Showing 120 of 120on this page. Filters & sort apply to loaded results; URL updates for sharing.120 of 120 on this page
GitHub - qwopqwop200/AutoAWQ-windows: AutoAWQ implements the AWQ ...
AWQ: How Its Code Works. A walkthrough of the AutoAWQ library | by ...
十五分钟简略走读 AutoAWQ 代码 - 知乎
AutoAwq 试用记录 - 知乎
After using autoawq to quantify the model, an error occurs when ...
is quantization with AutoAWQ supported for Qwen2-VL? · Issue #2398 ...
Can't install AutoAWQ 0.2 on Windows · Issue #377 · casper-hansen ...
[Feature request] AutoAWQ support · Issue #345 · NVIDIA/TensorRT-LLM ...
AutoAWQ installing incompatible pytorch version · Issue #226 · casper ...
Qwen/Qwen2.5-VL-32B-Instruct-AWQ · There is a conflict between autoawq ...
autoawq · PyPI
AutoAWQ Windows Fix – Get It Running! · Issue #704 · casper-hansen ...
AutoAWQ Crash after generating 1 token when fused attention is enabled ...
The version of transformers, auto_gptq, autoawq · Issue #88 · OpenGVLab ...
AutoAWQ load error · Issue #4337 · oobabooga/text-generation-webui · GitHub
TheBloke/Mistral-7B-OpenOrca-AWQ · AutoAWQ loader fails
autoawq aarch64 unavailable · Issue #4887 · oobabooga/text-generation ...
Add AutoAWQ as backend · Issue #3782 · oobabooga/text-generation-webui ...
Support AutoAWQ in `awq-py` · Issue #4701 · ggml-org/llama.cpp · GitHub
cannot load autoawq Model text-generation-webui1.7 how i fix this ...
Qwen/Qwen2-72B-Instruct-AWQ · Error AutoAWQ tensor 4 vllm
qwen2-72B can not be quantized by autoawq · Issue #498 · casper-hansen ...
Can autoAWQ be used by hiascend's npu? · Issue #616 · casper-hansen ...
AutoAWQ: 基于AWQ算法的4位量化推理加速工具 - 懂AI
AutoAWQ-INT4-gs128 - a fbaldassarri Collection
AutoAWQ/docs/examples.md at main · casper-hansen/AutoAWQ · GitHub
大模型量化技术原理-AWQ、AutoAWQ - 知乎
vLLM-0013-量化 01-AutoAWQ - 知乎
cant import awq · Issue #559 · casper-hansen/AutoAWQ · GitHub
GitHub - matrix-yang/AutoAWQ
QuixiAI/DeepSeek-R1-AWQ · What is the calibration set used when using ...
使用AutoAWQ量化自己的模型 - 知乎
From Fine-Tuning to Inference: The New LLM Optimization Stack with ...
auto-awq kernels is needed to be installed to use `.backward()` · Issue ...
After using AutoAWQ, Qwen decreased by 10 points · Issue #223 · casper ...
openbmb/MiniCPM4.1-8B-AutoAWQ · Hugging Face
AutoAWQ量化方法用于Bloom-560m - 知乎
Exploring the Potential of Dynamic Quantisation for Variable-Length ...
about the shape of qzeros in awq quantization model · Issue #566 ...
9.10-9.11-AutoAWQ代码解析_autoawq github-CSDN博客
You current version of `autoawq` does not support module quantization ...
【量化】AutoAWQ的quant_config配置参数 - 知乎
vLLM + AutoAWQ: Fastest Way To Serve LLMs | by Agent Native | Dev Genius
ruikangliu/DeepSeek-R1-Distill-Qwen-32B-quantized.awq-autoawq-w4g128 ...
AWQ量化及AutoAWQ代码详解-CSDN博客
vLLM + AutoAWQ: Fastest Way To Serve LLMs | by Datadrifters | Dev Genius
Support Qwen2 72 Awq quantization? · Issue #509 · casper-hansen/AutoAWQ ...
Add LoRA fine-tuning to AWQ · Issue #85 · casper-hansen/AutoAWQ · GitHub
Can not import AutoAWQForCausalLM on google colab · Issue #156 · casper ...
ImportError: Loading an AWQ quantized model requires auto-awq - Quick ...
大模型量化技术原理-AWQ、AutoAWQ - 掘金
awq quantization is not fully optimized yet. The speed can be slower ...
Bugs in AWQ models deployed in multiple GPUs. · Issue #662 · casper ...
使用conda安装autoawq,报错 · Issue #73 · casper-hansen/AutoAWQ · GitHub
大模型量化技术原理-AWQ、AutoAWQ近年来,随着Transformer、MOE架构的提出,使得深度学习模型轻松突破 - 掘金
AutoAWQ/docs/index.md at main · casper-hansen/AutoAWQ · GitHub
AutoAWQ项目对Qwen模型的支持与量化实践 - GitCode博客
GitHub - casper-hansen/AutoAWQ_kernels