MLC Models

Available Models

Model ID Quantization Link
Llama-3-8B-Instruct q0f16 HuggingFace
Llama-3-8B-Instruct q3f16_1 HuggingFace
Llama-3-8B-Instruct q4f16_1 HuggingFace
Llama-3-8B-Instruct q4f32_1 HuggingFace
Llama-3.1-70B-Instruct q0f16 HuggingFace
Llama-3.1-70B-Instruct q3f16_1 HuggingFace
Llama-3.1-70B-Instruct q4f16_1 HuggingFace
Llama-3.1-70B-Instruct q4f32_1 HuggingFace
Llama-3.1-8B q0f16 HuggingFace
Llama-3.1-8B q4f16_1 HuggingFace
Llama-3.1-8B q4f32_1 HuggingFace
Llama-3.1-8B-Instruct q0f16 HuggingFace
Llama-3.1-8B-Instruct q3f16_0 HuggingFace
Llama-3.1-8B-Instruct q3f16_1 HuggingFace
Llama-3.1-8B-Instruct q4f16_1 HuggingFace
Llama-3.1-8B-Instruct q4f32_1 HuggingFace
Llama-3.2-1B-Instruct q0f16 HuggingFace
Llama-3.2-1B-Instruct q0f32 HuggingFace
Llama-3.2-1B-Instruct q4f16_0 HuggingFace
Llama-3.2-1B-Instruct q4f16_1 HuggingFace
Llama-3.2-1B-Instruct q4f32_1 HuggingFace
Llama-3.2-3B-Instruct q0f16 HuggingFace
Llama-3.2-3B-Instruct q0f32 HuggingFace
Llama-3.2-3B-Instruct q4f16_0 HuggingFace
Llama-3.2-3B-Instruct q4f16_1 HuggingFace
Llama-3.2-3B-Instruct q4f32_1 HuggingFace
Hermes-2-Pro-Llama-3-8B q0f16 HuggingFace
Hermes-2-Pro-Llama-3-8B q3f16_1 HuggingFace
Hermes-2-Pro-Llama-3-8B q4f16_1 HuggingFace
Hermes-2-Pro-Llama-3-8B q4f32_1 HuggingFace
Hermes-2-Theta-Llama-3-70B q0f16 HuggingFace
Hermes-2-Theta-Llama-3-70B q3f16_1 HuggingFace
Hermes-2-Theta-Llama-3-70B q4f16_1 HuggingFace
Hermes-2-Theta-Llama-3-70B q4f32_1 HuggingFace
Hermes-2-Theta-Llama-3-8B q0f16 HuggingFace
Hermes-2-Theta-Llama-3-8B q3f16_1 HuggingFace
Hermes-2-Theta-Llama-3-8B q4f16_1 HuggingFace
Hermes-2-Theta-Llama-3-8B q4f32_1 HuggingFace
Hermes-3-Llama-3.1-8B q0f16 HuggingFace
Hermes-3-Llama-3.1-8B q3f16_1 HuggingFace
Hermes-3-Llama-3.1-8B q4f16_1 HuggingFace
Hermes-3-Llama-3.1-8B q4f32_1 HuggingFace
Phi-3-mini-128k-instruct q0f16 HuggingFace
Phi-3-mini-128k-instruct q4f16_1 HuggingFace
Phi-3-mini-128k-instruct q4f32_1 HuggingFace
Phi-3.5-mini-instruct q0f16 HuggingFace
Phi-3.5-mini-instruct q4f16_0 HuggingFace
Phi-3.5-mini-instruct q4f16_1 HuggingFace
Phi-3.5-mini-instruct q4f32_1 HuggingFace
Phi-3.5-vision-instruct q0f16 HuggingFace
Phi-3.5-vision-instruct q3f16_1 HuggingFace
Phi-3.5-vision-instruct q4f16_1 HuggingFace
Phi-3.5-vision-instruct q4f32_1 HuggingFace
Mistral-7B-Instruct-v0.3 q0f16 HuggingFace
Mistral-7B-Instruct-v0.3 q3f16_1 HuggingFace
Mistral-7B-Instruct-v0.3 q4f16_0 HuggingFace
Mistral-7B-Instruct-v0.3 q4f16_1 HuggingFace
Mistral-7B-Instruct-v0.3 q4f32_1 HuggingFace
Qwen1.5-0.5B-Chat q0f16 HuggingFace
Qwen1.5-0.5B-Chat q4f16_1 HuggingFace
Qwen1.5-0.5B-Chat q4f32_1 HuggingFace
Qwen1.5-1.8B-Chat q0f16 HuggingFace
Qwen1.5-1.8B-Chat q4f16_1 HuggingFace
Qwen1.5-1.8B-Chat q4f32_1 HuggingFace
Qwen2-0.5B-Instruct q0f16 HuggingFace
Qwen2-0.5B-Instruct q0f32 HuggingFace
Qwen2-0.5B-Instruct q4f16_0 HuggingFace
Qwen2-0.5B-Instruct q4f16_1 HuggingFace
Qwen2-0.5B-Instruct q4f32_1 HuggingFace
Qwen2-1.5B-Instruct q0f16 HuggingFace
Qwen2-1.5B-Instruct q4f16_0 HuggingFace
Qwen2-1.5B-Instruct q4f16_1 HuggingFace
Qwen2-1.5B-Instruct q4f32_1 HuggingFace
Qwen2-72B-Instruct q0f16 HuggingFace
Qwen2-72B-Instruct q4f16_1 HuggingFace
Qwen2-72B-Instruct q4f32_1 HuggingFace
Qwen2-7B-Instruct q0f16 HuggingFace
Qwen2-7B-Instruct q4f16_1 HuggingFace
Qwen2-7B-Instruct q4f32_1 HuggingFace
Qwen2-Math-1.5B-Instruct q0f16 HuggingFace
Qwen2-Math-1.5B-Instruct q4f16_1 HuggingFace
Qwen2-Math-1.5B-Instruct q4f32_1 HuggingFace
Qwen2-Math-72B-Instruct q0f16 HuggingFace
Qwen2-Math-72B-Instruct q4f16_1 HuggingFace
Qwen2-Math-72B-Instruct q4f32_1 HuggingFace
Qwen2-Math-7B-Instruct q0f16 HuggingFace
Qwen2-Math-7B-Instruct q4f16_1 HuggingFace
Qwen2-Math-7B-Instruct q4f32_1 HuggingFace
Qwen2.5-0.5B-Instruct q0f16 HuggingFace
Qwen2.5-0.5B-Instruct q0f32 HuggingFace
Qwen2.5-0.5B-Instruct q4f16_1 HuggingFace
Qwen2.5-0.5B-Instruct q4f32_1 HuggingFace
Qwen2.5-1.5B-Instruct q0f16 HuggingFace
Qwen2.5-1.5B-Instruct q4f16_1 HuggingFace
Qwen2.5-1.5B-Instruct q4f32_1 HuggingFace
Qwen2.5-14B-Instruct q0f16 HuggingFace
Qwen2.5-14B-Instruct q4f16_1 HuggingFace
Qwen2.5-14B-Instruct q4f32_1 HuggingFace
Qwen2.5-32B-Instruct q0f16 HuggingFace
Qwen2.5-32B-Instruct q4f16_1 HuggingFace
Qwen2.5-32B-Instruct q4f32_1 HuggingFace
Qwen2.5-3B-Instruct q0f16 HuggingFace
Qwen2.5-3B-Instruct q4f16_1 HuggingFace
Qwen2.5-3B-Instruct q4f32_1 HuggingFace
Qwen2.5-72B-Instruct q0f16 HuggingFace
Qwen2.5-72B-Instruct q4f16_1 HuggingFace
Qwen2.5-72B-Instruct q4f32_1 HuggingFace
Qwen2.5-7B-Instruct q0f16 HuggingFace
Qwen2.5-7B-Instruct q4f16_1 HuggingFace
Qwen2.5-7B-Instruct q4f32_1 HuggingFace
Qwen2.5-Coder-0.5B-Instruct q0f16 HuggingFace
Qwen2.5-Coder-0.5B-Instruct q0f32 HuggingFace
Qwen2.5-Coder-0.5B-Instruct q4f16_0 HuggingFace
Qwen2.5-Coder-0.5B-Instruct q4f16_1 HuggingFace
Qwen2.5-Coder-0.5B-Instruct q4f32_1 HuggingFace
Qwen2.5-Coder-1.5B-Instruct q0f16 HuggingFace
Qwen2.5-Coder-1.5B-Instruct q4f16_1 HuggingFace
Qwen2.5-Coder-1.5B-Instruct q4f32_1 HuggingFace
Qwen2.5-Coder-14B-Instruct q0f16 HuggingFace
Qwen2.5-Coder-14B-Instruct q4f16_1 HuggingFace
Qwen2.5-Coder-14B-Instruct q4f32_1 HuggingFace
Qwen2.5-Coder-32B-Instruct q0f16 HuggingFace
Qwen2.5-Coder-32B-Instruct q4f16_1 HuggingFace
Qwen2.5-Coder-32B-Instruct q4f32_1 HuggingFace
Qwen2.5-Coder-3B-Instruct q0f16 HuggingFace
Qwen2.5-Coder-3B-Instruct q4f16_1 HuggingFace
Qwen2.5-Coder-3B-Instruct q4f32_1 HuggingFace
Qwen2.5-Coder-7B-Instruct q0f16 HuggingFace
Qwen2.5-Coder-7B-Instruct q4f16_1 HuggingFace
Qwen2.5-Coder-7B-Instruct q4f32_1 HuggingFace
Qwen2.5-Math-1.5B-Instruct q0f16 HuggingFace
Qwen2.5-Math-1.5B-Instruct q4f16_1 HuggingFace
Qwen2.5-Math-1.5B-Instruct q4f32_1 HuggingFace
Qwen2.5-Math-72B-Instruct q0f16 HuggingFace
Qwen2.5-Math-72B-Instruct q4f16_1 HuggingFace
Qwen2.5-Math-72B-Instruct q4f32_1 HuggingFace
DeepSeek-V2-Lite-Chat q0f16 HuggingFace
DeepSeek-V2-Lite-Chat q4f16_1 HuggingFace
DeepSeek-V2-Lite-Chat q4f32_1 HuggingFace
Mixtral-8x7B-Instruct-v0.1 q0f16 HuggingFace
Mixtral-8x7B-Instruct-v0.1 q4f16_1 HuggingFace
Mixtral-8x7B-Instruct-v0.1 q4f32_1 HuggingFace
SmolLM-1.7B-Instruct q0f16 HuggingFace
SmolLM-1.7B-Instruct q0f32 HuggingFace
SmolLM-1.7B-Instruct q4f16_1 HuggingFace
SmolLM-1.7B-Instruct q4f32_1 HuggingFace
SmolLM-135M-Instruct q0f16 HuggingFace
SmolLM-135M-Instruct q0f32 HuggingFace
SmolLM-135M-Instruct q4f16_1 HuggingFace
SmolLM-135M-Instruct q4f32_1 HuggingFace
SmolLM-360M-Instruct q0f16 HuggingFace
SmolLM-360M-Instruct q0f32 HuggingFace
SmolLM-360M-Instruct q4f16_1 HuggingFace
SmolLM-360M-Instruct q4f32_1 HuggingFace
SmolLM2-1.7B-Instruct q0f16 HuggingFace
SmolLM2-1.7B-Instruct q4f16_1 HuggingFace
SmolLM2-1.7B-Instruct q4f32_1 HuggingFace
SmolLM2-135M-Instruct q0f16 HuggingFace
SmolLM2-135M-Instruct q0f32 HuggingFace
SmolLM2-135M-Instruct q4f16_1 HuggingFace
SmolLM2-135M-Instruct q4f32_1 HuggingFace
SmolLM2-360M-Instruct q0f16 HuggingFace
SmolLM2-360M-Instruct q0f32 HuggingFace
SmolLM2-360M-Instruct q4f16_1 HuggingFace
SmolLM2-360M-Instruct q4f32_1 HuggingFace
gemma-2-27b-it q0f16 HuggingFace
gemma-2-27b-it q4f16_1 HuggingFace
gemma-2-27b-it q4f32_1 HuggingFace
gemma-2-2b-it q0f16 HuggingFace
gemma-2-2b-it q0f32 HuggingFace
gemma-2-2b-it q4f16_0 HuggingFace
gemma-2-2b-it q4f16_1 HuggingFace
gemma-2-2b-it q4f32_1 HuggingFace
gemma-2-2b-jpn-it q0f16 HuggingFace
gemma-2-2b-jpn-it q0f32 HuggingFace
gemma-2-2b-jpn-it q4f16_1 HuggingFace
gemma-2-2b-jpn-it q4f32_1 HuggingFace
gemma-2-9b-it q0f16 HuggingFace
gemma-2-9b-it q3f16_1 HuggingFace
gemma-2-9b-it q4f16_1 HuggingFace
gemma-2-9b-it q4f32_1 HuggingFace
internlm2_5-1_8b q0f16 HuggingFace
internlm2_5-1_8b q4f16_1 HuggingFace
internlm2_5-1_8b q4f32_1 HuggingFace
internlm2_5-1_8b-chat q0f16 HuggingFace
internlm2_5-1_8b-chat q4f16_1 HuggingFace
internlm2_5-1_8b-chat q4f32_1 HuggingFace
internlm2_5-20b q0f16 HuggingFace
internlm2_5-20b q4f16_1 HuggingFace
internlm2_5-20b q4f32_1 HuggingFace
internlm2_5-20b-chat q0f16 HuggingFace
internlm2_5-20b-chat q4f16_1 HuggingFace
internlm2_5-20b-chat q4f32_1 HuggingFace
internlm2_5-7b q0f16 HuggingFace
internlm2_5-7b q4f16_1 HuggingFace
internlm2_5-7b q4f32_1 HuggingFace
internlm2_5-7b-chat q0f16 HuggingFace
internlm2_5-7b-chat q4f16_1 HuggingFace
internlm2_5-7b-chat q4f32_1 HuggingFace