Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
nvidia
/
Qwen3.5-397B-A17B-NVFP4
like
97
Follow
NVIDIA
57.3k
Text Generation
Safetensors
Model Optimizer
qwen3_5_moe
nvidia
ModelOpt
Qwen3.5
quantized
FP4
fp4
conversational
modelopt
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
10
refs/pr/9
Qwen3.5-397B-A17B-NVFP4
251 GB
Ctrl+K
Ctrl+K
3 contributors
History:
8 commits
wangshangsam
Add vLLM as one of the supported inference engines in the model card.
10fbf11
verified
about 2 months ago
.gitattributes
Safe
1.64 kB
Upload folder using huggingface_hub
3 months ago
README.md
6.93 kB
Add vLLM as one of the supported inference engines in the model card.
about 2 months ago
chat_template.jinja
Safe
7.76 kB
Upload folder using huggingface_hub
3 months ago
config.json
Safe
17 kB
Do not quantize shared expert
2 months ago
generation_config.json
Safe
244 Bytes
Upload folder using huggingface_hub
3 months ago
hf_quant_config.json
Safe
11.9 kB
Do not quantize shared expert
2 months ago
model-00001-of-00011.safetensors
Safe
25 GB
xet
Do not quantize shared expert
2 months ago
model-00002-of-00011.safetensors
Safe
25 GB
xet
Do not quantize shared expert
2 months ago
model-00003-of-00011.safetensors
Safe
25 GB
xet
Do not quantize shared expert
2 months ago
model-00004-of-00011.safetensors
Safe
25 GB
xet
Do not quantize shared expert
2 months ago
model-00005-of-00011.safetensors
Safe
25 GB
xet
Do not quantize shared expert
2 months ago
model-00006-of-00011.safetensors
Safe
25 GB
xet
Do not quantize shared expert
2 months ago
model-00007-of-00011.safetensors
Safe
25 GB
xet
Do not quantize shared expert
2 months ago
model-00008-of-00011.safetensors
Safe
25 GB
xet
Do not quantize shared expert
2 months ago
model-00009-of-00011.safetensors
Safe
25 GB
xet
Do not quantize shared expert
2 months ago
model-00010-of-00011.safetensors
25 GB
xet
Do not quantize shared expert
2 months ago
model-00011-of-00011.safetensors
Safe
1.15 GB
xet
Do not quantize shared expert
2 months ago
model.safetensors.index.json
Safe
41.1 MB
xet
Do not quantize shared expert
2 months ago
preprocessor_config.json
Safe
390 Bytes
Upload folder using huggingface_hub
3 months ago
processor_config.json
Safe
1.3 kB
Upload folder using huggingface_hub
3 months ago
tokenizer.json
Safe
12.8 MB
xet
Upload folder using huggingface_hub
3 months ago
tokenizer_config.json
Safe
16.7 kB
Upload folder using huggingface_hub
3 months ago
video_preprocessor_config.json
Safe
385 Bytes
Upload folder using huggingface_hub
3 months ago
vocab.json
Safe
6.72 MB
Upload folder using huggingface_hub
3 months ago