Inference Providers
Active filters: ModelOpt
nvidia/Qwen3-235B-A22B-Eagle3
Text Generation
• 0.3B • Updated • 246
• 12
NVFP4/Qwen3-235B-A22B-Thinking-2507-FP4
Text Generation
• 118B • Updated • 8
• 2
BitPhinix/DeepSeek-V3-0324-FP4
Text Generation
• 397B • Updated • 1
NVFP4/Qwen3-30B-A3B-Instruct-2507-FP4
Text Generation
• 16B • Updated • 546
• 12
NVFP4/Qwen3-30B-A3B-Thinking-2507-FP4
Text Generation
• 16B • Updated • 1.35k
• 4
NVFP4/Qwen3-Coder-30B-A3B-Instruct-FP4
Text Generation
• 16B • Updated • 7.29k
• 27
Text Generation
• 0.4B • Updated • 337
• 2
nvidia/gpt-oss-120b-Eagle3-long-context
Text Generation
• 0.2B • Updated • 20.4k
• 70
jonlizardo/affine-gpt-oss-120b-light
Text Generation
• 0.2B • Updated • 4
nvidia/Qwen3-235B-A22B-Thinking-2507-Eagle3
Text Generation
• 0.3B • Updated • 100
• 1
nvidia/Qwen3-30B-A3B-Thinking-2507-Eagle3
Text Generation
• 0.1B • Updated • 176
• 3
nvidia/Phi-4-multimodal-instruct-NVFP4
4B • Updated • 9.32k
• 11
nvidia/Phi-4-multimodal-instruct-FP8
6B • Updated • 690
• 7
nvidia/Phi-4-reasoning-plus-FP8
15B • Updated • 168
• 6
nvidia/Phi-4-reasoning-plus-NVFP4
8B • Updated • 819
• 9
Text Generation
• 5B • Updated • 62.4k
• 17
Text Generation
• 8B • Updated • 22.7k
• 5
Text Generation
• 8B • Updated • 19.7k
• 11
Text Generation
• 15B • Updated • 1.73k
• 5
Text Generation
• 17B • Updated • 60.6k
• 16
nvidia/Qwen2.5-VL-7B-Instruct-FP8
Text Generation
• 8B • Updated • 1.35k
• 8
nvidia/Qwen2.5-VL-7B-Instruct-NVFP4
Text Generation
• 5B • Updated • 235k
• 15
nvidia/gpt-oss-120b-Eagle3-short-context
Text Generation
• Updated • 3.16k
• 16
nvidia/Llama-3.3-70B-Instruct-Eagle3
Text Generation
• Updated • 128
• 2
nvidia/DeepSeek-V3.1-NVFP4
Text Generation
• 394B • Updated • 16.8k
• 16
nvidia/Kimi-K2-Thinking-NVFP4
Text Generation
• Updated • 11.9k
• 30
nvidia/gpt-oss-120b-Eagle3-throughput
Text Generation
• Updated • 916
• 34
nvidia/Qwen3-Next-80B-A3B-Instruct-NVFP4
Text Generation
• Updated • 12.9k
• 39
nvidia/Qwen3-235B-A22B-Thinking-2507-FP4-Eagle3
Text Generation
• Updated • 54
nvidia/Qwen3-VL-235B-A22B-Instruct-NVFP4
119B • Updated • 832
• 3