⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
567,580 results found
Trending
Model Name
Input
Output
Type
openbmb
MiniCPM5-1B
Base
Deploy
nvidia
Orchestrator-8B
Fine-tuned
deepseek-ai
DeepSeek-V4-Pro
numind
NuExtract3
tencent
Hy-MT2-1.8B
Qwen
Qwen3.6-27B
zai-org
GLM-5.1
CohereLabs
command-a-plus-05-2026-w4a4
Quantized
google
gemma-4-31B-it
GLM-5
Qwen3.6-35B-A3B
DeepSeek-V4-Flash
GLM-4.6
meta-llama
Llama-3.1-8B-Instruct
gemma-4-E4B-it
moonshotai
Kimi-K2.6
mistralai
Magistral-Small-2506
HuggingFaceBio
Carbon-3B
skt
A.X-3.1
gemma-4-26B-A4B-it
Hy-MT2-7B
Qwen3-235B-A22B-Thinking-2507
Qwen3-235B-A22B-Instruct-2507
THUDM
GLM-4.1V-9B-Thinking
DeepSeek-R1
Jackrong
Qwopus3.6-27B-v2
0xSero
MiniMax-M2.1-REAP-50-W4A16
black-forest-labs
FLUX.1-dev
Qwen3.5-9B
syvai
cohere-transcribe-diarize
Llama-3.3-70B-Instruct
MiniMaxAI
MiniMax-M2.7
gemma-4-E2B-it
DeepSeek-V3.2
HiDream-ai
HiDream-O1-Image
openai
whisper-large-v3
MiniCPM5-1B-SFT
Qwen3-Coder-Next
gpt-oss-120b
sakamakismile
Qwen3.6-27B-NVFP4
Devstral-Small-2505
Llama-3.2-1B