Free/Pay-as-you-go
$0/month

- Pay only for usage
- No recurring monthly charge
Hobby
$5/month

- 250 daily requests
- Access to all models

Pro
$10/month

- All Hobby benefits
- 1,000 daily requests
- Priority Support


Large Language Model (LLM) Pricing
Model
Quantization Context / Max output
$ / M in
$ / M out
Speed
kimi-k2-thinking
Q4_0 262,144 / 262,144
$0.40
$1.50
~91 t/s
kimi-k2-thinking-turbo
fp8 262,144 / 262,144
$1.00
$3.00
~42 t/s
deepseek-v3.2
Q4_0 163,840 / 163,840
$0.28
$0.38
~3 t/s
deepseek-v3.2-chat
fp8 163,840 / 163,840
$0.28
$0.38
~50 t/s
deepseek-v3.2-precision
fp8 163,840 / 163,840
$0.35
$0.45
~63 t/s
deepseek-v3.2-speciale
fp8 163,840 / 163,840
$0.35
$0.45
~61 t/s
devstral-2
fp8 262,144 / 262,144
$0.07
$0.31
~45 t/s
kimi-k2-0905
Q8_0 131,072 / 131,072
$0.15
$0.55
~613 t/s
kimi-k2-0905-turbo
fp8 131,072 / 8,192
$0.35
$1.00
~998 t/s
glm-4.6-turbo
fp8 202,752 / 202,752
$0.50
$2.25
~98 t/s
minimax-m2
fp8 196,608 / 196,608
$0.00
$0.00
~229 t/s
qwen3-coder:free
fp8 256,000 / 256,000
$0.00
$0.00
~123 t/s
intellect-3
Q8_0 128,000 / 128,000
$0.15
$1.00
~119 t/s
kimi-k2-eco
Q2_k 131,072 / 131,072
$0.05
$0.10
~6 t/s
glm-4.6
fp8 131,072 / 131,072
$0.30
$0.60
~94 t/s
glm-4.5
fp8 131,072 / 131,072
$0.20
$0.40
~124 t/s
ring-1t
Q4_0 131,072 / 131,072
$0.40
$1.00
~184 t/s
deepseek-v3.2-exp
Q4_0 131,072 / 131,072
$0.15
$0.30
~19 t/s
deepseek-v3.1-terminus
Q4_0 131,072 / 131,072
$0.20
$0.50
~0 t/s
deepseek-v3.1-terminus-reasoner
Q4_0 131,072 / 131,072
$0.20
$0.50
~340 t/s
deepseek-v3.1
Q4_0 131,072 / 131,072
$0.15
$0.50
~18 t/s
deepseek-v3.1-reasoner
Q4_0 131,072 / 131,072
$0.15
$0.50
~340 t/s
deepseek-v3-0324
Q4_0 131,072 / 8,192
$0.20
$0.25
~219 t/s
deepseek-v3-0324-turbo
Q4_0 131,072 / 8,192
$0.50
$1.00
~341 t/s
deepseek-r1-0528
Q4_0 131,072 / 131,072
$0.25
$0.25
~62 t/s
deepseek-r1-0528-turbo
Q4_0 131,072 / 131,072
$1.00
$2.00
~49 t/s
qwen3-next-80b-a3b-instruct
Q8_0 262,144 / 262,144
$0.08
$0.38
~300 t/s
qwen3-235b-a22b-2507-instruct
Q8_0 131,072 / 131,072
$0.10
$0.25
~405 t/s
qwen3-235b-a22b-2507-thinking
Q8_0 131,072 / 131,072
$0.10
$0.30
~333 t/s
qwen3-coder
fp8 131,072 / 131,072
$0.15
$0.35
~141 t/s
qwen3-coder-turbo
fp8 131,072 / 131,072
$0.20
$0.50
~375 t/s
gpt-oss-120b
Q4_0 131,072 / 131,072
$0.07
$0.27
~170 t/s
gpt-oss-safeguard-120b
Q8_0 131,072 / 131,072
$0.07
$0.27
~299 t/s
gemma-3-27b-it
Q8_0 131,072 / 131,072
$0.04
$0.10
~128 t/s
llama-4-scout
fp8 262,144 / 16,384 vision
$0.08
$0.40
~804 t/s
llama3.3-70b
fp8 131,072 / 8,192
$0.12
$0.20
~574 t/s
deepseek-r1-distill-llama-70b
fp8 65,536 / 65,536
$0.10
$0.10
~162 t/s
deepseek-r1-distill-qwen-32b
fp8 65,536 / 65,536
$0.10
$0.10
~164 t/s
stok-0.4.1
stok 2,048 / 2,048
$0.00
$0.00
~3697 t/s