Cloudflare AI

Llama 3.2 3B Cost Calculator

Estimate your exact monthly API cost for Llama 3.2 3B. Enter your token volume and see the math — OpenRouter vs Neureus side-by-side.

$0.046 Input /M (Neureus)
$0.30 Output /M (Neureus)
128K Context window
−10% vs OpenRouter
Published pricing
Input (OpenRouter) $0.051/M
Input (Neureus) $0.046/M
Output (OpenRouter) $0.34/M
Output (Neureus) $0.30/M

How much will Llama 3.2 3B cost?

M
M
Standard cost (OpenRouter)
N Neureus cost
You save per month with Neureus
Cost breakdown
Type Volume Rate (OpenRouter) Total (OpenRouter) Rate (Neureus) Total (Neureus)
Input $0.051/M $0.046/M
Output $0.34/M $0.30/M
Total
Context window
128K tokens
Maximum combined input + output size per request
Max output
4K tokens
Maximum tokens in a single response
Modalities
text
Input types supported by this model
Provider
Cloudflare AI
Access this model through Neureus or directly

Meta's Llama 3.2 3B — ultra-fast edge inference for latency-sensitive applications.

Full model documentation →

Use Llama 3.2 3B for free.

Free tier includes 5M tokens — no credit card. Access Llama 3.2 3B and 34 other models through one API key, priced 10% below OpenRouter.