Meta

Llama 3.1 8B Instruct Cost Calculator

Estimate your exact monthly API cost for Llama 3.1 8B Instruct. Enter your token volume and see the math — OpenRouter vs Neureus side-by-side.

$0.045 Input /M (Neureus)
$0.045 Output /M (Neureus)
128K Context window
−10% vs OpenRouter
Published pricing
Input (OpenRouter) $0.050/M
Input (Neureus) $0.045/M
Output (OpenRouter) $0.050/M
Output (Neureus) $0.045/M

How much will Llama 3.1 8B Instruct cost?

M
M
Standard cost (OpenRouter)
N Neureus cost
You save per month with Neureus
Cost breakdown
Type Volume Rate (OpenRouter) Total (OpenRouter) Rate (Neureus) Total (Neureus)
Input $0.050/M $0.045/M
Output $0.050/M $0.045/M
Total
Context window
128K tokens
Maximum combined input + output size per request
Max output
4K tokens
Maximum tokens in a single response
Modalities
text, code
Input types supported by this model
Provider
Meta
Access this model through Neureus or directly

Meta's fastest open-source model. Runs at the edge — sub-50ms latency, zero egress fees. Free in Neureus edge tier.

Full model documentation →

Use Llama 3.1 8B Instruct for free.

Free tier includes 5M tokens — no credit card. Access Llama 3.1 8B Instruct and 34 other models through one API key, priced 10% below OpenRouter.