โ† Back to AI Cost Calculator

๐Ÿฆž Cheapest AI Models in 2026

The 25 cheapest AI model APIs ranked by per-token price. Updated May 2026 from live pricing data across 461 models.

Cheapest input price
$0.01 / 1M tokens
Cheapest combined (in+out)
$0.03 / 1M tokens
Free tiers available
Ollama Cloud Free ยท Tencent Lite (free) ยท Baidu Speed (free)
Models tracked
402 per-token + 20 subscription

The cheapest AI APIs cluster around $0.01โ€“$0.10 per million tokens for input. Chinese providers (Zhipu, Tencent, Alibaba, ByteDance) dominate the budget tier alongside OpenRouter community-hosted models. Most cheap models are open-weight (Llama, Qwen, Mistral) โ€” you trade frontier reasoning for affordability.

All prices below are live. Click any model name to jump to the AI Cost Calculator for a full monthly cost breakdown with your usage volume.

๐Ÿ’ฐ Top 25 Cheapest AI Models by Combined Price

#ModelProviderTypeInput $/1MOutput $/1MCombinedContext
1GLM-4 AirZhipu AIDirect$0.01$0.01$0.03128K
2GLM-4 LongZhipu AIDirect$0.01$0.01$0.031M
3Hunyuan StandardTencentDirect$0.01$0.01$0.0332K
4Mistral NemoMistralOpenRouter$0.02$0.03$0.05131K
5Llama 3.1 8BMetaOpenRouter$0.02$0.05$0.0716K
6Llama 3 8BMetaOpenRouter$0.03$0.04$0.078K
7Qwen TurboAlibaba CloudDirect$0.02$0.06$0.08131K
8MiniMax M2.5MiniMaxDirect$0.04$0.04$0.08131K
9ABAB 6.5MiniMaxDirect$0.04$0.04$0.08245K
10Llama 3 8B LunarisSao10KOpenRouter$0.04$0.05$0.098K
11Doubao LiteByteDanceDirect$0.04$0.08$0.12131K
12Yi Medium01.AIDirect$0.04$0.08$0.1232K
13Mixtral 8x7B (Groq)GroqDirect$0.06$0.06$0.1232K
14Gemma 3 4BGoogleOpenRouter$0.04$0.08$0.12131K
15MythoMax 13BGrypheOpenRouter$0.06$0.06$0.124K
16Granite 4.0 MicroIBMOpenRouter$0.02$0.11$0.13131K
17Mistral Small 3MistralOpenRouter$0.05$0.08$0.1332K
18Step-1V (Vision)StepFunDirect$0.07$0.07$0.1432K
19Qwen 2.5 7BQwenOpenRouter$0.04$0.10$0.1432K
20LFM2-24B-A2BLiquid AIOpenRouter$0.03$0.12$0.1532K
21MiniMax M2.7MiniMaxDirect$0.08$0.08$0.16131K
22Qwen-TurboQwenOpenRouter$0.03$0.13$0.16131K
23Gemma 3 12BGoogleOpenRouter$0.04$0.13$0.17131K
24GPT-OSS-20BOpenAIOpenRouter$0.03$0.14$0.17131K
25Qwen3 235B A22BQwenOpenRouter$0.07$0.10$0.17262K

๐Ÿ†“ Free and Subscription Options

Several providers offer completely free tiers:

โšก Tips for Keeping AI Costs Low

Use prompt caching. OpenAI and Anthropic offer ~50% discount on cached input tokens. If your system prompt or context is repetitive, enable caching in the calculator and set 50-80% cache hit rate.

Choose shorter context models. Models with massive context windows (1M+) charge for every token in the window, even if you only use a fraction. Match the context to your task.

Batch processing. Anthropic and OpenAI offer 50% discount on batch requests with 24-hour turnaround. Perfect for eval runs, data labeling, and bulk summarization.

Fine-tune with the cheapest model. Use a free/cheap model for experimentation and prototyping, then switch to a frontier model only for final production runs.

๐Ÿฆž Find your cheapest model

Enter your actual usage volume and get a personalized monthly cost comparison across all 461 models.

Launch AI Cost Calculator โ†’