Skip to content
LIVE
OPUS 4.8$5 / $25per Mtok
GPT-5.5$5 / $30per Mtok
GEMINI 2.5 PRO$1.25 / $10per Mtok
SONNET 4.6$3 / $15per Mtok
SWE-BENCHleader GPT-5.568.7%
MMLU-PROleader GPT-5.594.2
GPQAleader GPT-5.578.3
AFTAv1.0 whitepaper live at /whitepaper
OPUS 4.8$5 / $25per Mtok
GPT-5.5$5 / $30per Mtok
GEMINI 2.5 PRO$1.25 / $10per Mtok
SONNET 4.6$3 / $15per Mtok
SWE-BENCHleader GPT-5.568.7%
MMLU-PROleader GPT-5.594.2
GPQAleader GPT-5.578.3
AFTAv1.0 whitepaper live at /whitepaper
All systems operational0 AI providers monitored, polled every 2 minutes
Live status

Open-Weights Deployment

What you actually need to self-host the major open-weights models. VRAM per quantization (FP16, FP8, AWQ INT4, GGUF Q4_K_M), recommended GPU class, license, capabilities. The “I want to run this myself” companion to /inference-providers (hosted pricing).

Machine-readable JSON/api/open-weights

For agents: same data at /api/open-weights. Filter with ?family=Meta|DeepSeek|Alibaba|Mistral|Google|Microsoft. Free, no auth, cached 10 min.