Serverless LLM Hosting - Featherless.ai
Instantly run any Llama model from HuggingFace without setting up any servers. Over 12,200+ models available. Starting at $10/month for unlimited acce...
Synthetic offers subscription-based access to open-source language models including DeepSeek, Llama, Qwen, and others with context windows ranging from 128k to 256k tokens.
Standard and Pro subscriptions include unlimited access to all always-on models and LoRA fine-tuning capabilities at a flat monthly rate without per-token billing. On-demand models from Hugging Face can be run separately at GPU-based pricing, with embedding models included at no additional cost.