Synthetic LLM Hosted Models
Chat with open-source models privately
Trinity Mini is a 26B-parameter sparse mixture-of-experts language model with 3B active parameters, designed for efficient reasoning and multi-step agent workflows over extended contexts up to 131k tokens.
The model features 128 total experts with 8 active per token and supports robust function calling capabilities. Trinity Mini is available free through OpenRouter with 100% uptime, achieving approximately 213 tokens per second throughput and 0.32 seconds latency.