Provider 6 & 8 Launch, Flux 2 Series & Major Infrastructure Updates
This update introduces two new providers to the platform with an extensive catalog of models, while addressing ongoing quota challenges across multiple providers. Provider 6 brings Groq-powered high-performance models, and Provider 8 debuts with a comprehensive suite of 36 chat and image generation models. We have also implemented strategic adjustments to Provider 2 and Provider 3 in response to quota limitations.
⚠️ Important Service Notices
Critical Updates Regarding Model Availability:
-
GPT Model Availability: We regret to inform our users that GPT models have been experiencing significant quota exhaustion across multiple providers. Our team is actively working to secure new provider partnerships to restore GPT model access. We appreciate your patience during this transition period and will issue an update as soon as new providers are onboarded.
-
Gemini Model Status: Due to strict API restrictions imposed by Google, Gemini model availability remains limited and should be considered experimental. Quota constraints continue to affect service reliability, and users are advised to have fallback options in place when utilizing Gemini models.
-
Claude Model Stability: We are pleased to report that Claude models remain stable and fully operational as usual. No service disruptions have been reported. However, please note that the newly added Claude models in Provider 8 are from an experimental provider and may not exhibit the same stability as Provider 7.
-
Free Tier Compensation: To mitigate any inconvenience caused by the ongoing quota challenges, we have ensured extensive Free tier access across Provider 6 and Provider 8. We encourage all users to explore these new providers for continued access to high-quality AI models.
🔧 Roo Code Compatibility Notice
XML Function Deprecation in Roo Code:
Roo Code has deprecated the XML function calling format and implemented native function calling exclusively. Users experiencing compatibility issues or unexpected behavior are advised to downgrade to Roo Code version 3.16.x until we release a compatibility fix.
Our development team is actively working on addressing this change, and a fix will be deployed in an upcoming update. We sincerely apologize for any inconvenience this may cause and appreciate your understanding.
Provider 6 - Groq-Powered Model Launch
High-Performance Groq Integration: Provider 6 debuts with 11 models optimized for speed and efficiency through Groq's specialized inference infrastructure.
Free Tier Models (8 models):
-
provider-6/gpt-oss-20b- Free, Basic, Pro, Ultra- Open-source GPT variant with 20B parameters
- Optimized for general-purpose chat and reasoning tasks
-
provider-6/compound-mini- Free, Basic, Pro, Ultra- Compact compound reasoning model
- Efficient multi-step problem solving
-
provider-6/llama-4-scout-17b-16e-instruct- Free, Basic, Pro, Ultra- Meta's 17B Scout variant with 16 experts
- Advanced instruction following capabilities
-
provider-6/llama-3.1-8b-instant- Free, Basic, Pro, Ultra- Ultra-fast 8B Llama variant
- Optimized for low-latency applications
-
provider-6/compound- Free, Basic, Pro, Ultra- Full compound reasoning engine
- Complex multi-hop reasoning support
-
provider-6/llama-3.3-70b-versatile- Free, Basic, Pro, Ultra- 70B versatile flagship model
- Comprehensive general-purpose capabilities
-
provider-6/qwen3-32b- Free, Basic, Pro, Ultra- Qwen3 32B instruction-tuned model
- Strong multilingual and coding performance
-
provider-6/llama-4-maverick-17b-128e-instruct- Free, Basic, Pro, Ultra- Meta's 17B Maverick with 128 experts
- Maximum Mixture-of-Experts configuration
Basic Tier and Above (3 models):
-
provider-6/kimi-k2-instruct-0905- Basic, Pro, Ultra- Moonshot AI's K2 September checkpoint
- Enhanced instruction following
-
provider-6/kimi-k2-instruct- Basic, Pro, Ultra- Moonshot AI's K2 flagship instruct model
- Advanced reasoning and analysis
-
provider-6/gpt-oss-120b- Basic, Pro, Ultra- Large-scale 120B open-source GPT variant
- Premium reasoning capabilities
Provider 8 - Comprehensive Model Suite Launch
New Multi-Purpose Provider: Provider 8 introduces 36 models spanning chat, reasoning, and image generation, offering extensive Free tier access.
Chat & Reasoning Models (27 models)
Free Tier Models (20 models):
-
provider-8/gpt-oss-120b- Free, Basic, Pro, Ultra- Large-scale 120B open-source GPT variant
-
provider-8/gpt-oss-20b- Free, Basic, Pro, Ultra- Efficient 20B open-source GPT variant
-
provider-8/gemini-2.0-flash- Free, Basic, Pro, Ultra- Google's fast Gemini 2.0 variant
- Optimized for speed and efficiency
-
provider-8/kimi-k2-0905- Free, Basic, Pro, Ultra- Moonshot AI K2 September checkpoint
-
provider-8/kimi-k2- Free, Basic, Pro, Ultra- Moonshot AI K2 flagship model
-
provider-8/char- Free, Basic, Pro, Ultra- Character-focused conversational model
-
provider-8/kimi-k2-thinking- Free, Basic, Pro, Ultra- K2 with enhanced reasoning capabilities
-
provider-8/deepseek-v3- Free, Basic, Pro, Ultra- DeepSeek V3 flagship model
- Advanced general-purpose capabilities
-
provider-8/deepseek-terminus- Free, Basic, Pro, Ultra- DeepSeek Terminus variant
- Specialized hybrid reasoning
-
provider-8/seed-rp- Free, Basic, Pro, Ultra- Seed roleplay-optimized model
-
provider-8/llama-4-scout- Free, Basic, Pro, Ultra- Meta's Llama 4 Scout variant
-
provider-8/llama-4-maverick- Free, Basic, Pro, Ultra- Meta's Llama 4 Maverick variant
-
provider-8/glm-4.5-air- Free, Basic, Pro, Ultra- Zhipu AI GLM 4.5 efficient variant
-
provider-8/glm-4.5- Free, Basic, Pro, Ultra- Zhipu AI GLM 4.5 flagship
-
provider-8/glm-4.6- Free, Basic, Pro, Ultra- Zhipu AI GLM 4.6 model
-
provider-8/glm-4.6-thinking- Free, Basic, Pro, Ultra- GLM 4.6 with thinking capabilities
-
provider-8/glm-4.7- Free, Basic, Pro, Ultra- Zhipu AI GLM 4.7 latest model
-
provider-8/glm-4.7-thinking- Free, Basic, Pro, Ultra- GLM 4.7 with enhanced reasoning
-
provider-8/glm-4.6v-thinking- Free, Basic, Pro, Ultra- GLM 4.6 vision-thinking variant
-
provider-8/qwen3-235b- Free, Basic, Pro, Ultra- Qwen3 235B flagship model
- Maximum-scale Qwen architecture
Basic Tier and Above (5 models):
-
provider-8/mimo-v2-flash- Basic, Pro, Ultra- Fast mimo variant for efficient inference
-
provider-8/gemini-3-flash- Basic, Pro, Ultra- Google's Gemini 3 Flash variant
- Next-generation speed optimization
-
provider-8/claude-sonnet-4.5- Basic, Pro, Ultra- Anthropic's Claude Sonnet 4.5
- ⚠️ Experimental provider - may have reduced stability compared to Provider 7
-
provider-8/qwen3-next-80b-a3b-instruct- Basic, Pro, Ultra- Qwen3 Next 80B with 3B active parameters
-
provider-8/mistral-small-creative- Basic, Pro, Ultra- Mistral AI creative-optimized variant
Pro Tier and Above (2 models):
-
provider-8/gemini-3-pro- Pro, Ultra- Google's Gemini 3 Pro flagship
- Maximum capability Gemini variant
-
provider-8/grok-4.1-fast-non-reasoning- Pro, Ultra- xAI Grok 4.1 optimized for speed
- Non-reasoning variant for rapid responses
Image Generation Models (9 models)
All Tiers - Free, Basic, Pro, Ultra:
-
- Creative fire and frost themed generation
-
- Versatile image generation model
-
- Efficient compact image generator
-
- Doubao Seedream 4.5 image model
- Enhanced dream-style synthesis
-
- Google Imagen 3 image generation
-
- Google Imagen 4 latest generation
- Maximum quality image synthesis
-
- Black Forest Labs Flux 2 Pro
- Professional-grade generation
-
- Flux 2 Flex configurable variant
- Maximum parameter flexibility
-
- OpenAI GPT Image 1.5 model
- Enhanced realistic generation
Provider 2 - Flux 2 Image Generation Suite
Flux 2 Series Retained: Despite quota exhaustion affecting GPT and Gemini models, the Flux 2 image generation suite remains fully operational and serves as the cornerstone of Provider 2.
⚠️ Models Removed Due to Quota Exhaustion:
- All GPT models previously available on Provider 2 have been removed due to complete quota exhaustion
- All Gemini models previously available on Provider 2 have been removed due to complete quota exhaustion
Active Flux 2 Models:
-
provider-2/flux-2-dev- Pro & Ultra Tiers- Development-grade model with maximum configurability
- Generation Config: Steps: 28, Guidance Scale: 3.5
- Price: $0.025 per image
- Ideal for iterative creative workflows and prototyping
-
provider-2/flux-2-pro- Pro & Ultra Tiers- Professional-grade model with integrated safety controls
- Generation Config: Safety Tolerance: 2
- Price: $0.05 per image
- Optimized for production-ready outputs with content moderation
-
provider-2/flux-2-flex- Pro & Ultra Tiers- Most configurable model supporting all generation parameters
- Generation Config: Steps: 28, Guidance Scale: 3.5, Safety Tolerance: 2
- Price: $0.035 per image
- Maximum flexibility for advanced users requiring full parameter control
-
provider-2/flux-2-max- Pro & Ultra Tiers- Maximum quality model for premium image generation
- Generation Config: Safety Tolerance: 2
- Price: $0.06 per image
- Highest fidelity outputs for professional creative applications
The Flux 2 series represents the most important addition to Provider 2 and helps sustain the provider's value proposition for image generation workflows.
Provider 3 - Service Status Update
Gemini Models Temporarily Unavailable:
We regret to inform our users that Gemini models on Provider 3 have become temporarily unavailable due to provider-side resource constraints. However, GPT models on Provider 3 remain fully operational.
- Gemini Models: Currently unavailable. The provider has indicated that an update is forthcoming, and we anticipate Gemini services to be restored in a future release.
- GPT Models: Fully operational with no reported issues.
We will issue an update as soon as Provider 3 restores Gemini model availability.
Backend Optimizations
Dependency Cleanup: Removed unused aioredis dependency from the main application module, streamlining the codebase and reducing unnecessary imports. This optimization improves application startup time and reduces memory footprint without affecting functionality.
Platform Impact
- New Provider Expansion: Provider 6 and Provider 8 significantly expand the platform's model catalog with 47 new models across chat, reasoning, and image generation
- Free Tier Enhancement: Extensive Free tier access across new providers ensures continued AI access despite quota challenges on legacy providers
- Image Generation Strength: Flux 2 series on Provider 2 combined with 9 image models on Provider 8 provides comprehensive image generation capabilities
- Groq Performance: Provider 6's Groq integration delivers optimized inference speeds for latency-sensitive applications
- Multi-Provider Redundancy: New providers offer alternative access paths for models affected by quota exhaustion on other providers
Important Notes
- GPT Model Restoration: Our team is actively seeking new provider partnerships to restore GPT model availability. Updates will be communicated as soon as new sources are secured.
- Gemini Experimental Status: All Gemini models should be considered experimental due to Google's strict API quota restrictions. Please plan accordingly with fallback options.
- Claude Provider 8 Notice: Claude Sonnet 4.5 on Provider 8 is from an experimental provider. For maximum stability, we recommend using Provider 7 for Claude models.
- Free Tier Compensation: Provider 6 and Provider 8 offer extensive Free tier access to compensate for quota-related disruptions. We encourage users to explore these new options.
- Roo Code Users: If experiencing issues with the new native function calling format, please downgrade to Roo Code version 3.16.x. A compatibility fix is in development.
- Flux 2 Pricing: Flux 2 models use per-image pricing ranging from $0.025 to $0.06 based on model tier and output quality requirements.