v19.5.0•December 24, 2025

Provider 6 & 8 Launch, Flux 2 Series & Major Infrastructure Updates

This update introduces two new providers to the platform with an extensive catalog of models, while addressing ongoing quota challenges across multiple providers. Provider 6 brings Groq-powered high-performance models, and Provider 8 debuts with a comprehensive suite of 36 chat and image generation models. We have also implemented strategic adjustments to Provider 2 and Provider 3 in response to quota limitations.

⚠️ Important Service Notices

Critical Updates Regarding Model Availability:

GPT Model Availability: We regret to inform our users that GPT models have been experiencing significant quota exhaustion across multiple providers. Our team is actively working to secure new provider partnerships to restore GPT model access. We appreciate your patience during this transition period and will issue an update as soon as new providers are onboarded.
Gemini Model Status: Due to strict API restrictions imposed by Google, Gemini model availability remains limited and should be considered experimental. Quota constraints continue to affect service reliability, and users are advised to have fallback options in place when utilizing Gemini models.
Claude Model Stability: We are pleased to report that Claude models remain stable and fully operational as usual. No service disruptions have been reported. However, please note that the newly added Claude models in Provider 8 are from an experimental provider and may not exhibit the same stability as Provider 7.
Free Tier Compensation: To mitigate any inconvenience caused by the ongoing quota challenges, we have ensured extensive Free tier access across Provider 6 and Provider 8. We encourage all users to explore these new providers for continued access to high-quality AI models.

🔧 Roo Code Compatibility Notice

XML Function Deprecation in Roo Code:

Roo Code has deprecated the XML function calling format and implemented native function calling exclusively. Users experiencing compatibility issues or unexpected behavior are advised to downgrade to Roo Code version 3.16.x until we release a compatibility fix.

Our development team is actively working on addressing this change, and a fix will be deployed in an upcoming update. We sincerely apologize for any inconvenience this may cause and appreciate your understanding.

Provider 6 - Groq-Powered Model Launch

High-Performance Groq Integration: Provider 6 debuts with 11 models optimized for speed and efficiency through Groq's specialized inference infrastructure.

Free Tier Models (8 models):

provider-6/gpt-oss-20b - Free, Basic, Pro, Ultra
- Open-source GPT variant with 20B parameters
- Optimized for general-purpose chat and reasoning tasks
provider-6/compound-mini - Free, Basic, Pro, Ultra
- Compact compound reasoning model
- Efficient multi-step problem solving
provider-6/llama-4-scout-17b-16e-instruct - Free, Basic, Pro, Ultra
- Meta's 17B Scout variant with 16 experts
- Advanced instruction following capabilities
provider-6/llama-3.1-8b-instant - Free, Basic, Pro, Ultra
- Ultra-fast 8B Llama variant
- Optimized for low-latency applications
provider-6/compound - Free, Basic, Pro, Ultra
- Full compound reasoning engine
- Complex multi-hop reasoning support
provider-6/llama-3.3-70b-versatile - Free, Basic, Pro, Ultra
- 70B versatile flagship model
- Comprehensive general-purpose capabilities
provider-6/qwen3-32b - Free, Basic, Pro, Ultra
- Qwen3 32B instruction-tuned model
- Strong multilingual and coding performance
provider-6/llama-4-maverick-17b-128e-instruct - Free, Basic, Pro, Ultra
- Meta's 17B Maverick with 128 experts
- Maximum Mixture-of-Experts configuration

Basic Tier and Above (3 models):

provider-6/kimi-k2-instruct-0905 - Basic, Pro, Ultra
- Moonshot AI's K2 September checkpoint
- Enhanced instruction following
provider-6/kimi-k2-instruct - Basic, Pro, Ultra
- Moonshot AI's K2 flagship instruct model
- Advanced reasoning and analysis
provider-6/gpt-oss-120b - Basic, Pro, Ultra
- Large-scale 120B open-source GPT variant
- Premium reasoning capabilities

Provider 8 - Comprehensive Model Suite Launch

New Multi-Purpose Provider: Provider 8 introduces 36 models spanning chat, reasoning, and image generation, offering extensive Free tier access.

Chat & Reasoning Models (27 models)

Free Tier Models (20 models):

provider-8/gpt-oss-120b - Free, Basic, Pro, Ultra
- Large-scale 120B open-source GPT variant
provider-8/gpt-oss-20b - Free, Basic, Pro, Ultra
- Efficient 20B open-source GPT variant
provider-8/gemini-2.0-flash - Free, Basic, Pro, Ultra
- Google's fast Gemini 2.0 variant
- Optimized for speed and efficiency
provider-8/kimi-k2-0905 - Free, Basic, Pro, Ultra
- Moonshot AI K2 September checkpoint
provider-8/kimi-k2 - Free, Basic, Pro, Ultra
- Moonshot AI K2 flagship model
provider-8/char - Free, Basic, Pro, Ultra
- Character-focused conversational model
provider-8/kimi-k2-thinking - Free, Basic, Pro, Ultra
- K2 with enhanced reasoning capabilities
provider-8/deepseek-v3 - Free, Basic, Pro, Ultra
- DeepSeek V3 flagship model
- Advanced general-purpose capabilities
provider-8/deepseek-terminus - Free, Basic, Pro, Ultra
- DeepSeek Terminus variant
- Specialized hybrid reasoning
provider-8/seed-rp - Free, Basic, Pro, Ultra
- Seed roleplay-optimized model
provider-8/llama-4-scout - Free, Basic, Pro, Ultra
- Meta's Llama 4 Scout variant
provider-8/llama-4-maverick - Free, Basic, Pro, Ultra
- Meta's Llama 4 Maverick variant
provider-8/glm-4.5-air - Free, Basic, Pro, Ultra
- Zhipu AI GLM 4.5 efficient variant
provider-8/glm-4.5 - Free, Basic, Pro, Ultra
- Zhipu AI GLM 4.5 flagship
provider-8/glm-4.6 - Free, Basic, Pro, Ultra
- Zhipu AI GLM 4.6 model
provider-8/glm-4.6-thinking - Free, Basic, Pro, Ultra
- GLM 4.6 with thinking capabilities
provider-8/glm-4.7 - Free, Basic, Pro, Ultra
- Zhipu AI GLM 4.7 latest model
provider-8/glm-4.7-thinking - Free, Basic, Pro, Ultra
- GLM 4.7 with enhanced reasoning
provider-8/glm-4.6v-thinking - Free, Basic, Pro, Ultra
- GLM 4.6 vision-thinking variant
provider-8/qwen3-235b - Free, Basic, Pro, Ultra
- Qwen3 235B flagship model
- Maximum-scale Qwen architecture

Basic Tier and Above (5 models):

provider-8/mimo-v2-flash - Basic, Pro, Ultra
- Fast mimo variant for efficient inference
provider-8/gemini-3-flash - Basic, Pro, Ultra
- Google's Gemini 3 Flash variant
- Next-generation speed optimization
provider-8/claude-sonnet-4.5 - Basic, Pro, Ultra
- Anthropic's Claude Sonnet 4.5
- ⚠️ Experimental provider - may have reduced stability compared to Provider 7
provider-8/qwen3-next-80b-a3b-instruct - Basic, Pro, Ultra
- Qwen3 Next 80B with 3B active parameters
provider-8/mistral-small-creative - Basic, Pro, Ultra
- Mistral AI creative-optimized variant

Pro Tier and Above (2 models):

provider-8/gemini-3-pro - Pro, Ultra
- Google's Gemini 3 Pro flagship
- Maximum capability Gemini variant
provider-8/grok-4.1-fast-non-reasoning - Pro, Ultra
- xAI Grok 4.1 optimized for speed
- Non-reasoning variant for rapid responses

Image Generation Models (9 models)

All Tiers - Free, Basic, Pro, Ultra:

provider-8/firefrost
- Creative fire and frost themed generation
provider-8/z-image
- Versatile image generation model
provider-8/nano-banana-pro
- Efficient compact image generator
provider-8/seedream-4.5
- Doubao Seedream 4.5 image model
- Enhanced dream-style synthesis
provider-8/imagen-3
- Google Imagen 3 image generation
provider-8/imagen-4
- Google Imagen 4 latest generation
- Maximum quality image synthesis
provider-8/flux-2-pro
- Black Forest Labs Flux 2 Pro
- Professional-grade generation
provider-8/flux-2-flex
- Flux 2 Flex configurable variant
- Maximum parameter flexibility
provider-8/gpt-image-1.5
- OpenAI GPT Image 1.5 model
- Enhanced realistic generation

Provider 2 - Flux 2 Image Generation Suite

Flux 2 Series Retained: Despite quota exhaustion affecting GPT and Gemini models, the Flux 2 image generation suite remains fully operational and serves as the cornerstone of Provider 2.

⚠️ Models Removed Due to Quota Exhaustion:

All GPT models previously available on Provider 2 have been removed due to complete quota exhaustion
All Gemini models previously available on Provider 2 have been removed due to complete quota exhaustion

Active Flux 2 Models:

provider-2/flux-2-dev - Pro & Ultra Tiers
- Development-grade model with maximum configurability
- Generation Config: Steps: 28, Guidance Scale: 3.5
- Price: $0.025 per image
- Ideal for iterative creative workflows and prototyping
provider-2/flux-2-pro - Pro & Ultra Tiers
- Professional-grade model with integrated safety controls
- Generation Config: Safety Tolerance: 2
- Price: $0.05 per image
- Optimized for production-ready outputs with content moderation
provider-2/flux-2-flex - Pro & Ultra Tiers
- Most configurable model supporting all generation parameters
- Generation Config: Steps: 28, Guidance Scale: 3.5, Safety Tolerance: 2
- Price: $0.035 per image
- Maximum flexibility for advanced users requiring full parameter control
provider-2/flux-2-max - Pro & Ultra Tiers
- Maximum quality model for premium image generation
- Generation Config: Safety Tolerance: 2
- Price: $0.06 per image
- Highest fidelity outputs for professional creative applications

The Flux 2 series represents the most important addition to Provider 2 and helps sustain the provider's value proposition for image generation workflows.

Provider 3 - Service Status Update

Gemini Models Temporarily Unavailable:

We regret to inform our users that Gemini models on Provider 3 have become temporarily unavailable due to provider-side resource constraints. However, GPT models on Provider 3 remain fully operational.

Gemini Models: Currently unavailable. The provider has indicated that an update is forthcoming, and we anticipate Gemini services to be restored in a future release.
GPT Models: Fully operational with no reported issues.

We will issue an update as soon as Provider 3 restores Gemini model availability.

Backend Optimizations

Dependency Cleanup: Removed unused aioredis dependency from the main application module, streamlining the codebase and reducing unnecessary imports. This optimization improves application startup time and reduces memory footprint without affecting functionality.

Platform Impact

New Provider Expansion: Provider 6 and Provider 8 significantly expand the platform's model catalog with 47 new models across chat, reasoning, and image generation
Free Tier Enhancement: Extensive Free tier access across new providers ensures continued AI access despite quota challenges on legacy providers
Image Generation Strength: Flux 2 series on Provider 2 combined with 9 image models on Provider 8 provides comprehensive image generation capabilities
Groq Performance: Provider 6's Groq integration delivers optimized inference speeds for latency-sensitive applications
Multi-Provider Redundancy: New providers offer alternative access paths for models affected by quota exhaustion on other providers

Important Notes

GPT Model Restoration: Our team is actively seeking new provider partnerships to restore GPT model availability. Updates will be communicated as soon as new sources are secured.
Gemini Experimental Status: All Gemini models should be considered experimental due to Google's strict API quota restrictions. Please plan accordingly with fallback options.
Claude Provider 8 Notice: Claude Sonnet 4.5 on Provider 8 is from an experimental provider. For maximum stability, we recommend using Provider 7 for Claude models.
Free Tier Compensation: Provider 6 and Provider 8 offer extensive Free tier access to compensate for quota-related disruptions. We encourage users to explore these new options.
Roo Code Users: If experiencing issues with the new native function calling format, please downgrade to Roo Code version 3.16.x. A compatibility fix is in development.
Flux 2 Pricing: Flux 2 models use per-image pricing ranging from $0.025 to $0.06 based on model tier and output quality requirements.