Provider 2 Major Model Surge, Vision Capabilities Restoration & Token Accuracy Enhancement
This major update introduces 42 new chat/completion models to Provider 2 through a new API integration, significantly expanding model diversity across Qwen, DeepSeek, Hermes, GLM, Mistral, Gemma, and other families. A critical vision bug fix restores full multimodal functionality to 11 vision-capable models. Provider 7 receives accurate token counting using official provider data, and Provider 8 receives a major context window expansion.
Provider 2
New API Integration & Model Expansion (42 models)
Major Infrastructure Enhancement: New API integration enables access to 42 new chat/completion models spanning diverse model families with function calling, reasoning, and vision capabilities.
Qwen Models (13 models)
Free Tier:
provider-2/qwen3-32b- 40K context, function calling, reasoningprovider-2/qwen3-14b- 40K context, function calling, reasoningprovider-2/qwen3-30b-a3b- 40K context, function calling, reasoningprovider-2/qwen2.5-72b-instruct- 32K context, function callingprovider-2/qwen2.5-vl-32b-instruct- 16K context, vision
Basic Tier:
provider-2/qwen3-235b-a22b-instruct-2507- 262K context, function callingprovider-2/qwen3-235b-a22b-thinking-2507- 262K context, function calling, reasoningprovider-2/qwen3-30b-a3b-instruct-2507- 262K context, function callingprovider-2/qwen3-next-80b-a3b-instruct- 262K context, function callingprovider-2/qwen2.5-vl-72b-instruct- 32K context, vision
Pro Tier:
provider-2/qwen3-coder-480b-a35b-instruct- 262K context, function callingprovider-2/qwen3-vl-235b-a22b-instruct- 262K context, function calling, visionprovider-2/qwen3-vl-235b-a22b-thinking- 262K context, function calling, vision, reasoning
DeepSeek Models (6 models)
Free Tier:
provider-2/deepseek-r1-0528- 163K context, function calling, reasoningprovider-2/deepseek-r1- 163K context, reasoningprovider-2/deepseek-v3- 163K contextprovider-2/deepseek-v3.1- 163K context, function calling, reasoningprovider-2/deepseek-r1-distill-llama-70b- 131K context, function calling, reasoning
Basic Tier:
provider-2/deepseek-v3.2-tee- 163K context, function calling, reasoning
NousResearch Hermes Models (3 models)
Free Tier:
provider-2/hermes-4-70b- 131K context, function calling, reasoningprovider-2/hermes-4-14b- 40K context, function calling, reasoning
Pro Tier:
provider-2/hermes-4-405b- 131K context, function calling, reasoning
Zhipu AI GLM Models (5 models)
Free Tier:
provider-2/glm-4.5-air- 131K context, function calling, reasoning
Basic Tier:
provider-2/glm-4.5- 131K context, function calling, reasoningprovider-2/glm-4.6-tee- 202K context, function calling, reasoningprovider-2/glm-4.6v- 131K context, function calling, vision, reasoningprovider-2/glm-4.7-tee- 202K context, function calling, reasoning
OpenAI OSS Models (2 models)
Free Tier:
provider-2/gpt-oss-120b- 131K context, function calling, reasoningprovider-2/gpt-oss-20b- 131K context, function calling, reasoning
Moonshot AI Kimi Models (2 models)
Basic Tier:
provider-2/kimi-k2-instruct-0905- 262K context, function callingprovider-2/kimi-k2-thinking-tee- 262K context, function calling, reasoning
Mistral Models (3 models)
Free Tier:
provider-2/mistral-small-3.1-24b-instruct- 131K context, function calling, visionprovider-2/mistral-small-3.2-24b-instruct- 131K context, function calling, vision
Basic Tier:
provider-2/devstral-2-123b-instruct- 262K context, function calling
Google Gemma Models (3 models)
Free Tier:
provider-2/gemma-3-27b-it- 96K context, function calling, visionprovider-2/gemma-3-12b-it- 131K context, visionprovider-2/gemma-3-4b-it- 96K context, vision
Other Models (5 models)
Free Tier:
provider-2/tongyi-deepresearch-30b-a3b- 131K context, function calling, reasoningprovider-2/tng-r1t-chimera- 163K context, reasoningprovider-2/tng-r1t2-chimera- 163K context, function calling, reasoning
Basic Tier:
provider-2/internvl3-78b- 32K context, visionprovider-2/minimax-m2- 196K context, function calling, reasoning
Vision Bug Fix (1 fix)
Critical Fix: Resolved a bug where multimodal/vision content was being stripped from API requests, rendering vision-capable models non-functional for image analysis.
11 Vision Models Restored:
provider-2/qwen2.5-vl-72b-instruct,provider-2/qwen2.5-vl-32b-instructprovider-2/qwen3-vl-235b-a22b-instruct,provider-2/qwen3-vl-235b-a22b-thinkingprovider-2/glm-4.6vprovider-2/mistral-small-3.1-24b-instruct,provider-2/mistral-small-3.2-24b-instructprovider-2/gemma-3-27b-it,provider-2/gemma-3-12b-it,provider-2/gemma-3-4b-itprovider-2/internvl3-78b
Fix Details:
- Multimodal content arrays (images, audio) now properly preserved
- Base64 and URL image formats fully supported
- Backward compatible with text-only requests
Provider 7
Token Counting Enhancement
Accuracy Improvement: Token counting now uses official provider data instead of estimation, with automatic fallback when needed.
- Primary Source: Official token counts from provider API responses
- Fallback Mechanism: Automatic fallback to estimation when provider data unavailable
- Scope: Provider 7 only - no impact on other providers
- Benefit: More accurate billing and usage tracking
Provider 8
Context Window Expansion
Performance Enhancement: Major context window expansion for improved long-document processing capabilities.
- Context Window: Expanded from 40K to 262K tokens (~6x increase)
- Tier Allocation: 131K (Free) / 183K (Basic) / 262K (Pro/Ultra)
- Standard Rule Applied: 50%/70%/100%/100% distribution for free/basic/pro/ultra tiers
Feature Distribution Summary
New Model Capabilities Breakdown:
- Function Calling: 35 models with tool/function calling support
- Reasoning: 26 models with enhanced reasoning capabilities
- Vision: 11 models with image analysis support
Platform Impact
- Model Diversity: 42 new models across 9 model families provide unprecedented choice for specialized tasks
- Vision Restoration: 11 vision-capable models now fully functional for image analysis workflows
- Token Accuracy: Provider 7 users benefit from precise token counting for accurate usage tracking
- Context Expansion: Provider 8 context window expansion enables processing of larger documents
- Free Tier Value: 24 of the 42 new models available on Free tier, expanding accessible AI capabilities
Important Notes
- Context Window Allocation: All new models follow the standard 50%/70%/100%/100% distribution rule for free/basic/pro/ultra tiers.
- Vision Models: The vision bug fix applies only to Provider 2 models. Other providers' vision capabilities remain unchanged.
- Token Counting: The token counting enhancement applies exclusively to Provider 7. Other providers continue using their existing token counting methods.
- Model Availability: New models are available immediately. Check the Models page for complete specifications and tier availability.