v20.1.0

Provider 5 (Nebius) Revocation & New Provider Infrastructure, Provider 2 Image Generation Restored

Sreejan
SreejanAuthor

This major infrastructure update brings significant changes to Provider 5 with the complete revocation of the Nebius provider and introduction of a new high-performance infrastructure. Provider 2 image generation capabilities have been fully restored after recent issues. Additionally, new video generation models are expected to be added soon.

⚠️ Provider 5 (Nebius) - Complete Revocation

41

Platform Discontinuation: Provider 5 powered by Nebius has been completely revoked from the platform. All 41 models have been discontinued effective immediately.

Chat/Completion Models Removed - Free Tier (22 models)

Meta Llama Family:

NVIDIA Models:

Google Gemma:

Qwen Series:

DeepSeek Reasoning:

Zhipu AI:

OpenAI Open-Source:

Moonshot AI:

NousResearch:

PrimeIntellect:


Chat/Completion Models Removed - Basic Tier (11 models)


Chat/Completion Models Removed - Pro Tier (2 models)


Embedding Models Removed (4 models)


Image Generation Models Removed (2 models)

Provider 5 - New Infrastructure Launch

13

New High-Performance Provider: A new Provider 5 infrastructure has been integrated with 13 premium models spanning Google Gemini, Anthropic Claude, and OpenAI families.

Google Gemini Models (9 models)

Free Tier:

  • provider-5/gemini-2.5-flash-lite - Free, Basic, Pro, Ultra

    • Lightweight and cost-effective variant optimized for high-volume tasks
    • Context Window: 524K (Free) / 786K (Basic) / 1M (Pro/Ultra)
    • Output Tokens: 65,536
    • Features: vision, function_calling
  • provider-5/gpt-oss-120b-medium - Free, Basic, Pro, Ultra

    • OpenAI's balanced open-weight reasoning model optimized for medium-complexity tasks
    • Context Window: 64K (Free) / 90K (Basic) / 128K (Pro/Ultra)
    • Output Tokens: 32,768
    • Features: reasoning, vision, function_calling

Basic Tier:

  • provider-5/gemini-2.5-flash-thinking - Basic, Pro, Ultra

    • Enhanced reasoning variant of Gemini 2.5 Flash with chain-of-thought capabilities
    • Context Window: 256K (Basic) / 512K (Pro) / 1M (Ultra)
    • Output Tokens: 65,536
    • Features: reasoning, vision, function_calling
  • provider-5/gemini-2.5-flash - Basic, Pro, Ultra

    • Google's best model for balancing reasoning and speed with 1M token context
    • Context Window: 256K (Basic) / 512K (Pro) / 1M (Ultra)
    • Output Tokens: 65,536
    • Features: reasoning, vision, function_calling, audio

Pro Tier:

  • provider-5/gemini-2.5-pro - Pro, Ultra
    • Google's most capable model with advanced reasoning and 1M token context
    • Context Window: 1M (Pro/Ultra)
    • Output Tokens: 65,536
    • Features: reasoning, vision, function_calling, audio

Ultra Tier Exclusive:

  • provider-5/gemini-3-pro-low - Ultra Only

    • Gemini 3 Pro Low - Google's efficient variant for faster inference
    • Context Window: 1M (Ultra)
    • Output Tokens: 65,536
    • Features: reasoning, vision, function_calling
  • provider-5/gemini-3-pro-high - Ultra Only

    • Gemini 3 Pro High - Google's premium variant with maximum quality
    • Context Window: 1M (Ultra)
    • Output Tokens: 65,536
    • Features: reasoning, vision, function_calling
  • provider-5/gemini-3-pro-image - Ultra Only

    • Gemini 3 Pro Image - Vision-enabled model for image understanding and analysis
    • Context Window: 1M (Ultra)
    • Output Tokens: 65,536
    • Features: vision
  • provider-5/gemini-3-flash - Ultra Only

    • Google's fast and efficient multimodal model
    • Context Window: 1M (Ultra)
    • Output Tokens: 65,536
    • Features: reasoning, vision, function_calling, audio

Anthropic Claude Models (4 models)

Ultra Tier Exclusive:

  • provider-5/claude-sonnet-4-5 - Ultra Only

    • Claude Sonnet 4.5 - Advanced reasoning model with excellent performance across diverse tasks
    • Context Window: 200K (Ultra)
    • Output Tokens: 64,000
    • Features: vision, function_calling
  • provider-5/claude-sonnet-4-5-thinking - Ultra Only

    • Claude Sonnet 4.5 Thinking - Extended reasoning variant with enhanced chain-of-thought
    • Context Window: 200K (Ultra)
    • Output Tokens: 64,000
    • Features: reasoning, vision, function_calling
  • provider-5/claude-opus-4-5 - Ultra Only

    • Claude Opus 4.5 - Anthropic's most powerful model with exceptional reasoning and creative capabilities
    • Context Window: 200K (Ultra)
    • Output Tokens: 64,000
    • Features: reasoning, vision, function_calling
  • provider-5/claude-opus-4-5-thinking - Ultra Only

    • Claude Opus 4.5 Thinking - Anthropic's most powerful reasoning model with extended chain-of-thought
    • Context Window: 200K (Ultra)
    • Output Tokens: 64,000
    • Features: reasoning, vision, function_calling

Provider 2 - Image Generation Restored

1

Service Restoration: All Provider 2 image generation models have been fully restored and are now operational after recent infrastructure issues.

  • Full Functionality: All image generation endpoints are now working correctly
  • Model Availability: All previously available image generation models are accessible
  • Stability: Infrastructure issues have been resolved ensuring consistent performance
  • No Action Required: Users can resume image generation workflows immediately

This fix addresses the issues that were affecting Provider 2 image generation capabilities, ensuring reliable access for all users.

🔮 Coming Soon: Video Generation Models

1

Upcoming Feature: New video generation models are expected to be added to Provider 2 very soon.

  • Timeline: Expected deployment within the next day
  • Provider: Video generation will be available through Provider 2
  • Details: Full model specifications will be announced upon release

Stay tuned for the official announcement with complete details on the new video generation capabilities.

New Model Capabilities Summary

3

Provider 5 New Infrastructure Breakdown:

  • Google Gemini Models: 9 models with vision, function calling, reasoning, and audio capabilities
  • Anthropic Claude Models: 4 models with advanced reasoning and vision support
  • Free Tier Access: 2 models available on Free tier (Gemini 2.5 Flash Lite, GPT OSS 120B Medium)
  • Ultra Tier Premium: 8 exclusive models including Gemini 3 series and Claude Opus 4.5

Feature Distribution:

  • Vision: 13 models with image analysis support
  • Function Calling: 12 models with tool/function calling support
  • Reasoning: 10 models with enhanced reasoning capabilities
  • Audio: 3 models with audio processing capabilities

Platform Impact

5
  • Provider Transition: Nebius infrastructure fully replaced with new high-performance provider
  • Premium Model Access: Ultra tier users gain access to cutting-edge Gemini 3 and Claude Opus 4.5 models
  • Image Generation Reliability: Provider 2 image generation fully restored for uninterrupted creative workflows
  • Free Tier Value: 2 high-quality models remain accessible on Free tier despite provider transition
  • Video Generation Preview: New video generation capabilities coming soon to expand platform offerings

Important Notes

5
  • Provider 5 Migration: All users of the previous Provider 5 (Nebius) models should migrate to alternative providers. Model IDs have changed completely.
  • Embedding Models: No embedding models are available in the new Provider 5. Use Provider 2 or other providers for embedding needs.
  • Image Generation: No image generation models in new Provider 5. Use Provider 2 for image generation (now fully restored).
  • Ultra Tier Value: 8 of the 13 new models are Ultra-exclusive, providing significant value for Ultra subscribers.
  • Video Generation: Watch for the upcoming announcement on new video generation capabilities expected tomorrow.