What's New
Changelog
Stay up to date with the latest features, improvements, and fixes
8
Releases
35
New Features
25
Improvements
7
Bug Fixes
Stay Updated
Get notified when we release new features
RSS Feed
v2.1.0
Minor
January 20, 2025
DeepSeek R1 & Enhanced Analytics
Added DeepSeek R1 reasoning model and redesigned usage analytics dashboard.
New Features
- •DeepSeek R1 reasoning model now available - state-of-the-art reasoning capabilities
- •New usage analytics dashboard with real-time charts and breakdowns
- •Model comparison tool - compare up to 4 models side by side
- •Prompt template library with 50+ ready-to-use templates
- •Webhook notifications for usage alerts and API events
Improvements
- •Response streaming now 40% faster with optimized chunking
- •Dashboard load time reduced by 60% with lazy loading
- •Better error messages with actionable suggestions
- •Enhanced rate limit headers with remaining quota info
Bug Fixes
- •Fixed token counting for multi-byte characters
- •Resolved timeout issues with long-running requests
- •Fixed API key rotation not taking effect immediately
v2.0.0
Major
January 10, 2025
COZHUB 2.0 - Major Platform Update
Complete platform redesign with new features, improved performance, and expanded model support.
New Features
- •Completely redesigned dashboard with modern UI
- •Added 20+ new AI models including GPT-4o, Claude 3.5, Gemini 2.0
- •New playground with side-by-side model comparison
- •Team collaboration features with role-based access control
- •API versioning support (v1, v2) for backwards compatibility
- •New SDK packages for TypeScript, Python, Go, and Ruby
Improvements
- •40% faster response times with edge-optimized routing
- •Rebuilt infrastructure on global CDN for lower latency
- •Enhanced security with SOC 2 Type II compliance
- •Improved documentation with interactive examples
Deprecated
- •Legacy v0 API endpoints (migration guide available)
- •Old dashboard UI (auto-redirects to new dashboard)
v1.9.0
Minor
December 15, 2024
Gemini 2.0 Flash & Image Generation
Added Google Gemini 2.0 Flash model and expanded image generation capabilities.
New Features
- •Google Gemini 2.0 Flash - fastest multimodal model
- •FLUX 1.1 Pro image generation model
- •Batch API for processing multiple requests efficiently
- •Usage export to CSV/JSON for billing integration
Improvements
- •Image generation API now supports custom aspect ratios
- •Better handling of concurrent requests
- •Improved model availability monitoring
Bug Fixes
- •Fixed memory leak in streaming connections
- •Resolved billing discrepancy for cached responses
v1.8.0
Minor
November 20, 2024
Claude 3.5 Sonnet & Function Calling
Added Anthropic Claude 3.5 Sonnet and improved function calling support.
New Features
- •Anthropic Claude 3.5 Sonnet - best-in-class for coding tasks
- •Enhanced function calling with parallel tool execution
- •New /models endpoint for dynamic model discovery
- •Request logging with 30-day retention
Improvements
- •Function calling response time reduced by 25%
- •Better JSON mode reliability across all models
- •Enhanced streaming for tool use responses
Security
- •Added API key scoping for read-only and write access
- •Implemented request signing for enhanced security
v1.7.0
Minor
October 25, 2024
Embeddings API & Vector Search
Launched embeddings API with support for multiple embedding models.
New Features
- •Embeddings API with OpenAI text-embedding-3 models
- •Cohere embed-v3 multilingual embeddings
- •Jina embeddings for long document support
- •Batch embedding endpoint for bulk processing
Improvements
- •Reduced embedding latency by 30%
- •Added dimension reduction option for embeddings
- •Better documentation for RAG use cases
v1.6.0
Minor
September 30, 2024
OpenAI o1 Models & Reasoning
Added OpenAI o1-preview and o1-mini reasoning models.
New Features
- •OpenAI o1-preview - advanced reasoning model
- •OpenAI o1-mini - cost-effective reasoning
- •New reasoning token tracking in usage stats
- •Extended context support up to 128K tokens
Improvements
- •Better handling of long-running reasoning requests
- •Improved timeout handling for complex queries
Bug Fixes
- •Fixed context truncation for very long conversations
- •Resolved rate limit calculation for streaming requests
v1.5.0
Minor
August 15, 2024
Vision API & Multi-modal Support
Added vision capabilities for image understanding across multiple models.
New Features
- •Vision API support for GPT-4V and Claude 3 Vision
- •Image URL and base64 input support
- •Multi-image conversations
- •Image token estimation for cost prediction
Improvements
- •Unified vision API across all providers
- •Better image compression for faster uploads
- •Enhanced error messages for unsupported formats
v1.4.0
Minor
July 10, 2024
Llama 3.1 & Open Source Models
Expanded support for open source models including Llama 3.1 family.
New Features
- •Meta Llama 3.1 8B, 70B, and 405B models
- •Mistral Large and Mistral Nemo
- •Qwen 2 series from Alibaba
- •New model filtering by license type
Improvements
- •Open source models now with 99.9% availability
- •Reduced pricing for open source model inference
- •Better system prompt handling for Llama models