What's New

Changelog

Stay up to date with the latest features, improvements, and fixes

Releases

New Features

Improvements

Bug Fixes

Stay Updated

Get notified when we release new features

RSS Feed

v2.1.0

Minor

January 20, 2025

DeepSeek R1 & Enhanced Analytics

Added DeepSeek R1 reasoning model and redesigned usage analytics dashboard.

New Features

•DeepSeek R1 reasoning model now available - state-of-the-art reasoning capabilities
•New usage analytics dashboard with real-time charts and breakdowns
•Model comparison tool - compare up to 4 models side by side
•Prompt template library with 50+ ready-to-use templates
•Webhook notifications for usage alerts and API events

Improvements

•Response streaming now 40% faster with optimized chunking
•Dashboard load time reduced by 60% with lazy loading
•Better error messages with actionable suggestions
•Enhanced rate limit headers with remaining quota info

Bug Fixes

•Fixed token counting for multi-byte characters
•Resolved timeout issues with long-running requests
•Fixed API key rotation not taking effect immediately

v2.0.0

Major

January 10, 2025

COZHUB 2.0 - Major Platform Update

Complete platform redesign with new features, improved performance, and expanded model support.

New Features

•Completely redesigned dashboard with modern UI
•Added 20+ new AI models including GPT-4o, Claude 3.5, Gemini 2.0
•New playground with side-by-side model comparison
•Team collaboration features with role-based access control
•API versioning support (v1, v2) for backwards compatibility
•New SDK packages for TypeScript, Python, Go, and Ruby

Improvements

•40% faster response times with edge-optimized routing
•Rebuilt infrastructure on global CDN for lower latency
•Enhanced security with SOC 2 Type II compliance
•Improved documentation with interactive examples

Deprecated

•Legacy v0 API endpoints (migration guide available)
•Old dashboard UI (auto-redirects to new dashboard)

v1.9.0

Minor

December 15, 2024

Gemini 2.0 Flash & Image Generation

Added Google Gemini 2.0 Flash model and expanded image generation capabilities.

New Features

•Google Gemini 2.0 Flash - fastest multimodal model
•FLUX 1.1 Pro image generation model
•Batch API for processing multiple requests efficiently
•Usage export to CSV/JSON for billing integration

Improvements

•Image generation API now supports custom aspect ratios
•Better handling of concurrent requests
•Improved model availability monitoring

Bug Fixes

•Fixed memory leak in streaming connections
•Resolved billing discrepancy for cached responses

v1.8.0

Minor

November 20, 2024

Claude 3.5 Sonnet & Function Calling

Added Anthropic Claude 3.5 Sonnet and improved function calling support.

New Features

•Anthropic Claude 3.5 Sonnet - best-in-class for coding tasks
•Enhanced function calling with parallel tool execution
•New /models endpoint for dynamic model discovery
•Request logging with 30-day retention

Improvements

•Function calling response time reduced by 25%
•Better JSON mode reliability across all models
•Enhanced streaming for tool use responses

Security

•Added API key scoping for read-only and write access
•Implemented request signing for enhanced security

v1.7.0

Minor

October 25, 2024

Embeddings API & Vector Search

Launched embeddings API with support for multiple embedding models.

New Features

•Embeddings API with OpenAI text-embedding-3 models
•Cohere embed-v3 multilingual embeddings
•Jina embeddings for long document support
•Batch embedding endpoint for bulk processing

Improvements

•Reduced embedding latency by 30%
•Added dimension reduction option for embeddings
•Better documentation for RAG use cases

v1.6.0

Minor

September 30, 2024

OpenAI o1 Models & Reasoning

Added OpenAI o1-preview and o1-mini reasoning models.

New Features

•OpenAI o1-preview - advanced reasoning model
•OpenAI o1-mini - cost-effective reasoning
•New reasoning token tracking in usage stats
•Extended context support up to 128K tokens

Improvements

•Better handling of long-running reasoning requests
•Improved timeout handling for complex queries

Bug Fixes

•Fixed context truncation for very long conversations
•Resolved rate limit calculation for streaming requests

v1.5.0

Minor

August 15, 2024

Vision API & Multi-modal Support

Added vision capabilities for image understanding across multiple models.

New Features

•Vision API support for GPT-4V and Claude 3 Vision
•Image URL and base64 input support
•Multi-image conversations
•Image token estimation for cost prediction

Improvements

•Unified vision API across all providers
•Better image compression for faster uploads
•Enhanced error messages for unsupported formats

v1.4.0

Minor

July 10, 2024

Llama 3.1 & Open Source Models

Expanded support for open source models including Llama 3.1 family.

New Features

•Meta Llama 3.1 8B, 70B, and 405B models
•Mistral Large and Mistral Nemo
•Qwen 2 series from Alibaba
•New model filtering by license type

Improvements

•Open source models now with 99.9% availability
•Reduced pricing for open source model inference
•Better system prompt handling for Llama models