Cohere
Enterprise-first provider focused on retrieval, grounding, multilingual control, and private deployment. Command A+ is the newest open-source workhorse, while Command A Reasoning, Rerank, Embed, and Transcribe stay central to the stack.
Latest model families
- Command A+ - newest open-source enterprise workhorse
- Command A / Command A Reasoning - flagship enterprise models
- Command A Vision / Command A Translate - multimodal and translation lines
- Command R7B / Command R+ / Command R - retrieval and grounding family
- Aya Expanse / Aya Vision - multilingual and multimodal lines
- Embed / Rerank / Transcribe - retrieval and audio stack
Best for
- RAG and enterprise search
- Grounded assistants
- Private and sovereign deployments
Amazon Nova / Bedrock
A production platform rather than a single model lab. Bedrock is the control plane; Nova is Amazon's own model line for multimodal, reasoning, and speech workloads.
Latest model families
- Amazon Nova 2 Omni / Nova 2 Pro / Nova 2 Lite
- Nova 2 Sonic - speech and conversational voice
- Nova Multimodal Embeddings - semantic retrieval
- Third-party model access via Bedrock
- Agent and guardrail tooling around model use
Best for
- AWS-native production apps
- Enterprise governance and routing
- Multi-provider deployments
Microsoft Phi
Microsoft's small-model family is optimized for strong performance per parameter and edge use cases. The current work centers on compact reasoning, multimodal SLMs, and vision reasoning.
Latest model families
- Phi-4-reasoning-vision-15B - compact multimodal reasoning
- Phi-4-reasoning / Phi-4-reasoning-plus - compact reasoning
- Phi-4 - small-model flagship
- Phi-4-mini - compact variant
- Phi-4-multimodal - vision, audio, text
- Phi-3.5 - still widely used in light-footprint setups
Best for
- Small-footprint deployments
- STEM-heavy tasks
- Edge and local inference
Baidu ERNIE (China)
Baidu's China enterprise stack is still one of the clearest in the market, with a strong emphasis on search, multimodal understanding, and document-heavy workflows.
Latest model families
- ERNIE 5.0 / ERNIE 5.0 Thinking / ERNIE 5.0 Preview
- ERNIE X1.1 / X1.1 Preview
- ERNIE X1 Turbo / X1 Turbo Preview
- ERNIE 4.5 Turbo / ERNIE 4.5 Turbo VL
- PaddleOCR-VL / PP-StructureV3 - document stack
Best for
- Chinese enterprise deployments
- Search-augmented assistants
- Document and multimodal knowledge work
Zhipu AI / GLM (China)
One of China's strongest general-purpose model lines, with a clear push into long-horizon coding, multimodal reasoning, and image generation.
Latest model families
- GLM-5.1 / GLM-5.1-HighSpeed - flagship agentic coding line
- GLM-4.7 / GLM-4.7-FlashX - high-intelligence general line
- GLM-5V-Turbo / GLM-4.6V-Flash - multimodal coding and vision
- GLM-4.6 / GLM-4.5 - still used in broader deployments
- CogView-4 - image generation
Best for
- Agentic coding and long tasks
- Chinese-language workflows
- Multimodal app stacks
Moonshot / Kimi (China)
A fast-moving China-side assistant platform with a strong long-context and coding angle. Kimi is increasingly relevant for agentic workflows.
Latest model families
- Kimi K2.6 - latest flagship
- Kimi K2.5 - multimodal general model
- Kimi K2-thinking / Kimi K2-thinking-turbo - reasoning models
- moonshot-v1 vision previews - older long-context line still in circulation
Best for
- Long-context assistants
- Chinese first-party product experiences
- Agent and code-heavy workflows
MiniMax (China)
MiniMax is broader than many outsiders realize: text, speech, video, and music are all active product lines.
Latest model families
- MiniMax M2.1 / M2.1-lightning / M2 - text and agent models
- MiniMax M2-her - role-play and long-dialogue model
- MiniMax Speech 2.6 / 2.6 HD - voice agents
- MiniMax Hailuo 2.3 / 2.3 Fast - video
- MiniMax Music 2.6 / 2.5+ - music generation
Best for
- Voice and media products
- Agentic coding
- Multimodal consumer apps
Tencent Hunyuan (China)
Tencent's Hunyuan stack is moving through a major refresh. The current public signal is Hy3 preview for language work, Hunyuan-role-latest for role-play, and HY-3D-3.1 for 3D.
Latest model families
- Hy3 preview - latest flagship language / reasoning model
- Hunyuan-role-latest - role-play and conversational character model
- HY-MT2-Pro - translation-focused model
- HY-3D-3.1 / HY-3D-3.0 - 3D generation
- HunyuanVideo / HunyuanVideo-Avatar - video generation
Best for
- 3D, simulation, and interactive content
- Creative tooling
- Media-heavy AI experiences