Cohere's open Command A+ makes sovereign AI more practical

What happened

Cohere released Command A+, a 218 billion parameter mixture-of-experts model built for enterprise agentic workloads, multimodal document processing, RAG and private deployment. The model is available under an Apache 2.0 license, with weights on Hugging Face and support for vLLM and Transformers. Cohere says the model has 25 billion active parameters per generation step, a 128K input context window, tool use, image input and support for 48 languages.

The important part is not only model quality. Cohere is pushing Command A+ as a practical sovereign AI building block. The model is offered in BF16, FP8 and W4A4 quantized versions, with the smallest deployment target listed as one Blackwell B200 or two H100 GPUs. Cohere also says the 4-bit version keeps quality nearly intact while improving speed and latency compared with earlier Command models.

Command A+ also includes native citation behavior for retrieval and tool use. When the model uses external sources, it can produce grounding spans that connect statements back to source documents or database rows. That matters for enterprise teams that need AI outputs to be checkable, not just fluent.

Why it matters

Enterprise AI is moving away from the question of which chatbot is smartest and toward a more operational question: where does the runtime live, who controls it and can the output be audited? Command A+ lands directly in that debate. A permissively licensed model that can run in a customer controlled environment gives buyers another path between black-box API usage and expensive internal model research.

For European and regulated organisations, the licensing and deployment model may matter as much as the benchmark table. Running a model closer to internal documents, systems and logs can reduce data movement, simplify governance and create clearer accountability. It also gives IT teams more leverage over cost, because inference can be managed as infrastructure instead of an open-ended token bill spread across disconnected SaaS tools.

The citation features are equally relevant. Many RAG projects fail not because retrieval is impossible, but because users cannot see why an answer should be trusted. Native grounding does not remove the need for evaluation, permissions and workflow design, but it gives teams a better primitive for document-heavy operations such as compliance checks, customer service, legal review and internal knowledge work.

Laava perspective

This is the kind of model release that makes sovereign AI more concrete. It is not a reason to buy a loose server and hope value appears. The value comes when a capable model is placed inside a managed runtime, connected to the right documents, guarded by permissions, monitored for quality and integrated into real workflows. That is where Laava sees the market moving.

Laava's Sovereign Runtime fits this pattern as a deployment form inside Laava Agents and Custom Solutions. The customer is not buying hardware as the product. The customer is buying managed runtime, agents, integrations, logging, monitoring and ongoing improvement. A model like Command A+ can become one option inside that model-agnostic layer, selected when sovereignty, auditability, multilingual document work or predictable inference cost are important.

The bigger lesson is architectural. A strong open model is useful, but it is only one layer. Production AI still needs context, reasoning and action. It needs access to SharePoint, ERP, CRM, mailboxes and process rules. It needs human approval where actions are risky. It needs audit trails when a user asks why an answer was generated. Without those pieces, even a very capable model remains another isolated tool.

What you can do

If you are evaluating AI for sensitive documents or workflow automation, use this release as a prompt to review your deployment assumptions. Which workloads can stay on hosted APIs, and which ones need more control over data, logs, cost and model choice? Which outputs need citations before they can be used in a process?

Start with one operational bottleneck rather than a platform rebuild. A proof of pilot around document Q&A, dossier review, customer question triage or workflow preparation can show whether a sovereign runtime is worth the extra control. The model is not the strategy. The managed operating layer around it is where enterprise value becomes repeatable.

Cohere's open Command A+ makes sovereign AI more practical

What happened

Why it matters

Laava perspective

What you can do

Determine where this affects you first for real

From news to a concrete first route