Microsoft releases foundational AI models targeting enterprises

Silicon Republicby Suhasini SrinivasaragavanApril 3, 20261 min read1 views

Microsoft wants to offer the 'most complete AI and app agent factory'. Read more: Microsoft releases foundational AI models targeting enterprises

Microsoft wants to offer the ‘most complete AI and app agent factory’.

Microsoft has released three new AI foundational models, created in-house, in a move that places the company in direct competition with enterprise AI rivals, despite its deep ties with OpenAI.

The new foundational models target three of the most commercially viable modalities: transcription, voice and images. The models are already powering Microsoft’s products, including Copilot, Bing and Azure Speech, the company said, and will be available in a preview via the Microsoft Foundry and MAI Playground.

With this, Microsoft is furthering its goals of delivering “the most complete AI and app agent factory”, it said.

‘MAI-Transcribe-1’ is a first-generation speech recognition model expected to deliver “enterprise-grade accuracy” across 25 languages at around 50pc lower GPU costs than its alternatives. The model scores lower than 4pc average ‘word error rate’ on accuracy benchmarks, while GPT-Transcribe is at 4.2pc and Gemini 3.1 Flash is at 4.9pc.

‘MAI-Voice-1’ is a speech generation model that, according to Microsoft, can produce 60 seconds of expressive audio in under one second on a single GPU.

Together, the two models are meant to deliver an audio AI stack capable of assisting in call-centre workflows and other voice-driven services, such as providing live captioning, automatic subtitling and converting interactions into structured data for research.

Microsoft’s second-generation image model, ‘MAI-Image-2’, is expected to offer artists a way to “explore” different visual directions. The model is created in “close collaboration” with artists, the company said, and is meant to help enterprises create branding and communication material.

MAI-Image-2 debuted in third spot on the Arena.ai leaderboard for image model families, and is currently ranked fifth.

Microsoft, valued at $2.7trn, already offers several AI-embedded apps and platform services. Its Copilot Studio lets users build agents, while the Foundry services offer a place to train and scale models.

Meanwhile, a recently announced Copilot integration with Anthropic’s Claude Cowork is meant to target the growing demand for autonomous agents.

Microsoft backed OpenAI in its recent $122bn funding round alongside the likes of Amazon, Nvidia and SoftBank. Late last year, the company announced a $10bn investment plan for a data centre in Portugal. It also announced a $37.5bn quarterly capital expenditure bill at the end of January.

Don’t miss out on the knowledge you need to succeed. Sign up for the Daily Brief, Silicon Republic’s digest of need-to-know sci-tech news.

Original source

Silicon Republic

https://www.siliconrepublic.com/machines/microsoft-releases-foundational-ai-models-mai-transcribe-mai-voice-mai-image

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modelreleaseagent

ReleasesFresh

The Axios supply chain attack used individually targeted social engineering

The Axios team have published a full postmortem on the supply chain attack which resulted in a malware dependency going out in a release the other day , and it involved a sophisticated social engineering campaign targeting one of their maintainers directly. Here's Jason Saayman'a description of how that worked : so the attack vector mimics what google has documented here: https://cloud.google.com/blog/topics/threat-intelligence/unc1069-targets-cryptocurrency-ai-social-engineering they tailored this process specifically to me by doing the following: they reached out masquerading as the founder of a company they had cloned the companys founders likeness as well as the company itself. they then invited me to a real slack workspace. this workspace was branded to the companies ci and named in a

Simon Willison Blog

2mabout 5 hours ago

Models

Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models - WSJ

Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models WSJ

Google News: LLM

1m3 days ago

Models

Anthropic Races to Contain Leak of Code Behind Claude AI Agent - WSJ

Anthropic Races to Contain Leak of Code Behind Claude AI Agent WSJ

Google News: Claude

1m2 days ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 182 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Models

ModelsLive

Automated Security Assertion Generation Using LLMs (U. of Florida) - Semiconductor Engineering

Automated Security Assertion Generation Using LLMs (U. of Florida) Semiconductor Engineering

Google News: LLM

1m36 minutes ago

Models

Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models - WSJ

Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models WSJ

Google News: LLM

1m3 days ago

ModelsFresh

Can JavaScript Escape a CSP Meta Tag Inside an Iframe?

Research: Can JavaScript Escape a CSP Meta Tag Inside an Iframe? In trying to build my own version of Claude Artifacts I got curious about options for applying CSP headers to content in sandboxed iframes without using a separate domain to host the files. Turns out you can inject tags at the top of the iframe content and they'll be obeyed even if subsequent untrusted JavaScript tries to manipulate them. Tags: iframes , security , javascript , content-security-policy , sandboxing

Simon Willison Blog

1mabout 3 hours ago

Models

Exclusive | The Sudden Fall of OpenAI’s Most Hyped Product Since ChatGPT - WSJ

Exclusive | The Sudden Fall of OpenAI’s Most Hyped Product Since ChatGPT WSJ

Google News: ChatGPT

1m5 days ago