Microsoft releases foundational AI models targeting enterprises
Microsoft wants to offer the 'most complete AI and app agent factory'. Read more: Microsoft releases foundational AI models targeting enterprises
Microsoft wants to offer the ‘most complete AI and app agent factory’.
Microsoft has released three new AI foundational models, created in-house, in a move that places the company in direct competition with enterprise AI rivals, despite its deep ties with OpenAI.
The new foundational models target three of the most commercially viable modalities: transcription, voice and images. The models are already powering Microsoft’s products, including Copilot, Bing and Azure Speech, the company said, and will be available in a preview via the Microsoft Foundry and MAI Playground.
With this, Microsoft is furthering its goals of delivering “the most complete AI and app agent factory”, it said.
‘MAI-Transcribe-1’ is a first-generation speech recognition model expected to deliver “enterprise-grade accuracy” across 25 languages at around 50pc lower GPU costs than its alternatives. The model scores lower than 4pc average ‘word error rate’ on accuracy benchmarks, while GPT-Transcribe is at 4.2pc and Gemini 3.1 Flash is at 4.9pc.
‘MAI-Voice-1’ is a speech generation model that, according to Microsoft, can produce 60 seconds of expressive audio in under one second on a single GPU.
Together, the two models are meant to deliver an audio AI stack capable of assisting in call-centre workflows and other voice-driven services, such as providing live captioning, automatic subtitling and converting interactions into structured data for research.
Microsoft’s second-generation image model, ‘MAI-Image-2’, is expected to offer artists a way to “explore” different visual directions. The model is created in “close collaboration” with artists, the company said, and is meant to help enterprises create branding and communication material.
MAI-Image-2 debuted in third spot on the Arena.ai leaderboard for image model families, and is currently ranked fifth.
Microsoft, valued at $2.7trn, already offers several AI-embedded apps and platform services. Its Copilot Studio lets users build agents, while the Foundry services offer a place to train and scale models.
Meanwhile, a recently announced Copilot integration with Anthropic’s Claude Cowork is meant to target the growing demand for autonomous agents.
Microsoft backed OpenAI in its recent $122bn funding round alongside the likes of Amazon, Nvidia and SoftBank. Late last year, the company announced a $10bn investment plan for a data centre in Portugal. It also announced a $37.5bn quarterly capital expenditure bill at the end of January.
Don’t miss out on the knowledge you need to succeed. Sign up for the Daily Brief, Silicon Republic’s digest of need-to-know sci-tech news.
Silicon Republic
https://www.siliconrepublic.com/machines/microsoft-releases-foundational-ai-models-mai-transcribe-mai-voice-mai-imageSign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
modelreleaseagent
The Axios supply chain attack used individually targeted social engineering
The Axios team have published a full postmortem on the supply chain attack which resulted in a malware dependency going out in a release the other day , and it involved a sophisticated social engineering campaign targeting one of their maintainers directly. Here's Jason Saayman'a description of how that worked : so the attack vector mimics what google has documented here: https://cloud.google.com/blog/topics/threat-intelligence/unc1069-targets-cryptocurrency-ai-social-engineering they tailored this process specifically to me by doing the following: they reached out masquerading as the founder of a company they had cloned the companys founders likeness as well as the company itself. they then invited me to a real slack workspace. this workspace was branded to the companies ci and named in a
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in Models

Can JavaScript Escape a CSP Meta Tag Inside an Iframe?
Research: Can JavaScript Escape a CSP Meta Tag Inside an Iframe? In trying to build my own version of Claude Artifacts I got curious about options for applying CSP headers to content in sandboxed iframes without using a separate domain to host the files. Turns out you can inject tags at the top of the iframe content and they'll be obeyed even if subsequent untrusted JavaScript tries to manipulate them. Tags: iframes , security , javascript , content-security-policy , sandboxing



Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!