Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessThe Cathedral, the Bazaar, and the Winchester Mystery HouseO'Reilly RadarSources: Mercor asked professionals in fields like entertainment to sell their prior work materials for AI training, even if the IP could belong to ex-employers (Katherine Bindley/Wall Street Journal)TechmemeMarch Madness 2026: How to watch the Final FourEngadgetSony buys machine learning firm behind volumetric 3D images to level-up PlayStation tech - TweakTownGoogle News: Machine LearningTake-Two laid off the head its AI division and an undisclosed number of staffEngadgetThe Week’s 10 Biggest Funding Rounds: Largest Financings Went To Defense, Wearables, Energy And SecurityCrunchbase NewsAutomated Security Assertion Generation Using LLMs (U. of Florida) - Semiconductor EngineeringGoogle News: LLMStop Using Robotic AI Voices — Here’s How to Make Them Sound Human (For Free)Medium AILangChain4j TokenWindowChatMemory Crash: IndexOutOfBoundsException Explained and FixedMedium AIGoogle TurboQuant Codes explainedMedium AIStop Storing Data in CSV Like It’s 2010-Apache Parquet Will Change How You Think About StorageMedium AIBest HSE Software in 2026: Top 10 Platforms for Safety ProfessionalsMedium AIBlack Hat USADark ReadingBlack Hat AsiaAI BusinessThe Cathedral, the Bazaar, and the Winchester Mystery HouseO'Reilly RadarSources: Mercor asked professionals in fields like entertainment to sell their prior work materials for AI training, even if the IP could belong to ex-employers (Katherine Bindley/Wall Street Journal)TechmemeMarch Madness 2026: How to watch the Final FourEngadgetSony buys machine learning firm behind volumetric 3D images to level-up PlayStation tech - TweakTownGoogle News: Machine LearningTake-Two laid off the head its AI division and an undisclosed number of staffEngadgetThe Week’s 10 Biggest Funding Rounds: Largest Financings Went To Defense, Wearables, Energy And SecurityCrunchbase NewsAutomated Security Assertion Generation Using LLMs (U. of Florida) - Semiconductor EngineeringGoogle News: LLMStop Using Robotic AI Voices — Here’s How to Make Them Sound Human (For Free)Medium AILangChain4j TokenWindowChatMemory Crash: IndexOutOfBoundsException Explained and FixedMedium AIGoogle TurboQuant Codes explainedMedium AIStop Storing Data in CSV Like It’s 2010-Apache Parquet Will Change How You Think About StorageMedium AIBest HSE Software in 2026: Top 10 Platforms for Safety ProfessionalsMedium AI
AI NEWS HUBbyEIGENVECTOREigenvector

Microsoft releases foundational AI models targeting enterprises

Silicon Republicby Suhasini SrinivasaragavanApril 3, 20261 min read1 views
Source Quiz

Microsoft wants to offer the 'most complete AI and app agent factory'. Read more: Microsoft releases foundational AI models targeting enterprises

Microsoft wants to offer the ‘most complete AI and app agent factory’.

Microsoft has released three new AI foundational models, created in-house, in a move that places the company in direct competition with enterprise AI rivals, despite its deep ties with OpenAI.

The new foundational models target three of the most commercially viable modalities: transcription, voice and images. The models are already powering Microsoft’s products, including Copilot, Bing and Azure Speech, the company said, and will be available in a preview via the Microsoft Foundry and MAI Playground.

With this, Microsoft is furthering its goals of delivering “the most complete AI and app agent factory”, it said.

‘MAI-Transcribe-1’ is a first-generation speech recognition model expected to deliver “enterprise-grade accuracy” across 25 languages at around 50pc lower GPU costs than its alternatives. The model scores lower than 4pc average ‘word error rate’ on accuracy benchmarks, while GPT-Transcribe is at 4.2pc and Gemini 3.1 Flash is at 4.9pc.

‘MAI-Voice-1’ is a speech generation model that, according to Microsoft, can produce 60 seconds of expressive audio in under one second on a single GPU.

Together, the two models are meant to deliver an audio AI stack capable of assisting in call-centre workflows and other voice-driven services, such as providing live captioning, automatic subtitling and converting interactions into structured data for research.

Microsoft’s second-generation image model, ‘MAI-Image-2’, is expected to offer artists a way to “explore” different visual directions. The model is created in “close collaboration” with artists, the company said, and is meant to help enterprises create branding and communication material.

MAI-Image-2 debuted in third spot on the Arena.ai leaderboard for image model families, and is currently ranked fifth.

Microsoft, valued at $2.7trn, already offers several AI-embedded apps and platform services. Its Copilot Studio lets users build agents, while the Foundry services offer a place to train and scale models.

Meanwhile, a recently announced Copilot integration with Anthropic’s Claude Cowork is meant to target the growing demand for autonomous agents.

Microsoft backed OpenAI in its recent $122bn funding round alongside the likes of Amazon, Nvidia and SoftBank. Late last year, the company announced a $10bn investment plan for a data centre in Portugal. It also announced a $37.5bn quarterly capital expenditure bill at the end of January.

Don’t miss out on the knowledge you need to succeed. Sign up for the Daily Brief, Silicon Republic’s digest of need-to-know sci-tech news.

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Microsoft r…modelreleaseagentSilicon Rep…

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 182 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!