Live
Black Hat USAAI BusinessBlack Hat AsiaAI Businessv4.3text-gen-webui ReleasesI simulated a 19th-century utopian commune with AI agentsHacker News AI TopAnthropic’s Claude Code Leak Exposed AI’s Ugliest Weakness [TK]Medium AIWhat Claude Code’s Leaked Permission Classifier Misses — And What Fills the GapMedium AIAI DATA CENTERS ARE CREATING HEAT ISLANDS AND WARMING SURROUNDING LANDMedium AI20 Careers that Will Dominate the Next 10 Years…Medium AI30 ChatGPT Prompts That Actually Work for Sales Reps (Copy & Paste Ready)Dev.to AI【営業マン向け】ChatGPTで商談前の準備を10分で完結する方法Dev.to AI“Actions and Consequences” (With the added detailed explanation of my writing by Gemini 3.1)Medium AIClaude Code Skills Have a Model Field. Here's Why You Should Be Using It.Dev.to AIHow SunoAI + ChatGPT Are Changing AI Content Creation (And How You Can Profit)Medium AICipherTrace × TRM LabsMedium AIBlack Hat USAAI BusinessBlack Hat AsiaAI Businessv4.3text-gen-webui ReleasesI simulated a 19th-century utopian commune with AI agentsHacker News AI TopAnthropic’s Claude Code Leak Exposed AI’s Ugliest Weakness [TK]Medium AIWhat Claude Code’s Leaked Permission Classifier Misses — And What Fills the GapMedium AIAI DATA CENTERS ARE CREATING HEAT ISLANDS AND WARMING SURROUNDING LANDMedium AI20 Careers that Will Dominate the Next 10 Years…Medium AI30 ChatGPT Prompts That Actually Work for Sales Reps (Copy & Paste Ready)Dev.to AI【営業マン向け】ChatGPTで商談前の準備を10分で完結する方法Dev.to AI“Actions and Consequences” (With the added detailed explanation of my writing by Gemini 3.1)Medium AIClaude Code Skills Have a Model Field. Here's Why You Should Be Using It.Dev.to AIHow SunoAI + ChatGPT Are Changing AI Content Creation (And How You Can Profit)Medium AICipherTrace × TRM LabsMedium AI
AI NEWS HUBbyEIGENVECTOREigenvector

Microsoft Goes Beyond LLMs With New Voice, Image Models

AI Businessby Scarlett EvansApril 2, 20261 min read0 views
Source Quiz

The new AI models signal a stronger push toward Microsoft-developed AI systems.

Microsoft on Thursday unveiled three new AI models, marking an expansion beyond typical large language models to multimodal, in-house capabilities.

The models were introduced under the Microsoft AI (MAI) division.

The release includes MAI-Transcribe-1, a new speech-to-text system, as well as voice generation and image models MAI-Voice-1 and MAI-Image-2. All three are the first models of their kind for Microsoft and are available on Microsoft Foundry and the MAI Playground.

MAI-Transcribe-1 is Microsoft’s first dedicated transcription model, designed to convert audio into text across 25 languages. Potential applications include video captioning, meeting transcriptions and voice-enabled agents.

According to Microsoft, the model can operate at speeds up to 2.5 times faster than its existing Azure Fast transcription model.

MAI-Voice-1, meanwhile, is designed for high-quality speech generation.

The model can generate up to a minute of audio in a single second, with an emphasis on natural, emotional tone and speaker personality.

Related:Microsoft to Invest $5.5 billion in AI in Singapore

The third release, MAI-Image-2, represents the second generation of Microsoft’s in-house image model. The company says it offers at least twice the generation speed of its predecessor while providing more realistic details, such as skin tone, lighting and textures.

The model is targeted for use in the creative industries, and is already being rolled out across Microsoft products, with integrations planned for the Bing search engine and PowerPoint.

Early customers include marketing and communications firm WPP, Microsoft said.

“MAI-Image-2 is a genuine game-changer,” Rob Reilly, global chief creative officer at WPP said in a MAI blog post on the launch. “It’s a platform that not only responds to the intricate nuance of creative direction, but deeply respects the sheer craft involved in generating real-world, campaign-ready images.”

In the post, Microsoft said the updates come as it pursues a more "humanist" AI.

“We have a distinct view when creating our AI models -- putting humans at the center, optimizing for how people actually communicate, training for practical use,” the company said.

The launches also reflect a broader strategic shift as Microsoft looks to diversify its AI portfolio and reduce reliance on external partners such as OpenAI. It is also aiming to strengthen its competitive standing against rivals such as Google and Amazon, both of which have been investing heavily in proprietary AI stacks.

About the Author

Contributing Writer

Scarlett Evans is a freelance writer with a focus on emerging technologies and the minerals industry. Previously, she served as assistant editor at IoT World Today, where she specialized in robotics and smart city technologies. Scarlett also has a background in the mining and resources sector, with experience at Mine Australia, Mine Technology and Power Technology. She joined Informa in April 2022 before transitioning to freelance work.

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

model

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Microsoft G…modelAI Business

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 138 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!