Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessA suspected system failure caused a number of Baidu robotaxis to stop across Wuhan, trapping passengers and reportedly causing traffic disruptions and crashes (Zeyi Yang/Wired)TechmemeManaging Secret For Your Golang Apps With The GCP Secret ManagerDEV CommunityLiquid AI Released LFM2.5-350M: A Compact 350M Parameter Model Trained on 28T Tokens with Scaled Reinforcement LearningMarkTechPostThe Role of a Team LeadDEV CommunityGrab, in partnership with WeRide, launches a robotaxi service in Singapore, becoming Southeast Asia's first ride-hailing provider to offer a driverless service (Bloomberg)TechmemeMachines are in loop, to plan, code and pair reviewDEV CommunityWhat 10 Real AI Agent Disasters Taught Me About Autonomous SystemsDEV CommunityI built Newsroulette: the anti-feed for tech newsDEV CommunityMichael Jordan, 63, credits one trait for making him great: 'It keeps me young'Business InsiderHow We Finally Solved Test DiscoveryDEV CommunityWhat 100% Test Coverage Can't MeasureDEV CommunityBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessA suspected system failure caused a number of Baidu robotaxis to stop across Wuhan, trapping passengers and reportedly causing traffic disruptions and crashes (Zeyi Yang/Wired)TechmemeManaging Secret For Your Golang Apps With The GCP Secret ManagerDEV CommunityLiquid AI Released LFM2.5-350M: A Compact 350M Parameter Model Trained on 28T Tokens with Scaled Reinforcement LearningMarkTechPostThe Role of a Team LeadDEV CommunityGrab, in partnership with WeRide, launches a robotaxi service in Singapore, becoming Southeast Asia's first ride-hailing provider to offer a driverless service (Bloomberg)TechmemeMachines are in loop, to plan, code and pair reviewDEV CommunityWhat 10 Real AI Agent Disasters Taught Me About Autonomous SystemsDEV CommunityI built Newsroulette: the anti-feed for tech newsDEV CommunityMichael Jordan, 63, credits one trait for making him great: 'It keeps me young'Business InsiderHow We Finally Solved Test DiscoveryDEV CommunityWhat 100% Test Coverage Can't MeasureDEV Community

Sommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language Models

arXivby [Submitted on 20 Mar 2026]March 30, 20261 min read1 views
Source Quiz

arXiv:2603.25750v1 Announce Type: cross Abstract: As the paradigm of AI shifts from text-based LLMs to Speech Language Models (SLMs), there is a growing demand for full-duplex systems capable of real-time, natural human-computer interaction. However, the development of such models is constrained by the scarcity of high-quality, multi-speaker conversational data, as existing large-scale resources are predominantly single-speaker or limited in volume. Addressing the complex dynamics of natural dialogue, such as overlapping and back-channeling remains a challenge, with standard processing pipelin — Kyudan Jung, Jihwan Kim, Soyoon Kim, Jeongoon Kim, Jaegul Choo, Cheonbok Park

View PDF

Abstract:As the paradigm of AI shifts from text-based LLMs to Speech Language Models (SLMs), there is a growing demand for full-duplex systems capable of real-time, natural human-computer interaction. However, the development of such models is constrained by the scarcity of high-quality, multi-speaker conversational data, as existing large-scale resources are predominantly single-speaker or limited in volume. Addressing the complex dynamics of natural dialogue, such as overlapping and back-channeling remains a challenge, with standard processing pipelines suffering from diarization errors and ASR hallucinations. To bridge this gap, we present a robust and scalable open-source data processing pipeline designed for full-duplex model.

Comments: 34 pages, 7 figures, 11 tables

Subjects:

Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)

Cite as: arXiv:2603.25750 [cs.SD]

(or arXiv:2603.25750v1 [cs.SD] for this version)

https://doi.org/10.48550/arXiv.2603.25750

arXiv-issued DOI via DataCite

Submission history

From: Kyudan Jung [view email] [v1] Fri, 20 Mar 2026 09:10:43 UTC (3,412 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Sommelier: …researchpaperarxivaiartificial-…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 208 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers