Contact UsSiteMap
Home Puzzle game Online tutoring Life information Financial community Real Estate Agents More

Announcing the Multilingual Conversational Speech Language Model (MLC-SLM) Challenge

2025-03-20 IDOPRESS

MONROVIA,Calif.,March 19,2025 -- Nexdata,a leading global provider of AI data services today announces the start of The Multilingual Conversational Speech LLM (MLC-SLM) Challenge,an officially approved satellite event of Interspeech 2025.

This challenge,hosted by Meta,Google,Samsung,Naver,China Mobile,Northwestern Polytechnical University and Nexdata,aims to advance multilingual conversational speech AI by providing a real-world dataset and encouraging innovation in speech language models.

The challenge consists of two tasks,both of which require participants to explore the development of speech language models (SLMs):

Task I: Multilingual Conversational Speech Recognition

Objective: Develop a multilingual LLM-based ASR model. Participants will be provided with oracle segmentation and speaker labels for each conversation.

Task II: Multilingual Conversational Speech Diarization and Recognition

Objective: Develop a system for both speaker diarization (identifying who is speaking when),and recognition (transcribing speech to text). No prior or oracle information will be provided during evaluation (e.g.,no pre-segmented utterances or speaker labels). Both pipeline-based and end-to-end systems are encouraged,providing flexibility in system design and implementation.

The training set (Train) comprises approximately 11 languages: English (en),French (fr),German (de),Italian (it),Portuguese (pt),Spanish (es),Japanese (jp),Korean (ko),Russian (ru),Thai (th),Vietnamese (vi). It's designed to provide a rich resource for training and evaluating multilingual conversational speech language models (MLC-SLM),addressing the challenges of linguistic diversity,speaker variability,and contextual understanding.

Important Dates (AOT Time)

March 10,2025: Registration opens


March 15,2025: Training data release


March 20,2025: Development set and baseline system release


May 15,2025: Evaluation set release and Leaderboard open


May 30,2025: Leaderboard freeze and paper submission portal opens (CMT system)


June 15,2025: Paper submission deadline


July 1,2025: Notification of acceptance


August 18,2025: Workshop date

We have set a prize pool of $20,000 for the winners. Based on performance,the top three teams in each track will be awarded:


1st Prize: $5,000


2nd Prize: $3,000


3rd Prize: $2,000

For more details,please check out the challenge website: https://www.nexdata.ai/competition/mlc-slm

Participate here: https://docs.google.com/forms/d/e/1FAIpQLSftZCRQQWvO5NZd-bPo1VT2Xsaieu_ZYCklw6MhW6LqjWnuYQ/viewform?usp=send_form

For inquiries:mlc-slmw@nexdata.ai

Join us in shaping the future of multilingual conversational AI and be part of this groundbreaking challenge!

About Nexdata

Nexdata provides top-notch training data solutions and serves as your reliable partner. With an extensive array of off-the-shelf datasets and flexible data collection and annotation services,our mission revolves around unleashing AI's full potential and expediting the AI industry's growth.

©copyright2009-2020 Canadian Finance Weekly    Contact Us SiteMap