About the SALMA Workshop

The rapid advancement of foundational large language models (LLMs) has transformed multiple domains by significantly boosting performance across a wide range of downstream tasks. Furthermore, in recent years, LLMs have been increasingly utilized in foundational speech and audio processing tasks such as ASR, Audio Captioning, etc., as well as in the development of new and innovative tasks such as open-ended Question Answering. However, despite the growing interest in this area, the adoption of LLMs for speech and audio tasks has been slower due to several challenges. These challenges include the limited availability of high-quality data, especially in non-English languages, the absence of comprehensive evaluation metrics, and the need for improved architectures and training methodologies that can effectively address the unique complexities of speech and audio processing.

The first Workshop on Speech and Audio Language Models (SALMA), co-located with ICASSP 2025, is focused on exploring how Large Language Models (LLMs) can be utilized to advance speech and audio processing. This workshop aims to bring together researchers specializing in speech, audio, and language models to foster in-depth discussions and identify synergies. The goal is to develop effective methodologies for leveraging LLMs to improve performance across various tasks in speech, audio, and music domains, including classification, generation, and retrieval. The workshop will also address fundamental questions such as:

Invited Speakers

Bhuvana Ramabhadran

Bhuvana Ramabhadran

Speech Research Manager
Google, USA

Oriol Nieto

Oriol Nieto

Senior Audio Research Scientist
Adobe Research, USA

Zhuo Chen

Zhuo Chen

Research Scientist Lead
ByteDance, USA

Organizers

Sreyan Ghosh

Sreyan Ghosh

Ph.D. Candidate
University of Maryland, College Park

Soham Deshmukh

Soham Deshmukh

Ph.D. Candidate, Applied Scientist
Carnegie Mellon University, Microsoft

Shinji Wantabe

Shinji Wantabe

Associate Professor
Carnegie Mellon University

Dinesh Manocha

Dinesh Manocha

Distinguished Professor
University of Maryland, College Park

Bhiksha Raj

Bhiksha Raj

Professor
Carnegie Mellon University

Ramani Duraiswami

Ramani Duraiswami

Professor
University of Maryland, College Park

Nima Mesgarani

Nima Mesgarani

Associate Professor
Columbia University

Huaming Wang

Huaming Wang

Partner Audio Manager
Microsoft, USA

Contact

For any questions, please email at salmaicassp2025@gmail.com