26th Interspeech 2025: Rotterdam, The Netherlands

Refine list

showing all ?? records

Keynote 1 - Roger Moore: From Talking and Listening Devices to Intelligent Communicative Machines

Spoken Machine Translation 1

Real-time Speech Enhancement

Multilinguality, Cross-linguistic Studies, L2 Speech

Speech Emotion Recognition 1

Multimodal Resources

Interpretability in Audio and Speech Technology

Summarization

Show and Tell 1: ASR / Tools

Models of Speech Production

Speech and Grammar/Articulatory Analyses

Speaking Styles, Register and Conversational Speech

Emotional Distress in Speech

Prosody in Speech Synthesis

Depression Detection and Assessment 1

Speech Analysis, Detection and Classification 1

Speech-based Cognitive Assessment 1

Large Language Models in Speech Recognition

Speech Coding and Echo Cancellation

Decoding Algorithms

Queer and Trans Speech Science and Technology

Tone

Cross-Lingual and Multilingual Processing

Echo Cancellation, Feedback Control, and Near-end Enhancement

Pathological Speech Analysis 1

Hearing Disorders

Interspeech 2025 URGENT Challenge

Spoken Machine Translation 2

Spatial Audio and Acoustics 1

Articulatory and Vocal Tract Modelling

Acoustic Assessment of Respiratory Health

Advances in Modelling and Imaging

Conversation, Communication and Interaction 1

Robust Speaker Verification

Multilingual ASR

Multi-channel Speech Enhancement

Self-supervised Learning

Singing Voice and Audio Synthesis

Acoustic and Articulatory Cues in Speech Perception

Audio Event Detection and Classification

Inclusivity

Voice Conversion 1

Speech-based Cognitive Assessment 2

Source Separation 1

Language and Accent Identification and Speaker Privacy

Source Tracing: The Origins of Synthetic or Manipulated Speech

Speaker Diarization 1

Multilingual Speech Synthesis and Special Applications 1

Characterization and Multimodal Approaches for Speaker Recognition

Acoustic Analysis and Bioacoustics

Keynote 2 - Alexander Waibel: From Speech Science to Language Transparence

Spoken Dialogue Systems 1

Speech Assessment

Audio-Visual ASR and Multimodal System

Speech and Voice Disorders 1

Multimodal Information Based Speech Processing (MISP) 2025 Challenge

Speaker Extraction 1

Low Resource Speech Recognition

Computational Resource Constrained ASR

Speech and Language Technology for Health Applications

Responsible Speech Foundation Models + SUPERB Challenge

Dysarthric Speech Assessment 1

Show and Tell 2: Speech Synthesis

Databases and Progress in Methodology

Novel Architectures for ASR

Deepfake Detection

Tools for Speech Analysis

Text Processing and Evaluation for Speech Synthesis 1

Segmental and Tonal Units

Speech Quality Assessment

Speech Enhancement

Language Learning and Assessment

Speech Synthesis Paradigms and Methods 1

Spatial Audio and Acoustics 2

Text Processing and Evaluation for Speech Synthesis 2

General Topics in ASR

Acoustic Event Detection and Classification

Keyword Spotting and Retrieval

Multimodal Systems

Dysarthric Speech Assessment 2

Dialect Identification in Different Languages

Connecting Speech Science and Speech Technology for Children's Speech

Brain and Cognition

Regional, Social and Diachronic Variation

Speaker Extraction 2

Multimodal Emotion Recognition

Conversation, Communication and Interaction 2

Multimodal Speech and Language Processing in Healthcare Settings

Music and Audio Analysis

Audio Analysis, Generation and Assessment

Other Topics in Speech Recognition

Privacy and Anonymization

Language Modeling for Conversational Systems

Speech Accessibility Project Challenge

Neural Network Training Methods 1

Diversity: Age, Sex, Gender, Ethnicity, and More

Anomalous Sound Detection

Far-field and Robust Speech Recognition

Speech Synthesis Paradigms and Methods 2

Keynote 3 - Carol Y. Espy-Wilson: Speech Kinematic Analysis from Acoustics: Scientific, Clinical and Practical Applications

Articulatory Analyses

Speech and Audio Analysis and Representation

Show and Tell 3: Signal Processing / Multimodal processing

Speech and Voice Disorders 2

Neural Network Training Methods 2

Disentanglement of Information for Speaker Recognition

Error Correction and Confidence Estimation

Training and Scoring Methods for Speaker Recognition

Pathological Speech Analysis 2

Multimodal and Visual Speech Synthesis

Lexicon and Grammar

Noise Reduction and Dereverberation

Neural Network Training Methods and Architectures

Challenges in Speech Data Collection, Curation and Annotation - Part 1

Evaluation and Forensic Applications of Speaker Recognition

Language Resources

Bandwidth Expansion and Diffusion-based Speech Enhancement

Spoken Language Understanding

Multilingual Speech Synthesis and Special Applications 2

Prosody and Voice Quality

Generative Models for Audio

Challenges in Speech Data Collection, Curation and Annotation - Part 2

Speech Emotion Recognition 3

Emotion and Expressivity in Speech Synthesis and Voice Conversion

Streaming ASR

L1 and L2 Acquisition, Perception and Processing

Speech Emotion Recognition 2

Speaker Traits Recognition

Spoofing and Adversarial Attacks

Voice Conversion 2

Pathological Speech Analysis 3

Speech Emotion Recognition in Naturalistic Conditions Challenge

Prosody, Phoneme and Stress Modeling in ASR

Segments

Datasets and Tools for Speech Synthesis

Spoken Dialogue Systems 2

Speech Enhancement and Representation Learning

Neural Codecs and Vocoders

Adaptation and Target-speaker ASR

Show and Tell 4: Education / Assistive Technology

Source Separation 2

Speech Coding

Multimodality

Speech Assessment and Language Learning

Watermarking and Anonymization

Single-channel Speech Enhancement

Contextual Biasing and Adaptation

Speaker Diarization 2

Depression Detection and Assessment 2

Keynote 4 - Judith Holler: Using and comprehending language in face-to-face conversation

Pathological Speech Analysis 4

Speech Deepfakes

Prosody

Speech Analysis and Quality Assessment

Emotions and Foundational Models

Prediction and Evaluation of Speech Quality and Intelligibility

Multi-Talker ASR

Speech Synthesis Paradigms and Methods 3

Biosignal-enabled Spoken Communication

Speech Deepfakes, Antispoofing and Backdoor Attacks

Pathological Speech Analysis 5

ASR Assessment and Foundational Models

Speaker Recognition

Speech Analysis, Detection and Classification 2