Catherine Lai
Orcid: 0000-0003-2411-8954
  According to our database1,
  Catherine Lai
  authored at least 71 papers
  between 2004 and 2025.
  
  
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
  2025
    CoRR, June, 2025
    
  
    Proceedings of the 27th International Conference on Multimodal Interaction, 2025
    
  
Revise, Reason, and Recognize: LLM-Based Emotion Recognition via Emotion-Specific Prompts and ASR Error Correction.
    
  
    Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
    
  
Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling.
    
  
    Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
    
  
Can We "Cherry-Pick"? Investigating Multiple Renditions from a Generative Speech Synthesis Model.
    
  
    Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
    
  
  2024
Large Language Model Based Generative Error Correction: A Challenge and Baselines For Speech Recognition, Speaker Tagging, and Emotion Recognition.
    
  
    Proceedings of the IEEE Spoken Language Technology Workshop, 2024
    
  
    Proceedings of the IEEE Spoken Language Technology Workshop, 2024
    
  
Speech Emotion Recognition With ASR Transcripts: a Comprehensive Study on Word Error Rate and Fusion Techniques.
    
  
    Proceedings of the IEEE Spoken Language Technology Workshop, 2024
    
  
1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem.
    
  
    Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024
    
  
Well, what can you do with messy data? Exploring the prosody and pragmatic function of the discourse marker "well" with found data and speech synthesis.
    
  
    Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
    
  
Layer-Wise Analysis of Self-Supervised Acoustic Word Embeddings: A Study on Speech Emotion Recognition.
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2024
    
  
Language Technologies as If People Mattered: Centering Communities in Language Technology Development.
    
  
    Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
    
  
  2023
Cross-Attention is Not Enough: Incongruity-Aware Multimodal Sentiment Analysis and Emotion Recognition.
    
  
    CoRR, 2023
    
  
    Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023
    
  
    Proceedings of the 2023 Workshop on Speech, Music and Mind, 2023
    
  
    Proceedings of the 24th Meeting of the Special Interest Group on Discourse and Dialogue, 2023
    
  
    Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
    
  
    Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
    
  
ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition.
    
  
    Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
    
  
    Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2023
    
  
    Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023
    
  
I Know Your Feelings Before You Do: Predicting Future Affective Reactions in Human-Computer Dialogue.
    
  
    Proceedings of the Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems, 2023
    
  
Empowering Dialogue Systems with Affective and Adaptive Interaction: Integrating Social Intelligence.
    
  
    Proceedings of the 11th International Conference on Affective Computing and Intelligent Interaction, ACII 2023, 2023
    
  
  2022
A Cross-Domain Approach for Continuous Impression Recognition from Dyadic Audio-Visual-Physio Signals.
    
  
    CoRR, 2022
    
  
    CoRR, 2022
    
  
    Proceedings of the IEEE Spoken Language Technology Workshop, 2022
    
  
    Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
    
  
    Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
    
  
Combining conversational speech with read speech to improve prosody in Text-to-Speech synthesis.
    
  
    Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
    
  
    Proceedings of the IEEE International Conference on Acoustics, 2022
    
  
Alzheimer's Dementia Detection through Spontaneous Dialogue with Proactive Robotic Listeners.
    
  
    Proceedings of the ACM/IEEE International Conference on Human-Robot Interaction, 2022
    
  
  2021
    IEEE Trans. Affect. Comput., 2021
    
  
    Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021
    
  
Location, Location: Enhancing the Evaluation of Text-to-Speech synthesis using the Rapid Prosody Transcription Paradigm.
    
  
    Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021
    
  
It's Not What You Said, it's How You Said it: Discriminative Perception of Speech as a Multichannel Communication System.
    
  
    Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
    
  
  2020
    Speech Commun., 2020
    
  
Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0.
    
  
    CoRR, 2020
    
  
  2019
    Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
    
  
    Proceedings of the Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems, 2019
    
  
  2018
    Proceedings of the Group Interaction Frontiers in Technology Workshop, 2018
    
  
    Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018
    
  
    Proceedings of the Workshop on Modeling Cognitive Processes from Multimodal Data, 2018
    
  
  2017
    Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
    
  
    Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
    
  
    Proceedings of the 1st ACM SIGCHI International Workshop on Investigating Social Interactions with Artificial Agents, 2017
    
  
Recognizing induced emotions of movie audiences: Are induced and perceived emotions the same?
    
  
    Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction, 2017
    
  
  2016
Recognizing emotions in spoken dialogue with hierarchically fused acoustic and lexical features.
    
  
    Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016
    
  
    Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
    
  
  2015
    Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
    
  
    Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
    
  
    Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
    
  
    Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
    
  
  2014
Incorporating lexical and prosodic information at different levels for meeting summarization.
    
  
    Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
    
  
    Proceedings of the Computational Linguistics and Intelligent Text Processing, 2014
    
  
  2013
    Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2013
    
  
Detecting summarization hot spots in meetings using group level involvement and turn-taking features.
    
  
    Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
    
  
  2010
What do you mean, you're uncertain?: the interpretation of cue words and rising intonation in dialogue.
    
  
    Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
    
  
  2009
Perceiving surprise on cue words: prosody and semantics interact on right and really.
    
  
    Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
    
  
  2008
    Proceedings of the ISMIR 2008, 2008
    
  
  2007
    Bull. IEEE Tech. Comm. Digit. Libr., 2007
    
  
    Proceedings of the 8th International Conference on Music Information Retrieval, 2007
    
  
    Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
    
  
  2006
Data Dictionary: Metadata for Phonograph Records.
  
    Proceedings of the ISMIR 2006, 2006
    
  
  2005
Metadata for Phonograph Records: Facilitating New Forms of Use and Access to Analog Sound Recordings.
    
  
    Bull. IEEE Tech. Comm. Digit. Libr., 2005
    
  
    Proceedings of the 19st Pacific Asia Conference on Language, Information and Computation, 2005
    
  
    Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2005
    
  
    Proceedings of the ISMIR 2005, 2005
    
  
  2004
    Proceedings of the Australasian Language Technology Workshop, 2004