Catherine Lai

Orcid: 0000-0003-2411-8954

According to our database1, Catherine Lai authored at least 67 papers between 2004 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling.
CoRR, 2024

Revise, Reason, and Recognize: LLM-Based Emotion Recognition via Emotion-Specific Prompts and ASR Error Correction.
CoRR, 2024

Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition.
CoRR, 2024

Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques.
CoRR, 2024

Crossmodal ASR Error Correction with Discrete Speech Units.
CoRR, 2024

1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem.
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024

Layer-Wise Analysis of Self-Supervised Acoustic Word Embeddings: A Study on Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

Language Technologies as If People Mattered: Centering Communities in Language Technology Development.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Cross-Attention is Not Enough: Incongruity-Aware Multimodal Sentiment Analysis and Emotion Recognition.
CoRR, 2023

Synthesising turn-taking cues using natural conversational data.
Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023

Into the prosodic dimension: Finding meaning in the non-lexical aspects of speech.
Proceedings of the 2023 Workshop on Speech, Music and Mind, 2023

Synthesising Personality with Neural Speech Synthesis.
Proceedings of the 24th Meeting of the Special Interest Group on Discourse and Dialogue, 2023

Quantifying the perceptual value of lexical and non-lexical channels in speech.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Everyone has an accent.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Transfer Learning for Personality Perception via Speech Emotion Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Multimodal Dyadic Impression Recognition via Listener Adaptive Cross-Domain Fusion.
Proceedings of the IEEE International Conference on Acoustics, 2023

Do dialogue representations align with perception? An empirical study.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

I Know Your Feelings Before You Do: Predicting Future Affective Reactions in Human-Computer Dialogue.
Proceedings of the Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

Empowering Dialogue Systems with Affective and Adaptive Interaction: Integrating Social Intelligence.
Proceedings of the 11th International Conference on Affective Computing and Intelligent Interaction, ACII 2023, 2023

2022
A Cross-Domain Approach for Continuous Impression Recognition from Dyadic Audio-Visual-Physio Signals.
CoRR, 2022

Robotic Speech Synthesis: Perspectives on Interactions, Scenarios, and Ethics.
CoRR, 2022

Exploration of a Self-Supervised Speech Model: A Study on Emotional Corpora.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Investigating perception of spoken dialogue acceptability through surprisal.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Voice Puppetry with FastPitch.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Combining conversational speech with read speech to improve prosody in Text-to-Speech synthesis.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Fusing ASR Outputs in Joint Training for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

Alzheimer's Dementia Detection through Spontaneous Dialogue with Proactive Robotic Listeners.
Proceedings of the ACM/IEEE International Conference on Human-Robot Interaction, 2022

2021
Recognizing Induced Emotions of Movie Audiences from Multimodal Information.
IEEE Trans. Affect. Comput., 2021

Factors Affecting the Evaluation of Synthetic Speech in Context.
Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021

Location, Location: Enhancing the Evaluation of Text-to-Speech synthesis using the Rapid Prosody Transcription Paradigm.
Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021

It's Not What You Said, it's How You Said it: Discriminative Perception of Speech as a Multichannel Communication System.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020
Integrating lexical and prosodic features for automatic paragraph segmentation.
Speech Commun., 2020

Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0.
CoRR, 2020

2019
Detecting Topic-Oriented Speaker Stance in Conversational Speech.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

"Why is the Doctor a Man": Reactions of Older Adults to a Virtual Training Doctor.
Proceedings of the Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems, 2019

2018
Polarity and Intensity: the Two Aspects of Sentiment Analysis.
CoRR, 2018

Multimodal Analysis of Group Attitudes Towards Meeting Management.
Proceedings of the Group Interaction Frontiers in Technology Workshop, 2018

Group Interaction Frontiers in Technology.
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018

Predicting group satisfaction in meeting discussions.
Proceedings of the Workshop on Modeling Cognitive Processes from Multimodal Data, 2018

2017
Using Prosody to Classify Discourse Relations.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

A System for Real Time Collaborative Transcription Correction.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Recognizing emotions in spoken dialogue with acoustic and lexical cues.
Proceedings of the 1st ACM SIGCHI International Workshop on Investigating Social Interactions with Artificial Agents, 2017

Recognizing induced emotions of movie audiences: Are induced and perceived emotions the same?
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction, 2017

2016
Recognizing emotions in spoken dialogue with hierarchically fused acoustic and lexical features.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Automatic Paragraph Segmentation with Lexical and Prosodic Features.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
Towards automatic detection of reported speech in dialogue using prosodic cues.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

A system for automatic broadcast news summarisation, geolocation and translation.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Recognizing emotions in dialogues with acoustic and lexical features.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

Emotion recognition in spontaneous and acted dialogues.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

2014
Incorporating lexical and prosodic information at different levels for meeting summarization.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Word-Level Emotion Recognition Using High-Level Features.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2014

2013
Applying rhythm metrics to non-native spontaneous speech.
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2013

Detecting summarization hot spots in meetings using group level involvement and turn-taking features.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

2010
Querying Linguistic Trees.
J. Log. Lang. Inf., 2010

What do you mean, you're uncertain?: the interpretation of cue words and rising intonation in dialogue.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009
Perceiving surprise on cue words: prosody and semantics interact on right and really.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008
Automatic Identification of Simultaneous Singers in Duet Recordings.
Proceedings of the ISMIR 2008, 2008

2007
Metadata Data Dictionary for Analog Sound Recordings.
Bull. IEEE Tech. Comm. Digit. Libr., 2007

Metadata Infrastructure for Sound Recordings.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007

Perception of disfluency: language differences and listener bias.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

2006
Data Dictionary: Metadata for Phonograph Records.
Proceedings of the ISMIR 2006, 2006

2005
Metadata for Phonograph Records: Facilitating New Forms of Use and Access to Analog Sound Recordings.
Bull. IEEE Tech. Comm. Digit. Libr., 2005

LPath+: A First-Order Complete Language for Linguistic Tree Query.
Proceedings of the 19st Pacific Asia Conference on Language, Information and Computation, 2005

The challenges in developing digital collections of phonograph records.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2005

Preservation Digitization of David Edelberg's Handel LP Collection: A Pilot Project.
Proceedings of the ISMIR 2005, 2005

2004
Querying and Updating Treebanks: A Critical Survey and Requirements Analysis.
Proceedings of the Australasian Language Technology Workshop, 2004


  Loading...