Shri Narayanan
Orcid: 0000-0002-1052-6204Affiliations:
- University of Southern California, Signal Analysis and Interpretation Lab, Los Angeles, USA
According to our database1,
Shri Narayanan
authored at least 912 papers
between 1993 and 2024.
Collaborative distances:
Collaborative distances:
Awards
ACM Fellow
ACM Fellow 2023, "For contributions to speech, language, multimedia processing, affective computing, and their human-centered applications".
IEEE Fellow
IEEE Fellow 2009, "For contributions to human-centric multimodal signal processing and applications".
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on sail.usc.edu
-
on orcid.org
-
on id.loc.gov
-
on d-nb.info
On csauthors.net:
Bibliography
2024
IEEE J. Biomed. Health Informatics, October, 2024
Speech2rtMRI: Speech-Guided Diffusion Model for Real-time MRI Video of the Vocal Tract during Speech.
CoRR, 2024
CoRR, 2024
Egocentric Speaker Classification in Child-Adult Dyadic Interactions: From Sensing to Computational Modeling.
CoRR, 2024
Larger Language Models Don't Care How You Think: Why Chain-of-Thought Prompting Fails in Subjective Tasks.
CoRR, 2024
ModalityMirror: Improving Audio Classification in Modality Heterogeneity Federated Learning with Multimodal Distillation.
CoRR, 2024
Early Detection of Coffee Leaf Rust Through Convolutional Neural Networks Trained on Low-Resolution Images.
CoRR, 2024
Creating a Lens of Chinese Culture: A Multimodal Dataset for Chinese Pun Rebus Art Understanding.
CoRR, 2024
Can Synthetic Audio From Generative Foundation Models Assist Audio Recognition and Speech Modeling?
CoRR, 2024
Exploring Speech Foundation Models for Speaker Diarization in Child-Adult Dyadic Interactions.
CoRR, 2024
ConPro: Learning Severity Representation for Medical Images using Contrastive Learning and Preference Optimization.
CoRR, 2024
TI-ASU: Toward Robust Automatic Speech Understanding through Text-to-speech Imputation Against Missing Speech Modality.
CoRR, 2024
The Strong Pull of Prior Knowledge in Large Language Models and Its Impact on Emotion Recognition.
CoRR, 2024
The NeurIPS 2023 Machine Learning for Audio Workshop: Affective Audio Benchmarks and Novel Data.
CoRR, 2024
Can Text-to-image Model Assist Multi-modal Learning for Visual Recognition with Visual Modality Missing?
CoRR, 2024
Understanding Stress, Burnout, and Behavioral Patterns in Medical Residents Using Large-scale Longitudinal Wearable Recordings.
CoRR, 2024
A Multi-Perspective Machine Learning Approach to Evaluate Police-Driver Interaction in Los Angeles.
CoRR, 2024
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
Proceedings of the Eighteenth International AAAI Conference on Web and Social Media, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Does Video Summarization Require Videos? Quantifying the Effectiveness of Language in Video Summarization.
Proceedings of the IEEE International Conference on Acoustics, 2024
Foundation Model Assisted Automatic Speech Emotion Recognition: Transcribing, Annotating, and Augmenting.
Proceedings of the IEEE International Conference on Acoustics, 2024
TRUST-SER: On The Trustworthiness Of Fine-Tuning Pre-Trained Speech Embeddings For Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
ConPro: Learning Severity Representation for Medical Images using Contrastive Learning and Preference Optimization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
An Engineering View on Emotions and Speech: From Analysis and Predictive Models to Responsible Human-Centered Applications.
Proc. IEEE, October, 2023
Comput. Speech Lang., April, 2023
Modeling inter-individual differences in ambulatory-based multimodal signals via metric learning: a case study of personalized well-being estimation of healthcare workers.
Frontiers Digit. Health, March, 2023
IEEE Trans. Multim., 2023
Explainable Severity ranking via pairwise n-hidden comparison: a case study of glaucoma.
CoRR, 2023
CoRR, 2023
Learning Behavioral Representations of Routines From Large-scale Unlabeled Wearable Time-series Data Streams using Hawkes Point Process.
CoRR, 2023
Unlocking Foundation Models for Privacy-Enhancing Speech Understanding: An Early Study on Low Resource Speech Training Leveraging Label-guided Synthetic Speech Content.
CoRR, 2023
TrustSER: On the Trustworthiness of Fine-tuning Pre-trained Speech Embeddings For Speech Emotion Recognition.
CoRR, 2023
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023
Understanding Spoken Language Development of Children with ASD Using Pre-trained Speech Embeddings.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Robust Self Supervised Speech Embeddings for Child-Adult Classification in Interactions involving Children with Autism.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Tensor Embedding: A Supervised Framework for Human Behavioral Data Mining and Prediction.
Proceedings of the 11th IEEE International Conference on Healthcare Informatics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Can Knowledge of End-to-End Text-to-Speech Models Improve Neural Midi-to-Audio Synthesis Systems?
Proceedings of the IEEE International Conference on Acoustics, 2023
Toward Privacy-Enhancing Ambulatory-Based Well-Being Monitoring: Investigating User Re-Identification Risk in Multimodal Data.
Proceedings of the IEEE International Conference on Acoustics, 2023
A Context-Aware Computational Approach for Measuring Vocal Entrainment in Dyadic Conversations.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Using Emotion Embeddings to Transfer Knowledge between Emotions, Languages, and Annotation Formats.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Signal Processing Grand Challenge 2023 - E-Prevention: Sleep Behavior as an Indicator of Relapses in Psychotic Patients.
Proceedings of the IEEE International Conference on Acoustics, 2023
Navigating and Reaching Therapeutic Goals with Dynamical Systems in Conversation-Based Interventions.
Proceedings of the IEEE International Conference on Acoustics, 2023
Designing and Evaluating Speech Emotion Recognition Systems: A Reality Check Case Study with IEMOCAP.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Context Unlocks Emotions: Text-based Emotion Classification Dataset Auditing with Large Language Models.
Proceedings of the 11th International Conference on Affective Computing and Intelligent Interaction, 2023
PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Speech Models.
Proceedings of the 11th International Conference on Affective Computing and Intelligent Interaction, 2023
2022
Robust Character Labeling in Movie Videos: Data Resources and Self-Supervised Feature Adaptation.
IEEE Trans. Multim., 2022
IEEE Trans. Affect. Comput., 2022
Modeling Vocal Entrainment in Conversational Speech Using Deep Unsupervised Learning.
IEEE Trans. Affect. Comput., 2022
IEEE Trans. Affect. Comput., 2022
IEEE J. Sel. Top. Signal Process., 2022
Studying Large-Scale Behavioral Differences in Auschwitz-Birkenau with Simulation of Gendered Narratives.
Digit. Humanit. Q., 2022
End-to-end neural systems for automatic children speech recognition: An empirical study.
Comput. Speech Lang., 2022
Comput. Speech Lang., 2022
Causal indicators for assessing the truthfulness of child speech in forensic interviews.
Comput. Speech Lang., 2022
An automated quality evaluation framework of psychotherapy conversations with local quality estimates.
Comput. Speech Lang., 2022
Exploring Workplace Behaviors through Speaking Patterns using Large-scale Multimodal Wearable Recordings: A Study of Healthcare Providers.
CoRR, 2022
A Review of Speech-centric Trustworthy Machine Learning: Privacy, Safety, and Fairness.
CoRR, 2022
Audio-Visual Activity Guided Cross-Modal Identity Association for Active Speaker Detection.
CoRR, 2022
CoRR, 2022
Unsupervised active speaker detection in media content using cross-modal information.
CoRR, 2022
VAuLT: Augmenting the Vision-and-Language Transformer with the Propagation of Deep Language Representations.
CoRR, 2022
Mel Frequency Spectral Domain Defenses against Adversarial Attacks on Speech Recognition Systems.
CoRR, 2022
Audio visual character profiles for detecting background characters in entertainment media.
CoRR, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
User-Level Differential Privacy against Attribute Inference Attack of Speech Emotion Recognition on Federated Learning.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Semi-FedSER: Semi-supervised Learning for Speech Emotion Recognition On Federated Learning using Multiview Pseudo-Labeling.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Automating Detection of Papilledema in Pediatric Fundus Images with Explainable Machine Learning.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022
Enhancing Privacy Through Domain Adaptive Noise Injection For Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022
Leveraging Open Data and Task Augmentation to Automated Behavioral Coding of Psychotherapy Conversations in Low-Resource Scenarios.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
Proceedings of the 10th International Conference on Affective Computing and Intelligent Interaction, 2022
2021
IEEE Trans. Signal Process., 2021
Evidence of Task-Independent Person-Specific Signatures in EEG Using Subspace Techniques.
IEEE Trans. Inf. Forensics Secur., 2021
Meta-Learning With Latent Space Clustering in Generative Adversarial Network for Speaker Diarization.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
IEEE ACM Trans. Audio Speech Lang. Process., 2021
IEEE Signal Process. Lett., 2021
Proc. IEEE, 2021
Deep multiple instance learning for foreground speech localization in ambient audio from wearable devices.
EURASIP J. Audio Speech Music. Process., 2021
Unsupervised speech representation learning for behavior modeling using triplet enhanced contextualized networks.
Comput. Speech Lang., 2021
Comput. Speech Lang., 2021
An analysis of observation length requirements for machine understanding of human behaviors from spoken language.
Comput. Speech Lang., 2021
Attribute Inference Attack of Speech Emotion Recognition in Federated Learning Settings.
CoRR, 2021
Representation of professions in entertainment media: Insights into frequency and sentiment trends through computational text analysis.
CoRR, 2021
Perceptual-based deep-learning denoiser as a defense against adversarial attacks on ASR systems.
CoRR, 2021
Front-end Diarization for Percussion Separation in Taniavartanam of Carnatic Music Concerts.
CoRR, 2021
Automated Quality Assessment of Cognitive Behavioral Therapy Sessions Through Highly Contextualized Language Representations.
CoRR, 2021
"Am I A Good Therapist?" Automated Evaluation Of Psychotherapy Skills Using Speech And Language Technologies.
CoRR, 2021
A multispeaker dataset of raw and reconstructed speech production real-time MRI video and 3D volumetric images.
CoRR, 2021
Attention-gated convolutional neural networks for off-resonance correction of spiral real-time MRI.
CoRR, 2021
Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with Subwords.
CoRR, 2021
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Proceedings of the Social, Cultural, and Behavioral Modeling, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Acted vs. Improvised: Domain Adaptation for Elicitation Approaches in Audio-Visual Emotion Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Analyzing Short Term Dynamic Speech Features for Understanding Behavioral Traits of Children with Autism Spectrum Disorder.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Feature Fusion Strategies for End-to-End Evaluation of Cognitive Behavior Therapy Sessions.
Proceedings of the 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2021
Proceedings of the 18th International Conference on Content-Based Multimedia Indexing, 2021
Proceedings of the 18th International Conference on Content-Based Multimedia Indexing, 2021
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
Proceedings of the 9th International Conference on Affective Computing and Intelligent Interaction, 2021
Proceedings of the 9th International Conference on Affective Computing and Intelligent Interaction, 2021
2020
Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap.
IEEE Signal Process. Lett., 2020
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 2020
Investigating Group-Specific Models of Hospital Workers' Well-Being: Implications for Algorithmic Bias.
Int. J. Semantic Comput., 2020
Leveraging Linguistic Context in Dyadic Interactions to Improve Automatic Speech Recognition for Children.
Comput. Speech Lang., 2020
Multi-Face: Self-supervised Multiview Adaptation for Robust Face Clustering in Videos.
CoRR, 2020
CoRR, 2020
Having a Bad Day? Detecting the Impact of Atypical Life Events Using Wearable Sensors.
CoRR, 2020
Affective Conditioning on Hierarchical Networks applied to Depression Detection from Transcribed Clinical Interviews.
CoRR, 2020
CoRR, 2020
A Label Proportions Estimation Technique for Adversarial Domain Adaptation in Text Classification.
CoRR, 2020
IEEE Access, 2020
Proceedings of the Social, Cultural, and Behavioral Modeling, 2020
An Empirical Analysis of Information Encoded in Disentangled Neural Speaker Representations.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020
ATQAM/MAST'20: Joint Workshop on Aesthetic and Technical Quality Assessment of Multimedia and Media Analytics for Societal Trends.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
MediaEval 2020 Emotion and Theme Recognition in Music Task: Loss Function Approaches for Multi-label Music Tagging.
Proceedings of the Working Notes Proceedings of the MediaEval 2020 Workshop, 2020
Affective Conditioning on Hierarchical Attention Networks Applied to Depression Detection from Transcribed Clinical Interviews.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Sentence Level Estimation of Psycholinguistic Norms Using Joint Multidimensional Annotations.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Exploiting Conic Affinity Measures to Design Speech Enhancement Systems Operating in Unseen Noise Conditions.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the ICMI '20: International Conference on Multimodal Interaction, 2020
Fifty Shades of Green: Towards a Robust Measure of Inter-annotator Agreement for Continuous Signals.
Proceedings of the ICMI '20: International Conference on Multimodal Interaction, 2020
Proceedings of the 8th IEEE International Conference on Healthcare Informatics, 2020
Bringing in the Outliers: A Sparse Subspace Clustering Approach to Learn a Dictionary of Mouse Ultrasonic Vocalizations.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Learning Domain Invariant Representations for Child-Adult Classification from Speech.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Vocal Tract Articulatory Contour Detection in Real-Time Magnetic Resonance Images Using Spatio-Temporal Context.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
The Role of Annotation Fusion Methods in the Study of Human-Reported Emotion Experience During Music Listening.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Modeling Behavioral Consistency in Large-Scale Wearable Recordings of Human Bio-Behavioral Signals.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Modeling Behavior as Mutual Dependency between Physiological Signals and Indoor Location in Large-Scale Wearable Sensor Study.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Automatic Prediction of Suicidal Risk in Military Couples Using Multimodal Interaction Cues from Couples Conversations.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Trapezoidal Segment Sequencing: A Novel Approach for Fusion of Human-Produced Continuous Annotations.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Dynamical systems modeling of day-to-day signal-based patterns of emotional self-regulation and stress spillover in highly-demanding health professions.
Proceedings of the 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2020
Proceedings of the 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2020
Towards end-2-end learning for predicting behavior codes from spoken utterances in psychotherapy conversations.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Proceedings of the First Joint Workshop on Narrative Understanding, Storylines, and Events, 2020
2019
IEEE Signal Process. Lett., 2019
Pattern Recognit. Lett., 2019
Comput. Speech Lang., 2019
An analysis of observation length requirements in spoken language for machine understanding of human behaviors.
CoRR, 2019
Characterizing dynamically varying acoustic scenes from egocentric audio recordings in workplace setting.
CoRR, 2019
CoRR, 2019
Report of 2017 NSF Workshop on Multimedia Challenges, Opportunities and Research Roadmaps.
CoRR, 2019
Towards Adapting NMF Dictionaries Using Total Variability Modeling for Noise-Robust Acoustic Features.
CoRR, 2019
A system for the 2019 Sentiment, Emotion and Cognitive State Task of DARPAs LORELEI project.
CoRR, 2019
CoRR, 2019
Understanding affective expressions and experiences through behavioral machine intelligence.
Proceedings of the 2019 Workshop on Speech, Music and Mind, 2019
Using Shared Vector Representations of Words and Chords in Music for Genre Classification.
Proceedings of the 2019 Workshop on Speech, Music and Mind, 2019
A Multimodal View into Music's Effect on Human Neural, Physiological, and Emotional Experience.
Proceedings of the 27th ACM International Conference on Multimedia, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Modeling Interpersonal Linguistic Coordination in Conversations Using Word Mover's Distance.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Multi-Task Discriminative Training of Hybrid DNN-TVM Model for Speaker Verification with Noisy and Far-Field Speech.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019
Proceedings of the 2019 IEEE International Conference on Healthcare Informatics, 2019
Reinforcing Self-expressive Representation with Constraint Propagation for Face Clustering in Movies.
Proceedings of the IEEE International Conference on Acoustics, 2019
An Empirical Study of Speech Processing in the Brain by Analyzing the Temporal Syllable Structure in Speech-input Induced EEG.
Proceedings of the IEEE International Conference on Acoustics, 2019
Speaker Agnostic Foreground Speech Detection from Audio Recordings in Workplace Settings from Wearable Recorders.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
On Role and Location of Normalization before Model-based Data Augmentation in Residual Blocks for Classification Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2019
Robust Speech Activity Detection in Movie Audio: Data Resources and Experimental Evaluation.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Role Specific Lattice Rescoring for Speaker Role Recognition from Speech Recognition Outputs.
Proceedings of the IEEE International Conference on Acoustics, 2019
Discovering Optimal Variable-length Time Series Motifs in Large-scale Wearable Recordings of Human Bio-behavioral Signals.
Proceedings of the IEEE International Conference on Acoustics, 2019
Improving the Prediction of Therapist Behaviors in Addiction Counseling by Exploiting Class Confusions.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the 27th European Signal Processing Conference, 2019
Stress and Anxiety Measurement "In-the-Wild" Using Quality-aware Multi-scale HRV Features.
Proceedings of the 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2019
A Comparative Study of Stress and Anxiety Estimation in Ecological Settings Using a Smart-shirt and a Smart-bracelet.
Proceedings of the 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2019
Proceedings of the 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2019
Imputing Missing Data In Large-Scale Multivariate Biomedical Wearable Recordings Using Bidirectional Recurrent Neural Networks With Temporal Activation Regularization.
Proceedings of the 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2019
Proceedings of the 2019 International Conference on Content-Based Multimedia Indexing, 2019
Prediction of Psychological Flexibility with multi-scale Heart Rate Variability and Breathing Features in an "in-the-wild" Setting.
Proceedings of the 8th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2019
A system for the 2019 Sentiment, Emotion and Cognitive State Task of DARPA's LORELEI project.
Proceedings of the 8th International Conference on Affective Computing and Intelligent Interaction, 2019
Proceedings of the 8th International Conference on Affective Computing and Intelligent Interaction, 2019
Trapezoidal Segmented Regression: A Novel Continuous-scale Real-time Annotation Approximation Algorithm.
Proceedings of the 8th International Conference on Affective Computing and Intelligent Interaction, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
IEEE Trans. Multim., 2018
Acoustic Denoising Using Dictionary Learning With Spectral and Temporal Regularization.
IEEE ACM Trans. Audio Speech Lang. Process., 2018
Modeling Multiple Time Series Annotations as Noisy Distortions of the Ground Truth: An Expectation-Maximization Approach.
IEEE Trans. Affect. Comput., 2018
IEEE Trans. Affect. Comput., 2018
Phonetica, 2018
The ELISA Situation Frame extraction for low resource languages pipeline for LoReHLT'2016.
Mach. Transl., 2018
Normalization Before Shaking Toward Learning Symmetrically Distributed Representation Without Margin in Speech Emotion Recognition.
CoRR, 2018
Shaking Acoustic Spectral Sub-bands Can Better Regularize Learning in Affective Computing.
CoRR, 2018
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
NTUA-SLP at SemEval-2018 Task 1: Predicting Affective Content in Tweets with Deep Attentive RNNs and Transfer Learning.
Proceedings of The 12th International Workshop on Semantic Evaluation, 2018
Proceedings of the 4th ACM Workshop on Wearable Systems and Applications, 2018
Proceedings of the 2018 on Audio/Visual Emotion Challenge and Workshop, 2018
Using Prosodic and Lexical Information for Learning Utterance-level Behaviors in Psychotherapy.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Denoising and Raw-waveform Networks for Weakly-Supervised Gender Identification on Noisy Speech.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Exploring the Relationship between Conic Affinity of NMF Dictionaries and Speech Enhancement Metrics.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Towards an Unsupervised Entrainment Distance in Conversational Speech Using Deep Neural Networks.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
A Knowledge Driven Structural Segmentation Approach for Play-Talk Classification During Autism Assessment.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Language Features for Automated Evaluation of Cognitive Behavior Psychotherapy Sessions.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018
Improving Semi-Supervised Classification for Low-Resource Speech Interaction Applications.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Shaking Acoustic Spectral Sub-Bands can Letxer Regularize Learning in Affective Computing.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Semi-Supervised and Transfer Learning Approaches for Low Resource Sentiment Classification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018
Discovering Latent Psychological Structures from Self-Report Assessments of Hospital Workers.
Proceedings of the 5th International Conference on Behavioral, 2018
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018
2017
IEEE Trans. Affect. Comput., 2017
Signal Processing and Machine Learning for Mental Health Research and Clinical Applications [Perspectives].
IEEE Signal Process. Mag., 2017
Characterizing Types of Convolution in Deep Convolutional Recurrent Neural Networks for Robust Speech Emotion Recognition.
CoRR, 2017
Computer, 2017
Tweester at SemEval-2017 Task 4: Fusion of Semantic-Affective and pairwise classification models for sentiment analysis in Twitter.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Test-Retest Repeatability of Articulatory Strategies Using Real-Time Magnetic Resonance Imaging.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Semantic Edge Detection for Tracking Vocal Tract Air-Tissue Boundaries in Real-Time Magnetic Resonance Images.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Comparison of Basic Beatboxing Articulations Between Expert and Novice Artists Using Real-Time Magnetic Resonance Imaging.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Global SNR Estimation of Speech Signals for Unknown Noise Conditions Using Noise Adapted Non-Linear Regression.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Complexity in Speech and its Relation to Emotional Bond in Therapist-Patient Interactions During Suicide Risk Assessment Interviews.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Exploiting Intra-Annotator Rating Consistency Through Copeland's Method for Estimation of Ground Truth Labels in Couples' Therapy.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Extracting Situation Frames from Non-English Speech: Evaluation Framework and Pilot Results.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Transfer Learning Between Concepts for Human Behavior Modeling: An Application to Sincerity and Deception Prediction.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Multi-Scale Context Adaptation for Improving Child Automatic Speech Recognition in Child-Adult Spoken Interactions.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
An Affect Prediction Approach Through Depression Severity Parameter Incorporation in Neural Networks.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Acoustic-Prosodic and Physiological Response to Stressful Interactions in Children with Autism Spectrum Disorder.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Deep convolutional recurrent neural network with attention mechanism for robust speech emotion recognition.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Quantifying regulation mechanisms in dating couples through a dynamical systems model of acoustic and physiological arousal.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
A knowledge-driven framework for ECG representation and interpretation for wearable applications.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Multimodal detection of fake social media use through a fusion of classification and pairwise ranking systems.
Proceedings of the 25th European Signal Processing Conference, 2017
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017
Weighted geodesic flow kernel for interpersonal mutual influence modeling and emotion recognition in dyadic interactions.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction, 2017
Exploring sparse representation measures of physiological synchrony for romantic couples.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction, 2017
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction, 2017
Proceedings of the 2017 Conference on Designing Interactive Systems, 2017
2016
Markov Chain Monte Carlo Inference of Parametric Dictionaries for Sparse Bayesian Approximations.
IEEE Trans. Signal Process., 2016
IEEE ACM Trans. Audio Speech Lang. Process., 2016
The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for Voice Research and Affective Computing.
IEEE Trans. Affect. Comput., 2016
A technology prototype system for rating therapist empathy from audio recordings in addiction counseling.
PeerJ Comput. Sci., 2016
The USC CreativeIT database of multimodal dyadic interactions: from speech and full body motion capture to continuous emotional annotations.
Lang. Resour. Evaluation, 2016
Online rate adjustment for adaptive random access compressed sensing of time-varying fields.
EURASIP J. Adv. Signal Process., 2016
Directly data-derived articulatory gesture-like representations retain discriminatory information about phone categories.
Comput. Speech Lang., 2016
Speaker verification based on the fusion of speech acoustics and inverted articulatory signals.
Comput. Speech Lang., 2016
Analysis of engagement behavior in children during dyadic interactions using prosodic cues.
Comput. Speech Lang., 2016
Detecting paralinguistic events in audio stream using context in features and probabilistic decisions.
Comput. Speech Lang., 2016
Inferring object rankings based on noisy pairwise comparisons from multiple annotators.
CoRR, 2016
Tweester at SemEval-2016 Task 4: Sentiment Analysis in Twitter Using Semantic-Affective Model Adaptation.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016
Understanding individual-level speech variability: From novel speech production data to robust speaker recognition.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016
Proceedings of the 18th IEEE International Workshop on Multimedia Signal Processing, 2016
Comparison of feature-level and kernel-level data fusion methods in multi-sensory fall detection.
Proceedings of the 18th IEEE International Workshop on Multimedia Signal Processing, 2016
Proceedings of the 6th International Workshop on Audio/Visual Emotion Challenge, 2016
Proceedings of the 26th IEEE International Workshop on Machine Learning for Signal Processing, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Behavioral Coding of Therapist Language in Addiction Counseling Using Recurrent Neural Networks.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Convex Hull Convolutive Non-Negative Matrix Factorization for Uncovering Temporal Patterns in Multivariate Time-Series Data.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Non-Iterative Parameter Estimation for Total Variability Model Using Randomized Singular Value Decomposition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Illustrating the Production of the International Phonetic Alphabet Sounds Using Fast Real-Time Magnetic Resonance Imaging.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Sensitivity of Quantitative RT-MRI Metrics of Vocal Tract Dynamics to Image Reconstruction Settings.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
An Expectation Maximization Approach to Joint Modeling of Multidimensional Ratings Derived from Multiple Annotators.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Noise Aware and Combined Noise Models for Speech Denoising in Unknown Noise Conditions.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Complexity in Prosody: A Nonlinear Dynamical Systems Approach for Dyadic Conversations; Behavior and Outcomes in Couples Therapy.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
State-of-the-Art MRI Protocol for Comprehensive Assessment of Vocal Tract Structure and Function.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Improved Depiction of Tissue Boundaries in Vocal Tract Real-Time MRI Using Automatic Off-Resonance Correction.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Investigation of Speed-Accuracy Tradeoffs in Speech Production Using Real-Time Magnetic Resonance Imaging.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Objective Language Feature Analysis in Children with Neurodevelopmental Disorders During Autism Assessment.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Attention Assisted Discovery of Sub-Utterance Structure in Speech Emotion Recognition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Laughter Valence Prediction in Motivational Interviewing Based on Lexical and Acoustic Cues.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Acoustic-Prosodic and Turn-Taking Features in Interactions with Children with Neurodevelopmental Disorders.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Lightly-supervised utterance-level emotion identification using latent topic modeling of multimodal words.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Pathological speech processing: State-of-the-art, current challenges, and future directions.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE Global Conference on Signal and Information Processing, 2016
EDA-gram: Designing electrodermal activity fingerprints for visualization and feature extraction.
Proceedings of the 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2016
Developing an Automated Report Card for Addiction Counseling: The Counselor Observer Ratings Expert for MI (CORE-MI).
Proceedings of the AMIA 2016, 2016
Proceedings of the 50th Asilomar Conference on Signals, Systems and Computers, 2016
Proceedings of the 2016 AAAI Spring Symposia, 2016
2015
IEEE Trans. Multim., 2015
IEEE Trans. Biomed. Eng., 2015
Comput. Speech Lang., 2015
Structured sparse methods for active ocean observation systems with communication constraints.
IEEE Commun. Mag., 2015
Keynote speech 4: Extraction of linguistic and paralinguistic information from audio-visual data.
Proceedings of the 2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015
Analyzing speech rate entrainment and its relation to therapist empathy in drug addiction counseling.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Ensemble of Gaussian mixture localized neural networks with application to phone recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Experimental assessment of the tongue incompressibility hypothesis during speech production.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
An analysis of the relationship between signal-derived vocal arousal score and human emotion production and perception.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Analysis and modeling of the role of laughter in motivational interviewing based psychotherapy conversations.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Predicting therapist empathy in motivational interviews using language features inspired by psycholinguistic norms.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
A dialog act tagging approach to behavioral coding: a case study of addiction counseling conversations.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Acoustic-prosodic correlates of 'awkward' prosody in story retellings from adolescents with autism.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Automated evaluation of non-native English pronunciation quality: combining knowledge- and data-driven features at multiple time scales.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
A discriminative reliability-aware classification model with applications to intelligibility classification in pathological speech.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Factor analysis of vocal-tract outlines derived from real-time magnetic resonance imaging data.
Proceedings of the 18th International Congress of Phonetic Sciences, 2015
Proceedings of the 18th International Congress of Phonetic Sciences, 2015
Gestural coordination of Brazilian Portugese nasal vowels in CV syllables: A real-time MRI study.
Proceedings of the 18th International Congress of Phonetic Sciences, 2015
Systematic variation in the articulation of the Korean liquid across prosodic positions.
Proceedings of the 18th International Congress of Phonetic Sciences, 2015
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Redundancy analysis of behavioral coding for couples therapy and improved estimation of behavior from noisy annotations.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
A mixture of experts approach towards intelligibility classification of pathological speech.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
On quantifying facial expression-related atypicality of children with Autism Spectrum Disorder.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Quantifying EDA synchrony through joint sparse representation: A case-study of couples' interactions.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 23rd European Signal Processing Conference, 2015
A quantitative analysis of gender differences in movies using psycholinguistic normatives.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015
Modeling head motion entrainment for prediction of couples' behavioral characteristics.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
Context-sensitive learning for enhanced audiovisual emotion classification (Extended abstract).
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
2014
Analysis and Predictive Modeling of Body Language Behavior in Dyadic Interactions From Multimodal Interlocutor Cues.
IEEE Trans. Multim., 2014
Theoretical Analysis of Diversity in an Ensemble of Automatic Speech Recognition Systems.
IEEE ACM Trans. Audio Speech Lang. Process., 2014
Robust Unsupervised Arousal Rating: A Rule-Based Framework withKnowledge-Inspired Vocal Features.
IEEE Trans. Affect. Comput., 2014
Gestural Control in the English Past-Tense Suffix: An Articulatory Study Using Real-Time MRI.
Phonetica, 2014
Simplified supervised i-vector modeling with application to robust and efficient language identification and speaker verification.
Comput. Speech Lang., 2014
Computing vocal entrainment: A signal-derived PCA-based quantification scheme with application to affect analysis in married couple interactions.
Comput. Speech Lang., 2014
Intoxicated speech detection: A fusion framework with speaker-normalized hierarchical functionals and GMM supervectors.
Comput. Speech Lang., 2014
Improving speech recognition for children using acoustic adaptation and pronunciation modeling.
Proceedings of the 4st Workshop on Child, Computer and Interaction, 2014
SAIL-GRS: Grammar Induction for Spoken Dialogue Systems using CF-IRF Rule Similarity.
Proceedings of the 8th International Workshop on Semantic Evaluation, 2014
Proceedings of the 8th International Workshop on Semantic Evaluation, 2014
Multimodal Prediction of Affective Dimensions and Depression in Human-Computer Interactions.
Proceedings of the 4th International Workshop on Audio/Visual Emotion Challenge, 2014
Detection of Musical Event Drop from Crowdsourced Annotations Using a Noisy Channel Model.
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Joint filtering and factorization for recovering latent structure from noisy speech data.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Modified-prior i-vector estimation for language identification of short duration utterances.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Motor control primitives arising from a learned dynamical systems model of speech articulation.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Selection of optimal vocal tract regions using real-time magnetic resonance imaging for robust voice activity detection.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 2nd International Workshop on Speech, Language and Audio in Multimedia, 2014
Estimation of the movement trajectories of non-crucial articulators based on the detection of crucial moments and physiological constraints.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
A study of invariant properties and variation patterns in the converter/distributor model for emotional speech.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Predicting client's inclination towards target behavior change in motivational interviewing and investigating the role of laughter.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
An investigation of vocal arousal dynamics in child-psychologist interactions using synchrony measures and a conversation-based model.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014
Proceedings of the 2014 IEEE International Conference on Data Mining Workshops, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Power-spectral analysis of head motion signal for behavioral modeling in human interaction.
Proceedings of the IEEE International Conference on Acoustics, 2014
Energy-constrained minimum variance response filter for robust vowel spectral estimation.
Proceedings of the IEEE International Conference on Acoustics, 2014
Classification of clean and noisy bilingual movie audio for speech-to-speech translation corpora design.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
A non-homogeneous poisson process model of Skin Conductance Responses integrated with observed regulatory behaviors for Autism intervention.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Hull detection based on largest empty sector angle with application to analysis of realtime MR images.
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
IEEE Trans. Medical Imaging, 2013
IEEE Trans. Knowl. Data Eng., 2013
IEEE Trans. Speech Audio Process., 2013
IEEE Trans. Affect. Comput., 2013
Statistical methods for estimation of direct and differential kinematics of the vocal tract.
Speech Commun., 2013
Toward automating a human behavioral coding system for married couples' interactions using speech acoustic features.
Speech Commun., 2013
Proc. IEEE, 2013
Behavioral Signal Processing: Deriving Human Behavioral Informatics From Speech and Language.
Proc. IEEE, 2013
A Globally-Variant Locally-Constant Model for Fusion of Labels from Multiple Diverse Experts without Using Reference Labels.
IEEE Trans. Pattern Anal. Mach. Intell., 2013
Tracking continuous emotional trends of participants during affective dyadic interactions using body language and speech information.
Image Vis. Comput., 2013
High-quality bilingual subtitle document alignments with application to spontaneous speech translation.
Comput. Speech Lang., 2013
Enriching machine-mediated speech-to-speech translation using contextual information.
Comput. Speech Lang., 2013
Enabling effective design of multimodal interfaces for speech-to-speech translation system: An empirical study of longitudinal user behaviors over time and user strategies for coping with errors.
Comput. Speech Lang., 2013
Comput. Speech Lang., 2013
Automatic speaker age and gender recognition using acoustic and prosodic level information fusion.
Comput. Speech Lang., 2013
Comput. Speech Lang., 2013
DeepPurple: Lexical, String and Affective Feature Fusion for Sentence-Level Semantic Similarity Estimation.
Proceedings of the Second Joint Conference on Lexical and Computational Semantics, 2013
Proceedings of the SIGDIAL 2013 Conference, 2013
Proceedings of the 7th International Workshop on Semantic Evaluation, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
A two-step technique for MRI audio enhancement using dictionary learning and wavelet packet analysis.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Multi-band long-term signal variability features for robust voice activity detection.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
A robust frontend for VAD: exploiting contextual, discriminative and spectral cues of human voice.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Articulatory settings facilitate mechanically advantageous motor control of vocal tract articulators.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Vocal tract cross-distance estimation from real-time MRI using region-of-interest analysis.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Paralinguistic event detection from speech using probabilistic time-series smoothing and masking.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Information theoretic acoustic feature selection for acoustic-to-articulatory inversion.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Analyzing the structure of parent-moderated narratives from children with ASD using an entity-based approach.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
On the computation of document frequency statistics from spoken corpora using factor automata.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Acoustic-prosodic, turn-taking, and language cues in child-psychologist interactions for varying social demand.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Classifying language-related developmental disorders from speech cues: the promise and the potential confounds.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Empirical link between hypothesis diversity and fusion performance in an ensemble of automatic speech recognition systems.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013
Quantifying atypicality in affective facial expressions of children with autism spectrum disorders.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013
An audio-visual approach to learning salient behaviors in couples' problem solving discussions.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013
Toward body language generation in dyadic interaction settings from interlocutor multimodal cues.
Proceedings of the IEEE International Conference on Acoustics, 2013
Data driven modeling of head motion towards analysis of behaviors in couple interactions.
Proceedings of the IEEE International Conference on Acoustics, 2013
A study on the effect of prosodic emphasis transfer on overall speech translation quality.
Proceedings of the IEEE International Conference on Acoustics, 2013
Combining window predictions efficiently - A new imputation approach for noise robust automatic speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013
A robust frontend for ASR: Combining denoising, noise masking and feature normalization.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Spatial and temporal alignment of multimodal human speech production data: Real time imaging, flesh point tracking and audio.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Using physiology and language cues for modeling verbal response latencies of children with ASD.
Proceedings of the IEEE International Conference on Acoustics, 2013
Annotation and processing of continuous emotional attributes: Challenges and opportunities.
Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013
2012
IEEE Trans. Inf. Theory, 2012
KNOWME: An Energy-Efficient Multimodal Body Area Network for Physical Activity Monitoring.
ACM Trans. Embed. Comput. Syst., 2012
Novel Variations of Group Sparse Regularization Techniques With Applications to Noise Robust Automatic Speech Recognition.
IEEE Trans. Speech Audio Process., 2012
IEEE Trans. Affect. Comput., 2012
Pattern Recognit., 2012
EURASIP J. Adv. Signal Process., 2012
IEEE Commun. Mag., 2012
Proceedings of the Third Workshop on Child, Computer and Interaction, 2012
A reranking approach for recognition and classification of speech input in conversational dialogue systems.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012
Proceedings of the 14th IEEE International Workshop on Multimedia Signal Processing, 2012
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012
Based on Isolated Saliency or Causal Integration? Toward a Better Understanding of Human Annotation Process using Multiple Instance Learning and Sequential Probability Ratio Test.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Intelligibility classification of pathological speech using fusion of multiple high level descriptors.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Emphatic segments and emphasis spread in Lebanese Arabic: a Real-time Magnetic Resonance Imaging Study.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Interplay between verbal response latency and physiology of children with autism during ECA interactions.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
A Case Study: Detecting Counselor Reflections in Psychotherapy for Addictions using Linguistic Features.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
A Robust Unsupervised Arousal Rating Framework using Prosody with Cross-Corpora Evaluation.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Spontaneous-Speech Acoustic-Prosodic Features of Children with Autism and the Interacting Psychologist.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Speaker Personality Classification Using Systems Based on Acoustic-Lexical Cues and an Optimal Tree-Structured Bayesian Network.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Multimodal detection of salient behaviors of approach-avoidance in dyadic interactions.
Proceedings of the International Conference on Multimodal Interaction, 2012
Analyzing the memory of BLSTM Neural Networks for enhanced emotion classification in dyadic spoken interactions.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
A hierarchical framework for modeling multimodality and emotional evolution in affective dialogs.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Speaker states recognition using latent factor analysis based Eigenchannel factor vector modeling.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Object classification in sidescan sonar images with sparse representation techniques.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
An acoustic analysis of shared enjoyment in ECA interactions of children with autism.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Improvements in predicting children's overall reading ability by modeling variability in evaluators' subjective judgments.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Analyzing quality of crowd-sourced speech transcriptions of noisy audio for acoustic model adaptation.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Supervised acoustic topic model with a consequent classifier for unstructured audio classification.
Proceedings of the 10th International Workshop on Content-Based Multimedia Indexing, 2012
Analyzing the language of therapist empathy in Motivational Interview based psychotherapy.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
Exploiting speech production information for automatic speech and speaker modeling and recognition - possibilities and new opportunities.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
Speaker verification using Lasso based sparse total variability supervector with PLDA modeling.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
Using measures of vocal entrainment to inform outcome-related behaviors in marital conflicts.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
A study of emotional information present in articulatory movements estimated using acoustic-to-articulatory inversion.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
A System for Real-time Twitter Sentiment Analysis of 2012 U.S. Presidential Election Cycle.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, 2012
2011
IEEE Trans. Signal Process., 2011
Introduction to the special issue on speech and language processing of children's speech for child-machine interaction applications.
ACM Trans. Speech Lang. Process., 2011
Automatically assessing the ABCs: Verification of children's spoken letter-names and letter-sounds.
ACM Trans. Speech Lang. Process., 2011
IEEE Trans. Speech Audio Process., 2011
IEEE ACM Trans. Audio Speech Lang. Process., 2011
IEEE Trans. Speech Audio Process., 2011
IEEE Trans. Speech Audio Process., 2011
Automatic Prediction of Children's Reading Ability for High-Level Literacy Assessment.
IEEE ACM Trans. Audio Speech Lang. Process., 2011
Speech Commun., 2011
Joint source-filter optimization for robust glottal source estimation in the presence of shimmer and jitter.
Speech Commun., 2011
Comput. Speech Lang., 2011
EmotiWord: Affective Lexicon Creation with Application to Interaction and Multimedia Data.
Proceedings of the Computational Intelligence for Multimedia Understanding, 2011
Behavioral signal processing for understanding (distressed) dyadic interactions: some recent developments.
Proceedings of the 2011 joint ACM workshop on Human gesture and behavior understanding, 2011
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Automatic Data-Driven Learning of Articulatory Primitives from Real-Time MRI Data Using Convolutive NMF with Sparseness Constraints.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Direct Estimation of Articulatory Kinematics from Real-Time Magnetic Resonance Image Sequences.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
An Analysis of PCA-Based Vocal Entrainment Measures in Married Couples' Affective Spoken Interactions.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Morphological Variation in the Adult Vocal Tract: A Modeling Study of its Potential Acoustic Impact.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Visualization of Vocal Tract Shape Using Interleaved Real-Time MRI of Multiple Scan Planes.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
An Exploratory Study of the Relations Between Perceived Emotion Strength and Articulatory Kinematics.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Automatic Identification of Salient Acoustic Instances in Couples' Behavioral Interactions Using Diverse Density Support Vector Machines.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Analysis of Inter-Articulator Correlation in Acoustic-to-Articulatory Inversion Using Generalized Smoothness Criterion.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Enhancements to the Training Process of Classifier-Based Speech Translator via Topic Modeling.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Intoxicated Speech Detection by Fusion of Speaker Normalized Hierarchical Features and GMM Supervectors.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
"You made me do it": Classification of Blame in Married Couples' Interactions by Fusing Automatically Derived Speech and Language Information.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
The USC CARE Corpus: Child-Psychologist Interactions of Children with Autism Spectrum Disorders.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Rachel: Design of an emotionally targeted interactive agent for children with autism.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011
Overlapped speech detection using long-term spectro-temporal similarity in stereo recording.
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Estimation of ordinal approach-avoidance labels in dyadic interactions: Ordinal logistic regression approach.
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Robust talking face video verification using joint factor analysis and sparse representation on GMM mean shifted supervectors.
Proceedings of the IEEE International Conference on Acoustics, 2011
Directional descriptors using zernike moment phases for object orientation estimation in underwater sonar images.
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Emotion classification from speech using evaluator reliability-weighted combination of ranked lists.
Proceedings of the IEEE International Conference on Acoustics, 2011
Accurate transcription of broadcast news speech using multiple noisy transcribers and unsupervised reliability metrics.
Proceedings of the IEEE International Conference on Acoustics, 2011
Modeling high-level descriptions of real-life physical activities using latent topic modeling of multimodal sensor signals.
Proceedings of the 33rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2011
Affective State Recognition in Married Couples' Interactions Using PCA-Based Vocal Entrainment Measures with Multiple Instance Learning.
Proceedings of the Affective Computing and Intelligent Interaction, 2011
Proceedings of the Affective Computing and Intelligent Interaction, 2011
Proceedings of the Affective Computing and Intelligent Interaction, 2011
Proceedings of the Affective Computing and Intelligent Interaction, 2011
"That's Aggravating, Very Aggravating": Is It Possible to Classify Behaviors in Couple Interactions Using Automatically Derived Lexical Features?
Proceedings of the Affective Computing and Intelligent Interaction, 2011
2010
Nonproduct data-dependent partitions for mutual information estimation: strong consistency and applications.
IEEE Trans. Signal Process., 2010
Optimal Arousal Identification and Classification for Affective Computing Using Physiological Signals: Virtual Reality Stroop Task.
IEEE Trans. Affect. Comput., 2010
IEEE Signal Process. Lett., 2010
Multimodal Speaker Segmentation and Identification in Presence of Overlapped Speech Segments.
J. Multim., 2010
Robust Multimodal Person Recognition Using Low-Complexity Audio-Visual Feature Fusion Approaches.
Int. J. Semantic Comput., 2010
Towards modeling user behavior in interactions mediated through an automated bidirectional speech translation system.
Comput. Speech Lang., 2010
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010
Proceedings of the 2010 IEEE International Workshop on Multimedia Signal Processing, 2010
Proceedings of the Intelligent Virtual Agents, 10th International Conference, 2010
Proceedings of the IEEE International Symposium on Information Theory, 2010
A near-optimal (minimax) tree-structured partition for mutual information estimation.
Proceedings of the IEEE International Symposium on Information Theory, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Context-sensitive multimodal emotion recognition from speech and facial expression using bidirectional LSTM modeling.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Investigating articulatory setting - pauses, ready position, and rest - using real-time MRI.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Rapid semi-automatic segmentation of real-time magnetic resonance images for parametric vocal tract analysis.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
A cluster-profile representation of emotion using agglomerative hierarchical clustering.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Vocal tract contour analysis of emotional speech by the functional data curve representation.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Quantification of prosodic entrainment in affective spontaneous spoken interactions of married couples.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Improved real-time MRI of oral-velar coordination using a golden-ratio spiral view order.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
A study of interplay between articulatory movement and prosodic characteristics in emotional speech production.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
A variable frame length and rate algorithm based on the spectral kurtosis measure for speaker verification.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
An improved cluster model selection method for agglomerative hierarchical speaker clustering using incremental Gaussian mixture models.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
A study of intra-speaker and inter-speaker affective variability using electroglottograph and inverse filtered glottal waveforms.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Data-dependent evaluator modeling and its application to emotional valence classification from speech.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 20th International Conference on Pattern Recognition, 2010
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Decision level combination of multiple modalities for recognition and analysis of emotional expression.
Proceedings of the IEEE International Conference on Acoustics, 2010
Visual emotion recognition using compact facial representations and viseme information.
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the 18th European Signal Processing Conference, 2010
2009
IEEE Trans. Signal Process., 2009
Human Perception of Audio-Visual Synthetic Character Emotion Expression in the Presence of Ambiguous and Conflicting Information.
IEEE Trans. Multim., 2009
Region Segmentation in the Frequency Domain Applied to Upper Airway Real-Time Magnetic Resonance Images.
IEEE Trans. Medical Imaging, 2009
Automatic Detection of Disfluency Boundaries in Spontaneous Speech of Children Using Audio-Visual Information.
IEEE Trans. Speech Audio Process., 2009
An Iterative Relative Entropy Minimization-Based Data Selection Approach for n-Gram Model Adaptation.
IEEE Trans. Speech Audio Process., 2009
Prominence Detection Using Auditory Attention Cues and Task-Dependent High Level Information.
IEEE Trans. Speech Audio Process., 2009
IEEE Trans. Speech Audio Process., 2009
Analysis of Emotionally Salient Aspects of Fundamental Frequency for Emotion Detection.
IEEE Trans. Speech Audio Process., 2009
Unsupervised Adaptation of Categorical Prosody Models for Prosody Labeling and Speech Recognition.
IEEE Trans. Speech Audio Process., 2009
IEEE Signal Process. Lett., 2009
Assessment of emerging reading skills in young native speakers and language learners.
Speech Commun., 2009
Timing effects of syllable structure and stress on nasals: A real-time MRI examination.
J. Phonetics, 2009
Combining lexical, syntactic and prosodic cues for improved online dialog act tagging.
Comput. Speech Lang., 2009
Proceedings of the Second Workshop on Child, Computer and Interaction, 2009
Proceedings of the Second Workshop on Child, Computer and Interaction, 2009
Comparison of child-human and child-computer interactions based on manual annotations.
Proceedings of the Second Workshop on Child, Computer and Interaction, 2009
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009
Saliency-driven unstructured acoustic scene classification using latent perceptual indexing.
Proceedings of the 2009 IEEE International Workshop on Multimedia Signal Processing, 2009
A Low-Complexity Dynamic Face-Voice Feature Fusion Approach to Multimodal Person Recognition.
Proceedings of the 11th IEEE International Symposium on Multimedia, 2009
Proceedings of the IEEE International Symposium on Information Theory, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Evaluating evaluators: a case study in understanding the benefits and pitfalls of multi-evaluator modeling.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Modeling mutual influence of interlocutor emotion states in dyadic spoken interactions.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Signature cluster model selection for incremental Gaussian mixture cluster modeling in agglomerative hierarchical speaker clustering.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Improved speaker diarization of meeting speech with recurrent selection of representative speech segments and participant interaction pattern modeling.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
A divide-and-conquer approach to Latent Perceptual Indexing of audio for large Web 2.0 applications.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009
Robust word boundary detection in spontaneous speech using acoustic and lexical cues.
Proceedings of the IEEE International Conference on Acoustics, 2009
Accelerated 3D MRI of vocal tract shaping using compressed sensing and parallel imaging.
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Automatic pronunciation verification of english letter-names for early literacy assessment of preliterate children.
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the Distributed Computing in Sensor Systems, 2009
Proceedings of the 4th International ICST Conference on Body Area Networks, 2009
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
Proceedings of the Affective Computing and Intelligent Interaction, 2009
2008
IEEE Trans. Signal Process., 2008
IEEE Trans. Signal Process., 2008
IEEE Trans. Speech Audio Process., 2008
Using Articulatory Representations to Detect Segmental Errors in Nonnative Pronunciation.
IEEE Trans. Speech Audio Process., 2008
Exploiting Acoustic and Syntactic Features for Automatic Prosody Labeling in a Maximum Entropy Framework.
IEEE Trans. Speech Audio Process., 2008
Strategies to Improve the Robustness of Agglomerative Hierarchical Clustering Under Data Source Variation for Speaker Diarization.
IEEE Trans. Speech Audio Process., 2008
IEEE Trans. Speech Audio Process., 2008
Seeing speech: Capturing vocal tract shaping using real-time magnetic resonance imaging [Exploratory DSP].
IEEE Signal Process. Mag., 2008
Lang. Resour. Evaluation, 2008
Proceedings of the First Workshop on Child, Computer and Interaction, 2008
Proceedings of the First Workshop on Child, Computer and Interaction, 2008
An empirical analysis of user uncertainty in problem-solving child-machine interactions.
Proceedings of the First Workshop on Child, Computer and Interaction, 2008
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008
Proceedings of the International Workshop on Multimedia Signal Processing, 2008
Proceedings of the International Workshop on Multimedia Signal Processing, 2008
Proceedings of the Tenth IEEE International Symposium on Multimedia (ISM2008), 2008
Selection of Emotionally Salient Audio-Visual Features for Modeling Human Evaluations of Synthetic Character Emotion Displays.
Proceedings of the Tenth IEEE International Symposium on Multimedia (ISM2008), 2008
Proceedings of the Tenth IEEE International Symposium on Multimedia (ISM2008), 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Relation between geometry and kinematics of articulatory trajectory associated with emotional speech production.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
An interval type-2 fuzzy logic system to translate between emotion-related vocabularies.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Combining task-dependent information with auditory attention cues for prominence detection in speech.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Agglomerative hierarchical speaker clustering using incremental Gaussian mixture cluster modeling.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Scripted dialogs versus improvisation: lessons learned about emotional elicitation techniques from the IEMOCAP database.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
The expression and perception of emotions: comparing assessments of self versus others.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
An analysis of vocal tract shaping in English sibilant fricatives using real-time magnetic resonance imaging.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Estimation of children's reading ability by fusion of automatic pronunciation verification and fluency detection.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Classification of sound clips by two schemes: Using onomatopoeia and semantic labels.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008
Joint-processing of audio-visual signals in human perception of conflicting synthetic character emotions.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
Modeling the intonation of discourse segments for improved online dialog ACT tagging.
Proceedings of the IEEE International Conference on Acoustics, 2008
Computation as estimation: Estimation-theoretic IC design improves robustness and reduces power consumption.
Proceedings of the IEEE International Conference on Acoustics, 2008
Human perception of synthetic character emotions in the presence of conflicting and congruent vocal and facial expressions.
Proceedings of the IEEE International Conference on Acoustics, 2008
A top-down auditory attention model for learning task dependent influences on prominence detection in speech.
Proceedings of the IEEE International Conference on Acoustics, 2008
Novel inter-cluster distance measure combining GLR and ICR for improved agglomerative hierarchical speaker clustering.
Proceedings of the IEEE International Conference on Acoustics, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
Recognition for synthesis: Automatic parameter selection for resynthesis of emotional speech from neutral speech.
Proceedings of the IEEE International Conference on Acoustics, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
Automatic classification of question turns in spontaneous speech using lexical and prosodic evidence.
Proceedings of the IEEE International Conference on Acoustics, 2008
Proceedings of the workshop on Speech Processing for Safety Critical Translation and Pervasive Applications@COLING 2008, 2008
Proceedings of the ACL 2008, 2008
2007
IEEE Trans. Speech Audio Process., 2007
IEEE Trans. Speech Audio Process., 2007
Interrelation Between Speech and Facial Gestures in Emotional Utterances: A Single Subject Study.
IEEE Trans. Speech Audio Process., 2007
IEEE Trans. Speech Audio Process., 2007
Speech Commun., 2007
Pattern Recognit. Lett., 2007
Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue, 2007
Investigating Implicit Cues for User State Estimation in Human-Robot Interaction Using Physiological Measurements.
Proceedings of the IEEE RO-MAN 2007, 2007
Exploiting Acoustic and Syntactic Features for Prosody Labeling in a Maximum Entropy Framework.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007
Proceedings of the IEEE 9th Workshop on Multimedia Signal Processing, 2007
Experiments in Automatic Genre Classification of Full-length Music Tracks using Audio Activity Rate.
Proceedings of the IEEE 9th Workshop on Multimedia Signal Processing, 2007
Analyzing the Multimodal Behaviors of Users of a Speech-to-Speech Translation Device by using Concept Matching Scores.
Proceedings of the IEEE 9th Workshop on Multimedia Signal Processing, 2007
Multimodal Meeting Monitoring: Improvements on Speaker Tracking and Segmentation through a Modified Mixture Particle Filter.
Proceedings of the IEEE 9th Workshop on Multimedia Signal Processing, 2007
Proceedings of the IEEE 9th Workshop on Multimedia Signal Processing, 2007
Real-time Emotion Detection System using Speech: Multi-modal Fusion of Different Timescale Features.
Proceedings of the IEEE 9th Workshop on Multimedia Signal Processing, 2007
Joint Analysis of the Emotional Fingerprint in the Face and Speech: A single subject study.
Proceedings of the IEEE 9th Workshop on Multimedia Signal Processing, 2007
A System for Technology Based Assessment of Language and Literacy in Young Children: the Role of Multiple Information Sources.
Proceedings of the IEEE 9th Workshop on Multimedia Signal Processing, 2007
Proceedings of the IEEE International Symposium on Information Theory, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Exploiting prosodic features for dialog act tagging in a discriminative modeling framework.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
A saliency-based auditory attention model with applications to unsupervised prominent syllable detection in speech.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
A robust stopping criterion for agglomerative hierarchical clustering in a speaker diarization system.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Automatic detection and classification of disfluent reading miscues in young children's speech for the purpose of assessment.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
Discriminating Two Types of Noise Sources using Cortical Representation and Dimension Reduction Technique.
Proceedings of the IEEE International Conference on Acoustics, 2007
Information Theoretic Analysis of Direct Articulatory Measurements for Phonetic Discrimination.
Proceedings of the IEEE International Conference on Acoustics, 2007
Optimal Wavelet Packets Decomposition Based on a Rate-Distortion Optimality Criterion.
Proceedings of the IEEE International Conference on Acoustics, 2007
Data Driven Approach for Language Model Adaptation using Stepwise Relative Entropy Minimization.
Proceedings of the IEEE International Conference on Acoustics, 2007
Support Vector Regression for Automatic Recognition of Spontaneous Emotions in Speech.
Proceedings of the IEEE International Conference on Acoustics, 2007
Real-Time Monitoring of Participants' Interaction in a Meeting using Audio-Visual Sensors.
Proceedings of the IEEE International Conference on Acoustics, 2007
A Statistical Approach for Modeling Prosody Features using POS Tags for Emotional Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2007
Improved Speech Recognition using Acoustic and Lexical Correlates of Pitch Accent in a N-Best Rescoring Framework.
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the 15th European Signal Processing Conference, 2007
Robust speaker clustering strategies to data source variation for improved speaker diarization.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
2006
Expressive Facial Animation Synthesis by Learning Speech Coarticulation and Expression Spaces.
IEEE Trans. Vis. Comput. Graph., 2006
Average divergence distance as a statistical discrimination measure for hidden Markov models.
IEEE Trans. Speech Audio Process., 2006
Speech Commun., 2006
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006
Selecting relevant text subsets from web-data for building topic specific language models.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006
An attribute-based approach to audio description applied to segmenting vocal sections in popular music songs.
Proceedings of the IEEE 8th Workshop on Multimedia Signal Processing, 2006
Proceedings of the 1st ACM international workshop on Human-centered multimedia, 2006
Using model trees for evaluating dialog error conditions based on acoustic information.
Proceedings of the 1st ACM international workshop on Human-centered multimedia, 2006
Upper Bound Kullback-Leibler Divergence for Hidden Markov Models with Application as Discrimination Measure for Speech Recognition.
Proceedings of the Proceedings 2006 IEEE International Symposium on Information Theory, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
A study of emotional speech articulation using a fast magnetic resonance imaging technique.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Automatic detection of voice onset time contrasts for use in pronunciation assessment.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Combining acoustic, lexical, and syntactic evidence for automatic unsupervised prosody labeling.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Speech Recognition Engineering Issues in Speech to Speech Translation System Design for Low Resource Languages and Domains.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Analyzing Children's Speech: An Acoustic Study of Consonants and Consonant-Vowel Transition.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Proceedings of the 14th European Signal Processing Conference, 2006
Proceedings of the 14th European Signal Processing Conference, 2006
Proceedings of the EMNLP 2006, 2006
Proceedings of the 28th International Conference of the IEEE Engineering in Medicine and Biology Society, 2006
Efficient Rotation Invariant Retrieval of Shapes with Applications in Medical Databases.
Proceedings of the 19th IEEE International Symposium on Computer-Based Medical Systems (CBMS 2006), 2006
Proceedings of the Aurally Informed Performance: Integrating Machine Listening and Auditory Presentation in Robotic Systems, 2006
Proceedings of the Aurally Informed Performance: Integrating Machine Listening and Auditory Presentation in Robotic Systems, 2006
2005
IEEE Trans. Speech Audio Process., 2005
Multichannel audio synthesis by subband-based spectral conversion and parameter adaptation.
IEEE Trans. Speech Audio Process., 2005
IEEE Trans. Speech Audio Process., 2005
Creating data resources for designing usercentric frontends for query-by-humming systems.
Multim. Syst., 2005
Comput. Animat. Virtual Worlds, 2005
Proceedings of the 6th SIGdial Workshop on Discourse and Dialogue, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Detecting Politeness and frustration state of a child in a conversational computer game.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Investigating the role of phoneme-level modifications in emotional speech resynthesis.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Automatic Syllable Stress Detection Using Prosodic Features for Pronunciation Evaluation of Language Learners.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
An Automatic Prosody Recognizer using a Coupled Multi-Stream Acoustic Model and a Syntactic-Prosodic Language Model.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Transonics: A Practical Speech-to-Speech Translator for English-Farsi Medical Dialogs.
Proceedings of the ACL 2005, 2005
2004
IEEE Trans. Circuits Syst. Video Technol., 2004
IEEE Trans. Speech Audio Process., 2004
Pattern Recognit. Lett., 2004
Proceedings of the 2004 ACM SIGMM Workshop on Effective Telepresence, 2004
A statistical approach to retrieval under user-dependent uncertainty in query-by-humming systems.
Proceedings of the 6th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2004
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004
Proceedings of the Intelligent Tutoring Systems, 7th International Conference, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Reference marking in children's computer-directed speech: an integrated analysis of discourse and gestures.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Analysis of emotion recognition using facial expressions, speech and multimodal information.
Proceedings of the 6th International Conference on Multimodal Interfaces, 2004
A multi-pass linear fold algorithm for sentence boundary detection using prosodic cues.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
Enhanced standard compliant distributed speech recognition (Aurora encoder) using rate allocation.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
The Transonics Spoken Dialogue Translator: An Aid for English-Persian Doctor-Patient Interviews.
Proceedings of the Dialogue Systems for Health Communication, 2004
2003
EURASIP J. Adv. Signal Process., 2003
Proceedings of the IEEE International Conference on Systems, 2003
Creating data resources for designing user-centric frontends for query by humming systems.
Proceedings of the 5th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Towards optimal encoding for classification with applications to distributed speech recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
A statistical multidimensional humming transcription using phone level hidden Markov models for query by humming systems.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
Multidimensional humming transcription using a statistical approach for query by humming systems.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
Split-lexicon based hierarchical recognition of speech using syllable and word level acoustic units.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
2002
IEEE Trans. Speech Audio Process., 2002
Proceedings of the Storage and Retrieval for Media Databases 2002, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002
Proceedings of the IEEE International Conference on Acoustics, 2002
Proceedings of the IEEE International Conference on Acoustics, 2002
Proceedings of the 11th European Signal Processing Conference, 2002
2001
Proceedings of the First International Conference on Human Language Technology Research, 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001
Proceedings of the IEEE International Conference on Acoustics, 2001
Proceedings of the CHI 2001 Extended Abstracts on Human Factors in Computing Systems, 2001
2000
IEEE Trans. Speech Audio Process., 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Effects of dialog initiative and multi-modal presentation strategies on large directory information access.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Web-based monitoring, logging and reporting tools for multi-service multi-modal systems.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Unifying Conversational Multimedia Interfaces for Accessing Network Services Across Communication Devices.
Proceedings of the 2000 IEEE International Conference on Multimedia and Expo, 2000
1999
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998
Probing the relationship between qualitative and quantitative performance measures for voice-enabled telecommunication services.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998
Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for Email.
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998
1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
1996
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
From MRI and acoustic data to articulatory synthesis: a case study of the lateral approximants in american English.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996
1994
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
Proceedings of the Proceedings 1994 International Conference on Image Processing, 1994
1993
Strange attractors and chaotic dynamics in the production of voiced and voiceless fricatives.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993