Alexandros Potamianos

Orcid: 0009-0007-1532-5288

According to our database1, Alexandros Potamianos authored at least 189 papers between 1993 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Awards

IEEE Fellow

IEEE Fellow 2016, "For contributions to human-centered speech and multimodal signal analysis".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Sample-Efficient Unsupervised Domain Adaptation of Speech Recognition Systems: A Case Study for Modern Greek.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

$\mathcal {P}$owMix: A Versatile Regularizer for Multimodal Sentiment Analysis.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Aggregation Artifacts in Subjective Tasks Collapse Large Language Models' Posteriors.
CoRR, 2024

BloomWise: Enhancing Problem-Solving capabilities of Large Language Models using Bloom's-Taxonomy-Inspired Prompts.
CoRR, 2024

LC-Protonets: Multi-label Few-shot learning for world music audio tagging.
CoRR, 2024

Y-Drop: A Conductance based Dropout for fully connected layers.
CoRR, 2024

Enhancing Fast Feed Forward Networks with Load Balancing and a Master Leaf Node.
CoRR, 2024

The Strong Pull of Prior Knowledge in Large Language Models and Its Impact on Emotion Recognition.
CoRR, 2024

2023
PowMix: A Versatile Regularizer for Multimodal Sentiment Analysis.
CoRR, 2023

SeqAug: Sequential Feature Resampling as a modality agnostic augmentation method.
CoRR, 2023

Efficient Audio Captioning Transformer with Patchout and Text Guidance.
CoRR, 2023

Depression detection in social media posts using affective and social norm features.
CoRR, 2023

From West to East: Who Can Understand the Music of the Others Better?
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

A Zero-Shot Approach for Multi-User Task-Oriented Dialog Generation.
Proceedings of the 16th International Natural Language Generation Conference, 2023

Adapted Multimodal Bert with Layer-Wise Fusion for Sentiment Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2023

Self-Attention Based Generative Adversarial Networks For Unsupervised Video Summarization.
Proceedings of the 31st European Signal Processing Conference, 2023

Multi-User MultiWOZ: Task-Oriented Dialogues among Multiple Users.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2022
Alternating Objectives Generates Stronger PGD-Based Adversarial Attacks.
CoRR, 2022

Regotron: Regularizing the Tacotron2 Architecture Via Monotonic Alignment Loss.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

A Dataset for Greek Traditional and Folk Music: Lyra.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

Extending Compositional Attention Networks for Social Reasoning in Videos.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

A Multi-Task BERT Model for Schema-Guided Dialogue State Tracking.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Mmlatch: Bottom-Up Top-Down Fusion For Multimodal Sentiment Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
EmpBot: A T5-based Empathetic Chatbot focusing on Sentiments.
CoRR, 2021

End-to-end Generative Zero-shot Learning via Few-shot Learning.
CoRR, 2021

UDALM: Unsupervised Domain Adaptation through Language Modeling.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

M<sup>3</sup>: MultiModal Masking Applied to Sentiment Analysis.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020
Affective Conditioning on Hierarchical Networks applied to Depression Detection from Transcribed Clinical Interviews.
CoRR, 2020

Affective Conditioning on Hierarchical Attention Networks Applied to Depression Detection from Transcribed Clinical Interviews.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019
SEQ<sup>3</sup>: Differentiable Sequence-to-Sequence-to-Sequence Autoencoder for Unsupervised Abstractive Sentence Compression.
CoRR, 2019

An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Cross-Topic Distributional Semantic Representations Via Unsupervised Mappings.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

SEQˆ3: Differentiable Sequence-to-Sequence-to-Sequence Autoencoder for Unsupervised Abstractive Sentence Compression.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Unsupervised Low-Rank Representations for Speech Emotion Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Deep Hierarchical Fusion with Application in Sentiment Analysis.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Data Augmentation Using GANs for Speech Emotion Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Using Oliver API for emotion-aware movie content characterization.
Proceedings of the 2019 International Conference on Content-Based Multimedia Indexing, 2019

Attention-based Conditioning Methods for External Knowledge Integration.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Speech understanding for spoken dialogue systems: From corpus harvesting to grammar rule induction.
Comput. Speech Lang., 2018

Pattern Search Multidimensional Scaling.
CoRR, 2018

Hierarchical bi-directional attention-based RNNs for supporting document classification on protein-protein interactions affected by genetic mutations.
Database J. Biol. Databases Curation, 2018

NTUA-SLP at IEST 2018: Ensemble of Neural Transfer Methods for Implicit Emotion Classification.
Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, 2018

NTUA-SLP at SemEval-2018 Task 3: Tracking Ironic Tweets using Ensembles of Word and Character Level Attentive RNNs.
Proceedings of The 12th International Workshop on Semantic Evaluation, 2018

NTUA-SLP at SemEval-2018 Task 2: Predicting Emojis using RNNs with Context-aware Attention.
Proceedings of The 12th International Workshop on Semantic Evaluation, 2018

NTUA-SLP at SemEval-2018 Task 1: Predicting Affective Content in Tweets with Deep Attentive RNNs and Transfer Learning.
Proceedings of The 12th International Workshop on Semantic Evaluation, 2018

Mixture of Topic-Based Distributional Semantic and Affective Models.
Proceedings of the 12th IEEE International Conference on Semantic Computing, 2018

Integrating Recurrence Dynamics for Speech Emotion Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Neural Activation Semantic Models: Computational lexical semantic models of localized neural activations.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

2017
COGNIMUSE: a multimodal video database annotated with saliency, events, semantics and emotion with application to summarization.
EURASIP J. Image Video Process., 2017

Lexical and affective models in early acquisition of semantics.
Proceedings of the 6th International Workshop on Child Computer Interaction, 2017

Tweester at SemEval-2017 Task 4: Fusion of Semantic-Affective and pairwise classification models for sentiment analysis in Twitter.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017

Engagement detection for children with Autism Spectrum Disorder.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Structural Attention Neural Networks for improved sentiment analysis.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Segment-based speech emotion recognition using recurrent neural networks.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction, 2017

2016
Audio-based Distributional Semantic Models for Music Auto-tagging and Similarity Measurement.
CoRR, 2016

A semantic-affective compositional approach for the affective labelling of adjective-noun and noun-noun pairs.
Proceedings of the 7th Workshop on Computational Approaches to Subjectivity, 2016

Tweester at SemEval-2016 Task 4: Sentiment Analysis in Twitter Using Semantic-Affective Model Adaptation.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

Affective Lexicon Creation for the Greek Language.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

The SpeDial datasets: datasets for Spoken Dialogue Systems analytics.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Crossmodal Network-Based Distributional Semantic Models.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Cognitively Motivated Distributional Representations of Meaning.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Audio-Based Distributional Representations of Meaning Using a Fusion of Feature Encodings.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Root Cause Analysis of Miscommunication Hotspots in Spoken Dialogue Systems.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Speech Emotion Recognition Using Affective Saliency.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
Similarity computation using semantic networks created from web-harvested data.
Nat. Lang. Eng., 2015

Open Challenges in Modelling, Analysis and Synthesis of Human Behaviour in Human-Human and Human-Machine Interactions.
Cogn. Comput., 2015

Quality evaluation of computational models for movie summarization.
Proceedings of the Seventh International Workshop on Quality of Multimedia Experience, 2015

Feeling is Understanding: From Affective to Semantic Spaces.
Proceedings of the 11th International Conference on Computational Semantics, 2015

Valence, arousal and dominance estimation for English, German, Greek, Portuguese and Spanish lexica using semantic models.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Predicting audio-visual salient events based on visual, audio and text modalities for movie summarization.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Audio salient event detection and summarization using audio and text modalities.
Proceedings of the 23rd European Signal Processing Conference, 2015

Fusion of Compositional Network-based and Lexical Function Distributional Semantic Models.
Proceedings of the 6th Workshop on Cognitive Modeling and Computational Linguistics, 2015

2014
Improving speech recognition for children using acoustic adaptation and pronunciation modeling.
Proceedings of the 4st Workshop on Child, Computer and Interaction, 2014

Using lexical, syntactic and semantic features for non-terminal grammar rule induction in Spoken Dialogue Systems.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

SAIL: Sentiment Analysis using Semantic Similarity and Contrast Features.
Proceedings of the 8th International Workshop on Semantic Evaluation, 2014

SemEval-2014 Task 2: Grammar Induction for Spoken Dialogue Systems.
Proceedings of the 8th International Workshop on Semantic Evaluation, 2014

tucSage: Grammar Rule Induction for Spoken Dialogue Systems via Probabilistic Candidate Selection.
Proceedings of the 8th International Workshop on Semantic Evaluation, 2014

Multimodal Prediction of Affective Dimensions and Depression in Human-Computer Interactions.
Proceedings of the 4th International Workshop on Audio/Visual Emotion Challenge, 2014

Word Semantic Similarity for Morphologically Rich Languages.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Classification of cognitive load from speech using an i-vector framework.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Fusion of knowledge-based and data-driven approaches to grammar induction.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

An investigation of vocal arousal dynamics in child-psychologist interactions using synchrony measures and a conversation-based model.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Cognitive Multimodal Processing: from Signal to Behavior.
Proceedings of the 2014 Workshop on Roadmapping the Future of Multimodal Interaction Research including Business Opportunities and Challenges, 2014

Spoken dialogue grammar induction from crowdsourced data.
Proceedings of the IEEE International Conference on Acoustics, 2014

Affective language model adaptation via corpus selection.
Proceedings of the IEEE International Conference on Acoustics, 2014

Low-Dimensional Manifold Distributional Semantic Models.
Proceedings of the COLING 2014, 2014

2013
Multimodal Saliency and Fusion for Movie Summarization Based on Aural, Visual, and Textual Attention.
IEEE Trans. Multim., 2013

Toward the Automatic Extraction of Policy Networks Using Web Links and Documents.
IEEE Trans. Knowl. Data Eng., 2013

Distributional Semantic Models for Affective Text Analysis.
IEEE Trans. Speech Audio Process., 2013

DeepPurple: Lexical, String and Affective Feature Fusion for Sentence-Level Semantic Similarity Estimation.
Proceedings of the Second Joint Conference on Lexical and Computational Semantics, 2013

SAIL: A hybrid approach to sentiment analysis.
Proceedings of the 7th International Workshop on Semantic Evaluation, 2013

Semantic Similarity Computation for Abstract and Concrete Nouns Using Network-based Distributional Semantic Models.
Proceedings of the 10th International Conference on Computational Semantics, 2013

An affective evaluation tool using brain signals.
Proceedings of the 18th International Conference on Intelligent User Interfaces, 2013

Multi-band long-term signal variability features for robust voice activity detection.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Affective classification of generic audio clips using regression models.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Affective evaluation of multimodal dialogue games for preschoolers using physiological signals.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Web data harvesting for speech understanding grammar induction.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Instantaneous frequency and bandwidth estimation using filterbank arrays.
Proceedings of the IEEE International Conference on Acoustics, 2013

Continuous models of affect from text using n-grams.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Affective evaluation of a mobile multimodal dialogue system using brain signals.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

DeepPurple: Estimating Sentence Semantic Similarity using N-gram Regression Models and Web Snippets.
Proceedings of the 6th International Workshop on Semantic Evaluation, 2012

Up from Limited Dialog Systems!
Proceedings of the Workshop on Future directions and needs in the Spoken Dialog Community: Tools and Data, 2012

SemSim: Resources for Normalized Semantic Similarity Computation Using Lexical Networks.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Associative and Semantic Features Extracted From Web-Harvested Corpora.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

A saliency-based approach to audio event detection and summarization.
Proceedings of the 20th European Signal Processing Conference, 2012

2011
Introduction to the special issue on speech and language processing of children's speech for child-machine interaction applications.
ACM Trans. Speech Lang. Process., 2011

On the Effects of Filterbank Design and Energy Computation on Robust Speech Recognition.
IEEE Trans. Speech Audio Process., 2011

Detecting emotional state of a child in a conversational computer game.
Comput. Speech Lang., 2011

EmotiWord: Affective Lexicon Creation with Application to Interaction and Multimedia Data.
Proceedings of the Computational Intelligence for Multimedia Understanding, 2011

Kernel Models for Affective Lexicon Creation.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

A supervised approach to movie emotion tracking.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Unsupervised Semantic Similarity Computation between Terms Using Web Documents.
IEEE Trans. Knowl. Data Eng., 2010

Batch and Adaptive PARAFAC-Based Blind Separation of Convolutive Speech Mixtures.
IEEE Trans. Speech Audio Process., 2010

Spectral Moment Features Augmented by Low Order Cepstral Coefficients for Robust ASR.
IEEE Signal Process. Lett., 2010

BabyExp: Constructing a Huge Multimodal Resource to Acquire Commonsense Knowledge Like Children Do.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

On the effect of fundamental frequency on amplitude and frequency modulation patterns in speech resonances.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009
A comparison of the squared energy and Teager-Kaiser operators for short-term energy estimation in additive noise.
IEEE Trans. Signal Process., 2009

Unsupervised Stream-Weights Computation in Classification and Recognition Tasks.
IEEE Trans. Speech Audio Process., 2009

Fantasy, curiosity and challenge as adaptation indicators in multimodal dialogue systems for preschoolers.
Proceedings of the Second Workshop on Child, Computer and Interaction, 2009

A review of ASR technologies for children's speech.
Proceedings of the Second Workshop on Child, Computer and Interaction, 2009

Towards adapting fantasy, curiosity and challenge in multimodal dialogue systems for preschoolers.
Proceedings of the 11th International Conference on Multimodal Interfaces, 2009

Statistical analysis of amplitude modulation in speech signals using an AM-FM model.
Proceedings of the IEEE International Conference on Acoustics, 2009

Video event detection and summarization using audio, visual and text saliency.
Proceedings of the IEEE International Conference on Acoustics, 2009

Multiple time resolution analysis of speech signal using MCE training with application to speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009

Short-time instantaneous frequency and bandwidth features for speech recognition.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Transition features for CRF-based speech recognition and boundary detection.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
A Study in Efficiency and Modality Usage in Multimodal Form Filling Systems.
IEEE Trans. Speech Audio Process., 2008

Linguistic analysis of spontaneous children speech.
Proceedings of the First Workshop on Child, Computer and Interaction, 2008

Region-based vocal tract length normalization for ASR.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Multimodal system evaluation using modality efficiency and synergy metrics.
Proceedings of the 10th International Conference on Multimodal Interfaces, 2008

Movie summarization based on audiovisual saliency detection.
Proceedings of the International Conference on Image Processing, 2008

On the effectiveness of PARAFAC-based estimation for blind speech separation.
Proceedings of the IEEE International Conference on Acoustics, 2008

IDesign Principles for Multimodal Spoken Dialogue Systems.
Proceedings of the Multimodal Processing and Interaction, Audio, Video, Text, 2008

Human-Computer Interfaces to Multimedia Content a Review.
Proceedings of the Multimodal Processing and Interaction, Audio, Video, Text, 2008

Audiovisual Attention Modeling and Salient Event Detection.
Proceedings of the Multimodal Processing and Interaction, Audio, Video, Text, 2008

2007
Information Seeking Spoken Dialogue Systems- Part II: Multimodal Dialogue.
IEEE Trans. Multim., 2007

Information Seeking Spoken Dialogue Systems- Part I: Semantics and Pragmatics.
IEEE Trans. Multim., 2007

Unsupervised Semantic Similarity Computation usingWeb Search Engines.
Proceedings of the 2007 IEEE / WIC / ACM International Conference on Web Intelligence, 2007

Multimodal User Interface for Augmented Assembly.
Proceedings of the IEEE 9th Workshop on Multimedia Signal Processing, 2007

A review of the acoustic and linguistic properties of children's speech.
Proceedings of the IEEE 9th Workshop on Multimedia Signal Processing, 2007

A soft-clustering algorithm for automatic induction of semantic classes.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Advanced front-end for robust speech recognition in extremely adverse environments.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

The effect of input mode on inactivity and interaction times of multimodal systems.
Proceedings of the 9th International Conference on Multimodal Interfaces, 2007

Unsupervised Stream Weight Estimation using Anti-Models.
Proceedings of the IEEE International Conference on Acoustics, 2007

Demonstration of assembly work using augmented reality.
Proceedings of the 6th ACM International Conference on Image and Video Retrieval, 2007

2006
Blending speech and Visual Input in Multimodal Dialogue Systems.
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006

Unsupervised Combination of Metrics for Semantic Class Induction.
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006

Stream Weight Computation for Multi-Stream Classifiers.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Blind Speech Separation Using Parafac Analysis and Integer Least Squares.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Adaptive categorical understanding for spoken dialogue systems.
IEEE Trans. Speech Audio Process., 2005

Robust AM-FM Features for Speech Recognition.
IEEE Signal Process. Lett., 2005

Detecting Politeness and frustration state of a child in a conversational computer game.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Auditory Teager energy cepstrum coefficients for robust speech recognition.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2004
Auto-induced semantic classes.
Speech Commun., 2004

2003
Robust recognition of children's speech.
IEEE Trans. Speech Audio Process., 2003

2002
An error-protected speech recognition system for wireless communications.
IEEE Trans. Wirel. Commun., 2002

Creating conversational interfaces for children.
IEEE Trans. Speech Audio Process., 2002

DARPA communicator: cross-system results for the 2001 evaluation.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

DARPA communicator evaluation: progress from 2000 to 2001.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Adaptive language models for spoken dialogue systems.
Proceedings of the IEEE International Conference on Acoustics, 2002

Modulation features for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Time-frequency distributions for automatic speech recognition.
IEEE Trans. Speech Audio Process., 2001


Metrics for measuring domain independence of semantic classes.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Hybrid natural language generation for spoken dialogue systems.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Ambiguity representation and resolution in spoken dialogue systems.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Speech recognition for wireless applications.
Proceedings of the IEEE International Conference on Communications, 2001

Soft-feature decoding for speech recognition over wireless channels.
Proceedings of the IEEE International Conference on Acoustics, 2001

2000
Statistical recursive finite state machine parsing for speech understanding.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Dialogue management in the Bell Labs communicator system.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Cross-domain classification using generalized domain acts.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999
Speech analysis and synthesis using an AM-FM modulation model.
Speech Commun., 1999

Categorical understanding using statistical ngram models.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Speaker adaptation for audio-visual speech recognition.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Multimodal systems for children: building a prototype.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998
Language model adaptation for spoken language systems.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Spoken dialog systems for children.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Multi-band speech recognition in noisy environments.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
Unsupervised HMM adaptation based on speech-silence discrimination.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Automatic speech recognition for children.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

On using fractal features of speech sounds in automatic speech recognition.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Analysis of children's speech: duration, pitch and formants.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

On combining frequency warping and spectral shaping in HMM based speech recognition.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1995
Higher order differential energy operators.
IEEE Signal Process. Lett., 1995

A feature-space transformation for telephone based speech recognition.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Speech formant frequency and bandwidth tracking using multiband energy demodulation.
Proceedings of the 1995 International Conference on Acoustics, 1995

1994
A system for finding speech formants and modulations via energy separation.
IEEE Trans. Speech Audio Process., 1994

A comparison of the energy operator and the Hilbert transform approach to signal and speech demodulation.
Signal Process., 1994

1993
Finding speech formants and modulations via energy separation: with application to a vocoder.
Proceedings of the IEEE International Conference on Acoustics, 1993


  Loading...