Mathew Magimai-Doss
Orcid: 0000-0002-8714-1409
According to our database1,
Mathew Magimai-Doss
authored at least 157 papers
between 2001 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
CoRR, 2024
SSL-TTS: Leveraging Self-Supervised Embeddings and kNN Retrieval for Zero-Shot Multi-speaker TTS.
CoRR, 2024
Towards interfacing large language models with ASR systems using confidence measures and prompting.
CoRR, 2024
CoRR, 2024
Predicting Heart Activity from Speech using Data-driven and Knowledge-based features.
CoRR, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers?
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
2022
Comparing supervised and self-supervised embedding for ExVo Multi-Task learning track.
CoRR, 2022
Comparing Biosignal and Acoustic feature Representation for Continuous Emotion Recognition.
Proceedings of the MuSe@MM 2022: Proceedings of the 3rd International on Multimodal Sentiment Analysis Workshop and Challenge, 2022
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the International Conference on Multimodal Interaction, 2022
Proceedings of the International Conference on Multimodal Interaction, 2022
Modeling of Pre-Trained Neural Network Embeddings Learned From Raw Waveform for COVID-19 Infection Detection.
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
On Joint Optimization of Automatic Speaker Verification and Anti-Spoofing in the Embedding Space.
IEEE Trans. Inf. Forensics Secur., 2021
Utterance Verification-Based Dysarthric Speech Intelligibility Assessment Using Phonetic Posterior Features.
IEEE Signal Process. Lett., 2021
Signal-to-signal neural networks for improved spike estimation from calcium imaging data.
PLoS Comput. Biol., 2021
Deep learning architectures for estimating breathing signal and respiratory parameters from speech recordings.
Neural Networks, 2021
Fusion of Acoustic and Linguistic Information using Supervised Autoencoder for Improved Emotion Recognition.
Proceedings of the MuSe '21: Proceedings of the 2nd on Multimodal Sentiment Analysis Challenge, 2021
Late Fusion of the Available Lexicon and Raw Waveform-Based Acoustic Modeling for Depression and Dementia Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
On Modeling Glottal Source Information for Phonation Assessment in Parkinson's Disease.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Handling Acoustic Variation in Dysarthric Speech Recognition Systems Through Model Combination.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Approximating the Mental Lexicon from Clinical Interviews as a Support Tool for Depression Detection.
Proceedings of the ICMI '21: International Conference on Multimodal Interaction, 2021
On The Relationship Between Speech-Based Breathing Signal Prediction Evaluation Measures and Breathing Parameters Estimation.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the 29th European Signal Processing Conference, 2021
Proceedings of the 14th ITG Conference on Speech Communication, online, September 29, 2021
2020
An HMM Approach with Inherent Model Selection for Sign Language and Gesture Recognition.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
A Comparison of Acoustic and Linguistics Methodologies for Alzheimer's Dementia Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the Companion Publication of the 2020 International Conference on Multimodal Interaction, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Detection Of S1 And S2 Locations In Phonocardiogram Signals Using Zero Frequency Filter.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Estimating the Degree of Sleepiness by Integrating Articulatory Feature Knowledge in Raw Waveform Based CNNS.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
End-to-end acoustic modeling using convolutional neural networks for HMM-based automatic speech recognition.
Speech Commun., 2019
Subunits Inference and Lexicon Development Based on Pairwise Comparison of Utterances and Signs.
Inf., 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Using Speech Production Knowledge for Raw Waveform Modelling Based Styrian Dialect Identification.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
HMM-based Approaches to Model Multichannel Information in Sign Language Inspired from Articulatory Features-based Speech Processing.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Segment-level Training of ANNs Based on Acoustic Confidence Measures for Hybrid HMM/ANN Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019
Improving Children Speech Recognition through Feature Learning from Raw Speech Signal.
Proceedings of the IEEE International Conference on Acoustics, 2019
2018
Towards weakly supervised acoustic subword unit discovery and lexicon development using hidden Markov models.
Speech Commun., 2018
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Denoising and Raw-waveform Networks for Weakly-Supervised Gender Identification on Noisy Speech.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
On Learning Vocal Tract System Related Speaker Discriminative Information from Raw Signal Using CNNs.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
2017
IEEE ACM Trans. Audio Speech Lang. Process., 2017
IEEE Signal Process. Lett., 2017
Proceedings of the 2017 IEEE International Joint Conference on Biometrics, 2017
2016
Acoustic data-driven grapheme-to-phoneme conversion in the probabilistic lexical modeling framework.
Speech Commun., 2016
Articulatory feature based continuous speech recognition using probabilistic lexical modeling.
Comput. Speech Lang., 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Presentation Attack Detection Using Long-Term Spectral Statistics for Trustworthy Speaker Verification.
Proceedings of the 2016 International Conference of the Biometrics Special Interest Group, 2016
2015
Acoustic and lexical resource constrained ASR using language-independent acoustic model and language-dependent probabilistic lexical model.
Speech Commun., 2015
Learning linearly separable features for speech recognition using convolutional neural networks.
Proceedings of the 3rd International Conference on Learning Representations, 2015
Objective intelligibility assessment of text-to-speech systems through utterance verification.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Automatic accentedness evaluation of non-native speech using phonetic and sub-phonetic posterior probabilities.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Objective speech intelligibility assessment through comparison of phoneme class conditional probability sequences.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
An HMM-based formalism for automatic subword unit derivation and pronunciation generation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Integrated pronunciation learning for automatic speech recognition using probabilistic lexical modeling.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Convolutional Neural Networks-based continuous speech recognition using raw speech signal.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
2014
Feature mapping of multiple beamformed sources for robust overlapping speech recognition using a microphone array.
IEEE ACM Trans. Audio Speech Lang. Process., 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
On modeling context-dependent clustered states: Comparing HMM/GMM, hybrid HMM/ANN and KL-HMM approaches.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014
2013
Applying Multi- and Cross-Lingual Stochastic Phone Space Transformations to Non-Native Speech Recognition.
IEEE Trans. Speech Audio Process., 2013
IEEE Signal Process. Lett., 2013
CoRR, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Estimating phoneme class conditional probabilities from raw speech signal using convolutional neural networks.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Grapheme and multilingual posterior features for under-resourced speech recognition: A study on Scottish Gaelic.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013
2012
IEEE Trans. Inf. Forensics Secur., 2012
Speech Commun., 2012
Proceedings of the 2012 Symposium on Machine Learning in Speech and Language Processing, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Template-based ASR using posterior features and synthetic references: comparing different TTS systems.
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Combining Acoustic Data Driven G2P and Letter-to-Sound Rules for Under Resource Lexicon Generation.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Joint detection and localization of multiple speakers using a probabilistic interpretation of the steered response power.
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 20th European Signal Processing Conference, 2012
2011
Transcribing Mandarin Broadcast Speech Using Multi-Layer Perceptron Acoustic Features.
IEEE ACM Trans. Audio Speech Lang. Process., 2011
IEEE Trans. Speech Audio Process., 2011
IEEE ACM Trans. Audio Speech Lang. Process., 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Improving Non-Native ASR Through Stochastic Multilingual Phoneme Space Transformations.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 2011 IEEE International Joint Conference on Biometrics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Integrating articulatory features using Kullback-Leibler divergence based acoustic model for phoneme recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011
Language dependent universal phoneme posterior estimation for mixed language speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2011, 2011
Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011
2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Evaluating the robustness of privacy-sensitive audio features for speech detection in personal audio log scenarios.
Proceedings of the IEEE International Conference on Acoustics, 2010
2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Investigating privacy-sensitive features for speech detection in multiparty conversations.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 11th International Conference on Multimodal Interfaces, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Non-linear mapping for multi-channel speech separation and robust overlapping spech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
2008
Proceedings of the Machine Learning for Multimodal Interaction, 5th International Workshop, 2008
Neural network based regression for robust overlapping speech recognition using microphone arrays.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
Proceedings of the 2008 16th European Signal Processing Conference, 2008
Proceedings of the 2008 16th European Signal Processing Conference, 2008
2007
Proceedings of the Machine Learning for Multimodal Interaction , 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 JHU Summer workshop.
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
A Generalized Dynamic Composition Algorithm of Weighted Finite State Transducers for Large Vocabulary Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the Multimodal Technologies for Perception of Humans, 2007
Monolingual and crosslingual comparison of tandem features derived from articulatory and phone MLPS.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
2006
Proceedings of the Machine Learning for Multimodal Interaction, 2006
Threshold Selection for Unsupervised Detection, With an Application to Microphone Arrays.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
A sector-based, frequency-domain approach to detection and localization of multiple speakers.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
2004
IEEE Trans. Speech Audio Process., 2004
Proceedings of the Machine Learning for Multimodal Interaction, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Speech recognition of spontaneous, noisy speech using auxiliary information in Bayesian networks.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
2002
Dynamic Bayesian network based speech recognition with pitch and energy as auxiliary variables.
Proceedings of the 12th IEEE Workshop on Neural Networks for Signal Processing, 2002
Auxiliary variables in conditional Gaussian mixtures for automatic speech recognition.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 16th International Conference on Pattern Recognition, 2002
2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001