Athanasios Mouchtaris

Orcid: 0000-0001-7583-0189

According to our database1, Athanasios Mouchtaris authored at least 124 papers between 1998 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tuning.
CoRR, 2024

Max-Margin Transducer Loss: Improving Sequence-Discriminative Training Using a Large-Margin Learning Strategy.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Conmer: Streaming Conformer Without Self-attention for Interactive Voice Assistants.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Lookahead When It Matters: Adaptive Non-causal Transformers for Streaming Neural Transducers.
Proceedings of the International Conference on Machine Learning, 2023

Dual-Attention Neural Transducers for Efficient Wake Word Spotting in Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Multilingual End-To-End Spoken Language Understanding For Ultra-Low Footprint Applications.
Proceedings of the IEEE International Conference on Acoustics, 2023

Robust Acoustic And Semantic Contextual Biasing In Neural Transducers For Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Gated Contextual Adapters For Selective Contextual Biasing In Neural Transducers.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
A neural prosody encoder for end-ro-end dialogue act classification.
CoRR, 2022

Sub-8-Bit Quantization for On-Device Speech Recognition: A Regularization-Free Approach.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Accelerator-Aware Training for Transducer-Based Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Sub-8-Bit Quantization Aware Training for 8-Bit Neural Network Accelerator with On-Device Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Compute Cost Amortized Transformer for Streaming ASR.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

ConvRNN-T: Convolutional Augmented Recurrent Neural Network Transducers for Streaming Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Knowledge Distillation via Module Replacing for Automatic Speech Recognition with Recurrent Neural Network Transducer.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

A Neural Prosody Encoder for End-to-End Dialogue Act Classification.
Proceedings of the IEEE International Conference on Acoustics, 2022

Contextual Adapters for Personalized Speech Recognition in Neural Transducers.
Proceedings of the IEEE International Conference on Acoustics, 2022

TINYS2I: A Small-Footprint Utterance Classification Model with Contextual Support for On-Device SLU.
Proceedings of the IEEE International Conference on Acoustics, 2022

Caching Networks: Capitalizing on Common Speech for ASR.
Proceedings of the IEEE International Conference on Acoustics, 2022

Tie Your Embeddings Down: Cross-Modal Latent Spaces for End-to-end Spoken Language Understanding.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Exploiting Large-Scale Teacher-Student Training for On-Device Acoustic Models.
Proceedings of the Text, Speech, and Dialogue - 24th International Conference, 2021

CoDERT: Distilling Encoder Representations with Co-Learning for Transducer-Based Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Evaluating the Vulnerability of End-to-End Automatic Speech Recognition Models to Membership Inference Attacks.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

End-to-End Spoken Language Understanding for Generalized Voice Assistants.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

FANS: Fusing ASR and NLU for On-Device SLU.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Phonetically Induced Subwords for End-to-End Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Multi-Channel Transformer Transducer for Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Sparsification via Compressed Sensing for Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Joint ASR and Language Identification Using RNN-T: An Efficient Approach to Dynamic Language Switching.
Proceedings of the IEEE International Conference on Acoustics, 2021

End-to-End Multi-Channel Transformer for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

In Pursuit of Babel - Multilingual End-to-End Spoken Language Understanding.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Context-Aware Transformer Transducer for Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Tie Your Embeddings Down: Cross-Modal Latent Spaces for End-to-end Spoken Language Understanding.
CoRR, 2020

Streaming End-to-End Bilingual ASR Systems with Joint Language Identification.
CoRR, 2020

End-to-End Neural Transformer Based Spoken Language Understanding.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Quantization Aware Training with Absolute-Cosine Regularization for Automatic Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Semantic Complexity in End-to-End Spoken Language Understanding.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Multilingual Grapheme-To-Phoneme Conversion with Byte Representation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2018
Multiple Sound Source Location Estimation in Wireless Acoustic Sensor Networks Using DOA Estimates: The Data-Association Problem.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Convolutional Neural Networks for Video Quality Assessment.
CoRR, 2018

Multiple Source Location Estimation on a Dataset of Real Recordings in a Wireless Acoustic Sensor Network.
Proceedings of the 20th IEEE International Workshop on Multimedia Signal Processing, 2018

Normalization of Partly Overlapping Audio Recordings from the Same Event Based on Relative Signal Powers.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Acoustic Beamforming in Front of a Reflective Plane.
Proceedings of the 26th European Signal Processing Conference, 2018

2017
Wireless Acoustic Sensor Networks and Applications.
Wirel. Commun. Mob. Comput., 2017

A Survey of Sound Source Localization Methods in Wireless Acoustic Sensor Networks.
Wirel. Commun. Mob. Comput., 2017

Corrections to "Perpendicular Cross-Spectra Fusion for Sound Source Localization With a Planar Microphone Array".
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Perpendicular Cross-Spectra Fusion for Sound Source Localization With a Planar Microphone Array.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Maximum component elimination in mixing of user generated audio recordings.
Proceedings of the 19th IEEE International Workshop on Multimedia Signal Processing, 2017

Automatic matching and synchronization of user generated videos from a large scale sport event.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

DOA estimation with histogram analysis of spatially constrained active intensity vectors.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Towards wireless acoustic sensor networks for location estimation and counting of multiple speakers in real-life conditions.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

A subjective evaluation on mixtures of crowdsourced audio recordings.
Proceedings of the 25th European Signal Processing Conference, 2017

2016
3D DOA estimation of multiple sound sources based on spatially constrained beamforming driven by intensity vectors.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Capturing and reproduction of a crowded sound scene using a circular microphone array.
Proceedings of the 24th European Signal Processing Conference, 2016

Direction of arrival estimation in front of a reflective plane using a circular microphone array.
Proceedings of the 24th European Signal Processing Conference, 2016

3D localization of multiple audio sources utilizing 2D DOA histograms.
Proceedings of the 24th European Signal Processing Conference, 2016

Development and evaluation of a digital MEMS microphone array for spatial audio.
Proceedings of the 24th European Signal Processing Conference, 2016

Improving narrowband DOA estimation of sound sources using the complex Watson distribution.
Proceedings of the 24th European Signal Processing Conference, 2016

2015
Speech Analysis and Synthesis with a Computationally Efficient Adaptive Harmonic Model.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Localizing multiple audio sources in a wireless acoustic sensor network.
Signal Process., 2015

Multiple sound source location estimation and counting in a wireless acoustic sensor network.
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015

The MusiNet project: Addressing the challenges in Networked Music Performance systems.
Proceedings of the 6th International Conference on Information, 2015

Foreground suppression for capturing and reproduction of crowded acoustic environments.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

A multi-sensor approach for real-time detection and classification of impact sounds.
Proceedings of the 23rd European Signal Processing Conference, 2015

3D localization of multiple sound sources with intensity vector estimates in single source zones.
Proceedings of the 23rd European Signal Processing Conference, 2015

Addressing the data-association problem for multiple sound source localization using DOA estimates.
Proceedings of the 23rd European Signal Processing Conference, 2015

2014
Reconstruction of missing features based on a low-rank assumption for robust speaker identification.
Proceedings of the 5th International Conference on Information, 2014

The MusiNet project: Towards unraveling the full potential of Networked Music Performance systems.
Proceedings of the 5th International Conference on Information, 2014

Instantaneous Detection and Classification of Impact Sound: Turning Simple Objects into Powerful Musical Control Interfaces.
Proceedings of the Music Technology meets Philosophy, 2014

A computationally efficient refinement of the fundamental frequency estimate for the Adaptive Harmonic Model.
Proceedings of the IEEE International Conference on Acoustics, 2014

Joint low-rank representation and matrix completion under a singular value thresholding framework.
Proceedings of the 22nd European Signal Processing Conference, 2014

Analysis of emotional speech using an adaptive sinusoidal model.
Proceedings of the 22nd European Signal Processing Conference, 2014

Real-time localization of multiple audio sources in a wireless acoustic sensor network.
Proceedings of the 22nd European Signal Processing Conference, 2014

Breaking down the cocktail party: Capturing and isolating sources in a soundscape.
Proceedings of the 22nd European Signal Processing Conference, 2014

2013
Real-Time Multiple Sound Source Localization and Counting Using a Circular Microphone Array.
IEEE Trans. Speech Audio Process., 2013

Capturing and Reproducing Spatial Audio Based on a Circular Microphone Array.
J. Electr. Comput. Eng., 2013

Localizing multiple audio sources from DOA estimates in a wireless acoustic sensor network.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Evaluating how well filtered white noise models the residual from sinusoidal modeling of musical instrument sounds.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Compressive sensing in footstep sounds, hand tremors and speech using K-SVD dictionaries.
Proceedings of the 18th International Conference on Digital Signal Processing, 2013

Directional coding of audio using a circular microphone array.
Proceedings of the IEEE International Conference on Acoustics, 2013

Sparsity based robust speaker identification using a discriminative dictionary learning approach.
Proceedings of the 21st European Signal Processing Conference, 2013

Adaptive sinusoidal modeling of percussive musical instrument sounds.
Proceedings of the 21st European Signal Processing Conference, 2013

2012
Post-Nonlinear Sparse Component Analysis Using Single-Source Zones and Functional Data Clustering
CoRR, 2012

Nonlinear blind mixture identification using local source sparsity and functional data clustering.
Proceedings of the IEEE 7th Sensor Array and Multichannel Signal Processing Workshop, 2012

Source counting in real-time sound source localization using a circular microphone array.
Proceedings of the IEEE 7th Sensor Array and Multichannel Signal Processing Workshop, 2012

Real-time multiple sound source localization using a circular microphone array based on single-source confidence measures.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Real-time multiple speaker DOA estimation in a circular microphone array based on Matching Pursuit.
Proceedings of the 20th European Signal Processing Conference, 2012

The Role of Time in Music Emotion Recognition: Modeling Musical Emotions from Time-Varying Music Features.
Proceedings of the From Sounds to Music and Emotions - 9th International Symposium, 2012

2011
Single-Channel and Multi-Channel Sinusoidal Audio Coding Using Compressed Sensing.
IEEE Trans. Speech Audio Process., 2011

Tempo Estimation Based on Linear Prediction and Perceptual Modelling.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

Perceptually-Driven Scalable MDCT Enhancement of Compressed Audio Based on Statistical Conversion.
Proceedings of the 2011 IEEE International Symposium on Multimedia, 2011

Post-nonlinear speech mixture identification using single-source temporal zones & curve clustering.
Proceedings of the 19th European Signal Processing Conference, 2011

2010
Sinusoidal spatial audio coding for low-bitrate binaural reproduction.
Proceedings of the IEEE International Conference on Acoustics, 2010

Top-down strategies in parameter selection of sinusoidal modeling of audio.
Proceedings of the IEEE International Conference on Acoustics, 2010

Robust text-independent speaker identification using short test and training sessions.
Proceedings of the 18th European Signal Processing Conference, 2010

Speaker identification using sparsely excited speech signals and compressed sensing.
Proceedings of the 18th European Signal Processing Conference, 2010

Multichannel audio coding using sinusoidal modelling and compressed sensing.
Proceedings of the 18th European Signal Processing Conference, 2010

2009
A Multichannel Sinusoidal Model Applied to Spot Microphone Signals for Immersive Audio.
IEEE Trans. Speech Audio Process., 2009

The ASPIRE Project - Sensor Networks for Immersive Multimedia Environments.
ERCIM News, 2009

Encoding the sinusoidal model of an audio signal using compressed sensing.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Bandwidth extension of low bitrate compressed audio based on statistical conversion.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

2008
Low Bitrate Coding of Spot Audio Signals for Interactive and Immersive Audio Applications.
Proceedings of the New Directions in Intelligent Interactive Multimedia, 2008

Multiresolution Source/Filter Model for Low Bitrate Coding of Spot Microphone Signals.
EURASIP J. Audio Speech Music. Process., 2008

Quality Enhancement of Compressed Audio Based on Statistical Conversion.
EURASIP J. Audio Speech Music. Process., 2008

Modeling and coding of spot microphone signals for immersive audio based on the sinusoidal model.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

Synthesis of enhanced audio from low bitrate compressed audio based on unit selection and statistical conversion methods.
Proceedings of the 42nd Asilomar Conference on Signals, Systems and Computers, 2008

2007
A Spectral Conversion Approach to Single-Channel Speech Enhancement.
IEEE Trans. Speech Audio Process., 2007

Enhanced Multichannel Audio Resynthesis Through Residual Processing and Features Alignment.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Conditional Vector Quantization for Voice Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2007

Sinusoidal modeling of spot microphone signals based on noise transplantation for multichannel audio coding.
Proceedings of the 15th European Signal Processing Conference, 2007

2006
Nonparallel training for voice conversion based on a parameter adaptation approach.
IEEE Trans. Speech Audio Process., 2006

Musical Genre Classification VIA Generalized Gaussian and Alpha-Stable Modeling.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Multiband source/filter representation of multichannel audio for reduction of inter-channel redundancy.
Proceedings of the 14th European Signal Processing Conference, 2006

2005
Multichannel audio synthesis by subband-based spectral conversion and parameter adaptation.
IEEE Trans. Speech Audio Process., 2005

A spectral conversion approach to feature denoising and speech enhancement.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2004
A spectral conversion approach to the iterative Wiener filter for speech enhancement.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Non-parallel training for voice conversion by maximum likelihood constrained adaptation.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Virtual Microphones for Multichannel Audio Resynthesis.
EURASIP J. Adv. Signal Process., 2003

2002
Multiresolution spectral conversion for multichannel audio resynthesis.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

Effcient multichannel audio resynthesis by subband-based spectral conversion.
Proceedings of the 11th European Signal Processing Conference, 2002

2000
Inverse Filter Design for Immersive Audio Rendering Over Loudspeakers.
IEEE Trans. Multim., 2000

Virtual Microphones for Multichannel Audio Applications.
Proceedings of the 2000 IEEE International Conference on Multimedia and Expo, 2000

1999
Non-minimum phase inverse filter methods for immersive audio rendering.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998
Head-related transfer function synthesis for immersive audio.
Proceedings of the Second IEEE Workshop on Multimedia Signal Processing, 1998


  Loading...