Ahmed Hussen Abdelaziz
Orcid: 0000-0001-8027-4666
According to our database1,
Ahmed Hussen Abdelaziz
authored at least 43 papers
between 2012 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Device-Directed Speech Detection for Follow-up Conversations Using Large Language Models.
CoRR, 2024
Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels.
CoRR, 2024
CoRR, 2024
Multimodal Large Language Models with Fusion Low Rank Adaptation for Device Directed Speech Detection.
CoRR, 2024
Comparative Analysis of Personalized Voice Activity Detection Systems: Assessing Real-World Effectiveness.
CoRR, 2024
Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?
CoRR, 2024
ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models.
CoRR, 2024
Modality Drop-Out for Multimodal Device Directed Speech Detection Using Verbal and Non-Verbal Features.
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
Modality Dropout for Multimodal Device Directed Speech Detection using Verbal and Non-Verbal Features.
CoRR, 2023
Less Is More: A Unified Architecture for Device-Directed Speech Detection with Multiple Invocation Types.
Proceedings of the IEEE International Conference on Acoustics, 2023
2022
Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
2021
Proceedings of the ICMI '21: International Conference on Multimodal Interaction, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the 32nd British Machine Vision Conference 2021, 2021
2020
Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement.
CoRR, 2020
Proceedings of the ICMI '20: International Conference on Multimodal Interaction, 2020
2019
Speaker-Independent Speech-Driven Visual Speech Synthesis using Domain-Adapted Acoustic Models.
Proceedings of the International Conference on Multimodal Interaction, 2019
2018
IEEE ACM Trans. Audio Speech Lang. Process., 2018
2017
NTCD-TIMIT: A New Database and Baseline for Noise-Robust Audio-Visual Speech Recognition.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017
2016
Noise-robust HMM-based pattern recognition using multimodal features and observation uncertainties.
PhD thesis, 2016
General hybrid framework for uncertainty-decoding-based automatic speech recognition systems.
Speech Commun., 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 12th ITG Symposium on Speech Communication, 2016
2015
Learning Dynamic Stream Weights For Coupled-HMM-Based Audio-Visual Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2015
Robust speech processing using observation uncertainty and uncertainty propagation: session and paper overview.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
2014
The Tutorbot Corpus ― A Corpus for Studying Tutoring Behaviour in Multiparty Face-to-Face Spoken Dialogue.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014
Dynamic stream weight estimation in coupled-HMM-based audio-visual speech recognition using multilayer perceptrons.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the ACM/IEEE International Conference on Human-Robot Interaction, 2014
2013
Using twin-HMM-based audio-visual speech enhancement as a front-end for robust audio-visual speech recognition.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the Innovative and Creative Developments in Multimodal Interaction Systems, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
2012
Decoding of Uncertain Features Using the Posterior Distribution of the Clean Data for Robust Speech Recognition.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 10th ITG Conference on Speech Communication, 2012