Zakaria Aldeneh

Orcid: 0000-0003-4599-2448

According to our database1, Zakaria Aldeneh authored at least 27 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Learning Spatially-Aware Language and Audio Embedding.
CoRR, 2024

Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels.
CoRR, 2024

Exploring Prediction Targets in Masked Pre-Training for Speech Foundation Models.
CoRR, 2024

Towards Automatic Assessment of Self-Supervised Speech Models using Rank.
CoRR, 2024

dMel: Speech Tokenization made Simple.
CoRR, 2024

Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?
CoRR, 2024

ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models.
CoRR, 2024

2023
You're Not You When You're Angry: Robust Emotion Features Emerge by Recognizing Speakers.
IEEE Trans. Affect. Comput., 2023

Spatial LibriSpeech: An Augmented Dataset for Spatial Audio Learning.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Naturalistic Head Motion Generation from Speech.
Proceedings of the IEEE International Conference on Acoustics, 2023

On the Role of LIP Articulation in Visual Speech Perception.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Towards a Perceptual Model for Estimating the Quality of Visual Speech.
CoRR, 2022

2021
Learning Paralinguistic Features from Audiobooks through Style Voice Conversion.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

On The Role of Visual Cues in Audiovisual Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Robust Methods for the Automatic Quantification and Prediction of Affect in Spoken Interactions.
PhD thesis, 2020

Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement.
CoRR, 2020

Aphasic Speech Recognition Using a Mixture of Speech Intelligibility Experts.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019
Identifying Mood Episodes Using Dialogue Features from Clinical Interviews.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Controlling for Confounders in Multimodal Emotion Classification via Adversarial Learning.
Proceedings of the International Conference on Multimodal Interaction, 2019

Muse-ing on the Impact of Utterance Ordering on Crowdsourced Emotion Annotations.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Improving End-of-Turn Detection in Spoken Dialogues by Detecting Speaker Intentions as a Secondary Task.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Discretized Continuous Speech Emotion Recognition with Multi-Task Deep Recurrent Neural Network.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Capturing Long-Term Temporal Dependencies with Convolutional Networks for Continuous Emotion Recognition.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Progressive Neural Networks for Transfer Learning in Emotion Recognition.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Pooling acoustic and lexical features for the prediction of valence.
Proceedings of the 19th ACM International Conference on Multimodal Interaction, 2017

Using regional saliency for speech emotion recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Wild wild emotion: a multimodal ensemble approach.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016


  Loading...