Midia Yousefi

According to our database1, Midia Yousefi authored at least 18 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

2016
2017
2018
2019
2020
2021
2022
2023
2024
0
1
2
3
4
5
6
7
8
4
1
2
3
1
3
1
1
1
1

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Isochrony-Controlled Speech-to-Text Translation: A study on translating from Sino-Tibetan to Indo-European Languages.
CoRR, 2024

Investigating Neural Audio Codecs for Speech Language Model-Based Speech Generation.
CoRR, 2024

TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation.
CoRR, 2024

CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations.
CoRR, 2024

CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Profile-Error-Tolerant Target-Speaker Voice Activity Detection.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Single-channel speech separation using soft-minimum permutation invariant training.
Speech Commun., June, 2023

Speaker Diarization for ASR Output with T-vectors: A Sequence Classification Approach.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2021
Block-Based High Performance CNN Architectures for Frame-Level Overlapping Speech Detection.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Speaker conditioning of acoustic models using affine transformation for multi-speaker speech recognition.
CoRR, 2021

Real-Time Speaker Counting in a Cocktail Party Scenario Using Attention-Guided Convolutional Neural Network.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Audio-based Toxic Language Classification using Self-attentive Convolutional Neural Network.
Proceedings of the 29th European Signal Processing Conference, 2021

Speaker Conditioning of Acoustic Models Using Affine Transformation for Multi-Speaker Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Frame-Based Overlapping Speech Detection Using Convolutional Neural Networks.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Probabilistic Permutation Invariant Training for Speech Separation.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018
Assessing Speaker Engagement in 2-Person Debates: Overlap Detection in United States Presidential Debates.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2016
Supervised speech enhancement using online Group-Sparse Convolutive NMF.
Proceedings of the 8th International Symposium on Telecommunications, 2016


  Loading...