Midia Yousefi

According to our database¹, Midia Yousefi authored at least 17 papers between 2016 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Isochrony-Controlled Speech-to-Text Translation: A study on translating from Sino-Tibetan to Indo-European Languages.

[BibT_eX]

[DOI]

CoRR, 2024

TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation.

[BibT_eX]

[DOI]

CoRR, 2024

Investigating Neural Audio Codecs For Speech Language Model-Based Speech Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Profile-Error-Tolerant Target-Speaker Voice Activity Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Single-channel speech separation using soft-minimum permutation invariant training.

[BibT_eX]

[DOI]

Midia Yousefi

John H. L. Hansen

Speech Commun., June, 2023

Speaker Diarization for ASR Output with T-vectors: A Sequence Classification Approach.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2021

Block-Based High Performance CNN Architectures for Frame-Level Overlapping Speech Detection.

[BibT_eX]

[DOI]

Midia Yousefi

John H. L. Hansen

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Speaker conditioning of acoustic models using affine transformation for multi-speaker speech recognition.

[BibT_eX]

[DOI]

Midia Yousefi

John H. L. Hanse

CoRR, 2021

Real-Time Speaker Counting in a Cocktail Party Scenario Using Attention-Guided Convolutional Neural Network.

[BibT_eX]

[DOI]

Midia Yousefi

John H. L. Hansen

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Audio-based Toxic Language Classification using Self-attentive Convolutional Neural Network.

[BibT_eX]

[DOI]

Midia Yousefi

Dimitra Emmanouilidou

Proceedings of the 29th European Signal Processing Conference, 2021

Speaker Conditioning of Acoustic Models Using Affine Transformation for Multi-Speaker Speech Recognition.

[BibT_eX]

[DOI]

Midia Yousefi

John H. L. Hansen

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020

Frame-Based Overlapping Speech Detection Using Convolutional Neural Networks.

[BibT_eX]

[DOI]

Midia Yousefi

John H. L. Hansen

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Probabilistic Permutation Invariant Training for Speech Separation.

[BibT_eX]

[DOI]

Midia Yousefi

Soheil Khorram

John H. L. Hansen

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018

Assessing Speaker Engagement in 2-Person Debates: Overlap Detection in United States Presidential Debates.

[BibT_eX]

[DOI]

Midia Yousefi

Navid Shokouhi

John H. L. Hansen

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2016

Supervised speech enhancement using online Group-Sparse Convolutive NMF.

[BibT_eX]

[DOI]

Midia Yousefi

Mohammad Hassan Savoji

Proceedings of the 8th International Symposium on Telecommunications, 2016

Midia Yousefi

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...