Ruijie Tao

Orcid: 0000-0003-0021-5661

According to our database1, Ruijie Tao authored at least 25 papers between 2020 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Deep Cross-Modal Retrieval Between Spatial Image and Acoustic Speech.
IEEE Trans. Multim., 2024

Unified Audio Event Detection.
CoRR, 2024

A Benchmark for Multi-speaker Anonymization.
CoRR, 2024

Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection.
CoRR, 2024

Target Speech Diarization with Multimodal Prompts.
CoRR, 2024

How Do Neural Spoofing Countermeasures Detect Partially Spoofed Audio?
CoRR, 2024

Audio-Visual Target Speaker Extraction with Reverse Selective Auditory Attention.
CoRR, 2024

Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2024

Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-Talker Speech.
Proceedings of the IEEE International Conference on Acoustics, 2024

Prompt-Driven Target Speech Diarization.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Self-Supervised Training of Speaker Encoder With Multi-Modal Diverse Positive Pairs.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

USED: Universal Speaker Extraction and Diarization.
CoRR, 2023

Target Active Speaker Detection with Audio-visual Cues.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Speaker Recognition with Two-Step Multi-Modal Deep Cleansing.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Selective Listening by Synchronizing Speech With Lips.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

I4U System Description for NIST SRE'20 CTS Challenge.
CoRR, 2022

Self-Supervised Speaker Recognition with Loss-Gated Learning.
Proceedings of the IEEE International Conference on Acoustics, 2022


2021
Ego4D: Around the World in 3, 000 Hours of Egocentric Video.
CoRR, 2021

Is Someone Speaking?: Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Muse: Multi-Modal Target Speaker Extraction with Visual Cues.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
HLT-NUS Submission for NIST 2019 Multimedia Speaker Recognition Evaluation.
CoRR, 2020

Audio-Visual Speaker Recognition with a Cross-Modal Discriminative Network.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

HLT-NUS Submission for 2019 NIST Multimedia Speaker Recognition Evaluation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020


  Loading...