Yidi Jiang

Orcid: 0000-0002-2317-9571

According to our database1, Yidi Jiang authored at least 12 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Unified Audio Event Detection.
CoRR, 2024

Flow-TSVAD: Target-Speaker Voice Activity Detection via Latent Flow Matching.
CoRR, 2024

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling.
CoRR, 2024

Target Speech Diarization with Multimodal Prompts.
CoRR, 2024

Audio-Visual Target Speaker Extraction with Reverse Selective Auditory Attention.
CoRR, 2024

Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Prompt-Driven Target Speech Diarization.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
The NUS-HLT System for ICASSP2024 ICMC-ASR Grand Challenge.
CoRR, 2023

EEG-Derived Voice Signature for Attended Speaker Detection.
CoRR, 2023

Target Active Speaker Detection with Audio-visual Cues.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2021
Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021


  Loading...