Danwei Cai

Orcid: 0000-0002-5122-0623

According to our database1, Danwei Cai authored at least 30 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Leveraging ASR Pretrained Conformers for Speaker Verification Through Transfer Learning and Knowledge Distillation.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Self-supervised Reflective Learning through Self-distillation and Online Clustering for Speaker Representation Learning.
CoRR, 2024

Joint Inference of Speaker Diarization and ASR with Multi-Stage Information Sharing.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Robust Multi-Channel Far-Field Speaker Verification Under Different In-Domain Data Availability Scenarios.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Pretraining Conformer with ASR for Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2023

Identifying Source Speakers for Voice Conversion Based Spoofing Attacks on Speaker Verification Systems.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Similarity Measurement of Segment-Level Speaker Embeddings in Speaker Diarization.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Incorporating Visual Information in Audio Based Self-Supervised Speaker Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

2021
The DKU-DukeECE System for the Self-Supervision Speaker Verification Task of the 2021 VoxCeleb Speaker Recognition Challenge.
CoRR, 2021

The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge.
CoRR, 2021

The DKU-Duke-Lenovo System Description for the Third DIHARD Speech Diarization Challenge.
CoRR, 2021

Embedding Aggregation for Far-Field Speaker Verification with Distributed Microphone Arrays.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

The DKU-Duke-Lenovo System Description for the Fearless Steps Challenge Phase III.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

An Iterative Framework for Self-Supervised Deep Speaker Representation Learning.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Within-Sample Variability-Invariant Loss for Robust Speaker Recognition Under Noisy Environments.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Far-Field End-to-End Text-Dependent Speaker Verification Based on Mixed Training Data with Transfer Learning and Enrollment Data Augmentation.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Survey Talk: End-to-End Deep Neural Network Based Speaker and Language Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

The DKU Replay Detection System for the ASVspoof 2019 Challenge: On Data Augmentation, Feature Representation, Classification, and Fusion.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Multi-Channel Training for End-to-End Speaker Recognition Under Reverberant and Noisy Environment.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

The DKU System for the Speaker Recognition Task of the 2019 VOiCES from a Distance Challenge.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

The DKU-SMIIP System for NIST 2018 Speaker Recognition Evaluation.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Utterance-level End-to-end Language Identification Using Attention-based CNN-BLSTM.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Cancellable speech template via random binary orthogonal matrices projection hashing.
Pattern Recognit., 2018

End-to-end Language Identification using NetFV and NetVLAD.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

The DKU-JNU-EMA Electromagnetic Articulography Database on Mandarin and Chinese Dialects with Tandem Feature based Acoustic-to-Articulatory Inversion.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Deep Speaker Embeddings with Convolutional Neural Network on Supervector for Text-Independent Speaker Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
End-to-End Deep Learning Framework for Speech Paralinguistics Detection Based on Perception Aware Spectrum.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Countermeasures for Automatic Speaker Verification Replay Spoofing Attack : On Data Augmentation, Feature Representation, Classification and Fusion.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Mandarin electrolaryngeal voice conversion with combination of Gaussian mixture model and non-negative matrix factorization.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
Locality sensitive discriminant analysis for speaker verification.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016


  Loading...