We stand with Ukraine

We stand with Ukraine

Danwei Cai

Orcid: 0000-0002-5122-0623

According to our database¹, Danwei Cai authored at least 30 papers between 2016 and 2024.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Leveraging ASR Pretrained Conformers for Speaker Verification Through Transfer Learning and Knowledge Distillation.

[BibT_eX]

[DOI]

,

IEEE ACM Trans. Audio Speech Lang. Process., 2024

Self-supervised Reflective Learning through Self-distillation and Online Clustering for Speaker Representation Learning.

[BibT_eX]

[DOI]

,

,

CoRR, 2024

Joint Inference of Speaker Diarization and ASR with Multi-Stage Information Sharing.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Robust Multi-Channel Far-Field Speaker Verification Under Different In-Domain Data Availability Scenarios.

[BibT_eX]

[DOI]

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2023

Pretraining Conformer with ASR for Speaker Verification.

[BibT_eX]

[DOI]

,

,

,

,

Chuanzeng Huang

Proceedings of the IEEE International Conference on Acoustics, 2023

Identifying Source Speakers for Voice Conversion Based Spoofing Attacks on Speaker Verification Systems.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Similarity Measurement of Segment-Level Speaker Embeddings in Speaker Diarization.

[BibT_eX]

[DOI]

,

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2022

Incorporating Visual Information in Audio Based Self-Supervised Speaker Recognition.

[BibT_eX]

[DOI]

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2022

2021

The DKU-DukeECE System for the Self-Supervision Speaker Verification Task of the 2021 VoxCeleb Speaker Recognition Challenge.

[BibT_eX]

[DOI]

,

CoRR, 2021

The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2021

The DKU-Duke-Lenovo System Description for the Third DIHARD Speech Diarization Challenge.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2021

Embedding Aggregation for Far-Field Speaker Verification with Distributed Microphone Arrays.

[BibT_eX]

[DOI]

,

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

The DKU-Duke-Lenovo System Description for the Fearless Steps Challenge Phase III.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

An Iterative Framework for Self-Supervised Deep Speaker Representation Learning.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Within-Sample Variability-Invariant Loss for Robust Speaker Recognition Under Noisy Environments.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Far-Field End-to-End Text-Dependent Speaker Verification Based on Mixed Training Data with Transfer Learning and Enrollment Data Augmentation.

[BibT_eX]

[DOI]

,

,

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Survey Talk: End-to-End Deep Neural Network Based Speaker and Language Recognition.

[BibT_eX]

[DOI]

,

,

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

The DKU Replay Detection System for the ASVspoof 2019 Challenge: On Data Augmentation, Feature Representation, Classification, and Fusion.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Multi-Channel Training for End-to-End Speaker Recognition Under Reverberant and Noisy Environment.

[BibT_eX]

[DOI]

,

,

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

The DKU System for the Speaker Recognition Task of the 2019 VOiCES from a Distance Challenge.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

The DKU-SMIIP System for NIST 2018 Speaker Recognition Evaluation.

[BibT_eX]

[DOI]

,

,

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Utterance-level End-to-end Language Identification Using Attention-based CNN-BLSTM.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Cancellable speech template via random binary orthogonal matrices projection hashing.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Pattern Recognit., 2018

End-to-end Language Identification using NetFV and NetVLAD.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

The DKU-JNU-EMA Electromagnetic Articulography Database on Mandarin and Chinese Dialects with Tandem Feature based Acoustic-to-Articulatory Inversion.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Deep Speaker Embeddings with Convolutional Neural Network on Supervector for Text-Independent Speaker Recognition.

[BibT_eX]

[DOI]

,

,

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017

End-to-End Deep Learning Framework for Speech Paralinguistics Detection Based on Perception Aware Spectrum.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Countermeasures for Automatic Speaker Verification Replay Spoofing Attack : On Data Augmentation, Feature Representation, Classification and Fusion.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Mandarin electrolaryngeal voice conversion with combination of Gaussian mixture model and non-negative matrix factorization.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016

Locality sensitive discriminant analysis for speaker verification.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

Loading...