Xiangang Li

Orcid: 0000-0002-7810-1077

According to our database¹, Xiangang Li authored at least 77 papers between 2012 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

An Effective Magnetic Anomaly Detection Using Orthonormal Basis of Magnetic Gradient Tensor Invariants.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2024

OpenChat: Advancing Open-source Language Models with Mixed-Quality Data.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

A Compensation Method in Magnetic Distortion Through Regularized Inverse Problems.

[BibT_eX]

[DOI]

IEEE Trans. Instrum. Meas., 2023

A Comparative Study between Full-Parameter and LoRA-based Fine-Tuning on Chinese Instruction Data for Instruction Following Large Language Model.

[BibT_eX]

[DOI]

CoRR, 2023

Towards Better Instruction Following Language Models for Chinese: Investigating the Impact of Training Data and Evaluation.

[BibT_eX]

[DOI]

CoRR, 2023

CornerFormer: Boosting Corner Representation for Fine-Grained Structured Reconstruction.

[BibT_eX]

[DOI]

CoRR, 2023

Exploring the Impact of Instruction Data Scaling on Large Language Models: An Empirical Study on Real-World Use Cases.

[BibT_eX]

[DOI]

CoRR, 2023

Exploring ChatGPT's Ability to Rank Content: A Preliminary Study on Consistency with Human Preferences.

[BibT_eX]

[DOI]

CoRR, 2023

Semi-Supervised 2D Human Pose Estimation Driven by Position Inconsistency Pseudo Label Correction Module.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Domain-Adapted Dependency Parsing for Cross-Domain Named Entity Recognition.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Audio Deep Fake Detection System with Neural Stitching for ADD 2022.

[BibT_eX]

[DOI]

CoRR, 2022

BEIKE NLP at SemEval-2022 Task 4: Prompt-Based Paragraph Classification for Patronizing and Condescending Language Detection.

[BibT_eX]

[DOI]

Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

To Answer or Not To Answer? Improving Machine Reading Comprehension Model with Span-based Contrastive Learning.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Audio Deepfake Detection System with Neural Stitching for ADD 2022.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Audio-Visual Wake Word Spotting System for MISP Challenge 2021.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Time Domain Adversarial Voice Conversion for ADD 2022.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10, 000 Hours of Transcribed Audio.

[BibT_eX]

[DOI]

CoRR, 2021

KeSpeech: An Open Source Speech Dataset of Mandarin and Its Eight Subdialects.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Order-aware Pairwise Intoxication Detection.

[BibT_eX]

[DOI]

Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Semantic Data Augmentation for End-to-End Mandarin Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Speech SimCLR: Combining Contrastive and Reconstruction Objective for Self-Supervised Speech Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

GigaSpeech: An Evolving, Multi-Domain ASR Corpus with 10, 000 Hours of Transcribed Audio.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Transformer Based Unsupervised Pre-Training for Acoustic Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

A Further Study of Unsupervised Pretraining for Transformer Based Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Didispeech: A Large Scale Mandarin Speech Corpus.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning.

[BibT_eX]

[DOI]

CoRR, 2020

A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2020

Adversarial Multi-Binary Neural Network for Multi-class Classification.

[BibT_eX]

[DOI]

CoRR, 2020

DiDi's Machine Translation System for WMT2020.

[BibT_eX]

[DOI]

Proceedings of the Fifth Conference on Machine Translation, 2020

On Loss Functions and Recurrency Training for GAN-Based Speech Enhancement Systems.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Generative Adversarial Network Based Acoustic Echo Cancellation.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

TMT: A Transformer-Based Modal Translator for Improving Multimodal Sequence Representations in Audio Visual Scene-Aware Dialog.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Conv-TasSAN: Separative Adversarial Network Based on Conv-TasNet.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Selective Attention Encoders by Syntactic Graph Convolutional Networks for Document Summarization.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

DNN-based Mask Estimation Integrating Spectral and Spatial Features for Robust Beamforming.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

TCT: A Cross-supervised Learning Method for Multimodal Sequence Representation.

[BibT_eX]

[DOI]

Wubo Li

Wei Zou

Xiangang Li

CoRR, 2019

Cross-task pre-training for acoustic scene classification.

[BibT_eX]

[DOI]

Ruixiong Zhang

Wei Zou

Xiangang Li

CoRR, 2019

Improving Transformer-based Speech Recognition Using Unsupervised Pre-training.

[BibT_eX]

[DOI]

CoRR, 2019

DELTA: A DEep learning based Language Technology plAtform.

[BibT_eX]

[DOI]

CoRR, 2019

Learning Alignment for Multimodal Emotion Recognition from Speech.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Environment-Dependent Attention-Driven Recurrent Convolutional Neural Network for Robust Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Learning Syntactic and Dynamic Selective Encoding for Document Summarization.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2019

NVSRN: A Neural Variational Scaling Reasoning Network for Initiative Response Generation.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

Replay Attack Detection Using Magnitude and Phase Information with Attention-based Adaptive Filters.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

The TJU-Didi-Huiyan system for Blizzard Challenge 2019.

[BibT_eX]

[DOI]

Proceedings of the Blizzard Challenge 2019, Vienna, Austria, September 23, 2019, 2019

Deep Segment Attentive Embedding for Duration Robust Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018

Towards End-to-End Code-Switching Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2018

A comparable study of modeling units for end-to-end Mandarin speech recognition.

[BibT_eX]

[DOI]

CoRR, 2018

Comparable Study Of Modeling Units For End-To-End Mandarin Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

An Analysis of Decoding for Attention-Based End-to-End Mandarin Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Pitch Synchronized Relative Phase with Peak Error Detection For Noise-robust Speaker Recognition.

[BibT_eX]

[DOI]

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Multiple Phase Information Combination for Replay Attacks Detection.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Speech Emotion Recognition by Combining Amplitude and Phase Information Using Convolutional Neural Network.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Interaction-Aware Topic Model for Microblog Conversations through Network Embedding and User Attention.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Computational Linguistics, 2018

Implicit Discourse Relation Recognition using Neural Tensor Network with Interactive Attention and Sparse Learning.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Computational Linguistics, 2018

2017

Deep Speaker: an End-to-End Neural Speaker Embedding System.

[BibT_eX]

[DOI]

CoRR, 2017

Gram-CTC: Automatic Unit Selection and Target Decomposition for Sequence Labelling.

[BibT_eX]

[DOI]

Proceedings of the 34th International Conference on Machine Learning, 2017

2016

Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin.

[BibT_eX]

[DOI]

Proceedings of the 33nd International Conference on Machine Learning, 2016

2015

A comparative study on selecting acoustic modeling units in deep neural networks based large vocabulary Chinese speech recognition.

[BibT_eX]

[DOI]

Neurocomputing, 2015

I-vector dependent feature space transformations for adaptive speech recognition.

[BibT_eX]

[DOI]

Xiangang Li

Xihong Wu

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Long short-term memory based convolutional recurrent neural networks for large vocabulary speech recognition.

[BibT_eX]

[DOI]

Xiangang Li

Xihong Wu

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Modeling speaker variability using long short-term memory networks for speech recognition.

[BibT_eX]

[DOI]

Xiangang Li

Xihong Wu

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Improving long short-term memory networks using maxout units for large vocabulary speech recognition.

[BibT_eX]

[DOI]

Xiangang Li

Xihong Wu

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Constructing long short-term memory based deep recurrent neural networks for large vocabulary speech recognition.

[BibT_eX]

[DOI]

Xiangang Li

Xihong Wu

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Chinese syllable-to-character conversion with recurrent neural network based supervised sequence labelling.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Integrating prosodic information into recurrent neural network language model for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014

Error-driven pronunciation dictionary construction for Mandarin speech recognition.

[BibT_eX]

[DOI]

Yi Liu

Xiangang Li

Xihong Wu

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Decision tree based state tying for speech recognition using DNN derived embeddings.

[BibT_eX]

[DOI]

Xiangang Li

Xihong Wu

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Labeling unsegmented sequence data with DNN-HMM and its application for speech recognition.

[BibT_eX]

[DOI]

Xiangang Li

Xihong Wu

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Recurrent neural network language model with part-of-speech for Mandarin speech recognition.

[BibT_eX]

[DOI]

Caixia Gong

Xiangang Li

Xihong Wu

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Query-based composition for large-scale language model in LVCSR.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

A Comparative Study on Selecting Acoustic Modeling Units in Deep Neural Networks Based Large Vocabulary Chinese Speech Recognition.

[BibT_eX]

[DOI]

Xiangang Li

Yuning Yang

Xihong Wu

Proceedings of the Intelligence Science and Big Data Engineering, 2013

Overview of SHRC-Ginkgo speech synthesis system for Blizzard Challenge 2013.

[BibT_eX]

[DOI]

Proceedings of the Blizzard Challenge 2013, 2013

Deep neural networks for syllable based acoustic modeling in Chinese speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

The effect of part-of-speech on Mandarin speech recognition.

[BibT_eX]

[DOI]

Caixia Gong

Xiangang Li

Xihong Wu

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012

Lightly Supervised Acoustic Model Training for Mandarin Continuous Speech Recognition.

[BibT_eX]

[DOI]

Xiangang Li

Zaihu Pang

Xihong Wu

Proceedings of the Intelligent Science and Intelligent Data Engineering, 2012

Probabilistic Speaker-Class based Acoustic Modeling for Large Vocabulary Continuous Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Xiangang Li

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...