Xiangang Li

Orcid: 0000-0002-7810-1077

According to our database1, Xiangang Li authored at least 77 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
An Effective Magnetic Anomaly Detection Using Orthonormal Basis of Magnetic Gradient Tensor Invariants.
IEEE Trans. Geosci. Remote. Sens., 2024

OpenChat: Advancing Open-source Language Models with Mixed-Quality Data.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
A Compensation Method in Magnetic Distortion Through Regularized Inverse Problems.
IEEE Trans. Instrum. Meas., 2023

A Comparative Study between Full-Parameter and LoRA-based Fine-Tuning on Chinese Instruction Data for Instruction Following Large Language Model.
CoRR, 2023

Towards Better Instruction Following Language Models for Chinese: Investigating the Impact of Training Data and Evaluation.
CoRR, 2023

CornerFormer: Boosting Corner Representation for Fine-Grained Structured Reconstruction.
CoRR, 2023

Exploring the Impact of Instruction Data Scaling on Large Language Models: An Empirical Study on Real-World Use Cases.
CoRR, 2023

Exploring ChatGPT's Ability to Rank Content: A Preliminary Study on Consistency with Human Preferences.
CoRR, 2023

Semi-Supervised 2D Human Pose Estimation Driven by Position Inconsistency Pseudo Label Correction Module.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Domain-Adapted Dependency Parsing for Cross-Domain Named Entity Recognition.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Audio Deep Fake Detection System with Neural Stitching for ADD 2022.
CoRR, 2022

BEIKE NLP at SemEval-2022 Task 4: Prompt-Based Paragraph Classification for Patronizing and Condescending Language Detection.
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

To Answer or Not To Answer? Improving Machine Reading Comprehension Model with Span-based Contrastive Learning.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Audio Deepfake Detection System with Neural Stitching for ADD 2022.
Proceedings of the IEEE International Conference on Acoustics, 2022

Audio-Visual Wake Word Spotting System for MISP Challenge 2021.
Proceedings of the IEEE International Conference on Acoustics, 2022

Time Domain Adversarial Voice Conversion for ADD 2022.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10, 000 Hours of Transcribed Audio.
CoRR, 2021

KeSpeech: An Open Source Speech Dataset of Mandarin and Its Eight Subdialects.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Order-aware Pairwise Intoxication Detection.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Semantic Data Augmentation for End-to-End Mandarin Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Speech SimCLR: Combining Contrastive and Reconstruction Objective for Self-Supervised Speech Representation Learning.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

GigaSpeech: An Evolving, Multi-Domain ASR Corpus with 10, 000 Hours of Transcribed Audio.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Transformer Based Unsupervised Pre-Training for Acoustic Representation Learning.
Proceedings of the IEEE International Conference on Acoustics, 2021

A Further Study of Unsupervised Pretraining for Transformer Based Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Didispeech: A Large Scale Mandarin Speech Corpus.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning.
CoRR, 2020

A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition.
CoRR, 2020

Adversarial Multi-Binary Neural Network for Multi-class Classification.
CoRR, 2020

DiDi's Machine Translation System for WMT2020.
Proceedings of the Fifth Conference on Machine Translation, 2020

On Loss Functions and Recurrency Training for GAN-Based Speech Enhancement Systems.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Generative Adversarial Network Based Acoustic Echo Cancellation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

TMT: A Transformer-Based Modal Translator for Improving Multimodal Sequence Representations in Audio Visual Scene-Aware Dialog.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Conv-TasSAN: Separative Adversarial Network Based on Conv-TasNet.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Selective Attention Encoders by Syntactic Graph Convolutional Networks for Document Summarization.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

DNN-based Mask Estimation Integrating Spectral and Spatial Features for Robust Beamforming.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
TCT: A Cross-supervised Learning Method for Multimodal Sequence Representation.
CoRR, 2019

Cross-task pre-training for acoustic scene classification.
CoRR, 2019

Improving Transformer-based Speech Recognition Using Unsupervised Pre-training.
CoRR, 2019

DELTA: A DEep learning based Language Technology plAtform.
CoRR, 2019

Learning Alignment for Multimodal Emotion Recognition from Speech.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Environment-Dependent Attention-Driven Recurrent Convolutional Neural Network for Robust Speech Enhancement.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Learning Syntactic and Dynamic Selective Encoding for Document Summarization.
Proceedings of the International Joint Conference on Neural Networks, 2019

NVSRN: A Neural Variational Scaling Reasoning Network for Initiative Response Generation.
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

Replay Attack Detection Using Magnitude and Phase Information with Attention-based Adaptive Filters.
Proceedings of the IEEE International Conference on Acoustics, 2019

The TJU-Didi-Huiyan system for Blizzard Challenge 2019.
Proceedings of the Blizzard Challenge 2019, Vienna, Austria, September 23, 2019, 2019

Deep Segment Attentive Embedding for Duration Robust Speaker Verification.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Towards End-to-End Code-Switching Speech Recognition.
CoRR, 2018

A comparable study of modeling units for end-to-end Mandarin speech recognition.
CoRR, 2018

Comparable Study Of Modeling Units For End-To-End Mandarin Speech Recognition.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

An Analysis of Decoding for Attention-Based End-to-End Mandarin Speech Recognition.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Pitch Synchronized Relative Phase with Peak Error Detection For Noise-robust Speaker Recognition.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Multiple Phase Information Combination for Replay Attacks Detection.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Speech Emotion Recognition by Combining Amplitude and Phase Information Using Convolutional Neural Network.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Interaction-Aware Topic Model for Microblog Conversations through Network Embedding and User Attention.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Implicit Discourse Relation Recognition using Neural Tensor Network with Interactive Attention and Sparse Learning.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

2017
Deep Speaker: an End-to-End Neural Speaker Embedding System.
CoRR, 2017

Gram-CTC: Automatic Unit Selection and Target Decomposition for Sequence Labelling.
Proceedings of the 34th International Conference on Machine Learning, 2017

2016

2015
A comparative study on selecting acoustic modeling units in deep neural networks based large vocabulary Chinese speech recognition.
Neurocomputing, 2015

I-vector dependent feature space transformations for adaptive speech recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Long short-term memory based convolutional recurrent neural networks for large vocabulary speech recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Modeling speaker variability using long short-term memory networks for speech recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Improving long short-term memory networks using maxout units for large vocabulary speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Constructing long short-term memory based deep recurrent neural networks for large vocabulary speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Chinese syllable-to-character conversion with recurrent neural network based supervised sequence labelling.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Integrating prosodic information into recurrent neural network language model for speech recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014
Error-driven pronunciation dictionary construction for Mandarin speech recognition.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Decision tree based state tying for speech recognition using DNN derived embeddings.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Labeling unsegmented sequence data with DNN-HMM and its application for speech recognition.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Recurrent neural network language model with part-of-speech for Mandarin speech recognition.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Query-based composition for large-scale language model in LVCSR.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
A Comparative Study on Selecting Acoustic Modeling Units in Deep Neural Networks Based Large Vocabulary Chinese Speech Recognition.
Proceedings of the Intelligence Science and Big Data Engineering, 2013

Overview of SHRC-Ginkgo speech synthesis system for Blizzard Challenge 2013.
Proceedings of the Blizzard Challenge 2013, 2013

Deep neural networks for syllable based acoustic modeling in Chinese speech recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

The effect of part-of-speech on Mandarin speech recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012
Lightly Supervised Acoustic Model Training for Mandarin Continuous Speech Recognition.
Proceedings of the Intelligent Science and Intelligent Data Engineering, 2012

Probabilistic Speaker-Class based Acoustic Modeling for Large Vocabulary Continuous Speech Recognition.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012


  Loading...