Xiaodong Cui

Orcid: 0000-0002-4762-6161

According to our database1, Xiaodong Cui authored at least 131 papers between 2000 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
MAL-YOLO: a lightweight algorithm for target detection in side-scan sonar images based on multi-scale feature fusion and attention mechanism.
Int. J. Digit. Earth, December, 2024

Optimizing multi-classifier fusion for seabed sediment classification using machine learning.
Int. J. Digit. Earth, December, 2024

Node Injection Attack Based on Label Propagation Against Graph Neural Network.
IEEE Trans. Comput. Soc. Syst., October, 2024

A Novel Multi-Feature Fusion Model Based on Pre-Trained Wav2vec 2.0 for Underwater Acoustic Target Recognition.
Remote. Sens., July, 2024

Application of Sample Enhancement Method Combining Superpixel Segmentation and Active Learning in MBES Seafloor Sediment Classification.
IEEE Trans. Geosci. Remote. Sens., 2024

Anomaly Detection in Multibeam Bathymetric Point Clouds Integrating Prior Constraints With Geostatistical Prediction.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2024

A knowledge-guided graph attention network for emotion-cause pair extraction.
Knowl. Based Syst., 2024

Training Nonlinear Transformers for Chain-of-Thought Inference: A Theoretical Generalization Analysis.
CoRR, 2024

Training Nonlinear Transformers for Efficient In-Context Learning: A Theoretical Learning and Generalization Analysis.
CoRR, 2024

How Do Nonlinear Transformers Learn and Generalize in In-Context Learning?
Proceedings of the Forty-first International Conference on Machine Learning, 2024

How Can Personalized Context Help? Exploring Joint Retrieval of Passage and Personalized Context.
Proceedings of the IEEE International Conference on Acoustics, 2024

Reparameterization Head for Efficient Multi-Input Networks.
Proceedings of the IEEE International Conference on Acoustics, 2024

Joint Unsupervised and Supervised Training for Automatic Speech Recognition via Bilevel Optimization.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Multimodal fake news detection through data augmentation-based contrastive learning.
Appl. Soft Comput., March, 2023

A Sample Enhancement Method Based on Simple Linear Iterative Clustering Superpixel Segmentation Applied to Multibeam Seabed Classification.
IEEE Trans. Geosci. Remote. Sens., 2023

Seafloor Habitat Mapping by Combining Multiple Features From Optic and Acoustic Data: A Case Study From Ganquan Island, South China Sea.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2023

Soft Random Sampling: A Theoretical and Empirical Analysis.
CoRR, 2023

How Can Context Help? Exploring Joint Retrieval of Passage and Personalized Context.
CoRR, 2023

Improving RNN Transducer Acoustic Models for English Conversational Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

A Transformer-Based OFDM Receiver for Underwater Acoustic Communication.
Proceedings of the IEEE International Conference on Signal Processing, 2023

Compressed Decentralized Proximal Stochastic Gradient Method for Nonconvex Composite Problems with Heterogeneous Data.
Proceedings of the International Conference on Machine Learning, 2023

Diagonal State Space Augmented Transformers for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

HEPT Attack: Heuristic Perpendicular Trial for Hard-label Attacks under Limited Query Budgets.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Distributed Offline Policy Optimization Over Batch Data.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022
Deep-Sea Sediment Mixed Pixel Decomposition Based on Multibeam Backscatter Intensity Segmentation.
IEEE Trans. Geosci. Remote. Sens., 2022

Promotion of Interface Fusion of Solid Polymer Electrolyte and Cathode by Ultrasonic Vibration.
Sensors, 2022

MBES Seabed Sediment Classification Based on a Decision Fusion Method Using Deep Learning Model.
Remote. Sens., 2022

Speech Emotion Recognition with Complementary Acoustic Representations.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

A Stochastic Linearized Augmented Lagrangian Method for Decentralized Bilevel Optimization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Decentralized Bilevel Optimization for Personalized Client Learning.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Asynchronous Decentralized Distributed Training of Acoustic Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Loss Landscape Dependent Self-Adjusting Learning Rates in Decentralized Stochastic Gradient Descent.
CoRR, 2021

On Sample Based Explanation Methods for NLP: Efficiency, Faithfulness, and Semantic Evaluation.
CoRR, 2021

4-Bit Quantization of LSTM-Based Speech Recognition Models.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Reducing Exposure Bias in Training Recurrent Neural Network Transducers.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Speech Emotion Recognition with Multiscale Area Attention and Data Augmentation.
Proceedings of the IEEE International Conference on Acoustics, 2021

Federated Acoustic Modeling for Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

On Sample Based Explanation Methods for NLP: Faithfulness, Efficiency and Semantic Evaluation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition: A comparison of current training strategies.
IEEE Signal Process. Mag., 2020

Change Detection from Remote Sensing to Guide OpenStreetMap Labeling.
ISPRS Int. J. Geo Inf., 2020

Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition.
CoRR, 2020

The effects of heterogeneity of updating rules on cooperation in spatial network.
Appl. Math. Comput., 2020

Ultra-Low Precision 4-bit Training of Deep Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

A Decentralized Parallel Algorithm for Training Generative Adversarial Nets.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

ScaleCom: Scalable Sparsified Gradient Compression for Communication-Efficient Distributed Training.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Map Generation from Large Scale Incomplete and Inaccurate Data Labels.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Task-Based Learning via Task-Oriented Prediction Network with Applications in Finance.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Towards Better Understanding of Adaptive Gradient Algorithms in Generative Adversarial Nets.
Proceedings of the 8th International Conference on Learning Representations, 2020

Improving Efficiency in Large-Scale Decentralized Distributed Training.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Decentralized Parallel Algorithm for Training Generative Adversarial Nets.
CoRR, 2019

Task-Based Learning via Task-Oriented Prediction Network.
CoRR, 2019

Hybrid 8-bit Floating Point (HFP8) Training and Inference for Deep Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

A Highly Efficient Distributed Deep Learning System for Automatic Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Challenging the Boundaries of Speech Recognition: The MALACH Corpus.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Large-Scale Mixed-Bandwidth Deep Neural Network Acoustic Modeling for Automatic Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Acoustic Model Optimization Based on Evolutionary Stochastic Gradient Descent with Anchors for Automatic Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Distributed Deep Learning Strategies for Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

Cyclegan Bandwidth Extension Acoustic Modeling for Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
MeTDiff: A Novel Differential RNA Methylation Analysis for MeRIP-Seq Data.
IEEE ACM Trans. Comput. Biol. Bioinform., 2018

Evolutionary Stochastic Gradient Descent for Optimization of Deep Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

2017
Dilated Recurrent Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

English Conversational Telephone Speech Recognition by Humans and Machines.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Embedding-Based Speaker Adaptive Training of Deep Neural Networks.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Network architectures for multilingual speech representation learning.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Maximum Likelihood Nonlinear Transformations Based on Deep Neural Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

A novel algorithm for calling mRNA m<sup>6</sup>A peaks by modeling biological variances in MeRIP-seq data.
Bioinform., 2016

Efficient non-linear feature adaptation using Maxout networks.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Data Augmentation for Deep Neural Network Acoustic Modeling.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Annealed dropout trained maxout networks for improved LVCSR.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Data augmentation for deep convolutional neural network acoustic modeling.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Modeling of replicates variances for detecting RNA methylation site in MERIP-SEQ data.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

Sketching the distribution of transcriptomic features on RNA transcripts with Travis coordinates.
Proceedings of the 2015 IEEE International Conference on Bioinformatics and Biomedicine, 2015

Multilingual representations for low resource speech recognition and keyword search.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Exploiting vocal-source features to improve ASR accuracy for low-resource languages.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Recent improvements in neural network acoustic modeling for LVCSR in low resource languages.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Improving deep neural network acoustic modeling for audio corpus indexing under the IARPA babel program.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

A family of discriminative training criteria based on the F-divergence for deep neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2014

Differential analysis of RNA methylome with improved spatial resolution.
Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014

Detecting differentially methylated mRNA from MeRIP-Seq with likelihood ratio test.
Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014

2013
The IBM speech-to-speech translation system for smartphone: Improvements for resource-constrained tasks.
Comput. Speech Lang., 2013

Stereo hidden Markov modeling for noise robust speech recognition.
Comput. Speech Lang., 2013

Exome-based analysis for RNA epigenome sequencing data.
Bioinform., 2013

Adaptive stereo-based stochastic mapping.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Mixtures of Bayesian joint factor analyzers for noise robust automatic speech recognition.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

System combination and score normalization for spoken term detection.
Proceedings of the IEEE International Conference on Acoustics, 2013

A high-performance Cantonese keyword search system.
Proceedings of the IEEE International Conference on Acoustics, 2013

Developing speech recognition systems for corpus indexing under the IARPA Babel program.
Proceedings of the IEEE International Conference on Acoustics, 2013

Differential analysis of rna methylation sequencing data.
Proceedings of the IEEE Global Conference on Signal and Information Processing, 2013

An HMM-based Exome Peak-finding package for RNA epigenome sequencing data.
Proceedings of the 2013 IEEE International Workshop on Genomic Signal Processing and Statistics, 2013

Unveiling the dynamics in RNA epigenetic regulations.
Proceedings of the 2013 IEEE International Conference on Bioinformatics and Biomedicine, 2013

An empirical study of confusion modeling in keyword search for low resource languages.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
Hidden Markov Acoustic Modeling With Bootstrap and Restructuring for Low-Resourced Languages.
IEEE Trans. Speech Audio Process., 2012

Multi-View and Multi-Objective Semi-Supervised Learning for HMM-Based Automatic Speech Recognition.
IEEE Trans. Speech Audio Process., 2012

Sparse Bayesian Factor Analysis for Stereo-based Stochastic Mapping.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Stereo-based stochastic mapping with context using probabilistic PCA for noise robust automatic speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Towards High Performance LVCSR in Speech-to-Speech Translation System on Smart Phones.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Acoustic Modeling with Bootstrap and Restructuring Based on Full Covariance.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Multi-view and multi-objective semi-supervised learning for large vocabulary continuous speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Clustering of bootstrapped acoustic model with full covariance.
Proceedings of the IEEE International Conference on Acoustics, 2011

An investigation of heuristic, manual and statistical pronunciation derivation for Pashto.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Applying scalable phonetic context similarity in unit selection of concatenative text-to-speech.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Acoustic modeling with bootstrap and restructuring for low-resourced languages.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

A comparative study on system combination schemes for LVCSR.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Stereo-Based Stochastic Mapping for Robust Speech Recognition.
IEEE Trans. Speech Audio Process., 2009

A study of bootstrapping with multiple acoustic features for improved automatic speech recognition.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Stereo-based stochastic mapping with discriminative training for noise robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009

Improving online incremental speaker adaptation with eigen feature space MLLR.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
High-performance low-latency speech recognition via multi-layered feature streaming and fast Gaussian computation.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

N-best based stochastic mapping on stereo HMM for noise robust speech recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Developing high performance asr in the IBM multilingual speech-to-speech translation system.
Proceedings of the IEEE International Conference on Acoustics, 2008

MMSE-based stereo feature stochastic mapping for noise robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Speaker Adaptation With Limited Data Using Regression-Tree-Based Spectral Peak Alignment.
IEEE Trans. Speech Audio Process., 2007

A Study of Variable-Parameter Gaussian Mixture Hidden Markov Modeling for Noisy Speech Recognition.
IEEE Trans. Speech Audio Process., 2007

Robust Speaker Adaptation by Weighted Model Averaging Based on the Minimum Description Length Criterion.
IEEE Trans. Speech Audio Process., 2007

2006
Adaptation of children's speech with limited data based on formant-like peak alignment.
Comput. Speech Lang., 2006

Rapid speaker adaptation using regression-tree based spectral peak alignment.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

A Database of Vocal Tract Resonance Trajectories for Research in Speech Processing.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Modeling Variance Variation in a Variable Parameter HMM Framework for Noise Robust Speech Recognition.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Noise robust speech recognition using feature compensation based on polynomial regression of utterance SNR.
IEEE Trans. Speech Audio Process., 2005

TBALL data collection: the making of a young children's speech corpus.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

MLLR-like speaker adaptation based on linearization of VTLN with MFCC features.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2004
Combining feature compensation and weighted Viterbi decoding for noise robust speech recognition with limited adaptation data.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Can back-ends be more robust than front-ends? Investigation over the Aurora-2 database.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
A noise-robust ASR back-end technique based on weighted viterbi recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Variable parameter Gaussian mixture hidden Markov modeling for speech recognition.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Evaluation of noise robust features on the Aurora databases.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Efficient adaptation text design based on the Kullback-Leibler measure.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Noise robust feature extraction for ASR using the Aurora 2 database.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
A language model adaptation approach based on text classification.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000


  Loading...