Bo Xu
Orcid: 0000-0002-1111-1529Affiliations:
- University of Science and Technology of China, Department of Automation, Hefei, China
- Chinese Academy of Sciences, Center for Excellence in Brain Science and Intelligence Technology, Beijing, China
- Chinese Academy of Sciences, Institute of Automation, National Laboratory of Pattern Recognition, Beijing, China
According to our database1,
Bo Xu
authored at least 495 papers
between 1991 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
-
on cebs.ac.cn
On csauthors.net:
Bibliography
2024
Self-Lateral Propagation Elevates Synaptic Modifications in Spiking Neural Networks for the Efficient Spatial and Temporal Classification.
IEEE Trans. Neural Networks Learn. Syst., November, 2024
Tuning Synaptic Connections Instead of Weights by Genetic Algorithm in Spiking Policy Network.
Mach. Intell. Res., October, 2024
Network model with internal complexity bridges artificial intelligence and neuroscience.
Nat. Comput. Sci., August, 2024
Learning Top-K Subtask Planning Tree Based on Discriminative Representation Pretraining for Decision-making.
Mach. Intell. Res., August, 2024
Mach. Intell. Res., April, 2024
A Knowledge-enhanced Two-stage Generative Framework for Medical Dialogue Information Extraction.
Mach. Intell. Res., February, 2024
Multi-Cue Guided Semi-Supervised Learning Toward Target Speaker Separation in Real Environments.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
SSCFormer: Push the Limit of Chunk-Wise Conformer for Streaming ASR Using Sequentially Sampled Chunks and Chunked Causal Convolution.
IEEE Signal Process. Lett., 2024
Neural Networks, 2024
Multiscale fusion enhanced spiking neural network for invasive BCI neural signal decoding.
CoRR, 2024
Integer-Valued Training and Spike-Driven Inference Spiking Neural Network for High-performance and Energy-efficient Object Detection.
CoRR, 2024
CoRR, 2024
Enhanced Spatiotemporal Prediction Using Physical-guided And Frequency-enhanced Recurrent Neural Networks.
CoRR, 2024
Biologically-Plausible Topology Improved Spiking Actor Network for Efficient Deep Reinforcement Learning.
CoRR, 2024
Fourier or Wavelet bases as counterpart self-attention in spikformer for efficient visual classification.
CoRR, 2024
RSC-SNN: Exploring the Trade-off Between Adversarial Robustness and Accuracy in Spiking Neural Networks via Randomized Smoothing Coding.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
CIEASR: Contextual Image-Enhanced Automatic Speech Recognition for Improved Homophone Discrimination.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Bridge the Query and Document: Contrastive Learning for Generative Document Retrieval.
Proceedings of the International Joint Conference on Neural Networks, 2024
Proceedings of the International Joint Conference on Neural Networks, 2024
Long Short-Term Reasoning Network with Theory of Mind for Efficient Multi-Agent Cooperation.
Proceedings of the International Joint Conference on Neural Networks, 2024
Proceedings of the International Joint Conference on Neural Networks, 2024
Proceedings of the International Joint Conference on Neural Networks, 2024
High-Performance Temporal Reversible Spiking Neural Networks with O(L) Training Memory and O(1) Inference Cost.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Spike-driven Transformer V2: Meta Spiking Neural Network Architecture Inspiring the Design of Next-generation Neuromorphic Chips.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
ViLaS: Exploring the Effects of Vision and Language Context in Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024
A New Pre-Training Paradigm for Offline Multi-Agent Reinforcement Learning with Suboptimal Data.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
A Brain-Inspired Approach for Probabilistic Estimation and Efficient Planning in Precision Physical Interaction.
IEEE Trans. Cybern., October, 2023
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023
Neurocomputing, April, 2023
Mach. Intell. Res., April, 2023
Origin of the efficiency of spike timing-based neural computation for processing temporal information.
Neural Networks, March, 2023
IEEE CAA J. Autom. Sinica, 2023
Learning Top-k Subtask Planning Tree based on Discriminative Representation Pre-training for Decision Making.
CoRR, 2023
CoRR, 2023
Double Reverse Regularization Network Based on Self-Knowledge Distillation for SAR Object Classification.
CoRR, 2023
Local Convolution Enhanced Global Fourier Neural Operator For Multiscale Dynamic Spaces Prediction.
CoRR, 2023
ApproBiVT: Lead ASR Models to Generalize Better Using Approximated Bias-Variance Tradeoff Guided Early Stopping and Checkpoint Averaging.
CoRR, 2023
CoRR, 2023
Probabilistic Modeling: Proving the Lottery Ticket Hypothesis in Spiking Neural Network.
CoRR, 2023
Mixture of personality improved Spiking actor network for efficient multi-agent cooperation.
CoRR, 2023
X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages.
CoRR, 2023
Proceedings of the Machine Learning and Knowledge Discovery in Databases: Research Track, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Enhancing Visual Question Answering via Deconstructing Questions and Explicating Answers.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs.
Proceedings of the International Joint Conference on Neural Networks, 2023
Make Spoken Document Readable: Leveraging Graph Attention Networks for Chinese Document-Level Spoken-to-Written Simplification.
Proceedings of the Neural Information Processing - 30th International Conference, 2023
Replay Memory as An Empirical MDP: Combining Conservative Estimation with Experience Replay.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Task-Prompt Generalised World Model in Multi-Environment Offline Reinforcement Learning.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023
Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
Corrigendum: A brain-inspired decision-making spiking neural network and its application in unmanned aerial vehicle.
Frontiers Neurorobotics, September, 2022
Tuning Convolutional Spiking Neural Network With Biologically Plausible Reward Propagation.
IEEE Trans. Neural Networks Learn. Syst., 2022
A Brain-Inspired Approach for Collision-Free Movement Planning in the Small Operational Space.
IEEE Trans. Neural Networks Learn. Syst., 2022
Sequence-Level Speaker Change Detection With Difference-Based Continuous Integrate-and-Fire.
IEEE Signal Process. Lett., 2022
IEEE Robotics Autom. Lett., 2022
Compressing speaker extraction model with ultra-low precision quantization and knowledge distillation.
Neural Networks, 2022
Train from scratch: Single-stage joint training of speech separation and recognition.
Comput. Speech Lang., 2022
Motif-topology improved Spiking Neural Network for the Cocktail Party Effect and McGurk Effect.
CoRR, 2022
MCascade R-CNN: A Modified Cascade R-CNN for Detection of Calcified on Coronary Artery Angiography Images.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2022
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Joint Modeling of Document and Label with Clause Interaction Hypergraph for ICD Medical Code Assignment.
Proceedings of the International Joint Conference on Neural Networks, 2022
Proceedings of the International Joint Conference on Neural Networks, 2022
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022
Proceedings of the 2022 International Conference on Robotics and Automation, 2022
Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition, 2022
Motif-Topology and Reward-Learning Improved Spiking Neural Network for Efficient Multi-Sensory Integration.
Proceedings of the IEEE International Conference on Acoustics, 2022
Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
A Multi Domain Knowledge Enhanced Matching Network for Response Selection in Retrieval-Based Dialogue Systems.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022
Proceedings of the Thirty-Second International Conference on Automated Planning and Scheduling, 2022
Multi-Sacle Dynamic Coding Improved Spiking Actor Network for Reinforcement Learning.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
IEEE Trans. Ind. Informatics, 2021
Efficiently Fusing Pretrained Acoustic and Linguistic Encoders for Low-Resource Speech Recognition.
IEEE Signal Process. Lett., 2021
Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem.
CoRR, 2021
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks.
CoRR, 2021
Promoting Coordination Through Electing First-moveAgent in Multi-Agent Reinforcement Learning.
CoRR, 2021
CoRR, 2021
Population-coding and Dynamic-neurons improved Spiking Actor Network for Reinforcement Learning.
CoRR, 2021
Counterfactual Supporting Facts Extraction for Explainable Medical Record Based Diagnosis with Graph Network.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.
Proceedings of the International Joint Conference on Neural Networks, 2021
A Language Model Based Pseudo-Sample Deliberation for Semi-supervised Speech Recognition.
Proceedings of the International Joint Conference on Neural Networks, 2021
Proceedings of the International Joint Conference on Neural Networks, 2021
Proceedings of the International Joint Conference on Neural Networks, 2021
Proceedings of the International Joint Conference on Neural Networks, 2021
Proceedings of the ICCAI '21: 2021 7th International Conference on Computing and Artificial Intelligence, Tianjin China, April 23, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
MACCIF-TDNN: Multi Aspect Aggregation of Channel and Context Interdependence Features in TDNN-Based Speaker Verification.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
Listen, Understand and Translate: Triple Supervision Decouples End-to-end Speech-to-text Translation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
IEEE Trans. Cogn. Dev. Syst., 2020
Chinese Short Text Classification with Mutual-Attention Convolutional Neural Networks.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2020
A biologically plausible supervised learning method for spiking neural networks using the symmetric STDP rule.
Neural Networks, 2020
CoRR, 2020
Audio-visual Speech Separation with Adversarially Disentangled Visual Representation.
CoRR, 2020
CoRR, 2020
A Comparison of Label-Synchronous and Frame-Synchronous End-to-End Models for Speech Recognition.
CoRR, 2020
Sequence to Multi-Sequence Learning via Conditional Chain Mapping for Mixture Signals.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
A Unified Framework for Low-Latency Speaker Extraction in Cocktail Party Environments.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
LISNN: Improving Spiking Neural Networks with Lateral Interactions for Robust Object Recognition.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
Proceedings of the Neural Information Processing - 27th International Conference, 2020
Low-Frequency Guided Self-Supervised Learning For High-Fidelity 3d Face Reconstruction In The Wild.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Bridging the Gap between Prior and Posterior Knowledge Selection for Knowledge-Grounded Dialogue Generation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Proceedings of the 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2020
Convolution Pyramid Network: A Classification Network on Coronary Artery Angiogram Images.
Proceedings of the 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2020
Knowledge Aware Emotion Recognition in Textual Conversations via Multi-Task Incremental Transformer.
Proceedings of the 28th International Conference on Computational Linguistics, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Pattern Recognit. Lett., 2019
Concept learning through deep reinforcement learning with memory-augmented neural networks.
Neural Networks, 2019
Neurocomputing, 2019
Neurocomputing, 2019
How social media usage affects employees' job satisfaction and turnover intention: An empirical study in China.
Inf. Manag., 2019
CoRR, 2019
Modelling Speaker-dependent Auditory Attention Using A Spiking Neural Network with Temporal Coding and Supervised Learning.
Aust. J. Intell. Inf. Process. Syst., 2019
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019
Boosting Character-Based Chinese Speech Synthesis via Multi-Task Learning and Dictionary Tutoring.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
RevCuT Tree Search Method in Complex Single-player Game with Continuous Search Space.
Proceedings of the International Joint Conference on Neural Networks, 2019
Proceedings of the International Joint Conference on Neural Networks, 2019
Proceedings of the International Joint Conference on Neural Networks, 2019
Proceedings of the International Joint Conference on Neural Networks, 2019
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019
Efficient and Accurate Face Shape Reconstruction by Fusion of Multiple Landmark Databases.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019
Self-attention Aligner: A Latency-control End-to-end Model for ASR Using Self-attention Network and Chunk-hopping.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
A Basal Ganglia Network Centric Reinforcement Learning Model and Its Application in Unmanned Aerial Vehicle.
IEEE Trans. Cogn. Dev. Syst., 2018
Neural Networks, 2018
Neural Networks, 2018
A Brain-Inspired Decision-Making Spiking Neural Network and Its Application in Unmanned Aerial Vehicle.
Frontiers Neurorobotics, 2018
A Fast Contour Detection Model Inspired by Biological Mechanisms in Primary Vision System.
Frontiers Comput. Neurosci., 2018
Multilingual End-to-End Speech Recognition with A Single Transformer on Low-Resource Languages.
CoRR, 2018
A Brain-Inspired Decision Making Model Based on Top-Down Biasing of Prefrontal Cortex to Basal Ganglia and Its Application in Autonomous UAV Explorations.
Cogn. Comput., 2018
Toward Robot Self-Consciousness (II): Brain-Inspired Robot Bodily Self Model for Self-Recognition.
Cogn. Comput., 2018
Improving Neural Machine Translation with Conditional Sequence Generative Adversarial Nets.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018
Syllable-Based Sequence-to-Sequence Speech Recognition with the Transformer in Mandarin Chinese.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Extending Recurrent Neural Aligner for Streaming End-to-End Speech Recognition in Mandarin.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Syllable-Based Acoustic Modeling with CTC for Multi-Scenarios Mandarin speech recognition.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
Listen, Think and Listen Again: Capturing Top-down Auditory Attention for Speaker-independent Speech Separation.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
Proceedings of the 24th International Conference on Pattern Recognition, 2018
Proceedings of the 24th International Conference on Pattern Recognition, 2018
Recurrent Neural Network Based Small-footprint Wake-up-word Speech Recognition System with a Score Calibration Method.
Proceedings of the 24th International Conference on Pattern Recognition, 2018
Proceedings of the 24th International Conference on Pattern Recognition, 2018
A Comparison of Modeling Units in Sequence-to-Sequence Speech Recognition with the Transformer on Mandarin Chinese.
Proceedings of the Neural Information Processing - 25th International Conference, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
CBLDNN-Based Speaker-Independent Speech Separation Via Generative Adversarial Training.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Speech-Transformer: A No-Recurrence Sequence-to-Sequence Model for Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018
Proceedings of the 27th International Conference on Computational Linguistics, 2018
Which Mapping Rule in the Fireworks Algorithm is Better for Large Scale Optimization.
Proceedings of the 2018 IEEE Congress on Evolutionary Computation, 2018
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018
Modeling Attention and Memory for Auditory Selection in a Cocktail Party Environment.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2017
Encoder-decoder recurrent network model for interactive character animation generation.
Vis. Comput., 2017
Neural Networks, 2017
Neurocomputing, 2017
Computación y Sistemas, 2017
Sci. China Inf. Sci., 2017
Proceedings of the Natural Language Processing and Chinese Computing, 2017
Multilingual Recurrent Neural Networks with Residual Learning for Low-Resource Speech Recognition.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017
A class-specific copy network for handling the rare word problem in neural machine translation.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017
Hierarchical Hybrid Attention Networks for Chinese Conversation Topic Classification.
Proceedings of the Neural Information Processing - 24th International Conference, 2017
Word-Level Permutation and Improved Lower Frame Rate for RNN-Based Acoustic Modeling.
Proceedings of the Neural Information Processing - 24th International Conference, 2017
Proceedings of the Neural Information Processing - 24th International Conference, 2017
Proceedings of the Neural Information Processing - 24th International Conference, 2017
Proceedings of the Neural Information Processing - 24th International Conference, 2017
Combining unidirectional long short-term memory with convolutional output layer for high-performance speech synthesis.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017
Joint Extraction of Multiple Relations and Entities by Using a Hybrid Neural Network.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2017
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2017
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017
2016
A neural network framework for relation extraction: Learning entity semantic and relation pattern.
Knowl. Based Syst., 2016
HCNN: A Neural Network Model for Combining Local and Global Features Towards Human-Like Classification.
Int. J. Pattern Recognit. Artif. Intell., 2016
Semantic expansion using word embedding clustering and convolutional neural network for improving short text classification.
Neurocomputing, 2016
Parallel Brain Simulator: A Multi-scale and Parallel Brain-Inspired Neural Network Modeling and Simulation Platform.
Cogn. Comput., 2016
Proceedings of the 2016 IEEE/WIC/ACM International Conference on Web Intelligence, 2016
Proceedings of the 2016 IEEE International Conference on Systems, Man, and Cybernetics, 2016
Proceedings of the 2016 IEEE International Conference on Systems, Man, and Cybernetics, 2016
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2016
Proceedings of the Natural Language Understanding and Intelligent Applications, 2016
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
Applying connectionist temporal classification objective function to Chinese Mandarin speech recognition.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
Multidimensional Residual Learning Based on Recurrent Neural Networks for Acoustic Modeling.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
First Step Towards End-to-End Parametric TTS Synthesis: Generating Spectral Parameters with Neural Attention.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Gating recurrent mixture density networks for acoustic modeling in statistical parametric speech synthesis.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Text Classification Improved by Integrating Bidirectional LSTM with Two-dimensional Max Pooling.
Proceedings of the COLING 2016, 2016
Proceedings of the COLING 2016, 2016
Proceedings of the COLING 2016, 2016
Proceedings of the Pattern Recognition - 7th Chinese Conference, 2016
Proceedings of the Brain Informatics and Health - International Conference, 2016
Proceedings of the Brain Informatics and Health - International Conference, 2016
Proceedings of the Brain Informatics and Health - International Conference, 2016
A Spiking Neural Network Based Autonomous Reinforcement Learning Model and Its Application in Decision Making.
Proceedings of the Advances in Brain Inspired Cognitive Systems, 2016
Attention-Based Bidirectional Long Short-Term Memory Networks for Relation Classification.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016
2015
Proceedings of the 24th International Conference on World Wide Web Companion, 2015
Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, 2015
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2015
Bilingually-Constrained Recursive Neural Networks with Syntactic Constraints for Hierarchical Translation Model.
Proceedings of the Natural Language Processing and Chinese Computing - 4th CCF Conference, 2015
Proceedings of the 1st Workshop on Vector Space Modeling for Natural Language Processing, 2015
Proceedings of the 2015 IEEE International Conference on Intelligence and Security Informatics, 2015
Towards end-to-end speech recognition for Chinese Mandarin using long short-term memory recurrent neural networks.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2015
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015
2014
Multim. Syst., 2014
Proceedings of the 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT), Warsaw, Poland, August 11-14, 2014, 2014
Proceedings of the Social Media Processing - Third National Conference, 2014
Proceedings of the Advances in Multimedia Information Processing - PCM 2014, 2014
Proceedings of the Natural Language Processing and Chinese Computing, 2014
Proceedings of the Natural Language Processing and Chinese Computing, 2014
Proceedings of the MultiMedia Modeling - 20th Anniversary International Conference, 2014
Proceedings of the MultiMedia Modeling - 20th Anniversary International Conference, 2014
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014
Proceedings of the IEEE Joint Intelligence and Security Informatics Conference, 2014
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
Investigation of stochastic Hessian-Free optimization in Deep neural networks for speech recognition.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
Improving wideband acoustic models using mixed-bandwidth training data via DNN adaptation.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
An empirical study of multilingual and low-resource spoken term detection using deep neural networks.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
A robust framework for short text categorization based on topic model and integrated classifier.
Proceedings of the 2014 International Joint Conference on Neural Networks, 2014
Proceedings of the Neural Information Processing - 21st International Conference, 2014
Image character recognition using deep convolutional neural network learned from different languages.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
An investigation of summed-channel speaker recognition with multi-session enrollment.
Proceedings of the IEEE International Conference on Acoustics, 2014
Recursive neural network based word topology model for hierarchical phrase-based speech translation.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Chinese Image Character Recognition Using DNN and Machine Simulated Training Samples.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2014, 2014
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2014
Neuronal Morphology Modeling Based on Microscopy Reconstruction Data in the Public Repositories.
Proceedings of the Brain Informatics and Health - International Conference, 2014
Proceedings of the 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 2014
Proceedings of the Advances in Artificial Intelligence, 2014
Learning New Semi-Supervised Deep Auto-encoder Features for Statistical Machine Translation.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014
2013
Proceedings of the IEEE International Conference on Systems, 2013
Proceedings of the Natural Language Processing and Chinese Computing, 2013
Pseudo In-Domain Data Selection from Large-Scale Web Corpus for Spoken Language Translation.
Proceedings of the Natural Language Processing and Chinese Computing, 2013
Proceedings of the Natural Language Processing and Chinese Computing, 2013
Fusion of Audio-Visual Features and Statistical Property for Commercial Segmentation.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013
A general Framework of video segmentation to logical unit based on conditional random fields.
Proceedings of the International Conference on Multimedia Retrieval, 2013
CASIA-KB: A Multi-source Chinese Semantic Knowledge Base Built from Structured and Unstructured Web Data.
Proceedings of the Semantic Technology - Third Joint International Conference, 2013
Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2013, 2013
Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013
Joint and Coupled Bilingual Topic Model Based Sentence Representations for Language Model Adaptation.
Proceedings of the IJCAI 2013, 2013
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2013
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Integrating Multi-source Bilingual Information for Chinese Word Segmentation in Statistical Machine Translation.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2013
2012
J. Comput. Sci. Technol., 2012
From English pitch accent detection to Mandarin stress detection, where is the difference?
Comput. Speech Lang., 2012
Statistical and Structural Analysis of Web-Based Collaborative Knowledge Bases Generated from Wiki Encyclopedia.
Proceedings of the 2012 IEEE/WIC/ACM International Conferences on Web Intelligence, 2012
Phrase-based data selection for language model adaptation in spoken language translation.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012
Effective near-duplicate image retrieval with image-specific visual phrase selection.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012
Discriminative training of weighted polynomial vector for acoustic language recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Unsupervised training of subspace gaussian mixture models for conversational telephone speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
TV commercial detection using constrained viterbi algorithm based on time distribution.
Proceedings of the 9th International Conference on Fuzzy Systems and Knowledge Discovery, 2012
Translation Model Based Cross-Lingual Language Model Adaptation: from Word Models to Phrase Models.
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012
Automated Essay Scoring Based on Finite State Transducer: towards ASR Transcription of Oral English Speech.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012
2011
Sci. China Inf. Sci., 2011
Comput. Graph., 2011
Comput. Aided Geom. Des., 2011
Data-Driven UBM Generation via Tied Gaussians for GMM-Supervector Based Accent Identification.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Restoring the Residual Speaker Information in Total Variability Modeling for Speaker Verification.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Automatic Prosodic Events Detection by Using Syllable-Based Acoustic, Lexical and Syntactic Features.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Context-Dependent Duration Modeling with Backoff Strategy and Look-Up Tables for Pronunciation Assessment and Mispronunciation Detection.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 2011 International Joint Conference on Neural Networks, 2011
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Exploring nuisance attribute projection and score normalization for GLDS-SVM based automatic mispronunciation detection method.
Proceedings of the IEEE International Conference on Acoustics, 2011
Structured precision modelling with Cholesky Basis Superposition for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011
2010
EURASIP J. Audio Speech Music. Process., 2010
Comput. Speech Lang., 2010
Proceedings of the 4th International Universal Communication Symposium, 2010
Proceedings of the 4th International Universal Communication Symposium, 2010
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010
A new approach for automatic tone error detection in strong accented Mandarin based on dominant set.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
On the use of Gaussian component information in the generative likelihood ratio estimation for speaker verification.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
An investigation into direct scoring methods without SVM training in speaker verification.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Automatic reference independent evaluation of prosody quality using multiple knowledge fusions.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Mandarin stress detection using hierarchical model based boosting classification and regression tree.
Proceedings of the International Joint Conference on Neural Networks, 2010
Simplified Residual Factor Analysis for Text-Independent Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2010
2009
High performance automatic mispronunciation detection method based on neural network and TRAP features.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Automatic pronunciation error detection based on linguistic knowledge and pronunciation space.
Proceedings of the IEEE International Conference on Acoustics, 2009
An efficient mispronounciation detction method using GLDS-SVM and formant enhanced features.
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Context Dependent Feature Based Bottom-up Rescoring SVM Classifier in Children's English Stress Mis-pronunciation Detection.
Proceedings of the 9th IEEE International Conference on Advanced Learning Technologies, 2009
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
2008
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Automatic Prosody Boundary Labeling of Mandarin Using Both Text and Acoustic Information.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Improving searching speed and accuracy of query by humming system based on three methods: feature fusion, candidates set reduction and multiple similarity measurement rescoring.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the Fourth International Conference on Natural Computation, 2008
Query by humming via multiscale transportation distance in random query occurrence context.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
2007
Proceedings of the IJCAI 2007, 2007
A Novel Phone-State Matrix Based Vocabulary-Indenendent Keyword Spotting Method for Spontaneous Speech.
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the EMNLP-CoNLL 2007, 2007
Proceedings of the Machine Learning: ECML 2007, 2007
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007
2006
Monaural Speech Separation Based on Computational Auditory Scene Analysis and Objective Quality Assessment of Speech.
IEEE Trans. Speech Audio Process., 2006
An approach to automatic acquisition of translation templates based on phrase structure extraction and alignment.
IEEE Trans. Speech Audio Process., 2006
A Fast Framework for the Constrained Mean Trajectory Segment Model by Avoidance of Redundant Computation on Segment.
Int. J. Comput. Linguistics Chin. Lang. Process., 2006
Int. J. Comput. Linguistics Chin. Lang. Process., 2006
Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006
Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006
Multi-Pitch Detection for Co-Channel Speech Utilizing Frequency Channel Piecewise Integration and Morphological Feedback Verification Tracking.
Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006
Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition.
Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006
Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006
A quality measure method using Gaussian mixture models and divergence measure for speaker identification.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006
Proceedings of the Advances in Biometrics, International Conference, 2006
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
A Novel Noise Robust Front-End Using First Order VTS in Construction of Mel-Warped Wiener Filter.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
An Improved Mandarin Keyword Spotting System Using MCE Training and Context-Enhanced Verification.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Applying Pitch Target Model to Convert F0 Contour for Expressive Mandarin Speech Synthesis.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Proceedings of the Fifth Workshop on Chinese Language Processing, 2006
2005
Proceedings of the HLT/EMNLP 2005, 2005
Proceedings of the 2005 International Workshop on Spoken Language Translation, 2005
Proceedings of the ISMIR 2005, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the Information Retrieval Technology, 2005
Proceedings of the Fourth SIGHAN Workshop on Chinese Language Processing, 2005
Proceedings of the Affective Computing and Intelligent Interaction, 2005
Proceedings of the Affective Computing and Intelligent Interaction, 2005
2004
IEEE Signal Process. Lett., 2004
Outline of Research Activities on Speech-to-speech Translation in Institute of Automation, Chinese Academy of Sciences.
J. Chin. Lang. Comput., 2004
Cross-Language Acoustic Modeling in Large Vocabulary Continuous Speech Recognition.
J. Chin. Lang. Comput., 2004
Hand-Free Speech Recognition in Adverse Environment with Microphone Arrays.
J. Chin. Lang. Comput., 2004
A Novel Polyspectra-Based End Point Detector In Noisy Environments.
J. Chin. Lang. Comput., 2004
A co-chunk based method for spoken-language translation.
J. Chin. Lang. Comput., 2004
Research on IF-based Chinese and English Generation Approach.
J. Chin. Lang. Comput., 2004
Int. J. Speech Technol., 2004
Proceedings of the Thirteenth Text REtrieval Conference, 2004
Proceedings of the IEEE International Conference on Systems, 2004
Improvement of Speaker Identification by Combining Prosodic Features with Acoustic Features.
Proceedings of the Advances in Biometric Person Authentication, 2004
Text-independent speaker identification using GMM-UBM and frame level likelihood normalization.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004
A new multicomponent AM-FM demodulation with predicting frequency boundaries and its application to formant estimation.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Multi-layer structure MLLR adaptation algorithm with subspace regression classes and tying.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Combining agglomerative and tree-based state clustering for high accuracy acoustic modeling.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Exploring high-performance speech recognition in noisy environments using high-order taylor series expansion.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Bilingual Chunk Alignment Based on Interactional Matching and Probabilistic Latent Semantic Indexing.
Proceedings of the Natural Language Processing, 2004
Proceedings of the Natural Language Processing, 2004
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
2003
Proceedings of The Twelfth Text REtrieval Conference, 2003
Geometric constrained maximum likelihood linear regression on Mandarin dialect adaptation.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Statistical speech-to-speech translation with multilingual speech recognition and bilingual-chunk parsing.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Joint model and feature based compensation for robust speech recognition under non-stationary noise environments.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Discriminative optimization of large vocabulary Mandarin conversational speech recognition system.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Sequential MAP estimation based speech feature enhancement for noise robust speech recognition.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
A vector statistical piecewise polynomial approximation algorithm for environment compensation in telephone LVCSR.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
Fast speaker adaptation using triple diagonal and shared block diagonal transform matrices.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2003
Proceedings of the Workshop on Multilingual and Mixed-language Named Entity Recognition, 2003
2002
Proceedings of the SIGDIAL 2002 Workshop, 2002
Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002
Comparison between the spectral estimation techniques by different spectral-distortion measures.
Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002
Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002
Accuracy improving method for parametric trajectory modeling and its use in a* search.
Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002
Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002
Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002
Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002
Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002
Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002
Structure-based compensation using an improved statistical linear approximation for Mandarin speech recognition over telephone.
Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002
Comparisons of MLLR and CDCN for speech recognition in additive noise by experiments.
Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002
Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002
Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002
Improving parametric trajectory modeling by integration of pitch and tone information.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Codebook dependent dynamic channel estimation for Mandarin speech recognition over telephone.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Chinese spoken language analyzing based on combination of statistical and rule methods.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 2002
Proceedings of the IEEE International Conference on Acoustics, 2002
Using nonstandard SVM for combination of Speaker Verification and Verbal Information Verification in speaker authentication system.
Proceedings of the IEEE International Conference on Acoustics, 2002
Including detailed information feature in MFCC for large vocabulary contious speech recornition.
Proceedings of the IEEE International Conference on Acoustics, 2002
Proceedings of the IEEE International Conference on Acoustics, 2002
Proceedings of the IEEE International Conference on Acoustics, 2002
Proceedings of the 19th International Conference on Computational Linguistics, 2002
Proceedings of the Workshop on Speech-to-Speech Translation: Algorithms and Systems@ACL 2002, 2002
2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
2000
Design And Implementation of A Chinese-To-English Spoken Language Translation System.
Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000
Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000
Block Analysis of Bilingual Corpus for Chinese-English Statistical Machine Translation.
Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000
Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000
Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000
Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000
Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000
Statistical Approach to Chinese-English Spoken-language Translation in Hotel Reservation Domain.
Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000
Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000
Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Incorporating HMM-state sequence confusion for rapid MLLR adaptation to new speakers.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Approach to Recognition and Understanding of the Time Constituents in the Spoken Chinese Language Translation.
Proceedings of the Advances in Multimodal Interfaces, 2000
Statistical Analysis of Chinese Language and Language Modeling Based on Huge Text Corpora.
Proceedings of the Advances in Multimodal Interfaces, 2000
Mandarin accent adaptation based on context-independent/context-dependent pronunciation modeling.
Proceedings of the IEEE International Conference on Acoustics, 2000
Acoustic modeling for Chinese speech recognition: a comparative study of Mandarin and Cantonese.
Proceedings of the IEEE International Conference on Acoustics, 2000
Proceedings of the IEEE International Conference on Acoustics, 2000
1999
Regression class selection and speaker adaptation with MLLR in Mandarin continuous speech recognition.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
1998
Class-Triphone Acoustic Modelling Based On Decision Tree for Mandarin Continuous Speech Recognition.
Proceedings of the 1998 International Symposium on Chinese Spoken Language Processing, 1998
Proceedings of the 1998 International Symposium on Chinese Spoken Language Processing, 1998
A novel robust feature of speech signal based on the Mellin transform for speaker-independent speech recognition.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998
1996
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996
1994
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
1992
Proceedings of the Second International Conference on Spoken Language Processing, 1992
1991
Proceedings of the 1991 International Conference on Acoustics, 1991