Xinhui Hu

Orcid: 0000-0001-9847-0788

According to our database1, Xinhui Hu authored at least 58 papers between 1994 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Beyond Functionality: Co-Designing Voice User Interfaces for Older Adults' Well-being.
CoRR, 2024

SC-MoE: Switch Conformer Mixture of Experts for Unified Streaming and Non-streaming Code-Switching ASR.
CoRR, 2024

Developing Library and Data Storytelling Toolkits: Scenarios and Personas.
Proceedings of the Wisdom, Well-Being, Win-Win, 2024

A Deep Representation Learning-Based Speech Enhancement Method Using Complex Convolution Recurrent Variational Autoencoder.
Proceedings of the IEEE International Conference on Acoustics, 2024

The Royalflush Automatic Speech Diarization and Recognition System for In-Car Multi-Channel Automatic Speech Recognition Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2024

Learning Emotion-Invariant Speaker Representations for Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Advancing the study of Large-Scale Learning in Overlapped Speech Detection.
CoRR, 2023

Semi-supervised Multimodal Emotion Recognition with Consensus Decision-making and Label Correction.
Proceedings of the 1st International Workshop on Multimodal and Responsible Affective Computing, 2023

A Scoping Review of Mental Model Research in HCI from 2010 to 2021.
Proceedings of the HCI International 2023 - Late Breaking Papers, 2023

Using Experience-Based Participatory Approach to Design Interactive Voice User Interfaces for Delivering Physical Activity Programs with Older Adults.
Proceedings of the International Conference on Human-Agent Interaction, 2023

LE-SSL-MOS: Self-Supervised Learning MOS Prediction with Listener Enhancement.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Hybrid-Regressive Paradigm for Accurate and Speed-Robust Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Hybrid-Regressive Neural Machine Translation.
CoRR, 2022

The Royalflush System for VoxCeleb Speaker Recognition Challenge 2022.
CoRR, 2022

Royalflush Speaker Diarization System for ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge.
CoRR, 2022

Multiple Enhancements to LSTM for Learning Emotion-Salient Features in Speech Emotion Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

The Royalflush System of Speech Recognition for M2met Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2022

Toward Designing Trustworthy Autonomous Systems: Probing the Role of Humans' Ethical Perspectives.
Proceedings of the 25th IEEE International Conference on Computer Supported Cooperative Work in Design, 2022

2021
Bursting through the blocks in the human mind: enhancing creativity with extended reality technologies.
Interactions, 2021

An End-to-End Dialect Identification System with Transfer Learning from a Multilingual Automatic Speech Recognition Model.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

An Investigation of Using Hybrid Modeling Units for Improving End-to-End Speech Recognition System.
Proceedings of the IEEE International Conference on Acoustics, 2021

Hierarchical Prosody Analysis Improves Categorical and Dimensional Emotion Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
Data Augmentation for Code-Switch Language Modeling by Fusing Multiple Text Generation Methods.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

The RoyalFlush Synthesis System for Blizzard Challenge 2020.
Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020

2019
The RoyalFlush Synthesis System for Blizzard Challenge 2019.
Proceedings of the Blizzard Challenge 2019, Vienna, Austria, September 23, 2019, 2019

2018
Classification of surface electromyogram signals based on directed acyclic graphs and support vector machines.
Turkish J. Electr. Eng. Comput. Sci., 2018

A High Precision Recommendation Algorithm Based on Combination Features.
Proceedings of the Database Systems for Advanced Applications, 2018

2016
Combination of multiple acoustic models with unsupervised adaptation for lecture speech transcription.
Speech Commun., 2016

2015
Economical Aspects of Resource Allocation under Discounts.
PhD thesis, 2015

Competitive Strategies for Online Cloud Resource Allocation with Discounts: The 2-Dimensional Parking Permit Problem.
Proceedings of the 35th IEEE International Conference on Distributed Computing Systems, 2015

A Myanmar large vocabulary continuous speech recognition system.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014
The NCT ASR system for IWSLT 2014.
Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2014, 2014

Mandarin speech recognition using convolution neural network with augmented tone features.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Translating TED speeches by recurrent neural network based translation model.
Proceedings of the IEEE International Conference on Acoustics, 2014

Incorporating tone features to convolutional neural network to improve Mandarin/Thai speech recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013
Collecting Colloquial and Spontaneous-like Sentences from Web Resources for Constructing Chinese Language Models of Speech Recognition.
J. Inf. Process., 2013

Overview of the NTCIR-10 SpokenDoc-2 Task.
Proceedings of the 10th NTCIR Conference on Evaluation of Information Access Technologies, 2013

Multilingual Speech-to-Speech Translation System: VoiceTra.
Proceedings of the 2013 IEEE 14th International Conference on Mobile Data Management, Milan, Italy, June 3-6, 2013, 2013

Optimal Migration Contracts in Virtual Networks: Pay-as-You-Come vs Pay-as-You-Go Pricing.
Proceedings of the Distributed Computing and Networking, 14th International Conference, 2013

2012
Distributed speech translation technologies for multiparty multilingual communication.
ACM Trans. Speech Lang. Process., 2012

Collecting sentences from web resources for constructing spontaneous Chinese language model.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

2010
Constructing Japanese test collections for spoken term detection.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Construction and evaluations of an annotated Chinese conversational corpus in travel domain for the language model of speech recognition.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Cluster-based language model for spoken document retrieval using NMF-based document clustering.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009
Spoken document retrieval using topic models.
Proceedings of the 3rd International Universal Communication Symposium, 2009

Japanese Spontaneous Spoken Document Retrieval Using NMF-Based Topic Models.
Proceedings of the Information Retrieval Technology, 2009

Construction of Chinese Segmented and POS-tagged Conversational Corpora and Their Evaluations on Spontaneous Speech Recognitions.
Proceedings of the 7th Workshop on Asian Language Resources, 2009

2008
Using Mutual Information Criterion to Design an Efficient Phoneme Set for Chinese Speech Recognition.
IEICE Trans. Inf. Syst., 2008

Utilization of Huge Written Text Corpora for Conversational Speech Recognition.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

2007
A Priority MAC Protocol for Ad Hoc Networks with Multiple Channels.
Proceedings of the IEEE 18th International Symposium on Personal, 2007

Learning Unsupervised SVM Classifier for Answer Selection in Web Question Answering.
Proceedings of the EMNLP-CoNLL 2007, 2007

Mining redundancy in candidate-bearing snippets to improve web question answering.
Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, 2007

2006
Chinese Character-based Segmentation & POS-tagging and Named Entity Identification with a CRF Chunker.
Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006

Language modeling of Chinese personal names based on character units for continuous Chinese speech recognition.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Automatic Derivation of a Phoneme Set with Tone Information for Chinese Speech Recognition Based on Mutual Information Criterion.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

1995
Tone Recognition of Chinese Dissyllables Using Hidden Markov Models.
IEICE Trans. Inf. Syst., 1995

HMM-based tone recognition of Chinese trisyllables using double codebooks on fundamental frequency and waveform power.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

1994
Recognition of Chinese tones in monosyllabic and disyllabic speech using HMM.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994


  Loading...