Dong-Yan Huang

Orcid: 0000-0001-8844-8579

According to our database1, Dong-Yan Huang authored at least 93 papers between 1995 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.




In proceedings 
PhD thesis 




ParaLBench: A Large-Scale Benchmark for Computational Paralinguistics over Acoustic Foundation Models.
CoRR, 2024

Electronic sensing combined with machine learning models for predicting soil nutrient content.
Comput. Electron. Agric., 2024

Computation Offloading Strategies for LEO Satellite Edge Computing Systems Based on Different Multiple Access Methods.
IEEE Access, 2024

Automatically Select the Training Loss based on the Data Distribution for Talking Head Synthesis.
Proceedings of the IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications, 2024

An Efficient DAG Blockchain Architecture for IoT.
IEEE Internet Things J., January, 2023

Multi-Layer Resource Computing Offloading Based on Power Price Difference.
Proceedings of the 23rd IEEE International Conference on Communication Technology, 2023

LEO laser microwave hybrid inter-satellite routing strategy based on modified Q-routing algorithm.
EURASIP J. Wirel. Commun. Netw., 2022

IEEE SLT 2021 Alpha-Mini Speech Challenge: Open Datasets, Tracks, Rules and Baselines.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Improving Model Stability and Training Efficiency in Fast, High Quality Expressive Voice Conversion System.
Proceedings of the ICMI '21 Companion: Companion Publication of the 2021 International Conference on Multimodal Interaction, Montreal, QC, Canada, October 18, 2021

ASMMC21: The 6th International Workshop on Affective Social Multimedia Computing.
Proceedings of the ICMI '21: International Conference on Multimodal Interaction, 2021

FER by Modeling the Conditional Independence between the Spatial Cues and the Spatial Attention Distributions.
Proceedings of the ICMI '21 Companion: Companion Publication of the 2021 International Conference on Multimodal Interaction, Montreal, QC, Canada, October 18, 2021

A Profit Maximization Strategy of MEC Resource Provider in the Satellite-Terrestrial Double Edge Computing System.
Proceedings of the 21st International Conference on Communication Technology, 2021

LSSED: A Large-Scale Dataset and Benchmark for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Performance Analysis of the Raft Consensus Algorithm for Private Blockchains.
IEEE Trans. Syst. Man Cybern. Syst., 2020

Feature Point Registration Model of Farmland Surface and Its Application Based on a Monocular Camera.
Sensors, 2020

Adaptive Domain-Aware Representation Learning for Speech Emotion Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

A Joint Computation Offloading and Resource Allocation Strategy for LEO Satellite Edge Computing System.
Proceedings of the 20th IEEE International Conference on Communication Technology, 2020

Anomaly Detection for Consortium Blockchains Based on Machine Learning Classification Algorithm.
Proceedings of the Computational Data and Social Networks - 9th International Conference, 2020

A Novel Method for Soil Organic Matter Determination by Using an Artificial Olfactory System.
Sensors, 2019

High-quality Speech Synthesis Using Super-resolution Mel-Spectrogram.
CoRR, 2019

Context-aware Deep Learning for Multi-modal Depression Detection.
Proceedings of the IEEE International Conference on Acoustics, 2019

Discriminative Feature Learning for Speech Emotion Recognition.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2019: Text and Time Series, 2019

What Affects the Performance of Convolutional Neural Networks for Audio Event Classification.
Proceedings of the 8th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2019

Speech Emotion Recognition using Spectral Normalized CycleGAN.
Proceedings of the 8th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2019

ASMMC-MMAC 2018: The Joint Workshop of 4th the Workshop on Affective Social Multimedia Computing and first Multi-Modal Affective Computing of Large-Scale Multimedia Data Workshop.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Investigation on Joint Representation Learning for Robust Feature Extraction in Speech Emotion Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

The I2R-NWPU-NUS Text-to-Speech System for Blizzard Challenge 2018.
Proceedings of the Blizzard Challenge 2018, Hyderabad, India, September 8, 2018, 2018

On the Localization Algorithm of Wireless Sensor Network and Its Application.
Int. J. Online Eng., 2017

Denoising Recurrent Neural Network for Deep Bidirectional LSTM Based Voice Conversion.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Multimodal Prediction of Affective Dimensions via Fusing Multiple Regression Techniques.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

The Study of the Work Parameters of the Corn Harvester Cutter.
Proceedings of the Computer and Computing Technologies in Agriculture XI, 2017

Application of Growth Curve in Agricultural Scientific Research.
Proceedings of the Computer and Computing Technologies in Agriculture XI, 2017

Audio-visual emotion recognition using deep transfer learning and multiple temporal models.
Proceedings of the 19th ACM International Conference on Multimodal Interaction, 2017

The I2R-NWPU Text-to-Speech System for Blizzard Challenge 2017.
Proceedings of the Blizzard Challenge 2017, Stockholm, Sweden, August 25, 2017, 2017

Statistical parametric speech synthesis using generative adversarial networks under a multi-task learning framework.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Voichap: A standalone real-time voice change application on iOS platform.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Speech emotion recognition via ensembling neural networks.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Facial action recognition using very deep networks for highly imbalanced class distribution.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

An Automatic Voice Conversion Evaluation Strategy Based on Perceptual Background Noise Distortion and Speaker Similarity.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

I2RNTU at SemEval-2016 Task 4: Classifier Fusion for Polarity Classification in Twitter.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

Deep Bidirectional LSTM Modeling of Timbre and Prosody for Emotional Voice Conversion.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Audio and face video emotion recognition in the wild using deep neural networks and small datasets.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016

Deep neural network derived bottleneck features for accurate audio classification.
Proceedings of the 2016 IEEE International Conference on Multimedia & Expo Workshops, 2016

Exemplar-based sparse representation of timbre and prosody for voice conversion.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Combining multiple kernel models for automatic intelligibility detection of pathological speech.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Regularized non-negative matrix factorization using alternating direction method of multipliers and its application to source separation.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

An alternating optimization approach for phase retrieval.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

A real-time variable-q non-stationary Gabor transform for pitch shifting.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Performance scoring of singing voice.
Proceedings of the 2015 International Conference on Asian Language Processing, 2015

Perceptual speech quality improvement for vocoder based on amplitude spectrum of residual signal.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

Non-negative matrix factorization using stable alternating direction method of multipliers for source separation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Mapping frames with DNN-HMM recognizer for non-parallel voice conversion.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Fundamental frequency modeling using wavelets for emotional voice conversion.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

Speaker state classification based on fusion of asymmetric simple partial least squares (SIMPLS) and support vector machines.
Comput. Speech Lang., 2014

Energy-efficient sleep strategy for distributed MIMO systems.
Proceedings of the 25th IEEE Annual International Symposium on Personal, 2014

Soft constrained leading voice separation with music score guidance.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Acoustic emotion recognition based on fusion of multiple feature-dependent deep Boltzmann machines.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

I<sup>2</sup>r speech2singing perfects everyone's singing.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Intelligibility detection of pathological speech using asymmetric sparse kernel partial least squares classifier.
Proceedings of the IEEE International Conference on Acoustics, 2014

Learning optimal features for music transcription.
Proceedings of the IEEE China Summit & International Conference on Signal and Information Processing, 2014

Energy-efficient subcarrier-bit-power allocation based on genetic algorithm.
Proceedings of the 9th International Conference on Communications and Networking in China, 2014

Emotional facial expression transfer based on temporal restricted Boltzmann machines.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Ensemble Nyström method for predicting conflict level from speech.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Energy-Efficient Spectrum Sensing Strategy in Cognitive Radio Networks.
IEEE Commun. Lett., 2013

Optimized cognitive terminal assignment strategy for coordinated spectrum sensing.
Proceedings of the 24th IEEE Annual International Symposium on Personal, 2013

A dynamic Gaussian process for voice conversion.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

Prioritized Spectrum Sensing Scheme Based on Semi-Markov Process.
Proceedings of the 75th IEEE Vehicular Technology Conference, 2012

An energy-efficient cooperative MIMO strategy for Wireless Sensor Networks with intra-body channel.
Proceedings of the International Symposium on Communications and Information Technologies, 2012

A comparison of SVM and asymmetric SIMPLS in emotion recognition from naturalistic dialogues.
Proceedings of the 2012 IEEE International Symposium on Circuits and Systems, 2012

Detecting Intelligibility by Linear Dimensionality Reduction and Normalized Voice Quality Hierarchical Features.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Speaker State Classification Based on Fusion of Asymmetric SIMPLS and Support Vector Machines.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

High level emotional speech morphing using STRAIGHT.
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

Lombard effect mimicking.
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

Biologically inspired algorithm for enhancement of speech intelligibility over telephone channel.
Proceedings of the 2009 IEEE International Workshop on Multimedia Signal Processing, 2009

The Misadjustment of the Cascaded LMS Prediction Filter.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2009), 2009

I2R Text-to-Speech System for Blizzard Challenge 2009.
Proceedings of the Blizzard Challenge 2009, Edinburgh, Scotland, UK, September 4, 2009, 2009

Cascaded RLS-LMS Prediction in MPEG-4 Lossless Audio Coding.
IEEE Trans. Speech Audio Process., 2008

Convergence Performance of the Cascaded RLS-LMS Prediction.
Proceedings of the 67th IEEE Vehicular Technology Conference, 2008

Eigenstructure algorithms for multirate adaptive lossless FIR filters.
IEEE Trans. Signal Process., 2006

A performance bound for a cascade LMS predictor.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

Software simulation tools on forward error correction schemes for the wireless transmission of MPEG4 AAC audio bitstreams.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Characterization of a cascade LMS predictor.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Speech pitch detection in noisy environment using multi-rate adaptive lossless FIR filters.
Proceedings of the 2004 International Symposium on Circuits and Systems, 2004

Sensitivity analysis of a cascade RLS-LMS algorithm for different resolution audio signals.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Performance analysis of an RLS-LMS algorithm for lossless audio compression.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Soft decision unequal error protection scheme for MPEG advanced audio coding.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Integer fast modified cosine transform.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Robust and Inaudible Multi-echo Audio Watermarking.
Proceedings of the Advances in Multimedia Information Processing, 2002

An Attack Processing of Audio Signal for Reducing Pre-echo in a Low Bit-Rate Audio Coding System.
Proceedings of the Signal and Image Processing (SIP), 1999

Implementation of the MPEG-4 advanced audio coding encoder on ADSP-21060 SHARC.
Proceedings of the 1999 International Symposium on Circuits and Systems, ISCAS 1999, Orlando, Florida, USA, May 30, 1999

Comparison of two eigenstructure algorithms for lossless multirate filter optimization.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

An adaptive projection algorithm for multirate filter bank optimization.
Proceedings of the 8th European Signal Processing Conference, 1996

Attainable error bounds in multirate adaptive lossless FIR filters.
Proceedings of the 1995 International Conference on Acoustics, 1995
