Mao-shen Jia

Orcid: 0000-0002-3452-3913

According to our database1, Mao-shen Jia authored at least 68 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Exploring the power of pure attention mechanisms in blind room parameter estimation.
EURASIP J. Audio Speech Music. Process., December, 2024

TIM-Net: A multi-label classification network for TCM tongue images fusing global-local features.
IET Image Process., May, 2024

Joint DOA Estimation and Dereverberation Based on Multi-Channel Linear Prediction Filtering and Azimuth Sparsity.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Harmonic-Aware Frequency and Time Attention for Automatic Piano Transcription.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

First-Order Relative Harmonic Coefficient-Based Time-Frequency Points Selection for Multi-Source DOA Estimation.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Three-Dimensional Room Transfer Function Parameterization Based on Multiple Concentric Planar Circular Arrays.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

A distortionless convolution beamformer design method based on the weighted minimum mean square error for joint dereverberation and denoising.
Speech Commun., 2024

SS-BRPE: Self-Supervised Blind Room Parameter Estimation Using Attention Mechanisms.
CoRR, 2024

Attention Is All You Need For Blind Room Volume Estimation.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Deep encoder/decoder dual-path neural network for speech separation in noisy reverberation environments.
EURASIP J. Audio Speech Music. Process., December, 2023

Separation of Multiple Speech Sources in Reverberant Environments Based on Sparse Component Enhancement.
Circuits Syst. Signal Process., October, 2023

Multisource localization based on angle distribution of time-frequency points using an FOA microphone.
CAAI Trans. Intell. Technol., September, 2023

Diffuseness Estimation-Based SSTP Detection for Multiple Sound Source Localization in Reverberant Environments.
Circuits Syst. Signal Process., August, 2023

Study of MVDR Beamforming with Spatially Distributed Source: Theoretical Analysis and Efficient Microphone Array Geometry Optimization Method.
Circuits Syst. Signal Process., August, 2023

Multiple-Speech-Source DOA Estimation Based on Single-Source Cluster Detection.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Multi-Source Localization Using Optimized Time-Frequency Representation and Sparsity Component Analysis.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Adaptive learning Unet-based adversarial network with CNN and transformer for segmentation of hard exudates in diabetes retinopathy.
IET Image Process., 2023

Sound Source Localization by Combining Phase Consistency and Angle Deviation.
Proceedings of the 9th International Conference on Computing and Artificial Intelligence, 2023

Single Source Zone Detection in the Spherical Harmonic Domain for Multisource Localization.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
Speech Enhancement With Robust Beamforming for Spatially Overlapped and Distributed Sources.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Cross-corpus speech emotion recognition using subspace learning and domain adaption.
EURASIP J. Audio Speech Music. Process., 2022

A 3D U-Net-Based Approach for Intracranial Aneurysm Detection.
Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition, 2022

A Symmetric Dual-Attention Generative Adversarial Network with Channel and Spatial Features Fusion.
Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition, 2022

DOA Estimation of Multiple Sources based on the Angle Distribution of Time-frequency Points in Single-source Zone.
Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition, 2022

Speech Recognition Method based on CTC Multilayer Loss.
Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition, 2022

Speech Emotion Recognition by using Philips Fingerprint and Spectral Entropy.
Proceedings of the ICCAI '22: 8th International Conference on Computing and Artificial Intelligence, Tianjin, China, March 18, 2022

2021
Multi-Source DOA Estimation in Reverberant Environments by Jointing Detection and Modeling of Time-Frequency Points.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Multi-source localization by using offset residual weight.
EURASIP J. Audio Speech Music. Process., 2021

Person Re-identification Based on Hash.
Proceedings of the Intelligent Computing Theories and Application, 2021

A Hierarchical Retrieval Method Based on Hash Table for Audio Fingerprinting.
Proceedings of the Intelligent Computing Theories and Application, 2021

Multi-source Localization by Using the Correlation between Single-Source Components.
Proceedings of the ICCPR '21: 10th International Conference on Computing and Pattern Recognition, Shanghai, China, October 15, 2021

A multi-source localization method based on clustering and outlier removal.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
Multiple Sound Source Separation by Jointing Single Source Zone Detection and Linearly Constrained Minimum Variance.
Proceedings of the ICCPR 2020: 9th International Conference on Computing and Pattern Recognition, Xiamen, China, October 30, 2020

Multiple Speech Source Separation by Using MVDR for B-Format Recordings.
Proceedings of the ICCPR 2020: 9th International Conference on Computing and Pattern Recognition, Xiamen, China, October 30, 2020

2019
Sound Field Reproduction in Reverberant Room Using the Alternating Direction Method of Multipliers Based Lasso and Regularized Least-Square.
Proceedings of the Intelligent Computing Theories and Application, 2019

Multiple Sound Sources Localization by using Statistical Source Component Equalization.
Proceedings of the ICCPR '19: 8th International Conference on Computing and Pattern Recognition, 2019

2018
Design of a Planar First-Order Loudspeaker Array for Global Active Noise Control.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Separation of multiple speech sources by recovering sparse and non-sparse components from B-format microphone recordings.
Speech Commun., 2018

Multiple Sound Sources Localization with Frame-by-Frame Component Removal of Statistically Dominant Source.
Sensors, 2018

Multiple Source Localization by Using Improved Single Source Bins Detection.
J. Inf. Hiding Multim. Signal Process., 2018

Multiple Speech Source Separation with Non-Sparse Components Recovery by Using Dual Similarity Determination.
IEICE Trans. Inf. Syst., 2018

Sound Field Reproduction via the Alternating Direction Method of Multipliers Based Lasso Plus Regularized Least-Square.
IEEE Access, 2018

Optical Character Detection and Recognition for Image-Based in Natural Scene.
Proceedings of the Intelligent Computing Methodologies - 14th International Conference, 2018

2017
Real-time multiple sound source localization and counting using a soundfield microphone.
J. Ambient Intell. Humaniz. Comput., 2017

Simulating the Three-Dimensional Room Transfer Function for a Rotatable Complex Source.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2017

Multiple audio source separation by using intra-object-sparsity encoding framework.
Proceedings of the 2017 IEEE International Conference on Signal Processing, 2017

Multiple source localization by using energy weighted single source zone detection.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
HMM-based cue parameters estimation for speech enhancement.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Measurement of the acoustic transfer function using compressed sensing techniques.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015
Encoding Multiple Audio Objects Using Intra-Object Sparsity.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

3D multizone soundfield reproduction using spherical harmonic analysis.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

Conversion of multichannel sound signals based on spherical harmonics with L1-norm constraint.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

3D multizone soundfield reproduction in the reverberant room using a spherical loudspeaker array.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

An analysis-by-synthesis encoding approach for multiple audio objects.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014
The design of Ambisonic reproduction system based on dynamic gain parameters.
Proceedings of the IEEE International Conference on Acoustics, 2014

The design of ambisonics decoders for irregular speaker array conforming to subjective perception.
Proceedings of the International Conference on Audio, 2014

Relative distance estimation in multi-channel spatial audio signal.
Proceedings of the International Conference on Audio, 2014

Multi-source sound field reproduction using cylindrical harmonic analysis.
Proceedings of the IEEE China Summit & International Conference on Signal and Information Processing, 2014

Speech enhancement based on a few shapes of speech spectrum.
Proceedings of the IEEE China Summit & International Conference on Signal and Information Processing, 2014

The design of HOA irregular decoders based on the optimal symmetrical virtual microphone response.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Range extrapolation of Head-Related Transfer Function using improved Higher Order Ambisonics.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Speech enhancement based on a novel weighting spectral distortion measure.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

A novel speech enhancement method using power spectra smooth in Wiener filtering.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2011
A MDCT-based click noise reduction method for MPEG-4 AAC codec.
Proceedings of the 2011 International Conference on Wireless Communications & Signal Processing, 2011

A sinusoidal audio and speech analysis/synthesis model based on improved EMD by adding pure tone.
Proceedings of the 2011 IEEE International Workshop on Machine Learning for Signal Processing, 2011

An embedded stereo speech and audio coding method based on principal component analysis.
Proceedings of the 2011 IEEE International Symposium on Signal Processing and Information Technology, 2011

2010
High frequency reconstruction of audio signal based on chaotic prediction theory.
Proceedings of the IEEE International Conference on Acoustics, 2010

2008
A 8.32 kb/s embedded wideband speech coding candidate for ITU-t EV-VBR standardization.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008


  Loading...