Jing Wang

Orcid: 0000-0002-3653-9951

Affiliations:
  • Beijing Institute of Technology, School of Information and Electronics, China (PhD 2007)


According to our database1, Jing Wang authored at least 72 papers between 2006 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Audio-Visual Temporal Forgery Detection Using Embedding-Level Fusion and Multi-Dimensional Contrastive Loss.
IEEE Trans. Circuits Syst. Video Technol., August, 2024

A Perceptually Motivated Approach for Low-Complexity Speech Semantic Communication.
IEEE Internet Things J., June, 2024

ListenFormer: Responsive Listening Head Generation with Non-autoregressive Transformers.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Semi-supervised Cross-Lingual Speech Recognition Exploiting Articulatory Features.
Proceedings of the Pattern Recognition - 27th International Conference, 2024

Investigating the Potential of VR in Language Education: A Study of Cybersickness and Presence Metrics.
Proceedings of the 13th International Conference on Educational and Information Technology, 2024

Visually Guided Binaural Audio Generation with Cross-Modal Consistency.
Proceedings of the IEEE International Conference on Acoustics, 2024

Non-Intrusive Speech Quality Assessment with Multi-Task Learning Based on Tensor Network.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Deep-Reinforcement-Learning-Based NOMA-Aided Slotted ALOHA for LEO Satellite IoT Networks.
IEEE Internet Things J., October, 2023

Multisource localization based on angle distribution of time-frequency points using an FOA microphone.
CAAI Trans. Intell. Technol., September, 2023

Diffuseness Estimation-Based SSTP Detection for Multiple Sound Source Localization in Reverberant Environments.
Circuits Syst. Signal Process., August, 2023

Attention-based neural network for end-to-end music separation.
CAAI Trans. Intell. Technol., June, 2023

Multiple-Speech-Source DOA Estimation Based on Single-Source Cluster Detection.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Multi-Source Localization Using Optimized Time-Frequency Representation and Sparsity Component Analysis.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Semi-Supervised Sound Event Detection with Pre-Trained Model.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Secure Aerial Computing: Convergence of Mobile Edge Computing and Blockchain for UAV Networks.
IEEE Trans. Veh. Technol., 2022

ASA-Net: Deep representation learning between object silhouette and attributes.
Neurocomputing, 2022

Speaker-Independent Audio-Visual Speech Separation Based on Transformer in Multi-Talker Environments.
IEICE Trans. Inf. Syst., 2022

MetaMGC: a music generation framework for concerts in metaverse.
EURASIP J. Audio Speech Music. Process., 2022

Human Sound Classification based on Feature Fusion Method with Air and Bone Conducted Signal.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

BIT-MI Deep Learning-based Model to Non-intrusive Speech Quality Assessment Challenge in Online Conferencing Applications.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Binaural Sound Source Localization based on Neural Networks in Mismatched HRTF Condition.
Proceedings of the ICCAI '22: 8th International Conference on Computing and Artificial Intelligence, Tianjin, China, March 18, 2022

MOS Predictor for Synthetic Speech with I-Vector Inputs.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Fusion of Machine Learning and Privacy Preserving for Secure Facial Expression Recognition.
Secur. Commun. Networks, 2021

Empirical Investigation of Multimodal Sensors in Novel Deep Facial Expression Recognition In-the-Wild.
J. Sensors, 2021

Neural network-based non-intrusive speech quality assessment using attention pooling function.
EURASIP J. Audio Speech Music. Process., 2021

A Separable Temporal Convolution Neural Network with Attention for Small-Footprint Keyword Spotting.
CoRR, 2021

Multi-Stream Gated and Pyramidal Temporal Convolutional Neural Networks for Audio-Visual Speech Separation in Multi-Talker Environments.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

A Target Speaker Separation Neural Network with Joint-Training.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Frequency Axis Pooling Method for Weakly Labeled Sound Event Detection and Classification.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Separable Temporal Convolution plus Temporally Pooled Attention for Lightweight High-Performance Keyword Spotting.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
QoE Evaluation Methods for 360-Degree VR Video Transmission.
IEEE J. Sel. Top. Signal Process., 2020

Binaural sound localization based on deep neural network and affinity propagation clustering in mismatched HRTF condition.
EURASIP J. Audio Speech Music. Process., 2020

Measuring quality of experience for 360-degree videos in virtual reality.
Sci. China Inf. Sci., 2020

An efficient method for generating assembly precedence constraints on 3D models based on a block sequence structure.
Comput. Aided Des., 2020

Subjective QoE of 360-Degree Virtual Reality Videos and Machine Learning Predictions.
IEEE Access, 2020

Impact of the Impairment in 360-Degree Videos on Users VR Involvement and Machine Learning-Based QoE Predictions.
IEEE Access, 2020

2019
Output-based speech quality assessment using autoencoder and support vector regression.
Speech Commun., 2019

Trajectory-Based 3D Convolutional Descriptors for Human Action Recognition.
J. Inf. Sci. Eng., 2019

Video Representation via Fusion of Static and Motion Features Applied to Human Activity Recognition.
KSII Trans. Internet Inf. Syst., 2019

3D-CNN-Based Fused Feature Maps with LSTM Applied to Action Recognition.
Future Internet, 2019

Compression of Head-Related Transfer Function Based on Tucker and Tensor Train Decomposition.
IEEE Access, 2019

An Interactive Virtual Training System for Assembly and Disassembly Based on Precedence Constraints.
Proceedings of the Advances in Computer Graphics, 2019

Speech Recognition Based on Deep Tensor Neural Network and Multifactor Feature.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Robust Speech Recognition based on Multi-Objective Learning with GRU Network.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Multiple Sound Sources Localization with Frame-by-Frame Component Removal of Statistically Dominant Source.
Sensors, 2018

QoE-Aware Mobile VR HAS Cache Management With Coding Helper.
IEEE Access, 2018

Sound Field Reproduction via the Alternating Direction Method of Multipliers Based Lasso Plus Regularized Least-Square.
IEEE Access, 2018

Automatic Personality Perception from Speech in Mandarin.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Non-intrusive Speech Quality Assessment Using Deep Belief Network and Backpropagation Neural Network.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

An Audio-Visual Quality Assessment Methodology in Virtual Reality Environment.
Proceedings of the 2018 IEEE International Conference on Multimedia & Expo Workshops, 2018

Attribute Driven Zero-Shot Classification and Segmentation.
Proceedings of the 2018 IEEE International Conference on Multimedia & Expo Workshops, 2018

Node Risk Propagation Capability Modeling of Supply Chain Network based on Structural Attributes.
Proceedings of the 9th International Conference on E-business, Management and Economics, 2018

Nonlinear Manifold Feature Extraction Based on Spectral Supervised Canonical Correlation Analysis for Facial Expression Recognition with RRNN.
Proceedings of the 11th International Congress on Image and Signal Processing, 2018

2017
HAS QoE prediction based on dynamic video features with data mining in LTE network.
Sci. China Inf. Sci., 2017

An objective assessment method based on multi-level factors for panoramic videos.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Subjective and objective quality assessment of panoramic videos in virtual reality environments.
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

Customer Satisfaction Evaluation Model of E-commerce Website based on Tensor Analysis.
Proceedings of the 8th International Conference on E-business, Management and Economics, 2017

2016
Microphone array speech denoising modeled by tensor filtering.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

2015
Prediction Model of Multi-channel Audio Quality Based on Multiple Linear Regression.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Objective Measurement of Spatial Audio Coding Quality Based on MNLR Mapping Model.
Proceedings of the 2015 International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2015

2014
Tensor-based blind signal recovery for multi-carrier amplify-and-forward relay networks.
Sci. China Inf. Sci., 2014

A real-time QoE methodology for AMR codec voice in mobile network.
Sci. China Inf. Sci., 2014

A Dynamic Clustering Algorithm Design for C-RAN Based on Multi-Objective Optimization Theory.
Proceedings of the IEEE 79th Vehicular Technology Conference, 2014

Multi-channel audio signal retrieval based on multi-factor data mining with tensor decomposition.
Proceedings of the 19th International Conference on Digital Signal Processing, 2014

2013
Context-based adaptive arithmetic coding in time and frequency domain for the lossless compression of audio coding parameters at variable rate.
EURASIP J. Audio Speech Music. Process., 2013

Multi-channel Audio Compression Method Based on ITU-T G.719 Codec.
Proceedings of the Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2013

Multichannel audio signal compression based on tensor decomposition.
Proceedings of the IEEE International Conference on Acoustics, 2013

Application of Tucker Decomposition in Speech Signal Feature Extraction.
Proceedings of the 2013 International Conference on Asian Language Processing, 2013

2012
Comparison and optimization of packet loss recovery methods based on AMR-WB for VoIP.
Speech Commun., 2012

The lossless adaptive arithmetic coding based on context for ITU-T G.719 at variable rate.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

2008
A CSI and Rate-Distortion Based Packet Loss Recovery Algorithm for VoIP.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

2006
A Closed-loop Multimode Variable Bit Rate Characteristic Waveform Interpolation Coder.
Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006


  Loading...