Bing Yang
Orcid: 0000-0002-8978-2322Affiliations:
- Peking University, Shenzhen Graduate School, Key Laboratory of Machine Perception, Beijing, China
According to our database1,
Bing Yang
authored at least 23 papers
between 2017 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Self-Supervised Learning of Spatial Acoustic Representation With Cross-Channel Signal Reconstruction and Multi-Channel Conformer.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
2022
Enhancing direct-path relative transfer function using deep neural network for robust sound source localization.
CAAI Trans. Intell. Technol., 2022
Head-related transfer function-reserved time-frequency masking for robust binaural sound source localization.
CAAI Trans. Intell. Technol., 2022
SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization.
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Supervised Direct-Path Relative Transfer Function Learning for Binaural Sound Source Localization.
Proceedings of the IEEE International Conference on Acoustics, 2021
Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
An Adaptive Method Based on Multiscale Dilated Convolutional Network for Binaural Speech Source Localization.
Complex., 2020
Deep Metric Learning-Assisted 3D Audio-Visual Speaker Tracking via Two-Layer Particle Filter.
Complex., 2020
Proceedings of the 2020 IEEE International Conference on Systems, Man, and Cybernetics, 2020
Lip Graph Assisted Audio-Visual Speech Recognition Using Bidirectional Synchronous Fusion.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 25th International Conference on Pattern Recognition, 2020
Mutual Alignment between Audiovisual Features for End-to-End Audiovisual Speech Recognition.
Proceedings of the 25th International Conference on Pattern Recognition, 2020
Proceedings of the 25th International Conference on Pattern Recognition, 2020
A Base-Derivative Framework for Cross-Modality RGB-Infrared Person Re-Identification.
Proceedings of the 25th International Conference on Pattern Recognition, 2020
Proceedings of the 25th International Conference on Pattern Recognition, 2020
2019
Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering.
IEEE ACM Trans. Audio Speech Lang. Process., 2019
Proceedings of the 2019 IEEE International Conference on Robotics and Biomimetics, 2019
Synergistic Optimization based Binaural Time-Frequency Masking for Speech Source Localization.
Proceedings of the 2019 IEEE International Conference on Robotics and Biomimetics, 2019
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019
2018
Multiple Concurrent Sound Source Tracking Based on Observation-Guided Adaptive Particle Filter.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
2017
Multiple Sound Source Counting and Localization Based on Spatial Principal Eigenvector.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Multiple sound source localization based on TDOA clustering and multi-path matching pursuit.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017