G2EMOT: Guided Embedding Enhancement for Multiple Object Tracking in Complex Scenes.
IEEE Trans. Instrum. Meas., 2024
Gotta Hear Them All: Sound Source Aware Vision to Audio Generation.
CoRR, 2024
Diff-SAGe: End-to-End Spatial Audio Generation Using Diffusion Models.
CoRR, 2024
A unified multichannel far-field speech recognition system: combining neural beamforming with attention based end-to-end model.
CoRR, 2024
STCMOT: Spatio-Temporal Cohesion Learning for UAV-Based Multiple Object Tracking.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024
V2A-Mapper: A Lightweight Solution for Vision-to-Audio Generation by Connecting Foundation Models.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
A shrinking synchronization clustering algorithm based on a linear weighted Vicsek model.
J. Intell. Fuzzy Syst., December, 2023
Low latency transformers for speech processing.
CoRR, 2023
An End-to-End Far-Field Keyword Spotting System with Neural Beamforming.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
Intelligent Parking Space Sharing System.
Proceedings of the 3rd International Conference on Artificial Intelligence and Advanced Manufacture, 2021
Modelling and compensation techniques for short duration speaker verification.
PhD thesis, 2019
Generalized Variability Model for Speaker Verification.
IEEE Signal Process. Lett., 2018
Speaker-Phonetic Vector Estimation for Short Duration Speaker Verification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Incorporating Local Acoustic Variability Information into Short Duration Speaker Verification.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Twin Model G-PLDA for Duration Mismatch Compensation in Text-Independent Speaker Verification.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Parallel Speaker and Content Modelling for Text-Dependent Speaker Verification.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Recovery Mechanism of Online Certification Chain in Grid Computing.
Proceedings of the The First International Conference on Availability, 2006
Recovering Frontal-Pose Image from a Single Profile Image.
Proceedings of the 2000 International Conference on Image Processing, 2000
Region Correspondence by Global Configuration Matching and Progressive Delaunay Triangulation.
Proceedings of the 2000 Conference on Computer Vision and Pattern Recognition (CVPR 2000), 2000
Dense Shape and Motion from Region Correspondences by Factorization.
Proceedings of the 1998 Conference on Computer Vision and Pattern Recognition (CVPR '98), 1998