Yidi Li

Orcid: 0000-0002-5236-7010

According to our database1, Yidi Li authored at least 24 papers between 2019 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
DepthGAN: GAN-based depth generation from semantic layouts.
Comput. Vis. Media, June, 2024

Audio-visual keyword transformer for unconstrained sentence-level keyword spotting.
CAAI Trans. Intell. Technol., February, 2024

MVSSC: Meta-reinforcement learning based visual indoor navigation using multi-view semantic spatial context.
Pattern Recognit. Lett., January, 2024

Feature Completion Transformer for Occluded Person Re-Identification.
IEEE Trans. Multim., 2024

STNet: Deep Audio-Visual Fusion Network for Robust Speaker Tracking.
CoRR, 2024

PVAFN: Point-Voxel Attention Fusion Network with Multi-Pooling Enhancing for 3D Object Detection.
CoRR, 2024

Global-Local Distillation Network-Based Audio-Visual Speaker Tracking with Incomplete Modalities.
CoRR, 2024

Adaptive Fourier Decomposition Based Signal Extraction on Weak Electromagnetic Field.
Proceedings of the IEEE International Conference on Acoustics, 2024

AttA-NET: Attention Aggregation Network for Audio-Visual Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
On-device audio-visual multi-person wake word spotting.
CAAI Trans. Intell. Technol., December, 2023

Cascade RDN: Towards Accurate Localization in Industrial Visual Anomaly Detection With Structural Anomaly Generation.
IEEE Robotics Autom. Lett., September, 2023

Transparency study of architectural space based on a scalar field function.
Spatial Cogn. Comput., July, 2023

1-Bit Hilbert Transform for Signed Signals with Sparse Prior.
Circuits Syst. Signal Process., March, 2023

Joint Adversarial and Collaborative Learning for Self-Supervised Action Recognition.
CoRR, 2023

Feature Completion Transformer for Occluded Person Re-identification.
CoRR, 2023

Self-Supervised 3D Skeleton Representation Learning with Active Sampling and Adaptive Relabeling for Action Recognition.
Proceedings of the IEEE International Conference on Image Processing, 2023

Boosting Person Re-Identification with Viewpoint Contrastive Learning and Adversarial Training.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
DepthGAN: GAN-based Depth Generation of Indoor Scenes from Semantic Layouts.
CoRR, 2022

Audio-Visual Fusion Network Based on Conformer for Multimodal Emotion Recognition.
Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

Multi-Modal Perception Attention Network with Self-Supervised Learning for Audio-Visual Speaker Tracking.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2020
Deep Metric Learning-Assisted 3D Audio-Visual Speaker Tracking via Two-Layer Particle Filter.
Complex., 2020

Improving the Data Quality for Credit Card Fraud Detection.
Proceedings of the IEEE International Conference on Intelligence and Security Informatics, 2020

3D Audio-Visual Speaker Tracking with A Novel Particle Filter.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

2019
3D Audio-Visual Speaker Tracking with A Two-Layer Particle Filter.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019


  Loading...