Shengeng Tang

Orcid: 0000-0001-6313-2543

According to our database1, Shengeng Tang authored at least 22 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

2019
2020
2021
2022
2023
2024
2025
0
5
10
15
1
14
1
3
1
1
1

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Leveraging vision-language prompts for real-world image restoration and enhancement.
Comput. Vis. Image Underst., 2025

2024
Intermediary-Generated Bridge Network for RGB-D Cross-Modal Re-Identification.
ACM Trans. Intell. Syst. Technol., December, 2024

Emotional Video Captioning With Vision-Based Emotion Interpretation Network.
IEEE Trans. Image Process., 2024

Linguistics-Vision Monotonic Consistent Network for Sign Language Production.
CoRR, 2024

Temporal-Frequency State Space Duality: An Efficient Paradigm for Speech Emotion Recognition.
CoRR, 2024

Sign-IDD: Iconicity Disentangled Diffusion for Sign Language Production.
CoRR, 2024

Dense Audio-Visual Event Localization under Cross-Modal Consistency and Multi-Temporal Granularity Collaboration.
CoRR, 2024

Patch-level Sounding Object Tracking for Audio-Visual Question Answering.
CoRR, 2024

Towards Pixel-Level Prediction for Gaze Following: Benchmark and Approach.
CoRR, 2024

Discrete to Continuous: Generating Smooth Transition Poses from Sign Language Observation.
CoRR, 2024

Modality Alignment Meets Federated Broadcasting.
CoRR, 2024

Dataset Distillers Are Good Label Denoisers In the Wild.
CoRR, 2024

Shaping a Stabilized Video by Mitigating Unintended Changes for Concept-Augmented Video Editing.
CoRR, 2024

Micro-gesture Online Recognition using Learnable Query Points.
CoRR, 2024

A Two-Stage Adverse Weather Semantic Segmentation Method for WeatherProof Challenge CVPR 2024 Workshop UG2+.
CoRR, 2024

Comprehensive Survey on Person Identification: Queries, Methods, and Datasets.
Proceedings of the 1st ICMR Workshop on Multimedia Object Re-Identification, 2024

Micro-gesture Online Recognition using Learnable Query Points.
Proceedings of IJCAI 2024 Workshop&Challenge on Micro-gesture Analysis for Hidden Emotion Understanding (MiGA 2024) co-located with 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024), 2024


2023
Emotion-Prior Awareness Network for Emotional Video Captioning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

2022
Graph-Based Multimodal Sequential Embedding for Sign Language Translation.
IEEE Trans. Multim., 2022

Gloss Semantic-Enhanced Network with Online Back-Translation for Sign Language Production.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

2019
Connectionist Temporal Modeling of Video and Language: a Joint Model for Translation and Sign Labeling.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019


  Loading...