We stand with Ukraine

We stand with Ukraine

Shengeng Tang

Orcid: 0000-0001-6313-2543

According to our database¹, Shengeng Tang authored at least 24 papers between 2019 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Knowledge Swapping via Learning and Unlearning.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, February, 2025

Navigating Semantic Drift in Task-Agnostic Class-Incremental Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, February, 2025

Efficient Vision Language Model Fine-tuning for Text-based Person Anomaly Search.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, February, 2025

Leveraging vision-language prompts for real-world image restoration and enhancement.

[BibT_eX]

[DOI]

,

,

,

,

,

Comput. Vis. Image Underst., 2025

2024

Intermediary-Generated Bridge Network for RGB-D Cross-Modal Re-Identification.

[BibT_eX]

[DOI]

,

,

ACM Trans. Intell. Syst. Technol., December, 2024

Emotional Video Captioning With Vision-Based Emotion Interpretation Network.

[BibT_eX]

[DOI]

,

,

,

,

IEEE Trans. Image Process., 2024

Linguistics-Vision Monotonic Consistent Network for Sign Language Production.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

Temporal-Frequency State Space Duality: An Efficient Paradigm for Speech Emotion Recognition.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

Sign-IDD: Iconicity Disentangled Diffusion for Sign Language Production.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

Dense Audio-Visual Event Localization under Cross-Modal Consistency and Multi-Temporal Granularity Collaboration.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

Patch-level Sounding Object Tracking for Audio-Visual Question Answering.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

Discrete to Continuous: Generating Smooth Transition Poses from Sign Language Observation.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

Modality Alignment Meets Federated Broadcasting.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

Dataset Distillers Are Good Label Denoisers In the Wild.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

Shaping a Stabilized Video by Mitigating Unintended Changes for Concept-Augmented Video Editing.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

Micro-gesture Online Recognition using Learnable Query Points.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

A Two-Stage Adverse Weather Semantic Segmentation Method for WeatherProof Challenge CVPR 2024 Workshop UG2+.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

Comprehensive Survey on Person Identification: Queries, Methods, and Datasets.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 1st ICMR Workshop on Multimedia Object Re-Identification, 2024

Micro-gesture Online Recognition using Learnable Query Points.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of IJCAI 2024 Workshop&Challenge on Micro-gesture Analysis for Hidden Emotion Understanding (MiGA 2024) co-located with 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024), 2024

NTIRE 2024 Dense and Non-Homogeneous Dehazing Challenge Report.

[BibT_eX]

[DOI]

Codruta O. Ancuti

,

,

Florin-Alexandru Vasluianu

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Seyed Amirreza Mousavi

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Nikhil Akalwadi

,

,

Ramesh Ashok Tabib

,

,

,

Jagadeesh Kalyanshetti

,

Vijayalaxmi Ashok Aralikatti

,

Palani Yashaswini

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Emotion-Prior Awareness Network for Emotional Video Captioning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 31st ACM International Conference on Multimedia, 2023

2022

Graph-Based Multimodal Sequential Embedding for Sign Language Translation.

[BibT_eX]

[DOI]

,

,

,

IEEE Trans. Multim., 2022

Gloss Semantic-Enhanced Network with Online Back-Translation for Sign Language Production.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

2019

Connectionist Temporal Modeling of Video and Language: a Joint Model for Translation and Sign Labeling.

[BibT_eX]

[DOI]

,

,

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Loading...