Shaofei Huang

Orcid: 0000-0001-8996-9907

Affiliations:
  • Chinese Academy of Sciences, Institute of Information Engineering, Beijing, China
  • University of Chinese Academy of Sciences, School of Cyber Security, Beijing, China
  • SenseTime Research, Science Park, Hong Kong


According to our database1, Shaofei Huang authored at least 21 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Anchor3DLane++: 3D Lane Detection via Sample-Adaptive Sparse 3D Anchor Regression.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2025

LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding.
CoRR, January, 2025

2024
Modality adaptation via feature difference learning for depth human parsing.
Comput. Vis. Image Underst., 2024

FreeEdit: Mask-free Reference-based Image Editing with Multi-modal Instruction.
CoRR, 2024

Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation.
CoRR, 2024

Mask-Enhanced Segment Anything Model for Tumor Lesion Semantic Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

Reference Prompted Model Adaptation for Referring Camouflaged Object Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Language-Aware Spatial-Temporal Collaboration for Referring Video Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

Fine-Grained Face Editing via Personalized Spatial-Aware Affine Modulation.
IEEE Trans. Multim., 2023

Transferring CLIP's Knowledge into Zero-Shot Point Cloud Semantic Segmentation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Discovering Sounding Objects by Audio Queries for Audio Visual Segmentation.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Anchor3DLane: Learning to Regress 3D Anchors for Monocular 3D Lane Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Cross-Modal Progressive Comprehension for Referring Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Cross-Modality Domain Adaptation for Freespace Detection: A Simple yet Effective Baseline.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

A Keypoint-based Global Association Network for Lane Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
ORDNet: Capturing Omni-Range Dependencies for Scene Parsing.
IEEE Trans. Image Process., 2020

Linguistic Structure Guided Context Modeling for Referring Image Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Referring Image Segmentation via Cross-Modal Progressive Comprehension.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020


  Loading...