Si Liu
Orcid: 0000-0002-9180-2935Affiliations:
- Beihang University, School of Computer Science and Engineering, Beijing Key Laboratory of Digital Media, China
- National University of Singapore, Department of Electrical and Computer Engineering, Learning and Vision Research Group, Singapore (former)
- Chinese Academy of Sciences, Institute of Information Engineering, State Key Laboratory of Information Security (SKLOIS), Beijing, China (former)
- Chinese Academy of Sciences, Institute of Automation, National Laboratory of Pattern Recognition, Beijing, China (PhD 2012)
According to our database1,
Si Liu
authored at least 233 papers
between 2009 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
IEEE Trans. Pattern Anal. Mach. Intell., March, 2025
RGB-T Tracking With Template-Bridged Search Interaction and Target-Preserved Template Updating.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2025
LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding.
CoRR, January, 2025
2024
IEEE Trans. Pattern Anal. Mach. Intell., October, 2024
IEEE Trans. Pattern Anal. Mach. Intell., September, 2024
IEEE Trans. Pattern Anal. Mach. Intell., August, 2024
Pattern Recognit., April, 2024
Delving Into the Devils of Bird's-Eye-View Perception: A Review, Evaluation and Recipe.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2024
Region-adaptive and context-complementary cross modulation for RGB-T semantic segmentation.
Pattern Recognit., March, 2024
IEEE Trans. Pattern Anal. Mach. Intell., February, 2024
IEEE Trans. Multim., 2024
IEEE Trans. Multim., 2024
CoRR, 2024
CoRR, 2024
TopV-Nav: Unlocking the Top-View Spatial Reasoning Potential of MLLM for Zero-shot Object Navigation.
CoRR, 2024
VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection.
CoRR, 2024
Towards Realistic UAV Vision-Language Navigation: Platform, Benchmark, and Methodology.
CoRR, 2024
CoRR, 2024
Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation.
CoRR, 2024
MV2DFusion: Leveraging Modality-Specific Object Semantics for Multi-Modal 3D Detection.
CoRR, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024
Proceedings of the IEEE International Conference on Robotics and Automation, 2024
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
LAROD-HD: Low-Cost Adaptive Real-Time Object Detection for High-Resolution Video Surveillance.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2024
Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection.
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis.
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023
IEEE Trans. Circuits Syst. Video Technol., September, 2023
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023
IEEE Trans. Multim., 2023
IEEE Trans. Multim., 2023
LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual Descriptions.
CoRR, 2023
Towards Vehicle-to-everything Autonomous Driving: A Survey on Collaborative Perception.
CoRR, 2023
CoRR, 2023
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the 31st ACM International Conference on Multimedia, 2023
DUSA: Decoupled Unsupervised Sim2Real Adaptation for Vehicle-to-Everything Collaborative Perception.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Enriching Phrases with Coupled Pixel and Object Contexts for Panoptic Narrative Grounding.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023
Proceedings of the IEEE International Conference on Robotics and Automation, 2023
CSDNet: Contrastive Similarity Distillation Network for Multi-lingual Image-Text Retrieval.
Proceedings of the Image and Graphics - 12th International Conference, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Omnidirectional Information Gathering for Knowledge Transfer-based Audio-Visual Navigation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
DETR with Additional Global Aggregation for Cross-domain Weakly Supervised Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
Progressive Language-Customized Visual Feature Learning for One-Stage Visual Grounding.
IEEE Trans. Image Process., 2022
IEEE Trans. Circuits Syst. Video Technol., 2022
IEEE Trans. Pattern Anal. Mach. Intell., 2022
IEEE Trans. Pattern Anal. Mach. Intell., 2022
IEEE Trans. Pattern Anal. Mach. Intell., 2022
IEEE Trans. Pattern Anal. Mach. Intell., 2022
Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe.
CoRR, 2022
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Cross-Modality Domain Adaptation for Freespace Detection: A Simple yet Effective Baseline.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022
PoseTrans: A Simple yet Effective Pose Transformation Augmentation for Human Pose Estimation.
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
2021
IEEE Trans. Neural Networks Learn. Syst., 2021
ACM Trans. Intell. Syst. Technol., 2021
Discriminative Triad Matching and Reconstruction for Weakly Referring Expression Grounding.
IEEE Trans. Pattern Anal. Mach. Intell., 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Differentiable Multi-Granularity Human Representation Learning for Instance-Aware Human Semantic Parsing.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
IEEE Trans. Image Process., 2020
Proceedings of the Pattern Recognition and Computer Vision, Third Chinese Conference, 2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the Computer Vision - ECCV 2020, 2020
PPDM: Parallel Point Detection and Matching for Real-Time Human-Object Interaction Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
A Real-Time Cross-Modality Correlation Filtering Method for Referring Expression Comprehension.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Tree-Structured Policy Based Progressive Reinforcement Learning for Temporally Language Grounding in Video.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
IEEE Trans. Multim., 2019
IEEE Trans. Image Process., 2019
IEEE Trans. Circuits Syst. Video Technol., 2019
PPDM: Parallel Point Detection and Matching for Real-time Human-Object Interaction Detection.
CoRR, 2019
Proceedings of the 27th ACM International Conference on Multimedia, 2019
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019
RGB-Infrared Cross-Modality Person Re-Identification via Joint Pixel and Feature Alignment.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
2018
IEEE Trans. Circuits Syst. Video Technol., 2018
Comput. Vis. Media, 2018
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018
BeautyGAN: Instance-level Facial Makeup Transfer with Deep Generative Adversarial Network.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
Proceedings of the 10th International Conference on Internet Multimedia Computing and Service, 2018
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2017
Adult Image and Video Recognition by a Deep Multicontext Network and Fine-to-Coarse Strategy.
ACM Trans. Intell. Syst. Technol., 2017
Pattern Recognit., 2017
Multim. Tools Appl., 2017
J. Comput. Sci. Technol., 2017
Proceedings of the 2017 ACM on Multimedia Conference, 2017
Proceedings of the 2017 ACM on Multimedia Conference, 2017
Proceedings of the 2017 ACM on Multimedia Conference, 2017
Proceedings of the 2017 ACM on Multimedia Conference, 2017
Proceedings of the 2017 ACM on Multimedia Conference, 2017
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017
2016
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016
Proceedings of the 2016 IEEE International Conference on Multimedia & Expo Workshops, 2016
Proceedings of the Computer Vision - ECCV 2016, 2016
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016
2015
SLED: Semantic Label Embedding Dictionary Representation for Multilabel Image Annotation.
IEEE Trans. Image Process., 2015
IEEE Trans. Circuits Syst. Video Technol., 2015
IEEE Trans. Pattern Anal. Mach. Intell., 2015
Int. J. Comput. Vis., 2015
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015
Towards Computational Baby Learning: A Weakly-Supervised Approach for Object Detection.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015
Proceedings of the 2015 IEEE International Conference on Multimedia Big Data, BigMM 2015, 2015
2014
Wirel. Pers. Commun., 2014
ACM Trans. Multim. Comput. Commun. Appl., 2014
ACM Trans. Intell. Syst. Technol., 2014
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014
Proceedings of the Computer Vision - ECCV 2014, 2014
2013
ACM Trans. Multim. Comput. Commun. Appl., 2013
Mining Semantic Context Information for Intelligent Video Surveillance of Traffic Scenes.
IEEE Trans. Ind. Informatics, 2013
M<sup>4</sup>L: Maximum margin Multi-instance Multi-cluster Learning for scene modeling.
Pattern Recognit., 2013
Int. J. Comput. Vis., 2013
Proceedings of the ACM Multimedia Conference, 2013
Proceedings of the IEEE International Conference on Computer Vision, 2013
Proceedings of the IEEE International Conference on Computer Vision, 2013
Proceedings of the 2nd IAPR Asian Conference on Pattern Recognition, 2013
2012
IEEE Trans. Multim., 2012
IEEE Trans. Multim., 2012
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012
Street-to-shop: cross-scenario clothing retrieval via parts alignment and auxiliary set.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012
Low-complexity PAPR reduction algorithm in OFDM systems by designing data subcarriers.
Proceedings of the 2012 IEEE Global Communications Conference, 2012
Proceedings of the Computer Vision - ECCV 2012, 2012
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012
Street-to-shop: Cross-scenario clothing retrieval via parts alignment and auxiliary set.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012
2011
IEEE Trans. Circuits Syst. Video Technol., 2011
Pattern Recognit., 2011
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011
2010
Proceedings of the Unifying Theories of Programming - Third International Symposium, 2010
Proceedings of the Advances in Multimedia Modeling, 2010
Proceedings of the 18th International Conference on Multimedia 2010, 2010
Proceedings of the Second International Conference on Internet Multimedia Computing and Service, 2010
2009
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009
Proceedings of the First International Conference on Internet Multimedia Computing and Service, 2009
Proceedings of the 12th IEEE International Conference on Computer Vision Workshops, 2009