Si Liu

Orcid: 0000-0002-9180-2935

Affiliations:

Beihang University, School of Computer Science and Engineering, Beijing Key Laboratory of Digital Media, China
National University of Singapore, Department of Electrical and Computer Engineering, Learning and Vision Research Group, Singapore (former)
Chinese Academy of Sciences, Institute of Information Engineering, State Key Laboratory of Information Security (SKLOIS), Beijing, China (former)
Chinese Academy of Sciences, Institute of Automation, National Laboratory of Pattern Recognition, Beijing, China (PhD 2012)

According to our database¹, Si Liu authored at least 233 papers between 2009 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

Anchor3DLane++: 3D Lane Detection via Sample-Adaptive Sparse 3D Anchor Regression.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., March, 2025

RGB-T Tracking With Template-Bridged Search Interaction and Target-Preserved Template Updating.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., January, 2025

LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding.

[BibT_eX]

[DOI]

CoRR, January, 2025

2024

PPDM++: Parallel Point Detection and Matching for Fast and Accurate HOI Detection.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., October, 2024

FeatAug-DETR: Enriching One-to-Many Matching for DETRs With Feature Augmentation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., September, 2024

Multi-Person Pose Regression With Distribution-Aware Single-Stage Models.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., August, 2024

MI3C: Mining intra- and inter-image context for person search.

[BibT_eX]

[DOI]

Pattern Recognit., April, 2024

Delving Into the Devils of Bird's-Eye-View Perception: A Review, Evaluation and Recipe.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., April, 2024

Region-adaptive and context-complementary cross modulation for RGB-T semantic segmentation.

[BibT_eX]

[DOI]

Pattern Recognit., March, 2024

Room-Object Entity Prompting and Reasoning for Embodied Referring Expression.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., February, 2024

Linker: Learning Long Short-term Associations for Robust Visual Tracking.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2024

MAC: Masked Contrastive Pre-Training for Efficient Video-Text Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2024

GaussianPainter: Painting Point Cloud into 3D Gaussians with Normal Guidance.

[BibT_eX]

[DOI]

CoRR, 2024

Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation.

[BibT_eX]

[DOI]

CoRR, 2024

TopV-Nav: Unlocking the Top-View Spatial Reasoning Potential of MLLM for Zero-shot Object Navigation.

[BibT_eX]

[DOI]

CoRR, 2024

VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection.

[BibT_eX]

[DOI]

CoRR, 2024

Towards Realistic UAV Vision-Language Navigation: Platform, Benchmark, and Methodology.

[BibT_eX]

[DOI]

CoRR, 2024

MC-MoE: Mixture Compressor for Mixture-of-Experts LLMs Gains More.

[BibT_eX]

[DOI]

CoRR, 2024

FreeEdit: Mask-free Reference-based Image Editing with Multi-modal Instruction.

[BibT_eX]

[DOI]

CoRR, 2024

Knowledge Distillation via Query Selection for Detection Transformer.

[BibT_eX]

[DOI]

CoRR, 2024

LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation.

[BibT_eX]

[DOI]

CoRR, 2024

Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation.

[BibT_eX]

[DOI]

CoRR, 2024

MV2DFusion: Leveraging Modality-Specific Object Semantics for Multi-Modal 3D Detection.

[BibT_eX]

[DOI]

CoRR, 2024

Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT.

[BibT_eX]

[DOI]

CoRR, 2024

Enhancing 3D Lane Detection and Topology Reasoning with 2D Lane Priors.

[BibT_eX]

[DOI]

CoRR, 2024

V2X-PC: Vehicle-to-everything Collaborative Perception via Point Cluster.

[BibT_eX]

[DOI]

CoRR, 2024

Data Augmentation in Human-Centric Vision.

[BibT_eX]

[DOI]

CoRR, 2024

Lumina-Next : Making Lumina-T2X Stronger and Faster with Next-DiT.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Image Understanding Makes for A Good Tokenizer for Image Generation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

GPD-VVTO: Preserving Garment Details in Video Virtual Try-On.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Collaborative Training of Tiny-Large Vision Language Models.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Mask-Enhanced Segment Anything Model for Tumor Lesion Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

Realistic Rainy Weather Simulation for LiDARs in CARLA Simulator.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

Eliminating Cross-modal Conflicts in BEV Space for LiDAR-Camera 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Reference Prompted Model Adaptation for Referring Camouflaged Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

AutoVP: An Automated Visual Prompting Framework and Benchmark.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Octavius: Mitigating Task Interference in MLLMs via LoRA-MoE.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

LAROD-HD: Low-Cost Adaptive Real-Time Object Detection for High-Resolution Video Surveillance.

[BibT_eX]

[DOI]

Proceedings of the Advanced Intelligent Computing Technology and Applications, 2024

Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Controllable Navigation Instruction Generation with Chain of Thought Prompting.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Asynchronous Large Language Model Enhanced Planner for Autonomous Driving.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

EASE-DETR: Easing the Competition among Object Queries.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Teach-DETR: Better Training DETR With Teachers.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Unified Transformer With Isomorphic Branches for Natural Language Tracking.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., September, 2023

Language-Aware Spatial-Temporal Collaboration for Referring Video Segmentation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

Virtual Try-On With Garment Self-Occlusion Conditions.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Simultaneously Training and Compressing Vision-and-Language Pre-Training Model.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Fine-Grained Face Editing via Personalized Spatial-Aware Affine Modulation.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Realistic Rainy Weather Simulation for LiDARs in CARLA Simulator.

[BibT_eX]

[DOI]

CoRR, 2023

LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual Descriptions.

[BibT_eX]

[DOI]

CoRR, 2023

Octavius: Mitigating Task Interference in MLLMs via MoE.

[BibT_eX]

[DOI]

CoRR, 2023

Towards Vehicle-to-everything Autonomous Driving: A Survey on Collaborative Perception.

[BibT_eX]

[DOI]

CoRR, 2023

Sparse Dense Fusion for 3D Object Detection.

[BibT_eX]

[DOI]

CoRR, 2023

Object as Query: Equipping Any 2D Object Detector with 3D Detection Ability.

[BibT_eX]

[DOI]

CoRR, 2023

Transferring CLIP's Knowledge into Zero-Shot Point Cloud Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

DiffDance: Cascaded Human Motion Diffusion Model for Dance Generation.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

DUSA: Decoupled Unsupervised Sim2Real Adaptation for Vehicle-to-Everything Collaborative Perception.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Enriching Phrases with Coupled Pixel and Object Contexts for Panoptic Narrative Grounding.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Discovering Sounding Objects by Audio Queries for Audio Visual Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Analyzing Infrastructure LiDAR Placement with Realistic LiDAR Simulation Library.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2023

CSDNet: Contrastive Similarity Distillation Network for Multi-lingual Image-Text Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Image and Graphics - 12th International Conference, 2023

Video Background Music Generation: Dataset, Method and Evaluation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Object as Query: Lifting any 2D Object Detector to 3D Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Optimizing the Placement of Roadside LiDARs for Autonomous Driving.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Omnidirectional Information Gathering for Knowledge Transfer-based Audio-Visual Navigation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Strong Detector with Simple Tracker.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

DETR with Additional Global Aggregation for Cross-domain Weakly Supervised Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Bridging Search Region Interaction with Template for RGB-T Tracking.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Anchor3DLane: Learning to Regress 3D Anchors for Monocular 3D Lane Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Adaptive Zone-aware Hierarchical Planner for Vision-Language Navigation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Progressive Language-Customized Visual Feature Learning for One-Stage Visual Grounding.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Human-Centric Spatio-Temporal Video Grounding With Visual Transformers.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2022

Human-Centric Relation Segmentation: Dataset and Solution.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Fine-Grained Human-Centric Tracklet Segmentation with Single Frame Supervision.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

PSGAN++: Robust Detail-Preserving Makeup Transfer and Removal.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Cross-Modal Progressive Comprehension for Referring Segmentation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Masked Contrastive Pre-Training for Efficient Video-Text Retrieval.

[BibT_eX]

[DOI]

CoRR, 2022

Analyzing Infrastructure LiDAR Placement with Realistic LiDAR.

[BibT_eX]

[DOI]

CoRR, 2022

Video Background Music Generation: Dataset, Method and Evaluation.

[BibT_eX]

[DOI]

CoRR, 2022

Multi-view Human Body Mesh Translator.

[BibT_eX]

[DOI]

CoRR, 2022

Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe.

[BibT_eX]

[DOI]

CoRR, 2022

TR-MOT: Multi-Object Tracking by Reference.

[BibT_eX]

[DOI]

CoRR, 2022

Target-Driven Structured Transformer Planner for Vision-Language Navigation.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Cross-Modality Domain Adaptation for Freespace Detection: A Simple yet Effective Baseline.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

PIC'22: 4th Person in Context Workshop.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

HEAD: HEtero-Assists Distillation for Heterogeneous Object Detectors.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

The Tenth Visual Object Tracking VOT2022 Challenge Results.

[BibT_eX]

[DOI]

Joni-Kristian Kämäräinen

Alireza Memarmoghadam

Christian Micheloni

Payman Moallem

Le Thanh Nguyen-Meidine

Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

PoseTrans: A Simple yet Effective Pose Transformation Augmentation for Human Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Distribution-Aware Single-Stage Models for Multi-Person 3D Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

3D-SPS: Single-Stage 3D Visual Grounding via Referred Point Progressive Selection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Reinforced Structured State-Evolution for Vision-Language Navigation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Scene Graph Generation With Hierarchical Context.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2021

Attentive Excitation and Aggregation for Bilingual Referring Image Segmentation.

[BibT_eX]

[DOI]

ACM Trans. Intell. Syst. Technol., 2021

Discriminative Triad Matching and Reconstruction for Weakly Referring Expression Grounding.

[BibT_eX]

[DOI]

John Yannis Goulermas

IEEE Trans. Pattern Anal. Mach. Intell., 2021

Improved Pillar with Fine-grained Feature for 3D Object Detection.

[BibT_eX]

[DOI]

CoRR, 2021

Mining the Benefits of Two-stage and One-stage HOI Detection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Video Background Music Generation with Controllable Music Transformer.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Language-Guided Global Image Editing via Cross-Modal Cyclic Mechanism.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Differentiable Multi-Granularity Human Representation Learning for Instance-Aware Human Semantic Parsing.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Room-and-Object Aware Knowledge Reasoning for Remote Embodied Referring Expression.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

General Instance Distillation for Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Reformulating HOI Detection As Adaptive Set Prediction.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

SRKTDN: Applying Super Resolution Method to Dehazing Task.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Confidence-aware Non-repetitive Multimodal Transformers for TextCaps.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

ORDNet: Capturing Omni-Range Dependencies for Scene Parsing.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Recapture as You Want.

[BibT_eX]

[DOI]

CoRR, 2020

Multi-granularity Multimodal Feature Interaction for Referring Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision, Third Chinese Conference, 2020

Cross-Modal Omni Interaction Modeling for Phrase Grounding.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Video Relation Detection with Trajectory-aware Multi-modal Features.

[BibT_eX]

[DOI]

Wentao Xie

Guanghui Ren

Si Liu

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Beautify As You Like.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

InteractGAN: Learning to Generate Human-Object Interaction.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Linguistic Structure Guided Context Modeling for Referring Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

PPDM: Parallel Point Detection and Matching for Real-Time Human-Object Interaction Detection.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

A Real-Time Cross-Modality Correlation Filtering Method for Referring Expression Comprehension.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Referring Image Segmentation via Cross-Modal Progressive Comprehension.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

AdversarialNAS: Adversarial Neural Architecture Search for GANs.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Tree-Structured Policy Based Progressive Reinforcement Learning for Temporally Language Grounding in Video.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Rule-Guided Compositional Representation Learning on Knowledge Graphs.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

RotateView: A Video Composition System for Interactive Product Display.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2019

Accurate Facial Image Parsing at Real-Time Speed.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2019

Magic-Wall: Visualizing Room Decoration by Enhanced Wall Segmentation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2019

Improved Search in Hamming Space Using Deep Multi-Index Hashing.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2019

PPDM: Parallel Point Detection and Matching for Real-time Human-Object Interaction Detection.

[BibT_eX]

[DOI]

CoRR, 2019

PSGAN: Pose-Robust Spatial-Aware GAN for Customizable Makeup Transfer.

[BibT_eX]

[DOI]

CoRR, 2019

UGAN: Untraceable GAN for Multi-Domain Face Translation.

[BibT_eX]

[DOI]

CoRR, 2019

Finding Images by Dialoguing with Image.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

GPS: Group People Segmentation with Detailed Part Inference.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Enhanced Memory Network for Video Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

RGB-Infrared Cross-Modality Person Re-Identification via Joint Pixel and Feature Alignment.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Building Detail-Sensitive Semantic Segmentation Networks With Polynomial Pooling.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Correlation Particle Filter for Visual Tracking.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2018

Composing Semantic Collage for Image Retargeting.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2018

Robust Target Tracking by Online Random Forests and Superpixels.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2018

Human parsing by weak structural label.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2018

Watch fashion shows to tell clothing attributes.

[BibT_eX]

[DOI]

Neurocomputing, 2018

Learning adaptive receptive fields for deep image parsing networks.

[BibT_eX]

[DOI]

Comput. Vis. Media, 2018

Multimodal Fusion for Traditional Chinese Painting Generation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Structured Deep Learning for Pixel-level Understanding.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

BeautyGAN: Instance-level Facial Makeup Transfer with Deep Generative Adversarial Network.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Ensemble Soft-Margin Softmax Loss for Image Classification.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Automatic makeup based on generative adversarial nets.

[BibT_eX]

[DOI]

Proceedings of the 10th International Conference on Internet Multimedia Computing and Service, 2018

Person Re-Identification with Hybrid Loss and Hard Triplets Mining.

[BibT_eX]

[DOI]

Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

Cross-Domain Human Parsing via Adversarial Feature and Label Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Adult Image and Video Recognition by a Deep Multicontext Network and Fine-to-Coarse Strategy.

[BibT_eX]

[DOI]

ACM Trans. Intell. Syst. Technol., 2017

Retrieving Objects by Partitioning.

[BibT_eX]

[DOI]

IEEE Trans. Big Data, 2017

A weakly supervised method for makeup-invariant face verification.

[BibT_eX]

[DOI]

Pattern Recognit., 2017

Hierarchical deep semantic hashing for fast image retrieval.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2017

Objectness Region Enhancement Networks for Scene Parsing.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., 2017

Fast Deep Matting for Portrait Animation on Mobile Phone.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

RSVP: A Real-Time Surveillance Video Parsing System with Single Frame Supervision.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

Time Traveler: A Real-time Face Aging System.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

Magic-wall: Visualizing Room Decoration.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

Face Aging with Contextual Generative Adversarial Nets.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

Learning Adaptive Receptive Fields for Deep Image Parsing Network.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Surveillance Video Parsing with Single Frame Supervision.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Robust Visual Tracking via Exclusive Context Modeling.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2016

Surveillance Video Parsing with Single Frame Supervision.

[BibT_eX]

[DOI]

CoRR, 2016

Beauty eMakeup: A Deep Makeup Transfer System.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Makeup Like a Superstar: Deep Localized Makeup Transfer Network.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Deep multi-context Network for FINE-GRAINED VISUAL RECOGNITION.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Multimedia & Expo Workshops, 2016

Single Image Dehazing via Multi-scale Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

Structural Correlation Filter for Robust Visual Tracking.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

SketchNet: Sketch Classification with Web Images.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015

Fashion Parsing With Video Context.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2015

SLED: Semantic Label Embedding Dictionary Representation for Multilabel Image Annotation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2015

Clothing Attributes Assisted Person Reidentification.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2015

Deep Human Parsing with Active Template Regression.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2015

Robust Visual Tracking Via Consistent Low-Rank Sparse Learning.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2015

Deep People Counting in Extremely Dense Crowds.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Human-Centric Images and Videos Analysis.

[BibT_eX]

[DOI]

Si Liu

Bingbing Ni

Liang Lin

Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Low-Rank Tensor Constrained Multiview Subspace Clustering.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Human Parsing with Contextualized Convolutional Neural Network.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Towards Computational Baby Learning: A Weakly-Supervised Approach for Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Structural Sparse Tracking.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Matching-CNN meets KNN: Quasi-parametric human parsing.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Diversity-induced Multi-view Subspace Clustering.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Multimedia Analysis with Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Multimedia Big Data, BigMM 2015, 2015

2014

A Low-Complexity Compressive Sensing Algorithm for PAPR Reduction.

[BibT_eX]

[DOI]

Wirel. Pers. Commun., 2014

"Wow! You Are So Beautiful Today!".

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2014

Circle & Search: Attribute-Aware Shoe Retrieval.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2014

Fashion Parsing With Weak Color-Category Labels.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2014

PicWords: Render a Picture by Packing Keywords.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2014

Snap & Play: Auto-Generated Personalized Find-the-Difference Game.

[BibT_eX]

[DOI]

ACM Trans. Intell. Syst. Technol., 2014

Fashion Analysis: Current Techniques and Future Directions.

[BibT_eX]

[DOI]

Si Liu

Luoqi Liu

Shuicheng Yan

IEEE Multim., 2014

Computational Baby Learning.

[BibT_eX]

[DOI]

CoRR, 2014

Fashion Parsing with Video Context.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Image Retrieval and Ranking via Consistently Reconstructing Multi-attribute Queries.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2014, 2014

2013

Towards decrypting attractiveness via multi-modality cues.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2013

Mining Semantic Context Information for Intelligent Video Surveillance of Traffic Scenes.

[BibT_eX]

[DOI]

IEEE Trans. Ind. Informatics, 2013

M<sup>4</sup>L: Maximum margin Multi-instance Multi-cluster Learning for scene modeling.

[BibT_eX]

[DOI]

Pattern Recognit., 2013

Robust Visual Tracking via Structured Multi-Task Sparse Learning.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2013

eHeritage of shadow puppetry: creation and manipulation.

[BibT_eX]

[DOI]

Proceedings of the ACM Multimedia Conference, 2013

Low-Rank Sparse Coding for Image Classification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2013

SYM-FISH: A Symmetry-Aware Flip Invariant Sketch Histogram Shape Descriptor.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2013

Magic Mirror: An Intelligent Fashion Recommendation System.

[BibT_eX]

[DOI]

Si Liu

Luoqi Liu

Shuicheng Yan

Proceedings of the 2nd IAPR Asian Conference on Pattern Recognition, 2013

2012

A Generic Framework for Video Annotation via Semi-Supervised Learning.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2012

Weakly Supervised Graph Propagation Towards Collective Image Parsing.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2012

Sense beauty via face, dressing, and/or voice.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Street-to-shop: cross-scenario clothing retrieval via parts alignment and auxiliary set.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Hi, magic closet, tell me what to wear!

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Hi, magic closet, tell me what to wear!

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Low-complexity PAPR reduction algorithm in OFDM systems by designing data subcarriers.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Global Communications Conference, 2012

Low-Rank Sparse Learning for Robust Visual Tracking.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2012, 2012

Robust visual tracking via multi-task sparse learning.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Street-to-shop: Cross-scenario clothing retrieval via parts alignment and auxiliary set.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011

Boosted Exemplar Learning for Action Recognition and Annotation.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2011

Boosted multi-class semi-supervised learning for human action recognition.

[BibT_eX]

[DOI]

Pattern Recognit., 2011

A Physical Topology-Aware Chord Model based on ACO.

[BibT_eX]

[DOI]

J. Comput., 2011

Snap & play: auto-generate personalized find-the-difference mobile game.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Size Adaptive Selection of Most Informative Features.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

2010

Promoting Models.

[BibT_eX]

[DOI]

Proceedings of the Unifying Theories of Programming - Third International Symposium, 2010

Human Action Recognition in Videos Using Hybrid Motion Features.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Modeling, 2010

A generic framework for event detection in various video domains.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Multimedia 2010, 2010

Human action recognition via multi-view learning.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Internet Multimedia Computing and Service, 2010

2009

IVA-NLPR-IA-CAS TRECVID 2009: High LevelFeatures Extraction.

[BibT_eX]

[DOI]

Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

Human action recognition in videos using motion impression image.

[BibT_eX]

[DOI]

Proceedings of the First International Conference on Internet Multimedia Computing and Service, 2009

Boosted Exemplar Learning for human action recognition.

[BibT_eX]

[DOI]

Proceedings of the 12th IEEE International Conference on Computer Vision Workshops, 2009

Si Liu

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...