Guanbin Li

Orcid: 0000-0002-4805-0926

According to our database1, Guanbin Li authored at least 251 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Separating Noisy Samples From Tail Classes for Long-Tailed Image Classification With Label Noise.
IEEE Trans. Neural Networks Learn. Syst., November, 2024

Structure Embedded Nucleus Classification for Histopathology Images.
IEEE Trans. Medical Imaging, September, 2024

ECC-PolypDet: Enhanced CenterNet With Contrastive Learning for Automatic Polyp Detection.
IEEE J. Biomed. Health Informatics, August, 2024

TIP-Editor: An Accurate 3D Editor Following Both Text-Prompts And Image-Prompts.
ACM Trans. Graph., July, 2024

Inter-domain mixup for semi-supervised domain adaptation.
Pattern Recognit., February, 2024

Mapping medical image-text to a joint space via masked modeling.
Medical Image Anal., January, 2024

Contrastive Open-Set Active Learning-Based Sample Selection for Image Classification.
IEEE Trans. Image Process., 2024

SENSE: Self-Evolving Learning for Self-Supervised Monocular Depth Estimation.
IEEE Trans. Image Process., 2024

Uncertainty-Aware Active Domain Adaptive Salient Object Detection.
IEEE Trans. Image Process., 2024

Let Video Teaches You More: Video-to-Image Knowledge Distillation using DEtection TRansformer for Medical Video Lesion Detection.
CoRR, 2024

SAM2-UNet: Segment Anything 2 Makes Strong Encoder for Natural and Medical Image Segmentation.
CoRR, 2024

High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Model.
CoRR, 2024

Style-Preserving Lip Sync via Audio-Aware Style Reference.
CoRR, 2024

MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection.
CoRR, 2024

ToDER: Towards Colonoscopy Depth Estimation and Reconstruction with Geometry Constraint Adaptation.
CoRR, 2024

WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models.
CoRR, 2024

Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI.
CoRR, 2024

Fine-grained Spatial-temporal MLP Architecture for Metro Origin-Destination Prediction.
CoRR, 2024

UniFL: Improve Stable Diffusion via Unified Feedback Learning.
CoRR, 2024

Annotation-Efficient Polyp Segmentation via Active Learning.
CoRR, 2024

Large Multimodal Agents: A Survey.
CoRR, 2024

Multimodal Embodied Interactive Agent for Cafe Scene.
CoRR, 2024

Universal Semi-supervised Model Adaptation via Collaborative Consistency Training.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

TreeReward: Improve Diffusion Model via Tree-Structured Feedback Learning.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Multi-modal Denoising Diffusion Pre-training for Whole-Slide Image Classification.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Mask-Enhanced Segment Anything Model for Tumor Lesion Semantic Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

Semi- and Weakly-Supervised Learning for Mammogram Mass Segmentation with Limited Annotations.
Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024

Enrichment, Borrowing, And Mining: A Data-Driven Approach To Colonoscopic Lesion Classification.
Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024

ColonCLIP: An Adaptable Prompt-Driven Multi-Modal Strategy for Colonoscopy Image Diagnosis.
Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024

Source-Free Semi-Supervised Domain Adaptation for Tuberculosis Recognition.
Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024

Intensity Confusion Matters: An Intensity-Distance Guided Loss For Bronchus Segmentation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

VersVideo: Leveraging Enhanced Temporal Diffusion Models for Versatile Video Generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Credible Teacher for Semi-Supervised Object Detection in Open Scene.
Proceedings of the IEEE International Conference on Acoustics, 2024

OVER-NAV: Elevating Iterative Vision-and-Language Navigation with Open-Vocabulary Detection and StructurEd Representation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Decoupled Pseudo-Labeling for Semi-Supervised Monocular 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Open-Vocabulary Segmentation with Semantic-Assisted Calibration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

AlignSAM: Aligning Segment Anything Model to Open Context via Reinforcement Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

MMAPS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

GraphVAE: Unveiling Dynamic Stock Relationships with Variational Autoencoder-based Factor Modeling.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

Cell Graph Transformer for Nuclei Classification.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

FedDiv: Collaborative Noise Filtering for Federated Learning with Noisy Labels.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Removing Interference and Recovering Content Imaginatively for Visible Watermark Removal.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

UniCell: Universal Cell Nucleus Classification via Prompt Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Variance-Insensitive and Target-Preserving Mask Refinement for Interactive Image Segmentation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
PointMatch: A consistency training framework for weakly supervised semantic segmentation of 3D point clouds.
Comput. Graph., November, 2023

Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Unbiased curriculum learning enhanced global-local graph neural network for protein thermodynamic stability prediction.
Bioinform., October, 2023

Multi-Task Learning With Hierarchical Guidance for Locating and Stratifying Submucosal Tumors.
IEEE J. Biomed. Health Informatics, September, 2023

Aerial Images Meet Crowdsourced Trajectories: A New Approach to Robust Road Extraction.
IEEE Trans. Neural Networks Learn. Syst., July, 2023

Language-Aware Spatial-Temporal Collaboration for Referring Video Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

Urban regional function guided traffic flow prediction.
Inf. Sci., July, 2023

Fashion Retrieval via Graph Reasoning Networks on a Similarity Pyramid.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Which Pixel to Annotate: A Label-Efficient Nuclei Segmentation Framework.
IEEE Trans. Medical Imaging, April, 2023

Online Metro Origin-Destination Prediction via Heterogeneous Information Aggregation.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

Thyroid region prior guided attention for ultrasound segmentation of thyroid nodules.
Comput. Biol. Medicine, March, 2023

Taylor Neural Network for Real-World Image Super-Resolution.
IEEE Trans. Image Process., 2023

Adaptive Betweenness Clustering for Semi-Supervised Domain Adaptation.
IEEE Trans. Image Process., 2023

Hybrid-Order Representation Learning for Electricity Theft Detection.
IEEE Trans. Ind. Informatics, 2023

GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained 3D Face Guidance.
CoRR, 2023

M3PS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization in E-commerce.
CoRR, 2023

WMFormer++: Nested Transformer for Visible Watermark Removal via Implict Joint Learning.
CoRR, 2023

Exploration and Exploitation of Unlabeled Data for Open-Set Semi-Supervised Learning.
CoRR, 2023

CausalVLR: A Toolbox and Benchmark for Visual-Linguistic Causal Reasoning.
CoRR, 2023

Visual Causal Scene Refinement for Video Question Answering.
CoRR, 2023

Causality-aware Visual Scene Discovery for Cross-Modal Question Reasoning.
CoRR, 2023

Urban Regional Function Guided Traffic Flow Prediction.
CoRR, 2023

Visual-Linguistic Causal Intervention for Radiology Report Generation.
CoRR, 2023

DreamEditor: Text-Driven 3D Scene Editing with Neural Fields.
Proceedings of the SIGGRAPH Asia 2023 Conference Papers, 2023

Visual Causal Scene Refinement for Video Question Answering.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Mammo-SAM: Adapting Foundation Segment Anything Model for Automatic Breast Mass Segmentation in Whole Mammograms.
Proceedings of the Machine Learning in Medical Imaging - 14th International Workshop, 2023

Self- and Semi-supervised Learning for Gastroscopic Lesion Detection.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Diffusion-Based Data Augmentation for Nuclei Image Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

YONA: You Only Need One Adjacent Reference-Frame for Accurate and Fast Video Polyp Detection.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Automatic Bleeding Risk Rating System of Gastric Varices.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Prompt-Based Grouping Transformer for Nucleus Detection and Classification.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

ArSDM: Colonoscopy Images Synthesis with Adaptive Refinement Semantic Diffusion Models.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Unpaired Image-to-Image Translation Based Domain Adaptation for Polyp Segmentation.
Proceedings of the 20th IEEE International Symposium on Biomedical Imaging, 2023

Long-term Wind Power Forecasting with Hierarchical Spatial-Temporal Transformer.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

DenseLight: Efficient Control for Large-scale Traffic Signals with Dense Feedback.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

RankMatch: Fostering Confidence and Consistency in Learning with Noisy Labels.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SkeletonMAE: Graph-based Masked Autoencoder for Skeleton Sequence Pre-training.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Towards Real-World Burst Image Super-Resolution: Benchmark and Method.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Enhanced Soft Label for Semi-Supervised Semantic Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Gradient-based Sampling for Class Imbalanced Semi-supervised Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Affine-Consistent Transformer for Multi-Class Cell Nuclei Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Identity-Preserving Talking Face Generation with Landmark and Appearance Priors.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Improved Distribution Matching for Dataset Condensation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Semi-DETR: Semi-Supervised Object Detection with Detection Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

SCoDA: Domain Adaptive Shape Completion for Real Scans.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Being Comes from Not-Being: Open-Vocabulary Text-to-Motion Generation with Wordless Training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Parametric Implicit Face Representation for Audio-Driven Facial Reenactment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Divide and Adapt: Active Domain Adaptation via Customized Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Advancing Visual Grounding with Scene Knowledge: Benchmark and Method.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

De-biased Teacher: Rethinking IoU Matching for Semi-supervised Object Detection.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Adapting Object Size Variance and Class Imbalance for Semi-supervised Object Detection.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Structured Attention Network for Referring Image Segmentation.
IEEE Trans. Multim., 2022

VQAMix: Conditional Triplet Mixup for Medical Visual Question Answering.
IEEE Trans. Medical Imaging, 2022

Physical-Virtual Collaboration Modeling for Intra- and Inter-Station Metro Ridership Prediction.
IEEE Trans. Intell. Transp. Syst., 2022

Human-Centric Spatio-Temporal Video Grounding With Visual Transformers.
IEEE Trans. Circuits Syst. Video Technol., 2022

A Hamiltonian Monte Carlo Method for Probabilistic Adversarial Attack and Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Cross-Modal Progressive Comprehension for Referring Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Causal Reasoning Meets Visual Representation Learning: A Prospective Study.
Int. J. Autom. Comput., 2022

OhMG: Zero-shot Open-vocabulary Human Motion Generation.
CoRR, 2022

Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering.
CoRR, 2022

BronchusNet: Region and Structure Prior Embedded Representation Learning for Bronchus Segmentation and Classification.
CoRR, 2022

Dual Adversarial Adaptation for Cross-Device Real-World Image Super-Resolution.
CoRR, 2022

Causal Reasoning with Spatial-temporal Representation Learning: A Prospective Study.
CoRR, 2022

Open Set Domain Adaptation By Novel Class Discovery.
CoRR, 2022

PointMatch: A Consistency Training Framework for Weakly SupervisedSemantic Segmentation of 3D Point Clouds.
CoRR, 2022

Gradient-Rebalanced Uncertainty Minimization for Cross-Site Adaptation of Medical Image Segmentation.
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

Divide and Contrast: Source-free Domain Adaptation via Adaptive Contrastive Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Multi-stream Cell Segmentation with Low-level Cues for Multi-modality Images.
Proceedings of The Cell Segmentation Challenge in Multi-modality High-Resolution Microscopy Images, 2022

Audio-driven Talking Head Generation with Transformer and 3D Morphable Model.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Compound Batch Normalization for Long-tailed Image Classification.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Align, Reason and Learn: Enhancing Medical Vision-and-Language Pre-training with Knowledge.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Semi-supervised Spatial Temporal Attention Network for Video Polyp Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Lesion-Aware Dynamic Kernel for Polyp Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

BoxPolyp: Boost Generalized Polyp Segmentation Using Extra Coarse Bounding Box Annotations.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Attentive Symmetric Autoencoder for Brain MRI Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Less is More: Adaptive Curriculum Learning for Thyroid Nodule Diagnosis.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Multi-modal Masked Autoencoders for Medical Vision-and-Language Pre-training.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Cross-Level Contrastive Learning and Consistency Constraint for Semi-Supervised Medical Image Segmentation.
Proceedings of the 19th IEEE International Symposium on Biomedical Imaging, 2022

Early Prediction of Blastocyst Development via Time-Lapse Video Analysis.
Proceedings of the 19th IEEE International Symposium on Biomedical Imaging, 2022

View-Disentangled Transformer for Brain Lesion Detection.
Proceedings of the 19th IEEE International Symposium on Biomedical Imaging, 2022

Semantic-Aware Temporal Channel-Wise Attention for Cardiac Function Assessment.
Proceedings of the 19th IEEE International Symposium on Biomedical Imaging, 2022

Multi-level Consistency Learning for Semi-supervised Domain Adaptation.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Double-Check Soft Teacher for Semi-Supervised Object Detection.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Discovering Implicit Classes Achieves Open Set Domain Adaptation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

HairGAN: Spatial-Aware Palette GAN for Hair Color Transfer.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Multimodal Crowd Counting with Mutual Attention Transformers.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Centrality and Consistency: Two-Stage Clean Samples Identification for Learning with Instance-Dependent Noisy Labels.
Proceedings of the Computer Vision - ECCV 2022, 2022

Neighborhood Collective Estimation for Noisy Label Identification and Correction.
Proceedings of the Computer Vision, 2022

A Multi-granularity Retrieval System for Natural Language-based Vehicle Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

X -Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Dual Adversarial Adaptation for Cross-Device Real-World Image Super-Resolution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Unsupervised Domain Adaptive Salient Object Detection through Uncertainty-Aware Pseudo-Label Learning.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

A Causal Debiasing Framework for Unsupervised Salient Object Detection.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

A Causal Inference Look at Unsupervised Video Anomaly Detection.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Contralaterally Enhanced Networks for Thoracic Disease Detection.
IEEE Trans. Medical Imaging, 2021

Dynamic Spatial-Temporal Representation Learning for Traffic Flow Prediction.
IEEE Trans. Intell. Transp. Syst., 2021

Semantics-Aware Adaptive Knowledge Distillation for Sensor-to-Vision Action Recognition.
IEEE Trans. Image Process., 2021

Hierarchical Reasoning Network for Human-Object Interaction Detection.
IEEE Trans. Image Process., 2021

Depthwise Nonlocal Module for Fast Salient Object Detection Using a Single Thread.
IEEE Trans. Cybern., 2021

Relationship-Embedded Representation Learning for Grounding Referring Expressions.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Deep CockTail Networks.
Int. J. Comput. Vis., 2021

Instance-level salient object segmentation.
Comput. Vis. Image Underst., 2021

Road Network Guided Fine-Grained Urban Traffic Flow Inference.
CoRR, 2021

Online Metro Origin-Destination Prediction via Heterogeneous Information Aggregation.
CoRR, 2021

Explore before Moving: A Feasible Path Estimation and Memory Recalling Framework for Embodied Navigation.
Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

Robust Real-World Image Super-Resolution against Adversarial Attacks.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Cross-Modal Self-Attention with Multi-Task Pre-Training for Medical Visual Question Answering.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Self-supervised Correction Learning for Semi-supervised Biomedical Image Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

Colorectal Polyp Classification from White-Light Colonoscopy Images via Domain Alignment.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

Deep Transformers For Fast Small Intestine Grounding In Capsule Endoscope Video.
Proceedings of the 18th IEEE International Symposium on Biomedical Imaging, 2021

Multi-Task Learning For Thyroid Nodule Segmentation With Thyroid Region Prior.
Proceedings of the 18th IEEE International Symposium on Biomedical Imaging, 2021

Weakly-Supervised Spatio-Temporal Anomaly Detection in Surveillance Video.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Towards Interpretable Deep Networks for Monocular Depth Estimation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

LapsCore: Language-guided Person Search via Color Reasoning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Trash to Treasure: Harvesting OOD Data with Cross-Modal Matching for Open-Set Semi-Supervised Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Bottom-Up Shift and Reasoning for Referring Image Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Robust and Online Vehicle Counting at Crowded Intersections.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Cross-Modal Collaborative Representation Learning and a Large-Scale RGBT Benchmark for Crowd Counting.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Scene-Intuitive Agent for Remote Embodied Visual Grounding.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Cross-Domain Adaptive Clustering for Semi-Supervised Domain Adaptation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

SYSU-HCP at VQA-Med 2021: A Data-centric Model with Efficient Training Methodology for Medical Visual Question Answering.
Proceedings of the Working Notes of CLEF 2021 - Conference and Labs of the Evaluation Forum, Bucharest, Romania, September 21st - to, 2021

Multi-Layer Networks for Ensemble Precipitation Forecasts Postprocessing.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Online Alternate Generator Against Adversarial Attacks.
IEEE Trans. Image Process., 2020

Self-Enhanced Convolutional Network for Facial Video Hallucination.
IEEE Trans. Image Process., 2020

ROSA: Robust Salient Object Detection Against Adversarial Attacks.
IEEE Trans. Cybern., 2020

Face Hallucination by Attentive Sequence Optimization with Reinforcement Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Lightweight adversarial network for salient object detection.
Neurocomputing, 2020

Semantics-aware Adaptive Knowledge Distillation for Sensor-to-Vision Action Recognition.
CoRR, 2020

Depthwise Non-local Module for Fast Salient Object Detection Using a Single Thread.
CoRR, 2020

Physical-Virtual Collaboration Graph Network for Station-Level Metro Ridership Prediction.
CoRR, 2020

Modularized Framework with Category-Sensitive Abnormal Filter for City Anomaly Detection.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Efficient Crowd Counting via Structured Knowledge Transfer.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Reinforcement Learning for Weakly Supervised Temporal Grounding of Natural Language in Untrimmed Videos.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Active Object Search.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Adaptive Context Selection for Polyp Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

Collaborative Training Between Region Proposal Localization and Classification for Domain Adaptive Object Detection.
Proceedings of the Computer Vision - ECCV 2020, 2020

Propagating Over Phrase Relations for One-Stage Visual Grounding.
Proceedings of the Computer Vision - ECCV 2020, 2020

Peeking into Occluded Joints: A Novel Framework for Crowd Pose Estimation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Linguistic Structure Guided Context Modeling for Referring Image Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

MetaSelection: Metaheuristic Sub-Structure Selection for Neural Network Pruning Using Evolutionary Algorithm.
Proceedings of the ECAI 2020 - 24th European Conference on Artificial Intelligence, 29 August-8 September 2020, Santiago de Compostela, Spain, August 29 - September 8, 2020, 2020

Graph-Structured Referring Expression Reasoning in the Wild.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

A Real-Time Cross-Modality Correlation Filtering Method for Referring Expression Comprehension.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Multi-Granularity Tracking with Modularlized Components for Unsupervised Vehicles Anomaly Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Referring Image Segmentation via Cross-Modal Progressive Comprehension.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

HCP-MIC at VQA-Med 2020: Effective Visual Representation for Medical Visual Question Answering.
Proceedings of the Working Notes of CLEF 2020, 2020

An Adversarial Perturbation Oriented Domain Adaptation Approach for Semantic Segmentation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Tree-Structured Policy Based Progressive Reinforcement Learning for Temporally Language Grounding in Video.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Knowledge Graph Transfer Network for Few-Shot Recognition.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Harvesting Visual Objects from Internet Images via Deep-Learning-Based Objectness Assessment.
ACM Trans. Multim. Comput. Commun. Appl., 2019

Facial Landmark Machines: A Backbone-Branches Architecture With Progressive Representation Learning.
IEEE Trans. Multim., 2019

Contextualized Spatial-Temporal Network for Taxi Origin-Destination Demand Prediction.
IEEE Trans. Intell. Transp. Syst., 2019

Cross-Modal Attentional Context Learning for RGB-D Object Detection.
IEEE Trans. Image Process., 2019

Context-Aware Semantic Inpainting.
IEEE Trans. Cybern., 2019

ACFM: A Dynamic Spatial-Temporal Network for Traffic Prediction.
CoRR, 2019

Automatic Color Sketch Generation Using Deep Style Transfer.
IEEE Computer Graphics and Applications, 2019

Attention Embedded Spatio-Temporal Network for Video Salient Object Detection.
IEEE Access, 2019

Simultaneous Lung Field Detection and Segmentation for Pediatric Chest Radiographs.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

Globally Guided Progressive Fusion Network for 3D Pancreas Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

3D Enhanced Multi-scale Network for Thoracic Organs Segmentation.
Proceedings of the 2019 Challenge on Segmentation of THoracic Organs at Risk in CT Images, 2019

Lightweight Contrast Modeling for Attention-Aware Visual Localization.
Proceedings of the International Conference on Robotics and Automation, 2019

Multivariate-Information Adversarial Ensemble for Scalable Joint Distribution Matching.
Proceedings of the 36th International Conference on Machine Learning, 2019

Crowd Counting via Multi-view Scale Aggregation Networks.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Taxi Origin-Destination Demand Prediction with Contextualized Spatial-Temporal Network.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Dynamic Graph Attention for Referring Expression Comprehension.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Semi-Supervised Video Salient Object Detection Using Pseudo-Labels.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Larger Norm More Transferable: An Adaptive Feature Norm Approach for Unsupervised Domain Adaptation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Crowd Counting With Deep Structured Scale Integration Network.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Motion Guided Attention for Video Salient Object Detection.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Semi-Supervised Skin Detection by Network With Mutual Guidance.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Cross-Modal Relationship Inference for Grounding Referring Expressions.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

ClusterNet: Deep Hierarchical Cluster Network With Rigorously Rotation-Invariant Representation for Point Cloud Analysis.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

CamDrop: A New Explanation of Dropout and A Guided Regularization Method for Deep Neural Networks.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Instruction-guided object detection.
Proceedings of the ACM Turing Celebration Conference - China, 2019

Semantic Relationships Guided Representation Learning for Facial Action Unit Recognition.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Non-Local Context Encoder: Robust Biomedical Image Segmentation against Adversarial Attacks.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

FRAME Revisited: An Interpretation View Based on Particle Evolution.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Contrast-Oriented Deep Neural Networks for Salient Object Detection.
IEEE Trans. Neural Networks Learn. Syst., 2018

Learning deep representations for semantic image parsing: a comprehensive overview.
Frontiers Comput. Sci., 2018

Deep RBFNet: Point Cloud Feature Learning using Radial Basis Functions.
CoRR, 2018

Unsupervised Domain Adaptation: An Adaptive Feature Norm Approach.
CoRR, 2018

Attentive Crowd Flow Machines.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Non-locally Enhanced Encoder-Decoder Network for Single Image De-raining.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Crowd Counting using Deep Recurrent Spatial-Aware Network.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Interpretable Video Captioning via Trajectory Structured Localization.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Flow Guided Recurrent Neural Encoder for Video Salient Object Detection.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Visual Question Reasoning on General Dependency Tree.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Facial Landmark Localization in the Wild by Backbone-Branches Representation Learning.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

Weakly Supervised Salient Object Detection Using Image Labels.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Recurrent Attentional Reinforcement Learning for Multi-Label Image Recognition.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Automatic Colorization with Improved Spatial Coherence and Boundary Localization.
J. Comput. Sci. Technol., 2017

Context-Aware Semantic Inpainting.
CoRR, 2017

ColorSketch: A Drawing Assistant for Generating Color Sketches from Photos.
IEEE Computer Graphics and Applications, 2017

Multi-label Image Recognition by Recurrently Discovering Attentional Regions.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Instance-Level Salient Object Segmentation.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Attention-Aware Face Hallucination via Deep Reinforcement Learning.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Deep saliency detection and color sketch generation.
PhD thesis, 2016

Visual Saliency Detection Based on Multiscale Deep CNN Features.
IEEE Trans. Image Process., 2016

Deep Contrast Learning for Salient Object Detection.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Visual saliency based on multiscale deep features.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Weighted attentional blocks for probabilistic object tracking.
Vis. Comput., 2014

2012
Online boosted tracking with discriminative feature selection and scale adaptation.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012


  Loading...