Guosheng Lin

Orcid: 0000-0002-0329-7458

According to our database1, Guosheng Lin authored at least 225 papers between 2012 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
NVDS$^{\mathbf{+}}$+: Towards Efficient and Versatile Neural Stabilizer for Video Depth Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2025

2024
Instance Consistency Regularization for Semi-Supervised 3D Instance Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Self-Supervised 3D Scene Flow Estimation and Motion Prediction Using Local Rigidity Prior.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Learning Temporal Variations for 4D Point Cloud Segmentation.
Int. J. Comput. Vis., December, 2024

ManiCLIP: Multi-attribute Face Manipulation from Text.
Int. J. Comput. Vis., October, 2024

An Adaptive Correlation Filtering Method for Text-Based Person Search.
Int. J. Comput. Vis., October, 2024

Robust-EQA: Robust Learning for Embodied Question Answering With Noisy Labels.
IEEE Trans. Neural Networks Learn. Syst., September, 2024

Towards Robust Monocular Depth Estimation: A New Baseline and Benchmark.
Int. J. Comput. Vis., July, 2024

Indoor Smartphone SLAM With Acoustic Echoes.
IEEE Trans. Mob. Comput., June, 2024

Dense Supervision Propagation for Weakly Supervised Semantic Segmentation on 3D Point Clouds.
IEEE Trans. Circuits Syst. Video Technol., June, 2024

Reliability-Adaptive Consistency Regularization for Weakly-Supervised Point Cloud Segmentation.
Int. J. Comput. Vis., June, 2024

Harmonizing Base and Novel Classes: A Class-Contrastive Approach for Generalized Few-Shot Segmentation.
Int. J. Comput. Vis., April, 2024

LCReg: Long-tailed image classification with Latent Categories based Recognition.
Pattern Recognit., January, 2024

Neural Logic Vision Language Explainer.
IEEE Trans. Multim., 2024

ViTA: Video Transformer Adaptor for Robust Video Depth Estimation.
IEEE Trans. Multim., 2024

A Unified Transformer Framework for Group-Based Segmentation: Co-Segmentation, Co-Saliency Detection and Video Salient Object Detection.
IEEE Trans. Multim., 2024

CMNet: Component-Aware Matching Network for Few-Shot Point Cloud Classification.
IEEE Trans. Multim., 2024

Neural Radiance Selector: Find the best 2D representations of 3D data for CLIP based 3D tasks.
Knowl. Based Syst., 2024

Meta-Exploiting Frequency Prior for Cross-Domain Few-Shot Learning.
CoRR, 2024

Hybrid Mamba for Few-Shot Segmentation.
CoRR, 2024

High Quality Human Image Animation using Regional Supervision and Motion Blur Condition.
CoRR, 2024

MotionCom: Automatic and Motion-Aware Image Composition with LLM and Video Diffusion Prior.
CoRR, 2024

Prim2Room: Layout-Controllable Room Mesh Generation from Primitives.
CoRR, 2024

3D-VirtFusion: Synthetic 3D Data Augmentation through Generative Diffusion Models and Controllable Editing.
CoRR, 2024

MeshAnything V2: Artist-Created Mesh Generation With Adjacent Mesh Tokenization.
CoRR, 2024

MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers.
CoRR, 2024

OrientDream: Streamlining Text-to-3D Generation with Explicit Orientation Control.
CoRR, 2024

Text-to-Image Rectified Flow as Plug-and-Play Priors.
CoRR, 2024

Sync4D: Video Guided Controllable Dynamics for Physics-Based 4D Generation.
CoRR, 2024

PS-CAD: Local Geometry Guidance via Prompting and Selection for CAD Reconstruction.
CoRR, 2024

Magic-Boost: Boost 3D Generation with Mutli-View Conditioned Diffusion.
CoRR, 2024

Style-Consistent 3D Indoor Scene Synthesis with Decoupled Objects.
CoRR, 2024

HMR-Adapter: A Lightweight Adapter with Dual-Path Cross Augmentation for Expressive Human Mesh Recovery.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Rethinking the Effect of Uninformative Class Name in Prompt Learning.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

iControl3D: An Interactive System for Controllable 3D Scene Generation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Sentiment-oriented Sarcasm Integration for Video Sentiment Analysis Enhancement with Sarcasm Assistance.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion Consistency.
Proceedings of the Computer Vision - ECCV 2024, 2024

Learn to Optimize Denoising Scores: A Unified and Improved Diffusion Prior for 3D Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Eliminating Feature Ambiguity for Few-Shot Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024

3DFG-PIFu: 3D Feature Grids for Human Digitization from Sparse Views.
Proceedings of the Computer Vision - ECCV 2024, 2024

AttriHuman-3D: Editable 3D Human Avatar Generation with Attribute Decomposition and Indexing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Self-Supervised Class-Agnostic Motion Prediction with Spatial and Temporal Consistency Regularizations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

REACTO: Reconstructing Articulated Objects from a Single Video.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

S-DyRF: Reference-Based Stylized Radiance Fields for Dynamic Scenes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Sculpt3D: Multi-View Consistent Text-to-3D Generation with Sparse 3D Prior.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

R-Cyclic Diffuser: Reductive and Cyclic Latent Diffusion for 3D Clothed Human Digitalization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Enhanced Visual Instruction Tuning with Synthesized Image-Dialogue Data.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Diverse and Stable 2D Diffusion Guided Text to 3D Generation with Noise Recalibration.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Semi-supervised Class-Agnostic Motion Prediction with Pseudo Label Regeneration and BEVMix.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

IT3D: Improved Text-to-3D Generation with Explicit View Synthesis.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Fine Structure-Aware Sampling: A New Sampling Training Scheme for Pixel-Aligned Implicit Models in Single-View Human Reconstruction.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Temporal Feature Matching and Propagation for Semantic Segmentation on 3D Point Cloud Sequences.
IEEE Trans. Circuits Syst. Video Technol., December, 2023

Contrastive Generative Network with Recursive-Loop for 3D point cloud generalized zero-shot classification.
Pattern Recognit., December, 2023

Self-Training Vision Language BERTs With a Unified Conditional Model.
IEEE Trans. Circuits Syst. Video Technol., August, 2023

Unsupervised 3D Pose Transfer With Cross Consistency and Dual Reconstruction.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Depth and Video Segmentation Based Visual Attention for Embodied Question Answering.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

DeepEMD: Differentiable Earth Mover's Distance for Few-Shot Learning.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

Improving Tail-Class Representation with Centroid Contrastive Learning.
Pattern Recognit. Lett., April, 2023

Learning Structural Representations for Recipe Generation and Food Retrieval.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

Guided by Meta-Set: A Data-Driven Method for Fine-Grained Visual Recognition.
IEEE Trans. Multim., 2023

Effective End-to-End Vision Language Pretraining With Semantic Visual Loss.
IEEE Trans. Multim., 2023

Few-Shot Segmentation With Optimal Transport Matching and Message Flow.
IEEE Trans. Multim., 2023

Cross-Image Region Mining With Region Prototypical Network for Weakly Supervised Segmentation.
IEEE Trans. Multim., 2023

Semantic Consistent Embedding for Domain Adaptive Zero-Shot Learning.
IEEE Trans. Image Process., 2023

Efficient Few-Shot Object Detection via Knowledge Inheritance.
IEEE Trans. Image Process., 2023

Single-View 3D Mesh Reconstruction for Seen and Unseen Categories.
IEEE Trans. Image Process., 2023

Learn to Optimize Denoising Scores for 3D Generation: A Unified and Improved Diffusion Prior on NeRF and 3D Gaussian Splatting.
CoRR, 2023

Few-shot Image Generation via Style Adaptation and Content Preservation.
CoRR, 2023

SARA: Controllable Makeup Transfer with Spatial Alignment and Region-Adaptive Normalization.
CoRR, 2023

DI-Net : Decomposed Implicit Garment Transfer Network for Digital Clothed 3D Human.
CoRR, 2023

Learning-Based Biharmonic Augmentation for Point Cloud Classification.
CoRR, 2023

Leveraging Large-Scale Pretrained Vision Foundation Models for Label-Efficient 3D Point Cloud Segmentation.
CoRR, 2023

Neural Vector Fields: Generalizing Distance Vector Fields by Codebooks and Zero-Curl Regularization.
CoRR, 2023

Improving Video Violence Recognition with Human Interaction Learning on 3D Skeleton Point Clouds.
CoRR, 2023

IT3D: Improved Text-to-3D Generation with Explicit View Synthesis.
CoRR, 2023

StableLLaVA: Enhanced Visual Instruction Tuning with Synthesized Image-Dialogue Data.
CoRR, 2023

Unlimited Knowledge Distillation for Action Recognition in the Dark.
CoRR, 2023

Weakly Supervised 3D Instance Segmentation without Instance-level Annotations.
CoRR, 2023

StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation.
CoRR, 2023

OR-NeRF: Object Removing from 3D Scenes Guided by Multiview Segmentation with Neural Radiance Fields.
CoRR, 2023

MoDA: Modeling Deformable 3D Objects from Casual Videos.
CoRR, 2023

StarNet: Style-Aware 3D Point Cloud Generation.
CoRR, 2023

Toward Re-Identifying Any Animal.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Make-It-4D: Synthesizing a Consistent Long-Term Dynamic Scene Video from a Single Image.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Iterative Refinement for Multi-Source Visual Domain Adaptation (Extended abstract).
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

Self-Calibrated Cross Attention Network for Few-Shot Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Collaborative Propagation on Multiple Instance Graphs for 3D Instance Segmentation with Single-point Supervision.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Neural Video Depth Stabilizer.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Label-Guided Knowledge Distillation for Continual Semantic Segmentation on 2D Images and 3D Point Clouds.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Neural Vector Fields: Implicit Representation by Explicit Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

TAPS3D: Text-Guided 3D Textured Shape Generation from Pseudo Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

3D Cinemagraphy from a Single Image.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Weakly Supervised Class-agnostic Motion Prediction for Autonomous Driving.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Iterative Refinement for Multi-Source Visual Domain Adaptation.
IEEE Trans. Knowl. Data Eng., 2022

Cross-Modal Graph With Meta Concepts for Video Captioning.
IEEE Trans. Image Process., 2022

Dense Semantics-Assisted Networks for Video Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2022

Decomposing generation networks with structure prediction for recipe generation.
Pattern Recognit., 2022

Feature flow: In-network feature flow estimation for video object detection.
Pattern Recognit., 2022

Tackling background ambiguities in multi-class few-shot point cloud semantic segmentation.
Knowl. Based Syst., 2022

Online Active Proposal Set Generation for weakly supervised object detection.
Knowl. Based Syst., 2022

Learning language to symbol and language to vision mapping for visual grounding.
Image Vis. Comput., 2022

CRCNet: Few-Shot Segmentation with Cross-Reference and Region-Global Conditional Networks.
Int. J. Comput. Vis., 2022

Generalizable Person Re-Identification via Viewpoint Alignment and Fusion.
CoRR, 2022

ManiCLIP: Multi-Attribute Face Manipulation from Text.
CoRR, 2022

RWSeg: Cross-graph Competing Random Walks for Weakly Supervised 3D Instance Segmentation.
CoRR, 2022

ZeroMesh: Zero-shot Single-view 3D Mesh Reconstruction.
CoRR, 2022

3D Cartoon Face Generation with Controllable Expressions from a Single GAN Image.
CoRR, 2022

Learning Spatial and Temporal Variations for 4D Point Cloud Segmentation.
CoRR, 2022

Long-tailed Recognition by Learning from Latent Categories.
CoRR, 2022

Efficient Few-Shot Object Detection via Knowledge Inheritance.
CoRR, 2022

A Unified Transformer Framework for Group-based Segmentation: Co-Segmentation, Co-Saliency Detection and Video Salient Object Detection.
CoRR, 2022

Indoor Smartphone SLAM with Learned Echoic Location Features.
Proceedings of the 20th ACM Conference on Embedded Networked Sensor Systems, 2022

S-PIFu: Integrating Parametric Human Models with PIFu for Single-view Clothed Human Reconstruction.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Paired Cross-Modal Data Augmentation for Fine-Grained Image-to-Text Retrieval.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Few-shot Open-set Recognition Using Background as Unknowns.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Dual Adaptive Transformations for Weakly Supervised Point Cloud Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

General Object Pose Transformation Network from Unpaired Data.
Proceedings of the Computer Vision - ECCV 2022, 2022

Dynamically Transformed Instance Normalization Network for Generalizable Person Re-Identification.
Proceedings of the Computer Vision - ECCV 2022, 2022

Learning Regional Purity for Instance Segmentation on 3D Point Clouds.
Proceedings of the Computer Vision - ECCV 2022, 2022

IntegratedPIFu: Integrated Pixel Aligned Implicit Function for Single-View Human Reconstruction.
Proceedings of the Computer Vision - ECCV 2022, 2022

Expanding Large Pre-trained Unimodal Models with Multimodal Information Injection for Image-Text Multimodal Classification.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

RigidFlow: Self-Supervised Scene Flow Learning on Point Clouds by Local Rigidity Prior.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Weakly Supervised Segmentation on Outdoor 4D point clouds with Temporal Matching and Spatial Graph Propagation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SymmNeRF: Learning to Explore Symmetry Prior for Single-View View Synthesis.
Proceedings of the Computer Vision - ACCV 2022, 2022

Self-Supervised Object Localization with Joint Graph Partition.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Point Discriminative Learning for Data-efficient 3D Point Cloud Analysis.
Proceedings of the International Conference on 3D Vision, 2022

2021
CycleSegNet: Object Co-Segmentation With Cycle Refinement and Region Correspondence.
IEEE Trans. Image Process., 2021

Progressive Self-Guided Loss for Salient Object Detection.
IEEE Trans. Image Process., 2021

On Lightweight Privacy-preserving Collaborative Learning for Internet of Things by Independent Random Projections.
ACM Trans. Internet Things, 2021

Weakly-Supervised Cross-Domain Road Scene Segmentation via Multi-Level Curriculum Adaptation.
IEEE Trans. Circuits Syst. Video Technol., 2021

Guided Co-Segmentation Network for Fast Video Object Segmentation.
IEEE Trans. Circuits Syst. Video Technol., 2021

Graph neural network for 6D object pose estimation.
Knowl. Based Syst., 2021

Few-shot fine-grained classification with Spatial Attentive Comparison.
Knowl. Based Syst., 2021

CNN-Based RGB-D Salient Object Detection: Learn, Select, and Fuse.
Int. J. Comput. Vis., 2021

Calibrating Class Activation Maps for Long-Tailed Visual Recognition.
CoRR, 2021

Few-Shot Segmentation with Global and Local Contrastive Learning.
CoRR, 2021

M2IOSR: Maximal Mutual Information Open Set Recognition.
CoRR, 2021

Point Discriminative Learning for Unsupervised Representation Learning on 3D Point Clouds.
CoRR, 2021

Remember What You have drawn: Semantic Image Manipulation with Memory.
CoRR, 2021

CycleSegNet: Object Co-segmentation with Cycle Refinement and Region Correspondence.
CoRR, 2021

3D Pose Transfer with Correspondence Learning and Mesh Refinement.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

MV-TON: Memory-based Video Virtual Try-on network.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Cycle-Consistent Inverse GAN for Text-to-Image Synthesis.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Modeling the Uncertainty for Self-supervised 3D Skeleton Action Representation Learning.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Learning Meta-class Memory for Few-Shot Semantic Segmentation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Context Decoupling Augmentation for Weakly Supervised Semantic Segmentation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Self-supervised 3D Skeleton Action Representation Learning with Motion Consistency and Continuity.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Attention is not Enough: Mitigating the Distribution Discrepancy in Asynchronous Multimodal Sequence Fusion.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Meta Navigator: Search for a Good Adaptation Policy for Few-shot Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Few-Shot Incremental Learning With Continually Evolved Classifiers.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

CT-Net: Complementary Transfering Network for Garment Transfer With Arbitrary Geometric Changes.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Progressive Modality Reinforcement for Human Multimodal Emotion Recognition From Unaligned Multimodal Sequences.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Self-Point-Flow: Self-Supervised Scene Flow Estimation From Point Clouds With Optimal Transport and Random Walk.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

HCRF-Flow: Scene Flow From Point Clouds With Continuous High-Order CRFs and Position-Aware Flow Embedding.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
A Dilated Inception Network for Visual Saliency Prediction.
IEEE Trans. Multim., 2020

Video Object Segmentation and Tracking: A Survey.
ACM Trans. Intell. Syst. Technol., 2020

RGBD Salient Object Detection via Disentangled Cross-Modal Fusion.
IEEE Trans. Image Process., 2020

Motion Context Network for Weakly Supervised Object Detection in Videos.
IEEE Signal Process. Lett., 2020

RefineNet: Multi-Path Refinement Networks for Dense Prediction.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Bottom-Up Scene Text Detection with Markov Clustering Networks.
Int. J. Comput. Vis., 2020

Compositional Prototype Network with Multi-view Comparision for Few-Shot Point Cloud Semantic Segmentation.
CoRR, 2020

LAGNet: Logic-Aware Graph Network for Human Interaction Understanding.
CoRR, 2020

Open Set Recognition with Conditional Probabilistic Generative Models.
CoRR, 2020

Decomposed Generation Networks with Structure Prediction for Recipe Generation from Food Images.
CoRR, 2020

Weakly Supervised Segmentation with Maximum Bipartite Graph Matching.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Splitting Vs. Merging: Mining Object Regions with Discrepancy and Intersection Loss for Weakly Supervised Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

TRRNet: Tiered Relation Reasoning for Compositional Visual Question Answering.
Proceedings of the Computer Vision - ECCV 2020, 2020

Structure-Aware Generation Network for Recipe Generation from Images.
Proceedings of the Computer Vision - ECCV 2020, 2020

Human Interaction Learning on 3D Skeleton Point Clouds for Video Violence Recognition.
Proceedings of the Computer Vision - ECCV 2020, 2020

Graph Edit Distance Reward: Learning to Edit Scene Graph.
Proceedings of the Computer Vision - ECCV 2020, 2020

DeepEMD: Few-Shot Image Classification With Differentiable Earth Mover's Distance and Structured Classifiers.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Exploring Bottom-Up and Top-Down Cues With Attentive Learning for Webly Supervised Object Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Multi-Path Region Mining for Weakly Supervised 3D Semantic Segmentation on Point Clouds.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

SpSequenceNet: Semantic Segmentation Network on 4D Point Clouds.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Cross-Domain Semantic Segmentation via Domain-Invariant Interactive Relation Transfer.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

CRNet: Cross-Reference Networks for Few-Shot Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Decoupled Spatial Neural Attention for Weakly Supervised Semantic Segmentation.
IEEE Trans. Multim., 2019

Semantics-Aware Visual Object Tracking.
IEEE Trans. Circuits Syst. Video Technol., 2019

Keypoint based weakly supervised human parsing.
Image Vis. Comput., 2019

Local fusion networks with chained residual pooling for video action recognition.
Image Vis. Comput., 2019

On Lightweight Privacy-Preserving Collaborative Learning for IoT Objects.
CoRR, 2019

Task-in-all Domain Adaptation for Semantic Segmentation.
Proceedings of the 2019 IEEE Visual Communications and Image Processing, 2019

Semantic Segmentation via Domain Adaptation with Global Structure Embedding.
Proceedings of the 2019 IEEE Visual Communications and Image Processing, 2019

M2E-Try On Net: Fashion from Model to Everyone.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

On lightweight privacy-preserving collaborative learning for internet-of-things objects.
Proceedings of the International Conference on Internet of Things Design and Implementation, 2019

Pyramid Graph Networks With Connection Attentions for Region-Based One-Shot Semantic Segmentation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

SegEQA: Video Segmentation Based Visual Attention for Embodied Question Answering.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

CANet: Class-Agnostic Segmentation Networks With Iterative Refinement and Attentive Few-Shot Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Towards Robust Curve Text Detection With Conditional Spatial Expansion.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Structured Learning of Tree Potentials in CRF for Image Segmentation.
IEEE Trans. Neural Networks Learn. Syst., 2018

Crowd Counting via Weighted VLAD on a Dense Attribute Feature Map.
IEEE Trans. Circuits Syst. Video Technol., 2018

Efficient dense labelling of human activity sequences from wearables using fully convolutional networks.
Pattern Recognit., 2018

Exploring Context with Deep Structured Models for Semantic Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Correlation Propagation Networks for Scene Text Detection.
CoRR, 2018


Domain Adaptive Semantic Segmentation Through Structure Enhancement.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

MoNet: Deep Motion Exploitation for Video Object Segmentation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Bootstrapping the Performance of Webly Supervised Semantic Segmentation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Learning Markov Clustering Networks for Scene Text Detection.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Discriminative Training of Deep Fully Connected Continuous CRFs With Task-Specific Loss.
IEEE Trans. Image Process., 2017

Structured Learning of Binary Codes with Column Generation for Optimizing Ranking Measures.
Int. J. Comput. Vis., 2017

Efficient Dense Labeling of Human Activity Sequences from Wearables using Fully Convolutional Networks.
CoRR, 2017

Learning Multi-level Region Consistency with Dense Multi-label Networks for Semantic Segmentation.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

RefineNet: Multi-path Refinement Networks for High-Resolution Semantic Segmentation.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Sequential Person Recognition in Photo Albums with a Recurrent Network.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Weakly Supervised Semantic Segmentation Based on Co-segmentation.
Proceedings of the British Machine Vision Conference 2017, 2017

2016
Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Crowd Counting via Weighted VLAD on Dense Attribute Feature Maps.
CoRR, 2016

Discriminative Training of Deep Fully-connected Continuous CRF with Task-specific Loss.
CoRR, 2016

Structured Learning of Binary Codes with Column Generation.
CoRR, 2016

Fast Training of Triplet-Based Deep Binary Embedding Networks.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Efficient Piecewise Training of Deep Structured Models for Semantic Segmentation.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
CRF learning with CNN features for image segmentation.
Pattern Recognit., 2015

Supervised Hashing Using Graph Cuts and Boosted Decision Trees.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Deeply Learning the Messages in Message Passing Inference.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Sequence searching with deep-learnt depth for condition- and viewpoint-invariant route-based place recognition.
Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2015

Deep convolutional neural fields for depth estimation from a single image.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
StructBoost: Boosting Methods for Predicting Structured Output Variables.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Optimizing Ranking Measures for Compact Binary Code Learning.
Proceedings of the Computer Vision - ECCV 2014, 2014

Fast Supervised Hashing with Decision Trees for High-Dimensional Data.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Learning Hash Functions Using Column Generation.
Proceedings of the 30th International Conference on Machine Learning, 2013

Approximate constraint generation for efficient structured boosting.
Proceedings of the IEEE International Conference on Image Processing, 2013

A General Two-Step Approach to Learning-Based Hashing.
Proceedings of the IEEE International Conference on Computer Vision, 2013

2012
Fast Training of Effective Multi-class Boosting Using Coordinate Descent Optimization.
Proceedings of the Computer Vision, 2012


  Loading...