Liang Lin

Orcid: 0000-0003-2248-3755

According to our database1, Liang Lin authored at least 523 papers between 2006 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
A novel linear discriminant analysis based on alternate ratio sum minimization.
Inf. Sci., 2025

2024
Exploration and Exploitation of Unlabeled Data for Open-Set Semi-supervised Learning.
Int. J. Comput. Vis., December, 2024

Heterogeneous Semantic Transfer for Multi-label Recognition with Partial Labels.
Int. J. Comput. Vis., December, 2024

Routing User-Interest Markov Tree for Scalable Personalized Knowledge-Aware Recommendation.
IEEE Trans. Neural Networks Learn. Syst., October, 2024

Template-Based Contrastive Distillation Pretraining for Math Word Problem Solving.
IEEE Trans. Neural Networks Learn. Syst., September, 2024

Contrastive Transformer Learning With Proximity Data Generation for Text-Based Person Search.
IEEE Trans. Circuits Syst. Video Technol., August, 2024

TIP-Editor: An Accurate 3D Editor Following Both Text-Prompts And Image-Prompts.
ACM Trans. Graph., July, 2024

An advanced subway train localization system using a vision-based kilometer marker recognition-assisted multi-sensor fusion method.
Int. J. Gen. Syst., July, 2024

Accelerating Massively Distributed Deep Learning Through Efficient Pseudo-Synchronous Update Method.
Int. J. Parallel Program., June, 2024

Generic Sensitivity: Generics-Guided Context Sensitivity for Pointer Analysis.
IEEE Trans. Software Eng., May, 2024

DNA Family: Boosting Weight-Sharing NAS With Block-Wise Supervisions.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Prototypical Graph Contrastive Learning.
IEEE Trans. Neural Networks Learn. Syst., February, 2024

Hand Gesture Authentication by Discovering Fine-Grained Spatiotemporal Identity Characteristics.
IEEE Trans. Circuits Syst. Video Technol., January, 2024

A Prior Guided Wavelet-Spatial Dual Attention Transformer Framework for Heavy Rain Image Restoration.
IEEE Trans. Multim., 2024

Dual-View Data Hallucination With Semantic Relation Guidance for Few-Shot Image Recognition.
IEEE Trans. Multim., 2024

Improving Network Interpretability via Explanation Consistency Evaluation.
IEEE Trans. Multim., 2024

Extensible Max-Min Collaborative Retention for Online Mini-Batch Learning Hash Retrieval.
IEEE Trans. Multim., 2024

NiteDR: Nighttime Image De-Raining With Cross-View Sensor Cooperative Learning for Dynamic Driving Scenes.
IEEE Trans. Multim., 2024

Category-Adaptive Label Discovery and Noise Rejection for Multi-Label Recognition With Partial Positive Labels.
IEEE Trans. Multim., 2024

Adaptive Global-Local Representation Learning and Selection for Cross-Domain Facial Expression Recognition.
IEEE Trans. Multim., 2024

Multi-Person 3D Pose Estimation With Occlusion Reasoning.
IEEE Trans. Multim., 2024

DPHANet: Discriminative Parallel and Hierarchical Attention Network for Natural Language Video Localization.
IEEE Trans. Multim., 2024

Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph Generation.
IEEE Trans. Image Process., 2024

Uncertainty-Aware Active Domain Adaptive Salient Object Detection.
IEEE Trans. Image Process., 2024

Dynamic Correlation Learning and Regularization for Multi-Label Confidence Calibration.
IEEE Trans. Image Process., 2024

IDF-CR: Iterative Diffusion Process for Divide-and-Conquer Cloud Removal in Remote-Sensing Images.
IEEE Trans. Geosci. Remote. Sens., 2024

Diff-Mosaic: Augmenting Realistic Representations in Infrared Small Target Detection via Diffusion Prior.
IEEE Trans. Geosci. Remote. Sens., 2024

SIRST-5K: Exploring Massive Negatives Synthesis With Self-Supervised Learning for Robust Infrared Small Target Detection.
IEEE Trans. Geosci. Remote. Sens., 2024

An Introspective Data Augmentation Method for Training Math Word Problem Solvers.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

MirrorDiffusion: Stabilizing Diffusion Process in Zero-Shot Image Translation by Prompts Redescription and Beyond.
IEEE Signal Process. Lett., 2024

A multi-aware graph convolutional network for driver drowsiness detection.
Knowl. Based Syst., 2024

Dual-perspective semantic-aware representation blending for multi-label image recognition with partial labels.
Expert Syst. Appl., 2024

Integration of Communication and Computational Imaging.
CoRR, 2024

All Robots in One: A New Standard and Unified Dataset for Versatile, General-Purpose Embodied Agents.
CoRR, 2024

High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Model.
CoRR, 2024

Style-Preserving Lip Sync via Audio-Aware Style Reference.
CoRR, 2024

VideoQA in the Era of LLMs: An Empirical Study.
CoRR, 2024

CrossDehaze: Scaling Up Image Dehazing with Cross-Data Vision Alignment and Augmentation.
CoRR, 2024

Fuse, Reason and Verify: Geometry Problem Solving with Parsed Clauses from Diagram.
CoRR, 2024

Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI.
CoRR, 2024

Predicting Genetic Mutation from Whole Slide Images via Biomedical-Linguistic Knowledge Enhanced Multi-label Classification.
CoRR, 2024

Quantifying In-Context Reasoning Effects and Memorization Effects in LLMs.
CoRR, 2024

Two-Phase Dynamics of Interactions Explains the Starting Point of a DNN Learning Over-Fitted Features.
CoRR, 2024

Fine-grained Spatial-temporal MLP Architecture for Metro Origin-Destination Prediction.
CoRR, 2024

ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning.
CoRR, 2024

Towards Better Adversarial Purification via Adversarial Denoising Diffusion Training.
CoRR, 2024

AnimateZoo: Zero-shot Video Generation of Cross-Species Animation via Subject Alignment.
CoRR, 2024

AlignMiF: Geometry-Aligned Multimodal Implicit Field for LiDAR-Camera Joint Synthesis.
CoRR, 2024

Multimodal Embodied Interactive Agent for Cafe Scene.
CoRR, 2024

Mirror Gradient: Towards Robust Multimodal Recommender Systems via Exploring Flat Local Minima.
Proceedings of the ACM on Web Conference 2024, 2024

Flux: Decoupled Auto-Scaling for Heterogeneous Query Workload in Alibaba AnalyticDB.
Proceedings of the Companion of the 2024 International Conference on Management of Data, 2024

Exploring Out-of-Distribution Scene Text Recognition for Driving Scenes with Hybrid Test-Time Adaptation.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Diversity Matters: User-Centric Multi-Interest Learning for Conversational Movie Recommendation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Self-Supervised Emotion Representation Disentanglement for Speech-Preserving Facial Expression Manipulation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Decoder-Only LLMs are Better Controllers for Diffusion Models.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Kepler codebook.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

AttNS: Attention-Inspired Numerical Solving For Limited Data Scenarios.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Lottery Ticket Hypothesis for Attention Mechanism in Residual Convolutional Neural Network<sup>*</sup>.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Credible Teacher for Semi-Supervised Object Detection in Open Scene.
Proceedings of the IEEE International Conference on Acoustics, 2024

Mitigating Matthew Effect: Multi-Hypergraph Boosted Multi-Interest Self-Supervised Learning for Conversational Recommendation.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection.
Proceedings of the Computer Vision - ECCV 2024, 2024

Stripe Observation Guided Inference Cost-Free Attention Mechanism.
Proceedings of the Computer Vision - ECCV 2024, 2024

WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

AlignMiF: Geometry-Aligned Multimodal Implicit Field for LiDAR-Camera Joint Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Learning Adaptive Spatial Coherent Correlations for Speech-Preserving Facial Expression Manipulation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HyCoRec: Hypergraph-Enhanced Multi-Preference Learning for Alleviating Matthew Effect in Conversational Recommendation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

VisDiaHalBench: A Visual Dialogue Benchmark For Diagnosing Hallucination in Large Vision-Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

FacetCRS: Multi-Faceted Preference Learning for Pricking Filter Bubbles in Conversational Recommender System.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Diagnosing and Rectifying Fake OOD Invariance: A Restructured Causal Approach.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
IRA-FSOD: Instant-Response and Accurate Few-Shot Object Detector.
IEEE Trans. Circuits Syst. Video Technol., November, 2023

Towards Causality-Aware Inferring: A Sequential Discriminative Approach for Medical Diagnosis.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Towards Deviation-Robust Agent Navigation via Perturbation-Aware Contrastive Learning.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Discourse-Aware Graph Networks for Textual Logical Reasoning.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Aerial Images Meet Crowdsourced Trajectories: A New Approach to Robust Road Extraction.
IEEE Trans. Neural Networks Learn. Syst., July, 2023

Learning image blind denoisers without explicit noise modeling.
Multim. Tools Appl., July, 2023

Urban regional function guided traffic flow prediction.
Inf. Sci., July, 2023

Reward-Adaptive Reinforcement Learning: Dynamic Policy Gradient Optimization for Bipedal Locomotion.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Fashion Retrieval via Graph Reasoning Networks on a Similarity Pyramid.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Online Metro Origin-Destination Prediction via Heterogeneous Information Aggregation.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

Semantic representation and dependency learning for multi-label image recognition.
Neurocomputing, March, 2023

OccluMix: Towards De-Occlusion Virtual Try-on by Semantically-Guided Mixup.
IEEE Trans. Multim., 2023

Multi-Stage Spatio-Temporal Aggregation Transformer for Video Person Re-Identification.
IEEE Trans. Multim., 2023

Fine-Grained Face Editing via Personalized Spatial-Aware Affine Modulation.
IEEE Trans. Multim., 2023

Real-World Image Super-Resolution by Exclusionary Dual-Learning.
IEEE Trans. Multim., 2023

Graph-Convolved Factorization Machines for Personalized Recommendation.
IEEE Trans. Knowl. Data Eng., 2023

Taylor Neural Network for Real-World Image Super-Resolution.
IEEE Trans. Image Process., 2023

Hybrid-Order Representation Learning for Electricity Theft Detection.
IEEE Trans. Ind. Informatics, 2023

Anser: Adaptive Information Sharing Framework of AnalyticDB.
Proc. VLDB Endow., 2023

ADASR: An Adversarial Auto-Augmentation Framework for Hyperspectral and Multispectral Data Fusion.
IEEE Geosci. Remote. Sens. Lett., 2023

Object-aware navigation for remote embodied visual referring expression.
Neurocomputing, 2023

SQLNet: Scale-Modulated Query and Localization Network for Few-Shot Class-Agnostic Counting.
CoRR, 2023

VisualProg Distiller: Learning to Fine-tune Non-differentiable Visual Programming Frameworks.
CoRR, 2023

Towards Real-World Burst Image Super-Resolution: Benchmark and Method.
CoRR, 2023

Towards CausalGPT: A Multi-Agent Approach for Faithful Knowledge Reasoning via Promoting Causal Consistency in LLMs.
CoRR, 2023

CausalVLR: A Toolbox and Benchmark for Visual-Linguistic Causal Reasoning.
CoRR, 2023

Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models.
CoRR, 2023

Visual Causal Scene Refinement for Video Question Answering.
CoRR, 2023

Causality-aware Visual Scene Discovery for Cross-Modal Question Reasoning.
CoRR, 2023

ASR: Attention-alike Structural Re-parameterization.
CoRR, 2023

Open-World Pose Transfer via Sequential Test-Time Adaption.
CoRR, 2023

Urban Regional Function Guided Traffic Flow Prediction.
CoRR, 2023

Visual-Linguistic Causal Intervention for Radiology Report Generation.
CoRR, 2023

On Robust Numerical Solver for ODE via Self-Attention Mechanism.
CoRR, 2023

Research on Digital Monitoring Technology for Airport High-Pressure Rotary Jet Piles.
IEEE Access, 2023

DreamEditor: Text-Driven 3D Scene Editing with Neural Fields.
Proceedings of the SIGGRAPH Asia 2023 Conference Papers, 2023

FIRE: Fine Implicit Reconstruction Enhancement with Detailed Body Part Labels and Geometric Features.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

RecFormer: Recurrent Multi-modal Transformer with History-Aware Contrastive Learning for Visual Dialog.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Ego Lane Estimation Using Visual Information and High Definition Map.
Proceedings of the IEEE/ION Position, Location and Navigation Symposium, 2023

A low-cost lane-level navigation algorithm based on visual information.
Proceedings of the IEEE/ION Position, Location and Navigation Symposium, 2023

ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

SUR-adapter: Enhancing Text-to-Image Pre-trained Diffusion Models with Large Language Models.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Visual Causal Scene Refinement for Video Question Answering.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Long-term Wind Power Forecasting with Hierarchical Spatial-Temporal Transformer.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

DenseLight: Efficient Control for Large-scale Traffic Signals with Dense Feedback.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Adversarially Robust Source-free Domain Adaptation with Relaxed Adversarial Training.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Multi-object Video Generation from Single Frame Layouts.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

RankMatch: Fostering Confidence and Consistency in Learning with Noisy Labels.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SkeletonMAE: Graph-based Masked Autoencoder for Skeleton Sequence Pre-training.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Towards Real-World Burst Image Super-Resolution: Benchmark and Method.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Enhanced Soft Label for Semi-Supervised Semantic Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Coordinate Transformer: Achieving Single-stage Multi-person Mesh Recovery from Videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Understanding Self-attention Mechanism via Dynamical System Perspective.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

A Retrospect to Multi-prompt Learning across Vision and Language.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

HutCRS: Hierarchical User-Interest Tracking for Conversational Recommender System.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Identity-Preserving Talking Face Generation with Landmark and Appearance Priors.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Masked Images Are Counterfactual Samples for Robust Fine-Tuning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Being Comes from Not-Being: Open-Vocabulary Text-to-Motion Generation with Wordless Training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Scene Graph to Image Synthesis via Knowledge Consensus.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

De-biased Teacher: Rethinking IoU Matching for Semi-supervised Object Detection.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Adapting Object Size Variance and Class Imbalance for Semi-supervised Object Detection.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Actional Atomic-Concept Learning for Demystifying Vision-Language Navigation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Configurable Graph Reasoning for Visual Relationship Detection.
IEEE Trans. Neural Networks Learn. Syst., 2022

Joint Learning of Neural Transfer and Architecture Adaptation for Image Recognition.
IEEE Trans. Neural Networks Learn. Syst., 2022

Knowledge-Routed Visual Question Reasoning: Challenges for Deep Representation Embedding.
IEEE Trans. Neural Networks Learn. Syst., 2022

Structured Attention Network for Referring Image Segmentation.
IEEE Trans. Multim., 2022

Physical-Virtual Collaboration Modeling for Intra- and Inter-Station Metro Ridership Prediction.
IEEE Trans. Intell. Transp. Syst., 2022

TCGL: Temporal Contrastive Graph for Self-Supervised Video Representation Learning.
IEEE Trans. Image Process., 2022

Unifying Relational Sentence Generation and Retrieval for Medical Image Report Composition.
IEEE Trans. Cybern., 2022

A Hamiltonian Monte Carlo Method for Probabilistic Adversarial Attack and Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Adversarial Reinforced Instruction Attacker for Robust Vision-Language Navigation.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Graphonomy: Universal Image Parsing via Graph Reasoning and Transfer.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Learning Spatially Variant Linear Representation Models for Joint Filtering.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Cross-Domain Facial Expression Recognition: A Unified Evaluation Benchmark and Adversarial Graph Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Knowledge-Guided Multi-Label Few-Shot Learning for General Image Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Multi-label image recognition with attentive transformer-localizer module.
Multim. Tools Appl., 2022

Fast Spectral Embedded Clustering Based on Structured Graph Learning for Large-Scale Hyperspectral Image.
IEEE Geosci. Remote. Sens. Lett., 2022

Structured graph optimization for joint spectral embedding and clustering.
Neurocomputing, 2022

Causal Reasoning Meets Visual Representation Learning: A Prospective Study.
Int. J. Autom. Comput., 2022

DreamArtist: Towards Controllable One-Shot Text-to-Image Generation via Contrastive Prompt-Tuning.
CoRR, 2022

Category-Adaptive Label Discovery and Noise Rejection for Multi-label Image Recognition with Partial Positive Labels.
CoRR, 2022

Layer-wise Shared Attention Network on Dynamical System Perspective.
CoRR, 2022

OhMG: Zero-shot Open-vocabulary Human Motion Generation.
CoRR, 2022

Accelerating Numerical Solvers for Large-Scale Simulation of Dynamical System via NeurVec.
CoRR, 2022

Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering.
CoRR, 2022

The Lottery Ticket Hypothesis for Self-attention in Convolutional Neural Network.
CoRR, 2022

Semantic-Aware Representation Blending for Multi-Label Image Recognition with Partial Labels.
CoRR, 2022

Dual Adversarial Adaptation for Cross-Device Real-World Image Super-Resolution.
CoRR, 2022

Audio-Visual Contrastive Learning for Self-supervised Action Recognition.
CoRR, 2022

Causal Reasoning with Spatial-temporal Representation Learning: A Prospective Study.
CoRR, 2022

Semantic Representation and Dependency Learning for Multi-Label Image Recognition.
CoRR, 2022

Open Set Domain Adaptation By Novel Class Discovery.
CoRR, 2022

Divide and Contrast: Source-free Domain Adaptation via Adaptive Contrastive Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Structure-Preserving 3D Garment Modeling with Neural Sewing Machines.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Privacy Preserving CSI Fingerprint Device-Free Localization.
Proceedings of the Machine Learning for Cyber Security - 4th International Conference, 2022

Double-Check Soft Teacher for Semi-Supervised Object Detection.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Discovering Implicit Classes Achieves Open Set Domain Adaptation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Cross-Domain Action Recognition via Prototypical Graph Alignment.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Multimodal Crowd Counting with Mutual Attention Transformers.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Enhancing Prototypical Few-Shot Learning By Leveraging The Local-Level Strategy.
Proceedings of the IEEE International Conference on Acoustics, 2022

LogicSolver: Towards Interpretable Math Word Problem Solving with Logical Prompt-enhanced Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

UniGeo: Unifying Geometry Logical Reasoning via Reformulating Mathematical Expression.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Adversarially-Aware Robust Object Detector.
Proceedings of the Computer Vision - ECCV 2022, 2022

Continual Object Detection via Prototypical Task Correlation Guided Gating Mechanism.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Dual Adversarial Adaptation for Cross-Device Real-World Image Super-Resolution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Semantic-Aware Auto-Encoders for Self-supervised Representation Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Content-Aware Hierarchical Representation Selection for Cross-View Geo-Localization.
Proceedings of the Computer Vision - ACCV 2022, 2022

Unsupervised Domain Adaptive Salient Object Detection through Uncertainty-Aware Pseudo-Label Learning.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Structured Semantic Transfer for Multi-Label Recognition with Partial Labels.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Semantic-Aware Representation Blending for Multi-Label Image Recognition with Partial Labels.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Graph Reasoning Networks and Applications.
Proceedings of the Neuro-Symbolic Artificial Intelligence: The State of the Art, 2021

Weakly Supervised Person Re-ID: Differentiable Graphical Learning and a New Benchmark.
IEEE Trans. Neural Networks Learn. Syst., 2021

Deductive Reinforcement Learning for Visual Autonomous Urban Driving Navigation.
IEEE Trans. Neural Networks Learn. Syst., 2021

Fine-Grained Image Captioning With Global-Local Discriminative Objective.
IEEE Trans. Multim., 2021

Unsupervised Multi-View Clustering by Squeezing Hybrid Knowledge From Cross View and Each View.
IEEE Trans. Multim., 2021

Dynamic Spatial-Temporal Representation Learning for Traffic Flow Prediction.
IEEE Trans. Intell. Transp. Syst., 2021

GTAE: Graph Transformer-Based Auto-Encoders for Linguistic-Constrained Text Style Transfer.
ACM Trans. Intell. Syst. Technol., 2021

Image Comes Dancing With Collaborative Parsing-Flow Video Synthesis.
IEEE Trans. Image Process., 2021

Semantics-Aware Adaptive Knowledge Distillation for Sensor-to-Vision Action Recognition.
IEEE Trans. Image Process., 2021

Hierarchical Reasoning Network for Human-Object Interaction Detection.
IEEE Trans. Image Process., 2021

Depthwise Nonlocal Module for Fast Salient Object Detection Using a Single Thread.
IEEE Trans. Cybern., 2021

Interpretable Visual Question Answering by Reasoning on Dependency Trees.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Deep CockTail Networks.
Int. J. Comput. Vis., 2021

Instance-level salient object segmentation.
Comput. Vis. Image Underst., 2021

Plug-and-Play Few-shot Object Detection with Meta Strategy and Explicit Localization Inference.
CoRR, 2021

Road Network Guided Fine-Grained Urban Traffic Flow Inference.
CoRR, 2021

Hybrid and dynamic policy gradient optimization for bipedal robot locomotion.
CoRR, 2021

Online Metro Origin-Destination Prediction via Heterogeneous Information Aggregation.
CoRR, 2021

Prototypical Graph Contrastive Learning.
CoRR, 2021

Learning Class-Agnostic Pseudo Mask Generation for Box-Supervised Semantic Segmentation.
CoRR, 2021

Temporal Contrastive Graph for Self-supervised Video Representation Learning.
CoRR, 2021

Rethinking the Pruning Criteria for Convolutional Neural Network.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Explore before Moving: A Feasible Path Estimation and Memory Recalling Framework for Embodied Navigation.
Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

Robust Real-World Image Super-Resolution against Adversarial Attacks.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Weakly-Supervised Spatio-Temporal Anomaly Detection in Surveillance Video.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

AU-Expression Knowledge Constrained Representation Learning for Facial Expression Recognition.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Continuous Transition: Improving Sample Efficiency for Continuous Control Problems via MixUp.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Unifying Dynamic Optimizer Search and Network Architecture Search.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Hierarchical Transformer: Unsupervised Representation Learning for Skeleton-Based Human Action Recognition.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Solving Inefficiency of Self-supervised Representation Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Pi-NAS: Improving Neural Architecture Search by Reducing Supernet Training Consistency Shift.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Trash to Treasure: Harvesting OOD Data with Cross-Modal Matching for Open-Set Semi-Supervised Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Linguistically Routing Capsule Network for Out-of-distribution Visual Question Answering.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Cross-Modal Collaborative Representation Learning and a Large-Scale RGBT Benchmark for Crowd Counting.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Towards Quantifiable Dialogue Coherence Evaluation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Neural-Symbolic Solver for Math Word Problems with Auxiliary Tasks.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

GeoQA: A Geometric Question Answering Benchmark Towards Multimodal Numerical Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Adversarial Meta Sampling for Multilingual Low-Resource Speech Recognition.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Graph-Evolving Meta-Learning for Low-Resource Medical Dialogue Generation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Deductive Learning for Weakly-Supervised 3D Human Pose Estimation via Uncalibrated Cameras.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Learning Semisupervised Multilabel Fully Convolutional Network for Hierarchical Object Parsing.
IEEE Trans. Neural Networks Learn. Syst., 2020

Online Alternate Generator Against Adversarial Attacks.
IEEE Trans. Image Process., 2020

Unifying Temporal Context and Multi-Feature With Update-Pacing Framework for Visual Tracking.
IEEE Trans. Circuits Syst. Video Technol., 2020

DDet: Dual-Path Dynamic Enhancement Network for Real-World Image Super-Resolution.
IEEE Signal Process. Lett., 2020

Obtaining World Coordinate Information of UAV in GNSS Denied Environments.
Sensors, 2020

3D Human Pose Machines with Self-Supervised Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Face Hallucination by Attentive Sequence Optimization with Reinforcement Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Crowd counting via scale-communicative aggregation networks.
Neurocomputing, 2020

Lightweight adversarial network for salient object detection.
Neurocomputing, 2020

REM-Net: Recursive Erasure Memory Network for Commonsense Evidence Refinement.
CoRR, 2020

Semantics-aware Adaptive Knowledge Distillation for Sensor-to-Vision Action Recognition.
CoRR, 2020

Look into Facial Expression Domain Adaptation: Adversarial Graph Learning and A Fair Evaluation Benchmark.
CoRR, 2020

EagleEye: Fast Sub-net Evaluation for Efficient Neural Network Pruning.
CoRR, 2020

Linguistically Driven Graph Capsule Network for Visual Question Reasoning.
CoRR, 2020

Learning Reinforced Agents with Counterfactual Simulation for Medical Automatic Diagnosis.
CoRR, 2020

Depthwise Non-local Module for Fast Salient Object Detection Using a Single Thread.
CoRR, 2020

Physical-Virtual Collaboration Graph Network for Station-Level Metro Ridership Prediction.
CoRR, 2020

Auto-Panoptic: Cooperative Multi-Component Architecture Search for Panoptic Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Motion-transformer: self-supervised pre-training for skeleton-based action recognition.
Proceedings of the MMAsia 2020: ACM Multimedia Asia, 2020

Efficient Crowd Counting via Structured Knowledge Transfer.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Reinforcement Learning for Weakly Supervised Temporal Grounding of Natural Language in Untrimmed Videos.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Active Object Search.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Adversarial Graph Representation Adaptation for Cross-Domain Facial Expression Recognition.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Grammatically Recognizing Images with Tree Convolution.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Semantically-Aligned Universal Tree-Structured Solver for Math Word Problems.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

GRADE: Automatic Graph-Enhanced Coherence Metric for Evaluating Open-Domain Dialogue Systems.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Collaborative Training Between Region Proposal Localization and Classification for Domain Adaptive Object Detection.
Proceedings of the Computer Vision - ECCV 2020, 2020

Component Divide-and-Conquer for Real-World Image Super-Resolution.
Proceedings of the Computer Vision - ECCV 2020, 2020



Bidirectional Graph Reasoning Network for Panoptic Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Block-Wisely Supervised Neural Architecture Search With Knowledge Distillation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Transferable, Controllable, and Inconspicuous Adversarial Attacks on Person Re-identification With Deep Mis-Ranking.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Performance Analysis of Blockchain-Based Internet of Vehicles Under the DSRC Architecture.
Proceedings of the Communications and Networking, 2020

An Adversarial Perturbation Oriented Domain Adaptation Approach for Semantic Segmentation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Tree-Structured Policy Based Progressive Reinforcement Learning for Temporally Language Grounding in Video.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Knowledge Graph Transfer Network for Few-Shot Recognition.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Human Centric Visual Analysis with Deep Learning
Springer, ISBN: 978-981-13-2386-7, 2020

2019
Cost-Effective Object Detection: Active Sample Mining With Switchable Selection Criteria.
IEEE Trans. Neural Networks Learn. Syst., 2019

Facial Landmark Machines: A Backbone-Branches Architecture With Progressive Representation Learning.
IEEE Trans. Multim., 2019

Neural Task Planning With AND-OR Graph Representations.
IEEE Trans. Multim., 2019

Contextualized Spatial-Temporal Network for Taxi Origin-Destination Demand Prediction.
IEEE Trans. Intell. Transp. Syst., 2019

SCAN: Self-and-Collaborative Attention Network for Video Person Re-Identification.
IEEE Trans. Image Process., 2019

Cross-Modal Attentional Context Learning for RGB-D Object Detection.
IEEE Trans. Image Process., 2019

Context-Aware Semantic Inpainting.
IEEE Trans. Cybern., 2019

AnalyticDB: Real-time OLAP Database System at Alibaba Cloud.
Proc. VLDB Endow., 2019

Progressively diffused networks for semantic visual parsing.
Pattern Recognit., 2019

Learning Support Correlation Filters for Visual Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Hierarchical Scene Parsing by Weakly Supervised Learning with Image Descriptions.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Look into Person: Joint Body Parsing & Pose Estimation Network and a New Benchmark.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Visual Tracking via Dynamic Graph Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Blockwisely Supervised Neural Architecture Search with Knowledge Distillation.
CoRR, 2019

Explainable High-order Visual Question Reasoning: A New Benchmark and Knowledge-routed Network.
CoRR, 2019

ACFM: A Dynamic Spatial-Temporal Network for Traffic Prediction.
CoRR, 2019

Learning Compact Target-Oriented Feature Representations for Visual Tracking.
CoRR, 2019

Weakly Supervised Person Re-identification: Cost-effective Learning with A New Benchmark.
CoRR, 2019

Weakly-Supervised Discovery of Geometry-Aware Representation for 3D Human Pose Estimation.
CoRR, 2019

Perceptual Image Enhancement by Relativistic Discriminant Learning With Cross-Scale Aggregated Representation.
IEEE Access, 2019

Attention Embedded Spatio-Temporal Network for Video Salient Object Detection.
IEEE Access, 2019

Simultaneous Lung Field Detection and Segmentation for Pediatric Chest Radiographs.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

Lightweight Contrast Modeling for Attention-Aware Visual Localization.
Proceedings of the International Conference on Robotics and Automation, 2019

Multivariate-Information Adversarial Ensemble for Scalable Joint Distribution Matching.
Proceedings of the 36th International Conference on Machine Learning, 2019

Concrete Image Captioning by Integrating Content Sensitive and Global Discriminative Objective.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Crowd Counting via Multi-view Scale Aggregation Networks.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Taxi Origin-Destination Demand Prediction with Contextualized Spatial-Temporal Network.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

SNAS: stochastic neural architecture search.
Proceedings of the 7th International Conference on Learning Representations, 2019

NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning.
Proceedings of the 7th International Conference on Learning Representations, 2019

Few-Shot Structured Domain Adaptation for Virtual-to-Real Scene Parsing.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Semi-Supervised Video Salient Object Detection Using Pseudo-Labels.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Meta R-CNN: Towards General Solver for Instance-Level Low-Shot Learning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Larger Norm More Transferable: An Adaptive Feature Norm Approach for Unsupervised Domain Adaptation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Crowd Counting With Deep Structured Scale Integration Network.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Learning Semantic-Specific Graph Representation for Multi-Label Image Recognition.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Layout-Graph Reasoning for Fashion Landmark Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Reasoning-RCNN: Unifying Adaptive Global Reasoning Into Large-Scale Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Adaptively Connected Neural Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Spatially Variant Linear Representation Models for Joint Filtering.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019


Graphonomy: Universal Human Parsing via Graph Transfer Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Blending-Target Domain Adaptation by Adversarial Meta-Adaptation Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Knowledge-Embedded Routing Network for Scene Graph Generation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

ClusterNet: Deep Hierarchical Cluster Network With Rigorously Rotation-Invariant Representation for Point Cloud Analysis.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Weakly-Supervised Discovery of Geometry-Aware Representation for 3D Human Pose Estimation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019


CamDrop: A New Explanation of Dropout and A Guided Regularization Method for Deep Neural Networks.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

End-to-End Knowledge-Routed Relational Dialogue System for Automatic Diagnosis.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Semantic Relationships Guided Representation Learning for Facial Action Unit Recognition.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

FRAME Revisited: An Interpretation View Based on Particle Evolution.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
F-SVM: Combination of Feature Transformation and SVM Learning via Convex Relaxation.
IEEE Trans. Neural Networks Learn. Syst., 2018

High-Precision Camera Localization in Scenes with Repetitive Patterns.
ACM Trans. Intell. Syst. Technol., 2018

Learning to Segment Object Candidates via Recursive Neural Networks.
IEEE Trans. Image Process., 2018

Guest Editorial Introduction to the Special Issue on Large Scale and Nonlinear Similarity Learning for Intelligent Video Analysis.
IEEE Trans. Circuits Syst. Video Technol., 2018

Deep Co-Space: Sample Mining Across Feature Transformation for Semi-Supervised Learning.
IEEE Trans. Circuits Syst. Video Technol., 2018

Active Self-Paced Learning for Cost-Effective and Progressive Face Identification.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Proposal-Free Network for Instance-Level Object Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Learning deep representations for semantic image parsing: a comprehensive overview.
Frontiers Comput. Sci., 2018

FANet: Quality-Aware Feature Aggregation Network for RGB-T Tracking.
CoRR, 2018

Unsupervised Domain Adaptation: An Adaptive Feature Norm Approach.
CoRR, 2018

Toward Characteristic-Preserving Image-based Virtual Try-On Network.
CoRR, 2018

Batch Kalman Normalization: Towards Training Deep Neural Networks with Micro-Batches.
CoRR, 2018

Structured Inhomogeneous Density Map Learning for Crowd Counting.
CoRR, 2018

Kalman Normalization: Normalizing Internal Representations Across Network Layers.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Symbolic Graph Reasoning Meets Convolutions.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Hybrid Knowledge Routed Modules for Large-scale Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Adaptive Temporal Encoding Network for Video Instance-level Human Parsing.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Structured Deep Learning for Pixel-level Understanding.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Attentive Crowd Flow Machines.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

BeautyGAN: Instance-level Facial Makeup Transfer with Deep Generative Adversarial Network.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Non-locally Enhanced Encoder-Decoder Network for Single Image De-raining.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Fine-Grained Representation Learning and Recognition by Exploiting Hierarchical Semantic Embedding.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Embedding Temporally Consistent Depth Recovery for Real-time Dense Mapping in Visual-inertial Odometry.
Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

Convolutional Memory Blocks for Depth Data Representation Learning.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Deep Reasoning with Knowledge Graph for Social Relationship Understanding.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

DRPose3D: Depth Ranking in 3D Human Pose Estimation.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Crowd Counting using Deep Recurrent Spatial-Aware Network.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Knowledge-Embedded Representation Learning for Fine-Grained Image Recognition.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Avoidance of High-Speed Obstacles Based on Velocity Obstacles.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Fusing Object Context to Detect Functional Area for Cognitive Robots.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Robust Object-Aware Sample Consensus with Application to Lidar Odometry.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Toward Characteristic-Preserving Image-Based Virtual Try-On Network.
Proceedings of the Computer Vision - ECCV 2018, 2018

Generative Semantic Manipulation with Mask-Contrasting GAN.
Proceedings of the Computer Vision - ECCV 2018, 2018

Learning Warped Guidance for Blind Face Restoration.
Proceedings of the Computer Vision - ECCV 2018, 2018


Instance-Level Human Parsing via Part Grouping Network.
Proceedings of the Computer Vision - ECCV 2018, 2018

Monocular Depth Estimation with Affinity, Vertical Pooling, and Label Enhancement.
Proceedings of the Computer Vision - ECCV 2018, 2018

Unsupervised Image Super-Resolution Using Cycle-in-Cycle Generative Adversarial Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Crafting a Toolchain for Image Restoration by Deep Reinforcement Learning.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Deep Cocktail Network: Multi-Source Unsupervised Domain Adaptation With Category Shift.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Interpretable Video Captioning via Trajectory Structured Localization.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Towards Human-Machine Cooperation: Self-Supervised Sample Mining for Object Detection.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Zoom and Learn: Generalizing Deep Stereo Matching to Novel Domains.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

LSTM Pose Machines.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Single View Stereo Matching.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Multi-Level Wavelet-CNN for Image Restoration.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Flow Guided Recurrent Neural Encoder for Video Salient Object Detection.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Visual Question Reasoning on General Dependency Tree.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Facial Landmark Localization in the Wild by Backbone-Branches Representation Learning.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

Weakly Supervised Salient Object Detection Using Image Labels.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Recurrent Attentional Reinforcement Learning for Multi-Label Image Recognition.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Learning a Wavelet-Like Auto-Encoder to Accelerate Deep Neural Networks.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Saliency Detection on Light Field: A Multi-Cue Approach.
ACM Trans. Multim. Comput. Commun. Appl., 2017

Structure-Preserving Image Super-Resolution via Contextualized Multitask Learning.
IEEE Trans. Multim., 2017

Distance Metric Learning via Iterated Support Vector Machines.
IEEE Trans. Image Process., 2017

Content-Adaptive Sketch Portrait Generation by Decompositional Representation Learning.
IEEE Trans. Image Process., 2017

Cost-Effective Active Learning for Deep Image Classification.
IEEE Trans. Circuits Syst. Video Technol., 2017

Weighted Low-Rank Decomposition for Robust Grayscale-Thermal Foreground Detection.
IEEE Trans. Circuits Syst. Video Technol., 2017

Cross-Domain Visual Matching via Generalized Similarity Measure and Feature Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Human Parsing with Contextualized Convolutional Neural Network.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Learning to Segment Human by Watching YouTube.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Context-Aware Semantic Inpainting.
CoRR, 2017

Learning a Wavelet-like Auto-Encoder to Accelerate Deep Neural Networks.
CoRR, 2017

Visual Tracking via Learning Dynamic Patch-based Graph Representation.
CoRR, 2017

Scene Parsing by Weakly Supervised Learning with Image Descriptions.
CoRR, 2017

Progressively Diffused Networks for Semantic Image Segmentation.
CoRR, 2017

Structure-Preserving Image Super-resolution via Contextualized Multi-task Learning.
CoRR, 2017

Look into Person: Self-supervised Structure-sensitive Learning and A New Benchmark for Human Parsing.
CoRR, 2017

Inversely Proportional Carrier Sense Threshold and Transmit Power Setting Towards Green WLANs.
Proceedings of the 86th IEEE Vehicular Technology Conference, 2017

Place-centric Visual Urban Perception with Deep Multi-instance Regression.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Decentralized navigation of multiple agents based on ORCA and model predictive control.
Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017

Knowledge-guided recurrent neural network learning for task-oriented action prediction.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Multi-label Image Recognition by Recurrently Discovering Attentional Regions.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Deep Dual Learning for Semantic Image Segmentation.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Joint Detection and Identification Feature Learning for Person Search.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Learning Object Interactions and Descriptions for Semantic Image Segmentation.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017


Recurrent 3D Pose Sequence Machines.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Interpretable Structure-Evolving LSTM.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Instance-Level Salient Object Segmentation.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Look into Person: Self-Supervised Structure-Sensitive Learning and a New Benchmark for Human Parsing.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Attention-Aware Face Hallucination via Deep Reinforcement Learning.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Face Recognition by Coarse-to-Fine Landmark Regression with Application to ATM Surveillance.
Proceedings of the Computer Vision - Second CCF Chinese Conference, 2017

Learning Patch-Based Dynamic Graph for Visual Tracking.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Learning Compositional Shape Models of Multiple Distance Metrics by Information Projection.
IEEE Trans. Neural Networks Learn. Syst., 2016

DISC: Deep Image Saliency Computing via Progressive Representation Learning.
IEEE Trans. Neural Networks Learn. Syst., 2016

Clothes Co-Parsing Via Joint Image Segmentation and Labeling With Application to Clothing Retrieval.
IEEE Trans. Multim., 2016

Recognizing Focal Liver Lesions in CEUS With Dynamically Trained Latent Structured Models.
IEEE Trans. Medical Imaging, 2016

An Approach to Streaming Video Segmentation With Sub-Optimal Low-Rank Decomposition.
IEEE Trans. Image Process., 2016

Learning Collaborative Sparse Representation for Grayscale-Thermal Tracking.
IEEE Trans. Image Process., 2016

Inference With Collaborative Model for Interactive Tumor Segmentation in Medical Image Sequences.
IEEE Trans. Cybern., 2016

Detection-Free Multiobject Tracking by Reconfigurable Inference With Bundle Representations.
IEEE Trans. Cybern., 2016

Compositional models and Structured learning for visual recognition.
Pattern Recognit., 2016

Deep Learning for Remote Sensing Image Understanding.
J. Sensors, 2016

Deep Boosting: Joint feature selection and analysis dictionary learning in hierarchy.
Neurocomputing, 2016

Special issue on Chinese Conference on Computer Vision 2015.
Neurocomputing, 2016

Parallel nonparametric binarization for degraded document images.
Neurocomputing, 2016

Discovering similar Chinese characters in online handwriting with deep convolutional neural networks.
Int. J. Document Anal. Recognit., 2016

A Deep Structured Model with Radius-Margin Bound for 3D Human Activity Recognition.
Int. J. Comput. Vis., 2016

End-to-End Deep Learning for Person Search.
CoRR, 2016

RGB-D Scene Labeling with Long Short-Term Memorized Fusion Model.
CoRR, 2016

Learning to Segment Object Proposals via Recursive Neural Networks.
CoRR, 2016

High-level representation sketch for video event retrieval.
Sci. China Inf. Sci., 2016

Human Pose Estimation from Depth Images via Inference Embedded Multi-task Learning.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Geometric Scene Parsing with Hierarchical LSTM.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

A Stochastic Image Grammar for Fine-Grained 3D Scene Reconstruction.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Learning a lightweight deep convolutional network for joint age and gender recognition.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Local- and holistic-structure preserving image super resolution via deep joint component learning.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Character proposal network for robust text extraction.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Is Faster R-CNN Doing Well for Pedestrian Detection?
Proceedings of the Computer Vision - ECCV 2016, 2016

Semantic Object Parsing with Graph LSTM.
Proceedings of the Computer Vision - ECCV 2016, 2016

LSTM-CF: Unifying Context Modeling and Fusion with LSTMs for RGB-D Scene Labeling.
Proceedings of the Computer Vision - ECCV 2016, 2016

Joint Learning of Single-Image and Cross-Image Representations for Person Re-identification.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Dictionary Pair Classifier Driven Convolutional Neural Networks for Object Detection.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Deep Structured Scene Parsing by Learning with Image Descriptions.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Reversible Recursive Instance-Level Object Segmentation.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Semantic Object Parsing with Local-Global Long Short-Term Memory.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Boosting Zero-Shot Image Classification via Pairwise Relationship Learning.
Proceedings of the Computer Vision - ACCV 2016, 2016

DARI: Distance Metric and Representation Integration for Person Verification.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Fashion Parsing With Video Context.
IEEE Trans. Multim., 2015

Bit-Scalable Deep Hashing With Regularized Similarity Learning for Image Retrieval and Person Re-Identification.
IEEE Trans. Image Process., 2015

PISA: Pixelwise Image Saliency by Aggregating Complementary Appearance Contrast Measures With Edge-Preserving Coherence.
IEEE Trans. Image Process., 2015

Hierarchical Ensemble of Background Models for PTZ-Based Video Surveillance.
IEEE Trans. Cybern., 2015

Adaptive Scene Category Discovery With Generative Learning and Compositional Sampling.
IEEE Trans. Circuits Syst. Video Technol., 2015

Deep feature learning with relative distance comparison for person re-identification.
Pattern Recognit., 2015

Discriminatively Trained And-Or Graph Models for Object Shape Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Deep Human Parsing with Active Template Regression.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Towards a solid solution of real-time fire and flame detection.
Multim. Tools Appl., 2015

Kernel sparse representation for time series classification.
Inf. Sci., 2015

Data-Driven Scene Understanding with Adaptively Retrieved Exemplars.
IEEE Multim., 2015

Iterated Support Vector Machines for Distance Metric Learning.
CoRR, 2015

F-SVM: Combination of Feature Transformation and SVM Learning via Convex Relaxation.
CoRR, 2015

Human-Centric Images and Videos Analysis.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

End-to-End Photo-Sketch Generation via Fully Convolutional Representation Learning.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Human Parsing with Contextualized Convolutional Neural Network.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Towards Computational Baby Learning: A Weakly-Supervised Approach for Object Detection.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Discriminative learning of iteration-wise priors for blind deconvolution.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Matching-CNN meets KNN: Quasi-parametric human parsing.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

SOLD: Sub-optimal low-rank decomposition for efficient video segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

A Deep Joint Learning Approach for Age Invariant Face Verification.
Proceedings of the Computer Vision - CCF Chinese Conference, 2015

2014
Complex Background Subtraction by Pursuing Dynamic Spatio-Temporal Models.
IEEE Trans. Image Process., 2014

Robust Feature Point Matching With Sparse Model.
IEEE Trans. Image Process., 2014

Salient object detection based on regions.
Multim. Tools Appl., 2014

Computational Baby Learning.
CoRR, 2014

sapFinder: an R/Bioconductor package for detection of variant peptides in shotgun proteomics experiments.
Bioinform., 2014

Deep Joint Task Learning for Generic Object Extraction.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Person Search in a Scene by Jointly Modeling People Commonness and Person Uniqueness.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

3D Human Activity Recognition with Reconfigurable Convolutional Neural Networks.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Fashion Parsing with Video Context.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Recognizing focal liver lesions in contrast-enhanced ultrasound with discriminatively trained spatio-temporal model.
Proceedings of the IEEE 11th International Symposium on Biomedical Imaging, 2014

Deep boosting: Layered feature mining for general image classification.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Data-driven scene understanding by adaptive exemplar retrieval.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

An expressive deep model for human action parsing from a single image.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Clothing Co-parsing by Joint Image Segmentation and Labeling.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Discovering Video Shot Categories by Unsupervised Stochastic Graph Partition.
IEEE Trans. Multim., 2013

Video Stylization: Painterly Rendering and Optimization With Content Extraction.
IEEE Trans. Circuits Syst. Video Technol., 2013

Sparse Learning-to-Rank via an Efficient Primal-Dual Algorithm.
IEEE Trans. Computers, 2013

Learning latent spatio-temporal compositional model for human action recognition.
Proceedings of the ACM Multimedia Conference, 2013

Human Re-identification by Matching Compositional Template with Cluster Sampling.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Correntropy Induced L2 Graph for Robust Subspace Clustering.
Proceedings of the IEEE International Conference on Computer Vision, 2013

SYM-FISH: A Symmetry-Aware Flip Invariant Sketch Histogram Shape Descriptor.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Integrating multi-stage depth-induced contextual information for human action recognition and localization.
Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

Incorporating Structural Alternatives and Sharing into Hierarchy for Multiclass Object Recognition and Detection.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

PISA: Pixelwise Image Saliency by Aggregating Complementary Appearance Contrast Measures with Spatial Priors.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Robust Region Grouping via Internal Patch Statistics.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Integrating Graph Partitioning and Matching for Trajectory Analysis in Video Surveillance.
IEEE Trans. Image Process., 2012

Object categorization with sketch representation and generalized samples.
Pattern Recognit., 2012

Representing and recognizing objects with massive local image patches.
Pattern Recognit., 2012

On the zeroth-order general Randic index of cacti.
Ars Comb., 2012

Dynamical And-Or Graph Learning for Object Shape Modeling and Detection.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Joint semantic segmentation by searching for compatible-competitive references.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Robust stroke-based video animation via layered motion and correspondence.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Realtime object-of-interest tracking by learning Composite Patch-based Templates.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Object-Layout-Aware Image Retrieval for Personal Album Management.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012

Cross-based local multipoint filtering.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Learning contour-fragment-based shape model with And-Or tree representation.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
High-Resolution Face Fusion for Gender Conversion.
IEEE Trans. Syst. Man Cybern. Part A, 2011

Integrating Spatio-Temporal Context With Multiview Representation for Object Recognition in Visual Surveillance.
IEEE Trans. Circuits Syst. Video Technol., 2011

Adaptive Object Tracking by Learning Hybrid Template Online.
IEEE Trans. Circuits Syst. Video Technol., 2011

Tenzing A SQL Implementation On The MapReduce Framework.
Proc. VLDB Endow., 2011

Color style transfer by constraint locally linear embedding.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Interactive CT image segmentation with online discriminative learning.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Segment an image by looking into an image corpus.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
I2T: Image Parsing to Text Description.
Proc. IEEE, 2010

Layered Graph Matching with Composite Cluster Sampling.
IEEE Trans. Pattern Anal. Mach. Intell., 2010

Skeletonization with Particle Filters.
Int. J. Pattern Recognit. Artif. Intell., 2010

Painterly animation using video semantics and feature correspondence.
Proceedings of the 8th International Symposium on Non-Photorealistic Animation and Rendering, 2010

Tracking Objects with Adaptive Feature Patches for PTZ Camera Visual Surveillance.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

A Discriminative Model for Object Representation and Detection via Sparse Features.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Semantics-driven portrait cartoon stylization.
Proceedings of the International Conference on Image Processing, 2010

Learning Shape Detector by Quantizing Curve Segments with Multiple Distance Metrics.
Proceedings of the Computer Vision, 2010

2009
Semantic event representation and recognition using syntactic attribute graph grammar.
Pattern Recognit. Lett., 2009

A stochastic graph grammar for compositional object representation and recognition.
Pattern Recognit., 2009

Marker-less registration based on template tracking for augmented reality.
Multim. Tools Appl., 2009

Stochastic Programming Models and Hybrid Intelligent Algorithm for Unbalanced Bidding Problem.
Comput. Inf. Sci., 2009

Interactive rotoscoping: Extracting and tracking object sketch.
Proceedings of the International Conference on Image Processing, 2009

Hierarchical 3D perception from a single image.
Proceedings of the International Conference on Image Processing, 2009

Accurate semantic image labeling by fast Geodesic Propagation.
Proceedings of the International Conference on Image Processing, 2009

Trajectory parsing by cluster sampling in spatio-temporal graph.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Layered graph matching by composite cluster sampling with collaborative and competitive interactions.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

2008
Spatio-temporal patches for night background modeling by subspace learning.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

An empirical study of facial components classification by integrating dimensionality reduction and clustering.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Layered shape matching and registration: Stochastic sampling with hierarchical graph representation.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Object-of-interest extraction by integrating stochastic inference with learnt active shape sketch.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

2007
An Empirical Study of Object Category Recognition: Sequential Testing with Generalized Samples.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Fuzzy Fixed Charge Solid Transportation Problem and Its Algorithm.
Proceedings of the Fourth International Conference on Fuzzy Systems and Knowledge Discovery, 2007

Object Category Recognition Using Generative Template Boosting.
Proceedings of the Energy Minimization Methods in Computer Vision and Pattern Recognition, 2007

Layered Graph Match with Graph Editing.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

2006
Key Issues for AR-Based Digital Reconstruction of Yuanmingyuan Garden.
Presence Teleoperators Virtual Environ., 2006


  Loading...