Xi Li

Orcid: 0000-0003-3023-1662

Affiliations:
  • Zhejiang University, College of Computer Science and Technology, China
  • University of Adelaide, Australian Centre for Visual Technology, SA, Australia (2010 - 2014)
  • CNRS Telecom ParisTech, France (2009 - 2010)
  • Chinese Academy of Sciences, Institute of Automation, National Laboratory of Pattern Recognition, Beijing, China (PhD 2009)


According to our database1, Xi Li authored at least 257 papers between 2006 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Temporal-Frequency Attention Focusing for Time Series Extrinsic Regression via Auxiliary Task.
IEEE Trans. Neural Networks Learn. Syst., November, 2024

Multimodal pre-train then transfer learning approach for speaker recognition.
Multim. Tools Appl., September, 2024

Unsupervised Domain Adaptation With Class-Aware Memory Alignment.
IEEE Trans. Neural Networks Learn. Syst., July, 2024

Ultra Fast Deep Lane Detection With Hybrid Anchor Driven Ordinal Classification.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

MgSvF: Multi-Grained Slow versus Fast Framework for Few-Shot Class-Incremental Learning.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

Epoch-Evolving Gaussian Process Guided Learning for Classification.
IEEE Trans. Neural Networks Learn. Syst., January, 2024

Self-Paced Multi-Grained Cross-Modal Interaction Modeling for Referring Expression Comprehension.
IEEE Trans. Image Process., 2024

IDNet: Information Decomposition Network for Fast Panoptic Segmentation.
IEEE Trans. Image Process., 2024

MLMG-SGG: Multilabel Scene Graph Generation With Multigrained Features.
IEEE Trans. Image Process., 2024

GaitMPL: Gait Recognition With Memory-Augmented Progressive Learning.
IEEE Trans. Image Process., 2024

Reading order detection in visually-rich documents with multi-modal layout-aware relation prediction.
Pattern Recognit., 2024

CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities.
CoRR, 2024

CLASH: Complementary Learning with Neural Architecture Search for Gait Recognition.
CoRR, 2024

GVDIFF: Grounded Text-to-Video Generation with Diffusion Models.
CoRR, 2024

SemanticMIM: Marring Masked Image Modeling with Semantics Compression for General Visual Representation.
CoRR, 2024

CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation.
CoRR, 2024

SphereDiffusion: Spherical Geometry-Aware Distortion Resilient Diffusion Model.
CoRR, 2024

BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-Based Roadside 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

ScanFormer: Referring Expression Comprehension by Iteratively Scanning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SphereDiffusion: Spherical Geometry-Aware Distortion Resilient Diffusion Model.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Elastic Knowledge Distillation by Learning From Recollection.
IEEE Trans. Neural Networks Learn. Syst., May, 2023

Uncertainty-Aware Scene Graph Generation.
Pattern Recognit. Lett., March, 2023

Forgery face detection via adaptive learning from multiple experts.
Neurocomputing, March, 2023

A Large-Scale Synthetic Gait Dataset Towards in-the-Wild Simulation and Comparison Study.
ACM Trans. Multim. Comput. Commun. Appl., January, 2023

Adaptive cooperative exploration for reinforcement learning from imperfect demonstrations.
Pattern Recognit. Lett., January, 2023

D<sup>3</sup>T-GAN: Data-Dependent Domain Transfer GANs for Image Generation with Limited Data.
ACM Trans. Multim. Comput. Commun. Appl., 2023

A Review on Methods and Applications in Multimodal Deep Learning.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Fuzzy Semantics for Arbitrary-Shaped Scene Text Detection.
IEEE Trans. Image Process., 2023

TextFusion: Unveiling the Power of Textual Semantics for Controllable Image Fusion.
CoRR, 2023

Bridging Cross-task Protocol Inconsistency for Distillation in Dense Object Detection.
CoRR, 2023

HeightFormer: Explicit Height Modeling without Extra Data for Camera-only 3D Object Detection in Bird's Eye View.
CoRR, 2023

FusionBooster: A Unified Image Fusion Boosting Paradigm.
CoRR, 2023

Personalized Behavior-Aware Transformer for Multi-Behavior Sequential Recommendation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Finding Cycles in Graph: A Unified Approach for Various NER Tasks.
Proceedings of the International Joint Conference on Neural Networks, 2023

DenseDINO: Boosting Dense Self-Supervised Learning with Token-Based Point-Level Consistency.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

SGAT4PASS: Spherical Geometry-Aware Transformer for PAnoramic Semantic Segmentation.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Bridging Cross-task Protocol Inconsistency for Distillation in Dense Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

UniFusion: Unified Multi-view Fusion Transformer for Spatial-Temporal Representation in Bird's-Eye-View.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

LayoutDiffusion: Controllable Diffusion Model for Layout-to-Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Language Adaptive Weight Generation for Multi-Task Visual Grounding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Action Probability Calibration for Efficient Naturalistic Driving Action Localization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

RWSC-Fusion: Region-Wise Style-Controlled Fusion Network for the Prohibited X-ray Security Image Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

GaitGCI: Generative Counterfactual Intervention for Gait Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Referring Expression Comprehension Using Language Adaptive Inference.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
What and Where: Learn to Plug Adapters via NAS for Multidomain Learning.
IEEE Trans. Neural Networks Learn. Syst., 2022

Memory-Efficient Class-Incremental Learning for Image Classification.
IEEE Trans. Neural Networks Learn. Syst., 2022

Global Context Assisted Structure-Aware Vehicle Retrieval.
IEEE Trans. Intell. Transp. Syst., 2022

Bias-Eliminated Semantic Refinement for Any-Shot Learning.
IEEE Trans. Image Process., 2022

Progressive Multistage Learning for Discriminative Tracking.
IEEE Trans. Cybern., 2022

Forgery-Domain-Supervised Deepfake Detection With Non-Negative Constraint.
IEEE Signal Process. Lett., 2022

AFD-StackGAN: Automatic Mask Generation Network for Face De-Occlusion Using StackGAN.
Sensors, 2022

Reparameterized attention for convolutional neural networks.
Pattern Recognit. Lett., 2022

Memory-efficient distribution-guided experience sampling for policy consolidation.
Pattern Recognit. Lett., 2022

Unified curiosity-Driven learning with smoothed intrinsic reward estimation.
Pattern Recognit., 2022

CoDiNet: Path Distribution Modeling With Consistency and Diversity for Dynamic Routing.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

TapLab: A Fast Framework for Semantic Video Segmentation Tapping Into Compressed-Domain Knowledge.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Attend and select: A segment selective transformer for microblog hashtag generation.
Knowl. Based Syst., 2022

Structure-conditioned adversarial learning for unsupervised domain adaptation.
Neurocomputing, 2022

PcmNet: Position-sensitive context modeling network for temporal action localization.
Neurocomputing, 2022

Learnable Depth-Sensitive Attention for Deep RGB-D Saliency Detection with Multi-modal Fusion Architecture Search.
Int. J. Comput. Vis., 2022

A Survey on Generative Adversarial Networks: Variants, Applications, and Training.
ACM Comput. Surv., 2022

Adaptive Edge-to-Edge Interaction Learning for Point Cloud Analysis.
CoRR, 2022

UniFormer: Unified Multi-view Fusion Transformer for Spatial-Temporal Representation in Bird's-Eye-View.
CoRR, 2022

F3A-GAN: Facial Flow for Face Animation with Generative Adversarial Networks.
CoRR, 2022

D3T-GAN: Data-Dependent Domain Transfer GANs for Few-shot Image Generation.
CoRR, 2022

Referring Expression Comprehension via Cross-Level Multi-Modal Fusion.
CoRR, 2022

A Review on Methods and Applications in Multimodal Deep Learning.
CoRR, 2022

Adma-GAN: Attribute-Driven Memory Augmented GANs for Text-to-Image Generation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

End-to-End Compound Table Understanding with Multi-Modal Modeling.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Explicitly Modeling Importance and Coherence for Timeline Summarization.
Proceedings of the IEEE International Conference on Acoustics, 2022

Entropy-Driven Sampling and Training Scheme for Conditional Diffusion Generation.
Proceedings of the Computer Vision - ECCV 2022, 2022

RBC: Rectifying the Biased Context in Continual Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Saliency Hierarchy Modeling via Generative Kernels for Salient Object Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022

Adaptive Cross-domain Learning for Generalizable Person Re-identification.
Proceedings of the Computer Vision - ECCV 2022, 2022

SP-Net: Slowly Progressing Dynamic Inference Networks.
Proceedings of the Computer Vision - ECCV 2022, 2022

The Tenth Visual Object Tracking VOT2022 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

MetaGait: Learning to Learn an Omni Sample Adaptive Representation for Gait Recognition.
Proceedings of the Computer Vision - ECCV 2022, 2022

Dynamic Low-Resolution Distillation for Cost-Efficient End-to-End Text Spotting.
Proceedings of the Computer Vision - ECCV 2022, 2022

MonoGround: Detecting Monocular 3D Objects from the Ground.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
End-to-End Video Saliency Detection via a Deep Contextual Spatiotemporal Network.
IEEE Trans. Neural Networks Learn. Syst., 2021

Deep Attentive Video Summarization With Distribution Consistency Learning.
IEEE Trans. Neural Networks Learn. Syst., 2021

F³A-GAN: Facial Flow for Face Animation With Generative Adversarial Networks.
IEEE Trans. Image Process., 2021

Condition-Aware Comparison Scheme for Gait Recognition.
IEEE Trans. Image Process., 2021

Efficient Style-Corpus Constrained Learning for Photorealistic Style Transfer.
IEEE Trans. Image Process., 2021

Multitask Non-Autoregressive Model for Human Motion Prediction.
IEEE Trans. Image Process., 2021

ResKD: Residual-Guided Knowledge Distillation.
IEEE Trans. Image Process., 2021

Multitask Identity-Aware Image Steganography via Minimax Optimization.
IEEE Trans. Image Process., 2021

Efficient Person Search via Expert-Guided Knowledge Distillation.
IEEE Trans. Cybern., 2021

Progressive Class-Based Expansion Learning for Image Classification.
IEEE Signal Process. Lett., 2021

Anytime Recognition with Routing Convolutional Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

FD-StackGAN: Face De-occlusion Using Stacked Generative Adversarial Networks.
KSII Trans. Internet Inf. Syst., 2021

Real-Time Semantic Segmentation via Auto Depth, Downsampling Joint Decision and Feature Aggregation.
Int. J. Comput. Vis., 2021

Event prediction based on evolutionary event ontology knowledge.
Future Gener. Comput. Syst., 2021

When Video Classification Meets Incremental Classes.
CoRR, 2021

Attend and Select: A Segment Attention based Selection Mechanism for Microblog Hashtag Generation.
CoRR, 2021

Automated Timeline Length Selection for Flexible Timeline Summarization.
CoRR, 2021

Recent Advances and Trends in Multimodal Deep Learning: A Review.
CoRR, 2021

Unsupervised Domain Adaptation for Image Classification via Structure-Conditioned Adversarial Learning.
CoRR, 2021

VersatileGait: A Large-Scale Synthetic Gait Dataset with Fine-GrainedAttributes and Complicated Scenarios.
CoRR, 2021

When Video Classification Meets Incremental Classes.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

MGH: Metadata Guided Hypergraph Modeling for Unsupervised Person Re-identification.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

RDI-Net: Relational Dynamic Inference Networks.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

FcaNet: Frequency Channel Attention Networks.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Deep RGB-D Saliency Detection With Depth-Sensitive Attention and Automatic Multi-Modal Fusion.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Learning Multi-Level Density Maps for Crowd Counting.
IEEE Trans. Neural Networks Learn. Syst., 2020

Adaptive Graph Representation Learning for Video Person Re-Identification.
IEEE Trans. Image Process., 2020

Context-Aware Graph Label Propagation Network for Saliency Detection.
IEEE Trans. Image Process., 2020

Semantic Neighborhood-Aware Deep Facial Expression Recognition.
IEEE Trans. Image Process., 2020

Context-Aware Deep Spatiotemporal Network for Hand Pose Estimation From Depth Images.
IEEE Trans. Cybern., 2020

Human-Centric Clothing Segmentation via Deformable Semantic Locality-Preserving Network.
IEEE Trans. Circuits Syst. Video Technol., 2020

Towards a new generation of artificial intelligence in China.
Nat. Mach. Intell., 2020

Learning to Generate Content-Aware Dynamic Detectors.
CoRR, 2020

What and Where: Learn to Plug Adapters via NAS for Multi-Domain Learning.
CoRR, 2020

Few-Shot Class-Incremental Learning via Feature Space Composition.
CoRR, 2020

Epoch-evolving Gaussian Process Guided Learning.
CoRR, 2020

ResKD: Residual-Guided Knowledge Distillation.
CoRR, 2020

Dynamic Routing with Path Diversity and Consistency for Compact Network Learning.
CoRR, 2020

How to Train Your Dragon: Tamed Warping Network for Semantic Video Segmentation.
CoRR, 2020

Progressive Multi-Stage Learning for Discriminative Tracking.
CoRR, 2020

Real-Time Driving Scene Semantic Segmentation.
IEEE Access, 2020

TextRay: Contour-based Geometric Modeling for Arbitrary-shaped Scene Text Detection.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Stacked Pooling for Boosting Scale Invariance of Crowd Counting.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Multi-Way Multi-View Deep Autoencoder for Image Feature Learning with Multi-Level Graph Regularization.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Ultra Fast Structure-Aware Deep Lane Detection.
Proceedings of the Computer Vision - ECCV 2020, 2020

Graph-Guided Architecture Search for Real-Time Semantic Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

BANet: Bidirectional Aggregation Network With Occlusion Handling for Panoptic Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Editorial: Booming of Neural Networks and Learning Systems.
IEEE Trans. Neural Networks Learn. Syst., 2019

Stylized Aesthetic QR Code.
IEEE Trans. Multim., 2019

Deep Q Learning Driven CT Pancreas Segmentation With Geometry-Aware U-Net.
IEEE Trans. Medical Imaging, 2019

Deep Group-Wise Fully Convolutional Network for Co-Saliency Detection With Graph Propagation.
IEEE Trans. Image Process., 2019

User-Ranking Video Summarization With Multi-Stage Spatio-Temporal Representation.
IEEE Trans. Image Process., 2019

A Bilinear Ranking SVM for Knowledge Based Relation Prediction and Classification.
IEEE Trans. Big Data, 2019

Multi-Task Structure-Aware Context Modeling for Robust Keypoint-Based Object Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

State Distribution-Aware Sampling for Deep Q-Learning.
Neural Process. Lett., 2019

Adaptive Graph Representation Learning for Video Person Re-identification.
CoRR, 2019

OVSNet : Towards One-Pass Real-Time Video Object Segmentation.
CoRR, 2019

GroundNet: Monocular Ground Plane Normal Estimation with Geometric Consistency.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Learning a Key-Value Memory Co-Attention Matching Network for Person Re-Identification.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Spatio-Temporal Graph Routing for Skeleton-Based Action Recognition.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Identifying Objective and Subjective Words via Topic Modeling.
IEEE Trans. Neural Networks Learn. Syst., 2018

Weakly Supervised Object Detection via Object-Specific Pixel Gradient.
IEEE Trans. Neural Networks Learn. Syst., 2018

Deep Air Learning: Interpolation, Prediction, and Feature Analysis of Fine-Grained Air Quality.
IEEE Trans. Knowl. Data Eng., 2018

Deep Context-Sensitive Facial Landmark Detection With Tree-Structured Modeling.
IEEE Trans. Image Process., 2018

Body Structure Aware Deep Crowd Counting.
IEEE Trans. Image Process., 2018

Transductive Zero-Shot Learning With a Self-Training Dictionary Approach.
IEEE Trans. Cybern., 2018

Multimodal Deep Embedding via Hierarchical Grounded Compositional Semantics.
IEEE Trans. Circuits Syst. Video Technol., 2018

Perceiving Physical Equation by Observing Visual Scenarios.
CoRR, 2018

Context-Aware Deep Spatio-Temporal Network for Hand Pose Estimation from Depth Images.
CoRR, 2018

Stacked Pooling: Improving Crowd Counting by Boosting Scale Invariance.
CoRR, 2018

Stylize Aesthetic QR Code.
CoRR, 2018

GNAS: A Greedy Neural Architecture Search Method for Multi-Attribute Learning.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Deep Convolutional Neural Networks with Merge-and-Run Mappings.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Progressive Blockwise Knowledge Distillation for Neural Network Acceleration.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Semantic Locality-Aware Deformable Network for Clothing Segmentation.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Knowledge-Guided Agent-Tactic-Aware Learning for StarCraft Micromanagement.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Geometry-Aware Scene Text Detection With Instance Transformation Network.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Multi-Channel Pyramid Person Matching Network for Person Re-Identification.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

FR-ANet: A Face Recognition Guided Facial Attribute Classification Network.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Learning Bregman Distance Functions for Structural Learning to Rank.
IEEE Trans. Knowl. Data Eng., 2017

Data-Dependent Label Distribution Learning for Age Estimation.
IEEE Trans. Image Process., 2017

Regularized Deep Belief Network for Image Attribute Detection.
IEEE Trans. Circuits Syst. Video Technol., 2017

Joint entity-relation knowledge embedding via cost-sensitive learning.
Frontiers Inf. Technol. Electron. Eng., 2017

KeyphraseDS: Automatic generation of survey by exploiting keyphrase information.
Neurocomputing, 2017

DR-Net: Transmission Steered Single Image Dehazing Network with Weakly Supervised Refinement.
CoRR, 2017

Deep Optical Flow Estimation Via Multi-Scale Correspondence Structure Learning.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Group-wise Deep Co-saliency Detection.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Boosted Zero-Shot Learning with Semantic Correlation Regularization.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Graph-theoretic spatiotemporal context modeling for video saliency detection.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Deeply-Learned Part-Aligned Representations for Person Re-identification.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Pyramid Person Matching Network for Person Re-identification.
Proceedings of The 9th Asian Conference on Machine Learning, 2017

2016
Scalable Linear Visual Feature Learning via Online Parallel Nonnegative Matrix Factorization.
IEEE Trans. Neural Networks Learn. Syst., 2016

Joint Multilabel Classification With Community-Aware Label Graph Learning.
IEEE Trans. Image Process., 2016

DeepSaliency: Multi-Task Deep Neural Network Model for Salient Object Detection.
IEEE Trans. Image Process., 2016

Deep Learning Driven Visual Path Prediction From a Single Image.
IEEE Trans. Image Process., 2016

Learning A Superpixel-Driven Speed Function for Level Set Tracking.
IEEE Trans. Cybern., 2016

Aspect Learning for Multimedia Summarization via Nonparametric Bayesian.
IEEE Trans. Circuits Syst. Video Technol., 2016

Structure-Aware Slow Feature Analysis for Age Estimation.
IEEE Signal Process. Lett., 2016

Online Metric-Weighted Linear Representations for Robust Visual Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Fusing ℝ Features and Local Features with Context-Aware Kernels for Action Recognition.
Int. J. Comput. Vis., 2016

On the Connection of Deep Fusion to Ensembling.
CoRR, 2016

Semantics-Aware Deep Correspondence Structure Learning for Robust Person Re-Identification.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Diverse Image Captioning via GroupTalk.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Self-Paced Boost Learning for Classification.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

2015
Structured Visual Feature Learning for Classification via Supervised Probabilistic Tensor Factorization.
IEEE Trans. Multim., 2015

Joint Structural Learning to Rank with Deep Linear Feature Learning.
IEEE Trans. Knowl. Data Eng., 2015

Cross-Modal Learning to Rank via Latent Joint Representation.
IEEE Trans. Image Process., 2015

Multimedia Retrieval via Deep Learning to Rank.
IEEE Signal Process. Lett., 2015

Dynamic spatio-temporal modeling for example-based human silhouette recovery.
Signal Process., 2015

Deep learning driven blockwise moving object detection with binary scene modeling.
Neurocomputing, 2015

Tracking news article evolution by dense subgraph learning.
Neurocomputing, 2015

Deep Compositional Cross-modal Learning to Rank via Local-Global Alignment.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

3D Hand Pose Estimation Using Randomized Decision Forest with Segmentation Index Points.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Metric Learning Driven Multi-Task Structured Output Optimization for Robust Keypoint Tracking.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Structured Embedding via Pairwise Relations and Long-Range Interactions in Knowledge Base.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Context-Aware Hypergraph Construction for Robust Spectral Clustering.
IEEE Trans. Knowl. Data Eng., 2014

Modeling Geometric-Temporal Context With Directional Pyramid Co-Occurrence for Action Recognition.
IEEE Trans. Image Process., 2014

Ranking consistency for image matching and object retrieval.
Pattern Recognit., 2014

Multi-modal Mutual Topic Reinforce Modeling for Cross-media Retrieval.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Jointly Discovering Fine-grained and Coarse-grained Sentiments via Topic Modeling.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Learning Multimodal Neural Network with Ranking Examples.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Structural Bregman Distance Functions Learning to Rank with Self-Reinforcement.
Proceedings of the 2014 IEEE International Conference on Data Mining, 2014

Context Based Re-ranking for Object Retrieval.
Proceedings of the Computer Vision - ACCV 2014, 2014

2013
A survey of appearance models in visual object tracking.
ACM Trans. Intell. Syst. Technol., 2013

Visual Tracking With Spatio-Temporal Dempster-Shafer Information Fusion.
IEEE Trans. Image Process., 2013

Incremental Learning of 3D-DCT Compact Representations for Robust Visual Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

An Incremental DPMM-Based Method for Trajectory Clustering, Modeling, and Retrieval.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

Spatially aware feature selection and weighting for object retrieval.
Image Vis. Comput., 2013

An Improved Hierarchical Dirichlet Process-Hidden Markov Model and Its Application to Trajectory Modeling and Retrieval.
Int. J. Comput. Vis., 2013

Contextual Hypergraph Modelling for Salient Object Detection.
CoRR, 2013

Learning Hash Functions Using Column Generation.
Proceedings of the 30th International Conference on Machine Learning, 2013

Contextual Hypergraph Modeling for Salient Object Detection.
Proceedings of the IEEE International Conference on Computer Vision, 2013

3D R Transform on Spatio-temporal Interest Points for Action Recognition.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Learning Compact Binary Codes for Visual Tracking.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Boosting Object Retrieval With Group Queries.
IEEE Signal Process. Lett., 2012

Single and Multiple Object Tracking Using Log-Euclidean Riemannian Subspace and Block-Division Appearance Model.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Superpixel-driven level set tracking.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Adaptive human silhouette reconstruction based on the exploration of temporal information.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Visual Distance Measures for Object Retrieval.
Proceedings of the 2012 International Conference on Digital Image Computing Techniques and Applications, 2012

Non-sparse linear representations for visual tracking with online reservoir metric learning.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
Visual tracking via dynamic tensor analysis with mean update.
Neurocomputing, 2011

Incremental Tensor Subspace Learning and Its Applications to Foreground Segmentation and Tracking.
Int. J. Comput. Vis., 2011

Robust visual tracking via transfer learning.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Graph mode-based contextual kernels for robust SVM tracking.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Efficient block-division model for robust multiple object tracking.
Proceedings of the IEEE International Conference on Acoustics, 2011

Superpixel-based object class segmentation using conditional random fields.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Heat Kernel Based Local Binary Pattern for Face Representation.
IEEE Signal Process. Lett., 2010

Robust object tracking using a spatial pyramid heat kernel structural information representation.
Neurocomputing, 2010

Linear discriminant analysis using rotational invariant L<sub>1</sub> norm.
Neurocomputing, 2010

Context dependent SVMs for interconnected image network annotation.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Semi-supervised Trajectory Learning Using a Multi-Scale Key Point Based Trajectory Representation.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Spatio-Temporal Proximity Distribution Kernels for Action Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

TELECOM ParisTech at ImageCLEF 2010 Photo Annotation Task: Combining Tags and Visual Features for Learning-Based Image Annotation.
Proceedings of the CLEF 2010 LABs and Workshops, 2010

Context-Based Support Vector Machines for Interconnected Image Annotation.
Proceedings of the Computer Vision - ACCV 2010, 2010

2009
Adaptive Distributed Intrusion Detection Using Parametric Model.
Proceedings of the 2009 IEEE/WIC/ACM International Conference on Web Intelligence, 2009

A Boosted Semi-supervised Learning Framework for Web Page Filtering.
Proceedings of the IEEE International Conference on Systems, 2009

Video shot segmentation using graph-based dominant-set clustering.
Proceedings of the First International Conference on Internet Multimedia Computing and Service, 2009

Image spam filtering using Fourier-Mellin invariant features.
Proceedings of the IEEE International Conference on Acoustics, 2009

Segment Model Based Vehicle Motion Analysis.
Proceedings of the Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2009

Human Action Recognition Using Pyramid Vocabulary Tree.
Proceedings of the Computer Vision, 2009

Human Action Recognition under Log-Euclidean Riemannian Metric.
Proceedings of the Computer Vision, 2009

Spectral Graph Partitioning Based on a Random Walk Diffusion Similarity Measure.
Proceedings of the Computer Vision, 2009

2008
User oriented link function classification.
Proceedings of the 17th International Conference on World Wide Web, 2008

Distributed detection of network intrusions based on a parametric model.
Proceedings of the IEEE International Conference on Systems, 2008

Robust foreground segmentation based on two effective background models.
Proceedings of the 1st ACM SIGMM International Conference on Multimedia Information Retrieval, 2008

SVD based Kalman particle filter for robust visual tracking.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Boosted cannabis image recognition.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Multiclass spectral clustering based on discriminant analysis.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Level set tracking with dynamical shape priors.
Proceedings of the International Conference on Image Processing, 2008

Robust Visual Tracking Based on an Effective Appearance Model.
Proceedings of the Computer Vision, 2008

Sequential particle swarm optimization for visual tracking.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Visual tracking via incremental Log-Euclidean Riemannian subspace learning.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Trajectory-Based Video Retrieval Using Dirichlet Process Mixture Models.
Proceedings of the British Machine Vision Conference 2008, Leeds, UK, September 2008, 2008

2007
Customizable Instance-Driven Webpage Filtering Based on Semi-Supervised Learning.
Proceedings of the 2007 IEEE / WIC / ACM International Conference on Web Intelligence, 2007

Corner Detection of Contour Images using Spectral Clustering.
Proceedings of the International Conference on Image Processing, 2007

Graph Based Discriminative Learning for Robust and Efficient Object Tracking.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Robust Visual Tracking Based on Incremental Tensor Subspace Learning.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

2006
A Coarse-to-Fine Strategy for Vehicle Motion Trajectory Clustering.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006


  Loading...