Jinqiao Wang

Orcid: 0000-0002-9118-2780

Affiliations:
  • Chinese Academy of Sciences, National Laboratory of Pattern Recognition


According to our database1, Jinqiao Wang authored at least 304 papers between 2005 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Efficient Masked Autoencoders With Self-Consistency.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Multi-Model Style-Aware Diffusion Learning for Semantic Image Synthesis.
ACM Trans. Multim. Comput. Commun. Appl., November, 2024

Relation-Associated Instructions & Hallucination Benchmark.
Dataset, July, 2024

Structural Dependence Learning Based on Self-attention for Face Alignment.
Mach. Intell. Res., June, 2024

Dual-Path Transformer for 3D Human Pose Estimation.
IEEE Trans. Circuits Syst. Video Technol., May, 2024

Objformer: Boosting 3D object detection via instance-wise interaction.
Pattern Recognit., February, 2024

Artificial intelligence for automatic surgical phase recognition of laparoscopic gastrectomy in gastric cancer.
Int. J. Comput. Assist. Radiol. Surg., February, 2024

A fast mask synthesis method for face recognition.
Vis. Intell., 2024

MAC: Masked Contrastive Pre-Training for Efficient Video-Text Retrieval.
IEEE Trans. Multim., 2024

Pixel-Level Contrastive Pretrainer for Industrial Image Representation.
IEEE Trans. Instrum. Meas., 2024

ImFusion: Boosting Two-Stage 3D Object Detection via Image Candidates.
IEEE Signal Process. Lett., 2024

Learning facial structural dependency in 3D aligned space for face alignment.
Image Vis. Comput., 2024

SlowFastFormer for 3D human pose estimation.
Comput. Vis. Image Underst., 2024

Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models.
CoRR, 2024

The BRAVO Semantic Segmentation Challenge Results in UNCV2024.
CoRR, 2024

MROVSeg: Breaking the Resolution Curse of Vision-Language Models in Open-Vocabulary Semantic Segmentation.
CoRR, 2024

AnyDesign: Versatile Area Fashion Editing via Mask-Free Diffusion.
CoRR, 2024

Recurrent Context Compression: Efficiently Expanding the Context Window of LLM.
CoRR, 2024

VS-Assistant: Versatile Surgery Assistant on the Demand of Surgeons.
CoRR, 2024

Pattern-Aware Chain-of-Thought Prompting in Large Language Models.
CoRR, 2024

PM-VIS: High-Performance Box-Supervised Video Instance Segmentation.
CoRR, 2024

Optimization of Prompt Learning via Multi-Knowledge Representation for Vision-Language Models.
CoRR, 2024

Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring.
CoRR, 2024

FiLo: Zero-Shot Anomaly Detection by Fine-Grained Description and High-Quality Localization.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Auto DragGAN: Editing the Generative Image Manifold in an Autoregressive Manner.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

PFDM: Parser-Free Virtual Try-On via Diffusion Model.
Proceedings of the IEEE International Conference on Acoustics, 2024

BFRFormer: Transformer-Based Generator for Real-World Blind Face Restoration.
Proceedings of the IEEE International Conference on Acoustics, 2024

The Devil is in Details: Delving Into Lite FFN Design for Vision Transformers.
Proceedings of the IEEE International Conference on Acoustics, 2024

SEEKR: Selective Attention-Guided Knowledge Retention for Continual Learning of Large Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Griffon: Spelling Out All Object Locations at Any Granularity with Large Language Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Enhancing Text-to-SQL Capabilities of Large Language Models via Domain Database Knowledge Injection.
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

Self-Supervised Representation Learning from Arbitrary Scenarios.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Fluctuation-Based Adaptive Structured Pruning for Large Language Models.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Learning Semantics-Consistent Stripes With Self-Refinement for Person Re-Identification.
IEEE Trans. Neural Networks Learn. Syst., November, 2023

Progressive Direction-Aware Pose Grammar for Human Pose Estimation.
IEEE Trans. Biom. Behav. Identity Sci., October, 2023

Bi-Level Implicit Semantic Data Augmentation for Vehicle Re-Identification.
IEEE Trans. Intell. Transp. Syst., April, 2023

Pseudo Label Rectification With Joint Camera Shift Adaptation and Outlier Progressive Recycling for Unsupervised Person Re-Identification.
IEEE Trans. Intell. Transp. Syst., March, 2023

Human Parsing With Part-Aware Relation Modeling.
IEEE Trans. Multim., 2023

Pruning-aware Sparse Regularization for Network Pruning.
Int. J. Autom. Comput., 2023

Mitigating Hallucination in Visual Language Models with Visual Supervision.
CoRR, 2023

Continual Instruction Tuning for Large Multimodal Models.
CoRR, 2023

Surgical Temporal Action-aware Network with Sequence Regularization for Phase Recognition.
CoRR, 2023

ChineseWebText: Large-scale High-quality Chinese Web Text Extracted with Effective Evaluation Model.
CoRR, 2023

IAIFNet: An Illumination-Aware Infrared and Visible Image Fusion Network.
CoRR, 2023

SSPFusion: A Semantic Structure-Preserving Approach for Infrared and Visible Image Fusion.
CoRR, 2023

FastBCSD: Fast and Efficient Neural Network for Binary Code Similarity Detection.
CoRR, 2023

Fast Segment Anything.
CoRR, 2023

FreConv: Frequency Branch-and-Integration Convolutional Networks.
CoRR, 2023

ZBS: Zero-shot Background Subtraction via Instance-level Background Modeling and Foreground Selection.
CoRR, 2023

Efficient Masked Autoencoders with Self-Consistency.
CoRR, 2023

Uncertainty-Aware Boundary Attention Network for Real-Time Semantic Segmentation.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Instance-Proxy Loss for Semi-supervised Learning with Coarse Labels.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Temporal-Channel Topology Enhanced Network for Skeleton-Based Action Recognition.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Surgical Video Captioning with Mutual-Modal Concept Alignment.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

ShiftFormer: Spatial-Temporal Shift Operation in Video Transformer.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

FreConv: Frequency Branch-and-Integration Convolutional Networks.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Explicit Attention Modeling for Pedestrian Attribute Recognition.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

ZBS: Zero-Shot Background Subtraction via Instance-Level Background Modeling and Foreground Selection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Temporal Action-aware Network with Sequence Regularization for Phase Recognition.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2023

2022
Hybrid Modality Metric Learning for Visible-Infrared Person Re-Identification.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Multi-Granularity Mutual Learning Network for Object Re-Identification.
IEEE Trans. Intell. Transp. Syst., 2022

Grammar-Induced Wavelet Network for Human Parsing.
IEEE Trans. Image Process., 2022

Dynamic Orthogonal Projection Constrained Discriminative Tracking.
IEEE Signal Process. Lett., 2022

Fine-Grained Human-Centric Tracklet Segmentation with Single Frame Supervision.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Masked Contrastive Pre-Training for Efficient Video-Text Retrieval.
CoRR, 2022

Plug-and-Play Pseudo Label Correction Network for Unsupervised Person Re-identification.
CoRR, 2022

Part-Aware Self-Supervised Pre-Training for Person Re-Identification.
CoRR, 2022

PruneFaceDet: Pruning lightweight face detection network by sparsity training.
Cogn. Comput. Syst., 2022

Global Patch Cross-Attention for Point Cloud Analysis.
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

TaiSu: A 166M Large-scale High-Quality Dataset for Chinese Vision-Language Pre-training.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Graph Neural Networks Based Multi-granularity Feature Representation Learning for Fine-Grained Visual Categorization.
Proceedings of the MultiMedia Modeling - 28th International Conference, 2022

When Skeleton Meets Appearance: Adaptive Appearance Information Enhancement for Skeleton Based Action Recognition.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Transfering Low-Frequency Features for Domain Adaptation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

PASS: Part-Aware Self-Supervised Pre-Training for Person Re-Identification.
Proceedings of the Computer Vision - ECCV 2022, 2022

Regularizing Vector Embedding in Bottom-Up Human Pose Estimation.
Proceedings of the Computer Vision - ECCV 2022, 2022

C2AM Loss: Chasing a Better Decision Boundary for Long-Tail Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

UniVIP: A Unified Framework for Self-Supervised Visual Pre-training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

An Ensemble of One-Stage and Two-Stage Detectors Approach for Road Damage Detection.
Proceedings of the IEEE International Conference on Big Data, 2022

2021
Antidecay LSTM for Siamese Tracking With Adversarial Learning.
IEEE Trans. Neural Networks Learn. Syst., 2021

Siamese Regression Tracking With Reinforced Template Updating.
IEEE Trans. Image Process., 2021

Semi-Supervised Scene Text Recognition.
IEEE Trans. Image Process., 2021

Enhanced Bounding Box Estimation with Distribution Calibration for Visual Tracking.
Sensors, 2021

STN-enhanced message passing guided by adversarial learning for human pose estimation.
Neurocomputing, 2021

Macro-micro mutual learning inside compositional model for human pose estimation.
Neurocomputing, 2021

Unsupervised cycle-consistent person pose transfer.
Neurocomputing, 2021

OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation.
CoRR, 2021

AAformer: Auto-Aligned Transformer for Person Re-Identification.
CoRR, 2021

Fast Kernelized Correlation Filter without Boundary Effect.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

High-Performance Discriminative Tracking with Target-Aware Feature Embeddings.
Proceedings of the Pattern Recognition and Computer Vision - 4th Chinese Conference, 2021

MST: Masked Self-Supervised Transformer for Visual Representation.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Multi-initialization Optimization Network for Accurate 3D Human Pose and Shape Estimation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

DPT: Deformable Patch-based Transformer for Visual Recognition.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Attention-Guided Knowledge Distillation for Efficient Single-Stage Detector.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

High-Performance Discriminative Tracking with Transformers.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Improving Multiple Object Tracking With Single Object Tracking.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Adaptive Class Suppression Loss for Long-Tail Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Consistent-Separable Feature Representation for Semantic Segmentation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Recall What You See Continually Using GridLSTM in Image Captioning.
IEEE Trans. Multim., 2020

A Comparison of Correlation Filter-Based Trackers and Struck Trackers.
IEEE Trans. Circuits Syst. Video Technol., 2020

An end-to-end exemplar association for unsupervised person Re-identification.
Neural Networks, 2020

Food det: Detecting foods in refrigerator with supervised transformer network.
Neurocomputing, 2020

Siamese Deformable Cross-Correlation Network for Real-Time Visual Tracking.
Neurocomputing, 2020

Semantic-spatial fusion network for human parsing.
Neurocomputing, 2020

A novel data augmentation scheme for pedestrian detection with attribute preserving GAN.
Neurocomputing, 2020

Progressive rectification network for irregular text recognition.
Sci. China Inf. Sci., 2020

Siamese Attentive Graph Tracking.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Task Decoupled Knowledge Distillation For Lightweight Face Detectors.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Unsupervised Domain Adaptive Re-Identification with Feature Adversarial Learning and Self-similarity Clustering.
Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges, 2020

High-Speed And Accurate Scale Estimation For Visual Tracking With Gaussian Process Regression.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

PruneFaceDet: Pruning Lightweight Face Detection Network by Sparsity Training.
Proceedings of the ICCPR 2020: 9th International Conference on Computing and Pattern Recognition, Xiamen, China, October 30, 2020

Identity-Guided Human Semantic Parsing for Person Re-identification.
Proceedings of the Computer Vision - ECCV 2020, 2020

Occlusion-Aware Siamese Network for Human Pose Estimation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Learning Feature Embeddings for Discriminant Model Based Tracking.
Proceedings of the Computer Vision - ECCV 2020, 2020

Blended Grammar Network for Human Parsing.
Proceedings of the Computer Vision - ECCV 2020, 2020

Adaptive Variance Based Label Distribution Learning for Facial Age Estimation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Large Batch Optimization for Object Detection: Training COCO in 12 minutes.
Proceedings of the Computer Vision - ECCV 2020, 2020

Part-Aware Context Network for Human Parsing.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Progressive Bi-C3D Pose Grammar for Human Pose Estimation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Dynamic Collaborative Tracking.
IEEE Trans. Neural Networks Learn. Syst., 2019

Multi-Correlation Filters With Triangle-Structure Constraints for Object Tracking.
IEEE Trans. Multim., 2019

Attention CoupleNet: Fully Convolutional Attention Coupling Network for Object Detection.
IEEE Trans. Image Process., 2019

Two-Level Attention Network With Multi-Grain Ranking Loss for Vehicle Re-Identification.
IEEE Trans. Image Process., 2019

Feature Distilled Tracking.
IEEE Trans. Cybern., 2019

Adversarial Deep Tracking.
IEEE Trans. Circuits Syst. Video Technol., 2019

Pixelwise Deep Sequence Learning for Moving Object Detection.
IEEE Trans. Circuits Syst. Video Technol., 2019

Real-Time Multi-Scale Face Detector on Embedded Devices.
Sensors, 2019

Elite Loss for scene text detection.
Neurocomputing, 2019

Reading scene text with fully convolutional sequence modeling.
Neurocomputing, 2019

Adversarial image generation by combining content and style.
IET Image Process., 2019

Class Regularization: Improve Few-shot Image Classification by Reducing Meta Shift.
CoRR, 2019

Learning Features with Differentiable Closed-Form Solver for Tracking.
CoRR, 2019

Color-Sensitive Person Re-Identification.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Mask Guided Knowledge Distillation for Single Shot Detector.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Bi-Directional Message Passing Based Scanet for Human Pose Estimation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Vehicle Re-Identification with Refined Part Model.
Proceedings of the IEEE International Conference on Multimedia & Expo Workshops, 2019

Pose-Weighted Gan for Photorealistic Face Frontalization.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Cascade Attention Network for Person Re-Identification.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

The Seventh Visual Object Tracking VOT2019 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Fast-deepKCF Without Boundary Effect.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

FLDet: A CPU Real-time Joint Face and Landmark Detector.
Proceedings of the 2019 International Conference on Biometrics, 2019

In Defense of Color Names for Small-Scale Person Re-Identification.
Proceedings of the 2019 International Conference on Biometrics, 2019

Learning Discriminative and Complementary Patches for Face Recognition.
Proceedings of the 14th IEEE International Conference on Automatic Face & Gesture Recognition, 2019

Semantic Alignment: Finding Semantically Consistent Ground-Truth for Facial Landmark Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Gate-based Bidirectional Interactive Decoding Network for Scene Text Recognition.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

2018
Appearance features in Encoding Color Space for visual surveillance.
Neurocomputing, 2018

Recurrent Calibration Network for Irregular Text Recognition.
CoRR, 2018

High Speed Kernelized Correlation Filters without Boundary Effect.
CoRR, 2018

Multi-view pedestrian captioning with an attention topic CNN model.
Comput. Ind., 2018

Domain Adaptation Tracker With Global and Local Searching.
IEEE Access, 2018

Learning Robust Gaussian Process Regression for Visual Tracking.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Tree Hierarchical CNNs for Object Parsing.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Dense Chained Attention Network for Scene Text Recognition.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

The Sixth Visual Object Tracking VOT2018 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

High-Speed Tracking With Multi-Kernel Correlation Filters.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Improved Single Shot Object Detector Using Enhanced Features and Predicting Heads.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

Progressive Cognitive Human Parsing.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Learning Coarse-to-Fine Structured Feature Embedding for Vehicle Re-Identification.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Learning discriminative context models for concurrent collective activity recognition.
Multim. Tools Appl., 2017

Automatic group activity annotation for mobile videos.
Multim. Syst., 2017

On the Relations of Correlation Filter Based Trackers and Struck.
CoRR, 2017

Reading Scene Text with Attention Convolutional Sequence Modeling.
CoRR, 2017

Fast Deep Matting for Portrait Animation on Mobile Phone.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

DenseTracker: A multi-task dense network for visual tracking.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Joint background reconstruction and foreground segmentation via a two-stage convolutional neural network.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Deep embedding network for robust age estimation.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Joint Visual Context for Pedestrian Captioning.
Proceedings of the Internet Multimedia Computing and Service, 2017

Automatic Watermeter Digit Recognition on Mobile Devices.
Proceedings of the Internet Multimedia Computing and Service, 2017

CoupleNet: Coupling Global Structure with Local Parts for Object Detection.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Learning Adaptive Receptive Fields for Deep Image Parsing Network.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Improving Visual Saliency Computing With Emotion Intensity.
IEEE Trans. Neural Networks Learn. Syst., 2016

Multi-View 3D Object Retrieval With Deep Embedding Network.
IEEE Trans. Image Process., 2016

Adaptive Content Condensation Based on Grid Optimization for Thumbnail Image Generation.
IEEE Trans. Circuits Syst. Video Technol., 2016

Real-time people counting for indoor scenes.
Signal Process., 2016

A unified model sharing framework for moving object detection.
Signal Process., 2016

ActiveAd: A novel framework of linking ad videos to online products.
Neurocomputing, 2016

Multiple deep features learning for object retrieval in surveillance videos.
IET Comput. Vis., 2016

Clustering based ensemble correlation tracking.
Comput. Vis. Image Underst., 2016

Learning weighted part models for object tracking.
Comput. Vis. Image Underst., 2016

WHU-NERCMS at TRECVID2016: Instance Search Task.
Proceedings of the 2016 TREC Video Retrieval Evaluation, 2016

Robust Crowd Segmentation and Counting in Indoor Scenes.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

Natural image classification driven by human brain activity.
Proceedings of the Medical Imaging 2016: Biomedical Applications in Molecular, Structural, and Functional Imaging, San Diego, California, United States, 27 February, 2016

Scale-Adaptive Low-Resolution Person Re-Identification via Learning a Discriminating Surface.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Person re-identification via rich color-gradient feature.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Boosted local classifiers for visual tracking.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Multi-scale blocks based image emotion classification using multiple instance learning.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Extensive Comparison of Visual Features for Person Re-identification.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2016

Deep People Counting with Faster R-CNN and Correlation Tracking.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2016

Scale-Adaptive Deconvolutional Regression Network for Pedestrian Detection.
Proceedings of the Computer Vision - ACCV 2016, 2016

Piecewise Video Condensation for Complex Scenes.
Proceedings of the Computer Vision - ACCV 2016 Workshops, 2016

MC-HOG Correlation Tracking with Saliency Proposal.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Weighted Part Context Learning for Visual Tracking.
IEEE Trans. Image Process., 2015

Image Tag Refinement With View-Dependent Concept Representations.
IEEE Trans. Circuits Syst. Video Technol., 2015

Finding logos in real-world images with point-context representation-based region search.
Multim. Syst., 2015

A Real-Time People Counting Approach in Indoor Environment.
Proceedings of the MultiMedia Modeling - 21st International Conference, 2015

Learning Multi-view Deep Features for Small Object Retrieval in Surveillance Scenarios.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Mobile Media Thumbnailing.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Learning sharable models for robust background subtraction.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Color names learning using convolutional neural networks.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Learning deep compact descriptor with bagging auto-encoders for object retrieval.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Multiple features based shared models for background subtraction.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

60 Hz self-tuning background modeling.
Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015

Concurrent group activity classification with context modeling.
Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015

Relaxing from Vocabulary: Robust Weakly-Supervised Deep Learning for Vocabulary-Free Image Tagging.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Collaborative Correlation Tracking.
Proceedings of the British Machine Vision Conference 2015, 2015


2014
Bilayer Sparse Topic Model for Scene Analysis in Imbalanced Surveillance Videos.
IEEE Trans. Image Process., 2014

Spatiotemporal Grid Flow for Video Retargeting.
IEEE Trans. Image Process., 2014

Spatiotemporal Group Context for Pedestrian Counting.
IEEE Trans. Circuits Syst. Video Technol., 2014

A hybrid domain enhanced framework for video retargeting with spatial-temporal importance and 3D grid optimization.
Signal Process., 2014

Sparse representation for robust abnormality detection in crowded scenes.
Pattern Recognit., 2014

Key observation selection-based effective video synopsis for camera network.
Mach. Vis. Appl., 2014

A three-level framework for affective content analysis and its case studies.
Multim. Tools Appl., 2014

Interactive ads recommendation with contextual search on product topic space.
Multim. Tools Appl., 2014

A Hybrid Image Retargeting Approach via Combining Seam Carving and Grid Warping.
J. Multim., 2014

Online video synopsis of structured motion.
Neurocomputing, 2014

Group latent factor model for recommendation with multiple user behaviors.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

A Curvature Filter and Normal Clustering Based Approach to Detecting Cylinder on 3D Medical Model.
Proceedings of the Advances in Multimedia Information Processing - PCM 2014, 2014

Mask Assisted Object Coding with Deep Learning for Object Retrieval in Surveillance Videos.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Estimate Gaze Density by Incorporating Emotion.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Discriminative Context Models for Collective Activity Recognition.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

A coarse-to-fine logo recognition method in video streams.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Recommendation on Flickr by combining community user ratings and item importance.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Object tracking with part-based discriminative context models.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Part Context Learning for Visual Tracking.
Proceedings of the British Machine Vision Conference, 2014

Clustering Ensemble Tracking.
Proceedings of the Computer Vision - ACCV 2014, 2014

Learning a Representative and Discriminative Part Model with Deep Convolutional Features for Scene Recognition.
Proceedings of the Computer Vision - ACCV 2014, 2014

What Visual Attributes Characterize an Object Class?
Proceedings of the Computer Vision - ACCV 2014, 2014

2013
Exploiting content relevance and social relevance for personalized ad recommendation on internet TV.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Context-Aware Video Retargeting via Graph Model.
IEEE Trans. Multim., 2013

Dynamic scene understanding by improved sparse topical coding.
Pattern Recognit., 2013

Multiple Hypotheses Based Spatial-Temporal Association for Stable Pedestrian Counting.
Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013

Collaborative Tracking: Dynamically Fusing Short-Term Trackers and Long-Term Detector.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Subspace learning based active learning for image retrieval.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

Improving scene classification with weakly spatial symmetry information.
Proceedings of the IEEE International Conference on Image Processing, 2013

Brand Image Detection in Broadcast Video Streams.
Proceedings of the Seventh International Conference on Image and Graphics, 2013

Classification Related Manifold Dimension Estimation with Restricted Boltzmann Machine.
Proceedings of the Seventh International Conference on Image and Graphics, 2013

2012
Enhanced 3-D Modeling for Landmark Image Classification.
IEEE Trans. Multim., 2012

Real-Time Probabilistic Covariance Tracking With Efficient Model Update.
IEEE Trans. Image Process., 2012

Real-time multiple object instances detection.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Key observation selection for effective video synopsis.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Learning Semantic Motion Patterns for Dynamic Scenes by Improved Sparse Topical Coding.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Anomaly detection in crowded scene via appearance and dynamics joint modeling.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Object-centered narratives for video surveillance.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Multiple features fusion for crowd density estimation.
Proceedings of the 4th International Conference on Internet Multimedia Computing and Service, 2012

Bag of features using sparse coding for gender classification.
Proceedings of the 4th International Conference on Internet Multimedia Computing and Service, 2012

Point-context descriptor based region search for logo recognition.
Proceedings of the 4th International Conference on Internet Multimedia Computing and Service, 2012

Fast seam carving with strip constraints.
Proceedings of the 4th International Conference on Internet Multimedia Computing and Service, 2012

Weighted Interaction Force Estimation for Abnormality Detection in Crowd Scenes.
Proceedings of the Computer Vision - ACCV 2012, 2012

Fusing Warping, Cropping, and Scaling for Optimal Image Thumbnail Generation.
Proceedings of the Computer Vision, 2012

Efficient Clothing Retrieval with Semantic-Preserving Visual Phrases.
Proceedings of the Computer Vision, 2012

2011
Boosting part-sense multi-feature learners toward effective object detection.
Comput. Vis. Image Underst., 2011

Adaptive Model for Robust Pedestrian Counting.
Proceedings of the Advances in Multimedia Modeling, 2011

Grid-Based Retargeting with Transformation Consistency Smoothing.
Proceedings of the Advances in Multimedia Modeling, 2011

Landmark recognition and retrieval: from 2D to 3D.
Proceedings of the 2011 joint ACM workshop on Human gesture and behavior understanding, 2011

Fast retargeting with adaptive grid optimization.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Using context saliency for movie shot classification.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Specific vehicle detection and tracking in road environment.
Proceedings of the ICIMCS 2011, 2011

Global Trajectory Construction across Multi-cameras via Graph Matching.
Proceedings of the Sixth International Conference on Image and Graphics, 2011

Video Reshuffling with Narratives toward Effective Video Browsing.
Proceedings of the Sixth International Conference on Image and Graphics, 2011

2010
People Detection by Boosting Features in Nonlinear Subspace.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010

Visual Attention Model Based Object Tracking.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010

Personalized Sports Video Customization for Mobile Devices.
Proceedings of the Advances in Multimedia Modeling, 2010

AdVR: Linking Ad Video with Products or Service.
Proceedings of the Advances in Multimedia Modeling, 2010

Landmark image classification using 3D point clouds.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Effective logo retrieval with adaptive local feature selection.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Fast feature selection and training for AdaBoost-based concept detection with large scale datasets.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Interactive Web Video Advertising with Context Analysis and Search.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Interactive service recommendation based on ad concept hierarchy.
Proceedings of the Second International Conference on Internet Multimedia Computing and Service, 2010

A improved silhouette tracking approach integrating particle filter with graph cuts.
Proceedings of the IEEE International Conference on Acoustics, 2010

Multi-level trajectory modeling for video copy detection.
Proceedings of the IEEE International Conference on Acoustics, 2010

Image Classification Using Spatial Pyramid Coding and Visual Word Reweighting.
Proceedings of the Computer Vision - ACCV 2010, 2010

2009
IVA-NLPR-IA-CAS TRECVID 2009: High LevelFeatures Extraction.
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

Based on the Reinforcement Learning Association Rules Recommendation Study.
Proceedings of the Fifth International Conference on Semantics, Knowledge and Grid, 2009

A Hierarchical Semantics-Matching Approach for Sports Video Annotation.
Proceedings of the Advances in Multimedia Information Processing, 2009

Sports video retargeting.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Consumer video retargeting: context assisted spatial-temporal grid optimization.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Linking video ADS with product or service information by web search.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Context saliency based image summarization.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Learning local features for object categorization.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Boosted forest for human detection.
Proceedings of the First International Conference on Internet Multimedia Computing and Service, 2009

Spatial pyramid based histogram representation for visual tracking with partial occlusion.
Proceedings of the First International Conference on Internet Multimedia Computing and Service, 2009

Semantic Linking between Video Ads and Web Services with Progressive Search.
Proceedings of the ICDM Workshops 2009, 2009

Real-time visual tracking via Incremental Covariance Tensor Learning.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Robust Bayesian tracking on Riemannian manifolds via fragments-based representation.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
A Multimodal Scheme for Program Segmentation and Representation in Broadcast Video Streams.
IEEE Trans. Multim., 2008

Digesting Commercial Clips from TV Streams.
IEEE Multim., 2008

A Spatial-Temporal-Scale Registration Approach for Video Copy Detection.
Proceedings of the Advances in Multimedia Information Processing, 2008

Boosting relative spaces for categorizing objects with large intra-class variation.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Hand posture recognition with co-training.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Online video advertising based on user's attention relavancy computing.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

A novel contextual descriptors for category recognition.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

2007
Automatic TV Logo Detection, Tracking and Removal in Broadcast Video.
Proceedings of the Advances in Multimedia Modeling, 2007

TV ad video categorization with probabilistic latent concept learning.
Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007

Semantic Event Extraction from Basketball Games using Multi-Modal Analysis.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Robust Commercial Retrieval in Video Streams.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

2006
A Semantic Image Category for Structuring TV Broadcast Video Streams.
Proceedings of the Advances in Multimedia Information Processing, 2006

Segmentation, categorization, and identification of commercial clips from TV streams using multimodal analysis.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

A Robust Method for TV Logo Tracking in Video Streams.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Web Image Mining Based on Modeling Concept-Sensitive Salient Regions.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Target Tracking in Infrared Image Sequences Using Diverse AdaBoostSVM.
Proceedings of the First International Conference on Innovative Computing, Information and Control (ICICIC 2006), 30 August, 2006

A Mid-Level Scene Change Representation Via Audiovisual Alignment.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Shot-Level Camera Motion Estimation Based on a Parametric Model.
Proceedings of the 2005 TREC Video Retrieval Evaluation, 2005


  Loading...