Junsong Yuan

ACM Trans. Multim. Comput. Commun. Appl., April, 2024

A Dual Reinforcement Learning Framework for Weakly Supervised Phrase Grounding.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2024

Shared Latent Membership Enables Joint Shape Abstraction and Segmentation With Deformable Superquadrics.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2024

A Graph-Based Approach for Relating Integer Programs.

[BibT_eX]

[DOI]

INFORMS J. Comput., 2024

Pluralistic Salient Object Detection.

[BibT_eX]

[DOI]

CoRR, 2024

FADE: A Dataset for Detecting Falling Objects around Buildings in Video.

[BibT_eX]

[DOI]

CoRR, 2024

Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation.

[BibT_eX]

[DOI]

CoRR, 2024

STAT: Towards Generalizable Temporal Action Localization.

[BibT_eX]

[DOI]

CoRR, 2024

AM^2-EmoJE: Adaptive Missing-Modality Emotion Recognition in Conversation via Joint Embedding Learning.

[BibT_eX]

[DOI]

Naresh Kumar Devulapally

Sidharth Anand

CoRR, 2024

Show Your Face: Restoring Complete Facial Images from Partial Observations for VR Meeting.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Forecasting Future Videos from Novel Views via Disentangled 3D Scene Representation.

[BibT_eX]

[DOI]

Sudhir Yarram

Proceedings of the Computer Vision - ECCV 2024, 2024

GRiT: A Generative Region-to-Text Transformer for Object Understanding.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Interaction-Centric Spatio-Temporal Context Reasoning for Multi-person Video HOI Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Divide and Fuse: Body Part Mesh Recovery from Partially Visible Human Images.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

FSC: Few-Point Shape Completion.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Spectrum AUC Difference (SAUCD): Human-Aligned 3D Shape Evaluation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

SignRing: Continuous American Sign Language Recognition Using IMU Rings and Virtual IMU Data.

[BibT_eX]

[DOI]

Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., September, 2023

Consistent 3D Hand Reconstruction in Video via Self-Supervised Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Adaptive Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., April, 2023

NeRFPlayer: A Streamable Dynamic Scene Representation with Decomposed Neural Radiance Fields.

[BibT_eX]

[DOI]

IEEE Trans. Vis. Comput. Graph., 2023

Joint-Bone Fusion Graph Convolutional Network for Semi-Supervised Skeleton Action Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Federated Learning With Privacy-Preserving Ensemble Attention Distillation.

[BibT_eX]

[DOI]

IEEE Trans. Medical Imaging, 2023

DTCM: Joint Optimization of Dark Enhancement and Action Recognition in Videos.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2023

Eyelid's Intrinsic Motion-Aware Feature Learning for Real-Time Eyeblink Detection in the Wild.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Forensics Secur., 2023

Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2023

RoomDreamer: Text-Driven 3D Indoor Scene Synthesis with Coherent Geometry and Texture.

[BibT_eX]

[DOI]

CoRR, 2023

Dynamic Voxel Grid Optimization for High-Fidelity RGB-D Supervised Surface Reconstruction.

[BibT_eX]

[DOI]

CoRR, 2023

Harnessing Low-Frequency Neural Fields for Few-Shot View Synthesis.

[BibT_eX]

[DOI]

CoRR, 2023

Self-Supervised Distilled Learning for Multi-modal Misinformation Identification.

[BibT_eX]

[DOI]

Michael Mu

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Semantics-Depth-Symbiosis: Deeply Coupled Semi-Supervised Learning of Semantics and Depth.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Chain-of-Look Prompting for Verb-centric Surgical Triplet Recognition in Endoscopic Videos.

[BibT_eX]

[DOI]

Nan Xi

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Exploring the Knowledge Transferred by Response-Based Teacher-Student Distillation.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

RoomDreamer: Text-Driven 3D Indoor Scene Synthesis with Coherent Geometry and Texture.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Relit-NeuLF: Efficient Relighting and Novel View Synthesis via Neural 4D Light Field.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Multi-label Emotion Analysis in Conversation via Multimodal Knowledge Distillation.

[BibT_eX]

[DOI]

Sidharth Anand

Naresh Kumar Devulapally

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Language-guided Human Motion Synthesis with Atomic Actions.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Source-Free Domain Adaptation for Medical Image Segmentation via Prototype-Anchored Feature Alignment and Contrastive Learning.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Open Set Video HOI detection from Action-centric Chain-of-Look Prompting.

[BibT_eX]

[DOI]

Nan Xi

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

NeuRBF: A Neural Fields Representation with Adaptive Radial Basis Functions.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Uncertainty-aware State Space Transformer for Egocentric 3D Hand Trajectory Forecasting.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SOAR: Scene-debiasing Open-set Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Towards Generic Image Manipulation Detection with Weakly-Supervised Self-Consistency Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

POINTACL: Adversarial Contrastive Learning for Robust Point Clouds Representation Under Adversarial Attack.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

3D-aware Facial Landmark Detection via Multi-view Consistent Training on Synthetic Data.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

High Fidelity 3D Hand Shape Reconstruction via Scalable Graph Frequency Decomposition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Neural Voting Field for Camera-Space 3D Hand Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

AMuSE: Adaptive Multimodal Analysis for Speaker Emotion Recognition in Group Conversations.

[BibT_eX]

[DOI]

Sidharth Anand

Naresh Kumar Devulapally

Yu-Ping Chang

Proceedings of the Ninth IEEE Multimedia Big Data, 2023

Progressive Multi-View Human Mesh Recovery with Self-Supervision.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

ForestDet: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2022

Motion-Driven Visual Tempo Learning for Video-Based Action Recognition.

[BibT_eX]

[DOI]

Yuanzhong Liu

Zhigang Tu

IEEE Trans. Image Process., 2022

MAT: Multianchor Visual Tracking With Selective Search Region.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2022

Forest Graph Convolutional Network for Surgical Action Triplet Recognition in Endoscopic Videos.

[BibT_eX]

[DOI]

Nan Xi

IEEE Trans. Circuits Syst. Video Technol., 2022

AppFuse: An Appearance Fusion Framework for Saliency Cues.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2022

Hierarchical domain adaptation with local feature patterns.

[BibT_eX]

[DOI]

Pattern Recognit., 2022

Video anomaly detection with spatio-temporal dissociation.

[BibT_eX]

[DOI]

Pattern Recognit., 2022

Adversarial structured prediction for domain-adaptive semantic segmentation.

[BibT_eX]

[DOI]

Sudhir Yarram

Mach. Vis. Appl., 2022

An Image-Based Approach to Detecting Structural Similarity Among Mixed Integer Programs.

[BibT_eX]

[DOI]

INFORMS J. Comput., 2022

Slow-Fast Visual Tempo Learning for Video-based Action Recognition.

[BibT_eX]

[DOI]

CoRR, 2022

Optical flow for video super-resolution: a survey.

[BibT_eX]

[DOI]

Artif. Intell. Rev., 2022

NeuLF: Efficient Novel View Synthesis with Neural 4D Light Field.

[BibT_eX]

[DOI]

Proceedings of the 33rd Eurographics Symposium on Rendering, 2022

Personalized Prediction of Indoor Comfort Using Graph Convolutional Matrix Completion.

[BibT_eX]

[DOI]

Proceedings of the 5th IEEE International Conference on Multimedia Information Processing and Retrieval, 2022

Multimodal Attentive Learning for Real-time Explainable Emotion Recognition in Conversations.

[BibT_eX]

[DOI]

Balaji Arumugam

Proceedings of the IEEE International Symposium on Circuits and Systems, 2022

Multi-view Knowledge Graph for Explainable Course Content Recommendation in Course Discussion Posts.

[BibT_eX]

[DOI]

Jnana Sai Abhishek Varma Gokaraju

Abhilash Kalwa

Proceedings of the 26th International Conference on Pattern Recognition, 2022

Joint Global-Local Alignment for Domain Adaptive Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Deformable VisTR: Spatio Temporal Deformable Attention for Video Instance Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Generation for Unsupervised Domain Adaptation: A Gan-Based Approach for Object Classification with 3D Point Cloud Data.

[BibT_eX]

[DOI]

Junxuan Huang

Chunming Qiao

Proceedings of the IEEE International Conference on Acoustics, 2022

PREF: Predictability Regularized Neural Motion Fields.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Neural Correspondence Field for Object Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

AiATrack: Attention in Attention for Transformer Visual Tracking.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

MixSTE: Seq2seq Mixed Spatio-Temporal Encoder for 3D Human Pose Estimation in Video.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Efficient Video Instance Segmentation via Tracklet Query and Proposal.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Learning Transferable Human-Object Interaction Detector with Natural Language Supervision.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

OVIS: Open-Vocabulary Visual Instance Search via Visual-Semantic Aligned Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Survey on depth and RGB image-based 3D hand shape and pose estimation.

[BibT_eX]

[DOI]

Virtual Real. Intell. Hardw., 2021

Introduction to the Special Issue on Explainable AI on Multimedia Computing.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2021

Hierarchical Soft Quantization for Skeleton-Based Human Action Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2021

3D Object Representation Learning: A Set-to-Set Matching Perspective.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Image Co-Skeletonization via Co-Segmentation.

[BibT_eX]

[DOI]

Jiangbo Lu

IEEE Trans. Image Process., 2021

Joint Hand-Object 3D Reconstruction From a Single Image With Cross-Branch Feature Fusion.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Revisiting Modified Greedy Algorithm for Monotone Submodular Maximization with a Knapsack Constraint.

[BibT_eX]

[DOI]

Proc. ACM Meas. Anal. Comput. Syst., 2021

SibNet: Sibling Convolutional Encoder for Video Captioning.

[BibT_eX]

[DOI]

Sheng Liu

IEEE Trans. Pattern Anal. Mach. Intell., 2021

3D Hand Pose Estimation Using Synthetic Data and Weakly Labeled RGB Images.

[BibT_eX]

[DOI]

Yujun Cai

Liuhao Ge

IEEE Trans. Pattern Anal. Mach. Intell., 2021

Human pose estimation and its application to action recognition: A survey.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2021

Pseudo Supervised Monocular Depth Estimation with Teacher-Student Network.

[BibT_eX]

[DOI]

CoRR, 2021

Two-Stream Consensus Network: Submission to HACS Challenge 2021 Weakly-Supervised Learning Track.

[BibT_eX]

[DOI]

CoRR, 2021

NeLF: Practical Novel View Synthesis with Neural Light Field.

[BibT_eX]

[DOI]

CoRR, 2021

NeCH: Neural Clothed Human Model.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Visual Communications and Image Processing, 2021

Learning to Detect Monoclonal Protein in Electrophoresis Images.

[BibT_eX]

[DOI]

Hanyu Li

Sabrina Racine-Brzostek

Proceedings of the International Conference on Visual Communications and Image Processing, 2021

Handling Difficult Labels for Multi-label Image Classification via Uncertainty Distillation.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Learning Kinematic Formulas from Multiple View Videos.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Rethinking Soft Labels for Knowledge Distillation: A Bias-Variance Tradeoff Perspective.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Discovering Human Interactions with Large-Vocabulary Objects via Query and Multi-Scale Detection.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Stacked Homography Transformations for Multi-View Pedestrian Detection.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

A Unified 3D Human Motion Synthesis Model via Conditional Variational Auto-Encoder<sup>∗</sup>.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

High Quality Disparity Remapping with Two-Stage Warping.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Track To Detect and Segment: An Online Multi-Object Tracker.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Model-Based 3D Hand Reconstruction via Self-Supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Multimodal Co-training for Fake News Identification Using Attention-aware Fusion.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition - 6th Asian Conference, 2021

Proactive Student Persistence Prediction in MOOCs via Multi-domain Adversarial Learning.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition - 6th Asian Conference, 2021

Robust Knowledge Transfer via Hybrid Forward on the Teacher-Student Model.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

ACSNet: Action-Context Separation Network for Weakly Supervised Temporal Action Localization.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Weakly Supervised Temporal Action Localization Through Learning Explicit Subspaces for Action and Context.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Pruning 3D Filters For Accelerating 3D ConvNets.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2020

Unsupervised Learning of Optical Flow With CNN-Based Non-Local Filtering.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Context-Integrated and Feature-Refined Network for Lightweight Object Parsing.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Towards Real-Time Eyeblink Detection in the Wild: Dataset, Theory and Practices.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Forensics Secur., 2020

Occlusion Pattern Discovery for Object Detection and Occlusion Reasoning.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2020

Early Action Recognition With Category Exclusion Using Policy-Based Reinforcement Learning.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2020

Asymmetric Mapping Quantization for Nearest Neighbor Search.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2020

Detecting spatiotemporal irregularities in videos via a 3D convolutional autoencoder.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2020

Product Quantization Network for Fast Visual Search.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2020

Interventional Domain Adaptation.

[BibT_eX]

[DOI]

CoRR, 2020

Attention-Aware Noisy Label Learning for Image Classification.

[BibT_eX]

[DOI]

CoRR, 2020

Deep Reinforcement Learning with Label Embedding Reward for Supervised Image Hashing.

[BibT_eX]

[DOI]

CoRR, 2020

Towards Understanding the Adversarial Vulnerability of Skeleton-based Action Recognition.

[BibT_eX]

[DOI]

CoRR, 2020

Temporal Pulses Driven Spiking Neural Network for Fast Object Recognition in Autonomous Driving.

[BibT_eX]

[DOI]

CoRR, 2020

Learning Depth for Scene Reconstruction Using an Encoder-Decoder Model.

[BibT_eX]

[DOI]

IEEE Access, 2020

Multipath Event-Based Network for Low-Power Human Action Recognition.

[BibT_eX]

[DOI]

Xiao Wu

Proceedings of the 6th IEEE World Forum on Internet of Things, 2020

Self-Mimic Learning for Small-scale Pedestrian Detection.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Forest R-CNN: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection.

[BibT_eX]

[DOI]

Ye Liu

Chang Wen Chen

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Campus3D: A Photogrammetry Point Cloud Benchmark for Hierarchical Understanding of Outdoor Scene.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

HOT-Net: Non-Autoregressive Transformer for 3D Hand-Object Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Dynamic Graph CNN for Event-Camera Based Gesture Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2020

Temporal Pulses Driven Spiking Neural Network for Time and Power Efficient Object Recognition in Autonomous Driving.

[BibT_eX]

[DOI]

Proceedings of the 25th International Conference on Pattern Recognition, 2020

S3F: A Multi-View Slow-Fast Network For Alzheimer's Disease Diagnosis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Structure-Aware Human-Action Generation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Temporal Distinct Representation Learning for Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Hand-Transformer: Non-Autoregressive Structured Modeling for 3D Hand Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Clustering Driven Deep Autoencoder for Video Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Learning Progressive Joint Propagation for Human Motion Prediction.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Measuring Generalisation to Unseen Viewpoints, Articulations, Shapes and Objects for 3D Hand Pose Estimation Under Hand-Object Interaction.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Temporal-Context Enhanced Detection of Heavily Occluded Pedestrians.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Discovering Human Interactions With Novel Objects via Zero-Shot Learning.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

3DV: 3D Dynamic Voxel for Action Recognition in Depth Video.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Semantic Enhanced Sketch Based Image Retrieval with Incomplete Multimodal Query.

[BibT_eX]

[DOI]

Proceedings of the 6th IEEE International Conference on Multimedia Big Data, 2020

Learning Diverse Stochastic Human-Action Generators by Learning Smooth Latent Transitions.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Boosting Positive and Unlabeled Learning for Anomaly Detection With Multi-Features.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2019

Codebook-Free Compact Descriptor for Scalable Visual Search.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2019

Action-Stage Emphasized Spatiotemporal VLAD for Video Action Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2019

Hough Forest With Optimized Leaves for Global Hand Pose Estimation With Arbitrary Postures.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2019

Dictionary Learning-Based, Directional, and Optimized Prediction for Lenslet Image Coding.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2019

Discriminative Spatio-Temporal Pattern Discovery for 3D Action Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2019

Semantic Cues Enhanced Multimodality Multistream CNN for Action Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2019

Robust Distracter-Resistive Tracker via Learning a Multi-Component Discriminative Dictionary.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2019

Efficient Video Object Co-Localization With Co-Saliency Activated Tracklets.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2019

Real-Time Detection of Fall From Bed Using a Single Depth Camera.

[BibT_eX]

[DOI]

IEEE Trans Autom. Sci. Eng., 2019

A survey of variational and CNN-based optical flow techniques.

[BibT_eX]

[DOI]

Signal Process. Image Commun., 2019

Multi-label learning of part detectors for occluded pedestrian detection.

[BibT_eX]

[DOI]

Pattern Recognit., 2019

Learning a robust representation via a deep network on symmetric positive definite manifolds.

[BibT_eX]

[DOI]

Pattern Recognit., 2019

Real-Time 3D Hand Pose Estimation with 3D Convolutional Neural Networks.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2019

SLTFNet: A spatial and language-temporal tensor fusion network for video moment retrieval.

[BibT_eX]

[DOI]

Inf. Process. Manag., 2019

Context-Integrated and Feature-Refined Network for Lightweight Urban Scene Parsing.

[BibT_eX]

[DOI]

CoRR, 2019

Progress Regression RNN for Online Spatial-Temporal Action Localization in Unconstrained Videos.

[BibT_eX]

[DOI]

CoRR, 2019

Space-Time Event Clouds for Gesture Recognition: From RGB Cameras to Event Cameras.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Cross-Modal Video Moment Retrieval with Spatial and Language-Temporal Attention.

[BibT_eX]

[DOI]

Proceedings of the 2019 on International Conference on Multimedia Retrieval, 2019

SPAGAN: Shortest Path Graph Attention Network.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Bayesian Uncertainty Matching for Unsupervised Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Motion-Let Clustering for Skeleton-Based Action Recognition.

[BibT_eX]

[DOI]

Chen Zhu

Proceedings of the IEEE International Conference on Multimedia & Expo Workshops, 2019

Spatio-Temporal Multi-scale Soft Quantization Learning for Skeleton-Based Human Action Recognition.

[BibT_eX]

[DOI]

Chen Zhu

Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Discriminative Feature Transformation for Occluded Pedestrian Detection.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

PointCloud Saliency Maps.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Temporal Structure Mining for Weakly Supervised Action Detection.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

A2J: Anchor-to-Joint Regression Network for 3D Articulated Pose Estimation From a Single Depth Image.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

SO-HandNet: Self-Organizing Network for 3D Hand Pose Estimation With Semi-Supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Exploiting Spatial-Temporal Relationships for 3D Pose Estimation via Graph Convolutional Networks.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Kervolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Joint Representative Selection and Feature Learning: A Semi-Supervised Approach.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

3D Hand Shape and Pose Estimation From a Single RGB Image.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Multi-View, Generative, Transfer Learning for Distributed Time Series Classification.

[BibT_eX]

[DOI]

Abdullah al-Raihan Nayeem

S. George Djorgovski

Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019

Exploiting Local Feature Patterns for Unsupervised Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Traffic-Optimized Data Placement for Social Media.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2018

Quality-Guided Fusion-Based Co-Saliency Estimation for Image Co-Segmentation and Colocalization.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2018

Query Adaptive Multiview Object Instance Search and Localization Using Sketches.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2018

Profit Maximization for Viral Marketing in Online Social Networks: Algorithms and Analysis.

[BibT_eX]

[DOI]

IEEE Trans. Knowl. Data Eng., 2018

Simultaneously Discovering and Localizing Common Objects in Wild Images.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2018

Video Summarization Via Multiview Representative Selection.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2018

Fried Binary Embedding: From High-Dimensional Visual Features to High-Dimensional Binary Codes.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2018

Robust 3D Hand Pose Estimation From Single Depth Images Using Multi-View CNNs.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2018

Minimizing Reconstruction Bias Hashing via Joint Projection Learning and Quantization.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2018

Local Large-Margin Multi-Metric Learning for Face and Kinship Verification.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2018

Representative Selection on a Hypersphere.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2018

An efficient and effective hop-based approach for influence maximization in social networks.

[BibT_eX]

[DOI]

Soc. Netw. Anal. Min., 2018

Multi-stream CNN: Learning representations based on human-related regions for action recognition.

[BibT_eX]

[DOI]

Pattern Recognit., 2018

Temporally enhanced image object proposals for online video object and action detections.

[BibT_eX]

[DOI]

Jiong Yang

J. Vis. Commun. Image Represent., 2018

Learning Saliency Maps for Adversarial Point-Cloud Generation.

[BibT_eX]

[DOI]

CoRR, 2018

Online Processing Algorithms for Influence Maximization.

[BibT_eX]

[DOI]

Proceedings of the 2018 International Conference on Management of Data, 2018

Towards Profit Maximization for Online Social Network Providers.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Communications, 2018

3D Convolutional Generative Adversarial Networks for Detecting Temporal Irregularities in Videos.

[BibT_eX]

[DOI]

Mengjia Yan

Proceedings of the 24th International Conference on Pattern Recognition, 2018

Unsupervised Multiple-Instance Learning for Instance Search.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Selecting Informative Frames for Action Recognition with Partial Observations.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Bi-box Regression for Pedestrian Detection and Occlusion Estimation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Product Quantization Network for Fast Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Deformable Pose Traversal Convolution for 3D Action and Gesture Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Point-to-Point Regression PointNet for 3D Hand Pose Estimation.

[BibT_eX]

[DOI]

Liuhao Ge

Proceedings of the Computer Vision - ECCV 2018, 2018

Weakly-Supervised 3D Hand Pose Estimation from Monocular RGB Images.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Depth-Based 3D Hand Pose Estimation: From Current Achievements to Future Goals.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Multi-View Harmonized Bilinear Network for 3D Object Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Salience Guided Depth Calibration for Perceptually Optimized Compressive Light Field 3D Display.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Recognizing Human Actions as the Evolution of Pose Estimation Maps.

[BibT_eX]

[DOI]

Mengyuan Liu

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Conditional Generative Adversarial Network for Structured Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Hand PointNet: 3D Hand Pose Estimation Using Point Sets.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Understanding Human-Object Interaction in RGB-D videos for Human Robot Interaction.

[BibT_eX]

[DOI]

Zhiwen Fang

Proceedings of Computer Graphics International 2018, 2018

Actor-Action Semantic Segmentation with Region Masks.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference 2018, 2018

Kernel Cross-Correlator.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Tensorized Projection for High-Dimensional Binary Embedding.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Distributed Composite Quantization.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Visual Pattern Discovery and Recognition

[BibT_eX]

[DOI]

Springer Briefs in Computer Science, Springer, ISBN: 978-981-10-4839-5, 2017

Sound-Event Classification Using Robust Texture Features for Robot Hearing.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2017

Person Reidentification Using Multiple Egocentric Views.

[BibT_eX]

[DOI]

Anirban Chakraborty

Bappaditya Mandal

IEEE Trans. Circuits Syst. Video Technol., 2017

Discovering Class-Specific Spatial Layouts for Scene Recognition.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2017

LBP-Structure Optimization With Symmetry and Uniformity Regularizations for Scene Classification.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2017

Representative Selection with Structured Sparsity.

[BibT_eX]

[DOI]

Pattern Recognit., 2017

Fusing disparate object signatures for salient object detection in video.

[BibT_eX]

[DOI]

Pattern Recognit., 2017

Learning location constrained pixel classifiers for image parsing.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2017

3D Hand Pose Estimation: From Current Achievements to Future Goals.

[BibT_eX]

[DOI]

CoRR, 2017

Non-Iterative Localization and Fast Mapping.

[BibT_eX]

[DOI]

Chen Wang

Lihua Xie

CoRR, 2017

Non-Iterative SLAM: A Fast Dense Method for Inertial-Visual SLAM.

[BibT_eX]

[DOI]

Chen Wang

Lihua Xie

CoRR, 2017

Positive and Unlabeled Learning for Anomaly Detection with Multi-features.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

Efficient tracking of closely spaced objects in depth data using sequential dirichlet process clustering.

[BibT_eX]

[DOI]

Michael Hoy

Justin Dauwels

Proceedings of the IEEE Intelligent Vehicles Symposium, 2017

Efficient ground object segmentation in 3D LIDAR based on cascaded mode seeking.

[BibT_eX]

[DOI]

Michael Hoy

Justin Dauwels

Proceedings of the 20th IEEE International Conference on Intelligent Transportation Systems, 2017

Is My Object in This Video? Reconstruction-based Object Search in Videos.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Real time hand gesture recognition via finger-emphasized multi-scale description.

[BibT_eX]

[DOI]

Chen Zhu

Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Temporally enhanced image object proposals for videos.

[BibT_eX]

[DOI]

Jiang Yang

Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Context-aware graph-based analysis for detecting anomalous activities.

[BibT_eX]

[DOI]

Jiaqi Zhang

Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Efficient directional and L1-optimized intra-prediction for light field image compression.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Video Summarization via Multi-view Representative Selection.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Multi-label Learning of Part Detectors for Heavily Occluded Pedestrian Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Compressive Quantization for Fast Object Instance Search in Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Common Action Discovery and Localization in Unconstrained Videos.

[BibT_eX]

[DOI]

Jiong Yang

Proceedings of the IEEE International Conference on Computer Vision, 2017

Non-iterative SLAM.

[BibT_eX]

[DOI]

Chen Wang

Lihua Xie

Proceedings of the 18th International Conference on Advanced Robotics, 2017

Real-time hierarchical fusion system for semantic segmentation in offroad scenes.

[BibT_eX]

[DOI]

Proceedings of the 20th International Conference on Information Fusion, 2017

HOPE: Hierarchical Object Prototype Encoding for Efficient Object Instance Search in Videos.

[BibT_eX]

[DOI]

Yuwei Wu

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Spatio-Temporal Naive-Bayes Nearest-Neighbor (ST-NBNN) for Skeleton-Based Action Recognition.

[BibT_eX]

[DOI]

Junwu Weng

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Object Co-skeletonization with Co-segmentation.

[BibT_eX]

[DOI]

Jiangbo Lu

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Fried Binary Embedding for High-Dimensional Visual Features.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

3D Convolutional Neural Networks for Efficient and Robust Hand Pose Estimation from Single Depth Images.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Influence Maximization Meets Efficiency and Effectiveness: A Hop-Based Approach.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017, Sydney, Australia, July 31, 2017

A framework based on deep learning and mathematical morphology for cabin door detection in an automated aerobridge docking system.

[BibT_eX]

[DOI]

Proceedings of the 11th Asian Control Conference, 2017

Common visual pattern discovery and search.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Efficient Object Instance Search Using Fuzzy Objects Matching.

[BibT_eX]

[DOI]

Yuwei Wu

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016

Body Movement Analysis and Recognition.

[BibT_eX]

[DOI]

Proceedings of the Context Aware Human-Robot and Human-Agent Interaction, 2016

Face and Facial Expressions Recognition and Analysis.

[BibT_eX]

[DOI]

Proceedings of the Context Aware Human-Robot and Human-Agent Interaction, 2016

Object Instance Search in Videos via Spatio-Temporal Trajectory Discovery.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2016

Image Co-segmentation via Saliency Co-fusion.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2016

Query-Adaptive Small Object Search Using Object Proposals and Shape-Aware Descriptors.

[BibT_eX]

[DOI]

Ling-Yu Duan

IEEE Trans. Multim., 2016

Fast Appearance Modeling for Automatic Primary Video Object Segmentation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2016

Adobe Boxes: Locating Object Proposals Using Object Adobes.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2016

Discovering Primary Objects in Videos by Saliency Fusion and Iterative Appearance Estimation.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2016

Introduction of New Associate Editors.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2016

Discriminative Action States Discovery for Online Action Recognition.

[BibT_eX]

[DOI]

Bo Hu

Yuwei Wu

IEEE Signal Process. Lett., 2016

Parsing 3D motion trajectory for gesture recognition.

[BibT_eX]

[DOI]

Youfu Li

J. Vis. Commun. Image Represent., 2016

Finding spatio-temporal salient paths for video objects discovery.

[BibT_eX]

[DOI]

Ye Luo

Jianwei Lu

J. Vis. Commun. Image Represent., 2016

Guest Editorial: Human Activity Understanding from 2D and 3D Data.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2016

Invariant multi-scale descriptor for shape representation, matching and retrieval.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2016

Barehanded music: real-time hand interaction for virtual piano.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM SIGGRAPH Symposium on Interactive 3D Graphics and Games, 2016

L1-optimized linear prediction for light field image compression.

[BibT_eX]

[DOI]

Proceedings of the 2016 Picture Coding Symposium, 2016

A Compact Binary Aggregated Descriptor via Dual Selection for Visual Search.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Learning a Multi-class Discriminative Dictionary with Nonredundancy Constraints for Visual Classification.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Query Adaptive Instance Search using Object Sketches.

[BibT_eX]

[DOI]

Xiang Ruan

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

To Project More or to Quantize More: Minimize Reconstruction Bias for Learning Compact Binary Codes.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Profit maximization for viral marketing in Online Social Networks.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE International Conference on Network Protocols, 2016

Collaborative multi-view metric learning for visual classification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Invariant multi-scale shape descriptor for object matching and recognition.

[BibT_eX]

[DOI]

Haoran Xu

Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Bayesian tracking of multiple objects with vision and radar.

[BibT_eX]

[DOI]

Proceedings of the 14th International Conference on Control, 2016

CATS: Co-saliency Activated Tracklet Selection for Video Co-localization.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

From Keyframes to Key Objects: Video Summarization by Representative Object Proposal Selection.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Robust 3D Hand Pose Estimation in Single Depth Images: From Single-View CNN to Multi-View CNNs.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Learning to Integrate Occlusion-Specific Detectors for Heavily Occluded Pedestrian Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2016, 2016

Random Forest with Suppressed Leaves for Hough Voting.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2016, 2016

Multi-layer light field display characterisation.

[BibT_eX]

[DOI]

Proceedings of the True Vision - Capture, Transmission and Display of 3D Video, 2016

2015

Efficient Mining of Optimal AND/OR Patterns for Visual Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2015

Topical Video Object Discovery From Key Frames by Modeling Word Co-Occurrence Prior.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2015

Robust Discriminative Tracking via Landmark-Based Label Propagation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2015

Manifold Kernel Sparse Representation of Symmetric Positive-Definite Matrices and Its Applications.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2015

A Chi-Squared-Transformed Subspace of LBP Histogram for Visual Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2015

Randomized Spatial Context for Object Search.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2015

Collaborative Multifeature Fusion for Transductive Spectral Learning.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2015

Propagative Hough Voting for Human Activity Detection and Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2015

Resolving Ambiguous Hand Pose Predictions by Exploiting Part Correlations.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2015

LBP Encoding Schemes Jointly Utilizing the Information of Current Bit and Other LBP Bits.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2015

Learning LBP structure by maximizing the conditional mutual information.

[BibT_eX]

[DOI]

Pattern Recognit., 2015

Flexible Trajectory Indexing for 3D Motion Recognition.

[BibT_eX]

[DOI]

Youfu Li

Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision, 2015

First-Person Palm Pose Tracking and Gesture Recognition in Augmented Reality.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision, Imaging and Computer Graphics Theory and Applications, 2015

Two-layer optimized light field display using depth initialization.

[BibT_eX]

[DOI]

Proceedings of the 2015 Visual Communications and Image Processing, 2015

Glasses-free light field 3D display.

[BibT_eX]

[DOI]

Proceedings of the 2015 Visual Communications and Image Processing, 2015

QCCE: Quality constrained co-saliency estimation for common object detection.

[BibT_eX]

[DOI]

Proceedings of the 2015 Visual Communications and Image Processing, 2015

AR in Hand: Egocentric Palm Pose Tracking and Gesture Recognition for Augmented Reality Applications.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Query-Adaptive Logo Search using Shape-Aware Descriptors.

[BibT_eX]

[DOI]

Lingyu Duan

Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Egocentric hand pose estimation and distance recovery in a single RGB image.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Fast object instance search in videos from one example.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Group saliency propagation for large scale and quick image co-segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Optimizing Inter-server Communication for Online Social Networks.

[BibT_eX]

[DOI]

Proceedings of the 35th IEEE International Conference on Distributed Computing Systems, 2015

Adaptive Exponential Smoothing for Online Filtering of Pixel Prediction Maps.

[BibT_eX]

[DOI]

Jiong Yang

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Quantized fuzzy LBP for face recognition.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Fast action proposals for human action detection and search.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Automatic Visual Pattern Discovery via Cohesive Subgraph Mining.

[BibT_eX]

[DOI]

Proceedings of the Mobile Cloud Visual Media Computing - From Interaction to Service, 2015

2014

Visual pattern discovery in image and video data: a brief survey.

[BibT_eX]

[DOI]

WIREs Data Mining Knowl. Discov., 2014

Parsing the Hand in Depth Images.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2014

mCENTRIST: A Multi-Channel Feature Generation Mechanism for Scene Categorization.

[BibT_eX]

[DOI]

Yang Xiao

Jianxin Wu

IEEE Trans. Image Process., 2014

Context-Aware Discovery of Visual Co-Occurrence Patterns.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2014

Optimizing LBP Structure For Visual Recognition Using Binary Quadratic Programming.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2014

Entropic image thresholding based on GLGM histogram.

[BibT_eX]

[DOI]

Yang Xiao

Zhiguo Cao

Pattern Recognit. Lett., 2014

Modelling Multi-Party Interactions among Virtual Characters, Robots, and Humans.

[BibT_eX]

[DOI]

Zerrin Yumak

Presence Teleoperators Virtual Environ., 2014

Human-Robot Interaction by Understanding Upper Body Gestures.

[BibT_eX]

[DOI]

Presence Teleoperators Virtual Environ., 2014

Learning Actionlet Ensemble for 3D Human Action Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2014

Video Event Detection: From Subvolume Localization to Spatiotemporal Path Search.

[BibT_eX]

[DOI]

Du Tran

David A. Forsyth

IEEE Trans. Pattern Anal. Mach. Intell., 2014

Fusion of 3D-LIDAR and camera data for scene parsing.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2014

Tracking and fusion for multiparty interaction with a virtual character and a social robot.

[BibT_eX]

[DOI]

Zerrin Yumak

Proceedings of the SIGGRAPH Asia 2014 Autonomous Virtual Humans and Social Robot for Telepresence, 2014

Activity recognition in unconstrained RGB-D video using 3D trajectories.

[BibT_eX]

[DOI]

Proceedings of the SIGGRAPH Asia 2014 Autonomous Virtual Humans and Social Robot for Telepresence, 2014

Boosting cross-media retrieval via visual-auditory feature analysis and relevance feedback.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Scalable forest hashing for fast similarity search.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Hierarchical multi-feature fusion for multimodal data analysis.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Automatic image co-segmentation using geometric mean saliency.

[BibT_eX]

[DOI]

Fanman Meng

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Efficient Online Spatio-Temporal Filtering for Video Event Detection.

[BibT_eX]

[DOI]

Xinchen Yan

Proceedings of the Computer Vision - ECCV 2014 Workshops, 2014

Multi-feature Spectral Clustering with Minimax Optimization.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Non-rectangular Part Discovery for Object Detection.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference, 2014

Location Constrained Pixel Classifiers for Image Parsing with Regular Spatial Layout.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference, 2014

Discriminative Orderlet Mining for Real-Time Recognition of Human-Object Interaction.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2014, 2014

Large Margin Multi-metric Learning for Face and Kinship Verification in the Wild.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2014, 2014

Height Gradient Histogram (HIGH) for 3D Scene Labeling.

[BibT_eX]

[DOI]

Proceedings of the 2nd International Conference on 3D Vision, 2014

Low-Rank Online Metric Learning.

[BibT_eX]

[DOI]

Proceedings of the Low-Rank and Sparse Modeling for Visual Analysis, 2014

2013

Model-based hand pose estimation via spatial-temporal hand parsing and 3D fingertip localization.

[BibT_eX]

[DOI]

Vis. Comput., 2013

Robust Part-Based Hand Gesture Recognition Using Kinect Sensor.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2013

Action Search by Example Using Randomized Visual Vocabularies.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2013

Noise-Resistant Local Binary Pattern With an Embedded Error-Correction Mechanism.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2013

Self-Supervised Online Metric Learning With Low Rank Constraint for Scene Categorization.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2013

Video Anomaly Search in Crowded Scenes via Spatio-Temporal Motion Context.

[BibT_eX]

[DOI]

Yandong Tang

IEEE Trans. Inf. Forensics Secur., 2013

Hybrid Saliency Detection for Images.

[BibT_eX]

[DOI]

Zhenzhong Chen

IEEE Signal Process. Lett., 2013

A complete and fully automated face verification system on mobile devices.

[BibT_eX]

[DOI]

Pattern Recognit., 2013

Abnormal event detection in crowded scenes using sparse representation.

[BibT_eX]

[DOI]

Ji Liu

Pattern Recognit., 2013

Minimum Near-Convex Shape Decomposition.

[BibT_eX]

[DOI]

Wenyu Liu

IEEE Trans. Pattern Anal. Mach. Intell., 2013

Human-virtual human interaction by upper body gesture understanding.

[BibT_eX]

[DOI]

Yang Xiao

Proceedings of the 19th ACM Symposium on Virtual Reality Software and Technology, 2013

Salient object detection in videos by optimal spatio-temporal path discovery.

[BibT_eX]

[DOI]

Ye Luo

Proceedings of the ACM Multimedia Conference, 2013

Mobile media communication, processing, and analysis: A review of recent advances.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

Hierarchical sparse coding based on spatial pooling and multi-feature fusion.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Direct mining co-occurrence features for visual recognition: A branch and bound method.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Learning weighted geometric pooling for image classification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2013

Relaxed local ternary pattern for face recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2013

Learning binarized pixel-difference pattern for scene recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2013

Voxel labelling in CT images with data-driven contextual features.

[BibT_eX]

[DOI]

Ho Yee Tiong

Proceedings of the IEEE International Conference on Image Processing, 2013

Thematic Saliency Detection Using Spatial-Temporal Context.

[BibT_eX]

[DOI]

Ye Luo

Proceedings of the 2013 IEEE International Conference on Computer Vision Workshops, 2013

Dynamic texture recognition using enhanced LBP features.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Topical Video Object Discovery from Key Frames by Modeling Word Co-occurrence Prior.

[BibT_eX]

[DOI]

Gang Hua

Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Object instance search in videos.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Information, 2013

2012

Mining Visual Collocation Patterns via Self-Supervised Subspace Learning.

[BibT_eX]

[DOI]

IEEE Trans. Syst. Man Cybern. Part B, 2012

Towards Scalable Summarization of Consumer Videos Via Sparse Dictionary Selection.

[BibT_eX]

[DOI]

Jiebo Luo

IEEE Trans. Multim., 2012

Discovering Thematic Objects in Image Collections and Videos.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2012

Spatial Locality-Aware Sparse Coding and Dictionary Learning.

[BibT_eX]

[DOI]

Proceedings of the 4th Asian Conference on Machine Learning, 2012

Location Discriminative Vocabulary Coding for Mobile Landmark Search.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2012

Hand pose estimation by combining fingertip tracking and articulated ICP.

[BibT_eX]

[DOI]

Proceedings of the Virtual Reality Continuum and its Applications in Industry, 2012

Max-Margin Structured Output Regression for Spatio-Temporal Action Localization.

[BibT_eX]

[DOI]

Du Tran

Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Predicting human activities using spatio-temporal structure of interest points.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

3D fingertip and palm tracking in depth image sequences.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Rapid object search engine for contextual advertisement.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Social Image Tagging by Mining Sparse Tag Patterns from Auxiliary Data.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Curb detection and tracking using 3D-LIDAR scanner.

[BibT_eX]

[DOI]

Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Learning sparse tag patterns for social image classification.

[BibT_eX]

[DOI]

Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Object tracking via online metric learning.

[BibT_eX]

[DOI]

Yandong Tang

Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Fusion of Velodyne and camera data for scene parsing.

[BibT_eX]

[DOI]

Xuhong Xiao

Proceedings of the 15th International Conference on Information Fusion, 2012

Propagative Hough Voting for Human Activity Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2012, 2012

Randomized Spatial Partition for Scene Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2012, 2012

Mining actionlet ensemble for action recognition with depth cameras.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Randomized visual phrases for object search.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Arbitrary-Shape Object Localization Using Adaptive Image Grids.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2012, 2012

2011

Fast Action Detection via Discriminative Random Forest Voting and Top-K Subvolume Search.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2011

Saliency Density Maximization for Efficient Visual Objects Discovery.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2011

Discriminative Video Pattern Search for Efficient Action Detection.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2011

Discovering the Thematic Object in Commercial Videos.

[BibT_eX]

[DOI]

IEEE Multim., 2011

Learning spatio-temporal dependency of local patches for complex motion segmentation.

[BibT_eX]

[DOI]

Jiang Xu

Comput. Vis. Image Underst., 2011

Anomalous video event detection using spatiotemporal context.

[BibT_eX]

[DOI]

Fan Jiang

Sotirios A. Tsaftaris

Comput. Vis. Image Underst., 2011

Real-time human action search using random forest based hough voting.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Robust hand gesture recognition based on finger-earth mover's distance with a commodity depth camera.

[BibT_eX]

[DOI]

Zhengyou Zhang

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Robust hand gesture recognition with kinect sensor.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Grassmann Hashing for approximate nearest neighbor search in high dimensional space.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Salient region detection and its application to video retargeting.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Grid-based local feature bundling for efficient object search and localization.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Discovering Thematic Patterns in Videos via Cohesive Sub-graph Mining.

[BibT_eX]

[DOI]

Proceedings of the 11th IEEE International Conference on Data Mining, 2011

Combining Feature Context and Spatial Context for Image Pattern Discovery.

[BibT_eX]

[DOI]

Proceedings of the 11th IEEE International Conference on Data Mining, 2011

A fast and accurate cascade subspace face/eye detector on mobile devices.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

Minimum near-convex decomposition for robust shape representation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2011

Mining discriminative co-occurrence patterns for visual recognition.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Unsupervised random forest indexing for fast action search.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Optimal spatio-temporal path discovery for video event detection.

[BibT_eX]

[DOI]

Du Tran

Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Sparse reconstruction cost for abnormal event detection.

[BibT_eX]

[DOI]

Ji Liu

Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Multiple instance boosting with global smoothness regularization.

[BibT_eX]

[DOI]

Gang Hua

Proceedings of the 8th International Conference on Information, 2011

Depth camera based hand gesture recognition and its applications in Human-Computer-Interaction.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Information, 2011

2010

Mining Compositional Features From GPS and Visual Cues for Event Recognition in Photo Collections.

[BibT_eX]

[DOI]

Jiebo Luo

IEEE Trans. Multim., 2010

Mining and cropping common objects from images.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Multimedia 2010, 2010

KPB-SIFT: a compact local feature descriptor.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Multimedia 2010, 2010

Interactive visual object search through mutual information maximization.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Multimedia 2010, 2010

Bipolar grouping.

[BibT_eX]

[DOI]

Jiang Xu

Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Efficient search of Top-K video subvolumes for multi-instance action detection.

[BibT_eX]

[DOI]

Norberto A. Goussies

Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Video anomaly detection in spatiotemporal context.

[BibT_eX]

[DOI]

Fan Jiang

Sotirios A. Tsaftaris

Proceedings of the International Conference on Image Processing, 2010

Middle-Level Representation for Human Activities Recognition: The Role of Spatio-Temporal Relationships.

[BibT_eX]

[DOI]

Fei Yuan

Véronique Prinet

Proceedings of the Trends and Topics in Computer Vision, 2010

Saliency Density Maximization for Object Detection and Localization.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2010, 2010

2009

Mining Repetitive Patterns in Multimedia Data.

[BibT_eX]

[DOI]

Proceedings of the Encyclopedia of Data Warehousing and Mining, Second Edition (4 Volumes), 2009

Speeding up spatio-temporal sliding-window search for efficient event detection in crowded videos.

[BibT_eX]

[DOI]

Proceedings of the 1st ACM international workshop on Events in multimedia, 2009

Multimodal partial estimates fusion.

[BibT_eX]

[DOI]

Jiang Xu

Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Discriminative subvolume search for efficient action detection.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Multimedia Data Indexing.

[BibT_eX]

[DOI]

Thomas S. Huang

Proceedings of the Semantic Mining Technologies for Multimedia Databases., 2009

2008

Mining Recurring Events Through Forest Growing.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2008

Locality Versus Globality: Query-Driven Localized Linear Models for Facial Image Computing.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2008

Mining GPS traces and visual words for event classification.

[BibT_eX]

[DOI]

Proceedings of the 1st ACM SIGMM International Conference on Multimedia Information Retrieval, 2008

Mining Motifs from Human Motion.

[BibT_eX]

[DOI]

Proceedings of the 29th Annual Conference of the European Association for Computer Graphics, 2008

Context-aware clustering.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Mining compositional features for boosting.

[BibT_eX]

[DOI]

Jiebo Luo

Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

2007

Mining repetitive clips through finding continuous paths.

[BibT_eX]

[DOI]

Proceedings of the 15th International Conference on Multimedia 2007, 2007

From frequent itemsets to semantically meaningful visual patterns.

[BibT_eX]

[DOI]

Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2007

Query Driven Localized Linear Discriminant Models for Head Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Common Spatial Pattern Discovery by Efficient Candidate Pruning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Image Processing, 2007

Query-Driven Locally Adaptive Fisher Faces and Expert-Model for Face Recognition.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Image Processing, 2007

Spatial Random Partition for Common Visual Pattern Discovery.

[BibT_eX]

[DOI]

Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Discovery of Collocation Patterns: from Visual Words to Visual Phrases.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Spatial selection for attentional visual tracking.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

2004

Fast and Robust Short Video Clip Search for Copy Detection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Fast and robust video clip search using index structure.

[BibT_eX]

[DOI]

Proceedings of the 12th ACM International Conference on Multimedia, 2004

Fast and robust short video clip search using an index structure.

[BibT_eX]

[DOI]