We stand with Ukraine

We stand with Ukraine

Gang Yu

Orcid: 0000-0001-5570-2710

Affiliations:

StepFun
Tencent, Shanghai, China (2019 - 2024)
Megvii, Beijing, China (2014 - 2019)
Nanyang Technological University, Singapore (PhD 2014)
Shanghai Jiao Tong University, China (former)

According to our database¹, Gang Yu authored at least 123 papers between 2009 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

On csauthors.net:

Bibliography

2025

MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, February, 2025

2024

Unbounded-GS: Extending 3D Gaussian Splatting With Hybrid Representation for Unbounded Large-Scale Scene Reconstruction.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

IEEE Robotics Autom. Lett., December, 2024

Metric3D v2: A Versatile Monocular Geometric Foundation Model for Zero-Shot Metric Depth and Surface Normal Estimation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., November, 2024

Enhancing quality of pose-varied face restoration with local weak feature sensing and GAN prior.

[BibT_eX]

[DOI]

,

,

,

,

,

Neural Comput. Appl., January, 2024

Lightweight Model Pre-Training via Language Guided Knowledge Distillation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

IEEE Trans. Multim., 2024

MikuDance: Animating Character Art with Mixed Motion Dynamics.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Scene123: One Prompt to 3D Scene Generation via Video-Assisted and Consistency-Enhanced MAE.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

MeshXL: Neural Coordinate Field for Generative 3D Foundation Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Generative Motion Stylization within Canonical Motion Space.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

Disentangled Pre-training for Image Matting.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

MeshXL: Neural Coordinate Field for Generative 3D Foundation Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Generative Motion Stylization of Cross-structure Characters within Canonical Motion Space.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

TapMo: Shape-aware Motion Generation of Skeleton-free Characters.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

M3DBench: Towards Omni 3D Assistant with Interleaved Multi-modal Instructions.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

MotionChain: Conversational Motion Controllers via Multimodal Prompts.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

Paint3D: Paint Anything 3D With Lighting-Less Texture Diffusion Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Enhanced Visual Instruction Tuning with Synthesized Image-Dialogue Data.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2024

PM-INR: Prior-Rich Multi-Modal Implicit Large-Scale Scene Neural Representation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

IT3D: Improved Text-to-3D Generation with Explicit View Synthesis.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

UAV-Served Energy Harvesting-Enabled M2M Networks for Green Industry - A Perspective of Energy Efficient Resource Management Scheme.

[BibT_eX]

[DOI]

,

,

,

,

,

Arumugam Nallanathan

IEEE Trans. Green Commun. Netw., December, 2023

DCNet: Large-Scale Point Cloud Semantic Segmentation With Discriminative and Efficient Feature Aggregation.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Trans. Circuits Syst. Video Technol., August, 2023

AppAgent: Multimodal Agents as Smartphone Users.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2023

M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2023

FaceStudio: Put Your Face Everywhere in Seconds.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2023

ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2023

ChartLlama: A Multimodal LLM for Chart Understanding and Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2023

VQ-NeRF: Vector Quantization Enhances Implicit Neural Representations.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2023

Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Joey Tianyi Zhou

,

CoRR, 2023

StableLLaVA: Enhanced Visual Instruction Tuning with Synthesized Image-Dialogue Data.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2023

StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2023

A Large-Scale Outdoor Multi-modal Dataset and Benchmark for Novel View Synthesis and Implicit Scene Reconstruction.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2023

Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

MotionGPT: Human Motion as a Foreign Language.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

PDF: Point Diffusion Implicit Function for Large-scale Scene Neural Representation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Capturing the Motion of Every Joint: 3D Human Pose and Shape Estimation with Independent Tokens.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

SeaFormer: Squeeze-enhanced Axial Transformer for Mobile Semantic Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Joey Tianyi Zhou

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

A Large-Scale Outdoor Multi-modal Dataset and Benchmark for Novel View Synthesis and Implicit Scene Reconstruction.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

STAR Loss: Reducing Semantic Ambiguity in Facial Landmark Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

End-to-End 3D Dense Captioning with Vote2Cap-DETR.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Executing your Commands via Motion Diffusion in Latent Space.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Stochastic Game for Resource Management in Cellular Zero-Touch Deterministic Industrial M2M Networks.

[BibT_eX]

[DOI]

,

,

Yang-Gang Zhang

,

IEEE Wirel. Commun. Lett., 2022

Sample-Centric Feature Generation for Semi-Supervised Few-Shot Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

IEEE Trans. Image Process., 2022

Executing your Commands via Motion Diffusion in Latent Space.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2022

Learning Variational Motion Prior for Video-based Motion Capture.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2022

Hierarchical Normalization for Robust Monocular Depth Estimation.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2022

Resource allocation for UAV-aided energy harvesting-powered D2D communications: A reinforcement learning-based scheme.

[BibT_eX]

[DOI]

,

,

,

Ad Hoc Networks, 2022

Hierarchical Normalization for Robust Monocular Depth Estimation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Coordinates Are NOT Lonely - Codebook Prior Helps Implicit Neural 3D representations.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

D &D: Learning Human Dynamics from Dynamic Camera.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2022, 2022

Efficient Single-Image Depth Estimation on Mobile Devices, Mobile AI & AIM 2022 Challenge: Report.

[BibT_eX]

[DOI]

,

Grigory Malivenko

,

,

Lukasz Treszczotko

,

,

,

Michal Lopuszynski

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Marcos V. Conde

,

Denis Sapozhnikov

,

Byeong Hyun Lee

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

NTIRE 2022 Challenge on Super-Resolution and Quality Enhancement of Compressed Video: Dataset, Methods and Results.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Shakarim Soltanayev

,

Chervontsev Sergey

,

Magauiya Zhussip

,

,

,

Pablo Navarrete Michelini

,

,

,

,

,

,

Chengjian Zheng

,

,

,

,

Thuong Nguyen Canh

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Sunder Ali Khowaja

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021

Generative Adversarial LSTM Networks Learning for Resource Allocation in UAV-Served M2M Communications.

[BibT_eX]

[DOI]

,

,

,

IEEE Wirel. Commun. Lett., 2021

Human pose estimation and its application to action recognition: A survey.

[BibT_eX]

[DOI]

,

,

,

J. Vis. Commun. Image Represent., 2021

BiSeNet V2: Bilateral Network with Guided Aggregation for Real-Time Semantic Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

Int. J. Comput. Vis., 2021

Sketch Me A Video.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2021

Fine-grained Identity Preserving Landmark Synthesis for Face Reenactment.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2021

Shuffle Transformer with Feature Alignment for Video Face Parsing.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2021

Shuffle Transformer: Rethinking Spatial Shuffle for Vision Transformer.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2021

Fast and Accurate Single-Image Depth Estimation on Mobile Devices, Mobile AI 2021 Challenge: Report.

[BibT_eX]

[DOI]

,

Grigory Malivenko

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Fausto T. Benavide

CoRR, 2021

Object-aware Long-short-range Spatial Alignment for Few-Shot Fine-Grained Image Classification.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Attribute-specific Control Units in StyleGAN for Fine-grained Image Manipulation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

HSEGAN: Hair Synthesis and Editing Using Structure-Adaptive Normalization on Generative Adversarial Network.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

A Simple Baseline for Fast and Accurate Depth Estimation on Mobile Devices.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

2020

AnchorFace: An Anchor-based Facial Landmark Detector Across Large Poses.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2020

Context Prior for Scene Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

High-Order Information Matters: Learning Relation and Topology for Occluded Person Re-Identification.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

State-Aware Tracker for Real-Time Video Object Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

SiamFC++: Towards Robust and Accurate Visual Tracking with Target Estimation Guidelines.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Real-Time Semantic Segmentation via Multiply Spatial Fusion Network.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2019

Double Anchor R-CNN for Human Detection in a Crowd.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2019

Class-balanced Grouping and Sampling for Point Cloud 3D Object Detection.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2019

Shape Robust Text Detection with Progressive Scale Expansion Network.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2019

ThunderNet: Towards Real-time Generic Object Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2019

WIDER Face and Pedestrian Challenge 2018: Methods and Results.

[BibT_eX]

[DOI]

Chen Change Loy

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Artem Kukharenko

,

,

Nikolay Sergievskiy

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2019

Rethinking on Multi-Stage Networks for Human Pose Estimation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2019

Learnable Tree Filter for Structure-preserving Feature Transform.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Efficient and Accurate Arbitrary-Shaped Text Detection With Pixel Aggregation Network.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

ThunderNet: Towards Real-Time Generic Object Detection on Mobile Devices.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Objects365: A Large-Scale, High-Quality Dataset for Object Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Shape Robust Text Detection With Progressive Scale Expansion Network.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

TACNet: Transition-Aware Context Network for Spatio-Temporal Action Detection.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

An End-To-End Network for Panoptic Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Modeling Local Geometric Structure of 3D Point Clouds Using Geo-CNN.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Scene Text Detection with Supervised Pyramid Context Network.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Attention-Based Multi-Context Guiding for Few-Shot Semantic Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

Cees G. M. Snoek

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

CrowdHuman: A Benchmark for Detecting Human in a Crowd.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2018

SFace: An Efficient Network for Face Detection in Large Scale Variations.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2018

DetNet: A Backbone network for Object Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2018

Selecting Informative Frames for Action Recognition with Partial Observations.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

BiSeNet: Bilateral Segmentation Network for Real-Time Semantic Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2018, 2018

DetNet: Design Backbone for Object Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2018, 2018

Associating Inter-image Salient Instances for Weakly Supervised Semantic Segmentation.

[BibT_eX]

[DOI]

,

,

Ming-Ming Cheng

,

,

Ralph R. Martin

,

Proceedings of the Computer Vision - ECCV 2018, 2018

Learning a Discriminative Feature Network for Semantic Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

MegDet: A Large Mini-Batch Object Detector.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Cascaded Pyramid Network for Multi-Person Pose Estimation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

R-FCN++: Towards Accurate Region-Based Fully Convolutional Networks for Object Detection.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

SOT for MOT.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2017

Light-Head R-CNN: In Defense of Two-Stage Object Detector.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2017

Face Attention Network: An Effective Face Detector for the Occluded Faces.

[BibT_eX]

[DOI]

,

,

CoRR, 2017

Large Kernel Matters - Improve Semantic Segmentation by Global Convolutional Network.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2015

Propagative Hough Voting for Human Activity Detection and Recognition.

[BibT_eX]

[DOI]

,

,

IEEE Trans. Circuits Syst. Video Technol., 2015

Fast action proposals for human action detection and search.

[BibT_eX]

[DOI]

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014

Scalable forest hashing for fast similarity search.

[BibT_eX]

[DOI]

,

Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Discriminative Orderlet Mining for Real-Time Recognition of Human-Object Interaction.

[BibT_eX]

[DOI]

,

,

Proceedings of the Computer Vision - ACCV 2014, 2014

2013

Action Search by Example Using Randomized Visual Vocabularies.

[BibT_eX]

[DOI]

,

,

IEEE Trans. Image Process., 2013

2012

Predicting human activities using spatio-temporal structure of interest points.

[BibT_eX]

[DOI]

,

,

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Propagative Hough Voting for Human Activity Recognition.

[BibT_eX]

[DOI]

,

,

Proceedings of the Computer Vision - ECCV 2012, 2012

Randomized Spatial Partition for Scene Recognition.

[BibT_eX]

[DOI]

,

,

Proceedings of the Computer Vision - ECCV 2012, 2012

2011

Fast Action Detection via Discriminative Random Forest Voting and Top-K Subvolume Search.

[BibT_eX]

[DOI]

,

Norberto A. Goussies

,

,

IEEE Trans. Multim., 2011

Robust object tracking with occlusion handle.

[BibT_eX]

[DOI]

,

,

,

Neural Comput. Appl., 2011

Real-time human action search using random forest based hough voting.

[BibT_eX]

[DOI]

,

,

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Unsupervised random forest indexing for fast action search.

[BibT_eX]

[DOI]

,

,

Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2009

Robust Incremental Subspace Learning for Object Tracking.

[BibT_eX]

[DOI]

,

,

Proceedings of the Neural Information Processing, 16th International Conference, 2009

Illumination Invariant Object Tracking with Incremental Subspace Learning.

[BibT_eX]

[DOI]

,

Proceedings of the Fifth International Conference on Image and Graphics, 2009

Loading...