Ming Yang

Orcid: 0000-0003-1691-6817

Affiliations:
  • Ant Group, Seattle, WA, USA
  • Facebook AI Research, Menlo Park, CA, USA (former)
  • Horizon Robotics Inc., Beijing, China (former)
  • NEC Laboratories America, Cupertino, CA, USA (former)
  • Northwestern University, Department of EECS, Evanston, IL, USA (PhD 2008)


According to our database1, Ming Yang authored at least 90 papers between 2004 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Adapting Vision-Language Models via Learning to Inject Knowledge.
IEEE Trans. Image Process., 2024

Animate-X: Universal Character Image Animation with Enhanced Motion Representation.
CoRR, 2024

Social Debiasing for Fair Multi-modal LLMs.
CoRR, 2024

Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight.
CoRR, 2024

SHE-Net: Syntax-Hierarchy-Enhanced Text-Video Retrieval.
CoRR, 2024

M2-RAAP: A Multi-Modal Recipe for Advancing Adaptation-based Pre-training towards Effective and Efficient Zero-shot Video-text Retrieval.
CoRR, 2024

M<sub>2</sub>-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient Pretraining.
CoRR, 2024

M<sup>2</sup>-RAAP: A Multi-Modal Recipe for Advancing Adaptation-based Pre-training towards Effective and Efficient Zero-shot Video-text Retrieval.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Parameter-Efficient Complementary Expert Learning for Long-Tailed Visual Recognition.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

EVE: Efficient Zero-Shot Text-Based Video Editing With Depth Map Guidance and Temporal Consistency Constraints.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

SyCoCa: Symmetrizing Contrastive Captioners with Attentive Masking for Multimodal Alignment.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

POA: Pre-training Once for Models of All Sizes.
Proceedings of the Computer Vision - ECCV 2024, 2024

StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Pink: Unveiling the Power of Referential Comprehension for Multi-modal LLMs.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SkySense: A Multi-Modal Remote Sensing Foundation Model Towards Universal Interpretation for Earth Observation Imagery.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Towards Better Vision-Inspired Vision-Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Fine-grained Pseudo Labels for Scene Text Recognition.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Dual-Modal Attention-Enhanced Text-Video Retrieval with Triplet Partial Margin Contrastive Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

2022
ForestDet: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation.
IEEE Trans. Multim., 2022

BDCN: Bi-Directional Cascade Network for Perceptual Edge Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Adversarial structured prediction for domain-adaptive semantic segmentation.
Mach. Vis. Appl., 2022

Asymmetric Label Propagation for Video Object Segmentation.
Proceedings of the 4th ACM International Conference on Multimedia in Asia, 2022

Joint Global-Local Alignment for Domain Adaptive Semantic Segmentation.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
3D Object Representation Learning: A Set-to-Set Matching Perspective.
IEEE Trans. Image Process., 2021

Handling Difficult Labels for Multi-label Image Classification via Uncertainty Distillation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Stacked Homography Transformations for Multi-View Pedestrian Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Track To Detect and Segment: An Online Multi-Object Tracker.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Robust Knowledge Transfer via Hybrid Forward on the Teacher-Student Model.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Learning Recurrent 3D Attention for Video-Based Person Re-Identification.
IEEE Trans. Image Process., 2020

Self-Mimic Learning for Small-scale Pedestrian Detection.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Temporal-Context Enhanced Detection of Heavily Occluded Pedestrians.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Learning Semantics-Preserving Attention and Contextual Interaction for Group Activity Recognition.
IEEE Trans. Image Process., 2019

Spatial-Temporal Attention-Aware Learning for Video-Based Person Re-Identification.
IEEE Trans. Image Process., 2019

Self-Guided Hash Coding for Large-Scale Person Re-identification.
Proceedings of the 2nd IEEE Conference on Multimedia Information Processing and Retrieval, 2019

Resolution-invariant Person Re-Identification.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Discriminative Feature Transformation for Occluded Pedestrian Detection.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

SSAP: Single-Shot Instance Segmentation With Affinity Pyramid.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Bi-Directional Cascade Network for Perceptual Edge Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Assessing Image Retrieval Quality at the First Glance.
IEEE Trans. Image Process., 2018

Collaborative Active Visual Recognition from Crowds: A Distributed Ensemble Approach.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Mining Semantics-Preserving Attention for Group Activity Recognition.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Deep Reinforcement Learning with Iterative Shift for Visual Tracking.
Proceedings of the Computer Vision - ECCV 2018, 2018

Conditional Generative Adversarial Network for Structured Domain Adaptation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2016
Scalable Feature Matching by Dual Cascaded Scalar Quantization for Image Retrieval.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

2015
Semantic-Aware Co-Indexing for Image Retrieval.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Query Specific Rank Fusion for Image Retrieval.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Regionlets for Generic Object Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Web-scale training for face identification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Towards Codebook-Free: Scalable Cascaded Hashing for Mobile Image Search.
IEEE Trans. Multim., 2014

Compressing Deep Convolutional Networks using Vector Quantization.
CoRR, 2014

DeepFace: Closing the Gap to Human-Level Performance in Face Verification.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Accurate Object Detection with Location Relaxation and Regionlets Re-localization.
Proceedings of the Computer Vision - ACCV 2014, 2014

2013
3D Convolutional Neural Networks for Human Action Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

Collaborative Active Learning of a Kernel Machine Ensemble for Recognition.
Proceedings of the IEEE International Conference on Computer Vision, 2013

2012
Query Specific Fusion for Image Retrieval.
Proceedings of the Computer Vision - ECCV 2012, 2012

2011
Real-time clothing recognition in surveillance videos.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Contextual weighting for vocabulary tree based image retrieval.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Mining discriminative co-occurrence patterns for visual recognition.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Correspondence driven adaptation for human profile recognition.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Large-scale image classification: Fast feature extraction and SVM training.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Efficient re-ranking in vocabulary tree based image retrieval.
Proceedings of the Conference Record of the Forty Fifth Asilomar Conference on Signals, 2011

2010
AdaBoost-based face detection for embedded systems.
Comput. Vis. Image Underst., 2010

Videos Semantic Indexing using Image Classification.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

2009
Tracking Nonstationary Visual Appearances by Data-Driven Adaptation.
IEEE Trans. Image Process., 2009

Context-Aware Visual Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., 2009

Detecting Human Actions in Surveillance Videos.
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

Detecting video events based on action recognition in complex scenes using spatio-temporal descriptor.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Human action detection by boosting efficient motion features.
Proceedings of the 12th IEEE International Conference on Computer Vision Workshops, 2009

Detection driven adaptive multi-cue integration for multiple human tracking.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Semi Supervised Image Spam Hunter: A Regularized Discriminant EM Approach.
Proceedings of the Advanced Data Mining and Applications, 5th International Conference, 2009

2008
Surveillance Event Detection.
Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008

A bi-subspace model for robust visual tracking.
Proceedings of the International Conference on Image Processing, 2008

Image spam hunter.
Proceedings of the IEEE International Conference on Acoustics, 2008

Granularity and elasticity adaptation in visual tracking.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Vital sign estimation from passive thermal video.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

2007
Multiple Collaborative Kernel Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., 2007

From frequent itemsets to semantically meaningful visual patterns.
Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2007

Mining Auxiliary Objects for Tracking by Multibody Grouping.
Proceedings of the International Conference on Image Processing, 2007

Game-Theoretic Multiple Target Tracking.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

False Positive Reduction in Lung GGO Nodule Detection with 3D Volume Shape Descriptor.
Proceedings of the IEEE International Conference on Acoustics, 2007

Discovery of Collocation Patterns: from Visual Words to Visual Phrases.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Spatial selection for attentional visual tracking.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Detector Ensemble.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

2006
Face detection for automatic exposure control in handheld camera.
Proceedings of the 2006 IEEE International Conference on Computer Vision Systems, 2006

Tracking Motion-Blurred Targets in Video.
Proceedings of the International Conference on Image Processing, 2006

Intelligent Collaborative Tracking by Mining Auxiliary Objects.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

Efficient Optimal Kernel Placement for Reliable Visual Tracking.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

2005
Tracking Non-Stationary Appearances and Dynamic Feature Selection.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

2004
Fast macroblock mode selection based on motion content classification in H.264/AVC.
Proceedings of the 2004 International Conference on Image Processing, 2004


  Loading...