Jingdong Wang

Orcid: 0000-0002-4888-4445

Affiliations:

Baidu, AI Group, Sunnyvale, CA, USA
Microsoft Research Asia, Beijing, China (former)
Hong Kong University of Science and Technology, Hong Kong (PhD 2007)
Tsinghua University, Department of Automation, Beijing, China (1997 - 2004)

According to our database¹, Jingdong Wang authored at least 397 papers between 2003 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

Towards Lightweight Super-Resolution With Dual Regression Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

CSDG-FAS: Closed-Space Domain Generalization for Face Anti-spoofing.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., November, 2024

A Survey of Multimodal Controllable Diffusion Models.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., May, 2024

Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., February, 2024

Recent advances in artificial intelligence generated content.

[BibT_eX]

[DOI]

Frontiers Inf. Technol. Electron. Eng., January, 2024

Context Autoencoder for Self-supervised Representation Learning.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., January, 2024

MaskOCR: Scene Text Recognition with Masked Vision-Language Pre-training.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

MoLE: Enhancing Human-centric Text-to-image Diffusion via Mixture of Low-rank Experts.

[BibT_eX]

[DOI]

CoRR, 2024

Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing.

[BibT_eX]

[DOI]

CoRR, 2024

Improving Multi-modal Large Language Model through Boosting Vision Capabilities.

[BibT_eX]

[DOI]

CoRR, 2024

MGMapNet: Multi-Granularity Representation Learning for End-to-End Vectorized HD Map Construction.

[BibT_eX]

[DOI]

CoRR, 2024

Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation.

[BibT_eX]

[DOI]

CoRR, 2024

Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery.

[BibT_eX]

[DOI]

CoRR, 2024

MonoFormer: One Transformer for Both Diffusion and Autoregression.

[BibT_eX]

[DOI]

CoRR, 2024

Learning Multiple Probabilistic Decisions from Latent World Model in Autonomous Driving.

[BibT_eX]

[DOI]

CoRR, 2024

FullAnno: A Data Engine for Enhancing Image Comprehension of MLLMs.

[BibT_eX]

[DOI]

CoRR, 2024

SpotActor: Training-Free Layout-Controlled Consistent Image Generation.

[BibT_eX]

[DOI]

CoRR, 2024

EasyChauffeur: A Baseline Advancing Simplicity and Efficiency on Waymax.

[BibT_eX]

[DOI]

CoRR, 2024

Disentangled Noisy Correspondence Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Add-SD: Rational Generation without Manual Reference.

[BibT_eX]

[DOI]

CoRR, 2024

LION: Linear Group RNN for 3D Object Detection in Point Clouds.

[BibT_eX]

[DOI]

CoRR, 2024

Explore the LiDAR-Camera Dynamic Adjustment Fusion for 3D Object Detection.

[BibT_eX]

[DOI]

CoRR, 2024

Surfel-based Gaussian Inverse Rendering for Fast and Relightable Dynamic Human Reconstruction from Monocular Video.

[BibT_eX]

[DOI]

CoRR, 2024

OVLW-DETR: Open-Vocabulary Light-Weighted Detection Transformer.

[BibT_eX]

[DOI]

CoRR, 2024

Evaluation of Text-to-Video Generation Models: A Dynamics Perspective.

[BibT_eX]

[DOI]

CoRR, 2024

XLD: A Cross-Lane Dataset for Benchmarking Novel Driving View Synthesis.

[BibT_eX]

[DOI]

CoRR, 2024

VDG: Vision-Only Dynamic Gaussian for Driving Simulation.

[BibT_eX]

[DOI]

CoRR, 2024

Assessing Model Generalization in Vicinity.

[BibT_eX]

[DOI]

CoRR, 2024

Skim then Focus: Integrating Contextual and Fine-grained Views for Repetitive Action Counting.

[BibT_eX]

[DOI]

CoRR, 2024

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation.

[BibT_eX]

[DOI]

CoRR, 2024

LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection.

[BibT_eX]

[DOI]

CoRR, 2024

OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding.

[BibT_eX]

[DOI]

CoRR, 2024

StrucTexTv3: An Efficient Vision-Language Model for Text-rich Image Perception, Comprehension, and Beyond.

[BibT_eX]

[DOI]

CoRR, 2024

Are Image Distributions Indistinguishable to Humans Indistinguishable to Classifiers?

[BibT_eX]

[DOI]

CoRR, 2024

Dense Connector for MLLMs.

[BibT_eX]

[DOI]

CoRR, 2024

Unsupervised Pre-training with Language-Vision Prompts for Low-Data Instance Segmentation.

[BibT_eX]

[DOI]

CoRR, 2024

Automated Multi-level Preference for MLLMs.

[BibT_eX]

[DOI]

CoRR, 2024

Training-Free Unsupervised Prompt for Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

CLIP-GS: CLIP-Informed Gaussian Splatting for Real-time and View-consistent 3D Semantic Understanding.

[BibT_eX]

[DOI]

CoRR, 2024

TryOn-Adapter: Efficient Fine-Grained Clothing Identity Adaptation for High-Fidelity Virtual Try-On.

[BibT_eX]

[DOI]

CoRR, 2024

DreamSalon: A Staged Diffusion Framework for Preserving Identity-Context in Editable Face Generation.

[BibT_eX]

[DOI]

CoRR, 2024

TexRO: Generating Delicate Textures of 3D Models by Recursive Optimization.

[BibT_eX]

[DOI]

CoRR, 2024

GVA: Reconstructing Vivid 3D Gaussian Avatars from Monocular Videos.

[BibT_eX]

[DOI]

CoRR, 2024

Collaborative Position Reasoning Network for Referring Image Segmentation.

[BibT_eX]

[DOI]

CoRR, 2024

M2-CLIP: A Multimodal, Multi-task Adapting Framework for Video Action Recognition.

[BibT_eX]

[DOI]

CoRR, 2024

TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model.

[BibT_eX]

[DOI]

Proceedings of the SIGGRAPH Asia 2024 Conference Papers, 2024

RTG-SLAM: Real-time 3D Reconstruction at Scale using Gaussian Splatting.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGGRAPH 2024 Conference Papers, 2024

Generating Action-conditioned Prompts for Open-vocabulary Video Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Uni4DAL: A Unified Baseline for Multi-dataset 4D Auto-Labeling.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition - 27th International Conference, 2024

Mobile Attention: Mobile-Friendly Linear-Attention for Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Towards Unified Multi-granularity Text Detection with Interactive Attention.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

IRGen: Generative Modeling for Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Interactive 3D Object Detection with Prompts.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Make Your ViT-Based Multi-view 3D Detectors Faster via Token Compression.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Timestep-Aware Correction for Quantized Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

SEED: A Simple and Effective 3D DETR in Point Clouds.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

GGRt: Towards Pose-Free Generalizable 3D Gaussian Splatting in Real-Time.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

OPEN: Object-Wise Position Embedding for Multi-view 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

ReSyncer: Rewiring Style-Based Generator for Unified Audio-Visually Synced Facial Performer.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

MS-DETR: Efficient DETR Training with Mixed Supervision.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Decoupled Pseudo-Labeling for Semi-Supervised Monocular 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-Based Roadside 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

VRP-SAM: SAM with Visual Reference Prompt.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Forgery-aware Adaptive Transformer for Generalizable Synthetic Image Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Learning to Rematch Mismatched Pairs for Robust Cross-Modal Retrieval.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Multi-Domain Incremental Learning for Face Presentation Attack Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

A Multimodal, Multi-Task Adapting Framework for Video Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

SSMG: Spatial-Semantic Map Guided Diffusion Model for Free-Form Layout-to-Image Generation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Noisy Correspondence Learning with Self-Reinforcing Errors Mitigation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Guest editorial: special issue on human pose estimation and its applications.

[BibT_eX]

[DOI]

Wei Tang

Zhou Ren

Jingdong Wang

Mach. Vis. Appl., November, 2023

Structured Knowledge Distillation for Dense Prediction.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Understanding Self-Supervised Pretraining with Part-Aware Representation Learning.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2023

CAE v2: Context Autoencoder with CLIP Latent Alignment.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2023

NeRF-Loc: Transformer-Based Object Localization Within Neural Radiance Fields.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., 2023

A Survey of Reasoning with Foundation Models.

[BibT_eX]

[DOI]

CoRR, 2023

GIR: 3D Gaussian Inverse Rendering for Relightable Scene Factorization.

[BibT_eX]

[DOI]

CoRR, 2023

Open-sourced Data Ecosystem in Autonomous Driving: the Present and Future.

[BibT_eX]

[DOI]

CoRR, 2023

Layered Rendering Diffusion Model for Zero-Shot Guided Image Synthesis.

[BibT_eX]

[DOI]

CoRR, 2023

GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition?

[BibT_eX]

[DOI]

CoRR, 2023

GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding.

[BibT_eX]

[DOI]

CoRR, 2023

Disentangled Representation Learning with Transmitted Information Bottleneck.

[BibT_eX]

[DOI]

CoRR, 2023

Accelerating Vision Transformers Based on Heterogeneous Attention Patterns.

[BibT_eX]

[DOI]

CoRR, 2023

PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement.

[BibT_eX]

[DOI]

CoRR, 2023

Unified Frequency-Assisted Transformer Framework for Detecting and Grounding Multi-Modal Manipulation.

[BibT_eX]

[DOI]

CoRR, 2023

VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation.

[BibT_eX]

[DOI]

CoRR, 2023

Learning Implicit Entity-object Relations by Bidirectional Generative Alignment for Multimodal NER.

[BibT_eX]

[DOI]

CoRR, 2023

Multimodal Adaptation of CLIP for Few-Shot Action Recognition.

[BibT_eX]

[DOI]

CoRR, 2023

Enhancing Your Trained DETRs with Box Refinement.

[BibT_eX]

[DOI]

CoRR, 2023

Learning Structure-Guided Diffusion Model for 2D Human Pose Estimation.

[BibT_eX]

[DOI]

CoRR, 2023

Vision Transformer with Attention Map Hallucination and FFN Compaction.

[BibT_eX]

[DOI]

CoRR, 2023

Rethinking the Open-Loop Evaluation of End-to-End Autonomous Driving in nuScenes.

[BibT_eX]

[DOI]

CoRR, 2023

Multi-Modal 3D Object Detection by Box Matching.

[BibT_eX]

[DOI]

CoRR, 2023

Exploring Effective Factors for Improving Visual In-Context Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Task-Oriented Multi-Modal Mutual Leaning for Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

ByteTrackV2: 2D and 3D Multi-Object Tracking by Associating Every Detection Box.

[BibT_eX]

[DOI]

CoRR, 2023

Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation.

[BibT_eX]

[DOI]

CoRR, 2023

Efficient Video Portrait Reenactment via Grid-based Codebook.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGGRAPH 2023 Conference Proceedings, 2023

HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Leveraging Vision-Centric Multi-Modal Expertise for 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

DAC-DETR: Divide the Attention Layers and Conquer.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

GridFormer: Towards Accurate Table Structure Recognition via Grid Prediction.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Graph Contrastive Learning for Skeleton-based Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images.

[BibT_eX]

[DOI]

Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

Boosting Few-shot Action Recognition with Graph-guided Hybrid Matching.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

What Can Simple Arithmetic Operations Do for Temporal Modeling?

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Unified Pre-training with Pseudo Texts for Text-To-Image Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Task-Oriented Multi-Modal Mutual Learning for Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

CPCM: Contextual Point Cloud Modeling for Weakly-supervised Point Cloud Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Group Pose: A Simple Baseline for End-to-End Multi-person Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Gradient-based Sampling for Class Imbalanced Semi-supervised Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

CFCG: Semi-Supervised Semantic Segmentation via Cross-Fusion and Contour Guidance Supervision.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Forward Flow for Novel View Synthesis of Dynamic Scenes.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

UATVR: Uncertainty-Adaptive Text-Video Retrieval.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

σ-Adaptive Decoupled Prototype for Few-Shot Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Augmentation Matters: A Simple-Yet-Effective Approach to Semi-Supervised Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Instance-Specific and Model-Adaptive Supervision for Semi-Supervised Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Semi-DETR: Semi-Supervised Object Detection with Detection Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CAPE: Camera View Position Embedding for Multi-View 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

PSVT: End-to-End Multi-Person 3D Pose and Shape Estimation with Progressive Video Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Beyond Attentive Tokens: Incorporating Token Importance and Diversity for Efficient Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Ambiguity-Resistant Semi-Supervised Learning for Dense Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-Based Generator.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Cyclically Disentangled Feature Translation for Face Anti-spoofing.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Robust Video Portrait Reenactment via Personalized Representation Quantization.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Distillation-Guided Residual Learning for Binary Convolutional Neural Networks.

[BibT_eX]

[DOI]

Jianming Ye

Jingdong Wang

Shiliang Zhang

IEEE Trans. Neural Networks Learn. Syst., 2022

Guest Editorial: Introduction to the Special Section on Fine-Grained Visual Categorization.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Few-Shot Image and Sentence Matching via Aligned Cross-Modal Memory.

[BibT_eX]

[DOI]

Yan Huang

Jingdong Wang

Liang Wang

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition.

[BibT_eX]

[DOI]

CoRR, 2022

CAE v2: Context Autoencoder with CLIP Target.

[BibT_eX]

[DOI]

CoRR, 2022

Group DETR v2: Strong Object Detector with Encoder-Decoder Pretraining.

[BibT_eX]

[DOI]

CoRR, 2022

It Takes Two: Masked Appearance-Motion Modeling for Self-supervised Video Transformer Pre-training.

[BibT_eX]

[DOI]

CoRR, 2022

NeRF-Loc: Transformer-Based Object Localization Within Neural Radiance Fields.

[BibT_eX]

[DOI]

CoRR, 2022

TRUST: An Accurate and End-to-End Table structure Recognizer Using Splitting-based Transformers.

[BibT_eX]

[DOI]

CoRR, 2022

Group DETR: Fast Training Convergence with Decoupled One-to-Many Label Assignment.

[BibT_eX]

[DOI]

CoRR, 2022

Detecting Deepfake by Creating Spatio-Temporal Regularity Disruption.

[BibT_eX]

[DOI]

CoRR, 2022

Conditional DETR V2: Efficient Detection Transformer with Box Queries.

[BibT_eX]

[DOI]

CoRR, 2022

MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining.

[BibT_eX]

[DOI]

CoRR, 2022

Efficient Video Segmentation Models with Per-frame Inference.

[BibT_eX]

[DOI]

CoRR, 2022

Masked Lip-Sync Prediction by Audio-Visual Contextual Exploitation in Transformers.

[BibT_eX]

[DOI]

Proceedings of the SIGGRAPH Asia 2022 Conference Papers, 2022

Singular Value Fine-tuning: Few-shot Segmentation requires Few-parameters Fine-tuning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Delving into Sequential Patches for Deepfake Detection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Paint and Distill: Boosting 3D Object Detection with Semantic Passing Network.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Self-Guided Hard Negative Generation for Unsupervised Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

On the Connection between Local Attention and Dynamic Depth-wise Convolution.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Learning Versatile Neural Architectures by Propagating Network Codes.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Fatigue Life Evaluation of Rubber Tyred Gantry Crane based on Minner Criterion.

[BibT_eX]

[DOI]

Proceedings of the 2nd International Conference on Control and Intelligent Robotics, 2022

StyleSwap: Style-Based Generator Empowers Robust Face Swapping.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

UFO: Unified Feature Optimization.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Diverse Learner: Exploring Diverse Supervision for Semi-supervised Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

GitNet: Geometric Prior-Based Transformation for Birds-Eye-View Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

DaViT: Dual Attention Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision, 2022

Action Quality Assessment with Temporal Parsing Transformer.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Human-Object Interaction Detection via Disentangled Transformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Implicit Sample Extension for Unsupervised Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Few-Shot Font Generation by Learning Fine-Grained Local Styles.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Few-Shot Head Swapping in the Wild.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Expressive Talking Head Generation with Granular Audio-Visual Control.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

MixFormer: Mixing Features across Windows and Dimensions.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Learning to Segment Video Object With Accurate Boundaries.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2021

Group Reidentification with Multigrained Matching and Integration.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2021

Deep High-Resolution Representation Learning for Visual Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2021

Content-aware convolutional neural networks.

[BibT_eX]

[DOI]

Neural Networks, 2021

OCNet: Object Context for Semantic Segmentation.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2021

SPANN: Highly-efficient Billion-scale Approximate Nearest Neighbor Search.

[BibT_eX]

[DOI]

CoRR, 2021

HRFormer: High-Resolution Transformer for Dense Prediction.

[BibT_eX]

[DOI]

CoRR, 2021

Realistic Image Synthesis with Configurable 3D Scene Layouts.

[BibT_eX]

[DOI]

CoRR, 2021

Cross-Modal Attention Consistency for Video-Audio Unsupervised Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Demystifying Local Vision Transformer: Sparse Connectivity, Weight Sharing, and Dynamic Weight.

[BibT_eX]

[DOI]

CoRR, 2021

Whole brain segmentation with full volume neural network.

[BibT_eX]

[DOI]

Comput. Medical Imaging Graph., 2021

HRFormer: High-Resolution Vision Transformer for Dense Predict.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Results of the NeurIPS'21 Challenge on Billion-Scale Approximate Nearest Neighbor Search.

[BibT_eX]

[DOI]

Harsha Vardhan Simhadri

Ravishankar Krishnaswamy

Gopal Srinivasa

Suhas Jayaram Subramanya

Jingdong Wang

Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track, 2021

SPANN: Highly-efficient Billion-scale Approximate Nearest Neighborhood Search.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Hybrid Network Compression via Meta-Learning.

[BibT_eX]

[DOI]

Jianming Ye

Shiliang Zhang

Jingdong Wang

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Admix: Enhancing the Transferability of Adversarial Attacks.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Conditional DETR for Fast Training Convergence.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Lite-HRNet: A Lightweight High-Resolution Network.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Bottom-Up Human Pose Estimation via Disentangled Keypoint Regression.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Semi-Supervised Semantic Segmentation With Cross Pseudo Supervision.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Boosting Adversarial Transferability through Enhanced Momentum.

[BibT_eX]

[DOI]

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020

Parameter Distribution Balanced CNNs.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2020

Guest Editorial Multimedia Computing With Interpretable Machine Learning.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2020

Semantic Image Segmentation by Scale-Adaptive Networks.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Improving Person Re-Identification With Iterative Impression Aggregation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Object Detection in Videos by High Quality Object Linking.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2020

S4Net: Single stage salient-instance segmentation.

[BibT_eX]

[DOI]

Comput. Vis. Media, 2020

Bottom-Up Human Pose Estimation by Ranking Heatmap-Guided Adaptive Keypoint Estimates.

[BibT_eX]

[DOI]

CoRR, 2020

Informative Dropout for Robust Representation Learning: A Shape-bias Perspective.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

SegFix: Model-Agnostic Boundary Refinement for Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Object-Contextual Representations for Semantic Segmentation.

[BibT_eX]

[DOI]

Yuhui Yuan

Xilin Chen

Jingdong Wang

Proceedings of the Computer Vision - ECCV 2020, 2020

Point-Set Anchors for Object Detection, Instance Segmentation and Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Efficient Semantic Video Segmentation with Per-Frame Inference.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Weakly-Supervised Action Localization by Generative Attention Modeling.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Closed-Loop Matters: Dual Regression Networks for Single Image Super-Resolution.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Balanced Decoupled Spatial Convolution for CNNs.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2019

Learning Attentional Recurrent Neural Network for Visual Tracking.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2019

Automatic Ensemble Diffusion for 3D Shape and Image Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2019

A Bilinear Ranking SVM for Knowledge Based Relation Prediction and Classification.

[BibT_eX]

[DOI]

IEEE Trans. Big Data, 2019

Composite Quantization.

[BibT_eX]

[DOI]

Jingdong Wang

Ting Zhang

IEEE Trans. Pattern Anal. Mach. Intell., 2019

Ordinal Constraint Binary Coding for Approximate Nearest Neighbor Search.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2019

Joint salient object detection and existence prediction.

[BibT_eX]

[DOI]

Frontiers Comput. Sci., 2019

Bottom-up Higher-Resolution Networks for Multi-Person Pose Estimation.

[BibT_eX]

[DOI]

CoRR, 2019

Interlaced Sparse Self-Attention for Semantic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2019

MMDetection: Open MMLab Detection Toolbox and Benchmark.

[BibT_eX]

[DOI]

CoRR, 2019

Beyond Intra-modality Discrepancy: A Comprehensive Survey of Heterogeneous Person Re-identification.

[BibT_eX]

[DOI]

CoRR, 2019

Group Re-Identification with Multi-grained Matching and Integration.

[BibT_eX]

[DOI]

CoRR, 2019

High-Resolution Representations for Labeling Pixels and Regions.

[BibT_eX]

[DOI]

CoRR, 2019

Disparity-preserved Deep Cross-platform Association for Cross-platform Video Recommendation.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Cross View Fusion for 3D Human Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Global-Local Temporal Representations for Video Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Structured Knowledge Distillation for Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Deep High-Resolution Representation Learning for Human Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Face Alignment With Deep Regression.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2018

A Survey on Learning to Hash.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2018

Multi-Dimensional Sparse Models.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2018

Multiview Cross-Media Hashing with Semantic Consistency.

[BibT_eX]

[DOI]

IEEE Multim., 2018

Accelerating Deep Neural Networks with Spatial Bottleneck Modules.

[BibT_eX]

[DOI]

CoRR, 2018

OCNet: Object Context Network for Scene Parsing.

[BibT_eX]

[DOI]

Yuhui Yuan

Jingdong Wang

CoRR, 2018

IGCV2: Interleaved Structured Sparse Convolutional Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2018

Object Detection in Videos by Short and Long Range Object Linking.

[BibT_eX]

[DOI]

CoRR, 2018

On the Large-Scale Transferability of Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Trends and Applications in Knowledge Discovery and Data Mining, 2018

Weakly Supervised Dense Event Captioning in Videos.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Group Re-Identification: Leveraging and Integrating Multi-Grain Information.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Deep Triplet Quantization.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Deep Convolutional Neural Networks with Merge-and-Run Mappings.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Rethinking ReLU to Train Better CNNs.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Pattern Recognition, 2018

Feature Incay for Representation Regularization.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

Part-Aligned Bilinear Representations for Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Interleaved Structured Sparse Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Global Versus Localized Generative Adversarial Nets.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Weakly-Supervised Semantic Segmentation Network With Deep Seeded Region Growing.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

IGCV3: Interleaved Low-Rank Group Convolutions for Efficient Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference 2018, 2018

Decoupled Convolutions for CNNs.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Learning Correspondence Structures for Person Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2017

Multi-Timescale Collaborative Tracking.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2017

Special issue on intelligent urban computing with big data.

[BibT_eX]

[DOI]

Mach. Vis. Appl., 2017

Towards Reversal-Invariant Image Representation.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2017

Salient Object Detection: A Discriminative Regional Feature Integration Approach.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2017

Exemplar-Guided Similarity Learning on Polynomial Kernel Feature Map for Person Re-identification.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2017

Training Better CNNs Requires to Rethink ReLU.

[BibT_eX]

[DOI]

CoRR, 2017

Interleaved Group Convolutions for Deep Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2017

Orthogonal and Idempotent Transformations for Learning Deep Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2017

Finding the Secret of CNN Parameter Layout under Strict Size Constraint.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

Mixture Factorized Ornstein-Uhlenbeck Processes for Time-Series Forecasting.

[BibT_eX]

[DOI]

Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

Random Shifting for CNN: a Solution to Reduce Information Loss in Down-Sampling Layers.

[BibT_eX]

[DOI]

Gangming Zhao

Jingdong Wang

Zhaoxiang Zhang

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Deeply-Learned Part-Aligned Representations for Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Interleaved Group Convolutions.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Human Pose Estimation Using Global and Local Normalization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Ensemble Diffusion for Retrieval.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

2016

Generalized Deep Transfer Networks for Knowledge Propagation in Heterogeneous Domains.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2016

Dual Low-Rank Pursuit: Learning Salient Features for Saliency Detection.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2016

A Distance-Computation-Free Search Scheme for Binary Code Databases.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2016

Weakly Supervised Metric Learning for Traffic Sign Recognition in a LIDAR-Equipped Vehicle.

[BibT_eX]

[DOI]

IEEE Trans. Intell. Transp. Syst., 2016

A Diffusion and Clustering-Based Approach for Finding Coherent Motions and Understanding Crowd Scenes.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2016

Joint Multilabel Classification With Community-Aware Label Graph Learning.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2016

DeepSaliency: Multi-Task Deep Neural Network Model for Salient Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2016

Guest Editorial: Big Media Data: Understanding, Search, and Mining.

[BibT_eX]

[DOI]

IEEE Trans. Big Data, 2016

Incorporating visual adjectives for image classification.

[BibT_eX]

[DOI]

Neurocomputing, 2016

Accurate Image Search with Multi-Scale Contextual Evidences.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2016

Detection of Co-salient Objects by Looking Deep and Wide.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2016

Good Practice in CNN Feature Transfer.

[BibT_eX]

[DOI]

CoRR, 2016

On the Connection of Deep Fusion to Ensembling.

[BibT_eX]

[DOI]

CoRR, 2016

Deeply-Fused Nets.

[BibT_eX]

[DOI]

CoRR, 2016

Self-Paced Cross-Modal Subspace Matching.

[BibT_eX]

[DOI]

Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016

Fast Nearest Neighbor Search in the Hamming Space.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

Binary Optimized Hashing.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

MARS: A Video Benchmark for Large-Scale Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

Geometric Neural Phrase Pooling: Modeling the Spatial Co-occurrence of Neurons.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

Collaborative Quantization for Cross-Modal Similarity Search.

[BibT_eX]

[DOI]

Ting Zhang

Jingdong Wang

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

InterActive: Inter-Layer Activeness Propagation.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

DisturbLabel: Regularizing CNN on the Loss Layer.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Supervised Quantization for Similarity Search.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015

Fine-Grained Image Search.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2015

Optimized Cartesian K-Means.

[BibT_eX]

[DOI]

IEEE Trans. Knowl. Data Eng., 2015

Exploratory Product Image Search With Circle-to-Search Interaction.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2015

Guest Editorial: Big Media Data: Understanding, Search, and Mining (Part 2).

[BibT_eX]

[DOI]

IEEE Trans. Big Data, 2015

Guest Editorial: Ad Hoc Web Multimedia Analysis with Limited Supervision.

[BibT_eX]

[DOI]

Yahong Han

Yi Yang

Jingdong Wang

Multim. Tools Appl., 2015

Group $K$-Means.

[BibT_eX]

[DOI]

CoRR, 2015

Deep kinship verification.

[BibT_eX]

[DOI]

Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015

Weakly-Shared Deep Transfer Networks for Heterogeneous-Domain Knowledge Propagation.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Quantized Correlation Hashing for Fast Cross-Modal Search.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Scalable Person Re-identification: A Benchmark.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

RIDE: Reversal Invariant Descriptor Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Person Re-Identification with Correspondence Structure Learning.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Sparse composite quantization.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Co-saliency detection via looking deep and wide.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Similarity learning on an explicit polynomial kernel feature map for person re-identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Fast Neighborhood Graph Search Using Cartesian Concatenation.

[BibT_eX]

[DOI]

Proceedings of the Multimedia Data Mining and Analytics - Disruptive Innovation, 2015

Fast Approximate K-Means via Cluster Closures.

[BibT_eX]

[DOI]

Proceedings of the Multimedia Data Mining and Analytics - Disruptive Innovation, 2015

2014

Personalized Video Recommendation through Graph Propagation.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2014

Browse-to-Search: Interactive Exploratory Search with Visual Entities.

[BibT_eX]

[DOI]

ACM Trans. Inf. Syst., 2014

Regularized Tree Partitioning and Its Application to Unsupervised Image Segmentation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2014

Trinary-Projection Trees for Approximate Nearest Neighbor Search.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2014

Image tag refinement by regularized latent Dirichlet allocation.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2014

Low-rank SIFT: An Affine Invariant Feature for Place Recognition.

[BibT_eX]

[DOI]

CoRR, 2014

Hashing for Similarity Search: A Survey.

[BibT_eX]

[DOI]

CoRR, 2014

Deep Regression for Face Alignment.

[BibT_eX]

[DOI]

CoRR, 2014

Salient Object Detection: A Discriminative Regional Feature Integration Approach.

[BibT_eX]

[DOI]

CoRR, 2014

Inner Product Similarity Search using Compositional Codes.

[BibT_eX]

[DOI]

Chao Du

Jingdong Wang

CoRR, 2014

Transductive 3D Shape Segmentation using Sparse Reconstruction.

[BibT_eX]

[DOI]

Comput. Graph. Forum, 2014

Optimized Distances for Binary Code Ranking.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Composite Quantization for Approximate Nearest Neighbor Search.

[BibT_eX]

[DOI]

Ting Zhang

Chao Du

Jingdong Wang

Proceedings of the 31th International Conference on Machine Learning, 2014

Low-rank SIFT: An affine invariant feature for place recognition.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Finding Coherent Motions and Semantic Regions in Crowd Scenes: A Diffusion and Clustering Approach.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2014, 2014

Orientational Pyramid Matching for Recognizing Indoor Scenes.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

How Fashion Talks: Clothing-Region-Based Gender Recognition.

[BibT_eX]

[DOI]

Shengnan Cai

Jingdong Wang

Long Quan

Proceedings of the Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications, 2014

2013

Interactive Multimodal Visual Search on Mobile Device.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2013

Structure-Sensitive Superpixels via Geodesic Distance.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2013

Hybrid Affinity Propagation.

[BibT_eX]

[DOI]

CoRR, 2013

Scalable $k$-NN graph construction.

[BibT_eX]

[DOI]

CoRR, 2013

Order preserving hashing for approximate nearest neighbor search.

[BibT_eX]

[DOI]

Proceedings of the ACM Multimedia Conference, 2013

Image search by graph-based label propagation with image representation from DNN.

[BibT_eX]

[DOI]

Proceedings of the ACM Multimedia Conference, 2013

Clickage: towards bridging semantic and intent gaps via mining click logs of search engines.

[BibT_eX]

[DOI]

Proceedings of the ACM Multimedia Conference, 2013

Fixed-Point Model For Structured Labeling.

[BibT_eX]

[DOI]

Proceedings of the 30th International Conference on Machine Learning, 2013

Two dimensional synthesis sparse model.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Two dimensional analysis sparse model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2013

Learning CRFs for Image Parsing with Adaptive Subgradient Descent.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2013

Fast Neighborhood Graph Search Using Cartesian Concatenation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2013

Online Robust Non-negative Dictionary Learning for Visual Tracking.

[BibT_eX]

[DOI]

Naiyan Wang

Jingdong Wang

Dit-Yan Yeung

Proceedings of the IEEE International Conference on Computer Vision, 2013

Supervised Kernel Descriptors for Visual Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Salient Object Detection: A Discriminative Regional Feature Integration Approach.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012

An interactive approach to semantic modeling of indoor scenes with an RGBD camera.

[BibT_eX]

[DOI]

ACM Trans. Graph., 2012

Correction to "Bayesian Visual Reranking".

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2012

Recommending Flickr groups with social topic model.

[BibT_eX]

[DOI]

Inf. Retr., 2012

Color filter for image search.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Scalable similar image search by joint indices.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Query-driven iterated neighborhood graph search for large scale indexing.

[BibT_eX]

[DOI]

Jingdong Wang

Shipeng Li

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Similar image search with a tiny bag-of-delegates representation.

[BibT_eX]

[DOI]

Weiwen Tu

Rong Pan

Jingdong Wang

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Browse-to-search.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Personalized video recommendation through tripartite graph propagation.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Contextual Dominant Color Name Extraction for Web Image Search.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops, 2012

A Probabilistic Approach to Robust Matrix Factorization.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2012, 2012

Scalable k-NN graph construction for visual descriptors.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Salient object detection for searched web images via global saliency.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Fast approximate k-means via cluster closures.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Image search results refinement via outlier detection using deep contexts.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011

Bayesian Visual Reranking.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2011

Interactive Image Search by Color Map.

[BibT_eX]

[DOI]

Jingdong Wang

Xian-Sheng Hua

ACM Trans. Intell. Syst. Technol., 2011

A transductive multi-label learning approach for video concept detection.

[BibT_eX]

[DOI]

Pattern Recognit., 2011

Learning to Detect a Salient Object.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2011

Interactive browsing via diversified visual summarization for image search results.

[BibT_eX]

[DOI]

Jingdong Wang

Liyan Jia

Xian-Sheng Hua

Multim. Syst., 2011

Discriminative Sketch-based 3D Model Retrieval via Robust Shape Matching.

[BibT_eX]

[DOI]

Comput. Graph. Forum, 2011

Document clustering with universum.

[BibT_eX]

[DOI]

Dan Zhang

Jingdong Wang

Luo Si

Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Hybrid image summarization.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

JIGSAW: interactive mobile visual search with multimodal queries.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Web-scale image search by color sketch.

[BibT_eX]

[DOI]

Jingdong Wang

Xian-Sheng Hua

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Robust visual reranking via sparsity and ranking constraints.

[BibT_eX]

[DOI]

Nobuyuki Morioka

Jingdong Wang

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Contextual image search.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Complementary hashing for approximate nearest neighbor search.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2011

Multi-task low-rank affinity pursuit for image segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2011

A non-convex relaxation approach to sparse dictionary learning.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Automatic salient object segmentation based on context and shape prior.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference, 2011

2010

Interactive image search by 2D semantic map.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on World Wide Web, 2010

Image search by concept map.

[BibT_eX]

[DOI]

Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Dynamic Video Collage.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Modeling, 2010

Learning to combine multi-resolution spatially-weighted co-occurrence matrices for image representation.

[BibT_eX]

[DOI]

Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Optimizing kd-trees for scalable visual descriptor indexing.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009

Picture Collage.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2009

Linear Neighborhood Propagation and Its Applications.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2009

Graph-based semi-supervised learning with multiple labels.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2009

Tag refinement by regularized LDA.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Multimedia 2009, 2009

Summarizing tagged image collections by cross-media representativeness voting.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

2008

MSRA atT TRECVID 2008: High-Level Feature Extraction and Automatic Search.

[BibT_eX]

[DOI]

Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008

Semi-Supervised Classification with Universum.

[BibT_eX]

[DOI]

Proceedings of the SIAM International Conference on Data Mining, 2008

Bayesian video search reranking.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Multimedia 2008, 2008

Finding image exemplars using fast sparse affinity propagation.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Multimedia 2008, 2008

Transductive multi-label learning for video concept detection.

[BibT_eX]

[DOI]

Proceedings of the 1st ACM SIGMM International Conference on Multimedia Information Retrieval, 2008

Graph-based semi-supervised learning with multi-label.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Optimized video scene segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Transductive video annotation via local learnable kernel classifier.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Augmented tree partitioning for interactive image segmentation.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Image Processing, 2008

Maximum Margin Clustering with Pairwise Constraints.

[BibT_eX]

[DOI]

Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008

Joint multi-label multi-instance learning for image classification.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Normalized tree partitioning for image segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

2007

Image-based tree modeling.

[BibT_eX]

[DOI]

ACM Trans. Graph., 2007

Face recognition using spectral features.

[BibT_eX]

[DOI]

Pattern Recognit., 2007

Image-Based Modeling by Joint Segmentation.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2007

Joint Affinity Propagation for Multiple View Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

2006

Image-based plant modeling.

[BibT_eX]

[DOI]

ACM Trans. Graph., 2006

Semi-Supervised Classification Using Linear Neighborhood Propagation.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

Picture Collage.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

2005

Visual object recognition using probabilistic kernel subspace similarity.

[BibT_eX]

[DOI]

Pattern Recognit., 2005

2004

Probabilistic tangent subspace: a unified view.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 2004

Multi-view EM algorithm and its application to color image segmentation.

[BibT_eX]

[DOI]

Xing Yi

Changshui Zhang

Jingdong Wang

Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

2003

Kernel GMM and its application to image binarization.

[BibT_eX]

[DOI]

Jingdong Wang

Jianguo Lee

Changshui Zhang

Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Color Image Segmentation: Kernel Do the Feature Space.

[BibT_eX]

[DOI]

Jianguo Lee

Jingdong Wang

Changshui Zhang

Proceedings of the Machine Learning: ECML 2003, 2003

Kernel Trick Embedded Gaussian Mixture Model.

[BibT_eX]

[DOI]

Jingdong Wang

Jianguo Lee

Changshui Zhang

Proceedings of the Algorithmic Learning Theory, 14th International Conference, 2003

Jingdong Wang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...