Zhiding Yu

Orcid: 0000-0003-1776-996X

According to our database1, Zhiding Yu authored at least 127 papers between 2009 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Prismer: A Vision-Language Model with Multi-Task Experts.
Trans. Mach. Learn. Res., 2024

Neural Eulerian Scene Flow Fields.
CoRR, 2024

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders.
CoRR, 2024

Exploring Camera Encoder Designs for Autonomous Driving Perception.
CoRR, 2024

Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation.
CoRR, 2024

X-VILA: Cross-Modality Alignment for Large Language Model.
CoRR, 2024

Memorize What Matters: Emergent Scene Decomposition from Multitraverse.
CoRR, 2024

OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning.
CoRR, 2024

T-Stitch: Accelerating Sampling in Pre-Trained Diffusion Models with Trajectory Stitching.
CoRR, 2024

Differentially Private Video Activity Recognition.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

SF3D: SlowFast Temporal 3D Object Detection.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2024

A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties.
Proceedings of the Computer Vision - ECCV 2024, 2024

LITA: Language Instructed Temporal-Localization Assistant.
Proceedings of the Computer Vision - ECCV 2024, 2024

Improving Distant 3D Object Detection Using 2D Box Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving?
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

What is Point Supervision Worth in Video Instance Segmentation?
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Real-Time Radiance Fields for Single-Image Portrait View Synthesis.
ACM Trans. Graph., August, 2023

Partial Convolution for Padding, Inpainting, and Image Synthesis.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

Delving Deeper into Anti-Aliasing in ConvNets.
Int. J. Comput. Vis., 2023

Taxonomy of Machine Learning Safety: A Survey and Primer.
ACM Comput. Surv., 2023

A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties.
CoRR, 2023

FB-OCC: 3D Occupancy Prediction based on Forward-Backward View Transformation.
CoRR, 2023

SSCBench: A Large-Scale 3D Semantic Scene Completion Benchmark for Autonomous Driving.
CoRR, 2023

Prismer: A Vision-Language Model with An Ensemble of Experts.
CoRR, 2023

Learning Calibrated Uncertainties for Domain Shift: A Distributionally Robust Learning Approach.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

A Critical Revisit of Adversarial Robustness in 3D Point Cloud Recognition with Diffusion-Driven Purification.
Proceedings of the International Conference on Machine Learning, 2023

Fully Attentional Networks with Self-emerging Token Labeling.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

FB-BEV: BEV Representation from Forward-Backward View Transformations.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

End-to-end 3D Tracking with Decoupled Queries.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

FocalFormer3D : Focusing on Hard Instance for 3D Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

VoxFormer: Sparse Voxel Transformer for Camera-Based 3D Semantic Scene Completion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Vision Transformers are Good Mask Auto-Labelers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Correction to: Learning Contrastive Representation for Semantic Correspondence.
Int. J. Comput. Vis., 2022

Learning Contrastive Representation for Semantic Correspondence.
Int. J. Comput. Vis., 2022

1st Place Solution of The Robust Vision Challenge (RVC) 2022 Semantic Segmentation Track.
CoRR, 2022

PointDP: Diffusion-driven Purification against Adversarial Attacks on 3D Point Cloud Recognition.
CoRR, 2022

M<sup>2</sup>BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation.
CoRR, 2022

Benchmarking Robustness of 3D Point Cloud Recognition Against Common Corruptions.
CoRR, 2022

Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Understanding The Robustness in Vision Transformers.
Proceedings of the International Conference on Machine Learning, 2022

RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

FreeSOLO: Learning to Segment Objects without Annotations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

CoordGAN: Self-Supervised Dense Correspondences Emerge from GANs.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

How Much More Data Do I Need? Estimating Requirements for Downstream Tasks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object Interactions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Not All Labels Are Equal: Rationalizing The Labeling Costs for Training Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Domain Stylization: A Fast Covariance Matching Framework Towards Domain Adaptation.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Panoptic SegFormer.
CoRR, 2021

Towards Reducing Labeling Cost in Deep Object Detection.
CoRR, 2021

Practical Machine Learning Safety: A Survey and Primer.
CoRR, 2021

Coupled Segmentation and Edge Learning via Dynamic Graph Propagation.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

AugMax: Adversarial Composition of Random Augmentations for Robust Training.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Adversarially Robust 3D Point Cloud Recognition Using Self-Supervisions.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Unsupervised Controllable Generation with Self-Training.
Proceedings of the International Joint Conference on Neural Networks, 2021

SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual Policies.
Proceedings of the 38th International Conference on Machine Learning, 2021

Image-Level or Object-Level? A Tale of Two Resampling Strategies for Long-Tailed Detection.
Proceedings of the 38th International Conference on Machine Learning, 2021

Contrastive Syn-to-Real Generalization.
Proceedings of the 9th International Conference on Learning Representations, 2021

DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Uncertainty-aware multi-view co-training for semi-supervised medical image segmentation and domain adaptation.
Medical Image Anal., 2020

UFO$^2$: A Unified Framework towards Omni-supervised Object Detection.
CoRR, 2020

Distributionally Robust Learning for Unsupervised Domain Adaptation.
CoRR, 2020

Transposer: Universal Texture Synthesis Using Feature Maps as Transposed Convolution Filter.
CoRR, 2020

Bongard-LOGO: A New Benchmark for Human-Level Concept Learning and Reasoning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Neural Networks with Recurrent Generative Feedback.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Automated Synthetic-to-Real Generalization.
Proceedings of the 37th International Conference on Machine Learning, 2020

Angular Visual Hardness.
Proceedings of the 37th International Conference on Machine Learning, 2020

Joint Disentangling and Adaptation for Cross-Domain Person Re-Identification.
Proceedings of the Computer Vision - ECCV 2020, 2020

UFO<sup>2</sup>: A Unified Framework Towards Omni-supervised Object Detection.
Proceedings of the Computer Vision - ECCV 2020, 2020

Instance-Aware, Context-Focused, and Memory-Efficient Weakly Supervised Object Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Regularizing Neural Networks via Minimizing Hyperspherical Energy.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Delving Deeper into Anti-aliasing in ConvNets.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

2019
Compressive Hyperspherical Energy Minimization.
CoRR, 2019

Confidence Regularized Self-Training.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Joint Discriminative and Generative Learning for Person Re-Identification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Partial Convolution based Padding.
CoRR, 2018

Domain Adaptation for Semantic Segmentation via Class-Balanced Self-Training.
CoRR, 2018

Learning towards Minimum Hyperspherical Energy.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Unsupervised Domain Adaptation for Semantic Segmentation via Class-Balanced Self-training.
Proceedings of the Computer Vision - ECCV 2018, 2018

Simultaneous Edge Alignment and Learning.
Proceedings of the Computer Vision - ECCV 2018, 2018

Learning Strict Identity Mappings in Deep Residual Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Decoupled Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Learning Structured and Deep Representations for Traffc Scene Understanding.
PhD thesis, 2017

Constructing the L2-Graph for Robust Subspace Learning and Subspace Clustering.
IEEE Trans. Cybern., 2017

Deep Hyperspherical Learning.
CoRR, 2017

CASENet: Deep Category-Aware Semantic Edge Detection.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

SphereFace: Deep Hypersphere Embedding for Face Recognition.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Semi-supervised subspace learning with L2graph.
Neurocomputing, 2016

Large-Margin Softmax Loss for Convolutional Neural Networks.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Jointly Learning Non-negative Projection and Dictionary with Discriminative Graph Constraints for Classification.
Proceedings of the British Machine Vision Conference 2016, 2016

On Order-Constrained Transitive Distance Clustering.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
KCRC-LCD: Discriminative kernel collaborative representation with locality constrained dictionary for visual categorization.
Pattern Recognit., 2015

Jointly Learning Non-negative Projection and Dictionary with Discriminative Graph Constraints for Classification.
CoRR, 2015

Robust Elastic Net Regression.
CoRR, 2015

Structured Hough Voting for Vision-Based Highway Border Detection.
Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision, 2015

Locality constrained transitive distance clustering on speech data.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Generalized Transitive Distance with Minimum Spanning Random Forest.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Image based Static Facial Expression Recognition with Multiple Deep Network Learning.
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015

Joint kernel dictionary and classifier learning for sparse coding via locality preserving K-SVD.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Multi-kernel collaborative representation for image classification.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Efficient autism spectrum disorder prediction with eye movement: A machine learning framework.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

2014
An iterative framework for unsupervised learning in the PLDA based speaker verification.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Robust rear-view ground surface detection with hidden state conditional random field and confidence propagation.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Transitive Distance Clustering with K-Means Duality.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Multi-Task Regularization with Covariance Dictionary for Linear Classifiers.
CoRR, 2013

Joint recognition / segmentation with cascaded multi-level feature classification and confidence propagation.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

2012
Texture optimization for seamless view synthesis through energy minimization.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Bag of textons for image segmentation via soft clustering and convex shift.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
Automatic object segmentation from large scale 3D urban point clouds through manifold embedded mode seeking.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Towards robust and efficient segmentation: An approach based on inter-region contour and intra-region content analysis.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Compressing similar image sets using low frequency template.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

How anti-aliasing filter affects image contrast: An analysis from majorization theory perspective.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Adaptive depth map assisted matting in 3D video.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Data hiding in dot diffused halftone images.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Multi-scale analysis of color and texture for salient object detection.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Image Interpolation Using Autoregressive Model and Gauss-Seidel Optimization.
Proceedings of the Sixth International Conference on Image and Graphics, 2011

Nonparametric density estimation on a graph: Learning framework, fast approximation and application in image segmentation.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
An adaptive unsupervised approach toward pixel clustering and color image segmentation.
Pattern Recognit., 2010

Graph segmentation revisited: Detailed analysis and density learning based implementation.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

2009
Heuristic Search for Cluster Centroids: An Ant-Based Approach for FCM Initialization.
Proceedings of the Advances in Neural Networks, 2009

On ACO-Based Fuzzy Clustering for Image Segmentation.
Proceedings of the Advances in Neural Networks, 2009

Image Segmentation Based on Local Ant Colony Optimization.
Proceedings of the Fifth International Conference on Natural Computation, 2009

A modified fuzzy c-means algorithm with adaptive spatial information for color image segmentation.
Proceedings of the 2009 IEEE Symposium on Computational Intelligence for Image Processing, 2009

Noise-robust Binary segmentation based on Ant Colony System and Modified Fuzzy C-Means algorithm.
Proceedings of the IEEE Congress on Evolutionary Computation, 2009


  Loading...