Xianzhi Du

According to our database1, Xianzhi Du authored at least 45 papers between 2011 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning.
CoRR, 2024

Apple Intelligence Foundation Language Models.
CoRR, 2024

Revisiting MoE and Dense Speed-Accuracy Comparisons for LLM Training.
CoRR, 2024

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training.
CoRR, 2024

Empowering Unsupervised Domain Adaptation with Large-scale Pre-trained Vision-Language Models.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Ferret: Refer and Ground Anything Anywhere at Any Granularity.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

MOFI: Learning Image Representations from Noisy Entity Annotated Images.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Compressing LLMs: The Truth is Rarely Pure and Never Simple.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Guiding Instruction-based Image Editing via Multimodal Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024


VeCLIP: Improving CLIP Training via Visual-Enriched Captions.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
From Scarcity to Efficiency: Improving CLIP Training via Visual-enriched Captions.
CoRR, 2023

Mobile V-MoEs: Scaling Down Vision Transformers via Sparse Mixture-of-Experts.
CoRR, 2023

MOFI: Learning Image Representations from Noisy Entity Annotated Images.
CoRR, 2023

Less is More: Removing Text-regions Improves CLIP Training Efficiency and Robustness.
CoRR, 2023

ISAA: Boost Repair Process by Constructing the Degree Constrained Optimal Repair Tree for Erasure-coded Systems.
Proceedings of the 29th IEEE International Conference on Parallel and Distributed Systems, 2023

AdaMV-MoE: Adaptive Multi-Task Vision Mixture-of-Experts.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Optimizing Anchor-based Detectors for Autonomous Driving Scenes.
CoRR, 2022

Back Razor: Memory-Efficient Transfer Learning by Self-Sparsified Backpropagation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Provable Stochastic Optimization for Global Contrastive Learning: Small Batch Does Not Harm Performance.
Proceedings of the International Conference on Machine Learning, 2022

Auto-scaling Vision Transformers without Training.
Proceedings of the Tenth International Conference on Learning Representations, 2022

A Genetic Algorithm-based Construction of Fractional Repetition Codes.
Proceedings of the IEEE Global Communications Conference, 2022

A Simple Single-Scale Vision Transformer for Object Detection and Instance Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
A Simple Single-Scale Vision Transformer for Object Localization and Instance Segmentation.
CoRR, 2021

Towards a Unified Foundation Model: Jointly Pre-Training Transformers on Unpaired Images and Text.
CoRR, 2021

Revisiting 3D ResNets for Video Recognition.
CoRR, 2021

Simple Training Strategies and Model Scaling for Object Detection.
CoRR, 2021

Dilated SpineNet for Semantic Segmentation.
CoRR, 2021

Revisiting ResNets: Improved Training and Scaling Strategies.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020
Multitask Deep Neural Networks for Tele-Wide Stereo Matching.
IEEE Access, 2020

FBA-AMNET: Foreground-Background Aware Atrous Multiscale Networks for Stereo Disparity Estimation.
Proceedings of the 2020 IEEE International Conference on Consumer Electronics (ICCE), 2020

Efficient Scale-Permuted Backbone with Learned Resource Distribution.
Proceedings of the Computer Vision - ECCV 2020, 2020

SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
TW-SMNet: Deep Multitask Learning of Tele-Wide Stereo Matching.
CoRR, 2019

AMNet: Deep Atrous Multiscale Stereo Disparity Estimation Networks.
CoRR, 2019

Multi-Task Learning of Depth from Tele and Wide Stereo Image Pairs.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Boundary-sensitive Network for Portrait Segmentation.
Proceedings of the 14th IEEE International Conference on Automatic Face & Gesture Recognition, 2019

2018
Fused Deep Neural Networks for Efficient Pedestrian Detection.
CoRR, 2018

2017
Computer Vision and Deep Learning with Applications to Object Detection, Segmentation, and Document Analysis.
PhD thesis, 2017

Fused DNN: A Deep Neural Network Fusion Approach to Fast and Robust Pedestrian Detection.
Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

Cyber-physical system enabled nearby traffic flow modelling for autonomous vehicles.
Proceedings of the 36th IEEE International Performance Computing and Communications Conference, 2017

2015
A graphical model approach for matching partial signatures.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Signature Matching Using Supervised Topic Models.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

2013
Large-Scale Signature Matching Using Multi-stage Hashing.
Proceedings of the 12th International Conference on Document Analysis and Recognition, 2013

2011
A Novel Wideband Spatial Power Combining Amplifier Based on Turnstile-Junction Waveguide Divider/Combiner.
IEICE Trans. Electron., 2011


  Loading...