Bo Zhang

Orcid: 0000-0001-8052-782X

Affiliations:
  • Shanghai AI Laboratory, China
  • Fudan University, MoE Key Laboratory for Information Science of Electromagnetic Waves, Shanghai, China (PhD 2022)


According to our database1, Bo Zhang authored at least 47 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Few-Shot Cross-Domain Object Detection With Instance-Level Prototype-Based Meta-Learning.
IEEE Trans. Circuits Syst. Video Technol., October, 2024

Push-and-Pull: A General Training Framework With Differential Augmentor for Domain Generalized Point Cloud Classification.
IEEE Trans. Circuits Syst. Video Technol., August, 2024

MinerU: An Open-Source Solution for Precise Document Content Extraction.
CoRR, 2024

CDM: A Reliable Metric for Fair and Accurate Formula Recognition Evaluation.
CoRR, 2024

DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language Models.
CoRR, 2024

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text.
CoRR, 2024

Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving.
CoRR, 2024

How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites.
CoRR, 2024

UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition.
CoRR, 2024

ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning.
CoRR, 2024

OASim: an Open and Adaptive Simulator based on Neural Rendering for Autonomous Driving.
CoRR, 2024

Cross-Task Linearity Emerges in the Pretraining-Finetuning Paradigm.
CoRR, 2024

On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Reg-TTA3D: Better Regression Makes Better Test-Time Adaptive 3D Object Detection.
Proceedings of the Computer Vision - ECCV 2024, 2024

Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Performance-Aware Approximation of Global Channel Pruning for Multitask CNNs.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

A Closer Look at Few-Shot 3D Point Cloud Classification.
Int. J. Comput. Vis., March, 2023

Rethinking Cross-Domain Pedestrian Detection: A Background-Focused Distribution Alignment Framework for Instance-Free One-Stage Detectors.
IEEE Trans. Image Process., 2023

PAN-Guided Multiresolution Fusion Network Using Swin Transformer for Pansharpening.
IEEE Geosci. Remote. Sens. Lett., 2023

Rethinking of Feature Interaction for Multi-task Learning on Dense Prediction.
CoRR, 2023

Towards Knowledge-driven Autonomous Driving.
CoRR, 2023

REVO-LION: Evaluating and Refining Vision-Language Instruction Tuning Datasets.
CoRR, 2023

StructChart: Perception, Structuring, Reasoning for Visual Chart Understanding.
CoRR, 2023

SPOT: Scalable 3D Pre-training via Occupancy Prediction for Autonomous Driving.
CoRR, 2023

Multi-view Vision-Prompt Fusion Network: Can 2D Pre-trained Model Boost 3D Point Cloud Data-scarce Learning?
CoRR, 2023

AD-PT: Autonomous Driving Pre-Training with Large-scale Point Cloud Dataset.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Transferring CLIP's Knowledge into Zero-Shot Point Cloud Semantic Segmentation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

SUG: Single-dataset Unified Generalization for 3D Point Cloud Classification.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Uni3D: A Unified Baseline for Multi-Dataset 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Bi3D: Bi-Domain Active Learning for Cross-Domain 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Generative Diffusion Prior for Unified Image Restoration and Enhancement.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Joint Distribution Alignment via Adversarial Learning for Domain Adaptive Object Detection.
IEEE Trans. Multim., 2022

Sample-Centric Feature Generation for Semi-Supervised Few-Shot Learning.
IEEE Trans. Image Process., 2022

Curriculum-Style Local-to-Global Adaptation for Cross-Domain Remote Sensing Image Segmentation.
IEEE Trans. Geosci. Remote. Sens., 2022

Densely Semantic Enhancement for Domain Adaptive Region-Free Detectors.
IEEE Trans. Circuits Syst. Video Technol., 2022

Few-Shot Object Detection With Self-Adaptive Global Similarity and Two-Way Foreground Stimulator in Remote Sensing Images.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2022

ADAS: A Simple Active-and-Adaptive Baseline for Cross-Domain 3D Semantic Segmentation.
CoRR, 2022

Instance-aware Model Ensemble With Distillation For Unsupervised Domain Adaptation.
CoRR, 2022

Learning Cross-Image Object Semantic Relation in Transformer for Few-Shot Fine-Grained Image Classification.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

2021
Coarse-to-Fine Joint Distribution Alignment for Cross-Domain Hyperspectral Image Classification.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2021

Scale-Aware Anchor-Free Object Detection via Curriculum Learning for Remote Sensing Images.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2021

Domain adaptive detection system for concealed objects using millimeter wave images.
Neural Comput. Appl., 2021

Object-aware Long-short-range Spatial Alignment for Few-Shot Fine-Grained Image Classification.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

2017
Fast Deep Matting for Portrait Animation on Mobile Phone.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

2016
Virtual experiment teaching and research oriented to college computer curriculum.
Proceedings of the 11th International Conference on Computer Science & Education, 2016


  Loading...