Shanghang Zhang

Orcid: 0000-0003-4047-3526

According to our database¹, Shanghang Zhang authored at least 193 papers between 2012 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

2012

2014

2016

2018

2020

2022

2024

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Empowering Corner Case Detection in Autonomous Vehicles With Multimodal Large Language Models.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2025

2024

DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., September, 2024

Exploring Generalizable Distillation for Efficient Medical Image Segmentation.

[BibT_eX]

[DOI]

IEEE J. Biomed. Health Informatics, July, 2024

BEV-LGKD: A Unified LiDAR-Guided Knowledge Distillation Framework for Multi-View BEV 3D Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Intell. Veh., January, 2024

DECOR: Dynamic Decoupling and Multiobjective Optimization for Long-Tailed Remote Sensing Image Classification.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2024

A lightweight multi-layer perceptron for efficient multivariate time series forecasting.

[BibT_eX]

[DOI]

Knowl. Based Syst., 2024

The Emerging Issues in Bioimaging AI Publications and Research (Dagstuhl Seminar 24042).

[BibT_eX]

[DOI]

Dagstuhl Reports, 2024

Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation.

[BibT_eX]

[DOI]

CoRR, 2024

Proactive Gradient Conflict Mitigation in Multi-Task Learning: A Sparse Training Perspective.

[BibT_eX]

[DOI]

CoRR, 2024

EMD: Explicit Motion Modeling for High-Quality Street Gaussian Splatting.

[BibT_eX]

[DOI]

CoRR, 2024

MC-LLaVA: Multi-Concept Personalized Vision-Language Model.

[BibT_eX]

[DOI]

CoRR, 2024

Learning from Different Samples: A Source-free Framework for Semi-supervised Domain Adaptation.

[BibT_eX]

[DOI]

CoRR, 2024

Training-free Regional Prompting for Diffusion Transformers.

[BibT_eX]

[DOI]

CoRR, 2024

Subgraph Aggregation for Out-of-Distribution Generalization on Graphs.

[BibT_eX]

[DOI]

CoRR, 2024

Towards Unifying Understanding and Generation in the Era of Vision Foundation Models: A Survey from the Autoregression Perspective.

[BibT_eX]

[DOI]

CoRR, 2024

EVA: An Embodied World Model for Future Video Anticipation.

[BibT_eX]

[DOI]

CoRR, 2024

SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference.

[BibT_eX]

[DOI]

CoRR, 2024

Expert-level vision-language foundation model for real-world radiology and comprehensive evaluation.

[BibT_eX]

[DOI]

CoRR, 2024

Discovering Long-Term Effects on Parameter Efficient Fine-tuning.

[BibT_eX]

[DOI]

CoRR, 2024

FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions.

[BibT_eX]

[DOI]

CoRR, 2024

Multimodal Large Language Models for Bioimage Analysis.

[BibT_eX]

[DOI]

CoRR, 2024

MAVIS: Mathematical Visual Instruction Tuning.

[BibT_eX]

[DOI]

CoRR, 2024

Fisher-aware Quantization for DETR Detectors with Critical-category Objectives.

[BibT_eX]

[DOI]

CoRR, 2024

MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception.

[BibT_eX]

[DOI]

CoRR, 2024

RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation.

[BibT_eX]

[DOI]

CoRR, 2024

S<sup>3</sup>Gaussian: Self-Supervised Street Gaussians for Autonomous Driving.

[BibT_eX]

[DOI]

CoRR, 2024

Implicit Neural Image Field for Biological Microscopy Image Compression.

[BibT_eX]

[DOI]

CoRR, 2024

Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation.

[BibT_eX]

[DOI]

CoRR, 2024

CoCoGesture: Toward Coherent Co-speech 3D Gesture Generation in the Wild.

[BibT_eX]

[DOI]

CoRR, 2024

Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation.

[BibT_eX]

[DOI]

CoRR, 2024

Unveiling the Tapestry of Consistency in Large Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention.

[BibT_eX]

[DOI]

CoRR, 2024

Intuition-aware Mixture-of-Rank-1-Experts for Parameter Efficient Finetuning.

[BibT_eX]

[DOI]

CoRR, 2024

Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding.

[BibT_eX]

[DOI]

CoRR, 2024

SpikeNVS: Enhancing Novel View Synthesis from Blurry Images via Spike Camera.

[BibT_eX]

[DOI]

CoRR, 2024

Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want.

[BibT_eX]

[DOI]

CoRR, 2024

Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection.

[BibT_eX]

[DOI]

CoRR, 2024

DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing.

[BibT_eX]

[DOI]

CoRR, 2024

A Vanilla Multi-Task Framework for Dense Visual Prediction Solution to 1st VCL Challenge - Multi-Task Robustness Track.

[BibT_eX]

[DOI]

CoRR, 2024

Building Flexible Machine Learning Models for Scientific Computing at Scale.

[BibT_eX]

[DOI]

CoRR, 2024

Proximity QA: Unleashing the Power of Multi-Modal Large Language Models for Spatial Proximity Analysis.

[BibT_eX]

[DOI]

CoRR, 2024

VeCAF: VLM-empowered Collaborative Active Finetuning with Training Objective Awareness.

[BibT_eX]

[DOI]

CoRR, 2024

RustNeRF: Robust Neural Radiance Field with Low-Quality Images.

[BibT_eX]

[DOI]

CoRR, 2024

TCP: Triplet Contrastive-relationship Preserving for Class-Incremental Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Distribution-Aware Continual Test-Time Adaptation for Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Unsupervised Spike Depth Estimation via Cross-modality Cross-domain Knowledge Transfer.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

BEVUDA: Multi-geometric Space Alignments for Domain Adaptive BEV 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Compositional Few-Shot Class-Incremental Learning.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

VoroNav: Voronoi-based Zero-shot Object Navigation with Large Language Model.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Split-Ensemble: Efficient OOD-aware Ensemble via Task and Model Splitting.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

VLUReID: Exploiting Vision-Language Knowledge for Unsupervised Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Enhanced Blind Watermarking Against Black-Box Noise: Leveraging CIN Framework.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Unleashing the Potentials of Likelihood Composition for Multi-modal Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Learning from Mistakes: Iterative Prompt Relabeling for Text-to-Image Diffusion Model Training.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

I-MedSAM: Implicit Medical Image Segmentation with Segment Anything.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

LLM as Dataset Analyst: Subpopulation Structure Discovery with Large Language Model.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Gradient-based Parameter Selection for Efficient Fine-Tuning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

FreeKD: Knowledge Distillation via Semantic Frequency Prompt.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

PromptCoT: Align Prompt Distribution via Adapted Chain-of-Thought.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

NTO3D: Neural Target Object 3D Reconstruction with Segment Anything.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Cloud-Device Collaborative Learning for Multimodal Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-Speech Gesture Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

FM-OV3D: Foundation Model-Based Cross-Modal Knowledge Blending for Open-Vocabulary 3D Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Efficient Deweahter Mixture-of-Experts with Uncertainty-Aware Feature-Wise Linear Modulation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Exploring Sparse Visual Prompt for Domain Adaptive Dense Prediction.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Frame-Recurrent Video Crowd Counting.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., September, 2023

Learning Deep Features for Robotic Inference From Physical Interactions.

[BibT_eX]

[DOI]

IEEE Trans. Cogn. Dev. Syst., September, 2023

Expanding the prediction capacity in long sequence time-series forecasting.

[BibT_eX]

[DOI]

Artif. Intell., May, 2023

P2FEViT: Plug-and-Play CNN Feature Embedded Hybrid Vision Transformer for Remote Sensing Image Classification.

[BibT_eX]

[DOI]

Remote. Sens., April, 2023

Caching in Dynamic Environments: A Near-Optimal Online Learning Approach.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Efficient Deweather Mixture-of-Experts with Uncertainty-aware Feature-wise Linear Modulation.

[BibT_eX]

[DOI]

CoRR, 2023

Cloud-Device Collaborative Learning for Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Iterative Prompt Relabeling for diffusion model with RLDF.

[BibT_eX]

[DOI]

CoRR, 2023

FM-OV3D: Foundation Model-based Cross-modal Knowledge Blending for Open-Vocabulary 3D Detection.

[BibT_eX]

[DOI]

CoRR, 2023

LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding.

[BibT_eX]

[DOI]

CoRR, 2023

Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation.

[BibT_eX]

[DOI]

CoRR, 2023

Customize-It-3D: High-Quality 3D Creation from A Single Image Using Subject-Specific Knowledge Prior.

[BibT_eX]

[DOI]

CoRR, 2023

Split & Merge: Unlocking the Potential of Visual Adapters via Sparse Training.

[BibT_eX]

[DOI]

CoRR, 2023

MoEC: Mixture of Experts Implicit Neural Compression.

[BibT_eX]

[DOI]

CoRR, 2023

ChatIllusion: Efficient-Aligning Interleaved Generation ability with Visual Instruction Model.

[BibT_eX]

[DOI]

CoRR, 2023

COLE: A Hierarchical Generation Framework for Graphic Design.

[BibT_eX]

[DOI]

CoRR, 2023

Heterogenous Memory Augmented Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2023

Distribution-Aware Continual Test Time Adaptation for Semantic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

NOC: High-Quality Neural Object Cloning with 3D Lifting of Segment Anything.

[BibT_eX]

[DOI]

CoRR, 2023

RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision.

[BibT_eX]

[DOI]

CoRR, 2023

PM-DETR: Domain Adaptive Prompt Memory for Object Detection with Transformers.

[BibT_eX]

[DOI]

CoRR, 2023

DiffuseIR: Diffusion Models For Isotropic Reconstruction of 3D Microscopic Images.

[BibT_eX]

[DOI]

CoRR, 2023

UniOcc: Unifying Vision-Centric 3D Occupancy Prediction with Geometric and Semantic Rendering.

[BibT_eX]

[DOI]

CoRR, 2023

ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation.

[BibT_eX]

[DOI]

CoRR, 2023

Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Chain of Thought Prompt Tuning in Vision Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

MoWE: Mixture of Weather Experts for Multiple Adverse Weather Removal.

[BibT_eX]

[DOI]

CoRR, 2023

Exploring Sparse Visual Prompt for Cross-domain Semantic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

When Visible Light (Backscatter) Communication Meets Neuromorphic Cameras in V2X.

[BibT_eX]

[DOI]

Proceedings of the 24th International Workshop on Mobile Computing Systems and Applications, 2023

RepCaM: Re-parameterization Content-aware Modulation for Neural Video Delivery.

[BibT_eX]

[DOI]

Proceedings of the 33rd Workshop on Network and Operating System Support for Digital Audio and Video, 2023

PAD: A Dataset and Benchmark for Pose-agnostic Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

DiffuseIR: Diffusion Models for Isotropic Reconstruction of 3D Microscopic Images.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Electroencephalogram-Based Driver Emotional State Detection with Manifold Learning.

[BibT_eX]

[DOI]

Proceedings of the 25th IEEE International Conference on Intelligent Transportation Systems, 2023

A Text Prompt-Based Approach for Zero-Shot Corner Case Object Detection in Autonomous Driving.

[BibT_eX]

[DOI]

Proceedings of the 25th IEEE International Conference on Intelligent Transportation Systems, 2023

Uncertainty-Aware Dynamic Learning for Cross-Domain Few-Shot Scene Classification from Remote Sensing Imagery.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, 2023

Wasserstein Barycenter Matching for Graph Size Generalization of Message Passing Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

QD-BEV : Quantization-aware View-guided Distillation for Multi-view 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Q-Diffusion: Quantizing Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

BadRes: Reveal the Backdoors Through Residual Connection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

CSQ: Growing Mixed-Precision Quantization Scheme with Bi-level Continuous Sparsification.

[BibT_eX]

[DOI]

Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

Improving Generalization of Meta-Learning with Inverted Regularization at Inner-Level.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Annealing-based Label-Transfer Learning for Open World Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Open-Vocabulary Point-Cloud Object Detection without 3D Annotation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

NoisyQuant: Noisy Bias-Enhanced Post-Training Activation Quantization for Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

MSINet: Twins Contrastive Search of Multi-Scale Interaction for Object ReID.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Cloud-Device Collaborative Adaptation to Continual Changing Environments in the Real-World.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

BEV-SAN: Accurate BEV 3D Object Detection via Slice Attention Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

A Review of Single-Source Deep Unsupervised Visual Domain Adaptation.

[BibT_eX]

[DOI]

Alberto L. Sangiovanni-Vincentelli

Sanjit A. Seshia

Kurt Keutzer

IEEE Trans. Neural Networks Learn. Syst., 2022

Active Gradual Domain Adaptation: Dataset and Approach.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2022

BEV-LGKD: A Unified LiDAR-Guided Knowledge Distillation Framework for BEV 3D Object Detection.

[BibT_eX]

[DOI]

CoRR, 2022

Multi-latent Space Alignments for Unsupervised Domain Adaptation in Multi-view 3D Object Detection.

[BibT_eX]

[DOI]

CoRR, 2022

PointCLIP V2: Adapting CLIP for Powerful 3D Open-world Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Uncertainty Guided Depth Fusion for Spike Camera.

[BibT_eX]

[DOI]

CoRR, 2022

Unsupervised Spike Depth Estimation via Cross-modality Cross-domain Knowledge Transfer.

[BibT_eX]

[DOI]

CoRR, 2022

Open-Vocabulary 3D Detection via Image-level Class and Debiased Cross-modal Contrastive Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Domain-Adaptive Text Classification with Structured Knowledge from Unlabeled Data.

[BibT_eX]

[DOI]

CoRR, 2022

UnrealNAS: Can We Search Neural Architectures with Unreal Data?

[BibT_eX]

[DOI]

CoRR, 2022

Cross-Domain Object Detection with Mean-Teacher Transformer.

[BibT_eX]

[DOI]

CoRR, 2022

Self-Supervised Pretraining Improves Self-Supervised Pretraining.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Margin-Based Few-Shot Class-Incremental Learning with Class-Level Overfitting Mitigation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Jump Self-attention: Capturing High-order Statistics in Transformers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Outlier Suppression: Pushing the Limit of Low-bit Transformer Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Domain-Adaptive Text Classification with Structured Knowledge from Unlabeled Data.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Prototype-Voxel Contrastive Learning for LiDAR Point Cloud Panoptic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2022 International Conference on Robotics and Automation, 2022

DNA: Domain Generalization with Diversified Neural Averaging.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Temporal Efficient Training of Spiking Neural Network via Gradient Re-weighting.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

MTTrans: Cross-domain Object Detection with Mean Teacher Transformer.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Efficient Meta-Tuning for Content-Aware Neural Video Delivery.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Delving Deep into the Generalization of Vision Transformers under Distribution Shifts.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Online Continual Adaptation with Active Self-Training.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021

Learning graph attention-aware knowledge graph embedding.

[BibT_eX]

[DOI]

Neurocomputing, 2021

2nd Place Solution for VisDA 2021 Challenge - Universally Domain Adaptive Image Recognition.

[BibT_eX]

[DOI]

CoRR, 2021

Delving Deep into the Generalization of Vision Transformers under Distribution Shifts.

[BibT_eX]

[DOI]

CoRR, 2021

Differentiable Spike: Rethinking Gradient-Descent for Training Spiking Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Revisiting Mid-Level Patterns for Cross-Domain Few-Shot Recognition.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Annotation-Efficient Untrimmed Video Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Triplet Attention: Rethinking the Similarity in Transformers.

[BibT_eX]

[DOI]

Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Decoupling Global and Local Representations via Invertible Generative Flows.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

MERITS: Medication Recommendation for Chronic Disease with Irregular Time-Series.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Data Mining, 2021

Unsupervised Domain Adaptive 3D Detection with Multi-Level Consistency.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Contrastive Multimodal Fusion with TupleInfoNCE.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Cross-Domain Sentiment Classification with Contrastive Learning and Mutual Information Maximization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Prototypical Cross-Domain Self-Supervised Learning for Few-Shot Unsupervised Domain Adaptation.

[BibT_eX]

[DOI]

Alberto L. Sangiovanni-Vincentelli

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning Invariant Representations and Risks for Semi-Supervised Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Modeling relation paths for knowledge base completion via joint adversarial training.

[BibT_eX]

[DOI]

Knowl. Based Syst., 2020

P4Contrast: Contrastive Learning with Pairs of Point-Pixel Pairs for RGB-D Scene Understanding.

[BibT_eX]

[DOI]

CoRR, 2020

Cross-Domain Sentiment Classification with In-Domain Contrastive Learning.

[BibT_eX]

[DOI]

CoRR, 2020

Revisiting Mid-Level Patterns for Distant-Domain Few-Shot Recognition.

[BibT_eX]

[DOI]

CoRR, 2020

Transfer Learning or Self-supervised Learning? A Tale of Two Pretraining Paradigms.

[BibT_eX]

[DOI]

CoRR, 2020

Rethinking Distributional Matching Based Domain Adaptation.

[BibT_eX]

[DOI]

CoRR, 2020

Decoupling Global and Local Representations from/for Image Generation.

[BibT_eX]

[DOI]

CoRR, 2020

Compositional Few-Shot Recognition with Primitive Discovery and Enhancing.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Generalized Zero-Shot Text Classification for ICD Coding.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Instance Adaptive Self-training for Unsupervised Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

TCGM: An Information-Theoretic Framework for Semi-supervised Multi-modality Learning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Multi-Source Distilling Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Generalized Zero-shot ICD Coding.

[BibT_eX]

[DOI]

CoRR, 2019

Feature Fusion for Image Retrieval With Adaptive Bitrate Allocation and Hard Negative Mining.

[BibT_eX]

[DOI]

Chuang Zhu

Huihui Dong

Shanghang Zhang

IEEE Access, 2019

Dual Adversarial Semantics-Consistent Network for Generalized Zero-Shot Learning.

[BibT_eX]

[DOI]

Jian Ni

Shanghang Zhang

Haiyong Xie

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

MaCow: Masked Convolutional Generative Flow.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018

Deep Understanding of Urban Mobility from CityscapeWebcams.

[BibT_eX]

[DOI]

Shanghang Zhang

PhD thesis, 2018

Hierarchical Attention Networks for Knowledge Base Completion via Joint Adversarial Training.

[BibT_eX]

[DOI]

CoRR, 2018

Adversarial Multiple Source Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Multiple Source Domain Adaptation with Adversarial Learning.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

A Deep Learning Approach to IoT Authentication.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Communications, 2018

Learning to Understand Image Blur.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

Topology adaptive graph convolutional networks.

[BibT_eX]

[DOI]

CoRR, 2017

Multiple Source Domain Adaptation with Adversarial Training of Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2017

FCN-rLSTM: Deep Spatio-Temporal Neural Networks for Vehicle Counting in City Cameras.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Understanding Traffic Density from Large-Scale Web Camera Data.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2015

Traffic flow from a low frame rate city camera.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

2014

Bayesian model fusion: Enabling test cost reduction of analog/RF circuits via wafer-level spatial variation modeling.

[BibT_eX]

[DOI]

Shanghang Zhang

Xin Li

Ronald D. Blanton

José Machado da Silva

John M. Carulli Jr.

Kenneth M. Butler

Proceedings of the 2014 International Test Conference, 2014

2013

On a Highly Efficient RDO-Based Mode Decision Pipeline Design for AVS.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2013

A high-throughput low-latency arithmetic encoder design for HDTV.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

2012

An efficient foreground-based surveillance video coding scheme in low bit-rate compression.

[BibT_eX]

[DOI]

Proceedings of the 2012 Visual Communications and Image Processing, 2012

A flexible and high-performance hardware video encoder architecture.

[BibT_eX]

[DOI]

Proceedings of the 2012 Picture Coding Symposium, 2012

An Optimized Hardware Video Encoder for AVS with Level C+ Data Reuse Scheme for Motion Estimation.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Shanghang Zhang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...