Cihang Xie

Orcid: 0000-0003-1243-8045

According to our database1, Cihang Xie authored at least 91 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
On the Adversarial Robustness of Camera-based 3D Object Detection.
Trans. Mach. Learn. Res., 2024

Unleashing the Power of Visual Prompting At the Pixel Level.
Trans. Mach. Learn. Res., 2024

Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies.
Trans. Mach. Learn. Res., 2024

FedConv: Enhancing Convolutional Neural Networks for Handling Data Heterogeneity in Federated Learning.
Trans. Mach. Learn. Res., 2024

AttnGCG: Enhancing Jailbreaking Attacks on LLMs with Attention Manipulation.
CoRR, 2024

Causal Image Modeling for Efficient Visual Understanding.
CoRR, 2024

VHELM: A Holistic Evaluation of Vision Language Models.
CoRR, 2024

A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor?
CoRR, 2024

VideoLLaMB: Long-context Video Understanding with Recurrent Memory Bridges.
CoRR, 2024

MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine.
CoRR, 2024

VideoHallucer: Evaluating Intrinsic and Extrinsic Hallucinations in Large Video-Language Models.
CoRR, 2024

What If We Recaption Billions of Web Images with LLaMA-3?
CoRR, 2024

Autoregressive Pretraining with Mamba in Vision.
CoRR, 2024

Scaling White-Box Transformers for Vision.
CoRR, 2024

ARVideo: Autoregressive Pretraining for Self-Supervised Video Representation Learning.
CoRR, 2024

Mamba-R: Vision Mamba ALSO Needs Registers.
CoRR, 2024

HQ-Edit: A High-Quality Dataset for Instruction-based Image Editing.
CoRR, 2024

3D-TransUNet for Brain Metastases Segmentation in the BraTS2023 Challenge.
CoRR, 2024

AQA-Bench: An Interactive Benchmark for Evaluating LLMs' Sequential Reasoning Ability.
CoRR, 2024

SPFormer: Enhancing Vision Transformer with Superpixel Representation.
CoRR, 2024

Navigation as Attackers Wish? Towards Building Robust Embodied Agents under Federated Learning.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Brain Tumor Segmentation Through Supervoxel Transformer.
Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024

Rejuvenating image-GPT as Strong Visual Representation Learners.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Tuning LayerNorm in Attention: Towards Efficient Multi-Modal LLM Finetuning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

From Pixels to Objects: A Hierarchical Approach for Part and Object Segmentation Using Local and Global Aggregation.
Proceedings of the Computer Vision - ECCV 2024, 2024

A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties.
Proceedings of the Computer Vision - ECCV 2024, 2024

How Many Are in This Image A Safety Evaluation Benchmark for Vision LLMs.
Proceedings of the Computer Vision - ECCV 2024, 2024

Localization vs. Semantics: Visual Representations in Unimodal and Multimodal Models.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

L2B: Learning to Bootstrap Robust Models for Combating Label Noise.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Masked Autoencoders are Secretly Efficient Learners.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Benchmarking Robustness in Neural Radiance Fields.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-Training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Revisiting Adversarial Training at Scale.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
BNET: Batch Normalization With Enhanced Linear Transformation.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties.
CoRR, 2023

Compress & Align: Curating Image-Text Data with Human Knowledge.
CoRR, 2023

Audio-Visual LLM for Video Understanding.
CoRR, 2023

How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs.
CoRR, 2023

MixCon3D: Synergizing Multi-View and Cross-Modal Contrastive Learning for Enhancing 3D Representation.
CoRR, 2023

Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics.
CoRR, 2023

CLIPA-v2: Scaling CLIP Training with 81.1% Zero-shot ImageNet Accuracy within a $10, 000 Budget; An Extra $4, 000 Unlocks 81.8% Accuracy.
CoRR, 2023

An Inverse Scaling Law for CLIP Training.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Consistency-Guided Meta-learning for Bootstrapping Semi-supervised Medical Image Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

SwinMM: Masked Multi-view with Swin Transformers for 3D Medical Image Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

One-Pixel Shortcut: On the Learning Preference of Deep Neural Networks.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Can CNNs Be More Robust Than Transformers?
Proceedings of the Eleventh International Conference on Learning Representations, 2023

SMAUG: Sparse Masked Autoencoder for Efficient Video-Language Pre-training.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Diffusion Models as Masked Autoencoders.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Masked Autoencoders Enable Efficient Knowledge Distillers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Practical Disruption of Image Translation Deepfake Networks.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Localization vs. Semantics: How Can Language Benefit Visual Representation Learning?
CoRR, 2022

Navigation as the Attacker Wishes? Towards Building Byzantine-Robust Embodied Agents under Federated Learning.
CoRR, 2022

Bag of Tricks for FGSM Adversarial Training.
CoRR, 2022

Learning to Bootstrap for Combating Label Noise.
CoRR, 2022

Finding Differences Between Transformers and ConvNets Using Counterfactual Simulation Testing.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Adversarial Attack on Attackers: Post-Process to Mitigate Black-Box Score-Based Query Attacks.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Image BERT Pre-training with Online Tokenizer.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Fast AdvProp.
Proceedings of the Tenth International Conference on Learning Representations, 2022

ViP: Unified Certified Detection and Recovery for Patch Attack with Vision Transformers.
Proceedings of the Computer Vision - ECCV 2022, 2022

In Defense of Image Pre-Training for Spatiotemporal Recognition.
Proceedings of the Computer Vision - ECCV 2022, 2022

Simulated Adversarial Testing of Face Recognition Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

A Simple Data Mixing Prior for Improving Self-Supervised Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
iBOT: Image BERT Pre-Training with Online Tokenizer.
CoRR, 2021

Are Transformers more robust than CNNs?
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Shape-Texture Debiased Neural Network Training.
Proceedings of the 9th International Conference on Learning Representations, 2021

Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Robust and Accurate Object Detection via Adversarial Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Batch Normalization with Enhanced Linear Transformation.
CoRR, 2020

Smooth Adversarial Training.
CoRR, 2020

Intriguing Properties of Adversarial Training at Scale.
Proceedings of the 8th International Conference on Learning Representations, 2020

PatchAttack: A Black-Box Texture-Based Attack with Reinforcement Learning.
Proceedings of the Computer Vision - ECCV 2020, 2020

Regional Homogeneity: Towards Learning Transferable Universal Adversarial Perturbations Against Defenses.
Proceedings of the Computer Vision - ECCV 2020, 2020

Adversarial Examples Improve Image Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Neural Architecture Search for Lightweight Non-Local Networks.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Universal Physical Camouflage Attacks on Object Detectors.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Learning Transferable Adversarial Examples via Ghost Networks.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
UPC: Learning Universal Physical Camouflage Attacks on Object Detectors.
CoRR, 2019

Intriguing properties of adversarial training.
CoRR, 2019

Improving Transferability of Adversarial Examples With Input Diversity.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Feature Denoising for Improving Adversarial Robustness.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Adversarial Attacks and Defences Competition.
CoRR, 2018

Improving Transferability of Adversarial Examples with Input Diversity.
CoRR, 2018

Mitigating Adversarial Effects Through Randomization.
Proceedings of the 6th International Conference on Learning Representations, 2018

DeepVoting: A Robust and Explainable Deep Network for Semantic Part Detection Under Partial Occlusion.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Single-Shot Object Detection With Enriched Semantics.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Visual Concepts and Compositional Voting.
CoRR, 2017

DeepVoting: An Explainable Framework for Semantic Part Detection under Partial Occlusion.
CoRR, 2017

Adversarial Examples for Semantic Segmentation and Object Detection.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Detecting Semantic Parts on Partially Occluded Objects.
Proceedings of the British Machine Vision Conference 2017, 2017

2016
Prediction-based MAC-layer sensing in cognitive radio networks.
Wirel. Commun. Mob. Comput., 2016


  Loading...