2024

FedConv: Enhancing Convolutional Neural Networks for Handling Data Heterogeneity in Federated Learning.

[DOI]

Peiran Xu

Zeyu Wang

Trans. Mach. Learn. Res., 2024

TransUNet: Rethinking the U-Net architecture design for medical image segmentation through the lens of transformers.

[DOI]

Medical Image Anal., 2024

AttnGCG: Enhancing Jailbreaking Attacks on LLMs with Attention Manipulation.

[DOI]

CoRR, 2024

What If We Recaption Billions of Web Images with LLaMA-3?

[DOI]

CoRR, 2024

Autoregressive Pretraining with Mamba in Vision.

[DOI]

CoRR, 2024

Medical Vision Generalist: Unifying Medical Imaging Tasks in Context.

[DOI]

CoRR, 2024

Mamba-R: Vision Mamba ALSO Needs Registers.

[DOI]

CoRR, 2024

SPFormer: Enhancing Vision Transformer with Superpixel Representation.

[DOI]

CoRR, 2024

Brain Tumor Segmentation Through Supervoxel Transformer.

[DOI]

Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024

Tuning LayerNorm in Attention: Towards Efficient Multi-Modal LLM Finetuning.

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

From Pixels to Objects: A Hierarchical Approach for Part and Object Segmentation Using Local and Global Aggregation.

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties.

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference.

[DOI]

Feng Wang

Jieru Mei

Alan L. Yuille

Proceedings of the Computer Vision - ECCV 2024, 2024

Masked Autoencoders are Secretly Efficient Learners.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

BNET: Batch Normalization With Enhanced Linear Transformation.

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties.

[DOI]

CoRR, 2023

SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference.

[DOI]

Feng Wang

Jieru Mei

Alan L. Yuille

CoRR, 2023

3D TransUNet: Advancing Medical Image Segmentation through Vision Transformers.

[DOI]

CoRR, 2023

3D-TransUNet for Brain Metastases Segmentation in the BraTS2023 Challenge.

[DOI]

Proceedings of the Brain Tumor Segmentation, and Cross-Modality Domain Adaptation for Medical Image Segmentation, 2023

SwinMM: Masked Multi-view with Swin Transformers for 3D Medical Image Segmentation.

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Superpixel Transformers for Efficient Semantic Segmentation.

[DOI]

IROS, 2023

3D-Aware Neural Body Fitting for Occlusion Robust 3D Human Pose Estimation.

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022

Waymo Open Dataset: Panoramic Video Panoptic Segmentation.

[DOI]

CoRR, 2022

Fast AdvProp.

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Waymo Open Dataset: Panoramic Video Panoptic Segmentation.

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

In Defense of Image Pre-Training for Spatiotemporal Recognition.

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

2021

Are Transformers more robust than CNNs?

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Shape-Texture Debiased Neural Network Training.

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

CAKES: Channel-wise Automatic KErnel Shrinking for Efficient 3D Networks.

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Batch Normalization with Enhanced Linear Transformation.

[DOI]

CoRR, 2020

CAKES: Channel-wise Automatic KErnel Shrinking for Efficient 3D Network.

[DOI]

CoRR, 2020

AtomNAS: Fine-Grained End-to-End Neural Architecture Search.

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

Neural Architecture Search for Lightweight Non-Local Networks.

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Learning to Refine 3D Human Pose Sequences.

[DOI]

Proceedings of the 2019 International Conference on 3D Vision, 2019

2018

Online Dictionary Learning for Approximate Archetypal Analysis.

[DOI]

Jieru Mei

Chunyu Wang

Wenjun Zeng

Proceedings of the Computer Vision - ECCV 2018, 2018

2016

Scene text script identification with Convolutional Recurrent Neural Networks.

[DOI]

Proceedings of the 23rd International Conference on Pattern Recognition, 2016