2024
Intent-Oriented Dynamic Interest Modeling for Personalized Web Search.
ACM Trans. Inf. Syst., July, 2024
AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Analyzing The Language of Visual Tokens.
CoRR, 2024
KiVA: Kid-inspired Visual Analogies for Testing Large Multimodal Models.
CoRR, 2024
LLARVA: Vision-Action Instruction Tuning Enhances Robot Learning.
CoRR, 2024
Evaluating Multiview Object Consistency in Humans and Image Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
The Impact of Blockchain Implementations on Supply Chain Collaboration.
Proceedings of the Advances in Production Management Systems. Production Management Systems for Volatile, Uncertain, Complex, and Ambiguous Environments, 2024
Discovering Failure Modes of Text-guided Diffusion Models via Adversarial Search.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Learning Dynamic Multi-attribute Interest for Personalized Product Search.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Finding Visual Task Vectors.
Proceedings of the Computer Vision - ECCV 2024, 2024
Masked Autoencoders are Secretly Efficient Learners.
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Sequential Modeling Enables Scalable Learning for Large Vision Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
Understanding Pan-Sharpening via Generalized Inverse.
CoRR, 2023
Intriguing Properties of Text-guided Diffusion Models.
CoRR, 2023
Delving into Masked Autoencoders for Multi-Label Thorax Disease Classification.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023
CoKe: Contrastive Learning for Robust Keypoint Detection.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023
Making Your First Choice: To Address Cold Start Problem in Medical Active Learning.
Proceedings of the Medical Imaging with Deep Learning, 2023
Can CNNs Be More Robust Than Transformers?
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Masked Autoencoders Enable Efficient Knowledge Distillers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
Low-frequency oscillations link frontal and parietal cortex with subthalamic nucleus in conflicts.
,
,
,
,
,
,
,
,
,
,
,
,
NeuroImage, 2022
Making Your First Choice: To Address Cold Start Problem in Vision Active Learning.
CoRR, 2022
CateNorm: Categorical Normalization for Robust Medical Image Segmentation.
Proceedings of the Domain Adaptation and Representation Transfer - 4th MICCAI Workshop, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Point-Level Region Contrast for Object Detection Pre-Training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
TransFG: A Transformer Architecture for Fine-Grained Recognition.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
TransFG: A Transformer Architecture for Fine-grained Recognition.
CoRR, 2021
Glance-and-Gaze Vision Transformer.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Are Transformers more robust than CNNs?
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Mask Guided Matting via Progressive Refinement Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
2020
Unsupervised Part Discovery via Feature Alignment.
CoRR, 2020
Can Temporal Information Help with Contrastive Self-Supervised Learning?
CoRR, 2020
CoKe: Localized Contrastive Learning for Robust Keypoint Detection.
CoRR, 2020
C2FNAS: Coarse-to-Fine Neural Architecture Search for 3D Medical Image Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
2019
SSDC-DenseNet: A Cost-Effective End-to-End Spectral-Spatial Dual-Channel Dense Network for Hyperspectral Image Classification.
IEEE Access, 2019
Semantic Part Detection via Matching: Learning to Generalize to Novel Viewpoints From Limited Training Data.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019
CLEVR-Ref+: Diagnosing Visual Reasoning With Referring Expressions.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
2018
A computer-assisted instructional method based on machine learning in software testing class.
Comput. Appl. Eng. Educ., 2018
2017
Underactuated control of swing in orbit debris towing removal via tether space robots.
Proceedings of the 2017 IEEE International Conference on Robotics and Biomimetics, 2017