Sipeng Zheng

CoRR, 2024

LLaMA-Rider: Spurring Large Language Models to Explore the Open World.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Steve-Eye: Equipping LLM-based Embodied Agents with Visual Perception in Open Worlds.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

UniCode: Learning a Unified Codebook for Multimodal Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

2023

No-frills Temporal Video Grounding: Multi-Scale Neighboring Attention and Zoom-in Boundary Detection.

[BibT_eX]

[DOI]

Qi Zhang

CoRR, 2023

POV: Prompt-Oriented View-Agnostic Learning for Egocentric Hand-Object Interaction in the Multi-view World.

[BibT_eX]

[DOI]

Boshen Xu

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Anchor-Based Detection for Natural Language Localization in Ego-Centric Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Consumer Electronics, 2023

Open-Category Human-Object Interaction Pre-training via Language Modeling Framework.

[BibT_eX]

[DOI]

Boshen Xu

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Accommodating Audio Modality in CLIP for Multimodal Processing.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Exploring Anchor-based Detection for Ego4D Natural Language Query.

[BibT_eX]

[DOI]

CoRR, 2022

Few-Shot Action Recognition with Hierarchical Matching and Contrastive Learning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

VRDFormer: End-to-End Video Visual Relation Detection with Transformers.

[BibT_eX]

[DOI]

Juan Antonio Hernández Tamames

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

MR imaging for the quantitative assessment of brain iron in aceruloplasminemia: A postmortem validation study.

[BibT_eX]

[DOI]

Lena H. P. Vroegindeweij

Janneke G. Langendonk

NeuroImage, 2021

2020

Skeleton-Based Interactive Graph Network For Human Object Interaction Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

2019

Visual Relation Detection with Multi-Level Attention.

[BibT_eX]

[DOI]