Kashu Yamazaki

Remote. Sens., August, 2024

HENASY: Learning to Assemble Scene-Entities for Egocentric Video-Language Model.

[BibT_eX]

[DOI]

CoRR, 2024

Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

R<sup>2</sup>-Bench: Benchmarking the Robustness of Referring Perception Models Under Perturbations.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

2023

AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2023

AerialFormer: Multi-resolution Transformer for Aerial Image Segmentation.

[BibT_eX]

[DOI]

Jackson David Cothren

CoRR, 2023

CLIP-TSA: Clip-Assisted Temporal Self-Attention for Weakly-Supervised Video Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2023

DNA: Deformable Neural Articulations Network for Template-free Dynamic 3D Human Reconstruction from Monocular RGB-D Video.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning.

[BibT_eX]

[DOI]

CoRR, 2022

Meta-Learning of NAS for Few-shot Learning in Medical Image Applications.

[BibT_eX]

[DOI]

CoRR, 2022

Deep reinforcement learning in computer vision: a comprehensive survey.

[BibT_eX]

[DOI]

Khoa Luu

Marios Savvides

Artif. Intell. Rev., 2022

VLCAP: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning.

[BibT_eX]

[DOI]

Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

AISFormer: Amodal Instance Segmentation with Transformer.

[BibT_eX]

[DOI]

Minh Q. Tran

Khoa Vo

Arthur A. F. Fernandes

Michael Kidd

Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Contextual Explainable Video Representation: Human Perception-based Understanding.

[BibT_eX]

[DOI]

Proceedings of the 56th Asilomar Conference on Signals, Systems, and Computers, ACSSC 2022, Pacific Grove, CA, USA, October 31, 2022

2021

Invertible Residual Network with Regularization for Effective Medical Image Segmentation.

[BibT_eX]

[DOI]

T. Hoang Ngan Le

CoRR, 2021

ABN: Agent-Aware Boundary Networks for Temporal Action Proposal Generation.

[BibT_eX]

[DOI]

IEEE Access, 2021

Invertible residual network with regularization for effective volumetric segmentation.

[BibT_eX]

[DOI]

T. Hoang Ngan Le

Proceedings of the Medical Imaging 2021: Image Processing, Online, February 15-19, 2021, 2021

Agent-Environment Network for Temporal Action Proposal Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation.

[BibT_eX]

[DOI]

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Roughness Index and Roughness Distance for Benchmarking Medical Segmentation.

[BibT_eX]

[DOI]