Kashu Yamazaki

Orcid: 0000-0001-6569-6860

According to our database1, Kashu Yamazaki authored at least 23 papers between 2020 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
AerialFormer: Multi-Resolution Transformer for Aerial Image Segmentation.
Remote. Sens., August, 2024

HENASY: Learning to Assemble Scene-Entities for Egocentric Video-Language Model.
CoRR, 2024

Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

R<sup>2</sup>-Bench: Benchmarking the Robustness of Referring Perception Models Under Perturbations.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation.
Int. J. Comput. Vis., 2023

AerialFormer: Multi-resolution Transformer for Aerial Image Segmentation.
CoRR, 2023

CLIP-TSA: Clip-Assisted Temporal Self-Attention for Weakly-Supervised Video Anomaly Detection.
Proceedings of the IEEE International Conference on Image Processing, 2023

DNA: Deformable Neural Articulations Network for Template-free Dynamic 3D Human Reconstruction from Monocular RGB-D Video.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning.
CoRR, 2022

Meta-Learning of NAS for Few-shot Learning in Medical Image Applications.
CoRR, 2022

Deep reinforcement learning in computer vision: a comprehensive survey.
Artif. Intell. Rev., 2022

VLCAP: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

AISFormer: Amodal Instance Segmentation with Transformer.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Contextual Explainable Video Representation: Human Perception-based Understanding.
Proceedings of the 56th Asilomar Conference on Signals, Systems, and Computers, ACSSC 2022, Pacific Grove, CA, USA, October 31, 2022

2021
Invertible Residual Network with Regularization for Effective Medical Image Segmentation.
CoRR, 2021

ABN: Agent-Aware Boundary Networks for Temporal Action Proposal Generation.
IEEE Access, 2021

Invertible residual network with regularization for effective volumetric segmentation.
Proceedings of the Medical Imaging 2021: Image Processing, Online, February 15-19, 2021, 2021

Agent-Environment Network for Temporal Action Proposal Generation.
Proceedings of the IEEE International Conference on Acoustics, 2021

AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Roughness Index and Roughness Distance for Benchmarking Medical Segmentation.
Proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies, 2021

2020
A Multi-task Contextual Atrous Residual Network for Brain Tumor Detection & Segmentation.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Offset Curves Loss for Imbalanced Problem in Medical Segmentation.
Proceedings of the 25th International Conference on Pattern Recognition, 2020


  Loading...