2025
Faster than Fast: Accelerating Oriented FAST Feature Detection on Low-end Embedded GPUs.
CoRR, June, 2025

Efficient Parallel Implementation of Non-Local Means Algorithm on GPU.
Proceedings of the 17th Workshop on General Purpose Processing Using GPU, 2025

2024
An Optimized GPU Implementation for GIST Descriptor.
ACM Trans. Archit. Code Optim., December, 2024

2023
Multi-directional Sobel operator kernel on GPUs.
J. Parallel Distributed Comput., July, 2023

StereoVAE: A lightweight stereo-matching system using embedded GPUs.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023