Junyu Han

Orcid: 0000-0001-9917-7268

According to our database¹, Junyu Han authored at least 72 papers between 2011 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

BinVPR: Binary Neural Networks towards Real-Valued for Visual Place Recognition.

[BibT_eX]

[DOI]

Sensors, July, 2024

MaskOCR: Scene Text Recognition with Masked Vision-Language Pre-training.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

Decoupled Pseudo-labeling for Semi-Supervised Monocular 3D Object Detection.

[BibT_eX]

[DOI]

CoRR, 2024

Multi-Domain Incremental Learning for Face Presentation Attack Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

CAE v2: Context Autoencoder with CLIP Latent Alignment.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2023

Layered Rendering Diffusion Model for Zero-Shot Guided Image Synthesis.

[BibT_eX]

[DOI]

CoRR, 2023

Accelerating Vision Transformers Based on Heterogeneous Attention Patterns.

[BibT_eX]

[DOI]

CoRR, 2023

MataDoc: Margin and Text Aware Document Dewarping for Arbitrary Boundary.

[BibT_eX]

[DOI]

CoRR, 2023

Learning Structure-Guided Diffusion Model for 2D Human Pose Estimation.

[BibT_eX]

[DOI]

CoRR, 2023

HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

MSAbox: A spatially stable face detector.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Graph Contrastive Learning for Skeleton-based Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Group Pose: A Simple Baseline for End-to-End Multi-person Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Gradient-based Sampling for Class Imbalanced Semi-supervised Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

CFCG: Semi-Supervised Semantic Segmentation via Cross-Fusion and Contour Guidance Supervision.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Semi-DETR: Semi-Supervised Object Detection with Detection Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

PSVT: End-to-End Multi-Person 3D Pose and Shape Estimation with Progressive Video Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Ambiguity-Resistant Semi-Supervised Learning for Dense Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Cyclically Disentangled Feature Translation for Face Anti-spoofing.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Encoder-Decoder Structure with Multiscale Receptive Field Block for Unsupervised Depth Estimation from Monocular Video.

[BibT_eX]

[DOI]

Remote. Sens., 2022

CAE v2: Context Autoencoder with CLIP Target.

[BibT_eX]

[DOI]

CoRR, 2022

Knowledge Distillation for Detection Transformer with Consistent Distillation Points Sampling.

[BibT_eX]

[DOI]

CoRR, 2022

Group DETR v2: Strong Object Detector with Encoder-Decoder Pretraining.

[BibT_eX]

[DOI]

CoRR, 2022

MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining.

[BibT_eX]

[DOI]

CoRR, 2022

Singular Value Fine-tuning: Few-shot Segmentation requires Few-parameters Fine-tuning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

StyleSwap: Style-Based Generator Empowers Robust Face Swapping.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

UFO: Unified Feature Optimization.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Few-Shot Font Generation by Learning Fine-Grained Local Styles.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Few-Shot Head Swapping in the Wild.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Expressive Talking Head Generation with Granular Audio-Visual Control.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

MobileFaceSwap: A Lightweight Framework for Video Face Swapping.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

A novel loop closure detection method with the combination of points and lines based on information entropy.

[BibT_eX]

[DOI]

Junyu Han

Dong Ruifang

Jiangming Kan

J. Field Robotics, 2021

StrucTexT: Structured Text Understanding with Multi-Modal Transformers.

[BibT_eX]

[DOI]

CoRR, 2021

StrucTexT: Structured Text Understanding with Multi-Modal Transformers.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Dynamic Class Queue for Large Scale Face Recognition in the Wild.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

FaceController: Controllable Attribute Editing for Face in the Wild.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering Network.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Real Image Super Resolution Via Heterogeneous Model using GP-NAS.

[BibT_eX]

[DOI]

CoRR, 2020

NTIRE 2020 Challenge on Real Image Denoising: Dataset, Methods and Results.

[BibT_eX]

[DOI]

Abdelrahman Abdelhamed

Krzysztof Trojanowski

Yanhong Wu

Pablo Navarrete Michelini

CoRR, 2020

Learning Generalized Spoof Cues for Face Anti-spoofing.

[BibT_eX]

[DOI]

CoRR, 2020

Towards Accurate Scene Text Recognition with Semantic Reasoning Networks.

[BibT_eX]

[DOI]

CoRR, 2020

Learning Global Structure Consistency for Robust Object Tracking.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

AIM 2020 Challenge on Real Image Super-Resolution: Methods and Results.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Real Image Super Resolution via Heterogeneous Model Ensemble Using GP-NAS.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Towards Accurate Scene Text Recognition With Semantic Reasoning Networks.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

HAMBox: Delving Into Mining High-Quality Anchors on Face Detection.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

NTIRE 2020 Challenge on Real Image Denoising: Dataset, Methods and Results.

[BibT_eX]

[DOI]

Abdelrahman Abdelhamed

Krzysztof Trojanowski

Yanhong Wu

Pablo Navarrete Michelini

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

HAMBox: Delving into Online High-quality Anchors Mining for Detecting Outer Faces.

[BibT_eX]

[DOI]

CoRR, 2019

PyramidBox++: High Performance Detector for Finding Tiny Face.

[BibT_eX]

[DOI]

CoRR, 2019

Detecting Text in the Wild with Deep Character Embedding Network.

[BibT_eX]

[DOI]

CoRR, 2019

Editing Text in the Wild.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

A Single-Shot Arbitrarily-Shaped Text Detector based on Context Attended Multi-Task Learning.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

An End-to-End Video Text Detector with Online Tracking.

[BibT_eX]

[DOI]

Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

ICDAR 2019 Competition on Large-Scale Street View Text with Partial Labeling - RRC-LSVT.

[BibT_eX]

[DOI]

Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

EATEN: Entity-Aware Attention for Single Shot Visual Text Extraction.

[BibT_eX]

[DOI]

Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

ICDAR2019 Robust Reading Challenge on Arbitrary-Shaped Text - RRC-ArT.

[BibT_eX]

[DOI]

Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

ACFNet: Attentional Class Feature Network for Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Chinese Street View Text: Large-Scale Chinese Text Reading With Partially Supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Look More Than Once: An Accurate Detector for Text of Arbitrary Shapes.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

TextNet: Irregular Text Reading from Images with an End-to-End Trainable Network.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2018, 2018

Detecting Text in the Wild with Deep Character Embedding Network.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2018, 2018

2017

WordSup: Exploiting Word Annotations for Character Based Text Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Vision-based illegal human ladder climbing action recognition in substation.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Advanced Computational Intelligence, 2017

2016

Context-aware mathematical expression recognition: An end-to-end framework and a benchmark.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Pattern Recognition, 2016

STAR-Net: A SpaTial Attention Residue Network for Scene Text Recognition.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference 2016, 2016

2013

Structure guided fusion for depth map inpainting.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2013

2011

Gradient sparsity for piecewise continuous optical flow estimation.

[BibT_eX]

[DOI]

Junyu Han

Fei Qi

Guangming Shi

Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Enhancing Gradient Sparsity for Parametrized Motion Estimation.

[BibT_eX]

[DOI]

Junyu Han

Fei Qi

Guangming Shi

Proceedings of the British Machine Vision Conference, 2011

Junyu Han

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...