Junyu Han

Orcid: 0009-0006-5901-3254

According to our database1, Junyu Han authored at least 75 papers between 2011 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
CSDG-FAS: Closed-Space Domain Generalization for Face Anti-spoofing.
Int. J. Comput. Vis., November, 2024

BASL-AD SLAM: A Robust Deep-Learning Feature-Based Visual SLAM System With Adaptive Motion Model.
IEEE Trans. Intell. Transp. Syst., September, 2024

BinVPR: Binary Neural Networks towards Real-Valued for Visual Place Recognition.
Sensors, July, 2024

MaskOCR: Scene Text Recognition with Masked Vision-Language Pre-training.
Trans. Mach. Learn. Res., 2024

Decoupled Pseudo-Labeling for Semi-Supervised Monocular 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

KD-DETR: Knowledge Distillation for Detection Transformer with Consistent Distillation Points Sampling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Multi-Domain Incremental Learning for Face Presentation Attack Detection.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
CAE v2: Context Autoencoder with CLIP Latent Alignment.
Trans. Mach. Learn. Res., 2023

Layered Rendering Diffusion Model for Zero-Shot Guided Image Synthesis.
CoRR, 2023

Accelerating Vision Transformers Based on Heterogeneous Attention Patterns.
CoRR, 2023

MataDoc: Margin and Text Aware Document Dewarping for Arbitrary Boundary.
CoRR, 2023

Learning Structure-Guided Diffusion Model for 2D Human Pose Estimation.
CoRR, 2023

HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

MSAbox: A spatially stable face detector.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Graph Contrastive Learning for Skeleton-based Action Recognition.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Group Pose: A Simple Baseline for End-to-End Multi-person Pose Estimation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Gradient-based Sampling for Class Imbalanced Semi-supervised Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

CFCG: Semi-Supervised Semantic Segmentation via Cross-Fusion and Contour Guidance Supervision.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Semi-DETR: Semi-Supervised Object Detection with Detection Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

PSVT: End-to-End Multi-Person 3D Pose and Shape Estimation with Progressive Video Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Ambiguity-Resistant Semi-Supervised Learning for Dense Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Cyclically Disentangled Feature Translation for Face Anti-spoofing.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Encoder-Decoder Structure with Multiscale Receptive Field Block for Unsupervised Depth Estimation from Monocular Video.
Remote. Sens., 2022

CAE v2: Context Autoencoder with CLIP Target.
CoRR, 2022

Knowledge Distillation for Detection Transformer with Consistent Distillation Points Sampling.
CoRR, 2022

Group DETR v2: Strong Object Detector with Encoder-Decoder Pretraining.
CoRR, 2022

MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining.
CoRR, 2022

Singular Value Fine-tuning: Few-shot Segmentation requires Few-parameters Fine-tuning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

StyleSwap: Style-Based Generator Empowers Robust Face Swapping.
Proceedings of the Computer Vision - ECCV 2022, 2022

UFO: Unified Feature Optimization.
Proceedings of the Computer Vision - ECCV 2022, 2022

Few-Shot Font Generation by Learning Fine-Grained Local Styles.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Few-Shot Head Swapping in the Wild.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Expressive Talking Head Generation with Granular Audio-Visual Control.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

MobileFaceSwap: A Lightweight Framework for Video Face Swapping.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
A novel loop closure detection method with the combination of points and lines based on information entropy.
J. Field Robotics, 2021

StrucTexT: Structured Text Understanding with Multi-Modal Transformers.
CoRR, 2021

StrucTexT: Structured Text Understanding with Multi-Modal Transformers.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Dynamic Class Queue for Large Scale Face Recognition in the Wild.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

FaceController: Controllable Attribute Editing for Face in the Wild.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering Network.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Real Image Super Resolution Via Heterogeneous Model using GP-NAS.
CoRR, 2020

NTIRE 2020 Challenge on Real Image Denoising: Dataset, Methods and Results.
CoRR, 2020

Learning Generalized Spoof Cues for Face Anti-spoofing.
CoRR, 2020

Towards Accurate Scene Text Recognition with Semantic Reasoning Networks.
CoRR, 2020

Learning Global Structure Consistency for Robust Object Tracking.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020


Real Image Super Resolution via Heterogeneous Model Ensemble Using GP-NAS.
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Towards Accurate Scene Text Recognition With Semantic Reasoning Networks.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

HAMBox: Delving Into Mining High-Quality Anchors on Face Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020


2019
HAMBox: Delving into Online High-quality Anchors Mining for Detecting Outer Faces.
CoRR, 2019

PyramidBox++: High Performance Detector for Finding Tiny Face.
CoRR, 2019

Detecting Text in the Wild with Deep Character Embedding Network.
CoRR, 2019

Editing Text in the Wild.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

A Single-Shot Arbitrarily-Shaped Text Detector based on Context Attended Multi-Task Learning.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

An End-to-End Video Text Detector with Online Tracking.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

ICDAR 2019 Competition on Large-Scale Street View Text with Partial Labeling - RRC-LSVT.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

EATEN: Entity-Aware Attention for Single Shot Visual Text Extraction.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

ICDAR2019 Robust Reading Challenge on Arbitrary-Shaped Text - RRC-ArT.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

ACFNet: Attentional Class Feature Network for Semantic Segmentation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Chinese Street View Text: Large-Scale Chinese Text Reading With Partially Supervised Learning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Look More Than Once: An Accurate Detector for Text of Arbitrary Shapes.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
TextNet: Irregular Text Reading from Images with an End-to-End Trainable Network.
Proceedings of the Computer Vision - ACCV 2018, 2018

Detecting Text in the Wild with Deep Character Embedding Network.
Proceedings of the Computer Vision - ACCV 2018, 2018

2017
WordSup: Exploiting Word Annotations for Character Based Text Detection.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Vision-based illegal human ladder climbing action recognition in substation.
Proceedings of the Ninth International Conference on Advanced Computational Intelligence, 2017

2016
Context-aware mathematical expression recognition: An end-to-end framework and a benchmark.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

STAR-Net: A SpaTial Attention Residue Network for Scene Text Recognition.
Proceedings of the British Machine Vision Conference 2016, 2016

2013
Structure guided fusion for depth map inpainting.
Pattern Recognit. Lett., 2013

2011
Gradient sparsity for piecewise continuous optical flow estimation.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Enhancing Gradient Sparsity for Parametrized Motion Estimation.
Proceedings of the British Machine Vision Conference, 2011


  Loading...