Kun Yao

Orcid: 0000-0001-7155-4076

According to our database1, Kun Yao authored at least 55 papers between 2006 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Sequential Learning Network With Residual Blocks: Incorporating Temporal Convolutional Information Into Recurrent Neural Networks.
IEEE Trans. Cogn. Dev. Syst., February, 2024

MaskOCR: Scene Text Recognition with Masked Vision-Language Pre-training.
Trans. Mach. Learn. Res., 2024

Irregular text block recognition via decoupling visual, linguistic, and positional information.
Pattern Recognit., 2024

FullAnno: A Data Engine for Enhancing Image Comprehension of MLLMs.
CoRR, 2024

Add-SD: Rational Generation without Manual Reference.
CoRR, 2024

OVLW-DETR: Open-Vocabulary Light-Weighted Detection Transformer.
CoRR, 2024

Skim then Focus: Integrating Contextual and Fine-grained Views for Repetitive Action Counting.
CoRR, 2024

LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection.
CoRR, 2024

StrucTexTv3: An Efficient Vision-Language Model for Text-rich Image Perception, Comprehension, and Beyond.
CoRR, 2024

Towards Unified Multi-granularity Text Detection with Interactive Attention.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

FROSTER: Frozen CLIP is A Strong Teacher for Open-Vocabulary Action Recognition.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Textual Grounding for Open-Vocabulary Visual Information Extraction in Layout-Diversified Documents.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
CAE v2: Context Autoencoder with CLIP Latent Alignment.
Trans. Mach. Learn. Res., 2023

MataDoc: Margin and Text Aware Document Dewarping for Arbitrary Boundary.
CoRR, 2023

Learning Structure-Guided Diffusion Model for 2D Human Pose Estimation.
CoRR, 2023

HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

GridFormer: Towards Accurate Table Structure Recognition via Grid Prediction.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Fast-StrucTexT: An Efficient Hourglass Transformer with Modality-guided Dynamic Token Merge for Document Understanding.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

Group Pose: A Simple Baseline for End-to-End Multi-person Pose Estimation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Determination of Students' Misconceptions using the Electric Circuit Concept Diagnostic (ECCD) Instrument.
Proceedings of the IEEE Frontiers in Education Conference, 2023

2022
Development of Sapphire Optical Temperature Sensing System Used in Harsh Environment Sensing.
IEEE Trans. Instrum. Meas., 2022

A GAN-based method for time-dependent cloud workload generation.
J. Parallel Distributed Comput., 2022

Real-time evaluation method of turbine rotor's thermal stress with governing stage partial admission condition at low-load.
J. Comput. Methods Sci. Eng., 2022

CAE v2: Context Autoencoder with CLIP Target.
CoRR, 2022

Group DETR v2: Strong Object Detector with Encoder-Decoder Pretraining.
CoRR, 2022

TRUST: An Accurate and End-to-End Table structure Recognizer Using Splitting-based Transformers.
CoRR, 2022

MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining.
CoRR, 2022

Decoupling Recognition from Detection: Single Shot Self-Reliant Scene Text Spotter.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Research on Recognition Algorithm of Industrial Instrument Based on Convolutional Neural Network.
Proceedings of the 6th International Symposium on Computer Science and Intelligent Control, 2022

ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Fiber Vector Magnetometer Based on Balloon-Like Fiber Structure and Magnetic Fluid.
IEEE Trans. Instrum. Meas., 2021

Design and Analysis of a Combined FBG Sensor for the Measurement of Three Parameters.
IEEE Trans. Instrum. Meas., 2021

Data Mining Technology Application in False Text Information Recognition.
Mob. Inf. Syst., 2021

The depth utilization method of condensate system's energy-storage to improve large turbine generator units' load response characteristic.
J. Comput. Methods Sci. Eng., 2021

StrucTexT: Structured Text Understanding with Multi-Modal Transformers.
CoRR, 2021

Temperature and pressure dual-parameter sensing based on Fiber Bragg Grating.
Proceedings of the 16th IEEE International Conference on Nano/Micro Engineered and Molecular Systems, 2021

StrucTexT: Structured Text Understanding with Multi-Modal Transformers.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Design and load test method of 2×250kN large-span hoist.
Proceedings of the EEET 2021: 4th International Conference on Electronics and Electrical Engineering Technology, Nanjing, China, December 3, 2021

2020
Separation characteristics between time domain and frequency domain of wireless power communication signal in wind farm.
EURASIP J. Wirel. Commun. Netw., 2020

Molecular modeling studies to discover novel mIDH2 inhibitors with high selectivity for the primary and secondary mutants.
Comput. Biol. Chem., 2020

2019
Design and Analysis of a Combined Strain-Vibration-Temperature Sensor with Two Fiber Bragg Gratings and a Trapezoidal Beam.
Sensors, 2019

Robust Deep Feature Extraction Method for Acoustic Scene Classification.
Proceedings of the 19th IEEE International Conference on Communication Technology, 2019

Improving the Spectra Recovering of Bone-Conducted Speech via Structural SIMilarity Loss Function.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
High Temperature High Sensitivity Multipoint Sensing System Based on Three Cascade Mach-Zehnder Interferometers.
Sensors, 2018

Modeling and Analysis of a Combined Stress-Vibration Fiber Bragg Grating Sensor.
Sensors, 2018

Ordinary Optical Fiber Sensor for Ultra-High Temperature Measurement Based on Infrared Radiation.
Sensors, 2018

Temperature Sensor Based on Multimode Fiber Bragg Grating.
Proceedings of the 13th IEEE Annual International Conference on Nano/Micro Engineered and Molecular Systems, 2018

2017
Application of implicit symplectic difference scheme in calculating propagation trajectory of wave in non-magnetized plasma.
J. Comput. Methods Sci. Eng., 2017

2014
PathSimExt: Revisiting PathSim in Heterogeneous Information Networks.
Proceedings of the Web-Age Information Management - 15th International Conference, 2014

2007
An Improved Multi-particle Swarm Co-evolution Algorithm.
Proceedings of the Third International Conference on Natural Computation, 2007

2006
Performance of MC-CDMA Based on Feedback Pre-equalization Method in Uplink Channel.
Proceedings of the First International Conference on Innovative Computing, Information and Control (ICICIC 2006), 30 August, 2006


  Loading...