Yuliang Liu

Orcid: 0000-0003-1404-2239

According to our database1, Yuliang Liu authored at least 135 papers between 2001 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Toward real text manipulation detection: New dataset and new solution.
Pattern Recognit., 2025

Enhancing scene text detectors with realistic text image synthesis using diffusion models.
Comput. Vis. Image Underst., 2025

2024
Turning a CLIP Model Into a Scene Text Spotter.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2024

A Global Seawater Density Distribution Model Using a Convolutional Neural Network.
Sensors, March, 2024

Improving Handwritten Mathematical Expression Recognition via Similar Symbol Distinguishing.
IEEE Trans. Multim., 2024

Accountable Textual-Visual Chat Learns to Reject Human Instructions in Image Re-creation.
Trans. Mach. Learn. Res., 2024

Realistic Pulse Waveforms Estimation via Contrastive Learning in Remote Photoplethysmography.
IEEE Trans. Instrum. Meas., 2024

R-CoT: Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models.
CoRR, 2024

PDF-WuKong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling.
CoRR, 2024

LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models.
CoRR, 2024

Mini-Monkey: Multi-Scale Adaptive Cropping for Multimodal Large Language Models.
CoRR, 2024

Multi-Prompting Decoder Helps Better Language Understanding.
CoRR, 2024

MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks.
CoRR, 2024

PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling.
CoRR, 2024

MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering.
CoRR, 2024

VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization.
CoRR, 2024

TextSquare: Scaling up Text-Centric Visual Instruction Tuning.
CoRR, 2024

TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document.
CoRR, 2024

An open dataset for oracle bone script recognition and decipherment.
CoRR, 2024

An open dataset for the evolution of oracle bone characters: EVOBC.
CoRR, 2024

SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting.
CoRR, 2024

Injected Harmonic Feature Based Protection Scheme for Active Distribution Networks With High Proportion IIDGs.
IEEE Access, 2024

Accelerating the Training of Large Language Models using Efficient Activation Rematerialization and Optimal Hybrid Parallelism.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024

Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Person Re-identification Method Based on Dual Feature Attention Backbone Network.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2024

A YOLOv7-Based Defect Detection Method for Metal Surfaces.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2024

Exploring the Capabilities of Large Multimodal Models on Dense Text.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30, 2024

ICDAR 2024 Competition on Artistic Text Recognition.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30, 2024

Puzzle Pieces Picker: Deciphering Ancient Chinese Characters with Radical Reconstruction.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30, 2024

Dataset and Benchmark for Urdu Natural Scenes Text Detection, Recognition and Visual Question Answering.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30, 2024

Knowledge Mining of Scene Text for Referring Expression Comprehension.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30, 2024

The First Swahili Language Scene Text Detection and Recognition Dataset.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30, 2024

Progressive Evolution from Single-Point to Polygon for Scene Text.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30, 2024

OMNIPARSER: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Monkey: Image Resolution and Text Label are Important Things for Large Multi-Modal Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Bridging the Gap Between End-to-End and Two-Step Text Spotting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Deciphering Oracle Bone Language with Diffusion Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
SPTS v2: Single-Point Scene Text Spotting.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Drift Error Compensation Algorithm for Heterodyne Optical Seawater Refractive Index Monitoring of Unstable Signals.
Sensors, October, 2023

SIERRA: A robust bilateral feature upsampler for dense prediction.
Comput. Vis. Image Underst., October, 2023

ML-Bench: Large Language Models Leverage Open-source Libraries for Machine Learning Tasks.
CoRR, 2023

KwaiYiiMath: Technical Report.
CoRR, 2023

Box-DETR: Understanding and Boxing Conditional Spatial Queries.
CoRR, 2023

On Point Affiliation in Feature Upsampling.
CoRR, 2023

Looking and Listening: Audio Guided Text Recognition.
CoRR, 2023

On the Hidden Mystery of OCR in Large Multimodal Models.
CoRR, 2023

New Benchmarks for Accountable Text-based Visual Re-creation.
CoRR, 2023

Colossal-Auto: Unified Automation of Parallelization and Activation Checkpoint for Large-scale Models.
CoRR, 2023

Unsupervised Readability Assessment via Learning from Weak Readability Signals.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Colossal-AI: A Unified Deep Learning System For Large-Scale Parallel Training.
Proceedings of the 52nd International Conference on Parallel Processing, 2023

A Faster and More Robust Momentum Observer for Robot Collision Detection Based on Loop Shaping Techniques.
Proceedings of the Intelligent Robotics and Applications - 16th International Conference, 2023

ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

ICDAR 2023 Competition on Reading the Seal Title.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

ICDAR 2023 Competition on Detecting Tampered Text in Images.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

TextREC: A Dataset for Referring Expression Comprehension with Reading Comprehension.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

ICDAR 2023 Competition on Recognition of Multi-line Handwritten Mathematical Expressions.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

TCM Automatic Diagnosis System Based on Knowledge Graph and BERT.
Proceedings of the 4th International Conference on Artificial Intelligence and Computer Engineering, 2023

Effects of the Conversation and Recommendation Mechanism on Chatbots' Recommendation Effectiveness.
Proceedings of the HCI International 2023 - Late Breaking Papers, 2023

Turning a CLIP Model into a Scene Text Detector.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Towards Robust Tampered Text Detection in Document Image: New Dataset and New Solution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Surface Defect Detection of Heat Sink Based on Lightweight Fully Convolutional Network.
IEEE Trans. Instrum. Meas., 2022

Semantic Segmentation of Hyperspectral Remote Sensing Images Based on PSE-UNet Model.
Sensors, 2022

Arbitrarily shaped scene text detection with dynamic convolution.
Pattern Recognit., 2022

ABCNet v2: Adaptive Bezier-Curve Network for Real-Time End-to-End Text Spotting.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Structured Multimodal Attentions for TextVQA.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

PageNet: Towards End-to-End Weakly Supervised Page-Level Handwritten Chinese Text Recognition.
Int. J. Comput. Vis., 2022

DELTA: Dynamically Optimizing GPU Memory beyond Tensor Recomputation.
CoRR, 2022

MSDS: A Large-Scale Chinese Signature and Token Digit String Dataset for Handwriting Verification.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

SAPA: Similarity-Aware Point Affiliation for Feature Upsampling.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

SPTS: Single-Point Text Spotting.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

ICPR 2022 Challenge on Multi-Modal Subtitle Recognition.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Don't Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Context.
Proceedings of the Computer Vision - ECCV 2022, 2022

SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
OPMP: An Omnidirectional Pyramid Mask Proposal Network for Arbitrary-Shape Scene Text Detection.
IEEE Trans. Multim., 2021

Separating Content from Style Using Adversarial Learning for Recognizing Text in the Wild.
Int. J. Comput. Vis., 2021

Exploring the Capacity of an Orderless Box Discretization Network for Multi-orientation Scene Text Detection.
Int. J. Comput. Vis., 2021

SPTS: Single-Point Text Spotting.
CoRR, 2021

ICDAR 2021 Competition on Integrated Circuit Text Spotting and Aesthetic Assessment.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

2020
EraseNet: End-to-End Text Removal in the Wild.
IEEE Trans. Image Process., 2020

Arbitrarily Shaped Scene Text Detection With a Mask Tightness Text Detector.
IEEE Trans. Image Process., 2020

Construction of interpretive structure modeling for the influencing factors of emergency industry development.
J. Intell. Fuzzy Syst., 2020

Recognition of distorted QR codes with one missing position detection pattern.
IET Image Process., 2020

Robust License Plate Recognition With Shared Adversarial Training Network.
IEEE Access, 2020

A Financial Transaction Methods Based on MapReduce Technology and Blockchain.
Proceedings of the 3rd International Conference on Smart BlockChain, 2020

A Data Management Method Based on Blockchain Technology.
Proceedings of the 3rd International Conference on Smart BlockChain, 2020

Mechanical Analysis and Dynamic Simulation of Ship Micro In-pipe Robot.
Proceedings of the Artificial Intelligence and Security - 6th International Conference, 2020

ABCNet: Real-Time Scene Text Spotting With Adaptive Bezier-Curve Network.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Using reflections and questioning to engage and challenge online graduate learners in education.
Res. Pract. Technol. Enhanc. Learn., 2019

Automatic labeling of large amounts of handwritten characters with gate-guided dynamic deep learning.
Pattern Recognit. Lett., 2019

Curved scene text detection via transverse and longitudinal sequence connection.
Pattern Recognit., 2019

Weakly supervised precise segmentation for historical document images.
Neurocomputing, 2019

Exploring the Capacity of Sequential-free Box Discretization Network for Omnidirectional Scene Text Detection.
CoRR, 2019

Unsupervised Learning Grouping-Based Resampling for Particle Filters.
IEEE Access, 2019

Comparative Object Similarity Learning-Based Robust Visual Tracking.
IEEE Access, 2019

Detecting Diseases by Human-Physiological-Parameter-Based Deep Learning.
IEEE Access, 2019

Omnidirectional Scene Text Detection with Sequential-free Box Discretization.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

ICDAR 2019 Competition on Large-Scale Street View Text with Partial Labeling - RRC-LSVT.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

ICDAR2019 Robust Reading Challenge on Arbitrary-Shaped Text - RRC-ArT.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Aggregation Cross-Entropy for Sequence Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Tightness-Aware Evaluation Protocol for Scene Text Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

EnsNet: Ensconce Text in the Wild.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

DeRPN: Taking a Further Step toward More General Object Detection.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
A New CNN-Based Method for Multi-Directional Car License Plate Detection.
IEEE Trans. Intell. Transp. Syst., 2018

ICPR2018 Contest on Robust Reading for Multi-Type Web Images.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Feature Enhancement Network: A Refined Scene Text Detector.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Robust Adaptive Learning Control for Spacecraft Autonomous Proximity Maneuver.
Int. J. Pattern Recognit. Artif. Intell., 2017

Detecting Curve Text in the Wild: New Dataset and New Solution.
CoRR, 2017

Deep Matching Prior Network: Toward Tighter Multi-oriented Text Detection.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Optimization of non-linear image registration in AFNI.
Proceedings of the XSEDE16 Conference on Diversity, 2016

Two intersections traffic signal control method based on ADHDP.
Proceedings of the IEEE International Conference on Vehicular Electronics and Safety, 2016

2015
A Full-Brain, Bootstrapped Analysis of Diffusion Tensor Imaging Robustly Differentiates Parkinson Disease from Healthy Controls.
Neuroinformatics, 2015

An exploratory study of pre-service teachers' features related to their online behaviors and Problematic Internet Use in the United States.
Comput. Hum. Behav., 2015

Hard Drive Failure Prediction Using Big Data.
Proceedings of the 34th IEEE Symposium on Reliable Distributed Systems Workshop, 2015

fMRI image registration with AFNI's 3dQwarp.
Proceedings of the 6th ACM Conference on Bioinformatics, 2015

2014
A lightweight possession proof scheme for outsourced files in mobile cloud computing based on chameleon hash function.
Int. J. Comput. Sci. Eng., 2014

A rapid fluorescence lifetime image acquistion method based on time-gated fluorescence lifetime imaging microscopy.
Proceedings of the 2nd International Conference on Systems and Informatics, 2014

A practical indoor visible light communication system.
Proceedings of the 9th International Symposium on Communication Systems, 2014

2011
Action potential initial dynamical control of a minimum neuron model.
Proceedings of the 4th International Conference on Biomedical Engineering and Informatics, 2011

Influence of the twirling frequency on the firing patterns of the evoked spike trains.
Proceedings of the 4th International Conference on Biomedical Engineering and Informatics, 2011

Flash trajectory imaging of target 3D motion.
Proceedings of the Three-Dimensional Imaging, 2011

2010
Effect of the Twirling Frequency on Firing Patterns Evoked by Acupuncture.
Proceedings of the Life System Modeling and Intelligent Computing, 2010

Automatic irrigation system based on wireless network.
Proceedings of the 8th IEEE International Conference on Control and Automation, 2010

Quantum-behaved particle swarm optimization -ANN based identification method for typical power quality disturbance.
Proceedings of the 8th IEEE International Conference on Control and Automation, 2010

2009
Controlling Hopf bifurcation in Fluid Flow Model of Internet Congestion Control System.
Int. J. Bifurc. Chaos, 2009

An Improved Internet Congestion Control Algorithm.
Proceedings of the Fifth International Conference on Natural Computation, 2009

2008
Consensus Research of Modified Multiagent Networks with Time Delay.
Proceedings of the Fourth International Conference on Natural Computation, 2008

Consensus of Multiagent Networks with Time Delay.
Proceedings of the Fourth International Conference on Natural Computation, 2008

2007
Hopf bifurcation analysis in a dual model of Internet congestion control algorithm with communication delay
CoRR, 2007

Controlling Delay-induced Hopf bifurcation in Internet congestion control system
CoRR, 2007

2004
A preliminary study of the impact of online instruction on teachers' technology concerns.
Br. J. Educ. Technol., 2004

Application of a Wavelet Adaptive Filter Based on Neural Network to Minimize Distortion of the Pulsatile Spectrum.
Proceedings of the Advances in Neural Networks, 2004

2001
How Do Frequency and Duration of Messaging Affect Impression Development in Computer-Mediated Communication?
J. Univers. Comput. Sci., 2001


  Loading...