2025
Multi-robot Collaborative 3D Path Planning Based On Game Theory and Particle Swarm Optimization Hybrid Method.
J. Supercomput., February, 2025
High-Accuracy, Wide-Dynamic Range Continuous FBG Interrogator Based on an AWG.
IEEE Trans. Instrum. Meas., 2025
Interpretable Face Anti-Spoofing: Enhancing Generalization with Multimodal Large Language Models.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
2024
Sequential Learning Network With Residual Blocks: Incorporating Temporal Convolutional Information Into Recurrent Neural Networks.
IEEE Trans. Cogn. Dev. Syst., February, 2024
MaskOCR: Scene Text Recognition with Masked Vision-Language Pre-training.
Trans. Mach. Learn. Res., 2024
Irregular text block recognition via decoupling visual, linguistic, and positional information.
Pattern Recognit., 2024
FullAnno: A Data Engine for Enhancing Image Comprehension of MLLMs.
CoRR, 2024
Add-SD: Rational Generation without Manual Reference.
CoRR, 2024
OVLW-DETR: Open-Vocabulary Light-Weighted Detection Transformer.
CoRR, 2024
Skim then Focus: Integrating Contextual and Fine-grained Views for Repetitive Action Counting.
CoRR, 2024
LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
StrucTexTv3: An Efficient Vision-Language Model for Text-rich Image Perception, Comprehension, and Beyond.
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Towards Unified Multi-granularity Text Detection with Interactive Attention.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
FROSTER: Frozen CLIP is A Strong Teacher for Open-Vocabulary Action Recognition.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Textual Grounding for Open-Vocabulary Visual Information Extraction in Layout-Diversified Documents.
Proceedings of the Computer Vision - ECCV 2024, 2024
2023
CAE v2: Context Autoencoder with CLIP Latent Alignment.
,
,
,
,
,
,
,
,
,
,
,
,
Trans. Mach. Learn. Res., 2023
MataDoc: Margin and Text Aware Document Dewarping for Arbitrary Boundary.
CoRR, 2023
Learning Structure-Guided Diffusion Model for 2D Human Pose Estimation.
CoRR, 2023
HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
GridFormer: Towards Accurate Table Structure Recognition via Grid Prediction.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Fast-StrucTexT: An Efficient Hourglass Transformer with Modality-guided Dynamic Token Merge for Document Understanding.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023
StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023
Group Pose: A Simple Baseline for End-to-End Multi-person Pose Estimation.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Determination of Students' Misconceptions using the Electric Circuit Concept Diagnostic (ECCD) Instrument.
Proceedings of the IEEE Frontiers in Education Conference, 2023
2022
Development of Sapphire Optical Temperature Sensing System Used in Harsh Environment Sensing.
IEEE Trans. Instrum. Meas., 2022
A GAN-based method for time-dependent cloud workload generation.
J. Parallel Distributed Comput., 2022
Real-time evaluation method of turbine rotor's thermal stress with governing stage partial admission condition at low-load.
J. Comput. Methods Sci. Eng., 2022
CAE v2: Context Autoencoder with CLIP Target.
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2022
Group DETR v2: Strong Object Detector with Encoder-Decoder Pretraining.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2022
TRUST: An Accurate and End-to-End Table structure Recognizer Using Splitting-based Transformers.
CoRR, 2022
MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining.
CoRR, 2022
Decoupling Recognition from Detection: Single Shot Self-Reliant Scene Text Spotter.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Research on Recognition Algorithm of Industrial Instrument Based on Convolutional Neural Network.
Proceedings of the 6th International Symposium on Computer Science and Intelligent Control, 2022
ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval.
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
2021
Fiber Vector Magnetometer Based on Balloon-Like Fiber Structure and Magnetic Fluid.
IEEE Trans. Instrum. Meas., 2021
Design and Analysis of a Combined FBG Sensor for the Measurement of Three Parameters.
IEEE Trans. Instrum. Meas., 2021
Data Mining Technology Application in False Text Information Recognition.
Mob. Inf. Syst., 2021
The depth utilization method of condensate system's energy-storage to improve large turbine generator units' load response characteristic.
J. Comput. Methods Sci. Eng., 2021
StrucTexT: Structured Text Understanding with Multi-Modal Transformers.
CoRR, 2021
Temperature and pressure dual-parameter sensing based on Fiber Bragg Grating.
Proceedings of the 16th IEEE International Conference on Nano/Micro Engineered and Molecular Systems, 2021
StrucTexT: Structured Text Understanding with Multi-Modal Transformers.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
Design and load test method of 2×250kN large-span hoist.
Proceedings of the EEET 2021: 4th International Conference on Electronics and Electrical Engineering Technology, Nanjing, China, December 3, 2021
2020
Separation characteristics between time domain and frequency domain of wireless power communication signal in wind farm.
EURASIP J. Wirel. Commun. Netw., 2020
Molecular modeling studies to discover novel mIDH2 inhibitors with high selectivity for the primary and secondary mutants.
Comput. Biol. Chem., 2020
2019
Design and Analysis of a Combined Strain-Vibration-Temperature Sensor with Two Fiber Bragg Gratings and a Trapezoidal Beam.
Sensors, 2019
Robust Deep Feature Extraction Method for Acoustic Scene Classification.
Proceedings of the 19th IEEE International Conference on Communication Technology, 2019
Improving the Spectra Recovering of Bone-Conducted Speech via Structural SIMilarity Loss Function.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
2018
High Temperature High Sensitivity Multipoint Sensing System Based on Three Cascade Mach-Zehnder Interferometers.
Sensors, 2018
Modeling and Analysis of a Combined Stress-Vibration Fiber Bragg Grating Sensor.
Sensors, 2018
Ordinary Optical Fiber Sensor for Ultra-High Temperature Measurement Based on Infrared Radiation.
Sensors, 2018
Temperature Sensor Based on Multimode Fiber Bragg Grating.
Proceedings of the 13th IEEE Annual International Conference on Nano/Micro Engineered and Molecular Systems, 2018
2017
Application of implicit symplectic difference scheme in calculating propagation trajectory of wave in non-magnetized plasma.
J. Comput. Methods Sci. Eng., 2017
2014
PathSimExt: Revisiting PathSim in Heterogeneous Information Networks.
Proceedings of the Web-Age Information Management - 15th International Conference, 2014
2007
An Improved Multi-particle Swarm Co-evolution Algorithm.
Proceedings of the Third International Conference on Natural Computation, 2007
2006
Performance of MC-CDMA Based on Feedback Pre-equalization Method in Uplink Channel.
Proceedings of the First International Conference on Innovative Computing, Information and Control (ICICIC 2006), 30 August, 2006