2025
Decoupling and Interaction: task coordination in single-stage object detection.
Multim. Tools Appl., March, 2025
Data-Free Post-Training Quantization with Block-wise Enhanced Sample Generation.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
Tool Playgrounds: A Comprehensive and Analyzable Benchmark for LLM Tool Invocation.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
AtomNet: Designing Tiny Models from Operators Under Extreme MCU Constraints.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
2024
Improving Multi-Type License Plate Recognition via Learning Globally and Contrastively.
IEEE Trans. Intell. Transp. Syst., September, 2024
Sample Weighting with Hierarchical Equalization Loss for Dense Object Detection.
IEEE Trans. Multim., 2024
M3TTS: Multi-modal text-to-speech of multi-scale style control for dubbing.
Pattern Recognit. Lett., 2024
Improving license plate recognition via diverse stylistic plate generation.
Pattern Recognit. Lett., 2024
Integrated Recognition of Arbitrary-Oriented Multi-line Billet Number.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024
Irregular License Plate Recognition via Global Information Integration.
Proceedings of the MultiMedia Modeling - 30th International Conference, 2024
Improving Small License Plate Detection with Bidirectional Vehicle-Plate Relation.
Proceedings of the MultiMedia Modeling - 30th International Conference, 2024
Towards Low-resource License Plate Recognition via Feature Shuffling.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024
HQOD: Harmonious Quantization for Object Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024
Multi-task Learning for License Plate Recognition in Unconstrained Scenarios.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30, 2024
2023
Hypersphere guided embedding for masked face recognition.
Pattern Recognit. Lett., October, 2023
Self-supervised contrastive speaker verification with nearest neighbor positive instances.
Pattern Recognit. Lett., September, 2023
Faster SCDNet: Real-Time Semantic Segmentation Network with Split Connection and Flexible Dilated Convolution.
Sensors, March, 2023
Feature Enhancement and Reconstruction for Small Object Detection.
Proceedings of the MultiMedia Modeling - 29th International Conference, 2023
LiteHandNet: A Lightweight Hand Pose Estimation Network via Structural Feature Enhancement.
Proceedings of the MultiMedia Modeling - 29th International Conference, 2023
Rethinking Speech Recognition with A Multimodal Perspective via Acoustic and Semantic Cooperative Decoding.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
InterFormer: Interactive Local and Global Features Fusion for Automatic Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Complex Glyph Enhancement for License Plate Generation.
Proceedings of the Image and Graphics - 12th International Conference, 2023
End-to-End Multi-line License Plate Recognition with Cascaded Perception.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023
Self-Convolution for Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023
2022
Depth-Guided Progressive Network for Object Detection.
IEEE Trans. Intell. Transp. Syst., 2022
SCDNet: Real-time Semantic Segmentation Network with Split Connection and Flexible Dilated Convolution.
Proceedings of the IEEE Smartworld, 2022
Vertex Adjustment Loss for Multidirectional License Plate Detection and Recognition.
Proceedings of the IEEE Smartworld, 2022
Anchor-Free Location Refinement Network for Small License Plate Detection.
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022
DANet: Dynamic Attention to Spoof Patterns for Face Anti-Spoofing.
Proceedings of the 26th International Conference on Pattern Recognition, 2022
Semi-Supervised Fine-Grained Classification with Web Data via Noisy Sample Selection.
Proceedings of the 26th International Conference on Pattern Recognition, 2022
Adaptive Rounding Compensation for Post-training Quantization.
Proceedings of the Neural Information Processing - 29th International Conference, 2022
Non-Autoregressive Transformer with Unified Bidirectional Decoder for Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
Scale-Invariant Multidirectional License Plate Detection with the Network Combining Indirect and Direct Branches.
Sensors, 2021
End-to-end trainable network for degraded license plate detection via vehicle-plate relation mining.
Neurocomputing, 2021
An Efficient Temporal Model for Small-Footprint Keyword Spotting.
Proceedings of the 7th IEEE International Conference on Network Intelligence and Digital Content, 2021
Robust Chinese License Plate Generation via Foreground Text and Background Separation.
Proceedings of the Image and Graphics - 11th International Conference, 2021
Fast Recognition for Multidirectional and Multi-type License Plates with 2D Spatial Attention.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021
2020
Simultaneous End-to-End Vehicle and License Plate Detection With Multi-Branch Attention Neural Network.
IEEE Trans. Intell. Transp. Syst., 2020
Recurrent Graph Convolutional Network for Skeleton-Based Abnormal Driving Behavior Recognition.
Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges, 2020
Semantic Bilinear Pooling for Fine-Grained Recognition.
Proceedings of the 25th International Conference on Pattern Recognition, 2020
2016
Bloody Image Classification with Global and Local Features.
Proceedings of the Pattern Recognition - 7th Chinese Conference, 2016