Yu Zhou
Orcid: 0000-0003-4188-9953Affiliations:
- Nankai University, Tianjin, China
- Chinese Academy of Sciences, Institute of Information Engineering, Beijing, China (former)
- Shanghai Jiao Tong University, Shanghai, China (former)
- Harbin Institute of Technology, Heilongjiang, China (former)
According to our database1,
Yu Zhou
authored at least 93 papers
between 2007 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
IEEE Trans. Multim., 2024
IEEE Signal Process. Lett., 2024
Char-SAM: Turning Segment Anything Model into Scene Text Segmentation Annotator with Character-level Visual Prompts.
CoRR, 2024
LDP: Generalizing to Multilingual Visual Information Extraction by Language Decoupled Pretraining.
CoRR, 2024
CoRR, 2024
Resolving Sentiment Discrepancy for Multimodal Sentiment Detection via Semantics Completion and Decomposition.
CoRR, 2024
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model.
CoRR, 2024
Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing.
CoRR, 2024
Show Exemplars and Tell Me What You See: In-Context Learning with Frozen Large Language Models for TextVQA.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024
Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text Retrieval.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Robust Multimodal Sentiment Analysis of Image-Text Pairs by Distribution-Based Feature Recovery and Fusion.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Bridging Visual Affective Gap: Borrowing Textual Knowledge by Learning from Noisy Image-Text Pairs.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
First Creating Backgrounds Then Rendering Texts: A New Paradigm for Visual Text Blending.
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024
2023
IEEE Trans. Neural Networks Learn. Syst., December, 2023
Beyond OCR + VQA: Towards end-to-end reading and reasoning for robust and accurate textvqa.
Pattern Recognit., June, 2023
CoRR, 2023
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Perceiving Ambiguity and Semantics without Recognition: An Efficient and Effective Ambiguous Scene Text Detector.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
EI<sup>2</sup>SR: Learning an Enhanced Intra-Instance Semantic Relationship for Arbitrary-Shaped Scene Text Detection.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
RD-IOD: Two-Level Residual-Distillation-Based Triple-Network for Incremental Object Detection.
ACM Trans. Multim. Comput. Commun. Appl., 2022
ACM Trans. Multim. Comput. Commun. Appl., 2022
Deep collaborative multi-task network: A human decision process inspired model for hierarchical image classification.
Pattern Recognit., 2022
Pattern Recognit., 2022
CoRR, 2022
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
TPSNet: Reverse Thinking of Thin Plate Splines for Arbitrary Shape Scene Text Representation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
MaMiCo: Macro-to-Micro Semantic Correspondence for Self-supervised Video Representation Learning.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022
Towards Escaping from Language Bias and OCR Error: Semantics-Centered Text Visual Question Answering.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2022, 2022
Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021
Proceedings of the PRICAI 2021: Trends in Artificial Intelligence, 2021
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
Mask is All You Need: Rethinking Mask R-CNN for Dense and Arbitrary-Shaped Scene Text Detection.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
Proceedings of the 33rd IEEE International Conference on Tools with Artificial Intelligence, 2021
FC<sup>2</sup>RN: A Fully Convolutional Corner Refinement Network for Accurate Multi-Oriented Scene Text Detection.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2021, 2021
Which and Where to Focus: A Simple yet Accurate Framework for Arbitrary-Shaped Nearby Text Detection in Scene Images.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2021, 2021
2020
Two-Level Residual Distillation based Triple Network for Incremental Object Detection.
CoRR, 2020
CoRR, 2020
FC2RN: A Fully Convolutional Corner Refinement Network for Accurate Multi-Oriented Scene Text Detection.
CoRR, 2020
Video Playback Rate Perception for Self-supervisedSpatio-Temporal Representation Learning.
CoRR, 2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the 25th International Conference on Pattern Recognition, 2020
Proceedings of the 25th International Conference on Pattern Recognition, 2020
Proceedings of the 25th International Conference on Pattern Recognition, 2020
Video Playback Rate Perception for Self-Supervised Spatio-Temporal Representation Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Proceedings of the PRICAI 2019: Trends in Artificial Intelligence, 2019
Curved Text Detection in Natural Scene Images with Semi- and Weakly-Supervised Learning.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019
2016
IEEE/ACM Trans. Netw., 2016
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016
2015
Unsupervised adaptive sign language recognition based on hypothesis comparison guided cross validation and linguistic prior filtering.
Neurocomputing, 2015
Summarizing surveillance videos with local-patch-learning-based abnormality detection, blob sequence optimization, and type-based synopsis.
Neurocomputing, 2015
Semantics constrained dictionary learning for signer-independent sign language recognition.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015
Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015
Weakly Supervised Metric Learning towards Signer Adaptation for Sign Language Recognition.
Proceedings of the British Machine Vision Conference 2015, 2015
2014
Visual Similarity Based Anti-phishing with the Combination of Local and Global Features.
Proceedings of the 13th IEEE International Conference on Trust, 2014
Proceedings of the 15th International Conference on Parallel and Distributed Computing, 2014
Proceedings of the Advances in Multimedia Information Processing - PCM 2014, 2014
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014
Text localization in natural scene images with stroke width histogram and superpixel.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014
Improved human head and shoulder detection with local main gradient and tracklets-based feature.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014
Proceedings of the Computer Vision - ACCV 2014 Workshops, 2014
Proceedings of the Computer Vision - ACCV 2014 Workshops, 2014
2011
A new global-based video enhancement algorithm by fusing features of multiple region-of-interests.
Proceedings of the 2011 IEEE Visual Communications and Image Processing, 2011
Proceedings of the 2011 IEEE Visual Communications and Image Processing, 2011
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011
2010
IEEE Signal Process. Lett., 2010
2008
Mahalanobis distance based Polynomial Segment Model for Chinese Sign Language Recogniton.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008
2007
Signer Adaptation Based on Etyma for Large Vocabulary Chinese Sign Language Recognition.
Proceedings of the Advances in Multimedia Information Processing, 2007