2024

Inverse-Like Antagonistic Scene Text Spotting via Reading-Order Estimation and Dynamic Sampling.

[DOI]

Shi-Xue Zhang

Chun Yang

IEEE Trans. Image Process., 2024

HunyuanVideo: A Systematic Framework For Large Video Generative Models.

[DOI]

CoRR, 2024

Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation.

[DOI]

CoRR, 2024

Video-Language Alignment via Spatio-Temporal Graph Transformer.

[DOI]

CoRR, 2024

Follow-Your-Pose v2: Multiple-Condition Guided Character Image Animation for Stable Pose Control.

[DOI]

CoRR, 2024

Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts.

[DOI]

CoRR, 2024

Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation.

[DOI]

Proceedings of the SIGGRAPH Asia 2024 Conference Papers, 2024

LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment.

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

Unsupervised Cross-Modal Hashing With Modality-Interaction.

[DOI]

IEEE Trans. Circuits Syst. Video Technol., September, 2023

Deep Cross-Modal Proxy Hashing.

[DOI]

IEEE Trans. Knowl. Data Eng., July, 2023

Unsupervised Hashing with Semantic Concept Mining.

[DOI]

Proc. ACM Manag. Data, 2023

LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment.

[DOI]

CoRR, 2023

Global and Local Semantic Completion Learning for Vision-Language Pre-training.

[DOI]

CoRR, 2023

Img2Vec: A Teacher of High Token-Diversity Helps Masked AutoEncoders.

[DOI]

CoRR, 2023

MAP: Multimodal Uncertainty-Aware Vision-Language Pre-training Model.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

MAP: Modality-Agnostic Uncertainty-Aware Vision-Language Pre-training Model.

[DOI]

CoRR, 2022

Adaptive Perception Transformer for Temporal Action Localization.

[DOI]

CoRR, 2022

DPTNet: A Dual-Path Transformer Architecture for Scene Text Detection.

[DOI]

CoRR, 2022

Boosting Multi-Modal E-commerce Attribute Value Extraction via Unified Learning Scheme and Dynamic Range Minimization.

[DOI]

CoRR, 2022

Egocentric Video-Language Pretraining @ Ego4D Challenge 2022.

[DOI]

CoRR, 2022

Egocentric Video-Language Pretraining @ EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022.

[DOI]

CoRR, 2022

Egocentric Video-Language Pretraining.

[DOI]

CoRR, 2022

HunYuan_tvr for Text-Video Retrievial.

[DOI]

CoRR, 2022

Egocentric Video-Language Pretraining.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

CAliC: Accurate and Efficient Image-Text Retrieval via Contrastive Alignment and Visual Contexts Modeling.

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Deep Unsupervised Hashing with Latent Semantic Components.

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Detecting Text in Scene and Traffic Guide Panels With Attention Anchor Mechanism.

[DOI]

IEEE Trans. Intell. Transp. Syst., 2021

Correction to: Prediction of urban water accumulation points and water accumulation process based on machine learning.

[DOI]

Earth Sci. Informatics, 2021

Prediction of urban water accumulation points and water accumulation process based on machine learning.

[DOI]

Earth Sci. Informatics, 2021

Adaptive Boundary Proposal Network for Arbitrary Shape Text Detection.

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020

HAM: Hidden Anchor Mechanism for Scene Text Detection.

[DOI]

IEEE Trans. Image Process., 2020

Deep Relational Reasoning Graph Network for Arbitrary Shape Text Detection.

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Detecting Advertising Materials via Multi-Scale Instance Segmentation Network.

[DOI]

Aust. J. Intell. Inf. Process. Syst., 2019

2017

AdaDNNs: Adaptive Ensemble of Deep Neural Networks for Scene Text Recognition.

[DOI]

CoRR, 2017

2014

Weighted Cache Location Problem with Identical Servers.

[DOI]

Hongfa Wang

Wei Ding

J. Appl. Math., 2014

2013

An Approach to Online Recommendation of Products with High Price-Performance Ratios Based on a Customized Price-Dominance Relationship.

[DOI]

Hongfa Wang

Chen Xing

J. Softw., 2013

On the 2-MRS Problem in a Tree with Unreliable Edges.

[DOI]

J. Appl. Math., 2013

2010

Genetic Algorithm-Based Evaluation Model of Teaching Quality.

[DOI]

Proceedings of the Third International Symposium on Intelligent Information Technology and Security Informatics, 2010

2008

Singular Points Detection Based on Zero-Pole Model in Fingerprint Images.

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2008