Inverse-Like Antagonistic Scene Text Spotting via Reading-Order Estimation and Dynamic Sampling.
IEEE Trans. Image Process., 2024
Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation.
CoRR, 2024
Video-Language Alignment via Spatio-Temporal Graph Transformer.
CoRR, 2024
Follow-Your-Pose v2: Multiple-Condition Guided Character Image Animation for Stable Pose Control.
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts.
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation.
,
,
,
,
,
,
,
,
,
,
Proceedings of the SIGGRAPH Asia 2024 Conference Papers, 2024
LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment.
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Unsupervised Cross-Modal Hashing With Modality-Interaction.
IEEE Trans. Circuits Syst. Video Technol., September, 2023
Deep Cross-Modal Proxy Hashing.
IEEE Trans. Knowl. Data Eng., July, 2023
Unsupervised Hashing with Semantic Concept Mining.
Proc. ACM Manag. Data, 2023
LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment.
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
Global and Local Semantic Completion Learning for Vision-Language Pre-training.
CoRR, 2023
Img2Vec: A Teacher of High Token-Diversity Helps Masked AutoEncoders.
CoRR, 2023
MAP: Multimodal Uncertainty-Aware Vision-Language Pre-training Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
MAP: Modality-Agnostic Uncertainty-Aware Vision-Language Pre-training Model.
CoRR, 2022
Adaptive Perception Transformer for Temporal Action Localization.
CoRR, 2022
DPTNet: A Dual-Path Transformer Architecture for Scene Text Detection.
CoRR, 2022
Boosting Multi-Modal E-commerce Attribute Value Extraction via Unified Learning Scheme and Dynamic Range Minimization.
CoRR, 2022
Egocentric Video-Language Pretraining @ Ego4D Challenge 2022.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2022
Egocentric Video-Language Pretraining @ EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022.
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2022
HunYuan_tvr for Text-Video Retrievial.
,
,
,
,
,
,
,
,
,
,
CoRR, 2022
Egocentric Video-Language Pretraining.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
CAliC: Accurate and Efficient Image-Text Retrieval via Contrastive Alignment and Visual Contexts Modeling.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Deep Unsupervised Hashing with Latent Semantic Components.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
Detecting Text in Scene and Traffic Guide Panels With Attention Anchor Mechanism.
IEEE Trans. Intell. Transp. Syst., 2021
Correction to: Prediction of urban water accumulation points and water accumulation process based on machine learning.
Earth Sci. Informatics, 2021
Prediction of urban water accumulation points and water accumulation process based on machine learning.
Earth Sci. Informatics, 2021
Adaptive Boundary Proposal Network for Arbitrary Shape Text Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
HAM: Hidden Anchor Mechanism for Scene Text Detection.
IEEE Trans. Image Process., 2020
Deep Relational Reasoning Graph Network for Arbitrary Shape Text Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Detecting Advertising Materials via Multi-Scale Instance Segmentation Network.
Aust. J. Intell. Inf. Process. Syst., 2019
AdaDNNs: Adaptive Ensemble of Deep Neural Networks for Scene Text Recognition.
CoRR, 2017
Weighted Cache Location Problem with Identical Servers.
J. Appl. Math., 2014
An Approach to Online Recommendation of Products with High Price-Performance Ratios Based on a Customized Price-Dominance Relationship.
J. Softw., 2013
On the 2-MRS Problem in a Tree with Unreliable Edges.
J. Appl. Math., 2013
Genetic Algorithm-Based Evaluation Model of Teaching Quality.
Proceedings of the Third International Symposium on Intelligent Information Technology and Security Informatics, 2010
Singular Points Detection Based on Zero-Pole Model in Fingerprint Images.
IEEE Trans. Pattern Anal. Mach. Intell., 2008