Jianlong Fu
Orcid: 0000-0002-1025-2012
According to our database1,
Jianlong Fu
authored at least 132 papers
between 2010 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
Proceedings of the MultiMedia Modeling, 2025
2024
ACM Trans. Multim. Comput. Commun. Appl., December, 2024
CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation.
CoRR, 2024
CoRR, 2024
Proceedings of the 6th ACM International Conference on Multimedia in Asia, 2024
Solving Diffusion ODEs with Optimal Boundary Conditions for Better Image Super-Resolution.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the IEEE Gaming, Entertainment, and Media Conference, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Capture Concept Through Comparison: Vision-and-Language Representation Learning with Intrinsic Information Mining.
Proceedings of the Computer Vision - ACCV 2024, 2024
2023
Learning Degradation-Robust Spatiotemporal Frequency-Transformer for Video Super-Resolution.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023
IEEE Trans. Vis. Comput. Graph., July, 2023
Weakly-supervised pre-training for 3D human pose estimation via perspective knowledge.
Pattern Recognit., July, 2023
Int. J. Comput. Vis., May, 2023
IEEE Trans. Multim., 2023
IEEE Trans. Image Process., 2023
IEEE Trans. Image Process., 2023
IEEE Trans. Pattern Anal. Mach. Intell., 2023
Pave the Way to Grasp Anything: Transferring Foundation Models for Universal Pick-Place Robots.
CoRR, 2023
CoRR, 2023
VideoFactory: Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation.
CoRR, 2023
Unified Multi-Modal Latent Diffusion for Joint Subject and Text Conditional Image Generation.
CoRR, 2023
MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and Images.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
MobileVidFactory: Automatic Diffusion-Based Social Media Video Generation for Mobile Devices from Text.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Learning Data-Driven Vector-Quantized Degradation Model for Animation Video Super-Resolution.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE International Conference on Consumer Electronics, 2023
MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
2022
Multi-Scale 2D Temporal Adjacency Networks for Moment Localization With Natural Language.
IEEE Trans. Pattern Anal. Mach. Intell., 2022
Guest Editorial: Introduction to the Special Section on Fine-Grained Visual Categorization.
IEEE Trans. Pattern Anal. Mach. Intell., 2022
Learning Spatiotemporal Frequency-Transformer for Low-Quality Video Super-Resolution.
CoRR, 2022
CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Representation Alignment.
CoRR, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
GRIT-VLP: Grouped Mini-batch Sampling for Efficient Vision and Language Pre-training.
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Advancing High-Resolution Video-Language Representation with Large-Scale Video Transcriptions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the Computer Vision - ACCV 2022, 2022
2021
IEEE Trans. Circuits Syst. Video Technol., 2021
Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training.
CoRR, 2021
One-Shot Neural Ensemble Architecture Search by Diversity-Guided Search Space Shrinking.
CoRR, 2021
Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Probing Inter-modality: Visual Parsing with Self-Attention for Vision-and-Language Pre-training.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
A Picture is Worth a Thousand Words: A Unified System for Diverse Captions and Rich Images Generation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
MMPT'21: International Joint Workshop on Multi-Modal Pre-Training for Multimedia Understanding.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021
Learning Conditional Knowledge Distillation for Degraded-Reference Image Quality Assessment.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
LightTrack: Finding Lightweight Neural Networks for Object Tracking via One-Shot Architecture Search.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Seeing Out of the Box: End-to-End Pre-Training for Vision-Language Representation Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
One-Shot Neural Ensemble Architecture Search by Diversity-Guided Search Space Shrinking.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
2020
Learning Rich Part Hierarchies With Progressive Attention Networks for Fine-Grained Image Recognition.
IEEE Trans. Image Process., 2020
IEEE Trans. Image Process., 2020
Multi-Scale 2D Temporal Adjacent Networks for Moment Localization with Natural Language.
CoRR, 2020
CoRR, 2020
360-Indoor: Towards Learning Real-World Objects in 360° Indoor Equirectangular Images.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Cream of the Crop: Distilling Prioritized Paths For One-Shot Neural Architecture Search.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the Computer Vision - ECCV 2020, 2020
Proceedings of the Computer Vision - ECCV 2020, 2020
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Learning 2D Temporal Adjacent Networks for Moment Localization with Natural Language.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
DGCN: Dynamic Graph Convolutional Network for Efficient Multi-Person Pose Estimation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
ACM Trans. Multim. Comput. Commun. Appl., 2019
ACM Trans. Multim. Comput. Commun. Appl., 2019
Neurocomputing, 2019
CoRR, 2019
WSOD^2: Learning Bottom-up and Top-down Objectness Distillation for Weakly-supervised Object Detection.
CoRR, 2019
CoRR, 2019
CoRR, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
AI Coach: Deep Human Pose Estimation and Analysis for Personalized Athletic Training Assistance.
Proceedings of the 27th ACM International Conference on Multimedia, 2019
Proceedings of the 27th ACM International Conference on Multimedia, 2019
Proceedings of the 2019 on International Conference on Multimedia Retrieval, 2019
From Words to Sentences: A Progressive Learning Approach for Zero-resource Machine Translation with Visual Pivots.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
Learning Recurrent Structure-Guided Attention Network for Multi-person Pose Estimation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019
WSOD2: Learning Bottom-Up and Top-Down Objectness Distillation for Weakly-Supervised Object Detection.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019
Looking for the Devil in the Details: Learning Trilinear Attention Sampling Network for Fine-Grained Image Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
2018
DA-GAN: Instance-level Image Translation by Deep Attention Generative Adversarial Networks (with Supplementary Materials).
CoRR, 2018
Beyond Narrative Description: Generating Poetry from Images by Multi-Adversarial Training.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018
What Dress Fits Me Best?: Fashion Recommendation on the Clothing Style for Personal Body Shape.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018
Tell-and-Answer: Towards Explainable Visual Question Answering using Attributes and Captions.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018
Proceedings of the Computer Vision - ECCV 2018, 2018
DA-GAN: Instance-Level Image Translation by Deep Attention Generative Adversarial Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018
Show, Reward and Tell: Automatic Generation of Narrative Paragraph From Photo Stream by Adversarial Training.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2017
Searching Personal Photos on the Phone with Instant Visual Query Suggestion and Joint Text-Image Hashing.
Proceedings of the 2017 ACM on Multimedia Conference, 2017
Proceedings of the Internet Multimedia Computing and Service, 2017
Learning Multi-attention Convolutional Neural Network for Fine-Grained Image Recognition.
Proceedings of the IEEE International Conference on Computer Vision, 2017
Proceedings of the IEEE International Conference on Computer Vision, 2017
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017
Look Closer to See Better: Recurrent Attention Convolutional Neural Network for Fine-Grained Image Recognition.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017
Let Your Photos Talk: Generating Narrative Paragraph for Photo Stream via Bidirectional Attention Recurrent Neural Networks.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017
2016
Storytelling of Photo Stream with Bidirectional Multi-thread Recurrent Neural Network.
CoRR, 2016
Beyond Object Recognition: Visual Sentiment Analysis with Deep Coupled Adjective and Noun Neural Networks.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016
2015
IEEE Trans. Circuits Syst. Video Technol., 2015
Finding logos in real-world images with point-context representation-based region search.
Multim. Syst., 2015
Proceedings of the 24th International Conference on World Wide Web, 2015
Relaxing from Vocabulary: Robust Weakly-Supervised Deep Learning for Vocabulary-Free Image Tagging.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015
Proceedings of the 8th Eurographics Workshop on 3D Object Retrieval, 2015
2014
Proceedings of the Computer Vision - ACCV 2014, 2014
2012
Proceedings of the 4th International Conference on Internet Multimedia Computing and Service, 2012
Proceedings of the Computer Vision, 2012
2010
Proceedings of the 18th International Conference on Multimedia 2010, 2010