Licheng Yu

Orcid: 0000-0002-4943-6732

According to our database1, Licheng Yu authored at least 77 papers between 2011 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Movie Gen: A Cast of Media Foundation Models.
CoRR, 2024

SceneTextGen: Layout-Agnostic Scene Text Image Synthesis with Diffusion Models.
CoRR, 2024

Animated Stickers: Bringing Stickers to Life with Video Diffusion.
CoRR, 2024

Ameli: Enhancing Multimodal Entity Linking with Fine-Grained Attributes.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Layout-Agnostic Scene Text Image Synthesis with Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

AVID: Any-Length Video Inpainting with Diffusion Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Fairy: Fast Parallelized Instruction-Guided Video-to-Video Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression.
CoRR, 2023

Learning and Verification of Task Structure in Instructional Videos.
CoRR, 2023

Que2Engage: Embedding-based Retrieval for Relevant and Engaging Products at Facebook Marketplace.
Proceedings of the Companion Proceedings of the ACM Web Conference 2023, 2023

RoPAWS: Robust Semi-supervised Representation Learning from Uncurated Data.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

CiT: Curation in Training for Effective Vision-Language Data.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
GEB+: A benchmark for generic event boundary captioning, grounding and text-based retrieval.
CoRR, 2022

LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval.
CoRR, 2022

Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment.
CoRR, 2022

CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval.
CoRR, 2022

CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

FaD-VLP: Fashion Vision-and-Language Pre-training towards Unified Retrieval and Captioning.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval.
Proceedings of the Computer Vision - ECCV 2022, 2022

FashionViL: Fashion-Focused Vision-and-Language Representation Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

Unsupervised Vision-and-Language Pretraining via Retrieval-based Multi-Granular Alignment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Assistive supernumerary grasping with the back of the hand.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Connecting What To Say With Where To Look by Modeling Human Attention Traces.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

What is More Likely to Happen Next? Video-and-Language Future Event Prediction.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval.
Proceedings of the Computer Vision - ECCV 2020, 2020

UNITER: UNiversal Image-TExt Representation Learning.
Proceedings of the Computer Vision - ECCV 2020, 2020

Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models.
Proceedings of the Computer Vision - ECCV 2020, 2020

Violin: A Large-Scale Dataset for Video-and-Language Inference.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

BachGAN: High-Resolution Image Synthesis From Salient Object Layout.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

TVQA+: Spatio-Temporal Grounding for Video Question Answering.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Question Answering, Grounding, and Generation for Vision and Language.
PhD thesis, 2019

UNITER: Learning UNiversal Image-TExt Representations.
CoRR, 2019

Learning to Navigate Unseen Environments: Back Translation with Environmental Dropout.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Multi-Target Embodied Question Answering.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
A Unified Framework for Manifold Landmarking.
IEEE Trans. Signal Process., 2018

Physics-Inspired Garment Recovery from a Single-View Image.
ACM Trans. Graph., 2018

From image to language and back again.
Nat. Lang. Eng., 2018

Last level cache layout remapping for heterogeneous systems.
J. Syst. Archit., 2018

TVQA: Localized, Compositional Video Question Answering.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

MAttNet: Modular Attention Network for Referring Expression Comprehension.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Enable back memory and global synchronization on LLC buffer.
J. Supercomput., 2017

Active manifold learning via a unified framework for manifold landmarking.
CoRR, 2017

Hierarchically-Attentive RNN for Album Summarization and Storytelling.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

A Joint Speaker-Listener-Reinforcer Model for Referring Expressions.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Architecture supported register stash for GPGPU.
J. Parallel Distributed Comput., 2016

Detailed Garment Recovery from a Single-View Image.
CoRR, 2016

Two Methods for Combining Original Memory Access Coalescing and Equivalent Memory Access Coalescing on GPGPU.
Proceedings of the 13th International Conference on Embedded Software and Systems, 2016

LLC Buffer for Arbitrary Data Sharing in Heterogeneous Systems.
Proceedings of the 18th IEEE International Conference on High Performance Computing and Communications; 14th IEEE International Conference on Smart City; 2nd IEEE International Conference on Data Science and Systems, 2016

WAP: The Warp Feature Aware Prefetching Method for LLC on CPU-GPU Heterogeneous Architecture.
Proceedings of the 18th IEEE International Conference on High Performance Computing and Communications; 14th IEEE International Conference on Smart City; 2nd IEEE International Conference on Data Science and Systems, 2016

Modeling Context in Referring Expressions.
Proceedings of the Computer Vision - ECCV 2016, 2016

2015
Vector Sparse Representation of Color Image Using Quaternion Matrix Analysis.
IEEE Trans. Image Process., 2015

MCMG simulator: A unified simulation framework for CPU and graphic GPU.
J. Comput. Syst. Sci., 2015

Visual Madlibs: Fill in the blank Image Generation and Question Answering.
CoRR, 2015

Analyzing Memory Access on CPU-GPGPU Shared LLC Architecture.
Proceedings of the 14th International Symposium on Parallel and Distributed Computing, 2015

Visual Madlibs: Fill in the Blank Description Generation and Question Answering.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Equidistant Memory Access Coalescing on GPGPU.
Proceedings of the 17th IEEE International Conference on High Performance Computing and Communications, 2015

Buffer Filter: A Last-Level Cache Management Policy for CPU-GPGPU Heterogeneous System.
Proceedings of the 17th IEEE International Conference on High Performance Computing and Communications, 2015

Dictionary Learning with Mutually Reinforcing Group-Graph Structures.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Improving branch divergence performance on GPGPU with a new PDOM stack and multi-level warp scheduling.
J. Syst. Archit., 2014

Buffer on Last Level Cache for CPU and GPGPU Data Sharing.
Proceedings of the 2014 IEEE International Conference on High Performance Computing and Communications, 2014

A quantitative quality control method of big data in cancer patients using artificial neural network.
Proceedings of the IEEE 3rd International Conference on Cloud Computing and Intelligence Systems, 2014

2013
Single image super-resolution via phase congruency analysis.
Proceedings of the 2013 Visual Communications and Image Processing, 2013

Quaternion-based sparse representation of color image.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Self-example based super-resolution with fractal-based gradient enhancement.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

2012
Packet Triggered Prediction Based Task Migration for Network-on-Chip.
Proceedings of the 20th Euromicro International Conference on Parallel, 2012

A CPU-GPGPU Scheduler Based on Data Transmission Bandwidth of Workload.
Proceedings of the 13th International Conference on Parallel and Distributed Computing, 2012

A Software-hardware Collaborating Framework for Wear Leveling on Phase Change Memory.
Proceedings of the 14th IEEE International Conference on High Performance Computing and Communication & 9th IEEE International Conference on Embedded Software and Systems, 2012

Improve GPGPU Latency Hiding with a Hybrid Recovery Stack and a Window Based Warp Scheduling Policy.
Proceedings of the 14th IEEE International Conference on High Performance Computing and Communication & 9th IEEE International Conference on Embedded Software and Systems, 2012

Robust single image super-resolution based on gradient enhancement.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011
Leakage Aware Scheduling for Maximum Temperature Minimization.
Proceedings of the 12th International Conference on Parallel and Distributed Computing, 2011


  Loading...