Lu Jiang

Orcid: 0000-0003-0286-8439

Affiliations:
  • Google AI, Mountain View, CA, USA
  • Carnegie Mellon University, School of Computer Science, Pittsburgh, PA, USA


According to our database1, Lu Jiang authored at least 93 papers between 2011 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
A Versatile Diffusion Transformer with Mixture of Noise Levels for Audiovisual Generation.
CoRR, 2024


Language Model Beats Diffusion - Tokenizer is key to visual generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Photorealistic Video Generation with Diffusion Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Auditing Gender Presentation Differences in Text-to-Image Models.
Proceedings of the 4th ACM Conference on Equity and Access in Algorithms, 2024

Text-Driven Image Editing via Learnable Regions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Switchable Novel Object Captioner.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Fine-grained Controllable Video Generation via Object Appearance and Context.
CoRR, 2023

VideoGLUE: Video General Understanding Evaluation of Foundation Models.
CoRR, 2023

StyleDrop: Text-to-Image Generation in Any Style.
CoRR, 2023

Learning Disentangled Prompts for Compositional Image Synthesis.
CoRR, 2023

SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

StyleDrop: Text-to-Image Synthesis of Any Style.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Muse: Text-To-Image Generation via Masked Generative Transformers.
Proceedings of the International Conference on Machine Learning, 2023

Discrete Predictor-Corrector Diffusion Models for Image Synthesis.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

MAGVIT: Masked Generative Video Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Visual Prompt Tuning for Generative Transfer Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Contrastive Adaptation Network for Single- and Multi-Source Domain Adaptation.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Discrete Representations Strengthen Vision Transformer Robustness.
Proceedings of the Tenth International Conference on Learning Representations, 2022

ViTGAN: Training GANs with Vision Transformers.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Improved Masked Image Generation with Token-Critic.
Proceedings of the Computer Vision - ECCV 2022, 2022

BLT: Bidirectional Layout Transformer for Controllable Layout Generation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Pyramid Adversarial Training Improves ViT Performance.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

MaskGIT: Masked Generative Image Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Confident Learning: Estimating Uncertainty in Dataset Labels.
J. Artif. Intell. Res., 2021

Controllable and Progressive Image Extrapolation.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Text as Neural Operator: Image Manipulation by Text Instruction.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Self-supervised and Supervised Joint Training for Resource-rich Machine Translation.
Proceedings of the 38th International Conference on Machine Learning, 2021

Faster Meta Update Strategy for Noise-Robust Deep Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Regularizing Generative Adversarial Networks Under Limited Data.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Revisiting EmbodiedQA: A Simple Baseline and Beyond.
IEEE Trans. Image Process., 2020

SimAug: Learning Robust Representations from 3D Simulation for Pedestrian Trajectory Prediction in Unseen Cameras.
CoRR, 2020

Beyond Synthetic Noise: Deep Learning on Controlled Noisy Labels.
Proceedings of the 37th International Conference on Machine Learning, 2020

RetrieveGAN: Image Synthesis via Differentiable Patch Retrieval.
Proceedings of the Computer Vision - ECCV 2020, 2020

SimAug: Learning Robust Representations from Simulation for Trajectory Prediction.
Proceedings of the Computer Vision - ECCV 2020, 2020

Neural Design Network: Graphic Layout Generation with Constraints.
Proceedings of the Computer Vision - ECCV 2020, 2020

The Garden of Forking Paths: Towards Multi-Future Trajectory Prediction.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

AdvAug: Robust Adversarial Augmentation for Neural Machine Translation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Focal Visual-Text Attention for Memex Question Answering.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Neural Design Network: Graphic Layout Generation with Constraints.
CoRR, 2019

Feature Partitioning for Efficient Multi-Task Architectures.
CoRR, 2019

Let's Transfer Transformations of Shared Semantic Representations.
CoRR, 2019

Eidetic 3D LSTM: A Model for Video Prediction and Beyond.
Proceedings of the 7th International Conference on Learning Representations, 2019

Composing Text and Image for Image Retrieval - an Empirical Odyssey.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Peeking Into the Future: Predicting Future Person Activities and Locations in Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Contrastive Adaptation Network for Unsupervised Domain Adaptation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Robust Neural Machine Translation with Doubly Adversarial Inputs.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Decoupled Novel Object Captioner.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels.
Proceedings of the 35th International Conference on Machine Learning, 2018

Graph Distillation for Action Detection with Privileged Modalities.
Proceedings of the Computer Vision - ECCV 2018, 2018

Focal Visual-Text Attention for Visual Question Answering.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Revealing Event Saliency in Unconstrained Video Collection.
IEEE Trans. Image Process., 2017

A theoretical understanding of self-paced learning.
Inf. Sci., 2017

MentorNet: Regularizing Very Deep Neural Networks on Corrupted Labels.
CoRR, 2017

Graph Distillation for Action Detection with Privileged Information.
CoRR, 2017

MemexQA: Visual Memex Question Answering.
CoRR, 2017

Video Representation Learning and Latent Concept Mining for Large-scale Multi-label Video Classification.
CoRR, 2017

Delving Deep into Personal Photo and Video Search.
Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, 2017

Video Search via Ranking Network with Very Few Query Exemplars.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Leveraging Multi-modal Prior Knowledge for Large-scale Concept Learning in Noisy Web Data.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

Temporal localization of audio events for conflict monitoring in social media.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Webly-Supervised Learning of Multimodal Video Detectors.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

An Event Reconstruction Tool for Conflict Monitoring Using Social Media.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Visual Memory QA: Your Personal Photo and Video Search Agent.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Text-to-video: a semantic search engine for internet videos.
Int. J. Multim. Inf. Retr., 2016

Strategies for Searching Video Content with Text Queries or Video Examples.
CoRR, 2016

Exploiting Multi-modal Curriculum in Noisy Web Data for Large-scale Concept Learning.
CoRR, 2016

Web-scale Multimedia Search for Internet Video Content.
Proceedings of the 25th International Conference on World Wide Web, 2016

Informedia @ TRECVID 2016.
Proceedings of the 2016 TREC Video Retrieval Evaluation, 2016

Learning to Detect Concepts from Webly-Labeled Video Data.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

2015

Early Implementation Experience with Wearable Cognitive Assistance Applications.
Proceedings of the 2015 workshop on Wearable Systems and Applications, 2015

Fast and Accurate Content-based Semantic Search in 100M Internet Videos.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Content-Based Video Search over 1 Million Videos with 1 Core in 1 Second.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Incremental Multimodal Query Construction for Video Search.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Bridging the Ultimate Semantic Gap: A Semantic Search Engine for Internet Videos.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

A Self-Paced Multiple-Instance Learning Framework for Co-Saliency Detection.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Self-Paced Learning for Matrix Factorization.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Self-Paced Curriculum Learning.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
E-LAMP: integration of innovative ideas for multimedia event detection.
Mach. Vis. Appl., 2014


Improvements to speaker adaptive training of deep neural networks.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Self-Paced Learning with Diversity.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Instructional Videos for Unsupervised Harvesting and Learning of Action Examples.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Easy Samples First: Self-paced Reranking for Zero-Example Multimedia Search.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Towards Efficient Learning of Optimal Spatial Bag-of-Words Representations.
Proceedings of the International Conference on Multimedia Retrieval, 2014

Viral Video Style: A Closer Look at Viral Videos on YouTube.
Proceedings of the International Conference on Multimedia Retrieval, 2014

Zero-Example Event Search using MultiModal Pseudo Relevance Feedback.
Proceedings of the International Conference on Multimedia Retrieval, 2014

A Novel Group-Sparsity-Optimization-Based Feature Selection Model for Complex Interaction Recognition.
Proceedings of the Computer Vision - ACCV 2014, 2014

2013

2012

Leveraging high-level and low-level features for multimedia event detection.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

2011
Informedia@TRECVID 2011: Surveillance Event Detection.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011


  Loading...