Yizhe Zhang

Affiliations:
  • Apple MLR, USA
  • Meta AI, USA (former)
  • Microsoft Research, Redmond, WA, USA (former)


According to our database1, Yizhe Zhang authored at least 89 papers between 2002 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation.
CoRR, 2024

MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains.
CoRR, 2024

OpenDevin: An Open Platform for AI Software Developers as Generalist Agents.
CoRR, 2024

Improving GFlowNets for Text-to-Image Diffusion Alignment.
CoRR, 2024

Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling.
CoRR, 2024

Many-to-many Image Generation with Auto-regressive Diffusion Models.
CoRR, 2024

How Far Are We from Intelligent Visual Deductive Reasoning?
CoRR, 2024

Executable Code Actions Elicit Better LLM Agents.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Data-free Distillation of Diffusion Models with Bootstrapping.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Matryoshka Diffusion Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Divide-or-Conquer? Which Part Should You Distill Your LLM?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Probing the Multi-turn Planning Capabilities of LLMs via 20 Question Games.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
KGLens: A Parameterized Knowledge Graph Solution to Assess What an LLM Does and Doesn't Know.
CoRR, 2023

The Entity-Deduction Arena: A playground for probing the conversational reasoning and planning capabilities of LLMs.
CoRR, 2023

BOOT: Data-free Distillation of Denoising Diffusion Models with Bootstrapping.
CoRR, 2023

PLANNER: Generating Diversified Paragraph via Latent Language Diffusion Model.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Stabilizing Transformer Training by Preventing Attention Entropy Collapse.
Proceedings of the International Conference on Machine Learning, 2023

f-DM: A Multi-stage Diffusion Model via Progressive Signal Transformation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Interactive Text Generation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Towards More Efficient Insertion Transformer with Fractional Positional Encoding.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

2022
Linearizing Transformer with Key-Value Memory Bank.
CoRR, 2022

Report from the NSF Future Directions Workshop on Automatic Evaluation of Dialog: Research Directions and Challenges.
CoRR, 2022

Linearizing Transformer with Key-Value Memory.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Bridging the Training-Inference Gap for Dense Phrase Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

A Token-level Reference-free Hallucination Detection Benchmark for Free-form Text Generation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

What Makes Good In-Context Examples for GPT-3?
Proceedings of Deep Learning Inside Out: The 3rd Workshop on Knowledge Extraction and Integration for Deep Learning Architectures, 2022

RetGen: A Joint Framework for Retrieval and Grounded Text Generation Modeling.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Joint Retrieval and Generation Training for Grounded Text Generation.
CoRR, 2021

An Adversarially-Learned Turing Test for Dialog Generation Models.
CoRR, 2021

SDA: Improving Text Generation with Self Data Augmentation.
CoRR, 2021

Contextualized Perturbation for Textual Adversarial Attack.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Finetuning Pretrained Transformers into RNNs.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Contrastive Multi-document Question Generation.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Automatic Document Sketching: Generating Drafts from Analogous Texts.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

A Controllable Model of Grounded Response Generation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Data Augmentation for Abstractive Query-Focused Multi-Document Summarization.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Narrative Incoherence Detection.
CoRR, 2020

Weakly supervised cross-domain alignment with optimal transport.
CoRR, 2020

POINTER: Constrained Text Generation via Insertion-based Generative Pre-training.
CoRR, 2020

Contextual Re-Ranking with Behavior Aware Transformers.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Datasets and Benchmarks for Task-Oriented Log Dialogue Ranking Task.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Adaptive Correlated Monte Carlo for Contextual Categorical Sequence Generation.
Proceedings of the 8th International Conference on Learning Representations, 2020

POINTER: Constrained Progressive Text Generation via Insertion-based Generative Pre-training.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Improving Text Generation with Student-Forcing Optimal Transport.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Optimus: Organizing Sentences via Pre-trained Modeling of a Latent Space.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Dialogue Response Ranking Training with Large-Scale Human Feedback Data.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Contextual Text Style Transfer.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Advancing weakly supervised cross-domain alignment with optimal transport.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

DIALOGPT : Large-Scale Generative Pre-training for Conversational Response Generation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020

INSET: Sentence Infilling with INter-SEntential Transformer.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Improving Disentangled Text Representation Learning with Information-Theoretic Guidance.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Contrastively Smoothed Class Alignment for Unsupervised Domain Adaptation.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Complementary Auxiliary Classifiers for Label-Conditional Text Generation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Sequence Generation with Optimal-Transport-Enhanced Reinforcement Learning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
INSET: Sentence Infilling with Inter-sentential Generative Pre-training.
CoRR, 2019

Unsupervised Common Question Generation from Multiple Documents using Reinforced Contrastive Coordinator.
CoRR, 2019

Consistent Dialogue Generation with Self-supervised Feature Learning.
CoRR, 2019

A convergence analysis for a class of practical variance-reduction stochastic gradient MCMC.
Sci. China Inf. Sci., 2019

Unsupervised Dialogue Spectrum Generation for Log Dialogue Ranking.
Proceedings of the 20th Annual SIGdial Meeting on Discourse and Dialogue, 2019

Jointly Optimizing Diversity and Relevance in Neural Response Generation.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Improving Sequence-to-Sequence Learning via Optimal Transport.
Proceedings of the 7th International Conference on Learning Representations, 2019

Domain Adaptive Text Style Transfer.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Structuring Latent Spaces for Stylized Response Generation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Generating a Common Question from Multiple Documents using Multi-source Encoder-Decoder Models.
Proceedings of the 3rd Workshop on Neural Generation and Translation@EMNLP-IJCNLP 2019, 2019

Microsoft Icecaps: An Open-Source Toolkit for Conversation Modeling.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Towards Generating Long and Coherent Text with Multi-Level Latent Variable Models.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Improving Textual Network Embedding with Global Attention via Optimal Transport.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
A bird's-eye view on coherence, and a worm's-eye view on cohesion.
CoRR, 2018

Adversarial Text Generation via Feature-Mover's Distance.
CoRR, 2018

Generating Informative and Diverse Conversational Responses via Adversarial Information Maximization.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Adversarial Text Generation via Feature-Mover's Distance.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

JointGAN: Multi-Domain Joint Distribution Learning with Generative Adversarial Nets.
Proceedings of the 35th International Conference on Machine Learning, 2018

Joint Embedding of Words and Labels for Text Classification.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Zero-Shot Learning via Class-Conditioned Deep Generative Models.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Deconvolutional Latent-Variable Model for Text Sequence Matching.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Deconvolutional Paragraph Representation Learning.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Triangle Generative Adversarial Networks.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Adversarial Feature Matching for Text Generation.
Proceedings of the 34th International Conference on Machine Learning, 2017

Stochastic Gradient Monomial Gamma Sampler.
Proceedings of the 34th International Conference on Machine Learning, 2017

2016
Laplacian Hamiltonian Monte Carlo.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2016

Towards Unifying Hamiltonian Monte Carlo and Slice Sampling.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Stochastic Gradient MCMC with Stale Gradients.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Bayesian Dictionary Learning with Gaussian Processes and Sigmoid Belief Networks.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Dynamic Poisson Factor Analysis.
Proceedings of the IEEE 16th International Conference on Data Mining, 2016

Triply Stochastic Variational Inference for Non-linear Beta Process Factor Analysis.
Proceedings of the IEEE 16th International Conference on Data Mining, 2016

Learning a Hybrid Architecture for Sequence Regression and Annotation.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2002
Model-based statistical sensor fusion for unexploded ordnance detection.
Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, 2002


  Loading...