Yuhuai Wu

According to our database1, Yuhuai Wu authored at least 58 papers between 2016 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Solving olympiad geometry without human demonstrations.
Nat., January, 2024

Meta-Designing Quantum Experiments with Language Models.
CoRR, 2024

REFACTOR: Learning to Extract Theorems from Proofs.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Don't Trust: Verify - Grounding LLM Quantitative Reasoning with Autoformalization.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Magnushammer: A Transformer-Based Approach to Premise Selection.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Holistic Evaluation of Language Models.
Trans. Mach. Learn. Res., 2023

Length Generalization in Arithmetic Transformers.
CoRR, 2023

Evaluating Language Models for Mathematics through Interactions.
CoRR, 2023

Lexinvariant Language Models.
CoRR, 2023

Magnushammer: A Transformer-based Approach to Premise Selection.
CoRR, 2023

Focused Transformer: Contrastive Training for Context Scaling.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Lexinvariant Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Draft, Sketch, and Prove: Guiding Formal Theorem Provers with Informal Proofs.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Language Model Cascades.
CoRR, 2022

STaR: Bootstrapping Reasoning With Reasoning.
CoRR, 2022

STaR: Bootstrapping Reasoning With Reasoning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Insights into Pre-training via Simpler Synthetic Tasks.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Autoformalization with Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Solving Quantitative Reasoning Problems with Language Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Thor: Wielding Hammers to Integrate Language Models and Automated Theorem Provers.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Block-Recurrent Transformers.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Exploring Length Generalization in Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Path Independent Equilibrium Models Can Better Exploit Test-Time Computation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Hierarchical Transformers Are More Efficient Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Memorizing Transformers.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Invariant Causal Representation Learning for Out-of-Distribution Generalization.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Proof Artifact Co-Training for Theorem Proving with Language Models.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Discrete Equidecomposability and Ehrhart Theory of Polygons.
Discret. Comput. Geom., 2021

Learning to Give Checkable Answers with Prover-Verifier Games.
CoRR, 2021

Nonlinear Invariant Risk Minimization: A Causal Approach.
CoRR, 2021

Subgoal Search For Complex Reasoning Tasks.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

LIME: Learning Inductive Bias for Primitives of Mathematical Reasoning.
Proceedings of the 38th International Conference on Machine Learning, 2021

Efficient Statistical Tests: A Neural Tangent Kernel Approach.
Proceedings of the 38th International Conference on Machine Learning, 2021

INT: An Inequality Benchmark for Evaluating Generalization in Theorem Proving.
Proceedings of the 9th International Conference on Learning Representations, 2021

IsarStep: a Benchmark for High-level Mathematical Reasoning.
Proceedings of the 9th International Conference on Learning Representations, 2021

Learning Branching Heuristics for Propositional Model Counting.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
The Scattering Compositional Learner: Discovering Objects, Attributes, Relationships in Analogical Reasoning.
CoRR, 2020

Learning Branching Heuristics for Propositional Model Counting.
CoRR, 2020

Modelling High-Level Mathematical Reasoning in Mechanised Declarative Proofs.
CoRR, 2020

OPtions as REsponses: Grounding behavioural hierarchies in multi-agent reinforcement learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
Grandmaster level in StarCraft II using multi-agent reinforcement learning.
Nat., 2019

Options as responses: Grounding behavioural hierarchies in multi-agent RL.
CoRR, 2019

Concurrent Meta Reinforcement Learning.
CoRR, 2019

ACTRCE: Augmenting Experience via Teacher's Advice For Multi-Goal Reinforcement Learning.
CoRR, 2019

2018
Some Considerations on Learning to Explore via Meta-Reinforcement Learning.
CoRR, 2018

An Empirical Analysis of Proximal Policy Optimization with Kronecker-factored Natural Gradients.
CoRR, 2018

The Importance of Sampling inMeta-Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Understanding Short-Horizon Bias in Stochastic Meta-Optimization.
Proceedings of the 6th International Conference on Learning Representations, 2018

Backpropagation through the Void: Optimizing control variates for black-box gradient estimation.
Proceedings of the 6th International Conference on Learning Representations, 2018

2017
STDP-Compatible Approximation of Backpropagation in an Energy-Based Model.
Neural Comput., 2017

Sticking the Landing: An Asymptotically Zero-Variance Gradient Estimator for Variational Inference.
CoRR, 2017

Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Sticking the Landing: Simple, Lower-Variance Gradient Estimators for Variational Inference.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

On the Quantitative Analysis of Decoder-Based Generative Models.
Proceedings of the 5th International Conference on Learning Representations, 2017

2016
Architectural Complexity Measures of Recurrent Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

On Multiplicative Integration with Recurrent Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Path-Normalized Optimization of Recurrent Neural Networks with ReLU Activations.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016


  Loading...