Yuandong Tian
Orcid: 0000-0003-4202-4847Affiliations:
- Carnegie Mellon University
According to our database1,
Yuandong Tian
authored at least 158 papers
between 2006 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on cs.cmu.edu
On csauthors.net:
Bibliography
2024
Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces.
CoRR, 2024
CoRR, 2024
CoRR, 2024
From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients.
CoRR, 2024
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.
CoRR, 2024
CoRR, 2024
TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding.
CoRR, 2024
CoRR, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
2023
Understanding Curriculum Learning in Policy Optimization for Online Combinatorial Optimization.
Trans. Mach. Learn. Res., 2023
CoRR, 2023
RLCD: Reinforcement Learning from Contrast Distillation for Language Model Alignment.
CoRR, 2023
CoRR, 2023
H<sub>2</sub>O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
CoRR, 2023
Modeling Scattering Coefficients using Self-Attentive Complex Polynomials with Image-based Representation.
CoRR, 2023
Proceedings of the ACM SIGCOMM 2023 Conference, 2023
DyFormer : A Scalable Dynamic Graph Transformer with Provable Benefits on Generalization Ability.
Proceedings of the 2023 SIAM International Conference on Data Mining, 2023
Landscape Surrogate: Learning Decision Losses for Mathematical Optimization Under Partial Information.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models.
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2023
Proceedings of the Integration of Constraint Programming, Artificial Intelligence, and Operations Research, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
2022
Sample-Efficient Neural Architecture Search by Learning Actions for Monte Carlo Tree Search.
IEEE Trans. Pattern Anal. Mach. Intell., 2022
CoRR, 2022
AutoCAT: Reinforcement Learning for Automated Exploration of Cache Timing-Channel Attacks.
CoRR, 2022
Understanding Curriculum Learning in Policy Optimization for Solving Combinatorial Optimization Problems.
CoRR, 2022
CoRR, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
NASViT: Neural Architecture Search for Efficient Vision Transformers with Gradient Conflict aware Supernet Training.
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2022
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022
Learning Bounded Context-Free-Grammar via LSTM and the Transformer: Difference and the Explanations.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
Q-gym: An Equality Saturation Framework for DNN Inference Exploiting Weight Repetition.
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2022
2021
IEEE Robotics Autom. Lett., 2021
CoRR, 2021
CoRR, 2021
Proceedings of the ACM SIGCOMM 2021 Conference, Virtual Event, USA, August 23-27, 2021., 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Learn-to-Share: A Hardware-friendly Transfer Learning Framework Exploiting Computation and Parameter Sharing.
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021
2020
IEEE Micro, 2020
CoRR, 2020
CoRR, 2020
Learning Search Space Partition for Black-box Optimization using Monte Carlo Tree Search.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020
Student Specialization in Deep Rectified Networks With Finite Width and Input Dimension.
Proceedings of the 37th International Conference on Machine Learning, 2020
Playing the lottery with rewards and multiple languages: lottery tickets in RL and NLP.
Proceedings of the 8th International Conference on Learning Representations, 2020
Proceedings of the 8th International Conference on Learning Representations, 2020
FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel Dimensions.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
CoRR, 2019
AlphaX: eXploring Neural Architectures with Deep Neural Networks and Monte Carlo Tree Search.
CoRR, 2019
One ticket to win them all: generalizing lottery ticket initializations across datasets and optimizers.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Hierarchical Decision Making by Generating and Following Natural Language Instructions.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Proceedings of the 7th International Conference on Learning Representations, 2019
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees.
Proceedings of the 7th International Conference on Learning Representations, 2019
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019
FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
2018
IEEE Trans. Games, 2018
Int. J. Comput. Vis., 2018
Mixed Precision Quantization of ConvNets via Differentiable Neural Architecture Search.
CoRR, 2018
Algorithmic Framework for Model-based Reinforcement Learning with Theoretical Guarantees.
CoRR, 2018
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018
Gradient Descent Learns One-hidden-layer CNN: Don't be Afraid of Spurious Local Minima.
Proceedings of the 35th International Conference on Machine Learning, 2018
Proceedings of the 6th International Conference on Learning Representations, 2018
Proceedings of the 6th International Conference on Learning Representations, 2018
2017
ELF: An Extensive, Lightweight and Flexible Research Platform for Real-time Strategy Games.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
An Analytical Formula of Population Gradient for two-layered ReLU network and its Applications in Convergence and Critical Point Analysis.
Proceedings of the 34th International Conference on Machine Learning, 2017
Proceedings of the 5th International Conference on Learning Representations, 2017
Symmetry-Breaking Convergence Analysis of Certain Two-layered Neural Networks with ReLU nonlinearity.
Proceedings of the 5th International Conference on Learning Representations, 2017
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017
2016
Proceedings of the 4th International Conference on Learning Representations, 2016
Proceedings of the Computer Vision - ECCV 2016, 2016
2015
Theory and Practice of Hierarchical Data-driven Descent for Optimal Deformation Estimation.
Int. J. Comput. Vis., 2015
2013
Proceedings of the IEEE International Conference on Computer Vision, 2013
Integrating Perceptual Learning with External World Knowledge in a Simulated Student.
Proceedings of the Artificial Intelligence in Education - 16th International Conference, 2013
2012
Int. J. Comput. Vis., 2012
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012
Proceedings of the Computer Vision - ECCV 2012, 2012
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012
2011
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011
2010
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010
2009
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009
2008
Easytoon: an easy and quick tool to personalize a cartoon storyboard using family photo album.
Proceedings of the 16th International Conference on Multimedia 2008, 2008
2007
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007
EasyAlbum: an interactive photo annotation system based on face clustering and re-ranking.
Proceedings of the 2007 Conference on Human Factors in Computing Systems, 2007
2006
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006