2024

How Transformers Solve Propositional Logic Problems: A Mechanistic Analysis.

[DOI]

Guanzhe Hong

Nishanth Dikkala

CoRR, 2024

Causal Language Modeling Can Elicit Search and Reasoning Capabilities on Logic Puzzles.

[DOI]

CoRR, 2024

Jaxpruner: A Concise Library for Sparsity Research.

[DOI]

Proceedings of the Conference on Parsimony and Learning, 2024

2023

JaxPruner: A concise library for sparsity research.

[DOI]

CoRR, 2023

LayerNAS: Neural Architecture Search in Polynomial Complexity.

[DOI]

CoRR, 2023

The Power of External Memory in Increasing Predictive Model Capacity.

[DOI]

CoRR, 2023

Alternating Updates for Efficient Transformers.

[DOI]

CoRR, 2023

On the Benefits of Learning to Route in Mixture-of-Experts Models.

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Large Language Models with Controllable Working Memory.

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

Sketching based Representations for Robust Image Classification with Provable Guarantees.

[DOI]

Nishanth Dikkala

Sankeerth Rao Karingula

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A Theoretical View on Sparsely Activated Networks.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes.

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021

Green Technology Development and Adoption: Competition, Regulation, and Uncertainty - A Global Game Approach.

[DOI]

Xin Wang

Soo-Haeng Cho

Alan Scheller-Wolf

Manag. Sci., 2021

Sketch based Memory for Neural Networks.

[DOI]

Rina Panigrahy

Xin Wang

Manzil Zaheer

Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

2020

Improving Sampling Accuracy of Stochastic Gradient MCMC Methods via Non-uniform Subsampling of Gradients.

[DOI]

CoRR, 2020

2019

Back and forth error compensation and correction method for linear hyperbolic systems with application to the Maxwell's equations.

[DOI]

Xin Wang

Yingjie Liu

J. Comput. Phys. X, 2019

2017

Newsvendor Mergers.

[DOI]

Soo-Haeng Cho

Xin Wang

Manag. Sci., 2017