2025
Spectral-factorized Positive-definite Curvature Learning for NN Training.
CoRR, February, 2025

Training Data Attribution (TDA): Examining Its Adoption & Use Cases.
CoRR, January, 2025

Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Influence Functions for Scalable Data Attribution in Diffusion Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Accelerating neural network training: An analysis of the AlgoPerf competition.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models.
CoRR, 2024

What is Your Data Worth to GPT? LLM-Scale Data Valuation with Influence Functions.
CoRR, 2024

Training Data Attribution via Approximate Unrolled Differentiation.
CoRR, 2024

Training Data Attribution via Approximate Unrolling.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Using Large Language Models for Hyperparameter Optimization.
CoRR, 2023

Studying Large Language Model Generalization with Influence Functions.
CoRR, 2023

Benchmarking Neural Network Training Algorithms.
CoRR, 2023

Efficient Parametric Approximations of Neural Network Function Space Distance.
Proceedings of the International Conference on Machine Learning, 2023

Multi-Rate VAE: Train Once, Get the Full Rate-Distortion Curve.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Amortized Proximal Optimization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

If Influence Functions are the Answer, Then What is the Question?
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Analyzing Monotonic Linear Interpolation in Neural Network Loss Landscapes.
CoRR, 2021

On Monotonic Linear Interpolation of Neural Network Parameters.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
Delta-STN: Efficient Bilevel Optimization for Neural Networks using Structured Response Jacobians.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019
Fast 6DOF Pose Estimation with Synthetic Textureless CAD Model for Mobile Applications.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

2018
Eigenvalue Corrected Noisy Natural Gradient.
CoRR, 2018

Learnable Pooling Methods for Video Classification.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Study on HDR/WCG Service Model for UHD Service.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2015
Robust visual tracking through deep learning-based confidence evaluation.
Proceedings of the 12th International Conference on Ubiquitous Robots and Ambient Intelligence, 2015

2014
Semi-online video stabilization using probabilistic keyframe update and inter-keyframe motion smoothing.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

2013
Background subtraction using edge cues and color difference for stabilized CMOS images.
Proceedings of the IEEE International Conference on Consumer Electronics, 2013