Spectral-factorized Positive-definite Curvature Learning for NN Training.
CoRR, February, 2025
Training Data Attribution (TDA): Examining Its Adoption & Use Cases.
CoRR, January, 2025
Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Influence Functions for Scalable Data Attribution in Diffusion Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Accelerating neural network training: An analysis of the AlgoPerf competition.
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models.
CoRR, 2024
What is Your Data Worth to GPT? LLM-Scale Data Valuation with Influence Functions.
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Training Data Attribution via Approximate Unrolled Differentiation.
CoRR, 2024
Training Data Attribution via Approximate Unrolling.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Using Large Language Models for Hyperparameter Optimization.
CoRR, 2023
Studying Large Language Model Generalization with Influence Functions.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
Efficient Parametric Approximations of Neural Network Function Space Distance.
Proceedings of the International Conference on Machine Learning, 2023
Multi-Rate VAE: Train Once, Get the Full Rate-Distortion Curve.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Amortized Proximal Optimization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
If Influence Functions are the Answer, Then What is the Question?
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Analyzing Monotonic Linear Interpolation in Neural Network Loss Landscapes.
CoRR, 2021
On Monotonic Linear Interpolation of Neural Network Parameters.
Proceedings of the 38th International Conference on Machine Learning, 2021
Delta-STN: Efficient Bilevel Optimization for Neural Networks using Structured Response Jacobians.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Fast 6DOF Pose Estimation with Synthetic Textureless CAD Model for Mobile Applications.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019
Eigenvalue Corrected Noisy Natural Gradient.
CoRR, 2018
Learnable Pooling Methods for Video Classification.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018
Study on HDR/WCG Service Model for UHD Service.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
Robust visual tracking through deep learning-based confidence evaluation.
Proceedings of the 12th International Conference on Ubiquitous Robots and Ambient Intelligence, 2015
Semi-online video stabilization using probabilistic keyframe update and inter-keyframe motion smoothing.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014
Background subtraction using edge cues and color difference for stabilized CMOS images.
Proceedings of the IEEE International Conference on Consumer Electronics, 2013