Mark Schmidt
Orcid: 0000-0003-1129-5273Affiliations:
- University of British Columbia, Department of Computer Science, Vancouver, Canada
- École Normale Supérieure, INRIA SIERRA project team, Paris, France
- University of Alberta, Department of Computing Science, Edmonton, Canada
According to our database1,
Mark Schmidt
authored at least 100 papers
between 2005 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
On csauthors.net:
Bibliography
2024
Why Line Search when you can Plane Search? SO-Friendly Neural Networks allow Per-Iteration Optimization of Learning and Momentum Rates for Every Layer.
CoRR, 2024
BlockLLM: Memory-Efficient Adaptation of LLMs by Selecting and Optimizing the Right Coordinate Blocks.
CoRR, 2024
CoRR, 2024
Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent on Language Models.
CoRR, 2024
2023
Comput. Biol. Chem., June, 2023
Analyzing and Improving Greedy 2-Coordinate Updates for Equality-Constrained Optimization via Steepest Descent in the 1-Norm.
CoRR, 2023
Proceedings of the Uncertainty in Artificial Intelligence, 2023
Fast Convergence of Random Reshuffling Under Over-Parameterization and the Polyak-Łojasiewicz Condition.
Proceedings of the Machine Learning and Knowledge Discovery in Databases: Research Track, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Simplifying Momentum-based Positive-definite Submanifold Optimization with Applications to Deep Learning.
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Noise Is Not the Main Factor Behind the Gap Between Sgd and Adam on Transformers, But Sign Descent Might Be.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
2022
Let's Make Block Coordinate Descent Converge Faster: Faster Greedy Rules, Message-Passing, Active-Set Complexity, and Superlinear Convergence.
J. Mach. Learn. Res., 2022
Homeomorphic-Invariance of EM: Non-Asymptotic Convergence in KL Divergence for Exponential Families via Mirror Descent (Extended Abstract).
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022
Proceedings of the Conference on Lifelong Learning Agents, 2022
2021
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Homeomorphic-Invariance of EM: Non-Asymptotic Convergence in KL Divergence for Exponential Families via Mirror Descent.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021
2020
Adaptive Gradient Methods Converge Faster with Over-Parameterization (and you can do a line-search).
CoRR, 2020
Regret Bounds without Lipschitz Continuity: Online Learning with Relative-Lipschitz Losses.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
Proceedings of the IEEE International Conference on Image Processing, 2020
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020
2019
"Active-set complexity" of proximal gradient: How long does it take to find the sparsity pattern?
Optim. Lett., 2019
CoRR, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the International Joint Conference on Neural Networks, 2019
Fast and Simple Natural-Gradient Variational Inference with Mixture of Exponential-family Approximations.
Proceedings of the 36th International Conference on Machine Learning, 2019
Efficient Parameter Estimation for DNA Kinetics Modeled as Continuous-Time Markov Chains.
Proceedings of the DNA Computing and Molecular Programming - 25th International Conference, 2019
Proceedings of the 30th British Machine Vision Conference 2019, 2019
Proceedings of the 30th British Machine Vision Conference 2019, 2019
Fast and Faster Convergence of SGD for Over-Parameterized Models and an Accelerated Perceptron.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019
Distributed Maximization of "Submodular plus Diversity" Functions for Multi-label Feature Selection on Huge Datasets.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019
2018
Does Your Model Know the Digit 6 Is Not a Cat? A Less Biased Evaluation of "Outlier" Detectors.
CoRR, 2018
MASAGA: A Linearly-Convergent Stochastic First-Order Method for Optimization on Manifolds.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2018
SLANG: Fast Structured Covariance Approximations for Bayesian Deep Learning with Natural Gradient.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Proceedings of the 6th International Conference on Learning Representations, 2018
Proceedings of the Computer Vision - ECCV 2018, 2018
2017
Math. Program., 2017
Proceedings of the 34th International Conference on Machine Learning, 2017
Inferring Parameters for an Elementary Step Model of DNA Structure Kinetics with Locally Context-Dependent Arrhenius Rates.
Proceedings of the DNA Computing and Molecular Programming - 23rd International Conference, 2017
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017
2016
Convergence Rates for Greedy Kaczmarz Algorithms, and Faster Randomized Kaczmarz Rules Using the Orthogonality Graph.
CoRR, 2016
Convergence Rates for Greedy Kaczmarz Algorithms, and Randomized Kaczmarz Rules Using the Orthogonality Graph.
Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence, 2016
Faster Stochastic Variational Inference using Proximal-Gradient Methods with General Divergence Functions.
Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence, 2016
Linear Convergence of Gradient and Proximal-Gradient Methods Under the Polyak-Łojasiewicz Condition.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2016
Proceedings of the British Machine Vision Conference 2016, 2016
2015
Convergence of Proximal-Gradient Stochastic Variational Inference under Non-Decreasing Step-Size Sequence.
CoRR, 2015
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015
Coordinate Descent Converges Faster with the Gauss-Southwell Rule Than Random Selection.
Proceedings of the 32nd International Conference on Machine Learning, 2015
Non-Uniform Stochastic Average Gradient Method for Training Conditional Random Fields.
Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015
2014
Convex Optimization for Big Data: Scalable, randomized, and parallel algorithms for big data analytics.
IEEE Signal Process. Mag., 2014
2013
SIAM J. Sci. Comput., 2013
Proceedings of the 30th International Conference on Machine Learning, 2013
2012
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012
A simpler approach to obtaining an O(1/t) convergence rate for the projected stochastic subgradient method
CoRR, 2012
A Stochastic Gradient Method with an Exponential Convergence Rate for Strongly-Convex Optimization with Finite Training Sets
CoRR, 2012
A Stochastic Gradient Method with an Exponential Convergence Rate for Finite Training Sets.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012
2011
Generalized Fast Approximate Energy Minimization via Graph Cuts: Alpha-Expansion Beta-Shrink Moves
CoRR, 2011
Generalized Fast Approximate Energy Minimization via Graph Cuts: a-Expansion b-Shrink Moves.
Proceedings of the UAI 2011, 2011
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011
2010
Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, 2010
Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, 2010
Proceedings of the Causality: Objectives and Assessment (NIPS 2008 Workshop), 2010
2009
Optimizing Costly Functions with Simple Constraints: A Limited-Memory Projected Quasi-Newton Algorithm.
Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics, 2009
Proceedings of the UAI 2009, 2009
Increased discrimination in level set methods with embedded conditional random fields.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009
2008
Proceedings of the Advances in Neural Information Processing Systems 21, 2008
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008
2007
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007
Fast Optimization Methods for L1 Regularization: A Comparative Study and Two New Approaches.
Proceedings of the Machine Learning: ECML 2007, 2007
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007
2006
Proceedings of the Machine Learning, 2006
Proceedings of the Advances in Artificial Intelligence, 2006
2005
Proceedings of the Knowledge Discovery in Databases: PKDD 2005, 2005
Proceedings of the Fourth International Conference on Machine Learning and Applications, 2005
Proceedings of the Computer Vision for Biomedical Image Applications, 2005