Yoram Singer

Affiliations:
  • Google
  • Hebrew University of Jerusalem, Israel


According to our database1, Yoram Singer authored at least 133 papers between 1992 and 2022.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2022
Are All Layers Created Equal?
J. Mach. Learn. Res., 2022

2020
Second Order Optimization Made Practical.
CoRR, 2020

Identity Crisis: Memorization and Generalization Under Extreme Overparameterization.
Proceedings of the 8th International Conference on Learning Representations, 2020

Exponentiated Gradient Meets Gradient Descent.
Proceedings of the Algorithmic Learning Theory, 2020

Proximity Preserving Binary Code Using Signed Graph-Cut.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Convolutional Bipartite Attractor Networks.
CoRR, 2019

Identity Crisis: Memorization and Generalization under Extreme Overparameterization.
CoRR, 2019

Memory-Efficient Adaptive Optimization for Large-Scale Learning.
CoRR, 2019

Memory Efficient Adaptive Optimization.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018
The Well-Tempered Lasso.
Proceedings of the 35th International Conference on Machine Learning, 2018

Shampoo: Preconditioned Stochastic Tensor Optimization.
Proceedings of the 35th International Conference on Machine Learning, 2018

Learning a neural response metric for retinal prosthesis.
Proceedings of the 6th International Conference on Learning Representations, 2018

2017
A Unified Approach to Adaptive Regularization in Online and Stochastic Optimization.
CoRR, 2017

Random Features for Compositional Kernels.
CoRR, 2017

Short and Deep: Sketching and Neural Networks.
Proceedings of the 5th International Conference on Learning Representations, 2017

2016
A Stochastic Quasi-Newton Method for Large-Scale Optimization.
SIAM J. Optim., 2016

LLORMA: Local Low-Rank Matrix Approximation.
J. Mach. Learn. Res., 2016

Sketching and Neural Networks.
CoRR, 2016

Toward Deeper Understanding of Neural Networks: The Power of Initialization and a Dual View on Expressivity.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Train faster, generalize better: Stability of stochastic gradient descent.
Proceedings of the 33nd International Conference on Machine Learning, 2016

2014
Zero-Shot Learning by Convex Combination of Semantic Embeddings.
Proceedings of the 2nd International Conference on Learning Representations, 2014

Local collaborative ranking.
Proceedings of the 23rd International World Wide Web Conference, 2014

2013
Matrix Approximation under Local Low-Rank Assumption
Proceedings of the 1st International Conference on Learning Representations, 2013

The Maximum Entropy Relaxation Path.
CoRR, 2013

Using Web Co-occurrence Statistics for Improving Image Categorization.
CoRR, 2013

Parallel Boosting with Momentum.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2013

Local Low-Rank Matrix Approximation.
Proceedings of the 30th International Conference on Machine Learning, 2013

Efficient Learning of Sparse Ranking Functions.
Proceedings of the Empirical Inference - Festschrift in Honor of Vladimir N. Vapnik, 2013

2011
Pegasos: primal estimated sub-gradient solver for SVM.
Math. Program., 2011

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization.
J. Mach. Learn. Res., 2011

Entire Relaxation Path for Maximum Entropy Problems.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

2010
On the equivalence of weak learnability and linear separability: new relaxations and efficient boosting algorithms.
Mach. Learn., 2010

Composite Objective Mirror Descent.
Proceedings of the COLT 2010, 2010

2009
Individual sequence prediction using memory-efficient context trees.
IEEE Trans. Inf. Theory, 2009

Efficient Online and Batch Learning Using Forward Backward Splitting.
J. Mach. Learn. Res., 2009

Efficient Learning using Forward-Backward Splitting.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Group Sparse Coding.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Boosting with structural sparsity.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009

2008
The Forgetron: A Kernel-Based Perceptron on a Budget.
SIAM J. Comput., 2008

Online Learning of Complex Prediction Problems Using Simultaneous Projections.
J. Mach. Learn. Res., 2008

Efficient projections onto the <i>l</i><sub>1</sub>-ball for learning in high dimensions.
Proceedings of the Machine Learning, 2008

2007
A Large Margin Algorithm for Speech-to-Phoneme and Music-to-Score Alignment.
IEEE Trans. Speech Audio Process., 2007

A primal-dual perspective of online learning algorithms.
Mach. Learn., 2007

A Unified Algorithmic Approach for Efficient Online Label Ranking.
Proceedings of the Eleventh International Conference on Artificial Intelligence and Statistics, 2007

Online Learning of Multiple Tasks with a Shared Loss.
J. Mach. Learn. Res., 2007

A Boosting Algorithm for Label Covering in Multilabel Problems.
Proceedings of the Eleventh International Conference on Artificial Intelligence and Statistics, 2007

Pegasos: Primal Estimated sub-GrAdient SOlver for SVM.
Proceedings of the Machine Learning, 2007

Learning Globally-Consistent Local Distance Functions for Shape-Based Image Retrieval and Classification.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

2006
Efficient Learning of Label Ranking by Soft Projections onto Polyhedra.
J. Mach. Learn. Res., 2006

Online Passive-Aggressive Algorithms.
J. Mach. Learn. Res., 2006

Convex Repeated Games and Fenchel Duality.
Proceedings of the Advances in Neural Information Processing Systems 19, 2006

Image Retrieval and Classification Using Local Distance Functions.
Proceedings of the Advances in Neural Information Processing Systems 19, 2006

Support Vector Machines on a Budget.
Proceedings of the Advances in Neural Information Processing Systems 19, 2006

Online Classification for Complex Problems Using Simultaneous Projections.
Proceedings of the Advances in Neural Information Processing Systems 19, 2006

Discriminative kernel-based phoneme sequence recognition.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Online multiclass learning by interclass hypothesis sharing.
Proceedings of the Machine Learning, 2006

Online Learning Meets Optimization in the Dual.
Proceedings of the Learning Theory, 19th Annual Conference on Learning Theory, 2006

Online Multitask Learning.
Proceedings of the Learning Theory, 19th Annual Conference on Learning Theory, 2006

2005
Spikernels: Predicting Arm Movements by Embedding Population Spike Rate Patterns in Inner-Product Spaces.
Neural Comput., 2005

Online Ranking by Projecting.
Neural Comput., 2005

Smooth epsiloon-Insensitive Regression by Loss Symmetrization.
J. Mach. Learn. Res., 2005

The Forgetron: A Kernel-Based Perceptron on a Fixed Budget.
Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005

Data-Driven Online to Batch Conversions.
Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005

Phoneme alignment based on discriminative learning.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

A New Perspective on an Old Perceptron Algorithm.
Proceedings of the Learning Theory, 18th Annual Conference on Learning Theory, 2005

Loss Bounds for Online Category Ranking.
Proceedings of the Learning Theory, 18th Annual Conference on Learning Theory, 2005

2004
A Temporal Kernel-Based Model for Tracking Hand Movements from Neural Activities.
Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004

The Power of Selective Memory: Self-Bounded Learning of Prediction Suffix Trees.
Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004

An Online Algorithm for Hierarchical Phoneme Classification.
Proceedings of the Machine Learning for Multimodal Interaction, 2004

Learning to Align Polyphonic Music.
Proceedings of the ISMIR 2004, 2004

Online and batch learning of pseudo-metrics.
Proceedings of the Machine Learning, 2004

Leveraging the margin more carefully.
Proceedings of the Machine Learning, 2004

Large margin hierarchical classification.
Proceedings of the Machine Learning, 2004

2003
An Efficient Boosting Algorithm for Combining Preferences.
J. Mach. Learn. Res., 2003

A Family of Additive Online Algorithms for Category Ranking.
J. Mach. Learn. Res., 2003

Ultraconservative Online Algorithms for Multiclass Problems.
J. Mach. Learn. Res., 2003

Protein Family Classification Using Sparse Markov Transducers.
J. Comput. Biol., 2003

Online Passive-Aggressive Algorithms.
Proceedings of the Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, 2003

Log-Linear Models for Label Ranking.
Proceedings of the Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, 2003

Online Classification on a Budget.
Proceedings of the Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, 2003

Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003

Smooth e-Intensive Regression by Loss Symmetrization.
Proceedings of the Computational Learning Theory and Kernel Machines, 2003

Learning Algorithm for Enclosing Points in Bregmanian Spheres.
Proceedings of the Computational Learning Theory and Kernel Machines, 2003

2002
On the Learnability and Design of Output Codes for Multiclass Problems.
Mach. Learn., 2002

Logistic Regression, AdaBoost and Bregman Distances.
Mach. Learn., 2002

Using Substitution Matrices to Estimate Probability Distributions for Biological Sequences.
J. Comput. Biol., 2002

Robust temporal and spectral modeling for query By melody.
Proceedings of the SIGIR 2002: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2002

A new family of online algorithms for category ranking.
Proceedings of the SIGIR 2002: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2002

Spikernels: Embedding Spiking Neurons in Inner-Product Spaces.
Proceedings of the Advances in Neural Information Processing Systems 15 [Neural Information Processing Systems, 2002

Multiclass Learning by Probabilistic Embeddings.
Proceedings of the Advances in Neural Information Processing Systems 15 [Neural Information Processing Systems, 2002

Kernel Design Using Boosting.
Proceedings of the Advances in Neural Information Processing Systems 15 [Neural Information Processing Systems, 2002

Discriminative Binaural Sound Localization.
Proceedings of the Advances in Neural Information Processing Systems 15 [Neural Information Processing Systems, 2002

An Efficient PAC Algorithm for Reconstructing a Mixture of Lines.
Proceedings of the Algorithmic Learning Theory, 13th International Conference, 2002

2001
Guest Editor's Introduction.
Mach. Learn., 2001

On the Algorithmic Implementation of Multiclass Kernel-based Vector Machines.
J. Mach. Learn. Res., 2001

Pranking with Ranking.
Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001

Using mixtures of common ancestors for estimating the probabilities of discrete events in biological sequences.
Proceedings of the Ninth International Conference on Intelligent Systems for Molecular Biology, 2001

2000
BoosTexter: A Boosting-based System for Text Categorization.
Mach. Learn., 2000

Reducing Multiclass to Binary: A Unifying Approach for Margin Classifiers.
J. Mach. Learn. Res., 2000

Improved Output Coding for Classification Using Continuous Relaxation.
Proceedings of the Advances in Neural Information Processing Systems 13, 2000

State-based Classification of Finger Gestures from Electromyographic Signals.
Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000

Boosting for Document Routing.
Proceedings of the 2000 ACM CIKM International Conference on Information and Knowledge Management, 2000

1999
Context-Sensitive Learning Methods for Text Categorization.
ACM Trans. Inf. Syst., 1999

Improved Boosting Algorithms Using Confidence-rated Predictions.
Mach. Learn., 1999

An Efficient Extension to Mixture Techniques for Prediction and Decision Trees.
Mach. Learn., 1999

Learning to Order Things.
J. Artif. Intell. Res., 1999

Leveraged Vector Machines.
Proceedings of the Advances in Neural Information Processing Systems 12, [NIPS Conference, Denver, Colorado, USA, November 29, 1999

Unsupervised Models for Named Entity Classification.
Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, 1999

Boosting Applied to Tagging and PP Attachment.
Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, 1999

A Simple, Fast, and Effictive Rule Learner.
Proceedings of the Sixteenth National Conference on Artificial Intelligence and Eleventh Conference on Innovative Applications of Artificial Intelligence, 1999

1998
The Hierarchical Hidden Markov Model: Analysis and Applications.
Mach. Learn., 1998

On the Learnability and Usage of Acyclic Probabilistic Finite Automata.
J. Comput. Syst. Sci., 1998

Boosting and Rocchio Applied to Text Filtering.
Proceedings of the SIGIR '98: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1998

Batch and On-Line Parameter Estimation of Gaussian Mixtures Based on the Joint Entropy.
Proceedings of the Advances in Neural Information Processing Systems 11, [NIPS Conference, Denver, Colorado, USA, November 30, 1998

Efficient Bayesian Parameter Estimation in Large Discrete Domains.
Proceedings of the Advances in Neural Information Processing Systems 11, [NIPS Conference, Denver, Colorado, USA, November 30, 1998

1997
Adaptive Mixtures of Probabilistic Transducers.
Neural Comput., 1997

A Comparison of New and Old Algorithms for a Mixture Estimation Problem.
Mach. Learn., 1997

Switching Portfolios.
Int. J. Neural Syst., 1997

Update Rules for Parameter Estimation in Bayesian Networks.
Proceedings of the UAI '97: Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence, 1997

Using and Combining Predictors That Specialize.
Proceedings of the Twenty-Ninth Annual ACM Symposium on the Theory of Computing, 1997

Shared Context Probabilistic Transducers.
Proceedings of the Advances in Neural Information Processing Systems 10, 1997

1996
The Power of Amnesia: Learning Probabilistic Automata with Variable Memory Length.
Mach. Learn., 1996

Training Algorithms for Hidden Markov Models using Entropy Based Distance Functions.
Proceedings of the Advances in Neural Information Processing Systems 9, 1996

On-Line Portfolio Selection Using Multiplicative Updates.
Proceedings of the Machine Learning, 1996

1995
"What has been will be again": a machine learning approach to the analysis of natural language.
PhD thesis, 1995

Adaptive Mixture of Probabilistic Transducers.
Proceedings of the Advances in Neural Information Processing Systems 8, 1995

Beyond Word N-Grams.
Proceedings of the Third Workshop on Very Large Corpora, 1995

1994
Dynamical encoding of cursive handwriting.
Biol. Cybern., 1994

Learning Probabilistic Automata with Variable Memory Length.
Proceedings of the Seventh Annual ACM Conference on Computational Learning Theory, 1994

Part-of-Speech Tagging using a Variable Memory Markov Model.
Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, 1994

1993
Decoding Cursive Scripts.
Proceedings of the Advances in Neural Information Processing Systems 6, 1993

The Power of Amnesia.
Proceedings of the Advances in Neural Information Processing Systems 6, 1993

1992
Learning class probabilities from labeled data.
Proceedings of the 11th IAPR International Conference on Pattern Recognition, 1992


  Loading...