# Yoram Singer

According to our database

Collaborative distances:

^{1}, Yoram Singer authored at least 145 papers between 1993 and 2018.Collaborative distances:

## Timeline

#### Legend:

Book In proceedings Article PhD thesis Other## Links

#### Homepages:

#### On csauthors.net:

## Bibliography

2018

The Well Tempered Lasso.

CoRR, 2018

Shampoo: Preconditioned Stochastic Tensor Optimization.

CoRR, 2018

The Well-Tempered Lasso.

Proceedings of the 35th International Conference on Machine Learning, 2018

Shampoo: Preconditioned Stochastic Tensor Optimization.

Proceedings of the 35th International Conference on Machine Learning, 2018

2017

A Unified Approach to Adaptive Regularization in Online and Stochastic Optimization.

CoRR, 2017

Random Features for Compositional Kernels.

CoRR, 2017

2016

A Stochastic Quasi-Newton Method for Large-Scale Optimization.

SIAM Journal on Optimization, 2016

LLORMA: Local Low-Rank Matrix Approximation.

Journal of Machine Learning Research, 2016

Sketching and Neural Networks.

CoRR, 2016

Toward Deeper Understanding of Neural Networks: The Power of Initialization and a Dual View on Expressivity.

CoRR, 2016

Toward Deeper Understanding of Neural Networks: The Power of Initialization and a Dual View on Expressivity.

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Train faster, generalize better: Stability of stochastic gradient descent.

Proceedings of the 33nd International Conference on Machine Learning, 2016

2015

Train faster, generalize better: Stability of stochastic gradient descent.

CoRR, 2015

2014

A Stochastic Quasi-Newton Method for Large-Scale Optimization.

CoRR, 2014

Local collaborative ranking.

Proceedings of the 23rd International World Wide Web Conference, 2014

2013

Update Rules for Parameter Estimation in Bayesian Networks

CoRR, 2013

Switching Portfolios

CoRR, 2013

Matrix Approximation under Local Low-Rank Assumption

CoRR, 2013

Zero-Shot Learning by Convex Combination of Semantic Embeddings.

CoRR, 2013

The Maximum Entropy Relaxation Path.

CoRR, 2013

Using Web Co-occurrence Statistics for Improving Image Categorization.

CoRR, 2013

Parallel Boosting with Momentum.

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2013

Local Low-Rank Matrix Approximation.

Proceedings of the 30th International Conference on Machine Learning, 2013

Efficient Learning of Sparse Ranking Functions.

Proceedings of the Empirical Inference - Festschrift in Honor of Vladimir N. Vapnik, 2013

2011

Pegasos: primal estimated sub-gradient solver for SVM.

Math. Program., 2011

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization.

Journal of Machine Learning Research, 2011

Learning to Order Things

CoRR, 2011

Entire Relaxation Path for Maximum Entropy Problems.

Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

2010

On the equivalence of weak learnability and linear separability: new relaxations and efficient boosting algorithms.

Machine Learning, 2010

Composite Objective Mirror Descent.

Proceedings of the COLT 2010, 2010

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization.

Proceedings of the COLT 2010, 2010

2009

Individual sequence prediction using memory-efficient context trees.

IEEE Trans. Information Theory, 2009

Efficient Online and Batch Learning Using Forward Backward Splitting.

Journal of Machine Learning Research, 2009

Efficient Learning using Forward-Backward Splitting.

Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Group Sparse Coding.

Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Boosting with structural sparsity.

Proceedings of the 26th Annual International Conference on Machine Learning, 2009

2008

The Forgetron: A Kernel-Based Perceptron on a Budget.

SIAM J. Comput., 2008

Online Learning of Complex Prediction Problems Using Simultaneous Projections.

Journal of Machine Learning Research, 2008

Proceedings of the Machine Learning, 2008

On the Equivalence of Weak Learnability and Linear Separability: New Relaxations and Efficient Boosting Algorithms.

Proceedings of the 21st Annual Conference on Learning Theory, 2008

2007

A Large Margin Algorithm for Speech-to-Phoneme and Music-to-Score Alignment.

IEEE Trans. Audio, Speech & Language Processing, 2007

A primal-dual perspective of online learning algorithms.

Machine Learning, 2007

A Unified Algorithmic Approach for Efficient Online Label Ranking.

Proceedings of the Eleventh International Conference on Artificial Intelligence and Statistics, 2007

Online Learning of Multiple Tasks with a Shared Loss.

Journal of Machine Learning Research, 2007

A Boosting Algorithm for Label Covering in Multilabel Problems.

Proceedings of the Eleventh International Conference on Artificial Intelligence and Statistics, 2007

Pegasos: Primal Estimated sub-GrAdient SOlver for SVM.

Proceedings of the Machine Learning, 2007

Learning Globally-Consistent Local Distance Functions for Shape-Based Image Retrieval and Classification.

Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

2006

Efficient Learning of Label Ranking by Soft Projections onto Polyhedra.

Journal of Machine Learning Research, 2006

Online Passive-Aggressive Algorithms.

Journal of Machine Learning Research, 2006

Convex Repeated Games and Fenchel Duality.

Proceedings of the Advances in Neural Information Processing Systems 19, 2006

Image Retrieval and Classification Using Local Distance Functions.

Proceedings of the Advances in Neural Information Processing Systems 19, 2006

Support Vector Machines on a Budget.

Proceedings of the Advances in Neural Information Processing Systems 19, 2006

Online Classification for Complex Problems Using Simultaneous Projections.

Proceedings of the Advances in Neural Information Processing Systems 19, 2006

Discriminative kernel-based phoneme sequence recognition.

Proceedings of the INTERSPEECH 2006, 2006

Online multiclass learning by interclass hypothesis sharing.

Proceedings of the Machine Learning, 2006

Online Learning Meets Optimization in the Dual.

Proceedings of the Learning Theory, 19th Annual Conference on Learning Theory, 2006

Online Multitask Learning.

Proceedings of the Learning Theory, 19th Annual Conference on Learning Theory, 2006

2005

Spikernels: Predicting Arm Movements by Embedding Population Spike Rate Patterns in Inner-Product Spaces.

Neural Computation, 2005

Online Ranking by Projecting.

Neural Computation, 2005

Smooth epsiloon-Insensitive Regression by Loss Symmetrization.

Journal of Machine Learning Research, 2005

The Forgetron: A Kernel-Based Perceptron on a Fixed Budget.

Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005

Data-Driven Online to Batch Conversions.

Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005

Phoneme alignment based on discriminative learning.

Proceedings of the INTERSPEECH 2005, 2005

A New Perspective on an Old Perceptron Algorithm.

Proceedings of the Learning Theory, 18th Annual Conference on Learning Theory, 2005

Loss Bounds for Online Category Ranking.

Proceedings of the Learning Theory, 18th Annual Conference on Learning Theory, 2005

2004

A Temporal Kernel-Based Model for Tracking Hand Movements from Neural Activities.

Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004

The Power of Selective Memory: Self-Bounded Learning of Prediction Suffix Trees.

Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004

An Online Algorithm for Hierarchical Phoneme Classification.

Proceedings of the Machine Learning for Multimodal Interaction, 2004

Learning to Align Polyphonic Music.

Proceedings of the ISMIR 2004, 2004

Online and batch learning of pseudo-metrics.

Proceedings of the Machine Learning, 2004

Leveraging the margin more carefully.

Proceedings of the Machine Learning, 2004

Large margin hierarchical classification.

Proceedings of the Machine Learning, 2004

2003

An Efficient Boosting Algorithm for Combining Preferences.

Journal of Machine Learning Research, 2003

A Family of Additive Online Algorithms for Category Ranking.

Journal of Machine Learning Research, 2003

Ultraconservative Online Algorithms for Multiclass Problems.

Journal of Machine Learning Research, 2003

Protein Family Classification Using Sparse Markov Transducers.

Journal of Computational Biology, 2003

Online Passive-Aggressive Algorithms.

Proceedings of the Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, 2003

Log-Linear Models for Label Ranking.

Proceedings of the Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, 2003

Online Classification on a Budget.

Proceedings of the Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, 2003

Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network.

Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003

Smooth e-Intensive Regression by Loss Symmetrization.

Proceedings of the Computational Learning Theory and Kernel Machines, 2003

Learning Algorithm for Enclosing Points in Bregmanian Spheres.

Proceedings of the Computational Learning Theory and Kernel Machines, 2003

2002

On the Learnability and Design of Output Codes for Multiclass Problems.

Machine Learning, 2002

Logistic Regression, AdaBoost and Bregman Distances.

Machine Learning, 2002

Using Substitution Matrices to Estimate Probability Distributions for Biological Sequences.

Journal of Computational Biology, 2002

Robust temporal and spectral modeling for query By melody.

Proceedings of the SIGIR 2002: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2002

A new family of online algorithms for category ranking.

Proceedings of the SIGIR 2002: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2002

Spikernels: Embedding Spiking Neurons in Inner-Product Spaces.

Proceedings of the Advances in Neural Information Processing Systems 15 [Neural Information Processing Systems, 2002

Multiclass Learning by Probabilistic Embeddings.

Proceedings of the Advances in Neural Information Processing Systems 15 [Neural Information Processing Systems, 2002

Kernel Design Using Boosting.

Proceedings of the Advances in Neural Information Processing Systems 15 [Neural Information Processing Systems, 2002

Discriminative Binaural Sound Localization.

An Efficient PAC Algorithm for Reconstructing a Mixture of Lines.

Proceedings of the Algorithmic Learning Theory, 13th International Conference, 2002

2001

Guest Editor's Introduction.

Machine Learning, 2001

On the Algorithmic Implementation of Multiclass Kernel-based Vector Machines.

Journal of Machine Learning Research, 2001

Pranking with Ranking.

Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001

Using mixtures of common ancestors for estimating the probabilities of discrete events in biological sequences.

Proceedings of the Ninth International Conference on Intelligent Systems for Molecular Biology, 2001

Ultraconservative Online Algorithms for Multiclass Problems.

Proceedings of the Computational Learning Theory, 2001

2000

BoosTexter: A Boosting-based System for Text Categorization.

Machine Learning, 2000

Reducing Multiclass to Binary: A Unifying Approach for Margin Classifiers.

Journal of Machine Learning Research, 2000

Improved Output Coding for Classification Using Continuous Relaxation.

Proceedings of the Advances in Neural Information Processing Systems 13, 2000

Protein Family Classification Using Sparse Markov Transducers.

Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology, 2000

State-based Classification of Finger Gestures from Electromyographic Signals.

Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000

Reducing Multiclass to Binary: A Unifying Approach for Margin Classifiers.

Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000

On the Learnability and Design of Output Codes for Multiclass Problems.

Proceedings of the Thirteenth Annual Conference on Computational Learning Theory (COLT 2000), June 28, 2000

Logistic Regression, AdaBoost and Bregman Distances.

Proceedings of the Thirteenth Annual Conference on Computational Learning Theory (COLT 2000), June 28, 2000

Boosting for Document Routing.

Proceedings of the 2000 ACM CIKM International Conference on Information and Knowledge Management, 2000

1999

Context-Sensitive Learning Methods for Text Categorization.

ACM Trans. Inf. Syst., 1999

Improved Boosting Algorithms Using Confidence-rated Predictions.

Machine Learning, 1999

An Efficient Extension to Mixture Techniques for Prediction and Decision Trees.

Machine Learning, 1999

Learning to Order Things.

J. Artif. Intell. Res., 1999

Leveraged Vector Machines.

Proceedings of the Advances in Neural Information Processing Systems 12, [NIPS Conference, Denver, Colorado, USA, November 29, 1999

Unsupervised Models for Named Entity Classification.

Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, 1999

Boosting Applied to Tagging and PP Attachment.

Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, 1999

A Simple, Fast, and Effictive Rule Learner.

Proceedings of the Sixteenth National Conference on Artificial Intelligence and Eleventh Conference on Innovative Applications of Artificial Intelligence, 1999

1998

The Hierarchical Hidden Markov Model: Analysis and Applications.

Machine Learning, 1998

On the Learnability and Usage of Acyclic Probabilistic Finite Automata.

J. Comput. Syst. Sci., 1998

Switching Portfolios.

Proceedings of the UAI '98: Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence, 1998

Boosting and Rocchio Applied to Text Filtering.

Proceedings of the SIGIR '98: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1998

Batch and On-Line Parameter Estimation of Gaussian Mixtures Based on the Joint Entropy.

Proceedings of the Advances in Neural Information Processing Systems 11, [NIPS Conference, Denver, Colorado, USA, November 30, 1998

Efficient Bayesian Parameter Estimation in Large Discrete Domains.

Proceedings of the Advances in Neural Information Processing Systems 11, [NIPS Conference, Denver, Colorado, USA, November 30, 1998

An Efficient Boosting Algorithm for Combining Preferences.

Proceedings of the Fifteenth International Conference on Machine Learning (ICML 1998), 1998

Improved Boosting Algorithms using Confidence-Rated Predictions.

Proceedings of the Eleventh Annual Conference on Computational Learning Theory, 1998

1997

Adaptive Mixtures of Probabilistic Transducers.

Neural Computation, 1997

A Comparison of New and Old Algorithms for a Mixture Estimation Problem.

Machine Learning, 1997

Switching Portfolios.

Int. J. Neural Syst., 1997

Update Rules for Parameter Estimation in Bayesian Networks.

Proceedings of the UAI '97: Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence, 1997

Using and Combining Predictors That Specialize.

Proceedings of the Twenty-Ninth Annual ACM Symposium on the Theory of Computing, 1997

Learning to Order Things.

Proceedings of the Advances in Neural Information Processing Systems 10, 1997

Shared Context Probabilistic Transducers.

Proceedings of the Advances in Neural Information Processing Systems 10, 1997

An Efficient Extension to Mixture Techniques for Prediction and Decision Trees.

Proceedings of the Tenth Annual Conference on Computational Learning Theory, 1997

1996

The Power of Amnesia: Learning Probabilistic Automata with Variable Memory Length.

Machine Learning, 1996

Beyond Word N-Grams

CoRR, 1996

Context-sensitive Learning Methods for Text Categorization.

Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1996

Training Algorithms for Hidden Markov Models using Entropy Based Distance Functions.

Proceedings of the Advances in Neural Information Processing Systems 9, 1996

On-Line Portfolio Selection Using Multiplicative Updates.

Proceedings of the Machine Learning, 1996

1995

Adaptive Mixture of Probabilistic Transducers.

Proceedings of the Advances in Neural Information Processing Systems 8, 1995

On the Learnability and Usage of Acyclic Probabilistic Finite Automata.

Proceedings of the Eigth Annual Conference on Computational Learning Theory, 1995

A Comparison of New and Old Algorithms for a Mixture Estimation Problem.

Proceedings of the Eigth Annual Conference on Computational Learning Theory, 1995

Beyond Word N-Grams.

Proceedings of the Third Workshop on Very Large Corpora, 1995

1994

Dynamical encoding of cursive handwriting.

Biological Cybernetics, 1994

Learning Probabilistic Automata with Variable Memory Length.

Proceedings of the Seventh Annual ACM Conference on Computational Learning Theory, 1994

Part-of-Speech Tagging using a Variable Memory Markov Model.

Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, 1994

1993

Decoding Cursive Scripts.

Proceedings of the Advances in Neural Information Processing Systems 6, 1993

The Power of Amnesia.

Proceedings of the Advances in Neural Information Processing Systems 6, 1993

Dynamical encoding of cursive handwriting.

Proceedings of the Conference on Computer Vision and Pattern Recognition, 1993