Martin Jaggi

According to our database1, Martin Jaggi authored at least 82 papers between 2009 and 2020.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2020
Masking as an Efficient Alternative to Finetuning for Pretrained Language Models.
CoRR, 2020

Data Parallelism in Training Sparse Neural Networks.
CoRR, 2020

A Unified Theory of Decentralized SGD with Changing Topology and Local Updates.
CoRR, 2020

Evaluating The Search Phase of Neural Architecture Search.
Proceedings of the 8th International Conference on Learning Representations, 2020

Don't Use Large Mini-batches, Use Local SGD.
Proceedings of the 8th International Conference on Learning Representations, 2020

Dynamic Model Pruning with Feedback.
Proceedings of the 8th International Conference on Learning Representations, 2020

Decentralized Deep Learning with Arbitrary Communication Compression.
Proceedings of the 8th International Conference on Learning Representations, 2020

On the Relationship between Self-Attention and Convolutional Layers.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
Unsupervised robust nonparametric learning of hidden community properties.
Math. Found. Comput., 2019

Robust Cross-lingual Embeddings from Parallel Sentences.
CoRR, 2019

Advances and Open Problems in Federated Learning.
CoRR, 2019

On the Tunability of Optimizers in Deep Learning.
CoRR, 2019

Model Fusion via Optimal Transport.
CoRR, 2019

Correlating Twitter Language with Community-Level Health Outcomes.
CoRR, 2019

SysML: The New Frontier of Machine Learning Systems.
CoRR, 2019

Structure Tree-LSTM: Structure-aware Attentional Document Encoders.
CoRR, 2019

Forecasting intracranial hypertension using multi-scale waveform metrics.
CoRR, 2019

Crosslingual Document Embedding as Reduced-Rank Ridge Regression.
Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, 2019

PowerSGD: Practical Low-Rank Gradient Compression for Distributed Optimization.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Unsupervised Scalable Representation Learning for Multivariate Time Series.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Better Word Embeddings by Disentangling Contextual n-Gram Information.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Open-Vocabulary Keyword Spotting with Audio and Text Embeddings.
Proceedings of the Interspeech 2019, 2019

Decentralized Stochastic Optimization and Gossip Algorithms with Compressed Communication.
Proceedings of the 36th International Conference on Machine Learning, 2019

Error Feedback Fixes SignSGD and other Gradient Compression Schemes.
Proceedings of the 36th International Conference on Machine Learning, 2019

Overcoming Multi-model Forgetting.
Proceedings of the 36th International Conference on Machine Learning, 2019

Context Mover's Distance & Barycenters: Optimal transport of contexts for building representations.
Proceedings of the Deep Generative Models for Highly Structured Data, 2019

On Linear Learning with Manycore Processors.
Proceedings of the 26th IEEE International Conference on High Performance Computing, 2019

Efficient Greedy Coordinate Descent for Composite Problems.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2018
Optimal Affine-Invariant Smooth Minimization Algorithms.
SIAM J. Optimization, 2018

Wasserstein is all you need.
CoRR, 2018

Don't Use Large Mini-Batches, Use Local SGD.
CoRR, 2018

COLA: Communication-Efficient Decentralized Linear Learning.
CoRR, 2018

Global linear convergence of Newton's method without strong-convexity or Lipschitz gradients.
CoRR, 2018

End-to-End DNN Training with Block Floating Point Arithmetic.
CoRR, 2018

Revisiting First-Order Convex Optimization Over Linear Spaces.
CoRR, 2018

EmbedRank: Unsupervised Keyphrase Extraction using Sentence Embeddings.
CoRR, 2018

Sparsified SGD with Memory.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

COLA: Decentralized Linear Learning.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Training DNNs with Hybrid Block Floating Point.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Unsupervised Learning of Sentence Embeddings Using Compositional n-Gram Features.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

On Matching Pursuit and Coordinate Descent.
Proceedings of the 35th International Conference on Machine Learning, 2018

A Distributed Second-Order Algorithm You Can Trust.
Proceedings of the 35th International Conference on Machine Learning, 2018

Simple Unsupervised Keyphrase Extraction using Sentence Embeddings.
Proceedings of the 22nd Conference on Computational Natural Language Learning, 2018

Adaptive balancing of gradient and update computation times using global geometry and approximate subproblems.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

2017
Learning Aerial Image Segmentation From Online Maps.
IEEE Trans. Geosci. Remote. Sens., 2017

Distributed optimization with arbitrary local solvers.
Optimization Methods and Software, 2017

CoCoA: A General Framework for Communication-Efficient Distributed Optimization.
J. Mach. Learn. Res., 2017

An Accelerated Communication-Efficient Primal-Dual Optimization Framework for Structured Machine Learning.
CoRR, 2017

Efficient Use of Limited-Memory Resources to Accelerate Linear Learning.
CoRR, 2017

Unsupervised robust nonparametric learning of hidden community properties.
CoRR, 2017

Leveraging Large Amounts of Weakly Supervised Data for Multi-Language Sentiment Classification.
Proceedings of the 26th International Conference on World Wide Web, 2017

Safe Adaptive Importance Sampling.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Greedy Algorithms for Cone Constrained Optimization with Convergence Guarantees.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Efficient Use of Limited-Memory Accelerators for Linear Learning on Heterogeneous Systems.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Approximate Steepest Coordinate Descent.
Proceedings of the 34th International Conference on Machine Learning, 2017

Faster Coordinate Descent via Adaptive Importance Sampling.
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017

A Unified Optimization View on Generalized Matching Pursuit and Frank-Wolfe.
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017

Generating Steganographic Text with LSTMs.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Screening Rules for Convex Problems.
CoRR, 2016

Pursuits in Structured Non-Convex Matrix Factorizations.
CoRR, 2016

SwissCheese at SemEval-2016 Task 4: Sentiment Classification Using an Ensemble of Convolutional Neural Networks with Distant Supervision.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

Primal-Dual Rates and Certificates.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Audio Based Bird Species Identification using Deep Learning Techniques.
Proceedings of the Working Notes of CLEF 2016, 2016

2015
L1-Regularized Distributed Optimization: A Communication-Efficient Primal-Dual Framework.
CoRR, 2015

Swiss-Chocolate: Combining Flipout Regularization and Random Forests with Artificially Built Subsystems to Boost Text-Classification for Sentiment.
Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

On the Global Linear Convergence of Frank-Wolfe Optimization Variants.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Adding vs. Averaging in Distributed Primal-Dual Optimization.
Proceedings of the 32nd International Conference on Machine Learning, 2015

2014
Swiss-Chocolate: Sentiment Detection using Sparse SVMs and Part-Of-Speech n-Grams.
Proceedings of the 8th International Workshop on Semantic Evaluation, 2014

Communication-Efficient Distributed Dual Coordinate Ascent.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

2013
An Equivalence between the Lasso and Support Vector Machines
CoRR, 2013

Block-Coordinate Frank-Wolfe Optimization for Structural SVMs.
Proceedings of the 30th International Conference on Machine Learning, 2013

Revisiting Frank-Wolfe: Projection-Free Sparse Convex Optimization.
Proceedings of the 30th International Conference on Machine Learning, 2013

2012
Approximating parameterized convex optimization problems.
ACM Trans. Algorithms, 2012

An Exponential Lower Bound on the Complexity of Regularization Paths.
JoCG, 2012

Regularization Paths with Guarantees for Convex Semidefinite Optimization.
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012

Stochastic Block-Coordinate Frank-Wolfe Optimization for Structural SVMs
CoRR, 2012

Optimizing over the Growing Spectrahedron.
Proceedings of the Algorithms - ESA 2012, 2012

2011
Convex Optimization without Projection Steps
CoRR, 2011

2010
A Simple Algorithm for Nuclear Norm Regularized Problems.
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

2009
A Combinatorial Algorithm to Compute Regularization Paths
CoRR, 2009

An Exponential Lower Bound on the Complexity of Regularization Paths
CoRR, 2009

Coresets for polytope distance.
Proceedings of the 25th ACM Symposium on Computational Geometry, 2009


  Loading...