Rahul Mazumder

Andrés Gómez

CoRR, April, 2026

Robust Batch-Level Query Routing for Large Language Models under Cost and Capacity Constraints.

[BibT_eX]

[DOI]

Jelena Markovic-Voronov

CoRR, March, 2026

DuaLip-GPU Technical Report.

[BibT_eX]

[DOI]

CoRR, March, 2026

Robust Batch-Level Query Routing for Large Language Models under Cost and Capacity Constraints.

[BibT_eX]

[DOI]

Jelena Markovic-Voronov

Proceedings of the ACM Conference on AI and Agentic Systems, 2026

2025

Theoretical Compression Bounds for Wide Multilayer Perceptrons.

[BibT_eX]

[DOI]

Houssam El Cheairi

David Gamarnik

CoRR, December, 2025

Reasoning Models Can be Accurately Pruned Via Chain-of-Thought Reconstruction.

[BibT_eX]

[DOI]

CoRR, September, 2025

Extracting Interpretable Models from Tree Ensembles: Computational and Statistical Perspectives.

[BibT_eX]

[DOI]

CoRR, June, 2025

TSENOR: Highly-Efficient Algorithm for Finding Transposable N:M Sparse Masks.

[BibT_eX]

[DOI]

Mehdi Makni

CoRR, May, 2025

An Optimization Framework for Differentially Private Sparse Fine-Tuning.

[BibT_eX]

[DOI]

CoRR, March, 2025

Efficient AI in Practice: Training and Deployment of Efficient LLMs for Industry Applications.

[BibT_eX]

[DOI]

CoRR, February, 2025

HASSLE-free: A unified Framework for Sparse plus Low-Rank Matrix Decomposition for LLMs.

[BibT_eX]

[DOI]

CoRR, February, 2025

Nonparametric Finite Mixture Models with Possible Shape Constraints: A Cubic Newton Approach.

[BibT_eX]

[DOI]

SIAM J. Math. Data Sci., 2025

Randomization Can Reduce Both Bias and Variance: A Case Study in Random Forests.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2025

Efficient Algorithms for Leveraging LLMs for Generative and Predictive Recommender Systems.

[BibT_eX]

[DOI]

Proceedings of the Companion Proceedings of the ACM on Web Conference 2025, 2025

Differentially Private High-dimensional Variable Selection via Integer Programming.

[BibT_eX]

[DOI]

Petros Prastakos

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

TSENOR: Highly-Efficient Algorithm for Finding Transposable N: M Sparse Masks.

[BibT_eX]

[DOI]

Mehdi Makni

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

3BASiL: An Algorithmic Framework for Sparse plus Low-Rank Compression of LLMs.

[BibT_eX]

[DOI]

Mehdi Makni

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

SPARTA: An Optimization Framework for Differentially Private Sparse Fine-Tuning.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

MOSS: Multi-Objective Optimization for Stable Rule Sets.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

Preserving Deep Representations in One-Shot Pruning: A Hessian-Free Second-Order Optimization Framework.

[BibT_eX]

[DOI]

Ryan Lucas

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Scaling Down, Serving Fast: Compressing and Deploying Efficient LLMs for Recommendation Systems.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

A unified framework for Sparse plus Low-Rank Matrix Decomposition for LLMs.

[BibT_eX]

[DOI]

Proceedings of the Conference on Parsimony and Learning, 2025

2024

A new computational framework for log-concave density estimation.

[BibT_eX]

[DOI]

Wenyu Chen

Richard J. Samworth

Math. Program. Comput., June, 2024

PolyCD: Optimization via Cycling through the Vertices of a Polytope.

[BibT_eX]

[DOI]

SIAM J. Optim., 2024

Subgradient Regularized Multivariate Convex Regression at Scale.

[BibT_eX]

[DOI]

Wenyu Chen

SIAM J. Optim., 2024

Sparse NMF with Archetypal Regularization: Computational and Robustness Properties.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2024

Efficient user history modeling with amortized inference for deep learning recommendation models.

[BibT_eX]

[DOI]

CoRR, 2024

FFSplit: Split Feed-Forward Network For Optimizing Accuracy-Efficiency Trade-off in Language Model Inference.

[BibT_eX]

[DOI]

Zirui Liu

Qingquan Song

Qiang Charles Xiao

Xia Hu

CoRR, 2024

ALPS: Improved Optimization for Highly Sparse One-Shot Pruning for Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

FAST: An Optimization Framework for Fast Additive Segmentation in Transparent ML.

[BibT_eX]

[DOI]

Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

OSSCAR: One-Shot Structured Pruning in Vision and Language Models with Combinatorial Optimization.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

FALCON: FLOP-Aware Combinatorial Optimization for Neural Network Pruning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

End-to-end Feature Selection Approach for Learning Skinny Trees.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

2023

Linear regression with partially mismatched data: local search with theoretical guarantees.

[BibT_eX]

[DOI]

Math. Program., February, 2023

Subset Selection with Shrinkage: Sparse Linear Modeling When the SNR Is Low.

[BibT_eX]

[DOI]

Oper. Res., January, 2023

L0Learn: A Scalable Package for Sparse Learning using L0 Regularization.

[BibT_eX]

[DOI]

Tim Nonet

J. Mach. Learn. Res., 2023

QuantEase: Optimization-based Quantization for Language Models - An Efficient and Intuitive Algorithm.

[BibT_eX]

[DOI]

Ayan Acharya

CoRR, 2023

Sparse Gaussian Graphical Models with Discrete Optimization: Computational and Statistical Perspectives.

[BibT_eX]

[DOI]

Wenyu Chen

CoRR, 2023

Matrix Completion from General Deterministic Sampling Patterns.

[BibT_eX]

[DOI]

CoRR, 2023

Sharpness-Aware Minimization: An Implicit Regularization Perspective.

[BibT_eX]

[DOI]

CoRR, 2023

mSAM: Micro-Batch-Averaged Sharpness-Aware Minimization.

[BibT_eX]

[DOI]

CoRR, 2023

Promoting Inactive Members in Edge-Building Marketplace.

[BibT_eX]

[DOI]

Parag Agrawal

Proceedings of the Companion Proceedings of the ACM Web Conference 2023, 2023

On the Convergence of CART under Sufficient Impurity Decrease Condition.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

GRAND-SLAMIN' Interpretable Additive Modeling with Structural Constraints.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Fire: An Optimization Approach for Fast Interpretable Rule Extraction.

[BibT_eX]

[DOI]

Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

COMET: Learning Cardinality Constrained Mixture of Experts with Trees and Local Search.

[BibT_eX]

[DOI]

Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Practical Design of Performant Recommender Systems using Large-scale Linear Programming-based Global Inference.

[BibT_eX]

[DOI]

Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Fast as CHITA: Neural Network Pruning with Combinatorial Optimization.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Dyn-GWN: Time-Series Forecasting using Time-varying Graphs with Applications to Finance and Traffic Prediction.

[BibT_eX]

[DOI]

Max Tell

Proceedings of the 4th ACM International Conference on AI in Finance, 2023

Dynamic Covariance Estimation under Structural Assumptions via a Joint Optimization Approach.

[BibT_eX]

[DOI]

Proceedings of the 4th ACM International Conference on AI in Finance, 2023

Optimizing for Member Value in an Edge Building Marketplace.

[BibT_eX]

[DOI]

Parag Agrawal

Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

ForestPrune: Compact Depth-Pruned Tree Ensembles.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022

Frank-Wolfe Methods with an Unbounded Feasible Region and Applications to Structured Learning.

[BibT_eX]

[DOI]

Haihao Lu

SIAM J. Optim., December, 2022

Sparse regression at scale: branch-and-bound rooted in first-order optimization.

[BibT_eX]

[DOI]

Ali Saab

Math. Program., 2022

Solving L1-regularized SVMs and Related Linear Programs: Revisiting the Effectiveness of Column and Constraint Generation.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2022

Using ℓ1-Relaxation and Integer Programming to Obtain Dual Bounds for Sparse PCA.

[BibT_eX]

[DOI]

Santanu S. Dey

Guanyi Wang

Oper. Res., 2022

Improved Deep Neural Network Generalization Using m-Sharpness-Aware Minimization.

[BibT_eX]

[DOI]

CoRR, 2022

ForestPrune: Compact Depth-Controlled Tree Ensembles.

[BibT_eX]

[DOI]

CoRR, 2022

Newer is Not Always Better: Rethinking Transferability Metrics, Their Peculiarities, Stability and Performance.

[BibT_eX]

[DOI]

Natalia Ponomareva

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2022

Pushing the limits of fairness impossibility: Who's the fairest of them all?

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Flexible Modeling and Multitask Learning using Differentiable Tree Ensembles.

[BibT_eX]

[DOI]

Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Quant-BnB: A Scalable Branch-and-Bound Method for Optimal Decision Trees with Continuous Features.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Knowledge Graph Guided Simultaneous Forecasting and Network Learning for Multivariate Financial Time Series.

[BibT_eX]

[DOI]

Proceedings of the 3rd ACM International Conference on AI in Finance, 2022

2021

Learning Sparse Classifiers: Continuous and Mixed Integer Optimization Perspectives.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2021

Optimal Ensemble Construction for Multi-Study Prediction with Applications to COVID-19 Excess Mortality Estimation.

[BibT_eX]

[DOI]

CoRR, 2021

Predicting Census Survey Response Rates via Interpretable Nonparametric Additive Models with Structured Interactions.

[BibT_eX]

[DOI]

CoRR, 2021

Grouped Variable Selection with Discrete Optimization: Computational and Statistical Perspectives.

[BibT_eX]

[DOI]

CoRR, 2021

Archetypal Analysis for Sparse Nonnegative Matrix Factorization: Robustness Under Misspecification.

[BibT_eX]

[DOI]

CoRR, 2021

DSelect-k: Differentiable Selection in the Mixture of Experts with Applications to Multi-Task Learning.

[BibT_eX]

[DOI]

Zhe Zhao

Aakanksha Chowdhery

Maheswaran Sathiamoorthy

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Linear Regression with Mismatched Data: A Provably Optimal Local Search Algorithm.

[BibT_eX]

[DOI]

Proceedings of the Integer Programming and Combinatorial Optimization, 2021

2020

Randomized Gradient Boosting Machine.

[BibT_eX]

[DOI]

Haihao Lu

SIAM J. Optim., 2020

Matrix completion with nonconvex regularization: spectral operators and scalable algorithms.

[BibT_eX]

[DOI]

Diego Saldana

Haolei Weng

Stat. Comput., 2020

Fast Best Subset Selection: Coordinate Descent and Local Combinatorial Optimization Algorithms.

[BibT_eX]

[DOI]

Oper. Res., 2020

The Tree Ensemble Layer: Differentiability meets Conditional Computation.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

ECLIPSE: An Extreme-Scale Linear Program Solver for Web-Applications.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Learning Hierarchical Interactions at Scale: A Convex Optimization Approach.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

2019

Learning a Mixture of Gaussians via Mixed-Integer Optimization.

[BibT_eX]

[DOI]

Hari Bandi

Dimitris Bertsimas

INFORMS J. Optim., July, 2019

Computation of the maximum likelihood estimator in low-rank factor analysis.

[BibT_eX]

[DOI]

Koulik Khamaru

Math. Program., 2019

Solving large-scale L1-regularized SVMs and cousins: the surprising effectiveness of column and constraint generation.

[BibT_eX]

[DOI]

CoRR, 2019

2018

Condition Number Analysis of Logistic Regression, and its Implications for Standard First-Order Solution Methods.

[BibT_eX]

[DOI]

CoRR, 2018

Hierarchical Modeling and Shrinkage for User Session Length Prediction in Media Streaming.

[BibT_eX]

[DOI]

CoRR, 2018

Hierarchical Modeling and Shrinkage for User Session LengthPrediction in Media Streaming.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

2017

The Discrete Dantzig Selector: Estimating Sparse Linear Models via Mixed Integer Linear Optimization.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Theory, 2017

An Extended Frank-Wolfe Method with "In-Face" Directions, and Its Application to Low-Rank Matrix Completion.

[BibT_eX]

[DOI]

SIAM J. Optim., 2017

Certifiably Optimal Low Rank Factor Analysis.

[BibT_eX]

[DOI]

Dimitris Bertsimas

Martin S. Copenhaver

J. Mach. Learn. Res., 2017

2015

Matrix completion and low-rank SVD via fast alternating least squares.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2015

A New Perspective on Boosting in Linear Regression via Subgradient Optimization and Relatives.

[BibT_eX]

[DOI]

CoRR, 2015

2013

AdaBoost and Forward Stagewise Regression are First-Order Convex Optimization Methods.

[BibT_eX]

[DOI]

CoRR, 2013

Non-negative matrix completion for bandwidth extension: A convex optimization approach.

[BibT_eX]

[DOI]

Dennis L. Sun

Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2013

2012

Exact Covariance Thresholding into Connected Components for Large-Scale Graphical Lasso.

[BibT_eX]

[DOI]

Trevor Hastie

J. Mach. Learn. Res., 2012

2011

The Graphical Lasso: New Insights and Alternatives

[BibT_eX]

[DOI]

Trevor Hastie

CoRR, 2011

2010

Spectral Regularization Algorithms for Learning Large Incomplete Matrices.

[BibT_eX]

[DOI]