Mengdi Wang
Orcid: 0000-0002-2101-9507
According to our database1,
Mengdi Wang
authored at least 162 papers
between 2014 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
CoRR, September, 2025
CoRR, July, 2025
CoRR, June, 2025
CoRR, June, 2025
Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution.
CoRR, May, 2025
NoWag: A Unified Framework for Shape Preserving Compression of Large Language Models.
CoRR, April, 2025
MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations.
CoRR, February, 2025
Deep Reinforcement Learning for Efficient and Fair Allocation of Healthcare Resources.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
2024
Efficient Reinforcement Learning With Impaired Observability: Learning to Act With Delayed and Missing State Observations.
IEEE Trans. Inf. Theory, October, 2024
Redefining the Game: MVAE-DFDPnet's Low-Dimensional Embeddings for Superior Drug-Protein Interaction Predictions.
IEEE J. Biomed. Health Informatics, July, 2024
IEEE Trans. Mob. Comput., May, 2024
Boosting the Convergence of Reinforcement Learning-Based Auto-Pruning Using Historical Data.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., February, 2024
Trans. Mach. Learn. Res., 2024
Author Correction: A 5′ UTR language model for decoding untranslated regions of mRNA and function predictions.
Nat. Mac. Intell., 2024
A 5′ UTR language model for decoding untranslated regions of mRNA and function predictions.
Nat. Mac. Intell., 2024
On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control.
J. Mach. Learn. Res., 2024
Unveil Conditional Diffusion Models with Classifier-free Guidance: A Sharp Statistical Theory.
CoRR, 2024
MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences.
CoRR, 2024
Deep reinforcement learning identifies personalized intermittent androgen deprivation therapy for prostate cancer.
Briefings Bioinform., 2024
Nonparametric Classification on Low Dimensional Manifolds using Overparameterized Convolutional Residual Networks.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024
Is Inverse Reinforcement Learning Harder than Standard Reinforcement Learning? A Theoretical Perspective.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
Primal-Dual First-Order Methods for Affinely Constrained Multi-block Saddle Point Problems.
SIAM J. Optim., June, 2023
IEEE Trans. Pattern Anal. Mach. Intell., April, 2023
Learning Good State and Action Representations for Markov Decision Process via Tensor Decomposition.
J. Mach. Learn. Res., 2023
Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning.
J. Mach. Learn. Res., 2023
CoRR, 2023
Sample Complexity of Preference-Based Nonparametric Off-Policy Evaluation with Deep Networks.
CoRR, 2023
Sample Complexity of Neural Policy Mirror Descent for Policy Optimization on Low-Dimensional Manifolds.
CoRR, 2023
Deep Reinforcement Learning for Efficient and Fair Allocation of Health Care Resources.
CoRR, 2023
Provable Benefits of Policy Learning from Human Preferences in Contextual Bandit Problems.
CoRR, 2023
Nonparametric Classification on Low Dimensional Manifolds using Overparameterized Convolutional Residual Networks.
CoRR, 2023
Score Approximation, Estimation and Distribution Recovery of Diffusion Models on Low-Dimensional Data.
CoRR, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Reward-Directed Conditional Diffusion: Provable Distribution Estimation and Reward Improvement.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Efficient RL with Impaired Observability: Learning to Act with Delayed and Missing State Observations.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
STEERING : Stein Information Directed Exploration for Model-Based Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the Thirty Sixth Annual Conference on Learning Theory, 2023
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023
2022
Energy system digitization in the era of AI: A three-layered approach toward carbon neutrality.
Patterns, 2022
CoRR, 2022
Energy System Digitization in the Era of AI: A Three-Layered Approach towards Carbon Neutrality.
CoRR, 2022
Provably Efficient Reinforcement Learning for Online Adaptive Influence Maximization.
CoRR, 2022
CoRR, 2022
Sample Complexity of Nonparametric Off-Policy Evaluation on Low-Dimensional Manifolds using Deep Networks.
CoRR, 2022
Proceedings of the Uncertainty in Artificial Intelligence, 2022
Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning approach.
Proceedings of the International Conference on Machine Learning, 2022
Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism.
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the 56th Annual Conference on Information Sciences and Systems, 2022
Multi-Agent Reinforcement Learning with General Utilities via Decentralized Shadow Reward Actor-Critic.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
IEEE J. Sel. Areas Inf. Theory, 2021
IEEE Internet Things J., 2021
You Only Compress Once: Towards Effective and Elastic BERT Compression via Exploit-Explore Stochastic Nature Gradient.
CoRR, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the IEEE International Symposium on Information Theory, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Proceedings of the 2021 60th IEEE Conference on Decision and Control (CDC), 2021
Beyond Cumulative Returns via Reinforcement Learning over State-Action Occupancy Measures.
Proceedings of the 2021 American Control Conference, 2021
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021
2020
SIAM J. Matrix Anal. Appl., 2020
A Single Timescale Stochastic Approximation Method for Nested Stochastic Optimization.
SIAM J. Optim., 2020
Randomized Linear Programming Solves the Markov Decision Problem in Nearly Linear (Sometimes Sublinear) Time.
Math. Oper. Res., 2020
Bridging Exploration and General Function Approximation in Reinforcement Learning: Provably Efficient Kernel and Neural Value Iterations.
CoRR, 2020
Variational Policy Gradient Method for Reinforcement Learning with General Utilities.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Provably Efficient Reinforcement Learning with Kernel and Neural Function Approximations.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Middleware '20: 21st International Middleware Conference, 2020
Proceedings of the 2nd Annual Conference on Learning for Dynamics and Control, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
A History-Based Auto-Tuning Framework for Fast and High-Performance DNN Design on GPU.
Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020
Proceedings of the 2020 American Control Conference, 2020
Solving Discounted Stochastic Two-Player Games with Near-Optimal Time and Sample Complexity.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020
2019
SIAM J. Optim., 2019
Blessing of massive scale: spatial graphical model estimation with a total cardinality constraint approach.
Math. Program., 2019
Picasso: A Sparse Learning Library for High Dimensional Data Analysis in R and Python.
J. Mach. Learn. Res., 2019
J. Mach. Learn. Res., 2019
CoRR, 2019
Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, 2019
Learning low-dimensional state embeddings and metastable clusters from time series data.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the IEEE International Symposium on Information Theory, 2019
Proceedings of the IEEE International Symposium on Workload Characterization, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Proceedings of the 57th Annual Allerton Conference on Communication, 2019
2018
Math. Program., 2018
CoRR, 2018
Diffusion Approximations for Online Principal Component Estimation and Global Convergence.
CoRR, 2018
CoRR, 2018
Variance Reduced Value Iteration and Faster Algorithms for Solving Markov Decision Processes.
Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms, 2018
Near-Optimal Time and Sample Complexities for Solving Markov Decision Processes with a Generative Model.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Dimensionality Reduction for Stationary Time Series via Stochastic Nonconvex Optimization.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Proceedings of the 35th International Conference on Machine Learning, 2018
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018
2017
SIAM J. Optim., 2017
Stochastic compositional gradient descent: algorithms for minimizing compositions of expected-value functions.
Math. Program., 2017
Diffusion Approximations for Online Principal Component Estimation and Global Convergence.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Proceedings of the 34th International Conference on Machine Learning, 2017
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017
2016
SIAM J. Optim., 2016
Proceedings of the Winter Simulation Conference, 2016
Proceedings of the 55th IEEE Conference on Decision and Control, 2016
2015
Math. Program., 2015
IEEE J. Sel. Top. Signal Process., 2015
Random Multi-Constraint Projection: Stochastic Gradient Methods for Convex Optimization with Many Constraints.
CoRR, 2015
Averaging random projection: A fast online solution for large-scale constrained stochastic optimization.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
2014
Stabilization of Stochastic Iterative Methods for Singular and Nearly Singular Linear Systems.
Math. Oper. Res., 2014
Multi-task nonconvex optimization with total budget constraint: A distributed algorithm using Monte Carlo estimates.
Proceedings of the 19th International Conference on Digital Signal Processing, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014