Wenpin Tang

Orcid: 0000-0001-7228-1954

According to our database1, Wenpin Tang authored at least 39 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Sample Complexity of Transfer Learning: An Optimal Transport Approach.
CoRR, May, 2026

Tweedie's Formulae and Diffusion Generative Models Beyond Gaussian.
CoRR, May, 2026

Improved techniques for fine-tuning flow models via adjoint matching: a deterministic control pipeline.
CoRR, May, 2026

Conditional Diffusion Guidance under Hard Constraint: A Stochastic Analysis Approach.
CoRR, February, 2026

ART for Diffusion Sampling: A Reinforcement Learning Approach to Timestep Schedule.
CoRR, January, 2026

2025
SOCRATES: Simulation Optimization with Correlated Replicas and Adaptive Trajectory Evaluations.
CoRR, November, 2025

Understanding Sampler Stochasticity in Training Diffusion Models for RLHF.
CoRR, October, 2025

DiFFPO: Training Diffusion LLMs to Reason Fast and Furious via Reinforcement Learning.
CoRR, October, 2025

Diffusion Generative Models Meet Compressed Sensing, with Applications to Imaging and Finance.
CoRR, September, 2025

Fine-Tuning Diffusion Generative Models via Rich Preference Optimization.
CoRR, March, 2025

The Convergence Rate of Vanishing Viscosity Approximations for Mean Field Games.
SIAM J. Math. Anal., 2025

Policy Iteration for the Deterministic Control Problems - A Viscosity Approach.
SIAM J. Control. Optim., 2025

Polynomial Voting Rules.
Math. Oper. Res., 2025

Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey.
J. Artif. Intell. Res., 2025

Score as Action: Fine Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

MallowsPO: Fine-Tune Your LLM with Preference Dispersions.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Regret of exploratory policy improvement and <i>q</i>-learning.
CoRR, 2024

RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization.
CoRR, 2024

Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey.
CoRR, 2024

Scores as Actions: a framework of fine-tuning diffusion models by continuous-time reinforcement learning.
CoRR, 2024

Mallows-DPO: Fine-Tune Your LLM with Preference Dispersions.
CoRR, 2024

Fine-tuning of diffusion models via stochastic control: entropy regularization and beyond.
CoRR, 2024

Score-based Diffusion Models via Stochastic Differential Equations - a Technical Tutorial.
CoRR, 2024

Contractive Diffusion Probabilistic Models.
CoRR, 2024

2023
Inference for Gaussian Processes with Matern Covariogram on Compact Riemannian Manifolds.
J. Mach. Learn. Res., 2023

Transaction fee mechanism for Proof-of-Stake protocol.
CoRR, 2023

Policy iteration for the deterministic control problems - a viscosity approach.
CoRR, 2023

Policy Optimization for Continuous Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
Exploratory HJB Equations and Their Convergence.
SIAM J. Control. Optim., 2022

A Class of Stochastic Games and Moving Free Boundary Problems.
SIAM J. Control. Optim., 2022

Asset selection via correlation blockmodel clustering.
Expert Syst. Appl., 2022

2021
Arcsine laws for random walks generated from random permutations with applications to genomics.
J. Appl. Probab., 2021

2020
Learning an arbitrary mixture of two multinomial logits.
CoRR, 2020

Perturbed gradient descent with occupation time.
CoRR, 2020

The Buckley-Osthus model and the block preferential attachment model: statistical analysis and application.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
Exponential ergodicity and convergence for generalized reflected Brownian motion.
Queueing Syst. Theory Appl., 2019

Consistency of the Buckley-Osthus model and the hierarchical preferential attachment model.
CoRR, 2019

Mallows ranking models: maximum likelihood estimate and regeneration.
Proceedings of the 36th International Conference on Machine Learning, 2019


  Loading...