Tan M. Nguyen

Orcid: 0000-0002-6408-5416

Affiliations:
  • University of California, Los Angeles, CA, USA


According to our database1, Tan M. Nguyen authored at least 63 papers between 2018 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
The Blessing and Curse of Dimensionality in Safety Alignment.
CoRR, July, 2025

Revisiting Transformers with Insights from Image Filtering.
CoRR, June, 2025

Resolving Memorization in Empirical Diffusion Model for Manifold Data in High-Dimensional Spaces.
CoRR, May, 2025

Tree-Sliced Wasserstein Distance with Nonlinear Projection.
CoRR, May, 2025

Spherical Tree-Sliced Wasserstein Distance.
CoRR, March, 2025

MoLEx: Mixture of Layer Experts for Finetuning with Sparse Upcycling.
CoRR, March, 2025

Distance-Based Tree-Sliced Wasserstein Distance.
CoRR, March, 2025

CAMEx: Curvature-aware Merging of Experts.
CoRR, February, 2025

Learning and predicting dynamics of compositional multiphase mixtures using Graph Neural Networks.
J. Comput. Phys., 2025

Demystifying the Token Dynamics of Deep Selective State Space Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Equivariant Neural Functional Networks for Transformers.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Distance-Based Tree-Sliced Wasserstein Distance.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Spherical Tree-Sliced Wasserstein Distance.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

MoLEx: Mixture of Layer Experts for Fine-tuning with Sparse Upcycling.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Tight Clusters Make Specialized Experts.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

CAMEx: Curvature-aware Merging of Experts.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Transformer Meets Twicing: Harnessing Unattended Residual Information.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
An Attention-based Framework for Fair Contrastive Learning.
CoRR, 2024

MomentumSMoE: Integrating Momentum into Sparse Mixture of Experts.
CoRR, 2024

A Clifford Algebraic Approach to E(n)-Equivariant High-order Graph Neural Networks.
CoRR, 2024

Equivariant Polynomial Functional Networks.
CoRR, 2024

Equivariant Neural Functional Networks for Transformers.
CoRR, 2024

Demystifying the Token Dynamics of Deep Selective State Space Models.
CoRR, 2024

Monomial Matrix Group Equivariant Neural Functional Networks.
CoRR, 2024

A Primal-Dual Framework for Transformers and Neural Networks.
CoRR, 2024

Elliptical Attention.
CoRR, 2024

Unveiling the Hidden Structure of Self-Attention via Kernel Principal Component Analysis.
CoRR, 2024

Tree-Sliced Wasserstein Distance on a System of Lines.
CoRR, 2024

PIDformer: Transformer Meets Control Theory.
CoRR, 2024

Revisiting Kernel Attention with Correlated Gaussian Process Representation.
Proceedings of the Uncertainty in Artificial Intelligence, 2024

Monomial Matrix Group Equivariant Neural Functional Networks.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

PIDformer: Transformer Meets Control Theory.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Neural Collapse for Cross-entropy Class-Imbalanced Learning with Unconstrained ReLU Features Model.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Beyond Vanilla Variational Autoencoders: Detecting Posterior Collapse in Conditional and Hierarchical Variational Autoencoders.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

From Coupled Oscillators to Graph Neural Networks: Reducing Over-smoothing via a Kuramoto Model-based Approach.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

2023
ARist: An effective API argument recommendation approach.
J. Syst. Softw., October, 2023

Mitigating Over-smoothing in Transformers via Regularized Nonlocal Functionals.
CoRR, 2023

p-Laplacian Transformer.
CoRR, 2023

Revisiting Over-smoothing and Over-squashing Using Ollivier-Ricci Curvature.
Proceedings of the International Conference on Machine Learning, 2023

Neural Collapse in Deep Linear Networks: From Balanced to Imbalanced Data.
Proceedings of the International Conference on Machine Learning, 2023

A Primal-Dual Framework for Transformers and Neural Networks.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

A Probabilistic Framework for Pruning Transformers Via a Finite Admixture of Keys.
Proceedings of the IEEE International Conference on Acoustics, 2023

DeepGRAND: Deep Graph Neural Diffusion.
Proceedings of the 57th Asilomar Conference on Signals, Systems, and Computers, ACSSC 2023, Pacific Grove, CA, USA, October 29, 2023

2022
Scheduled Restart Momentum for Accelerated Stochastic Gradient Descent.
SIAM J. Imaging Sci., 2022

Robustify Transformers with Robust Kernel Density Estimation.
CoRR, 2022

Improving Generative Flow Networks with Path Regularization.
CoRR, 2022

Transformer with Fourier Integral Attentions.
CoRR, 2022

Momentum Transformer: Closing the Performance Gap Between Self-attention and Its Linearization.
Proceedings of the Mathematical and Scientific Machine Learning, 2022

Improving Transformers with Probabilistic Attention Keys.
Proceedings of the International Conference on Machine Learning, 2022

GRAND++: Graph Neural Diffusion with A Source Term.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Transformer with a Mixture of Gaussian Keys.
CoRR, 2021

How Does Momentum Benefit Deep Neural Networks Architecture Design? A Few Case Studies.
CoRR, 2021

Heavy Ball Neural Ordinary Differential Equations.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

FMMformer: Efficient and Flexible Transformer via Decomposed Near-field and Far-field Attention.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

API parameter recommendation based on language model and program analysis.
Proceedings of the 28th Asia-Pacific Software Engineering Conference, 2021

2020
Dual Dynamic Inference: Enabling More Efficient, Adaptive, and Controllable Deep Inference.
IEEE J. Sel. Top. Signal Process., 2020

Scheduled Restart Momentum for Accelerated Stochastic Gradient Descent.
CoRR, 2020

MomentumRNN: Integrating Momentum into Recurrent Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Neural Networks with Recurrent Generative Feedback.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019
InfoCNF: An Efficient Conditional Continuous Normalizing Flow with Adaptive Solvers.
CoRR, 2019

Learning Near-optimal Convex Combinations of Basis Models with Generalization Guarantees.
CoRR, 2019

Out-of-Distribution Detection Using Neural Rendering Generative Models.
CoRR, 2019

2018
Neural Rendering Model: Joint Generation and Prediction for Semi-Supervised Learning.
CoRR, 2018


  Loading...