We stand with Ukraine

We stand with Ukraine

Guang Dai

Orcid: 0000-0002-3529-9087

According to our database¹, Guang Dai authored at least 96 papers between 2003 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2026

Corrigendum to "MA-FSAR: Multimodal Adaptation of CLIP for few-shot action recognition" [Pattern Recognition 169 (2026) 111902].

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Pattern Recognit., 2026

MA-FSAR: Multimodal Adaptation of CLIP for few-shot action recognition.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Pattern Recognit., 2026

2025

MSCR: Exploring the Vulnerability of LLMs' Mathematical Reasoning Abilities Using Multi-Source Candidate Replacement.

[BibT_eX]

[DOI]

,

,

CoRR, November, 2025

Numerical Sensitivity and Robustness: Exploring the Flaws of Mathematical Reasoning in Large Language Models.

[BibT_eX]

[DOI]

,

,

,

CoRR, November, 2025

Deforming Videos to Masks: Flow Matching for Referring Video Segmentation.

[BibT_eX]

[DOI]

,

,

Liuzhuozheng Li

,

,

,

,

,

,

CoRR, October, 2025

TriCLIP-3D: A Unified Parameter-Efficient Framework for Tri-Modal 3D Visual Grounding based on CLIP.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, July, 2025

PSDiff: Diffusion Model for Person Search With Iterative and Collaborative Refinement.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Trans. Circuits Syst. Video Technol., June, 2025

TryOn-Adapter: Efficient Fine-Grained Clothing Identity Adaptation for High-Fidelity Virtual Try-On.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Int. J. Comput. Vis., June, 2025

FZOO: Fast Zeroth-Order Optimizer for Fine-Tuning Large Language Models towards Adam-Scale Speed.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, June, 2025

Privacy Leaks by Adversaries: Adversarial Iterations for Membership Inference Attack.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, June, 2025

Towards Understanding The Calibration Benefits of Sharpness-Aware Minimization.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, May, 2025

No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves.

[BibT_eX]

[DOI]

,

,

Liuzhuozheng Li

,

,

,

,

,

,

CoRR, May, 2025

Breaking the Prompt Wall (I): A Real-World Case Study of Attacking ChatGPT via Lightweight Prompt Injection.

[BibT_eX]

[DOI]

,

,

,

CoRR, April, 2025

AffordanceSAM: Segment Anything Once More in Affordance Grounding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, April, 2025

InnerSelf: Designing Self-Deepfaked Voice for Emotional Well-being.

[BibT_eX]

[DOI]

,

,

,

CoRR, March, 2025

Decentralized Optimization on Compact Submanifolds by Quantized Riemannian Gradient Tracking.

[BibT_eX]

[DOI]

,

,

,

,

,

,

IEEE Trans. Signal Process., 2025

Disentangled Noisy Correspondence Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

IEEE Trans. Image Process., 2025

VidEvo: Evolving Video Editing through Exhaustive Temporal Modeling.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

Instructing Text-to-Image Diffusion Models via Classifier-Guided Semantic Optimization.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

DynaMind: Reasoning over Abstract Video Dynamics for Embodied Decision-Making.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Second-Order Fine-Tuning without Pain for LLMs: A Hessian Informed Zeroth-Order Optimizer.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Manifold Constraint Reduces Exposure Bias in Accelerated Diffusion Sampling.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

ProAdvPrompter: A Two-Stage Journey to Effective Adversarial Prompting for LLMs.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Exploring Triple Knowledge Cues for Zero-Shot Human-Object Interaction Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Density-aware and Depth-aware Visual Representation for Zero-Shot Object Counting.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Action Detail Matters: Refining Video Recognition with Local Action Queries.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Low-Biased General Annotated Dataset Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

IMOL: Incomplete-Modality-Tolerant Learning for Multi-Domain Fake News Video Detection.

[BibT_eX]

[DOI]

,

,

,

,

Xiangzheng Kong

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

On the Risk of Evidence Pollution for Malicious Social Text Detection in the Era of LLMs.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

SpotActor: Training-Free Layout-Controlled Consistent Image Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

Disentangled Representation Learning With Transmitted Information Bottleneck.

[BibT_eX]

[DOI]

,

,

,

,

,

,

IEEE Trans. Circuits Syst. Video Technol., December, 2024

Disentangled Generation With Information Bottleneck for Enhanced Few-Shot Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

IEEE Trans. Image Process., 2024

Unbiased General Annotated Dataset Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

Visual Object Tracking across Diverse Data Modalities: A Review.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

How Do Social Bots Participate in Misinformation Spread? A Comprehensive Dataset and Analysis.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2024

AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

DreamSalon: A Staged Diffusion Framework for Preserving Identity-Context in Editable Face Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

M2-CLIP: A Multimodal, Multi-task Adapting Framework for Video Action Recognition.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

Exploring Perceptions of Children's Learning Stress for Stress Management.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Intelligent Computing, 2024

OneActor: Consistent Subject Generation via Cluster-Conditioned Guidance.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Generating Action-conditioned Prompts for Open-vocabulary Video Action Recognition.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Can Gaussian Sketching Converge Faster on a Preconditioned Landscape?

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Double Variance Reduction: A Smoothing Trick for Composite Optimization Problems without First-Order Gradient.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Double Stochasticity Gazes Faster: Snap-Shot Decentralized Stochastic Gradient Tracking Methods.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Decentralized Riemannian Conjugate Gradient Method on the Stiefel Manifold.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Timestep-Aware Correction for Quantized Diffusion Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

Learning to Rematch Mismatched Pairs for Robust Cross-Modal Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Stress Diffuser: A Biofeedback Agent for Stress Management in Children During Homework with Parent Involvement.

[BibT_eX]

[DOI]

,

,

Emilia I. Barakova

,

,

Proceedings of the 23rd Annual ACM Interaction Design and Children Conference, 2024

A Multimodal, Multi-Task Adapting Framework for Video Action Recognition.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

SSMG: Spatial-Semantic Map Guided Diffusion Model for Free-Form Layout-to-Image Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Noisy Correspondence Learning with Self-Reinforcing Errors Mitigation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Disentangled Representation Learning with Transmitted Information Bottleneck.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2023

SUBP: Soft Uniform Block Pruning for 1xN Sparse CNNs Multithreading Acceleration.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2023

PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2023

Multimodal Adaptation of CLIP for Few-Shot Action Recognition.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2023

SUBP: Soft Uniform Block Pruning for 1×N Sparse CNNs Multithreading Acceleration.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Boosting Few-shot Action Recognition with Graph-guided Hybrid Matching.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022

Synchronization of complex-valued stochastic coupled systems with hybrid impulses via discrete-time state observations control.

[BibT_eX]

[DOI]

,

,

,

Neural Comput. Appl., 2022

Pleasure and Improvement - A Study on Digital Artistic Expression of Health Data Generation for Professional Women.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Design Studies and Intelligence Engineering, 2022

2017

A Balanced Assignment Mechanism for Online Taxi Recommendation.

[BibT_eX]

[DOI]

,

,

Stephen Manko Wambura

,

Proceedings of the 18th IEEE International Conference on Mobile Data Management, 2017

2014

Multicategory large margin classification methods: Hinge losses vs. coherence functions.

[BibT_eX]

[DOI]

,

,

,

,

Artif. Intell., 2014

2013

An iterative SVM approach to feature selection and classification in high-dimensional datasets.

[BibT_eX]

[DOI]

,

,

,

Pattern Recognit., 2013

2012

Coherence functions with applications in large-margin classification methods.

[BibT_eX]

[DOI]

,

,

,

Michael I. Jordan

J. Mach. Learn. Res., 2012

2011

Bayesian Generalized Kernel Mixed Models.

[BibT_eX]

[DOI]

,

,

Michael I. Jordan

J. Mach. Learn. Res., 2011

A non-convex relaxation approach to sparse dictionary learning.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010

A regularization framework for multiclass classification: A deterministic annealing approach.

[BibT_eX]

[DOI]

,

,

,

,

Frederick H. Lochovsky

Pattern Recognit., 2010

Regularized Discriminant Analysis, Ridge Regression and Beyond.

[BibT_eX]

[DOI]

,

,

,

Michael I. Jordan

J. Mach. Learn. Res., 2010

Bayesian Generalized Kernel Models.

[BibT_eX]

[DOI]

,

,

,

Michael I. Jordan

Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, 2010

Matrix-Variate Dirichlet Process Mixture Models.

[BibT_eX]

[DOI]

,

,

Michael I. Jordan

Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, 2010

Sparse Unsupervised Dimensionality Reduction Algorithms.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2010

2009

A Flexible and Efficient Algorithm for Regularized Fisher Discriminant Analysis.

[BibT_eX]

[DOI]

,

,

Michael I. Jordan

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2009

Optimal Scoring for Unsupervised Learning.

[BibT_eX]

[DOI]

,

Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Research on Magnetic Flux Leakage Signals Quantity Technology of Tank Floor Corrosion Defects Based on Artificial Neural Network.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Fifth International Conference on Natural Computation, 2009

2008

A Scalable Kernel-Based Semisupervised Metric Learning Algorithm with Out-of-Sample Generalization Ability.

[BibT_eX]

[DOI]

,

,

Neural Comput., 2008

2007

Learning the kernel matrix by maximizing a KFD-based class separability criterion.

[BibT_eX]

[DOI]

,

,

Pattern Recognit., 2007

Face recognition using a kernel fractional-step discriminant analysis algorithm.

[BibT_eX]

[DOI]

,

,

Pattern Recognit., 2007

A Scalable Kernel-Based Algorithm for Semi-Supervised Metric Learning.

[BibT_eX]

[DOI]

,

,

Proceedings of the IJCAI 2007, 2007

Boosting Kernel Discriminant Analysis and Its Application on Tissue Classification of Gene Expression Data.

[BibT_eX]

[DOI]

,

Proceedings of the IJCAI 2007, 2007

Kernel selection forl semi-supervised kernel machines.

[BibT_eX]

[DOI]

,

Proceedings of the Machine Learning, 2007

2006

Local Discriminant Embedding with Tensor Representation.

[BibT_eX]

[DOI]

,

,

Proceedings of the International Conference on Image Processing, 2006

Extending Kernel Fisher Discriminant Analysis with the Weighted Pairwise Chernoff Criterion.

[BibT_eX]

[DOI]

,

,

Proceedings of the Computer Vision, 2006

Tensor Embedding Methods.

[BibT_eX]

[DOI]

,

Proceedings of the Proceedings, 2006

2005

Nonlinear dimensionality reduction for classification using kernel weighted subspace method.

[BibT_eX]

[DOI]

,

Proceedings of the 2005 International Conference on Image Processing, 2005

2004

By normalizing to improve generalized Foley-Sammon transform in high-dimensional spaces - with application to face recognition.

[BibT_eX]

[DOI]

,

Proceedings of the IEEE International Conference on Systems, 2004

Face recognition with the robust feature extracted by the generalized Foley-Sammon transform.

[BibT_eX]

,

Proceedings of the 2004 International Symposium on Circuits and Systems, 2004

An Advanced Segmental Semi-Markov Model Based Online Series Pattern Detection.

[BibT_eX]

[DOI]

,

,

Proceedings of the 17th International Conference on Pattern Recognition, 2004

A Kernel Fractional-Step Nonlinear Discriminant Analysis for Pattern Recognition.

[BibT_eX]

[DOI]

,

,

Proceedings of the 17th International Conference on Pattern Recognition, 2004

A Gabor direct fractional-step LDA algorithm for face recognition.

[BibT_eX]

[DOI]

,

Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Kernel generalized nonlinear discriminant analysis algorithm for pattern recognition.

[BibT_eX]

,

Proceedings of the 2004 International Conference on Image Processing, 2004

A robust feature extraction framework for face recognition.

[BibT_eX]

[DOI]

,

Yasutoshi Otani

Proceedings of the 2004 International Conference on Image Processing, 2004

Modified kernel-based nonlinear feature extraction [face recognition example].

[BibT_eX]

[DOI]

,

,

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Face Recognition Using Novel LDA-Based Algorithms.

[BibT_eX]

,

Proceedings of the 16th Eureopean Conference on Artificial Intelligence, 2004

2003

Feature Extraction Method Based on the Generalized Weighted Foley-Sammon Transform in High Dimensional Spaces.

[BibT_eX]

,

,

,

Proceedings of the 1st Indian International Conference on Artificial Intelligence, 2003

Loading...