Guang Dai

Orcid: 0000-0002-3529-9087

According to our database1, Guang Dai authored at least 93 papers between 2003 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2026
Corrigendum to "MA-FSAR: Multimodal Adaptation of CLIP for few-shot action recognition" [Pattern Recognition 169 (2026) 111902].
Pattern Recognit., 2026

MA-FSAR: Multimodal Adaptation of CLIP for few-shot action recognition.
Pattern Recognit., 2026

2025
Deforming Videos to Masks: Flow Matching for Referring Video Segmentation.
CoRR, October, 2025

TriCLIP-3D: A Unified Parameter-Efficient Framework for Tri-Modal 3D Visual Grounding based on CLIP.
CoRR, July, 2025

PSDiff: Diffusion Model for Person Search With Iterative and Collaborative Refinement.
IEEE Trans. Circuits Syst. Video Technol., June, 2025

TryOn-Adapter: Efficient Fine-Grained Clothing Identity Adaptation for High-Fidelity Virtual Try-On.
Int. J. Comput. Vis., June, 2025

FZOO: Fast Zeroth-Order Optimizer for Fine-Tuning Large Language Models towards Adam-Scale Speed.
CoRR, June, 2025

Privacy Leaks by Adversaries: Adversarial Iterations for Membership Inference Attack.
CoRR, June, 2025

Towards Understanding The Calibration Benefits of Sharpness-Aware Minimization.
CoRR, May, 2025

No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves.
CoRR, May, 2025

Breaking the Prompt Wall (I): A Real-World Case Study of Attacking ChatGPT via Lightweight Prompt Injection.
CoRR, April, 2025

AffordanceSAM: Segment Anything Once More in Affordance Grounding.
CoRR, April, 2025

InnerSelf: Designing Self-Deepfaked Voice for Emotional Well-being.
CoRR, March, 2025

Decentralized Optimization on Compact Submanifolds by Quantized Riemannian Gradient Tracking.
IEEE Trans. Signal Process., 2025

Disentangled Noisy Correspondence Learning.
IEEE Trans. Image Process., 2025

VidEvo: Evolving Video Editing through Exhaustive Temporal Modeling.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

Instructing Text-to-Image Diffusion Models via Classifier-Guided Semantic Optimization.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

Second-Order Fine-Tuning without Pain for LLMs: A Hessian Informed Zeroth-Order Optimizer.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Manifold Constraint Reduces Exposure Bias in Accelerated Diffusion Sampling.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

ProAdvPrompter: A Two-Stage Journey to Effective Adversarial Prompting for LLMs.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Exploring Triple Knowledge Cues for Zero-Shot Human-Object Interaction Detection.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Density-aware and Depth-aware Visual Representation for Zero-Shot Object Counting.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Action Detail Matters: Refining Video Recognition with Local Action Queries.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Low-Biased General Annotated Dataset Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

IMOL: Incomplete-Modality-Tolerant Learning for Multi-Domain Fake News Video Detection.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

On the Risk of Evidence Pollution for Malicious Social Text Detection in the Era of LLMs.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

SpotActor: Training-Free Layout-Controlled Consistent Image Generation.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Disentangled Representation Learning With Transmitted Information Bottleneck.
IEEE Trans. Circuits Syst. Video Technol., December, 2024

Disentangled Generation With Information Bottleneck for Enhanced Few-Shot Learning.
IEEE Trans. Image Process., 2024

Unbiased General Annotated Dataset Generation.
CoRR, 2024

Visual Object Tracking across Diverse Data Modalities: A Review.
CoRR, 2024

How Do Social Bots Participate in Misinformation Spread? A Comprehensive Dataset and Analysis.
CoRR, 2024

Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models.
CoRR, 2024

AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention.
CoRR, 2024

DreamSalon: A Staged Diffusion Framework for Preserving Identity-Context in Editable Face Generation.
CoRR, 2024

M2-CLIP: A Multimodal, Multi-task Adapting Framework for Video Action Recognition.
CoRR, 2024

Exploring Perceptions of Children's Learning Stress for Stress Management.
Proceedings of the Intelligent Computing, 2024

OneActor: Consistent Subject Generation via Cluster-Conditioned Guidance.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Generating Action-conditioned Prompts for Open-vocabulary Video Action Recognition.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Can Gaussian Sketching Converge Faster on a Preconditioned Landscape?
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Double Variance Reduction: A Smoothing Trick for Composite Optimization Problems without First-Order Gradient.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Double Stochasticity Gazes Faster: Snap-Shot Decentralized Stochastic Gradient Tracking Methods.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Decentralized Riemannian Conjugate Gradient Method on the Stiefel Manifold.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Timestep-Aware Correction for Quantized Diffusion Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Learning to Rematch Mismatched Pairs for Robust Cross-Modal Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Stress Diffuser: A Biofeedback Agent for Stress Management in Children During Homework with Parent Involvement.
Proceedings of the 23rd Annual ACM Interaction Design and Children Conference, 2024

A Multimodal, Multi-Task Adapting Framework for Video Action Recognition.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

SSMG: Spatial-Semantic Map Guided Diffusion Model for Free-Form Layout-to-Image Generation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Noisy Correspondence Learning with Self-Reinforcing Errors Mitigation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Disentangled Representation Learning with Transmitted Information Bottleneck.
CoRR, 2023

SUBP: Soft Uniform Block Pruning for 1xN Sparse CNNs Multithreading Acceleration.
CoRR, 2023

PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement.
CoRR, 2023

Multimodal Adaptation of CLIP for Few-Shot Action Recognition.
CoRR, 2023

SUBP: Soft Uniform Block Pruning for 1×N Sparse CNNs Multithreading Acceleration.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Boosting Few-shot Action Recognition with Graph-guided Hybrid Matching.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Synchronization of complex-valued stochastic coupled systems with hybrid impulses via discrete-time state observations control.
Neural Comput. Appl., 2022

Pleasure and Improvement - A Study on Digital Artistic Expression of Health Data Generation for Professional Women.
Proceedings of the Design Studies and Intelligence Engineering, 2022

2017
A Balanced Assignment Mechanism for Online Taxi Recommendation.
Proceedings of the 18th IEEE International Conference on Mobile Data Management, 2017

2014
Multicategory large margin classification methods: Hinge losses vs. coherence functions.
Artif. Intell., 2014

2013
An iterative SVM approach to feature selection and classification in high-dimensional datasets.
Pattern Recognit., 2013

2012
Coherence functions with applications in large-margin classification methods.
J. Mach. Learn. Res., 2012

2011
Bayesian Generalized Kernel Mixed Models.
J. Mach. Learn. Res., 2011

A non-convex relaxation approach to sparse dictionary learning.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
A regularization framework for multiclass classification: A deterministic annealing approach.
Pattern Recognit., 2010

Regularized Discriminant Analysis, Ridge Regression and Beyond.
J. Mach. Learn. Res., 2010

Bayesian Generalized Kernel Models.
Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, 2010

Matrix-Variate Dirichlet Process Mixture Models.
Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, 2010

Sparse Unsupervised Dimensionality Reduction Algorithms.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2010

2009
A Flexible and Efficient Algorithm for Regularized Fisher Discriminant Analysis.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2009

Optimal Scoring for Unsupervised Learning.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Research on Magnetic Flux Leakage Signals Quantity Technology of Tank Floor Corrosion Defects Based on Artificial Neural Network.
Proceedings of the Fifth International Conference on Natural Computation, 2009

2008
A Scalable Kernel-Based Semisupervised Metric Learning Algorithm with Out-of-Sample Generalization Ability.
Neural Comput., 2008

2007
Learning the kernel matrix by maximizing a KFD-based class separability criterion.
Pattern Recognit., 2007

Face recognition using a kernel fractional-step discriminant analysis algorithm.
Pattern Recognit., 2007

A Scalable Kernel-Based Algorithm for Semi-Supervised Metric Learning.
Proceedings of the IJCAI 2007, 2007

Boosting Kernel Discriminant Analysis and Its Application on Tissue Classification of Gene Expression Data.
Proceedings of the IJCAI 2007, 2007

Kernel selection forl semi-supervised kernel machines.
Proceedings of the Machine Learning, 2007

2006
Local Discriminant Embedding with Tensor Representation.
Proceedings of the International Conference on Image Processing, 2006

Extending Kernel Fisher Discriminant Analysis with the Weighted Pairwise Chernoff Criterion.
Proceedings of the Computer Vision, 2006

Tensor Embedding Methods.
Proceedings of the Proceedings, 2006

2005
Nonlinear dimensionality reduction for classification using kernel weighted subspace method.
Proceedings of the 2005 International Conference on Image Processing, 2005

2004
By normalizing to improve generalized Foley-Sammon transform in high-dimensional spaces - with application to face recognition.
Proceedings of the IEEE International Conference on Systems, 2004

Face recognition with the robust feature extracted by the generalized Foley-Sammon transform.
Proceedings of the 2004 International Symposium on Circuits and Systems, 2004

An Advanced Segmental Semi-Markov Model Based Online Series Pattern Detection.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

A Kernel Fractional-Step Nonlinear Discriminant Analysis for Pattern Recognition.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

A Gabor direct fractional-step LDA algorithm for face recognition.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Kernel generalized nonlinear discriminant analysis algorithm for pattern recognition.
Proceedings of the 2004 International Conference on Image Processing, 2004

A robust feature extraction framework for face recognition.
Proceedings of the 2004 International Conference on Image Processing, 2004

Modified kernel-based nonlinear feature extraction [face recognition example].
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Face Recognition Using Novel LDA-Based Algorithms.
Proceedings of the 16th Eureopean Conference on Artificial Intelligence, 2004

2003
Feature Extraction Method Based on the Generalized Weighted Foley-Sammon Transform in High Dimensional Spaces.
Proceedings of the 1st Indian International Conference on Artificial Intelligence, 2003


  Loading...