Shangqian Gao

Orcid: 0000-0001-9699-1790

According to our database1, Shangqian Gao authored at least 48 papers between 2017 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
TreeDiff: AST-Guided Code Generation with Diffusion LLMs.
CoRR, August, 2025

ALTER: All-in-One Layer Pruning and Temporal Expert Routing for Efficient Diffusion Generation.
CoRR, May, 2025

Any Large Language Model Can Be a Reliable Judge: Debiasing with a Reasoning-based Bias Detector.
CoRR, May, 2025

Dynamic Noise Preference Optimization for LLM Self-Improvement via Synthetic Data.
CoRR, February, 2025

ToMoE: Converting Dense Large Language Models to Mixture-of-Experts through Dynamic Structural Pruning.
CoRR, January, 2025

FlexiGPT: Pruning and Extending Large Language Models with Low-Rank Weight Sharing.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

MoDeGPT: Modular Decomposition for Large Language Model Compression.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Efficient Fine-Tuning and Concept Suppression for Pruned Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
All-in-One Tuning and Structural Pruning for Domain-Specific LLMs.
CoRR, 2024

DISP-LLM: Dimension-Independent Structural Pruning for Large Language Models.
CoRR, 2024

Token Fusion: Bridging the Gap between Token Pruning and Token Merging.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

DISP-LLM: Dimension-Independent Structural Pruning for Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Adaptive Rank Selections for Low-Rank Approximation of Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Unlocking Memorization in Large Language Models with Dynamic Soft Prompting.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Auto- Train-Once: Controller Network Guided Automatic Network Pruning from Scratch.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

BilevelPruning: Unified Dynamic and Static Channel Pruning for Convolutional Neural Networks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Device-Wise Federated Network Pruning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Jointly Training and Pruning CNNs via Learnable Agent Guidance and Alignment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Compressing Image-to-Image Translation GANs Using Local Density Structures on Their Learned Manifold.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Gradient Descent Ascent for Minimax Problems on Riemannian Manifolds.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

Learning to Jointly Share and Prune Weights for Grounding Based Vision and Language Models.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Structural Alignment for Network Pruning through Partial Regularization.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Dynamic Low-rank Estimation for Transformer-based Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

EffConv: Efficient Learning of Kernel Sizes for Convolution Layers of CNNs.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Riemannian gradient methods for stochastic composition problems.
Neural Networks, 2022

Accelerated Zeroth-Order and First-Order Momentum Methods from Mini to Minimax Optimization.
J. Mach. Learn. Res., 2022

Enhanced Bilevel Optimization via Bregman Distance.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Improving Social Network Embedding via New Second-Order Continuous Graph Neural Networks.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Bregman Gradient Policy Optimization.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Recover Fair Deep Classification Models via Altering Pre-trained Structure.
Proceedings of the Computer Vision - ECCV 2022, 2022

Disentangled Differentiable Network Pruning.
Proceedings of the Computer Vision - ECCV 2022, 2022

Interpretations Steered Network Pruning via Amortized Inferred Saliency Maps.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
Black-Box Reductions for Zeroth-Order Gradient Algorithms to Achieve Lower Query Complexity.
J. Mach. Learn. Res., 2021

Exploration and Estimation for Model Compression.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Adversarial Attack on Deep Cross-Modal Hamming Retrieval.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Network Pruning via Performance Maximization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Gradient Descent Ascent for Min-Max Problems on Riemannian Manifold.
CoRR, 2020

Accelerated Zeroth-Order Momentum Methods from Mini to Minimax Optimization.
CoRR, 2020

Momentum-Based Policy Gradient Methods.
Proceedings of the 37th International Conference on Machine Learning, 2020

Discrete Model Compression With Resource Constraint for Deep Neural Networks.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Nonconvex Zeroth-Order Stochastic ADMM Methods with Lower Function Query Complexity.
CoRR, 2019

Cross-Modal Learning with Adversarial Samples.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Zeroth-Order Stochastic Alternating Direction Method of Multipliers for Nonconvex Nonsmooth Optimization.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Cross Domain Model Compression by Structurally Weight Sharing.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Action Prediction From Videos via Memorizing Hard-to-Predict Samples.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Discriminative Multi-instance Multitask Learning for 3D Action Recognition.
IEEE Trans. Multim., 2017

Video Recovery via Learning Variation and Consistency of Images.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017


  Loading...