Shengkun Tang

According to our database1, Shengkun Tang authored at least 20 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
SlimQwen: Exploring the Pruning and Distillation in Large MoE Model Pre-training.
CoRR, May, 2026

AIMER: Calibration-Free Task-Agnostic MoE Pruning.
CoRR, March, 2026

BiGain: Unified Token Compression for Joint Generation and Classification.
CoRR, March, 2026

EvoESAP: Non-Uniform Expert Pruning for Sparse MoE.
CoRR, March, 2026

Diff-ES: Stage-wise Structural Diffusion Pruning via Evolutionary Search.
CoRR, March, 2026

Sink-Aware Pruning for Diffusion Language Models.
CoRR, February, 2026

Canzona: A Unified, Asynchronous, and Load-Balanced Framework for Distributed Matrix-based Optimizers.
CoRR, February, 2026

Mobile-MMLU: A Mobile Intelligence Language Understanding Benchmark.
J. Data-centric Mach. Learn. Res., 2026

2025
DarwinLM: Evolutionary Structured Pruning of Large Language Models.
CoRR, February, 2025

Bi-Mamba: Towards Accurate 1-Bit State Space Model.
Trans. Mach. Learn. Res., 2025

Human Texts Are Outliers: Detecting LLM-generated Texts via Out-of-distribution Detection.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

MosaicDiff: Training-free Structural Pruning for Diffusion Model Acceleration Reflecting Pretraining Dynamics.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024
Bi-Mamba: Towards Accurate 1-Bit State Space Models.
CoRR, 2024

Improving Logits-based Detector without Logits from Black-box LLMs.
CoRR, 2024

DALD: Improving Logits-based Detector without Logits from Black-box LLMs.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

AdaDiff: Accelerating Diffusion Models Through Step-Wise Adaptive Computation.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
DeeDiff: Dynamic Uncertainty-Aware Early Exiting for Accelerating Diffusion Model Generation.
CoRR, 2023

You Need Multiple Exiting: Dynamic Early Exiting for Accelerating Unified Vision Language Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2021
Scale-Robust Deep-Supervision Network for Mapping Building Footprints From High-Resolution Remote Sensing Images.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2021

DDR-Net: Learning Multi-Stage Multi-View Stereo With Dynamic Depth Range.
CoRR, 2021


  Loading...