Han Shu

Orcid: 0009-0009-5555-8398

According to our database1, Han Shu authored at least 40 papers between 1995 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Thinking-while-speaking: A Controlled, Interleaved Reasoning Method for Real-Time Speech Generation.
CoRR, May, 2026

TinySAM 2: Extreme Memory Compression for Efficient Track Anything Model.
CoRR, May, 2026

Bifurcations and Dynamics in a Generalized KdV-mKdV-like System.
Int. J. Bifurc. Chaos, April, 2026

SJD-PAC: Accelerating Speculative Jacobi Decoding via Proactive Drafting and Adaptive Continuation.
CoRR, March, 2026

2025
GoodSpeed: Optimizing Fair Goodput with Adaptive Speculative Decoding in Distributed Edge Inference.
CoRR, December, 2025

VLM-Pruner: Buffering for Spatial Sparsity in an Efficient VLM Centrifugal Token Pruning Paradigm.
CoRR, December, 2025

Ada-MoGE: Adaptive Mixture of Gaussian Expert Model for Time Series Forecasting.
CoRR, December, 2025

Expert Merging: Model Merging with Unsupervised Expert Alignment and Importance-Guided Layer Chunking.
CoRR, September, 2025

MAEST: accurately spatial domain detection in spatial transcriptomics with graph masked autoencoder.
Briefings Bioinform., March, 2025

Bifurcations of the GKdV-mKdV-Like System with Medical Applications.
Int. J. Bifurc. Chaos, 2025

Federated Koopman-Reservoir Learning for Large-Scale Multivariate Time-Series Anomaly Detection.
Proceedings of the 2025 SIAM International Conference on Data Mining, 2025

ViSpec: Accelerating Vision-Language Models with Vision-Aware Speculative Decoding.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

TinySAM: Pushing the Envelope for Efficient Segment Anything Model.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Accurately deciphering spatial domains for spatially resolved transcriptomics with stCluster.
Briefings Bioinform., July, 2024

iREPO: implicit Reward Pairwise Difference based Empirical Preference Optimization.
CoRR, 2024

Transmission Benefits and Cost Allocation under Ambiguity.
CoRR, 2024

FKD-Med: Privacy-Aware, Communication-Optimized Medical Image Segmentation via Federated Learning and Model Lightweighting Through Knowledge Distillation.
IEEE Access, 2024

ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Sequence-Integrated Radiology Report Generation Leveraging Positional Encoding for Text and Images.
Proceedings of the 2024 IEEE Cyber Science and Technology Congress (CyberSciTech), 2024

2022
Control Reconfiguration of Dynamical Systems for Improved Performance via Reverse- and Forward-Engineering.
IEEE Trans. Autom. Control., 2022

2021
Coarse-to-Fine Searching for Efficient Generative Adversarial Networks.
CoRR, 2021

Adder Attention for Vision Transformer.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020
VEGA: Towards an End-to-End Configurable AutoML Pipeline.
CoRR, 2020

Control Reconfiguration of Dynamical Systems for Improved Performance via Reverse-engineering and Forward-engineering.
CoRR, 2020

Automatically Searching for U-Net Image Translator Architecture.
CoRR, 2020

Control Reconfiguration of Cyber-physical Systems for Improved Performance via Reverse-engineering and Accelerated First-order Algorithms.
Proceedings of the 11th ACM/IEEE International Conference on Cyber-Physical Systems, 2020

Optical Flow Distillation: Towards Efficient and Stable Video Style Transfer.
Proceedings of the Computer Vision - ECCV 2020, 2020

Frequency Domain Compact 3D Convolutional Neural Networks.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Distilling Portable Generative Adversarial Networks for Image Translation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Attribute Aware Pooling for Pedestrian Attribute Recognition.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Co-Evolutionary Compression for Unpaired Image Translation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2017
A highly accurate facial region network for unconstrained face detection.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

2006
Multi-tape finite-state transducer for asynchronous multi-stream pattern recognition with application to speech.
PhD thesis, 2006

Flexible Multi-Stream Framework for Speech Recognition using Multi-Tape Finite-State Transducers.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Pronunciation modeling using a finite-state transducer representation.
Speech Commun., 2005

A probabilistic approach to unit selection for corpus-based speech synthesis.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2002
EM training of finite-state transducers and its application to pronunciation modeling.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2000
The BBN Byblos 2000 conversational Mandarin LVCSR system.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

The 2000 BBN Byblos LVCSR system.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1995
Duration modeling in large vocabulary speech recognition.
Proceedings of the 1995 International Conference on Acoustics, 1995


  Loading...