Lean Wang

According to our database¹, Lean Wang authored at least 14 papers between 2022 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Investigating Cross-Modal Skill Injection: Scenarios, Methods, and Hyperparameters.

[BibT_eX]

[DOI]

CoRR, May, 2026

2025

mHC: Manifold-Constrained Hyper-Connections.

[BibT_eX]

[DOI]

CoRR, December, 2025

Unveiling the Role of Learning Rate Schedules via Functional Scaling Laws.

[BibT_eX]

[DOI]

CoRR, September, 2025

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention.

[BibT_eX]

[DOI]

CoRR, February, 2025

DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning.

[BibT_eX]

[DOI]

Nat., 2025

TimeChat-Online: 80% Visual Tokens are Naturally Redundant in Streaming Videos.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Temporal Reasoning Transfer from Text to Video.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Auxiliary-Loss-Free Load Balancing Strategy for Mixture-of-Experts.

[BibT_eX]

[DOI]

CoRR, 2024

DeCo: Decoupling Token Compression from Semantic Abstraction in Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Towards Codable Watermarking for Injecting Multi-Bits Information to LLMs.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

Towards Codable Text Watermarking for Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022

Gradient Knowledge Distillation for Pre-trained Language Models.

[BibT_eX]

[DOI]

Lean Wang

Lei Li

Xu Sun

CoRR, 2022

Lean Wang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...