We stand with Ukraine

We stand with Ukraine

Li Lyna Zhang

Orcid: 0000-0002-4465-1628

According to our database¹, Li Lyna Zhang authored at least 34 papers between 2017 and 2025.

Collaborative distances:

Dijkstra number² of three.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

Online presence:

on orcid.org

On csauthors.net:

Bibliography

2025

LoongRL: Reinforcement Learning for Advanced Reasoning over Long Contexts.

[DOI]

,

,

,

,

,

,

CoRR, October, 2025

rStar2-Agent: Agentic Reasoning Technical Report.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, August, 2025

rStar-Coder: Scaling Competitive Code Reasoning with a Large-Scale Verified Dataset.

[DOI]

,

,

,

,

,

,

,

CoRR, May, 2025

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs.

[DOI]

Abdelrahman Abouelenin

,

,

,

,

,

,

,

,

Vishrav Chaudhary

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Abhishek Goswami

,

,

,

,

,

Mahmoud Khademi

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Daniel Perez-Becker

,

,

,

,

,

,

,

,

,

Saksham Singhal

,

,

,

,

Praneetha Vaddamanu

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, March, 2025

Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization.

[DOI]

,

,

,

,

,

,

,

,

CoRR, February, 2025

LongRoPE2: Near-Lossless LLM Context Window Scaling.

[DOI]

,

,

,

,

,

,

,

Proceedings of the Forty-second International Conference on Machine Learning, 2025

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking.

[DOI]

,

,

,

,

,

,

,

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solver.

[DOI]

,

,

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers.

[DOI]

,

,

,

,

,

CoRR, 2024

LitePred: Transferable and Scalable Latency Prediction for Hardware-Aware Neural Architecture Search.

[DOI]

,

,

,

,

Chengruidong Zhang

,

,

,

,

Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens.

[DOI]

,

,

Chengruidong Zhang

,

,

,

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models.

[DOI]

,

,

,

,

,

,

,

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Fewer is More: Boosting Math Reasoning with Reinforced Context Pruning.

[DOI]

,

,

Kwang-Ting Cheng

,

,

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023

Boosting LLM Reasoning: Push the Limits of Few-shot Learning with Reinforced In-Context Pruning.

[DOI]

,

,

Kwang-Ting Cheng

,

CoRR, 2023

Compresso: Structured Pruning with Collaborative Prompting Learns Compact Large Language Models.

[DOI]

,

,

,

CoRR, 2023

LUT-NN: Towards Unified Neural Network Inference by Table Lookup.

[DOI]

,

,

,

,

,

,

,

CoRR, 2023

On Modular Learning of Distributed Systems for Predicting End-to-End Latency.

[DOI]

Chieh-Jan Mike Liang

,

,

,

,

,

,

,

Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, 2023

LUT-NN: Empower Efficient Neural Network Inference with Centroid Learning and Table Lookup.

[DOI]

,

,

,

,

,

,

,

Proceedings of the 29th Annual International Conference on Mobile Computing and Networking, 2023

Constraint-aware and Ranking-distilled Token Pruning for Efficient Transformer Inference.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Accurate and Structured Pruning for Efficient Automatic Speech Recognition.

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022

Towards efficient vision transformer inference: a first study of transformers on mobile devices.

[DOI]

,

,

,

Proceedings of the HotMobile '22: The 23rd International Workshop on Mobile Computing Systems and Applications, Tempe, Arizona, USA, March 9, 2022

SwiftPruner: Reinforced Evolutionary Pruning for Efficient Ad Relevance.

[DOI]

,

,

,

,

,

,

,

Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

2021

nn-METER: Towards Accurate Latency Prediction of DNN Inference on Diverse Edge Devices.

[DOI]

,

,

,

,

,

GetMobile Mob. Comput. Commun., 2021

AceNAS: Learning to Rank Ace Neural Architectures with Weak Supervision of Weight Sharing.

[DOI]

,

,

,

,

,

,

CoRR, 2021

nn-Meter: towards accurate latency prediction of deep-learning model inference on diverse edge devices.

[DOI]

,

,

,

,

,

,

Proceedings of the MobiSys '21: The 19th Annual International Conference on Mobile Systems, Applications, and Services, Virtual Event, Wisconsin, USA, 24 June, 2021

Boosting Mobile CNN Inference through Semantic Memory.

[DOI]

,

,

,

,

,

,

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

To Bridge Neural Network Design and Real-World Performance: A Behaviour Study for Neural Networks.

[DOI]

,

,

,

,

Proceedings of the Fourth Conference on Machine Learning and Systems, 2021

2020

Fast Hardware-Aware Neural Architecture Search.

[DOI]

,

,

,

,

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Hardware-aware One-Shot Neural Architecture Search in Coordinate Ascent Framework.

[DOI]

,

,

,

,

CoRR, 2019

2018

Characterizing Privacy Risks of Mobile Apps with Sensitivity Analysis.

[DOI]

,

Chieh-Jan Mike Liang

,

,

,

,

IEEE Trans. Mob. Comput., 2018

2017

Towards A Contextual and Scalable Automated-testing Service for Mobile Apps.

[DOI]

,

Chieh-Jan Mike Liang

,

,

Proceedings of the 18th International Workshop on Mobile Computing Systems and Applications, 2017

Systematically testing background services of mobile apps.

[DOI]

,

Chieh-Jan Mike Liang

,

,

Proceedings of the 32nd IEEE/ACM International Conference on Automated Software Engineering, 2017

Loading...