Jianyu Wei

Orcid: 0000-0001-8183-4821

According to our database1, Jianyu Wei authored at least 25 papers between 2013 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing.
CoRR, February, 2026

Fluid Antenna Array-Enabled AAV Covert Communications Against Active Warden.
IEEE Trans. Wirel. Commun., 2026

UAV Covert Communications in Interweave Cognitive Radio Network.
IEEE Trans. Cogn. Commun. Netw., 2026

Cognitive Jamming-Aided UAV Multi-User Covert Communication.
IEEE J. Sel. Areas Commun., 2026

Vec-LUT: Vector Table Lookup for Parallel Ultra-Low-Bit LLM Inference on Edge Devices.
Proceedings of the 24th Annual International Conference on Mobile Systems, 2026

Scaling LLM Test-Time Compute with Mobile NPU on Smartphones.
Proceedings of the 21st European Conference on Computer Systems, 2026

2025
Vec-LUT: Vector Table Lookup for Parallel Ultra-Low-Bit LLM Inference on Edge Devices.
CoRR, December, 2025

T-MAN: Enabling End-to-End Low-Bit LLM Inference on NPUs via Unified Table Lookup.
CoRR, November, 2025

AdaNav: Adaptive Reasoning with Uncertainty for Vision-Language Navigation.
CoRR, September, 2025

UAV-Enabled Multi-User Covert Communications Against Active Flying and Ground Wardens.
IEEE Trans. Veh. Technol., May, 2025

Blockchain-assisted distributed lightweight anonymous two-way authentication protocol for UAV-UAV communication.
Phys. Commun., 2025

Demo: EdgeMind-OS: A Plug-and-Play Embodied Intelligence System for Real-Time On-Device Deployment.
Proceedings of the 31st Annual International Conference on Mobile Computing and Networking, 2025

LUT Tensor Core: A Software-Hardware Co-Design for LUT-Based Low-Bit LLM Inference.
Proceedings of the 52nd Annual International Symposium on Computer Architecture, 2025

T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge.
Proceedings of the Twentieth European Conference on Computer Systems, 2025

Bitnet.cpp: Efficient Edge Inference for Ternary LLMs.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Rapid data collection and processing in dense urban edge computing networks with drone assistance.
Phys. Commun., 2024

LUT Tensor Core: Lookup Table Enables Efficient Low-Bit LLM Inference Acceleration.
CoRR, 2024

Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert Inference.
Proceedings of the 51st ACM/IEEE Annual International Symposium on Computer Architecture, 2024

AFPQ: Asymmetric Floating Point Quantization for LLMs.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Autonomous confrontation strategy learning evolution mechanism of unmanned system group under actual combat in the loop.
Comput. Commun., September, 2023

NN-Stretch: Automatic Neural Network Branching for Parallel Inference on Heterogeneous Multi-Processors.
Proceedings of the 21st Annual International Conference on Mobile Systems, 2023

2021
nn-METER: Towards Accurate Latency Prediction of DNN Inference on Diverse Edge Devices.
GetMobile Mob. Comput. Commun., 2021

nn-Meter: towards accurate latency prediction of deep-learning model inference on diverse edge devices.
Proceedings of the MobiSys '21: The 19th Annual International Conference on Mobile Systems, Applications, and Services, Virtual Event, Wisconsin, USA, 24 June, 2021

2018
Distributed Multi-node of Fuzzy Control Considering Adjacent Node Effect for Temperature Control.
Proceedings of the Advances in Brain Inspired Cognitive Systems, 2018

2013
Facial Expression Recognition based on Independent Component Analysis.
J. Multim., 2013


  Loading...