Ye Qiao

Orcid: 0000-0002-6877-5764

According to our database1, Ye Qiao authored at least 22 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
FASQ: Flexible Accelerated Subspace Quantization for Calibration-Free LLM Compression.
CoRR, May, 2026

Artifact Evaluation Repository for the FPGA'26 Paper: TeLLMe: An Efficient End-to-End Ternary LLM Prefill and Decode Accelerator with Table-Lookup Matmul on Edge FPGAs.
Dataset, January, 2026

TeLLMe: An Efficient End-to-End Ternary LLM Prefill and Decode Accelerator with Table-Lookup Matmul on Edge FPGAs.
Proceedings of the 2026 ACM/SIGDA International Symposium on Field Programmable Gate Arrays, 2026

APEX-Q: Arbitrary-dimension Product-EXtension Quantization for Accelerated LLM Deployment (Student Abstract).
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Q-ROAR: Outlier-Aware Rescaling for RoPE Position Interpolation in Quantized Long-Context LLMs (Student Abstract).
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

HARK: Hierarchical Agentic Retrieval with Keyframing for Video Understanding (Student Abstract).
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
PD-Swap: Prefill-Decode Logic Swapping for End-to-End LLM Inference on Edge FPGAs via Dynamic Partial Reconfiguration.
CoRR, December, 2025

TeLLMe v2: An Efficient End-to-End Ternary LLM Prefill and Decode Accelerator with Table-Lookup Matmul on Edge FPGAs.
CoRR, October, 2025

Rethinking RoPE Scaling in Quantized LLM: Theory, Outlier, and Channel-Band Analysis with Weight Rescaling.
CoRR, October, 2025

Q-ROAR: Outlier-Aware Rescaling for RoPE Position Interpolation in Quantized Long-Context LLMs.
CoRR, September, 2025

TeLLMe: An Energy-Efficient Ternary LLM Accelerator for Prefilling and Decoding on Edge FPGAs.
CoRR, April, 2025

MONAS: Efficient Zero-Shot Neural Architecture Search for MCUs.
Proceedings of the International Joint Conference on Neural Networks, 2025

RSEND: Retinex-based Squeeze and Excitation Network with Dark Region Detection for Efficient Low Light Image Enhancement.
Proceedings of the International Joint Conference on Neural Networks, 2025

COBRA: Algorithm-Architecture Co-optimized Binary Transformer Accelerator for Edge Inference.
Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2025

2024
Optimized Spatial Architecture Mapping Flow for Transformer Accelerators.
CoRR, 2024

TG-NAS: Leveraging Zero-Cost Proxies with Transformer and Graph Convolution Networks for Efficient Neural Architecture Search.
CoRR, 2024

MicroNAS: Zero-Shot Neural Architecture Search for MCUs.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2024

Generic and Scalable Detection of Risky Transactions Using Density Flows: Applications to Financial Networks.
Proceedings of the Web and Big Data - 8th International Joint Conference, 2024

2023
BNN An Ideal Architecture for Acceleration With Resistive in Memory Computation.
IEEE Trans. Emerg. Top. Comput., 2023

Support for Stock Trend Prediction Using Transformers and Sentiment Analysis.
CoRR, 2023

2022
A Two-Stage Efficient 3-D CNN Framework for EEG Based Emotion Recognition.
Proceedings of the IEEE International Conference on Industrial Technology, 2022

2019
Multi-atlas tool for automated segmentation of brain gray matter nuclei and quantification of their magnetic susceptibility.
NeuroImage, 2019


  Loading...