Yasuyuki Okoshi
Orcid: 0009-0005-8472-7841
According to our database1,
Yasuyuki Okoshi authored at least 19 papers
between 2022 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
AQPIM: Breaking the PIM Capacity Wall for LLMs with in-Memory Activation Quantization.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2026
A 22nm Continual Learning Accelerator for Autonomous Systems with 69.2TOPS/W Dynamic-Sparse-Weight-Updat and Dual-Mode Vector-Scaled-INT4 Processing.
Proceedings of the IEEE Custom Integrated Circuits Conference, 2026
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
Trans. Mach. Learn. Res., 2025
TicketLLM: Next-Generation Sparse and Low-bit Transformers with Supermask-based Method.
Trans. Mach. Learn. Res., 2025
WhiteDwarf: A Holistic Co-Design Approach to Ultra-Compact Neural Inference Acceleration.
IEEE Access, 2025
Binary Quadratic Quantization: Beyond First-Order Quantization for Real-Valued Matrix Compression.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025
2024
Trans. Mach. Learn. Res., 2024
CoRR, 2024
Pianissimo: A Sub-mW Class DNN Accelerator With Progressively Adjustable Bit-Precision.
IEEE Access, 2024
Exploiting N:M Sparsity in Quantized-Folded ResNets: Signed Multicoat Supermasks and Iterative Pruning-Quantization.
Proceedings of the Twelfth International Symposium on Computing and Networking, 2024
WhiteDwarf: 12.24 TFLOPS/W 40 nm Versatile Neural Inference Engine for Ultra-Compact Execution of CNNs and MLPs Through Triple Unstructured Sparsity Exploitation and Triple Model Compression.
Proceedings of the IEEE Asian Solid-State Circuits Conference, 2024
2023
Pianissimo: A Sub-mW Class DNN Accelerator with Progressive Bit-by-Bit Datapath Architecture for Adaptive Inference at Edge.
Proceedings of the 2023 IEEE Symposium on VLSI Technology and Circuits (VLSI Technology and Circuits), 2023
Proceedings of the Learning on Graphs Conference, 27-30 November 2023, Virtual Event., 2023
2022
Hiddenite: 4K-PE Hidden Network Inference 4D-Tensor Engine Exploiting On-Chip Model Construction Achieving 34.8-to-16.0TOPS/W for CIFAR-100 and ImageNet.
Proceedings of the IEEE International Solid-State Circuits Conference, 2022
Proceedings of the International Conference on Machine Learning, 2022