Xin Fu

ACM Trans. Archit. Code Optim., December, 2024

Enhancing Neural Network Reliability: Insights From Hardware/Software Collaboration With Neuron Vulnerability Quantization.

[BibT_eX]

[DOI]

IEEE Trans. Computers, August, 2024

Efficient one-shot Neural Architecture Search with progressive choice freezing evolutionary search.

[BibT_eX]

[DOI]

Neurocomputing, 2024

HSAS: Efficient task scheduling for large scale heterogeneous systolic array accelerator cluster.

[BibT_eX]

[DOI]

Future Gener. Comput. Syst., 2024

WHALE-FL: Wireless and Heterogeneity Aware Latency Efficient Federated Learning over Mobile Devices via Adaptive Subnetwork Scheduling.

[BibT_eX]

[DOI]

CoRR, 2024

Safe Offline-to-Online Multi-Agent Decision Transformer: A Safety Conscious Sequence Modeling Approach.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

A New Routing Strategy to Improve Success Rates of Quantum Computers.

[BibT_eX]

[DOI]

Proceedings of the Great Lakes Symposium on VLSI 2024, 2024

Tuning Quantum Computing Privacy through Quantum Error Correction.

[BibT_eX]

[DOI]

Proceedings of the 2024 IEEE Global Communications Conference, 2024

2023

Enabling High-Efficient ReRAM-Based CNN Training Via Exploiting Crossbar-Level Insignificant Writing Elimination.

[BibT_eX]

[DOI]

IEEE Trans. Computers, November, 2023

Accelerating Convolutional Neural Network by Exploiting Sparsity on GPUs.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., September, 2023

Architectural Design Model Guided On-Demand Power Management of Energy-Efficient GPGPU for SLAM.

[BibT_eX]

[DOI]

J. Circuits Syst. Comput., September, 2023

Accelerating Reinforcement Learning-Based CCSL Specification Synthesis Using Curiosity-Driven Exploration.

[BibT_eX]

[DOI]

IEEE Trans. Computers, May, 2023

Energy and Reliability-Aware Task Scheduling for Cost Optimization of DVFS-Enabled Cloud Workflows.

[BibT_eX]

[DOI]

IEEE Trans. Cloud Comput., 2023

Efficient Federated Learning for AIoT Applications Using Knowledge Distillation.

[BibT_eX]

[DOI]

IEEE Internet Things J., 2023

Saca-FI: A microarchitecture-level fault injection framework for reliability analysis of systolic array based CNN accelerator.

[BibT_eX]

[DOI]

Future Gener. Comput. Syst., 2023

A Survey of AI-enabled Dynamic Manufacturing Scheduling: From Directed Heuristics to Autonomous Learning.

[BibT_eX]

[DOI]

ACM Comput. Surv., 2023

Tuning Quantum Computing Privacy through Quantum Error Correction.

[BibT_eX]

[DOI]

CoRR, 2023

CyclicFL: A Cyclic Model Pre-Training Approach to Efficient Federated Learning.

[BibT_eX]

[DOI]

CoRR, 2023

EEFL: High-Speed Wireless Communications Inspired Energy Efficient Federated Learning over Mobile Devices.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual International Conference on Mobile Systems, 2023

NAS-SE: Designing A Highly-Efficient In-Situ Neural Architecture Search Engine for Large-Scale Deployment.

[BibT_eX]

[DOI]

Proceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture, 2023

Workie-Talkie: Accelerating Federated Learning by Overlapping Computing and Communications via Contrastive Regularization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Post0-VR: Enabling Universal Realistic Rendering for Modern VR via Exploiting Architectural Similarity and Data Sharing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2023

DAFL: Delay Efficient Federated Learning over Mobile Devices via Device-to-Device Transmissions.

[BibT_eX]

[DOI]

Proceedings of the IEEE Global Communications Conference, 2023

2022

PervasiveFL: Pervasive Federated Learning for Heterogeneous IoT Systems.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022

DynamAP: Architectural Support for Dynamic Graph Traversal on the Automata Processor.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2022

Enabling PIM-based AES encryption for online video streaming.

[BibT_eX]

[DOI]

J. Syst. Archit., 2022

BS-pFL: Enabling Low-Cost Personalized Federated Learning by Exploring Weight Gradient Sparsity.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2022

2021

A Collaborative and Sustainable Edge-Cloud Architecture for Object Tracking with Convolutional Siamese Networks.

[BibT_eX]

[DOI]

IEEE Trans. Sustain. Comput., 2021

Enabling Highly Efficient Capsule Networks Processing Through Software-Hardware Co-Design.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2021

Efficient Federated Learning for AIoT Applications Using Knowledge Distillation.

[BibT_eX]

[DOI]

CoRR, 2021

Shift-BNN: Highly-Efficient Probabilistic Bayesian Neural Network Training via Memory-Friendly Pattern Retrieving.

[BibT_eX]

[DOI]

Proceedings of the MICRO '21: 54th Annual IEEE/ACM International Symposium on Microarchitecture, 2021

η-LSTM: Co-Designing Highly-Efficient Large LSTM Training via Exploiting Memory-Saving and Architectural Design Opportunities.

[BibT_eX]

[DOI]

Proceedings of the 48th ACM/IEEE Annual International Symposium on Computer Architecture, 2021

2020

Energy-Efficient GPU L2 Cache Design Using Instruction-Level Data Locality Similarity.

[BibT_eX]

[DOI]

ACM Trans. Design Autom. Electr. Syst., 2020

Toward Customized Hybrid Fuel-Cell and Battery-powered Mobile Device for Individual Users.

[BibT_eX]

[DOI]

ACM Trans. Embed. Comput. Syst., 2020

Statistical Model Checking-Based Evaluation and Optimization for Cloud Workflow Resource Allocation.

[BibT_eX]

[DOI]

IEEE Trans. Cloud Comput., 2020

Enabling Energy-Efficient and Reliable Neural Network via Neuron-Level Voltage Scaling.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2020

Fast-BCNN: Massive Neuron Skipping in Bayesian Convolutional Neural Networks.

[BibT_eX]

[DOI]

Qiyu Wan

Proceedings of the 53rd Annual IEEE/ACM International Symposium on Microarchitecture, 2020

Enabling Highly Efficient Capsule Networks Processing Through A PIM-Based Architecture Design.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2020

2019

Bridging mobile device configuration to the user experience under budget constraint.

[BibT_eX]

[DOI]

Kaige Yan

Pervasive Mob. Comput., 2019

Improving energy efficiency of mobile devices by characterizing and exploring user behaviors.

[BibT_eX]

[DOI]

Kaige Yan

J. Syst. Archit., 2019

OO-VR: NUMA friendly object-oriented VR rendering framework for future NUMA-based multi-GPU systems.

[BibT_eX]

[DOI]

Proceedings of the 46th International Symposium on Computer Architecture, 2019

Enabling Energy-Efficient and Reliable Neural Network via Neuron-Level Voltage Scaling.

[BibT_eX]

[DOI]

Proceedings of the 25th IEEE International Conference on Parallel and Distributed Systems, 2019

Reliability Enhancement of Neural Networks via Neuron-Level Vulnerability Quantization.

[BibT_eX]

[DOI]

Proceedings of the Algorithms and Architectures for Parallel Processing, 2019

Reliability Aware Cost Optimization for Memory Constrained Cloud Workflows.

[BibT_eX]

[DOI]

Proceedings of the Algorithms and Architectures for Parallel Processing, 2019

PIM-VR: Erasing Motion Anomalies In Highly-Interactive Virtual Reality World with Customized Memory Cube.

[BibT_eX]

[DOI]

Proceedings of the 25th IEEE International Symposium on High Performance Computer Architecture, 2019

LoSCache: Leveraging Locality Similarity to Build Energy-Efficient GPU L2 Cache.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2019

2018

Towards Memory Friendly Long-Short Term Memory Networks (LSTMs) on Mobile GPUs.

[BibT_eX]

[DOI]

Proceedings of the 51st Annual IEEE/ACM International Symposium on Microarchitecture, 2018

Perception-Oriented 3D Rendering Approximation for Modern Graphics Processors.

[BibT_eX]

[DOI]

Chenhao Xie

Shuaiwen Song

Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2018

2017

Dolphins First: Dolphin-Aware Communications in Multi-Hop Underwater Cognitive Acoustic Networks.

[BibT_eX]

[DOI]

IEEE Trans. Wirel. Commun., 2017

Interspike-Interval-Based Analog Spike-Time-Dependent Encoder for Neuromorphic Processors.

[BibT_eX]

[DOI]

IEEE Trans. Very Large Scale Integr. Syst., 2017

On the Implication of NTC versus Dark Silicon on Emerging Scale-Out Workloads: The Multi-Core Architecture Perspective.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2017

Efficient Resource Constrained Scheduling Using Parallel Two-Phase Branch-and-Bound Heuristics.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2017

Exploring Energy-Efficient Cache Design in Emerging Mobile Platforms.

[BibT_eX]

[DOI]

ACM Trans. Design Autom. Electr. Syst., 2017

Emerging technology enabled energy-efficient GPGPUs register file.

[BibT_eX]

[DOI]

Microprocess. Microsystems, 2017

Fault-Tolerant Task Scheduling for Mixed-Criticality Real-Time Systems.

[BibT_eX]

[DOI]

J. Circuits Syst. Comput., 2017

GPU-Based Fluid Motion Estimation Using Energy Constraint.

[BibT_eX]

[DOI]

J. Circuits Syst. Comput., 2017

Three dimensional memristor-based neuromorphic computing system and its application to cloud robotics.

[BibT_eX]

[DOI]

Comput. Electr. Eng., 2017

Processing-in-Memory Enabled Graphics Processors for 3D Rendering.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Symposium on High Performance Computer Architecture, 2017

2016

Mitigating the Impact of Hardware Variability for GPGPUs Register File.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2016

Exploring Soft-Error Robust and Energy-Efficient Register File in GPGPUs using Resistive Memory.

[BibT_eX]

[DOI]

ACM Trans. Design Autom. Electr. Syst., 2016

Soft error resilience in Big Data kernels through modular analysis.

[BibT_eX]

[DOI]

J. Supercomput., 2016

Efficient Resource Constrained Scheduling Using Parallel Structure-Aware Pruning Techniques.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2016

Design space exploration for device and architectural heterogeneity in chip-multiprocessors.

[BibT_eX]

[DOI]

Microprocess. Microsystems, 2016

FPGA based spike-time dependent encoder and reservoir design in neuromorphic computing processors.

[BibT_eX]

[DOI]

Microprocess. Microsystems, 2016

Redefining QoS and customizing the power management policy to satisfy individual mobile users.

[BibT_eX]

[DOI]

Proceedings of the 49th Annual IEEE/ACM International Symposium on Microarchitecture, 2016

Exploring Variation-Aware Fault-Tolerant Cache under Near-Threshold Computing.

[BibT_eX]

[DOI]

Proceedings of the 45th International Conference on Parallel Processing, 2016

Combating the Reliability Challenge of GPU Register File at Low Supply Voltage.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Parallel Architectures and Compilation, 2016

2015

Aurora: A Cross-Layer Solution for Thermally Resilient Photonic Network-on-Chip.

[BibT_eX]

[DOI]

IEEE Trans. Very Large Scale Integr. Syst., 2015

Characterizing, modeling, and improving the QoE of mobile devices with low battery level.

[BibT_eX]

[DOI]

Kaige Yan

Xingyao Zhang

Proceedings of the 48th International Symposium on Microarchitecture, 2015

Mitigating the Susceptibility of GPGPUs Register File to Process Variations.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

Soft-error reliability and power co-optimization for GPGPUS register file using resistive memory.

[BibT_eX]

[DOI]

Zhi Li

Proceedings of the 2015 Design, Automation & Test in Europe Conference & Exhibition, 2015

Variation-aware evaluation of MPSoC task allocation and scheduling strategies using statistical model checking.

[BibT_eX]

[DOI]

Proceedings of the 2015 Design, Automation & Test in Europe Conference & Exhibition, 2015

POSTER: A Hardware Fingerprint Using GPU Core Frequency Variations.

[BibT_eX]

[DOI]

Fengjun Li

Bo Luo

Proceedings of the 22nd ACM SIGSAC Conference on Computer and Communications Security, 2015

2014

Design configuration selection for hard-error reliable processors via statistical rules.

[BibT_eX]

[DOI]

Microprocess. Microsystems, 2014

2013

Modeling and characterizing GPGPU reliability in the presence of soft errors.

[BibT_eX]

[DOI]

Parallel Comput., 2013

Intelligent Spatial-based Resource Allocation Algorithms in NoC.

[BibT_eX]

[DOI]

J. Comput., 2013

Hybrid CMOS-TFET based register files for energy-efficient GPGPUs.

[BibT_eX]

[DOI]

Zhi Li

Proceedings of the International Symposium on Quality Electronic Design, 2013

Reliable Express-Virtual-Channel-based network-on-chip under the impact of technology scaling.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Quality Electronic Design, 2013

Lighting the dark silicon by exploiting heterogeneity on future processors.

[BibT_eX]

[DOI]

Proceedings of the 50th Annual Design Automation Conference 2013, 2013

Cost-effective soft-error protection for SRAM-based structures in GPGPUs.

[BibT_eX]

[DOI]

Zhi Li

Proceedings of the Computing Frontiers Conference, 2013

2012

Aurora: A thermally resilient photonic network-on-chip architecture.

[BibT_eX]

[DOI]

Proceedings of the 30th International IEEE Conference on Computer Design, 2012

RISE: improving the streaming processors reliability against soft errors in gpgpus.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2012

2011

Analyzing soft-error vulnerability on GPGPU microarchitecture.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE International Symposium on Workload Characterization, 2011

2010

Architecting reliable multi-core network-on-chip for small scale processing technology.

[BibT_eX]

[DOI]

Proceedings of the 2010 IEEE/IFIP International Conference on Dependable Systems and Networks, 2010

2009

Soft error vulnerability aware process variation mitigation.

[BibT_eX]

[DOI]

Proceedings of the 15th International Conference on High-Performance Computer Architecture (HPCA-15 2009), 2009

2008

ORBIT: Effective Issue Queue Soft-Error Vulnerability Mitigation on Simultaneous Multithreaded Architectures Using Operand Readiness-Based Instruction Dispatch.

[BibT_eX]

[DOI]

Proceedings of the 20th International Symposium on Computer Architecture and High Performance Computing, 2008

NBTI tolerant microarchitecture design in the presence of process variation.

[BibT_eX]

[DOI]