Yanzhi Wang

Orcid: 0000-0002-3024-7990

According to our database1, Yanzhi Wang authored at least 470 papers between 2010 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Real-Time Robust Video Object Detection System Against Physical-World Adversarial Attacks.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., January, 2024

TextCraftor: Your Text Encoder Can be Image Quality Controller.
CoRR, 2024

Efficient Pruning of Large Language Model with Adaptive Estimation Fusion.
CoRR, 2024

InstructGIE: Towards Generalizable Image Editing.
CoRR, 2024

DiffClass: Diffusion-Based Class Incremental Learning.
CoRR, 2024

Dynamic Gaussian Graph Operator: Learning parametric partial differential equations in arbitrary discrete mechanics problems.
CoRR, 2024

MPIPN: A Multi Physics-Informed PointNet for solving parametric acoustic-structure systems.
CoRR, 2024

EdgeQAT: Entropy and Distribution Guided Quantization-Aware Training for the Acceleration of Lightweight LLMs on the Edge.
CoRR, 2024

EdgeOL: Efficient in-situ Online Learning on Edge Devices.
CoRR, 2024

E<sup>2</sup>GAN: Efficient Training of Efficient GANs for Image-to-Image Translation.
CoRR, 2024

Energy-Aware Tile Size Selection for Affine Programs on GPUs.
Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2024

Agile-Quant: Activation-Guided Quantization for Faster Inference of LLMs on the Edge.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
AirNN: Over-the-Air Computation for Neural Networks via Reconfigurable Intelligent Surfaces.
IEEE/ACM Trans. Netw., December, 2023

DAIS: Automatic Channel Pruning via Differentiable Annealing Indicator Search.
IEEE Trans. Neural Networks Learn. Syst., December, 2023

ELIXIR: An Expedient Connection Paradigm for Self-Powered IoT Devices.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., November, 2023

Memristor-Based Spectral Decomposition of Matrices and Its Applications.
IEEE Trans. Computers, May, 2023

A Co-Scheduling Framework for DNN Models on Mobile and Edge Devices With Heterogeneous Hardware.
IEEE Trans. Mob. Comput., March, 2023

AI-Enabled Experience-Driven Networking: Vision, State-of-the-Art and Future Directions.
IEEE Netw., 2023

Survey: Exploiting Data Redundancy for Optimization of Deep Learning.
ACM Comput. Surv., 2023

A Life-Cycle Energy and Inventory Analysis of Adiabatic Quantum-Flux-Parametron Circuits.
CoRR, 2023

DualHSIC: HSIC-Bottleneck and Alignment for Continual Learning.
CoRR, 2023

Can Adversarial Examples Be Parsed to Reveal Victim Model Information?
CoRR, 2023

SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

PackQViT: Faster Sub-8-bit Vision Transformers via Full and Packed Quantization on the Mobile.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

HotBEV: Hardware-oriented Transformer-based Multi-View 3D Detector for BEV Perception.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Design and Implementation of an FFT-Based Neural Network Accelerator Using Rapid Single-Flux-Quantum Technology.
Proceedings of the 21st IEEE Interregional NEWCAS Conference, 2023

SupeRBNN: Randomized Binary Neural Network Using Adiabatic Superconductor Josephson Devices.
Proceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture, 2023

RelKD 2023: International Workshop on Resource-Efficient Learning for Knowledge Discovery.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Data Level Lottery Ticket Hypothesis for Vision Transformers.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

StereoVoxelNet: Real-Time Obstacle Detection Based on Occupancy Voxels from a Stereo Camera Using Deep Neural Networks.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

SpeedDETR: Speed-aware Transformers for End-to-end Object Detection.
Proceedings of the International Conference on Machine Learning, 2023

DualHSIC: HSIC-Bottleneck and Alignment for Continual Learning.
Proceedings of the International Conference on Machine Learning, 2023

SmartFRZ: An Efficient Training Framework using Attention-Based Layer Freezing.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Self-Ensemble Protection: Training Checkpoints Are Good Data Protectors.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Unfairness in Distributed Graph Frameworks.
Proceedings of the IEEE International Conference on Data Mining, 2023

Rethinking Vision Transformers for MobileNet Size and Speed.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Fast and Fair Medical AI on the Edge Through Neural Architecture Search for Hybrid Vision Models.
Proceedings of the IEEE/ACM International Conference on Computer Aided Design, 2023

MOC: Multi-Objective Mobile CPU-GPU Co-Optimization for Power-Efficient DNN Inference.
Proceedings of the IEEE/ACM International Conference on Computer Aided Design, 2023

HeatViT: Hardware-Efficient Adaptive Token Pruning for Vision Transformers.
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2023

ESRU: Extremely Low-Bit and Hardware-Efficient Stochastic Rounding Unit Design for Low-Bit DNN Training.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2023

Invited: Algorithm-Software-Hardware Co-Design for Deep Learning Acceleration.
Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

Condense: A Framework for Device and Frequency Adaptive Neural Network Models on the Edge.
Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

Pruning Parameterization with Bi-level Optimization for Efficient Semantic Segmentation on the Edge.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

You Need Multiple Exiting: Dynamic Early Exiting for Accelerating Unified Vision Language Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural Network.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Performance Assessment of an Extremely Energy-Efficient Binary Neural Network Using Adiabatic Superconductor Devices.
Proceedings of the 5th IEEE International Conference on Artificial Intelligence Circuits and Systems, 2023

Towards Real-Time Segmentation on the Edge.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Floating Gate Transistor-Based Accurate Digital In-Memory Computing for Deep Neural Networks.
Adv. Intell. Syst., December, 2022

Mobile or FPGA? A Comprehensive Evaluation on Energy Efficiency and a Unified Optimization Framework.
ACM Trans. Embed. Comput. Syst., September, 2022

Elastic Significant Bit Quantization and Acceleration for Deep Neural Networks.
IEEE Trans. Parallel Distributed Syst., 2022

Automatic Mapping of the Best-Suited DNN Pruning Schemes for Real-Time Mobile Acceleration.
ACM Trans. Design Autom. Electr. Syst., 2022

StructADMM: Achieving Ultrahigh Efficiency in Structured Pruning for DNNs.
IEEE Trans. Neural Networks Learn. Syst., 2022

Non-Structured DNN Weight Pruning - Is It Beneficial in Any Platform?
IEEE Trans. Neural Networks Learn. Syst., 2022

Bridging the Gap Between Semantic Segmentation and Instance Segmentation.
IEEE Trans. Multim., 2022

ReCARL: Resource Allocation in Cloud RANs With Deep Reinforcement Learning.
IEEE Trans. Mob. Comput., 2022

Radio Frequency Fingerprinting on the Edge.
IEEE Trans. Mob. Comput., 2022

AntiDoteX: Attention-Based Dynamic Optimization for Neural Network Runtime Efficiency.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022

PACA: A Pattern Pruning Algorithm and Channel-Fused High PE Utilization Accelerator for CNNs.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022

GRIM: A General, Real-Time Deep Learning Inference Framework for Mobile Devices Based on Fine-Grained Structured Weight Sparsity.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

A survey for deep reinforcement learning in markovian cyber-physical systems: Common problems and solutions.
Neural Networks, 2022

Transfer learning based on improved stacked autoencoder for bearing fault diagnosis.
Knowl. Based Syst., 2022

Neural Network-Based OFDM Receiver for Resource Constrained IoT Devices.
IEEE Internet Things Mag., 2022

The Lottery Ticket Hypothesis for Vision Transformers.
CoRR, 2022

DeltaFS: Pursuing Zero Update Overhead via Metadata-Enabled Delta Compression for Log-structured File System on Mobile Devices.
CoRR, 2022

PIM-QAT: Neural Network Quantization for Processing-In-Memory (PIM) Systems.
CoRR, 2022

Understanding Time Variations of DNN Inference in Autonomous Driving.
CoRR, 2022

Continual Few-Shot Learning with Adversarial Class Storage.
CoRR, 2022

Student-AI Creative Writing: Pedagogical Strategies for Applying Natural Language Generation in Schools.
CoRR, 2022

CoCoPIE XGen: A Full-Stack AI-Oriented Optimizing Framework.
CoRR, 2022

EfficientFormer: Vision Transformers at MobileNet Speed.
CoRR, 2022

Deep neural network goes lighter: A case study of deep compression techniques on automatic RF modulation recognition for Beyond 5G networks.
CoRR, 2022

VAQF: Fully Automatic Software-hardware Co-design Framework for Low-bit Vision Transformer.
CoRR, 2022

Automated deep learning-based wide-band receiver.
Comput. Networks, 2022

Optimizing Data Layout for Training Deep Neural Networks.
Proceedings of the Companion of The Web Conference 2022, Virtual Event / Lyon, France, April 25, 2022

Design and Implementation of Stochastic Neural Networks Using Superconductor Quantum-Flux-Parametron Devices.
Proceedings of the 35th IEEE International System-on-Chip Conference, 2022

Prophet: Realizing a Predictable Real-time Perception Pipeline for Autonomous Vehicles.
Proceedings of the IEEE Real-Time Systems Symposium, 2022

Brief Industry Paper: Enabling Level-4 Autonomous Driving on a Single $1k Off-the-Shelf Card.
Proceedings of the 28th IEEE Real-Time and Embedded Technology and Applications Symposium, 2022

Advancing Model Pruning via Bi-level Optimization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Layer Freezing & Data Sieving: Missing Pieces of a Generic Framework for Sparse Training.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

EfficientFormer: Vision Transformers at MobileNet Speed.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

SparCL: Sparse Continual Learning on the Edge.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

GCD<sup>2</sup>: A Globally Optimizing Compiler for Mapping DNNs to Mobile DSPs.
Proceedings of the 55th IEEE/ACM International Symposium on Microarchitecture, 2022

BLCR: Towards Real-time DNN Execution with Block-based Reweighted Pruning.
Proceedings of the 23rd International Symposium on Quality Electronic Design, 2022

Reliability Improvement in RRAM-based DNN for Edge Computing.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2022

Pruning-as-Search: Efficient Neural Architecture Search via Channel Pruning and Structural Reparameterization.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Real-Time Portrait Stylization on the Edge.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Compact Multi-level Sparse Neural Networks with Input Independent Dynamic Rerouting.
Proceedings of the 34th IEEE International Conference on Tools with Artificial Intelligence, 2022

Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets.
Proceedings of the International Conference on Machine Learning, 2022

Effective Model Sparsification by Scheduled Grow-and-Prune Methods.
Proceedings of the Tenth International Conference on Learning Representations, 2022

F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Pruning Adversarially Robust Neural Networks without Adversarial Examples.
Proceedings of the IEEE International Conference on Data Mining, 2022

Quantum Neural Network Compression.
Proceedings of the 41st IEEE/ACM International Conference on Computer-Aided Design, 2022

All-in-One: A Highly Representative DNN Pruning Framework for Edge Devices with Dynamic Power Management.
Proceedings of the 41st IEEE/ACM International Conference on Computer-Aided Design, 2022

Hardware-Friendly Acceleration for Deep Neural Networks with Micro-Structured Compression.
Proceedings of the 30th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2022

You Already Have It: A Generator-Free Low-Precision DNN Training Framework Using Stochastic Rounding.
Proceedings of the Computer Vision - ECCV 2022, 2022

Compiler-Aware Neural Architecture Search for On-Mobile Real-time Super-Resolution.
Proceedings of the Computer Vision - ECCV 2022, 2022

SPViT: Enabling Faster Vision Transformers via Latency-Aware Soft Token Pruning.
Proceedings of the Computer Vision - ECCV 2022, 2022

TAAS: a timing-aware analytical strategy for AQFP-capable placement automation.
Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022

Hardware-efficient stochastic rounding unit design for DNN training: late breaking results.
Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022

DCT-RAM: A Driver-Free Process-In-Memory 8T SRAM Macro with Multi-Bit Charge-Domain Computation and Time-Domain Quantization.
Proceedings of the IEEE Custom Integrated Circuits Conference, 2022

NN-key: A Neural Network-Based Secret Key for Demapping OFDM Symbols.
Proceedings of the 19th IEEE Annual Consumer Communications & Networking Conference, 2022

What drives patients to choose a physician online? A study based on tree models and SHAP values.
Proceedings of the 18th IEEE International Conference on Automation Science and Engineering, 2022

2021
An Actor-Critic-Based Transfer Learning Framework for Experience-Driven Networking.
IEEE/ACM Trans. Netw., 2021

A Survey of Stochastic Computing Neural Networks for Machine Learning Applications.
IEEE Trans. Neural Networks Learn. Syst., 2021

NS-FDN: Near-Sensor Processing Architecture of Feature-Configurable Distributed Network for Beyond-Real-Time Always-on Keyword Spotting.
IEEE Trans. Circuits Syst. I Regul. Pap., 2021

STICKER-T: An Energy-Efficient Neural Network Processor Using Block-Circulant Algorithm and Unified Frequency-Domain Acceleration.
IEEE J. Solid State Circuits, 2021

CAP-RAM: A Charge-Domain In-Memory Computing 6T-SRAM for Accurate and Precision-Programmable CNN Inference.
IEEE J. Solid State Circuits, 2021

PnP-DRL: A Plug-and-Play Deep Reinforcement Learning Approach for Experience-Driven Networking.
IEEE J. Sel. Areas Commun., 2021

SPViT: Enabling Faster Vision Transformers via Soft Token Pruning.
CoRR, 2021

ILMPQ : An Intra-Layer Multi-Precision Deep Neural Network Quantization framework for FPGA.
CoRR, 2021

Enabling Level-4 Autonomous Driving on a Single 1 Off-the-Shelf Card.
CoRR, 2021

Achieving Real-Time Object Detection on MobileDevices with Neural Pruning Search.
CoRR, 2021

Efficient Micro-Structured Weight Unification and Pruning for Neural Network Compression.
CoRR, 2021

Lottery Ticket Implies Accuracy Degradation, Is It a Desirable Phenomenon?
CoRR, 2021

CoCoPIE: enabling real-time AI on off-the-shelf mobile devices via compression-compilation co-design.
Commun. ACM, 2021

Sensor data-driven structural damage detection based on deep convolutional neural networks and continuous wavelet transform.
Appl. Intell., 2021

JAXED: Reverse Engineering DNN Architectures Leveraging JIT GEMM Libraries.
Proceedings of the 2021 International Symposium on Secure and Private Execution Environment Design (SEED), 2021

Brief Industry Paper: Towards Real-Time 3D Object Detection for Autonomous Vehicles with Pruning Search.
Proceedings of the 27th IEEE Real-Time and Embedded Technology and Applications Symposium, 2021

Work in Progress: Mobile or FPGA? A Comprehensive Evaluation on Energy Efficiency and a Unified Optimization Framework.
Proceedings of the 27th IEEE Real-Time and Embedded Technology and Applications Symposium, 2021

Brief Industry Paper: An Infrastructure-Aided High Definition Map Data Provisioning Service for Autonomous Driving.
Proceedings of the 27th IEEE Real-Time and Embedded Technology and Applications Symposium, 2021

DNNFusion: accelerating deep neural networks execution with advanced operator fusion.
Proceedings of the PLDI '21: 42nd ACM SIGPLAN International Conference on Programming Language Design and Implementation, 2021

MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Sanity Checks for Lottery Tickets: Does Your Winning Ticket Really Win the Jackpot?
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

ScaleCert: Scalable Certified Defense against Adversarial Patches with Sparse Superficial Layers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

EXTRA: An Experience-driven Control Framework for Distributed Stream Data Processing with a Variable Number of Threads.
Proceedings of the 29th IEEE/ACM International Symposium on Quality of Service, 2021

Improving DNN Fault Tolerance using Weight Pruning and Differential Crossbar Mapping for ReRAM-based Edge AI.
Proceedings of the 22nd International Symposium on Quality Electronic Design, 2021

MC<sup>2</sup>-RAM: An In-8T-SRAM Computing Macro Featuring Multi-Bit Charge-Domain Computing and ADC-Reduction Weight Encoding.
Proceedings of the IEEE/ACM International Symposium on Low Power Electronics and Design, 2021

FORMS: Fine-grained Polarized ReRAM-based In-situ Computation for Mixed-signal DNN Accelerator.
Proceedings of the 48th ACM/IEEE Annual International Symposium on Computer Architecture, 2021

Towards Fast and Accurate Multi-Person Pose Estimation on Mobile Devices.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

A Compression-Compilation Framework for On-mobile Real-time BERT Applications.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

ClickTrain: efficient and accurate end-to-end deep learning training via fine-grained architecture-preserving pruning.
Proceedings of the ICS '21: 2021 International Conference on Supercomputing, 2021

Lottery Ticket Preserves Weight Correlation: Is It Desirable or Not?
Proceedings of the 38th International Conference on Machine Learning, 2021

Investigating Digital Literacy Skills in Examination-Oriented Education System for the Post-Pandemic Era.
Proceedings of the ICETM 2021: 4th International Conference on Education Technology Management, Tokyo, Japan, December 17, 2021

Improving Neural Network Efficiency via Post-training Quantization with Adaptive Floating-Point.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

RMSMP: A Novel Deep Neural Network Quantization Framework with Row-wise Mixed Schemes and Multiple Precisions.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Achieving on-Mobile Real-Time Super-Resolution with Neural Architecture and Pruning Search.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

ScaleDNN: Data Movement Aware DNN Training on Multi-GPU.
Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2021

Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization Framework.
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2021

TinyADC: Peripheral Circuit-aware Weight Pruning Framework for Mixed-signal DNN Accelerators.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2021

Towards AQFP-Capable Physical Design Automation.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2021

Neural Pruning Search for Real-Time Object Detection of Autonomous Vehicles.
Proceedings of the 58th ACM/IEEE Design Automation Conference, 2021

A Unified DNN Weight Pruning Framework Using Reweighted Optimization Methods.
Proceedings of the 58th ACM/IEEE Design Automation Conference, 2021

NPAS: A Compiler-Aware Framework of Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Teachers Do More Than Teach: Compressing Image-to-Image Models.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

An Explainable Convolutional Neural Networks for Automatic Segmentation of the Left Ventricle in Cardiac MRI.
Proceedings of CECNet 2021, 2021

A High-Performance Infrared Imaging System with Adaptive Contrast Enhancement.
Proceedings of CECNet 2021, 2021

Real-Time Mobile Acceleration of DNNs: From Computer Vision to Medical Applications.
Proceedings of the ASPDAC '21: 26th Asia and South Pacific Design Automation Conference, 2021

Puncturing the memory wall: Joint optimization of network compression with approximate memory for ASR application.
Proceedings of the ASPDAC '21: 26th Asia and South Pacific Design Automation Conference, 2021

RT3D: Achieving Real-Time Execution of 3D Convolutional Neural Networks on Mobile Devices.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

A Compression-Compilation Co-Design Framework Towards Real-Time Object Detection on Mobile Devices.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Memory Augmented Deep Recurrent Neural Network for Video Question Answering.
IEEE Trans. Neural Networks Learn. Syst., 2020

A High-Performance and Secure TRNG Based on Chaotic Cellular Automata Topology.
IEEE Trans. Circuits Syst., 2020

Guest Editors' Introduction to the Special Issue on Machine Learning Architectures and Accelerators.
IEEE Trans. Computers, 2020

Achieving Real-Time LiDAR 3D Object Detection on a Mobile Device.
CoRR, 2020

6.7ms on Mobile with over 78% ImageNet Accuracy: Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration.
CoRR, 2020

An Efficient End-to-End Deep Learning Training Framework via Fine-Grained Pattern-Based Pruning.
CoRR, 2020

Simultaneous Relevance and Diversity: A New Recommendation Inference Approach.
CoRR, 2020

MSP: An FPGA-Specific Mixed-Scheme, Multi-Precision Deep Neural Network Quantization Framework.
CoRR, 2020

Achieving Real-Time Execution of Transformer-based Large-scale Models on Mobile with Compiler-aware Neural Architecture Optimization.
CoRR, 2020

ESMFL: Efficient and Secure Models for Federated Learning.
CoRR, 2020

Achieving Real-Time Execution of 3D Convolutional Neural Networks on Mobile Devices.
CoRR, 2020

AUSN: Approximately Uniform Quantization by Adaptively Superimposing Non-uniform Distribution for Deep Neural Networks.
CoRR, 2020

A Unified DNN Weight Compression Framework Using Reweighted Optimization Methods.
CoRR, 2020

CoCoPIE: Making Mobile AI Sweet As PIE -Compression-Compilation Co-Design Goes a Long Way.
CoRR, 2020

A Privacy-Preserving DNN Pruning and Mobile Acceleration Framework.
CoRR, 2020

RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition.
CoRR, 2020

SS-Auto: A Single-Shot, Automatic Structured Weight Pruning Framework of DNNs with Ultra-High Efficiency.
CoRR, 2020

BLK-REW: A Unified Block-based DNN Pruning Framework using Reweighted Regularization Method.
CoRR, 2020

An Image Enhancing Pattern-based Sparsity for Real-time Inference on Mobile Devices.
CoRR, 2020

Editorial for the special issue on disruptive computing technologies.
CCF Trans. High Perform. Comput., 2020

3D Capsule Networks for Object Classification With Weight Pruning.
IEEE Access, 2020

One for Many: Transfer Learning for Building HVAC Control.
Proceedings of the BuildSys '20: The 7th ACM International Conference on Systems for Energy-Efficient Buildings, 2020

SympleGraph: distributed graph processing with precise loop-carried dependency guarantee.
Proceedings of the 41st ACM SIGPLAN International Conference on Programming Language Design and Implementation, 2020

Towards Ultra-Efficient DNN Inference Acceleration on Edge Devices for Wellbeing Applications.
Proceedings of the HealthDL@MobiSys 2020, 2020

Efficient Training of Deep Convolutional Neural Networks by Augmentation in Embedding Space.
Proceedings of the 21st International Symposium on Quality Electronic Design, 2020

Characterizing the I/O Pipeline in the Deployment of CNNs on Commercial Accelerators.
Proceedings of the IEEE International Conference on Parallel & Distributed Processing with Applications, 2020

NS-KWS: joint optimization of near-sensor processing architecture and low-precision GRU for always-on keyword spotting.
Proceedings of the ISLPED '20: ACM/IEEE International Symposium on Low Power Electronics and Design, 2020

Accurate and Energy-Efficient Implementation of Non-Linear Adder in Parallel Stochastic Computing using Sorting Network.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2020

Towards Real-Time DNN Inference on Mobile Platforms with Model Pruning and Compiler Optimization.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

CSB-RNN: a faster-than-realtime RNN acceleration framework with compressed structured blocks.
Proceedings of the ICS '20: 2020 International Conference on Supercomputing, 2020

Learn-Prune-Share for Lifelong Learning.
Proceedings of the 20th IEEE International Conference on Data Mining, 2020

Concurrent Weight Encoding-based Detection for Bit-Flip Attack on Neural Network Accelerators.
Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2020

ASAP: An Analytical Strategy for AQFP Placement.
Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2020

Robust Sparse Regularization: Defending Adversarial Attacks Via Regularized Sparse Network.
Proceedings of the GLSVLSI '20: Great Lakes Symposium on VLSI 2020, 2020

A Privacy-Preserving-Oriented DNN Pruning and Mobile Acceleration Framework.
Proceedings of the GLSVLSI '20: Great Lakes Symposium on VLSI 2020, 2020

Adversarial T-Shirt! Evading Person Detectors in a Physical World.
Proceedings of the Computer Vision - ECCV 2020, 2020

An Image Enhancing Pattern-Based Sparsity for Real-Time Inference on Mobile Devices.
Proceedings of the Computer Vision - ECCV 2020, 2020

When Sorting Network Meets Parallel Bitstreams: A Fault-Tolerant Parallel Ternary Neural Network Accelerator based on Stochastic Computing.
Proceedings of the 2020 Design, Automation & Test in Europe Conference & Exhibition, 2020

AntiDote: Attention-based Dynamic Optimization for Neural Network Runtime Efficiency.
Proceedings of the 2020 Design, Automation & Test in Europe Conference & Exhibition, 2020

PCNN: Pattern-based Fine-Grained Regular Pruning Towards Optimizing CNN Accelerators.
Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

PIM-Prune: Fine-Grain DCNN Pruning for Crossbar-Based Process-In-Memory Architecture.
Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning.
Proceedings of the ASPLOS '20: Architectural Support for Programming Languages and Operating Systems, 2020

Tiny but Accurate: A Pruned, Quantized and Optimized Memristor Crossbar Framework for Ultra Efficient DNN Implementation.
Proceedings of the 25th Asia and South Pacific Design Automation Conference, 2020

Database and Benchmark for Early-stage Malicious Activity Detection in 3D Printing.
Proceedings of the 25th Asia and South Pacific Design Automation Conference, 2020

DARB: A Density-Adaptive Regular-Block Pruning for Deep Neural Networks.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

PCONV: The Missing but Desirable Sparsity in DNN Weight Pruning for Real-Time Execution on Mobile Devices.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Embedding Compression with Isotropic Iterative Quantization.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

AutoCompress: An Automatic DNN Structured Pruning Framework for Ultra-High Compression Rates.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Accelerating Sparse CNN Inference on GPUs with Performance-Aware Weight Pruning.
Proceedings of the PACT '20: International Conference on Parallel Architectures and Compilation Techniques, 2020

2019
Reduced-Complexity Deep Neural Networks Design Using Multi-Level Compression.
IEEE Trans. Sustain. Comput., 2019

Distributed Graph Processing System and Processing-in-memory Architecture with Precise Loop-carried Dependency Guarantee.
ACM Trans. Comput. Syst., 2019

HEIF: Highly Efficient Stochastic Computing-Based Inference Framework for Deep Neural Networks.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2019

Experience-Driven Congestion Control: When Multi-Path TCP Meets Deep Reinforcement Learning.
IEEE J. Sel. Areas Commun., 2019

Normalization and dropout for stochastic computing-based deep convolutional neural networks.
Integr., 2019

DARB: A Density-Aware Regular-Block Pruning for Deep Neural Networks.
CoRR, 2019

Evading Real-Time Person Detectors by Adversarial T-shirt.
CoRR, 2019

Reweighted Proximal Pruning for Large-Scale Language Representation.
CoRR, 2019

A Stochastic-Computing based Deep Learning Framework using Adiabatic Quantum-Flux-Parametron SuperconductingTechnology.
CoRR, 2019

AutoSlim: An Automatic DNN Structured Pruning Framework for Ultra-High Compression Rates.
CoRR, 2019

Non-structured DNN Weight Pruning Considered Harmful.
CoRR, 2019

Robust Sparse Regularization: Simultaneously Optimizing Neural Network Robustness and Compactness.
CoRR, 2019

Brain-inspired reverse adversarial examples.
CoRR, 2019

Toward Extremely Low Bit and Lossless Accuracy in DNNs with Progressive ADMM.
CoRR, 2019

26ms Inference Time for ResNet-50: Towards Real-Time Execution of all DNNs on Smartphone.
CoRR, 2019

ResNet Can Be Pruned 60x: Introducing Network Purification and Unused Path Removal (P-RM) after Weight Pruning.
CoRR, 2019

Second Rethinking of Network Pruning in the Adversarial Setting.
CoRR, 2019

Progressive DNN Compression: A Key to Achieve Ultra-High Weight Pruning and Quantization Rates using ADMM.
CoRR, 2019

CircConv: A Structured Convolution with Low Complexity.
CoRR, 2019

Autonomous UAV with Learned Trajectory Generation and Control.
Proceedings of the 2019 IEEE International Workshop on Signal Processing Systems, 2019

Fast and Accurate Trajectory Tracking for Unmanned Aerial Vehicles based on Deep Reinforcement Learning.
Proceedings of the 25th IEEE International Conference on Embedded and Real-Time Computing Systems and Applications, 2019

ResNet Can Be Pruned 60×: Introducing Network Purification and Unused Path Removal (P-RM) after Weight Pruning.
Proceedings of the IEEE/ACM International Symposium on Nanoscale Architectures, 2019

GraphQ: Scalable PIM-Based Graph Processing.
Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture, 2019

Deep Compressed Pneumonia Detection for Low-Power Embedded Devices.
Proceedings of the Large-Scale Annotation of Biomedical Data and Expert Label Synthesis and Hardware Aware Learning for Medical Imaging and Computer Assisted Intervention, 2019

IDE Development, Logic Synthesis and Buffer/Splitter Insertion Framework for Adiabatic Quantum-Flux-Parametron Superconducting Circuits.
Proceedings of the 2019 IEEE Computer Society Annual Symposium on VLSI, 2019

A 65nm 0.39-to-140.3TOPS/W 1-to-12b Unified Neural Network Processor Using Block-Circulant-Enabled Transpose-Domain Acceleration with 8.1 × Higher TOPS/mm<sup>2</sup>and 6T HBST-TRAM-Based 2D Data-Reuse Architecture.
Proceedings of the IEEE International Solid- State Circuits Conference, 2019

A General Framework to Map Neural Networks onto Neuromorphic Processor.
Proceedings of the 20th International Symposium on Quality Electronic Design, 2019

An Ultra-Efficient Memristor-Based DNN Framework with Structured Weight Pruning and Quantization Using ADMM.
Proceedings of the 2019 IEEE/ACM International Symposium on Low Power Electronics and Design, 2019

A stochastic-computing based deep learning framework using adiabatic quantum-flux-parametron superconducting technology.
Proceedings of the 46th International Symposium on Computer Architecture, 2019

Interpreting and Evaluating Neural Network Robustness.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Protecting Neural Networks with Hierarchical Random Switching: Towards Better Robustness-Accuracy Trade-off for Stochastic Defenses.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Structured Adversarial Attack: Towards General Implementation and Better Interpretability.
Proceedings of the 7th International Conference on Learning Representations, 2019

Evaluating Fault Resiliency of Compressed Deep Neural Networks.
Proceedings of the 15th IEEE International Conference on Embedded Software and Systems, 2019

Efficient Cloud Resource Management using Neuromorphic Modeling and Prediction for Virtual Machine Resource Utilization.
Proceedings of the 15th IEEE International Conference on Embedded Software and Systems, 2019

Generation of Low Distortion Adversarial Attacks via Convex Programming.
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

Adversarial Robustness vs. Model Compression, or Both?
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

A Buffer and Splitter Insertion Framework for Adiabatic Quantum-Flux-Parametron Superconducting Circuits.
Proceedings of the 37th IEEE International Conference on Computer Design, 2019

E-RNN: Design Optimization for Efficient Recurrent Neural Networks in FPGAs.
Proceedings of the 25th IEEE International Symposium on High Performance Computer Architecture, 2019

SPEC2: SPECtral SParsE CNN Accelerator on FPGAs.
Proceedings of the 26th IEEE International Conference on High Performance Computing, 2019

HSIM-DNN: Hardware Simulator for Computation-, Storage- and Power-Efficient Deep Neural Networks.
Proceedings of the 2019 on Great Lakes Symposium on VLSI, 2019

ADMM-based Weight Pruning for Real-Time Deep Learning Acceleration on Mobile Devices.
Proceedings of the 2019 on Great Lakes Symposium on VLSI, 2019

A Majority Logic Synthesis Framework for Adiabatic Quantum-Flux-Parametron Superconducting Circuits.
Proceedings of the 2019 on Great Lakes Symposium on VLSI, 2019

REQ-YOLO: A Resource-Aware, Efficient Quantization Framework for Object Detection on FPGAs.
Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2019

Fault Sneaking Attack: a Stealthy Framework for Misleading Deep Neural Networks.
Proceedings of the 56th Annual Design Automation Conference 2019, 2019

A Fault-Tolerant Neural Network Architecture.
Proceedings of the 56th Annual Design Automation Conference 2019, 2019

Multi-Channel Attention Selection GAN With Cascaded Semantic Guidance for Cross-View Image Translation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Machine Vision Guided 3D Medical Image Compression for Efficient Transmission and Accurate Segmentation in the Clouds.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Feature Distillation: DNN-Oriented JPEG Compression Against Adversarial Examples.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

A Sparse-Adaptive CNN Processor with Area/Performance balanced N-Way Set-Associate PE Arrays Assisted by a Collision-Aware Scheduler.
Proceedings of the IEEE Asian Solid-State Circuits Conference, 2019

ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Methods of Multipliers.
Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, 2019

ADMM attack: an enhanced adversarial attack for deep neural networks with undetectable distortions.
Proceedings of the 24th Asia and South Pacific Design Automation Conference, 2019

A system-level perspective to understand the vulnerability of deep learning systems.
Proceedings of the 24th Asia and South Pacific Design Automation Conference, 2019

Universal Approximation Property and Equivalence of Stochastic Computing-Based Neural Networks and Binary Neural Networks.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Dynamic Reconfiguration of Thermoelectric Generators for Vehicle Radiators Energy Harvesting Under Location-Dependent Temperature Variations.
IEEE Trans. Very Large Scale Integr. Syst., 2018

An Exploration of Applying Gate-Length-Biasing Techniques to Deeply-Scaled FinFETs Operating in Multiple Voltage Regimes.
IEEE Trans. Emerg. Top. Comput., 2018

A Stochastic Computational Multi-Layer Perceptron with Backward Propagation.
IEEE Trans. Computers, 2018

Model-free Control for Distributed Stream Data Processing using Deep Reinforcement Learning.
Proc. VLDB Endow., 2018

Deep reinforcement learning: Algorithm, applications, and ultra-low-power implementation.
Nano Commun. Networks, 2018

A low-computation-complexity, energy-efficient, and high-performance linear program solver based on primal-dual interior point method using memristor crossbars.
Nano Commun. Networks, 2018

Modular Spiking Neural Circuits for Mapping Long Short-Term Memory on a Neurosynaptic Processor.
IEEE J. Emerg. Sel. Topics Circuits Syst., 2018

An Energy-Efficient Online-Learning Stochastic Computational Deep Belief Network.
IEEE J. Emerg. Sel. Topics Circuits Syst., 2018

Reconfigurable Photovoltaic Systems for Electric Vehicles.
IEEE Des. Test, 2018

ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Method of Multipliers.
CoRR, 2018

A Unified Framework of DNN Weight Pruning and Weight Clustering/Quantization Using ADMM.
CoRR, 2018

Progressive Weight Pruning of Deep Neural Networks using ADMM.
CoRR, 2018

Interpreting Adversarial Robustness: A View from Decision Surface in Input Space.
CoRR, 2018

Structured Adversarial Attack: Towards General Implementation and Better Interpretability.
CoRR, 2018

ADAM-ADMM: A Unified, Systematic Framework of Structured Weight Pruning for DNNs.
CoRR, 2018

Adversarial Meta-Learning.
CoRR, 2018

Towards Robust Training of Neural Networks by Regularizing Adversarial Gradients.
CoRR, 2018

Feature Distillation: DNN-Oriented JPEG Compression Against Adversarial Examples.
CoRR, 2018

An ADMM-Based Universal Framework for Adversarial Attacks on Deep Neural Networks.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

CSE: Parallel Finite State Machines with Convergence Set Enumeration.
Proceedings of the 51st Annual IEEE/ACM International Symposium on Microarchitecture, 2018

A Fast and Effective Memristor-Based Method for Finding Approximate Eigenvalues and Eigenvectors of Non-negative Matrices.
Proceedings of the 2018 IEEE Computer Society Annual Symposium on VLSI, 2018

Towards Budget-Driven Hardware Optimization for Deep Convolutional Neural Networks Using Stochastic Computing.
Proceedings of the 2018 IEEE Computer Society Annual Symposium on VLSI, 2018

An area and energy efficient design of domain-wall memory-based deep convolutional neural networks using stochastic computing.
Proceedings of the 19th International Symposium on Quality Electronic Design, 2018

Experience-driven Networking: A Deep Reinforcement Learning based Approach.
Proceedings of the 2018 IEEE Conference on Computer Communications, 2018

Learning Topics Using Semantic Locality.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Systematic Weight Pruning of DNNs using Alternating Direction Method of Multipliers.
Proceedings of the 6th International Conference on Learning Representations, 2018

Efficient Recurrent Neural Networks using Structured Matrices in FPGAs.
Proceedings of the 6th International Conference on Learning Representations, 2018

Design automation methodology and tools for superconductive electronics.
Proceedings of the International Conference on Computer-Aided Design, 2018

Low Power and Trusted Machine Learning.
Proceedings of the 2018 on Great Lakes Symposium on VLSI, 2018

Structured Weight Matrices-Based Hardware Accelerators in Deep Neural Networks: FPGAs and ASICs.
Proceedings of the 2018 on Great Lakes Symposium on VLSI, 2018

Reinforced Adversarial Attacks on Deep Neural Networks Using ADMM.
Proceedings of the 2018 IEEE Global Conference on Signal and Information Processing, 2018

C-LSTM: Enabling Efficient LSTM using Structured Compression Techniques on FPGAs.
Proceedings of the 2018 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2018

A Systematic DNN Weight Pruning Framework Using Alternating Direction Method of Multipliers.
Proceedings of the Computer Vision - ECCV 2018, 2018

An energy-efficient stochastic computational deep belief network.
Proceedings of the 2018 Design, Automation & Test in Europe Conference & Exhibition, 2018

FFT-based deep learning deployment in embedded systems.
Proceedings of the 2018 Design, Automation & Test in Europe Conference & Exhibition, 2018

DeepN-JPEG: a deep neural network favorable JPEG-based image compression framework.
Proceedings of the 55th Annual Design Automation Conference, 2018

PrinTracker: Fingerprinting 3D Printers using Commodity Scanners.
Proceedings of the 2018 ACM SIGSAC Conference on Computer and Communications Security, 2018

VIBNN: Hardware Acceleration of Bayesian Neural Networks.
Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating Systems, 2018

A deep reinforcement learning framework for optimizing fuel economy of hybrid electric vehicles.
Proceedings of the 23rd Asia and South Pacific Design Automation Conference, 2018

Security analysis and enhancement of model compressed deep learning systems under adversarial attacks.
Proceedings of the 23rd Asia and South Pacific Design Automation Conference, 2018

Towards Ultra-High Performance and Energy Efficiency of Deep Learning Systems: An Algorithm-Hardware Co-Optimization Framework.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
TEI-power: Temperature Effect Inversion-Aware Dynamic Thermal Management.
ACM Trans. Design Autom. Electr. Syst., 2017

Optimal Control of PEVs with a Charging Aggregator Considering Regulation Service Provisioning.
ACM Trans. Cyber Phys. Syst., 2017

Fully-Parallel Area-Efficient Deep Neural Network Design Using Stochastic Computing.
IEEE Trans. Circuits Syst. II Express Briefs, 2017

An optimal energy co-scheduling framework for smart buildings.
Integr., 2017

Hierarchical resource allocation and consolidation framework in a multi-core server cluster using a Markov decision process model.
IET Cyper-Phys. Syst.: Theory & Appl., 2017

Multisource Indoor Energy Harvesting for Nonvolatile Processors.
IEEE Des. Test, 2017

A Memristor-Based Optimization Framework for AI Applications.
CoRR, 2017

Deep Reinforcement Learning: Framework, Applications, and Embedded Implementations.
CoRR, 2017

CirCNN: Accelerating and Compressing Deep Neural Networks Using Block-CirculantWeight Matrices.
CoRR, 2017

Theoretical Properties for Neural Networks with Weight Matrices of Low Displacement Rank.
CoRR, 2017

pgRNAFinder: a web-based tool to design distance independent paired-gRNA.
Bioinform., 2017

Memristor crossbar-based ultra-efficient next-generation baseband processors.
Proceedings of the IEEE 60th International Midwest Symposium on Circuits and Systems, 2017

CirCNN: accelerating and compressing deep neural networks using block-circulant weight matrices.
Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture, 2017

A lightweight progress maximization scheduler for non-volatile processor under unstable energy harvesting.
Proceedings of the 18th ACM SIGPLAN/SIGBED Conference on Languages, 2017

Data center power management for regulation service using neural network-based power prediction.
Proceedings of the 18th International Symposium on Quality Electronic Design, 2017

Fast and energy-aware resource provisioning and task scheduling for cloud systems.
Proceedings of the 18th International Symposium on Quality Electronic Design, 2017

Reconfigurable thermoelectric generators for vehicle radiators energy harvesting.
Proceedings of the 2017 IEEE/ACM International Symposium on Low Power Electronics and Design, 2017

Spatiotemporal modeling and prediction in cellular networks: A big data enabled deep learning approach.
Proceedings of the 2017 IEEE Conference on Computer Communications, 2017

Stable spike-timing dependent plasticity rule for multilayer unsupervised and supervised learning.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Hardware-driven nonlinear activation for stochastic computing based deep convolutional neural networks.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Improving contour accuracy of a 2-DOF planar parallel kinematic machine by smart structure based compensation method.
Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Theoretical Properties for Neural Networks with Weight Matrices of Low Displacement Rank.
Proceedings of the 34th International Conference on Machine Learning, 2017

Deep Reinforcement Learning for Dynamic Treatment Regimes on Medical Registry Data.
Proceedings of the 2017 IEEE International Conference on Healthcare Informatics, 2017

A Hierarchical Framework of Cloud Resource Allocation and Power Management Using Deep Reinforcement Learning.
Proceedings of the 37th IEEE International Conference on Distributed Computing Systems, 2017

Hardware Acceleration of Bayesian Neural Networks Using RAM Based Linear Feedback Gaussian Random Number Generators.
Proceedings of the 2017 IEEE International Conference on Computer Design, 2017

A spike-based long short-term memory on a neurosynaptic processor.
Proceedings of the 2017 IEEE/ACM International Conference on Computer-Aided Design, 2017

Energy-efficient, high-performance, highly-compressed deep neural network design using block-circulant matrices.
Proceedings of the 2017 IEEE/ACM International Conference on Computer-Aided Design, 2017

Deep reinforcement learning: Framework, applications, and embedded implementations: Invited paper.
Proceedings of the 2017 IEEE/ACM International Conference on Computer-Aided Design, 2017

A deep reinforcement learning based framework for power-efficient resource allocation in cloud RANs.
Proceedings of the IEEE International Conference on Communications, 2017

Ultra-fast robust compressive sensing based on memristor crossbars.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Softmax Regression Design for Stochastic Computing Based Deep Convolutional Neural Networks.
Proceedings of the on Great Lakes Symposium on VLSI 2017, 2017

Deadline-Aware Joint Optimization of Sleep Transistor and Supply Voltage for FinFET Based Embedded Systems.
Proceedings of the on Great Lakes Symposium on VLSI 2017, 2017

Structural design optimization for deep convolutional neural networks using stochastic computing.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2017

Deep Reinforcement Learning for Building HVAC Control.
Proceedings of the 54th Annual Design Automation Conference, 2017

SC-DCNN: Highly-Scalable Deep Convolutional Neural Network using Stochastic Computing.
Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems, 2017

Algorithm-hardware co-optimization of the memristor-based framework for solving SOCP and homogeneous QCQP problems.
Proceedings of the 22nd Asia and South Pacific Design Automation Conference, 2017

Towards acceleration of deep convolutional neural networks using stochastic computing.
Proceedings of the 22nd Asia and South Pacific Design Automation Conference, 2017

Algorithm accelerations for luminescent solar concentrator-enhanced reconfigurable onboard photovoltaic system.
Proceedings of the 22nd Asia and South Pacific Design Automation Conference, 2017

2016
Area-Efficient Scaling-Free DFT/FFT Design Using Stochastic Computing.
IEEE Trans. Circuits Syst. II Express Briefs, 2016

Concurrent Task Scheduling and Dynamic Voltage and Frequency Scaling in a Real-Time Embedded System With Energy Harvesting.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2016

Model-Free Reinforcement Learning and Bayesian Classification in System-Level Power Management.
IEEE Trans. Computers, 2016

A Four Quadrature Signals' Generator with Precise Phase Adjustment.
J. Electr. Comput. Eng., 2016

Properness-based blind mitigation of feedback impairments in digital pre-distortion system.
Proceedings of the 8th International Conference on Wireless Communications & Signal Processing, 2016

Standard cell library based layout characterization and power analysis for 10nm gate-all-around (GAA) transistors.
Proceedings of the 29th IEEE International System-on-Chip Conference, 2016

Design of high-speed low-power polar BP decoder using emerging technologies.
Proceedings of the 29th IEEE International System-on-Chip Conference, 2016

A low-computation-complexity, energy-efficient, and high-performance linear program solver using memristor crossbars.
Proceedings of the 29th IEEE International System-on-Chip Conference, 2016

High-Accuracy FIR Filter Design Using Stochastic Computing.
Proceedings of the IEEE Computer Society Annual Symposium on VLSI, 2016

Memristor-Based Discrete Fourier Transform for Improving Performance and Energy Efficiency.
Proceedings of the IEEE Computer Society Annual Symposium on VLSI, 2016

System Design for In-Hardware STDP Learning and Spiking Based Probablistic Inference.
Proceedings of the IEEE Computer Society Annual Symposium on VLSI, 2016

Maximizing the performance of NoC-based MPSoCs under total power and power density constraints.
Proceedings of the 17th International Symposium on Quality Electronic Design, 2016

Negotiation-based resource provisioning and task scheduling algorithm for cloud systems.
Proceedings of the 17th International Symposium on Quality Electronic Design, 2016

Area-efficient scaling-free DFT/FFT design using stochastic computing.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2016

Multi-source in-door energy harvesting for non-volatile processors.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2016

Designing reconfigurable large-scale deep learning systems using stochastic computing.
Proceedings of the IEEE International Conference on Rebooting Computing, 2016

Power-aware virtual machine mapping in the data-center-on-a-chip paradigm.
Proceedings of the 34th IEEE International Conference on Computer Design, 2016

DSCNN: Hardware-oriented optimization for Stochastic Computing based Deep Convolutional Neural Networks.
Proceedings of the 34th IEEE International Conference on Computer Design, 2016

Luminescent solar concentrator-based photovoltaic reconfiguration for hybrid and plug-in electric vehicles.
Proceedings of the 34th IEEE International Conference on Computer Design, 2016

Dynamic converter reconfiguration for near-threshold non-volatile processors using in-door energy harvesting.
Proceedings of the 34th IEEE International Conference on Computer Design, 2016

A Reinforcement Learning-Based Power Management Framework for Green Computing Data Centers.
Proceedings of the 2016 IEEE International Conference on Cloud Engineering, 2016

Area-Efficient Error-Resilient Discrete Fourier Transformation Design using Stochastic Computing.
Proceedings of the 26th edition on Great Lakes Symposium on VLSI, 2016

Neural Network-based Prediction Algorithms for In-Door Multi-Source Energy Harvesting System for Non-Volatile Processors.
Proceedings of the 26th edition on Great Lakes Symposium on VLSI, 2016

Charging state aware optimal auction design for sensor selection in crowdsourcing based sensor networks.
Proceedings of the 19th International Conference on Information Fusion, 2016

Optimal energy allocation and storage control for distributed estimation with sensor collaboration.
Proceedings of the 2016 Annual Conference on Information Science and Systems, 2016

Optimal co-scheduling of HVAC control and battery management for energy-efficient buildings considering state-of-health degradation.
Proceedings of the 21st Asia and South Pacific Design Automation Conference, 2016

A Profit Optimization Framework of Energy Storage Devices in Data Centers: Hierarchical Structure and Hybrid Types.
Proceedings of the 9th IEEE International Conference on Cloud Computing, 2016

2015
Task Scheduling with Dynamic Voltage and Frequency Scaling for Energy Minimization in the Mobile Cloud Computing Environment.
IEEE Trans. Serv. Comput., 2015

Performance Comparisons Between 7-nm FinFET and Conventional Bulk CMOS Standard Cell Libraries.
IEEE Trans. Circuits Syst. II Express Briefs, 2015

Optimizing a Reconfigurable Power Distribution Network in a Multicore Platform.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2015

Hierarchical power management of a system with autonomously power-managed components using reinforcement learning.
Integr., 2015

Optimizing fuel economy of hybrid electric vehicles using a Markov decision process model.
Proceedings of the 2015 IEEE Intelligent Vehicles Symposium, 2015

Design optimization of sense amplifiers using deeply-scaled FinFET devices.
Proceedings of the Sixteenth International Symposium on Quality Electronic Design, 2015

Optimal choice of FinFET devices for energy minimization in deeply-scaled technologies.
Proceedings of the Sixteenth International Symposium on Quality Electronic Design, 2015

Reconfigurable three dimensional photovoltaic panel architecture for solar-powered time extension.
Proceedings of the IEEE/ACM International Symposium on Low Power Electronics and Design, 2015

Design and optimization of a reconfigurable power delivery network for large-area, DVS-enabled OLED displays.
Proceedings of the IEEE/ACM International Symposium on Low Power Electronics and Design, 2015

Multi-source energy harvesting management and optimization for non-volatile processors.
Proceedings of the Sixth International Green and Sustainable Computing Conference, 2015

Layout Characterization and Power Density Analysis for Shorted-Gate and Independent-Gate 7nm FinFET Standard Cells.
Proceedings of the 25th edition on Great Lakes Symposium on VLSI, GLVLSI 2015, Pittsburgh, PA, USA, May 20, 2015

Analyzing the Dark Silicon Phenomenon in a Many-Core Chip Multi-Processor under Deeply-Scaled Process Technologies.
Proceedings of the 25th edition on Great Lakes Symposium on VLSI, GLVLSI 2015, Pittsburgh, PA, USA, May 20, 2015

Efficiency-driven design time optimization of a hybrid energy storage system with networked charge transfer interconnect.
Proceedings of the 2015 Design, Automation & Test in Europe Conference & Exhibition, 2015

Event-driven and sensorless photovoltaic system reconfiguration for electric vehicles.
Proceedings of the 2015 Design, Automation & Test in Europe Conference & Exhibition, 2015

Leakage power reduction for deeply-scaled FinFET circuits operating in multiple voltage regimes using fine-grained gate-length biasing technique.
Proceedings of the 2015 Design, Automation & Test in Europe Conference & Exhibition, 2015

Joint automatic control of the powertrain and auxiliary systems to enhance the electromobility in hybrid electric vehicles.
Proceedings of the 52nd Annual Design Automation Conference, 2015

Optimal control of PEVs for energy cost minimization and frequency regulation in the smart grid accounting for battery state-of-health degradation.
Proceedings of the 52nd Annual Design Automation Conference, 2015

Reinforcement learning-based control of residential energy storage systems for electric bill minimization.
Proceedings of the 12th Annual IEEE Consumer Communications and Networking Conference, 2015

A cross-layer framework for designing and optimizing deeply-scaled FinFET-based SRAM cells under process variations.
Proceedings of the 20th Asia and South Pacific Design Automation Conference, 2015

Negotiation-based task scheduling and storage control algorithm to minimize user's electric bills under dynamic prices.
Proceedings of the 20th Asia and South Pacific Design Automation Conference, 2015

Hierarchical Deployment and Control of Energy Storage Devices in Data Centers.
Proceedings of the 8th IEEE International Conference on Cloud Computing, 2015

A Joint Optimization Framework for Request Scheduling and Energy Storage Management in a Data Center.
Proceedings of the 8th IEEE International Conference on Cloud Computing, 2015

2014
Single-Source, Single-Destination Charge Migration in Hybrid Electrical Energy Storage Systems.
IEEE Trans. Very Large Scale Integr. Syst., 2014

Adaptive Control for Energy Storage Systems in Households With Photovoltaic Modules.
IEEE Trans. Smart Grid, 2014

Architecture and Control Algorithms for Combating Partial Shading in Photovoltaic Systems.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2014

Optimizing the Power Delivery Network in a Smartphone Platform.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2014

Designing soft-edge flip-flop-based linear pipelines operating in multiple supply voltage regimes.
Integr., 2014

Designing Fault-Tolerant Photovoltaic Systems.
IEEE Des. Test, 2014

Negotiation-based task scheduling to minimize user's electricity bills under dynamic energy prices.
Proceedings of the IEEE Online Conference on Green Communications, 2014

5nm FinFET Standard Cell Library Optimization and Circuit Synthesis in Near-and Super-Threshold Voltage Regimes.
Proceedings of the IEEE Computer Society Annual Symposium on VLSI, 2014

FinCACTI: Architectural Analysis and Modeling of Caches with Deeply-Scaled FinFET Devices.
Proceedings of the IEEE Computer Society Annual Symposium on VLSI, 2014

Stack sizing analysis and optimization for FinFET logic cells and circuits operating in the sub/near-threshold regime.
Proceedings of the Fifteenth International Symposium on Quality Electronic Design, 2014

An improved logical effort model and framework applied to optimal sizing of circuits operating in multiple supply voltage regimes.
Proceedings of the Fifteenth International Symposium on Quality Electronic Design, 2014

An efficient semi-analytical current source model for FinFET devices in near/sub-threshold regime considering multiple input switching and stack effect.
Proceedings of the Fifteenth International Symposium on Quality Electronic Design, 2014

Dynamic thermal management for FinFET-based circuits exploiting the temperature effect inversion phenomenon.
Proceedings of the International Symposium on Low Power Electronics and Design, 2014

Fast photovoltaic array reconfiguration for partial solar powered vehicles.
Proceedings of the International Symposium on Low Power Electronics and Design, 2014

Coordination of the smart grid and distributed data centers: A nested game-based optimization framework.
Proceedings of the IEEE PES Innovative Smart Grid Technologies Conference, 2014

An electricity trade model for microgrid communities in smart grid.
Proceedings of the IEEE PES Innovative Smart Grid Technologies Conference, 2014

Model-free learning-based online management of hybrid electrical energy storage systems in electric vehicles.
Proceedings of the IECON 2014 - 40th Annual Conference of the IEEE Industrial Electronics Society, Dallas, TX, USA, October 29, 2014

Resource allocation optimization in a data center with energy storage devices.
Proceedings of the IECON 2014 - 40th Annual Conference of the IEEE Industrial Electronics Society, Dallas, TX, USA, October 29, 2014

Variation-aware joint optimization of the supply voltage and sleep transistor size for the 7nm FinFET technology.
Proceedings of the 32nd IEEE International Conference on Computer Design, 2014

Low write-energy STT-MRAMs using FinFET-based access transistors.
Proceedings of the 32nd IEEE International Conference on Computer Design, 2014

Power supply and consumption co-optimization of portable embedded systems with hybrid power supply.
Proceedings of the 32nd IEEE International Conference on Computer Design, 2014

Reinforcement learning based power management for hybrid electric vehicles.
Proceedings of the IEEE/ACM International Conference on Computer-Aided Design, 2014

Optimal offloading control for a mobile device based on a realistic battery model and semi-markov decision process.
Proceedings of the IEEE/ACM International Conference on Computer-Aided Design, 2014

An optimization framework for data centers to minimize electric bill under day-ahead dynamic energy prices while providing regulation services.
Proceedings of the International Green Computing Conference, 2014

7nm FinFET standard cell layout characterization and power density prediction in near- and super-threshold voltage regimes.
Proceedings of the International Green Computing Conference, 2014

Optimal power switch design methodology for ultra dynamic voltage scaling with a limited number of power rails.
Proceedings of the Great Lakes Symposium on VLSI 2014, GLSVLSI '14, Houston, TX, USA - May 21, 2014

Energy optimal sizing of FinFET standard cells operating in multiple voltage regimes using adaptive independent gate control.
Proceedings of the Great Lakes Symposium on VLSI 2014, GLSVLSI '14, Houston, TX, USA - May 21, 2014

Optimal design and management of a smart residential PV and energy storage system.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2014

Minimizing state-of-health degradation in hybrid electrical energy storage systems with arbitrary source and load profiles.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2014

VRCon: Dynamic reconfiguration of voltage regulators in a multicore platform.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2014

FEPMA: Fine-grained event-driven power meter for android smartphones based on device driver layer event monitoring.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2014

An energy-aware fault tolerant scheduling framework for soft error resilient cloud computing systems.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2014

Concurrent placement, capacity provisioning, and request flow control for a distributed cloud infrastructure.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2014

Cost-effective design of a hybrid electrical energy storage system for electric vehicles.
Proceedings of the 2014 International Conference on Hardware/Software Codesign and System Synthesis, 2014

Prediction and control of bursty cloud workloads: A fractal framework.
Proceedings of the 2014 International Conference on Hardware/Software Codesign and System Synthesis, 2014

Trace-Based Analysis and Prediction of Cloud Computing User Behavior Using the Fractal Modeling Technique.
Proceedings of the 2014 IEEE International Congress on Big Data, Anchorage, AK, USA, June 27, 2014

Semi-analytical current source modeling of FinFET devices operating in near/sub-threshold regime with independent gate control and considering process variation.
Proceedings of the 19th Asia and South Pacific Design Automation Conference, 2014

Energy and Performance-Aware Task Scheduling in a Mobile Cloud Computing Environment.
Proceedings of the 2014 IEEE 7th International Conference on Cloud Computing, Anchorage, AK, USA, June 27, 2014

2013
Charge Allocation in Hybrid Electrical Energy Storage Systems.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2013

Accurate Modeling of the Delay and Energy Overhead of Dynamic Voltage and Frequency Scaling in Modern Microprocessors.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2013

Computer-Aided Design and Optimization of Hybrid Energy Storage Systems.
Found. Trends Electron. Des. Autom., 2013

A Nested Two Stage Game-Based Optimization Framework in Mobile Cloud Computing System.
Proceedings of the Seventh IEEE International Symposium on Service-Oriented System Engineering, 2013

A nested game-based optimization framework for electricity retailers in the smart grid with residential users and PEVs.
Proceedings of the IEEE Online Conference on Green Communications, OnlineGreenComm 2013, 2013

Hierarchical dynamic power management using model-free reinforcement learning.
Proceedings of the International Symposium on Quality Electronic Design, 2013

Resource allocation and consolidation in a multi-core server cluster using a Markov decision process model.
Proceedings of the International Symposium on Quality Electronic Design, 2013

SIMES: A simulator for hybrid electrical energy storage systems.
Proceedings of the International Symposium on Low Power Electronics and Design (ISLPED), 2013

Maximum power transfer tracking in a solar USB charger for smartphones.
Proceedings of the International Symposium on Low Power Electronics and Design (ISLPED), 2013

A framework of concurrent task scheduling and dynamic voltage and frequency scaling in real-time embedded systems with energy harvesting.
Proceedings of the International Symposium on Low Power Electronics and Design (ISLPED), 2013

A sequential game perspective and optimization of the smart grid with distributed data centers.
Proceedings of the IEEE PES Innovative Smart Grid Technologies Conference, 2013

A game-theoretic price determination algorithm for utility companies serving a community in smart grid.
Proceedings of the IEEE PES Innovative Smart Grid Technologies Conference, 2013

Semi-analytical current source modeling of near-threshold operating logic cells considering process variations.
Proceedings of the 2013 IEEE 31st International Conference on Computer Design, 2013

Dynamic thermal management in mobile devices considering the thermal coupling between battery and application processor.
Proceedings of the IEEE/ACM International Conference on Computer-Aided Design, 2013

Joint sizing and adaptive independent gate control for FinFET circuits operating in multiple voltage regimes using the logical effort method.
Proceedings of the IEEE/ACM International Conference on Computer-Aided Design, 2013

Variability-aware design of energy-delay optimal linear pipelines operating in the near-threshold regime and above.
Proceedings of the Great Lakes Symposium on VLSI 2013 (part of ECRC), 2013

A semi-Markovian decision process based control method for offloading tasks from mobile devices to the cloud.
Proceedings of the 2013 IEEE Global Communications Conference, 2013

Reinforcement Learning-Based Dynamic Power Management of a Battery-Powered System Supplying Multiple Active Modes.
Proceedings of the Seventh UKSim/AMSS European Modelling Symposium, 2013

Optimal control of a grid-connected hybrid electrical energy storage system for homes.
Proceedings of the Design, Automation and Test in Europe, 2013

Capital cost-aware design and partial shading-aware architecture optimization of a reconfigurable photovoltaic system.
Proceedings of the Design, Automation and Test in Europe, 2013

A new paradigm for trading off yield, area and performance to enhance performance per wafer.
Proceedings of the Design, Automation and Test in Europe, 2013

Designing a residential hybrid electrical energy storage system based on the energy buffering strategy.
Proceedings of the International Conference on Hardware/Software Codesign and System Synthesis, 2013

An energy and deadline aware resource provisioning, scheduling and optimization framework for cloud systems.
Proceedings of the International Conference on Hardware/Software Codesign and System Synthesis, 2013

An optimal control policy in a mobile cloud computing system based on stochastic data.
Proceedings of the IEEE 2nd International Conference on Cloud Networking, 2013

Maximizing return on investment of a grid-connected hybrid electrical energy storage system.
Proceedings of the 18th Asia and South Pacific Design Automation Conference, 2013

An efficient scheduling algorithm for multiple charge migration tasks in hybrid electrical energy storage systems.
Proceedings of the 18th Asia and South Pacific Design Automation Conference, 2013

Online estimation of the remaining energy capacity in mobile systems considering system-wide power consumption and battery characteristics.
Proceedings of the 18th Asia and South Pacific Design Automation Conference, 2013

2012
Dynamic Power Management of a Computer with Self Power-Managed Components.
Proceedings of the Integrated Circuit and System Design. Power and Timing Modeling, 2012

Profit maximization for utility companies in an oligopolistic energy market with dynamic prices.
Proceedings of the IEEE Online Conference on Green Communications, 2012

Enhancing efficiency and robustness of a photovoltaic power system under partial shading.
Proceedings of the Thirteenth International Symposium on Quality Electronic Design, 2012

Dynamic reconfiguration of photovoltaic energy harvesting system in hybrid electric vehicles.
Proceedings of the International Symposium on Low Power Electronics and Design, 2012

Battery management for grid-connected PV systems with a battery.
Proceedings of the International Symposium on Low Power Electronics and Design, 2012

Power conversion efficiency characterization and optimization for smartphones.
Proceedings of the International Symposium on Low Power Electronics and Design, 2012

Reinforcement learning based dynamic power management with a hybrid power supply.
Proceedings of the 30th International IEEE Conference on Computer Design, 2012

Online fault detection and tolerance for photovoltaic energy harvesting systems.
Proceedings of the 2012 IEEE/ACM International Conference on Computer-Aided Design, 2012

State of health aware charge management in hybrid electrical energy storage systems.
Proceedings of the 2012 Design, Automation & Test in Europe Conference & Exhibition, 2012

Multiple-source and multiple-destination charge migration in hybrid electrical energy storage systems.
Proceedings of the 2012 Design, Automation & Test in Europe Conference & Exhibition, 2012

Near-optimal, dynamic module reconfiguration in a photovoltaic system to combat partial shading effects.
Proceedings of the 49th Annual Design Automation Conference 2012, 2012

Networked architecture for hybrid electrical energy storage systems.
Proceedings of the 49th Annual Design Automation Conference 2012, 2012

Charge replacement in hybrid electrical energy storage systems.
Proceedings of the 17th Asia and South Pacific Design Automation Conference, 2012

2011
Charge migration efficiency optimization in hybrid electrical energy storage (HEES) systems.
Proceedings of the 2011 International Symposium on Low Power Electronics and Design, 2011

Versatile high-fidelity photovoltaic module emulation system.
Proceedings of the 2011 International Symposium on Low Power Electronics and Design, 2011

Balanced reconfiguration of storage banks in a hybrid electrical energy storage system.
Proceedings of the 2011 IEEE/ACM International Conference on Computer-Aided Design, 2011

Battery-supercapacitor hybrid system for high-rate pulsed load applications.
Proceedings of the Design, Automation and Test in Europe, 2011

Deriving a near-optimal power management policy using model-free reinforcement learning and Bayesian classification.
Proceedings of the 48th Design Automation Conference, 2011

Synchronization of the Fractional Order Finance Systems with Activation Feedback Control.
Proceedings of the Artificial Intelligence and Computational Intelligence, 2011

2010
Multi-resolution recognition of 3D objects based on visual resolution limits.
Pattern Recognit. Lett., 2010

Hybrid electrical energy storage systems.
Proceedings of the 2010 International Symposium on Low Power Electronics and Design, 2010

Maximum power transfer tracking for a photovoltaic-supercapacitor energy system.
Proceedings of the 2010 International Symposium on Low Power Electronics and Design, 2010

Dynamics analysis of fractional order three-dimensional Hopfield neural network.
Proceedings of the Sixth International Conference on Natural Computation, 2010


  Loading...