Duo Liu

Orcid: 0000-0002-3040-2065

Affiliations:
  • Chongqing University, Chongqing, China


According to our database1, Duo Liu authored at least 199 papers between 2009 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
SAHChain: A Hybrid Storage Blockchain System Supporting Semantic Expressiveness and Retrieval.
IEEE Trans. Computers, May, 2026

Latency Optimization in Hybrid Memory System for GNNs.
IEEE Trans. Computers, March, 2026

LIO-HKDT: Fast and Accurate LiDAR-Inertial Odometry With Hash K-D Tree.
IEEE Robotics Autom. Lett., March, 2026

D<sup>2</sup>Prune: Sparsifying Large Language Models via Dual Taylor Expansion and Attention Distribution Awareness.
CoRR, January, 2026

CD-ANN: Scalable Approximate Nearest Neighbor search on client-side devices.
J. Syst. Archit., 2026

RefineDedup: efficient deduplication for mobile systems via application-wise learning.
Sci. China Inf. Sci., 2026

D2 Prune: Sparsifying Large Language Models via Dual Taylor Expansion and Attention Distribution Awareness.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
LAShards: Low-Overhead and Self-Adaptive MRC Construction for Non-Stack Algorithms.
IEEE Trans. Computers, October, 2025

CMCache: An Adaptive Cross-Level Data Placement Method for Multilevel Cache.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., August, 2025

DSAV: A Deep Sparse Acceleration Framework for Voxel-Based 3-D Object Detection.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., February, 2025

GNNBoost: Accelerating sampling-based GNN training on large scale graph by optimizing data preparation.
J. Syst. Archit., 2025

Gravity-Constrained Simultaneous Localization and Mapping for suppressing map warping in complex large-scale environments.
Integr. Comput. Aided Eng., 2025

PIM-IoT: Enabling hierarchical, heterogeneous, and agile Processing-in-Memory in IoT systems.
Future Gener. Comput. Syst., 2025

FASP: A Fast and Accurate Framework for Schedule Performance Evaluation.
Proceedings of the 31th IEEE International Conference on Parallel and Distributed Systems, 2025

RobTrack: A Robust 3D Multi-object Tracking Method for Edge Devices.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2025

Co-GNN: A Co-optimization Framework for Memory and Computation in Sampling-Based GNN Training.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2025

CAST: An Efficient Framework for Schedules Performance Prediction Based on Compact ASTs.
Proceedings of the 43rd IEEE International Conference on Computer Design, 2025

MPNAS: Multimodal Sentiment Analysis Pruning via Neural Architecture Search.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Cocache: An Accurate and Low-Overhead Dynamic Caching Method for GNNs.
Proceedings of the Euro-Par 2025: Parallel Processing, 2025

CoSF: A <sub>Co</sub>-Optimization Framework for Operator <sub>S</sub>plitting and <sub>F</sub>usion.
Proceedings of the Euro-Par 2025: Parallel Processing, 2025

LIO-DPC: Accurate and Fast LiDAR-Inertial Odometry with Dynamic Pose Chain.
Proceedings of the 62nd ACM/IEEE Design Automation Conference, 2025

RAN: Accelerating Data Repair with Available Nodes in Erasure-Coded Storage.
Proceedings of the IEEE International Conference on Cluster Computing, 2025

2024
FreePrune: An Automatic Pruning Framework Across Various Granularities Based on Training-Free Evaluation.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., November, 2024

LightFS: A Lightweight Host-CSD Coordinated File System Optimizing for Heavy Small File Accesses.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., November, 2024

Fair-ZNS: Enhancing Fairness in ZNS SSDs Through Self-Balancing I/O Scheduling.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., July, 2024

Optimizing the Performance of Consistency-Aware Deduplication Using Persistent Memory.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., June, 2024

A Fast Location-Aware Repair Strategy for Mobile Grouped Storage Clusters.
IEEE Internet Things J., June, 2024

WA-Zone: Wear-Aware Zone Management Optimization for LSM-Tree on ZNS SSDs.
ACM Trans. Archit. Code Optim., March, 2024

Wear-leveling-aware buddy-like memory allocator for persistent memory file systems.
Future Gener. Comput. Syst., January, 2024

BGS: Accelerate GNN training on multiple GPUs.
J. Syst. Archit., 2024

ZNS-Cleaner: Enhancing lifespan by reducing empty erase in ZNS SSDs.
J. Syst. Archit., 2024

CEIU: Consistent and Efficient Incremental Update mechanism for mobile systems on flash storage.
J. Syst. Archit., 2024

Trustworthy Self-Attention: Enabling the Network to Focus Only on the Most Relevant References.
CoRR, 2024

YOIO: You Only Iterate Once by mining and fusing multiple necessary global information in the optical flow estimation.
CoRR, 2024

DPC: DPU-accelerated High-Performance File System Client.
Proceedings of the 53rd International Conference on Parallel Processing, 2024

Hi-ZNS: High Space Efficiency and Zero-Copy LSM-Tree Based Stores on ZNS SSDs.
Proceedings of the 53rd International Conference on Parallel Processing, 2024

RACI: A Resource-Aware Cooperative Inference Framework on Heterogeneous Edge Devices.
Proceedings of the 43rd IEEE/ACM International Conference on Computer-Aided Design, 2024

FinerDedup: Sifting Fingerprints for Efficient Data Deduplication on Mobile Devices.
Proceedings of the 61st ACM/IEEE Design Automation Conference, 2024

Rethinking Literary Plagiarism in LLMs through the Lens of Copyright Laws.
Proceedings of the Asian Conference on Machine Learning, 2024

VIFA: An Efficient Visible and Infrared Image Fusion Architecture for Multi-task Applications via Continual Learning.
Proceedings of the Computer Vision - ACCV 2024, 2024

2023
ADAR: Application-Specific Data Allocation and Reprogramming Optimization for 3-D TLC Flash Memory.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., June, 2023

An efficient wear-leveling-aware multi-grained allocator for persistent memory file systems.
Frontiers Inf. Technol. Electron. Eng., May, 2023

V-WAFA: An Endurance Variation Aware Fine-Grained Allocator for Persistent Memory.
IEEE Trans. Computers, April, 2023

FedMDS: An Efficient Model Discrepancy-Aware Semi-Asynchronous Clustered Federated Learning Framework.
IEEE Trans. Parallel Distributed Syst., March, 2023

LFPR: A Lazy Fast Predictive Repair Strategy for Mobile Distributed Erasure Coded Cluster.
IEEE Internet Things J., 2023

Optimizing the Incremental Update Mechanism by Inlaying File Indexes on Flash Storage.
Proceedings of the 12th Non-Volatile Memory Systems and Applications Symposium, 2023

RadarSSD: A Computational Storage for Radar Signal Processing.
Proceedings of the 52nd International Conference on Parallel Processing, 2023

Data-Quality-Driven Federated Learning for Optimizing Communication Costs.
Proceedings of the 29th IEEE International Conference on Parallel and Distributed Systems, 2023

Re-compact: Structured Pruning and SpMM Kernel Co-design for Accelerating DNNs on GPUs.
Proceedings of the 41st IEEE International Conference on Computer Design, 2023

An Efficient Scheduling Algorithm for Multi-mode Tasks on Near-Data Processing SSDs.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2023

HBP: Hierarchically Balanced Pruning and Accelerator Co-Design for Efficient DNN Inference.
Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

Optimizing the Performance of NDP Operations by Retrieving File Semantics in Storage.
Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

IFHE: Intermediate-Feature Heterogeneity Enhancement for Image Synthesis in Data-Free Knowledge Distillation.
Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

2022
Improving Fairness for SSD Devices through DRAM Over-Provisioning Cache Management.
IEEE Trans. Parallel Distributed Syst., 2022

Flexible Clustered Federated Learning for Client-Level Data Distribution Shift.
IEEE Trans. Parallel Distributed Syst., 2022

SENTunnel: Fast Path for Sensor Data Access on Automotive Embedded Systems.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022

Self-Adapting Channel Allocation for Multiple Tenants Sharing SSD Devices.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022

Horae: A Hybrid I/O Request Scheduling Technique for Near-Data Processing-Based SSD.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022

eRDAC: Efficient and Reliable Remote Direct Access and Control for Embedded Systems.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022

FRL: Fast and Reconfigurable Accelerator for Distributed Sound Source Localization.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022

ELOFS: An Extensible Low-Overhead Flash File System for Resource-Scarce Embedded Devices.
IEEE Trans. Computers, 2022

Federated learning with workload-aware client scheduling in heterogeneous systems.
Neural Networks, 2022

Efficient persistent memory file systems using virtual superpages with multi-level allocator.
J. Syst. Archit., 2022

Towards highly-concurrent leaderless state machine replication for distributed systems.
J. Syst. Archit., 2022

CoDiscard: A revenue model based cross-layer cooperative discarding mechanism for flash memory devices.
J. Syst. Archit., 2022

Towards the Design of Efficient TCN-bascd Prefetcher for Hybrid NVM-DRAM Memory.
Proceedings of the International Joint Conference on Neural Networks, 2022

CADedup: High-performance Consistency-aware Deduplication Based on Persistent Memory.
Proceedings of the IEEE 40th International Conference on Computer Design, 2022

VEA: An FPGA-Based Voxel Encoding Accelerator for 3D Object Detection with LiDAR.
Proceedings of the IEEE 40th International Conference on Computer Design, 2022

3DS: An Efficient DPDK-based Data Distribution Service for Distributed Real-time Applications.
Proceedings of the 24th IEEE Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, 2022

Weak Network Oriented Mobile Distributed Storage: A Hybrid Fault-Tolerance Scheme Based on Potential Replicas.
Proceedings of the 24th IEEE Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, 2022

Optimizing CoW-based File Systems on Open-Channel SSDs with Persistent Memory.
Proceedings of the 2022 Design, Automation & Test in Europe Conference & Exhibition, 2022

GATLB: A Granularity-Aware TLB to Support Multi-Granularity Pages in Hybrid Memory System.
Proceedings of the 2022 Design, Automation & Test in Europe Conference & Exhibition, 2022

SAPredictor: a simple and accurate self-adaptive predictor for hierarchical hybrid memory system.
Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022

Lazy repair with temporary redundancy(LRTR): reducing repair network traffic in erasure-coded storage.
Proceedings of the CF '22: 19th ACM International Conference on Computing Frontiers, Turin, Italy, May 17, 2022

2021
Improving the Performance of Deduplication-Based Storage Cache via Content-Driven Cache Management Methods.
IEEE Trans. Parallel Distributed Syst., 2021

Self-Balancing Federated Learning With Global Imbalanced Data in Mobile Systems.
IEEE Trans. Parallel Distributed Syst., 2021

Bridging Mismatched Granularity Between Embedded File Systems and Flash Memory.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2021

Making Frequent-Pattern Mining Scalable, Efficient, and Compact on Nonvolatile Memories.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2021

MobileRE: A replicas prioritized hybrid fault tolerance strategy for mobile distributed system.
J. Syst. Archit., 2021

A machine learning assisted data placement mechanism for hybrid storage systems.
J. Syst. Archit., 2021

Flexible Clustered Federated Learning for Client-Level Data Distribution Shift.
CoRR, 2021

FedGroup: Efficient Federated Learning via Decomposed Similarity-Based Clustering.
Proceedings of the 2021 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom), New York City, NY, USA, September 30, 2021

CSAFL: A Clustered Semi-Asynchronous Federated Learning Framework.
Proceedings of the International Joint Conference on Neural Networks, 2021

FedSAE: A Novel Self-Adaptive Federated Learning Framework in Heterogeneous Systems.
Proceedings of the International Joint Conference on Neural Networks, 2021

Forseti: An Efficient Basic-block-level Sensitivity Analysis Framework Towards Multi-bit Faults.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2021

DFShards: effective construction of MRCs online for non-stack algorithms.
Proceedings of the CF '21: Computing Frontiers Conference, 2021

AIR Cache: A Variable-Size Block Cache Based on Fine-Grained Management Method.
Proceedings of the Web and Big Data - 5th International Joint Conference, 2021

2020
APMigration: Improving Performance of Hybrid Memory Performance via An Adaptive Page Migration Method.
IEEE Trans. Parallel Distributed Syst., 2020

Downsizing Without Downgrading: Approximated Dynamic Time Warping on Nonvolatile Memories.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2020

Separable Binary Convolutional Neural Network on Embedded Systems.
IEEE Trans. Computers, 2020

Optimizing synchronization mechanism for block-based file systems using persistent memory.
Future Gener. Comput. Syst., 2020

FedGroup: Ternary Cosine Similarity-based Clustered Federated Learning Framework toward High Accuracy in Heterogeneous Data.
CoRR, 2020

An Efficient and Wear-Leveling-Aware Frequent-Pattern Mining on Non-Volatile Memory.
CoRR, 2020

SSDKeeper: Self-Adapting Channel Allocation to Improve the Performance of SSD Devices.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

Themis: Malicious Wear Detection and Defense for Persistent Memory File Systems.
Proceedings of the 26th IEEE International Conference on Parallel and Distributed Systems, 2020

WMAlloc: A Wear-Leveling-Aware Multi-Grained Allocator for Persistent Memory File Systems.
Proceedings of the 26th IEEE International Conference on Parallel and Distributed Systems, 2020

Unified-TP: A Unified TLB and Page Table Cache Structure for Efficient Address Translation.
Proceedings of the 38th IEEE International Conference on Computer Design, 2020

MobileRE: A Hybrid Fault Tolerance Strategy Combining Erasure Codes and Replicas for Mobile Distributed Cluster.
Proceedings of the 22nd IEEE International Conference on High Performance Computing and Communications; 18th IEEE International Conference on Smart City; 6th IEEE International Conference on Data Science and Systems, 2020

Optimizing Performance of Persistent Memory File Systems using Virtual Superpages.
Proceedings of the 2020 Design, Automation & Test in Europe Conference & Exhibition, 2020

LOFFS: A Low-Overhead File System for Large Flash Memory on Embedded Devices.
Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

Efficient Multi-Grained Wear Leveling for Inodes of Persistent Memory File Systems.
Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

2019
Towards Fast and Lightweight Checkpointing for Mobile Virtualization Using NVRAM.
IEEE Trans. Parallel Distributed Syst., 2019

DCR: Deterministic Crash Recovery for NAND Flash Storage Systems.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2019

HiNextApp: A context-aware and adaptive framework for app prediction in mobile systems.
Sustain. Comput. Informatics Syst., 2019

FitCNN: A cloud-assisted and low-cost framework for updating CNNs on IoT devices.
Future Gener. Comput. Syst., 2019

Wear-aware Memory Management Scheme for Balancing Lifetime and Performance of Multiple NVM Slots.
Proceedings of the 35th Symposium on Mass Storage Systems and Technologies, 2019

CDAC: Content-Driven Deduplication-Aware Storage Cache.
Proceedings of the 35th Symposium on Mass Storage Systems and Technologies, 2019

Towards Efficient NVDIMM-based Heterogeneous Storage Hierarchy Management for Big Data Workloads.
Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture, 2019

Power-Aware Virtual Machine Placement for Mobile Edge Computing.
Proceedings of the 2019 International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, 2019

Optimizing the Data Transmission Scheme for Edge-Based Automatic Driving.
Proceedings of the 15th IEEE International Conference on Embedded Software and Systems, 2019

Archivist: A Machine Learning Assisted Data Placement Mechanism for Hybrid Storage Systems.
Proceedings of the 37th IEEE International Conference on Computer Design, 2019

Astraea: Self-Balancing Federated Learning for Improving Classification Accuracy of Mobile Deep Learning Applications.
Proceedings of the 37th IEEE International Conference on Computer Design, 2019

Reducing Write Amplification for Inodes of Journaling File System using Persistent Memory.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2019

UIMigrate: Adaptive Data Migration for Hybrid Non-Volatile Memory Systems.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2019

Tumbler: Energy Efficient Task Scheduling for Dual-Channel Solar-Powered Sensor Nodes.
Proceedings of the 56th Annual Design Automation Conference 2019, 2019

Routing in optical network-on-chip: minimizing contention with guaranteed thermal reliability.
Proceedings of the 24th Asia and South Pacific Design Automation Conference, 2019

2018
A Novel ReRAM-Based Processing-in-Memory Architecture for Graph Traversal.
ACM Trans. Storage, 2018

Hardware/Software Adaptive Cryptographic Acceleration for Big Data Processing.
Secur. Commun. Networks, 2018

F2FS Aware Mapping Cache Design on Solid State Drives.
Proceedings of the IEEE 7th Non-Volatile Memory Systems and Applications Symposium, 2018

TaiJiNet: Towards Partial Binarized Convolutional Neural Network for Embedded Systems.
Proceedings of the 2018 IEEE Computer Society Annual Symposium on VLSI, 2018

Puppet: Energy Efficient Task Mapping For Storage-Less and Converter-Less Solar-Powered Non-Volatile Sensor Nodes.
Proceedings of the 36th IEEE International Conference on Computer Design, 2018

In-Situ AI: Towards Autonomous and Incremental Deep Learning for IoT Systems.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2018

2017
Building NVRAM-Aware Swapping Through Code Migration in Mobile Devices.
IEEE Trans. Parallel Distributed Syst., 2017

Durable Address Translation in PCM-Based Flash Storage Systems.
IEEE Trans. Parallel Distributed Syst., 2017

Durable and Energy Efficient In-Memory Frequent-Pattern Mining.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2017

vFlash: Virtualized Flash for Optimizing the I/O Performance in Mobile Devices.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2017

Non-Volatile Memory Based Page Swapping for Building High-Performance Mobile Devices.
IEEE Trans. Computers, 2017

Heating Dispersal for Self-Healing NAND Flash Memory.
IEEE Trans. Computers, 2017

A workload-aware flash translation layer enhancing performance and lifespan of TLC/SLC dual-mode flash memory in embedded systems.
Microprocess. Microsystems, 2017

Fine grained, direct access file system support for storage class memory.
J. Syst. Archit., 2017

An energy-efficient encryption mechanism for NVM-based main memory in mobile systems.
J. Syst. Archit., 2017

Revisiting swapping in mobile systems with SwapBench.
Future Gener. Comput. Syst., 2017

HiNextApp: A Context-Aware and Adaptive Framework for App Prediction in Mobile Systems.
Proceedings of the 2017 IEEE Trustcom/BigDataSE/ICESS, Sydney, Australia, August 1-4, 2017, 2017

FitCNN: A cloud-assisted lightweight convolutional neural network framework for mobile devices.
Proceedings of the 23rd IEEE International Conference on Embedded and Real-Time Computing Systems and Applications, 2017

Downsampling of time-series data for approximated dynamic time warping on nonvolatile memories.
Proceedings of the IEEE 6th Non-Volatile Memory Systems and Applications Symposium, 2017

SmartSwap: High-Performance and User Experience Friendly Swapping in Mobile Systems.
Proceedings of the 54th Annual Design Automation Conference, 2017

Scalable frequent-pattern mining on nonvolatile memories.
Proceedings of the 22nd Asia and South Pacific Design Automation Conference, 2017

2016
Image-Content-Aware I/O Optimization for Mobile Virtualization.
ACM Trans. Embed. Comput. Syst., 2016

Energy-Efficient In-Memory Paging for Smartphones.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2016

Retention Trimming for Lifetime Improvement of Flash Memory Storage Systems.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2016

Morphable Resistive Memory Optimization for Mobile Virtualization.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2016

A compiler assisted wear leveling for morphable PCM in embedded systems.
J. Syst. Archit., 2016

PADS: A Reliable Pothole Detection System Using Machine Learning.
Proceedings of the Smart Computing and Communication, 2016

A Quantitative Approach for Memory Fragmentation in Mobile Systems.
Proceedings of the Smart Computing and Communication, 2016

Making In-Memory Frequent Pattern Mining Durable and Energy Efficient.
Proceedings of the 45th International Conference on Parallel Processing, 2016

FLIC: Fast, lightweight checkpointing for mobile virtualization using NVRAM.
Proceedings of the 2016 Design, Automation & Test in Europe Conference & Exhibition, 2016

2015
Towards Write-Activity-Aware Page Table Management for Non-volatile Main Memories.
ACM Trans. Embed. Comput. Syst., 2015

On-Demand Block-Level Address Mapping in Large-Scale NAND Flash Storage Systems.
IEEE Trans. Computers, 2015

MobiLock: an energy-aware encryption mechanism for NVRAM-based mobile devices.
Proceedings of the IEEE Non-Volatile Memory System and Applications Symposium, 2015

Mixer: software enabled wear leveling for morphable PCM in embedded systems.
Proceedings of the IEEE Non-Volatile Memory System and Applications Symposium, 2015

File system-independent block device support for storage class memory.
Proceedings of the 2015 IEEE Conference on Computer Communications Workshops, 2015

SwapBench: The Easy Way to Demystify Swapping in Mobile Systems.
Proceedings of the 17th IEEE International Conference on High Performance Computing and Communications, 2015

NV-CFS: NVRAM-Assisted Scheduling Optimization for Virtualized Mobile Systems.
Proceedings of the 17th IEEE International Conference on High Performance Computing and Communications, 2015

TLC-FTL: Workload-Aware Flash Translation Layer for TLC/SLC Dual-Mode Flash Memory in Embedded Systems.
Proceedings of the 17th IEEE International Conference on High Performance Computing and Communications, 2015

Traffic-Aware Application Mapping for Network-on-Chip Based Multiprocessor System-on-Chip.
Proceedings of the 17th IEEE International Conference on High Performance Computing and Communications, 2015

Realistic Task Parallelization of the H.264 Decoding Algorithm for Multiprocessors.
Proceedings of the 17th IEEE International Conference on High Performance Computing and Communications, 2015

A Hierarchical Resource Allocation Game for Heterogeneous Networks with Relays.
Proceedings of the 17th IEEE International Conference on High Performance Computing and Communications, 2015

SmartBackup: An Efficient and Reliable Backup Strategy for Solid State Drives with Backup Capacitors.
Proceedings of the 17th IEEE International Conference on High Performance Computing and Communications, 2015

Virtual Machine Image Content Aware I/O Optimization for Mobile Virtualization.
Proceedings of the 17th IEEE International Conference on High Performance Computing and Communications, 2015

<i>n</i>Code: limiting harmful writes to emerging mobile NVRAM through code swapping.
Proceedings of the 2015 Design, Automation & Test in Europe Conference & Exhibition, 2015

Balloonfish: Utilizing morphable resistive memory in mobile virtualization.
Proceedings of the 20th Asia and South Pacific Design Automation Conference, 2015

Unified non-volatile memory and NAND flash memory architecture in smartphones.
Proceedings of the 20th Asia and South Pacific Design Automation Conference, 2015

2014
Memory-Aware Task Scheduling with Communication Overhead Minimization for Streaming Applications on Bus-Based Multiprocessor System-on-Chips.
IEEE Trans. Parallel Distributed Syst., 2014

Application-Specific Wear Leveling for Extending Lifetime of Phase Change Memory in Embedded Systems.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2014

Loop scheduling with memory access reduction subject to register constraints for DSP applications.
Softw. Pract. Exp., 2014

A space allocation and reuse strategy for PCM-based embedded systems.
J. Syst. Archit., 2014

A Partition-based Mechanism for Reducing Energy in Phase Change Memory.
J. Comput., 2014

Contention-aware task and communication co-scheduling for network-on-chip based Multiprocessor System-on-Chip.
Proceedings of the 2014 IEEE 20th International Conference on Embedded and Real-Time Computing Systems and Applications, 2014

Enhancing lifetime of NVM-based main memory with bit shifting and flipping.
Proceedings of the 2014 IEEE 20th International Conference on Embedded and Real-Time Computing Systems and Applications, 2014

Virtual-machine metadata optimization for I/O traffic reduction in mobile virtualization.
Proceedings of the IEEE Non-Volatile Memory Systems and Applications Symposium, 2014

An Improved Thermal Model for Static Optimization of Application Mapping and Scheduling in Multiprocessor System-on-Chip.
Proceedings of the IEEE Computer Society Annual Symposium on VLSI, 2014

DR. Swap: energy-efficient paging for smartphones.
Proceedings of the International Symposium on Low Power Electronics and Design, 2014

Building high-performance smartphones via non-volatile memory: The swap approach.
Proceedings of the 2014 International Conference on Embedded Software, 2014

Deterministic Crash Recovery for NAND Flash Based Storage Systems.
Proceedings of the 51st Annual Design Automation Conference 2014, 2014

2013
Optimally Removing Intercore Communication Overhead for Streaming Applications on MPSoCs.
IEEE Trans. Computers, 2013

A content-aware writing mechanism for reducing energy on non-volatile memory based embedded storage systems.
Des. Autom. Embed. Syst., 2013

A space-based wear leveling for PCM-based embedded systems.
Proceedings of the 2013 IEEE 19th International Conference on Embedded and Real-Time Computing Systems and Applications, 2013

FTL<sup>2</sup>: a hybrid <i>f</i>lash <i>t</i>ranslation <i>l</i>ayer with logging for write reduction in flash memory.
Proceedings of the SIGPLAN/SIGBED Conference on Languages, 2013

Curling-PCM: Application-specific wear leveling for phase change memory based embedded systems.
Proceedings of the 18th Asia and South Pacific Design Automation Conference, 2013

2012
A Space Reuse Strategy for Flash Translation Layers in SLC NAND Flash Memory Storage Systems.
IEEE Trans. Very Large Scale Integr. Syst., 2012

Optimally Maximizing Iteration-Level Loop Parallelism.
IEEE Trans. Parallel Distributed Syst., 2012

Real-Time Flash Translation Layer for NAND Flash Memory Storage Systems.
Proceedings of the 2012 IEEE 18th Real Time and Embedded Technology and Applications Symposium, 2012

Efficient Task Assignment on Heterogeneous Multicore Systems Considering Communication Overhead.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2012

A block-level flash memory management scheme for reducing write activities in PCM-based embedded systems.
Proceedings of the 2012 Design, Automation & Test in Europe Conference & Exhibition, 2012

Write-activity-aware page table management for PCM-based embedded systems.
Proceedings of the 17th Asia and South Pacific Design Automation Conference, 2012

2011
Overhead-aware energy optimization for real-time streaming applications on multiprocessor System-on-Chip.
ACM Trans. Design Autom. Electr. Syst., 2011

On Improving Real-Time Interrupt Latencies of Hybrid Operating Systems with Two-Level Hardware Interrupts.
IEEE Trans. Computers, 2011

PCM-FTL: A Write-Activity-Aware NAND Flash Memory Management Scheme for PCM-Based Embedded Systems.
Proceedings of the 32nd IEEE Real-Time Systems Symposium, 2011

A Two-Level Caching Mechanism for Demand-Based Page-Level Address Mapping in NAND Flash Memory Storage Systems.
Proceedings of the 17th IEEE Real-Time and Embedded Technology and Applications Symposium, 2011

An endurance-enhanced Flash Translation Layer via reuse for NAND flash memory storage systems.
Proceedings of the Design, Automation and Test in Europe, 2011

MNFTL: an efficient flash translation layer for MLC NAND flash memory storage systems.
Proceedings of the 48th Design Automation Conference, 2011

2010
Compiler-assisted leakage-aware loop scheduling for embedded VLIW DSP processors.
J. Syst. Softw., 2010

Memory-Aware Optimal Scheduling with Communication Overhead Minimization for Streaming Applications on Chip Multiprocessors.
Proceedings of the 31st IEEE Real-Time Systems Symposium, 2010

Optimal Task Scheduling by Removing Inter-Core Communication Overhead for Streaming Applications on MPSoC.
Proceedings of the 16th IEEE Real-Time and Embedded Technology and Applications Symposium, 2010

RNFTL: a reuse-aware NAND flash translation layer for flash memory.
Proceedings of the ACM SIGPLAN/SIGBED 2010 conference on Languages, 2010

Demand-based block-level address mapping in large-scale NAND flash storage systems.
Proceedings of the 8th International Conference on Hardware/Software Codesign and System Synthesis, 2010

2009
Loop scheduling with memory access reduction under register constraints for DSP applications.
Proceedings of the IEEE Workshop on Signal Processing Systems, 2009

Improving the Reliability of Embedded Systems with Cache and SPM.
Proceedings of the IEEE 6th International Conference on Mobile Adhoc and Sensor Systems, 2009

Optimal loop parallelization for maximizing iteration-level parallelism.
Proceedings of the 2009 International Conference on Compilers, 2009


  Loading...