Rong Chen

Orcid: 0000-0002-6115-8130

Affiliations:
  • Shanghai Jiao Tong University, Institute of Parallel and Distributed Systems, China
  • Fudan University, Shanghai, China (PhD 2011)


According to our database1, Rong Chen authored at least 59 papers between 2006 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
XGNN: Boosting Multi-GPU GNN Training via Global GNN Memory Store.
Proc. VLDB Endow., January, 2024

Characterizing Network Requirements for GPU API Remoting in AI Applications.
CoRR, 2024

Serialization/Deserialization-free State Transfer in Serverless Workflows.
Proceedings of the Nineteenth European Conference on Computer Systems, 2024

2023
Transactional Indexes on (RDMA or CXL-based) Disaggregated Memory with Repairable Transaction.
CoRR, 2023

Bridging the Gap between Relational OLTP and Graph-based OLAP.
Proceedings of the 2023 USENIX Annual Technical Conference, 2023

UGACHE: A Unified GPU Cache for Embedding-based Deep Learning.
Proceedings of the 29th Symposium on Operating Systems Principles, 2023

No Provisioned Concurrency: Fast RDMA-codesigned Remote Fork for Serverless Computing.
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023

Characterizing Off-path SmartNIC for Accelerating Distributed Systems.
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023

Automated Verification of Idempotence for Stateful Serverless Applications.
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023

DArray: A High Performance RDMA-Based Distributed Array.
Proceedings of the 52nd International Conference on Parallel Processing, 2023

Microsecond-scale Datacenter Computing with Modern Networking.
Proceedings of the ACM Turing Award Celebration Conference - China 2023, 2023

2022
Wukong+G: Fast and Concurrent RDF Query Processing Using RDMA-Assisted GPU Graph Exploration.
IEEE Trans. Parallel Distributed Syst., 2022

DrTM+B: Replication-Driven Live Reconfiguration for Fast and General Distributed Transaction Processing.
IEEE Trans. Parallel Distributed Syst., 2022

Reducing Transaction Processing Latency in Hardware Transactional Memory-based Database with Non-volatile Memory.
Int. J. Softw. Informatics, 2022

A Comprehensive Study on Off-path SmartNIC.
CoRR, 2022

Booting 10K Serverless Functions within One Second via RDMA-based Remote Fork.
CoRR, 2022

KRCORE: A Microsecond-scale RDMA Control Plane for Elastic Computing.
Proceedings of the 2022 USENIX Annual Technical Conference, 2022

Microsecond-scale Preemption for Concurrent GPU-accelerated DNN Inferences.
Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 2022

FNotify: A Low-Latency and Scalable Publish/Subscribe System using RDMA.
Proceedings of the 24th IEEE Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, 2022

GNNLab: a factored system for sample-based GNN training over GPUs.
Proceedings of the EuroSys '22: Seventeenth European Conference on Computer Systems, Rennes, France, April 5, 2022

2021
XStore: Fast RDMA-Based Ordered Key-Value Store Using Remote Learned Cache.
ACM Trans. Storage, 2021

Wukong: A Distributed Framework for Fast and Concurrent Graph Querying.
ACM SIGOPS Oper. Syst. Rev., 2021

Characterizing and Optimizing Remote Persistent Memory with RDMA and NVM.
Proceedings of the 2021 USENIX Annual Technical Conference, 2021

Retrofitting High Availability Mechanism to Tame Hybrid Transaction/Analytical Processing.
Proceedings of the 15th USENIX Symposium on Operating Systems Design and Implementation, 2021

Unifying Timestamp with Transaction Ordering for MVCC with Decentralized Scalar Timestamp.
Proceedings of the 18th USENIX Symposium on Networked Systems Design and Implementation, 2021

FlexGraph: a flexible and efficient distributed framework for GNN training.
Proceedings of the EuroSys '21: Sixteenth European Conference on Computer Systems, 2021

Fast and Accurate Optimizer for Query Processing over Knowledge Graphs.
Proceedings of the SoCC '21: ACM Symposium on Cloud Computing, 2021

2020
Fast RDMA-based Ordered Key-Value Store using Remote Learned Cache.
Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020

2019
Pragh: Locality-preserving Graph Traversal with Split Live Migration.
Proceedings of the 2019 USENIX Annual Technical Conference, 2019

2018
Replication-Based Fault-Tolerance for Large-Scale Graph Processing.
IEEE Trans. Parallel Distributed Syst., 2018

PowerLyra: Differentiated Graph Computation and Partitioning on Skewed Graphs.
ACM Trans. Parallel Comput., 2018

Asymmetric virtual machine replication for low latency and high available service.
Sci. China Inf. Sci., 2018

Fast and Concurrent RDF Queries using RDMA-assisted GPU Graph Exploration.
Proceedings of the 2018 USENIX Annual Technical Conference, 2018

Deconstructing RDMA-enabled Distributed Transactions: Hybrid is Better!
Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation, 2018

Analysis and Improvement of Optimizer for Query Processing on Graph Store.
Proceedings of the 9th Asia-Pacific Workshop on Systems, 2018

2017
Fast In-Memory Transaction Processing Using RDMA and HTM.
ACM Trans. Comput. Syst., 2017

Replication-driven Live Reconfiguration for Fast Distributed Transaction Processing.
Proceedings of the 2017 USENIX Annual Technical Conference, 2017

Sub-millisecond Stateful Stream Querying over Fast-evolving Linked Data.
Proceedings of the 26th Symposium on Operating Systems Principles, 2017

Extracting More Intra-transaction Parallelism with Work Stealing for OLTP Workloads.
Proceedings of the 8th Asia-Pacific Workshop on Systems, Mumbai, India, September 2, 2017, 2017

2016
Fast and Concurrent RDF Queries with RDMA-Based Distributed Graph Exploration.
Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation, 2016

Fast and general distributed transactions using RDMA and HTM.
Proceedings of the Eleventh European Conference on Computer Systems, 2016

A Case for Virtualizing Persistent Memory.
Proceedings of the Seventh ACM Symposium on Cloud Computing, 2016

2015
Bipartite-Oriented Distributed Graph Partitioning for Big Learning.
J. Comput. Sci. Technol., 2015

Fast in-memory transaction processing using RDMA and HTM.
Proceedings of the 25th Symposium on Operating Systems Principles, 2015

NUMA-aware graph-structured analytics.
Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2015

SYNC or ASYNC: time to fuse for distributed graph-parallel computation.
Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2015

PowerLyra: differentiated graph computation and partitioning on skewed graphs.
Proceedings of the Tenth European Conference on Computer Systems, 2015

2014
Computation and communication efficient graph processing with distributed immutable view.
Proceedings of the 23rd International Symposium on High-Performance Parallel and Distributed Computing, 2014

Greedy map generalization by iterative point removal.
Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, 2014

Replication-Based Fault-Tolerance for Large-Scale Graph Processing.
Proceedings of the 44th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2014

Bipartite-oriented distributed graph partitioning for big learning.
Proceedings of the Asia-Pacific Workshop on Systems, 2014

2013
Tiled-MapReduce: Efficient and Flexible MapReduce Processing on Multicore with Tiling.
ACM Trans. Archit. Code Optim., 2013

2012
Mercury: Combining Performance with Dependability Using Self-Virtualization.
J. Comput. Sci. Technol., 2012

2011
A case for scaling applications to many-core with OS clustering.
Proceedings of the European Conference on Computer Systems, 2011

2010
Tiled-MapReduce: optimizing resource usages of data-parallel applications on multicore with tiling.
Proceedings of the 19th International Conference on Parallel Architectures and Compilation Techniques, 2010

2009
Evaluating SPLASH-2 Applications Using MapReduce.
Proceedings of the Advanced Parallel Processing Technologies, 8th International Symposium, 2009

2008
Corey: An Operating System for Many Cores.
Proceedings of the 8th USENIX Symposium on Operating Systems Design and Implementation, 2008

2007
POLUS: A POwerful Live Updating System.
Proceedings of the 29th International Conference on Software Engineering (ICSE 2007), 2007

2006
Live updating operating systems using virtualization.
Proceedings of the 2nd International Conference on Virtual Execution Environments, 2006


  Loading...