Rong Gu

Orcid: 0000-0002-1565-9997

Affiliations:
  • Nanjing University, State Key Laboratory for Novel Software Technology, China (PhD 2016)


According to our database1, Rong Gu authored at least 87 papers between 2012 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Parallel Wormhole Filters: High-Performance Approximate Membership Query Data Structures for Persistent Memory.
IEEE Trans. Parallel Distributed Syst., November, 2025

Chameleon: Taming Dynamic Operator Sequences for Memory-Intensive LLM Training.
CoRR, September, 2025

Accelerating Mixture-of-Experts Inference by Hiding Offloading Latency with Speculative Decoding.
CoRR, August, 2025

Odyssey: Adaptive Policy Selection for Resilient Distributed Training.
CoRR, August, 2025

BOASF: A Unified Framework for Speeding up Automatic Machine Learning via Adaptive Successive Filtering.
CoRR, July, 2025

Chordless Structure: A Pathway to Simple and Expressive GNNs.
CoRR, May, 2025

FlashForge: Ultra-Efficient Prefix-Aware Attention for LLM Decoding.
CoRR, May, 2025

Echo: Efficient Co-Scheduling of Hybrid Online-Offline Tasks for Large Language Model Serving.
CoRR, April, 2025

VEGA: An Active-tuning Learned Index with Group-Wise Learning Granularity.
Proc. ACM Manag. Data, February, 2025

Hourglass: An Adaptive Range Filter with Lightweight Hybrid Encoding.
Proc. ACM Manag. Data, 2025

HotPrefix: Hotness-Aware KV Cache Scheduling for Efficient Prefix Sharing in LLM Inference Systems.
Proc. ACM Manag. Data, 2025

Towards Efficient Serverless MapReduce Computing on Cloud-Native Platforms.
Big Data Min. Anal., 2025

Local-to-Cloud Database Synchronization via Fine-Grained Hybrid Compression.
Proceedings of the 41st IEEE International Conference on Data Engineering, 2025

2024
Fluid-Shuttle: Efficient Cloud Data Transmission Based on Serverless Computing Compression.
IEEE/ACM Trans. Netw., December, 2024

Bamboo Filters: Make Resizing Smooth and Adaptive.
IEEE/ACM Trans. Netw., October, 2024

Joint Deployment of Truck-Drone Systems for Camera-Based Object Monitoring.
IEEE Trans. Mob. Comput., October, 2024

A Generic Framework for Finding Special Quadratic Elements in Data Streams.
IEEE/ACM Trans. Netw., August, 2024

Placing Wireless Chargers With Multiple Antennas.
IEEE Trans. Mob. Comput., June, 2024

A Survey of Multi-Dimensional Indexes: Past and Future Trends.
IEEE Trans. Knowl. Data Eng., 2024

Revisiting SLO and Goodput Metrics in LLM Serving.
CoRR, 2024

ColdPurge: Effecient Metadata Cache Cleaning via Accurate Online Data Hotness Tracking.
Proceedings of the 25th International Middleware Conference Industrial Track, 2024

ACER: Accelerating Complex Event Recognition via Two-Phase Filtering under Range Bitmap-Based Indexes.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

The Reinforcement Cuckoo Filter.
Proceedings of the IEEE INFOCOM 2024, 2024

Wormhole Filters: Caching Your Hash on Persistent Memory.
Proceedings of the Nineteenth European Conference on Computer Systems, 2024

2023
Seesaw Counting Filter: A Dynamic Filtering Framework for Vulnerable Negative Keys.
IEEE Trans. Knowl. Data Eng., December, 2023

High-Level Data Abstraction and Elastic Data Caching for Data-Intensive AI Applications on Cloud-Native Platforms.
IEEE Trans. Parallel Distributed Syst., November, 2023

SAFE: Service Availability via Failure Elimination Through VNF Scaling.
IEEE/ACM Trans. Netw., October, 2023

Coral: federated query join order optimization based on deep reinforcement learning.
World Wide Web (WWW), September, 2023

Placing Wireless Chargers With Limited Mobility.
IEEE Trans. Mob. Comput., June, 2023

A Pareto optimal Bloom filter family with hash adaptivity.
VLDB J., May, 2023

ShadowAQP: Efficient Approximate Group-by and Join Query via Attribute-oriented Sample Size Allocation and Data Generation.
Proc. VLDB Endow., 2023

Adaptive Online Cache Capacity Optimization via Lightweight Working Set Size Estimation at Scale.
Proceedings of the 2023 USENIX Annual Technical Conference, 2023

Magpie: Efficient Big Data Query System Parameter Optimization based on Pre-selection and Search Pruning Approach.
Proceedings of the 31st IEEE/ACM International Symposium on Quality of Service, 2023

Time and Cost-Efficient Cloud Data Transmission based on Serverless Computing Compression.
Proceedings of the IEEE INFOCOM 2023, 2023

MoonKV: Optimizing Update-intensive Workloads for NVM-based Key-value Stores.
Proceedings of the IEEE International Conference on Data Mining, 2023

Variable-length Encoding Framework: A Generic Framework for Enhancing the Accuracy of Approximate Membership Queries.
Proceedings of the IEEE International Conference on Data Mining, 2023

Raven: Benchmarking Monetary Expense and Query Efficiency of OLAP Engines on the Cloud.
Proceedings of the Database Systems for Advanced Applications, 2023

Distantly Supervised Entity Linking with Selection Consistency Constraint.
Proceedings of the Database Systems for Advanced Applications, 2023

2022
Liquid: Intelligent Resource Estimation and Network-Efficient Scheduling for Deep Learning Jobs on Distributed GPU Clusters.
IEEE Trans. Parallel Distributed Syst., 2022

VFChain: Enabling Verifiable and Auditable Federated Learning via Blockchain Systems.
IEEE Trans. Netw. Sci. Eng., 2022

Octopus-DF: Unified DataFrame-based cross-platform data analytic system.
Parallel Comput., 2022

Seesaw Counting Filter: An Efficient Guardian for Vulnerable Negative Keys During Dynamic Filtering.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Meces: Latency-efficient Rescaling via Prioritized State Migration for Stateful Distributed Stream Processing Systems.
Proceedings of the 2022 USENIX Annual Technical Conference, 2022

Placing Wireless Chargers with Multiple Antennas.
Proceedings of the 19th Annual IEEE International Conference on Sensing, 2022

DUET: Joint Deployment of Trucks and Drones for Object Monitoring.
Proceedings of the 30th IEEE/ACM International Symposium on Quality of Service, 2022

Bamboo Filters: Make Resizing Smooth.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Fluid: Dataset Abstraction and Elastic Acceleration for Cloud-native Deep Learning Training Jobs.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Efficient. Scalable and Robust Data Shuffle Service for Distributed MapReduce Computing on Cloud.
Proceedings of the 24th IEEE Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, 2022

Towards Efficient Reverse-time Migration Imaging Computation by Pipeline and Fine-grained Execution Parallelization.
Proceedings of the 25th IEEE International Conference on Computational Science and Engineering, 2022

2021
Towards Efficient Distributed Subgraph Enumeration Via Backtracking-Based Framework.
IEEE Trans. Parallel Distributed Syst., 2021

Towards Efficient Large-Scale Interprocedural Program Static Analysis on Distributed Data-Parallel Computation.
IEEE Trans. Parallel Distributed Syst., 2021

Alchemy: Distributed financial quantitative analysis system with high-level programming model.
Softw. Pract. Exp., 2021

Improving in-memory file system reading performance by fine-grained user-space cache mechanisms.
J. Syst. Archit., 2021

VSIM: Distributed local structural vertex similarity calculation on big graphs.
J. Parallel Distributed Comput., 2021

SparkDQ: Efficient generic big data quality management on distributed data-parallel computation.
J. Parallel Distributed Comput., 2021

Empirical analysis of performance bottlenecks in graph neural network training and inference with GPUs.
Neurocomputing, 2021

Hash Adaptive Bloom Filter.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

2020
Distributed Subgraph Enumeration via Backtracking-based Framework.
CoRR, 2020

Semi-supervised Embedding Learning for High-dimensional Bayesian Optimization.
CoRR, 2020

2019
Efficient and Scalable Functional Dependency Discovery on Distributed Data-Parallel Platforms.
IEEE Trans. Parallel Distributed Syst., 2019

DGST: Efficient and scalable suffix tree construction on distributed data-parallel platforms.
Parallel Comput., 2019

ForestLayer: Efficient training of deep forests on distributed task-parallel platforms.
J. Parallel Distributed Comput., 2019

Adaptive cache policy scheduling for big data applications on distributed tiered storage system.
Concurr. Comput. Pract. Exp., 2019

BigSpa: An Efficient Interprocedural Static Analysis Engine in the Cloud.
Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019

SAFE: Service Availability via Failure Elimination Through VNF Scaling.
Proceedings of the 48th International Conference on Parallel Processing, 2019

Push-Based Network-efficient Hadoop YARN Scheduling Mechanism for In-Memory Computing.
Proceedings of the 25th IEEE International Conference on Parallel and Distributed Systems, 2019

HyMJ: A Hybrid Structure-Aware Approach to Distributed Multi-way Join Query.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

BENU: Distributed Subgraph Enumeration with Backtracking-Based Framework.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

2018
Penguin: Efficient Query-Based Framework for Replaying Large Scale Historical Data.
IEEE Trans. Parallel Distributed Syst., 2018

Parallelizing Machine Learning Optimization Algorithms on Distributed Data-Parallel Platforms with Parameter Server.
Proceedings of the 24th IEEE International Conference on Parallel and Distributed Systems, 2018

Seal: Efficient Training Large Scale Statistical Machine Translation Models on Spark.
Proceedings of the 24th IEEE International Conference on Parallel and Distributed Systems, 2018

2017
Improving Execution Concurrency of Large-Scale Matrix Multiplication on Distributed Data-Parallel Platforms.
IEEE Trans. Parallel Distributed Syst., 2017

AutoMJ: Towards Efficient Multi-way Join Query on Distributed Data-Parallel Platform.
Proceedings of the 23rd IEEE International Conference on Parallel and Distributed Systems, 2017

2016
Accelerating Big Data Applications on Tiered Storage System with Various Eviction Policies.
Proceedings of the 2016 IEEE Trustcom/BigDataSE/ISPA, 2016

A Time-Cost Based Automatic Scheduling Framework for Matrix Computation on Various Distributed Computing Platforms.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

2015
Cichlid: Efficient Large Scale RDFS/OWL Reasoning with Spark.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

iPLAR: Towards Interactive Programming with Parallel Linear Algebra in R.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2015

Parallel Training GBRT Based on KMeans Histogram Approximation for Big Data.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2015

Unified Programming Model and Software Framework for Big Data Machine Learning and Data Analytics.
Proceedings of the 39th Annual Computer Software and Applications Conference, 2015

Efficient large scale distributed matrix computation with spark.
Proceedings of the 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA, October 29, 2015

2014
SHadoop: Improving MapReduce performance by optimizing job execution mechanism in Hadoop clusters.
J. Parallel Distributed Comput., 2014

YAFIM: A Parallel Frequent Itemset Mining Algorithm with Spark.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

Training Large Scale Deep Neural Networks on the Intel Xeon Phi Many-Core Coprocessor.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

Rainbow: A distributed and hierarchical RDF triple store with dynamic scalability.
Proceedings of the 2014 IEEE International Conference on Big Data (IEEE BigData 2014), 2014

2013
Large Scale Nearest Neighbors Search Based on Neighborhood Graph.
Proceedings of the International Conference on Advanced Cloud and Big Data, 2013

A parallel computing platform for training large scale neural networks.
Proceedings of the 2013 IEEE International Conference on Big Data (IEEE BigData 2013), 2013

2012
Performance Optimization for Short MapReduce Job Execution in Hadoop.
Proceedings of the 2012 Second International Conference on Cloud and Green Computing, 2012


  Loading...