Da Zheng

Orcid: 0000-0001-8115-5415

Affiliations:
  • Ant Group, Hangzhou, China


According to our database1, Da Zheng authored at least 70 papers between 2012 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Executable Knowledge Graphs for Replicating AI Research.
CoRR, October, 2025

dInfer: An Efficient Inference Framework for Diffusion Language Models.
CoRR, October, 2025

EvolProver: Advancing Automated Theorem Proving by Evolving Formalized Problems via Symmetry and Difficulty.
CoRR, October, 2025

LLaDA-MoE: A Sparse MoE Diffusion Language Model.
CoRR, September, 2025

Reflective Paper-to-Code Reproduction Enabled by Fine-Grained Verification.
CoRR, August, 2025

Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study.
CoRR, June, 2025

AutoMind: Adaptive Knowledgeable Agent for Automated Data Science.
CoRR, June, 2025

Right Is Not Enough: The Pitfalls of Outcome Supervision in Training LLMs for Math Reasoning.
CoRR, June, 2025

Knowledge Augmented Complex Problem Solving with Large Language Models: A Survey.
CoRR, May, 2025

LightThinker: Thinking Step-by-Step Compression.
CoRR, February, 2025

OneKE: A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2025, 2025

2024
TraverseNet: Unifying Space and Time in Message Passing for Traffic Forecasting.
IEEE Trans. Neural Networks Learn. Syst., February, 2024

Hierarchical Compression of Text-Rich Graphs via Large Language Models.
CoRR, 2024

GraphStorm: all-in-one graph machine learning framework for industry applications.
CoRR, 2024

Parameter-Efficient Tuning Large Language Models for Graph Representation Learning.
CoRR, 2024

GraphStorm: All-in-one Graph Machine Learning Framework for Industry Applications.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

NetInfoF Framework: Measuring and Exploiting Network Usable Information.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Revisit Orthogonality in Graph-Regularized MLPs.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

Hector: An Efficient Programming and Compilation Framework for Implementing Relational Graph Neural Networks in GPU Architectures.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

RAP: Resource-aware Automated GPU Sharing for Multi-GPU Recommendation Model Training and Input Preprocessing.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023
Tango: rethinking quantization for graph neural network training on GPUs.
CoRR, 2023

OrthoReg: Improving Graph-regularized MLPs via Orthogonality Regularization.
CoRR, 2023

PIGEON: Optimizing CUDA Code Generator for End-to-End Training and Inference of Relational Graph Neural Networks.
CoRR, 2023

PaGE-Link: Path-based Graph Neural Network Explanation for Heterogeneous Link Prediction.
Proceedings of the ACM Web Conference 2023, 2023

DistTGL: Distributed Memory-Based Temporal Graph Neural Network Training.
Proceedings of the International Conference for High Performance Computing, 2023

TANGO: re-thinking quantization for graph neural network training on GPUs.
Proceedings of the International Conference for High Performance Computing, 2023

DSP: Efficient GNN Training with Multiple GPUs.
Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2023

Train Your Own GNN Teacher: Graph-Aware Distillation on Textual Graphs.
Proceedings of the Machine Learning and Knowledge Discovery in Databases: Research Track, 2023

GraphStorm an Easy-to-use and Scalable Graph Neural Network Framework: From Beginners to Heroes.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Graph-Aware Language Model Pre-Training on a Large Graph Corpus Can Help Multiple Graph Applications.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Optimizing Irregular Dense Operators of Heterogeneous GNN Models on GPU.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

2022
TGL: A General Framework for Temporal GNN Training onBillion-Scale Graphs.
Proc. VLDB Endow., 2022

From Local to Global: Spectral-Inspired Graph Neural Networks.
CoRR, 2022

Efficient and effective training of language and graph neural network models.
CoRR, 2022

TGL: A General Framework for Temporal GNN Training on Billion-Scale Graphs.
CoRR, 2022

Distributed Hybrid CPU and GPU training for Graph Neural Networks on Billion-Scale Heterogeneous Graphs.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Nimble GNN Embedding with Tensor-Train Decomposition.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

2021
Distributed Hybrid CPU and GPU training for Graph Neural Networks on Billion-Scale Graphs.
CoRR, 2021

TraverseNet: Unifying Space and Time in Message Passing.
CoRR, 2021

Scalable Graph Neural Networks with Deep Graph Library.
Proceedings of the WSDM '21, 2021

Learning over Families of Sets - Hypergraph Representation Learning for Higher Order Tasks.
Proceedings of the 2021 SIAM International Conference on Data Mining, 2021

Dr. Top-k: delegate-centric Top-k on GPUs.
Proceedings of the International Conference for High Performance Computing, 2021

Global Neighbor Sampling for Mixed CPU-GPU Training on Giant Graphs.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Schema-Aware Deep Graph Convolutional Networks for Heterogeneous Graphs.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

2020
PanRep: Universal node embeddings for heterogeneous graphs.
CoRR, 2020

Few-shot link prediction via graph neural networks for Covid-19 drug-repurposing.
CoRR, 2020

Collective Multi-type Entity Alignment Between Knowledge Graphs.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Learning Graph Neural Networks with Deep Graph Library.
Proceedings of the Companion of The 2020 Web Conference 2020, 2020

DGL-KE: Training Knowledge Graph Embeddings at Scale.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

DistDGL: Distributed Graph Neural Network Training for Billion-Scale Graphs.
Proceedings of the 10th IEEE/ACM Workshop on Irregular Applications: Architectures and Algorithms, 2020

FeatGraph: a flexible and efficient backend for graph neural network systems.
Proceedings of the International Conference for High Performance Computing, 2020

Scalable Graph Neural Networks with Deep Graph Library.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

2019
Deep Graph Library: Towards Efficient and Scalable Deep Learning on Graphs.
CoRR, 2019

Graphyti: A Semi-External Memory Graph Library for FlashGraph.
CoRR, 2019

clusterNOR: A NUMA-Optimized Clustering Framework.
CoRR, 2019

2018
Challenges Towards Deploying Data Intensive Scientific Applications on Extreme Heterogeneity Supercomputers.
CoRR, 2018

FlashR: parallelize and scale R for machine learning using SSDs.
Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2018

2017
Semi-External Memory Sparse Matrix Multiplication for Billion-Node Graphs.
IEEE Trans. Parallel Distributed Syst., 2017

knor: A NUMA-Optimized In-Memory, Distributed and Semi-External-Memory k-means Library.
Proceedings of the 26th International Symposium on High-Performance Parallel and Distributed Computing, 2017

2016
FlashMatrix: Parallel, Scalable Data Analysis with Generalized Matrix Operations using Commodity SSDs.
CoRR, 2016

Semi-External Memory Sparse Matrix Multiplication on Billion-node Graphs in a Multicore Architecture.
CoRR, 2016

An SSD-based eigensolver for spectral analysis on billion-node graphs.
CoRR, 2016

NUMA-optimized In-memory and Semi-external-memory Parameterized Clustering.
CoRR, 2016

2015
Optimize Unsynchronized Garbage Collection in an SSD Array.
CoRR, 2015

FlashGraph: Processing Billion-Node Graphs on an Array of Commodity SSDs.
Proceedings of the 13th USENIX Conference on File and Storage Technologies, 2015

2014
Hadoop in Low-Power Processors.
CoRR, 2014

FlashGraph: Processing Billion-Node Graphs on an Array of Commodity SSDs.
CoRR, 2014

Active Community Detection in Massive Graphs.
CoRR, 2014

2013
Toward millions of file system IOPS on low-cost, commodity hardware.
Proceedings of the International Conference for High Performance Computing, 2013

2012
A Parallel Page Cache: IOPS and Caching for Multicore Systems.
Proceedings of the 4th USENIX Workshop on Hot Topics in Storage and File Systems, 2012


  Loading...