Feng Zhang

Orcid: 0000-0003-1983-7321

Affiliations:
  • Renmin University of China, Key Laboratory of Data Engineering and Knowledge Engineering, Beijing, China
  • Tsinghua University, Department of Computer Science and Technology, Beijing, China (former)


According to our database1, Feng Zhang authored at least 62 papers between 2015 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Optimizing DNNs With Partially Equivalent Transformations and Automated Corrections.
IEEE Trans. Computers, December, 2023

Homomorphic Compression: Making Text Processing on Compression Unlimited.
Proc. ACM Manag. Data, December, 2023

Enabling Efficient Random Access to Hierarchically Compressed Text Data on Diverse GPU Platforms.
IEEE Trans. Parallel Distributed Syst., October, 2023

Expanding the Edge: Enabling Efficient Winograd CNN Inference With Deep Reuse on Edge Device.
IEEE Trans. Knowl. Data Eng., October, 2023

BladeDISC: Optimizing Dynamic Shape Machine Learning Workloads via Compiler Approach.
Proc. ACM Manag. Data, September, 2023

CompressGraph: Efficient Parallel Graph Analytics with Rule-Based Compression.
Proc. ACM Manag. Data, 2023

Binary stochasticity enabled highly efficient neuromorphic deep learning achieves better-than-software accuracy.
CoRR, 2023

CompressStreamDB: Fine-Grained Adaptive Stream Processing without Decompression.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

EdgeNN: Efficient Neural Network Inference for CPU-GPU Integrated Edge Devices.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

Efficient Anomaly Detection in Property Graphs.
Proceedings of the Database Systems for Advanced Applications, 2023

RECom: A Compiler Approach to Accelerating Recommendation Model Inference with Massive Embedding Columns.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

Space-Efficient TREC for Enabling Deep Learning on Microcontrollers.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022
YuenyeungSpTRSV.
Dataset, May, 2022

Payment behavior prediction on shared parking lots with TR-GCN.
VLDB J., 2022

POCLib: A High-Performance Framework for Enabling Near Orthogonal Processing on Compression.
IEEE Trans. Parallel Distributed Syst., 2022

Detecting Performance Variance for Parallel Applications Without Source Code.
IEEE Trans. Parallel Distributed Syst., 2022

Leveraging Code Snippets to Detect Variations in the Performance of HPC Systems.
IEEE Trans. Parallel Distributed Syst., 2022

Exploring Data Analytics Without Decompression on Embedded GPU Systems.
IEEE Trans. Parallel Distributed Syst., 2022

G-SLIDE: A GPU-Based Sub-Linear Deep Learning Engine via LSH Sparsification.
IEEE Trans. Parallel Distributed Syst., 2022

Exploring Query Processing on CPU-GPU Integrated Edge Device.
IEEE Trans. Parallel Distributed Syst., 2022

Periodic Weather-Aware LSTM With Event Mechanism for Parking Behavior Prediction.
IEEE Trans. Knowl. Data Eng., 2022

Efficient Load-Balanced Butterfly Counting on GPU.
Proc. VLDB Endow., 2022

An Adaptive Elastic Multi-model Big Data Analysis and Information Extraction System.
Data Sci. Eng., 2022

DREW: Efficient Winograd CNN Inference with Deep Reuse.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

An In-depth Analysis of Subflow Degradation for Multi-path TCP on High Speed Rails.
Proceedings of the 23rd IEEE International Symposium on a World of Wireless, 2022

CompressDB: Enabling Efficient Compressed Data Direct Processing for Various Databases.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

Optimizing Random Access to Hierarchically-Compressed Data on GPU.
Proceedings of the SC22: International Conference for High Performance Computing, 2022

Taming the Big Data Monster: Managing Petabytes of Data with Multi-Model Databases.
Proceedings of the 2022 IEEE 34th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), 2022

2021
TADOC: Text analytics directly on compression.
VLDB J., 2021

Fine-Grained Multi-Query Stream Processing on Integrated Architectures.
IEEE Trans. Parallel Distributed Syst., 2021

iMLBench: A Machine Learning Benchmark Suite for CPU-GPU Integrated Architectures.
IEEE Trans. Parallel Distributed Syst., 2021

YuenyeungSpTRSV: A Thread-Level and Warp-Level Fusion Synchronization-Free Sparse Triangular Solve.
IEEE Trans. Parallel Distributed Syst., 2021

An Efficient Parallel Secure Machine Learning Framework on GPUs.
IEEE Trans. Parallel Distributed Syst., 2021

DTransE: Distributed Translating Embedding for Knowledge Graph.
IEEE Trans. Parallel Distributed Syst., 2021

Automatic Irregularity-Aware Fine-Grained Workload Partitioning on Integrated Architectures.
IEEE Trans. Knowl. Data Eng., 2021

Preface.
J. Comput. Sci. Technol., 2021

Exploring deep reuse in winograd CNN inference.
Proceedings of the PPoPP '21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021

G-TADOC: Enabling Efficient GPU-Based Text Analytics without Decompression.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

FineQuery: Fine-Grained Query Processing on CPU-GPU Integrated Architectures.
Proceedings of the IEEE International Conference on Cluster Computing, 2021

2020
Modeling Analysis and Cost-Performance Ratio Optimization of Virtual Machine Scheduling in Cloud Computing.
IEEE Trans. Parallel Distributed Syst., 2020

FineStream: Fine-Grained Window-Based Stream Processing on CPU-GPU Integrated Architectures.
Proceedings of the 2020 USENIX Annual Technical Conference, 2020

Payment Behavior Prediction and Statistical Analysis for Shared Parking Lots.
Proceedings of the Network and Parallel Computing, 2020

PewLSTM: Periodic LSTM with Weather-Aware Gating Mechanism for Parking Behavior Prediction.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

CapelliniSpTRSV: A Thread-Level Synchronization-Free Sparse Triangular Solve on GPUs.
Proceedings of the ICPP 2020: 49th International Conference on Parallel Processing, 2020

ParSecureML: An Efficient Parallel Secure Machine Learning Framework on GPUs.
Proceedings of the ICPP 2020: 49th International Conference on Parallel Processing, 2020

Towards Concurrent Stateful Stream Processing on Multicore Processors.
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

Enabling Efficient Random Access to Hierarchically-Compressed Data.
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

Exploration of TransE in a Distributed Environment.
Proceedings of the 40th IEEE International Conference on Distributed Computing Systems, 2020

2019
Hardware-Conscious Stream Processing: A Survey.
SIGMOD Rec., 2019

Guest Editorial: Special Issue on Network and Parallel Computing for Emerging Architectures and Applications.
Int. J. Parallel Program., 2019

Scaling Stream Processing with Transactional State Management on Multicores.
CoRR, 2019

Performance evaluation and analysis of sparse matrix and graph kernels on heterogeneous processors.
CCF Trans. High Perform. Comput., 2019

Statistical Analysis and Prediction of Parking Behavior.
Proceedings of the Network and Parallel Computing, 2019

Distributed Join Algorithms on Multi-CPU Clusters with GPUDirect RDMA.
Proceedings of the 48th International Conference on Parallel Processing, 2019

Parallel Hybrid Join Algorithm on GPU.
Proceedings of the 21st IEEE International Conference on High Performance Computing and Communications; 17th IEEE International Conference on Smart City; 5th IEEE International Conference on Data Science and Systems, 2019

2018
An adaptive breadth-first search algorithm on integrated architectures.
J. Supercomput., 2018

Efficient Document Analytics on Compressed Data: Method, Challenges, Algorithms, Insights.
Proc. VLDB Endow., 2018

Zwift: A Programming Framework for High Performance Text Analytics on Compressed Data.
Proceedings of the 32nd International Conference on Supercomputing, 2018

2017
Understanding Co-Running Behaviors on Integrated CPU/GPU Architectures.
IEEE Trans. Parallel Distributed Syst., 2017

FinePar: irregularity-aware fine-grained workload partitioning on integrated architectures.
Proceedings of the 2017 International Symposium on Code Generation and Optimization, 2017

2016
Characterizing and optimizing TPC-C workloads on large-scale systems using SSD arrays.
Sci. China Inf. Sci., 2016

2015
To Co-run, or Not to Co-run: A Performance Study on Integrated Architectures.
Proceedings of the 23rd IEEE International Symposium on Modeling, 2015


  Loading...