Use of interpretable machine learning approaches for quantificationally understanding the performance of steel fiber-reinforced recycled aggregate concrete: From the perspective of compressive strength and splitting tensile strength.

[BibT_eX]

[DOI]

Shuyuan Zhang

Wenguang Chen

Jinjun Xu

Tianyu Xie

Eng. Appl. Artif. Intell., 2024

Compression for Better: A General and Stable Lossless Compression Framework.

[BibT_eX]

[DOI]

CoRR, 2024

KAG: Boosting LLMs in Professional Domains via Knowledge Augmented Generation.

[BibT_eX]

[DOI]

CoRR, 2024

Building a High-Performance Graph Storage on Top of Tree-Structured Key-Value Stores.

[BibT_eX]

[DOI]

Big Data Min. Anal., 2024

Contrastive Learning Enhanced Diffusion Model for Improving Tropical Cyclone Intensity Estimation with Test-Time Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track, 2024

Ranking Enhanced Supervised Contrastive Learning for Regression.

[BibT_eX]

[DOI]

Proceedings of the Advances in Knowledge Discovery and Data Mining, 2024

MatFactory: A Framework for High-Performance Matrix Factorization on FPGAs.

[BibT_eX]

[DOI]

Proceedings of the 43rd IEEE/ACM International Conference on Computer-Aided Design, 2024

<i>Coral: </i> Maliciously Secure Computation Framework for Packed and Mixed Circuits.

[BibT_eX]

[DOI]

Proceedings of the 2024 on ACM SIGSAC Conference on Computer and Communications Security, 2024

A Context-Sensitive Pointer Analysis Framework for Rust and Its Application to Call Graph Construction.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM SIGPLAN International Conference on Compiler Construction, 2024

AdaPipe: Optimizing Pipeline Parallelism with Adaptive Recomputation and Partitioning.

[BibT_eX]

[DOI]

Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023

A Comprehensive Survey on Distributed Training of Graph Neural Networks.

[BibT_eX]

[DOI]

Proc. IEEE, December, 2023

Mat2Stencil: A Modular Matrix-Based DSL for Explicit and Implicit Matrix-Free PDE Solvers on Structured Grid.

[BibT_eX]

[DOI]

Proc. ACM Program. Lang., October, 2023

Toward 6G TKμ Extreme Connectivity: Architecture, Key Technologies and Experiments.

[BibT_eX]

[DOI]

IEEE Wirel. Commun., June, 2023

Gas Plume Target Detection in Multibeam Water Column Image Using Deep Residual Aggregation Structure and Attention Mechanism.

[BibT_eX]

[DOI]

Remote. Sens., 2023

PUMA: Secure Inference of LLaMA-7B in Five Minutes.

[BibT_eX]

[DOI]

CoRR, 2023

An Improved IPOS LLC Resonant Converter With Sub-Module Output Voltage Sharing in Low Ripple High-Voltage Applications.

[BibT_eX]

[DOI]

IEEE Access, 2023

A Physics-guided NN-based Approach for Tropical Cyclone Intensity Estimation.

[BibT_eX]

[DOI]

Proceedings of the 2023 SIAM International Conference on Data Mining, 2023

GraphSet: High Performance Graph Mining through Equivalent Set Transformations.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2023

Canvas: Isolated and Adaptive Swapping for Multi-Applications on Remote Memory.

[BibT_eX]

[DOI]

Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, 2023

GLM-130B: An Open Bilingual Pre-trained Model.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Lasa: Abstraction and Specialization for Productive and Performant Linear Algebra on FPGAs.

[BibT_eX]

[DOI]

Proceedings of the 31st IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2023

G-Sparse: Compiler-Driven Acceleration for Generalized Sparse Computation for Graph Neural Networks on Modern GPUs.

[BibT_eX]

[DOI]

Proceedings of the 32nd International Conference on Parallel Architectures and Compilation Techniques, 2023

2022

Detecting Performance Variance for Parallel Applications Without Source Code.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2022

Leveraging Code Snippets to Detect Variations in the Performance of HPC Systems.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2022

$TC-Stream$TC-Stream: Large-Scale Graph Triangle Counting on a Single Machine Using GPUs.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2022

GLM-130B: An Open Bilingual Pre-trained Model.

[BibT_eX]

[DOI]

CoRR, 2022

Toward 6G TKμ Extreme Connectivity: Architecture, Key Technologies and Experiments.

[BibT_eX]

[DOI]

CoRR, 2022

Mixed-Precision Inference Quantization: Radically Towards Faster inference speed, Lower Storage requirement, and Lower Loss.

[BibT_eX]

[DOI]

Daning Cheng

Wenguang Chen

CoRR, 2022

Programming Matrices as Staged Sparse Rows to Generate Efficient Matrix-free Differential Equation Solver.

[BibT_eX]

[DOI]

CoRR, 2022

Canvas: Isolated and Adaptive Swapping for Multi-Applications on Remote Memory.

[BibT_eX]

[DOI]

CoRR, 2022

Quantization in Layer's Input is Matter.

[BibT_eX]

[DOI]

Daning Cheng

Wenguang Chen

CoRR, 2022

Scaling Graph 500 SSSP to 140 Trillion Edges with over 40 Million Cores.

[BibT_eX]

[DOI]

Proceedings of the SC22: International Conference for High Performance Computing, 2022

Vapro: performance variance detection and diagnosis for production-run parallel applications.

[BibT_eX]

[DOI]

Proceedings of the PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2, 2022

BaGuaLu: targeting brain scale pretrained models with over 37 million cores.

[BibT_eX]

[DOI]

Proceedings of the PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2, 2022

Scaling graph traversal to 281 trillion edges with 40 million cores.

[BibT_eX]

[DOI]

Proceedings of the PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2, 2022

TriCache: A User-Transparent Block Cache Enabling High-Performance Out-of-Core Processing with In-Memory Programs.

[BibT_eX]

[DOI]

Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 2022

2021

TADOC: Text analytics directly on compression.

[BibT_eX]

[DOI]

VLDB J., 2021

Automatic Irregularity-Aware Fine-Grained Workload Partitioning on Integrated Architectures.

[BibT_eX]

[DOI]

IEEE Trans. Knowl. Data Eng., 2021

A Fast Lock for Explicit Message Passing Architectures.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2021

Taking the Pulse of Financial Activities with Online Graph Processing.

[BibT_eX]

[DOI]

ACM SIGOPS Oper. Syst. Rev., 2021

Chukonu: A Fully-Featured Big Data Processing System by Efficiently Integrating a Native Compute Engine into Spark.

[BibT_eX]

[DOI]

Proc. VLDB Endow., 2021

GraphTheta: A Distributed Graph Neural Network Learning System With Flexible Training Strategy.

[BibT_eX]

[DOI]

CoRR, 2021

Processing extreme-scale graphs on China's supercomputers.

[BibT_eX]

[DOI]

Yiming Zhang

Kai Lu

Wenguang Chen

Commun. ACM, 2021

AIPerf: Automated machine learning as an AI-HPC benchmark.

[BibT_eX]

[DOI]

Big Data Min. Anal., 2021

LotusSQL: SQL engine for high-performance big data systems.

[BibT_eX]

[DOI]

Big Data Min. Anal., 2021

RisGraph: A Real-Time Streaming System for Evolving Graphs to Support Sub-millisecond Per-update Analysis at Millions Ops/s.

[BibT_eX]

[DOI]

Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

DFOGraph: an I/O- and communication-efficient system for distributed fully-out-of-core graph processing.

[BibT_eX]

[DOI]

Proceedings of the PPoPP '21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021

Encouraging Compiler Optimization Practice for Undergraduate Students through Competition.

[BibT_eX]

[DOI]

Proceedings of the ITiCSE '21: Proceedings of the 26th ACM Conference on Innovation and Technology in Computer Science Education V.1, Virtual Event, Germany, June 26, 2021

Sparker: Efficient Reduction for More Scalable Machine Learning with Spark.

[BibT_eX]

[DOI]

Proceedings of the ICPP 2021: 50th International Conference on Parallel Processing, Lemont, IL, USA, August 9, 2021

2020

Survey of external memory large-scale graph processing on a multi-core system.

[BibT_eX]

[DOI]

J. Supercomput., 2020

LiveGraph: A Transactional Graph Storage System with Purely Sequential Adjacency List Scans.

[BibT_eX]

[DOI]

Proc. VLDB Endow., 2020

RisGraph: A Real-Time Streaming System for Evolving Graphs.

[BibT_eX]

[DOI]

CoRR, 2020

2019

ANG: a combination of Apriori and graph computing techniques for frequent itemsets mining.

[BibT_eX]

[DOI]

J. Supercomput., 2019

LiveGraph: A Transactional Graph Storage System with Purely Sequential Adjacency List Scans.

[BibT_eX]

[DOI]

CoRR, 2019

Auxo: a temporal graph management system.

[BibT_eX]

[DOI]

Big Data Min. Anal., 2019

Spread-n-share: improving application performance and cluster throughput with resource-aware job placement.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2019

Pimiento: A Vertex-Centric Graph-Processing Framework on a Single Machine.

[BibT_eX]

[DOI]

Proceedings of the Algorithms and Architectures for Parallel Processing, 2019

T2S-Tensor: Productively Generating High-Performance Spatial Hardware for Dense Tensor Computations.

[BibT_eX]

[DOI]

Nitish Kumar Srivastava

Christopher J. Hughes

Timothy G. Mattson

Pradeep Dubey

Proceedings of the 27th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2019

HiWayLib: A Software Framework for Enabling High Performance Communications for Heterogeneous Pipeline Computations.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, 2019

pLock: A Fast Lock for Architectures with Explicit Inter-core Message Passing.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, 2019

2018

An Efficient In-Memory Checkpoint Method and its Practice on Fault-Tolerant HPL.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2018

Efficient Document Analytics on Compressed Data: Method, Challenges, Algorithms, Insights.

[BibT_eX]

[DOI]

Proc. VLDB Endow., 2018

Bridging the Gap Between Neural Networks and Neuromorphic Hardware with A Neural Network Compiler.

[BibT_eX]

[DOI]

CoRR, 2018

The future of artificial intelligence in China.

[BibT_eX]

[DOI]

Commun. ACM, 2018

Will supercomputers be super-data and super-AI machines?

[BibT_eX]

[DOI]

Commun. ACM, 2018

Welcome to the China region special section.

[BibT_eX]

[DOI]

Wenguang Chen

Xiang-Yang Li

Commun. ACM, 2018

Spindle: Informed Memory Access Monitoring.

[BibT_eX]

[DOI]

Proceedings of the 2018 USENIX Annual Technical Conference, 2018

ShenTu: processing multi-trillion edge graphs on millions of cores in seconds.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2018

vSensor: leveraging fixed-workload snippets of programs for performance variance detection.

[BibT_eX]

[DOI]

Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2018

Zwift: A Programming Framework for High Performance Text Analytics on Compressed Data.

[BibT_eX]

[DOI]

Proceedings of the 32nd International Conference on Supercomputing, 2018

Bridge the Gap between Neural Networks and Neuromorphic Hardware with a Neural Network Compiler.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating Systems, 2018

2017

Understanding Co-Running Behaviors on Integrated CPU/GPU Architectures.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2017

Congestion control and energy-balanced scheme based on the hierarchy for WSNs.

[BibT_eX]

[DOI]

Wenguang Chen

Yugang Niu

Yuanyuan Zou

IET Wirel. Sens. Syst., 2017

Self-Checkpoint: An In-Memory Checkpoint Method Using Less Space and Its Practice on Fault-Tolerant HPL.

[BibT_eX]

[DOI]

Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2017

Versapipe: a versatile programming framework for pipelined computing on GPU.

[BibT_eX]

[DOI]

Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture, 2017

Scalable Graph Traversal on Sunway TaihuLight with Ten Million Cores.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

Performance evaluation and optimization of HBM-Enabled GPU for data-intensive applications.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2017

What Decides the Dropout in MOOCs?

[BibT_eX]

[DOI]

Proceedings of the Database Systems for Advanced Applications, 2017

FinePar: irregularity-aware fine-grained workload partitioning on integrated architectures.

[BibT_eX]

[DOI]

Proceedings of the 2017 International Symposium on Code Generation and Optimization, 2017

SaberLDA: Sparsity-Aware Learning of Topic Models on GPUs.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems, 2017

POSTER: Bridge the Gap Between Neural Networks and Neuromorphic Hardware.

[BibT_eX]

[DOI]

Proceedings of the 26th International Conference on Parallel Architectures and Compilation Techniques, 2017

2016

Building Semi-Elastic Virtual Clusters for Cost-Effective HPC Cloud Resource Provisioning.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2016

DRDDR: a lightweight method to detect data races in Linux kernel.

[BibT_eX]

[DOI]

J. Supercomput., 2016

Performance Prediction for Large-Scale Parallel Applications Using Representative Replay.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2016

WarpLDA: a Cache Efficient O(1) Algorithm for Latent Dirichlet Allocation.

[BibT_eX]

[DOI]

Proc. VLDB Endow., 2016

NestedMP: Enabling cache-aware thread mapping for nested parallel shared memory applications.

[BibT_eX]

[DOI]

Jiangzhou He

Wenguang Chen

Zhizhong Tang

Parallel Comput., 2016

Data adapter for querying and transformation between SQL and NoSQL database.

[BibT_eX]

[DOI]

Future Gener. Comput. Syst., 2016

A survey of cloud resource management for complex engineering applications.

[BibT_eX]

[DOI]

Frontiers Comput. Sci., 2016

Characterizing and optimizing TPC-C workloads on large-scale systems using SSD arrays.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2016

Refactoring and optimizing the community atmosphere model (CAM) on the sunway taihulight supercomputer.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2016

Gemini: A Computation-Centric Distributed Graph Processing System.

[BibT_eX]

[DOI]

Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation, 2016

NEUTRAMS: Neural network transformation and co-design under neuromorphic hardware constraints.

[BibT_eX]

[DOI]

Proceedings of the 49th Annual IEEE/ACM International Symposium on Microarchitecture, 2016

Neural network transformation under hardware constraints.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Compilers, 2016

2015

Automatic Cloud I/O Configurator for I/O Intensive Parallel Applications.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2015

ImmortalGraph: A System for Storage and Analysis of Temporal Graphs.

[BibT_eX]

[DOI]

ACM Trans. Storage, 2015

Extending Conditional Dependencies with Built-in Predicates.

[BibT_eX]

[DOI]

IEEE Trans. Knowl. Data Eng., 2015

Optimizing seam carving on multi-GPU systems for real-time content-aware image resizing.

[BibT_eX]

[DOI]

J. Supercomput., 2015

WarpLDA: a Simple and Efficient O(1) Algorithm for Latent Dirichlet Allocation.

[BibT_eX]

[DOI]

CoRR, 2015

GridGraph: Large-Scale Graph Processing on a Single Machine Using 2-Level Hierarchical Partitioning.

[BibT_eX]

[DOI]

Xiaowei Zhu

Wentao Han

Wenguang Chen

Proceedings of the 2015 USENIX Annual Technical Conference, 2015

BiFennel: Fast Bipartite Graph Partitioning Algorithm for Big Data.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Smart City/SocialCom/SustainCom/DataCom/SC2 2015, 2015

To Co-run, or Not to Co-run: A Performance Study on Integrated Architectures.

[BibT_eX]

[DOI]

Proceedings of the 23rd IEEE International Symposium on Modeling, 2015

AsHES Introduction and Committees.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

Cost-Effective Resource Configuration for Cloud Video Streaming Services.

[BibT_eX]

[DOI]

Yunyun Jiang

Xiaosong Ma

Wenguang Chen

Proceedings of the 21st IEEE International Conference on Parallel and Distributed Systems, 2015

A Power-Conserving Online Scheduling Scheme for Video Streaming Services.

[BibT_eX]

[DOI]

Proceedings of the Algorithms and Architectures for Parallel Processing, 2015

Distributed Metaserver Mechanism and Recovery Mechanism Support in Quantcast File System.

[BibT_eX]

[DOI]

Proceedings of the 39th IEEE Annual Computer Software and Applications Conference, 2015

Weibo, and a Tale of Two Worlds.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 2015

2014

CYPRESS: Combining Static and Dynamic Analysis for Top-Down Communication Trace Compression.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2014

Cybertron: pushing the limit on I/O reduction in data-parallel programs.

[BibT_eX]

[DOI]

Proceedings of the 2014 ACM International Conference on Object Oriented Programming Systems Languages & Applications, 2014

Nondeterminism in MapReduce considered harmful? an empirical study on non-commutative aggregators in MapReduce programs.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Software Engineering, 2014

NestedMP: Taming Complex Configuration Space of Degree of Parallelism for Nested-Parallel Programs.

[BibT_eX]

[DOI]

Jiangzhou He

Wenguang Chen

Zhizhong Tang

Proceedings of the 43rd International Conference on Parallel Processing Workshops, 2014

Optimizing Seam Carving on multi-GPU systems for real-time image resizing.

[BibT_eX]

[DOI]

Proceedings of the 20th IEEE International Conference on Parallel and Distributed Systems, 2014

Chronos: a graph engine for temporal graph analysis.

[BibT_eX]

[DOI]

Proceedings of the Ninth Eurosys Conference 2014, 2014

Kernel data race detection using debug register in Linux.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Symposium on Low-Power and High-Speed Chips, 2014

2013

Taming Hardware Event Samples for Precise and Versatile Feedback Directed Optimizations.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2013

Improving cis-regulatory elements modeling by consensus scaffolded mixture models.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2013

Cost-effective cloud HPC resource provisioning by building semi-elastic virtual clusters.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2013

ACIC: automatic cloud I/O configurator for HPC applications.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2013

ACIC: automatic cloud I/O configurator for parallel applications.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Symposium on High-Performance Parallel and Distributed Computing, 2013

Shall I Use Heterogeneous Data Centers? - A Case Study on Video on Demand Systems.

[BibT_eX]

[DOI]

Shi Feng

Hong Zhang

Wenguang Chen

Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing, 2013

2012

SMILE: streaming management of applications and data for mobile terminals.

[BibT_eX]

[DOI]

Int. J. Cloud Comput., 2012

CUDA-Zero: a framework for porting shared memory GPU applications to multi-GPUs.

[BibT_eX]

[DOI]

Dehao Chen

Wenguang Chen

Weimin Zheng

Sci. China Inf. Sci., 2012

Acolyte: An In-Memory Social Network Query System.

[BibT_eX]

[DOI]

Proceedings of the Web Information Systems Engineering - WISE 2012, 2012

Employing Checkpoint to Improve Job Scheduling in Large-Scale Systems.

[BibT_eX]

[DOI]

Proceedings of the Job Scheduling Strategies for Parallel Processing, 2012

Parameter estimation of Conditional Random Fields model based on cloud computing.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Granular Computing, 2012

2011

Efficiently Acquiring Communication Traces for Large-Scale Parallel Applications.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2011

ASLOP: A field-access affinity-based structure data layout optimizer.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2011

Cloud versus in-house cluster: evaluating Amazon cluster compute instances for running MPI applications.

[BibT_eX]

[DOI]

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, 2011

An SSA-based algorithm for optimal speculative code motion under an execution profile.

[BibT_eX]

[DOI]

Hucheng Zhou

Wenguang Chen

Fred C. Chow

Proceedings of the 32nd ACM SIGPLAN Conference on Programming Language Design and Implementation, 2011

RACEZ: a lightweight and non-invasive race detection tool for production applications.

[BibT_eX]

[DOI]

Proceedings of the 33rd International Conference on Software Engineering, 2011

One optimized I/O configuration per HPC application: leveraging the configurability of cloud.

[BibT_eX]

[DOI]

Proceedings of the APSys '11 Asia Pacific Workshop on Systems, 2011

OpenMDSP: Extending OpenMP to Program Multi-Core DSP.

[BibT_eX]

[DOI]

Proceedings of the 2011 International Conference on Parallel Architectures and Compilation Techniques, 2011

2010

PHANTOM: predicting performance of parallel applications on large-scale parallel machines using a single node.

[BibT_eX]

[DOI]

Jidong Zhai

Wenguang Chen

Weimin Zheng

Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2010

Do I use the wrong definition?: DeFuse: definition-use invariants for detecting concurrency and sequential bugs.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual ACM SIGPLAN Conference on Object-Oriented Programming, 2010

How OpenMP Applications Get More Benefit from Many-Core Era.

[BibT_eX]

[DOI]

Proceedings of the Beyond Loop Level Parallelism in OpenMP: Accelerators, 2010

Taming hardware event samples for FDO compilation.

[BibT_eX]

[DOI]

Proceedings of the CGO 2010, 2010

MapCG: writing parallel program portable between CPU and GPU.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Parallel Architectures and Compilation Techniques, 2010

2009

Incorporating cardinality constraints and synonym rules into conditional functional dependencies.

[BibT_eX]

[DOI]

Wenguang Chen

Wenfei Fan

Shuai Ma

Inf. Process. Lett., 2009

LogGPO: An accurate communication model for performance prediction of MPI programs.

[BibT_eX]

[DOI]

Sci. China Ser. F Inf. Sci., 2009

FACT: fast communication trace collection for parallel applications through program slicing.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE Conference on High Performance Computing, 2009

MPIWiz: subgroup reproducible replay of mpi applications.

[BibT_eX]

[DOI]

Proceedings of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2009

Improving Dense Linear Equation Solver on Hybrid CPU-GPU System.

[BibT_eX]

[DOI]

Proceedings of the 10th International Symposium on Pervasive Systems, 2009

Process Mapping for MPI Collective Communications.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2009 Parallel Processing, 2009

Analyses and Validation of Conditional Dependencies with Built-in Predicates.

[BibT_eX]

[DOI]

Wenguang Chen

Wenfei Fan

Shuai Ma

Proceedings of the Database and Expert Systems Applications, 20th International Conference, 2009

Extracting Maximal Degenerate Motifs Based on a Suffix Tree.

[BibT_eX]

Proceedings of the International Conference on Bioinformatics & Computational Biology, 2009

Cache Sharing Management for Performance Fairness in Chip Multiprocessors.

[BibT_eX]

[DOI]

Xing Zhou

Wenguang Chen

Weimin Zheng

Proceedings of the PACT 2009, 2009

2008

Exploring the Emerging Applications for Transactional Memory.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Parallel and Distributed Computing, 2008

CprFS: a user-level file system to support consistent file states for checkpoint and restart.

[BibT_eX]

[DOI]

Ruini Xue

Wenguang Chen

Weimin Zheng

Proceedings of the 22nd Annual International Conference on Supercomputing, 2008

Maotai: View-Oriented Parallel Programming on CMT Processors.

[BibT_eX]

[DOI]

Proceedings of the 2008 International Conference on Parallel Processing, 2008

Parallelization and Characterization of Probabilistic Latent Semantic Analysis.

[BibT_eX]

[DOI]

Proceedings of the 2008 International Conference on Parallel Processing, 2008

Parallelization of spectral clustering algorithm on multi-core processors and GPGPU.

[BibT_eX]

[DOI]

Proceedings of the 13th Asia-Pacific Computer Systems Architecture Conference, 2008

2007

OpenUH: an optimizing, portable OpenMP compiler.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2007

PBB: a parallel bioinformatics benchmark suite for shared memory multiprocessors.

[BibT_eX]

[DOI]

Wenguang Chen

Chuntao Hong

Proceedings of the CHINA HPC 2007, 2007

Performance Evaluation of View-Oriented Parallel Programming on Cluster of Computers.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing and Communications, 2007

History Based User Interest Modeling in WWW Access.

[BibT_eX]

[DOI]

Shuang Han

Wenguang Chen

Heng Wang

Proceedings of the Human-Computer Interaction. HCI Intelligent Multimodal Interaction Environments, 2007

Revisit of View-Oriented Parallel Programming.

[BibT_eX]

[DOI]

Zhiyi Huang

Wenguang Chen

Proceedings of the Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2007), 2007

2006

Parallelization of module network structure learning and performance tuning on SMP.

[BibT_eX]

[DOI]

Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

Parallel implementation and performance characterization of MUSCLE.

[BibT_eX]

[DOI]

Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

Tree partition based parallel frequent pattern mining on shared memory systems.

[BibT_eX]

[DOI]

Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

MPIPP: an automatic profile-guided parallel process placement toolset for SMP clusters and multiclusters.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual International Conference on Supercomputing, 2006

Distributed File Streamer: A Framework for Distributed Application Data Coupling.

[BibT_eX]

[DOI]

Proceedings of the 7th IEEE/ACM International Conference on Grid Computing (GRID 2006), 2006

VODCA: View-Oriented, Distributed, Cluster-Based Approach to Parallel Computing.

[BibT_eX]

[DOI]

Proceedings of the Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2006), 2006

2005

Thckpt: Transparent Checkpointing of Linux Processes Under IA-64.

[BibT_eX]

Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2005

Parallel Implementation of SEMPHY - a Structural EM Algorithm for Phylogenetic Reconstruction.

[BibT_eX]

Proceedings of the Parallel Computing: Current & Future Issues of High-End Computing, 2005

Parallel Module Network Learning on Distributed Memory Multiprocessors.

[BibT_eX]

[DOI]

Proceedings of the 34th International Conference on Parallel Processing Workshops (ICPP 2005 Workshops), 2005

A Dynamic Energy Conservation Scheme for Clusters in Computing Centers.

[BibT_eX]

[DOI]

Proceedings of the Embedded Software and Systems, Second International Conference, 2005

Hierarchical Parallel Simulated Annealing and Its Applications.

[BibT_eX]

[DOI]

Proceedings of the Distributed and Parallel Computing, 2005

2004

Parallelization of Bayesian Network based SNPs Pattern Analysis and Performance Characterization on SMP/HT.

[BibT_eX]

[DOI]

Proceedings of the 10th International Conference on Parallel and Distributed Systems, 2004

A Single Thread Discrete Event Simulation Toolkit for Java: STSimJ.

[BibT_eX]

[DOI]

Wenguang Chen

Dingxing Wang

Weimin Zheng

Proceedings of the Computational Science, 2004

2003

On the Malicious Participants Problem in Computational Grid.

[BibT_eX]

[DOI]

Wenguang Chen

Weimin Zheng

Guangwen Yang

Proceedings of the Grid and Cooperative Computing, Second International Workshop, 2003

Wenguang Chen

Known people with the same name:

Bibliography

Loading...