Cho-Li Wang

Orcid: 0000-0002-4629-7175

According to our database1, Cho-Li Wang authored at least 156 papers between 1992 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Performance modeling on DaVinci AI core.
J. Parallel Distributed Comput., May, 2023

SelB-k-NN: A Mini-Batch K-Nearest Neighbors Algorithm on AI Processors.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

Embedding Communication for Federated Graph Neural Networks with Privacy Guarantees.
Proceedings of the 43rd IEEE International Conference on Distributed Computing Systems, 2023

2022
MIPD: An Adaptive Gradient Sparsification Framework for Distributed DNNs Training.
IEEE Trans. Parallel Distributed Syst., 2022

SaPus: Self-Adaptive Parameter Update Strategy for DNN Training on Multi-GPU Clusters.
IEEE Trans. Parallel Distributed Syst., 2022

Momentum-driven adaptive synchronization model for distributed DNN training on HPC clusters.
J. Parallel Distributed Comput., 2022

Compiler-Directed Incremental Checkpointing for Low Latency GPU Preemption.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

Efficient exact K-nearest neighbor graph construction for billion-scale datasets using GPUs with tensor cores.
Proceedings of the ICS '22: 2022 International Conference on Supercomputing, Virtual Event, June 28, 2022

KAFL: Achieving High Training Efficiency for Fast-K Asynchronous Federated Learning.
Proceedings of the 42nd IEEE International Conference on Distributed Computing Systems, 2022

Optimizing Aggregate Computation of Graph Neural Networks with on-GPU Interpreter-Style Programming.
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2022

2021
FedSCR: Structure-Based Communication Reduction for Federated Learning.
IEEE Trans. Parallel Distributed Syst., 2021

CTXBack: Enabling Low Latency GPU Context Switching via Context Flashback.
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

Accelerating DBSCAN Algorithm with AI Chips for Large Datasets.
Proceedings of the ICPP 2021: 50th International Conference on Parallel Processing, Lemont, IL, USA, August 9, 2021

Collaborative GPU Preemption via Spatial Multitasking for Efficient GPU Sharing.
Proceedings of the Euro-Par 2021: Parallel Processing, 2021

2020
Probabilistic Consistency Guarantee in Partial Quorum-Based Data Store.
IEEE Trans. Parallel Distributed Syst., 2020

A Model-Based Software Solution for Simultaneous Multiple Kernels on GPUs.
ACM Trans. Archit. Code Optim., 2020

On-GPU thread-data remapping for nested branch divergence.
J. Parallel Distributed Comput., 2020

Uranus: Simple, Efficient SGX Programming and its Applications.
Proceedings of the ASIA CCS '20: The 15th ACM Asia Conference on Computer and Communications Security, 2020

2019
Efficient low-latency packet processing using On-GPU Thread-Data Remapping.
J. Parallel Distributed Comput., 2019

Spectral Graph Theory Based Topology Analysis for Reconfigurable Data Center Networks.
Proceedings of the 15th International Conference on Mobile Ad-Hoc and Sensor Networks, 2019

FluentPS: A Parameter Server Design with Low-frequency Synchronization for Distributed Deep Learning.
Proceedings of the 2019 IEEE International Conference on Cluster Computing, 2019

EC-Shuffle: Dynamic Erasure Coding Optimization for Efficient and Reliable Shuffle in Spark.
Proceedings of the 19th IEEE/ACM International Symposium on Cluster, 2019

2018
Confluence: Speeding Up Iterative Distributed Operations by Key-Dependency-Aware Partitioning.
IEEE Trans. Parallel Distributed Syst., 2018

SIMPO: A Scalable In-Memory Persistent Object Framework Using NVRAM for Reliable Big Data Computing.
ACM Trans. Archit. Code Optim., 2018

On-GPU Thread-Data Remapping for Branch Divergence Reduction.
ACM Trans. Archit. Code Optim., 2018

NVCL: Exploiting NVRAM in Cache-Line Granularity Differential Logging.
Proceedings of the IEEE 7th Non-Volatile Memory Systems and Applications Symposium, 2018

2017
Scalable Adaptive NUMA-Aware Lock.
IEEE Trans. Parallel Distributed Syst., 2017

PoweRock: Power Modeling and Flexible Dynamic Power Management for Many-Core Architectures.
IEEE Syst. J., 2017

2016
Scalable adaptive NUMA-aware lock: combining local locking and remote locking for efficient concurrency.
Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2016

Lightweight Dependency Checking for Parallelizing Loops with Non-Deterministic Dependency on GPU.
Proceedings of the 22nd IEEE International Conference on Parallel and Distributed Systems, 2016

2015
Latency-aware DVFS for efficient power state transitions on many-core architectures.
J. Supercomput., 2015

Optimization of Composite Cloud Service Processing with Virtual Machines.
IEEE Trans. Computers, 2015

Cloud, grid, P2P and internet computing: Recent trends and future directions.
Peer-to-Peer Netw. Appl., 2015

Cache Affinity Optimization Techniques for Scaling Software Transactional Memory Systems on Multi-CMP Architectures.
Proceedings of the 14th International Symposium on Parallel and Distributed Computing, 2015

2014
Computational awareness towards green environments.
J. Supercomput., 2014

Adaptive Algorithm for Minimizing Cloud Task Length with Prediction Errors.
IEEE Trans. Cloud Comput., 2014

A Power Modelling Approach for Many-Core Architectures.
Proceedings of the 2014 10th International Conference on Semantics, 2014

Rhymes: A shared virtual memory system for non-coherent tiled many-core architectures.
Proceedings of the 20th IEEE International Conference on Parallel and Distributed Systems, 2014

Resource Allocation in Cloud Environment: A Model Based on Double Multi-attribute Auction Mechanism.
Proceedings of the IEEE 6th International Conference on Cloud Computing Technology and Science, 2014

Adaptive Live VM Migration over a WAN: Modeling and Implementation.
Proceedings of the 2014 IEEE 7th International Conference on Cloud Computing, Anchorage, AK, USA, June 27, 2014

2013
Error-Tolerant Resource Allocation and Payment Minimization for Cloud System.
IEEE Trans. Parallel Distributed Syst., 2013

Dynamic Optimization of Multiattribute Resource Allocation in Self-Organizing Clouds.
IEEE Trans. Parallel Distributed Syst., 2013

Network performance isolation for latency-sensitive cloud applications.
Future Gener. Comput. Syst., 2013

Ex-post efficient resource allocation for Self-organizing Cloud.
Comput. Electr. Eng., 2013

Optimization of cloud task processing with checkpoint-restart mechanism.
Proceedings of the International Conference for High Performance Computing, 2013

Optimization and stabilization of composite service processing in a cloud system.
Proceedings of the 21st IEEE/ACM International Symposium on Quality of Service, 2013

Java with Auto-parallelization on Graphics Coprocessing Architecture.
Proceedings of the 42nd International Conference on Parallel Processing, 2013

PVTCP: Towards practical and effective congestion control in virtualized datacenters.
Proceedings of the 2013 21st IEEE International Conference on Network Protocols, 2013

Minimization of cloud task execution length with workload prediction errors.
Proceedings of the 20th Annual International Conference on High Performance Computing, 2013

GPU-TLS: An Efficient Runtime for Speculative Loop Parallelization on GPUs.
Proceedings of the 13th IEEE/ACM International Symposium on Cluster, 2013

Towards Payment-Bound Analysis in Cloud Systems with Task-Prediction Errors.
Proceedings of the 2013 IEEE Sixth International Conference on Cloud Computing, Santa Clara, CA, USA, June 28, 2013

2012
Decentralized proactive resource allocation for maximizing throughput of P2P Grid.
J. Parallel Distributed Comput., 2012

Grid transaction management and an efficient development kit.
Comput. Syst. Sci. Eng., 2012

An efficient deadlock prevention approach for service oriented transaction processing.
Comput. Math. Appl., 2012

Mobile Edutainment with Interactive Augmented Reality Using Adaptive Marker Tracking.
Proceedings of the 18th IEEE International Conference on Parallel and Distributed Systems, 2012

SmartShadow-K: an practical knowledge network for joint context inference in everyday life.
Proceedings of the 2012 ACM Conference on Ubiquitous Computing, 2012

vBalance: using interrupt load balance to improve I/O performance for SMP virtual machines.
Proceedings of the ACM Symposium on Cloud Computing, SOCC '12, 2012

Lightweight Application-Level Task Migration for Mobile Cloud Computing.
Proceedings of the IEEE 26th International Conference on Advanced Information Networking and Applications, 2012

2011
A pipeline-based approach for long transaction processing in web service environments.
Int. J. Web Grid Serv., 2011

Defeating Network Jitter for Virtual Machines.
Proceedings of the IEEE 4th International Conference on Utility and Cloud Computing, 2011

WAVNet: Wide-Area Network Virtualization Technique for Virtual Private Cloud.
Proceedings of the International Conference on Parallel Processing, 2011

Probabilistic Best-Fit Multi-dimensional Range Query in Self-Organizing Cloud.
Proceedings of the International Conference on Parallel Processing, 2011

TrC-MC: Decentralized Software Transactional Memory for Multi-multicore Computers.
Proceedings of the 17th IEEE International Conference on Parallel and Distributed Systems, 2011

Towards Context-Aware Ubiquitous Transaction Processing: A Model and Algorithm.
Proceedings of IEEE International Conference on Communications, 2011

eXCloud: Transparent runtime support for scaling mobile applications in cloud.
Proceedings of the 2011 International Conference on Cloud and Service Computing, 2011

Social-optimized win-win resource allocation for Self-organizing Cloud.
Proceedings of the 2011 International Conference on Cloud and Service Computing, 2011

2010
GPS Calibrated Ad-Hoc Localization for Geosocial Networking.
Proceedings of the Ubiquitous Intelligence and Computing - 7th International Conference, 2010

Adaptive sampling-based profiling techniques for optimizing the distributed JVM runtime.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

A Stack-on-Demand Execution Model for Elastic Computing.
Proceedings of the 39th International Conference on Parallel Processing, 2010

Dual-Phase Just-in-Time Workflow Scheduling in P2P Grid Systems.
Proceedings of the 39th International Conference on Parallel Processing, 2010

Optimizing data acquisition by sensor-channel co-allocation in wireless sensor networks.
Proceedings of the 2010 International Conference on High Performance Computing, 2010

Conflict-minimizing dynamic load balancing for P2P desktop Grid.
Proceedings of the 2010 11th IEEE/ACM International Conference on Grid Computing, 2010

BetterLife 2.0: Large-Scale Social Intelligence Reasoning on Cloud.
Proceedings of the Cloud Computing, Second International Conference, 2010

2009
Towards pervasive instant messaging and presence awareness.
Int. J. Pervasive Comput. Commun., 2009

SmartShadow: Modeling A User-centric Mobile Virtual Space.
Proceedings of the Seventh Annual IEEE International Conference on Pervasive Computing and Communications, 2009

Path-Analytic Distributed Object Prefetching.
Proceedings of the 10th International Symposium on Pervasive Systems, 2009

A Case-Based Component Selection Framework for Mobile Context-Aware Applications.
Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2009

A Semantic Context Management Framework on Mobile Device.
Proceedings of the International Conference on Embedded Software and Systems, 2009

2008
Object co-location and memory reuse for Java programs.
ACM Trans. Archit. Code Optim., 2008

Implementation of an Intelligent Urban Traffic Management System Based on a City Grid Infrastructure.
J. Inf. Sci. Eng., 2008

Handoff Performance Comparison of Mobile IP, Fast Handoff and mSCTP in Mobile Wireless Networks.
Proceedings of the 9th International Symposium on Parallel Architectures, 2008

Lightweight process migration and memory prefetching in openMosix.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Scalable group-based checkpoint/restart for large-scale message-passing systems.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Process reassignment with reduced migration cost in grid load rebalancing.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

A Performance Study of Clustering Web Application Servers with Distributed JVM.
Proceedings of the 14th International Conference on Parallel and Distributed Systems, 2008

2007
A Receiver-Coordinated Approach for Throughput Aggregation in High Bandwidth Multicast.
Proceedings of the INFOCOM 2007. 26th IEEE International Conference on Computer Communications, 2007

GPS-Based Location Extraction and Presence Management for Mobile Instant Messenger.
Proceedings of the Embedded and Ubiquitous Computing, International Conference, 2007

2006
Code-on-Demand and Code Adaptation for Mobile Computing.
Proceedings of the Handbook of Mobile Middleware., 2006

An architecture to support scalable distributed virtual environment systems on grid.
J. Supercomput., 2006

G-PASS: an instance-oriented security infrastructure for Grid travelers.
Concurr. Comput. Pract. Exp., 2006

A segment-based DSM supporting large shared object space.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

Smart Instant Messenger in Pervasive Computing Environments.
Proceedings of the Advances in Grid and Pervasive Computing, 2006

An Adaptive Multipath Protocol for Efficient IP Handoff in Mobile Wireless Networks.
Proceedings of the 20th International Conference on Advanced Information Networking and Applications (AINA 2006), 2006

2005
On The Cooperation of Web Clients and Proxy Caches.
Proceedings of the 11th International Conference on Parallel and Distributed Systems, 2005

Smart Retrieval and Sharing of Information Resources Based on Contexts of User-Information Relationships.
Proceedings of the 19th International Conference on Advanced Information Networking and Applications (AINA 2005), 2005

2004
Cyclone: A High-Performance Cluster-Based Web Server with Socket Cloning.
Clust. Comput., 2004

Gamelet: A Mobile Service Component for Building Multi-server Distributed Virtual Environment on Grid.
Proceedings of the Parallel and Distributed Processing and Applications, 2004

Petri-Net-Based Coordination Algorithms for Grid Transactions.
Proceedings of the Parallel and Distributed Processing and Applications, 2004

State-On-Demand Execution for Adaptive Component-based Mobile Agent Systems.
Proceedings of the 10th International Conference on Parallel and Distributed Systems, 2004

G-PASS: Security Infrastructure for Grid Travelers.
Proceedings of the Grid and Cooperative Computing, 2004

InstantGrid: A Framework for On-Demand Grid Point Construction.
Proceedings of the Grid and Cooperative Computing, 2004

Design of an OGSA-Based MetaService Architecture.
Proceedings of the Grid and Cooperative Computing, 2004

Grid Computing in Hong Kong: Research and Development.
Proceedings of the 10th IEEE International Workshop on Future Trends of Distributed Computing Systems (FTDCS 2004), 2004

Context-Aware State Management for Ubiquitous Applications.
Proceedings of the Embedded and Ubiquitous Computing, 2004

Ontology Mapping in Pervasive Computing Environment.
Proceedings of the Embedded and Ubiquitous Computing, 2004

A Collaborative and Semantic Data Management Framework for Ubiquitous Computing Environment.
Proceedings of the Embedded and Ubiquitous Computing, 2004

A novel adaptive home migration protocol in home-based DSM.
Proceedings of the 2004 IEEE International Conference on Cluster Computing (CLUSTER 2004), 2004

LOTS: a software DSM supporting large object space.
Proceedings of the 2004 IEEE International Conference on Cluster Computing (CLUSTER 2004), 2004

PAT: a postmortem object access pattern analysis and visualization tool.
Proceedings of the 4th IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid 2004), 2004

Exploiting Java Objects Behavior for Memory Management and Optimizations.
Proceedings of the Programming Languages and Systems: Second Asian Symposium, 2004

A Grid-enabled Multi-server Network Game Architecture.
Proceedings of the 3rd International Conference on Application and Development of Computer Games (ADCOG 2004) held on 26-27 April 2004 in City University of Hong Kong, 2004

2003
Solving irregularly structured problems based on distributed object model.
Parallel Comput., 2003

On the design of global object space for efficient multi-threading Java computing on clusters.
Parallel Comput., 2003

Document replication and distribution in extensible geographically distributed web servers.
J. Parallel Distributed Comput., 2003

A Grid Middleware for Distributed Java Computing with MPI Binding and Process Migration Supports.
J. Comput. Sci. Technol., 2003

p-Jigsaw: a cluster-based Web server with cooperative caching support.
Concurr. Comput. Pract. Exp., 2003

Contention-Aware Communication Schedule for High-Speed Communication.
Clust. Comput., 2003

Functionality Adaptation: A Context-Aware Service Code Adaptation for Pervasive Computing Environments.
Proceedings of the 2003 IEEE / WIC International Conference on Web Intelligence, 2003

Lightweight Transparent Java Thread Migration for Distributed JVM.
Proceedings of the 32nd International Conference on Parallel Processing (ICPP 2003), 2003

Dynamic Component Composition for Functionality Adaptation in Pervasive Environments.
Proceedings of the 9th IEEE International Workshop on Future Trends of Distributed Computing Systems (FTDCS 2003), 2003

2002
Special section on Industrial information systems: progresses and perspectives in Pacific Rim.
J. Syst. Softw., 2002

Portable and Scalable Algorithm for Irregular All-to-All Communication.
J. Parallel Distributed Comput., 2002

Migrating-Home Protocol for Software Distributed Shared Memory.
J. Inf. Sci. Eng., 2002

Directed Point: a communication subsystem for commodity supercomputing with Gigabit Ethernet.
Future Gener. Comput. Syst., 2002

A New Asynchronous Parallel Evolutionary Algorithm for Function Optimization.
Proceedings of the Parallel Problem Solving from Nature, 2002

Workshop Introduction.
Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002

Load Balancing in Distributed Web Server Systems with Partial Document Replication.
Proceedings of the 31st International Conference on Parallel Processing (ICPP 2002), 2002

Efficient Global Object Space Support for Distributed JVM on Cluster.
Proceedings of the 31st International Conference on Parallel Processing (ICPP 2002), 2002

JESSICA2: A Distributed Java Virtual Machine with Transparent Thread Migration Support.
Proceedings of the 2002 IEEE International Conference on Cluster Computing (CLUSTER 2002), 2002

Socket Cloning for Cluster-Based Web Servers.
Proceedings of the 2002 IEEE International Conference on Cluster Computing (CLUSTER 2002), 2002

M-JavaMPI: A Java-MPI Binding with Process Migration Support.
Proceedings of the 2nd IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2002), 2002

2001
Distributed particle simulation method on adaptive collaborative system.
Future Gener. Comput. Syst., 2001

A Distributed Object Model for Solving Irregularly Structured Problems on Cluster.
Proceedings of the 2001 IEEE International Conference on Cluster Computing (CLUSTER 2001), 2001

Building a Scalable Web Server with Global Object Space Support on Heterogeneous Clusters.
Proceedings of the 2001 IEEE International Conference on Cluster Computing (CLUSTER 2001), 2001

Document Distribution Algorithm for Load Balancing on an Extensible Web Server Architecture.
Proceedings of the First IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2001), 2001

2000
JESSICA: Java-Enabled Single-System-Image Computing Architecture.
J. Parallel Distributed Comput., 2000

Delta Execution: A preemptive Java thread migration mechanism.
Clust. Comput., 2000

JUMP-DP: A Software DSM System with Low-Latency Communication Support.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2000

Contention-free Complete Exchange Algorithm on Clusters.
Proceedings of the 2000 IEEE International Conference on Cluster Computing (CLUSTER 2000), November 28th, 2000

1999
Resource Scaling Effects on MPP Performance: The STAP Benchmark Implications.
IEEE Trans. Parallel Distributed Syst., 1999

Designing SSI clusters with hierarchical checkpointing and single I/O space.
IEEE Concurr., 1999

JESSICA: Java-Enabled Single-System-Image Computing Architecture.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 1999

A Migrating-Home Protocol for Implementing Scope Consistency Model on a Cluster of Workstations.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 1999

Realistic Communication Model for Parallel Computing on Cluster.
Proceedings of the International Workshop on Cluster Computing (IWCC '99), 1999

ClusterProbe: An Open, Flexible and Scalable Cluster Monitoring Tool.
Proceedings of the International Workshop on Cluster Computing (IWCC '99), 1999

Push-Pull Messaging: A High-Performance Communication Mechanism for Commodity SMP Clusters.
Proceedings of the International Conference on Parallel Processing 1999, 1999

A Distributed Object-Oriented Method for Particle Simulations on Clusters.
Proceedings of the High-Performance Computing and Networking, 7th International Conference, 1999

1998
Parallel Algorithms for Perceptual Grouping on Distributed Memory Machines.
J. Parallel Distributed Comput., 1998

1997
Evaluating MPI Collective Communication on the SP2, T3D, and Paragon Multicomputers.
Proceedings of the 3rd IEEE Symposium on High-Performance Computer Architecture (HPCA '97), 1997

1996
High-performance computing for vision.
Proc. IEEE, 1996

Portable Message Passing Algorithms for Irregular All-to-all Communication.
Proceedings of the 16th International Conference on Distributed Computing Systems, 1996

1994
Scalable Data Parallel Implementations of Object Recognition Using Geometric Hashing.
J. Parallel Distributed Comput., 1994

Scalable parallel implementations of perceptual grouping on connection machine CM-5.
Proceedings of the 12th IAPR International Conference on Pattern Recognition, 1994

Scalable Data Parallel Implementations of Object Recognition on Connection Machine CM-.
Proceedings of the 27th Annual Hawaii International Conference on System Sciences (HICSS-27), 1994

1993
Heterogeneous Computing: Challenges and Opportunities.
Computer, 1993

1992
An architecture for tree search based vector quantization for single chip implementation.
Proceedings of the Application Specific Array Processors, 1992


  Loading...