Bingsheng He
According to our database^{1},
Bingsheng He
authored at least 246 papers
between 1986 and 2019.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis OtherLinks
Homepage:

at ihome.ust.hk
On csauthors.net:
Bibliography
2019
Big Data and Exascale Computing.
Proceedings of the Encyclopedia of Big Data Technologies., 2019
Search and Query Accelerators.
Proceedings of the Encyclopedia of Big Data Technologies., 2019
GPUBased Hardware Platforms.
Proceedings of the Encyclopedia of Big Data Technologies., 2019
Storage Technologies for Big Data.
Proceedings of the Encyclopedia of Big Data Technologies., 2019
Emerging Hardware Technologies.
Proceedings of the Encyclopedia of Big Data Technologies., 2019
FairnessEfficiency Allocation of CPUGPU Heterogeneous Resources.
IEEE Trans. Services Computing, 2019
Privacy Regulation Aware Process Mapping in GeoDistributed Cloud Data Centers.
IEEE Trans. Parallel Distrib. Syst., 2019
CGraph: A Distributed Storage and Processing System for Concurrent Iterative Graph Analysis Jobs.
TOS, 2019
Guest Editors' Introduction: Special Issue on Big Data Systems on Emerging Architectures.
IEEE Trans. Big Data, 2019
Supporting Superpages and Lightweight Page Migration in Hybrid Memory Systems.
TACO, 2019
A Survey on Graph Processing Accelerators: Challenges and Opportunities.
J. Comput. Sci. Technol., 2019
BriskStream: Scaling Data Stream Processing on SharedMemory Multicore Architectures.
Proceedings of the 2019 International Conference on Management of Data, 2019
Incorporating Probabilistic Optimizations for Resource Provisioning of Data Processing Workflows.
Proceedings of the 48th International Conference on Parallel Processing, 2019
Efficient MultiClass Probabilistic SVMs on GPUs.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019
Aucher: Multimodal Queries on Live Audio Streams in RealTime.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019
Revisiting Hash Join on Graphics Processors: A Decade Later.
Proceedings of the 35th IEEE International Conference on Data Engineering Workshops, 2019
DiGraph: An Efficient Pathbased Iterative Directed Graph Processing System on Multiple GPUs.
Proceedings of the TwentyFourth International Conference on Architectural Support for Programming Languages and Operating Systems, 2019
2018
Fair Resource Allocation for DataIntensive Computing in the Cloud.
IEEE Trans. Services Computing, 2018
Efficient DiskBased Directed Graph Processing: A Strongly Connected Component Approach.
IEEE Trans. Parallel Distrib. Syst., 2018
Scalable GPU Virtualization with Dynamic Sharing of Graphics Memory Space.
IEEE Trans. Parallel Distrib. Syst., 2018
LongTerm MultiResource Fairness for Payasyou Use Computing Systems.
IEEE Trans. Parallel Distrib. Syst., 2018
Frog: Asynchronous Graph Processing on GPU with Hybrid Coloring Model.
IEEE Trans. Knowl. Data Eng., 2018
Towards Efficient Resource Allocation for Heterogeneous Workloads in IaaS Clouds.
IEEE Trans. Cloud Computing, 2018
JouleMR: Towards CostEffective and GreenAware Data Processing Frameworks.
IEEE Trans. Big Data, 2018
LayerCentric Memory Reuse and Data Migration for ExtremeScale Deep Learning on ManyCore Architectures.
TACO, 2018
Manycore needs finegrained scheduling: A case study of query processing on Intel Xeon Phi processors.
J. Parallel Distrib. Comput., 2018
ThunderSVM: A Fast SVM Library on GPUs and CPUs.
J. Mach. Learn. Res., 2018
Database Architectures for Modern Hardware (Dagstuhl Seminar 18251).
Dagstuhl Reports, 2018
A class of ADMMbased algorithms for threeblock separable convex programming.
Comp. Opt. and Appl., 2018
gMig: Efficient GPU Live Migration Optimized by Software Dirty Page for Full Virtualization.
Proceedings of the 14th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments, 2018
CGraph: A Correlationsaware Approach for Efficient Concurrent Iterative Graph Processing.
Proceedings of the 2018 USENIX Annual Technical Conference, 2018
vSensor: leveraging fixedworkload snippets of programs for performance variance detection.
Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2018
GNET: Effective GPU Sharing in NFV Systems.
Proceedings of the 15th USENIX Symposium on Networked Systems Design and Implementation, 2018
Efficient Gradient Boosted Decision Tree Training on GPUs.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018
EnergyEfficient Speculative Execution using Advanced Reservation for Heterogeneous Clusters.
Proceedings of the 47th International Conference on Parallel Processing, 2018
GLP4NN: A Convergenceinvariant and Networkagnostic Lightweight Parallelization Framework for Deep Neural Networks on Modern GPUs.
Proceedings of the 47th International Conference on Parallel Processing, 2018
Query Processing on OpenCLBased FPGAs: Challenges and Opportunities.
Proceedings of the 24th IEEE International Conference on Parallel and Distributed Systems, 2018
RTSI: An Index Structure for MultiModal RealTime Search on Live Audio Streaming Services.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018
Hebe: An OrderOblivious and HighPerformance Execution Scheme for Conjunctive Predicates.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018
FCNengine: accelerating deconvolutional layers in classic CNN processors.
Proceedings of the International Conference on ComputerAided Design, 2018
Efficient Support Vector Machine Training Algorithm on GPUs.
Proceedings of the ThirtySecond AAAI Conference on Artificial Intelligence, 2018
An efficient graph accelerator with parallel data conflict management.
Proceedings of the 27th International Conference on Parallel Architectures and Compilation Techniques, 2018
Towards concurrency race debugging: an integrated approach for constraint solving and dynamic slicing.
Proceedings of the 27th International Conference on Parallel Architectures and Compilation Techniques, 2018
2017
A distributed inmemory keyvalue store system on heterogeneous CPUGPU cluster.
VLDB J., 2017
Multikernel Data Partitioning With Channel on OpenCLBased FPGAs.
IEEE Trans. VLSI Syst., 2017
A VariationAware Adaptive Fuzzy Control System for Thermal Management of Microprocessors.
IEEE Trans. VLSI Syst., 2017
A Declarative Optimization Engine for Resource Provisioning of Scientific Workflows in GeoDistributed Clouds.
IEEE Trans. Parallel Distrib. Syst., 2017
Understanding CoRunning Behaviors on Integrated CPU/GPU Architectures.
IEEE Trans. Parallel Distrib. Syst., 2017
Building an Efficient PutIntensive KeyValue Store with SkipTree.
IEEE Trans. Parallel Distrib. Syst., 2017
Analysis of Minimum Interaction Time for Continuous Distributed Interactive Computing.
IEEE Trans. Parallel Distrib. Syst., 2017
QoSAware Resource Allocation for Video Transcoding in Clouds.
IEEE Trans. Circuits Syst. Video Techn., 2017
A Hybrid Logic Block Architecture in FPGA for Holistic Efficiency.
IEEE Trans. on Circuits and Systems, 2017
Network Performance Aware Optimizations on IaaS Clouds.
IEEE Trans. Computers, 2017
Accelerating Dynamic Graph Analytics on GPUs.
PVLDB, 2017
Convergence Rate Analysis for the Alternating Direction Method of Multipliers with a Substitution Procedure for Separable Convex Programming.
Math. Oper. Res., 2017
On the Iteration Complexity of Some Projection Methods for Monotone Linear Variational Inequalities.
J. Optimization Theory and Applications, 2017
An Algorithmic Framework of Generalized PrimalDual Hybrid Gradient Methods for Saddle Point Problems.
Journal of Mathematical Imaging and Vision, 2017
Efficient process mapping in geodistributed cloud data centers.
Proceedings of the International Conference for High Performance Computing, 2017
Dynamic module partitioning for library based placement on heterogeneous FPGAs.
Proceedings of the 23rd IEEE International Conference on Embedded and RealTime Computing Systems and Applications, 2017
Hardware/software cooperative caching for hybrid DRAM/NVM memory architectures.
Proceedings of the International Conference on Supercomputing, 2017
Multiobjective Optimizations in GeoDistributed Data Analytics Systems.
Proceedings of the 23rd IEEE International Conference on Parallel and Distributed Systems, 2017
MultiQuery Optimization for Complex Event Processing in SAP ESP.
Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017
DIDO: Dynamic Pipelines for InMemory KeyValue Stores on Coupled CPUGPU Architectures.
Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017
Revisiting the Design of Data Stream Processing Systems on MultiCore Processors.
Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017
AdaStorm: Resource Efficient Storm with Adaptive Configuration.
Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017
Data Management Systems on Future Hardware: Challenges and Opportunities.
Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017
On Achieving Efficient Data Transfer for Graph Processing in GeoDistributed Datacenters.
Proceedings of the 37th IEEE International Conference on Distributed Computing Systems, 2017
COMBA: A comprehensive modelbased analysis framework for high level synthesis of real applications.
Proceedings of the 2017 IEEE/ACM International Conference on ComputerAided Design, 2017
A novel twostage modular multiplier based on racetrack memory for asymmetric cryptography.
Proceedings of the 2017 IEEE/ACM International Conference on ComputerAided Design, 2017
Dynamic Partitioning for Library based Placement on Heterogeneous FPGAs (Abstract Only).
Proceedings of the 2017 ACM/SIGDA International Symposium on FieldProgrammable Gate Arrays, 2017
Dynamic Module Partitioning for Library Based Placement on Heterogeneous FPGAs.
Proceedings of the 25th IEEE Annual International Symposium on FieldProgrammable Custom Computing Machines, 2017
A Study of MainMemory Hash Joins on Manycore Processor: A Case with Intel Knights Landing Architecture.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017
FinePar: irregularityaware finegrained workload partitioning on integrated architectures.
Proceedings of the 2017 International Symposium on Code Generation and Optimization, 2017
2016
Decentralized ThermalAware Task Scheduling for LargeScale ManyCore Systems.
IEEE Trans. VLSI Syst., 2016
Dynamic Job Ordering and Slot Configurations for MapReduce Workloads.
IEEE Trans. Services Computing, 2016
Melia: A MapReduce Framework on OpenCLBased FPGAs.
IEEE Trans. Parallel Distrib. Syst., 2016
F2C: Enabling Fair and FineGrained Resource Sharing in MultiTenant IaaS Clouds.
IEEE Trans. Parallel Distrib. Syst., 2016
A Performance Debugging Framework for Unnecessary Lock Contentions with Record/Replay Techniques.
IEEE Trans. Parallel Distrib. Syst., 2016
LibraryBased Placement and Routing in FPGAs with Support of Partial Reconfiguration.
ACM Trans. Design Autom. Electr. Syst., 2016
Monetary Cost Optimizations for Hosting WorkflowasaService in IaaS Clouds.
IEEE Trans. Cloud Computing, 2016
Rotated Logging Storage Architectures for Data Centers: Models and Optimizations.
IEEE Trans. Computers, 2016
NVTree: A Consistent and WorkloadAdaptive Tree Structure for NonVolatile Memory.
IEEE Trans. Computers, 2016
RankAware Dynamic Migrations and Adaptive Demotions for DRAM Power Management.
IEEE Trans. Computers, 2016
Convergence Study on the Symmetric Version of ADMM with Larger Step Sizes.
SIAM J. Imaging Sciences, 2016
The direct extension of ADMM for multiblock convex minimization problems is not necessarily convergent.
Math. Program., 2016
On the Proximal Jacobian Decomposition of ALM for MultipleBlock Separable Convex Minimization Problems and Its Relationship to ADMM.
J. Sci. Comput., 2016
gScale: Scaling up GPU Virtualization with Dynamic Sharing of Graphics Memory Space.
Proceedings of the 2016 USENIX Annual Technical Conference, 2016
GPL: A GPUbased Pipelined Query Processing Engine.
Proceedings of the 2016 International Conference on Management of Data, 2016
Efficient Query Processing on Manycore Architectures: A Case Study with Intel Xeon Phi Processor.
Proceedings of the 2016 International Conference on Management of Data, 2016
A Study of Sorting Algorithms on Approximate Memory.
Proceedings of the 2016 International Conference on Management of Data, 2016
Elastic multiresource fairness: balancing fairness and efficiency in coupled CPUGPU architectures.
Proceedings of the International Conference for High Performance Computing, 2016
VColor: A practical vertexcut based approach for coloring large graphs.
Proceedings of the 32nd IEEE International Conference on Data Engineering, 2016
Not All Joules are Equal: Towards EnergyEfficient and GreenAware Data Processing Frameworks.
Proceedings of the 2016 IEEE International Conference on Cloud Engineering, 2016
A Study of Big Data Computing Platforms: Fairness and Energy Consumption.
Proceedings of the 2016 IEEE International Conference on Cloud Engineering Workshop, 2016
A performance analysis framework for optimizing OpenCL applications on FPGAs.
Proceedings of the 2016 IEEE International Symposium on High Performance Computer Architecture, 2016
Modular Placement for Interposer based MultiFPGA Systems.
Proceedings of the 26th edition on Great Lakes Symposium on VLSI, 2016
Relational query processing on OpenCLbased FPGAs.
Proceedings of the 26th International Conference on Field Programmable Logic and Applications, 2016
Accelerating Database Query Processing on OpenCLbased FPGAs (Abstract Only).
Proceedings of the 2016 ACM/SIGDA International Symposium on FieldProgrammable Gate Arrays, 2016
A discrete thermal controller for chipmultiprocessors.
Proceedings of the 2016 Design, Automation & Test in Europe Conference & Exhibition, 2016
A racetrack memory based inmemory booth multiplier for cryptography application.
Proceedings of the 21st Asia and South Pacific Design Automation Conference, 2016
2015
A Combined SDCSDF Architecture for Normal I/O Pipelined Radix2 FFT.
IEEE Trans. VLSI Syst., 2015
MrPhi: An Optimized MapReduce Framework on Intel Xeon Phi Coprocessors.
IEEE Trans. Parallel Distrib. Syst., 2015
Hotplug or Ballooning: A Comparative Study on Dynamic Memory Management Techniques for Virtual Machines.
IEEE Trans. Parallel Distrib. Syst., 2015
VMbuddies: Coordinating Live Migration of MultiTier Applications in Cloud Environments.
IEEE Trans. Parallel Distrib. Syst., 2015
Willow: Saving Data Center Network Energy for NetworkLimited Flows.
IEEE Trans. Parallel Distrib. Syst., 2015
Improving UpdateIntensive Workloads on Flash Disks through Exploiting MultiChip Parallelism.
IEEE Trans. Parallel Distrib. Syst., 2015
Network Performance Aware MPI Collective Communication Operations in the Cloud.
IEEE Trans. Parallel Distrib. Syst., 2015
Sensor Placement and Measurement of Wind for Water Quality Studies in Urban Reservoirs.
TOSN, 2015
PCMLogging: Optimizing Transaction Logging and Recovery Performance with PCM.
IEEE Trans. Knowl. Data Eng., 2015
Guest Editors' Introduction: Special Issue on Economics and Market Mechanisms for Cloud Computing.
IEEE Trans. Cloud Computing, 2015
Synergy of Dynamic Frequency Scaling and Demotion on DRAM Power Management: Models and Optimizations.
IEEE Trans. Computers, 2015
On Full Jacobian Decomposition of the Augmented Lagrangian Method for Separable Convex Programming.
SIAM Journal on Optimization, 2015
Improving Main Memory Hash Joins on Intel Xeon Phi Processors: An Experimental Approach.
PVLDB, 2015
On nonergodic convergence rate of DouglasRachford alternating direction method of multipliers.
Numerische Mathematik, 2015
Generalized alternating direction method of multipliers: new theoretical insights and applications.
Math. Program. Comput., 2015
On the convergence rate of DouglasRachford operator splitting method.
Math. Program., 2015
Monetary cost optimizations for MPIbased HPC applications on Amazon clouds: checkpoints and replicated execution.
Proceedings of the International Conference for High Performance Computing, 2015
Optimization of asynchronous graph processing on GPU with hybrid coloring model.
Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2015
Hierarchical Library Based Power Estimator for Versatile FPGAs.
Proceedings of the IEEE 9th International Symposium on Embedded Multicore/Manycore SystemsonChip, 2015
To Corun, or Not to Corun: A Performance Study on Integrated Architectures.
Proceedings of the 23rd IEEE International Symposium on Modeling, 2015
RealTime InMemory Checkpointing for Future Hybrid Memory Systems.
Proceedings of the 29th ACM on International Conference on Supercomputing, 2015
A Declarative Optimization Engine for Resource Provisioning of Scientific Workflows in IaaS Clouds.
Proceedings of the 24th International Symposium on HighPerformance Parallel and Distributed Computing, 2015
A study of data partitioning on OpenCLbased FPGAs.
Proceedings of the 25th International Conference on Field Programmable Logic and Applications, 2015
Improving Data Partitioning Performance on OpenCLBased FPGAs.
Proceedings of the 23rd IEEE Annual International Symposium on FieldProgrammable Custom Computing Machines, 2015
NVTree: Reducing Consistency Cost for NVMbased Single Level Systems.
Proceedings of the 13th USENIX Conference on File and Storage Technologies, 2015
Fast Subgraph Matching on Large Graphs using Graphics Processors.
Proceedings of the Database Systems for Advanced Applications, 2015
EnergyEfficient Query Processing on Embedded CPUGPU Architectures.
Proceedings of the 11th International Workshop on Data Management on New Hardware, 2015
Gemini: An Adaptive PerformanceFairness Scheduler for DataIntensive Cluster Computing.
Proceedings of the 7th IEEE International Conference on Cloud Computing Technology and Science, 2015
On performance debugging of unnecessary lock contentions on multicore processors: a replaybased approach.
Proceedings of the 13th Annual IEEE/ACM International Symposium on Code Generation and Optimization, 2015
2014
Medusa: Simplified Graph Processing on GPUs.
IEEE Trans. Parallel Distrib. Syst., 2014
Kernelet: HighThroughput GPU Kernel Executions with Dynamic Slicing and Scheduling.
IEEE Trans. Parallel Distrib. Syst., 2014
TransformationBased Monetary CostOptimizations for Workflows in the Cloud.
IEEE Trans. Cloud Computing, 2014
DynamicMR: A Dynamic Slot Allocation Optimization Framework for MapReduce Clusters.
IEEE Trans. Cloud Computing, 2014
FDBuffer: A CostBased Adaptive Buffer Replacement Algorithm for FlashMemory Devices.
IEEE Trans. Computers, 2014
Medusa: A Parallel Graph Processing System on Graphics Processors.
SIGMOD Record, 2014
A Strictly Contractive PeacemanRachford Splitting Method for Convex Programming.
SIAM Journal on Optimization, 2014
On the Convergence of PrimalDual Hybrid Gradient Algorithm.
SIAM J. Imaging Sciences, 2014
InCache Query CoProcessing on Coupled CPUGPU Architectures.
PVLDB, 2014
When Data Management Systems Meet Approximate Hardware: Challenges and Opportunities.
PVLDB, 2014
Inexact AlternatingDirectionBased Contraction Methods for Separable Linearly Constrained Convex Optimization.
J. Optimization Theory and Applications, 2014
A Survey of Resource Management in MultiTier Web Applications.
IEEE Communications Surveys and Tutorials, 2014
Customized proximal point algorithms for linearly constrained convex minimization and saddlepoint problems: a unified approach.
Comp. Opt. and Appl., 2014
On the O(1/t) convergence rate of the projection and contraction methods for variational inequalities with Lipschitz continuous monotone operators.
Comp. Opt. and Appl., 2014
Demo Abstract: Wind measurements for water quality studies in urban reservoirs.
Proceedings of the Eleventh Annual IEEE International Conference on Sensing, 2014
Reciprocal Resource Fairness: Towards Cooperative MultipleResource Fair Sharing in IaaS Clouds.
Proceedings of the International Conference for High Performance Computing, 2014
Finding Constant from Change: Revisiting Network Performance Aware Optimizations on IaaS Clouds.
Proceedings of the International Conference for High Performance Computing, 2014
Thermalaware task scheduling for 3Dnetworkonchip: A BottomtoTop scheme.
Proceedings of the 2014 International Symposium on Integrated Circuits (ISIC), 2014
Optimal sensor placement and measurement of wind for water quality studies in urban reservoirs.
Proceedings of the IPSN'14, 2014
Pipelined Compaction for the LSMTree.
Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014
Longterm resource fairness: towards economic fairness on payasyouuse computing systems.
Proceedings of the 2014 International Conference on Supercomputing, 2014
Towards multiresource physical machine provisioning for IaaS clouds.
Proceedings of the IEEE International Conference on Communications, 2014
A novel authenticated multiparty key agreement for private cloud.
Proceedings of the IEEE International Conference on Communications, 2014
Towards automatic partial reconfiguration in FPGAs.
Proceedings of the 2014 International Conference on FieldProgrammable Technology, 2014
Simplified Resource Provisioning for Workflows in IaaS Clouds.
Proceedings of the IEEE 6th International Conference on Cloud Computing Technology and Science, 2014
Towards Economic Fairness for Big Data Processing in PayasYouGo Cloud Computing.
Proceedings of the IEEE 6th International Conference on Cloud Computing Technology and Science, 2014
GPUAccelerated Cloud Computing for DataIntensive Applications.
Proceedings of the Cloud Computing for DataIntensive Applications, 2014
Network Performance Aware Graph Partitioning for Large Graph Processing Systems in the Cloud.
Proceedings of the Large Scale and Big Data  Processing and Management., 2014
2013
Parallel Graph Processing on Graphics Processors Made Easy.
PVLDB, 2013
OmniDB: Towards Portable and Efficient Query Processing on Parallel CPU/GPU Architectures.
PVLDB, 2013
Revisiting CoProcessing for Hash Joins on the Coupled CPUGPU Architecture.
PVLDB, 2013
Handling partitioning skew in MapReduce using LEEN.
PeertoPeer Networking and Applications, 2013
Forwardbackwardbased descent methods for composite variational inequalities.
Optimization Methods and Software, 2013
A customized proximal point algorithm for convex minimization with linear constraints.
Comp. Opt. and Appl., 2013
Simulation studies of viral advertisement diffusion on multiGPU.
Proceedings of the Winter Simulations Conference: Simulation Making Decisions in a Complex World, 2013
A Framework for Analyzing Monetary Cost of Database Systems in the Cloud.
Proceedings of the WebAge Information Management  14th International Conference, 2013
Brief announcement: on minimum interaction time for continuous distributed interactive computing.
Proceedings of the ACM Symposium on Principles of Distributed Computing, 2013
Spectral Decomposition for Optimal Graph Index Prediction.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2013
MROrder: Flexible Job Ordering Optimization for Online MapReduce Workloads.
Proceedings of the EuroPar 2013 Parallel Processing, 2013
Simulation of Information Propagation over Complex Networks: Performance Studies on MultiGPU.
Proceedings of the 17th IEEE/ACM International Symposium on Distributed Simulation and Real Time Applications, 2013
Dynamic slot allocation technique for MapReduce clusters.
Proceedings of the 2013 IEEE International Conference on Cluster Computing, 2013
Towards GPUAccelerated LargeScale Graph Processing in the Cloud.
Proceedings of the IEEE 5th International Conference on Cloud Computing Technology and Science, 2013
Green Databases Through Integration of Renewable Energy.
Proceedings of the CIDR 2013, 2013
Optimizing the MapReduce framework on Intel Xeon Phi coprocessor.
Proceedings of the 2013 IEEE International Conference on Big Data, 2013
2012
Flag Commit: Supporting Efficient Transaction Recovery in FlashBased DBMSs.
IEEE Trans. Knowl. Data Eng., 2012
On the O(1/n) Convergence Rate of the DouglasRachford Alternating Direction Method.
SIAM J. Numerical Analysis, 2012
Alternating Direction Method with Gaussian Back Substitution for Separable Convex Programming.
SIAM Journal on Optimization, 2012
Convergence Analysis of PrimalDual Algorithms for a SaddlePoint Problem: From Contraction Perspective.
SIAM J. Imaging Sciences, 2012
HPC Simulations of Information Propagation Over Social Networks.
Proceedings of the International Conference on Computational Science, 2012
An Accelerated Inexact Proximal Point Algorithm for Convex Minimization.
J. Optimization Theory and Applications, 2012
Proximallike contraction methods for monotone variational inequalities in a unified framework II: general methods and numerical experiments.
Comp. Opt. and Appl., 2012
Proximallike contraction methods for monotone variational inequalities in a unified framework I: Effective quadruplet and primary methods.
Comp. Opt. and Appl., 2012
RAMZzz: rankaware dram power management with dynamic migrations and demotions.
Proceedings of the SC Conference on High Performance Computing Networking, 2012
An overview of Medusa: simplified graph processing on GPUs.
Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2012
An overview of CMPI: network performance aware MPI in the cloud.
Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2012
Speedup for MultiLevel Parallel Computing.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012
GPGPU for RealTime Data Analytics.
Proceedings of the 18th IEEE International Conference on Parallel and Distributed Systems, 2012
Greenaware workload scheduling in geographically distributed data centers.
Proceedings of the 4th IEEE International Conference on Cloud Computing Technology and Science Proceedings, 2012
Improving large graph processing on partitioned graphs in the cloud.
Proceedings of the ACM Symposium on Cloud Computing, SOCC '12, 2012
A MapReduce Based Framework for Heterogeneous Processing Element Cluster Environments.
Proceedings of the 12th IEEE/ACM International Symposium on Cluster, 2012
Maestro: ReplicaAware Map Scheduling for MapReduce.
Proceedings of the 12th IEEE/ACM International Symposium on Cluster, 2012
2011
Mars: Accelerating MapReduce with Graphics Processors.
IEEE Trans. Parallel Distrib. Syst., 2011
Solving LargeScale Least Squares Semidefinite Programming by Alternating Direction Methods.
SIAM J. Matrix Analysis Applications, 2011
Highthroughput transaction executions on graphics processors.
PVLDB, 2011
GPUAssisted Buffer Management.
Proceedings of the International Conference on Computational Science, 2011
GViewer: GPUAccelerated Graph Visualization and Mining.
Proceedings of the Social Informatics  Third International Conference, SocInfo 2011, 2011
Operationaware buffer management in flashbased systems.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011
Adaptive Disk I/O Scheduling for MapReduce in Virtualized Environment.
Proceedings of the International Conference on Parallel Processing, 2011
PCMLogging: reducing transaction logging overhead with PCM.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011
Towards PayAsYouConsume Cloud Computing.
Proceedings of the IEEE International Conference on Services Computing, 2011
2010
Tree Indexing on Solid State Drives.
PVLDB, 2010
Database Compression on Graphics Processors.
PVLDB, 2010
Steplengths in the extragradient type methods.
J. Computational Applied Mathematics, 2010
Solving a class of constrained 'blackbox' inverse variational inequalities.
European Journal of Operational Research, 2010
Large graph processing in the cloud.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010
Distributed Systems Meet Economics: Pricing in the Cloud.
Proceedings of the 2nd USENIX Workshop on Hot Topics in Cloud Computing, 2010
Supporting extended precision on graphics processors.
Proceedings of the Sixth International Workshop on Data Management on New Hardware, 2010
LEEN: Locality/FairnessAware Key Partitioning for MapReduce in the Cloud.
Proceedings of the Cloud Computing, Second International Conference, 2010
Comet: batched stream processing for data intensive distributed computing.
Proceedings of the 1st ACM Symposium on Cloud Computing, 2010
FDbuffer: a buffer manager for databases on flash disks.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010
2009
Relational query coprocessing on graphics processors.
ACM Trans. Database Syst., 2009
Selfadaptive projection method for cocoercive variational inequalities.
European Journal of Operational Research, 2009
Parallel splitting augmented Lagrangian methods for monotone structured variational inequalities.
Comp. Opt. and Appl., 2009
Stackbased parallel recursion on graphics processors.
Proceedings of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2009
Tree Indexing on Flash Disks.
Proceedings of the 25th International Conference on Data Engineering, 2009
Wave Computing in the Cloud.
Proceedings of HotOS'09: 12th Workshop on Hot Topics in Operating Systems, 2009
A Uniform Framework for AdHoc Indexes to Answer Reachability Queries on Large Graphs.
Proceedings of the Database Systems for Advanced Applications, 2009
Frequent itemset mining on graphics processors.
Proceedings of the Fifth International Workshop on Data Management on New Hardware, 2009
2008
Cacheoblivious databases: Limitations and opportunities.
ACM Trans. Database Syst., 2008
Relational joins on graphics processors.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008
Mars: a MapReduce framework on graphics processors.
Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques, 2008
2007
Adaptive Index Utilization in MemoryResident Structural Joins.
IEEE Trans. Knowl. Data Eng., 2007
An inexact logarithmicquadratic proximal augmented Lagrangian method for a class of constrained variational inequalities.
Math. Meth. of OR, 2007
EaseDB: a cacheoblivious inmemory query processor.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2007
GPUQP: query coprocessing using graphics processors.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2007
Efficient gather and scatter operations on graphics processors.
Proceedings of the ACM/IEEE Conference on High Performance Networking and Computing, 2007
Inmemory grid files on graphics processors.
Proceedings of the Workshop on Data Management on New Hardware, 2007
A general framework for improving query processing performance on multilevel memory hierarchies.
Proceedings of the Workshop on Data Management on New Hardware, 2007
CacheOblivious Query Processing.
Proceedings of the CIDR 2007, 2007
2006
A LogarithmicQuadratic Proximal PredictionCorrection Method for Structured Monotone Variational Inequalities.
Comp. Opt. and Appl., 2006
A Quantitative Summary of XML Structures.
Proceedings of the Conceptual Modeling, 2006
Cacheoblivious nestedloop joins.
Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management, 2006
2005
A Relaxed Approximate Proximal Point Algorithm.
Annals OR, 2005
CacheConscious Automata for XML Filtering.
Proceedings of the 21st International Conference on Data Engineering, 2005
2004
A modified augmented Lagrangian method for a class of monotone variational inequalities.
European Journal of Operational Research, 2004
Comparison of Two Kinds of PredictionCorrection Methods for Monotone Variational Inequalities.
Comp. Opt. and Appl., 2004
The HKUST Frog Pond  A Case Study of Sensory Data Analysis.
Proceedings of the Network and Parallel Computing, IFIP International Conference, 2004
Accurate Emulation of Wireless Sensor Networks.
Proceedings of the Network and Parallel Computing, IFIP International Conference, 2004
MEADOWS: modeling, emulation, and analysis of data of wireless sensor networks.
Proceedings of the 1st Workshop on Data Management for Sensor Networks, 2004
2003
Selfadaptive operator splitting methods for monotone variational inequalities.
Numerische Mathematik, 2003
2002
A new inexact alternating directions method for monotone variational inequalities.
Math. Program., 2002
2000
A neural network model for monotone linear asymmetric variational inequalities.
IEEE Trans. Neural Netw. Learning Syst., 2000
A modified alternating direction method for convex minimization problems.
Appl. Math. Lett., 2000
1999
Inexact implicit methods for monotone general variational inequalities.
Math. Program., 1999
1998
Some convergence properties of a method of multipliers for linearly constrained monotone variational inequalities.
Oper. Res. Lett., 1998
1994
A new method for a class of linear variational inequalities.
Math. Program., 1994
1986
Grosse nichtlineare Optimierungsprobleme mit "BoxConstraints" und Gleichungsrestriktionen.
PhD thesis, 1986