Douglas Thain
Orcid: 0000-0001-5218-1956
  According to our database1,
  Douglas Thain
  authored at least 156 papers
  between 2000 and 2026.
  
  
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
- 
    on orcid.org
On csauthors.net:
Bibliography
  2026
  2025
xGFabric: Coupling Sensor Networks and HPC Facilities with Private 5G Wireless Networks for Real-Time Digital Agriculture.
    
  
    CoRR, September, 2025
    
  
Scaling Up Throughput-oriented LLM Inference Applications on Heterogeneous Opportunistic GPU Clusters with Pervasive Context Management.
    
  
    CoRR, September, 2025
    
  
Liberating the Data Aware Scheduler to Achieve Locality in Layered Scientific Workflow Systems.
    
  
    Proceedings of the IEEE International Conference on eScience, 2025
    
  
Backpacks for Notebooks: Enabling Containerized Notebook Workflows in Distributed Environments.
    
  
    Proceedings of the IEEE International Conference on eScience, 2025
    
  
  2024
Reshaping High Energy Physics Applications for Near-Interactive Execution Using TaskVine.
    
  
    Proceedings of the International Conference for High Performance Computing, 2024
    
  
Shepherd: Seamless Integration of Service Workflows into Task-Based Workflows through Log Monitoring.
    
  
    Proceedings of the SC24-W: Workshops of the International Conference for High Performance Computing, 2024
    
  
Adaptive Task-Oriented Resource Allocation for Large Dynamic Workflows on Opportunistic Resources.
    
  
    Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024
    
  
Accelerating Function-Centric Applications by Discovering, Distributing, and Retaining Reusable Context in Workflow Systems.
    
  
    Proceedings of the 33rd International Symposium on High-Performance Parallel and Distributed Computing, 2024
    
  
  2023
    IEEE Trans. Parallel Distributed Syst., May, 2023
    
  
    Comput. Sci. Eng., 2023
    
  
    Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023
    
  
    Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023
    
  
    Proceedings of the 32nd International Symposium on High-Performance Parallel and Distributed Computing, 2023
    
  
  2022
Dynamic Task Shaping for High Throughput Data Analysis Applications in High Energy Physics.
    
  
    Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022
    
  
PONCHO: Dynamic Package Synthesis for Distributed and Serverless Python Applications.
    
  
    Proceedings of the HiPS@HPDC 2022: Proceedings of the 2nd Workshop on High Performance Serverless Computing, 2022
    
  
    Proceedings of the 18th IEEE International Conference on e-Science, 2022
    
  
  2021
Workflows Community Summit: Advancing the State-of-the-art of Scientific Workflows Management Systems Research and Development.
    
  
    CoRR, 2021
    
  
    CoRR, 2021
    
  
    Proceedings of the 2021 IEEE Workshop on Workflows in Support of Large-Scale Science (WORKS), 2021
    
  
Not All Tasks Are Created Equal: Adaptive Resource Allocation for Heterogeneous Tasks in Dynamic Workflows.
    
  
    Proceedings of the 2021 IEEE Workshop on Workflows in Support of Large-Scale Science (WORKS), 2021
    
  
Emerging Frameworks for Advancing Scientific Workflows Research, Development, and Education.
    
  
    Proceedings of the 2021 IEEE Workshop on Workflows in Support of Large-Scale Science (WORKS), 2021
    
  
Lightweight Function Monitors for Fine-Grained Management in Large Scale Python Applications.
    
  
    Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021
    
  
An Empirical Study of Package Dependencies and Lifetimes in Binder Python Containers.
    
  
    Proceedings of the 17th IEEE International Conference on eScience, 2021
    
  
  2020
    Proceedings of the PEARC '20: Practice and Experience in Advanced Research Computing, 2020
    
  
    Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020
    
  
    Proceedings of the IEEE International Conference on Cluster Computing, 2020
    
  
  2019
    Proceedings of the Practice and Experience in Advanced Research Computing on Rise of the Machines (learning), 2019
    
  
    Proceedings of the 15th International Conference on eScience, 2019
    
  
  2018
    IEEE Trans. Parallel Distributed Syst., 2018
    
  
Combining Static and Dynamic Storage Management for Data Intensive Scientific Workflows.
    
  
    IEEE Trans. Parallel Distributed Syst., 2018
    
  
    Proceedings of the Practice and Experience on Advanced Research Computing, 2018
    
  
    Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018
    
  
    Proceedings of the International Conference for High Performance Computing, 2018
    
  
    Proceedings of the 2018 IEEE International Conference on Cloud Engineering, 2018
    
  
    Proceedings of the 2018 IEEE International Conference on Cloud Engineering, 2018
    
  
    Proceedings of the 9th Workshop on Scientific Cloud Computing, 2018
    
  
    Proceedings of the 9th Workshop on Scientific Cloud Computing, 2018
    
  
    Proceedings of the 14th IEEE International Conference on e-Science, 2018
    
  
    Proceedings of the 14th IEEE International Conference on e-Science, 2018
    
  
    Proceedings of the ACM Symposium on Cloud Computing, 2018
    
  
  2017
Designing Self-Tuning Split-Map-Merge Applications for High Cost-Efficiency in the Cloud.
    
  
    IEEE Trans. Cloud Comput., 2017
    
  
    Concurr. Comput. Pract. Exp., 2017
    
  
Balancing push and pull in Confuga, an active storage cluster file system for scientific workflows.
    
  
    Concurr. Comput. Pract. Exp., 2017
    
  
    Proceedings of the 2nd Joint International Workshop on Parallel Data Storage & Data Intensive Scalable Computing Systems, 2017
    
  
    Proceedings of the 37th IEEE International Conference on Distributed Computing Systems, 2017
    
  
Facilitating the Reproducibility of Scientific Workflows with Execution Environment Specifications.
    
  
    Proceedings of the International Conference on Computational Science, 2017
    
  
Deploying High Throughput Scientific Workflows on Container Schedulers with Makeflow and Mesos.
    
  
    Proceedings of the 17th IEEE/ACM International Symposium on Cluster, 2017
    
  
  2016
DiaPro: Unifying Dynamic Impact Analyses for Improved and Variable Cost-Effectiveness.
    
  
    ACM Trans. Softw. Eng. Methodol., 2016
    
  
    Proceedings of the 31st IEEE/ACM International Conference on Automated Software Engineering, 2016
    
  
Conducting reproducible research with Umbrella: Tracking, creating, and preserving execution environments.
    
  
    Proceedings of the 12th IEEE International Conference on e-Science, 2016
    
  
    Proceedings of the 12th IEEE International Conference on e-Science, 2016
    
  
  2015
Accelerating Comparative Genomics Work ows in a Distributed Environment with Optimized Data Partitioning and Workflow Fusion.
    
  
    Scalable Comput. Pract. Exp., 2015
    
  
    J. Comput. Sci., 2015
    
  
    Comput. Sci. Eng., 2015
    
  
    Comput. Sci. Eng., 2015
    
  
    Proceedings of the 2nd Workshop on Visual Performance Analysis, 2015
    
  
Techniques for Preserving Scientific Software Executions: Preserve the Mess or Encourage Cleanliness?
    
  
    Proceedings of the 12th International Conference on Digital Preservation, 2015
    
  
Integrating Containers into Workflows: A Case Study Using Makeflow, Work Queue, and Docker.
    
  
    Proceedings of the 8th International Workshop on Virtualization Technologies in Distributed Computing, 2015
    
  
Umbrella: A Portable Environment Creator for Reproducible Computing on Clusters, Clouds, and Grids.
    
  
    Proceedings of the 8th International Workshop on Virtualization Technologies in Distributed Computing, 2015
    
  
Scaling Up Bioinformatics Workflows with Dynamic Job Expansion: A Case Study Using Galaxy and Makeflow.
    
  
    Proceedings of the 11th IEEE International Conference on e-Science, 2015
    
  
Scaling Data Intensive Physics Applications to 10k Cores on Non-dedicated Clusters with Lobster.
    
  
    Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015
    
  
    Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015
    
  
Balancing Thread-Level and Task-Level Parallelism for Data-Intensive Workloads on Clusters and Clouds.
    
  
    Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015
    
  
    Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015
    
  
  2014
    J. Chem. Inf. Model., 2014
    
  
    Int. J. Bioinform. Res. Appl., 2014
    
  
Lessons Learned from an Experiment in Crowdsourcing Complex Citizen Engineering Tasks with Amazon Mechanical Turk.
    
  
    CoRR, 2014
    
  
    Concurr. Comput. Pract. Exp., 2014
    
  
    Proceedings of the 14th IEEE/ACM International Symposium on Cluster, 2014
    
  
Expanding Tasks of Logical Workflows Into Independent Workflows for Improved Scalability.
    
  
    Proceedings of the 14th IEEE/ACM International Symposium on Cluster, 2014
    
  
Accelerating Comparative Genomics Workflows in a Distributed Environment with Optimized Data Partitioning.
    
  
    Proceedings of the 14th IEEE/ACM International Symposium on Cluster, 2014
    
  
    Proceedings of the 2014 IEEE International Congress on Big Data, Anchorage, AK, USA, June 27, 2014
    
  
  2013
    Proceedings of WORKS 2013: 8th Workshop On Workflows in Support of Large-Scale Science, 2013
    
  
Automated packaging of bioinformatics workflows for portability and durability using makeflow.
    
  
    Proceedings of WORKS 2013: 8th Workshop On Workflows in Support of Large-Scale Science, 2013
    
  
    Proceedings of the 2013 International Workshop on Data-Intensive Scalable Computing Systems, 2013
    
  
    Proceedings of the 2013 IEEE International Conference on Cluster Computing, 2013
    
  
    Proceedings of the 13th IEEE/ACM International Symposium on Cluster, 2013
    
  
  2012
    Proceedings of the Handbook of Energy-Aware and Green Computing - Two Volume Set., 2012
    
  
    IEEE Trans. Parallel Distributed Syst., 2012
    
  
    Distributed Parallel Databases, 2012
    
  
    Concurr. Comput. Pract. Exp., 2012
    
  
Makeflow: a portable abstraction for data intensive computing on clusters, clouds, and grids.
    
  
    Proceedings of the 1st ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies, 2012
    
  
Shifting the bioinformatics computing paradigm: A case study in parallelizing genome annotation using MAKER and Work Queue.
    
  
    Proceedings of the IEEE 2nd International Conference on Computational Advances in Bio and Medical Sciences, 2012
    
  
A system for management of Computational Fluid Dynamics simulations for civil engineering.
    
  
    Proceedings of the 8th IEEE International Conference on E-Science, 2012
    
  
    Proceedings of the 8th IEEE International Conference on E-Science, 2012
    
  
    Proceedings of the 12th IEEE/ACM International Symposium on Cluster, 2012
    
  
    Proceedings of the 12th IEEE/ACM International Symposium on Cluster, 2012
    
  
  2011
    Concurr. Comput. Pract. Exp., 2011
    
  
    Proceedings of the IEEE Ninth International Conference on Dependable, 2011
    
  
    Proceedings of the IEEE 3rd International Conference on Cloud Computing Technology and Science, 2011
    
  
  2010
    IEEE Trans. Parallel Distributed Syst., 2010
    
  
    Concurr. Comput. Pract. Exp., 2010
    
  
Harnessing parallelism in multicore clusters with the All-Pairs, Wavefront, and Makeflow abstractions.
    
  
    Clust. Comput., 2010
    
  
    Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, 2010
    
  
Weaver: integrating distributed computing abstractions into scientific workflows using Python.
    
  
    Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, 2010
    
  
    Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, 2010
    
  
    Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, 2010
    
  
Environmentally Opportunistic Computing transforming the data center for economic and environmental sustainability.
    
  
    Proceedings of the International Green Computing Conference 2010, 2010
    
  
    Proceedings of the Euro-Par 2010 - Parallel Processing, 16th International Euro-Par Conference, Ischia, Italy, August 31, 2010
    
  
    Proceedings of the Cloud Computing, Second International Conference, 2010
    
  
    Proceedings of the Cloud Computing, Second International Conference, 2010
    
  
  2009
    Softw. Pract. Exp., 2009
    
  
    J. Grid Comput., 2009
    
  
Experience with BXGrid: a data repository and computing grid for biometrics research.
    
  
    Clust. Comput., 2009
    
  
    Proceedings of the 2nd Workshop on Many-Task Computing on Grids and Supercomputers, 2009
    
  
Harnessing parallelism in multicore clusters with the all-pairs and wavefront abstractions.
    
  
    Proceedings of the 18th ACM International Symposium on High Performance Distributed Computing, 2009
    
  
    Proceedings of the 18th ACM International Symposium on High Performance Distributed Computing, 2009
    
  
Scheduling Grid workloads on multicore clusters to minimize energy and maximize performance.
    
  
    Proceedings of the 2009 10th IEEE/ACM International Conference on Grid Computing, 2009
    
  
    Proceedings of the Ad Hoc Networks, First International Conference, 2009
    
  
    Proceedings of the Scientific Data Management - Challenges, Technology, and Deployment., 2009
    
  
  2008
Biomolecular committor probability calculation enabled by processing in network storage.
    
  
    Parallel Comput., 2008
    
  
    Future Gener. Comput. Syst., 2008
    
  
    Proceedings of the 22nd Large Installation System Administration Conference, 2008
    
  
    Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008
    
  
    Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008
    
  
    Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008
    
  
    Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008
    
  
    Proceedings of the 17th International Symposium on High-Performance Distributed Computing (HPDC-17 2008), 2008
    
  
    Proceedings of the 9th IEEE/ACM International Conference on Grid Computing (Grid 2008), Tsukuba, Japan, September 29, 2008
    
  
    Proceedings of the Fourth International Conference on e-Science, 2008
    
  
    Proceedings of the Fourth International Conference on e-Science, 2008
    
  
  2007
    Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007
    
  
    Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007
    
  
    Proceedings of the 8th IEEE/ACM International Conference on Grid Computing (GRID 2007), 2007
    
  
  2006
    Concurr. Comput. Pract. Exp., 2006
    
  
    Concurr. Comput. Pract. Exp., 2006
    
  
    Proceedings of the 2006 ACM Workshop On Storage Security And Survivability, 2006
    
  
    Proceedings of the 12th International Conference on Parallel and Distributed Systems, 2006
    
  
    Proceedings of the 15th IEEE International Symposium on High Performance Distributed Computing, 2006
    
  
    Proceedings of the 7th IEEE/ACM International Conference on Grid Computing (GRID 2006), 2006
    
  
    Proceedings of the 7th IEEE/ACM International Conference on Grid Computing (GRID 2006), 2006
    
  
    Proceedings of the 7th IEEE/ACM International Conference on Grid Computing (GRID 2006), 2006
    
  
    Proceedings of the 2006 IEEE International Conference on Cluster Computing, 2006
    
  
    Proceedings of the ARCS 2006, 2006
    
  
  2005
    Scalable Comput. Pract. Exp., 2005
    
  
    Proceedings of the 3rd International IEEE Security in Storage Workshop (SISW 2005), 2005
    
  
    Proceedings of the ACM/IEEE SC2005 Conference on High Performance Networking and Computing, 2005
    
  
    Proceedings of the ACM/IEEE SC2005 Conference on High Performance Networking and Computing, 2005
    
  
    Proceedings of the 14th IEEE International Symposium on High Performance Distributed Computing, 2005
    
  
    Proceedings of the 14th IEEE International Symposium on High Performance Distributed Computing, 2005
    
  
  2004
    Proceedings of the 1st Symposium on Networked Systems Design and Implementation (NSDI 2004), 2004
    
  
    Proceedings of the Grid 2 - Blueprint for a New Computing Infrastructure, Second Edition, 2004
    
  
  2003
    Proceedings of the 12th International Symposium on High-Performance Distributed Computing (HPDC-12 2003), 2003
    
  
    Proceedings of the 12th International Symposium on High-Performance Distributed Computing (HPDC-12 2003), 2003
    
  
    Proceedings of the 3rd IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2003), 2003
    
  
  2002
    Proceedings of the 9th Pacific Rim International Symposium on Dependable Computing (PRDC 2002), 2002
    
  
    Proceedings of the 11th IEEE International Symposium on High Performance Distributed Computing (HPDC-11 2002), 2002
    
  
  2001
    Clust. Comput., 2001
    
  
    Proceedings of the 2001 ACM/IEEE conference on Supercomputing, 2001
    
  
    Proceedings of the 10th IEEE International Symposium on High Performance Distributed Computing (HPDC-10 2001), 2001
    
  
  2000
    Proceedings of the Ninth IEEE International Symposium on High Performance Distributed Computing, 2000