Beth Plale

Orcid: 0000-0003-2164-8132

According to our database1, Beth Plale authored at least 153 papers between 1998 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Creating intelligent cyberinfrastructure for democratizing AI.
AI Mag., 2024

2023
Cybershuttle: An End-to-End Cyberinfrastructure Continuum to Accelerate Discovery in Science and Engineering.
Proceedings of the Practice and Experience in Advanced Research Computing, 2023

CKN: An Edge AI Distributed Framework.
Proceedings of the 19th IEEE International Conference on e-Science, 2023

Democratization of AI: Challenges of AI Cyberinfrastructure and Software Research.
Proceedings of the 19th IEEE International Conference on e-Science, 2023

CCGRID 2023: A Holistic Approach to Inclusion and Belonging.
Proceedings of the 23rd IEEE/ACM International Symposium on Cluster, 2023

Knowledge Enhanced Digital Objects: a Data Lake Approach.
Proceedings of the 23rd IEEE/ACM International Symposium on Cluster, 2023

2021
Transparency and Reproducibility Practice in Large-Scale Computational Science: A Preface to the Special Section.
IEEE Trans. Parallel Distributed Syst., 2021

Fostering Interdisciplinary Data Cultures through Early Career Development: The RDA/US Data Share Fellowship.
Data Sci. J., 2021

Reproducibility Practice in High-Performance Computing: Community Survey Results.
Comput. Sci. Eng., 2021

Campaign Knowledge Network: Building Knowledge for Campaign Efficiency.
CoRR, 2021

Towards System for Knowledge Representation of Campaign Experimentation.
Proceedings of the 17th IEEE International Conference on eScience, 2021

2020
Reliable access to massive restricted texts: Experience-based evaluation.
Concurr. Comput. Pract. Exp., 2020

2019
Safe Open Science for Restricted Data.
Data Inf. Manag., 2019

Pilot evaluation of Collection API with PID Kernel Information.
CoRR, 2019

Intelligent systems for geosciences: an essential research agenda.
Commun. ACM, 2019

Transparency by Design in eScience Research.
Proceedings of the 15th International Conference on eScience, 2019

2018
Big Provenance Stream Processing for Data Intensive Computations.
Proceedings of the 14th IEEE International Conference on e-Science, 2018

2017
Identification and characterization of information-networks in long-tail data collections.
Environ. Model. Softw., 2017

Mining lake time series using symbolic representation.
Ecol. Informatics, 2017

Pacific Rim Applications and Grid Middleware Assembly (PRAGMA): International clouds for data science.
Concurr. Comput. Pract. Exp., 2017

Enhancing Access to Digital Media: The Language Application Grid in the HTRC Data Capsule.
Proceedings of the Practice and Experience in Advanced Research Computing 2017: Sustainability, 2017

Towards Publishing Secure Capsule-Based Analysis.
Proceedings of the 2017 ACM/IEEE Joint Conference on Digital Libraries, 2017

Provenance Enriched PID Kernel Information as OAI-ORE Map Replacement for SEAD Research Objects.
Proceedings of the 2017 ACM/IEEE Joint Conference on Digital Libraries, 2017

2016
Argus: A Multi-tenancy NoSQL store with workload-aware resource reservation.
Parallel Comput., 2016

A Multi-tenant Fair Share Approach to Full-text Search Engine.
Proceedings of the Seventh International Workshop on Data-Intensive Computing in the Clouds, 2016

SamzaSQL: Scalable Fast Data Management with Streaming SQL.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

Provenance as Essential Infrastructure for Data Lakes.
Proceedings of the Provenance and Annotation of Data and Processes, 2016

Analysis of Memory Constrained Live Provenance.
Proceedings of the Provenance and Annotation of Data and Processes, 2016

Crossing analytics systems: A case for integrated provenance in data lakes.
Proceedings of the 12th IEEE International Conference on e-Science, 2016

A hybrid approach to population construction for agricultural agent-based simulation.
Proceedings of the 12th IEEE International Conference on e-Science, 2016

Horme: Random Access Big Data Analytics.
Proceedings of the 2016 IEEE International Conference on Cluster Computing, 2016

KVLight: A Lightweight Key-Value Store for Distributed Access in Cloud.
Proceedings of the IEEE/ACM 16th International Symposium on Cluster, 2016

2015
Provenance Quality Assessment Methodology and Framework.
ACM J. Data Inf. Qual., 2015

Fast Data Management with Distributed Streaming SQL.
CoRR, 2015

Towards Sustainable Curation and Preservation: The SEAD Project's Data Services Approach.
Proceedings of the 11th IEEE International Conference on e-Science, 2015

Towards Building a Lightweight Key-Value Store on Parallel File System.
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

Workload-Aware Resource Reservation for Multi-tenant NoSQL.
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

Big Data Provenance Analysis and Visualization.
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015

ProvErr: System Level Statistical Fault Diagnosis Using Dependency Model.
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015

2014
Temporal representation for mining scientific data provenance.
Future Gener. Comput. Syst., 2014

Synthesis of Working Group and Interest Group Activity One Year into the Research Data Alliance.
D Lib Mag., 2014

Hierarchical MapReduce: towards simplified cross-domain data processing.
Concurr. Comput. Pract. Exp., 2014

TextRWeb: Large-Scale Text Analytics with R on the Web.
Proceedings of the Annual Conference of the Extreme Science and Engineering Discovery Environment, 2014

Regenerating and Quantifying Quality of Benchmarking Data Using Static and Dynamic Provenance.
Proceedings of the Provenance and Annotation of Data and Processes, 2014

Cloud computing data capsules for non-consumptiveuse of texts.
Proceedings of the ScienceCloud'14, 2014

Study in Usefulness of Middleware-Only Provenance.
Proceedings of the 10th IEEE International Conference on e-Science, 2014

Large-scale text analysis through the HathiTrust Research Center.
Proceedings of the 9th Annual International Conference of the Alliance of Digital Humanities Organizations, 2014

Multi-tenant fair share in NoSQL data stores.
Proceedings of the 2014 IEEE International Conference on Cluster Computing, 2014

Parallel and quantitative sequential pattern mining for large-scale interval-based temporal data.
Proceedings of the 2014 IEEE International Conference on Big Data (IEEE BigData 2014), 2014

Author gender metadata augmentation of hathitrust digital library.
Proceedings of the Connecting Collections, Cultures, and Communities, 2014

Executing Storm Surge Ensembles on PAAS Cloud.
Proceedings of the Cloud Computing for Data-Intensive Applications, 2014

2013
Provenance Capture and Use in a Satellite Data Processing Pipeline.
IEEE Trans. Geosci. Remote. Sens., 2013

SEAD Virtual Archive: Building a Federation of Institutional Repositories for Long-Term Data Preservation in Sustainability Science.
Int. J. Digit. Curation, 2013

Exploiting MapReduce and data compression for data-intensive applications.
Proceedings of the Extreme Science and Engineering Discovery Environment: Gateway to Discovery, 2013

Storm surge simulation and load balancing in Azure cloud.
Proceedings of the 2013 Spring Simulation Multiconference, SpringSim '13, 2013

Static compiler analysis for workflow provenance.
Proceedings of WORKS 2013: 8th Workshop On Workflows in Support of Large-Scale Science, 2013

Automatic performance evaluation of dewarping methods in large scale digitization of historical documents.
Proceedings of the 13th ACM/IEEE-CS Joint Conference on Digital Libraries, 2013

HathiTrust research center: computational access for digital humanities and beyond.
Proceedings of the 13th ACM/IEEE-CS Joint Conference on Digital Libraries, 2013

The SEAD DataNet prototype: data preservation services for sustainability science.
Proceedings of the 13th ACM/IEEE-CS Joint Conference on Digital Libraries, 2013

Modeling heterogeneous data resources for social-ecological research: a data-centric perspective.
Proceedings of the 13th ACM/IEEE-CS Joint Conference on Digital Libraries, 2013

Provenance from log files: a BigData problem.
Proceedings of the Joint 2013 EDBT/ICDT Conferences, 2013

Data Pipeline in MapReduce.
Proceedings of the 9th IEEE International Conference on eScience, 2013

Dependency Provenance in Agent Based Modeling.
Proceedings of the 9th IEEE International Conference on eScience, 2013

Architecture to enable large-scale computational analysis of millions of volumes.
Proceedings of the 8th Annual International Conference of the Alliance of Digital Humanities Organizations, 2013

DEM Generation with SAR Interferometry Based on Weighted Wavelet Phase Unwrapping.
Proceedings of the Fourth International Conference on Computing for Geospatial Research and Application, 2013

Big data opportunities and challenges for IR, text mining and NLP.
Proceedings of the 2013 international workshop on Mining unstructured big data using natural language processing, 2013

Milieu: Lightweight and Configurable Big Data Provenance for Science.
Proceedings of the IEEE International Congress on Big Data, 2013

2012
Effectiveness of Hybrid Workflow Systems for Computational Science.
Proceedings of the International Conference on Computational Science, 2012

Sigiri: uniform resource abstraction for grids and clouds.
Concurr. Comput. Pract. Exp., 2012

Managing the long tail of science: data and communities.
Proceedings of the 1st Conference of the Extreme Science and Engineering Discovery Environment, 2012

Poster: Visualizing Large Scale Scientific Data Provenance.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Abstract: Visualizing Large Scale Scientific Data Provenance.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Visualization of network data provenance.
Proceedings of the 19th International Conference on High Performance Computing, 2012

Generalized representation and mapping for social-ecological data: Freeing data from the database.
Proceedings of the 8th IEEE International Conference on E-Science, 2012

Temporal representation for scientific data provenance.
Proceedings of the 8th IEEE International Conference on E-Science, 2012

Provenance analysis: Towards quality provenance.
Proceedings of the 8th IEEE International Conference on E-Science, 2012

Hierarchical MapReduce Programming Model and Scheduling Algorithms.
Proceedings of the 12th IEEE/ACM International Symposium on Cluster, 2012

Mining classifications from social-ecological databases.
Proceedings of the Information, Interaction, Innovation: Celebrating the Past, Constructing the Present and Creating the Future, 2012

From metadata to ontology representation: A case of converting severe weather forecast metadata to an ontology.
Proceedings of the Information, Interaction, Innovation: Celebrating the Past, Constructing the Present and Creating the Future, 2012

2011
Using Provenance for Personalized Quality Ranking of Scientific Datasets.
Int. J. Comput. Their Appl., 2011

The Open Provenance Model core specification (v1.1).
Future Gener. Comput. Syst., 2011

Programming Abstraction for Resource Aware Stream Processing for Scientific Workflows.
Proceedings of the IEEE 7th International Conference on E-Science, 2011

Hybrid programming abstraction for e-science workflows and event processing.
Proceedings of the Fifth ACM International Conference on Distributed Event-Based Systems, 2011

A Noisy 10GB Provenance Database.
Proceedings of the Business Process Management Workshops, 2011

2010
Implementation, performance, and science results from a 30.7 TFLOPS IBM BladeCenter cluster.
Concurr. Comput. Pract. Exp., 2010

What is cyberinfrastructure.
Proceedings of the ACM SIGUCCS Fall Conference on User Services 2010, Norfolk, VA, USA, 2010

Versioning for workflow evolution.
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, 2010

Trading Consistency for Scalability in Scientific Metadata.
Proceedings of the Sixth International Conference on e-Science, 2010

Usage Patterns to Provision for Scientific Experimentation in Clouds.
Proceedings of the Cloud Computing, Second International Conference, 2010

WORKEM: Representing and Emulating Distributed Scientific Workflow Execution State.
Proceedings of the 10th IEEE/ACM International Conference on Cluster, 2010

Streamflow Programming Model for Data Streaming in Scientific Workflows.
Proceedings of the 10th IEEE/ACM International Conference on Cluster, 2010

2009
CBR Based Workflow Composition Assistant.
Proceedings of the 2009 IEEE Congress on Services, Part I, 2009

Provenance Information Model of Karma Version 3.
Proceedings of the 2009 IEEE Congress on Services, Part I, 2009

Semantically Annotated Provenance in the Life Science Grid.
Proceedings of the First International Workshop on the role of Semantic Web in Provenance Management (SWPM 2009), 2009

Application of Management Frameworks to Manage Workflow-Based Systems: A Case Study on a Large Scale E-science Project.
Proceedings of the IEEE International Conference on Web Services, 2009

2008
Karma2: Provenance Management for Data-Driven Workflows.
Int. J. Web Serv. Res., 2008

Real-time storm detection and weather forecast activation through data mining and events processing.
Earth Sci. Informatics, 2008

Query capabilities of the Karma provenance framework.
Concurr. Comput. Pract. Exp., 2008

Special Issue: The First Provenance Challenge.
Concurr. Comput. Pract. Exp., 2008

Using Characteristics of Computational Science Schemas for Workflow Metadata Management.
Proceedings of the 2008 IEEE Congress on Services, Part I, 2008

A Roadmap for caGrid, an Enterprise Grid Architecture for Biomedical Research.
Proceedings of the Global Healthgrid: e-Science Meets Biomedical Informatics, 2008

Riding the Geoscience Cyberinfrastructure Wave of Data: Real Time Data Use in Education Workshop.
Proceedings of the Fourth International Conference on e-Science, 2008

Schema-Independent and Schema-Friendly Scientific Metadata Management.
Proceedings of the Fourth International Conference on e-Science, 2008

Provenance Collection in an Industry Biochemical Discovery Cyberinfrastructure.
Proceedings of the Fourth International Conference on e-Science, 2008

2007
Service Architectures for e-Science Grid Gateways: Opportunities and Challenges.
Proceedings of the On the Move to Meaningful Internet Systems 2007: CoopIS, 2007

Realization of Dynamically Adaptive Weather Analysis and Forecasting in LEAD: Four Years Down the Road.
Proceedings of the Computational Science, 2007

Dynamic, Adaptive Workflows for Mesoscale Meteorology.
Proceedings of the Workflows for e-Science, Scientific Workflows for Grids., 2007

2006
CASA and LEAD: Adaptive Cyberinfrastructure for Real-Time Multiscale Weather Forecasting.
Computer, 2006

Multi-model Based Optimization for Stream Query Processing.
Proceedings of the Eighteenth International Conference on Software Engineering & Knowledge Engineering (SEKE'2006), 2006

Query Optimization for Distributed Data Streams.
Proceedings of the 15th International Conference on Software Engineering and Data Engineering (SEDE-2006), 2006

Poster reception - A meta-provenance service to infer context from provenance data of distributed entities.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Bandwidth challenge - All in a day's work: advancing data-intensive research with the data capacitor.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Towards Low Overhead Provenance Tracking in Near Real-Time Stream Filtering.
Proceedings of the Provenance and Annotation of Data, 2006

Performance Evaluation of the Karma Provenance Framework for Scientific Workflows.
Proceedings of the Provenance and Annotation of Data, 2006

Dynamic Filtering and Mining Triggers in Mesoscale Meteorology Forecasting.
Proceedings of the IEEE International Geoscience & Remote Sensing Symposium, 2006

Data Management in Dynamic Environment-driven Computational Science.
Proceedings of the Grid-Based Problem Solving Environments, 2006

A Framework for Collecting Provenance in Data-Centric Scientific Workflows.
Proceedings of the 2006 IEEE International Conference on Web Services (ICWS 2006), 2006

TrustCell: Towards the End-to-End Trustworthiness in Data-Oriented Scientific Computing.
Proceedings of the 2006 International Conference on Parallel Processing Workshops (ICPP Workshops 2006), 2006

A Hybrid XML-Relational Grid Metadata Catalog.
Proceedings of the 2006 International Conference on Parallel Processing Workshops (ICPP Workshops 2006), 2006

Towards a Quality Model for Effective Data Selection in Collaboratories.
Proceedings of the 22nd International Conference on Data Engineering Workshops, 2006

Building e-Science Portals: A Service Oriented Architecture.
Proceedings of the High Performance Computing and Grids in Action, 2006

Personal Workspace for Large-Scale Data-Driven Computational Experiment.
Proceedings of the 7th IEEE/ACM International Conference on Grid Computing (GRID 2006), 2006

Stream processing in data-driven computational science.
Proceedings of the 7th IEEE/ACM International Conference on Grid Computing (GRID 2006), 2006

Calder Query Grid Service: Insights and Experimental Evaluation.
Proceedings of the Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2006), 2006

End-to-End Trustworthy Data Access in Data-Oriented Scientific Computing.
Proceedings of the Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2006), 2006

2005
A survey of data provenance in e-science.
SIGMOD Rec., 2005

Building Grid Portal Applications From a Web Service Component Architecture.
Proc. IEEE, 2005

Active Management of Scientific Data.
IEEE Internet Comput., 2005

Cooperating services for data-driven computational experimentation.
Comput. Sci. Eng., 2005

Service-oriented environments for dynamically interacting with mesoscale weather.
Comput. Sci. Eng., 2005

Evaluation of Rate-Based Adaptivity in Asynchronous Data Stream Joins.
Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), 2005

Service Oriented Architectures for Science Gateways on Grid Systems.
Proceedings of the Service-Oriented Computing, 2005

Towards Dynamically Adaptive Weather Analysis and Forecasting in LEAD.
Proceedings of the Computational Science, 2005

Calder: enabling grid access to data streams.
Proceedings of the 14th IEEE International Symposium on High Performance Distributed Computing, 2005

Distributed streaming query planner in Calder system.
Proceedings of the 14th IEEE International Symposium on High Performance Distributed Computing, 2005

Structure, sharing and preservation of scientific experiment data.
Proceedings of the 2005 Challenges of Large Applications in Distributed Environments, 2005

Monitoring Access to Stateful Resources in Grid Environments.
Proceedings of the 2005 IEEE International Conference on Services Computing (SCC 2005), 2005

2004
Framework for bringing data streams to the grid.
Sci. Program., 2004

Performance Evaluation of Rate-Based Join Window Sizing for Asynchronous Data Streams.
Proceedings of the 13th International Symposium on High-Performance Distributed Computing (HPDC-13 2004), 2004

Building Grid Applications and Portals: An Approach Based on Components, Web Services and Workflow Tools.
Proceedings of the Euro-Par 2004 Parallel Processing, 2004

Using Global Snapshots to Access Data Streams on the Grid.
Proceedings of the Grid Computing, 2004

Understanding Grid resource information management through a synthetic database benchmark/workload.
Proceedings of the 4th IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid 2004), 2004

2003
Dynamic Querying of Streaming Data with the dQUOB System.
IEEE Trans. Parallel Distributed Syst., 2003

2002
Leveraging Run Time Knowledge about Event Rates to Improve Memory Utilization in Wide Area Data Stream Filtering.
Proceedings of the 11th IEEE International Symposium on High Performance Distributed Computing (HPDC-11 2002), 2002

2001
Taking the Step From Meta-Information to Communication Middleware in Computational Data Streams.
Proceedings of the 15th International Parallel & Distributed Processing Symposium (IPDPS-01), 2001

Optimizations Enabled by Relational Data Model View to Querying Data Streams.
Proceedings of the 15th International Parallel & Distributed Processing Symposium (IPDPS-01), 2001

2000
Operational Information Systems: An Example from the Airline Industry.
Proceedings of the First Workshop on Industrial Experiences with Systems Software, 2000

dQUOB: Managing Large Data Flows using Dynamic Embedded Queries.
Proceedings of the Ninth IEEE International Symposium on High Performance Distributed Computing, 2000

1999
Run-time Detection in Parallel and Distributed Systems: Application to Safety-Critical Systems.
Proceedings of the 19th International Conference on Distributed Computing Systems, Austin, TX, USA, May 31, 1999

Steering Data Streams in Distributed Computational Laboratories.
Proceedings of the Eighth IEEE International Symposium on High Performance Distributed Computing, 1999

1998
DataExchange: High Performance Communications in Distributed Laboratories.
Parallel Comput., 1998

From interactive applications to distributed laboratories.
IEEE Concurr., 1998

Multi-level steering in distributed laboratories.
Proceedings of the SIGMETRICS Symposium on Parallel and Distributed Tools, 1998


  Loading...