Tanu Malik

Orcid: 0009-0007-9656-727X

Affiliations:
  • Johns Hopkins University, USA


According to our database1, Tanu Malik authored at least 70 papers between 2002 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Comparing containerization-based approaches for reproducible computational modeling of environmental systems.
Environ. Model. Softw., September, 2023

IOSPReD: I/O Specialized Packaging of Reduced Datasets and Data-Intensive Applications for Efficient Reproducibility.
IEEE Access, 2023

Querying Container Provenance.
Proceedings of the Companion Proceedings of the ACM Web Conference 2023, 2023

Reproducible eScience: The Data Containerization Challenge.
Proceedings of the 19th IEEE International Conference on e-Science, 2023

Efficient Differencing of System-level Provenance Graphs.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

2022
CHEX: Multiversion Replay with Ordered Checkpoints.
Proc. VLDB Endow., 2022

Theory and Practice of Provenance.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

Provenance-based Workflow Diagnostics Using Program Specification.
Proceedings of the 29th IEEE International Conference on High Performance Computing, 2022

Reproducible Notebook Containers using Application Virtualization.
Proceedings of the 18th IEEE International Conference on e-Science, 2022

2021
Special issue on Data-driven Science.
Distributed Parallel Databases, 2021

Reproducibility Practice in High-Performance Computing: Community Survey Results.
Comput. Sci. Eng., 2021

On Lowering Merge Costs of an LSM Tree.
Proceedings of the SSDBM 2021: 33rd International Conference on Scientific and Statistical Database Management, 2021

LDI: Learned Distribution Index for Column Stores.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

2020
DF-Toolkit: Interacting with Low-Level Database Storage.
Proc. VLDB Endow., 2020

A taxonomy for reproducible and replicable research in environmental modelling.
Environ. Model. Softw., 2020

Efficient Provenance Alignment in Reproduced Executions.
Proceedings of the 12th International Workshop on Theory and Practice of Provenance, 2020

PROV-CRT: Provenance Support for Container Runtimes.
Proceedings of the 12th International Workshop on Theory and Practice of Provenance, 2020

MiDas: Containerizing Data-Intensive Applications with I/O Specialization.
Proceedings of the 3rd International Workshop on Practical Reproducible Evaluation of Computer Systems, 2020

Content-defined Merkle Trees for Efficient Container Delivery.
Proceedings of the 27th IEEE International Conference on High Performance Computing, 2020

ODSA: Open Database Storage Access.
Proceedings of the 23rd International Conference on Extending Database Technology, 2020

2019
PLI $$^+$$ + : efficient clustering of cloud databases.
Distributed Parallel Databases, 2019

Documenting Computing Environments for Reproducible Experiments.
Proceedings of the Parallel Computing: Technology Trends, 2019

SciInc: A Container Runtime for Incremental Recomputation.
Proceedings of the 15th International Conference on eScience, 2019

2018
Report on the First International Workshop on Incremental Re-computation: Provenance and Beyond.
SIGMOD Rec., 2018

Utilizing Provenance in Reusable Research Objects.
Informatics, 2018

Integrating scientific cyberinfrastructures to improve reproducibility in computational hydrology: Example for HydroShare and GeoTrust.
Environ. Model. Softw., 2018

Using Provenance for Generating Automatic Citations.
Proceedings of the 10th USENIX Workshop on the Theory and Practice of Provenance, 2018

Where Provenance in Database Storage.
Proceedings of the Provenance and Annotation of Data and Processes, 2018

Detecting Database File Tampering through Page Carving.
Proceedings of the 21st International Conference on Extending Database Technology, 2018

2017
PLI: Augmenting Live Databases with Custom Clustered Indexes.
Proceedings of the 29th International Conference on Scientific and Statistical Database Management, 2017

Sciunits: Reusable Research Objects.
Proceedings of the 13th IEEE International Conference on e-Science, 2017

Database Forensic Analysis with DBCarver.
Proceedings of the 8th Biennial Conference on Innovative Data Systems Research, 2017

2016
Rediscovering EarthCube: Collaborate. Or collaborate not. There is no I.
Digit. Libr. Perspect., 2016

Ontology-based urban data exploration.
Proceedings of the 2nd ACM SIGSPATIAL Workshop on Smart Cities and Urban Analytics, 2016

Interactive provenance summaries for reproducible science.
Proceedings of the 12th IEEE International Conference on e-Science, 2016

2015
Sharing and Reproducing Database Applications.
Proc. VLDB Endow., 2015

An invariant framework for conducting reproducible computational science.
J. Comput. Sci., 2015

PDACS: A Portal for Data Analysis Services for Cosmological Simulations.
Comput. Sci. Eng., 2015

GEN: a database interface generator for HPC programs.
Proceedings of the 27th International Conference on Scientific and Statistical Database Management, 2015

LDV: Light-weight database virtualization.
Proceedings of the 31st IEEE International Conference on Data Engineering, 2015

2014
Plenario: An Open Data Discovery and Exploration Platform for Urban Science.
IEEE Data Eng. Bull., 2014

Auditing and Maintaining Provenance in Software Packages.
Proceedings of the Provenance and Annotation of Data and Processes, 2014

Benchmarking cloud-based tagging services.
Proceedings of the Workshops Proceedings of the 30th International Conference on Data Engineering Workshops, 2014

2013
Using Provenance for Repeatability.
Proceedings of the 5th Workshop on the Theory and Practice of Provenance, 2013

Proactive Support for Large-Scale Data Exploration.
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

Lens: A Faceted Browser for Research Networking Platforms.
Proceedings of the 9th IEEE International Conference on eScience, 2013

Distributed data provenance for large-scale data-intensive computing.
Proceedings of the 2013 IEEE International Conference on Cluster Computing, 2013

2012
SOLE: Linking Research Papers with Science Objects.
Proceedings of the Provenance and Annotation of Data and Processes, 2012

Addressing data access needs of the long-tail distribution of geoscientists.
Proceedings of the 2012 IEEE International Geoscience and Remote Sensing Symposium, 2012

2011
Policy-Based Integration of Provenance Metadata.
Proceedings of the POLICY 2011, 2011

Improving the efficiency of subset queries on raster images.
Proceedings of the 2011 Second International Workshop on High Performance and Distributed Geographic Information Systems, 2011

2010
JAWS: Job-Aware Workload Scheduling for the Exploration of Turbulence Simulations.
Proceedings of the Conference on High Performance Computing Networking, 2010

A Dynamic Data Middleware Cache for Rapidly-Growing Scientific Repositories.
Proceedings of the Middleware 2010 - ACM/IFIP/USENIX 11th International Middleware Conference, Bangalore, India, November 29, 2010

Efficient querying of distributed provenance stores.
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, 2010

Tracking and Sketching Distributed Data Provenance.
Proceedings of the Sixth International Conference on e-Science, 2010

Providing Scalable Data Services in Ubiquitous Networks.
Proceedings of the Database Systems for Advanced Applications, 2010

2009
Adaptive Physical Design for Curated Archives.
Proceedings of the Scientific and Statistical Database Management, 2009

LifeRaft: Data-Driven, Batch Processing for the Exploration of Scientific Databases.
Proceedings of the Fourth Biennial Conference on Innovative Data Systems Research, 2009

2008
Automated physical design in database caches.
Proceedings of the 24th International Conference on Data Engineering Workshops, 2008

Rule-Based Classification Systems for Informatics.
Proceedings of the Fourth International Conference on e-Science, 2008

Workload-Aware Histograms for Remote Applications.
Proceedings of the Data Warehousing and Knowledge Discovery, 10th International Conference, 2008

2007
A Workload-Driven Unit of Cache Replacement for Mid-Tier Database Caching.
Proceedings of the Advances in Databases: Concepts, 2007

A Black-Box Approach to Query Cardinality Estimation.
Proceedings of the Third Biennial Conference on Innovative Data Systems Research, 2007

2006
Data management and query - Estimating query result sizes for proxy caching in scientific database federations.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

2005
Practical Passive Lossy Link Inference.
Proceedings of the Passive and Active Network Measurement, 6th International Workshop, 2005

Bypass Caching: Making Scientific Databases Good Network Citizens.
Proceedings of the 21st International Conference on Data Engineering, 2005

2003
SkyQuery: A Web Service Approach to Federate Databases.
Proceedings of the First Biennial Conference on Innovative Data Systems Research, 2003

2002
Web Services for the Virtual Observatory
CoRR, 2002

SkyQuery: A WebService Approach to Federate Databases
CoRR, 2002

The SDSS skyserver: public access to the sloan digital sky server data.
Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, 2002


  Loading...