Juliana Freire

According to our database1, Juliana Freire authored at least 174 papers between 1994 and 2019.

Collaborative distances:
  • Dijkstra number2 of three.
  • Erdős number3 of two.

Awards

ACM Fellow

ACM Fellow 2014, "For contributions to provenance management research and technology, and computational reproducibility.".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2019
A Survey on Collecting, Managing, and Analyzing Provenance from Scripts.
ACM Comput. Surv., 2019

Bootstrapping Domain-Specific Content Discovery on the Web.
Proceedings of the World Wide Web Conference, 2019

A Topic-Agnostic Approach for Identifying Fake News Pages.
Proceedings of the Companion of The 2019 World Wide Web Conference, 2019

Debugging Machine Learning Pipelines.
Proceedings of the 3rd International Workshop on Data Management for End-to-End Machine Learning, 2019

Data Debugging and Exploration with Vizier.
Proceedings of the 2019 International Conference on Management of Data, 2019

A large-scale study about quality and reproducibility of jupyter notebooks.
Proceedings of the 16th International Conference on Mining Software Repositories, 2019

2018
XML Selectivity Estimation.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Biological Resource Discovery.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Provenance in Workflows.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Provenance and Reproducibility.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

XML Storage.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Provenance and the Different Flavors of Reproducibility.
IEEE Data Eng. Bull., 2018

Time Lattice: A Data Structure for the Interactive Visual Analysis of Large Time Series.
Comput. Graph. Forum, 2018

Spatio-Temporal Urban Data Analysis: A Visual Analytics Perspective.
IEEE Computer Graphics and Applications, 2018

ARIES: Enabling Visual Exploration and Organization of Art Image Collections.
IEEE Computer Graphics and Applications, 2018

Learning to Discover Domain-Specific Web Content.
Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, 2018

Interactive Visual Exploration of Spatio-Temporal Urban Data Sets using Urbane.
Proceedings of the 2018 International Conference on Management of Data, 2018

2017
STaRS: Simulating Taxi Ride Sharing at Scale.
IEEE Trans. Big Data, 2017

Data Quality: The Role of Empiricism.
SIGMOD Record, 2017

GPU Rasterization for Real-Time Spatial Aggregation over Arbitrary Polygons.
PVLDB, 2017

noWorkflow: a Tool for Collecting, Analyzing, and Managing Provenance from Python Scripts.
PVLDB, 2017

Real-time understanding of humanitarian crises via targeted information retrieval.
IBM Journal of Research and Development, 2017

Connecting Visualization and Data Management Research (Dagstuhl Seminar 17461).
Dagstuhl Reports, 2017

Querying and Exploring Polygamous Relationships in Urban Spatio-Temporal Data Sets.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

2016
Finding seeds to bootstrap focused crawlers.
World Wide Web, 2016

Visually Exploring Transportation Schedules.
IEEE Trans. Vis. Comput. Graph., 2016

ReproZip: The Reproducibility Packer.
J. Open Source Software, 2016

A collaborative approach to computational reproducibility.
Inf. Syst., 2016

Exploring What not to Clean in Urban Data: A Study Using New York City Taxi Trips.
IEEE Data Eng. Bull., 2016

Reproducibility of Data-Oriented Experiments in e-Science (Dagstuhl Seminar 16041).
Dagstuhl Reports, 2016

A First Study on Temporal Dynamics of Topics on the Web.
Proceedings of the 25th International Conference on World Wide Web, 2016

The exception that improves the rule.
Proceedings of the Workshop on Human-In-the-Loop Data Analytics, 2016

ReproZip: Computational Reproducibility With Ease.
Proceedings of the 2016 International Conference on Management of Data, 2016

Data Polygamy: The Many-Many Relationships among Urban Spatio-Temporal Data Sets.
Proceedings of the 2016 International Conference on Management of Data, 2016

Understanding Website Behavior based on User Agent.
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016

Fine-Grained Provenance Collection over Scripts Through Program Slicing.
Proceedings of the Provenance and Annotation of Data and Processes, 2016

Tracking and Analyzing the Evolution of Provenance from Scripts.
Proceedings of the Provenance and Annotation of Data and Processes, 2016

Prov Viewer: A Graph-Based Visualization Tool for Interactive Exploration of Provenance Data.
Proceedings of the Provenance and Annotation of Data and Processes, 2016

A GPU-based index to support interactive spatio-temporal queries over historical data.
Proceedings of the 32nd IEEE International Conference on Data Engineering, 2016

Virtual lightweight snapshots for consistent analytics in NoSQL stores.
Proceedings of the 32nd IEEE International Conference on Data Engineering, 2016

Anonymizing NYC Taxi Data: Does It Matter?
Proceedings of the 2016 IEEE International Conference on Data Science and Advanced Analytics, 2016

A Unified Index for Spatio-Temporal Keyword Queries.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

Predicting taxi demand at high spatial resolution: Approaching the limit of predictability.
Proceedings of the 2016 IEEE International Conference on Big Data, 2016

2015
Exploring Traffic Dynamics in Urban Environments Using Vector-Valued Functions.
Comput. Graph. Forum, 2015

An Urban Data Profiler.
Proceedings of the 24th International Conference on World Wide Web Companion, 2015

Reproducibility Made Easy.
Proceedings of the EuroVis Workshop on Reproducibility, 2015

Collecting and Analyzing Provenance on Interactive Notebooks: When IPython Meets noWorkflow.
Proceedings of the 7th USENIX Workshop on the Theory and Practice of Provenance, 2015

Visualizing the Evolution of Module Workflows.
Proceedings of the 19th International Conference on Information Visualisation, 2015

A scalable approach for data-driven taxi ride-sharing simulation.
Proceedings of the 2015 IEEE International Conference on Big Data, 2015

2014
Using Topological Analysis to Support Event-Guided Exploration in Urban Data.
IEEE Trans. Vis. Comput. Graph., 2014

Riding from Urban Data to Insight Using New York City Taxis.
IEEE Data Eng. Bull., 2014

Reorganizing Workflow Evolution Provenance.
Proceedings of the 6th Workshop on the Theory and Practice of Provenance, 2014

Should we all be teaching "intro to data science" instead of "intro to databases"?
Proceedings of the International Conference on Management of Data, 2014

Towards Understanding Real-Estate Ownership in New York City: Opportunities and Challenges.
Proceedings of the International Workshop on Data Science for Macro-Modeling, 2014

noWorkflow: Capturing and Analyzing Provenance of Scripts.
Proceedings of the Provenance and Annotation of Data and Processes, 2014

2013
Visual Exploration of Big Spatio-Temporal Urban Data: A Study of New York City Taxi Trips.
IEEE Trans. Vis. Comput. Graph., 2013

Letter from the Special Issue Editor.
IEEE Data Eng. Bull., 2013

A Computational Reproducibility Benchmark.
IEEE Data Eng. Bull., 2013

ReproZip: Using Provenance to Support Computational Reproducibility.
Proceedings of the 5th Workshop on the Theory and Practice of Provenance, 2013

Packing experiments for sharing and publication.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

VisTrails provenance traces for benchmarking.
Proceedings of the Joint 2013 EDBT/ICDT Conferences, 2013

Visual summaries for graph collections.
Proceedings of the IEEE Pacific Visualization Symposium, 2013

2012
Letter from the Associate Editors.
PVLDB, 2012

Letter from the Special Issue Editors.
IEEE Data Eng. Bull., 2012

Making Computations and Publications Reproducible with VisTrails.
Computing in Science and Engineering, 2012

Computational reproducibility: state-of-the-art, challenges, and database research opportunities.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

Designing a Provenance-Based Climate Data Analysis Application.
Proceedings of the Provenance and Annotation of Data and Processes, 2012

Towards Integrating Workflow and Database Provenance.
Proceedings of the Provenance and Annotation of Data and Processes, 2012

Clustering Wikipedia infoboxes to discover their types.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

2011
BirdVis: Visualizing and Understanding Bird Populations.
IEEE Trans. Vis. Comput. Graph., 2011

Repeatability and workability evaluation of SIGMOD 2011.
SIGMOD Record, 2011

Multilingual Schema Matching for Wikipedia Infoboxes.
PVLDB, 2011

Synthesizing Products for Online Catalogs.
PVLDB, 2011

Exploring the Coming Repositories of Reproducible Experiments: Challenges and Opportunities.
PVLDB, 2011

A Provenance-Based Infrastructure to Support the Life Cycle of Executable Papers.
Proceedings of the International Conference on Computational Science, 2011

Guest Editors' Introduction: Provenance in Web Applications.
IEEE Internet Computing, 2011

The Open Provenance Model core specification (v1.1).
Future Generation Comp. Syst., 2011

Proppian random walks in Z.
Discrete Mathematics, 2011

XML Management for Bioinformatics Applications.
Computing in Science and Engineering, 2011

CrowdLabs: Social Analysis and Visualization for the Sciences.
Proceedings of the Scientific and Statistical Database Management, 2011

Parallel visualization on large clusters using MapReduce.
Proceedings of the IEEE Symposium on Large Data Analysis and Visualization, 2011

2010
Indexing Web Form Constraints.
JIDM, 2010

Siphoning Hidden-Web Data through Keyword-Based Interfaces: Retrospective.
JIDM, 2010

Querying Wikipedia Documents and Relationships.
Proceedings of the 13th International Workshop on the Web and Databases 2010, 2010

Using Latent-Structure to Detect Objects on the Web.
Proceedings of the 13th International Workshop on the Web and Databases 2010, 2010

Bridging Workflow and Data Provenance Using Strong Links.
Proceedings of the Scientific and Statistical Database Management, 2010

Creating and exploring web form repositories.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

Provenance-Enabled Data Exploration and Visualization with VisTrails.
Proceedings of the 23rd SIBGRAPI Conference on Graphics, Patterns and Images Tutorials, 2010

The Provenance of Workflow Upgrades.
Proceedings of the Provenance and Annotation of Data and Processes, 2010

Using VisTrails and Provenance for Teaching Scientific Visualization.
Proceedings of the Eurographics 2010 - Education Papers, Norrköping, Sweden, May 3-7, 2010, 2010

Indexing relations on the web.
Proceedings of the EDBT 2010, 2010

Provenance Management for Data Exploration.
Proceedings of the Data Integration in the Life Sciences, 7th International Conference, 2010

PruSM: a prudent schema matching approach for web forms.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

2009
XML Selectivity Estimation.
Proceedings of the Encyclopedia of Database Systems, 2009

Biological Resource Discovery.
Proceedings of the Encyclopedia of Database Systems, 2009

XML Storage.
Proceedings of the Encyclopedia of Database Systems, 2009

On Finding Templates on Web Collections.
World Wide Web, 2009

VisMashup: Streamlining the Creation of Custom Visualization Applications.
IEEE Trans. Vis. Comput. Graph., 2009

Using Workflow Medleys to Streamline Exploratory Tasks.
Proceedings of the Scientific and Statistical Database Management, 2009

A first study on strategies for generating workflow snippets.
Proceedings of the First International Workshop on Keyword Search on Structured Data, 2009

Using Mediation to Achieve Provenance Interoperability.
Proceedings of the 2009 IEEE Congress on Services, Part I, 2009

Introduction.
Proceedings of the First International Workshop on the role of Semantic Web in Provenance Management (SWPM 2009), 2009

Enabling Advanced Visualization Tools in a Web-Based Simulation Monitoring System.
Proceedings of the Fifth International Conference on e-Science, 2009

Provenance Management: Challenges and Opportunities.
Proceedings of the Datenbanksysteme in Business, 2009

Scientific Process Automation and Workflow Management.
Proceedings of the Scientific Data Management - Challenges, Technology, and Deployment., 2009

2008
VisComplete: Automating Suggestions for Visualization Pipelines.
IEEE Trans. Vis. Comput. Graph., 2008

Learning to extract form labels.
PVLDB, 2008

Provenance for Computational Tasks: A Survey.
Computing in Science and Engineering, 2008

Scientific Exploration in the Era of Ocean Observatories.
Computing in Science and Engineering, 2008

Tackling the Provenance Challenge one layer at a time.
Concurrency and Computation: Practice and Experience, 2008

Special Issue: The First Provenance Challenge.
Concurrency and Computation: Practice and Experience, 2008

Examining Statistics of Workflow Evolution Provenance: A First Study.
Proceedings of the Scientific and Statistical Database Management, 2008

Querying and re-using workflows with VsTrails.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

Provenance and scientific workflows: challenges and opportunities.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

A First Study on Clustering Collections of Workflow Graphs.
Proceedings of the Provenance and Annotation of Data and Processes, 2008

The Open Provenance Model: An Overview.
Proceedings of the Provenance and Annotation of Data and Processes, 2008

Using Provenance to Support Real-Time Collaborative Design of Workflows.
Proceedings of the Provenance and Annotation of Data and Processes, 2008

Towards Provenance-Enabling ParaView.
Proceedings of the Provenance and Annotation of Data and Processes, 2008

Querying structured information sources on the web.
Proceedings of the iiWAS'2008, 2008

Automatically Extracting Form Labels.
Proceedings of the 24th International Conference on Data Engineering, 2008

End-to-End eScience: Integrating Workflow, Query, Visualization, and Provenance at an Ocean Observatory.
Proceedings of the Fourth International Conference on e-Science, 2008

Using Mediation to Achieve Provenance Interoperability (Extended Abstract).
Proceedings of the Fourth International Conference on e-Science, 2008

Siphon++: a hidden-webcrawler for keyword-based interfaces.
Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008

2007
Querying and Creating Visualizations by Analogy.
IEEE Trans. Vis. Comput. Graph., 2007

Provenance in Scientific Workflow Systems.
IEEE Data Eng. Bull., 2007

Provenance for Visualizations: Reproducibility and Beyond.
Computing in Science and Engineering, 2007

An adaptive crawler for locating hiddenwebentry points.
Proceedings of the 16th International Conference on World Wide Web, 2007

Combining classifiers to identify online databases.
Proceedings of the 16th International Conference on World Wide Web, 2007

Provenance Management: Challenges and Opportunities.
Proceedings of the XXII Simpósio Brasileiro de Banco de Dados, 2007

Organizing Hidden-Web Databases by Clustering Visible Web Documents.
Proceedings of the 23rd International Conference on Data Engineering, 2007

Automatically Constructing a Directory of Molecular Biology Databases.
Proceedings of the Data Integration in the Life Sciences, 4th International Workshop, 2007

2006
Integrated Scientific Workflow Management for the Emulab Network Testbed.
Proceedings of the 2006 USENIX Annual Technical Conference, Boston, MA, USA, May 30, 2006

VisTrails: visualization meets data management.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2006

Managing Rapidly-Evolving Scientific Workflows.
Proceedings of the Provenance and Annotation of Data, 2006

Managing the Evolution of Dataflows with VisTrails.
Proceedings of the 22nd International Conference on Data Engineering Workshops, 2006

A fast and robust method for web page template detection and removal.
Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management, 2006

Automatically constructing collections of online database directories.
Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management, 2006

2005
Looking at both the present and the past to efficiently update replicas of web content.
Proceedings of the Seventh ACM International Workshop on Web Information and Data Management (WIDM 2005), 2005

Searching for Hidden-Web Databases.
Proceedings of the Eight International Workshop on the Web & Databases (WebDB 2005), 2005

Designing Information-Preserving Mapping Schemes for XML.
Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, August 30, 2005

VisTrails: Enabling Interactive Multiple-View Visualizations.
Proceedings of the 16th IEEE Visualization Conference, 2005

IMAX: The Big Picture of Dynamic XML Statistics.
Proceedings of the 21st International Conference on Data Engineering, 2005

2004
Information Preservation in XML-to-Relational Mappings.
Proceedings of the Database and XML Technologies, 2004

A comprehensive solution to the XML-to-relational mapping problem.
Proceedings of the Sixth ACM CIKM International Workshop on Web Information and Data Management (WIDM 2004), 2004

ShreX: Managing XML Documents in Relational Databases.
Proceedings of the (e)Proceedings of the Thirtieth International Conference on Very Large Data Bases, VLDB 2004, Toronto, Canada, August 31, 2004

Siphoning Hidden-Web Data through Keyword-Based Interfaces.
Proceedings of the XIX Simpósio Brasileiro de Bancos de Dados, 2004

A Flexible Infrastructure for Gathering XML Statistics and Estimating Query Cardinality.
Proceedings of the 20th International Conference on Data Engineering, 2004

Supporting Exploratory Queries in Databases.
Proceedings of the Database Systems for Advances Applications, 2004

2003
Searching for Efficient XML-to-Relational Mappings.
Proceedings of the Database and XML Technologies, 2003

Capturing both Types and Constraints in Data Integration.
Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, 2003

Bridging the XML Relational Divide with LegoDB.
Proceedings of the 19th International Conference on Data Engineering, 2003

2002
Integrating network devices in a meta-directory: the MetaComm experience.
Inf. Syst., 2002

LegoDB: Customizing Relational Storage for XML Documents.
Proceedings of 28th International Conference on Very Large Data Bases, 2002

StatiX: making XML count.
Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, 2002

From XML Schema to Relations: A Cost-Based Approach to XML Storage.
Proceedings of the 18th International Conference on Data Engineering, San Jose, CA, USA, February 26, 2002

Adaptive XML Shredding: Architecture, Implementation, and Challenges.
Proceedings of the Efficiency and Effectiveness of XML Tools and Techniques and Data Integration over the Web, 2002

2001
WebViews: accessing personalized web content and services.
Proceedings of the Tenth International World Wide Web Conference, 2001

Efficient Acquisition of Web Data through Restricted Query Interfaces.
Proceedings of the Poster Proceedings of the Tenth International World Wide Web Conference, 2001

2000
Automating Web navigation with the WebVCR.
Computer Networks, 2000

Web Services and Information Delivery for Diverse Environments.
Proceedings of the Workshop on Technologies for E-Services, 2000

MetaComm: A Meta-Directory for Telecommunications.
Proceedings of the 16th International Conference on Data Engineering, San Diego, California, USA, February 28, 2000

1999
Making LDAP Active with the LTAP Gateway; Case Study in Providing Telecom Integration and Enchanced Services.
Proceedings of the Databases in Telecommunications, 1999

A Layered Architecture for Querying Dynamic Web Content.
Proceedings of the SIGMOD 1999, 1999

Personalizing the Web Using Site Descriptions.
Proceedings of the 10th International Workshop on Database & Expert Systems Applications, 1999

1998
Beyond Depth-First Strategies: Improving Tabled Logic Programs through Alternative Scheduling.
Journal of Functional and Logic Programming, 1998

Scheduling in SLG Revisited.
Proceedings of the 1st Workshop on Tabulation in Parsing and Deduction, 1998

Practical Problems in Coupling Deductive Engines with Relational Databases (Abstract).
Proceedings of the 5th International Workshop on Knowledge Represenation Meets Databases (KRDB '98): Innovative Application Programming and Query Interfaces, 1998

1997
Controlling the Search in Tabled Evaluations.
Proceedings of the Logic Programming, 1997

XSB: A System for Effciently Computing WFS.
Proceedings of the Logic Programming and Nonmonotonic Reasoning, 1997

Taking I/O Seriously: Resolution Reconsidered for Disk.
Proceedings of the Logic Programming, 1997

1996
Beyond Depth-First: Improving Tabled Logic Programs through Alternative Scheduling Strategies.
Proceedings of the Programming Languages: Implementations, 1996

Logic Programming and Databases Integrated at Last? (Poster Abstract).
Proceedings of the Logic Programming, 1996

1995
Exploiting Parallelism in Tabled Evaluations
Proceedings of the Programming Languages: Implementations, 1995

1994
Parallelizing Tabled Evaluations (Extended Abstract).
Proceedings of the ILPS 94 Workshop on Design and Implementation of Parallel Logic Programming Systems, 1994


  Loading...