Valter Crescenzi

Affiliations:
  • Roma Tre University, Rome, Italy


According to our database1, Valter Crescenzi authored at least 55 papers between 1998 and 2022.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2022
OpenTRIAGE: Entity Linkage for Detail Webpages.
Proceedings of the 30th Italian Symposium on Advanced Database Systems, 2022

2021
The Smallest Extraction Problem.
Proc. VLDB Endow., 2021

Alaska: A Flexible Benchmark for Data Integration Tasks.
CoRR, 2021

NOAH: Creating Data Integration Pipelines over Continuously Extracted Web Data.
Proceedings of the Workshops of the EDBT/ICDT 2021 Joint Conference, 2021

2019
Hybrid Crowd-Machine Wrapper Inference.
ACM Trans. Knowl. Discov. Data, 2019

RED: Redundancy-Driven Data Extraction from Result Pages?
Proceedings of the World Wide Web Conference, 2019

2018
Big Data Integration for Product Specifications.
IEEE Data Eng. Bull., 2018

Big Data Linkage for Product Specification Pages.
Proceedings of the 2018 International Conference on Management of Data, 2018


Lessons Learned and Research Agenda for Big Data Integration of Product Specifications.
Proceedings of the 26th Italian Symposium on Advanced Database Systems, 2018

2017
Crowdsourcing for data management.
Knowl. Inf. Syst., 2017

2015
Web Content Extraction: a MetaAnalysis of its Past and Thoughts on its Future.
SIGKDD Explor., 2015

Crowdsourcing large scale wrapper inference.
Distributed Parallel Databases, 2015

Web Content Extraction - a Meta-Analysis of its Past and Thoughts on its Future.
CoRR, 2015

2014
Web-Scale Extension of RDF Knowledge Bases from Templated Websites.
Proceedings of the Semantic Web - ISWC 2014, 2014

2013
Extraction and Integration of Partially Overlapping Web Sources.
Proc. VLDB Endow., 2013

ALFRED: crowd assisted data extraction.
Proceedings of the 22nd International World Wide Web Conference, 2013

A framework for learning web wrappers from the crowd.
Proceedings of the 22nd International World Wide Web Conference, 2013

Wrapper Generation Supervised by a Noisy Crowd.
Proceedings of the First VLDB Workshop on Databases and Crowdsourcing, 2013

2012
Minimizing the Costs of the Training Data for Learning Web Wrappers.
Proceedings of the Second International Workshop on Searching and Integrating New Web Data Sources, 2012

Web Data Reconciliation: Models and Experiences.
Proceedings of the Search Computing - Broadening Web Search, 2012

2011
Characterizing the uncertainty of web data: models and experiences.
Proceedings of the 2011 Joint WICOW/AIRWeb Workshop on Web Quality, 2011

Automatically building probabilistic databases from the web.
Proceedings of the 20th International Conference on World Wide Web, 2011

Wrapper Generation for Overlapping Web Sources.
Proceedings of the 2011 IEEE/WIC/ACM International Conference on Web Intelligence, 2011

Contextual Data Extraction and Instance-Based Integration.
Proceedings of the First International Workshop on Searching and Integrating New Web Data Sources, 2011

2010
Exploiting information redundancy to wring out structured data from the web.
Proceedings of the 19th International Conference on World Wide Web, 2010

Redundancy-Driven Web Data Extraction and Integration.
Proceedings of the 13th International Workshop on the Web and Databases 2010, 2010

Probabilistic Reconciliation of Records from Inaccurate Web Sources (Extended Abstract).
Proceedings of the Eighteenth Italian Symposium on Advanced Database Systems, 2010

Probabilistic Models to Reconcile Complex Data from Inaccurate Data Sources.
Proceedings of the Advanced Information Systems Engineering, 22nd International Conference, 2010

2009
Data Extraction and Integration from Imprecise Web Sources.
Proceedings of the Seventeenth Italian Symposium on Advanced Database Systems, 2009

2008
Structure and Semantics of Data-IntensiveWeb Pages: An Experimental Study on their Relationships.
J. Univers. Comput. Sci., 2008

Wrapper Inference for Ambiguous Web Pages.
Appl. Artif. Intell., 2008

Supporting the automatic construction of entity aware search engines.
Proceedings of the 10th ACM International Workshop on Web Information and Data Management (WIDM 2008), 2008

Searching Entities on the Web by Sample.
Proceedings of the Sixteenth Italian Symposium on Advanced Database Systems, 2008

Crawling programs for wrapper-based applications.
Proceedings of the IEEE International Conference on Information Reuse and Integration, 2008

Flint: Google-basing the Web.
Proceedings of the EDBT 2008, 2008

2006
Efficient Techniques for Effective Wrapper Induction.
Proceedings of the 22nd International Conference on Data Engineering Workshops, 2006

2005
Clustering Web pages based on their structure.
Data Knowl. Eng., 2005

Efficiently Locating Collections of Web Pages to Wrap.
Proceedings of the WEBIST 2005, 2005

Harvesting Structurally Similar Pages.
Proceedings of the Thirteenth Italian Symposium on Advanced Database Systems, 2005

2004
Automatic information extraction from large websites.
J. ACM, 2004

An Automatic Data Grabber for Large Web Sites.
Proceedings of the (e)Proceedings of the Thirtieth International Conference on Very Large Data Bases, VLDB 2004, Toronto, Canada, August 31, 2004

Improving the expressiveness of ROADRUNNER.
Proceedings of the Twelfth Italian Symposium on Advanced Database Systems, 2004

2003
Fine-grain web site structure discovery.
Proceedings of the Fifth ACM CIKM International Workshop on Web Information and Data Management (WIDM 2003), 2003

Automatic annotation of data extracted from large Web sites.
Proceedings of the International Workshop on Web and Databases, 2003

2002
RoadRunner: automatic data extraction from data-intensive web sites.
Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, 2002

Back to Gold's Age: Bridging the Gap Between Traditional Grammar Inference and Web Information Extraction.
Proceedings of the Decimo Convegno Nazionale su Sistemi Evoluti per Basi di Dati, 2002

Wrapping-oriented classification of web pages.
Proceedings of the 2002 ACM Symposium on Applied Computing (SAC), 2002

2001
RoadRunner: Towards Automatic Data Extraction from Large Web Sites.
Proceedings of the VLDB 2001, 2001

The RoadRunner Web Data Extraction System.
Proceedings of the Nono Convegno Nazionale Sistemi Evoluti per Basi di Dati, 2001

Automatic Web Information Extraction in the ROADRUNNER System.
Proceedings of the ER 2001 Workshops, 2001

2000
Experiences in XML data management.
Proceedings of the Ottavo Convegno Nazionale su Sistemi Evoluti per Basi di Dati, 2000

1999
The (Short) Araneus Guide to Web-Site Development.
Proceedings of the ACM SIGMOD Workshop on The Web and Databases, 1999

The ARANEUS Guide to Web-Site Development.
Proceedings of the Atti del Settimo Convegno Nazionale Sistemi Evoluti per Basi di Dati, 1999

1998
Grammars Have Exceptions.
Inf. Syst., 1998


  Loading...