Altigran Soares da Silva

According to our database1, Altigran Soares da Silva authored at least 124 papers between 1996 and 2018.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2018
Lathe: light-Weight Keyword Query Processing over Multiple Databases.
Proceedings of the XXXIII Simpósio Brasileiro de Banco de Dados: Demos e WTDBD, 2018

Match-Based Candidate Network Generation for Keyword Queries over Relational Databases.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

2017
Color and texture applied to a signature-based bag of visual words method for image retrieval.
Multimedia Tools Appl., 2017

Waves: a fast multi-tier top-k query processing algorithm.
Inf. Retr. Journal, 2017

2016
Finding seeds to bootstrap focused crawlers.
World Wide Web, 2016

Fast top-k preserving query processing using two-tier indexes.
Inf. Process. Manage., 2016

LCA-based algorithms for efficiently processing multiple keyword queries over XML streams.
Data Knowl. Eng., 2016

Towards the Effective Linking of Social Media Contents to Products in E-Commerce Catalogs.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

2015
Removing DUST Using Multiple Alignment of Sequences.
IEEE Trans. Knowl. Data Eng., 2015

A signature-based bag of visual words method for image indexing and search.
Pattern Recognition Letters, 2015

Heuristics to Improve the BMW Method and Its Variants.
JIDM, 2015

A genetic programming framework to schedule webpage updates.
Inf. Retr. Journal, 2015

Heurísticas para Aprimorar o Método BMW e suas Variantes.
Proceedings of the XXX Simpósio Brasileiro de Banco de Dados, 2015

Using active learning techniques for improving database schema matching methods.
Proceedings of the 2015 International Joint Conference on Neural Networks, 2015

Ranking Candidate Networks of relations to improve keyword search over relational databases.
Proceedings of the 31st IEEE International Conference on Data Engineering, 2015

A Self-training CRF Method for Recognizing Product Model Mentions in Web Forums.
Proceedings of the Advances in Information Retrieval, 2015

Finding Similar Products in E-commerce Sites Based on Attributes.
Proceedings of the 9th Alberto Mendelzon International Workshop on Foundations of Data Management, Lima, Peru, May 6, 2015

2014
Learning to expand queries using entities.
JASIST, 2014

MKStream: An Efficient Algorithm for Processing Multiple Keyword Queries over XML Streams.
Proceedings of the Conceptual Modeling - 33rd International Conference, 2014

2013
Unsupervised Information Extraction by Text Segmentation.
Springer Briefs in Computer Science, Springer, ISBN: 978-3-319-02597-1, 2013

An evolutionary approach to complex schema matching.
Inf. Syst., 2013

FS-NER: a lightweight filter-stream approach to named entity recognition on twitter data.
Proceedings of the 22nd International World Wide Web Conference, 2013

Learning to Schedule Webpage Updates Using Genetic Programming.
Proceedings of the String Processing and Information Retrieval, 2013

Learning URL Normalization Rules Using Multiple Alignment of Sequences.
Proceedings of the String Processing and Information Retrieval, 2013

Fast document-at-a-time query processing using two-tier indexes.
Proceedings of the 36th International ACM SIGIR conference on research and development in Information Retrieval, 2013

Filter-Stream Named Entity Recognition: A Case Study at the #MSM2013 Concept Extraction Challenge.
Proceedings of the Concept Extraction Challenge at the Workshop on 'Making Sense of Microposts', 2013

2012
A Genetic Programming Approach to Record Deduplication.
IEEE Trans. Knowl. Data Eng., 2012

Using Taxonomies for Product Recommendation.
JIDM, 2012

LePrEF: Learn to precompute evidence fusion for efficient query evaluation.
JASIST, 2012

Sorted dominant local color for searching large and heterogeneous image databases.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Methods and Techniques for Information Extraction by Text Segmentation.
Proceedings of the 6th Alberto Mendelzon International Workshop on Foundations of Data Management, 2012

Named Entity Disambiguation in Streaming Data.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011
Information Retrieval Research at UFMG.
JIDM, 2011

On Using Wikipedia to Build Knowledge Bases for Information Extraction by Text Segmentation.
JIDM, 2011

The Database and Information Retrieval Research Group at UFAM.
JIDM, 2011

Lightweight methods for large-scale product categorization.
JASIST, 2011

A New Approach for Verifying URL Uniqueness in Web Crawlers.
Proceedings of the String Processing and Information Retrieval, 2011

Joint unsupervised structure discovery and information extraction.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

A site oriented method for segmenting web pages.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

A source independent framework for research paper recommendation.
Proceedings of the 2011 Joint International Conference on Digital Libraries, 2011


Semi-supervised genetic programming for classification.
Proceedings of the 13th Annual Genetic and Evolutionary Computation Conference, 2011

Multiple keyword-based queries over XML streams.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

2010
A Probabilistic Approach for Automatically Filling Form-Based Web Interfaces.
PVLDB, 2010

Adaptive and Flexible Blocking for Record Linkage Tasks.
JIDM, 2010

Using structural information to improve search in Web collections.
JASIST, 2010

Information Systems Special Issue on SBBD 2007.
Inf. Syst., 2010

Exploring features for the automatic identification of user goals in web search.
Inf. Process. Manage., 2010

A Self-Supervised Approach for Extraction of Attribute-Value Pairs from Wikipedia Articles.
Proceedings of the String Processing and Information Retrieval, 2010

ONDUX: on-demand unsupervised learning for information extraction.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

Automatically Generating Structured Queries in XML Keyword Search.
Proceedings of the Comparative Evaluation of Focused Retrieval, 2010

Active Learning Genetic programming for record deduplication.
Proceedings of the IEEE Congress on Evolutionary Computation, 2010

2009
On Finding Templates on Web Collections.
World Wide Web, 2009

A Genre-Aware Approach to Focused Crawling.
World Wide Web, 2009

A flexible approach for extracting metadata from bibliographic citations.
JASIST, 2009

An evolutionary approach for combining different sources of evidence in search engines.
Inf. Syst., 2009

A strategy for allowing meaningful and comparable scores in approximate matching.
Inf. Syst., 2009

Automatically filling form-based web interfaces with free text inputs.
Proceedings of the 18th International Conference on World Wide Web, 2009

Blocagem Adaptativa e Flexível para o Pareamento Aproximado de Registros.
Proceedings of the XXIV Simpósio Brasileiro de Banco de Dados, 2009

2008
Locality-Based pruning methods for web search.
ACM Trans. Inf. Syst., 2008

Structure-Based Crawling in the Hidden Web.
J. UCS, 2008

Replica identification using genetic programming.
Proceedings of the 2008 ACM Symposium on Applied Computing (SAC), 2008

The impact of term selection in genre-aware focused crawling.
Proceedings of the 2008 ACM Symposium on Applied Computing (SAC), 2008

Cooperative Research on Web Data Management at UFMG and UFAM - A Brief Report.
Proceedings of the Latin American Web Conference, 2008

Siphon++: a hidden-webcrawler for keyword-based interfaces.
Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008

2007
LABRADOR: Efficiently publishing relational databases on the web by using keyword-based query interfaces.
Inf. Process. Manage., 2007

A cost-effective method for detecting web site replicas on search engine databases.
Data Knowl. Eng., 2007

An approach to XML path matching.
Proceedings of the 9th ACM International Workshop on Web Information and Data Management (WIDM 2007), 2007

FleDEx: flexible data exchange.
Proceedings of the 9th ACM International Workshop on Web Information and Data Management (WIDM 2007), 2007

Exploiting Genre in Focused Crawling.
Proceedings of the String Processing and Information Retrieval, 2007

A Scalable Parallel Deduplication Algorithm.
Proceedings of the 19th Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2007), 2007

Labeling Data Extracted from the Web.
Proceedings of the On the Move to Meaningful Internet Systems 2007: CoopIS, 2007

FLUX-CIM: flexible unsupervised extraction of citation metadata.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2007

Organizing Hidden-Web Databases by Clustering Visible Web Documents.
Proceedings of the 23rd International Conference on Data Engineering, 2007

Computing block importance for searching on web sites.
Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, 2007

A strategy for allowing meaningful and comparable scores in approximate matching.
Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, 2007

2006
GoGetIt!: a tool for generating structure-driven web crawlers.
Proceedings of the 15th international conference on World Wide Web, 2006

Structure-driven crawler generation by example.
Proceedings of the SIGIR 2006: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2006

Extração de Dados e Metadados em Textos Semi-estruturados usando HMMs.
Proceedings of the XXI Simpósio Brasileiro de Banco de Dados, 2006

Extracting and Searching Useful Information Available on Web FAQs.
Proceedings of the XXI Simpósio Brasileiro de Banco de Dados, 2006

Learning to deduplicate.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2006

A fast and robust method for web page template detection and removal.
Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management, 2006

2005
Improving Web search efficiency via a locality based static pruning method.
Proceedings of the 14th international conference on World Wide Web, 2005

Detecção de Réplicas Utilizando Conteúdo e Estrutura.
Proceedings of the 20° Simpósio Brasileiro de Bancos de Dados, 2005

Integrating Web Data and Geographic Knowledge into Spatial Databases.
Proceedings of the Spatial Databases: Technologies, Techniques and Trends, 2005

2004
A Bayesian network approach to searching Web databases through keyword-based queries.
Inf. Process. Manage., 2004

Automatic generation of agents for collecting hidden Web pages for data extraction.
Data Knowl. Eng., 2004

Automatic web news extraction using tree edit distance.
Proceedings of the 13th international conference on World Wide Web, 2004

Measuring similarity between collection of values.
Proceedings of the Sixth ACM CIKM International Workshop on Web Information and Data Management (WIDM 2004), 2004

The effectiveness of automatically structured queries in digital libraries.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2004

Information Retrieval Aware Web Site Modelling and Generation.
Proceedings of the Conceptual Modeling, 2004

2003
Finding similar identities among objects from multiple web sources.
Proceedings of the Fifth ACM CIKM International Workshop on Web Information and Data Management (WIDM 2003), 2003

Verificação Automática da Qualidade de Dados Extraídos da Web.
Proceedings of the XVIII Simpósio Brasileiro de Bancos de Dados, 2003

The Web-DL Environment for Building Digital Libraries from the Web.
Proceedings of the ACM/IEEE 2003 Joint Conference on Digital Libraries (JCDL 2003), 2003

The Web as a Data Source for Spatial Databases.
Proceedings of the Anais GeoInfo 2003, 2003

Keyword-Based Queries Over Web Databases.
Effective Databases for Text & Document Management, 2003

2002
Estratégias baseadas em exemplos para extração de dados semi-estruturados da web.
PhD thesis, 2002

A Brief Survey of Web Data Extraction Tools.
SIGMOD Record, 2002

The Debye Environment for Web Data Management.
IEEE Internet Computing, 2002

DEByE - Data Extraction By Example.
Data Knowl. Eng., 2002

Collecting hidden web pages for data extraction.
Proceedings of the Fourth ACM CIKM International Workshop on Web Information and Data Management (WIDM 2002), 2002

A Framework for Generating Attribute Extractors for Web Data Sources.
Proceedings of the String Processing and Information Retrieval, 2002

Consultando Bancos de Dados Disponíveis na Web Usando Palavras-Chave.
Proceedings of the XVII Simpósio Brasileiro de Banco de Dados, 2002

Structuring keyword-based queries for web databases.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2002

Representing and Querying Semistructured Web Data Using Nested Tables with Structural Variants.
Proceedings of the Conceptual Modeling, 2002

Searching web databases by structuring keyword-based queries.
Proceedings of the 2002 ACM CIKM International Conference on Information and Knowledge Management, 2002

Web-DL: an experience in building digital libraries from the web.
Proceedings of the 2002 ACM CIKM International Conference on Information and Knowledge Management, 2002

Using Nested Tables for Representing and Querying Semistructured Web Data.
Proceedings of the Advanced Information Systems Engineering, 14th International Conference, 2002

2001
Querying Semistructured Data By Example: The QSByE Interface.
Proceedings of the International Workshop on Information Integration on the Web, 2001

An Environment for Building and Maintaining Web Views.
Proceedings of the International Workshop on Information Integration on the Web, 2001

Storing Semistructured Data in Relational Databases.
Proceedings of the Eighth International Symposium on String Processing and Information Retrieval, 2001

Uma Abordagem para Armazenamento de Dados Semi-Estruturados em Bancos de Dados Relacionais.
Proceedings of the XVI Simpósio Brasileiro de Banco de Dados, 2001

Managing Web Data through Views.
Proceedings of the Electronic Commerce and Web Technologies, 2001

Bootstrapping for Example-Based Data Extraction.
Proceedings of the 2001 ACM CIKM International Conference on Information and Knowledge Management, 2001

2000
On the relational representation of complex specialization structures.
Inf. Syst., 2000

Uma Interface Gráfica para Consulta a Fontes de Dados XML.
Proceedings of the XV Simpósio Brasileiro de Banco de Dados, 2000

ASByE: uma Ferramenta Baseada em Exemplos para Especificação de Agentes para Coleta de Documentos Web.
Proceedings of the XV Simpósio Brasileiro de Banco de Dados, 2000

An Example-Based Environment for Wrapper Generation.
Proceedings of the Conceptual Modeling for E-Business and the Web, 2000

Representing Web Data as Complex Objects.
Proceedings of the Electronic Commerce and Web Technologies, 2000

1999
CoBWeb - A Crawler for the Brazilian Web.
Proceedings of the Sixth International Symposium on String Processing and Information Retrieval and Fifth International Workshop on Groupware, 1999

Top-down Extraction of Semi-Structured Data.
Proceedings of the Sixth International Symposium on String Processing and Information Retrieval and Fifth International Workshop on Groupware, 1999

DEByE - Uma ferramenta para Extração de Dados Semi-Estruturados.
Proceedings of the XIV Simpósio Brasileiro de Banco de Dados, 1999

Extracting Semi-Structured Data Through Examples.
Proceedings of the 1999 ACM CIKM International Conference on Information and Knowledge Management, 1999

1996
An Approach to Maintaining Optimized Relational Representations of Entity-Relationship Schemas.
Proceedings of the Conceptual Modeling, 1996


  Loading...