Kyuseok Shim

According to our database1, Kyuseok Shim authored at least 88 papers between 1992 and 2019.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Awards

ACM Fellow

ACM Fellow 2013, "For contributions to scalable data mining and query processing.".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepage:

On csauthors.net:

Bibliography

2019
Crowdsourced Truth Discovery in the Presence of Hierarchies for Knowledge Fusion.
Proceedings of the Advances in Database Technology, 2019

2018
Preface to the special issue on advances in Spatio-temporal data analysis and management.
GeoInformatica, 2018

2017
Preface to the special issue on big data search and mining.
World Wide Web, 2017

Efficient Processing of Skyline Queries Using MapReduce.
IEEE Trans. Knowl. Data Eng., 2017

Special Section on the International Conference on Data Engineering 2015.
IEEE Trans. Knowl. Data Eng., 2017

Efficient Haar+ Synopsis Construction for the Maximum Absolute Error Measure.
PVLDB, 2017

Integration of graphs from different data sources using crowdsourcing.
Inf. Sci., 2017

Latent ranking analysis using pairwise comparisons in crowdsourcing platforms.
Inf. Syst., 2017

2016
Parallel computation of k-nearest neighbor joins using MapReduce.
Proceedings of the 2016 IEEE International Conference on Big Data, 2016

2015
Processing of Probabilistic Skyline Queries Using MapReduce.
PVLDB, 2015

Aggregate query processing in the presence of duplicates in wireless sensor networks.
Inf. Sci., 2015

Supporting set-valued joins in NoSQL using MapReduce.
Inf. Syst., 2015

2014
TWINS: Efficient time-windowed in-network joins for sensor networks.
Inf. Sci., 2014

DBCURE-MR: An efficient density-based clustering algorithm for large data using MapReduce.
Inf. Syst., 2014

TWILITE: A recommendation system for Twitter using a probabilistic model based on latent Dirichlet allocation.
Inf. Syst., 2014

Latent Ranking Analysis Using Pairwise Comparisons.
Proceedings of the 2014 IEEE International Conference on Data Mining, 2014

2013
Parallel Computation of Skyline and Reverse Skyline Queries Using MapReduce.
PVLDB, 2013

Efficient processing of substring match queries with inverted variable-length gram indexes.
Inf. Sci., 2013

DIGTOBI: a recommendation system for Digg articles using probabilistic modeling.
Proceedings of the 22nd International World Wide Web Conference, 2013

Efficient top-k algorithms for approximate substring matching.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

A remote cardiac monitoring system for preventive care.
Proceedings of the IEEE International Conference on Consumer Electronics, 2013

MapReduce Algorithms for Big Data Analysis.
Proceedings of the Databases in Networked Information Systems - 8th International Workshop, 2013

2012
Parallel Top-K Similarity Join Algorithms Using MapReduce.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012

Data Management Challenges and Opportunities in Cloud Computing.
Proceedings of the Database Systems for Advanced Applications, 2012

HotDigg: Finding Recent Hot Topics from Digg.
Proceedings of the Database Systems for Advanced Applications, 2012

A breast tumor classification method based on ultrasound BI-RADS data mining.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011
TEXT: Automatic Template Extraction from Heterogeneous Web Pages.
IEEE Trans. Knowl. Data Eng., 2011

Similarity Join Size Estimation using Locality Sensitive Hashing.
PVLDB, 2011

CATCH: A detecting algorithm for coalition attacks of hit inflation in internet advertising.
Inf. Syst., 2011

TWITOBI: A Recommendation System for Twitter Using Probabilistic Modeling.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

2010
Approximate algorithms with generalizing attribute values for k-anonymity.
Inf. Syst., 2010

Efficient processing of substring match queries with inverted q-gram indexes.
Proceedings of the 26th International Conference on Data Engineering, 2010

2009
Power-Law Based Estimation of Set Similarity Join Size.
PVLDB, 2009

FAST: Flash-aware external sorting for mobile database systems.
Journal of Systems and Software, 2009

Approximate substring selectivity estimation.
Proceedings of the EDBT 2009, 2009

2008
Wavelet synopsis for hierarchical range queries with workloads.
VLDB J., 2008

2007
A Note on Linear Time Algorithms for Maximum Error Histograms.
IEEE Trans. Knowl. Data Eng., 2007

Extending Q-Grams to Estimate Selectivity of String Matching with Low Edit Distance.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

Approximate algorithms for K-anonymity.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2007

2006
Approximation and streaming algorithms for histogram construction problems.
ACM Trans. Database Syst., 2006

Erratum to: "An adaptive path index for XML data using the query workload": [Information Systems 30(6) (2005) 467-487].
Inf. Syst., 2006

2005
An adaptive path index for XML data using the query workload.
Inf. Syst., 2005

Offline and Data Stream Algorithms for Efficient Computation of Synopsis Structures.
Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, August 30, 2005

2004
Recent Advances in Histogram Construction Algorithms.
Proceedings of the Advances in Web-Age Information Management: 5th International Conference, 2004

REHIST: Relative Error Histogram Construction Algorithms.
Proceedings of the (e)Proceedings of the Thirtieth International Conference on Very Large Data Bases, VLDB 2004, Toronto, Canada, August 31, 2004

XWAVE: Approximate Extended Wavelets for Streaming Data.
Proceedings of the (e)Proceedings of the Thirtieth International Conference on Very Large Data Bases, VLDB 2004, Toronto, Canada, August 31, 2004

SQUIRE: Sequential Pattern Mining with Quantities.
Proceedings of the 20th International Conference on Data Engineering, 2004

Storing XML (with XSD) in SQL Databases: Interplay of Logical and Physical Designs.
Proceedings of the 20th International Conference on Data Engineering, 2004

2003
DTD Inference from XML Documents: The XTRACT Approach.
IEEE Data Eng. Bull., 2003

Building Decision Trees with Constraints.
Data Min. Knowl. Discov., 2003

XTRACT: Learning Document Type Descriptors from XML Document Collections.
Data Min. Knowl. Discov., 2003

Storage and Retrieval of XML Data using Relational Databases.
Proceedings of the 19th International Conference on Data Engineering, 2003

Techniques for Clustering Massive Data Sets.
Proceedings of the Clustering and Information Retrieval, 2003

2002
Mining Sequential Patterns with Regular Expression Constraints.
IEEE Trans. Knowl. Data Eng., 2002

Reminiscences on Influential Papers.
SIGMOD Record, 2002

APEX: an adaptive path index for XML data.
Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, 2002

2001
Storage and Retrieval of XML Data Using Relational Databases.
Proceedings of the VLDB 2001, 2001

Data-streams and histograms.
Proceedings of the Proceedings on 33rd Annual ACM Symposium on Theory of Computing, 2001

2000
Workshop Report: 1999 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery.
SIGKDD Explorations, 2000

Editorial.
SIGKDD Explorations, 2000

Approximate Query Processing Using Wavelets.
Proceedings of the VLDB 2000, 2000

Efficient Algorithms for Mining Outliers from Large Data Sets.
Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, 2000

XTRACT: A System for Extracting Document Type Descriptors from XML Documents.
Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, 2000

Efficient algorithms for constructing decision trees with constraints.
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, 2000

1999
Optimization of Queries with User-Defined Predicates.
ACM Trans. Database Syst., 1999

Data Mining and the Web: Past, Present and Future.
Proceedings of the ACM CIKM'99 2nd Workshop on Web Information and Data Management (WIDM'99), 1999

SPIRIT: Sequential Pattern Mining with Regular Expression Constraints.
Proceedings of the VLDB'99, 1999

WALRUS: A Similarity Retrieval Algorithm for Image Databases.
Proceedings of the SIGMOD 1999, 1999

Of Crawlers, Portals, Mice and Men: Is there more to Mining the Web? (Panel).
Proceedings of the SIGMOD 1999, 1999

Scalable Algorithms for Mining Large Databases.
Proceedings of the Tutorial Notes for ACM SIGKDD 1999 International Conference on Knowledge Discovery and Data Mining, 1999

Mining Optimized Gain Rules for Numeric Attributes.
Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1999

Mining Optimized Support Rules for Numeric Attributes.
Proceedings of the 15th International Conference on Data Engineering, 1999

ROCK: A Robust Clustering Algorithm for Categorical Attributes.
Proceedings of the 15th International Conference on Data Engineering, 1999

1998
PUBLIC: A Decision Tree Classifier that Integrates Building and Pruning.
Proceedings of the VLDB'98, 1998

CURE: An Efficient Clustering Algorithm for Large Databases.
Proceedings of the SIGMOD 1998, 1998

Mining Optimized Association Rules with Categorical and Numeric Attributes.
Proceedings of the Fourteenth International Conference on Data Engineering, 1998

A Constraint-Based Spatial Extension to SQL.
Proceedings of the ACM-GIS '98, 1998

1997
High-Dimensional Similarity Joins.
Proceedings of the Thirteenth International Conference on Data Engineering, 1997

1996
Optimization of Queries with User-defined Predicates
Proceedings of the VLDB'96, 1996

Developing Tightly-Coupled Data Mining Applications on a Relational Database System.
Proceedings of the Second International Conference on Knowledge Discovery and Data Mining (KDD-96), 1996

Optimizing Queries with Aggregate Views.
Proceedings of the Advances in Database Technology, 1996

1995
An Overview of Cost-based Optimization of Queries with Aggregates.
IEEE Data Eng. Bull., 1995

Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time-Series Databases.
Proceedings of the VLDB'95, 1995

Optimizing Queries with Materialized Views.
Proceedings of the Eleventh International Conference on Data Engineering, 1995

1994
Improvements on a Heuristic Algorithm for Multiple-Query Optimization.
Data Knowl. Eng., 1994

Including Group-By in Query Optimization.
Proceedings of the VLDB'94, 1994

1993
Query Optimization in the Presence of Foreign Functions.
Proceedings of the 19th International Conference on Very Large Data Bases, 1993

1992
Parametric Query Optimization.
Proceedings of the 18th International Conference on Very Large Data Bases, 1992


  Loading...