Dennis E. Shasha

Orcid: 0000-0002-7036-3312

Affiliations:
  • New York University, Courant Institute of Mathematical Sciences


According to our database1, Dennis E. Shasha authored at least 308 papers between 1984 and 2024.

Collaborative distances:

Awards

ACM Fellow

ACM Fellow 2013, "For technical and literary contributions over a broad range of data management topics.".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Bankruptcy prediction with low-quality financial information.
Expert Syst. Appl., March, 2024

2023
EnsInfer: a simple ensemble approach to network inference outperforms any single method.
BMC Bioinform., December, 2023

Tidy Towers.
Commun. ACM, October, 2023

Sampling the Goods.
Commun. ACM, July, 2023

Forgetful Forests: Data Structures for Machine Learning on Streaming Data under Concept Drift.
Algorithms, June, 2023

Correction to: BugDoc Iterative debugging and explanation of pipeline executions.
VLDB J., March, 2023

BugDoc.
VLDB J., January, 2023

Life Science Workflow Services (LifeSWS): Motivations and Architecture.
Trans. Large Scale Data Knowl. Centered Syst., 2023

Data Structures for Data-Intensive Applications: Tradeoffs and Design Guidelines.
Found. Trends Databases, 2023

On the calibration of compartmental epidemiological models.
CoRR, 2023

Cooperation in the Commons.
Commun. ACM, 2023

Maximal Cocktails.
Commun. ACM, 2023

Planning Multiple Epidemic Interventions with Reinforcement Learning.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

2022
AcX: System, Techniques, and Experiments for Acronym Expansion.
Proc. VLDB Endow., 2022

Forgetful Forests: high performance learning data structures for streaming data under concept drift.
CoRR, 2022

Card Nim.
Commun. ACM, 2022

Exclusivity probes.
Commun. ACM, 2022

Beating the house.
Commun. ACM, 2022

Orbit design.
Commun. ACM, 2022

2021
Automated Verification of Concurrent Search Structures
Synthesis Lectures on Computer Science, Morgan & Claypool Publishers, ISBN: 978-3-031-01806-0, 2021

Statistics is Easy: Case Studies on Real Scientific Datasets
Synthesis Lectures on Mathematics & Statistics, Morgan & Claypool Publishers, ISBN: 978-3-031-02433-7, 2021

SafePredict: A Meta-Algorithm for Machine Learning That Uses Refusals to Guarantee Correctness.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Verifying concurrent multicopy search structures.
Proc. ACM Program. Lang., 2021

BestNeighbor: efficient evaluation of kNN queries on large time series databases.
Knowl. Inf. Syst., 2021

EpiPolicy: a tool for combating epidemics.
XRDS, 2021

Classification Under Ambiguity: When Is Average-K Better Than Top-K?
CoRR, 2021

Pi-Radio v1: Calibration techniques to enable fully-digital beamforming at 60 GHz.
Comput. Networks, 2021

Randomower.
Commun. ACM, 2021

String me along.
Commun. ACM, 2021

Roulette Angel.
Commun. ACM, 2021

Stay in balance.
Commun. ACM, 2021

Pheniqs 2.0: accurate, high-performance Bayesian decoding and confidence estimation for combinatorial barcode indexing.
BMC Bioinform., 2021

Planning Epidemic Interventions with EpiPolicy.
Proceedings of the UIST '21: The 34th Annual ACM Symposium on User Interface Software and Technology, 2021

Acronym Expander at SDU@AAAI-21: an Acronym Disambiguation Module.
Proceedings of the Workshop on Scientific Document Understanding co-located with 35th AAAI Conference on Artificial Inteligence, 2021

2020
Robotic Room Traversal using Optical Range Finding.
CoRR, 2020

MultiRI: Fast Subgraph Matching in Labeled Multigraphs.
CoRR, 2020

Privacy-preserving polling.
Commun. ACM, 2020

Strategic paddling.
Commun. ACM, 2020

Optimal chimes.
Commun. ACM, 2020

Stopping tyranny.
Commun. ACM, 2020

Feedback for foxes.
Commun. ACM, 2020

BugDoc: A System for Debugging Computational Pipelines.
Proceedings of the 2020 International Conference on Management of Data, 2020

BugDoc: Algorithms to Debug Computational Processes.
Proceedings of the 2020 International Conference on Management of Data, 2020

Verifying concurrent search structure templates.
Proceedings of the 41st ACM SIGPLAN International Conference on Programming Language Design and Implementation, 2020

Fully-digital beamforming demonstration with Pi-Radio mmWave SDR platform.
Proceedings of the Mobihoc '20: The Twenty-first ACM International Symposium on Theory, 2020

Calibrating a 4-channel Fully-Digital 60 GHz SDR.
Proceedings of the WiNTECH@MobiCom 2020: Proceedings of the 14th International Workshop on Wireless Network Testbeds, 2020

2019
VersionClimber: Version Upgrades Without Tears.
Comput. Sci. Eng., 2019

Fast methods for finding significant motifs on labelled multi-relational networks.
J. Complex Networks, 2019

Dust wars.
Commun. ACM, 2019

Opioid games.
Commun. ACM, 2019

Fighting for lava.
Commun. ACM, 2019

Randomized anti-counterfeiting.
Commun. ACM, 2019

TACITuS: transcriptomic data collector, integrator, and selector on big data platform.
BMC Bioinform., 2019

Debugging Machine Learning Pipelines.
Proceedings of the 3rd International Workshop on Data Management for End-to-End Machine Learning, 2019

Distributed Algorithms to Find Similar Time Series.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2019

Deferred Runtime Pipelining for contentious multicore software transactions.
Proceedings of the Fourteenth EuroSys Conference 2019, Dresden, Germany, March 25-28, 2019, 2019

2018
Transaction Chopping.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Tuning Concurrency Control.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Schema Tuning.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Physical Layer Tuning.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Performance Monitoring Tools.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Index Tuning.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Database Benchmarks.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Data Generation.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Benchmark Frameworks.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Application-Level Tuning.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Administration Wizards.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Go with the flow: compositional abstractions for concurrent data structures.
Proc. ACM Program. Lang., 2018

ParCorr: efficient parallel methods to identify similar time series pairs across sliding windows.
Data Min. Knowl. Discov., 2018

Fast analytical methods for finding significant labeled graph motifs.
Data Min. Knowl. Discov., 2018

Bounce blockchain.
Commun. ACM, 2018

String wars.
Commun. ACM, 2018

Finding October.
Commun. ACM, 2018

Polychromatic choreography.
Commun. ACM, 2018

SuperNoder: a tool to discover over-represented modular structures in networks.
BMC Bioinform., 2018

Scientific Data Analysis Using Data-Intensive Scalable Computing: The SciDISC Project.
Proceedings of the Latin America Data Science Workshop co-located with 44th International Conference on Very Large Data Bases (VLDB 2018), 2018

Point pattern search in big data.
Proceedings of the 30th International Conference on Scientific and Statistical Database Management, 2018

Reducing Errors by Refusing to Guess (Occasionally).
Proceedings of the XXXIII Simpósio Brasileiro de Banco de Dados, 2018

Constellation Queries over Big Data.
Proceedings of the XXXIII Simpósio Brasileiro de Banco de Dados, 2018

Simple Pattern-only Heuristics Lead to Fast Subgraph Matching Strategies on Very Large Networks.
Proceedings of the Practical Applications of Computational Biology and Bioinformatics, 2018

Improving Tourism Prediction Models Using Climate and Social Media Data: A Fine-Grained Approach.
Proceedings of the Twelfth International Conference on Web and Social Media, 2018

Spark-parSketch: A Massively Distributed Indexing of Time Series Datasets.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

gLabTrie: A Data Structure for Motif Discovery with Constraints.
Proceedings of the Graph Data Management, Fundamental Issues and Recent Developments., 2018

2017
Crowdsourcing Thousands of Specialized Labels: A Bayesian Active Training Approach.
IEEE Trans. Multim., 2017

Go with the Flow: Compositional Abstractions for Concurrent Data Structures (Extended Version).
CoRR, 2017

Partitioned peace.
Commun. ACM, 2017

Ruby risks.
Commun. ACM, 2017

Stacking the deck.
Commun. ACM, 2017

Open field tic-tac-toe.
Commun. ACM, 2017

RadiusSketch: Massively Distributed Indexing of Time Series.
Proceedings of the 2017 IEEE International Conference on Data Science and Advanced Analytics, 2017

Pre-processing and Indexing Techniques for Constellation Queries in Big Data.
Proceedings of the Big Data Analytics and Knowledge Discovery, 2017

2016
ReproZip: The Reproducibility Packer.
J. Open Source Softw., 2016

A collaborative approach to computational reproducibility.
Inf. Syst., 2016

Find me quickly.
Commun. ACM, 2016

Upstart Puzzles: Chair Games.
Commun. ACM, 2016

Upstart Puzzles: Ice Trap.
Commun. ACM, 2016

Conjugate Conformal Prediction for Online Binary Classification.
Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence, 2016

ReproZip: Computational Reproducibility With Ease.
Proceedings of the 2016 International Conference on Management of Data, 2016

A Course on Programming and Problem Solving.
Proceedings of the 47th ACM Technical Symposium on Computing Science Education, 2016

ThePlantGame: Actively Training Human Annotators for Domain-specific Crowdsourcing.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Alphacodes: Usable, Secure Transactions with Untrusted Providers using Human Computable Puzzles.
Proceedings of the 7th Annual Symposium on Computing for Development, 2016

Fast Methods for Statistical Arbitrage.
Proceedings of the Data Stream Management - Processing High-Speed Data Streams, 2016

2015
NetMatchStar: an enhanced Cytoscape network querying app.
F1000Research, 2015

Upstart Puzzles: Auction Triplets.
Commun. ACM, 2015

Upstart puzzles.
Commun. ACM, 2015

Upstart Puzzles: Strategic Friendship.
Commun. ACM, 2015

Upstart Puzzles: Take Your Seats.
Commun. ACM, 2015

Quiet: Faster Belief Propagation for Images and Related Applications.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Positive-Unlabeled Learning in the Face of Labeling Bias.
Proceedings of the IEEE International Conference on Data Mining Workshop, 2015

2014
Negative Example Selection for Protein Function Prediction: The NoGO Database.
PLoS Comput. Biol., 2014

A model project for reproducible papers: critical temperature for the Ising model on a square lattice.
CoRR, 2014

Upstart Puzzles: Proving without Teaching/Teaching without Proving.
Commun. ACM, 2014

Tuning Database Design for High Performance.
Proceedings of the Computing Handbook, 2014

2013
A Computational Reproducibility Benchmark.
IEEE Data Eng. Bull., 2013

Locality Optimization for Data Parallel Programs
CoRR, 2013

A subgraph isomorphism algorithm and its application to biochemical data.
BMC Bioinform., 2013

Parametric Bayesian priors and better choice of negative examples improve protein function prediction.
Bioinform., 2013

ReproZip: Using Provenance to Support Computational Reproducibility.
Proceedings of the 5th Workshop on the Theory and Practice of Provenance, 2013

Packing experiments for sharing and publication.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

Tuning in action.
Proceedings of the Joint 2013 EDBT/ICDT Conferences, 2013

AppSleuth: a tool for database tuning at the application level.
Proceedings of the Joint 2013 EDBT/ICDT Conferences, 2013

2012
Network Inference in Molecular Biology - A Hands-on Framework
Springer Briefs in Electrical and Computer Engineering, Springer, ISBN: 978-1-4614-3113-8, 2012

Fast Elastic Peak Detection for Mass Spectrometry Data Mining.
IEEE Trans. Knowl. Data Eng., 2012

Future of computing: inspiration from nature.
XRDS, 2012

miR-EdiTar: a database of predicted A-to-I edited miRNA target sites.
Bioinform., 2012

JustMyFriends: full SQL, full transactional amenities, and access privacy.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

Computational reproducibility: state-of-the-art, challenges, and database research opportunities.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

Parakeet: A Just-In-Time Parallel Accelerator for Python.
Proceedings of the 4th USENIX Workshop on Hot Topics in Parallelism, 2012

2011
Storing Clocked Programs Inside DNA: A Simplifying Framework for Nanocomputing
Synthesis Lectures on Computer Science, Morgan & Claypool Publishers, ISBN: 978-3-031-01797-1, 2011

Repeatability and workability evaluation of SIGMOD 2011.
SIGMOD Rec., 2011

Exploring the Coming Repositories of Reproducible Experiments: Challenges and Opportunities.
Proc. VLDB Endow., 2011

Behind Efficient Algorithms to Search in Graphs.
Proceedings of the Graph Data Management: Techniques and Applications., 2011

2010
Statistics is Easy! Second Edition
Synthesis Lectures on Mathematics and Statistics, Morgan & Claypool Publishers, ISBN: 978-3-031-02400-9, 2010

An interview with Michael Rabin.
Commun. ACM, 2010

SING: Subgraph search In Non-homogeneous Graphs.
BMC Bioinform., 2010

Enhancing Graph Database Indexing by Suffix Tree Structure.
Proceedings of the Pattern Recognition in Bioinformatics, 2010

2009
Transaction Chopping.
Proceedings of the Encyclopedia of Database Systems, 2009

Tuning Concurrency Control.
Proceedings of the Encyclopedia of Database Systems, 2009

Schema Tuning.
Proceedings of the Encyclopedia of Database Systems, 2009

Physical Layer Tuning.
Proceedings of the Encyclopedia of Database Systems, 2009

Performance Monitoring Tools.
Proceedings of the Encyclopedia of Database Systems, 2009

Index Tuning.
Proceedings of the Encyclopedia of Database Systems, 2009

Application-Level Tuning.
Proceedings of the Encyclopedia of Database Systems, 2009

Administration Wizards.
Proceedings of the Encyclopedia of Database Systems, 2009

Foreword to TODS SIGMOD/PODS 2008 special issue.
ACM Trans. Database Syst., 2009

Repeatability & workability evaluation of SIGMOD 2009.
SIGMOD Rec., 2009

A Systems Approach Uncovers Restrictions for Signal Interactions Regulating Genome-wide Responses to Nutritional Cues in Arabidopsis.
PLoS Comput. Biol., 2009

DNA Hash Pooling and its Applications.
Int. J. Nanotechnol. Mol. Comput., 2009

Revelation on demand.
Distributed Parallel Databases, 2009

miRò: a miRNA knowledge base.
Database J. Biol. Databases Curation, 2009

The Blind Stone Tablet: Outsourcing Durability to Untrusted Parties.
Proceedings of the Network and Distributed System Security Symposium, 2009

2008
Statistics is Easy!
Synthesis Lectures on Mathematics and Statistics, Morgan & Claypool Publishers, ISBN: 978-3-031-02393-4, 2008

The repeatability experiment of SIGMOD 2008.
SIGMOD Rec., 2008

Graphclust: a Method for Clustering Database of Graphs.
J. Inf. Knowl. Manag., 2008

An integrated genetic, genomic and systems approach defines gene networks regulated by the interaction of light and carbon signaling pathways in Arabidopsis.
BMC Syst. Biol., 2008

GraphFind: enhancing graph searching by low support data mining techniques.
BMC Bioinform., 2008

Biocomputational puzzles: data, algorithms, and visualization.
Proceedings of the EDBT 2008, 2008

2007
Sungear: interactive visualization and functional analysis of genomic datasets.
Bioinform., 2007

NetMatch: a Cytoscape plugin for searching biological networks.
Bioinform., 2007

GhostDB: Hiding Data from Prying Eyes.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

GhostDB: querying visible and hidden data without leaks.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2007

Homology search for genes.
Proceedings of the Proceedings 15th International Conference on Intelligent Systems for Molecular Biology (ISMB) & 6th European Conference on Computational Biology (ECCB), 2007

Querying and Aggregating Visible and Hidden Data Without Leaks.
Proceedings of the 23èmes Journées Bases de Données Avancées, 2007

2006
Better Burst Detection.
Proceedings of the 22nd International Conference on Data Engineering, 2006

StrangerDB: Safe Data Management with Untrusted Servers.
Proceedings of the 13th International Conference on Management of Data, 2006

The puzzler's elusion - a tale of fraud, pursuit, and the art of logic.
Thunder's Mouth Press, ISBN: 978-1-56025-831-5, 2006

2005
MetricMap: an embedding technique for processing distance-based queries in metric spaces.
IEEE Trans. Syst. Man Cybern. Part B, 2005

Making snapshot isolation serializable.
ACM Trans. Database Syst., 2005

Antipole Tree Indexing to Support Range Search and K-Nearest Neighbor Search in Metric Spaces.
IEEE Trans. Knowl. Data Eng., 2005

Computing for biologists: lessons from some successful case studies.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2005

Fast window correlations over uncooperative time series.
Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2005

Incremental Methods for Simple Problems in Time Series: Algorithms and Experiments.
Proceedings of the Ninth International Database Engineering and Applications Symposium (IDEAS 2005), 2005

Introduction to Data Mining in Bioinformatics.
Proceedings of the Data Mining in Bioinformatics, 2005

AntiClustAl: Multiple Sequence Alignment by Antipole Clustering.
Proceedings of the Data Mining in Bioinformatics, 2005

Puzzling adventures - tales of strategy, logic, and mathematical skill.
W. W. Norton & Company, ISBN: 978-0-393-32663-5, 2005

2004
High Performance Discovery in Time Series - Techniques and Case Studies
Monographs in Computer Science, Springer, ISBN: 978-1-4757-4046-2, 2004

Scheduling Overloaded Real-Time Systems with Competitive/Worst Case Guarantees.
Proceedings of the Handbook of Scheduling - Algorithms, Models, and Performance Analysis., 2004

Editorial.
Inf. Syst., 2004

Fast Algorithms for Time Series with applications to Finance, Physics, Music, Biology, and other Suspects.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2004

Secure Untrusted Data Repository (SUNDR).
Proceedings of the 6th Symposium on Operating System Design and Implementation (OSDI 2004), 2004

Unordered Tree Mining with Applications to Phylogeny.
Proceedings of the 20th International Conference on Data Engineering, 2004

Dr. Ecco - mathematical detective.
Dover Publications, ISBN: 978-0-486-43552-7, 2004

2003
Fast Clustering and Minimum Weight Matching Algorithms for Very Large Mobile Backbone Wireless Networks.
Int. J. Found. Comput. Sci., 2003

The Virtues and Challenges of Ad Hoc + Streams Querying in Finance.
IEEE Data Eng. Bull., 2003

AQuery: Query Language for Ordered Data, Optimization Techniques, and Experiments.
Proceedings of 29th International Conference on Very Large Data Bases, 2003

TreeRank: A Similarity Measure for Nearest Neighbor Searching in Phylogenetic Databases.
Proceedings of the 15th International Conference on Scientific and Statistical Database Management (SSDBM 2003), 2003

Query by Humming - in Action with its Technology Revealed.
Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, 2003

Warping Indexes with Envelope Transforms for Query by Humming.
Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, 2003

Activist Data Mining for Computational Science: Tools and Applications.
Proceedings of the XVIII Simpósio Brasileiro de Bancos de Dados, 2003

Database Tuning: Principles, Experiments, and Guidance.
Proceedings of the XVIII Simpósio Brasileiro de Bancos de Dados, 2003

Efficient elastic burst detection in data streams.
Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 24, 2003

ANTICLUSTAL: Multiple Sequence Alignment by Antipole Clustering and Linear Approximate 1-Median Computation.
Proceedings of the 2nd IEEE Computer Society Bioinformatics Conference, 2003

2002
Finding Patterns in Three-Dimensional Graphs: Algorithms and Applications to Scientific Data Mining.
IEEE Trans. Knowl. Data Eng., 2002

Finding approximate patterns in undirected acyclic graphs.
Pattern Recognit., 2002

StatStream: Statistical Monitoring of Thousands of Data Streams in Real Time.
Proceedings of 28th International Conference on Very Large Data Bases, 2002

Database Tuning: Principles, Experiments, and Troubleshooting Techniques.
Proceedings of 28th International Conference on Very Large Data Bases, 2002

ATreeGrep: Approximate Searching in Unordered Trees.
Proceedings of the 14th International Conference on Scientific and Statistical Database Management, 2002

A Structure-Based Search Engine for Phylogenetic Databases.
Proceedings of the 14th International Conference on Scientific and Statistical Database Management, 2002

Database tuning: principles, experiments, and troubleshooting techniques (part I).
Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, 2002

Database tuning: principles, experiments, and troubleshooting techniques (part II).
Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, 2002

Algorithmics and Applications of Tree and Graph Searching.
Proceedings of the Twenty-first ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, 2002

Building secure file systems out of Byzantine storage.
Proceedings of the Twenty-First Annual ACM Symposium on Principles of Distributed Computing, 2002

GraphGrep: A Fast and Universal Method for Querying Graphs.
Proceedings of the 16th International Conference on Pattern Recognition, 2002

Database Tuning - Principles, Experiments, and Troubleshooting Techniques.
Elsevier, ISBN: 978-1-55860-753-8, 2002

2001
DNA sequence classification via an expectation maximization algorithm and neural networks: a case study.
IEEE Trans. Syst. Man Cybern. Part C, 2001

Efficient data reconciliation.
Inf. Sci., 2001

New techniques for extracting features from protein sequences.
IBM Syst. J., 2001

WebFilter: A High-throughput XML-based Publish and Subscribe System.
Proceedings of the VLDB 2001, 2001

Declarative Data Cleaning: Language, Model, and Algorithms.
Proceedings of the VLDB 2001, 2001

Lots o' Ticks: Real-Time High Performance Time Series Queries on Billions of Trades and Quotes.
Proceedings of the 2001 ACM SIGMOD international conference on Management of data, 2001

Filtering Algorithms and Implementation for Very Fast Publish/Subscribe.
Proceedings of the 2001 ACM SIGMOD international conference on Management of data, 2001

Don't Trust your File Server.
Proceedings of HotOS-VIII: 8th Workshop on Hot Topics in Operating Systems, 2001

Improving Data Cleaning Quality Using a Data Lineage Facility.
Proceedings of the 3rd Intl. Workshop on Design and Management of Data Warehouses, 2001

2000
An Index Structure for Data Mining and Clustering.
Knowl. Inf. Syst., 2000

Message from the Editors-in-Chief.
Inf. Syst., 2000

Publish/Subscribe on the Web at Extreme Speed.
Proceedings of the VLDB 2000, 2000

An Approximate Search Engine for Structural Databases.
Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, 2000

AJAX: An Extensible Data Cleaning Tool
Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, 2000

Application of neural networks to biological data mining: a case study in protein sequence classification.
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, 2000

An Extensible Framework for Data Cleaning.
Proceedings of the 16th International Conference on Data Engineering, San Diego, California, USA, February 28, 2000

Algorithms and Experience in Increasing the Intelligibility and Hygiene of Access Control in Large Organizations.
Proceedings of the Data and Application Security, 2000

Efficient Matching for Web-Based Publish/Subscribe Systems.
Proceedings of the Cooperative Information Systems, 7th International Conference, 2000

Declaratively Cleaning your Data with AJAX.
Proceedings of the 16èmes Journées Bases de Données Avancées, 2000

1999
FinTime - A Financial Time Series Benchmark.
SIGMOD Rec., 1999

New Techniques for DNA Sequence Classification.
J. Comput. Biol., 1999

Review - Efficient Locking for Concurrent Operations on B-Trees.
ACM SIGMOD Digit. Rev., 1999

Review - Ripple Joins for Online Aggregation.
ACM SIGMOD Digit. Rev., 1999

Review - On Random Sampling over Joins.
ACM SIGMOD Digit. Rev., 1999

Review - Join Synopses for Approximate Query Answering.
ACM SIGMOD Digit. Rev., 1999

Review - WALRUS: A Similarity Retrieval Algorithm for Image Databases.
ACM SIGMOD Digit. Rev., 1999

Tuning Time Series Queries in Finance: Case Studies and Recommendations.
IEEE Data Eng. Bull., 1999

Some Approaches to Index Design for Cude Forests.
IEEE Data Eng. Bull., 1999

Evaluating a Class of Distance-Mapping Algorithms for Data Mining and Clustering.
Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1999

Queryable Acyclic Production Systems.
Proceedings of the 1999 ACM CIKM International Conference on Information and Knowledge Management, 1999

Pattern Discovery and Classification in Biosequences.
Proceedings of the Pattern Discovery in Biomolecular Data: Tools, 1999

A Framework for Biological Pattern Discovery on Networks of Workstations.
Proceedings of the Pattern Discovery in Biomolecular Data: Tools, 1999

1998
An Algorithm for Finding the Largest Approximately Common Substructures of Two Trees.
IEEE Trans. Pattern Anal. Mach. Intell., 1998

Free Parallel Data Mining.
Proceedings of the SIGMOD 1998, 1998

An Approximate Oracle for Distance in Metric Spaces.
Proceedings of the Combinatorial Pattern Matching, 9th Annual Symposium, 1998

Out of Their Minds: The Lives and Discoveries of 15 Great Computer Scientists.
Copernicus Books, an imprint of Springer-Verlag, ISBN: 0-387-98269-8, 1998

1997
Some Approaches to Index Design for Cube Forest.
IEEE Data Eng. Bull., 1997

Structural Matching and Discovery in Document Databases.
Proceedings of the SIGMOD 1997, 1997

Lessons from Wall Street: Case Studies in Configuration, Tuning, and Distribution (Tutorial).
Proceedings of the SIGMOD 1997, 1997

Automated Discovery of Active Motifs in Three Dimensional Molecules.
Proceedings of the Third International Conference on Knowledge Discovery and Data Mining (KDD-97), 1997

An Approach to Fault-Tolerant Parallel Processing on Intermittently Idle, Heterogeneous Workstations.
Proceedings of the Digest of Papers: FTCS-27, 1997

Tree pattern matching.
Proceedings of the Pattern Matching Algorithms, 1997

Tuning Database Design for High Performance.
Proceedings of the Computer Science and Engineering Handbook, 1997

1996
On the Editing Distance Between Undirected Acyclic Graphs.
Int. J. Found. Comput. Sci., 1996

Tuning Databases for High Performance.
ACM Comput. Surv., 1996

Thinksheet: A Tool for Tailoring Complex Documents.
Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data, 1996

The Dangers of Replication and a Solution.
Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data, 1996

Automated Discovery of Active Motifs in Multiple RNA Secondary Structures.
Proceedings of the Second International Conference on Knowledge Discovery and Data Mining (KDD-96), 1996

1995
Transaction Chopping: Algorithms and Performance Studies
ACM Trans. Database Syst., 1995

D^over: An Optimal On-Line Scheduling Algorithm for Overloaded Uniprocessor Real-Time Systems.
SIAM J. Comput., 1995

Pattern Matching and Pattern Discovery in Scientific, Program, and Document Databases.
Proceedings of the 1995 ACM SIGMOD International Conference on Management of Data, 1995

An Approach To Handling Overloaded Systems That Allow Skips.
Proceedings of the 16th IEEE Real-Time Systems Symposium, 1995

On the Editing Distance between Undirected Acyclic Graphs and Related Problems.
Proceedings of the Combinatorial Pattern Matching, 6th Annual Symposium, 1995

1994
Exact and approximate algorithms for unordered tree matching.
IEEE Trans. Syst. Man Cybern., 1994

A System for Approximate Tree Matching.
IEEE Trans. Knowl. Data Eng., 1994

MOCA: A Multiprocessor On-Line Competitive Algorithm for Real-Time System Scheduling.
Theor. Comput. Sci., 1994

Approximate Tree Matching in the Presence of Variable Length Don't Cares.
J. Algorithms, 1994

The new Editorial Board of <i> Information Systems </i> .
Inf. Syst., 1994

Information Systems takes a new direction.
Inf. Syst., 1994

2Q: A Low Overhead High Performance Buffer Management Replacement Algorithm.
Proceedings of the VLDB'94, 1994

PLinda 2.0: A Transactional/Checkpointing Approach to Fault Tolerant Linda.
Proceedings of the 13th Symposium on Reliable Distributed Systems, 1994

Combinatorial Pattern Discovery for Scientific Data: Some Preliminary Results.
Proceedings of the 1994 ACM SIGMOD International Conference on Management of Data, 1994

1993
The Performance of Current B-Tree Algorithms.
ACM Trans. Database Syst., 1993

B-Trees with Inserts and Deletes: Why Free-at-Empty Is Better Than Merge-at-Half.
J. Comput. Syst. Sci., 1993

MOCA: A multiprocessor on-line competitive algorithm for real-time system scheduling.
Proceedings of the Real-Time Systems Symposium. Raleigh-Durham, NC, USA, December 1993, 1993

1992
On the Competitiveness of On-Line Real-Time Task Scheduling.
Real Time Syst., 1992

On the Editing Distance Between Unordered Labeled Trees.
Inf. Process. Lett., 1992

The Many Faces of Consensus in Distributed Systems.
Computer, 1992

Database Tuning.
Proceedings of the 18th International Conference on Very Large Data Bases, 1992

Simple Rational Guidance for Chopping Up Transactions.
Proceedings of the 1992 ACM SIGMOD International Conference on Management of Data, 1992

D<sup>over</sup>; an optimal on-line scheduling algorithm for overloaded real-time systems.
Proceedings of the Real-Time Systems Symposium, 1992

Locking without Blocking: Making Lock Based Concurrent Data Structure Algorithms Nonblocking.
Proceedings of the Eleventh ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, 1992

Pattern Matching in Unordered Trees.
Proceedings of the Fourth International Conference on Tools with Artificial Intelligence, 1992

Fast Serial and Parallel Algorithms for Approximate Tree Matching with VLDC's.
Proceedings of the Combinatorial Pattern Matching, Third Annual Symposium, 1992

Database Tuning - A Principled Approach
Prentice-Hall, ISBN: 0-13-205246-6, 1992

1991
Optimizing Equijoin Queries In Distributed Databases Where Relations Are Hash Partitioned.
ACM Trans. Database Syst., 1991

Information Search with Dynamic Text vs Paper Text: An Empirical Comparison.
Int. J. Man Mach. Stud., 1991

A Framework for Automating Physical Database Design.
Proceedings of the 17th International Conference on Very Large Data Bases, 1991

A tool for tree pattern matching.
Proceedings of the Third International Conference on Tools for Artificial Intelligence, 1991

Object Versioning in Ode.
Proceedings of the Seventh International Conference on Data Engineering, 1991

Persistant Linda: Linda + Transactions + Query Processing.
Proceedings of the Research Directions in High-Level Parallel Programming Languages, 1991

On-line Scheduling in the Presence of Overload
Proceedings of the 32nd Annual Symposium on Foundations of Computer Science, 1991

Rationale and Design of BULK.
Proceedings of the Database Programming Languages: Bulk Types and Persistent Data. 3rd International Workshop, 1991

Promises versus assumptions in database fault tolerance.
Proceedings of the VIIèmes Journées Bases de Données Avancées, 1991

1990
New Techniques for Best-Match Retrieval.
ACM Trans. Inf. Syst., 1990

Performance and Architectural Issues for String Matching.
IEEE Trans. Computers, 1990

Fast Algorithms for the Unit Cost Editing Distance Between Trees.
J. Algorithms, 1990

Query Processing for Distance Metrics.
Proceedings of the 16th International Conference on Very Large Data Bases, 1990

A Framework for the Performance Analysis of Concurrent B-tree Algorithms.
Proceedings of the Ninth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, 1990

1989
Simple Fast Algorithms for the Editing Distance Between Trees and Related Problems.
SIAM J. Comput., 1989

Using a Relational System On Wall Street: The Good, The Bad, The Ugly, And The Ideal.
Commun. ACM, 1989

Fast Parallel Algorithms for the Unit Cost Editing Distance Between Trees.
Proceedings of the ACM Symposium on Parallel Algorithms and Architectures, 1989

Utilization of B-trees with Inserts, Deletes and Modifies.
Proceedings of the Eighth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, 1989

1988
Efficient and Correct Execution of Parallel Programs that Share Memory.
ACM Trans. Program. Lang. Syst., 1988

Concurrent Search Structure Algorithms.
ACM Trans. Database Syst., 1988

Concurrent Set Manipulation without Locking.
Proceedings of the Seventh ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, 1988

1987
Fast Parallel Algorithms for Processing of Joins.
Proceedings of the Supercomputing, 1987

1986
Distributed Office By Example (D-OBE).
Proceedings of the Second International Conference on Data Engineering, 1986

A Symmetric Concurrent B-Tree Algorithm.
Proceedings of the Fall Joint Computer Conference, November 2-6, 1986, Dallas, Texas, USA, 1986

When Does Non-Linear Text Help?
Proceedings of the Expert Database Systems, 1986

1985
What Good are Concurrent Search Structure Algorithms for databases Anyway?
IEEE Database Eng. Bull., 1985

NetBook - a Data Model to Support Knowledge Exploration.
Proceedings of the VLDB'85, 1985

Semantically-based Concurrency Control for Search Structures.
Proceedings of the Fourth ACM SIGACT-SIGMOD Symposium on Principles of Database Systems, 1985

1984
Temporal Verification of Carrier-Sense Local Area Network Protocols.
Proceedings of the Conference Record of the Eleventh Annual ACM Symposium on Principles of Programming Languages, 1984


  Loading...