Matthias Boehm

Orcid: 0000-0003-1344-3663

Affiliations:
  • TU Berlin, Germany
  • Graz University of Technology, Austria (former)
  • IBM Research, Almaden (former)
  • Dresden University of Technology (former)


According to our database1, Matthias Boehm authored at least 74 papers between 2007 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
POLAR: Adaptive and Non-invasive Join Order Selection via Plans of Least Resistance.
Proc. VLDB Endow., February, 2024

2023
SAGA: A Scalable Framework for Optimizing Data Cleaning Pipelines for Machine Learning Applications.
Proc. ACM Manag. Data, September, 2023

Front Matter.
Proc. VLDB Endow., 2023

GIO: Generating Efficient Matrix and Frame Readers for Custom Data Formats by Example.
Proc. ACM Manag. Data, 2023

AWARE: Workload-aware, Redundancy-exploiting Linear Algebra.
Proc. ACM Manag. Data, 2023

Optimizing Tensor Computations: From Applications to Compilation and Runtime Techniques.
Proceedings of the Companion of the 2023 International Conference on Management of Data, 2023

Seventh Workshop on Data Management for End-to-End Machine Learning (DEEM).
Proceedings of the Companion of the 2023 International Conference on Management of Data, 2023

A Tutorial Workshop on ML for Systems and Systems for ML.
Proceedings of the Datenbanksysteme für Business, 2023

Enabling Integrated Data Analysis Pipelines on Heterogeneous Hardware through Holistic Extensibility.
Proceedings of the Datenbanksysteme für Business, 2023

2022
UPLIFT: Parallelization Strategies for Feature Transformations in Machine Learning Workloads.
Proc. VLDB Endow., 2022

DEEM'22: Data Management for End-to-End Machine Learning.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

Federated Data Preparation, Learning, and Debugging in Apache SystemDS.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022


2021
SliceLine: Fast, Linear-Algebra-based Slice Finding for ML Model Debugging.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

LIMA: Fine-grained Lineage Tracing and Reuse in Machine Learning Systems.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021


2020
Technical Perspective: Declarative Recursive Computation on an RDBMS.
SIGMOD Rec., 2020

Learning Explainable Linguistic Expressions with Neural Inductive Logic Programming for Sentence Classification.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

SystemDS: A Declarative Machine Learning System for the End-to-End Data Science Lifecycle.
Proceedings of the 10th Conference on Innovative Data Systems Research, 2020

2019
Data Management in Machine Learning Systems
Synthesis Lectures on Data Management, Morgan & Claypool Publishers, ISBN: 978-3-031-01869-5, 2019

Apache SystemML.
Proceedings of the Encyclopedia of Big Data Technologies., 2019

SystemDS: A Declarative Machine Learning System for the End-to-End Data Science Lifecycle.
CoRR, 2019

Compressed linear algebra for declarative large-scale machine learning.
Commun. ACM, 2019

MNC: Structure-Exploiting Sparsity Estimation for Matrix Expressions.
Proceedings of the 2019 International Conference on Management of Data, 2019

Efficient Data-Parallel Cumulative Aggregates for Large-Scale Machine Learning.
Proceedings of the Datenbanksysteme für Business, 2019

2018
On Optimizing Operator Fusion Plans for Large-Scale Machine Learning in SystemML.
Proc. VLDB Endow., 2018

Deep Learning with Apache SystemML.
CoRR, 2018

On Optimizing Operator Fusion Plans for Large-Scale Machine Learning in SystemML.
CoRR, 2018

2017
Scaling Machine Learning via Compressed Linear Algebra.
SIGMOD Rec., 2017

Data Management in Machine Learning: Challenges, Techniques, and Systems.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

SPOOF: Sum-Product Optimization and Operator Fusion for Large-Scale Machine Learning.
Proceedings of the 8th Biennial Conference on Innovative Data Systems Research, 2017

2016
Compressed Linear Algebra for Large-Scale Machine Learning.
Proc. VLDB Endow., 2016

SystemML: Declarative Machine Learning on Spark.
Proc. VLDB Endow., 2016

Declarative Machine Learning - A Classification of Basic Properties and Types.
CoRR, 2016

2015
Costing Generated Runtime Execution Plans for Large-Scale Machine Learning Programs.
CoRR, 2015

Resource Elasticity for Large-Scale Machine Learning.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

On optimizing machine learning workloads via kernel fusion.
Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2015

2014
Hybrid Parallelization Strategies for Large-Scale Machine Learning in SystemML.
Proc. VLDB Endow., 2014

On-demand re-optimization of integration flows.
Inf. Syst., 2014

SystemML's Optimizer: Plan Generation for Large-Scale Machine Learning Programs.
IEEE Data Eng. Bull., 2014

Large Scale Discriminative Metric Learning.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

2013
Towards Integrated Data Analytics: Time Series Forecasting in DBMS.
Datenbank-Spektrum, 2013

Compiling machine learning algorithms with SystemML.
Proceedings of the ACM Symposium on Cloud Computing, SOCC '13, 2013

2012
A high-throughput in-memory index, durable on flash-based SSD: insights into the winning solution of the SIGMOD programming contest 2011.
SIGMOD Rec., 2012

Optimizing Notifications of Subscription-Based Forecast Queries.
Proceedings of the Scientific and Statistical Database Management, 2012

Partitioning and Multi-core Parallelization of Multi-equation Forecast Models.
Proceedings of the Scientific and Statistical Database Management, 2012

Data management in the MIRABEL smart grid system.
Proceedings of the 2012 Joint EDBT/ICDT Workshops, Berlin, Germany, March 30, 2012, 2012

Efficient Integration of External Information into Forecast Models from the Energy Domain.
Proceedings of the Advances in Databases and Information Systems, 2012

2011
Cost-based optimization of integration flows.
PhD thesis, 2011

Resiliency-Aware Data Management.
Proc. VLDB Endow., 2011

Cost-based vectorization of instance-based integration processes.
Inf. Syst., 2011

Context-Aware Parameter Estimation for Forecast Models in the Energy Domain.
Proceedings of the Scientific and Statistical Database Management, 2011

Next Generation Database Programming and Execution Environment.
Proceedings of the Database Programming Languages, 2011

Offline Design Tuning for Hierarchies of Forecast Models.
Proceedings of the Datenbanksysteme für Business, 2011

Efficient In-Memory Indexing with Generalized Prefix Trees.
Proceedings of the Datenbanksysteme für Business, 2011

Forcasting Evolving Time Series of Energy Demand and Supply.
Proceedings of the Advances in Databases and Information Systems, 2011

2010
Indexing forecast models for matching and maintenance.
Proceedings of the Fourteenth International Database Engineering and Applications Symposium (IDEAS 2010), 2010

Multi-flow Optimization via Horizontal Message Queue Partitioning.
Proceedings of the Enterprise Information Systems - 12th International Conference, 2010

Multi-process Optimization Via Horizontal Message Queue Partitioning.
Proceedings of the ICEIS 2010 - Proceedings of the 12th International Conference on Enterprise Information Systems, Volume 1, DISI, Funchal, Madeira, Portugal, June 8, 2010

2009
Vectorizing Instance-Based Integration Processes.
Proceedings of the Enterprise Information Systems, 11th International Conference, 2009

Invisible Deployment of Integration Processes.
Proceedings of the Enterprise Information Systems, 11th International Conference, 2009

Anfragegetriebene Indizierung räumlicher Daten.
Proceedings of the 39. Jahrestagung der Gesellschaft für Informatik, Im Focus das Leben, INFORMATIK 2009, Lübeck, Germany, September 28, 2009

GCIP: exploiting the generation and optimization of integration processes.
Proceedings of the EDBT 2009, 2009

Systemübergreifende Kostennormalisierung für Integrationsprozesse.
Proceedings of the Datenbanksysteme in Business, 2009

2008
Model-Driven Development of Complex and Data-Intensive Integration Processes.
Proceedings of the Model-Based Software and Data Integration, 2008

Message Indexing for Document-Oriented Integration Processes.
Proceedings of the ICEIS 2008, 2008

Model-Driven Generation and Optimization of Complex Integration Processes.
Proceedings of the ICEIS 2008, 2008

DIPBench: An independent benchmark for Data-Intensive Integration Processes.
Proceedings of the 24th International Conference on Data Engineering Workshops, 2008

DIPBench Toolsuite: A Framework for Benchmarking Integration Systems.
Proceedings of the 24th International Conference on Data Engineering, 2008

Workload-based optimization of integration processes.
Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008

Improving Data Independence, Efficiency and Functional Flexibility of Integration Platforms.
Proceedings of the Forum at the CAiSE'08 Conference, Montpellier, France, June 18-20, 2008, 2008

An Advanced Transaction Model for Recovery Processing of Integration Processes.
Proceedings of the Advances in Databases and Information Systems, 2008

2007
Ein Nachrichtentransformationsmodell für komplexe Transformationsprozesse in datenzentrischen Anwendungsszenarien.
Proceedings of the Datenbanksysteme in Business, 2007

Towards Self-Optimization of Message Transformation Processes.
Proceedings of the Communications of the Eleventh East-European Conference on Advances in Databases and Information Systems, Varna, Bulgaria, September 29, 2007


  Loading...