Jiaheng Lu

Orcid: 0000-0003-2067-454X

Affiliations:
  • University of Helsinki, Finland
  • Renmin University of China, School of Information / MOE Key Laboratory of Data Engineering and Knowledge Engineering, China (former)
  • University of California at Irvine, Department of Computer Science, CA, USA (former)
  • National University of Singapore, Singapore (PhD 2007)
  • Shanghai Jiaotong University, China (former)


According to our database1, Jiaheng Lu authored at least 123 papers between 1999 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Multi-model query languages: taming the variety of big data.
Distributed Parallel Databases, March, 2024

SimCost: cost-effective resource provision prediction and recommendation for spark workloads.
Distributed Parallel Databases, March, 2024

MLM-WR: A Swarm-Intelligence-Based Cloud-Edge-Terminal Collaboration Data Collection Scheme in the Era of AIoT.
IEEE Internet Things J., January, 2024

2023
Performance models of data parallel DAG workflows for large scale data analytics.
Distributed Parallel Databases, September, 2023

Join Order Selection with Deep Reinforcement Learning: Fundamentals, Techniques, and Challenges.
Proc. VLDB Endow., 2023

A Survey on Mapping Semi-Structured Data and Graph Data to Relational Data.
ACM Comput. Surv., 2023

Quantum Computing for Databases: A Short Survey and Vision.
Proceedings of the Joint Proceedings of Workshops at the 49th International Conference on Very Large Data Bases (VLDB 2023), Vancouver, Canada, August 28, 2023

Preface QDSM.
Proceedings of the Joint Proceedings of Workshops at the 49th International Conference on Very Large Data Bases (VLDB 2023), Vancouver, Canada, August 28, 2023

Quantum Machine Learning: Foundation, New Techniques, and Opportunities for Database Research.
Proceedings of the Companion of the 2023 International Conference on Management of Data, 2023

Quantum Annealing Method for Dynamic Virtual Machine and Task Allocation in Cloud Infrastructures from Sustainability Perspective.
Proceedings of the 39th IEEE International Conference on Data Engineering, ICDE 2023, 2023

2022
$d$d-Simplexed: Adaptive Delaunay Triangulation for Performance Modeling and Prediction on Big Data Analytics.
IEEE Trans. Big Data, 2022

Cross-Model Conjunctive Queries over Relation and Tree-structured Data (Extended).
CoRR, 2022

Self-Adapting Design and Maintenance of Multi-Model Databases.
Proceedings of the IDEAS'22: International Database Engineered Applications Symposium, Budapest, Hungary, August 22, 2022

Automatic Performance Tuning for Distributed Data Stream Processing Systems.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Effective Generation of Relational Schema from Multi-Model Data with Reinforcement Learning.
Proceedings of the Conceptual Modeling - 41st International Conference, 2022

Cross-Model Conjunctive Queries over Relation and Tree-Structured Data.
Proceedings of the Database Systems for Advanced Applications, 2022

2021
MultiCategory: Multi-model Query Processing Meets Category Theory and Functional Programming.
Proc. VLDB Endow., 2021

Holistic evaluation in multi-model databases benchmarking.
Distributed Parallel Databases, 2021

A Survey on Automatic Parameter Tuning for Big Data Processing Systems.
ACM Comput. Surv., 2021

Multi-model Query Processing Meets Category Theory and Functional Programming.
Proceedings of the 2nd Workshop on Search, 2021

Automatic View Selection in Graph Databases.
Proceedings of the SSDBM 2021: 33rd International Conference on Scientific and Statistical Database Management, 2021

Workload-Aware Performance Tuning for Autonomous DBMSs.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

MORTAL: A Tool of Automatically Designing Relational Storage Schemas for Multi-model Data through Reinforcement Learning.
Proceedings of the ER Demos and Posters 2021 co-located with 40th International Conference on Conceptual Modeling (ER 2021), 2021

A Formal Category Theoretical Framework for Multi-model Data Transformations.
Proceedings of the Heterogeneous Data Management, Polystores, and Analytics for Healthcare, 2021

Quantum-Inspired Keyword Search on Multi-model Databases.
Proceedings of the Database Systems for Advanced Applications, 2021

Storing Multi-model Data in RDBMSs based on Reinforcement Learning.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

2020
Multiple Set Matching with Bloom Matrix and Bloom Vector.
ACM Trans. Knowl. Discov. Data, 2020

Neural Conversation Generation with Auxiliary Emotional Supervised Models.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2020

One size does not fit all: accelerating OLAP workloads with GPUs.
Distributed Parallel Databases, 2020

Selectivity Estimation for Relation-Tree Joins.
Proceedings of the SSDBM 2020: 32nd International Conference on Scientific and Statistical Database Management, 2020

Multi-Model Data Query Languages and Processing Paradigms.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

2019
Fusion OLAP: Fusing the Pros of MOLAP and ROLAP Together for In-Memory OLAP.
IEEE Trans. Knowl. Data Eng., 2019

PivotE: Revealing and Visualizing the Underlying Entity Structures for Exploration.
Proc. VLDB Endow., 2019

Towards a Unified Framework for String Similarity Joins.
Proc. VLDB Endow., 2019

Speedup Your Analytics: Automatic Parameter Tuning for Databases and Big Data Systems.
Proc. VLDB Endow., 2019

Main-memory foreign key joins on advanced processors: design and re-evaluations for OLAP workloads.
Distributed Parallel Databases, 2019

Multi-model Databases: A New Journey to Handle the Variety of Data.
ACM Comput. Surv., 2019

Multiple Set Matching and Pre-Filtering with Bloom Multifilters.
CoRR, 2019

Fusion OLAP: Fusing the Pros of MOLAP and ROLAP Together for In-memory OLAP (Extended Abstract).
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

Unified Management of Multi-model Data - (Vision Paper).
Proceedings of the Conceptual Modeling - 38th International Conference, 2019

Synergy of Database Techniques and Machine Learning Models for String Similarity Search and Join.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Cost-effective Resource Provisioning for Spark Workloads.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

2018
Optimal algorithms for selecting top-k combinations of attributes: theory and applications.
VLDB J., 2018

Hierarchical Clustering of Complex Symbolic Data and Application for Emitter Identification.
J. Comput. Sci. Technol., 2018

Multi-model Database Management Systems - A Look Forward.
Proceedings of the Heterogeneous Data Management, Polystores, and Analytics for Healthcare, 2018

UniBench: A Benchmark for Multi-model Database Management Systems.
Proceedings of the Performance Evaluation and Benchmarking for the Era of Artificial Intelligence, 2018

UDBMS: Road to Unification for Multi-model Data Management.
Proceedings of the Advances in Conceptual Modeling, 2018

Crowd-Type: A Crowdsourcing-Based Tool for Type Completion in Knowledge Bases.
Proceedings of the Advances in Conceptual Modeling, 2018

Efficient Taxonomic Similarity Joins with Adaptive Overlap Constraint.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

Multi-model Databases and Tightly Integrated Polystores: Current Practices, Comparisons, and Open Challenges.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

Using Crowdsourcing for Fine-Grained Entity Type Completion in Knowledge Bases.
Proceedings of the Web and Big Data - Second International Joint Conference, 2018

2017
ProbeSim: Scalable Single-Source and Top-k SimRank Computations on Dynamic Graphs.
Proc. VLDB Endow., 2017

Using hybrid algorithmic-crowdsourcing methods for academic knowledge acquisition.
Clust. Comput., 2017

Location-sensitive Query Auto-completion.
Proceedings of the 26th International Conference on World Wide Web Companion, 2017

Distributed Algorithms on Exact Personalized PageRank.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

Multi-model Data Management: What's New and What's Next?
Proceedings of the 20th International Conference on Extending Database Technology, 2017

Top-k String Auto-Completion with Synonyms.
Proceedings of the Database Systems for Advanced Applications, 2017

Towards Benchmarking Multi-Model Databases.
Proceedings of the 8th Biennial Conference on Innovative Data Systems Research, 2017

2016
Incremental Hierarchical Clustering of Stochastic Pattern-Based Symbolic Data.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2016

PANDA: A platform for academic knowledge discovery and acquisition.
Proceedings of the 2016 International Conference on Big Data and Smart Computing, 2016

2015
Boosting the Quality of Approximate String Matching by Synonyms.
ACM Trans. Database Syst., 2015

Best Keyword Cover Search.
IEEE Trans. Knowl. Data Eng., 2015

Towards Maximum Independent Sets on Massive Graphs.
Proc. VLDB Endow., 2015

Incremental Distributed Weighted Class Discriminant Analysis on Interval-Valued Emitter Parameters.
Proceedings of the Knowledge Science, Engineering and Management, 2015

PandaSearch: A fine-grained academic search engine for research documents.
Proceedings of the 31st IEEE International Conference on Data Engineering, 2015

Efficient algorithms for answering reverse spatial-keyword nearest neighbor queries.
Proceedings of the 23rd SIGSPATIAL International Conference on Advances in Geographic Information Systems, 2015

Incremental Class Discriminant Analysis on Interval-Valued Emitter Signal Parameters.
Proceedings of the Database Systems for Advanced Applications, 2015

2014
Efficient Algorithms and Cost Models for Reverse Spatial-Keyword <i>k</i>-Nearest Neighbor Search.
ACM Trans. Database Syst., 2014

MRTuner: A Toolkit to Enable Holistic Optimization for MapReduce Jobs.
Proc. VLDB Endow., 2014

A Study of SQL-on-Hadoop Systems.
Proceedings of the Big Data Benchmarks, Performance Optimization, and Emerging Hardware, 2014

A Skylining Approach to Optimize Influence and Cost in Location Selection.
Proceedings of the Database Systems for Advanced Applications, 2014

Object Semantics for XML Keyword Search.
Proceedings of the Database Systems for Advanced Applications, 2014

2013
Optimal and efficient generalized twig pattern processing: a combination of preorder and postorder filterings.
VLDB J., 2013

Big data challenge: a data management perspective.
Frontiers Comput. Sci., 2013

HmSearch: an efficient hamming distance query processing algorithm.
Proceedings of the Conference on Scientific and Statistical Database Management, 2013

String similarity measures and joins with synonyms.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

From Structure-Based to Semantics-Based: Towards Effective XML Keyword Search.
Proceedings of the Conceptual Modeling - 32th International Conference, 2013

2012
Information Extraction From Microblogs: A Survey.
Int. J. Softw. Informatics, 2012

Effective Keyword Search with Synonym Rules over XML Document.
Proceedings of the Web-Age Information Management, 2012

Optimal top-k generation of attribute combinations based on ranked lists.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

New assessment criteria for query suggestion.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

LotusX: A Position-Aware XML Graphical Search System with Auto-Completion.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012

Processing XML Twig Pattern Query with Wildcards.
Proceedings of the Database and Expert Systems Applications, 2012

Energy Efficiency for MapReduce Workloads: An In-depth Study.
Proceedings of the Twenty-Third Australasian Database Conference, 2012

2011
Clustering Web video search results based on integration of multiple features.
World Wide Web, 2011

Extended XML Tree Pattern Matching: Theories and Algorithms.
IEEE Trans. Knowl. Data Eng., 2011

A MovingObject Index for Efficient Query Processing with Peer-Wise Location Privacy.
Proc. VLDB Endow., 2011

Improving performance by creating a native join-index for OLAP.
Frontiers Comput. Sci. China, 2011

Indexing and querying XML using extended Dewey labeling scheme.
Data Knowl. Eng., 2011

XML Query Processing Using Views.
Proceedings of the Web-Age Information Management, 2011

Synthesizing Routes for Low Sampling Trajectories with Absorbing Markov Chains.
Proceedings of the Web-Age Information Management - 12th International Conference, 2011

Reverse spatial and textual k nearest neighbor search.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

A Survey on XML Keyword Search.
Proceedings of the Web Technologies and Applications - 13th Asia-Pacific Web Conference, 2011

Preface of the 2nd International Workshop on XML Data Management.
Proceedings of the Web Technologies and Applications - 13th Asia-Pacific Web Conference, 2011

2010
Towards an Effective XML Keyword Search.
IEEE Trans. Knowl. Data Eng., 2010

Report on the first international workshop on cloud data management (CloudDB 2009).
SIGMOD Rec., 2010

Efficient evaluation of query rewriting plan over materialized XML view.
J. Syst. Softw., 2010

Bucket-based authentication for outsourced databases.
Concurr. Comput. Pract. Exp., 2010

FlexTable: Using a Dynamic Relation Model to Store RDF Data.
Proceedings of the Database Systems for Advanced Applications, 2010

Benchmarking Holistic Approaches to XML Tree Pattern Query Processing - (Extended Abstract of Invited Talk).
Proceedings of the Database Systems for Advanced Applications, 2010

An Effective Object-Level XML Keyword Search.
Proceedings of the Database Systems for Advanced Applications, 2010

Report on the second international workshop on cloud data management (CloudDB 2010).
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

XReal: an interactive XML keyword searching.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

2009
XML keyword query refinement.
Proceedings of the First International Workshop on Keyword Search on Structured Data, 2009

Efficient Algorithm for Computing Link-Based Similarity in Real World Networks.
Proceedings of the ICDM 2009, 2009

Space-Constrained Gram-Based Indexing for Efficient Approximate String Search.
Proceedings of the 25th International Conference on Data Engineering, 2009

Effective XML Keyword Search with Relevance Oriented Ranking.
Proceedings of the 25th International Conference on Data Engineering, 2009

Demonstrating Effective Ranked XML Keyword Search with Meaningful Result Display.
Proceedings of the Database Systems for Advanced Applications, 2009

An efficient multi-dimensional index for cloud data management.
Proceedings of the First International CIKM Workshop on Cloud Data Management, 2009

Efficient algorithms for approximate member extraction using signature-based inverted lists.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

2008
Efficient Merging and Filtering Algorithms for Approximate String Searches.
Proceedings of the 24th International Conference on Data Engineering, 2008

Exploiting ID References for Effective Keyword Search in XML Documents.
Proceedings of the Database Systems for Advanced Applications, 2008

SemanticTwig: A Semantic Approach to Optimize XML Query Processing.
Proceedings of the Database Systems for Advanced Applications, 2008

2006
TwigStackList-: A Holistic Twig Join Algorithm for Twig Query with Not-Predicates on XML Data.
Proceedings of the Database Systems for Advanced Applications, 2006

Effective Keyword Search in XML Documents Based on MIU.
Proceedings of the Database Systems for Advanced Applications, 2006

2005
TJFast: effective processing of XML twig pattern matching.
Proceedings of the 14th international conference on World Wide Web, 2005

From Region Encoding To Extended Dewey: On Efficient Processing of XML Twig Pattern Matching.
Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, August 30, 2005

On Boosting Holism in XML Twig Pattern Matching using Structural Indexing Techniques.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2005

Efficient Processing of Ordered XML Twig Pattern.
Proceedings of the Database and Expert Systems Applications, 16th International Conference, 2005

On reducing redundancy and improving efficiency of XML labeling schemes.
Proceedings of the 2005 ACM CIKM International Conference on Information and Knowledge Management, Bremen, Germany, October 31, 2005

2004
Efficient processing of XML twig patterns with parent child edges: a look-ahead approach.
Proceedings of the 2004 ACM CIKM International Conference on Information and Knowledge Management, 2004

Labeling and Querying Dynamic XML Trees.
Proceedings of the Advanced Web Technologies and Applications, 2004

1999
A Equivalent Object-Oriented Schema Evolution Approach Using the Path-Independence Language.
Proceedings of the TOOLS 1999: 31st International Conference on Technology of Object-Oriented Languages and Systems, 1999


  Loading...