Ju Fan

Orcid: 0000-0003-4729-9903

According to our database1, Ju Fan authored at least 85 papers between 2008 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
MisDetect: Iterative Mislabel Detection using Early Loss.
Proc. VLDB Endow., February, 2024

Tabular data synthesis with generative adversarial networks: design space and optimizations.
VLDB J., 2024

CodeS: Towards Building Open-source Language Models for Text-to-SQL.
CoRR, 2024

VerifAI: Verified Generative AI.
Proceedings of the 14th Conference on Innovative Data Systems Research, 2024

DINGO: Towards Diverse and Fine-Grained Instruction-Following Evaluation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Front Matter.
Proc. VLDB Endow., 2023

Unicorn: A Unified Multi-tasking Model for Supporting Matching Tasks in Data Integration.
Proc. ACM Manag. Data, 2023

Few-shot Text-to-SQL Translation using Structure and Content Prompt Learning.
Proc. ACM Manag. Data, 2023

HAIPipe: Combining Human-generated and Machine-generated Pipelines for Data Preparation.
Proc. ACM Manag. Data, 2023

GoodCore: Data-effective and Data-efficient Machine Learning through Coreset Selection over Incomplete Data.
Proc. ACM Manag. Data, 2023

Cost-Effective In-Context Learning for Entity Resolution: A Design Space Exploration.
CoRR, 2023

SEED: Simple, Efficient, and Effective Data Management via Large Language Models.
CoRR, 2023

VerifAI: Verified Generative AI.
CoRR, 2023

Interleaving Pre-Trained Language Models and Large Language Models for Zero-Shot NL2SQL Generation.
CoRR, 2023

ChatPipe: Orchestrating Data Preparation Program by Optimizing Human-ChatGPT Interactions.
CoRR, 2023

Pay "Attention" to Chart Images for What You Read on Text.
Proceedings of the Companion of the 2023 International Conference on Management of Data, 2023

Demystifying Artificial Intelligence for Data Preparation.
Proceedings of the Companion of the 2023 International Conference on Management of Data, 2023

Symphony: Towards Natural Language Query Answering over Multi-modal Data Lakes.
Proceedings of the 13th Conference on Innovative Data Systems Research, 2023

2022
DADER: Hands-Off Entity Resolution with Domain Adaptation.
Proc. VLDB Endow., 2022

Interpretable MOOC recommendation: a multi-attention network for personalized learning behavior analysis.
Internet Res., 2022

Preface to Special Issue on New Technologies of Database Systems.
Int. J. Softw. Informatics, 2022

Contextual Expressive Text-to-Speech.
CoRR, 2022

Domain Adaptation for Deep Entity Resolution.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

OpenTFV: An Open Domain Table-Based Fact Verification System.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

Semantics Driven Embedding Learning for Effective Entity Alignment.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Local Clustering over Labeled Graphs: An Index-Free Approach.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

PASTA: Table-Operations Aware Fact Verification via Sentence-Table Cloze Pre-training.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

A Two-Phase Approach for Recognizing Tables with Complex Structures.
Proceedings of the Database Systems for Advanced Applications, 2022

2021
A survey of typical attributed graph queries.
World Wide Web, 2021

CrowdChart: Crowdsourced Data Extraction From Visualization Charts.
IEEE Trans. Knowl. Data Eng., 2021

RPT: Relational Pre-trained Transformer Is Almost All You Need towards Democratizing Data Preparation.
Proc. VLDB Endow., 2021

Adaptive Data Augmentation for Supervised Learning over Missing Data.
Proc. VLDB Endow., 2021

Preface.
J. Comput. Sci. Technol., 2021

TFV: A Framework for Table-Based Fact Verification.
IEEE Data Eng. Bull., 2021

A Human-in-the-loop Approach to Social Behavioral Targeting.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

2020
A game-based framework for crowdsourced data labeling.
VLDB J., 2020

Relational Data Synthesis using Generative Adversarial Networks: A Design Space Exploration.
Proc. VLDB Endow., 2020

Relational Pretrained Transformers towards Democratizing Data Preparation [Vision].
CoRR, 2020

BiANE: Bipartite Attributed Network Embedding.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Crowdsourcing-based Data Extraction from Visualization Charts.
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

Social Influence Does Matter: User Action Prediction for In-Feed Advertising.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Distribution-Aware Crowdsourced Entity Collection.
IEEE Trans. Knowl. Data Eng., 2019

CrowdGame: A Game-Based Crowdsourcing System for Cost-Effective Data Labeling.
Proceedings of the 2019 International Conference on Management of Data, 2019

Maximizing Multifaceted Network Influence.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

Crowdsourcing Database Systems: Overview and Challenges.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

2018
Crowd Database Systems.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Location-aware Influence Maximization over Dynamic Social Streams.
ACM Trans. Inf. Syst., 2018

Influence Maximization on Social Graphs: A Survey.
IEEE Trans. Knowl. Data Eng., 2018

Trajectory Simplification: An Experimental Study and Quality Analysis.
Proc. VLDB Endow., 2018

Cost-Effective Data Annotation using Game-Based Crowdsourcing.
Proc. VLDB Endow., 2018

CDB: A Crowd-Powered Database System.
Proc. VLDB Endow., 2018

Human-in-the-loop Rule Learning for Data Integration.
IEEE Data Eng. Bull., 2018

Crowd-Powered Data Mining.
CoRR, 2018

Influential User Subscription on Time-Decaying Social Streams.
CoRR, 2018

Fine-grained Concept Linking using Neural Networks in Healthcare.
Proceedings of the 2018 International Conference on Management of Data, 2018

OCTOPUS: An Online Topic-Aware Influence Analysis System for Social Networks.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

Incentive-Based Entity Collection Using Crowdsourcing.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

Mining Rules with Constants from Large Scale Knowledge Bases.
Proceedings of the Conceptual Modeling - 37th International Conference, 2018

Crowd-Type: A Crowdsourcing-Based Tool for Type Completion in Knowledge Bases.
Proceedings of the Advances in Conceptual Modeling, 2018

Using Crowdsourcing for Fine-Grained Entity Type Completion in Knowledge Bases.
Proceedings of the Web and Big Data - Second International Joint Conference, 2018

2017
Processing Long Queries Against Short Text: Top-<i>k</i> Advertisement Matching in News Stream Applications.
ACM Trans. Inf. Syst., 2017

Using hybrid algorithmic-crowdsourcing methods for academic knowledge acquisition.
Clust. Comput., 2017

Crowdsourced Data Management: Overview and Challenges.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

Discovering Your Selling Points: Personalized Social Influential Tags Exploration.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

CDB: Optimizing Queries with Crowd-Based Selections and Joins.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

2016
Linking News and Tweets.
Proceedings of the Databases Theory and Applications, 2016

2015
Competence-Based Song Recommendation: Matching Songs to One's Singing Skill.
IEEE Trans. Multim., 2015

CrowdOp: Query Optimization for Declarative Crowdsourcing Systems.
IEEE Trans. Knowl. Data Eng., 2015

Online Topic-Aware Influence Maximization.
Proc. VLDB Endow., 2015

iCrowd: An Adaptive Crowdsourcing Framework.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

2014
GEMINI: An Integrative Healthcare Analytics System.
Proc. VLDB Endow., 2014

Song Recommendation for Social Singing Community.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

A hybrid machine-crowdsourcing system for matching web tables.
Proceedings of the IEEE 30th International Conference on Data Engineering, Chicago, 2014

2013
A User-Friendly Patent Search Paradigm.
IEEE Trans. Knowl. Data Eng., 2013

TsingNUS: a location-based service system towards live city.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

Crowdsourcing-Assisted Query Structure Interpretation.
Proceedings of the IJCAI 2013, 2013

2012
SEAL: Spatio-Textual Similarity Search.
Proc. VLDB Endow., 2012

Location-aware instant search.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

2011
Interactive SQL query suggestion: Making databases user-friendly.
Proceedings of the 27th International Conference on Data Engineering, 2011

An Effective Approach for Searching Closest Sentence Translations from the Web.
Proceedings of the Database Systems for Advanced Applications, 2011

DBease: Making Databases User-Friendly and Easily Accessible.
Proceedings of the Fifth Biennial Conference on Innovative Data Systems Research, 2011

Measuring Similarity of Chinese Web Databases Based on Category Hierarchy.
Proceedings of the Web Technologies and Applications - 13th Asia-Pacific Web Conference, 2011

2010
Personalizing Web Page Recommendation via Collaborative Filtering and Topic-Aware Markov Model.
Proceedings of the ICDM 2010, 2010

Suggesting Topic-Based Query Terms as You Type.
Proceedings of the Advances in Web Technologies and Applications, 2010

2008
SESQ: A Model-Driven Method for Building Object Level Vertical Search Engines.
Proceedings of the Conceptual Modeling, 2008


  Loading...