Yuren Mao

Orcid: 0000-0003-0550-3072

According to our database1, Yuren Mao authored at least 42 papers between 2020 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
LASER: A Data-Centric Method for Low-Cost and Efficient SQL Rewriting based on SQL-GRPO.
CoRR, April, 2026

OpenHospital: A Thing-in-itself Arena for Evolving and Benchmarking LLM-based Collective Intelligence.
CoRR, March, 2026

Unbiased Multimodal Reranking for Long-Tail Short-Video Search.
Proceedings of the ACM Web Conference 2026, 2026

Can We Predict Before Executing Machine Learning Agents?
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025
Agent-Kernel: A MicroKernel Multi-Agent System Framework for Adaptive Social Simulation Powered by LLMs.
CoRR, December, 2025

SQL-Factory: A Multi-Agent Framework for High-Quality and Large-Scale SQL Generation.
Proc. VLDB Endow., November, 2025

A survey on LoRA of large language models.
Frontiers Comput. Sci., July, 2025

LOFTune: A Low-Overhead and Flexible Approach for Spark SQL Configuration Tuning.
IEEE Trans. Knowl. Data Eng., June, 2025

Snoopy: Effective and Efficient Semantic Join Discovery via Proxy Columns.
IEEE Trans. Knowl. Data Eng., May, 2025

CT-Agent: A Multimodal-LLM Agent for 3D CT Radiology Question Answering.
CoRR, May, 2025

Text-to-Pipeline: Bridging Natural Language and Data Preparation Pipelines.
CoRR, May, 2025

Boosting GNN-Based Link Prediction via PU-AUC Optimization.
IEEE Trans. Knowl. Data Eng., April, 2025

scAgent: Universal Single-Cell Annotation via a LLM Agent.
CoRR, April, 2025

FIT-RAG: Black-Box RAG with Factual Information and Token Reduction.
ACM Trans. Inf. Syst., March, 2025

BIRDIE: Natural Language-Driven Table Discovery Using Differentiable Search Index.
Proc. VLDB Endow., March, 2025

DAgent: A Relational Database-Driven Data Analysis Report Generation Agent.
CoRR, March, 2025

G-Boost: Boosting Private SLMs with General LLMs.
CoRR, March, 2025

scRAG: an Efficient Retrieval Augmented Generation System for scRNA-seq Data Analysis.
Proceedings of the 41st IEEE International Conference on Data Engineering, 2025

KnowTrans: Boosting Transferability of Data Preparation LLMs via Knowledge Augmentation.
Proceedings of the 41st IEEE International Conference on Data Engineering, 2025

2024
LEAP: A Low-cost Spark SQL Query Optimizer using Pairwise Comparison.
Proc. VLDB Endow., November, 2024

Class-Imbalanced-Aware Distantly Supervised Named Entity Recognition.
IEEE Trans. Neural Networks Learn. Syst., September, 2024

UniView: A Unified Autonomous Materialized View Management System for Various Databases.
Proc. VLDB Endow., August, 2024

Dynamic Graph Embedding via Meta-Learning.
IEEE Trans. Knowl. Data Eng., July, 2024

FusionQuery: On-demand Fusion Queries over Multi-source Heterogeneous Data.
Proc. VLDB Endow., February, 2024

FinSQL: Model-Agnostic LLMs-based Text-to-SQL Framework for Financial Analysis.
Proceedings of the Companion of the 2024 International Conference on Management of Data, 2024

SparDL: Distributed Deep Learning Training with Efficient Sparse Communication.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

MultiEM: Efficient and Effective Unsupervised Multi-Table Entity Matching.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

2023
Task Variance Regularized Multi-Task Learning.
IEEE Trans. Knowl. Data Eng., August, 2023

C3: Zero-shot Text-to-SQL with ChatGPT.
CoRR, 2023

SparDL: Distributed Deep Learning Training with Efficient Sparse Communication.
CoRR, 2023

Knowledge-refined Denoising Network for Robust Recommendation.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

CampER: An Effective Framework for Privacy-Aware Deep Entity Resolution.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Towards Explainable Table Interpretation Using Multi-view Explanations.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

2022
PromptEM: Prompt-tuning for Low-resource Generalized Entity Matching.
Proc. VLDB Endow., 2022

SoLar: Sinkhorn Label Refinery for Imbalanced Partial-Label Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Less-forgetting Multi-lingual Fine-tuning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MetaWeighting: Learning to Weight Tasks in Multi-Task Learning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Towards Improving Generalization of Multi-Task Learning.
PhD thesis, 2021

Robust Deep Multi-task Learning Framework for Cancer Survival Analysis.
Proceedings of the International Joint Conference on Neural Networks, 2021

BanditMTL: Bandit-based Multi-task Learning for Text Classification.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Adaptive Adversarial Multi-task Representation Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

Tchebycheff Procedure for Multi-task Text Classification.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020


  Loading...