Junbo Zhao

Orcid: 0000-0002-8498-9666

Affiliations:
  • Zhejiang University, Zhejiang, China


According to our database1, Junbo Zhao authored at least 72 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Reinforcement Learning with Rubric Anchors.
CoRR, August, 2025

SPA++: Generalized Graph Spectral Alignment for Versatile Domain Adaptation.
CoRR, August, 2025

Toward Real-World Table Agents: Capabilities, Workflows, and Design Principles for LLM-based Table Intelligence.
CoRR, July, 2025

FLoE: Fisher-Based Layer Selection for Efficient Sparse Adaptation of Low-Rank Experts.
CoRR, June, 2025

ALPS: Attention Localization and Pruning Strategy for Efficient Alignment of Large Language Models.
CoRR, May, 2025

LeTS: Learning to Think-and-Search via Process-and-Outcome Reward Hybridization.
CoRR, May, 2025

CaCOM: customizing text-to-image diffusion models in the wild via continual active selection.
Mach. Learn., March, 2025

Category-free Out-of-Distribution Node Detection with Feature Resonance.
CoRR, February, 2025

Shift guided active learning.
Mach. Learn., January, 2025

Towards Robust Incremental Learning under Ambiguous Supervision.
CoRR, January, 2025

Jailbreaking Prompt Attack: A Controllable Adversarial Attack against Diffusion Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

DataMan: Data Manager for Pre-training Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Bridging the Semantic Gap Between Text and Table: A Case Study on NL2SQL.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

AIGT: AI Generative Table Based on Prompt.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

D.Va: Validate Your Demonstration First Before You Use It.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

RealHiTBench: A Comprehensive Realistic Hierarchical Table Benchmark for Evaluating LLM-Based Table Analysis.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

ALPS: Attention Localization and Pruning Strategy for Efficient Adaptation of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
When Quantum Computing Meets Database: A Hybrid Sampling Framework for Approximate Query Processing.
IEEE Trans. Knowl. Data Eng., December, 2024

CORAL: Collaborative Automatic Labeling System based on Large Language Models.
Proc. VLDB Endow., August, 2024

PiCO+: Contrastive Label Disambiguation for Robust Partial Label Learning.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

From Laws to Motivation: Guiding Exploration through Law-Based Reasoning and Rewards.
CoRR, 2024

TableGPT2: A Large Multimodal Model with Tabular Data Integration.
CoRR, 2024

A Comparative Study on Reasoning Patterns of OpenAI's o1 Model.
CoRR, 2024

Jailbreaking Prompt Attack: A Controllable Adversarial Attack against Diffusion Models.
CoRR, 2024

Pre-Trained Model Recommendation for Downstream Fine-tuning.
CoRR, 2024

Towards Cross-Table Masked Pretraining for Web Data Mining.
Proceedings of the ACM on Web Conference 2024, 2024

Locating What You Need: Towards Adapting Diffusion Models to OOD Concepts In-the-Wild.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Unbiased Multi-Label Learning from Crowdsourced Annotations.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Energy-based Automated Model Evaluation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Embedding and Gradient Say Wrong: A White-Box Method for Hallucination Detection.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Targeted Representation Alignment for Open-World Semi-Supervised Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Positive-Unlabeled Learning by Latent Group-Aware Meta Disambiguation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

RECOST: External Knowledge Guided Data-efficient Instruction Tuning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Data Contamination Calibration for Black-box LLMs.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Learning Geometry-Aware Representations for New Intent Discovery.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

On LLMs-Driven Synthetic Data Generation, Curation, and Evaluation: A Survey.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

DORY: Deliberative Prompt Recovery for LLM.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

A Separation and Alignment Framework for Black-Box Domain Adaptation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
SmartLite: A DBMS-based Serving System for DNN Inference in Resource-constrained Environments.
Proc. VLDB Endow., November, 2023

Fuse feeds as one: cross-modal framework for general identification of AMPs.
Briefings Bioinform., September, 2023

Towards Domain-Specific Features Disentanglement for Domain Generalization.
CoRR, 2023

TableGPT: Towards Unifying Tables, Nature Language and Commands into One GPT.
CoRR, 2023

CT-BERT: Learning Better Tabular Representations Through Cross-Table Pre-training.
CoRR, 2023

Assessing Hidden Risks of LLMs: An Empirical Study on Robustness, Consistency, and Credibility.
CoRR, 2023

Maybe Only 0.5% Data is Needed: A Preliminary Exploration of Low Training Data Instruction Tuning.
CoRR, 2023

Better Sign Language Translation with Monolingual Data.
CoRR, 2023

Controllable Textual Inversion for Personalized Text-to-Image Generation.
CoRR, 2023

Catch: Collaborative Feature Set Search for Automated Feature Engineering.
Proceedings of the ACM Web Conference 2023, 2023

SPA: A Graph Spectral Alignment Perspective for Domain Adaptation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Debiased and Denoised Entity Recognition from Distant Supervision.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

ProMix: Combating Label Noise via Maximizing Clean Sample Utility.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Latent Processes Identification From Multi-View Time Series.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Towards Controlled Data Augmentations for Active Learning.
Proceedings of the International Conference on Machine Learning, 2023

Learning a Data-Driven Policy Network for Pre-Training Automated Feature Engineering.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

CAME: Contrastive Automated Model Evaluation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

iDAG: Invariant DAG Searching for Domain Generalization.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

FreeAL: Towards Human-Free Active Learning in the Era of Large Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Revisiting the Knowledge Injection Frameworks.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
Dynamic Index Construction with Deep Reinforcement Learning.
Data Sci. Eng., 2022

Distill the Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine Translation.
CoRR, 2022

ProMix: Combating Label Noise via Maximizing Clean Sample Utility.
CoRR, 2022

A Sampling-based Learning Framework for Big Databases.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

SoLar: Sinkhorn Label Refinery for Imbalanced Partial-Label Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

HybridVocab: Towards Multi-Modal Machine Translation via Multi-Aspect Alignment.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

Deps-SAN: Neural Machine Translation with Dependency-Scaled Self-Attention Network.
Proceedings of the Neural Information Processing - 29th International Conference, 2022

Multi-label Learning with Data Self-augmentation.
Proceedings of the Neural Information Processing - 29th International Conference, 2022

PiCO: Contrastive Label Disambiguation for Partial Label Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

A Comparative Study of in-Database Inference Approaches.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Towards Unifying the Label Space for Aspect- and Sentence-based Sentiment Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Boosting Neural Machine Translation with Dependency-Scaled Self-Attention Network.
CoRR, 2021

Joining datasets via data augmentation in the label space for neural networks.
Proceedings of the 38th International Conference on Machine Learning, 2021


  Loading...