Heli Qi

Orcid: 0000-0001-9512-7140

According to our database1, Heli Qi authored at least 19 papers between 2022 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Code-Switching Information Retrieval: Benchmarks, Analysis, and the Limits of Current Retrievers.
CoRR, April, 2026

The Confidence Dichotomy: Analyzing and Mitigating Miscalibration in Tool-Use Agents.
CoRR, January, 2026

Toward Global Large Language Models in Medicine.
CoRR, January, 2026

2025
TeamPath: Building MultiModal Pathology Experts with Reasoning AI Copilots.
CoRR, November, 2025

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search.
CoRR, September, 2025

Position: The Hidden Costs and Measurement Gaps of Reinforcement Learning with Verifiable Rewards.
CoRR, September, 2025

VeriGUI: Verifiable Long-Chain GUI Dataset.
CoRR, August, 2025

DisasterM3: A Remote Sensing Vision-Language Dataset for Disaster Damage Assessment and Response.
CoRR, May, 2025

DynamicVL: Benchmarking Multimodal Large Language Models for Dynamic City Understanding.
CoRR, May, 2025

MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation.
CoRR, March, 2025

Seeing is Believing, but How Much? A Comprehensive Analysis of Verbalized Calibration in Vision-Language Models.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024
Segment Anything with Multiple Modalities.
CoRR, 2024

Conditional Tuning Network for Few-Shot Adaptation of Segmentation Anything Model.
CoRR, 2024

CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything Model.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
SpeeChain: A Speech Toolkit for Large-Scale Machine Speech Chain.
CoRR, 2023

2022
USB: A Unified Semi-supervised Learning Benchmark.
CoRR, 2022

USB: A Unified Semi-supervised Learning Benchmark for Classification.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Improved Consistency Training for Semi-Supervised Sequence-to-Sequence ASR via Speech Chain Reconstruction and Self-Transcribing.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022


  Loading...