Zhenting Qi

According to our database1, Zhenting Qi authored at least 30 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Scaling Reward Modeling without Human Supervision.
CoRR, March, 2026

MoCo: A One-Stop Shop for Model Collaboration Research.
CoRR, January, 2026

DSGym: A Holistic Framework for Evaluating and Training Data Science Agents.
CoRR, January, 2026

2025
Confucius Code Agent: Scalable Agent Scaffolding for Real-World Codebases.
CoRR, December, 2025

EvoLM: In Search of Lost Language Model Training Dynamics.
CoRR, June, 2025

Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering.
CoRR, May, 2025

Learning to Rank Chain-of-Thought: An Energy-Based Approach with Outcome Supervision.
CoRR, May, 2025

Measuring the Faithfulness of Thinking Drafts in Large Reasoning Models.
CoRR, May, 2025

ElaLoRA: Elastic & Learnable Low-Rank Adaptation for Efficient Model Fine-Tuning.
CoRR, April, 2025

Generalizing Trust: Weak-to-Strong Trustworthiness in Language Models.
CoRR, January, 2025

Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Follow My Instruction and Spill the Beans: Scalable Data Extraction from Retrieval-Augmented Generation Systems.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solver.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Quantifying Generalization Complexity for Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

MuTIS: Enhancing Reasoning Efficiency through Multi Turn Intervention Sampling in Reinforcement Learning.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers.
CoRR, 2024

Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024


2023
QTSumm: A New Benchmark for Query-Focused Table Summarization.
CoRR, 2023

Self-Criticism: Aligning Large Language Models with their Understanding of Helpfulness, Honesty, and Harmlessness.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023, 2023

PILLOW: Enhancing Efficient Instruction Fine-tuning via Prompt Matching.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023, 2023

QTSumm: Query-Focused Summarization over Tabular Data.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

LoFT: Enhancing Faithfulness and Diversity for Table-to-Text Generation via Logic Form Control.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

OpenRT: An Open-source Framework for Reasoning Over Tabular Data.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), 2023

SaFER: A Robust and Efficient Framework for Fine-tuning BERT-based Classifier with Noisy Labels.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 5: Industry Track), 2023

2022
FOLIO: Natural Language Reasoning with First-Order Logic.
CoRR, 2022

Weakly Supervised Two-Stage Training Scheme for Deep Video Fight Detection Model.
Proceedings of the 34th IEEE International Conference on Tools with Artificial Intelligence, 2022

ReasTAP: Injecting Table Reasoning Skills During Pre-training via Synthetic Reasoning Examples.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022


  Loading...