Jiahao Qiu

Orcid: 0009-0000-7752-4169

According to our database1, Jiahao Qiu authored at least 25 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence.
CoRR, July, 2025

ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs.
CoRR, June, 2025

AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes.
CoRR, June, 2025

Toward a Theory of Agents as Tool-Use Decision-Makers.
CoRR, June, 2025

Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution.
CoRR, May, 2025

On Path to Multimodal Historical Reasoning: HistBench and HistAgent.
CoRR, May, 2025

Shallow Preference Signals: Large Language Model Aligns Even Better with Truncated Data?
CoRR, May, 2025

OTC: Optimal Tool Calls via Reinforcement Learning.
CoRR, April, 2025

EmoAgent: Assessing and Safeguarding Human-AI Interaction for Mental Health Safety.
CoRR, April, 2025

Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models.
CoRR, March, 2025

Temporal Consistency for LLM Reasoning Process Error Identification.
CoRR, March, 2025

Collab: Controlled Decoding using Mixture of Agents for LLM Alignment.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling.
CoRR, 2024

MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences.
CoRR, 2024

Fast Best-of-N Decoding via Speculative Rejection.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

MaxMin-RLHF: Alignment with Diverse Human Preferences.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

L2 Normalization and Geodesic Distance for Enhanced Information Preservation in Visualizing High-dimensional Single-cell Sequencing Data.
Proceedings of the 15th ACM International Conference on Bioinformatics, 2024

Tree Search-Based Evolutionary Bandits for Protein Sequence Optimization.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Inverse kinematic solution of a 7-DOF robot with a telescopic forearm based on Joint Limit and inertia Matrix fluctuation.
Int. J. Robotics Autom., 2023

Hardware Circuit Design of Multifunctional Intelligent Table Lamp Based on STM32.
Proceedings of the 2023 International Conference on Power, 2023

2022
Efficient Model Assisted Probability of Detection Estimations in Eddy Current NDT with ACA-SVD Based Forward Solver.
Sensors, 2022

Multi-core parallel architecture design and experiment for deep learning model training.
Multim. Tools Appl., 2022

FusionNet: Coarse-to-Fine Extrinsic Calibration Network of LiDAR and Camera with Hierarchical Point-pixel Fusion.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Hidden State Variability of Pretrained Language Models Can Guide Computation Reduction for Transfer Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021
Inverse Kinematics of a 7-DOF Spray-Painting Robot with a Telescopic Forearm.
Proceedings of the Intelligent Robotics and Applications - 14th International Conference, 2021


  Loading...