Hongyi Guo

Orcid: 0009-0006-0129-7856

According to our database1, Hongyi Guo authored at least 25 papers between 2016 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2025
Lightweight intelligent detection algorithm for surface defects in printed circuit board.
Comput. Ind. Eng., 2025

BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Toward Optimal LLM Alignments Using Two-Player Games.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

LogWhisperer: Multi-log Semantic Similarity Analysis Based Intelligent Vehicle Anomaly Detection Without Log Template.
Proceedings of the Information Security and Cryptology - 21st International Conference, 2025

2024
Landslide Hazard Prediction Based on Small Baseline Subset-Interferometric Synthetic-Aperture Radar Technology Combined with Land-Use Dynamic Change and Hydrological Conditions (Sichuan, China).
Remote. Sens., August, 2024

Diverse randomized value functions: A provably pessimistic approach for offline reinforcement learning.
Inf. Sci., 2024

Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning.
CoRR, 2024

Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards.
CoRR, 2024

Can Large Language Models Play Games? A Case Study of A Self-Play Approach.
CoRR, 2024

Measuring and Reducing LLM Hallucination without Gold-Standard Answers via Expertise-Weighting.
CoRR, 2024

Human-Instruction-Free LLM Self-Alignment with Limited Samples.
CoRR, 2024

Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Reason for Future, Act for Now: A Principled Architecture for Autonomous LLM Agents.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency.
CoRR, 2023

Behavior Contrastive Learning for Unsupervised Skill Discovery.
Proceedings of the International Conference on Machine Learning, 2023

2022
Provably Efficient Offline Reinforcement Learning for Partially Observable Markov Decision Processes.
Proceedings of the International Conference on Machine Learning, 2022

2021
Policy Learning Using Weak Supervision.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Decentralized Single-Timescale Actor-Critic on Zero-Sum Two-Player Stochastic Games.
Proceedings of the 38th International Conference on Machine Learning, 2021

Signal Instructed Coordination in Cooperative Multi-agent Reinforcement Learning.
Proceedings of the Distributed Artificial Intelligence - Third International Conference, 2021

2020
An improved localization method in cyber-social environments with obstacles.
Comput. Electr. Eng., 2020

Peer Loss Functions: Learning from Noisy Labels without Knowing Noise Rates.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
Signal Instructed Coordination in Team Competition.
CoRR, 2019

Life Assistants for the Elderly Based on Mobile Devices.
Proceedings of the 2019 IEEE Intl Conf on Dependable, 2019

2017
ChildGuard: A Child-Safety Monitoring System.
IEEE Multim., 2017

2016
Automatic Threshold Calculation Based Label Propagation Algorithm for Overlapping Community.
Proceedings of the IEEE First International Conference on Data Science in Cyberspace, 2016


  Loading...