Hongyi Guo

According to our database1, Hongyi Guo authored at least 16 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards.
CoRR, 2024

Can Large Language Models Play Games? A Case Study of A Self-Play Approach.
CoRR, 2024

Measuring and Reducing LLM Hallucination without Gold-Standard Answers via Expertise-Weighting.
CoRR, 2024

Human-Instruction-Free LLM Self-Alignment with Limited Samples.
CoRR, 2024

2023
Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency.
CoRR, 2023

Behavior Contrastive Learning for Unsupervised Skill Discovery.
Proceedings of the International Conference on Machine Learning, 2023

2022
Provably Efficient Offline Reinforcement Learning for Partially Observable Markov Decision Processes.
Proceedings of the International Conference on Machine Learning, 2022

2021
Policy Learning Using Weak Supervision.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Decentralized Single-Timescale Actor-Critic on Zero-Sum Two-Player Stochastic Games.
Proceedings of the 38th International Conference on Machine Learning, 2021

Signal Instructed Coordination in Cooperative Multi-agent Reinforcement Learning.
Proceedings of the Distributed Artificial Intelligence - Third International Conference, 2021

2020
An improved localization method in cyber-social environments with obstacles.
Comput. Electr. Eng., 2020

Peer Loss Functions: Learning from Noisy Labels without Knowing Noise Rates.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
Signal Instructed Coordination in Team Competition.
CoRR, 2019

Life Assistants for the Elderly Based on Mobile Devices.
Proceedings of the 2019 IEEE Intl Conf on Dependable, 2019

2017
ChildGuard: A Child-Safety Monitoring System.
IEEE Multim., 2017

2016
Automatic Threshold Calculation Based Label Propagation Algorithm for Overlapping Community.
Proceedings of the IEEE First International Conference on Data Science in Cyberspace, 2016


  Loading...