Yihao Feng

According to our database1, Yihao Feng authored at least 43 papers between 2017 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
xLAM: A Family of Large Action Models to Empower AI Agent Systems.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Longhorn: State Space Models are Amortized Online Learners.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Text2Data: Low-Resource Data Generation with Textual Control.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding.
CoRR, 2024

xLAM: A Family of Large Action Models to Empower AI Agent Systems.
CoRR, 2024

Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents.
CoRR, 2024

APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets.
CoRR, 2024

AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning.
CoRR, 2024

Text2Data: Low-Resource Data Generation with Textual Control.
CoRR, 2024

APIGen: Automated PIpeline for Generating Verifiable and Diverse Function-Calling Datasets.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Demand Prediction by Incorporating Internet-of-Things Data: A Case of Automobile Repair and Maintenance Service.
Proceedings of the 57th Hawaii International Conference on System Sciences, 2024

xGen-VideoSyn-1: High-Fidelity Text-to-Video Synthesis with Compressed Representations.
Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024

HIVE: Harnessing Human Feedback for Instructional Visual Editing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

FOFO: A Benchmark to Evaluate LLMs' Format-Following Capability.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents.
CoRR, 2023

Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization.
CoRR, 2023

REX: Rapid Exploration and eXploitation for AI Agents.
CoRR, 2023

Preference-grounded Token-level Guidance for Language Model Fine-tuning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

FAMO: Fast Adaptive Multitask Optimization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Metric Residual Network for Sample Efficient Goal-Conditioned Reinforcement Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Metric Residual Networks for Sample Efficient Goal-conditioned Reinforcement Learning.
CoRR, 2022

A Regularized Implicit Policy for Offline Reinforcement Learning.
CoRR, 2022

Operator Deep Q-Learning: Zero-Shot Reward Transferring in Reinforcement Learning.
CoRR, 2022

A Unified Framework for Alternating Offline Model Training and Policy Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2022

2021
Incremental Few-shot Text Classification with Multi-round New Classes: Formulation, Dataset and System.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Non-asymptotic Confidence Intervals of Off-policy Evaluation: Primal and Dual Bounds.
Proceedings of the 9th International Conference on Learning Representations, 2021

Unsupervised Out-of-Domain Detection via Pre-trained Transformers.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Off-Policy Interval Estimation with Lipschitz Value Iteration.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Accountable Off-Policy Evaluation With Kernel Bellman Statistics.
Proceedings of the 37th International Conference on Machine Learning, 2020

Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
A Kernel Loss for Solving the Bellman Equation.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018
Knowledge-guided Semantic Computing Network.
CoRR, 2018

Action-dependent Control Variates for Policy Optimization via Stein Identity.
Proceedings of the 6th International Conference on Learning Representations, 2018

2017
ARShop: A Cloud-based Augmented Reality System for Shopping.
Proc. VLDB Endow., 2017

Sample-efficient Policy Optimization with Stein Control Variate.
CoRR, 2017

Learning to Draw Samples with Amortized Stein Variational Gradient Descent.
Proceedings of the Thirty-Third Conference on Uncertainty in Artificial Intelligence, 2017


  Loading...