Hao Chen

Orcid: 0000-0002-1960-4803

Affiliations:
  • Carnegie Mellon University, PA, USA


According to our database1, Hao Chen authored at least 68 papers between 2018 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Learning from Online Videos at Inference Time for Computer-Use Agents.
CoRR, November, 2025

SkipSR: Faster Super Resolution with Token Skipping.
CoRR, October, 2025

KnowledgeSmith: Uncovering Knowledge Updating in LLMs with Model Editing and Unlearning.
CoRR, October, 2025

ImageDoctor: Diagnosing Text-to-Image Generation via Grounded Image Reasoning.
CoRR, October, 2025

Latent Visual Reasoning.
CoRR, September, 2025

TrustJudge: Inconsistencies of LLM-as-a-Judge and How to Alleviate Them.
CoRR, September, 2025

APRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation.
CoRR, September, 2025

Image Tokenizer Needs Post-Training.
CoRR, September, 2025

Impact of Noisy Supervision in Foundation Model Learning.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2025

Instella-T2I: Pushing the Limits of 1D Discrete Latent Space Image Generation.
CoRR, June, 2025

Unleashing Hour-Scale Video Training for Long Video-Language Understanding.
CoRR, June, 2025

Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation.
CoRR, May, 2025

Topological Structure Learning Should Be A Research Priority for LLM-Based Multi-Agent Systems.
CoRR, May, 2025

ALPS: Attention Localization and Pruning Strategy for Efficient Alignment of Large Language Models.
CoRR, May, 2025

LeTS: Learning to Think-and-Search via Process-and-Outcome Reward Hybridization.
CoRR, May, 2025

CAARMA: Class Augmentation with Adversarial Mixup Regularization.
CoRR, March, 2025

Robust Latent Matters: Boosting Image Generation with Sampling Error Synthesis.
CoRR, March, 2025

On the robustness of multimodal language model towards distractions.
CoRR, February, 2025

Understanding and Mitigating the Bias Inheritance in LLM-based Data Augmentation on Downstream Tasks.
CoRR, February, 2025

On Fairness of Unified Multimodal Large Language Model for Image Generation.
CoRR, February, 2025

Rethinking the Bias of Foundation Model under Long-tailed Distribution.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Masked Autoencoders Are Effective Tokenizers for Diffusion Models.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

ImageFolder: Autoregressive Image Generation with Folded Tokens.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

ALPS: Attention Localization and Pruning Strategy for Efficient Adaptation of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
A Survey on Evaluation of Large Language Models.
ACM Trans. Intell. Syst. Technol., June, 2024

Exploring Vision-Language Models for Imbalanced Learning.
Int. J. Comput. Vis., January, 2024

PromptBench: A Unified Library for Evaluation of Large Language Models.
J. Mach. Learn. Res., 2024

On the Robustness of ChatGPT: An Adversarial and Out-of-distribution Perspective.
IEEE Data Eng. Bull., 2024

XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive Generation.
CoRR, 2024

TableGPT2: A Large Multimodal Model with Tabular Data Integration.
CoRR, 2024

On the Diversity of Synthetic Data and its Impact on Training Large Language Models.
CoRR, 2024

Automatic Dataset Construction (ADC): Sample Collection, Data Curation, and Beyond.
CoRR, 2024

Efficient Autoregressive Audio Modeling via Next-Scale Prediction.
CoRR, 2024

ControlVAR: Exploring Controllable Visual Autoregressive Modeling.
CoRR, 2024

RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection.
CoRR, 2024

Learning with Noisy Foundation Models.
CoRR, 2024

On Catastrophic Inheritance of Large Foundation Models.
CoRR, 2024

Metric from Human: Zero-shot Monocular Metric Depth Estimation via Test-time Adaptation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Slight Corruption in Pre-training Data Makes Better Diffusion Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Imprecise Label Learning: A Unified Framework for Learning with Various Imprecise Label Configurations.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

CompeteAI: Understanding the Competition Dynamics of Large Language Model-based Agents.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Completing Visual Objects via Bridging Generation and Segmentation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

A General Framework for Learning from Weak Supervision.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

AgentReview: Exploring Peer Review Dynamics with LLM Agents.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

R<sup>2</sup>-Bench: Benchmarking the Robustness of Referring Perception Models Under Perturbations.
Proceedings of the Computer Vision - ECCV 2024, 2024

Conv-Adapter: Exploring Parameter Efficient Transfer Learning for ConvNets.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
CompeteAI: Understanding the Competition Behaviors in Large Language Model-based Agents.
CoRR, 2023

Towards Domain-Specific Features Disentanglement for Domain Generalization.
CoRR, 2023

PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization.
CoRR, 2023

Imprecise Label Learning: A Unified Framework for Learning with Various Imprecise Label Configurations.
CoRR, 2023

Maybe Only 0.5% Data is Needed: A Preliminary Exploration of Low Training Data Instruction Tuning.
CoRR, 2023

SoftMatch: Addressing the Quantity-Quality Trade-off in Semi-supervised Learning.
CoRR, 2023

FreeMatch: Self-adaptive Thresholding for Semi-supervised Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

SoftMatch: Addressing the Quantity-Quality Tradeoff in Semi-supervised Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Boosting Transductive Few-Shot Fine-tuning with Margin-based Uncertainty Weighting and Probability Regularization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
3D Human Pose, Shape and Texture From Low-Resolution Images and Videos.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Conv-Adapter: Exploring Parameter Efficient Transfer Learning for ConvNets.
CoRR, 2022

USB: A Unified Semi-supervised Learning Benchmark.
CoRR, 2022

FreeMatch: Self-adaptive Thresholding for Semi-supervised Learning.
CoRR, 2022

USB: A Unified Semi-supervised Learning Benchmark for Classification.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Unitail: Detecting, Reading, and Matching in Retail Scene.
Proceedings of the Computer Vision - ECCV 2022, 2022

2020
3D Human Shape and Pose from a Single Low-Resolution Image with Self-Supervised Learning.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Blind Inpainting of Large-scale Masks of Thin Structures with Adversarial and Reinforcement Learning.
CoRR, 2019

Adversarial Large-Scale Root Gap Inpainting.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

2018
Root Gap Correction with a Deep Inpainting Model.
Proceedings of the British Machine Vision Conference 2018, 2018


  Loading...