Haoning Wu

This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.

Known people with the same name:

Bibliography

2026
PerceptionComp: A Video Benchmark for Complex Perception-Centric Reasoning.
CoRR, March, 2026

OmniStream: Mastering Perception, Reconstruction and Action in Continuous Streams.
CoRR, March, 2026

WorldVQA: Measuring Atomic World Knowledge in Multimodal Large Language Models.
CoRR, February, 2026

Towards Pixel-Level VLM Perception via Simple Points Prediction.
CoRR, January, 2026

BabyVision: Visual Reasoning Beyond Language.
CoRR, January, 2026

MMRA: A Benchmark for Evaluating Multi-Granularity and Multi-Image Relational Association Capabilities in Large Visual Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2026, 2026

2025
SoccerMaster: A Vision Foundation Model for Soccer Understanding.
CoRR, December, 2025

Robot Learning from a Physical World Model.
CoRR, November, 2025

The path of hyperinterpolation: A survey.
CoRR, October, 2025

Towards the Datasets Used in Requirements Engineering of Mobile Apps: Preliminary Findings from a Systematic Mapping Study.
CoRR, September, 2025

A Survey on the Techniques and Tools for Automated Requirements Elicitation and Analysis of Mobile Apps.
CoRR, September, 2025

SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass.
CoRR, August, 2025

VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?
CoRR, May, 2025

MPAM: Dual-Transformer for Millimeter-Wave Sensing Based Multi-person Activity Monitoring System.
Proceedings of the Wireless Artificial Intelligent Computing Systems and Applications, 2025

MRGen: Segmentation Data Engine for Underrepresented MRI Modalities.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

LIME: Less Is More for MLLM Evaluation.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Generative Frame Sampler for Long Video Understanding.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

ProBench: Judging Multimodal Foundation Models on Open-ended Multi-domain Expert Tasks.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Marcinkiewicz-Zygmund inequalities for scattered data on polygons.
CoRR, 2024

Aria: An Open Multimodal Native Mixture-of-Experts Model.
CoRR, 2024

LIME: Less Is More for MLLM Evaluation.
CoRR, 2024

MMRA: A Benchmark for Multi-granularity Multi-image Relational Association.
CoRR, 2024

LongVideoBench: A Benchmark for Long-context Interleaved Video-Language Understanding.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

AARR-Net: An Attention Assistance Feature Fusion and Model Recursive Recovery Network for Category-Level 6D Object Pose Estimation.
Proceedings of the Neural Information Processing - 31st International Conference, 2024

2020
Utilizing online stochastic optimization on scheduling of intensity-modulate radiotherapy therapy (IMRT).
J. Biomed. Informatics, 2020

2019
Data Locality Optimization of Depthwise Separable Convolutions for CNN Inference Accelerators.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2019


  Loading...