Chen Zhao

Affiliations:
  • New York University Shanghai, Shanghai, China


According to our database1, Chen Zhao authored at least 36 papers between 2016 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks.
CoRR, July, 2025

SUCEA: Reasoning-Intensive Retrieval for Adversarial Fact-checking through Claim Decomposition and Editing.
CoRR, June, 2025

Inter-Passage Verification for Multi-evidence Multi-answer QA.
CoRR, June, 2025

Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective.
CoRR, May, 2025

Reasoning Models Know When They're Right: Probing Hidden States for Self-Verification.
CoRR, April, 2025

MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search.
CoRR, March, 2025

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding.
CoRR, January, 2025

Are Multimodal LLMs Robust Against Adversarial Perturbations? RoMMath: A Systematic Evaluation on Multimodal Math Reasoning.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Physics: Benchmarking Foundation Models on University-Level Physics Problem Solving.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Inter-Passage Verification for Multi-evidence Multi-answer QA.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Two Failures of Self-Consistency in the Multi-Step Reasoning of LLMs.
Trans. Mach. Learn. Res., 2024

MRAG: A Modular Retrieval Framework for Time-Sensitive Question Answering.
CoRR, 2024

FinDVer: Explainable Claim Verification over Long and Hybrid-Content Financial Documents.
CoRR, 2024

Large Language Models Help Humans Verify Truthfulness - Except When They Are Convincingly Wrong.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Do Models Explain Themselves? Counterfactual Simulatability of Natural Language Explanations.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

SynTQA: Synergistic Table-based Question Answering via Mixture of Text-to-SQL and E2E TQA.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

FinDVer: Explainable Claim Verification over Long and Hybrid-content Financial Documents.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Your Co-Workers Matter: Evaluating Collaborative Capabilities of Language Models in Blocks World.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Parallel Structures in Pre-training Data Yield In-Context Learning.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

KnowledgeFMath: A Knowledge-Intensive Math Reasoning Dataset in Finance Domains.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

TaPERA: Enhancing Faithfulness and Interpretability in Long-Form Table QA by Content Planning and Execution-based Reasoning.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
KnowledgeMath: Knowledge-Intensive Math Word Problem Solving in Finance Domains.
CoRR, 2023

Mixture of Prompt Experts for Generalizable and Interpretable Question Answering.
CoRR, 2023

Getting MoRE out of Mixture of Language Model Reasoning Experts.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Revisiting Calibration for Question Answering.
CoRR, 2022

Re-Examining Calibration: The Case of Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021
Distantly-Supervised Evidence Retrieval Enables Question Answering without Evidence Annotation.
CoRR, 2021

Multi-Step Reasoning Over Unstructured Text with Beam Dense Retrieval.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Distantly-Supervised Dense Retrieval Enables Open-Domain Question Answering without Evidence Annotation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

What's in a Name? Answer Equivalence For Open-Domain Question Answering.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020
Complex Factoid Question Answering with a Free-Text Knowledge Graph.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

On the Potential of Lexico-logical Alignments for Semantic Parsing to SQL Queries.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

2018
A dataset and baselines for sequential open-domain question answering.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

2016
Atom Decomposition with Adaptive Basis Selection Strategy for Matrix Completion.
ACM Trans. Multim. Comput. Commun. Appl., 2016


  Loading...