Gengxu Li
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
Can Large Multimodal Models Actively Recognize Faulty Inputs? A Systematic Evaluation Framework of Their Input Scrutiny Ability.
CoRR, August, 2025
Refining Critical Thinking in LLM Code Generation: A Faulty Premise-based Evaluation Framework.
CoRR, August, 2025
Don't Take the Premise for Granted: Evaluating the Premise Critique Ability of Large Language Models.
CoRR, May, 2025
CoRR, February, 2025
CoRR, February, 2025