Gengxu Li
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
  2025
Can Large Multimodal Models Actively Recognize Faulty Inputs? A Systematic Evaluation Framework of Their Input Scrutiny Ability.
    
  
    CoRR, August, 2025
    
  
Refining Critical Thinking in LLM Code Generation: A Faulty Premise-based Evaluation Framework.
    
  
    CoRR, August, 2025
    
  
Don't Take the Premise for Granted: Evaluating the Premise Critique Ability of Large Language Models.
    
  
    CoRR, May, 2025
    
  
    CoRR, February, 2025
    
  
    CoRR, February, 2025