Yilong Wu

Orcid: 0009-0008-0497-6904

According to our database1, Yilong Wu authored at least 18 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
LLMEval-3: A Large-Scale Longitudinal Study on Robust and Fair Evaluation of Large Language Models.
CoRR, August, 2025

SpeechRole: A Large-Scale Dataset and Benchmark for Evaluating Speech Role-Playing Agents.
CoRR, August, 2025

A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models.
CoRR, May, 2025

STDArm: Transferring Visuomotor Policies From Static Data Training to Dynamic Robot Manipulation.
CoRR, April, 2025

MT-PCR: Leveraging Modality Transformation for Large-Scale Point Cloud Registration with Limited Overlap.
CoRR, March, 2025

OG-Gaussian: Occupancy Based Street Gaussians for Autonomous Driving.
CoRR, February, 2025

ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use.
CoRR, 2024

Rotation Initialization and Stepwise Refinement for Universal LiDAR Calibration.
CoRR, 2024

ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios.
CoRR, 2024

TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

ToolSword: Unveiling Safety Issues of Large Language Models in Tool Learning Across Three Stages.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
A Spatial Information Extraction Method Based on Multi-Modal Social Media Data: A Case Study on Urban Inundation.
ISPRS Int. J. Geo Inf., September, 2023

Energy consumption optimisation for machining processes based on numerical control programs.
Adv. Eng. Informatics, August, 2023

Urban Flood Dynamic Risk Assessment Based on Typhoon Rainfall Process: A Case Study of Typhoon "Lupit" (2109) in Fuzhou, China.
Remote. Sens., June, 2023

2022
RoadFormer: Pyramidal deformable vision transformers for road network extraction with remote sensing images.
Int. J. Appl. Earth Obs. Geoinformation, 2022


  Loading...