Junyang Wang

Orcid: 0000-0002-3204-6607

Affiliations:
  • Beijing Jiaotong University, School of Computer and Information Technology, Beijing Key Lab of Traffic Data Analysis and Mining, China


According to our database1, Junyang Wang authored at least 19 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Mobile-Agent-v3: Fundamental Agents for GUI Automation.
CoRR, August, 2025

Look Before You Leap: A GUI-Critic-R1 Model for Pre-Operative Error Diagnosis in GUI Automation.
CoRR, June, 2025

Mobile-Agent-V: Learning Mobile Device Operation Through Video-Guided Multi-Agent Collaboration.
CoRR, February, 2025

PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PC.
CoRR, February, 2025

Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks.
CoRR, January, 2025

2024
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception.
CoRR, 2024

Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent Collaboration.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

2023
An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination Evaluation.
CoRR, 2023

Evaluation and Analysis of Hallucination in Large Vision-Language Models.
CoRR, 2023

Overlap Bias Matching is Necessary for Point Cloud Registration.
CoRR, 2023

mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality.
CoRR, 2023

Benign Shortcut for Debiasing: Fair Visual Recognition via Intervention with Shortcut Features.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

mPLUG-Octopus: The Versatile Assistant Empowered by A Modularized End-to-End Multimodal LLM.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

From Association to Generation: Text-only Captioning by Unsupervised Cross-modal Mapping.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Improved Visual Fine-tuning with Natural Language Supervision.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Zero-shot Image Captioning by Anchor-augmented Vision-Language Space Alignment.
CoRR, 2022

Fair Visual Recognition via Intervention with Proxy Features.
CoRR, 2022

FairCLIP: Social Bias Elimination based on Attribute Prototype Learning and Representation Neutralization.
CoRR, 2022

Counterfactually Measuring and Eliminating Social Bias in Vision-Language Pre-training Models.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022


  Loading...