Jian Wang

Orcid: 0000-0003-4144-1753

Affiliations:
  • Ant Group, Hangzhou, China


According to our database1, Jian Wang authored at least 45 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
CTRL-RAG: Contrastive Likelihood Reward Based Reinforcement Learning for Context-Faithful RAG Models.
CoRR, March, 2026

LiveClin: A Live Clinical Benchmark without Leakage.
CoRR, February, 2026

WebClipper: Efficient Evolution of Web Agents with Graph-based Trajectory Pruning.
CoRR, February, 2026

ClinAlign: Scaling Healthcare Alignment from Clinician Preference.
CoRR, February, 2026

MLB: A Scenario-Driven Benchmark for Evaluating Large Language Models in Clinical Applications.
CoRR, January, 2026

TOOL-CURE: Tool Selection via Curriculum-Enhanced Reinforcement Learning with Sample Screening for LLMs.
Proceedings of the Nineteenth ACM International Conference on Web Search and Data Mining, 2026

PulseMind: A Multi-Modal Medical Model for Real-World Clinical Diagnosis.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

PRGB Benchmark: A Robust Placeholder-Assisted Algorithm for Benchmarking Retrieval-Augmented Generation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

MHB: Medical Hallucination Benchmark for Large Language Models in Complex Clinical Tasks.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Perplexity-Aware Data Scaling Law: Perplexity Landscapes Predict Performance for Continual Pre-training.
CoRR, December, 2025

GroupRank: A Groupwise Reranking Paradigm Driven by Reinforcement Learning.
CoRR, November, 2025

EHR-R1: A Reasoning-Enhanced Foundational Language Model for Electronic Health Record Analysis.
CoRR, October, 2025

Evolving Diagnostic Agents in a Virtual Clinical Environment.
CoRR, October, 2025

GAPS: A Clinically Grounded, Automated Benchmark for Evaluating AI Clinicians.
CoRR, October, 2025

Self-Rewarding Rubric-Based Reinforcement Learning for Open-Ended Reasoning.
CoRR, September, 2025

HANRAG: Heuristic Accurate Noise-resistant Retrieval-Augmented Generation for Multi-hop Question Answering.
CoRR, September, 2025

DIVER: A Multi-Stage Approach for Reasoning-intensive Information Retrieval.
CoRR, August, 2025

Learning to Align, Aligning to Learn: A Unified Approach for Self-Optimized Alignment.
CoRR, August, 2025

PRGB Benchmark: A Robust Placeholder-Assisted Algorithm for Benchmarking Retrieval-Augmented Generation.
CoRR, July, 2025

POLYRAG: Integrating Polyviews into Retrieval-Augmented Generation for Medical Applications.
CoRR, April, 2025

ADMIRE: ADaptive method to enhance Multiple Image REsolutions in text-rich multi-image understanding.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

HIRAG: Hierarchical-Thought Instruction-Tuning Retrieval-Augmented Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

SkySense-O: Towards Open-World Remote Sensing Interpretation with Vision-Centric Visual-Language Modeling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Training Object Detectors from Scratch: An Empirical Study in the Era of Vision Transformer.
Int. J. Comput. Vis., August, 2024

RuleAlign: Making Large Language Models Better Physicians with Diagnostic Rule Alignment.
CoRR, 2024

SkySenseGPT: A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding.
CoRR, 2024

Enhancing DETRs Variants through Improved Content Query and Similar Query Aggregation.
CoRR, 2024

M<sub>2</sub>-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient Pretraining.
CoRR, 2024

OrchMoE: Efficient Multi-Adapter Learning with Task-Skill Synergy.
CoRR, 2024

Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Parameter-Efficient Complementary Expert Learning for Long-Tailed Visual Recognition.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

POA: Pre-training Once for Models of All Sizes.
Proceedings of the Computer Vision - ECCV 2024, 2024

EcoMatcher: Efficient Clustering Oriented Matcher for Detector-Free Image Matching.
Proceedings of the Computer Vision - ECCV 2024, 2024

SkySense: A Multi-Modal Remote Sensing Foundation Model Towards Universal Interpretation for Earth Observation Imagery.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Towards Better Vision-Inspired Vision-Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Learning Implicit Entity-object Relations by Bidirectional Generative Alignment for Multimodal NER.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Wall-to-Wall Above-Ground Biomass Estimation with Alos-2 Palsar-2 L-Band SAR Data and GEDI.
Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, 2023

Uncertainty-guided Learning for Improving Image Manipulation Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Simultaneously Short- and Long-Term Temporal Modeling for Semi-Supervised Video Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
CRET: Cross-Modal Retrieval Transformer for Efficient Text-Video Retrieval.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Hierarchical Memory Learning for Fine-Grained Scene Graph Generation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Training Object Detectors from Scratch: An Empirical Study in the Era of Vision Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
GilBERT: Generative Vision-Language Pre-Training for Image-Text Retrieval.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

2020
Farmland Parcel Mapping in Mountain Areas Using Time-Series SAR Data and VHR Optical Images.
Remote. Sens., 2020

Automatic Car Damage Assessment System: Reading and Understanding Videos as Professional Insurance Inspectors.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020


  Loading...