Jipeng Zhang

This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.

Known people with the same name:

Bibliography

2026
Cross-Supervision Similarity Network for Medical Image Classification on Imbalanced Small Datasets.
IEEE Trans. Medical Imaging, May, 2026

RS-HyRe-R1: A Hybrid Reward Mechanism to Overcome Perceptual Inertia for Remote Sensing Images Understanding.
CoRR, April, 2026

FarmMind: Reasoning-Query-Driven Dynamic Segmentation for Farmland Remote Sensing Images.
CoRR, January, 2026

MMedExpert-R1: Strengthening Multimodal Medical Reasoning via Domain-Specific Adaptation and Clinical Guideline Reinforcement.
CoRR, January, 2026

Layer-Order Inversion: Rethinking Latent Multi-Hop Reasoning in Large Language Models.
CoRR, January, 2026

An adversarial training framework for improving zero-shot robustness of deep reinforcement learning-based autonomous driving policies.
Displays, 2026

MarineEval: Assessing the Marine Intelligence of Vision-Language Models.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2026

Faithful in Steps: Improving Generalization and Citation in RAG via Query Decomposition.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
LongVideoAgent: Multi-Agent Reasoning with Long Videos.
CoRR, December, 2025

Skin-R1: Toward Trustworthy Clinical Reasoning for Dermatological Diagnosis.
CoRR, November, 2025

RIDE: Difficulty Evolving Perturbation with Item Response Theory for Mathematical Reasoning.
CoRR, November, 2025

MARS-SQL: A multi-agent reinforcement learning framework for Text-to-SQL.
CoRR, November, 2025

Med-RewardBench: Benchmarking Reward Models and Judges for Medical Multimodal Large Language Models.
CoRR, August, 2025

Remote Sensing Image Intelligent Interpretation with the Language-Centered Perspective: Principles, Methods and Challenges.
CoRR, August, 2025

VL-GenRM: Enhancing Vision-Language Verification via Vision Experts and Iterative Training.
CoRR, June, 2025

MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly.
CoRR, May, 2025

DIDS: Domain Impact-aware Data Sampling for Large Language Model Training.
CoRR, April, 2025

Benchmarking Multi-National Value Alignment for Large Language Models.
CoRR, April, 2025

MA-LoT: Multi-Agent Lean-based Long Chain-of-Thought Reasoning enhances Formal Theorem Proving.
CoRR, March, 2025

Adapt-Pruner: Adaptive Structural Pruning for Efficient Small Language Model Training.
CoRR, February, 2025

LHADRO: A Robust Control Framework for Autonomous Vehicles Under Cyber-Physical Attacks.
IEEE Trans. Inf. Forensics Secur., 2025

STGAN-CR: A Semantics-Aware Cloud Removal Network Integrating Swin Transformer and GANs for Remote Sensing Applications.
Int. J. Softw. Eng. Knowl. Eng., 2025

Vectorized Falcon-Sign Implementations using SSE2, AVX2, AVX-512F, NEON, and RVV.
IACR Cryptol. ePrint Arch., 2025

ODCCMamba-Unet: A Mamba-Unet Based Model with Omnidirectional Divide-and-Conquer Scanning Mechanism for Remote Sensing Image Change Detection.
Proceedings of the Pattern Recognition and Computer Vision - 8th Chinese Conference, 2025

TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Multi-contrastive Regularization for Single Image Portrait Relighting.
Proceedings of the 7th ACM International Conference on Multimedia in Asia, 2025

MA-LoT: Model-Collaboration Lean-based Long Chain-of-Thought Reasoning enhances Formal Theorem Proving.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Personalized Visual Instruction Tuning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

AlignGuard: Scalable Safety Alignment for Text-to-Image Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

WSI-LLaVA: A Multimodal Large Language Model for Whole Slide Image.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

ExeSQL: Self-Taught Text-to-SQL Models with Execution-Driven Bootstrapping for SQL Dialects.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

DIDS: Domain Impact-aware Data Sampling for Large Language Model Training.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Pointing to a Llama and Call it a Camel: On the Sycophancy of Multimodal Large Language Models.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Local-Global Context Encoding Architecture for the Insulin Prescription Recommendation.
Proceedings of the 28th International Conference on Computer Supported Cooperative Work in Design, 2025

Bridge-Coder: Transferring Model Capabilities from High-Resource to Low-Resource Programming Language.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

LegalReasoner: Step-wised Verification-Correction for Legal Judgment Reasoning.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Making RALM Robust to Irrelevant Contexts via Layer Knowledge Guided Attention.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Benchmarking Multi-National Value Alignment for Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

EAGLE: Expert-Guided Self-Enhancement for Preference Alignment in Pathology Large Vision-Language Model.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
RAP Vol: Robust Adversary Populations With Volume Diversity Measure.
IEEE Trans. Neural Networks Learn. Syst., December, 2024

A Multi-Teacher Policy Distillation Framework for Enhancing Zero-Shot Generalization of Autonomous Driving Policies.
IEEE Trans. Veh. Technol., July, 2024

X$^{2}$2-VLM: All-in-One Pre-Trained Model for Vision-Language Tasks.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

DAInfer: Inferring API Aliasing Specifications from Library Documentation via Neurosymbolic Optimization.
Proc. ACM Softw. Eng., 2024

SafetyDPO: Scalable Safety Alignment for Text-to-Image Generation.
CoRR, 2024

WSI-LLaVA: A Multimodal Large Language Model for Whole Slide Image.
CoRR, 2024

Fox-1 Technical Report.
CoRR, 2024

Alopex: A Computational Framework for Enabling On-Device Function Calls with LLMs.
CoRR, 2024

Bridge-Coder: Unlocking LLMs' Potential to Overcome Language Gaps in Low-Resource Code.
CoRR, 2024

PolyRouter: A Multi-LLM Querying System.
CoRR, 2024

ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting.
CoRR, 2024

Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions.
CoRR, 2024

The Instinctive Bias: Spurious Images lead to Hallucination in MLLMs.
CoRR, 2024

MLLM-Protector: Ensuring MLLM's Safety without Hurting Performance.
CoRR, 2024

Exploring Boundary of GPT-4V on Marine Analysis: A Preliminary Case Study.
CoRR, 2024

PipeNet: Question Answering with Semantic Pruning over Knowledge Graphs.
Proceedings of the 13th Joint Conference on Lexical and Computational Semantics, 2024

STGAN-CR: A Swin Transformer-Enhanced GAN Framework for Effective Cloud Removal in Satellite Imagery.
Proceedings of the 36th International Conference on Software Engineering and Knowledge Engineering, 2024

Image Textualization: An Automatic Framework for Generating Rich and Detailed Image Descriptions.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: System Demonstrations, 2024

TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

TensorOpera Router: A Multi-Model Router for Efficient LLM Inference.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

MLLM-Protector: Ensuring MLLM's Safety without Hurting Performance.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Mitigating the Alignment Tax of RLHF.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

The Instinctive Bias: Spurious Images lead to Illusion in MLLMs.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization.
Proceedings of the Computer Vision - ECCV 2024, 2024

PerceptionGPT: Effectively Fusing Visual Perception Into LLM.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Plum: Prompt Learning using Metaheuristics.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

SceMQA: A Scientific College Entrance Level Multimodal Question Answering Benchmark.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2024

2023
A Deep Reinforcement Learning Algorithm Suitable for Autonomous Vehicles: Double Bootstrapped Soft-Actor-Critic-Discrete.
IEEE Trans. Cogn. Dev. Syst., December, 2023

A Portable Three-Layer Compton Camera for Wide-Energy-Range Gamma-ray Imaging: Design, Simulation and Preliminary Testing.
Sensors, November, 2023

RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment.
Trans. Mach. Learn. Res., 2023

Plum: Prompt Learning using Metaheuristic.
CoRR, 2023

MarineGPT: Unlocking Secrets of Ocean to the Public.
CoRR, 2023

Mitigating the Alignment Tax of RLHF.
CoRR, 2023

Dynamic Prediction using Time-Dependent Cox Survival Neural Network.
CoRR, 2023

RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment.
CoRR, 2023

Don't be Blind to Questions: Question-Oriented Math Word Problem Solving.
Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023

Toward Building General Foundation Models for Language, Vision, and Vision-Language Understanding Tasks.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

DetGPT: Detect What You Need via Reasoning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

UniMath: A Foundational and Multimodal Mathematical Reasoner.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Non-Autoregressive Sentence Ordering.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Compositional Mathematical Encoding for Math Word Problems.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Generalizing Math Word Problem Solvers via Solution Diversification.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Action-Centric Relation Transformer Network for Video Question Answering.
IEEE Trans. Circuits Syst. Video Technol., 2022

Multimodal Genotype and Phenotype Data Integration to Improve Partial Data-Based Longitudinal Prediction.
J. Comput. Biol., 2022

X<sup>2</sup>-VLM: All-In-One Pre-trained Model For Vision-Language Tasks.
CoRR, 2022

Execution-based Evaluation for Data Science Code Generation Models.
CoRR, 2022

Multi-modal Genotype and Phenotype Mutual Learning to Enhance Single-Modal Input Based Longitudinal Outcome Prediction.
Proceedings of the Research in Computational Molecular Biology, 2022

MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Non-Autoregressive Cross-Modal Coherence Modelling.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Analogical Math Word Problems Solving with Enhanced Problem-Solution Association.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021
Urban Traffic Control in Software Defined Internet of Things via a Multi-Agent Deep Reinforcement Learning Approach.
IEEE Trans. Intell. Transp. Syst., 2021

Dynamic analysis and chaos control of the switched-inductor boost converter with the memristive load.
Int. J. Circuit Theory Appl., 2021

MWP-BERT: A Strong Baseline for Math Word Problems.
CoRR, 2021

2020
Deep learning-based edge caching for multi-cluster heterogeneous networks.
Neural Comput. Appl., 2020

Teacher-Student Networks with Multiple Decoders for Solving Math Word Problem.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Graph-to-Tree Learning for Solving Math Word Problems.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Speeding up LDPC Decoder by Inter-Frame Pipeline for Wireless Laser Communications.
Proceedings of the 2019 IEEE/CIC International Conference on Communications in China, 2019

Modeling Intra-Relation in Math Word Problems with Different Functional Multi-Head Attentions.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Template-Based Math Word Problem Solvers with Recursive Neural Networks.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019


  Loading...