Jiahui Geng

Orcid: 0009-0009-3630-3023

According to our database1, Jiahui Geng authored at least 60 papers between 2018 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
CodeMMR: Bridging Natural Language, Code, and Image for Unified Retrieval.
CoRR, April, 2026

LongSpeech: A Scalable Benchmark for Transcription, Translation and Understanding in Long Speech.
CoRR, January, 2026

Spurious Rewards Paradox: Mechanistically Understanding How RLVR Activates Memorization Shortcuts in LLMs.
CoRR, January, 2026

The CLEF-2026 FinMMEval Lab: Multilingual and Multimodal Evaluation of Financial AI Systems.
Proceedings of the Advances in Information Retrieval, 2026

2025
M4FC: a Multimodal, Multilingual, Multicultural, Multitask Real-World Fact-Checking Dataset.
CoRR, October, 2025

Marco-Bench-MIF: On Multilingual Instruction-Following Capability of Large Language Models.
CoRR, July, 2025

CoQuIR: A Comprehensive Benchmark for Code Quality-Aware Information Retrieval.
CoRR, June, 2025

Con Instruction: Universal Jailbreaking of Multimodal Large Language Models via Non-Textual Modalities.
CoRR, June, 2025

CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation.
CoRR, May, 2025

A Comprehensive Survey of Machine Unlearning Techniques for Large Language Models.
CoRR, March, 2025

Is Human-Like Text Liked by Humans? Multilingual Human Detection and Preference Against AI.
CoRR, February, 2025

Internal Activation Revision: Safeguarding Vision Language Models Without Parameter Update.
CoRR, January, 2025

Development and validation of multimodal deep learning algorithms for detecting pulmonary hypertension.
npj Digit. Medicine, 2025

Federated Large Domain Model System.
Blockchain Res. Appl., 2025

On the Taxonomy, Tasks, and Open-Challenges for Multimodal Large Language Models.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2025

FIRE: Fact-checking with Iterative Retrieval and Verification.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

SAUCE: Selective Concept Unlearning in Vision-Language Models with Sparse Autoencoders.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025


OpenFactCheck: Building, Benchmarking Customized Fact-Checking Systems and Evaluating the Factuality of Claims and LLMs.
Proceedings of the 31st International Conference on Computational Linguistics, 2025


Overview of the "Voight-Kampff" Generative AI Authorship Verification Task at PAN and ELOQUENT 2025.
Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum, 2025

Marco-Bench-MIF: On Multilingual Instruction-Following Capability of Large Language.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

\mathsfCon Instruction: Universal Jailbreaking of Multimodal Large Language Models via Non-Textual Modalities.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

VSCBench: Bridging the Gap in Vision-Language Model Safety Calibration.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Shaping the Safety Boundaries: Understanding and Defending Against Jailbreaks in Large Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

HD-NDEs: Neural Differential Equations for Hallucination Detection in LLMs.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Internal Activation Revision: Safeguarding Vision Language Models Without Parameter Update.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Improved Gradient Inversion Attacks and Defenses in Federated Learning.
IEEE Trans. Big Data, December, 2024

CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark.
CoRR, 2024

OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs.
CoRR, 2024

Multimodal Large Language Models to Support Real-World Fact-Checking.
CoRR, 2024

Simultaneous state and fault estimation: A prescribed-time unknown input observer approach.
Appl. Math. Comput., 2024

PrivAuditor: Benchmarking Data Protection Vulnerabilities in LLM Adaptation Techniques.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

A Survey of Confidence Estimation and Calibration in Large Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Towards Trustworthy Dataset Distillation: A Benchmark of Privacy, Fairness and Robustness.
Proceedings of the International Joint Conference on Neural Networks, 2024

Factcheck-Bench: Fine-Grained Evaluation Benchmark for Automatic Fact-checkers.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024


Reference-free Hallucination Detection for Large Vision-Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
Taking Computation to Data: Integrating Privacy-preserving AI techniques and Blockchain Allowing Secure Analysis of Sensitive Data on Premise.
PhD thesis, 2023

Factcheck-GPT: End-to-End Fine-Grained Document-Level Fact-Checking and Correction of LLM Output.
CoRR, 2023

A Survey of Language Model Confidence Estimation and Calibration.
CoRR, 2023

A Comprehensive Study on Dataset Distillation: Performance, Privacy, Robustness and Fairness.
CoRR, 2023

Learning Parameterized ODEs From Data.
IEEE Access, 2023

pFedV: Mitigating Feature Distribution Skewness via Personalized Federated Learning with Variational Distribution Constraints.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2023

Solving Nonlinear Conservation Laws of Partial Differential Equations Using Graph Neural Networks.
Proceedings of the 2023 Northern Lights Deep Learning Workshop, 2023

A Survey on Dataset Distillation: Approaches, Applications and Future Directions.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

2022
OpenIaC: open infrastructure as code - the network is my computer.
J. Cloud Comput., 2022

An Image Region-Based Image Stitching Algorithm of Smartphones.
Proceedings of the 14th IEEE International Conference on Advanced Infocomm Technology, 2022

Blockchain Empowered and Self-sovereign Access Control System.
Proceedings of the IEEE International Conference on Cloud Computing Technology and Science, 2022

Managing Digital Objects with Decentralised Identifiers based on NFT-like schema.
Proceedings of the IEEE International Conference on Cloud Computing Technology and Science, 2022

Blockchain-based Cross-organizational Workflow Platform.
Proceedings of the IEEE International Conference on Cloud Computing Technology and Science, 2022

NFT as a proof of Digital Ownership-reward system integrated to a Secure Distributed Computing Blockchain Framework.
Proceedings of the IEEE International Conference on Cloud Computing Technology and Science, 2022

2021
Towards General Deep Leakage in Federated Learning.
CoRR, 2021

Optimized Federated Learning on Class-Biased Distributed Data Sources.
Proceedings of the Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2021

DID-eFed: Facilitating Federated Learning as a Service with Decentralized Identities.
Proceedings of the EASE 2021: Evaluation and Assessment in Software Engineering, 2021

2018
Efficient Patch-Wise Semantic Segmentation for Large-Scale Remote Sensing Images.
Sensors, 2018

The RWTH Aachen University English-German and German-English Unsupervised Neural Machine Translation Systems for WMT 2018.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

Improving Unsupervised Word-by-Word Translation with Language Model and Denoising Autoencoder.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018


  Loading...