Antoine Bosselut

Orcid: 0000-0001-8968-9649

According to our database1, Antoine Bosselut authored at least 130 papers between 2015 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Diversity Matters: Revisiting Test-Time Compute in Vision-Language Models.
CoRR, May, 2026

Do LLMs Game Formalization? Evaluating Faithfulness in Logical Reasoning.
CoRR, April, 2026

An Engineering Journey Training Large Language Models at Scale on Alps: The Apertus Experience.
CoRR, April, 2026

Large Language Models Align with the Human Brain during Creative Thinking.
CoRR, April, 2026

CresOWLve: Benchmarking Creative Problem-Solving Over Real-World Knowledge.
CoRR, April, 2026

AI Meets Mathematics Education: A Case Study on Supporting an Instructor in a Large Mathematics Class with Context-Aware AI.
CoRR, March, 2026

Brittlebench: Quantifying LLM robustness via prompt sensitivity.
CoRR, March, 2026

Helpful to a Fault: Measuring Illicit Assistance in Multi-Turn, Multilingual LLM Agents.
CoRR, February, 2026

Buy versus Build an LLM: A Decision Framework for Governments.
CoRR, February, 2026

ConLID: Supervised Contrastive Learning for Low-Resource Language Identification.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

Tracking the Limits of Knowledge Propagation: How LLMs Fail at Multi-Step Reasoning with Conflicting Knowledge.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

DRIVINGVQA: A Dataset for Interleaved Visual Chain-of-Thought in Real-World Driving Scenarios.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2026, 2026

AI meets Mathematics Education: Supporting Instructors in Large Mathematics Classes with Context-Aware AI.
Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems, 2026

Apertus: Democratizing Open and Compliant LLMs for Global Language Environments.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

Crosscoding Through Time: Tracking Emergence & Consolidation Of Linguistic Representations Throughout LLM Pretraining.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025
Measuring what Matters: Construct Validity in Large Language Model Benchmarks.
CoRR, November, 2025

Revisiting Multilingual Data Mixtures in Language Model Pretraining.
CoRR, October, 2025

GaLLoP: Gradient-based Sparse Learning on Low-Magnitude Parameters.
CoRR, October, 2025

PERK: Long-Context Reasoning as Parameter-Efficient Test-Time Learning.
CoRR, July, 2025

ConLID: Supervised Contrastive Learning for Low-Resource Language Identification.
CoRR, June, 2025

Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization.
CoRR, June, 2025

AbstRaL: Augmenting LLMs' Reasoning by Reinforcing Abstract Thinking.
CoRR, June, 2025

Positional Fragility in LLMs: How Offset Effects Reshape Our Understanding of Memorization Risks.
CoRR, May, 2025

Can Performant LLMs Be Ethical? Quantifying the Impact of Web Crawling Opt-Outs.
CoRR, April, 2025

Enhancing Procedural Writing Through Personalized Example Retrieval: A Case Study on Cooking Recipes.
Int. J. Artif. Intell. Educ., March, 2025

DRIVINGVQA: Analyzing Visual Chain-of-Thought Reasoning of Vision Language Models in Real-World Scenarios with Driving Theory Tests.
CoRR, January, 2025

Positional Fragility in LLMs: How Offset Effects Reshape Our Understanding of Memorization Risks.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

For Better or for Worse, Transformers Seek Patterns for Memorization.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

A Logical Fallacy-Informed Framework for Argument Generation.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

PICLe: Pseudo-annotations for In-Context Learning in Low-Resource Named Entity Detection.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Evaluating Morphological Compositional Generalization in Large Language Models.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

The LLM Language Network: A Neuroscientific Approach for Identifying Causally Task-Relevant Units.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025


GeoExplorer: Active Geo-Localization with Curiosity-Driven Exploration.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Reliable Evaluation and Benchmarks for Statement Autoformalization.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

RLMEval: Evaluating Research-Level Neural Theorem Proving.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Creative Preference Optimization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Context is Gold to find the Gold Passage: Evaluating and Training Contextual Document Embeddings.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

CAVE : Detecting and Explaining Commonsense Anomalies in Visual Environments.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

From Language to Cognition: How LLMs Outgrow the Human Language Network.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

VinaBench: Benchmark for Faithful and Consistent Visual Narratives.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Efficient Tool Use with Chain-of-Abstraction Reasoning.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

Challenges for AI in Multimodal STEM Assessments: a Human-AI Comparison.
Proceedings of the 20th Workshop on Innovative Use of NLP for Building Educational Applications, 2025

Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation.
CoRR, 2024

Creativity in AI: Progresses and Challenges.
CoRR, 2024

LLMs Are In-Context Reinforcement Learners.
CoRR, 2024

Could ChatGPT get an Engineering Degree? Evaluating Higher Education Vulnerability to AI Assistants.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2024

Brain-Like Language Processing via a Shallow Untrained Multihead Attention Network.
CoRR, 2024

ComperDial: Commonsense Persona-grounded Dialogue Dataset and Benchmark.
CoRR, 2024

Improving Autoformalization using Type Checking.
CoRR, 2024

Rethinking Skill Extraction in the Job Market Domain using Large Language Models.
CoRR, 2024

JOBSKAPE: A Framework for Generating Synthetic Job Postings to Enhance Skill Matching.
CoRR, 2024

Evaluating Language Model Agency through Negotiations.
CoRR, 2024

δ-CAUSAL: Exploring Defeasibility in Causal Reasoning.
CoRR, 2024

Course Recommender Systems Need to Consider the Job Market.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Training Visual Language Models with Object Detection: Grounded Change Descriptions in Satellite Images.
Proceedings of the IGARSS 2024, 2024

Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

"Flex Tape Can't Fix That": Bias and Misinformation in Edited Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Let Me Teach You: Pedagogical Foundations of Feedback for Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Discovering Knowledge-Critical Subnetworks in Pretrained Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

ConGeo: Robust Cross-View Geo-Localization Across Ground View Variations.
Proceedings of the Computer Vision - ECCV 2024, 2024

REFINER: Reasoning Feedback on Intermediate Representations.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024


EPFL-MAKE at "Discharge Me!": An LLM System for Automatically Generating Discharge Summaries of Clinical Electronic Health Record.
Proceedings of the 23rd Workshop on Biomedical Natural Language Processing, 2024

DiffuCOMET: Contextual Commonsense Knowledge Diffusion.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Complex Reasoning over Logical Queries on Commonsense Knowledge Graphs.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Exploring Defeasibility in Causal Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

ConVQG: Contrastive Visual Question Generation with Multimodal Guidance.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Instruction-tuning Aligns LLMs to the Human Brain.
CoRR, 2023

MEDITRON-70B: Scaling Medical Pretraining for Large Language Models.
CoRR, 2023

RECKONING: Reasoning through Dynamic Knowledge Encoding.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

CAR: Conceptualization-Augmented Reasoner for Zero-Shot Commonsense Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

CRAB: Assessing the Strength of Causal Relationships Between Real-world Events.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

CRoW: Benchmarking Commonsense Reasoning in Real-World Tasks.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Breaking the Language Barrier: Improving Cross-Lingual Reasoning with Structured Self-Attention.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

kogito: A Commonsense Knowledge Inference Toolkit.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics. EACL 2023, 2023

PeaCoK: Persona Commonsense Knowledge for Consistent and Engaging Narratives.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Mitigating Label Biases for In-context Learning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

DISCO: Distilling Counterfactuals with Large Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
End-to-End Task-Oriented Dialog Modeling With Semi-Structured Knowledge Management.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

DISCO: Distilling Phrasal Counterfactuals with Large Language Models.
CoRR, 2022

GreaseLM: Graph REASoning Enhanced Language Models for Question Answering.
CoRR, 2022

Deep Bidirectional Language-Knowledge Graph Pretraining.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Memory-Based Model Editing at Scale.
Proceedings of the International Conference on Machine Learning, 2022

Fast Model Editing at Scale.
Proceedings of the Tenth International Conference on Learning Representations, 2022

GreaseLM: Graph REASoning Enhanced Language Models.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Conditional set generation using Seq2seq models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

ComFact: A Benchmark for Linking Contextual Commonsense Knowledge.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Discovering Language-neutral Sub-networks in Multilingual Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Synthetic Disinformation Attacks on Automated Fact Verification Systems.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
On the Opportunities and Risks of Foundation Models.
CoRR, 2021

The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics.
CoRR, 2021

On-the-Fly Attention Modularization for Neural Generation.
CoRR, 2021

Understanding Few-Shot Commonsense Knowledge Models.
CoRR, 2021

QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question Answering.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

"I'm Not Mad": Commonsense Implications of Negation and Contradiction.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Conversational Multi-Hop Reasoning with Neural Commonsense Knowledge and Symbolic Logic Rules.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Discourse Understanding and Factual Consistency in Abstractive Summarization.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Analyzing Commonsense Emergence in Few-shot Knowledge Models.
Proceedings of the 3rd Conference on Automated Knowledge Base Construction, 2021

On-the-Fly Attention Modulation for Neural Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Edited Media Understanding Frames: Reasoning About the Intent and Implications of Visual Misinformation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

(Comet-) Atomic 2020: On Symbolic and Neural Commonsense Knowledge Graphs.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Dynamic Neuro-Symbolic Knowledge Graph Construction for Zero-shot Commonsense Question Answering.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Understanding Natural Language with Commonsense Knowledge Representation, Reasoning, and Simulation.
PhD thesis, 2020

Edited Media Understanding: Reasoning About Implications of Manipulated Images.
CoRR, 2020

Back to the Future: Unsupervised Backprop-based Decoding for Counterfactual and Abductive Commonsense Reasoning.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

The Amazing World of Neural Language Generation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Tutorial Abstracts, 2020

Procedural Reading Comprehension with Attribute-Aware Context Flow.
Proceedings of the Conference on Automated Knowledge Base Construction, 2020

Commonsense Reasoning for Natural Language Processing.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, 2020

Commonsense Knowledge Base Completion with Structural and Semantic Context.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Dynamic Knowledge Graph Construction for Zero-shot Commonsense Question Answering.
CoRR, 2019

Exploiting Structural and Semantic Context for Commonsense Knowledge Base Completion.
CoRR, 2019

Cooperative Generator-Discriminator Networks for Abstractive Summarization with Narrative Flow.
CoRR, 2019

Efficient Adaptation of Pretrained Transformers for Abstractive Summarization.
CoRR, 2019

Be Consistent! Improving Procedural Text Comprehension using Label Consistency.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

WIQA: A dataset for "What if..." reasoning over procedural text.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Counterfactual Story Reasoning and Generation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Everything Happens for a Reason: Discovering the Purpose of Actions in Procedural Text.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

COMET: Commonsense Transformers for Automatic Knowledge Graph Construction.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Deep Communicating Agents for Abstractive Summarization.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Discourse-Aware Neural Rewards for Coherent Text Generation.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Simulating Action Dynamics with Neural Process Networks.
Proceedings of the 6th International Conference on Learning Representations, 2018

Reasoning about Actions and State Changes by Injecting Commonsense Knowledge.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Modeling Naive Psychology of Characters in Simple Commonsense Stories.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Learning to Write with Cooperative Discriminators.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2016
Learning Prototypical Event Structure from Photo Albums.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
Thermo-mechanical effects in drilling using metal working fluids and cryogenic cooling and their impact in tool performance.
Prod. Eng., 2015


  Loading...