Kellin Pelrine

According to our database1, Kellin Pelrine authored at least 32 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Jailbreak-Tuning: Models Efficiently Learn Jailbreak Susceptibility.
CoRR, July, 2025

Veracity: An Open-Source AI Fact-Checking System.
CoRR, June, 2025

It's the Thought that Counts: Evaluating the Attempts of Frontier LLMs to Persuade on Harmful Topics.
CoRR, June, 2025

Accidental Misalignment: Fine-Tuning Language Models Induces Unexpected Vulnerability.
CoRR, May, 2025

From Intuition to Understanding: Using AI Peers to Overcome Physics Misconceptions.
CoRR, April, 2025

Online Influence Campaigns: Strategies and Vulnerabilities.
CoRR, January, 2025

The Structural Safety Generalization Problem.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Can Go AIs Be Adversarially Robust?
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Scaling Trends for Data Poisoning in LLMs.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Epistemic Integrity in Large Language Models.
CoRR, 2024

A Guide to Misinformation Detection Datasets.
CoRR, 2024

A Simulation System Towards Solving Societal-Scale Manipulation.
CoRR, 2024

Emerging Vulnerabilities in Frontier Models: Multi-Turn Jailbreak Attacks.
CoRR, 2024

Web Retrieval Agents for Evidence-Based Misinformation Detection.
CoRR, 2024

Scaling Laws for Data Poisoning in LLMs.
CoRR, 2024

Regional and Temporal Patterns of Partisan Polarization during the COVID-19 Pandemic in the United States and Canada.
CoRR, 2024

Combining Confidence Elicitation and Sample-based Methods for Uncertainty Quantification in Misinformation Mitigation.
CoRR, 2024

Comparing GPT-4 and Open-Source Language Models in Misinformation Mitigation.
CoRR, 2024

Uncertainty Resolution in Misinformation Detection.
CoRR, 2024

Party Prediction for Twitter.
Proceedings of the Eighteenth International AAAI Conference on Web and Social Media, 2024

2023
Exploiting Novel GPT-4 APIs.
CoRR, 2023

Open, Closed, or Small Language Models for Text Classification?
CoRR, 2023

Adversarial Policies Beat Superhuman Go AIs.
Proceedings of the International Conference on Machine Learning, 2023

Towards Reliable Misinformation Mitigation: Generalization, Uncertainty, and GPT-4.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

SWEET - Weakly Supervised Person Name Extraction for Fighting Human Trafficking.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Better Bridges Between Model and Real World.
Proceedings of the 36th Canadian Conference on Artificial Intelligence, 2023

2022
Towards Better Evaluation for Dynamic Link Prediction.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Active Keyword Selection to Track Evolving Topics on Twitter.
Proceedings of the IEEE International Conference on Data Mining Workshops, 2022

Extracting Person Names from User Generated Text: Named-Entity Recognition for Combating Human Trafficking.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
The Surprising Performance of Simple Baselines for Misinformation Detection.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Online Partisan Polarization of COVID-19.
Proceedings of the 2021 International Conference on Data Mining, 2021

2020
ComplexDataLab at W-NUT 2020 Task 2: Detecting Informative COVID-19 Tweets by Attending over Linked Documents.
Proceedings of the Sixth Workshop on Noisy User-generated Text, 2020


  Loading...