Christopher Parisien

According to our database1, Christopher Parisien authored at least 22 papers between 2008 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Training a General Purpose Automated Red Teaming Model.
CoRR, April, 2026

2025
Pluralistic Behavior Suite: Stress-Testing Multi-Turn Adherence to Custom Behavioral Policies.
CoRR, November, 2025

SafeSteer: Interpretable Safety Steering with Refusal-Evasion in LLMs.
CoRR, June, 2025

AEGIS2.0: A Diverse AI Safety Dataset and Risks Taxonomy for Alignment of LLM Guardrails.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Safety Through Reasoning: An Empirical Study of Reasoning Guardrail Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

A Simple Yet Effective Method for Non-Refusing Context Relevant Fine-grained Safety Steering in LLMs.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Guardrails and Security for LLMs: Safe, Secure and Controllable Steering of LLM Applications.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 5: Tutorial Abstracts), 2025

2024
Towards Inference-time Category-wise Safety Steering for Large Language Models.
CoRR, 2024

AEGIS: Online Adaptive AI Content Safety Moderation with Ensemble of LLM Experts.
CoRR, 2024

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues.
CoRR, 2024

Unsupervised Extraction of Dialogue Policies from Conversations.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
NeMo Guardrails: A Toolkit for Controllable and Safe LLM Applications with Programmable Rails.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
A large language model for electronic health records.
npj Digit. Medicine, 2022

Prompt Learning for Domain Adaptation in Task-Oriented Dialogue.
CoRR, 2022

GatorTron: A Large Clinical Language Model to Unlock Patient Information from Unstructured Electronic Health Records.
CoRR, 2022

2012
Hierarchical Bayesian Models of Verb Learning in Children.
PhD thesis, 2012

2011
Generalizing between form and meaning using learned verb classes.
Proceedings of the 33th Annual Meeting of the Cognitive Science Society, 2011

Incorporating Coercive Constructions into a Verb Lexicon.
Proceedings of the ACL 2011 Workshop on Relational Models of Semantics, 2011

2008
Solving the Problem of Negative Synaptic Weights in Cortical Models.
Neural Comput., 2008

Robosemantics: How Stanley the Volkswagen Represents the World.
Minds Mach., 2008

An Incremental Bayesian Model for Learning Syntactic Categories.
Proceedings of the Twelfth Conference on Computational Natural Language Learning, 2008


  Loading...