Liwei Jiang

This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.

Bibliography

2026
Active Manifolds, Stratifications, and Convergence to Local Minima in Nonsmooth Optimization.
Found. Comput. Math., April, 2026

Useless but Safe? Benchmarking Utility Recovery with User Intent Clarification in Multi-Turn Conversations.
CoRR, April, 2026

ODESteer: A Unified ODE-Based Steering Framework for LLM Alignment.
CoRR, February, 2026

2025
Pluralistic Behavior Suite: Stress-Testing Multi-Turn Adherence to Custom Behavioral Policies.
CoRR, November, 2025

Artificial Hivemind: The Open-Ended Homogeneity of Language Models (and Beyond).
CoRR, October, 2025

Spectrum Tuning: Post-Training for Distributional Coverage and In-Context Steerability.
CoRR, October, 2025

Preconditioned subgradient method for composite optimization: overparameterization and fast convergence.
CoRR, September, 2025

A Local Nearly Linearly Convergent First-Order Method for Nonsmooth Functions with Quadratic Growth.
Found. Comput. Math., June, 2025

Chasing Moving Targets with Online Self-Play Reinforcement Learning for Safer Language Models.
CoRR, June, 2025

AI Debate Aids Assessment of Controversial Claims.
CoRR, June, 2025

X-Teaming: Multi-Turn Jailbreaks and Defenses with Adaptive Multi-Agents.
CoRR, April, 2025

PolyGuard: A Multilingual Safety Moderation Tool for 17 Languages.
CoRR, April, 2025

Political Neutrality in AI is Impossible- But Here is How to Approximate it.
CoRR, March, 2025

Enhancement of Electron-Spin Polarization Using Magneto-Optical Synchronous Modulation.
IEEE Trans. Instrum. Meas., 2025

Investigating machine moral judgement through the Delphi experiment.
Nat. Mac. Intell., 2025

SafetyAnalyst: Interpretable, Transparent, and Steerable Safety Moderation for AI Behavior.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Position: Political Neutrality in AI Is Impossible - But Here Is How to Approximate It.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily Life.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

A Validation Approach to Over-parameterized Matrix and Image Recovery.
Proceedings of the Conference on Parsimony and Learning, 2025

Online Covariance Estimation in Nonsmooth Stochastic Approximation.
Proceedings of the Thirty Eighth Annual Conference on Learning Theory, 2025

Guardrails and Security for LLMs: Safe, Secure and Controllable Steering of LLM Applications.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 5: Tutorial Abstracts), 2025

Can Language Models Reason about Individualistic Human Values and Preferences?
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

CulturalBench: A Robust, Diverse and Challenging Benchmark for Measuring LMs' Cultural Knowledge Through Human-AI Red-Teaming.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

To Err Is AI: A Case Study Informing LLM Flaw Reporting Practices.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Suppression of Heading Error in Bell-Bloom Atomic Magnetometer by Controlling RF Magnetic Field.
IEEE Trans. Instrum. Meas., 2024

In Situ Measurement of High-Frequency Magnetic Field Produced by Electric Heaters in Atomic Sensors.
IEEE Trans. Instrum. Meas., 2024

SafetyAnalyst: Interpretable, transparent, and steerable LLM safety moderation.
CoRR, 2024

CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs.
CoRR, 2024

Gradient descent with adaptive stepsize converges (nearly) linearly under fourth-order growth.
CoRR, 2024

HAICOSYSTEM: An Ecosystem for Sandboxing Safety Risks in Human-AI Interactions.
CoRR, 2024

WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries.
CoRR, 2024

CULTURE-GEN: Revealing Global Cultural Perception in Language Models through Natural Language Prompting.
CoRR, 2024

CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge.
CoRR, 2024

Information-Theoretic Distillation for Reference-less Summarization.
CoRR, 2024

A Roadmap to Pluralistic Alignment.
CoRR, 2024

Ridged Structure of Coaxial Cavity for Raising the Multipactor Threshold of Microwave Switch.
IEEE Access, 2024

WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

WildGuard: Open One-stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Impossible Distillation for Paraphrasing and Summarization: How to Make High-quality Lemonade out of Small, Low-quality Model.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

JAMDEC: Unsupervised Authorship Obfuscation using Constrained Decoding over Small Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Position: A Roadmap to Pluralistic Alignment.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

The Generative AI Paradox: "What It Can Create, It May Not Understand".
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

MigraineTracker: Examining Patient Experiences with Goal-Directed Self-Tracking for a Chronic Health Condition.
Proceedings of the CHI Conference on Human Factors in Computing Systems, 2024

Particip-AI: A Democratic Surveying Framework for Anticipating Future AI Use Cases, Harms and Benefits.
Proceedings of the Seventh AAAI/ACM Conference on AI, Ethics, and Society (AIES-24) - Full Archival Papers, October 21-23, 2024, San Jose, California, USA, 2024

Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Algorithmic Regularization in Model-Free Overparametrized Asymmetric Matrix Factorization.
SIAM J. Math. Data Sci., September, 2023

Comprehensive Structural Parameter Optimal Design of Electric Heaters With Magnetic Field Suppression for Atomic Sensors.
IEEE Trans. Instrum. Meas., 2023

Accurate Determination of Spin-Exchange Relaxation in a Self-Compensated Atomic Comagnetometer.
IEEE Trans. Instrum. Meas., 2023

Impossible Distillation: from Low-Quality Model to High-Quality Dataset & Model for Summarization and Paraphrasing.
CoRR, 2023

Faith and Fate: Limits of Transformers on Compositionality.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

BiasX: "Thinking Slow" in Toxic Content Moderation with Explanations of Implied Social Biases.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

NovaCOMET: Open Commonsense Foundation Models with Symbolic Knowledge Distillation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

What Makes it Ok to Set a Fire? Iterative Self-distillation of Contexts and Rationales for Disambiguating Defeasible Social and Moral Situations.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Commonsense Norms.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

ClarifyDelphi: Reinforced Clarification Questions with Defeasibility Rewards for Social and Moral Situations.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Sensitivity of Green-Up Date to Meteorological Indicators in Hulun Buir Grasslands of China.
Remote. Sens., 2022

SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization.
CoRR, 2022

Reinforced Clarification Question Generation with Defeasibility Rewards for Disambiguating Social and Moral Situations.
CoRR, 2022

QUARK: Controllable Text Generation with Reinforced Unlearning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Symbolic Knowledge Distillation: from General Language Models to Commonsense Models.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

NeuroLogic A*esque Decoding: Constrained Text Generation with Lookahead Heuristics.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Aligning to Social Norms and Values in Interactive Narratives.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

ProsocialDialog: A Prosocial Backbone for Conversational Agents.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021
Examination of Spin-Exchange Relaxation in the Alkali Metal-Noble Gas Comagnetometer With a Large Electron Magnetic Field.
IEEE Trans. Instrum. Meas., 2021

Delphi: Towards Machine Ethics and Norms.
CoRR, 2021

Subgradient methods near active manifolds: saddle point avoidance, local convergence, and asymptotic normality.
CoRR, 2021

Estimation of muscle activation during ankle rehabilitation.
Proceedings of the IEEE International Conference on Real-time Computing and Robotics, 2021

Configuration Synthesis and Structure Design of a Reconfigurable Robot for Muscle Strength Training.
Proceedings of the IEEE International Conference on Real-time Computing and Robotics, 2021

Rank Overspecified Robust Matrix Recovery: Subgradient Method and Exact Recovery.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

"I'm Not Mad": Commonsense Implications of Negation and Contradiction.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

EnglishBot: An AI-Powered Conversational System for Second Language Learning.
Proceedings of the IUI '21: 26th International Conference on Intelligent User Interfaces, 2021

2020
A Fast Measurement for Relaxation Rates and Fermi-Contact Fields in Spin-Exchange Relaxation-Free Comagnetometers.
IEEE Trans. Instrum. Meas., 2020

Radiology, Mobile Devices, and Internet of Things (IoT).
J. Digit. Imaging, 2020

Improving Positive Unlabeled Learning: Practical AUL Estimation and New Training Method for Extremely Imbalanced Data Sets.
CoRR, 2020

2019
BookBuddy: Turning Digital Materials Into Interactive Foreign Language Lessons Through a Voice Chatbot.
Proceedings of the Sixth ACM Conference on Learning @ Scale, 2019

QuizBot: A Dialogue-based Adaptive Learning System for Factual Knowledge.
Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, 2019

2018
Hypercube-Based Crowding Differential Evolution with Neighborhood Mutation for Multimodal Optimization.
Int. J. Swarm Intell. Res., 2018


  Loading...