Liwei Jiang
This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.
Bibliography
2026
Active Manifolds, Stratifications, and Convergence to Local Minima in Nonsmooth Optimization.
Found. Comput. Math., April, 2026
Useless but Safe? Benchmarking Utility Recovery with User Intent Clarification in Multi-Turn Conversations.
CoRR, April, 2026
CoRR, February, 2026
2025
Pluralistic Behavior Suite: Stress-Testing Multi-Turn Adherence to Custom Behavioral Policies.
CoRR, November, 2025
CoRR, October, 2025
Spectrum Tuning: Post-Training for Distributional Coverage and In-Context Steerability.
CoRR, October, 2025
Preconditioned subgradient method for composite optimization: overparameterization and fast convergence.
CoRR, September, 2025
A Local Nearly Linearly Convergent First-Order Method for Nonsmooth Functions with Quadratic Growth.
Found. Comput. Math., June, 2025
Chasing Moving Targets with Online Self-Play Reinforcement Learning for Safer Language Models.
CoRR, June, 2025
CoRR, April, 2025
CoRR, March, 2025
Enhancement of Electron-Spin Polarization Using Magneto-Optical Synchronous Modulation.
IEEE Trans. Instrum. Meas., 2025
Nat. Mac. Intell., 2025
SafetyAnalyst: Interpretable, Transparent, and Steerable Safety Moderation for AI Behavior.
Proceedings of the Forty-second International Conference on Machine Learning, 2025
Position: Political Neutrality in AI Is Impossible - But Here Is How to Approximate It.
Proceedings of the Forty-second International Conference on Machine Learning, 2025
AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Conference on Parsimony and Learning, 2025
Proceedings of the Thirty Eighth Annual Conference on Learning Theory, 2025
Guardrails and Security for LLMs: Safe, Secure and Controllable Steering of LLM Applications.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 5: Tutorial Abstracts), 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
CulturalBench: A Robust, Diverse and Challenging Benchmark for Measuring LMs' Cultural Knowledge Through Human-AI Red-Teaming.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025
2024
Suppression of Heading Error in Bell-Bloom Atomic Magnetometer by Controlling RF Magnetic Field.
IEEE Trans. Instrum. Meas., 2024
In Situ Measurement of High-Frequency Magnetic Field Produced by Electric Heaters in Atomic Sensors.
IEEE Trans. Instrum. Meas., 2024
CoRR, 2024
CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs.
CoRR, 2024
Gradient descent with adaptive stepsize converges (nearly) linearly under fourth-order growth.
CoRR, 2024
CoRR, 2024
WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries.
CoRR, 2024
CULTURE-GEN: Revealing Global Cultural Perception in Language Models through Natural Language Prompting.
CoRR, 2024
CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge.
CoRR, 2024
Ridged Structure of Coaxial Cavity for Raising the Multipactor Threshold of Microwave Switch.
IEEE Access, 2024
WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024
WildGuard: Open One-stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024
Impossible Distillation for Paraphrasing and Summarization: How to Make High-quality Lemonade out of Small, Low-quality Model.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
JAMDEC: Unsupervised Authorship Obfuscation using Constrained Decoding over Small Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
MigraineTracker: Examining Patient Experiences with Goal-Directed Self-Tracking for a Chronic Health Condition.
Proceedings of the CHI Conference on Human Factors in Computing Systems, 2024
Particip-AI: A Democratic Surveying Framework for Anticipating Future AI Use Cases, Harms and Benefits.
Proceedings of the Seventh AAAI/ACM Conference on AI, Ethics, and Society (AIES-24) - Full Archival Papers, October 21-23, 2024, San Jose, California, USA, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
Algorithmic Regularization in Model-Free Overparametrized Asymmetric Matrix Factorization.
SIAM J. Math. Data Sci., September, 2023
Comprehensive Structural Parameter Optimal Design of Electric Heaters With Magnetic Field Suppression for Atomic Sensors.
IEEE Trans. Instrum. Meas., 2023
Accurate Determination of Spin-Exchange Relaxation in a Self-Compensated Atomic Comagnetometer.
IEEE Trans. Instrum. Meas., 2023
Impossible Distillation: from Low-Quality Model to High-Quality Dataset & Model for Summarization and Paraphrasing.
CoRR, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
BiasX: "Thinking Slow" in Toxic Content Moderation with Explanations of Implied Social Biases.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
What Makes it Ok to Set a Fire? Iterative Self-distillation of Contexts and Rationales for Disambiguating Defeasible Social and Moral Situations.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Commonsense Norms.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
ClarifyDelphi: Reinforced Clarification Questions with Defeasibility Rewards for Social and Moral Situations.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
2022
Sensitivity of Green-Up Date to Meteorological Indicators in Hulun Buir Grasslands of China.
Remote. Sens., 2022
CoRR, 2022
Reinforced Clarification Question Generation with Defeasibility Rewards for Disambiguating Social and Moral Situations.
CoRR, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
2021
Examination of Spin-Exchange Relaxation in the Alkali Metal-Noble Gas Comagnetometer With a Large Electron Magnetic Field.
IEEE Trans. Instrum. Meas., 2021
Subgradient methods near active manifolds: saddle point avoidance, local convergence, and asymptotic normality.
CoRR, 2021
Proceedings of the IEEE International Conference on Real-time Computing and Robotics, 2021
Configuration Synthesis and Structure Design of a Reconfigurable Robot for Muscle Strength Training.
Proceedings of the IEEE International Conference on Real-time Computing and Robotics, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
Proceedings of the IUI '21: 26th International Conference on Intelligent User Interfaces, 2021
2020
A Fast Measurement for Relaxation Rates and Fermi-Contact Fields in Spin-Exchange Relaxation-Free Comagnetometers.
IEEE Trans. Instrum. Meas., 2020
Improving Positive Unlabeled Learning: Practical AUL Estimation and New Training Method for Extremely Imbalanced Data Sets.
CoRR, 2020
2019
BookBuddy: Turning Digital Materials Into Interactive Foreign Language Lessons Through a Voice Chatbot.
Proceedings of the Sixth ACM Conference on Learning @ Scale, 2019
Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, 2019
2018
Hypercube-Based Crowding Differential Evolution with Neighborhood Mutation for Multimodal Optimization.
Int. J. Swarm Intell. Res., 2018