Kai-Wei Chang

This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.

Known people with the same name:

Bibliography

2025
HoneyBee: Data Recipes for Vision-Language Reasoners.
CoRR, October, 2025

BILLY: Steering Large Language Models via Merging Persona Vectors for Creative Generation.
CoRR, October, 2025

Dynamic Generation of Multi-LLM Agents Communication Topologies with Graph Diffusion Models.
CoRR, October, 2025

Towards Unsupervised Speech Recognition at the Syllable-Level.
CoRR, October, 2025

Game-Time: Evaluating Temporal Dynamics in Spoken Language Models.
CoRR, September, 2025

Which Cultural Lens Do Models Adopt? On Cultural Positioning Bias and Agentic Mitigation in LLMs.
CoRR, September, 2025

OpenThoughts: Data Recipes for Reasoning Models.
CoRR, June, 2025

Designing AI Tools for Clinical Care Teams to Support Serious Illness Conversations with Older Adults in the Emergency Department.
CoRR, June, 2025

Visualized Text-to-Image Retrieval.
CoRR, May, 2025

LaViDa: A Large Diffusion Language Model for Multimodal Understanding.
CoRR, May, 2025

CompAlign: Improving Compositional Text-to-Image Generation with a Complex Benchmark and Fine-Grained Feedback.
CoRR, May, 2025

X-Teaming: Multi-Turn Jailbreaks and Defenses with Adaptive Multi-Agents.
CoRR, April, 2025

On The Landscape of Spoken Language Models: A Comprehensive Survey.
CoRR, April, 2025

Fact-or-Fair: A Checklist for Behavioral Testing of AI Models on Fairness-Related Queries.
CoRR, February, 2025

Analyzing Mitigation Strategies for Catastrophic Forgetting in End-to-End Training of Spoken Language Models.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Controllable Generation via Locally Constrained Resampling.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

A Sparse Dynamic Programming Algorithm for Solving the Coding Sequence Design Problem.
Proceedings of the Computing and Combinatorics, 2025

Magnet: Multi-turn Tool-use Data Synthesis and Distillation via Graph Translation.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

The Male CEO and the Female Assistant: Evaluation and Mitigation of Gender Biases in Text-To-Image Generation of Dual Subjects.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

White Men Lead, Black Women Help? Benchmarking and Mitigating Language Agency Social Biases in LLMs.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Tree-managed network ensembles for video prediction.
Mach. Vis. Appl., July, 2024

SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks.
CoRR, 2024

Towards a Holistic Framework for Multimodal Large Language Models in Three-dimensional Brain CT Report Generation.
CoRR, 2024

Semantic Loss Functions for Neuro-Symbolic Structured Prediction.
CoRR, 2024

White Men Lead, Black Women Help: Uncovering Gender, Racial, and Intersectional Bias in Language Agency.
CoRR, 2024

LLMs in Biomedicine: A study on clinical Named Entity Recognition.
CoRR, 2024

Can Small Language Models Help Large Language Models Reason Better?: LM-Guided Chain-of-Thought.
CoRR, 2024

Survey of Bias In Text-to-Image Generation: Definition, Evaluation, and Mitigation.
CoRR, 2024

MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
CoRR, 2024

Towards audio language modeling - an overview.
CoRR, 2024

EMO-SUPERB: An In-depth Look at Speech Emotion Recognition.
CoRR, 2024

The Male CEO and the Female Assistant: Probing Gender Biases in Text-To-Image Models Through Paired Stereotype Test.
CoRR, 2024

Codec-Superb @ SLT 2024: A Lightweight Benchmark For Neural Audio Codec Models.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Open-Emotion: A Reproducible EMO-Superb For Speech Emotion Recognition Systems.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Exploring In-Context Learning of Textless Speech Language Model for Speech Classification Tasks.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Characterizing Truthfulness in Large Language Model Generations with Local Intrinsic Dimension.
Proceedings of the Forty-first International Conference on Machine Learning, 2024


CoBIT: A Contrastive Bi-directional Image-Text Generation Model.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Dynamic-Superb: Towards a Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark For Speech.
Proceedings of the IEEE International Conference on Acoustics, 2024

The Factuality Tax of Diversity-Intervened Text-to-Image Generation: Benchmark and Fact-Augmented Intervention.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

MATHVERSE: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
Proceedings of the Computer Vision - ECCV 2024, 2024

The Hard Positive Truth About Vision-Language Compositionality.
Proceedings of the Computer Vision - ECCV 2024, 2024

Understanding and Mitigating Spurious Correlations in Text Classification with Neighborhood Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

Can Small Language Models Help Large Language Models Reason Better?: LM-Guided Chain-of-Thought.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Empower Typed Descriptions by Large Language Models for Speech Emotion Recognition.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2024

Agent Lumos: Unified and Modular Training for Open-Source Language Agents.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

KPEval: Towards Fine-Grained Semantic-Based Keyphrase Evaluation.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Codec-SUPERB: An In-Depth Analysis of Sound Codec Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Semantic Loss Functions for Neuro-Symbolic Structured Prediction.
Proceedings of the Compendium of Neurosymbolic Artificial Intelligence, 2023

Planning as In-Painting: A Diffusion-Based Embodied Task Planning Framework for Environments under Uncertainty.
CoRR, 2023

Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs.
CoRR, 2023

An Exploration of In-Context Learning for Speech Language Model.
CoRR, 2023

Will the Prince Get True Love's Kiss? On the Model Sensitivity to Gender Perturbation over Fairytale Texts.
CoRR, 2023

SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts.
CoRR, 2023

Text encoders are performance bottlenecks in contrastive vision-language models.
CoRR, 2023

Understanding and Mitigating Spurious Correlations in Text Classification.
CoRR, 2023

Factoring the Matrix of Domination: A Critical Review and Reimagination of Intersectionality in AI Fairness.
CoRR, 2023

SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks.
CoRR, 2023

A Pseudo-Semantic Loss for Autoregressive Models with Logical Constraints.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

ABC-KD: Attention-Based-Compression Knowledge Distillation for Deep Learning-Based Noise Suppression.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

On the Paradox of Learning to Reason from Data.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

CleanCLIP: Mitigating Data Poisoning Attacks in Multimodal Contrastive Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Ensemble Knowledge Distillation of Self-Supervised Speech Models.
Proceedings of the IEEE International Conference on Acoustics, 2023

What's "up" with vision-language models? Investigating their struggle with spatial reasoning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Text encoders bottleneck compositionality in contrastive vision-language models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Retrieval Enhanced Data Augmentation for Question Answering on Privacy Policies.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Summarize and Generate to Back-translate: Unsupervised Translation of Programming Languages.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

GIVL: Improving Geographical Inclusivity of Vision-Language Models with Pre-Training Methods.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Minisuperb: Lightweight Benchmark for Self-Supervised Speech Models.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Towards General-Purpose Text-Instruction-Guided Voice Conversion.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Prompting and Adapter Tuning For Self-Supervised Encoder-Decoder Speech Model.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Semantic Strengthening of Neuro-Symbolic Learning.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

Efficient Shapley Values Estimation by Amortization for Text Classification.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

PIP: Parse-Instructed Prefix for Syntactically Controlled Paraphrase Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

MetaVL: Transferring In-Context Learning Ability From Language Models to Vision-Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

Symbolic Chain-of-Thought Distillation: Small Models Can Also "Think" Step-by-Step.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

PLUE: Language Understanding Evaluation Benchmark for Privacy Policies in English.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

AVATAR: A Parallel Corpus for Java-Python Program Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
MAGIC: Mask-Guided Image Synthesis by Inverting a Quasi-Robust Classifier.
CoRR, 2022

Neuro-symbolic entropy regularization.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

Superb @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Semantic Probabilistic Layers for Neuro-Symbolic Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

BERTHop: An Effective Vision-and-Language Model for Chest X-ray Disease Diagnosis.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Toward Degradation-Robust Voice Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2022

GeoMLAMA: Geo-Diverse Commonsense Probing on Multilingual Pre-Trained Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Conditional Supervised Contrastive Learning for Fair Text Classification.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

How well can Text-to-Image Generative Models understand Ethical Natural Language Interventions?
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021
Integrating topic modeling and word embedding to characterize violent deaths.
CoRR, 2021

SpeechNet: A Universal Modularized Model for Speech Processing Tasks.
CoRR, 2021

Leveraging Unlabeled Data for Entity-Relation Extraction through Probabilistic Constraint Satisfaction.
CoRR, 2021

Pylon: A PyTorch Framework for Learning with Constraints.
Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track, 2021

Adapting Coreference Resolution for Processing Violent Death Narratives.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Evaluating the Values of Sources in Transfer Learning.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Disentangling Semantics and Syntax in Sentence Embeddings with Pre-trained Language Models.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Unified Pre-training for Program Understanding and Generation.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

An Integer Linear Programming Framework for Mining Constraints from Data.
Proceedings of the 38th International Conference on Machine Learning, 2021

BERTHop: An Effective Vision-and-Language Model for Chest X-ray Disease Diagnosis.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Retrieval Augmented Code Generation and Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Harms of Gender Exclusivity and Challenges in Non-Binary Representation in Language Technologies.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Generating Syntactically Controlled Paraphrases without Using Annotated Parallel Pairs.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Syntax-augmented Multilingual BERT for Cross-lingual Transfer.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Intent Classification and Slot Filling for Privacy Policies.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Select, Extract and Generate: Neural Keyphrase Generation with Layer-wise Coverage Attention.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Select, Extract and Generate: Neural Keyphrase Generation with Syntactic Guidance.
CoRR, 2020

XAlgo: Explaining the Internal States of Algorithms via Question Answering.
CoRR, 2020

Generating Sports News from Live Commentary: A Chinese Dataset for Sports Game Summarization.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

Dynamically Expanded CNN Array for Video Coding.
Proceedings of the ICIGP 2020: 3rd International Conference on Image and Graphics Processing, 2020

PolicyQA: A Reading Comprehension Dataset for Privacy Policies.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

SentiBERT: A Transferable Transformer-Based Architecture for Compositional Sentiment Semantics.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

On the Robustness of Language Encoders against Grammatical Errors.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

A Transformer-based Approach for Source Code Summarization.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Learning Directional Sentence-Pair Embedding for Natural Language Reasoning (Student Abstract).
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Word and Sentence Embedding Tools to Measure Semantic Similarity of Gene Ontology Terms by Their Definitions.
J. Comput. Biol., 2019

Context Attentive Document Ranking and Query Suggestion.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Visualizing Trends of Key Roles in News Articles.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

2018
Analysis of the Sensing Properties of a Highly Stable and Reproducible Ozone Gas Sensor Based on Amorphous In-Ga-Zn-O Thin Film.
Sensors, 2018

Building a Robust Text Classifier on a Test-Time Budget.
CoRR, 2018

A Cloud-based Protection approach against JavaScript-based attacks to browsers.
Comput. Electr. Eng., 2018

Handover: A mechanism to improve the reliability and availability of network services for clients behind a network address translator.
Comput. Electr. Eng., 2018

Intent-aware Query Obfuscation for Privacy Protection in Personalized Web Search.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

A Corpus of Drug Usage Guidelines Annotated with Type of Advice.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

A Corpus to Learn Refer-to-as Relations for Nominals.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Counterexamples for Robotic Planning Explained in Structured Language.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Multi-Task Learning for Document Ranking and Query Suggestion.
Proceedings of the 6th International Conference on Learning Representations, 2018

Generating Natural Language Adversarial Examples.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Building Language Models for Text with Named Entities.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2016
A Credit Assignment Compiler for Joint Prediction.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Learning from Explicit and Implicit Supervision Jointly For Algebra Word Problems.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

2015
Selective algorithms for large-scale classification and structured learning
PhD thesis, 2015

Learning to Search for Dependencies.
CoRR, 2015

Hands-on Learning to Search for Structured Prediction.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Learning to Search Better than Your Teacher.
Proceedings of the 32nd International Conference on Machine Learning, 2015

Real-time data compression for thermal-controlled three-dimensional DRAM systems.
Proceedings of the IEEE 4th Global Conference on Consumer Electronics, 2015

2014
Typed Tensor Decomposition of Knowledge Bases for Relation Extraction.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

2013
Multi-Relational Latent Semantic Analysis.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013


  Loading...