William Chen

This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.

Bibliography

2025
CAST: Counterfactual Labels Improve Instruction Following in Vision-Language-Action Models.
CoRR, August, 2025

Addressing the ML Domain Adaptation Problem for Networking: Realistic and Controllable Training Data Generation with NetReplica.
CoRR, July, 2025

OpusLM: A Family of Open Unified Speech Language Models.
CoRR, June, 2025

OWSM v4: Improving Open Whisper-Style Speech Models via Data Scaling and Cleaning.
CoRR, June, 2025

VulBinLLM: LLM-powered Vulnerability Detection for Stripped Binaries.
CoRR, May, 2025

Training Strategies for Efficient Embodied Reasoning.
CoRR, May, 2025

Measuring General Intelligence with Generated Games.
CoRR, May, 2025

ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems.
CoRR, March, 2025

ESPnet-SpeechLM: An Open Speech Language Model Toolkit.
CoRR, February, 2025

OWLS: Scaling Laws for Multilingual Speech Recognition and Translation Models.
CoRR, February, 2025

Vision-Language Models Provide Promptable Representations for Reinforcement Learning.
Trans. Mach. Learn. Res., 2025

Proactive Privacy Amnesia for Large Language Models: Safeguarding PII with Negligible Impact on Model Utility.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Bridging Speech and Text Foundation Models with ReShape Attention.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
Indoor and Outdoor 3D Scene Graph Generation Via Language-Enabled Spatial Ontologies.
IEEE Robotics Autom. Lett., June, 2024

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks.
CoRR, 2024

Findings of the IWSLT 2024 Evaluation Campaign.
CoRR, 2024

CMU's IWSLT 2024 Simultaneous Speech Translation System.
CoRR, 2024

Nollywood: Let's Go to the Movies!
CoRR, 2024

AugSumm: towards generalizable speech summarization using synthetic labels from large language model.
CoRR, 2024

ESPnet-EZ: Python-Only ESPnet For Easy Fine-Tuning And Integration.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs For Audio, Music, and Speech.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Floras 50: A Massively Multilingual Multitask Benchmark for Long-Form Conversational Speech.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

On the Effects of Heterogeneous Data Sources on Speech-to-Text Foundation Models.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

EFFUSE: Efficient Self-Supervised Feature Fusion for E2E ASR in Low Resource and Multilingual Scenarios.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

AugSumm: Towards Generalizable Speech Summarization Using Synthetic Labels from Large Language Models.
Proceedings of the IEEE International Conference on Acoustics, 2024

Train Long and Test Long: Leveraging Full Document Contexts in Speech Processing.
Proceedings of the IEEE International Conference on Acoustics, 2024

Towards Robust Speech Representation Learning for Thousands of Languages.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Robotic Control via Embodied Chain-of-Thought Reasoning.
Proceedings of the Conference on Robot Learning, 6-9 November 2024, Munich, Germany., 2024

Evaluating Self-Supervised Speech Representations for Indigenous American Languages.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

On the Evaluation of Speech Foundation Models for Spoken Language Understanding.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond.
CoRR, 2023

EFFUSE: Efficient Self-Supervised Feature Fusion for E2E ASR in Multilingual and Low Resource Scenarios.
CoRR, 2023

LaMPP: Language Models as Probabilistic Priors for Perception and Action.
CoRR, 2023

CMU's IWSLT 2023 Simultaneous Speech Translation System.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023

QUESPA Submission for the IWSLT 2023 Dialect and Low-resource Speech Translation Tasks.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023


A New Benchmark of Aphasia Speech Recognition and Detection Based on E-Branchformer and Multi-task Learning.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

ML-SUPERB: Multilingual Speech Universal PERformance Benchmark.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Improving Massively Multilingual ASR with Auxiliary CTC Objectives.
Proceedings of the IEEE International Conference on Acoustics, 2023

Poster: Mujaz: A Summarization-based Approach for Normalized Vulnerability Description.
Proceedings of the 2023 ACM SIGSAC Conference on Computer and Communications Security, 2023

Findings of the 2023 ML-Superb Challenge: Pre-Training And Evaluation Over More Languages And Beyond.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Espnet-Summ: Introducing a Novel Large Dataset, Toolkit, and a Cross-Corpora Evaluation of Speech Summarization Systems.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Reproducing Whisper-Style Training Using An Open-Source Toolkit And Publicly Available Data.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Yodas: Youtube-Oriented Dataset for Audio and Speech.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Summarize While Translating: Universal Model With Parallel Decoding for Summarization and Translation.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Joint Prediction and Denoising for Large-Scale Multilingual Self-Supervised Learning.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Leveraging Large Language Models for Robot 3D Scene Understanding.
CoRR, 2022

Extracting Zero-shot Common Sense from Large Language Models for Robot 3D Scene Understanding.
CoRR, 2022

The Colour of Horror.
Proceedings of the European Conference on Visual Media Production, 2022

Benchmarking Azerbaijani Neural Machine Translation.
Proceedings of the ALTNLP The International Conference and workshop on Agglutanative Language Technologies as a challenge of Natural Language Processing, 2022

2021
Genetic Algorithms For Extractive Summarization.
CoRR, 2021

In silico model for miRNA-mediated regulatory network in cancer.
Briefings Bioinform., 2021

Analysis of Negative Electricity Price to Identify Demand Management Opportunity for Consumers in Renewable-rich Power Systems.
Proceedings of the 2021 IEEE PES Innovative Smart Grid Technologies, 2021

Longitudinal Data of Cancer Patients with Prior Mental Health Diagnoses Show Differences in Demographics, Emergency Visits, and Suicidality Rates.
Proceedings of the AMIA 2021, American Medical Informatics Association Annual Symposium, San Diego, CA, USA, October 30, 2021, 2021

2020
Weakly Supervised Deep Learning for Segmentation of Remote Sensing Imagery.
Remote. Sens., 2020

Audrey: A Personalized Open-Domain Conversational Bot.
CoRR, 2020

An automatic approach to establish clinically desired final dental occlusion for one-piece maxillary orthognathic surgery.
Int. J. Comput. Assist. Radiol. Surg., 2020

2018
Some Results on Tight Stationarity, University of California, Los Angeles, USA, 2016. Supervised by Itay Neeman.
Bull. Symb. Log., 2018

2017
Synergistic drug combinations from electronic health records and gene expression.
J. Am. Medical Informatics Assoc., 2017

2015
Tight stationarity and tree-like scales.
Ann. Pure Appl. Log., 2015

Square principles with tail-end agreement.
Arch. Math. Log., 2015

2014
Osiris: accessible and reproducible phylogenetic and phylogenomic analyses within the Galaxy workflow management system.
BMC Bioinform., 2014

Analyzing Abstract Factory and Strategy Design Patterns using Design Structure Matrix: A Role-Playing Game Case.
Proceedings of the Intelligent Systems and Applications, 2014

Minimizing expected loss for risk-avoiding reinforcement learning.
Proceedings of the International Conference on Data Science and Advanced Analytics, 2014

2010
An analogue of the Gallai-Edmonds Structure Theorem for non-zero roots of the matching polynomial.
J. Comb. Theory B, 2010

2006
Visualization of Remote Hyperspectral Image Data Using Google Earth.
Proceedings of the IEEE International Geoscience & Remote Sensing Symposium, 2006

2002
Fast and memory efficient algorithm for DCT-domain inverse motion compensation of low bitrate video.
Proceedings of the 14th International Conference on Digital Signal Processing, 2002

Design of an Auxiliary Power Distribution Network for an Electric Vehicle.
Proceedings of the 1st IEEE International Workshop on Electronic Design, 2002

1991
3-D camera calibration using vanishing point concept.
Pattern Recognit., 1991


  Loading...