Yang Zhang

Chuchu Fan

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Code-as-Symbolic-Planner: Foundation Model-Based Robot Planning via Symbolic Code Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing.

[BibT_eX]

[DOI]

Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2025

CommVQ: Commutative Vector Quantization for KV Cache Compression.

[BibT_eX]

[DOI]

Junyan Li

Muhammad Yusuf Hassan

Proceedings of the Forty-second International Conference on Machine Learning, 2025

A Hitchhiker's Guide to Scaling Law Estimation.

[BibT_eX]

[DOI]

Leshem Choshen

Jacob Andreas

Proceedings of the Forty-second International Conference on Machine Learning, 2025

CodeSteer: Symbolic-Augmented Language Models via Code/Text Guidance.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Planning Anything with Rigor: General-Purpose Zero-Shot Planning with LLM-based Formalized Programming.

[BibT_eX]

[DOI]

Yilun Hao

Chuchu Fan

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

VSP: Diagnosing the Dual Challenges of Perception and Reasoning in Spatial Planning Tasks for MLLMS.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Augment before You Try: Knowledge-Enhanced Table Question Answering via Table Expansion.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

PLAY2PROMPT: Zero-shot Tool Instruction Optimization for LLM Agents via Tool Play.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024

VSP: Assessing the dual challenges of perception and reasoning in spatial planning tasks for VLMs.

[BibT_eX]

[DOI]

CoRR, 2024

Towards Unsupervised Speech Recognition Without Pronunciation Models.

[BibT_eX]

[DOI]

Chang D. Yoo

CoRR, 2024

Large Language Models Can Plan Your Travels Rigorously with Formal Verification Tools.

[BibT_eX]

[DOI]

CoRR, 2024

PRompt Optimization in Multi-Step Tasks (PROMST): Integrating Human Feedback and Preference Alignment.

[BibT_eX]

[DOI]

CoRR, 2024

Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Paraphrase and Solve: Exploring and Exploiting the Impact of Surface Form on Mathematical Reasoning in Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Advancing the Robustness of Large Language Models through Self-Denoised Smoothing.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Short Papers, 2024

Scalable Multi-Robot Collaboration with Large Language Models: Centralized or Decentralized Systems?

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

AutoTAMP: Autoregressive Task and Motion Planning with LLMs as Translators and Checkers.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Decomposing Uncertainty for Large Language Models through Input Clarification Ensembling.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Speech Self-Supervised Learning Using Diffusion Model Synthetic Data.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Large Language Models Are Involuntary Truth-Tellers: Exploiting Fallacy Failure for Jailbreak Attacks.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Revisiting Who's Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

PRompt Optimization in Multi-Step Tasks (PROMST): Integrating Human Feedback and Heuristic-based Sampling.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Correcting Diffusion Generation Through Resampling.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Certified Robustness for Large Language Models with Self-Denoising.

[BibT_eX]

[DOI]

CoRR, 2023

Master-ASR: Achieving Multilingual Scalability and Low-Resource Adaptation in ASR with Modular Learning.

[BibT_eX]

[DOI]

CoRR, 2023

AutoTAMP: Autoregressive Task and Motion Planning with LLMs as Translators and Checkers.

[BibT_eX]

[DOI]

CoRR, 2023

Towards Coherent Image Inpainting Using Denoising Diffusion Implicit Models.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Master-ASR: Achieving Multilingual Scalability and Low-Resource Adaptation in ASR with Modular Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

PromptBoosting: Black-Box Text Classification with Ten Forward Passes.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

TextGrad: Advancing Robustness Evaluation in NLP by Gradient-Driven Optimization.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

NL2TL: Transforming Natural Languages to Temporal Logics using Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Audio-Visual Neural Syntax Acquisition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022

Domain Generalization for Language-Independent Automatic Speech Recognition.

[BibT_eX]

[DOI]

Frontiers Artif. Intell., 2022

Barrier functions enable safety-conscious force-feedback control.

[BibT_eX]

[DOI]

CoRR, 2022

Improving Self-Supervised Speech Representations by Disentangling Speakers.

[BibT_eX]

[DOI]

CoRR, 2022

SpeechSplit 2.0: Unsupervised speech disentanglement for voice conversion Without tuning autoencoder Bottlenecks.

[BibT_eX]

[DOI]

Chak Ho Chan

Kaizhi Qian

CoRR, 2022

Topogivity: A Machine-Learned Chemical Rule for Discovering Topological Materials.

[BibT_eX]

[DOI]

CoRR, 2022

Fairness Reprogramming.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech Processing.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Unsupervised Text-to-Speech Synthesis by Unsupervised Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

WavPrompt: Towards Few-Shot Spoken Language Understanding with Frozen Language Models.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

ContentVec: An Improved Self-Supervised Speech Representation by Disentangling Speakers.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Data-Efficient Double-Win Lottery Tickets from Robust Pre-training.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Linking Emergent and Natural Languages via Corpus Transfer.

[BibT_eX]

[DOI]

Shunyu Yao

Mo Yu

Karthik R. Narasimhan

Joshua B. Tenenbaum

Chuang Gan

Proceedings of the Tenth International Conference on Learning Representations, 2022

Adversarial Support Alignment.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

On the Interplay between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

SpeechSplit2.0: Unsupervised Speech Disentanglement for Voice Conversion without Tuning Autoencoder Bottlenecks.

[BibT_eX]

[DOI]

Chak Ho Chan

Kaizhi Qian

Proceedings of the IEEE International Conference on Acoustics, 2022

Knowledge Graph Guided Simultaneous Forecasting and Network Learning for Multivariate Financial Time Series.

[BibT_eX]

[DOI]

Proceedings of the 3rd ACM International Conference on AI in Finance, 2022

An Adversarial Framework for Generating Unseen Images by Activation Maximization.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Global Rhythm Style Transfer Without Text Transcriptions.

[BibT_eX]

[DOI]

CoRR, 2021

Understanding Interlocking Dynamics of Cooperative Rationalization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Drawing Robust Scratch Tickets: Subnetworks with Inborn Robustness Are Found within Randomly Initialized Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Speech Denoising with Auditory Models.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Zero-Shot Cross-Lingual Phonetic Recognition with External Language Embedding.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Global Prosody Style Transfer Without Text Transcriptions.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

SACoD: Sensor Algorithm Co-Design Towards Efficient CNN-powered Intelligent PhlatCam.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Continuous Cnn For Nonuniform Time Series.

[BibT_eX]

[DOI]

Jishen Zhao

Proceedings of the IEEE International Conference on Acoustics, 2021

Probabilistic framework for modeling event shocks to financial time series.

[BibT_eX]

[DOI]

Proceedings of the ICAIF'21: 2nd ACM International Conference on AI in Finance, Virtual Event, November 3, 2021

The Lottery Tickets Hypothesis for Supervised and Self-Supervised Pre-Training in Computer Vision Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Generating Visually Aligned Sound From Videos.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Deep Network Perceptual Losses for Speech Denoising.

[BibT_eX]

[DOI]

CoRR, 2020

The Lottery Ticket Hypothesis for Pre-trained BERT Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Unsupervised Speech Decomposition via Triple Information Bottleneck.

[BibT_eX]

[DOI]

Kaizhi Qian

David D. Cox

Proceedings of the 37th International Conference on Machine Learning, 2020

Invariant Rationalization.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

2019

An Efficient and Margin-Approaching Zero-Confidence Adversarial Attack.

[BibT_eX]

[DOI]

CoRR, 2019

Zero-Shot Voice Style Transfer with Only Autoencoder Loss.

[BibT_eX]

[DOI]

CoRR, 2019

A Game Theoretic Approach to Class-wise Selective Rationalization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

Rethinking Cooperative Rationalization: Introspective Extraction and Complement Control.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Grounding Spoken Words in Unlabeled Video.

[BibT_eX]

[DOI]

Rogério Schmidt Feris

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

2018

Deep Learning Based Speech Beamforming.

[BibT_eX]

[DOI]

Dinei A. F. Florêncio

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Geometry-Aware Traffic Flow Analysis by Detection and Tracking.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

2017

Application of generative models in speech processing tasks

[BibT_eX]

[DOI]

PhD thesis, 2017

A multidisciplinary approach to designing and evaluating Electronic Medical Record portal messages that support patient self-care.

[BibT_eX]

[DOI]

Daniel G. Morrow

Renato Ferreira Leitão Azevedo

William Schuh

Kuangxiao Gu

Bidisha Roy

Rocío García-Retamero

J. Biomed. Informatics, 2017

Streaming Recommender Systems.

[BibT_eX]

[DOI]

Proceedings of the 26th International Conference on World Wide Web, 2017

Dilated Recurrent Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Glottal Model Based Speech Beamforming for ad-hoc Microphone Arrays.

[BibT_eX]

[DOI]

Dinei Florêncio

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Speech Enhancement Using Bayesian Wavenet.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Fast Generation for Convolutional Autoregressive Models.

[BibT_eX]

[DOI]

Roy H. Campbell

Renato Ferreira Leitão Azevedo

Proceedings of the 5th International Conference on Learning Representations, 2017

Dr. Babel Fish: A Machine Translator to Simplify Providers' Language.

[BibT_eX]

[DOI]

Tarek Sakakini

Renato Ferreira Leitão Azevedo

Ann M. Willemsen-Dunlap

Donald J. Halpin

James F. Graumlich

Proceedings of the AMIA 2017, 2017

Using Computer Agents to Explain Clinical Test Results.

[BibT_eX]

[DOI]

Proceedings of the AMIA 2017, 2017

2016

Fast Wavenet Generation Algorithm.

[BibT_eX]

[DOI]

CoRR, 2016

Use of particle filtering and MCMC for inference in Probabilistic Acoustic Tube model.

[BibT_eX]

[DOI]

Ruobai Wang

Proceedings of the IEEE Statistical Signal Processing Workshop, 2016

Positive-Unlabeled Learning in Streaming Networks.

[BibT_eX]

[DOI]

Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

2015

Incorporating AM-FM effect in voiced speech for probabilistic acoustic tube model.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015

Multichannel transient acoustic signal classification using task-driven dictionary with joint sparsity and beamforming.

[BibT_eX]

[DOI]

Nasser M. Nasrabadi

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014

An iterative approach to decision tree training for context dependent speech synthesis.

[BibT_eX]

[DOI]

Xiayu Chen

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Improvement of Probabilistic Acoustic Tube model for speech decomposition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

2012

Probabilistic acoustic tube: a probabilistic generative model of speech for speech analysis/synthesis.

[BibT_eX]

[DOI]