Yuan Gong

Orcid: 0000-0002-4537-0078

Affiliations:
  • Massachusetts Institute of Technology, Cambridge, MA, USA


According to our database1, Yuan Gong authored at least 32 papers between 2017 and 2023.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning.
CoRR, 2023

Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers.
CoRR, 2023

SAIL: Search-Augmented Instruction Learning.
CoRR, 2023

Listen, Think, and Understand.
CoRR, 2023

Contrastive Audio-Visual Masked Autoencoder.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Search Augmented Instruction Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Joint Audio and Speech Understanding.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
UAVM: Towards Unifying Audio and Visual Models.
IEEE Signal Process. Lett., 2022

UAVM: A Unified Model for Audio-Visual Learning.
CoRR, 2022

CMKD: CNN/Transformer-Based Cross-Model Knowledge Distillation for Audio Classification.
CoRR, 2022

Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment.
Proceedings of the IEEE International Conference on Acoustics, 2022

Detecting Dementia from Long Neuropsychological Interviews.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

SSAST: Self-Supervised Audio Spectrogram Transformer.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

PSLA: Improving Audio Event Classification with Pretraining, Sampling, Labeling, and Aggregation.
CoRR, 2021

AST: Audio Spectrogram Transformer.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

2020
ReMASC: Realistic Replay Attack Corpus for Voice Controlled Systems.
Dataset, June, 2020

Detecting Replay Attacks Using Multi-Channel Audio: A Neural Network-Based Method.
IEEE Signal Process. Lett., 2020

2019
Second-order Non-local Attention Networks for Person Re-identification.
CoRR, 2019

ReMASC: Realistic Replay Attack Corpus for Voice Controlled Systems.
Proceedings of the Interspeech 2019, 2019

Real-Time Adversarial Attacks.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Second-Order Non-Local Attention Networks for Person Re-Identification.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018
Towards Learning Fine-Grained Disentangled Representations from Speech.
CoRR, 2018

An Overview of Vulnerabilities of Voice Controlled Systems.
CoRR, 2018

Impact of Aliasing on Deep CNN-Based End-to-End Acoustic Models.
Proceedings of the Interspeech 2018, 2018

Protecting Voice Controlled Systems Using Sound Source Identification Based on Acoustic Cues.
Proceedings of the 27th International Conference on Computer Communication and Networks, 2018

Automatic Autism Spectrum Disorder Detection Using Everyday Vocalizations Captured by Smart Devices.
Proceedings of the 2018 ACM International Conference on Bioinformatics, 2018

Improving LIWC Using Soft Word Matching.
Proceedings of the 2018 ACM International Conference on Bioinformatics, 2018

2017
Crafting Adversarial Examples For Speech Paralinguistics Applications.
CoRR, 2017

Topic Modeling Based Multi-modal Depression Detection.
Proceedings of the 7th Annual Workshop on Audio/Visual Emotion Challenge, Mountain View, CA, USA, October 23, 2017

Continuous Assessment of Children's Emotional States Using Acoustic Analysis.
Proceedings of the 2017 IEEE International Conference on Healthcare Informatics, 2017


  Loading...