Yuchen Hu

Orcid: 0000-0002-0696-6434

According to our database1, Yuchen Hu authored at least 36 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators.
CoRR, 2024

It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition.
CoRR, 2024

Large Language Models are Efficient Learners of Noise-Robust Speech Recognition.
CoRR, 2024

Cross-Modality and Within-Modality Regularization for Audio-Visual DeepFake Detection.
CoRR, 2024

Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
A zero-watermarking scheme based on spatial topological relations for vector dataset.
Expert Syst. Appl., September, 2023

A GNN-based model for capturing spatio-temporal changes in locomotion behaviors of aging <i>C. elegans</i>.
Comput. Biol. Medicine, March, 2023

Improved Generalized IHS Based on Total Variation for Pansharpening.
Remote. Sens., 2023

Generative error correction for code-switching speech recognition using large language models.
CoRR, 2023

Rep2wav: Noise Robust text-to-speech Using self-supervised representations.
CoRR, 2023

Noise-aware Speech Enhancement using Diffusion Probabilistic Model.
CoRR, 2023

A Neural State-Space Model Approach to Efficient Speech Separation.
CoRR, 2023

Noise-aware Speech Separation with Contrastive Learning.
CoRR, 2023

HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Unsupervised Noise Adaptation Using Data Simulation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Metric-Oriented Speech Enhancement Using Diffusion Probabilistic Model.
Proceedings of the IEEE International Conference on Acoustics, 2023

UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Leveraging Modality-Specific Representations for Audio-Visual Speech Recognition via Reinforcement Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
The Second Place Solution for The 4th Large-scale Video Object Segmentation Challenge-Track 3: Referring Video Object Segmentation.
CoRR, 2022

Interactive Audio-text Representation for Automated Audio Captioning with Contrastive Learning.
CoRR, 2022

Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition.
CoRR, 2022

Interactive Auido-text Representation for Automated Audio Captioning with Contrastive Learning.
Proceedings of the Interspeech 2022, 2022

Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

Noise-Robust Speech Recognition With 10 Minutes Unparalleled In-Domain Data.
Proceedings of the IEEE International Conference on Acoustics, 2022

Self-Critical Sequence Training for Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Off-Policy Evaluation in Partially Observed Markov Decision Processes.
CoRR, 2021

The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021

2019
Study on Prediction of Power Allocation for the Double-Wheel Trench Cutter Control System Based on Extreme Learning Machine Method.
Proceedings of the 4th IEEE International Conference on Advanced Robotics and Mechatronics, 2019

2017
Additive manufacturing technology in spare parts supply chain: a comparative study.
Int. J. Prod. Res., 2017

2016
Gearbox fault diagnosis based on LMD and cyclostationary demodulation.
Proceedings of the 13th International Conference on Ubiquitous Robots and Ambient Intelligence, 2016


  Loading...