Puyuan Peng
According to our database1,
Puyuan Peng
authored at least 28 papers
between 2020 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
TalkLess: Blending Extractive and Abstractive Speech Summarization for Editing Speech to Preserve Content and Style.
CoRR, July, 2025
VoiceStar: Robust Zero-Shot Autoregressive TTS with Duration Control and Extrapolation.
CoRR, May, 2025
CoRR, May, 2025
CoRR, April, 2025
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025
Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
2024
Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks.
CoRR, 2024
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
SpeechCLIP+: Self-Supervised Multi-Task Representation Learning for Speech Via Clip and Speech-Image Data.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Integrating Self-Supervised Speech Model with Pseudo Word-Level Targets from Visually-Grounded Speech Model.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
CoRR, 2023
Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Mode.
CoRR, 2023
Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Style-transfer based Speech and Audio-visual Scene understanding for Robot Action Sequence Acquisition from Videos.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
2022
Self-Supervised Representation Learning for Speech Using Visual Grounding and Masked Language Modeling.
CoRR, 2022
Proceedings of the Transfer Learning for Natural Language Processing Workshop, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
2020
CoRR, 2020