Jun-Kun Chen
According to our database1,
Jun-Kun Chen
authored at least 36 papers
between 2018 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs.
CoRR, March, 2025
Streaming Speaker Change Detection and Gender Classification for Transducer-Based Multi-Talker Speech Translation.
CoRR, February, 2025
2024
Isochrony-Controlled Speech-to-Text Translation: A study on translating from Sino-Tibetan to Indo-European Languages.
CoRR, 2024
CoRR, 2024
Proceedings of the IEEE Spoken Language Technology Workshop, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Soft Language Identification for Language-Agnostic Many-to-One End-to-End Speech Translation.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Leveraging Timestamp Information for Serialized Joint Streaming Recognition and Translation.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
Contrastive Learning Relies More on Spatial Inductive Bias Than Supervised Learning: An Empirical Study.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
2022
ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech.
CoRR, 2022
CoRR, 2022
A<sup>3</sup>T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing.
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
2021
SpecRec: An Alternative Solution for Improving End-to-End Speech-to-Text Translation via Spectrogram Reconstruction.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation.
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Improving Simultaneous Translation by Incorporating Pseudo-References with Fewer Reorderings.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Direct Simultaneous Speech-to-Text Translation Assisted by Synchronized Streaming ASR.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
2020
2019
CoRR, 2019
VaTeX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019
2018
Same Representation, Different Attentions: Shareable Sentence Representation Learning from Multiple Tasks.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018