Jun-Kun Chen

According to our database1, Jun-Kun Chen authored at least 36 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
V2Edit: Versatile Video Diffusion Editor for Videos and 3D Scenes.
CoRR, March, 2025

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs.
CoRR, March, 2025

Streaming Speaker Change Detection and Gender Classification for Transducer-Based Multi-Talker Speech Translation.
CoRR, February, 2025

2024
Isochrony-Controlled Speech-to-Text Translation: A study on translating from Sino-Tibetan to Indo-European Languages.
CoRR, 2024

Proto-OOD: Enhancing OOD Object Detection with Prototype Feature Similarity.
CoRR, 2024

Investigating Neural Audio Codecs For Speech Language Model-Based Speech Generation.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

SceneCraft: Layout-Guided 3D Scene Generation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Soft Language Identification for Language-Agnostic Many-to-One End-to-End Speech Translation.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Diarist: Streaming Speech Translation with Speaker Diarization.
Proceedings of the IEEE International Conference on Acoustics, 2024

Leveraging Timestamp Information for Serialized Joint Streaming Recognition and Translation.
Proceedings of the IEEE International Conference on Acoustics, 2024

Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

ConsistDreamer: 3D-Consistent 2D Diffusion for High-Fidelity Scene Editing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Contrastive Learning Relies More on Spatial Inductive Bias Than Supervised Learning: An Empirical Study.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

NeuralEditor: Editing Neural Radiance Fields via Manipulating Point Clouds.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech.
CoRR, 2022

Is Self-Supervised Learning More Robust Than Supervised Learning?
CoRR, 2022

PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit.
CoRR, 2022

Data-Driven Adaptive Simultaneous Machine Translation.
CoRR, 2022

TorchDrug: A Powerful and Flexible Machine Learning Platform for Drug Discovery.
CoRR, 2022

A<sup>3</sup>T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing.
Proceedings of the International Conference on Machine Learning, 2022

PointTree: Transformation-Robust Point Cloud Encoder with Relaxed K-D Trees.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
SpecRec: An Alternative Solution for Improving End-to-End Speech-to-Text Translation via Spectrogram Reconstruction.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation.
Proceedings of the 38th International Conference on Machine Learning, 2021

RNNLogic: Learning Logic Rules for Reasoning on Knowledge Graphs.
Proceedings of the 9th International Conference on Learning Representations, 2021

Improving Simultaneous Translation by Incorporating Pseudo-References with Fewer Reorderings.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Direct Simultaneous Speech-to-Text Translation Assisted by Synchronized Streaming ASR.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation.
CoRR, 2020

Improving Simultaneous Translation with Pseudo References.
CoRR, 2020

2019
DropAttention: A Regularization Method for Fully-Connected Self-Attention Networks.
CoRR, 2019

VaTeX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018
Exploring Shared Structures and Hierarchies for Multiple NLP Tasks.
CoRR, 2018

Same Representation, Different Attentions: Shareable Sentence Representation Learning from Multiple Tasks.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Meta Multi-Task Learning for Sequence Modeling.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018


  Loading...