Jun-Kun Chen

According to our database¹, Jun-Kun Chen authored at least 38 papers between 2018 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Virtual Fitting Room: Generating Arbitrarily Long Videos of Virtual Try-On from a Single Image - Technical Preview.

[BibT_eX]

[DOI]

CoRR, September, 2025

Dress&Dance: Dress up and Dance as You Like It - Technical Preview.

[BibT_eX]

[DOI]

CoRR, August, 2025

V2Edit: Versatile Video Diffusion Editor for Videos and 3D Scenes.

[BibT_eX]

[DOI]

CoRR, March, 2025

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs.

[BibT_eX]

[DOI]

Abdelrahman Abouelenin

CoRR, March, 2025

Streaming Speaker Change Detection and Gender Classification for Transducer-Based Multi-Talker Speech Translation.

[BibT_eX]

[DOI]

Aswin Shanmugam Subramanian

CoRR, February, 2025

2024

Isochrony-Controlled Speech-to-Text Translation: A study on translating from Sino-Tibetan to Indo-European Languages.

[BibT_eX]

[DOI]

CoRR, 2024

Proto-OOD: Enhancing OOD Object Detection with Prototype Feature Similarity.

[BibT_eX]

[DOI]

CoRR, 2024

Investigating Neural Audio Codecs For Speech Language Model-Based Speech Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

SceneCraft: Layout-Guided 3D Scene Generation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing.

[BibT_eX]

[DOI]

Jun-Kun Chen

Yu-Xiong Wang

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Soft Language Identification for Language-Agnostic Many-to-One End-to-End Speech Translation.

[BibT_eX]

[DOI]

Aswin Shanmugam Subramanian

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Diarist: Streaming Speech Translation with Speaker Diarization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Leveraging Timestamp Information for Serialized Joint Streaming Recognition and Translation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion.

[BibT_eX]

[DOI]

Linzhan Mou

Jun-Kun Chen

Yu-Xiong Wang

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

ConsistDreamer: 3D-Consistent 2D Diffusion for High-Fidelity Scene Editing.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Contrastive Learning Relies More on Spatial Inductive Bias Than Supervised Learning: An Empirical Study.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

NeuralEditor: Editing Neural Radiance Fields via Manipulating Point Clouds.

[BibT_eX]

[DOI]

Jun-Kun Chen

Jipeng Lyu

Yu-Xiong Wang

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022

ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech.

[BibT_eX]

[DOI]

CoRR, 2022

Is Self-Supervised Learning More Robust Than Supervised Learning?

[BibT_eX]

[DOI]

CoRR, 2022

PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit.

[BibT_eX]

[DOI]

CoRR, 2022

Data-Driven Adaptive Simultaneous Machine Translation.

[BibT_eX]

[DOI]

CoRR, 2022

TorchDrug: A Powerful and Flexible Machine Learning Platform for Drug Discovery.

[BibT_eX]

[DOI]

Louis-Pascal A. C. Xhonneux

Meng Qu

Jian Tang

CoRR, 2022

A<sup>3</sup>T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

PointTree: Transformation-Robust Point Cloud Encoder with Relaxed K-D Trees.

[BibT_eX]

[DOI]

Jun-Kun Chen

Yu-Xiong Wang

Proceedings of the Computer Vision - ECCV 2022, 2022

2021

SpecRec: An Alternative Solution for Improving End-to-End Speech-to-Text Translation via Spectrogram Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

RNNLogic: Learning Logic Rules for Reasoning on Knowledge Graphs.

[BibT_eX]

[DOI]

Meng Qu

Jun-Kun Chen

Louis-Pascal A. C. Xhonneux

Yoshua Bengio

Jian Tang

Proceedings of the 9th International Conference on Learning Representations, 2021

Improving Simultaneous Translation by Incorporating Pseudo-References with Fewer Reorderings.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Direct Simultaneous Speech-to-Text Translation Assisted by Synchronized Streaming ASR.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020

MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation.

[BibT_eX]

[DOI]

CoRR, 2020

Improving Simultaneous Translation with Pseudo References.

[BibT_eX]

[DOI]

CoRR, 2020

2019

DropAttention: A Regularization Method for Fully-Connected Self-Attention Networks.

[BibT_eX]

[DOI]

CoRR, 2019

VaTeX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018

Exploring Shared Structures and Hierarchies for Multiple NLP Tasks.

[BibT_eX]

[DOI]

CoRR, 2018

Same Representation, Different Attentions: Shareable Sentence Representation Learning from Multiple Tasks.

[BibT_eX]

[DOI]

Renjie Zheng

Jun-Kun Chen

Xipeng Qiu

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Meta Multi-Task Learning for Sequence Modeling.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Jun-Kun Chen

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...