Chan-Jan Hsu

According to our database1, Chan-Jan Hsu authored at least 18 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2025
Group Think: Multiple Concurrent Reasoning Agents Collaborating at Token Level Granularity.
CoRR, May, 2025

BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation - Challenges and Insights.
CoRR, January, 2025

The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling Capabilities.
CoRR, January, 2025

Enhancing Function-Calling Capabilities in LLMs: Strategies for Prompt Formats, Data Integration, and Multilingual Translation.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

A Self-Refining Framework for Enhancing ASR Using TTS-Synthesized Data.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

Let's Fuse Step by Step: A Generative Fusion Decoding Algorithm with LLMs for Robust and Instruction-Aware ASR and OCR.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
FineWeb-zhtw: Scalable Curation of Traditional Chinese Text Data from the Web.
CoRR, 2024

Building a Taiwanese Mandarin Spoken Language Model: A First Attempt.
CoRR, 2024

Let's Fuse Step by Step: A Generative Fusion Decoding Algorithm with LLMs for Multi-modal Text Recognition.
CoRR, 2024

Breeze-7B Technical Report.
CoRR, 2024

FineWeb-zhtw: Scalable Curation of Traditional Chinese Text Data from the Web.
Proceedings of the 36th Conference on Computational Linguistics and Speech Processing, 2024

2023
Advancing the Evaluation of Traditional Chinese Language Models: Towards a Comprehensive Benchmark Suite.
CoRR, 2023

Extending the Pre-Training of BLOOM for Improved Support of Traditional Chinese: Models, Methods and Results.
CoRR, 2023

Bridging Speech and Textual Pre-Trained Models With Unsupervised ASR.
Proceedings of the IEEE International Conference on Acoustics, 2023

T5lephone: Bridging Speech and Text Self-Supervised Models for Spoken Language Understanding Via Phoneme Level T5.
Proceedings of the IEEE International Conference on Acoustics, 2023

Zero-Shot Domain-Sensitive Speech Recognition with Prompt-Conditioning Fine-Tuning.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Analyzing The Robustness of Unsupervised Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

XDBERT: Distilling Visual Information to BERT from Cross-Modal Systems to Improve Language Understanding.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022


  Loading...