Songjun Cao
According to our database1,
Songjun Cao authored at least 24 papers
between 2020 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
CoRR, April, 2026
Controllable Spoken Dialogue Generation: An LLM-Driven Grading System for K-12 Non-Native English Learners.
CoRR, April, 2026
Thinking with Constructions: A Benchmark and Policy Optimization for Visual-Text Interleaved Geometric Reasoning.
CoRR, March, 2026
Leveraging large multimodal models for audio-video deepfake detection: a pilot study.
CoRR, February, 2026
Detect All-Type Deepfake Audio: Wavelet Prompt Tuning for Enhanced Auditory Perception.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
CoRR, June, 2025
CoRR, January, 2025
SonarGuard2: Ultrasonic Face Liveness Detection Based on Adaptive Doppler Effect Feature Extraction.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025
Monotonic Attention for Robust Text-to-Speech Synthesis in Large Language Model Frameworks.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025
DiffCSS: Diverse and Expressive Conversational Speech Synthesis with Diffusion Models.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
2024
A Transcription Prompt-based Efficient Audio Large Language Model for Robust Speech Recognition.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
2023
2022
A practical framework for multi-domain speech recognition and an instance sampling method to neural language modeling.
CoRR, 2022
Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Improving CTC-Based Speech Recognition Via Knowledge Transferring from Pre-Trained Language Models.
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model.
CoRR, 2021
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Improving Accent Identification and Accented Speech Recognition Under a Framework of Self-Supervised Learning.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Improving Streaming Transformer Based ASR Under a Framework of Self-Supervised Learning.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Improving Hybrid CTC/Attention End-to-End Speech Recognition with Pretrained Acoustic and Language Models.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
2020