Xuanjun Chen

Orcid: 0009-0002-5930-3797

According to our database1, Xuanjun Chen authored at least 33 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
CodaRAG: Connecting the Dots with Associativity Inspired by Complementary Learning.
CoRR, April, 2026

Joint Fullband-Subband Modeling for High-Resolution SingFake Detection.
CoRR, April, 2026

Training-Efficient Text-to-Music Generation with State-Space Modeling.
CoRR, January, 2026

CIMNet: Joint Search for Neural Network and Computing-in-Memory Architectures.
IEEE Micro, 2026

2025
A Preliminary Study of RAG for Taiwanese Historical Archives.
CoRR, November, 2025

How Does Instrumental Music Help SingFake Detection?
CoRR, September, 2025

Localizing Audio-Visual Deepfakes via Hierarchical Boundary Modeling.
CoRR, August, 2025

Exploring State-Space-Model based Language Model in Music Generation.
CoRR, July, 2025

DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment.
CoRR, July, 2025

A Preliminary Exploration with GPT-4o Voice Mode.
CoRR, February, 2025

CodecFake-Omni: A Large-Scale Codec-based Deepfake Speech Dataset.
CoRR, January, 2025

Codec-Based Deepfake Source Tracing via Neural Audio Codec Taxonomy.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Leveraging Joint Spectral and Spatial Learning with MAMBA for Multichannel Speech Enhancement.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Towards Generalized Source Tracing for Codec-Based Deepfake Speech.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

2024
Building a Taiwanese Mandarin Spoken Language Model: A First Attempt.
CoRR, 2024

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks.
CoRR, 2024

Singing Voice Graph Modeling for SingFake Detection.
CoRR, 2024

Towards audio language modeling - an overview.
CoRR, 2024

Codec-Superb @ SLT 2024: A Lightweight Benchmark For Neural Audio Codec Models.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Singer Separation for Karaoke Content Generation.
Proceedings of the 27th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2024

PointCIM: A Computing-in-Memory Architecture for Accelerating Deep Point Cloud Analytics.
Proceedings of the 57th IEEE/ACM International Symposium on Microarchitecture, 2024

Singing Voice Graph Modeling for SingFake Detection.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Neural Codec-based Adversarial Sample Detection for Speaker Verification.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Multimodal Transformer Distillation for Audio-Visual Synchronization.
Proceedings of the IEEE International Conference on Acoustics, 2024

Codec-SUPERB: An In-Depth Analysis of Sound Codec Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Intelligent Directing System for Music Concert Scene Based on Visual and Auditory Information.
Proceedings of the 2023 ACM International Conference on Interactive Media Experiences Workshops, 2023

Unified Agile Accuracy Assessment in Computing-in-Memory Neural Accelerators by Layerwise Dynamical Isometry.
Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

2022
Push-Pull: Characterizing the Adversarial Robustness for Audio-Visual Active Speaker Detection.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

2021
Singer separation for karaoke content generation.
CoRR, 2021

ezGeno: an automatic model selection package for genomic data analysis.
Bioinform., 2021

2020
Enterprise financial management information system based on cloud computing in big data environment.
J. Intell. Fuzzy Syst., 2020


  Loading...