A study on hand gesture recognition algorithm realized with the aid of efficient feature extraction method and convolution neural networks: design and its application to VR environment.

[BibT_eX]

[DOI]

Zhen Wang

Sung-Hoon Yoo

Soft Comput., February, 2026

Mitigating scale imbalance and conflicting gradients in deep multi-task learning.

[BibT_eX]

[DOI]

Frontiers Comput. Sci., February, 2026

2025

SoulX-Podcast: Towards Realistic Long-form Podcasts with Dialectal and Paralinguistic Diversity.

[BibT_eX]

[DOI]

CoRR, October, 2025

MeanVC: Lightweight and Streaming Zero-Shot Voice Conversion via Mean Flows.

[BibT_eX]

[DOI]

CoRR, October, 2025

A comprehensive benchmarking for evaluating TCR embeddings in modeling TCR-epitope interactions.

[BibT_eX]

[DOI]

Briefings Bioinform., January, 2025

REF-VC: Robust, Expressive and Fast Zero-Shot Voice Conversion with Diffusion Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

DiffRhythm+: Controllable and Flexible Full-Length Song Generation with Preference Optimization.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

Drop the Beat! Freestyler for Accompaniment Conditioned Rapping Voice Generation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Unlocking T-cell receptor-epitope insights with structural analysis.

[BibT_eX]

[DOI]

Miaozhe Huo

Yuepeng Jiang

Shuai Cheng Li

Nat. Comput. Sci., July, 2024

WenetSpeech4TTS: A 12, 800-hour Mandarin TTS Corpus for Large Speech Generation Model Benchmark.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Towards Expressive Zero-Shot Speech Synthesis with Hierarchical Prosody Modeling.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Dualvc 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Validation of MODIS Temperature and Emissivity Products Based on Ground-Based Mid-Wave Hyperspectral Imaging Measurement in the Northwestern Plateau Region of Qinghai, China.

[BibT_eX]

[DOI]

Remote. Sens., August, 2023

Deep autoregressive generative models capture the intrinsics embedded in T-cell receptor repertoires.

[BibT_eX]

[DOI]

Yuepeng Jiang

Shuai Cheng Li