Hybrid high-dimensional vine copula-Bayesian network framework for flood risk analysis in reservoir-lake systems: Addressing multisource uncertainties.

[BibT_eX]

[DOI]

Xuesong Yang

Bin Xu

Environ. Model. Softw., 2026

Spatiotemporal correction of decision variables using XGBoost for multi-objective intelligent scheduling rule extraction model in reservoir-lake flood control systems.

[BibT_eX]

[DOI]

Environ. Model. Softw., 2026

LLaVA-UHD v2: Exploiting Hierarchical Vision Granularity in MLLMs via Inverse Semantic Pyramid.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

MM-UAVBench: How Well Do Multimodal Large Language Models See, Think, and Plan in Low-Altitude UAV Scenarios?

[BibT_eX]

[DOI]

CoRR, December, 2025

Align2Speak: Improving TTS for Low Resource Languages via ASR-Guided Online Preference Optimization.

[BibT_eX]

[DOI]

CoRR, September, 2025

Frame-Stacked Local Transformers For Efficient Multi-Codebook Speech Generation.

[BibT_eX]

[DOI]

Ryan Langman Jaehyeon Kim

Subhankar Ghosh

Shehzeen Hussain

Jason Li

CoRR, September, 2025

Computer learning career path optimisation utilising multi-modal large models and privacy-preserving collaborative computing.

[BibT_eX]

[DOI]

Xuesong Yang

Int. J. Inf. Commun. Technol., 2025

Evaluation of teaching quality in database courses based on domain-adaptive transfer learning.

[BibT_eX]

[DOI]

Xuesong Yang

Int. J. Inf. Commun. Technol., 2025

Enhancing empathy of medical students in clinical training: a narrative-driven virtual reality experience for understanding undiagnosed chronic pain.

[BibT_eX]

[DOI]

Wenjie Xu

Xuesong Yang

Frontiers Virtual Real., 2025

Unveiling the molecular mechanisms of Haitang-Xiaoyin Mixture in psoriasis treatment based on bioinformatics, network pharmacology, machine learning, and molecular docking verification.

[BibT_eX]

[DOI]

Comput. Biol. Chem., 2025

HiFiTTS-2: A Large-Scale High Bandwidth Speech Dataset.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

VoiceNoNG: Robust High-Quality Speech Editing Model without Hallucinations.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

NanoCodec: Towards High-Quality Ultra Fast Speech LLM Inference.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Koel-TTS: Enhancing LLM based Speech Generation with Preference Alignment and Classifier Free Guidance.

[BibT_eX]

[DOI]

Shehzeen Samarah Hussain

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

NeKo: Cross-Modality Post-Recognition Error Correction with Tasks-Guided Mixture-of-Experts Language Model.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 6: Industry Track), 2025

2024

Reservoir Computing Based on Memristor Arrays in Random States.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. I Regul. Pap., July, 2024

Depth asynchronous time delay reservoir for nonlinear time series forecasting task.

[BibT_eX]

[DOI]

Inf. Sci., January, 2024

The Smart City Waste Classification Management System: Strategies and Applications Based on Computer Vision.

[BibT_eX]

[DOI]

J. Organ. End User Comput., 2024

LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer.

[BibT_eX]

[DOI]

CoRR, 2024

NeKo: Toward Post Recognition Generative Correction Large Language Models with Task-Oriented Experts.

[BibT_eX]

[DOI]

CoRR, 2024

Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Pudica: Toward Near-Zero Queuing Delay in Congestion Control for Cloud Gaming.

[BibT_eX]

[DOI]

Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

2023

LibFewShot: A Comprehensive Library for Few-Shot Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Optimizing Electromagnetic Cigarette Heaters Using PSO-NSGA II Algorithm: An Effective Strategy to Improve Temperature Control and Production Rate.

[BibT_eX]

[DOI]

Appl. Artif. Intell., December, 2023

Trip-ROMA: Self-Supervised Learning with Triplets and Random Mappings.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2023

SLN-RED: Regularization by Simultaneous Local and Nonlocal Denoising for Image Restoration.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2023

2022

A Unified Framework for Contrastive Learning from a Perspective of Affinity Matrix.

[BibT_eX]

[DOI]

CoRR, 2022

NDGGNET-A Node Independent Gate based Graph Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2022

2021

Triplet is All You Need with Random Mappings for Unsupervised Visual Representation Learning.

[BibT_eX]

[DOI]

CoRR, 2021

REDAT: Accent-Invariant Representation for End-To-End ASR by Domain Adversarial Training with Relabeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2019

Dealing with linguistic mismatches for automatic speech recognition

[BibT_eX]

[DOI]

Xuesong Yang

PhD thesis, 2019

Zero-Shot Voice Style Transfer with Only Autoencoder Loss.

[BibT_eX]

[DOI]

Mark Hasegawa-Johnson

CoRR, 2019

Bayesian Estimation Method for Storage Reliability Based on Drift Brownian Motion.

[BibT_eX]

[DOI]

Xuesong Yang

Shunong Zhang

Honglin Wang

Proceedings of the 2019 IEEE International Conference on Industrial Engineering and Engineering Management, 2019

Wearable Iridium Oxide pH Sensors for Sweat pH Measurements.

[BibT_eX]

[DOI]

Xuesong Yang

Khengdauliu Chawang

Jung-Chih Chiao

Proceedings of the 2019 IEEE SENSORS, Montreal, QC, Canada, October 27-30, 2019, 2019

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss.

[BibT_eX]

[DOI]

Mark Hasegawa-Johnson

Proceedings of the 36th International Conference on Machine Learning, 2019

When CTC Training Meets Acoustic Landmarks.

[BibT_eX]

[DOI]

Mark Hasegawa-Johnson

Deming Chen

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Feature extraction using convolutional neural networks for multi-atlas based image segmentation.

[BibT_eX]

[DOI]

Xuesong Yang

Yong Fan

Proceedings of the Medical Imaging 2018: Image Processing, 2018

Coupled dictionary learning for joint MR image restoration and segmentation.

[BibT_eX]

[DOI]

Xuesong Yang

Yong Fan

Proceedings of the Medical Imaging 2018: Image Processing, 2018

Improved ASR for Under-resourced Languages through Multi-task Learning with Acoustic Landmarks.

[BibT_eX]

[DOI]

Di He

Boon Pang Lim

Xuesong Yang

Mark Hasegawa-Johnson

Deming Chen

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Integrated pH and Sodium Sensor Array Based on Iridium Oxide Film.

[BibT_eX]

[DOI]

Xuesong Yang

Jung-Chih Chiao

Proceedings of the 2018 IEEE SENSORS, New Delhi, India, October 28-31, 2018, 2018

Wireless Iridium Oxide-Based pH Sensing Systems.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE SENSORS, New Delhi, India, October 28-31, 2018, 2018

Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition.

[BibT_eX]

[DOI]

Mark Hasegawa-Johnson

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Deep Learning Based Speech Beamforming.

[BibT_eX]

[DOI]

Dinei A. F. Florêncio

Mark Hasegawa-Johnson

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Metric Learning for Multi-atlas based Segmentation of Hippocampus.

[BibT_eX]

[DOI]

Neuroinformatics, 2017

Acoustic Landmarks Contain More Information About the Phone String than Other Frames.

[BibT_eX]

[DOI]

Di He

Boon Pang Lim

Xuesong Yang

Mark Hasegawa-Johnson

Deming Chen

CoRR, 2017

Speech Enhancement Using Bayesian Wavenet.

[BibT_eX]

[DOI]

Mark Hasegawa-Johnson

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

End-to-end joint learning of natural language understanding and dialogue manager.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

A study on landmark detection based on CTC and its application to pronunciation error detection.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016

Landmark-based consonant voicing detection on multilingual corpora.

[BibT_eX]

[DOI]

Xiang Kong

Xuesong Yang

Mark Hasegawa-Johnson

Jeung-Yoon Choi

Stefanie Shattuck-Hufnagel

CoRR, 2016

Metric learning for label fusion in multi-atlas based image segmentation.

[BibT_eX]

[DOI]

Proceedings of the 13th IEEE International Symposium on Biomedical Imaging, 2016

2015

Sol-Gel Deposition of Iridium Oxide for Biomedical Micro-Devices.

[BibT_eX]

[DOI]

Sensors, 2015

2014

Machine learning approaches to improving pronunciation error detection on an imbalanced corpus.

[BibT_eX]

[DOI]

Xuesong Yang

Anastassia Loukina

Keelan Evanini

Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

2011

Sound source localization for mobile robot based on time difference feature and space grid matching.

[BibT_eX]

[DOI]

Xiaofei Li

Hong Liu

Xuesong Yang

Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011

Improvement of Segmental Mispronunciation Detection with Prior Knowledge Extracted from Large L2 Speech Corpus.

[BibT_eX]

[DOI]

Dean Luo

Xuesong Yang

Lan Wang

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Xuesong Yang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...