Jian Xue
This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.
Bibliography
2025
Streaming Speaker Change Detection and Gender Classification for Transducer-Based Multi-Talker Speech Translation.
CoRR, February, 2025
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025
Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
2024
Remote. Sens., July, 2024
Comput. Biol. Medicine, March, 2024
MambaDETR: Query-based Temporal Modeling using State Space Model for Multi-View 3D Object Detection.
CoRR, 2024
Isochrony-Controlled Speech-to-Text Translation: A study on translating from Sino-Tibetan to Indo-European Languages.
CoRR, 2024
CoRR, 2024
Soft Language Identification for Language-Agnostic Many-to-One End-to-End Speech Translation.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024
LLMNDC: A Novel Approach for Network Device Configuration based on Fine-tuned Large Language Models.
Proceedings of the 5th International Conference on Computer Engineering and Intelligent Control, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Leveraging Timestamp Information for Serialized Joint Streaming Recognition and Translation.
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
LAMASSU: A Streaming Language-Agnostic Multilingual Speech Recognition and Translation Model Using Neural Transducers.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the IEEE International Conference on Signal Processing, 2023
Fast and Accurate Factorized Neural Transducer for Text Adaption of End-to-End Speech Recognition Models.
Proceedings of the IEEE International Conference on Acoustics, 2023
A Weakly-Supervised Streaming Multilingual Speech Model with Truly Zero-Shot Capability.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
Building High-Accuracy Multilingual ASR With Gated Language Experts and Curriculum Training.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
2022
Streaming, fast and accurate on-device Inverse Text Normalization for Automatic Speech Recognition.
CoRR, 2022
LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers.
CoRR, 2022
Streaming, Fast and Accurate on-Device Inverse Text Normalization for Automatic Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Proceedings of the 5th IEEE International Conference on Multimedia Information Processing and Retrieval, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
2021
IEEE Trans. Instrum. Meas., 2021
The Influence of Substituting Prices, Product Returns, and Service Quality on Repurchase Intention.
Complex., 2021
Improving Multilingual Transformer Transducer Models by Reducing Language Confusions.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
2019
IEEE Trans. Ind. Electron., 2019
Modelling of gene signal attribute reduction based on neighbourhood granulation and rough approximation.
Int. J. Model. Identif. Control., 2019
A Markerless Body Motion Capture System for Character Animation Based on Multi-view Cameras.
Proceedings of the IEEE International Conference on Acoustics, 2019
2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 11th Asian Control Conference, 2017
2015
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015
Investigating online low-footprint speaker adaptation using generalized linear regression and click-through data.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
2014
Singular value decomposition based low-footprint speaker adaptation and personalization for deep neural network.
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
Harmonizing model with transfer tax on water pollution across regional boundaries in a China's lake basin.
Eur. J. Oper. Res., 2013
The IBM speech-to-speech translation system for smartphone: Improvements for resource-constrained tasks.
Comput. Speech Lang., 2013
Restructuring of deep neural network acoustic models with singular value decomposition.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Investigations on hessian-free optimization for cross-entropy training of deep neural networks.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
2012
Hidden Markov Acoustic Modeling With Bootstrap and Restructuring for Low-Resourced Languages.
IEEE Trans. Speech Audio Process., 2012
Oper. Res. Lett., 2012
Research based on improved fuzzy immune PID algorithm optimized copper electrolysis rectifier system.
Proceedings of the 2nd IEEE International Conference on Cloud Computing and Intelligence Systems, 2012
2011
Towards High Performance LVCSR in Speech-to-Speech Translation System on Smart Phones.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the Seventh International Conference on Fuzzy Systems and Knowledge Discovery, 2010
2009
A study of bootstrapping with multiple acoustic features for improved automatic speech recognition.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Semi-tied covariance matrices for acoustic models based on random forests of phonetic decision trees.
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
2008
Random Forests of Phonetic Decision Trees for Acoustic Modeling in Conversational Speech Recognition.
IEEE Trans. Speech Audio Process., 2008
High-performance low-latency speech recognition via multi-layered feature streaming and fast Gaussian computation.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
2007
Proceedings of the Computer And Computing Technologies In Agriculture, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Random Forests-Based Confidence Annotation Using Novel Features from Confusion Network.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
2005
Incremental largest margin linear regression and MAP adaptation for speech separation in telemedicine applications.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
1997
A novel approach to the optimal biorthogonal analysis window sequence of the discrete Gabor expansion.
Signal Process., 1997