Yike Zhang

This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.

Known people with the same name:

Bibliography

2025
LeanK: Learnable K Cache Channel Pruning for Efficient Decoding.
CoRR, August, 2025

TriangleMix: A Lossless and Efficient Attention Pattern for Long Context Prefilling.
CoRR, July, 2025

Efficient Attention Mechanisms for Large Language Models: A Survey.
CoRR, July, 2025

Weakly-supervised Mamba-Based Mastoidectomy Shape Prediction for Cochlear Implant Surgery Using 3D T-Distribution Loss.
CoRR, May, 2025

Unscented recursive three-step filter based unbiased minimum-variance estimation for a class of nonlinear systems.
Int. J. Syst. Sci., January, 2025

LUCY: Linguistic Understanding and Control Yielding Early Stage of Her.
CoRR, January, 2025

Maximum correntropy recursive three-step filter.
Syst. Control. Lett., 2025

An advanced multi-source data fusion method utilizing deep learning techniques for fire detection.
Eng. Appl. Artif. Intell., 2025

M-MoE: Mixture of Mixture-of-Expert Model for CTC-based Streaming Multilingual ASR.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

FocusLLM: Precise Understanding of Long Context by Dynamic Condensing.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Maximum correntropy unbiased minimum-variance filter.
Signal Process., 2024

FreeCodec: A disentangled neural speech codec with fewer tokens.
CoRR, 2024

Molecular Dynamics and Machine Learning Unlock Possibilities in Beauty Design - A Perspective.
CoRR, 2024

FocusLLM: Scaling LLM's Context by Parallel Decoding.
CoRR, 2024

A Transcription Prompt-based Efficient Audio Large Language Model for Robust Speech Recognition.
CoRR, 2024

A Transcription Prompt-based Efficient Audio Large Language Model for Robust Speech Recognition.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

2023
Evaluation of skin sympathetic nervous activity for classification of intracerebral hemorrhage and outcome prediction.
Comput. Biol. Medicine, November, 2023

The Objective Dementia Severity Scale Based on MRI with Contrastive Learning: A Whole Brain Neuroimaging Perspective.
Sensors, August, 2023

Two Stage Contextual Word Filtering for Context Bias in Unified Streaming and Non-streaming Transducer.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

As with Wine, Life Gets Better with Age. Redefining Mobile User Interface (UI) Components in the Age-Friendly Design Transformation.
Proceedings of the Cross-Cultural Design, 2023

2022
A practical framework for multi-domain speech recognition and an instance sampling method to neural language modeling.
CoRR, 2022

Design and evaluation of an autonomic nerve monitoring system based on skin sympathetic nerve activity.
Biomed. Signal Process. Control., 2022

Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Leveraging Acoustic Contextual Representation by Audio-textual Cross-modal Learning for Conversational ASR.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Conversational Speech Recognition by Learning Conversation-Level Characteristics.
Proceedings of the IEEE International Conference on Acoustics, 2022

Improving CTC-Based Speech Recognition Via Knowledge Transferring from Pre-Trained Language Models.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model.
CoRR, 2021

Improving Speech Recognition Accuracy of Local POI Using Geographical Models.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Improving Streaming Transformer Based ASR Under a Framework of Self-Supervised Learning.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Improving Hybrid CTC/Attention End-to-End Speech Recognition with Pretrained Acoustic and Language Models.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2019
Tailoring an Interpretable Neural Language Model.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Multiple Temporal Scales Based Speaker Embeddings Learning for Text-dependent Speaker Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
深度学习优化算法研究 (Research on Optimization Algorithm of Deep Learning).
计算机科学, 2018

Evaluating Modeling Units and Sub-word Features in Language Models for Turkish ASR.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Improving Language Modeling with an Adversarial Critic for Automatic Speech Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017
An improved lexicon generation method for mandarin speech recognition.
Proceedings of the 13th International Conference on Natural Computation, 2017

2016
An unsupervised vocabulary selection technique for Chinese automatic speech recognition.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

2012
Single Event and Scenario Generation Based on Advertising Rhetorical Techniques Using the Conceptual Dictionary in Narrative Generation System.
Proceedings of the 2012 IEEE Fourth International Conference On Digital Game And Intelligent Toy Enhanced Learning, 2012

The Rhetoric of Defamiliarization for Narrative Generation using the Constraints in a Conceptual Dictionary.
Proceedings of the 34th Annual Meeting of the Cognitive Science Society, 2012

2011
An advertising rhetorical mechanism for single event combined with conceptual dictionary in narrative generation system.
Proceedings of the 7th International Conference on Natural Language Processing and Knowledge Engineering, 2011


  Loading...