Jyh-Shing Roger Jang

Orcid: 0000-0002-7319-9095

According to our database1, Jyh-Shing Roger Jang authored at least 154 papers between 1990 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
EMO-SUPERB: An In-depth Look at Speech Emotion Recognition.
CoRR, 2024

2023
Training a Singing Transcription Model Using Connectionist Temporal Classification Loss and Cross-Entropy Loss.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Applications of Large Language Models in Data Processing: Innovative Approaches to Segmenting and Renewing Information.
CoRR, 2023

Adapting pretrained speech model for Mandarin lyrics transcription and alignment.
CoRR, 2023

WC-SBERT: Zero-Shot Text Classification via SBERT with Self-Training for Wikipedia Categories.
CoRR, 2023

Multi-behavior Recommendation with Action Pattern-aware Networks.
Proceedings of the IEEE International Conference on Web Intelligence and Intelligent Agent Technology, 2023

Adapting Pretrained Speech Model for Mandarin Lyrics Transcription and Alignment.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Zero-Shot Singing Voice Synthesis from Musical Score.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Personalized Audio Quality Preference Prediction.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
Multimodal Transformer Distillation for Audio-Visual Synchronization.
CoRR, 2022

Use of multimodal dataset in AI for detecting glaucoma based on fundus photographs assessed with OCT: focus group study on high prevalence of myopia.
BMC Medical Imaging, 2022

Push-Pull: Characterizing the Adversarial Robustness for Audio-Visual Active Speaker Detection.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

CrowNER at Rocling 2022 Shared Task: NER using MacBERT and Adversarial Training.
Proceedings of the 34th Conference on Computational Linguistics and Speech Processing, 2022

A unified model for zero-shot singing voice conversion and synthesis.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

Ensemble And Re-Ranking Based On Language Models To Improve ASR.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Improving ASR in Reverberant Environments.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Towards Automatic Transcription of Polyphonic Electric Guitar Music: A New Dataset and a Multi-Loss Transformer Model.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Artificial Intelligence-Assisted Early Detection of Retinitis Pigmentosa - the Most Common Inherited Retinal Degeneration.
J. Digit. Imaging, 2021

Singer separation for karaoke content generation.
CoRR, 2021

On the Preparation and Validation of a Large-Scale Dataset of Singing Transcription.
Proceedings of the IEEE International Conference on Acoustics, 2021

Mandarin Electrolaryngeal Speech Voice Conversion with Sequence-to-Sequence Modeling.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Backpropagation With $N$ -D Vector-Valued Neurons Using Arbitrary Bilinear Products.
IEEE Trans. Neural Networks Learn. Syst., 2020

AutoRhythm: A Music Game With Automatic Hit-Timing Generation and Percussion Identification.
IEEE Trans. Games, 2020

Fast Tensor Factorization for Large-Scale Context-Aware Recommendation from Implicit Feedback.
IEEE Trans. Big Data, 2020

Data-driven Feature Selection for Long Longitudinal Breadth and High Dimensional Dataset: Empirical Studies of Metabolic Syndrome Prediction.
Proceedings of the ICMLC 2020: 2020 12th International Conference on Machine Learning and Computing, 2020

2019
An effective method for audio-to-score alignment using onsets and modified constant Q spectra.
Multim. Tools Appl., 2019

Machine Learning Based Early Detection System of Cardiac Arrest.
Proceedings of the 2019 International Conference on Technologies and Applications of Artificial Intelligence, 2019

Predicting Neurodegenerative Diseases Using a Novel Blood Biomarkers-based Model by Machine Learning.
Proceedings of the 2019 International Conference on Technologies and Applications of Artificial Intelligence, 2019

Applying Machine Learning to Design for Reliability Coverage.
Proceedings of the IEEE International Reliability Physics Symposium, 2019

Deep Cyclic Group Networks.
Proceedings of the International Joint Conference on Neural Networks, 2019

Early Detecting In-Hospital Cardiac Arrest Based on Machine Learning on Imbalanced Data.
Proceedings of the 2019 IEEE International Conference on Healthcare Informatics, 2019

Learning to Match Transient Sound Events Using Attentional Similarity for Few-shot Sound Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

K-Same-Siamese-GAN: K-Same Algorithm with Generative Adversarial Network for Facial Image De-identification with Hyperparameter Tuning and Mixed Precision Training.
Proceedings of the 16th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2019

Improving ResNet-based Feature Extractor for Face Recognition via Re-ranking and Approximate Nearest Neighbor.
Proceedings of the 16th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2019

Multi-task Learning for Acoustic Modeling Using Articulatory Attributes.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
A hierarchical linguistic information-based model of English prosody: L2 data analysis and implications for computer-assisted language learning.
Comput. Speech Lang., 2018

Singing Style Transfer Using Cycle-Consistent Boundary Equilibrium Generative Adversarial Networks.
CoRR, 2018

Adaptive Generation of Structured Medical Report Using NER Regarding Deep Learning.
Proceedings of the Conference on Technologies and Applications of Artificial Intelligence, 2018

Using Machine Learning Algorithms in Medication for Cardiac Arrest Early Warning System Construction and Forecasting.
Proceedings of the Conference on Technologies and Applications of Artificial Intelligence, 2018

A Syllable Structure Approach to Spoken Language Recognition.
Proceedings of the Statistical Language and Speech Processing, 2018

使用性別資訊於語者驗證系統之研究與實作 (A study and implementation on Speaker Verification System using Gender Information) [In Chinese].
Proceedings of the 30th Conference on Computational Linguistics and Speech Processing, 2018

基於數字文本相關之語者驗證系統的研究與實作 (Study and Implementation on Digit-related Speaker Verification) [In Chinese].
Proceedings of the 30th Conference on Computational Linguistics and Speech Processing, 2018

Learning to Recognize Transient Sound Events using Attentional Supervision.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

SVSGAN: Singing Voice Separation Via Generative Adversarial Network.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Music Signal Processing Using Vector Product Neural Networks.
CoRR, 2017

基於i-vector與PLDA並使用GMM-HMM強制對位之自動語者分段標記系統 (Speaker Diarization based on I-vector PLDA Scoring and using GMM-HMM Forced Alignment) [In Chinese].
Proceedings of the 29th Conference on Computational Linguistics and Speech Processing, 2017

Conditional preference nets for user and item cold start problems in music recommendation.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

2016
Addressing Cold Start for Next-song Recommendation.
Proceedings of the 10th ACM Conference on Recommender Systems, 2016

An efficient method for polyphonic audio-to-score alignment using onset detection and constant Q transform.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Singing Voice Separation and Pitch Extraction from Monaural Polyphonic Audio Music via DNN and Adaptive Pitch Tracking.
Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

Evaluation of singing enthusiasm for songs with multiple phrases.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015
Combining Acoustic and Multilevel Visual Features for Music Genre Classification.
ACM Trans. Multim. Comput. Commun. Appl., 2015

Audio Musical Dice Game: A User-Preference-Aware Medley Generating System.
ACM Trans. Multim. Comput. Commun. Appl., 2015

Improving Query-by-Singing/Humming by Combining Melody and Lyric Information.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Automatic Pronunciation Scoring with Score Combination by Learning to Rank and Class-Normalized DP-Based Quantization.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Automatic Music Mood Classification Based on Timbre and Modulation Features.
IEEE Trans. Affect. Comput., 2015

Bridging Music Using Sound-Effect Insertion.
IEEE Multim., 2015

AutoRhythm: A music game with automatic hit-time generation and percussion identification.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Toward commercial application of audio fingerprinting technology.
Proceedings of the IEEE International Conference on Consumer Electronics - Taiwan, 2015

Function and speed portability of audio fingerprint extraction across computing platforms.
Proceedings of the IEEE International Conference on Consumer Electronics - Taiwan, 2015

Vocal activity informed singing voice separation with the iKala dataset.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Some Prosodic Characteristics of Taiwan English Accent.
Int. J. Comput. Linguistics Chin. Lang. Process., 2014

A supervised learning method for tempo estimation of musical audio.
Proceedings of the 22nd Mediterranean Conference on Control and Automation, 2014

Phone Boundary Annotation in Conversational Speech.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Gender Identification and Age Estimation of Users Based on Music Metadata.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

Improving Query by Tapping via Tempo Alignment.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

Bridging Music via Sound Effects.
Proceedings of the 2014 IEEE International Symposium on Multimedia, 2014

On the way to ambient media for sheet music by techniques of information retrieval.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

An Architecture for Optical Music Recognition of Numbered Music Notation.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2014

Improved score-performance alignment algorithms on polyphonic music.
Proceedings of the IEEE International Conference on Acoustics, 2014

Speeding up audio fingerprinting over GPUs.
Proceedings of the International Conference on Audio, 2014

An effective re-ranking method based on learning to rank for improving audio fingerprinting.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

GPU and Cloud Computing for Two Paradigms of Music Information Retrieval.
Proceedings of the Cloud Computing and Digital Media, 2014

2013
Using Speech Assessment Technique for the Validation of Taiwanese Speech Corpus.
Int. J. Comput. Linguistics Chin. Lang. Process., 2013

Learning to Find Translations and Transliterations on the Web based on Conditional Random Fields.
Int. J. Comput. Linguistics Chin. Lang. Process., 2013

Using Tangible Companions for Enhancing Learning English Conversation.
J. Educ. Technol. Soc., 2013

使用語音評分技術輔助台語語料的驗證 (Using Speech Assessment Technique for the Validation of Taiwanese Speech Corpus) [In Chinese].
Proceedings of the 25th Conference on Computational Linguistics and Speech Processing, 2013

A two-stage query by singing/humming system on GPU.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012
Discovering Time-Constrained Sequential Patterns for Music Genre Classification.
IEEE Trans. Speech Audio Process., 2012

A Tandem Algorithm for Singing Pitch Extraction and Voice Separation From Music Accompaniment.
IEEE Trans. Speech Audio Process., 2012

台語關鍵詞辨識之實作與比較 (Implementation and Comparison of Keyword Spotting for Taiwanese) [In Chinese].
Proceedings of the 24th Conference on Computational Linguistics and Speech Processing, 2012

台語文字與語音語料庫之建置 (Development of a Taiwanese Speech and Text Corpus) [In Chinese].
Proceedings of the 24th Conference on Computational Linguistics and Speech Processing, 2012

台語朗讀資料庫之自動切音技術應用於音文同步有聲書之建立 (Automatic Time Alignment for a Taiwanese Read Speech Corpus and its Application to Constructing Audiobooks with Text-Speech Synchronization) [In Chinese].
Proceedings of the 24th Conference on Computational Linguistics and Speech Processing, 2012

Improvement in Automatic Pronunciation Scoring using Additional Basic Scores and Learning to Rank.
Proceedings of the INTERSPEECH 2012, 2012

Conceptualization and Significance Study of a New Appliation CS-MIR.
Proceedings of the Artificial Intelligence Applications and Innovations, 2012

Improving Query by Singing/Humming Systems over GPUs.
Proceedings of the 41st International Conference on Parallel Processing Workshops, 2012

A hybrid approach to singing pitch extraction based on trend estimation and hidden Markov models.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Accelerating query by singing/humming on GPU: Optimization for web deployment.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Learning to Find Translations and Transliterations on the Web.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011
A Kernel Framework for Content-Based Artist Recommendation System in Music.
IEEE Trans. Multim., 2011

A Two-Fold Dynamic Programming Approach to Beat Tracking for Audio Music with Time-Varying Tempo.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

Combining Visual and Acoustic Features for Music Genre Classification.
Proceedings of the 10th International Conference on Machine Learning and Applications and Workshops, 2011

Time-constrained sequential pattern discovery for music genre classification.
Proceedings of the IEEE International Conference on Acoustics, 2011

A trend estimation algorithm for singing pitch detection in musical recordings.
Proceedings of the IEEE International Conference on Acoustics, 2011

Support of software framework for embedded multi-core systems with Android environments.
Proceedings of the 9th IEEE Symposium on Embedded Systems for Real-Time Multimedia, 2011

2010
On the Improvement of Singing Voice Separation for Monaural Recordings Using the MIR-1K Dataset.
IEEE Trans. Speech Audio Process., 2010

Exploring perceptions of integrating tangible learning companions in learning English conversation.
Br. J. Educ. Technol., 2010

An Improved Query by Singing/Humming System Using Melody and Lyrics Information.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Singing Pitch Extraction by Voice Vibrato / Tremolo Estimation and Instrument Partial Deletion.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Music Genre Classification via Compressive Sampling.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Coping imbalanced prosodic unit boundary detection with linguistically-motivated prosodic features.
Proceedings of the INTERSPEECH 2010, 2010

Automatic pronunciation scoring using learning to rank and DP-based score segmentation.
Proceedings of the INTERSPEECH 2010, 2010

On the use of sequential patterns mining as temporal features for music genre classification.
Proceedings of the IEEE International Conference on Acoustics, 2010

Using Tangible Learning Companions in English Education.
Proceedings of the ICALT 2010, 2010

Support of Android lab modules for embedded system curriculum.
Proceedings of the 2010 Workshop on Embedded Systems Education, 2010

2009
On the Use of Anti-Word Models for Audio Music Annotation and Retrieval.
IEEE Trans. Speech Audio Process., 2009

Singing Pitch Extraction from Monaural Polyphonic Songs by Contextual Audio Modeling and Singing Harmonic Enhancement.
Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009

Evaluation of Tangible Learning Companion/Robot for English Language Learning.
Proceedings of the 9th IEEE International Conference on Advanced Learning Technologies, 2009

2008
A General Framework of Progressive Filtering and Its Application to Query by Singing/Humming.
IEEE Trans. Speech Audio Process., 2008

TRUES: Tone Recognition Using Extended Segments.
ACM Trans. Asian Lang. Inf. Process., 2008

Minimum phone error discriminative training for Mandarin Chinese speaker adaptation.
Proceedings of the INTERSPEECH 2008, 2008

2007
Automatic Phonetic Segmentation by Score Predictive Model for the Corpora of Mandarin Singing Voices.
IEEE Trans. Speech Audio Process., 2007

Automatic Pronunciation Assessment for Mandarin Chinese: Approaches and System Overview.
Int. J. Comput. Linguistics Chin. Lang. Process., 2007

An effective initial/final duration prediction method for corpus-based singing voice synthesis of Mandarin Chinese.
Proceedings of the INTERSPEECH 2007, 2007

2006
Alignment of bilingual named entities in parallel corpora using statistical models and multiple knowledge sources.
ACM Trans. Asian Lang. Inf. Process., 2006

Extraction of transliteration pairs from parallel corpora using a statistical transliteration model.
Inf. Sci., 2006

Admission control schemes for proportional differentiated services enabled internet servers using machine learning techniques.
Expert Syst. Appl., 2006

An Initial Study on Progressive Filtering Based on Dynamic Programming for Query-by-Singing/Humming.
Proceedings of the Advances in Multimedia Information Processing, 2006

Automatic phonetic segmentation by using a SPM-based approach for a Mandarin singing voice corpus.
Proceedings of the INTERSPEECH 2006, 2006

Formant-based English vowel assessment for Chinese in Taiwan.
Proceedings of the INTERSPEECH 2006, 2006

2005
Automatic Segmentation and Labeling for Mandarin Chinese Speech Corpora for Concatenation-based TTS.
Int. J. Comput. Linguistics Chin. Lang. Process., 2005

A corpus-based singing voice synthesis system for mandarin Chinese.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

Continuous HMM and Its Enhancement for Singing/Humming Query Retrieval.
Proceedings of the ISMIR 2005, 2005

A hybrid approach to automatic segmentation and labeling for Mandarin Chinese speech corpus.
Proceedings of the INTERSPEECH 2005, 2005

2004
Research and developments of a multi-modal MIR engine for commercial applications in East Asia.
J. Assoc. Inf. Sci. Technol., 2004

基於反轉檔查找與最佳片段選取演算法的中文語音合成系統 (A Mandarin Text-to-speech System based on Inverted File Indexing and Unit Selection) [In Chinese].
Proceedings of the 16th Conference on Computational Linguistics and Speech Processing, 2004

以語音辨識與評分輔助口說英文學習 (Spoken English Learning Based on Speech Recognition and Assessment) [In Chinese].
Proceedings of the 16th Conference on Computational Linguistics and Speech Processing, 2004

Computer assisted spoken English learning for Chinese in Taiwan.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

A two-phase pitch marking method for TD-PSOLA synthesis.
Proceedings of the INTERSPEECH 2004, 2004

i-Ring: a system for humming transcription and chord generation.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Automatic pronunciation assessment for Mandarin Chinese.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

2003
盲胞有聲書語音查詢系統 (A Speech-enabled Talking Book Retrieval System for the Blind) [In Chinese].
Proceedings of the 15th Conference on Computational Linguistics and Speech Processing, 2003

線上新聞語音檢索系統 (Online New Retrieval Based on Speech Input) [In Chinese].
Proceedings of the 15th Conference on Computational Linguistics and Speech Processing, 2003

A Statistical Approach to Chinese-to-English Back-Transliteration.
Proceedings of the 17th Pacific Asia Conference on Language, Information and Computation, 2003

An automatic singing voice rectifier design.
Proceedings of the Eleventh ACM International Conference on Multimedia, 2003

Microcontroller implementation of melody recognition: a prototype.
Proceedings of the Eleventh ACM International Conference on Multimedia, 2003

New refinement schemes for voice conversion.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

2002
An On-the-Fly Mandarin Singing Voice Synthesis System.
Proceedings of the Advances in Multimedia Information Processing, 2002

2001
Query by Tapping: A New Paradigm for Content-Based Music Retrieval from Acoustic Input.
Proceedings of the Advances in Multimedia Information Processing, 2001

Super MBox: an efficient/effective content-based music retrieval system.
Proceedings of the 9th ACM International Conference on Multimedia 2001, Ottawa, Ontario, Canada, September 30, 2001

Hierarchical filtering method for content-based music retrieval via acoustic input.
Proceedings of the 9th ACM International Conference on Multimedia 2001, Ottawa, Ontario, Canada, September 30, 2001

Content-based Music Retrieval Using Linear Scaling and Branch-and-bound Tree Search.
Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001

2000
Evolving color recipes.
IEEE Trans. Syst. Man Cybern. Part C, 2000

1998
Author's Reply.
IEEE Trans. Neural Networks, 1998

1997
Neuro-Fuzzy and Soft Computing-A Computational Approach to Learning and Machine Intelligence [Book Review].
IEEE Trans. Autom. Control., 1997

1995
Neuro-fuzzy modeling and control.
Proc. IEEE, 1995

Coactive neuro-fuzzy modelling for colour recipe prediction.
Proceedings of International Conference on Neural Networks (ICNN'95), Perth, WA, Australia, November 27, 1995

Coactive neural fuzzy modeling.
Proceedings of International Conference on Neural Networks (ICNN'95), Perth, WA, Australia, November 27, 1995

1993
ANFIS: adaptive-network-based fuzzy inference system.
IEEE Trans. Syst. Man Cybern., 1993

Functional equivalence between radial basis function networks and fuzzy inference systems.
IEEE Trans. Neural Networks, 1993

Using Genetic Algorithms in Structuring a Fuzzy Rulebase.
Proceedings of the 5th International Conference on Genetic Algorithms, 1993

1992
Self-learning fuzzy controllers based on temporal backpropagation.
IEEE Trans. Neural Networks, 1992

1991
Fuzzy Modeling Using Generalized Neural Networks and Kalman Filter Algorithm.
Proceedings of the 9th National Conference on Artificial Intelligence, 1991

1990
A hierarchical approach to designing approximate reasoning-based controllers for dynamic physical systems.
Proceedings of the UAI '90: Proceedings of the Sixth Annual Conference on Uncertainty in Artificial Intelligence, 1990


  Loading...