Jia-Ching Wang

Orcid: 0000-0003-0024-6732

According to our database1, Jia-Ching Wang authored at least 207 papers between 1999 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Multi-view and multi-augmentation for self-supervised visual representation learning.
Appl. Intell., January, 2024

Attention-Guided Prototype Mixing: Diversifying Minority Context on Imbalanced Whole Slide Images Classification Learning.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

2023
Electrocardiogram Heartbeat Classification for Arrhythmias and Myocardial Infarction.
Sensors, March, 2023

Anti-aliasing convolution neural network of finger vein recognition for virtual reality (VR) human-robot equipment of metaverse.
J. Supercomput., 2023

Cyclic Transfer Learning for Mandarin-English Code-Switching Speech Recognition.
IEEE Signal Process. Lett., 2023

Diffusion to Confusion: Naturalistic Adversarial Patch Generation Based on Diffusion Model for Object Detector.
CoRR, 2023

Ensemble Learning Technique with A Novelty Multi‑Source Information for Stock Price Movements.
Proceedings of the 12th International Symposium on Information and Communication Technology, 2023

Zero-Shot Voice Conversion Based on Speaker Embedding Domain Generalization.
Proceedings of the International Conference on Computing and Communication Technologies, 2023

A 3mW 2.7GS/s 8b Subranging ADC with Multiple-Reference-Reference-Embedded Comparators.
Proceedings of the IEEE International Solid- State Circuits Conference, 2023

3D Face Reconstruction Based on Weakly-Supervised Learning Morphable Face Model.
Proceedings of the IEEE International Conference on Image Processing, 2023

Mask Generation with Meta-Learning Classifier Weight Transformer Network for Few-Shot Image Segmentation.
Proceedings of the International Conference on Consumer Electronics - Taiwan, 2023

Selinet: A Lightweight Model for Single Channel Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Discriminative Vector Learning with Application to Single Channel Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2023

CNEG-VC: Contrastive Learning Using Hard Negative Example In Non-Parallel Voice Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2023

Dense Adversarial Transfer Learning Based On Class-Invariance.
Proceedings of the IEEE International Conference on Acoustics, 2023

Code-Switching Speech Synthesis Based on Self-Supervised Learning and Domain Adaptive Speaker Encoder.
Proceedings of the IEEE International Conference on Acoustics, 2023

EMIX: A Data Augmentation Method for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

On the Optimal Self-Supervised Multi-Fault Detector for Temperature Sensor Data.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

STUA-Net: A Fingerprint Reconstruction with Swin Transformer and Soft Collective Attention.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

Question Answering System Based on Pre-Training Model and Retrieval Reranking for Industry 4.0.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
Fast Gated Recurrent Network for Speech Synthesis.
IEICE Trans. Inf. Syst., September, 2022

Spectral-Temporal Receptive Field-Based Descriptors and Hierarchical Cascade Deep Belief Network for Guitar Playing Technique Classification.
IEEE Trans. Cybern., 2022

Heuristic Attention Representation Learning for Self-Supervised Pretraining.
Sensors, 2022

Self-Supervised Learning Framework toward State-of-the-Art Iris Image Segmentation.
Sensors, 2022

Convolutional Blur Attention Network for Cell Nuclei Segmentation.
Sensors, 2022

A 72-dB SNDR 130-MS/s 0.8-mW Pipelined-SAR ADC Using a Distributed Averaging Correlated Level Shifting Ring Amplifier.
IEEE J. Solid State Circuits, 2022

A Novel Self-Knowledge Distillation Approach with Siamese Representation Learning for Action Recognition.
CoRR, 2022

Speech Separation Using Augmented-Discrimination Learning on Squash-Norm Embedding Vector and Node Encoder.
IEEE Access, 2022

A 9.8-fJ/conv.-step FoMW 8b 2.5-GS/s Single-Channel CDAC-Assisted Subranging ADC with Reference-Embedded Comparators.
Proceedings of the IEEE Symposium on VLSI Technology and Circuits (VLSI Technology and Circuits 2022), 2022

A Comparative Study of Cross-Model Universal Adversarial Perturbation for Face Forgery.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2022

NCU1415 at ROCLING 2022 Shared Task: A light-weight transformer-based approach for Biomedical Name Entity Recognition.
Proceedings of the 34th Conference on Computational Linguistics and Speech Processing, 2022

Low-Resource Speech Recognition Based on Transfer Learning.
Proceedings of the RIVF International Conference on Computing and Communication Technologies, 2022

A Wise Matrix Factorization Model for Image Representation.
Proceedings of the International Conference on Multimedia Analysis and Pattern Recognition, 2022

A 0.82mW 14b 130MS/S Pipelined-SAR ADC With a Distributed Averaging Correlated Level Shifting (DACLS) Ringamp and Bypass-Window Backend.
Proceedings of the IEEE International Solid-State Circuits Conference, 2022

Lightweight End-To-End Deep Learning Model For Music Source Separation.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

(2+1)D Distilled ShuffleNet: A Lightweight Unsupervised Distillation Network for Human Action Recognition.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Fingerprint Liveness Detection Using Denoised-Bayes Shrink Wavelet and Aggregated Local Spatial and Frequency Features.
Proceedings of the International Conference on Machine Learning and Cybernetics, 2022

Single-Channel Target Speaker Extraction System with Attention Enhancement.
Proceedings of the IEEE International Conference on Consumer Electronics - Taiwan, 2022

Selective Mutual Learning: An Efficient Approach for Single Channel Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2022

Code-switched Text Data Augmentation for Chinese-English Mixed Speech Recognition.
Proceedings of the 11th IEEE Global Conference on Consumer Electronics, 2022

Fingerprint Liveness Detection Using Handcrafted Feature Descriptors and Neural Network.
Proceedings of the 11th IEEE Global Conference on Consumer Electronics, 2022

2021
Self-Supervised Learning via multi-Transformation Classification for Action Recognition.
CoRR, 2021

Teaching Yourself: A Self-Knowledge Distillation Approach to Action Recognition.
IEEE Access, 2021

A Novel Self-Knowledge Distillation Approach with Siamese Representation Learning for Action Recognition.
Proceedings of the International Conference on Visual Communications and Image Processing, 2021

Occluded Face Recognition Using Sparse Complex Matrix Factorization with Ridge Regularization.
Proceedings of the International Symposium on Intelligent Signal Processing and Communication Systems, 2021

Dual-Masking Wind Noise Reduction System Based on Recurrent Neural Network.
Proceedings of the International Symposium on Intelligent Signal Processing and Communication Systems, 2021

Deep Residual and Deep Dense Attentions in English Chinese Translation.
Proceedings of the IEEE International Conference on Consumer Electronics-Taiwan, 2021

Evaluation of Attention Mechanisms on Text to Speech.
Proceedings of the IEEE International Conference on Consumer Electronics-Taiwan, 2021

Modified Attention Spatial Convolution Model for Skin Lesion Segmentation.
Proceedings of the IEEE International Conference on Consumer Electronics-Taiwan, 2021

Sound Event Localization and Detection Based on Time-Frequency Separable Convolutional Compression Network.
Proceedings of the 10th IEEE Global Conference on Consumer Electronics, 2021

Single Channel Speech Separation using Enhanced Learning on Embedding Features.
Proceedings of the 10th IEEE Global Conference on Consumer Electronics, 2021

Facial Expression Recognition Using Sparse Complex Matrix Factorization with Ridge Term Regularization.
Proceedings of the 10th IEEE Global Conference on Consumer Electronics, 2021

A Fusion Methodology of AKAZE and Neural Network for Fingerprint Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Face Anti-Spoofing Using Multi-Branch CNN.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Partial Fingerprint on Combined Evaluation using Deep Learning and Feature Descriptor.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
Sound Events Recognition and Retrieval Using Multi-Convolutional-Channel Sparse Coding Convolutional Neural Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

A Calibration-Free 14-b 0.7-mW 100-MS/s Pipelined-SAR ADC Using a Weighted- Averaging Correlated Level Shifting Technique.
IEEE J. Solid State Circuits, 2020

Embedded draw-down constraint using ensemble learning for stock trading.
J. Intell. Fuzzy Syst., 2020

Learning to Remember Beauty Products.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

16.4 A Calibration-Free 71.7dB SNDR 100MS/s 0.7mW Weighted-Averaging Correlated Level Shifting Pipelined SAR ADC with Speed-Enhancement Scheme.
Proceedings of the 2020 IEEE International Solid- State Circuits Conference, 2020

Two-Phase Instance Segmentation for Whiteleg Shrimp Larvae Counting.
Proceedings of the 2020 IEEE International Conference on Consumer Electronics (ICCE), 2020

Transfer Learning for Gender and Age Prediction.
Proceedings of the IEEE International Conference on Consumer Electronics - Taiwan, 2020

Encoder-Recurrent Decoder Network for Single Image Dehazing.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
A Deep Neural Network for Real-Time Driver Drowsiness Detection.
IEICE Trans. Inf. Syst., 2019

Locality Preserved Joint Nonnegative Matrix Factorization for Speech Emotion Recognition.
IEICE Trans. Inf. Syst., 2019

Large Basic Cone and Sparse Subspace Constrained Nonnegative Matrix Factorization With Kullback-Leibler Divergence for Data Representation.
IEEE Intell. Syst., 2019

Deep learning and SURF for automated classification and detection of calcaneus fractures in CT images.
Comput. Methods Programs Biomed., 2019

A 15-bit 20 MS/s SHA-Less Pipelined ADC Achieving 73.7 dB SNDR with Averaging Correlated Level Shifting Technique.
Proceedings of the International Symposium on VLSI Design, Automation and Test, 2019

Bone-Conducted Speech Enhancement Using Hierarchical Extreme Learning Machine.
Proceedings of the Increasing Naturalness and Flexibility in Spoken Dialogue Interaction, 2019

Comparative Study of Masking and Mapping Based on Hierarchical Extreme Learning Machine for Speech Enhancement.
Proceedings of the 2019 International Symposium on Intelligent Signal Processing and Communication Systems, 2019

Sentiment Analysis Using Residual Learning with Simplified CNN Extractor.
Proceedings of the IEEE International Symposium on Multimedia, 2019

Deep Learning Based Vietnamese Diacritics Restoration.
Proceedings of the IEEE International Symposium on Multimedia, 2019

Object Bounding Transformed Network for End-to-End Semantic Segmentation.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

CoNet: Compact and Low-Cost CNN for Image Classification.
Proceedings of the IEEE International Conference on Consumer Electronics - Taiwan, 2019

Video Captioning Based on Joint Image-Audio Deep Learning Techniques.
Proceedings of the 9th IEEE International Conference on Consumer Electronics, 2019

Speaker Characterization Using TDNN-LSTM Based Speaker Embedding.
Proceedings of the IEEE International Conference on Acoustics, 2019

Audio-Visual Speech Enhancement using Hierarchical Extreme Learning Machine.
Proceedings of the 27th European Signal Processing Conference, 2019

Age and Gender Recognition Using Multi-task CNN.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Convolutional Attention Model for Retinal Edema Segmentation.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Compressed Multimodal Hierarchical Extreme Learning Machine for Speech Enhancement.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Sound Event Recognition Using Auditory-Receptive-Field Binary Pattern and Hierarchical-Diving Deep Belief Network.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Predicting the Probability Density Function of Music Emotion Using Emotion Space Mapping.
IEEE Trans. Affect. Comput., 2018

Projective complex matrix factorization for facial expression recognition.
EURASIP J. Adv. Signal Process., 2018

Single-Channel Speech Separation Based on Gaussian Process Regression.
Proceedings of the 2018 IEEE International Symposium on Multimedia, 2018

Learning a Hierarchical Latent Semantic Model for Multimedia Data.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Locality Preserving Discriminative Complex-Valued Latent Variable Model.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Playing Technique Classification Based on Deep Collaborative Learning of Variational Auto-Encoder and Gaussian Process.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Depth Human Action Recognition Based on Convolution Neural Networks and Principal Component Analysis.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Fast-LSTM acoustic model for distant speech recognition.
Proceedings of the IEEE International Conference on Consumer Electronics, 2018

Acoustic scene classification using convolutional neural networks and multi-scale multi-feature extraction.
Proceedings of the IEEE International Conference on Consumer Electronics, 2018

Complex-Valued Gaussian Process Latent Variable Model for Phase-Incorporating Speech Enhancement.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Locality-Preserving Complex-Valued Gaussian Process Latent Variable Model for Robust Face Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Image Representation Using Supervised and Unsupervised Learning Methods on Complex Domain.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Speaker Identification Using Discriminative Features and Sparse Representation.
IEEE Trans. Inf. Forensics Secur., 2017

Spectral-temporal receptive fields and MFCC balanced feature extraction for robust speaker recognition.
Multim. Tools Appl., 2017

Program Guardian: screening system with a novel speaker recognition approach for smart TV.
Multim. Tools Appl., 2017

Music emotion recognition using PSO-based fuzzy hyper-rectangular composite neural networks.
IET Signal Process., 2017

Maximum Volume Constrained Graph Nonnegative Matrix Factorization for Facial Expression Recognition.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2017

A New Approach of Matrix Factorization on Complex Domain for Data Representation.
IEICE Trans. Inf. Syst., 2017

Single channel source separation using graph sparse NMF and adaptive dictionary learning.
Intell. Data Anal., 2017

Self-Gated Recurrent Neural Networks for Human Activity Recognition on Wearable Devices.
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017

Discriminative Training of Complex-valued Deep Recurrent Neural Network for Singing Voice Separation.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Hierarchical Representation Based on Bayesian Nonparametric Tree-Structured Mixture Model for Playing Technique Classification.
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017

Asymmetrie Kernel Convolutional Neural Network for acoustic scenes classification.
Proceedings of the IEEE International Symposium on Consumer Electronics, 2017

Recognition and retrieval of sound events using sparse coding convolutional neural network.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Source separation using dictionary learning and deep recurrent neural network with locality preserving constraint.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Improved convolutional neural network based scene classification using long short-term memory and label relations.
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

Hand gesture recognition based on Bayesian sensing hidden Markov models and Bhattacharyya divergence.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Hierarchical joint-guided networks for semantic image segmentation.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Dynamic tracking attention model for action recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Fully complex deep neural network for phase-incorporating monaural source separation.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Multi-pitch streaming of interwoven streams.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Exemplar-embed complex matrix factorization for facial expression recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Kernel weighted Fisher sparse analysis on multiple maps for audio event recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Acoustic scene classification using self-determination convolutional neural network.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Automatic vehicle classification using center strengthened convolutional neural network.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
VLSI Design for Convolutive Blind Source Separation.
IEEE Trans. Circuits Syst. II Express Briefs, 2016

Compressive Sensing-Based Speech Enhancement.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Complex Matrix Factorization for Face Recognition.
CoRR, 2016

Phase-incorporating Speech Enhancement Based on Complex-valued Gaussian Process Latent Variable Model.
CoRR, 2016

多通道之多重音頻串流方法之研究(Multi-channel Source Clustering of Polyphonic Music) [In Chinese].
Proceedings of the 28th Conference on Computational Linguistics and Speech Processing, 2016

Transportation Mode Detection on Mobile Devices Using Recurrent Nets.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Locality-preserving K-SVD Based Joint Dictionary and Classifier Learning for Object Recognition.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Incorporating local environment information with ensemble neural networks to robust automatic speech recognition.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Spatial dispersion constrained NMF for monaural source separation.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Improving iris image segmentation in unconstrained environments using NMF-based approach.
Proceedings of the IEEE International Conference on Consumer Electronics-Taiwan, 2016

A novel approach for single channel source separation.
Proceedings of the IEEE International Conference on Consumer Electronics-Taiwan, 2016

NMF-based image segmentation.
Proceedings of the IEEE International Conference on Consumer Electronics-Taiwan, 2016

Robust face verification via Bayesian sparse representation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

Speech emotion classification using multiple kernel Gaussian process.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015
VLSI Design for SVM-Based Speaker Verification System.
IEEE Trans. Very Large Scale Integr. Syst., 2015

Speech Emotion Verification Using Emotion Variance Modeling and Discriminant Scale-Frequency Maps.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Robust Environmental Sound Recognition With Fast Noise Suppression for Home Automation.
IEEE Trans Autom. Sci. Eng., 2015

Speaker Identification With Whispered Speech for the Access Control System.
IEEE Trans Autom. Sci. Eng., 2015

Hierarchical Dirichlet Process Mixture Model for Music Emotion Recognition.
IEEE Trans. Affect. Comput., 2015

結合β距離與圖形正規限制式之非負矩陣分解應用於單通道訊號源分離(Monaural Source Separation Using Nonnegative Matrix Factorization with Graph Regularization Constraint) [In Chinese].
Proceedings of the 27th Conference on Computational Linguistics and Speech Processing, 2015

類神經網路訓練結合環境群集及專家混合系統於強健性語音辨識(Automatic Speech Recognition using Neural Network based Acoustic Model with the Environment Clustering and Mixture of Experts Algorithms) [In Chinese].
Proceedings of the 27th Conference on Computational Linguistics and Speech Processing, 2015

MediaEval 2015: Recurrent Neural Network Approach to Emotion in Music Tack.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015

Lip-based visual speech recognition system.
Proceedings of the International Carnahan Conference on Security Technology, 2015

Latent dirichlet allocation based blog analysis for criminal intention detection system.
Proceedings of the International Carnahan Conference on Security Technology, 2015

Automatic recognition of audio event using dynamic local binary patterns.
Proceedings of the IEEE International Conference on Consumer Electronics - Taiwan, 2015

News topics categorization using latent Dirichlet allocation and sparse representation classifier.
Proceedings of the IEEE International Conference on Consumer Electronics - Taiwan, 2015

Liver segmentation from 3D abdominal CT images.
Proceedings of the IEEE International Conference on Consumer Electronics - Taiwan, 2015

Kernel Sparse Representation Classifier with Center Enhanced SPM for Vehicle Classification.
Proceedings of the 39th IEEE Annual Computer Software and Applications Conference, 2015

Single Channel Source Separation Using Sparse NMF and Graph Regularization.
Proceedings of the ASE BigData & SocialInformatics 2015, 2015

Bayesian Sensing Hidden Markov Model for Hand Gesture Recognition.
Proceedings of the ASE BigData & SocialInformatics 2015, 2015

Music emotion recognition using deep Gaussian process.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Genre based emotion annotation for music in noisy environment.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

2014
Mixed Sound Event Verification on Wireless Sensor Network for Home Automation.
IEEE Trans. Ind. Informatics, 2014

Gabor-Based Nonuniform Scale-Frequency Map for Environmental Sound Classification in Home Automation.
IEEE Trans Autom. Sci. Eng., 2014

基於稀疏表示之語者識別 (Sparse Representation Based Speaker Identification) [In Chinese].
Proceedings of the 26th Conference on Computational Linguistics and Speech Processing, 2014

Spectral-temporal receptive fields and MFCC balanced feature extraction for noisy speech recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2D semi-NMF of scale-frequency map for environmental sound classification.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Robust emotion recognition in live music using noise suppression and a hierarchical sparse representation classifier.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013
A Novel Fast Mode Decision Algorithm for H.264/AVC Using Particle Swarm Optimization.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2013

混合聲音事件驗證在家庭自動化之應用 (Home environmental sound recognition) [In Chinese].
Proceedings of the 25th Conference on Computational Linguistics and Speech Processing, 2013

Blind Signal Separation with Speech Enhancement.
Proceedings of the Advanced Technologies, Embedded and Multimedia for Human-centric Computing, 2013

A Framework Design for Human-Robot Interaction.
Proceedings of the Advanced Technologies, Embedded and Multimedia for Human-centric Computing, 2013

Novel Mutual Information Analysis of Attentive Motion Entropy Algorithm for Sports Video Summarization.
Proceedings of the Advanced Technologies, Embedded and Multimedia for Human-centric Computing, 2013

Happiness detection in music using hierarchical SVMs with dual types of kernels.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012
VLSI Design of an SVM Learning Core on Sequential Minimal Optimization Algorithm.
IEEE Trans. Very Large Scale Integr. Syst., 2012

Fast Mode Decision for H.264/AVC Based on Rate-Distortion Clustering.
IEEE Trans. Multim., 2012

A new hybrid and dynamic fusion of multiple experts for intelligent porch system.
Expert Syst. Appl., 2012

基於稀疏成份分析之旋積盲訊號源分離方法 (Convolutive Blind Source Separation Based on Sparse Component Analysis) [In Chinese].
Proceedings of the 24th Conference on Computational Linguistics and Speech Processing, 2012

SVM-Based Sound Classification Based on MPEG-7 Audio LLDs and Related Enhanced Features.
Proceedings of the Convergence and Hybrid Information Technology, 2012

2011
Hardware/software co-design for fast-trainable speaker identification system based on SMO.
Proceedings of the IEEE International Conference on Systems, 2011

2010
Dynamic Fixed-Point Arithmetic Design of Embedded SVM-Based Speaker Identification System.
Proceedings of the Advances in Neural Networks, 2010

2009
A Novel Video Summarization Based on Mining the Story-Structure and Semantic Relations Among Concept Entities.
IEEE Trans. Multim., 2009

Personal Spoken Sentence Retrieval Using Two-Level Feature Matching and MPEG-7 Audio LLDs.
J. Inf. Sci. Eng., 2009

Hardware-software co-design of a speech translation embedded system.
J. Embed. Comput., 2009

VLSI Design of Sequential Minimal Optimization Algorithm for SVM Learning.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2009), 2009

Video Knowledge Augmentation based on Summarized Contents and Online Media.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2009), 2009

Stress Detection Based on Multi-class Probabilistic Support Vector Machines for Accented English Speech.
Proceedings of the CSIE 2009, 2009 WRI World Congress on Computer Science and Information Engineering, March 31, 2009

2008
Design and Implementation of Subspace-Based Speech Enhancement Under In-Car Noisy Environments.
IEEE Trans. Veh. Technol., 2008

Intensity Gradient Technique for Efficient Intra-Prediction in H.264/AVC.
IEEE Trans. Circuits Syst. Video Technol., 2008

The design of a speech interactivity embedded module and its applications for mobile consumer devices.
IEEE Trans. Consumer Electron., 2008

Robust Environmental Sound Recognition for Home Automation.
IEEE Trans Autom. Sci. Eng., 2008

Motion Entropy Feature and Its Applications to Event-Based Segmentation of Sports Video.
EURASIP J. Adv. Signal Process., 2008

An Embedded System Design for Speech Command Recognition using Improved AMDF-based Pitch Features.
Proceedings of the 2008 International Conference on Embedded Systems & Applications, 2008

2007
A Fast Mode Decision Algorithm and Its VLSI Design for H.264/AVC Intra-Prediction.
IEEE Trans. Circuits Syst. Video Technol., 2007

An ARM-Based System-on-a-Programmable-Chip Architecture for Spoken Language Translation.
IEEE Trans. Circuits Syst. II Express Briefs, 2007

Unsupervised Speaker Change Detection Using SVM Training Misclassification Rate.
IEEE Trans. Computers, 2007

A Block-Based Architecture for Lifting Scheme Discrete Wavelet Transform.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2007

Critical Band Subspace-Based Speech Enhancement Using SNR and Auditory Masking Aware Technique.
IEICE Trans. Inf. Syst., 2007

Robust Speaker Identification and Verification.
IEEE Comput. Intell. Mag., 2007

Event-Based Segmentation of Sports Video Using Motion Entropy.
Proceedings of the Ninth IEEE International Symposium on Multimedia, 2007

Efficient Intra Prediction in H.264 Based on Intensity Gradient Approach.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2007), 2007

2006
Efficient news video querying and browsing based on distributed news video servers.
IEEE Trans. Multim., 2006

Multiband Subspace Tracking Speech Enhancement for In-Car Human Computer Speech Interaction.
J. Inf. Sci. Eng., 2006

Projection Based Adaptive Window Size Selection for Efficient Motion Estimation in H.264/AVC.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2006

A novel fast algorithm for intra mode decision in H.264/AVC encoders.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2006), 2006

Environmental Sound Classification using Hybrid SVM/KNN Classifier and MPEG-7 Audio Low-Level Descriptor.
Proceedings of the International Joint Conference on Neural Networks, 2006

Content-Based Audio Classification Using Support Vector Machines and Independent Component Analysis.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Robust Speaker Recognition using SNR-Aware Subspace-Based Enhancement and Probabilistic SVMs.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

An ARM-Based Embedded System Design for Speech-to-Speech Translation.
Proceedings of the Embedded and Ubiquitous Computing, International Conference, 2006

2005
以支援向量機為基礎之新穎語者切換偵測演算法 (A Novel Algorithm for Speaker Change Detection Based on Support Vector Machine) [In Chinese].
Proceedings of the 17th Conference on Computational Linguistics and Speech Processing, 2005

Translation Divergence Analysis and Processing for Mandarin-English Parallel Text Exploitation.
Proceedings of the 17th Conference on Computational Linguistics and Speech Processing, 2005

VLSI design of a very low bit rate speech decoder.
Proceedings of the Third IASTED International Conference on Circuits, 2005

2004
Efficient Coding Translation of GSM and G.729 Speech Coders across Mobile and IP Networks.
IEICE Trans. Inf. Syst., 2004

2002
Chip design of portable speech memopad suitable for persons with visual disabilities.
IEEE Trans. Speech Audio Process., 2002

Chip design of MFCC extraction for speech recognition.
Integr., 2002

VLSI Architecture and Implementation for Speech Recognizer Based on Discriminative Bayesian Neural Network.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2002

2001
A voicing-driven packet loss recovery algorithm for analysis-by-synthesis predictive speech coders over Internet.
IEEE Trans. Multim., 2001

A programmable application-specific VLSI architecture for speech recognition.
Proceedings of the 2001 8th IEEE International Conference on Electronics, 2001

2000
Single chip implementation of the 1.6 kbps speech vocoder.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2000

Chip design of mel frequency cepstral coefficients for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
WWW Interface Design for Computerized Service Supporting System.
Proceedings of the Human-Computer Interaction: Ergonomics and User Interfaces, 1999


  Loading...