Brian Kan-Wing Mak

Orcid: 0000-0001-6787-5555

According to our database1, Brian Kan-Wing Mak authored at least 118 papers between 1991 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Towards Online Sign Language Recognition and Translation.
CoRR, 2024

A Simple Baseline for Spoken Language to Sign Language Translation with 3D Avatars.
CoRR, 2024

2023
Bayesian Self-Attentive Speaker Embeddings for Text-Independent Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

On the Audio-visual Synchronization for Lip-to-Speech Synthesis.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Natural Language-Assisted Sign Language Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Hardware-Control: Instrument control and automation package.
J. Open Source Softw., 2022

Improving Continuous Sign Language Recognition with Consistency Constraints and Signer Removal.
CoRR, 2022

Two-Stream Network for Sign Language Recognition and Translation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Local Context-aware Self-attention for Continuous Sign Language Recognition.
Proceedings of the Interspeech 2022, 2022

Synthesizing Near Native-accented Speech for a Non-native Speaker by Imitating the Pronunciation and Prosody of a Native Speaker.
Proceedings of the Interspeech 2022, 2022

C<sup>2</sup>SLR: Consistency-enhanced Continuous Sign Language Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Access on Demand: Real-time, Multi-modal Accessibility for the Deaf and Hard-of-Hearing based on Augmented Reality.
Proceedings of the 24th International ACM SIGACCESS Conference on Computers and Accessibility, 2022

2021
Non-Parallel Many-To-Many Voice Conversion by Knowledge Transfer from a Text-To-Speech Model.
Proceedings of the IEEE International Conference on Acoustics, 2021

A Comparative Study of Acoustic and Linguistic Features Classification for Alzheimer's Disease Detection.
Proceedings of the IEEE International Conference on Acoustics, 2021

On-The-Fly Data Augmentation for Text-to-Speech Style Transfer.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Transformer based Multilingual document Embedding model.
CoRR, 2020

Orthogonality Regularizations for End-to-End Speaker Verification.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Multi-Lingual Multi-Speaker Text-to-Speech Synthesis for Voice Cloning with Online Speaker Enrollment.
Proceedings of the Interspeech 2020, 2020

Orthogonal Training for Text-Independent Speaker Verification.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Stochastic Fine-Grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Mixup Learning Strategies for Text-Independent Speaker Verification.
Proceedings of the Interspeech 2019, 2019

Recurrent Poisson Process Unit for Speech Recognition.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Denoised Senone I-Vectors for Robust Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

DNN-Based Score Calibration With Multitask Learning for Noise Robust Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Fast derivation of neural network based document vectors with distance constraint and negative sampling.
CoRR, 2018

Domain Adaptation of End-to-end Speech Recognition in Low-Resource Settings.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Subspace Based Sequence Discriminative Training of LSTM Acoustic Models with Feed-Forward Layers.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

WaveNet MH-SRU: Deep and Wide Multiple-history Simple Recurrent Unit for Speech Recognition.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Multi-Head Attention for End-to-End Neural Machine Translation.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Self-Attentive Speaker Embeddings for Text-Independent Speaker Verification.
Proceedings of the Interspeech 2018, 2018

Fast Derivation of Cross-lingual Document Vectors from Self-attentive Neural Machine Translation Model.
Proceedings of the Interspeech 2018, 2018

learning Effective Factorized Hidden Layer Bases Using Student-Teacher Training for LSTM Acoustic Model Adaptation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

End-To-End Low-Resource Lip-Reading with Maxout Cnn and Lstm.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Learning Factorized Transforms for Unsupervised Adaptation of LSTM-RNN Acoustic Models.
Proceedings of the Interspeech 2017, 2017

To Improve the Robustness of LSTM-RNN Acoustic Models Using Higher-Order Feedback from Multiple Histories.
Proceedings of the Interspeech 2017, 2017

Speeding up softmax computations in DNN-based large vocabulary speech recognition by senone weight vector selection.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

An investigation into learning effective speaker subspaces for robust unsupervised DNN adaptation.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Derivation of Document Vectors from Adaptation of LSTM Language Model.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Unsupervised adaptation of student DNNS learned from teacher RNNS for improved ASR performance.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
An investigation of adaptation techniques for building acoustic models for hearing-impaired children in a CAPT application.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Senone I-vectors for robust speaker verification.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

2015
Multitask Learning of Deep Neural Networks for Low-Resource Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Distinct triphone acoustic modeling using deep neural networks.
Proceedings of the INTERSPEECH 2015, 2015

The harp of light: a musical string projection mapping.
Proceedings of the 12th International Conference on Advances in Computer Entertainment Technology, 2015

2014
Eigentrigraphemes for under-resourced languages.
Speech Commun., 2014

Modeling inter-cluster and intra-cluster discrimination among triphones.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Joint sequence training of phone and grapheme acoustic model based on multi-task learning deep neural networks.
Proceedings of the INTERSPEECH 2014, 2014

Subspace Gaussian mixture model with state-dependent subspace dimensions.
Proceedings of the IEEE International Conference on Acoustics, 2014

Joint acoustic modeling of triphones and trigraphemes by multi-task learning deep neural networks for low-resource speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Eigentriphones for Context-Dependent Acoustic Modeling.
IEEE Trans. Speech Audio Process., 2013

Distinct triphone modeling by reference model weighting.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Speaker-ensemble hidden Markov modeling for automatic speech recognition.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Welcome message from the technical program chairs.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Transition probabilities are more important than we once thought.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Derivation of eigentriphones by weighted principal component analysis.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Subspace high-density discrete hidden Markov model for automatic speech recognition.
Proceedings of the 20th European Signal Processing Conference, 2012

2011
A Fully Automated Derivation of State-Based Eigentriphones for Triphone Modeling with No Tied States Using Regularization.
Proceedings of the INTERSPEECH 2011, 2011

Eigentriphones: A basis for context-dependent acoustic modeling.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Subvector-quantized high-density discrete hidden Markov model and its re-estimation.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Problems of modeling phone deletion in conversational speech for speech recognition.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

The use of subvector quantization and discrete densities for fast GMM computation for speaker verification.
Proceedings of the INTERSPEECH 2010, 2010

Improving speech recognition by explicit modeling of phone deletions.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Maximum Penalized Likelihood Kernel Regression for Fast Adaptation.
IEEE Trans. Speech Audio Process., 2009

Fast GMM computation for speaker verification using scalar quantization and discrete densities.
Proceedings of the INTERSPEECH 2009, 2009

Automatic estimation of decoding parameters using large-margin iterative linear programming.
Proceedings of the INTERSPEECH 2009, 2009

2008
Min-max discriminative training of decoding parameters using iterative linear programming.
Proceedings of the INTERSPEECH 2008, 2008

Robust speaker verification using short-time frequency with long-time window and fusion of multi-resolutions.
Proceedings of the INTERSPEECH 2008, 2008

Discriminative training by iterative linear programming optimization.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Kernel Eigenspace-Based MLLR Adaptation.
IEEE Trans. Speech Audio Process., 2007

Boosting with anti-models for automatic language identification.
Proceedings of the INTERSPEECH 2007, 2007

A model-based estimation of phonotactic language verification performance.
Proceedings of the INTERSPEECH 2007, 2007

Robustness of several kernel-based fast adaptation methods on noisy LVCSR.
Proceedings of the INTERSPEECH 2007, 2007

2006
Embedded kernel eigenvoice speaker adaptation and its implication to reference speaker weighting.
IEEE Trans. Speech Audio Process., 2006

Minimization of Utterance Verification Error Rate as a Constrained Optimization Problem.
IEEE Signal Process. Lett., 2006

Joint Optimization of the Frequency-Domain and Time-Domain Transformations in Deriving Generalized Static and Dynamic MFCCs.
IEEE Signal Process. Lett., 2006

Unsupervised Speaker Adaptation Using Reference Speaker Weighting.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

Automatic Audio Indexing and Audio Playback Speed Control as Tools for Language Learning.
Proceedings of the Advances in Web Based Learning, 2006

Fast Speaker Adaption Via Maximum Penalized Likelihood Kernel Regression.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Improving Reference Speaker Weighting Adaptation by the Use of Maximum-Likelihood Reference Speakers.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

A Comparison of Various Adaptation Methods for Speaker Verification With Limited Enrollment Data.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Kernel Eigenvoice Speaker Adaptation.
IEEE Trans. Speech Audio Process., 2005

Pruning Hidden Markov Models With Optimal Brain Surgeon.
IEEE Trans. Speech Audio Process., 2005

High-density discrete HMM with the use of scalar quantization indexing.
Proceedings of the INTERSPEECH 2005, 2005

A comparative study of two kernel eigenspace-based speaker adaptation methods on large vocabulary continuous speech recognition.
Proceedings of the INTERSPEECH 2005, 2005

Various Reference Speakers Determination Methods for Embedded Kernel Eigenvoice Speaker Adaptation.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Kernel Eigenspace-based MLLR Adaptation Using Multiple Regression Classes.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Discriminative auditory-based features for robust speech recognition.
IEEE Trans. Speech Audio Process., 2004

An Acoustic-Phonetic and a Model-Theoretic Analysis of Subspace Distribution Clustering Hidden Markov Models.
Int. J. Speech Technol., 2004

Speedup of kernel eigenvoice speaker adaptation by embedded kernel PCA.
Proceedings of the INTERSPEECH 2004, 2004

Improving eigenspace-based MLLR adaptation by kernel PCA.
Proceedings of the INTERSPEECH 2004, 2004

A study of various composite kernels for kernel eigenvoice speaker adaptation.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Discriminative feature transformation by guided discriminative training.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Eigenvoice Speaker Adaptation via Composite Kernel PCA.
Proceedings of the Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, 2003

Pruning transitions in a hidden Markov model with optimal brain surgeon.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Joint estimation of thresholds in a bi-threshold verification problem.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Discriminative training of auditory filters of different shapes for robust speech recognition.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
A mathematical relationship between full-band and multiband mel-frequency cepstral coefficients.
IEEE Signal Process. Lett., 2002

Performance of discriminatively trained auditory features on Aurora2 and Aurora3.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

An alternative approach of finding competing hypotheses for better minimum classification error training.
Proceedings of the IEEE International Conference on Acoustics, 2002

Discriminative auditory features for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Direct training of subspace distribution clustering hidden Markov model.
IEEE Trans. Speech Audio Process., 2001

Subspace distribution clustering hidden Markov model.
IEEE Trans. Speech Audio Process., 2001

Rapid speaker adaptation using MLLR and subspace regression classes.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Development of an asynchronous multi-band system for continuous speech recognition.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
Optimization of sub-band weights using simulated noisy speech in multi-band speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Asynchrony with trained transition probabilities improves performance in multi-band speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Pruning of state-tying tree using bayesian information criterion with multiple mixtures.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

MAP adaptation with subspace regression classes and tying.
Proceedings of the IEEE International Conference on Acoustics, 2000

1998
Training of context-dependent subspace distribution clustering hidden Markov model.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Training of subspace distribution clustering hidden Markov model.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
Subspace distribution clustering for continuous observation density hidden Markov models.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Combining ANNs to improve phone recognition.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996
Phone clustering using the bhattacharyya distance.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

The contribution of consonants versus vowels to word recognition in fluent speech.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995
Tone recognition of isolated Cantonese syllables.
IEEE Trans. Speech Audio Process., 1995

1994
A robust algorithm for word boundary detection in the presence of noise.
IEEE Trans. Speech Audio Process., 1994

1992
A robust speech/non-speech detection algorithm using time and frequency-based features.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1991
A study of endpoint detection algorithms in adverse conditions: incidence on a DTW and HMM recognizer.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991


  Loading...