Chung-Cheng Chiu

Orcid: 0000-0001-9729-4778

According to our database1, Chung-Cheng Chiu authored at least 88 papers between 2004 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Handling Ambiguity in Emotion: From Out-of-Domain Detection to Distribution Estimation.
CoRR, 2024

2023
SLM: Bridge the thin gap between speech and text foundation models.
CoRR, 2023

Efficient Adapters for Giant Speech Models.
CoRR, 2023

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages.
CoRR, 2023

Textless Direct Speech-to-Speech Translation with Discrete Speech Representation.
Proceedings of the IEEE International Conference on Acoustics, 2023

SLM: Bridge the Thin Gap Between Speech and Text Foundation Models.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition.
IEEE J. Sel. Top. Signal Process., 2022

Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data.
CoRR, 2022

Self-supervised learning with random-projection quantizer for speech recognition.
Proceedings of the International Conference on Machine Learning, 2022


2021
Bridging the gap between streaming and non-streaming ASR systems bydistilling ensembles of CTC and RNN-T models.
CoRR, 2021

RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

An Efficient Streaming Non-Recurrent On-Device End-to-End Model with Improvements to Rare-Word Modeling.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Pushing the Limits of Non-Autoregressive Speech Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Bridging the Gap Between Streaming and Non-Streaming ASR Systems by Distilling Ensembles of CTC and RNN-T Models.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Dual-mode ASR: Unify and Improve Streaming ASR with Full-context Modeling.
Proceedings of the 9th International Conference on Learning Representations, 2021

FastEmit: Low-Latency Streaming ASR with Sequence-Level Emission Regularization.
Proceedings of the IEEE International Conference on Acoustics, 2021

Efficient Knowledge Distillation for RNN-Transducer Models.
Proceedings of the IEEE International Conference on Acoustics, 2021

Cascaded Encoders for Unifying Streaming and Non-Streaming ASR.
Proceedings of the IEEE International Conference on Acoustics, 2021

A Better and Faster end-to-end Model for Streaming ASR.
Proceedings of the IEEE International Conference on Acoustics, 2021

Improving Streaming Automatic Speech Recognition with Non-Streaming Model Distillation on Unsupervised Data.
Proceedings of the IEEE International Conference on Acoustics, 2021

Cross-Attention Conformer for Context Modeling in Speech Enhancement for ASR.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

w2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition.
CoRR, 2020

Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling.
CoRR, 2020

A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency.
CoRR, 2020

Improved Noisy Student Training for Automatic Speech Recognition.
Proceedings of the Interspeech 2020, 2020

Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition.
Proceedings of the Interspeech 2020, 2020

ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context.
Proceedings of the Interspeech 2020, 2020

Conformer: Convolution-augmented Transformer for Speech Recognition.
Proceedings of the Interspeech 2020, 2020

An Attention-Based Joint Acoustic and Text on-Device End-To-End Model.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020


Specaugment on Large Scale Datasets.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Speech Sentiment Analysis via Pre-Trained Features from End-to-End ASR Models.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling.
CoRR, 2019


SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition.
Proceedings of the Interspeech 2019, 2019

Leveraging Weakly Supervised Data to Improve End-to-end Speech-to-text Translation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Edge Detection Algorithm Based on Texture Blocks.
Proceedings of the IEEE 4th International Conference on Computer and Communication Systems, 2019

Recognizing Long-Form Speech Using Streaming End-to-End Models.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

A Comparison of End-to-End Models for Long-Form Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Monotonic Infinite Lookback Attention for Simultaneous Machine Translation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Compression of End-to-End Models.
Proceedings of the Interspeech 2018, 2018


Monotonic Chunkwise Attention.
Proceedings of the 6th International Conference on Learning Representations, 2018

No Need for a Lexicon? Evaluating the Value of the Pronunciation Lexica in End-to-End Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Improving the Performance of Online Neural Transducer Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Minimum Word Error Rate Training for Attention-Based Sequence-to-Sequence Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Learning Hard Alignments with Variational Inference.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

State-of-the-Art Speech Recognition with Sequence-to-Sequence Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

CaLcs: Continuously Approximating Longest Common Subsequence for Sequence Level Optimization.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

2017
An online sequence-to-sequence model for noisy speech recognition.
CoRR, 2017

A Robust Vision-Based Skyline Detection Algorithm Under Different Weather Conditions.
IEEE Access, 2017

Learning online alignments with continuous rewards policy gradient.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Contrast Enhancement Algorithm Based on Gap Adjustment for Histogram Equalization.
Sensors, 2016

A skyline detection algorithm for use in different weather and environmental conditions.
Proceedings of the 2016 IEEE International Conference on Electro Information Technology, 2016

2015
Visual Contrast Enhancement Algorithm Based on Histogram Equalization.
Sensors, 2015

Monocular Vision System for Fixed Altitude Flight of Unmanned Aerial Vehicles.
Sensors, 2015

Block-Based Connected-Component Labeling Algorithm Using Binary Decision Trees.
Sensors, 2015

Predicting Co-verbal Gestures: A Deep and Temporal Modeling Approach.
Proceedings of the Intelligent Virtual Agents - 15th International Conference, 2015

2014
Acting the part: the role of gesture on avatar identity.
Proceedings of the Seventh International Conference on Motion in Games, Playa Vista, CA, USA, November 06, 2014

An efficient scan algorithm for block-based connected component labeling.
Proceedings of the 22nd Mediterranean Conference on Control and Automation, 2014

Gesture generation with low-dimensional embeddings.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

2012
Subjective Optimization.
Proceedings of the Intelligent Virtual Agents - 12th International Conference, 2012

Personal identification by extracting SIFT features from laser speckle patterns.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Vision-Only Automatic Flight Control for Small UAVs.
IEEE Trans. Veh. Technol., 2011

Vision-based Automatic Flight Control for Small UAVs.
Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2011), 2011

Histogram Enhancement Using Adaptive Segmentation Algorithm.
Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2011), 2011

How to Train Your Avatar: A Data Driven Approach to Gesture Generation.
Proceedings of the Intelligent Virtual Agents - 11th International Conference, 2011

A style controller for generating virtual human behaviors.
Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011), 2011

2010
A Robust Object Segmentation System Using a Probability-Based Background Extraction Algorithm.
IEEE Trans. Circuits Syst. Video Technol., 2010

Automatic Traffic Surveillance System for Vision-Based Vehicle Recognition and Tracking.
J. Inf. Sci. Eng., 2010

Real-Time Front Vehicle Detection Algorithm for an Asynchronous Binocular System.
J. Inf. Sci. Eng., 2010

Automatic Complexity Reduction in Reinforcement Learning.
Comput. Intell., 2010

Analysis of adverse drug reactions using drug and drug target interactions and graph-based methods.
Artif. Intell. Medicine, 2010

2009
Asynchronous stereo vision system for front-vehicle detection.
Proceedings of the IEEE International Conference on Acoustics, 2009

On the Construction of Initial Basis Function for Efficient Value Function Approximation.
Proceedings of the 2009 International Conference on Artificial Intelligence, 2009

2008
Classifying Proteins Related to Adverse Drug Reactions from Drug Targets Using Support Vector Machines.
Proceedings of the International Conference on Bioinformatics & Computational Biology, 2008

2007
Motorcycle Detection and Tracking System with Occlusion Segmentation.
Proceedings of the Eighth International Workshop on Image Analysis for Multimedia Interactive Services, 2007

Subgoal Identification for Reinforcement Learning and Planning in Multiagent Problem Solving.
Proceedings of the Multiagent System Technologies, 5th German Conference, 2007

AI-RPG Toolkit: Towards A Deep Model Implementation for Improvisational Virtual Drama.
Proceedings of the Intelligent Virtual Agents, 7th International Conference, 2007

Probability Analysis on Associations of Adverse Drug Events with Drug-Drug Interactions.
Proceedings of the 7th IEEE International Conference on Bioinformatics and Bioengineering, 2007

2006
A real-time wavelet-based video compression approach to intelligent video surveillance systems.
Int. J. Comput. Appl. Technol., 2006

2005
Multi-layer segmentation of complex document images.
Int. J. Pattern Recognit. Artif. Intell., 2005

A new region-based segmentation method for complex document image analysis.
Int. J. Comput. Sci. Eng., 2005

A Discriminant Analysis Based Recursive Automatic Thresholding Approach for Image Segmentation.
IEICE Trans. Inf. Syst., 2005

2004
Complex document image segmentation using localized histogram analysis with multi-layer matching and clustering.
Proceedings of the IEEE International Conference on Systems, 2004


  Loading...