Chunxi Liu

According to our database1, Chunxi Liu authored at least 51 papers between 2006 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch.
CoRR, 2023

Multi-Head State Space Model for Speech Recognition.
CoRR, 2023

Learning ASR Pathways: A Sparse Multilingual ASR Model.
Proceedings of the IEEE International Conference on Acoustics, 2023

TorchAudio 2.1: Advancing Speech Recognition, Self-Supervised Learning, and Audio Processing Components for Pytorch.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Learning ASR pathways: A sparse multilingual ASR model.
CoRR, 2022

Learning a Dual-Mode Speech Recognition Model VIA Self-Pruning.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Conformer-Based Self-Supervised Learning For Non-Speech Audio Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2022

Streaming Transformer Transducer based Speech Recognition Using Non-Causal Convolution.
Proceedings of the IEEE International Conference on Acoustics, 2022

Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Reconfigurable Antenna Array Direction Finding System Based on a Fast Search Algorithm.
Sensors, 2021

Benchmarking LF-MMI, CTC And RNN-T Criteria For Streaming ASR.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Dual Application of Speech Enhancement for Automatic Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Improving RNN Transducer Based ASR with Auxiliary Tasks.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

2020
Fast, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces.
CoRR, 2020

Multilingual Graphemic Hybrid ASR with Massive Data Augmentation.
Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020

Faster, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces.
Proceedings of the Interspeech 2020, 2020

Contextualizing ASR Lattice Rescoring with Hybrid Pointer Network Language Model.
Proceedings of the Interspeech 2020, 2020

Transformer-Based Acoustic Modeling for Hybrid Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

DEJA-VU: Double Feature Presentation and Iterated Loss in Deep Transformer Networks.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Deja-vu: Double Feature Presentation in Deep Transformer Networks.
CoRR, 2019

Multilingual ASR with Massive Data Augmentation.
CoRR, 2019

Pretraining by Backtranslation for End-to-End ASR in Low-Resource Settings.
Proceedings of the Interspeech 2019, 2019

Bottom-Up Unsupervised Word Discovery via Acoustic Units.
Proceedings of the 2019 IEEE Global Conference on Signal and Information Processing, 2019

2018
Low Resource Multi-modal Data Augmentation for End-to-end ASR.
CoRR, 2018

The JHU Speech LOREHLT 2017 System: Cross-Language Transfer for Situation-Frame Detection.
CoRR, 2018

Low-Resource Contextual Topic Identification on Speech.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Automatic Speech Recognition and Topic Identification from Speech for Almost-Zero-Resource Languages.
Proceedings of the Interspeech 2018, 2018

2017
ASR for Under-Resourced Languages From Probabilistic Transcription.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Topic Identification for Speech Without ASR.
Proceedings of the Interspeech 2017, 2017

An empirical evaluation of zero resource acoustic unit discovery.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Adapting ASR for under-resourced languages using mismatched transcriptions.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Context-dependent point process models for keyword search and detection-based ASR.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Deep contextual language understanding in spoken dialogue systems.
Proceedings of the INTERSPEECH 2015, 2015

2014
Web video thumbnail recommendation with content-aware analysis and query-sensitive matching.
Multim. Tools Appl., 2014

A keyword search system using open source software.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Low-resource open vocabulary keyword search using point process models.
Proceedings of the INTERSPEECH 2014, 2014

2013
Research and Application of Variable Rate Fertilizer Applicator System Based on a DC Motor.
Proceedings of the Computer and Computing Technologies in Agriculture VII, 2013

2012
An effective multi-clue fusion approach for web video topic detection.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

A Novel Framework for Web Video Thumbnail Generation.
Proceedings of the Eighth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2012

Motion Based Perceptual Distortion and Rate Optimization for Video Coding.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Cross community news event summary generation based on collaborative ranking.
Proceedings of the 4th International Conference on Internet Multimedia Computing and Service, 2012

2011
News video story sentiment classification and ranking.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Visual perception based Lagrangian rate distortion optimization for video coding.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Query sensitive dynamic web video thumbnail generation.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

2010
The third eye: mining the visual cognition across multi-language communities.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Event based news video people classification and ranking using multimodality features.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

2009
A framework for flexible summarization of racquet sports video using multiple modalities.
Comput. Vis. Image Underst., 2009

Personalized online video recommendation by neighborhood score propagation based global ranking.
Proceedings of the First International Conference on Internet Multimedia Computing and Service, 2009

2008
Naming faces in broadcast news video by image google.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

2006
JDL at TRECVID 2006 Shot Boundary Detection.
Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006

Extracting Story Units in Sports Video Based on Unsupervised Video Scene Clustering.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006


  Loading...