We stand with Ukraine

We stand with Ukraine

Frank Zhang

Affiliations:

Facebook AI, USA

According to our database¹, Frank Zhang authored at least 22 papers between 2019 and 2022.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

Online presence:

on scholar.google.com

On csauthors.net:

Bibliography

2022

Pushing the performances of ASR models on English and Spanish accents.

[DOI]

,

Morgane Rivière

,

,

,

CoRR, 2022

Scaling ASR Improves Zero and Few Shot Learning.

[DOI]

,

,

,

,

,

Christian Fuegen

,

,

,

Abdelrahman Mohamed

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Improved Language Identification Through Cross-Lingual Self-Supervised Learning.

[DOI]

,

Diptanu Gon Choudhury

,

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Improved Language Identification Through Cross-Lingual Self-Supervised Learning.

[DOI]

,

Diptanu Gon Choudhury

,

,

,

,

,

,

CoRR, 2021

Benchmarking LF-MMI, CTC And RNN-T Criteria For Streaming ASR.

[DOI]

,

,

,

,

,

Pradyot Prakash

,

,

,

,

,

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Streaming Attention-Based Models with Augmented Memory for End-To-End Speech Recognition.

[DOI]

,

,

,

,

,

,

Michael L. Seltzer

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Improving RNN Transducer Based ASR with Auxiliary Tasks.

[DOI]

,

,

,

,

,

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Transformer in Action: A Comparative Study of Transformer-Based Acoustic Models for Large Scale Speech Recognition Applications.

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2021

Emformer: Efficient Memory Transformer Based Acoustic Model for Low Latency Streaming Speech Recognition.

[DOI]

,

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2021

On Lattice-Free Boosted MMI Training of HMM and CTC-Based Full-Context ASR Models.

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020

Emformer: Efficient Memory Transformer Based Acoustic Model For Low Latency Streaming Speech Recognition.

[DOI]

,

,

,

,

,

,

,

Michael L. Seltzer

CoRR, 2020

Fast, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces.

[DOI]

,

,

,

,

,

CoRR, 2020

Multilingual Graphemic Hybrid ASR with Massive Data Augmentation.

[DOI]

,

,

,

,

,

Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020

Faster, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces.

[DOI]

,

,

,

,

,

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Streaming Transformer-Based Acoustic Models Using Self-Attention with Augmented Memory.

[DOI]

,

,

,

,

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Weak-Attention Suppression for Transformer Based Speech Recognition.

[DOI]

,

,

,

Christian Fuegen

,

,

,

,

Michael L. Seltzer

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Contextualizing ASR Lattice Rescoring with Hybrid Pointer Network Language Model.

[DOI]

,

,

,

Gabriel Synnaeve

,

,

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Transformer-Based Acoustic Modeling for Hybrid Speech Recognition.

[DOI]

,

Abdelrahman Mohamed

,

,

,

,

,

,

,

,

,

Christian Fuegen

,

,

Michael L. Seltzer

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

DEJA-VU: Double Feature Presentation and Iterated Loss in Deep Transformer Networks.

[DOI]

,

,

,

,

,

Gabriel Synnaeve

,

Satoshi Nakamura

,

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Training ASR Models By Generation of Contextual Information.

[DOI]

,

,

,

,

,

Ross B. Girshick

,

,

,

,

,

Abdelrahman Mohamed

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Deja-vu: Double Feature Presentation in Deep Transformer Networks.

[DOI]

,

,

,

,

,

Gabriel Synnaeve

,

Satoshi Nakamura

,

CoRR, 2019

Multilingual ASR with Massive Data Augmentation.

[DOI]

,

,

,

,

,

CoRR, 2019

Loading...