Shankar Kumar

Orcid: 0009-0001-4307-8102

According to our database1, Shankar Kumar authored at least 61 papers between 1995 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Measuring Re-identification Risk.
Proc. ACM Manag. Data, 2023

Heterogeneous Federated Learning Using Knowledge Codistillation.
CoRR, 2023

Towards an On-device Agent for Text Rewriting.
CoRR, 2023

Semantic Segmentation with Bidirectional Language Models Improves Long-form ASR.
CoRR, 2023

Fast Text Generation with Text-Editing Models.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Multi-Output RNN-T Joint Networks for Multi-Task Learning of ASR and Auxiliary Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2023

Long-Form Speech Translation through Segmentation with Finite-State Decoding Constraints on Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2022
Improved Long-Form Spoken Language Translation with Large Language Models.
CoRR, 2022

Conciseness: An Overlooked Language Task.
CoRR, 2022

Simple and Effective Gradient-Based Tuning of Sequence-to-Sequence Models.
CoRR, 2022

Text Generation with Text-Editing Models.
CoRR, 2022

Scaling Language Model Size in Cross-Device Federated Learning.
CoRR, 2022

Transformer-based Models of Text Normalization for Speech Applications.
CoRR, 2022

Jam or Cream First? Modeling Ambiguity in Neural Machine Translation with SCONES.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word Speech Recognition.
Proceedings of the Interspeech 2022, 2022

Capitalization Normalization for Language Modeling with an Accurate and Efficient Hierarchical RNN Model.
Proceedings of the IEEE International Conference on Acoustics, 2022

Uncertainty Determines the Adequacy of the Mode and the Tractability of Decoding in Sequence-to-Sequence Models.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Position-Invariant Truecasing with a Word-and-Character Hierarchical Recurrent Neural Network.
CoRR, 2021

Lookup-Table Recurrent Language Models for Long Tail Speech Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Synthetic Data Generation for Grammatical Error Correction with Tagged Corruption Models.
Proceedings of the 16th Workshop on Innovative Use of NLP for Building Educational Applications, 2021

Data Strategies for Low-Resource Grammatical Error Correction.
Proceedings of the 16th Workshop on Innovative Use of NLP for Building Educational Applications, 2021

2020
Data Weighted Training Strategies for Grammatical Error Correction.
Trans. Assoc. Comput. Linguistics, 2020

Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus.
Proceedings of the Interspeech 2020, 2020

Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Seq2Edits: Sequence Transduction Using Span-level Edit Operations.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019
Neural Language Modeling with Visual Features.
CoRR, 2019

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling.
CoRR, 2019

Corpora Generation for Grammatical Error Correction.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

2018
Weakly Supervised Grammatical Error Correction using Iterative Decoding.
CoRR, 2018

No Need for a Lexicon? Evaluating the Value of the Pronunciation Lexica in End-to-End Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Modeling Non-Linguistic Contextual Signals in LSTM Language Models Via Domain Adaptation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

RADMM: Recurrent Adaptive Mixture Model with Applications to Domain Robust Language Modeling.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

A Conversational Neural Language Model for Speech Recognition in Digital Assistants.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Approaches for Neural-Network Language Model Adaptation.
Proceedings of the Interspeech 2017, 2017

Lattice rescoring strategies for long short term memory language models in speech recognition.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
NN-Grams: Unifying Neural Network and n-Gram Language Models for Speech Recognition.
Proceedings of the Interspeech 2016, 2016

2015
Multilingual Open Relation Extraction Using Cross-lingual Projection.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

2012
Large Scale Language Modeling in Automatic Speech Recognition
CoRR, 2012

2010
Model Combination for Machine Translation.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Expected Sequence Similarity Maximization.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

2009
Design validation of service delivery platform using modeling and simulation.
IEEE Commun. Mag., 2009

Efficient Minimum Error Rate Training and Minimum Bayes-Risk Decoding for Translation Hypergraphs and Lattices.
Proceedings of the ACL 2009, 2009

2008
Video suggestion and discovery for youtube: taking random walks through the view graph.
Proceedings of the 17th International Conference on World Wide Web, 2008

Lattice Minimum Bayes-Risk Decoding for Statistical Machine Translation.
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008

2007
Segmentation and alignment of parallel text for statistical machine translation.
Nat. Lang. Eng., 2007

Improving Word Alignment with Bridge Languages.
Proceedings of the EMNLP-CoNLL 2007, 2007

2006
Corrections to "Segmental minimum Bayes-risk decoding for automatic speech recognition".
IEEE Trans. Speech Audio Process., 2006

A weighted finite state transducer translation template model for statistical machine translation.
Nat. Lang. Eng., 2006

2005
Local Phrase Reordering Models for Statistical Machine Translation.
Proceedings of the HLT/EMNLP 2005, 2005

2004
Segmental minimum Bayes-risk decoding for automatic speech recognition.
IEEE Trans. Speech Audio Process., 2004

A Smorgasbord of Features for Statistical Machine Translation.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2004

Minimum Bayes-Risk Decoding for Statistical Machine Translation.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2004

2003
A Weighted Finite State Transducer Implementation of the Alignment Template Model for Statistical Machine Translation.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003

2002
Risk based lattice cutting for segmental minimum Bayes-risk decoding.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Minimum Bayes-Risk Word Alignments of Bilingual Texts.
Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing, 2002

2001
Normalization of non-standard words.
Comput. Speech Lang., 2001

Confidence based lattice segmentation and minimum Bayes-risk decoding.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
Unifying HMM and phone-pair segment models.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Segmental minimum Bayes-risk ASR voting strategies.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1996
Method for free-energy calculations using iterative techniques.
J. Comput. Chem., 1996

1995
Multidimensional Free-Energy Calculations Using the Weighted Histogram Analysis Method.
J. Comput. Chem., 1995


  Loading...