Sachin Kumar

Affiliations:
  • Carnegie Mellon University, Pittsburg, PA, USA


According to our database1, Sachin Kumar authored at least 27 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
RewardBench: Evaluating Reward Models for Language Modeling.
CoRR, 2024

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research.
CoRR, 2024

2023
What Constitutes a Faithful Summary? Preserving Author Perspectives in News Summarization.
CoRR, 2023

Gen-Z: Generative Zero-Shot Text Classification with Contextualized Label Descriptions.
CoRR, 2023

SSD-2: Scaling and Inference-time Fusion of Diffusion Language Models.
CoRR, 2023

Assessing Language Model Deployment with Risk Cards.
CoRR, 2023

Do All Languages Cost the Same? Tokenization in the Era of Commercial Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Language Generation Models Can Cause Harm: So What Can We Do About It? An Actionable Survey.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Minding Language Models' (Lack of) Theory of Mind: A Plug-and-Play Multi-Character Belief Tracker.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

On the Blind Spots of Model-Based Evaluation Metrics for Text Generation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Constrained Sampling from Language Models via Langevin Dynamics in Embedding Spaces.
CoRR, 2022


Referee: Reference-Free Sentence Summarization with Sharper Controllability through Symbolic Knowledge Distillation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Gradient-based Constrained Sampling from Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021
Improving the Diversity of Unsupervised Paraphrasing with Embedding Outputs.
CoRR, 2021

An Exploration of Data Augmentation Techniques for Improving English to Tigrinya Translation.
Proceedings of the 2nd AfricaNLP Workshop Proceedings, AfricaNLP@EACL 2021, Virtual Event, 2021

Controlled Text Generation as Continuous Optimization with Multiple Constraints.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Machine Translation into Low-resource Language Varieties.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Neural Abstractive Summarization with Structural Attention.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

End-to-End Differentiable GANs for Text Generation.
Proceedings of the "I Can't Believe It's Not Better!" at NeurIPS Workshops, 2020

A Deep Reinforced Model for Zero-Shot Cross-Lingual Summarization with Bilingual Semantic Similarity Rewards.
Proceedings of the Fourth Workshop on Neural Generation and Translation, 2020

2019
Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs.
Proceedings of the 7th International Conference on Learning Representations, 2019

Topics to Avoid: Demoting Latent Confounds in Text Classification.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

A Margin-based Loss with Synthetic Negative Samples for Continuous-output Machine Translation.
Proceedings of the 3rd Workshop on Neural Generation and Translation@EMNLP-IJCNLP 2019, 2019

2018
Mining & Summarizing E-petitions for Enhanced Understanding of Public Opinion.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

2017
Earth Mover's Distance Pooling over Siamese LSTMs for Automatic Short Answer Grading.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017


  Loading...