Madian Khabsa

According to our database1, Madian Khabsa authored at least 75 papers between 2010 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations.
CoRR, 2023

MART: Improving LLM Safety with Multi-round Automatic Red-Teaming.
CoRR, 2023

On the Equivalence of Graph Convolution and Mixup.
CoRR, 2023

Effective Long-Context Scaling of Foundation Models.
CoRR, 2023

The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants.
CoRR, 2023

Llama 2: Open Foundation and Fine-Tuned Chat Models.
CoRR, 2023

MMViT: Multiscale Multiview Vision Transformers.
CoRR, 2023

SVT: Supertoken Video Transformer for Efficient Video Understanding.
CoRR, 2023

Progressive Prompts: Continual Learning for Language Models.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

XLM-V: Overcoming the Vocabulary Bottleneck in Multilingual Masked Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

RoAST: Robustifying Language Models via Adversarial Perturbation with Selective Training.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Generating Hashtags for Short-form Videos with Guided Signals.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

MixPAVE: Mix-Prompt Tuning for Few-shot Product Attribute Value Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

MUSTIE: Multimodal Structural Transformer for Web Information Extraction.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Residual Prompt Tuning: improving prompt tuning with residual reparameterization.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Logical Satisfiability of Counterfactuals for Faithful Explanations in NLI.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Uniform Masking Prevails in Vision-Language Pretraining.
CoRR, 2022

Sparse Distillation: Speeding Up Text Classification by Using Bigger Student Models.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Quantifying Adaptability in Pre-trained Language Models with 500 Tasks.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

SMARTAVE: Structured Multimodal Transformer for Product Attribute Value Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
A Fistful of Words: Learning Transferable Visual Models from Bag-of-Words Supervision.
CoRR, 2021

Sparse Distillation: Speeding Up Text Classification by Using Bigger Models.
CoRR, 2021

Entailment as Few-Shot Learner.
CoRR, 2021

Towards Few-Shot Fact-Checking via Perplexity.
CoRR, 2021

On Unifying Misinformation Detection.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

On the Influence of Masking Policies in Intermediate Pre-training.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020
Studying Strategically: Learning to Mask for Closed-book QA.
CoRR, 2020

CLEAR: Contrastive Learning for Sentence Representation.
CoRR, 2020

To Pretrain or Not to Pretrain: Examining the Benefits of Pretraining on Resource Rich Tasks.
CoRR, 2020

Linformer: Self-Attention with Linear Complexity.
CoRR, 2020

Language Models as Fact Checkers?
CoRR, 2020

To Pretrain or Not to Pretrain: Examining the Benefits of Pretrainng on Resource Rich Tasks.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Keeping Notes: Conditional Natural Language Generation with a Scratchpad Mechanism.
CoRR, 2019

Keeping Notes: Conditional Natural Language Generation with a Scratchpad Encoder.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Adversarial Training for Community Question Answer Selection Based on Multi-Scale Matching.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Adversarial Training for Community Question Answer Selection Based on Multi-scale Matching.
CoRR, 2018

Identifying Task Boundaries in Digital Assistants.
Proceedings of the Companion of the The Web Conference 2018 on The Web Conference 2018, 2018

Characterizing and Supporting Question Answering in Human-to-Human Communication.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

2017
Actionable Email Intent Modeling with Reparametrized RNNs.
CoRR, 2017

User Interaction Sequences for Search Satisfaction Prediction.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Building Natural Language Interfaces to Web APIs.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

Deep Sequential Models for Task Satisfaction Prediction.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

2016
Learning to identify relevant studies for systematic reviews using random forest and external information.
Mach. Learn., 2016

Random Forest DBSCAN for USPTO Inventor Name Disambiguation.
CoRR, 2016

Detecting Good Abandonment in Mobile Search.
Proceedings of the 25th International Conference on World Wide Web, 2016

Is This Your Final Answer?: Evaluating the Effect of Answers on Good Abandonment in Mobile Search.
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016

Identifying Earmarks in Congressional Bills.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

Inventor Name Disambiguation for a Patent Database Using a Random Forest and DBSCAN.
Proceedings of the 16th ACM/IEEE-CS on Joint Conference on Digital Libraries, 2016

Towards Better Understanding of Academic Search.
Proceedings of the 16th ACM/IEEE-CS on Joint Conference on Digital Libraries, 2016

Learning to Account for Good Abandonment in Search Success Metrics.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

2015
The CHEMDNER corpus of chemicals and drugs and its annotation principles.
J. Cheminformatics, 2015

Chemical entity extraction using CRF and an ensemble of extractors.
J. Cheminformatics, 2015

CiteSeerX: AI in a Digital Library Search Engine.
AI Mag., 2015

Big Scholarly Data in CiteSeerX: Information Extraction from the Web.
Proceedings of the 24th International Conference on World Wide Web Companion, 2015

Automatically Generating a Concept Hierarchy with Graphs.
Proceedings of the 15th ACM/IEEE-CE Joint Conference on Digital Libraries, 2015

Online Person Name Disambiguation with Constraints.
Proceedings of the 15th ACM/IEEE-CE Joint Conference on Digital Libraries, 2015

2014
Towards building a scholarly big data platform: Challenges, lessons and opportunities.
Proceedings of the IEEE/ACM Joint Conference on Digital Libraries, 2014

The feasibility of investing in manual correction of metadata for a large-scale digital library.
Proceedings of the IEEE/ACM Joint Conference on Digital Libraries, 2014

A Web Service for Scholarly Big Data Information Extraction.
Proceedings of the 2014 IEEE International Conference on Web Services, 2014

Scholarly big data information extraction and integration in the CiteSeer<sup>χ</sup> digital library.
Proceedings of the Workshops Proceedings of the 30th International Conference on Data Engineering Workshops, 2014

Migrating a Digital Library to a Private Cloud.
Proceedings of the 2014 IEEE International Conference on Cloud Engineering, 2014

Utility-Based Control Feedback in a Digital Library Search Engine: Cases in CiteSeerX.
Proceedings of the 9th International Workshop on Feedback Computing, 2014

The impact of user corrections on a crawl-based digital library: A CiteSeerX perspective.
Proceedings of the 10th IEEE International Conference on Collaborative Computing: Networking, 2014

Large scale author name disambiguation in digital libraries.
Proceedings of the 2014 IEEE International Conference on Big Data (IEEE BigData 2014), 2014

CiteSeerX: AI in a Digital Library Search Engine.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Graph-based Approach to Automatic Taxonomy Generation (GraBTax).
CoRR, 2013

2012
Specialized Research Datasets in the CiteSeer<sup>x</sup> Digital Library.
D Lib Mag., 2012

Web crawler middleware for search engine digital libraries: a case study for citeseerX.
Proceedings of the Twelfth International Workshop on Web Information and Data Management, 2012

A Framework for Bridging the Gap Between Open Source Search Tools.
Proceedings of the SIGIR 2012 Workshop on Open Source Information Retrieval, 2012

Towards Building and Analyzing a Social Network of Acknowledgments in Scientific and Academic Documents.
Proceedings of the Social Computing, Behavioral - Cultural Modeling and Prediction, 2012

A system for indexing tables, algorithms and figures.
Proceedings of the 12th ACM/IEEE-CS Joint Conference on Digital Libraries, 2012

AckSeer: a repository and search engine for automatically extracted acknowledgments from digital libraries.
Proceedings of the 12th ACM/IEEE-CS Joint Conference on Digital Libraries, 2012

Entity resolution using search engine results.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

2010
SeerSuite: Developing a Scalable and Reliable Application Framework for Building Digital Libraries by Crawling the Web.
Proceedings of the USENIX Conference on Web Application Development, 2010


  Loading...