Daniel Jurafsky

Orcid: 0000-0002-6459-7745

Affiliations:
  • Stanford University, USA


According to our database1, Daniel Jurafsky authored at least 271 papers between 1988 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Dialect prejudice predicts AI decisions about people's character, employability, and criminality.
CoRR, 2024

CausalGym: Benchmarking causal interpretability methods on linguistic tasks.
CoRR, 2024

How Well Can LLMs Negotiate? NegotiationArena Platform and Analysis.
CoRR, 2024

Predicting positive transfer for improved low-resource speech recognition using acoustic pseudo-tokens.
CoRR, 2024

KTO: Model Alignment as Prospect Theoretic Optimization.
CoRR, 2024

AnthroScore: A Computational Linguistic Measure of Anthropomorphism.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

2023
Use large language models to promote equity.
CoRR, 2023

Multilingual self-supervised speech representations improve the speech recognition of low-resource African languages with codeswitching.
CoRR, 2023

Grounding or Guesswork? Large Language Models are Presumptive Grounders.
CoRR, 2023

A Benchmark for Learning to Translate a New Language from One Grammar Book.
CoRR, 2023

Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions.
CoRR, 2023

Learning the meanings of function words from grounded language using a visual question answering model.
CoRR, 2023

Othering and low prestige framing of immigrant cuisines in US restaurant reviews and large language models.
CoRR, 2023

Developing Speech Processing Pipelines for Police Accountability.
CoRR, 2023

string2string: A Modern Python Library for String-to-String Algorithms.
CoRR, 2023

Pretrain on just structure: Understanding linguistic inductive biases using transfer learning.
CoRR, 2023

Foundation Models and Fair Use.
CoRR, 2023

Navigating the Grey Area: Expressions of Overconfidence and Uncertainty in Language Models.
CoRR, 2023

Leveraging supplementary text data to kick-start automatic speech recognition system development with limited transcriptions.
CoRR, 2023

Ecosystem-level Analysis of Deployed Machine Learning Reveals Homogeneous Outcomes.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

When and Why Vision-Language Models Behave like Bags-Of-Words, and What to Do About It?
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Easily Accessible Text-to-Image Generation Amplifies Demographic Stereotypes at Large Scale.
Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency, 2023

Navigating the Grey Area: How Expressions of Uncertainty and Overconfidence Affect Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Injecting structural hints: Using language models to study inductive biases in language learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Multilingual BERT has an accent: Evaluating English influences on fluency in multilingual models.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Mini But Mighty: Efficient Multilingual Pretraining with Linguistically-Informed Data Selection.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

When Do Pre-Training Biases Propagate to Downstream Tasks? A Case Study in Text Summarization.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Self-Destructing Models: Increasing the Costs of Harmful Dual Uses of Foundation Models.
Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, 2023

Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Computationally Identifying Funneling and Focusing Questions in Classroom Discourse.
CoRR, 2022

How Human is Human Evaluation? Improving the Gold Standard for NLG with Utility Theory.
CoRR, 2022

Automated speech tools for helping communities process restricted-access corpora for language revival efforts.
CoRR, 2022

Picking on the Same Person: Does Algorithmic Monoculture lead to Outcome Homogenization?
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Prompt-and-Rerank: A Method for Zero-Shot and Few-Shot Arbitrary Textual Style Transfer with Small Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

The Authenticity Gap in Human Evaluation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Richer Countries and Richer Representations.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Problems with Cosine as a Measure of Embedding Similarity for High Frequency Words.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

Modular Domain Adaptation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Sensitivity as a Complexity Measure for Sequence Classification Tasks.
Trans. Assoc. Comput. Linguistics, 2021

On the Opportunities and Risks of Foundation Models.
CoRR, 2021

Frequency-based Distortions in Contextualized Word Embeddings.
CoRR, 2021

Leveraging neural representations for facilitating access to untranscribed speech from endangered languages.
CoRR, 2021

Causal Effects of Linguistic Properties.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Improving Factual Completeness and Consistency of Image-to-Text Radiology Report Generation.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Nearest Neighbor Machine Translation.
Proceedings of the 9th International Conference on Learning Representations, 2021

Focus on what matters: Applying Discourse Coherence Theory to Cross Document Coreference.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

The Emergence of the Shape Bias Results from Communicative Efficiency.
Proceedings of the 25th Conference on Computational Natural Language Learning, 2021

SAD: A Stress Annotated Dataset for Recognizing Everyday Stressors in SMS-like Conversational Systems.
Proceedings of the CHI '21: CHI Conference on Human Factors in Computing Systems, 2021

Leveraging Pre-Trained Representations to Improve Access to Untranscribed Speech from Endangered Languages.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Attention Flows are Shapley Value Explanations.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Measuring Conversational Uptake: A Case Study on Student-Teacher Interactions.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Racial disparities in automated speech recognition.
Proc. Natl. Acad. Sci. USA, 2020

Assessing the accuracy of automatic speech recognition for psychotherapy.
npj Digit. Medicine, 2020

A Framework for the Computational Linguistic Analysis of Dehumanization.
Frontiers Artif. Intell., 2020

Improving Factual Completeness and Consistency of Image-to-Text Radiology Report Generation.
CoRR, 2020

The Role of Verb Semantics in Hungarian Verb-Object Order.
CoRR, 2020

Pretraining on Non-linguistic Structure as a Tool for Analyzing Learning Bias in Language Models.
CoRR, 2020

Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning.
CoRR, 2020

Language Through a Prism: A Spectral Approach for Multiscale Language Representations.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Generalization through Memorization: Nearest Neighbor Language Models.
Proceedings of the 8th International Conference on Learning Representations, 2020

Learning Music Helps You Read: Using Transfer to Study Linguistic Structure in Language Models.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

DeSMOG: Detecting Stance in Media On Global Warming.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Utility is in the Eye of the User: A Critique of NLP Leaderboards.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

With Little Power Comes Great Responsibility.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Social Bias Frames: Reasoning about Social and Power Implications of Language.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Pretraining with Contrastive Sentence Objectives Improves Discourse Performance of Language Models.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Automatically Neutralizing Subjective Bias in Text.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Sample Efficient Text Summarization Using a Single Pre-Trained Transformer.
CoRR, 2019

Let's Make Your Request More Persuasive: Modeling Persuasive Strategies via Semi-Supervised Neural Nets on Crowdfunding Platforms.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Analyzing Polarization in Social Media: Method and Application to Tweets on 21 Mass Shootings.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Recursive Routing Networks: Learning to Compose Modules for Language Understanding.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Integrating Text and Image: Determining Multimodal Document Intent in Instagram Posts.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Seekers, Providers, Welcomers, and Storytellers: Modeling Social Roles in Online Health Communities.
Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, 2019

From Insanely Jealous to Insanely Delicious: Computational Models for the Semantic Bleaching of English Intensifiers.
Proceedings of the 1st International Workshop on Computational Approaches to Historical Language Change, 2019

2018
Detecting Institutional Dialog Acts in Police Traffic Stops.
Trans. Assoc. Comput. Linguistics, 2018

Measuring the Evolution of a Scientific Field through Citation Frames.
Trans. Assoc. Comput. Linguistics, 2018

Word embeddings quantify 100 years of gender and ethnic stereotypes.
Proc. Natl. Acad. Sci. USA, 2018

Querying Complex Networks in Vector Space.
CoRR, 2018

Community Interaction and Conflict on the Web.
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

Embedding Logical Queries on Knowledge Graphs.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Noising and Denoising Natural Language: Diverse Backtranslation for Grammar Correction.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Deconfounded Lexicon Induction for Interpretable Social Science.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

RtGender: A Corpus for Studying Differential Responses to Gender.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

JESC: Japanese-English Subtitle Corpus.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Textual Analogy Parsing: What's Shared and What's Compared among Analogous Facts.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Framing and Agenda-Setting in Russian News: a Computational Analysis of Intricate Political Strategies.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

An Information-Theoretic Explanation of Adjective Ordering Preferences.
Proceedings of the 40th Annual Meeting of the Cognitive Science Society, 2018

Sharp Nearby, Fuzzy Far Away: How Neural Language Models Use Context.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Automatic Detection of Incoherent Speech for Diagnosing Schizophrenia.
Proceedings of the Fifth Workshop on Computational Linguistics and Clinical Psychology: From Keyboard to Clinic, 2018

2017
A scaffolding approach to coreference resolution integrating statistical and rule-based models.
Nat. Lang. Eng., 2017

Dialogism in the novel: A computational model of the dialogic nature of narration and quotations.
Digit. Scholarsh. Humanit., 2017

Building DNN acoustic models for large vocabulary speech recognition.
Comput. Speech Lang., 2017

JESC: Japanese-English Subtitle Corpus.
CoRR, 2017

Adversarial Learning for Neural Dialogue Generation.
CoRR, 2017

Data Distillation for Controlling Specificity in Dialogue Generation.
CoRR, 2017

Learning to Decode for Future Success.
CoRR, 2017

Writer Profiling Without the Writer's Text.
Proceedings of the Social Informatics, 2017

Predicting Sales from the Language of Product Descriptions.
Proceedings of the SIGIR 2017 Workshop On eCommerce co-located with the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Community Identity and User Engagement in a Multi-Community Landscape.
Proceedings of the Eleventh International Conference on Web and Social Media, 2017

Loyalty in Online Communities.
Proceedings of the Eleventh International Conference on Web and Social Media, 2017

Data Noising as Smoothing in Neural Network Language Models.
Proceedings of the 5th International Conference on Learning Representations, 2017

Adversarial Learning for Neural Dialogue Generation.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Neural Net Models of Open-domain Discourse Coherence.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

A Two-stage Sieve Approach for Quote Attribution.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Incorporating Dialectal Variability for Socially Equitable Language Identification.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Neural Language Correction with Character-Based Attention.
CoRR, 2016

Understanding Neural Networks through Representation Erasure.
CoRR, 2016

A Simple, Fast Diverse Decoding Algorithm for Neural Generation.
CoRR, 2016

Mutual Information and Diverse Decoding Improve Neural Machine Translation.
CoRR, 2016

Citation Classification for Behavioral Analysis of a Scientific Field.
CoRR, 2016

Visualizing and Understanding Neural Models in NLP.
Proceedings of the NAACL HLT 2016, 2016

Between- and Within-Speaker Effects of Bilingualism on F0 Variation.
Proceedings of the Interspeech 2016, 2016

Ketchup, Interdisciplinarity, and the Spread of Innovation in Speech and Language Processing.
Proceedings of the Interspeech 2016, 2016

Deep Reinforcement Learning for Dialogue Generation.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Distinguishing Past, On-going, and Future Events: The EventStatus Corpus.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Cultural Shift or Linguistic Drift? Comparing Two Computational Measures of Semantic Change.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Inducing Domain-Specific Sentiment Lexicons from Unlabeled Corpora.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

The Dialogic Turn and the Performance of Gender: the English Canon 1782-2011.
Proceedings of the 11th Annual International Conference of the Alliance of Digital Humanities Organizations, 2016

Predicting the Rise and Fall of Scientific Topics from Trends in their Rhetorical Framing.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Diachronic Word Embeddings Reveal Statistical Laws of Semantic Change.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
Learning multi-faceted representations of individuals from heterogeneous evidence using neural networks.
CoRR, 2015

When Are Tree Structures Necessary for Deep Learning of Representations?
CoRR, 2015

Lexicon-Free Conversational Speech Recognition with Neural Networks.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

The connection between smiling and GOAT fronting: Embodied affect in sociophonetic variation.
Proceedings of the 18th International Congress of Phonetic Sciences, 2015

When Are Tree Structures Necessary for Deep Learning of Representations?
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Do Multi-Sense Embeddings Improve Natural Language Understanding?
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

The Users Who Say 'Ni': Audience Identification in Chinese-language Restaurant Reviews.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

A Hierarchical Neural Autoencoder for Paragraphs and Documents.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Narrative framing of consumer sentiment in online restaurant reviews.
First Monday, 2014

Increasing Deep Neural Network Acoustic Model Size for Large Vocabulary Continuous Speech Recognition.
CoRR, 2014

First-Pass Large Vocabulary Continuous Speech Recognition using Bi-Directional Recurrent DNNs.
CoRR, 2014

Inferring User Preferences by Probabilistic Logical Reasoning over Social Networks.
CoRR, 2014

Charles J. Fillmore.
Comput. Linguistics, 2014

Speaker-independent detection of child-directed speech.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Event Extraction Using Distant Supervision.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

On the Importance of Text Analysis for Stock Price Prediction.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

How to Ask for a Favor: A Case Study on the Success of Altruistic Requests.
Proceedings of the Eighth International Conference on Weblogs and Social Media, 2014

Learning to Reason Pragmatically with Cognitive Limitations.
Proceedings of the 36th Annual Meeting of the Cognitive Science Society, 2014

Easy does it: more usable CAPTCHAs.
Proceedings of the CHI Conference on Human Factors in Computing Systems, 2014

2013
Detecting friendly, flirtatious, awkward, and assertive speech in speed-dates.
Comput. Speech Lang., 2013

Deterministic Coreference Resolution Based on Entity-Centric, Precision-Ranked Rules.
Comput. Linguistics, 2013

No country for old members: user lifecycle and linguistic change in online communities.
Proceedings of the 22nd International World Wide Web Conference, 2013

Positive Diversity Tuning for Machine Translation System Combination.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

Emergence of Gricean Maxims from Multi-Agent Decision Theory.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Same Referent, Different Words: Unsupervised Mining of Opaque Coreferent Mentions.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Breaking Out of Local Optima with Count Transforms and Model Recombination: A Study in Grammar Induction.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Tradition and Modernity in 20th Century Chinese Poetry.
Proceedings of the Workshop on Computational Linguistics for Literature, 2013

Implicatures and Nested Beliefs in Approximate Decentralized-POMDPs.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Generating Recommendation Dialogs by Extracting Information from User Reviews.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Linguistic Models for Analyzing and Detecting Biased Language.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

A computational approach to politeness with application to social factors.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
Bootstrapping Dependency Grammar Inducers from Incomplete Sentence Fragments via Austere Models.
Proceedings of the Eleventh International Conference on Grammatical Inference, 2012

Citation-based bootstrapping for large-scale author disambiguation.
J. Assoc. Inf. Sci. Technol., 2012

Parsing Time: Learning to Interpret Time Expressions.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2012

Learning the Central Events and Participants in Unlabeled Text.
Proceedings of the 29th International Conference on Machine Learning, 2012

Learning Attitudes and Attributes from Multi-aspect Reviews.
Proceedings of the 12th IEEE International Conference on Data Mining, 2012

Three Dependency-and-Boundary Models for Grammar Induction.
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012

Joint Entity and Event Coreference Resolution across Documents.
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012

Towards a Literary Machine Translation: The Role of Referential Cohesion.
Proceedings of the Workshop on Computational Linguistics for Literature, 2012

A Computational Analysis of Style, Affect, and Imagery in Contemporary Poetry.
Proceedings of the Workshop on Computational Linguistics for Literature, 2012

He Said, She Said: Gender in the ACL Anthology.
Proceedings of the Special Workshop on Rediscovering 50 Years of Discoveries@ACL 2012, 2012

Towards a Computational History of the ACL: 1980-2008.
Proceedings of the Special Workshop on Rediscovering 50 Years of Discoveries@ACL 2012, 2012

2011
Sex, food, and words: the hidden meanings behind everyday language.
Proceedings of the 24th Annual ACM Symposium on User Interface Software and Technology, 2011

A Study of Academic Collaborations in Computational Linguistics using a Latent Mixture of Authors Model.
Proceedings of the 5th ACL Workshop on Language Technology for Cultural Heritage, 2011

Using Query Patterns to Learn the Duration of Events.
Proceedings of the Ninth International Conference on Computational Semantics, 2011

LeadLag LDA: Estimating Topic Specific Leads and Lags of Information Outlets.
Proceedings of the Fifth International Conference on Weblogs and Social Media, 2011

Lateen EM: Unsupervised Training with Multiple Objectives, Applied to Dependency Grammar Induction.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Unsupervised Dependency Parsing without Gold Part-of-Speech Tags.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Punctuation: Making a Point in Unsupervised Dependency Parsing.
Proceedings of the Fifteenth Conference on Computational Natural Language Learning, 2011

Stanford's Multi-Pass Sieve Coreference Resolution System at the CoNLL-2011 Shared Task.
Proceedings of the Fifteenth Conference on Computational Natural Language Learning: Shared Task, 2011

Template-Based Information Extraction without the Templates.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

2010
Which words are hard to recognize? Prosodic, lexical, and disfluency factors that increase speech recognition error rates.
Speech Commun., 2010

The NXT-format Switchboard Corpus: a rich resource for investigating the syntax, semantics, pragmatics and prosody of dialogue.
Lang. Resour. Evaluation, 2010

How Good Are Humans at Solving CAPTCHAs? A Large Scale Evaluation.
Proceedings of the 31st IEEE Symposium on Security and Privacy, 2010

From Baby Steps to Leapfrog: How "Less is More" in Unsupervised Dependency Parsing.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

The Best Lexical Metric for Phrase-Based Statistical MT System Optimization.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Phrasal: A Statistical Machine Translation Toolkit for Exploring New Model Features.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, June 2, 2010, Los Angeles, California, USA, 2010

A Database of Narrative Schemas.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Parsing to Stanford Dependencies: Trade-offs between Speed and Accuracy.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

A Multi-Pass Sieve for Coreference Resolution.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

Viterbi Training Improves Unsupervised Dependency Parsing.
Proceedings of the Fourteenth Conference on Computational Natural Language Learning, 2010

Who should I cite: learning literature search models from citation behavior.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

Learning to Follow Navigational Directions.
Proceedings of the ACL 2010, 2010

Profiting from Mark-Up: Hyper-Text Annotations for Guided Parsing.
Proceedings of the ACL 2010, 2010

Improving the Use of Pseudo-Words for Evaluating Selectional Preferences.
Proceedings of the ACL 2010, 2010

Eye Spy: Improving Vision through Dialog.
Proceedings of the Dialog with Robots, 2010

2009
Measuring machine translation quality as semantic equivalence: A metric based on entailment features.
Mach. Transl., 2009

The effect of lexical frequency and Lombard reflex on tone hyperarticulation.
J. Phonetics, 2009

Machine Translation Evaluation with Textual Entailment Features.
Proceedings of the Fourth Workshop on Statistical Machine Translation, 2009

Disambiguating "DE" for Chinese-English Machine Translation.
Proceedings of the Fourth Workshop on Statistical Machine Translation, 2009

Stanford-UBC at TAC-KBP.
Proceedings of the Second Text Analysis Conference, 2009

Discriminative Reordering with Chinese Grammatical Relations Features.
Proceedings of the Third Workshop on Syntax and Structure in Statistical Translation, 2009

Extracting Social Meaning: Identifying Interactional Style in Spoken Conversation.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

It's Not You, it's Me: Detecting Flirting and its Misperception in Speed-Dates.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

Hidden Conditional Random Fields for phone recognition.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

It's not you, it's me: Automatically extracting social meaning from speed dates.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Robust Machine Translation Evaluation with Entailment Features.
Proceedings of the ACL 2009, 2009

Distant supervision for relation extraction without labeled data.
Proceedings of the ACL 2009, 2009

Unsupervised Learning of Narrative Schemas and their Participants.
Proceedings of the ACL 2009, 2009

Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition, 2nd Edition.
Prentice Hall series in artificial intelligence, Prentice Hall, Pearson Education International, ISBN: 9780135041963, 2009

2008
Regularization and Search for Minimum Error Rate Training.
Proceedings of the Third Workshop on Statistical Machine Translation, 2008

Maximum conditional likelihood linear regression and maximum a posteriori for hidden conditional random fields speaker adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2008

Cheap and Fast - But is it Good? Evaluating Non-Expert Annotations for Natural Language Tasks.
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008

Studying the History of Ideas Using Topic Models.
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008

Jointly Combining Implicit Constraints Improves Temporal Ordering.
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008

Which Words Are Hard to Recognize? Prosodic, Lexical, and Disfluency Factors that Increase ASR Error Rates.
Proceedings of the ACL 2008, 2008

Unsupervised Learning of Narrative Event Chains.
Proceedings of the ACL 2008, 2008

2007
Resolving "You" in Multi-Party Dialog.
Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue, 2007

To Memorize or to Predict: Prominence labeling in Conversational Speech.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

Modelling prominence and emphasis improves unit-selection synthesis.
Proceedings of the INTERSPEECH 2007, 2007

Learning to Merge Word Senses.
Proceedings of the EMNLP-CoNLL 2007, 2007

Regularization, adaptation, and non-independent features improve hidden conditional random fields for phone classification.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

Automatic detection of contrastive elements in spontaneous speech.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

Disambiguating Between Generic and Referential "You" in Dialog.
Proceedings of the ACL 2007, 2007

Measuring Importance and Query Relevance in Topic-focused Multi-document Summarization.
Proceedings of the ACL 2007, 2007

Classifying Temporal Relations Between Events.
Proceedings of the ACL 2007, 2007

2006
A Dialectal Chinese Speech Recognition Framework.
J. Comput. Sci. Technol., 2006

The (Non)Utility of Linguistic Features for Predicting prominence in spontaneous speech.
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006

Have we met? MDP based speaker ID for robot dialogue.
Proceedings of the INTERSPEECH 2006, 2006

Limitations of MLLR adaptation with Spanish-accented English: an error analysis.
Proceedings of the INTERSPEECH 2006, 2006

Detection of word fragments in Mandarin telephone conversation.
Proceedings of the INTERSPEECH 2006, 2006

Semantic Taxonomy Induction from Heterogenous Evidence.
Proceedings of the ACL 2006, 2006

Extracting Opinion Propositions and Opinion Holders using Syntactic and Lexical Cues.
Proceedings of the Computing Attitude and Affect in Text: Theory and Applications, 2006

2005
Editorial.
Speech Commun., 2005

Support Vector Learning for Semantic Argument Classification.
Mach. Learn., 2005

Accent detection and speech recognition for Shanghai-accented Mandarin.
Proceedings of the INTERSPEECH 2005, 2005

Pitch accent prediction: effects of genre and speaker.
Proceedings of the INTERSPEECH 2005, 2005

The detection of emphatic words using acoustic and lexical features.
Proceedings of the INTERSPEECH 2005, 2005

A preliminary study of Mandarin filled pauses.
Proceedings of the ISCA Tutorial and Research Workshop (ITRW) on Disfluency in Spontaneous Speech, 2005

Semantic Role Chunking Combining Complementary Syntactic Views.
Proceedings of the Ninth Conference on Computational Natural Language Learning, 2005

Semantic Role Labeling Using Different Syntactic Views.
Proceedings of the ACL 2005, 2005

Morphological features help POS tagging of unknown words across language varieties.
Proceedings of the Fourth SIGHAN Workshop on Chinese Language Processing, 2005

A Conditional Random Field Word Segmenter for Sighan Bakeoff 2005.
Proceedings of the Fourth SIGHAN Workshop on Chinese Language Processing, 2005

2004
Learning Syntactic Patterns for Automatic Hypernym Discovery.
Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004

Shallow Semantc Parsing of Chinese.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2004

Shallow Semantic Parsing using Support Vector Machines.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2004

Parsing Arguments of Nominalizations in English and Chinese.
Proceedings of HLT-NAACL 2004: Short Papers, Boston, Massachusetts, USA, May 2-7, 2004, 2004

Automatic Tagging of Arabic Text: From Raw Text to Base Phrase Chunks.
Proceedings of HLT-NAACL 2004: Short Papers, Boston, Massachusetts, USA, May 2-7, 2004, 2004

Semantic Role Labeling by Tagging Syntactic Chunks.
Proceedings of the Eighth Conference on Computational Natural Language Learning, 2004

2003
Semantic Role Parsing: Adding Semantic Structure to Unstructured Text.
Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM 2003), 2003

The Effect of Rhythm on Structural Disambiguation in Chinese.
Proceedings of the Second Workshop on Chinese Language Processing, 2003

2002
Automatic Labeling of Semantic Roles.
Comput. Linguistics, 2002


2001
A Bayesian Model Predicts Human Parse Preference and Reading Times in Sentence Processing.
Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001

Knowledge-Free Induction of Inflectional Morphologies.
Proceedings of the Language Technologies 2001: The Second Meeting of the North American Chapter of the Association for Computational Linguistics, 2001

What kind of pronunciation variation is hard for triphones to model?
Proceedings of the IEEE International Conference on Acoustics, 2001

The effect of language model probability on pronunciation reduction.
Proceedings of the IEEE International Conference on Acoustics, 2001

Is Knowledge-Free Induction of Multiword Unit Dictionary Headwords a Solved Problem?
Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2001

2000
Can Prosody Aid the Automatic Classification of Dialog Acts in Conversational Speech?
CoRR, 2000

Dialogue Act Modeling for Automatic Tagging and Recognition of Conversational Speech
CoRR, 2000

Dialog Act Modeling for Automatic Tagging and Recognition of Conversational Speech.
Comput. Linguistics, 2000

Knowledge-Free Induction of Morphology Using Latent Semantic Analysis.
Proceedings of the Fourth Conference on Computational Natural Language Learning, 2000

Speech and language processing - an introduction to natural language processing, computational linguistics, and speech recognition.
Prentice Hall series in artificial intelligence, Prentice Hall, ISBN: 978-0-13-095069-7, 2000

1998
An American national corpus: a proposal.
Proceedings of the First International Conference on Language Resources and Evaluation, 1998

Reduction of English function words in switchboard.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Towards better integration of semantic predictors in statistical language modeling.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

How Verb Subcategorization Frequencies Are Affected By Corpus Choice.
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998

1996
Learning Bias and Phonological-Rule Induction.
Comput. Linguistics, 1996

A Probabilistic Model of Lexical and Syntactic Access and Disambiguation.
Cogn. Sci., 1996

1995
Building multiple pronunciation models for novel words using exploratory computational phonology.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Using a stochastic context-free grammar as a language model for speech recognition.
Proceedings of the 1995 International Conference on Acoustics, 1995

Learning Phonological Rule Probabilities from Speech Corpora with Exploratory Computational Phonology.
Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics, 1995

Automatic Induction of Finite State Transducers for Simple Phonological Rules.
Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics, 1995

1994
The berkeley restaurant project.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

1992
An On-Line Computational Model of Human Sentence Interpretation.
Proceedings of the 10th National Conference on Artificial Intelligence, 1992

1990
Representing and Integrating Linguistic Knowledge.
Proceedings of the 13th International Conference on Computational Linguistics, 1990

1989
James Allen, Understanding Natural Language.
Artif. Intell., 1989

1988
Issues in Relating Syntax Semantics.
Proceedings of the 12th International Conference on Computational Linguistics, 1988


  Loading...