Izhak Shafran

According to our database1, Izhak Shafran authored at least 92 papers between 2000 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Learning to Learn Faster from Human Feedback with Language Model Predictive Control.
CoRR, 2024

Retrieval Augmented End-to-End Spoken Dialog Models.
CoRR, 2024

2023
RoboVQA: Multimodal Long-Horizon Reasoning for Robotics.
CoRR, 2023

SLM: Bridge the thin gap between speech and text foundation models.
CoRR, 2023

Efficient Adapters for Giant Speech Models.
CoRR, 2023

Speech-to-Text Adapter and Speech-to-Entity Retriever Augmented LLMs for Speech Understanding.
CoRR, 2023

MUX-PLMs: Pre-training Language Models with Data Multiplexing.
Proceedings of the 8th Workshop on Representation Learning for NLP, 2023

Tree of Thoughts: Deliberate Problem Solving with Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

ReAct: Synergizing Reasoning and Acting in Language Models.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

AnyTOD: A Programmable Task-Oriented Dialog System.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

MUX-PLMs: Data Multiplexing for High-throughput Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

SLM: Bridge the Thin Gap Between Speech and Text Foundation Models.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Detecting Speech Abnormalities With a Perceiver-Based Sequence Classifier that Leverages a Universal Speech Model.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Speech Aware Dialog System Technology Challenge (DSTC11).
CoRR, 2022

RNN Transducers for Nested Named Entity Recognition with constraints on alignment for long sequences.
CoRR, 2022

Description-Driven Task-Oriented Dialog Modeling.
CoRR, 2022

Unsupervised Slot Schema Induction for Task-oriented Dialog.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

RNN Transducers for Named Entity Recognition with constraints on alignment for understanding medical conversations.
Proceedings of the Interspeech 2022, 2022

Knowledge-grounded Dialog State Tracking.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021
R2D2: Relational Text Decoding with Transformers.
CoRR, 2021

Understanding Medical Conversations: Rich Transcription, Confidence Scores & Information Extraction.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Word-Level Confidence Estimation for RNN Transducers.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Google COVID-19 Search Trends Symptoms Dataset: Anonymization Process Description (version 1.0).
CoRR, 2020

The Medical Scribe: Corpus Development and Model Performance Analyses.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

2019
Learning to Infer Entities, Properties and their Relations from Clinical Conversations.
CoRR, 2019

Audio De-identification - a New Entity Recognition Task.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Joint Speech Recognition and Speaker Diarization via Sequence Transduction.
Proceedings of the Interspeech 2019, 2019

Learning to Infer Entities, Properties and their Relations from Clinical Conversations.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Extracting Symptoms and their Status from Clinical Conversations.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Improvements to harmonic model for extracting better speech features in clinical applications.
Comput. Speech Lang., 2018

Complex Evolution Recurrent Neural Networks (ceRNNs).
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Multichannel Signal Processing With Deep Neural Networks for Automatic Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2017


Adaptive Multichannel Dereverberation for Automatic Speech Recognition.
Proceedings of the Interspeech 2017, 2017

Raw Multichannel Processing Using Deep Neural Networks.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016
Fuzzy Statistical Matrices for Cell Classification.
CoRR, 2016

Complex Linear Projection (CLP): A Discriminative Approach to Joint Feature Extraction and Acoustic Modeling.
Proceedings of the Interspeech 2016, 2016

Reducing the Computational Complexity of Multimicrophone Acoustic Models with Integrated Feature Extraction.
Proceedings of the Interspeech 2016, 2016

Robust speech recognition using multivariate copula models.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Fully automated assessment of the severity of Parkinson's disease from speech.
Comput. Speech Lang., 2015

Context dependent phone models for LSTM RNN acoustic modelling.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Efficient and accurate multivariate class conditional densities using copula.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Inferring social nature of conversations from words: Experiments on a corpus of everyday telephone conversations.
Comput. Speech Lang., 2014

Applications of Lexicographic Semirings to Problems in Speech and Language Processing.
Comput. Linguistics, 2014

Inferring clinical depression from speech and spoken utterances.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2014

Inferring social contexts from audio recordings using deep neural networks.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2014

Discriminative pronunciation modeling for dialectal speech recognition.
Proceedings of the INTERSPEECH 2014, 2014

Automatic measurement of affective valence and arousal in speech.
Proceedings of the IEEE International Conference on Acoustics, 2014

Detecting vocalizations of individual monkeys in social groups.
Proceedings of the 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2014

Detecting Health Related Discussions in Everyday Telephone Conversations for Studying Medical Events in the Lives of Older Adults.
Proceedings of BioNLP, Baltimore, Maryland, USA, 2014

2013
Inferring functional connectivity in MRI using Bayesian network structure learning with a modified PC algorithm.
NeuroImage, 2013

Discriminative Joint Modeling of Lexical Variation and Acoustic Confusion for Automated Narrative Retelling Assessment.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Visual hull reconstruction for automated primate behavior observation.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2013

Improving the accuracy and the robustness of harmonic model for pitch estimation.
Proceedings of the INTERSPEECH 2013, 2013

Robust and accurate features for detecting and diagnosing autism spectrum disorders.
Proceedings of the INTERSPEECH 2013, 2013

Parsimonious multivariate copula model for density estimation.
Proceedings of the IEEE International Conference on Acoustics, 2013

Adaptive H-Extrema for Automatic Immunogold Particle Detection.
Proceedings of the Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications, 2013

2012
Discriminative Language Modeling With Linguistic and Statistically Derived Features.
IEEE Trans. Speech Audio Process., 2012

Robust detection of voiced segments in samples of everyday conversations using unsupervised HMMS.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Hello, Who is Calling?: Can Words Reveal the Social Nature of Conversations?
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2012

Hallucinating system outputs for discriminative language modeling.
Proceedings of the 2012 Symposium on Machine Learning in Speech and Language Processing, 2012

Interspeech Pathology Challenge: Investigations into Speaker and Sentence Specific Effects.
Proceedings of the INTERSPEECH 2012, 2012

Fully Automated Neuropsychological Assessment for Detecting Mild Cognitive Impairment.
Proceedings of the INTERSPEECH 2012, 2012





2011
Learning a Discriminative Weighted Finite-State Transducer for Speech Recognition.
IEEE Trans. Speech Audio Process., 2011

Discriminatively estimated discrete, parametric and smoothed-discrete duration models for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Supervised and unsupervised feature selection for inferring social nature of telephone conversations from their content.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

Efficient determinization of tagged word lattices using categorial and lexicographic semirings.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

Lexicographic Semirings for Exact Automata Encoding of Sequence Models.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

2010
Discriminatively estimated joint acoustic, duration, and language model for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

Syntactic and sub-lexical features for Turkish discriminative language models.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Classifying clear and conversational speech based on acoustic features.
Proceedings of the INTERSPEECH 2009, 2009

2008
Discriminative n-gram language modeling for Turkish.
Proceedings of the INTERSPEECH 2008, 2008

2007
Multi-stream Fusion for Speaker Classification.
Proceedings of the Speaker Classification I: Fundamentals, Features, and Methods, 2007

The SRI/OGI 2006 spoken term detection system.
Proceedings of the INTERSPEECH 2007, 2007

Exploiting prosody for PCFGs with latent annotations.
Proceedings of the INTERSPEECH 2007, 2007

2006
SParseval: Evaluation Metrics for Parsing Speech.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Discriminative Classifiers for Language Recognition.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Reranking for Sentence Boundary Detection in Conversational Speech.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Corrective Models for Speech Recognition of Inflected Languages.
Proceedings of the EMNLP 2006, 2006

Overview of the CLEF-2006 Cross-Language Speech Retrieval Track.
Proceedings of the Evaluation of Multilingual and Multi-modal Information Retrieval, 2006

PCFGs with Syntactic and Prosodic Indicators of Speech Repairs.
Proceedings of the ACL 2006, 2006

2005
Automatic Detection and Segmentation of Robot-Assisted Surgical Motions.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention, 2005

Accent detection and speech recognition for Shanghai-accented Mandarin.
Proceedings of the INTERSPEECH 2005, 2005

A Comparison of Classifiers for Detecting Emotion from Speech.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Task-specific minimum Bayes-risk decoding using learned edit distance.
Proceedings of the INTERSPEECH 2004, 2004

2003
Acoustic model clustering based on syllable structure.
Comput. Speech Lang., 2003

Robust speech detection and segmentation for real-time ASR applications.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2000
Use of higher level linguistic structure in acoustic modeling for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2000


  Loading...