Sandra M. Aluísio

Orcid: 0000-0001-5108-2630

According to our database1, Sandra M. Aluísio authored at least 107 papers between 1995 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
NILC-Metrix: assessing the complexity of written and spoken language in Brazilian Portuguese.
Lang. Resour. Evaluation, March, 2024

Portal NURC-SP: Design, Development, and Speech Processing Corpora Resources to Support the Public Dissemination of Portuguese Spoken Language.
Proceedings of the 16th International Conference on Computational Processing of Portuguese, 2024

Simple and Fast Automatic Prosodic Segmentation of Brazilian Portuguese Spontaneous Speech.
Proceedings of the 16th International Conference on Computational Processing of Portuguese, 2024

TTS applied to the generation of datasets for automatic speech recognition.
Proceedings of the 16th International Conference on Computational Processing of Portuguese, 2024

2023
CORAA ASR: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese.
Lang. Resour. Evaluation, September, 2023

Evaluating OpenAI's Whisper ASR for Punctuation Prediction and Topic Modeling of life histories of the Museum of the Person.
CoRR, 2023

2022
RastrOS Project: Natural Language Processing contributions to the development of an eye-tracking corpus with predictability norms for Brazilian Portuguese.
Lang. Resour. Evaluation, 2022

Text complexity of open educational resources in Portuguese: mixing written and spoken registers in a multi-task approach.
Lang. Resour. Evaluation, 2022

TTS-Portuguese Corpus: a corpus for speech synthesis in Brazilian Portuguese.
Lang. Resour. Evaluation, 2022

Interpretability Analysis of Deep Models for COVID-19 Detection.
CoRR, 2022

Bringing NURC/SP to Digital Life: the Role of Open-source Automatic Speech Recognition Models.
CoRR, 2022

A single speaker is almost all you need for automatic speech recognition.
CoRR, 2022

2021
CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese.
CoRR, 2021

Evaluating Semantic Similarity Methods to Build Semantic Predictability Norms of Reading Data.
Proceedings of the Text, Speech, and Dialogue - 24th International Conference, 2021

SC-GlowTTS: An Efficient Zero-Shot Multi-Speaker Text-To-Speech Model.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Using Natural Language Processing to Build Graphical Abstracts to be used in Studies Selection Activity in Secondary Studies.
Proceedings of the 47th Euromicro Conference on Software Engineering and Advanced Applications, 2021

Using Open Information Extraction to Extract Relations: An Extended Systematic Mapping.
Proceedings of the XLVII Latin American Computing Conference, 2021

Speech2Phone: A Novel and Efficient Method for Training Speaker Recognition Models.
Proceedings of the Intelligent Systems - 10th Brazilian Conference, 2021

Deep Learning against COVID-19: Respiratory Insufficiency Detection in Brazilian Portuguese Speech.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Identificação automática de unidades de informação em testes de reconto de narrativas usando métodos de similaridade semântica avaliação de métodos de similaridade semântica.
Linguamática, 2020

Adaptação Lexical Automática em Textos Informativos do Português Brasileiro para o Ensino Fundamental.
Linguamática, 2020

End-To-End Speech Synthesis Applied to Brazilian Portuguese.
CoRR, 2020

Speech2Phone: A Multilingual and Text Independent Speaker Identification Model.
CoRR, 2020

A Dataset for the Evaluation of Lexical Simplification in Portuguese for Children.
Proceedings of the Computational Processing of the Portuguese Language, 2020

Evaluating Sentence Segmentation in Different Datasets of Neuropsychological Language Tests in Brazilian Portuguese.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Using Eye-tracking Data to Predict the Readability of Brazilian Portuguese Sentences in Single-task, Multi-task and Sequential Transfer Learning Approaches.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

2019
Automatic detection and correction of discourse marker errors made by Spanish native speakers in Portuguese academic writing.
Lang. Resour. Evaluation, 2019

Theoretical learning guarantees applied to acoustic modeling.
J. Braz. Comput. Soc., 2019

Robust Phoneme Recognition with Little Data.
Proceedings of the 8th Symposium on Languages, Applications and Technologies, 2019

2018
Sentence Segmentation and Disfluency Detection in Narrative Transcripts from Neuropsychological Tests.
Proceedings of the Computational Processing of the Portuguese Language, 2018

SIMPLEX-PB: A Lexical Simplification Database and Benchmark for Portuguese.
Proceedings of the Computational Processing of the Portuguese Language, 2018

Syntactic Knowledge for Natural Language Inference in Portuguese.
Proceedings of the Computational Processing of the Portuguese Language, 2018

A Nontrivial Sentence Corpus for the Task of Sentence Readability Assessment in Portuguese.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

2017
Discriminating between Similar Languages with Word-level Convolutional Neural Networks.
Proceedings of the Fourth Workshop on NLP for Similar Languages, 2017

A Lightweight Regression Method to Infer Psycholinguistic Properties for Brazilian Portuguese.
Proceedings of the Text, Speech, and Dialogue - 20th International Conference, 2017

Evaluating Word Embeddings for Sentence Boundary Detection in Speech Transcripts.
Proceedings of the 11th Brazilian Symposium in Information and Human Language Technology, 2017

Portuguese Word Embeddings: Evaluating on Word Analogies and Natural Language Tasks.
Proceedings of the 11th Brazilian Symposium in Information and Human Language Technology, 2017

The Coreference Annotation of the CSTNews Corpus.
Proceedings of the Second Workshop on Evaluation of Human Language Technologies for Iberian Languages (IberEval 2017) co-located with 33th Conference of the Spanish Society for Natural Language Processing (SEPLN 2017), 2017

Sentence Segmentation in Narrative Transcripts from Neuropsychological Tests using Recurrent Convolutional Neural Networks.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Acoustic Modeling Using a Shallow CNN-HTSVM Architecture.
Proceedings of the 2017 Brazilian Conference on Intelligent Systems, 2017

MilkQA: A Dataset of Consumer Questions for the Task of Answer Selection.
Proceedings of the 2017 Brazilian Conference on Intelligent Systems, 2017

Enriching Complex Networks with Word Embeddings for Detecting Mild Cognitive Impairment from Speech Transcripts.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Visão Geral da Avaliação de Similaridade Semântica e Inferência Textual.
Linguamática, 2016

An MCDM Approach to the Selection of Novel Technologies for Innovative In-Vehicle Information Systems.
Int. J. Decis. Support Syst. Technol., 2016

Sentence Segmentation in Narrative Transcripts from Neuropsycological Tests using Recurrent Convolutional Neural Networks.
CoRR, 2016

Automatic Semantic Role Labeling on Non-revised Syntactic Trees of Journalistic Texts.
Proceedings of the Computational Processing of the Portuguese Language, 2016

Automatic Classification of the Complexity of Nonfiction Texts in Portuguese for Early School Years.
Proceedings of the Computational Processing of the Portuguese Language, 2016

Improving POS Tagging Across Portuguese Variants with Word Embeddings.
Proceedings of the Computational Processing of the Portuguese Language, 2016

Evaluating Progression of Alzheimer's Disease by Regression and Classification Methods in a Narrative Language Test in Portuguese.
Proceedings of the Computational Processing of the Portuguese Language, 2016

Evaluating Phonetic Spellers for User-Generated Content in Brazilian Portuguese.
Proceedings of the Computational Processing of the Portuguese Language, 2016

2015
Evaluating word embeddings and a revised corpus for part-of-speech tagging in Portuguese.
J. Braz. Comput. Soc., 2015

Portal Min@s: Uma Ferramenta Geral de Apoio ao Processamento de Córpus de Propósito Geral (Portal Min@s: A General Purpose Support Tool for Corpora Processing).
Proceedings of the 10th Brazilian Symposium in Information and Human Language Technology, 2015

Semi-Automatic Construction of a Textual Entailment Dataset: Selecting Candidates with Vector Space Models.
Proceedings of the 10th Brazilian Symposium in Information and Human Language Technology, 2015

Automatic Generation of a Lexical Resource to support Semantic Role Labeling in Portuguese.
Proceedings of the Fourth Joint Conference on Lexical and Computational Semantics, 2015

A Deep Architecture for Non-Projective Dependency Parsing.
Proceedings of the 1st Workshop on Vector Space Modeling for Natural Language Processing, 2015

Automatic Proposition Extraction from Dependency Trees: Helping Early Prediction of Alzheimer's Disease from Narratives.
Proceedings of the 28th IEEE International Symposium on Computer-Based Medical Systems, 2015

2014
Using Cross-Linguistic Knowledge to Build VerbNet-Style Lexicons: Results for a (Brazilian) Portuguese VerbNet.
Proceedings of the Computational Processing of the Portuguese Language, 2014

GENERATING A LEXICON OF ERRORS IN PORTUGUESE TO SUPPORT AN ERROR IDENTIFICATION SYSTEM FOR SPANISH NATIVE LEARNERS.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

A Large Corpus of Product Reviews in Portuguese: Tackling Out-Of-Vocabulary Words.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Using a hybrid approach to build a pronunciation dictionary for Brazilian Portuguese.
Proceedings of the INTERSPEECH 2014, 2014

Some Issues on the Normalization of a Corpus of Products Reviews in Portuguese.
Proceedings of the 9th Web as Corpus Workshop, 2014

2013
Complex networks analysis of language complexity
CoRR, 2013

Approaches for Helping Brazilian Students Improve their Scientific Writings.
Proceedings of the 9th Brazilian Symposium in Information and Human Language Technology, 2013

An Evaluation of the Brazilian Portuguese LIWC Dictionary for Sentiment Analysis.
Proceedings of the 9th Brazilian Symposium in Information and Human Language Technology, 2013

Um repositório de verbos para a anotação de papéis semânticos disponível na web (A Verb Repository for Semantic Role Labeling Available in the Web) [in Portuguese].
Proceedings of the 9th Brazilian Symposium in Information and Human Language Technology, 2013

Identifying Pronominal Verbs: Towards Automatic Disambiguation of the Clitic 'se' in Portuguese.
Proceedings of the 9th Workshop on Multiword Expressions, 2013

2012
An architecture for multidimensional computer adaptive test with educational purposes.
Proceedings of the Brazilian Symposium on Multimedia and the Web, 2012

Propbank-Br: a Brazilian Treebank annotated with semantic role labels.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

2011
Automatic Question Categorization: a New Approach for Text Elaboration.
Proces. del Leng. Natural, 2011

Using machine learning methods to avoid the pitfall of cognates and false friends in Spanish-Portuguese word pairs.
Proceedings of the 8th Brazilian Symposium in Information and Human Language Technology, 2011

Características do jornalismo popular: avaliação da inteligibilidade e auxílio à descrição do gênero (Characteristics of Popular News: the Evaluation of Intelligibility and Support to the Genre Description) [in Portuguese].
Proceedings of the 8th Brazilian Symposium in Information and Human Language Technology, 2011

Propbank-Br: a Brazilian Portuguese corpus annotated with semantic role labels.
Proceedings of the 8th Brazilian Symposium in Information and Human Language Technology, 2011

Towards an on-demand Simple Portuguese Wikipedia.
Proceedings of the Second Workshop on Speech and Language Processing for Assistive Technologies, 2011

Identifying and Analyzing Brazilian Portuguese Complex Predicates.
Proceedings of the Workshop on Multiword Expressions: from Parsing and Generation to the Real World, 2011

2010
Adapting Web content for low-literacy readers by using lexical elaboration and named entities labeling.
New Rev. Hypermedia Multim., 2010

Análise da Inteligibilidade de textos via ferramentas de Processamento de Língua Natural: adaptando as métricas do Coh-Metrix para o Português.
Linguamática, 2010

Um panorama do Núcleo Interinstitucional de Linguística Computacional às vésperas de sua maioridade.
Linguamática, 2010

Challenging Choices for Text Simplification.
Proceedings of the Computational Processing of the Portuguese Language, 2010

SIMPLIFICA: a tool for authoring simplified texts in Brazilian Portuguese guided by readability assessments.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, June 2, 2010, Los Angeles, California, USA, 2010

Fostering Digital Inclusion and Accessibility: The PorSimples project for Simplification of Portuguese Texts.
Proceedings of the NAACL HLT 2010 Young Investigators Workshop on Computational Approaches to Languages of the Americas, 2010

Assigning Wh-Questions to Verbal Arguments: Annotation Tools Evaluation and Corpus Building.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Revisiting the Readability Assessment of Texts in Portuguese.
Proceedings of the Advances in Artificial Intelligence, 2010

2009
Building a Corpus-based Historical Portuguese Dictionary: Challenges and Opportunities.
Trait. Autom. des Langues, 2009

Facilita: helping the reading of texts available on the web.
Proceedings of the XV Brazilian Symposium on Multimedia and the Web, 2009

Simplifica: a simplified texts web authoring system.
Proceedings of the XV Brazilian Symposium on Multimedia and the Web, 2009

Facilita: reading assistance for low-literacy readers.
Proceedings of the 27th Annual International Conference on Design of Communication, 2009

Supporting the Adaptation of Texts for Poor Literacy Readers: a Text Simplification Editor for Brazilian Portuguese.
Proceedings of the Fourth Workshop on Innovative Use of NLP for Building Educational Applications, 2009

2008
Automatic summarization for text simplification: evaluating text understanding by poor readers.
Proceedings of the Companion Proceedings of the XIV Brazilian Symposium on Multimedia and the Web, 2008

Procorph: um sistema de apoio à criação de dicionários históricos.
Proceedings of the Companion Proceedings of the XIV Brazilian Symposium on Multimedia and the Web, 2008

OntoMethodus: a methodology to build domain-specific ontologies and its use in a system to support the generation of terminographic products.
Proceedings of the Companion Proceedings of the XIV Brazilian Symposium on Multimedia and the Web, 2008

A corpus analysis of simple account texts and the proposal of simplification strategies: first steps towards text simplification systems.
Proceedings of the 26th Annual International Conference on Design of Communication, 2008

Towards Brazilian Portuguese automatic text simplification systems.
Proceedings of the 2008 ACM Symposium on Document Engineering, 2008

2006
Developing strategies to produce better scientific papers: a Recipe for non-native users of English
CoRR, 2006

Argumentative Zoning Applied to Critiquing Novices' Scientific Abstracts.
Proceedings of the Computing Attitude and Affect in Text: Theory and Applications, 2006

2005
Evaluating Scientific Abstracts with a Genre-specific Rubric.
Proceedings of the Artificial Intelligence in Education, 2005

2004
Applying Argumentative Zoning in an Automatic Critiquer of Academic Writing.
Proceedings of the Advances in Artificial Intelligence - SBIA 2004, 17th Brazilian Symposium on Artificial Intelligence, São Luis, Maranhão, Brazil, September 29, 2004

The Lácio-Web: Corpora and Tools to Advance Brazilian Portuguese Language Investigations and Computational Linguistic Tools.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

What is my Style? Using Stylistic Features of Portuguese Web Texts to Classify Web Pages According to Users' Needs.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

A Learning Environment for English for Academic Purposes Based on Adaptive Tests and Task-Based Systems.
Proceedings of the Intelligent Tutoring Systems, 7th International Conference, 2004

2003
Assessing High-Order Skills with Partial Knowledge Evaluation: Lessons Learned from Using a Computer-based Proficiency Test of English for Academic Purposes.
J. Inf. Technol. Educ., 2003

An Account of the Challenge of Tagging a Reference Corpus for Brazilian Portuguese.
Proceedings of the Computational Processing of the Portuguese Language, 2003

An Initial Proposal for Cooperative Evaluation on Information Retrieval in Portuguese.
Proceedings of the Computational Processing of the Portuguese Language, 2003

2001
RaBeCa: A Hybrid Case-Based Reasoning Development Environment.
Proceedings of the 13th IEEE International Conference on Tools with Artificial Intelligence, 2001

How to Learn the Many Unwritten "Rules of the Game" of the Academic Discourse: A Hybrid Approach Based on Critiques and Cases to Support Scientific Writing.
Proceedings of the Proceedings IEEE International Conference on Advanced Learning Technology: Issues, 2001

2000
Combining Classifiers to Improve Part of Speech Tagging: A Case Study for Brazilian Portuguese.
Proceedings of the International Joint Conference, 2000

1995
A Case-Based Approach for Developing Writing Tools Aimed at Non-native English Users.
Proceedings of the Case-Based Reasoning Research and Development, 1995


  Loading...