Giampiero Salvi

Orcid: 0000-0002-3323-5311

According to our database1, Giampiero Salvi authored at least 64 papers between 1999 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
A step-by-step training method for multi generator GANs with application to anomaly detection and cybersecurity.
Neurocomputing, June, 2023

NAAQA: A Neural Architecture for Acoustic Question Answering.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2023

S-HR-VQVAE: Sequential Hierarchical Residual Learning Vector Quantized Variational Autoencoder for Video Prediction.
CoRR, 2023

Developing an AI-Assisted Low-Resource Spoken Language Learning App for Children.
IEEE Access, 2023

Improving Generalization of Norwegian ASR with Limited Linguistic Resources.
Proceedings of the 24th Nordic Conference on Computational Linguistics, 2023

A character-based analysis of impacts of dialects on end-to-end Norwegian ASR.
Proceedings of the 24th Nordic Conference on Computational Linguistics, 2023

Using Modified Adult Speech as Data Augmentation for Child Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
CLEAR: A Dataset for Compositional Language and Elementary Acoustic Reasoning.
Dataset, August, 2022

Acoustic-to-Articulatory Mapping With Joint Optimization of Deep Speech Enhancement and Articulatory Inversion Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Semantically Meaningful Metrics for Norwegian ASR Systems.
Proceedings of the Interspeech 2022, 2022

wav2vec2-based Speech Rating System for Children with Speech Sound Disorder.
Proceedings of the Interspeech 2022, 2022

Hierarchical Residual Learning Based Vector Quantized Variational Autoencoder for Image Reconstruction and Generation.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021
A DNN Based Speech Enhancement Approach to Noise Robust Acoustic-to-Articulatory Inversion.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2021

STEP-GAN: A One-Class Anomaly Detection Model with Applications to Power System Security.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Self-Supervised Vision-Based Detection of the Active Speaker as Support for Socially Aware Language Acquisition.
IEEE Trans. Cogn. Dev. Syst., 2020

Beyond the Self: Using Grounded Affordances to Interpret and Describe Others' Actions.
IEEE Trans. Cogn. Dev. Syst., 2020

STEP-GAN: A Step-by-Step Training for Multi Generator GANs with application to Cyber Security in Power Systems.
CoRR, 2020

Sequence-to-Sequence Articulatory Inversion Through Time Convolution of Sub-Band Frequency Signals.
Proceedings of the Interspeech 2020, 2020

Transfer Learning of Articulatory Information Through Phone Information.
Proceedings of the Interspeech 2020, 2020

Spatial Bias in Vision-Based Voice Activity Detection.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

2019
From Visual to Acoustic Question Answering.
CoRR, 2019

Effect of vowel context in cepstral and entropy analysis of pathological voices.
Biomed. Signal Process. Control., 2019

Active Mini-Batch Sampling Using Repulsive Point Processes.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
CLEAR: A Dataset for Compositional Language and Elementary Acoustic Reasoning.
CoRR, 2018

2017
Interactive Robot Learning of Gestures, Language and Affordances.
CoRR, 2017

Self-Supervised Vision-Based Detection of the Active Speaker as a Prerequisite for Socially-Aware Language Acquisition.
CoRR, 2017

Cepstral and Entropy Analyses in Vowels Excerpted from Continuous Speech of Dysphonic and Control Speakers.
Proceedings of the Interspeech 2017, 2017

2016
An Analysis of Shallow and Deep Representations of Speech Based on Unsupervised Classification of Isolated Words.
Proceedings of the Recent Advances in Nonlinear Speech Processing, 2016

Semi-supervised Learning with Sparse Autoencoders in Phone Classification.
CoRR, 2016

Optimising The Input Window Alignment in CD-DNN Based Phoneme Recognition for Low Latency Processing.
CoRR, 2016

2015
Detecting repetitions in spoken dialogue systems using phonetic distances.
Proceedings of the INTERSPEECH 2015, 2015

2014
Free Acoustic and Language Models for Large Vocabulary Continuous Speech Recognition in Swedish.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

The WaveSurfer Automatic Speech Recognition Plugin.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Audio-visual classification and detection of human manipulation actions.
Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2014

Pattern discovery in continuous speech using Block Diagonal Infinite HMM.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Semi-supervised methods for exploring the acoustics of simple productive feedback.
Speech Commun., 2013

On mispronunciation analysis of individual foreign speakers using auditory periphery models.
Speech Commun., 2013

A gaze-based method for relating group involvement to individual engagement in multimodal multiparty dialogue.
Proceedings of the 2013 International Conference on Multimodal Interaction, 2013

Robot anticipation of human intentions through continuous gesture recognition.
Proceedings of the 2013 International Conference on Collaboration Technologies and Systems, 2013

2012
Language Bootstrapping: Learning Word Meanings From Perception-Action Association.
IEEE Trans. Syst. Man Cybern. Part B, 2012

Word Discovery with Beta Process Factor Analysis.
Proceedings of the INTERSPEECH 2012, 2012

Auditory and Dynamic Modeling Paradigms to Detect L2 Mispronunciations.
Proceedings of the INTERSPEECH 2012, 2012

Biologically Inspired Methods for Automatic Speech Understanding.
Proceedings of the Biologically Inspired Cognitive Architectures 2012 - Proceedings of the Third Annual Meeting of the BICA Society, Palermo, Sicily, Italy, October 31, 2012

2011
Using Imitation to Learn Infant-Adult Acoustic Mappings.
Proceedings of the INTERSPEECH 2011, 2011

2010
Cluster analysis of differential spectral envelopes on emotional speech.
Proceedings of the INTERSPEECH 2010, 2010

2009
SynFace - Speech-Driven Facial Animation for Virtual Speech-Reading Support.
EURASIP J. Audio Speech Music. Process., 2009

Virtual speech reading support for hard of hearing in a domestic multi-media setting.
Proceedings of the INTERSPEECH 2009, 2009

Affordance based word-to-meaning association.
Proceedings of the 2009 IEEE International Conference on Robotics and Automation, 2009

Synface - verbal and non-verbal face animation from audio.
Proceedings of the Auditory-Visual Speech Processing, 2009

2008
Hearing at home - communication support in home environments for hearing impaired persons.
Proceedings of the INTERSPEECH 2008, 2008

2006
Mining Speech Sounds: Machine Learning Methods for Automatic Speech Recognition and Analysis.
PhD thesis, 2006

Segment boundary detection via class entropy measurements in connectionist phoneme recognition.
Speech Commun., 2006

Dynamic behaviour of connectionist speech recognition with strong latency constraints.
Speech Commun., 2006

User Evaluation of the SYNFACE Talking Head Telephone.
Proceedings of the Computers Helping People with Special Needs, 2006

2005
Segment Boundaries in Low Latency Phonetic Recognition.
Proceedings of the Nonlinear Analyses and Algorithms for Speech Processing, 2005

Advances in regional accent clustering in Swedish.
Proceedings of the INTERSPEECH 2005, 2005

Ecological language acquisition via incremental model-based clustering.
Proceedings of the INTERSPEECH 2005, 2005

2004
SYNFACE - A Talking Head Telephone for the Hearing-Impaired.
Proceedings of the Computers Helping People with Special Needs, 2004

2003
Truncation error and dynamics in very low latency phonetic recognition.
Proceedings of the ITRW on Non-Linear Speech Processing, 2003

Using accent information in ASR models for Swedish.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

SYNFACE - a talking face telephone.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2000
The COST 249 SpeechDat Multilingual Reference Recogniser.
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

A Noise Robust Multilingual Reference Recogniser Based on Speechdat(II).
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999
Synthetic visual speech driven from auditory speech.
Proceedings of the Auditory-Visual Speech Processing, 1999


  Loading...