Philip J. B. Jackson

Orcid: 0000-0001-7933-5935

According to our database1, Philip J. B. Jackson authored at least 76 papers between 2000 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Leveraging Visual Supervision for Array-Based Active Speaker Detection and Localization.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

2023
Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection.
CoRR, 2023

Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions.
CoRR, 2023

Audio Inputs for Active Speaker Detection and Localization Via Microphone Array.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Producing Personalised Object-Based Audio-Visual Experiences: an Ethnographic Study.
Proceedings of the 2023 ACM International Conference on Interactive Media Experiences, 2023

PAT: Position-Aware Transformer for Dense Multi-Label Action Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Immersive audio-visual scene reproduction using semantic scene reconstruction from 360 cameras.
Virtual Real., 2022

Tragic Talkers: A Shakespearean Sound- and Light-Field Dataset for Audio-Visual Machine Learning Research.
Proceedings of the European Conference on Visual Media Production, 2022

2021
Acoustic Room Modelling Using 360 Stereo Cameras.
IEEE Trans. Multim., 2021

Naturalistic audio-visual volumetric sequences dataset of sounding actions for six degree-of-freedom interaction.
Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops, 2021

Visually Supervised Speaker Detection and Localization via Microphone Array.
Proceedings of the 23rd International Workshop on Multimedia Signal Processing, 2021

2020
Immersive Virtual Reality Audio Rendering Adapted to the Listener and the Room.
Proceedings of the Adversarial and Uncertain Reasoning for Adaptive Cyber Defense, 2020

Audio-Visual Spatial Aligment Requirements of Central and Peripheral Object Events.
CoRR, 2020

Audio-Visual Spatial Alignment Requirements of Central and Peripheral Object Events.
Proceedings of the 2020 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops, 2020

2019
Modeling the Comb Filter Effect and Interaural Coherence for Binaural Source Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

A Speech Synthesis Approach for High Quality Speech Separation and Generation.
IEEE Signal Process. Lett., 2019

Immersive Spatial Audio Reproduction for VR/AR Using Room Acoustic Modelling from 360° Images.
Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces, 2019

Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Generalisation in Environmental Sound Classification: The 'Making Sense of Sounds' Data Set and Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2019

Robust Full-sphere Binaural Sound Source Localization Using Interaural and Spectral Cues.
Proceedings of the IEEE International Conference on Acoustics, 2019

Six types of audio that DEFY reality!: A taxonomy of audio augmented reality with examples.
Proceedings of the 14th International Audio Mostly Conference: A Journey in Sound, 2019

2018
Multiple Speaker Tracking in Spatial Audio via PHD Filtering and Depth-Audio Fusion.
IEEE Trans. Multim., 2018

An Audio-Visual System for Object-Based Audio: From Recording to Listening.
IEEE Trans. Multim., 2018

An Audio-Visual Method for Room Boundary Estimation and Material Recognition.
Proceedings of the 2018 Workshop on Audio-Visual Scene Understanding for Immersive Multimedia, 2018

Acoustic Reflector Localization and Classification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Iterative Deep Neural Networks for Speaker-Independent Binaural Blind Speech Separation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Synthesis of Images by Two-Stage Generative Adversarial Networks.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Robust Full-Sphere Binaural Sound Source Localization.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Perceptual Evaluation of Blind Source Separation in Object-Based Audio Production.
Proceedings of the Latent Variable Analysis and Signal Separation, 2018

Supporting Audiography: Design of a System for Sentimental Sound Recording, Classification and Playback.
Proceedings of the HCI International 2018, 2018

Robust median-plane binaural sound source localization.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

A Performance Evaluation of Several Deep Neural Networks for Reverberant Speech Separation.
Proceedings of the 52nd Asilomar Conference on Signals, Systems, and Computers, 2018

2017
Unsupervised Feature Learning Based on Deep Models for Environmental Audio Tagging.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Acoustic Reflector Localization: Novel Image Source Reversion and Direct Localization Methods.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Object-Based Audio Rendering.
CoRR, 2017

Speech reaction time measurements for the evaluation of audio-visual spatial coherence.
Proceedings of the Ninth International Conference on Quality of Multimedia Experience, 2017

Fast tagging of natural sounds using marginal co-regularization.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

A perceptually-weighted deep neural network for monaural speech enhancement in various background noise conditions.
Proceedings of the 25th European Signal Processing Conference, 2017

Media Device Orchestration for Immersive Spatial Audio Reproduction.
Proceedings of the 12th International Audio Mostly Conference on Augmented and Participatory Sound and Music Experiences, 2017

3D Room Geometry Reconstruction Using Audio-Visual Sensors.
Proceedings of the 2017 International Conference on 3D Vision, 2017

2016
Fully Deep Neural Networks Incorporating Unsupervised Feature Learning for Audio Tagging.
CoRR, 2016

Predicting Binaural Speech Intelligibility from Signals Estimated by a Blind Source Separation Algorithm.
Proceedings of the Interspeech 2016, 2016

Fully DNN-Based Multi-Label Regression for Audio Tagging.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016

2015
Person Tracking Using Audio and Depth Cues.
Proceedings of the 2015 IEEE International Conference on Computer Vision Workshop, 2015

A 3D model for room boundary estimation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

IVA algorithms using a multivariate Student's t source prior for speech source separation in real room environments.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

A source separation evaluation method in object-based spatial audio.
Proceedings of the 23rd European Signal Processing Conference, 2015

2014
Joint Mixing Vector and Binaural Model Based Stereo Source Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

2013
Source Separation of Convolutive and Noisy Mixtures Using Audio-Visual Dictionary Learning and Probabilistic Time-Frequency Masking.
IEEE Trans. Signal Process., 2013

Spatial and coherence cues based time-frequency masking for binaural reverberant speech separation.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Use of bimodal coherence to resolve the permutation problem in convolutive BSS.
Signal Process., 2012

Reverberant speech separation based on audio-visual dictionary learning and binaural cues.
Proceedings of the IEEE Statistical Signal Processing Workshop, 2012

2011
Source localization and separation using Random Sample Consensus with phase cues.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011

Integrating binaural cues and blind source separation method for separating reverberant speech mixtures.
Proceedings of the IEEE International Conference on Acoustics, 2011

Robust feature selection for scaling ambiguity reduction in audio-visual convolutive BSS.
Proceedings of the 19th European Signal Processing Conference, 2011

2010
Bimodal coherence based scale ambiguity cancellation for target speech extraction and enhancement.
Proceedings of the INTERSPEECH 2010, 2010

Use of Bimodal Coherence to Resolve Spectral Indeterminacy in Convolutive BSS.
Proceedings of the Latent Variable Analysis and Signal Separation, 2010

2009
Statistical identification of articulation constraints in the production of speech.
Speech Commun., 2009

Model-Based Synthesis of Visual Speech Movements from 3D Video.
EURASIP J. Audio Speech Music. Process., 2009

Speaker-dependent audio-visual emotion recognition.
Proceedings of the Auditory-Visual Speech Processing, 2009

2008
Frication and Voicing Classification.
Proceedings of the Computational Processing of the Portuguese Language, 2008

Parallel model combination and word recognition in soccer audio.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Audio-visual feature selection and reduction for emotion classification.
Proceedings of the International Conference on Auditory-Visual Speech Processing 2008, 2008

Parameterisation of 3d speech lip movements.
Proceedings of the International Conference on Auditory-Visual Speech Processing 2008, 2008

2007
Visual analysis of lip coarticulation in VCV utterances.
Proceedings of the INTERSPEECH 2007, 2007

Statistical identification of critical, dependent and redundant articulators.
Proceedings of the INTERSPEECH 2007, 2007

Time-Frequency-Modulation Representation of Stochastic Signals.
Proceedings of the 15th International Conference on Digital Signal Processing, 2007

2006
Enhancement of harmonic content of speech based on a dynamic programming pitch tracking algorithm.
Proceedings of the INTERSPEECH 2006, 2006

2005
A multiple-level linear/linear segmental HMM with a formant-based intermediate layer.
Comput. Speech Lang., 2005

Amplitude modulation of frication noise by voicing saturates.
Proceedings of the INTERSPEECH 2005, 2005

2004
Speech-Driven Face Synthesis from 3D Video.
Proceedings of the 2nd International Symposium on 3D Data Processing, 2004

2003
The effect of an intermediate articulatory layer on the performance of a segmental HMM.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Covariation and weighting of harmonically decomposed streams for ASR.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
Models of speech dynamics in a segmental-HMM recognizer using intermediate linear representations.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001
Pitch-scaled estimation of simultaneous voiced and turbulence-noise components in speech.
IEEE Trans. Speech Audio Process., 2001

2000
Performance of the pitch-scaled harmonic filter and applications in speech analysis.
Proceedings of the IEEE International Conference on Acoustics, 2000


  Loading...