Yuhong Yang

Speech Commun., March, 2023

A comparative study of Grid and Natural sentences effects on Normal-to-Lombard conversion.

[BibT_eX]

[DOI]

CoRR, 2023

Mandarin Lombard Flavor Classification.

[BibT_eX]

[DOI]

CoRR, 2023

EMALG: An Enhanced Mandarin Lombard Grid Corpus with Meaningful Sentences.

[BibT_eX]

[DOI]

CoRR, 2023

PCNN: A Lightweight Parallel Conformer Neural Network for Efficient Monaural Speech Enhancement.

[BibT_eX]

[DOI]

CoRR, 2023

Exploring the Interactions between Target Positive and Negative Information for Acoustic Echo Cancellation.

[BibT_eX]

[DOI]

CoRR, 2023

A Snoring Sound Dataset for Body Position Recognition: Collection, Annotation, and Analysis.

[BibT_eX]

[DOI]

CoRR, 2023

CQNV: A combination of coarsely quantized bitstream and neural vocoder for low rate speech coding.

[BibT_eX]

[DOI]

CoRR, 2023

Leveraging Sound Local and Global Features for Language-Queried Target Sound Extraction.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 30th International Conference, 2023

ONEI: Unveiling Route and Phase of Breathing from Snoring Sounds.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 30th International Conference, 2023

PMMSD: Development of the Matrix Sentence Intelligibility Dataset for Mandarin with Lombard Effect.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Learning From Single-Expert Annotated Labels for Automatic Sleep Staging.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Improving Acoustic Echo Cancellation by Mixing Speech Local and Global Features with Transformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Selector-Enhancer: Learning Dynamic Selection of Local and Non-local Attention Operation for Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Error model and simulation for multisource fusion indoor positioning.

[BibT_eX]

[DOI]

Int. J. Intell. Syst., 2022

Injecting Spatial Information for Monaural Speech Enhancement via Knowledge Distillation.

[BibT_eX]

[DOI]

CoRR, 2022

Mandarin Lombard Grid: a Lombard-grid-like corpus of Standard Chinese.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2022, 2022

Speaker- and Phone-aware Convolutional Transformer Network for Acoustic Echo Cancellation.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2022, 2022

CS-CTCSCONV1D: Small footprint speaker verification with channel split time-channel-time separable 1-dimensional convolution.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2022, 2022

2021

When Face Recognition Meets Occlusion: A New Benchmark.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Error Model of Radio Fingerprint and PDR Fusion Indoor Localization.

[BibT_eX]

[DOI]

CoRR, 2020

RNN-based signal classification for hybrid audio data compression.

[BibT_eX]

[DOI]

Computing, 2020

Constrained Ratio Mask for Speech Enhancement Using DNN.

[BibT_eX]

[DOI]

Hongjiang Yu

Wei-Ping Zhu

Proceedings of the Interspeech 2020, 2020

Attention-Guided Deraining Network Via Stage-Wise Learning.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Towards a real-time production of immersive spatial audio of high individuality with an RBF neural network.

[BibT_eX]

[DOI]

J. Parallel Distributed Comput., 2019

Kullback-Leibler Divergence Frequency Warping Scale for Acoustic Scene Classification Using Convolutional Neural Network.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

An Improvement of the Degradation of Speaker Recognition in Continuous Cold Speech for Home Assistant.

[BibT_eX]

[DOI]

Proceedings of the Cyberspace Safety and Security - 11th International Symposium, 2019

2018

An RNN-Based Speech-Music Discrimination Used for Hybrid Audio Coder.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

The BLE Fingerprint Map Fast Construction Method for Indoor Localization.

[BibT_eX]

[DOI]

Proceedings of the Algorithms and Architectures for Parallel Processing, 2018

2017

3D Sound Field Reproduction at Non Central Point for NHK 22.2 System.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Sound physical property matching between non central listening point and central listening point for NHK 22.2 system reproduction.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016

JND-based spatial parameter quantization of multichannel audio signals.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2016

Spatial Constrained Fine-Grained Color Name for Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

Level Ratio Based Inter and Intra Channel Prediction with Application to Stereo Audio Frame Loss Concealment.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

2015

3D Panning Based Sound Field Enhancement Method for Ambisonics.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Physical Properties of Sound Field Based Estimation of Phantom Source in 3D.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Simplification of 3D Multichannel Sound System Based on Multizone Soundfield Reproduction.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Signal-Aware Parametric Quality Model for Audio and Speech over IP Networks.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 21st International Conference, 2015

Azimuthal Perceptual Resolution Model Based Adaptive 3D Spatial Parameter Coding.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 21st International Conference, 2015

A down-mixing method for 22.2 multichannel system reproduction.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014

Expanded three-channel mid/side coding for three-dimensional multichannel audio systems.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2014

A 3D audio coding technique based on extracting the distance parameter.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Three-dimensional panning by four loudspeakers and its solution.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Histograms of Salience for Pedestrian Detection.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Internet Multimedia Computing and Service, 2014

Auditory attention based mobile audio quality assessment.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

A spatial priority based scalable audio coding.

[BibT_eX]

[DOI]

Li Gao

Ruimin Hu

Proceedings of the IEEE International Conference on Acoustics, 2014

An Inter-frame Correlation Based Error Concealment of Immittance Spectral Coefficients for Mobile Speech and Audio Codecs.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on High Performance Computing and Communications, 2014

Joint speech/audio coding based scalable perceptual audio coding.

[BibT_eX]

[DOI]

Li Gao

Ruimin Hu