Yuhong Yang

Orcid: 0000-0003-3001-7957

Affiliations:
  • Wuhan University, School of Computer Science, National Engineering Research Center for Multimedia Software, China


According to our database1, Yuhong Yang authored at least 53 papers between 2009 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Auxiliary Information Guided Self-attention for Image Quality Assessment.
ACM Trans. Multim. Comput. Commun. Appl., April, 2024

2023
CASE-Net: Integrating local and non-local attention operations for speech enhancement.
Speech Commun., March, 2023

A comparative study of Grid and Natural sentences effects on Normal-to-Lombard conversion.
CoRR, 2023

Mandarin Lombard Flavor Classification.
CoRR, 2023

EMALG: An Enhanced Mandarin Lombard Grid Corpus with Meaningful Sentences.
CoRR, 2023

PCNN: A Lightweight Parallel Conformer Neural Network for Efficient Monaural Speech Enhancement.
CoRR, 2023

Exploring the Interactions between Target Positive and Negative Information for Acoustic Echo Cancellation.
CoRR, 2023

A Snoring Sound Dataset for Body Position Recognition: Collection, Annotation, and Analysis.
CoRR, 2023

CQNV: A combination of coarsely quantized bitstream and neural vocoder for low rate speech coding.
CoRR, 2023

Leveraging Sound Local and Global Features for Language-Queried Target Sound Extraction.
Proceedings of the Neural Information Processing - 30th International Conference, 2023

ONEI: Unveiling Route and Phase of Breathing from Snoring Sounds.
Proceedings of the Neural Information Processing - 30th International Conference, 2023

PMMSD: Development of the Matrix Sentence Intelligibility Dataset for Mandarin with Lombard Effect.
Proceedings of the IEEE International Conference on Acoustics, 2023

Learning From Single-Expert Annotated Labels for Automatic Sleep Staging.
Proceedings of the IEEE International Conference on Acoustics, 2023

Improving Acoustic Echo Cancellation by Mixing Speech Local and Global Features with Transformer.
Proceedings of the IEEE International Conference on Acoustics, 2023

Selector-Enhancer: Learning Dynamic Selection of Local and Non-local Attention Operation for Speech Enhancement.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Error model and simulation for multisource fusion indoor positioning.
Int. J. Intell. Syst., 2022

Injecting Spatial Information for Monaural Speech Enhancement via Knowledge Distillation.
CoRR, 2022

Mandarin Lombard Grid: a Lombard-grid-like corpus of Standard Chinese.
Proceedings of the Interspeech 2022, 2022

Speaker- and Phone-aware Convolutional Transformer Network for Acoustic Echo Cancellation.
Proceedings of the Interspeech 2022, 2022

CS-CTCSCONV1D: Small footprint speaker verification with channel split time-channel-time separable 1-dimensional convolution.
Proceedings of the Interspeech 2022, 2022

2021
When Face Recognition Meets Occlusion: A New Benchmark.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Error Model of Radio Fingerprint and PDR Fusion Indoor Localization.
CoRR, 2020

RNN-based signal classification for hybrid audio data compression.
Computing, 2020

Constrained Ratio Mask for Speech Enhancement Using DNN.
Proceedings of the Interspeech 2020, 2020

Attention-Guided Deraining Network Via Stage-Wise Learning.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Towards a real-time production of immersive spatial audio of high individuality with an RBF neural network.
J. Parallel Distributed Comput., 2019

Kullback-Leibler Divergence Frequency Warping Scale for Acoustic Scene Classification Using Convolutional Neural Network.
Proceedings of the IEEE International Conference on Acoustics, 2019

An Improvement of the Degradation of Speaker Recognition in Continuous Cold Speech for Home Assistant.
Proceedings of the Cyberspace Safety and Security - 11th International Symposium, 2019

2018
An RNN-Based Speech-Music Discrimination Used for Hybrid Audio Coder.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

The BLE Fingerprint Map Fast Construction Method for Indoor Localization.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2018

2017
3D Sound Field Reproduction at Non Central Point for NHK 22.2 System.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Sound physical property matching between non central listening point and central listening point for NHK 22.2 system reproduction.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
JND-based spatial parameter quantization of multichannel audio signals.
EURASIP J. Audio Speech Music. Process., 2016

Spatial Constrained Fine-Grained Color Name for Person Re-identification.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

Level Ratio Based Inter and Intra Channel Prediction with Application to Stereo Audio Frame Loss Concealment.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

2015
3D Panning Based Sound Field Enhancement Method for Ambisonics.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Physical Properties of Sound Field Based Estimation of Phantom Source in 3D.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Simplification of 3D Multichannel Sound System Based on Multizone Soundfield Reproduction.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Signal-Aware Parametric Quality Model for Audio and Speech over IP Networks.
Proceedings of the MultiMedia Modeling - 21st International Conference, 2015

Azimuthal Perceptual Resolution Model Based Adaptive 3D Spatial Parameter Coding.
Proceedings of the MultiMedia Modeling - 21st International Conference, 2015

A down-mixing method for 22.2 multichannel system reproduction.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Expanded three-channel mid/side coding for three-dimensional multichannel audio systems.
EURASIP J. Audio Speech Music. Process., 2014

A 3D audio coding technique based on extracting the distance parameter.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Three-dimensional panning by four loudspeakers and its solution.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Histograms of Salience for Pedestrian Detection.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2014

Auditory attention based mobile audio quality assessment.
Proceedings of the IEEE International Conference on Acoustics, 2014

A spatial priority based scalable audio coding.
Proceedings of the IEEE International Conference on Acoustics, 2014

An Inter-frame Correlation Based Error Concealment of Immittance Spectral Coefficients for Mobile Speech and Audio Codecs.
Proceedings of the 2014 IEEE International Conference on High Performance Computing and Communications, 2014

Joint speech/audio coding based scalable perceptual audio coding.
Proceedings of the 2014 IEEE/ACIS 13th International Conference on Computer and Information Science, 2014

2013
A new mobile audio quality assessment using Jitter Distortion Measure approach.
Proceedings of the Fourth International Workshop on Quality of Multimedia Experience, 2013

Gain Factors Calibration in 3D Sound Reproduction Using VBAP.
Proceedings of the Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2013

Sound intensity and particle velocity based three-dimensional panning methods by five loudspeakers.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

2009
Surveillance Audio Attention Model Based on Spatial Audio Cues.
Proceedings of the Advances in Multimedia Information Processing, 2009


  Loading...