Xiaochen Wang

Orcid: 0000-0003-4177-7580

Affiliations:
  • Wuhan University, Wuhan, China


According to our database1, Xiaochen Wang authored at least 77 papers between 2011 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Optimal Illumination Distance Metrics for Person Re-Identification in Complex Lighting Conditions.
ACM Trans. Multim. Comput. Commun. Appl., January, 2025

Multimodal and multichannel speech separation using location-guided speech feature mapping network.
Neurocomputing, 2025

2024
Learning with noisy labels for robust fatigue detection.
Knowl. Based Syst., 2024

Adaptive subband partition encoding scheme for multiple audio objects using CNN and residual dense blocks mixture network.
Expert Syst. Appl., 2024

Optimal Illumination Distance Metrics for Person Re-identification.
Proceedings of the PRICAI 2024: Trends in Artificial Intelligence, 2024

2023
Multi-speaker DoA Estimation Using Audio and Visual Modality.
Neural Process. Lett., December, 2023

Multi-scale modeling temporal hierarchical attention for sequential recommendation.
Inf. Sci., September, 2023

A Dual Self-Attention mechanism for vehicle re-Identification.
Pattern Recognit., May, 2023

Cross-platform sequential recommendation with sharing item-level relevance data.
Inf. Sci., April, 2023

Single-channel Multi-speakers Speech Separation Based on Isolated Speech Segments.
Neural Process. Lett., February, 2023

From Collective Attribute Association of Groups to Precise Attribute Association of Individuals.
IEEE Trans. Multim., 2023

Perceptual Audio Object Coding Using Adaptive Subband Grouping with CNN and Residual Block.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Multi-speaker Direction of Arrival Estimation Using Audio and Visual Modalities with Convolutional Neural Network.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

2022
COVID-19 contact tracking by group activity trajectory recovery over camera networks.
Pattern Recognit., 2022

High Parameter Frequency Resolution Encoding Scheme for Spatial Audio Objects Using Stacked Sparse Autoencoder.
Neural Process. Lett., 2022

Distortion Reduction via CAE and DenseNet Mixture Network for Low Bitrate Spatial Audio Object Coding.
IEEE Multim., 2022

$(\alpha, \ \beta)$-AWCS: $(\alpha, \ \beta)$-Attributed Weighted Community Search on Bipartite Graphs.
Proceedings of the International Joint Conference on Neural Networks, 2022

2021
Intelligibility Enhancement Via Normal-to-Lombard Speech Conversion With Long Short-Term Memory Network and Bayesian Gaussian Mixture Model.
IEEE Trans. Multim., 2021

Trajectory Association for Person Re-identification.
Neural Process. Lett., 2021

Estimation of spherical harmonic coefficients in sound field recording using feed-forward neural networks.
Multim. Tools Appl., 2021

Optimization of sound fields reproduction based Higher-Order Ambisonics (HOA) using the Generative Adversarial Network (GAN).
Multim. Tools Appl., 2021

Audio object coding based on N-step residual compensating.
Multim. Tools Appl., 2021

Intelligent cloud computing platform for three-dimensional sound reproduction.
Concurr. Comput. Pract. Exp., 2021

Urban Hierarchical Open-up Schemes Based on Fine Regional Epidemic Data for the Lockdown in COVID-19.
Big Data Res., 2021

Stacked Sparse Autoencoder for Audio Object Coding.
Proceedings of the MultiMedia Modeling - 27th International Conference, 2021

Low Bitrates Audio Object Coding Using Convolutional Auto-Encoder and Densenet Mixture Model.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Efficient Multi-Step Audio Object Coding with Limited Residual Information.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Spatial Audio Object Coding Based on Time-Frequency Shifting and Scheduling.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

2020
A mapping model of spectral tilt in normal-to-Lombard speech conversion for intelligibility enhancement.
Multim. Tools Appl., 2020

Single Channel multi-speaker speech Separation based on quantized ratio mask and residual network.
Multim. Tools Appl., 2020

Loudspeaker triplet selection based on low distortion within head for multichannel conversion of smart 3D home theater.
Concurr. Comput. Pract. Exp., 2020

Fake Identity Attributes Detection Based on Analysis of Natural and Human Behaviors.
IEEE Access, 2020

HMM-Based Person Re-identification in Large-Scale Open Scenario.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

Multi-step Coding Structure of Spatial Audio Object Coding.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

Perceptual Localization of Virtual Sound Source Based on Loudspeaker Triplet.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

HRTF Representation with Convolutional Auto-encoder.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

Normal-To-Lombard Speech Conversion by LSTM Network and BGMM for Intelligibility Enhancement of Telephone Speech.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Speech Intelligibility Enhancement Using Non-Parallel Speaking Style Conversion With Stargan And Dynamic Range Compression.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

2019
Audio object coding based on optimal parameter frequency resolution.
Multim. Tools Appl., 2019

A near-end listening enhancement system by RNN-based noise cancellation and speech modification.
Multim. Tools Appl., 2019

Spectral Tilt Estimation for Speech Intelligibility Enhancement Using RNN Based on All-Pole Model.
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019

Multi-speakers Speech Separation Based on Modified Attractor Points Estimation and GMM Clustering.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

2018
Individualization of Head Related Transfer Functions Based on Radial Basis Function Neural Network.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

2017
The Analysis for Binaural Signal's Characteristics of a Real Source and Corresponding Virtual Sound Image.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

An Efficient Method Using the Parameterized HRTFs for 3D Audio Real-Time Rendering on Mobile Devices.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Speech Intelligibility Enhancement in Strong Mechanical Noise Based on Neural Networks.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

3D Sound Field Reproduction at Non Central Point for NHK 22.2 System.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Frame-Independent and Parallel Method for 3D Audio Real-Time Rendering on Mobile Devices.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

The Perceptual Lossless Quantization of Spatial Parameter for 3D Audio Signals.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Sound physical property matching between non central listening point and central listening point for NHK 22.2 system reproduction.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
JND-based spatial parameter quantization of multichannel audio signals.
EURASIP J. Audio Speech Music. Process., 2016

Head Related Transfer Function Interpolation Based on Aligning Operation.
Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016

Audio Bandwidth Extension Using Audio Super-Resolution.
Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016

Adaptive Multichannel Reduction Using Convex Polyhedral Loudspeaker Array.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

Analysis and Comparison of Inter-Channel Level Difference and Interaural Level Difference.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

Multichannel reduction based on sound field within two ears.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

2015
Multi-channel Object-Based Spatial Parameter Compression Approach for 3D Audio.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Real-Life Voice Activity Detection Based on Audio-Visual Alignment.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

3D Panning Based Sound Field Enhancement Method for Ambisonics.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Multichannel Simplification Based on Deviation of Loudspeaker Positions.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Physical Properties of Sound Field Based Estimation of Phantom Source in 3D.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Low Bitrates Audio Bandwidth Extension Using a Deep Auto-Encoder.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Simplification of 3D Multichannel Sound System Based on Multizone Soundfield Reproduction.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Unequal error protection for S<sup>3</sup>AC coding based on expanding window fountain codes.
Proceedings of the 2015 IEEE Symposium on Computers and Communication, 2015

Spatial perception reproduction of sound events based on sound property coincidences.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

A down-mixing method for 22.2 multichannel system reproduction.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Comments on "Algorithmic Aspects of Hardware/Software Partitioning: 1D Search Algorithms".
IEEE Trans. Computers, 2014

Expanded three-channel mid/side coding for three-dimensional multichannel audio systems.
EURASIP J. Audio Speech Music. Process., 2014

Reduction of Multichannel Sound System Based on Spherical Harmonics.
Proceedings of the Advances in Multimedia Information Processing - PCM 2014, 2014

Automatic Multichannel Simplification with Low Impacts on Sound Pressure at Ears.
Proceedings of the Advances in Multimedia Information Processing - PCM 2014, 2014

The Perceptual Characteristics of 3D Orientation.
Proceedings of the MultiMedia Modeling - 20th Anniversary International Conference, 2014

A 3D audio coding technique based on extracting the distance parameter.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Three-dimensional panning by four loudspeakers and its solution.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

AVS2 speech and audio coding scheme for high quality at low bitrates.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

2013
Gain Factors Calibration in 3D Sound Reproduction Using VBAP.
Proceedings of the Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2013

An expanded Mid/Side coding for 3D audio signal compression.
Proceedings of the IEEE International Conference on Acoustics, 2013

2011
An Even Grid Based Lattice Vector Quantization Algorithm for Mobile Audio Coding.
J. Multim., 2011


  Loading...