Jiqing Han

Orcid: 0000-0002-4297-4300

Affiliations:
  • Harbin Institute of Technology, School of Computer Science and Technology, China


According to our database1, Jiqing Han authored at least 133 papers between 1997 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Capturing High-Level Semantic Correlations via Graph for Multimodal Sentiment Analysis.
IEEE Signal Process. Lett., 2024

Contrastive Loss Based Frame-wise Feature disentanglement for Polyphonic Sound Event Detection.
CoRR, 2024

2023
Task-driven common subspace learning based semantic feature extraction for acoustic event recognition.
Expert Syst. Appl., December, 2023

A Glance is Enough: Extract Target Sentence By Looking at A keyword.
CoRR, 2023

Spot keywords from very noisy and mixed speech.
CoRR, 2023

Patch-level contrastive embedding learning for respiratory sound classification.
Biomed. Signal Process. Control., 2023

Using Auxiliary Tasks In Multimodal Fusion of Wav2vec 2.0 And Bert for Multimodal Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Subband Dependency Modeling for Sound Event Detection.
Proceedings of the IEEE International Conference on Acoustics, 2023

Time-Weighted Frequency Domain Audio Representation with GMM Estimator for Anomalous Sound Detection.
Proceedings of the IEEE International Conference on Acoustics, 2023

Graph-Based Spectro-Temporal Dependency Modeling for Anti-Spoofing.
Proceedings of the IEEE International Conference on Acoustics, 2023

Sentiment Knowledge Enhanced Self-supervised Learning for Multimodal Sentiment Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Exploring Inter-Node Relations in CNNs for Environmental Sound Classification.
IEEE Signal Process. Lett., 2022

Contrastive Regularization for Multimodal Emotion Recognition Using Audio and Text.
CoRR, 2022

Word-wise Sparse Attention for Multimodal Sentiment Analysis.
Proceedings of the Interspeech 2022, 2022

Exploring Transformer's Potential on Automatic Piano Transcription.
Proceedings of the IEEE International Conference on Acoustics, 2022

CDMA: Cross-Domain Distance Metric Adaptation for Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2022

Sparse Self-Attention for Semi-Supervised Sound Event Detection.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Exploring attention mechanisms based on summary information for end-to-end automatic speech recognition.
Neurocomputing, 2021

Semantic feature extraction based on subspace learning with temporal constraints for acoustic event recognition.
Digit. Signal Process., 2021

Can We Trust Deep Speech Prior?
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Model-Agnostic Fast Adaptive Multi-Objective Balancing Algorithm for Multilingual Automatic Speech Recognition Model Training.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Multimodal Sentiment Analysis with Temporal Modality Attention.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Gradient Regularization for Noise-Robust Speaker Verification.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Capturing Temporal Dependencies Through Future Prediction for CNN-Based Audio Classifiers.
Proceedings of the IEEE International Conference on Acoustics, 2021

Contrastive Embeddind Learning Method for Respiratory Sound Classification.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Pyramidal Temporal Pooling With Discriminative Mapping for Audio Classification.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Nonnegative Matrix Factorization Based Transfer Subspace Learning for Cross-Corpus Speech Emotion Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

A Joint Framework of Denoising Autoencoder and Generative Vocoder for Monaural Speech Enhancement.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Learning Temporal Relations from Semantic Neighbors for Acoustic Scene Classification.
IEEE Signal Process. Lett., 2020

Task-Driven Variability Model for Speaker Verification.
Circuits Syst. Signal Process., 2020

Toward the pre-cocktail party problem with TasTas+.
CoRR, 2020

La Furca: Iterative Context-Aware End-to-End Monaural Speech Separation Based on Dual-Path Deep Parallel Inter-Intra Bi-LSTM with Attention.
CoRR, 2020

FurcaNeXt: End-to-End Monaural Speech Separation with Dynamic Gated Dilated Temporal Convolutional Networks.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

ATReSN-Net: Capturing Attentive Temporal Relations in Semantic Neighborhood for Acoustic Scene Classification.
Proceedings of the Interspeech 2020, 2020

Speech Separation Based on Multi-Stage Elaborated Dual-Path Deep BiLSTM with Auxiliary Identity Loss.
Proceedings of the Interspeech 2020, 2020

Self-Supervised Adversarial Multi-Task Learning for Vocoder-Based Monaural Speech Enhancement.
Proceedings of the Interspeech 2020, 2020

Double Adversarial Network Based Monaural Speech Enhancement for Robust Speech Recognition.
Proceedings of the Interspeech 2020, 2020

Structured Sparse Attention for end-to-end Automatic Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Pan: Phoneme-Aware Network for Monaural Speech Enhancement.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

TDMF: Task-Driven Multilevel Framework for End-to-End Speaker Verification.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
A bilevel framework for joint optimization of session compensation and classification for speaker identification.
Digit. Signal Process., 2019

A Multi-Task Learning Framework for Overcoming the Catastrophic Forgetting in Automatic Speech Recognition.
CoRR, 2019

Hard Sample Mining for the Improved Retraining of Automatic Speech Recognition.
CoRR, 2019

FurcaNeXt: End-to-end monaural speech separation with dynamic gated dilated temporal convolutional networks.
CoRR, 2019

FurcaNet: An end-to-end deep gated convolutional, long short-term memory, deep neural networks for single channel speech separation.
CoRR, 2019

Is CQT more suitable for monaural speech separation than STFT? an empirical study.
CoRR, 2019

Abnormal heart sound detection using temporal quasi-periodic features and long short-term memory without segmentation.
Biomed. Signal Process. Control., 2019

Trace Ratio Criterion Based Large Margin Subspace Learning for Feature Selection.
IEEE Access, 2019

Acoustic Scene Classification by Implicitly Identifying Distinct Sound Events.
Proceedings of the Interspeech 2019, 2019

Deep Attention Gated Dilated Temporal Convolutional Networks with Intra-Parallel Convolutional Modules for End-to-End Monaural Speech Separation.
Proceedings of the Interspeech 2019, 2019

End-to-End Monaural Speech Separation with Multi-Scale Dynamic Weighted Gated Dilated Convolutional Pyramid Network.
Proceedings of the Interspeech 2019, 2019

Subspace Pooling Based Temporal Features Extraction for Audio Event Recognition.
Proceedings of the Interspeech 2019, 2019

Cross-Corpus Speech Emotion Recognition Using Semi-Supervised Transfer Non-Negative Matrix Factorization with Adaptation Regularization.
Proceedings of the Interspeech 2019, 2019

Convolutional Grid Long Short-Term Memory Recurrent Neural Network for Automatic Speech Recognition.
Proceedings of the Neural Information Processing - 26th International Conference, 2019

Furcax: End-to-end Monaural Speech Separation Based on Deep Gated (De)convolutional Neural Networks with Adversarial Example Training.
Proceedings of the IEEE International Conference on Acoustics, 2019

Investigation of Monaural Front-End Processing for Robust Speech Recognition Without Retraining or Joint-Training.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Efficient general sparse denoising with non-convex sparse constraint and total variation regularization.
Digit. Signal Process., 2018

Investigation of Monaural Front-End Processing for Robust ASR without Retraining or Joint-Training.
CoRR, 2018

Adaptive overlapping-group sparse denoising for heart sound signals.
Biomed. Signal Process. Control., 2018

Unsupervised Temporal Feature Learning Based on Sparse Coding Embedded BoAW for Acoustic Event Recognition.
Proceedings of the Interspeech 2018, 2018

A Compact and Discriminative Feature Based on Auditory Summary Statistics for Acoustic Scene Classification.
Proceedings of the Interspeech 2018, 2018

Deep Neural Network Based Discriminative Training for I-Vector/PLDA Speaker Verification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Heart sound classification based on scaled spectrogram and tensor decomposition.
Expert Syst. Appl., 2017

Heart sound classification based on scaled spectrogram and partial least squares regression.
Biomed. Signal Process. Control., 2017

Speaker Verification via Estimating Total Variability Space Using Probabilistic Partial Least Squares.
Proceedings of the Interspeech 2017, 2017

Learning Deep Neural Network Based Kernel Functions for Small Sample Size Classification.
Proceedings of the Neural Information Processing - 24th International Conference, 2017

Towards Heart Sound Classification Without Segmentation Using Convolutional Neural Network.
Proceedings of the Computing in Cardiology, 2017

2016
Signal Periodic Decomposition With Conjugate Subspaces.
IEEE Trans. Signal Process., 2016

Sparse Decomposition for Signal Periodic Model Over Complex Exponential Dictionary.
IEEE Signal Process. Lett., 2016

Speaker Verification via Modeling Kurtosis Using Sparse Coding.
Int. J. Pattern Recognit. Artif. Intell., 2016

Optimization of learned dictionary for sparse coding in speech processing.
Neurocomputing, 2016

Towards heart sound classification without segmentation via autocorrelation feature and diffusion maps.
Future Gener. Comput. Syst., 2016

Towards optimal vlad for human action recognition from still images.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Realistic human action recognition: When deep learning meets VLAD.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Abnormal Heart Sounds detection based on the Scaled Time-Frequency Representation and Feature Selection.
Proceedings of the Computing in Cardiology, CinC 2016, Vancouver, 2016

2015
Soft Margin Based Low-Rank Audio Signal Classification.
Neural Process. Lett., 2015

Dictionary evaluation and optimization for sparse coding based speech processing.
Inf. Sci., 2015

Spectrum enhancement with sparse coding for robust speech recognition.
Digit. Signal Process., 2015

Ramanujan subspace pursuit for signal periodic decomposition.
CoRR, 2015

Noise-robust speaker recognition based on morphological component analysis.
Proceedings of the INTERSPEECH 2015, 2015

2014
Confidence Measure Based on Context Consistency Using Word Occurrence Probability and Topic Adaptation for Spoken Term Detection.
IEICE Trans. Inf. Syst., 2014

A new framework for robust speech recognition in complex channel environments.
Digit. Signal Process., 2014

Sparse Representation with Optimized Learned Dictionary for Robust Voice Activity Detection.
Circuits Syst. Signal Process., 2014

Evaluation of dictionary for sparse coding in speech processing.
Proceedings of the INTERSPEECH 2014, 2014

Learning semantic kernels for scene classification.
Proceedings of the IEEE International Conference on Acoustics, 2014

Robust minimum statistics project coefficients feature for acoustic environment recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Audio classification with low-rank matrix representation features.
ACM Trans. Intell. Syst. Technol., 2013

Identification of Objectionable Audio Segments Based on Pseudo and Heterogeneous Mixture Models.
IEEE Trans. Speech Audio Process., 2013

Audio Segment Classification Using Online Learning Based Tensor Representation Feature Discrimination.
IEEE Trans. Speech Audio Process., 2013

Statistical voice activity detection based on sparse representation over learned dictionary.
Digit. Signal Process., 2013

Guarantees of Augmented Trace Norm Models in Tensor Recovery.
Proceedings of the IJCAI 2013, 2013

Case based reasoning solution to the problem of sustained learning in keyword spotting.
Proceedings of the IEEE International Conference on Acoustics, 2013

Upper and lower bounds for approximation of the Kullback-Leibler divergence between Hidden Markov models.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Sparse-Based auditory Model for robust speaker Recognition.
Int. J. Pattern Recognit. Artif. Intell., 2012

Likelihood ratio sign test for voice activity detection.
IET Signal Process., 2012

Identifiability of multivariate logistic mixture models
CoRR, 2012

Guarantees of Augmented Trace Norm Models in Tensor Recovery
CoRR, 2012

Low-rank Audio Signal Classification Under Soft Margin and Trace Norm Constraints.
Proceedings of the INTERSPEECH 2012, 2012

A Novel Confidence Measure Based on Context Consistency for Spoken Term Detection.
Proceedings of the INTERSPEECH 2012, 2012

Sparse power spectrum based robust voice activity detector.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

A solution to residual noise in speech denoising with sparse representation.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Gaussian Specific Compensation for Channel Distortion in Speech Recognition.
IEEE Signal Process. Lett., 2011

MAP-based Audio Coding Compensation for Speaker Recognition.
J. Signal Inf. Process., 2011

Voice activity detection based on conjugate subspace matching pursuit and likelihood ratio test.
EURASIP J. Audio Speech Music. Process., 2011

Online Learning for Classification of Low-rank Representation Features and Its Applications in Audio Segment Classification
CoRR, 2011

Trace Norm Regularized Tensor Classification and Its Online Learning Approaches
CoRR, 2011

Heterogeneous mixture models using sparse representation features for applause and laugh detection.
Proceedings of the 2011 IEEE International Workshop on Machine Learning for Signal Processing, 2011

Real-World Speech/Non-Speech Audio Classification Based on Sparse Representation Features and GPCs.
Proceedings of the INTERSPEECH 2011, 2011

AUC Optimization Based Confidence Measure for Keyword Spotting.
Proceedings of the INTERSPEECH 2011, 2011

A Novel Framework Based on Trace Norm Minimization for Audio Event Detection.
Proceedings of the Neural Information Processing - 18th International Conference, 2011

A cochlear neuron based robust feature for speaker recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Compensation of partly reliable components for band-limited speech recognition with missing data techniques.
Proceedings of the IEEE International Conference on Acoustics, 2011

A modified MAP criterion based on hidden Markov model for voice activity detecion.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Particle-based realistic simulation of fluid-solid interaction.
Comput. Animat. Virtual Worlds, 2010

Study on the Recognition of Objectionable Audio.
Int. J. Pattern Recognit. Artif. Intell., 2010

Compensation of signal with erasures via sparse representation into its significant subspace.
Proceedings of the 10th International Conference on Information Sciences, 2010

Model synthesis for band-limited speech recognition.
Proceedings of the INTERSPEECH 2010, 2010

Robust statistical voice activity detection using a likelihood ratio sign test.
Proceedings of the INTERSPEECH 2010, 2010

Voice Activity Detection Based on Complex Exponential Atomic Decomposition and Likelihood Ratio Test.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

2009
Speaker identification and verification from audio coded speech in matched and mismatched conditions.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2009

A Fast Audio Retrieval Method Based on Negativity Judgment.
Proceedings of the Fifth International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2009), 2009

2008
Text-independent Speaker Identification Based on MAP Channel Compensation and Pitch-dependent Features.
Proceedings of the 2008 International Conference on Information & Knowledge Engineering, 2008

2007
Automatic conversion from lexical words to prosodic words for mandarin text-to-speech system.
Int. J. Speech Technol., 2007

2006
Automatic Music Transcription Based on Harmonic Structure Information.
J. Comput. Res. Dev., 2006

Improved Mandarin Speech Recognition by Lattice Rescoring with Enhanced Tone Models.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

A multi-space distribution (MSD) approach to speech recognition of tonal languages.
Proceedings of the INTERSPEECH 2006, 2006

2005
Modifying Spectral Envelope to Synthetically Adjust Voice Quality and Articulation Parameters for Emotional Speech Synthesis.
Proceedings of the Affective Computing and Intelligent Interaction, 2005

2002
Sharpe Ratio-Oriented Active Trading: A Learning Approach.
Proceedings of the MICAI 2002: Advances in Artificial Intelligence, 2002

2001
Robust Speech Recognition Method Based on Discriminative Environment Feature Extraction.
J. Comput. Sci. Technol., 2001

2000
An environment model-based robust speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999
Robust telephone speech recognition based on channel compensation.
Pattern Recognit., 1999

1998
Discriminative learning of additive noise and channel distortions for robust speech recognition.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
Relative mel-frequency cepstral coefficients compensation for robust telephone speech recognition.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997


  Loading...