Jianwu Dang

Orcid: 0000-0002-9237-4821

Affiliations:
  • Tianjin University, Tianjin Key Laboratory of Cognitive Computing and Application, College of Intelligence and Computing, China
  • Institute of Communication Parlee, ICP, Center of National Research Scientific, France (2002-2003)
  • Japan Advanced Institute of Science and Technology, JAIST, Japan
  • Shizuoka University, Japan (PhD 1992)


According to our database1, Jianwu Dang authored at least 302 papers between 1994 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Disordered speech recognition considering low resources and abnormal articulation.
Speech Commun., November, 2023

TMS: Temporal multi-scale in time-delay neural network for speaker verification.
Appl. Intell., November, 2023

Meta-Generalization for Domain-Invariant Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

CFDRN: A Cognition-Inspired Feature Decomposition and Recombination Network for Dysarthric Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

A CIF-Based Speech Segmentation Method for Streaming E2E ASR.
IEEE Signal Process. Lett., 2023

ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations.
CoRR, 2023

A Refining Underlying Information Framework for Monaural Speech Enhancement.
CoRR, 2023

High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models.
CoRR, 2023

Learning Speech Representation From Contrastive Token-Acoustic Pretraining.
CoRR, 2023

Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding.
CoRR, 2023

Rethinking the visual cues in audio-visual speaker extraction.
CoRR, 2023

Local and Global Context Modeling with Relation Matching Task for Dialog Act Recognition.
Proceedings of the International Joint Conference on Neural Networks, 2023

Commonsense Knowledge Enhanced Sentiment Dependency Graph for Sarcasm Detection.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Multi-Modal Sarcasm Detection Based on Cross-Modal Composition of Inscribed Entity Relations.
Proceedings of the 35th IEEE International Conference on Tools with Artificial Intelligence, 2023

Noise-Disentanglement Metric Learning for Robust Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2023

Time-Domain Speech Enhancement Assisted by Multi-Resolution Frequency Encoder and Decoder.
Proceedings of the IEEE International Conference on Acoustics, 2023

Speech and Noise Dual-Stream Spectrogram Refine Network With Speech Distortion Loss For Robust Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Cross-Modal Audio-Visual Co-Learning for Text-Independent Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2023

Leveraging Positional-Related Local-Global Dependency for Synthetic Speech Detection.
Proceedings of the IEEE International Conference on Acoustics, 2023

VF-Taco2: Towards Fast and Lightweight Synthesis for Autoregressive Models with Variation Autoencoder and Feature Distillation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Brain Network Features Differentiate Intentions from Different Emotional Expressions of the Same Text.
Proceedings of the IEEE International Conference on Acoustics, 2023

Self-Supervised Audio-Visual Speaker Representation with Co-Meta Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023

Augmenting Affective Dependency Graph via Iterative Incongruity Graph Learning for Sarcasm Detection.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Intrinsic Representation Mining for Zero-Shot Slot Filling.
IEICE Trans. Inf. Syst., November, 2022

Constructing Accurate and Efficient Deep Spiking Neural Networks With Double-Threshold and Augmented Schemes.
IEEE Trans. Neural Networks Learn. Syst., 2022

Toward Efficient Processing and Learning With Spikes: New Approaches for Multispike Learning.
IEEE Trans. Cybern., 2022

One-shot emotional voice conversion based on feature separation.
Speech Commun., 2022

Learning affective representations based on magnitude and dynamic relative phase information for speech emotion recognition.
Speech Commun., 2022

Emotion Recognition With Multimodal Transformer Fusion Framework Based on Acoustic and Lexical Information.
IEEE Multim., 2022

Context- and Knowledge-Aware Graph Convolutional Network for Multimodal Emotion Recognition.
IEEE Multim., 2022

Detection of Brain Network Communities During Natural Speech Comprehension From Functionally Aligned EEG Sources.
Frontiers Comput. Neurosci., 2022

Improving low-resource Tibetan end-to-end ASR by multilingual and multilevel unit modeling.
EURASIP J. Audio Speech Music. Process., 2022

MIMO-DBnet: Multi-channel Input and Multiple Outputs DOA-aware Beamforming Network for Speech Separation.
CoRR, 2022

Monolingual Recognizers Fusion for Code-switching Speech Recognition.
CoRR, 2022

Heterogeneous Graph Neural Networks using Self-supervised Reciprocally Contrastive Learning.
CoRR, 2022

TMS: A Temporal Multi-scale Backbone Design for Speaker Embedding.
CoRR, 2022

Community Detection in Social Networks Considering Social Behaviors.
IEEE Access, 2022

Deep Spectro-temporal Artifacts for Detecting Synthesized Speech.
Proceedings of the DDAM@MM 2022: Proceedings of the 1st International Workshop on Deepfake Detection for Audio Multimedia, 2022

Reconstruction of speech spectrogram based on non-invasive EEG signal.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Adaptive Attention Network with Domain Adversarial Training for Multi-Accent Speech Recognition.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Dialogue scenario classification based on social factors.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Deep Multi-task Cascaded Acoustic Echo Cancellation and Noise Suppression.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources.
Proceedings of the Interspeech 2022, 2022

Self-Distillation Based on High-level Information Supervision for Compressing End-to-End ASR Model.
Proceedings of the Interspeech 2022, 2022

Hierarchical Tagger with Multi-task Learning for Cross-domain Slot Filling.
Proceedings of the Interspeech 2022, 2022

TopicKS: Topic-driven Knowledge Selection for Knowledge-grounded Dialogue Generation.
Proceedings of the Interspeech 2022, 2022

Language-specific Characteristic Assistance for Code-switching Speech Recognition.
Proceedings of the Interspeech 2022, 2022

Monaural Speech Enhancement Based on Spectrogram Decomposition for Convolutional Neural Network-sensitive Feature Extraction.
Proceedings of the Interspeech 2022, 2022

Finer-grained Modeling units-based Meta-Learning for Low-resource Tibetan Speech Recognition.
Proceedings of the Interspeech 2022, 2022

Global Signal-to-noise Ratio Estimation Based on Multi-subband Processing Using Convolutional Neural Network.
Proceedings of the Interspeech 2022, 2022

VCSE: Time-Domain Visual-Contextual Speaker Extraction Network.
Proceedings of the Interspeech 2022, 2022

Data Augmentation Using McAdams-Coefficient-Based Speaker Anonymization for Fake Audio Detection.
Proceedings of the Interspeech 2022, 2022

Improve emotional speech synthesis quality by learning explicit and implicit representations with semi-supervised training.
Proceedings of the Interspeech 2022, 2022

Iterative Sound Source Localization for Unknown Number of Sources.
Proceedings of the Interspeech 2022, 2022

Dual-stream Speech Dereverberation Network Using Long-term and Short-term Cues.
Proceedings of the International Joint Conference on Neural Networks, 2022

An Improved Stimulus Reconstruction Method for EEG-Based Short-Time Auditory Attention Detection.
Proceedings of the Neural Information Processing - 29th International Conference, 2022

Improving Dialogue Generation via Proactively Querying Grounded Knowledge.
Proceedings of the IEEE International Conference on Acoustics, 2022

Learning Domain-Invariant Transformation for Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2022

Multi-Stage Graph Representation Learning for Dialogue-Level Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

Cache: Modeling Contribution-Aware Context Hierarchically for Long-Range Dialogue State Tracking.
Proceedings of the IEEE International Conference on Acoustics, 2022

Compressing Transformer-Based ASR Model by Task-Driven Loss and Attention-Based Multi-Level Feature Distillation.
Proceedings of the IEEE International Conference on Acoustics, 2022

Using Multiple Reference Audios and Style Embedding Constraints for Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2022

L-SpEx: Localized Target Speaker Extraction.
Proceedings of the IEEE International Conference on Acoustics, 2022

Domain-Invariant Feature Learning for Cross Corpus Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

Relationship Between Speakers' Physiological Structure and Acoustic Speech Signals: Data-Driven Study Based on Frequency-Wise Attentional Neural Network.
Proceedings of the 30th European Signal Processing Conference, 2022

2021
Robust Environmental Sound Recognition With Sparse Key-Point Encoding and Efficient Multispike Learning.
IEEE Trans. Neural Networks Learn. Syst., 2021

Robust Detection of Link Communities With Summary Description in Social Networks.
IEEE Trans. Knowl. Data Eng., 2021

A Tibetan Language Model That Considers the Relationship Between Suffixes and Functional Words.
IEEE Signal Process. Lett., 2021

Efficient learning with augmented spikes: A case study with image classification.
Neural Networks, 2021

Multi-resolution modulation-filtered cochleagram feature for LSTM-based dimensional emotion recognition from speech.
Neural Networks, 2021

Replay attack detection using variable-frequency resolution phase and magnitude features.
Comput. Speech Lang., 2021

Using multiple reference audios and style embedding constraints for speech synthesis.
CoRR, 2021

Exploring Deep Learning for Joint Audio-Visual Lip Biometrics.
CoRR, 2021

Exploiting Explicit and Inferred Implicit Personas for Multi-turn Dialogue Generation.
Proceedings of the Natural Language Processing and Chinese Computing, 2021

A Sentiment Similarity-Oriented Attention Model with Multi-task Learning for Text-Based Emotion Recognition.
Proceedings of the MultiMedia Modeling - 27th International Conference, 2021

Dialogue Act Recognition using Branch Architecture with Attention Mechanism for Imbalanced Data.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Spoken Language Understanding with Sememe Knowledge as Domain Knowledge.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

An Eye-tracking Study of Transposed-letter Effect in English Word Recognition by Mandarin Speakers.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Frequency-specific Brain Network Dynamics during Perceiving Real Words and Pseudowords.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Order-aware Pairwise Intoxication Detection.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Joint Feature Enhancement and Speaker Recognition with Multi-Objective Task-Oriented Network.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Domain-Specific Multi-Agent Dialog Policy Learning in Multi-Domain Task-Oriented Scenarios.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Time-Frequency Representation Learning with Graph Convolutional Network for Dialogue-Level Speech Emotion Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

TacoLPCNet: Fast and Stable TTS by Conditioning LPCNet on Mel Spectrogram Predictions.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Metric Learning Based Feature Representation with Gated Fusion Model for Speech Emotion Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Multi-Modal Emotion Recognition Based On deep Learning Of EEG And Audio Signals.
Proceedings of the International Joint Conference on Neural Networks, 2021

Simultaneous Progressive Filtering-Based Monaural Speech Enhancement.
Proceedings of the Neural Information Processing - 28th International Conference, 2021

Speech Dereverberation Based on Scale-Aware Mean Square Error Loss.
Proceedings of the Neural Information Processing - 28th International Conference, 2021

Exploring Effective Speech Representation via ASR for High-Quality End-to-End Multispeaker TTS.
Proceedings of the Neural Information Processing - 28th International Conference, 2021

Semantic and Acoustic-Prosodic Entrainment of Dialogues in Service Scenarios.
Proceedings of the ICMI '21 Companion: Companion Publication of the 2021 International Conference on Multimodal Interaction, Montreal, QC, Canada, October 18, 2021

CONSK-GCN: Conversational Semantic- and Knowledge-Oriented Graph Convolutional Network for Multimodal Emotion Recognition.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Meta-Learning for Cross-Channel Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2021

Replay-Attack Detection Using Features With Adaptive Spectro-Temporal Resolution.
Proceedings of the IEEE International Conference on Acoustics, 2021

Multimodal Emotion Recognition with Capsule Graph Convolutional Based Representation Fusion.
Proceedings of the IEEE International Conference on Acoustics, 2021

Robust Voice Activity Detection Using a Masked Auditory Encoder Based Convolutional Neural Network.
Proceedings of the IEEE International Conference on Acoustics, 2021

Representation Learning with Spectro-Temporal-Channel Attention for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Improving Naturalness and Controllability of Sequence-to-Sequence Speech Synthesis by Learning Local Prosody Representations.
Proceedings of the IEEE International Conference on Acoustics, 2021

Multi-Stage Speaker Extraction with Utterance and Frame-Level Reference Signals.
Proceedings of the IEEE International Conference on Acoustics, 2021

Domain-Adversarial Autoencoder with Attention Based Feature Level Fusion for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

DeepLip: A Benchmark for Deep Learning-Based Audio-Visual Lip Biometrics.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Learning Language and Speaker Information for Code-Switch Speech Synthesis with Limited Data.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Spectrograms Fusion-based End-to-end Robust Automatic Speech Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Study on Simultaneous Estimation of Glottal Source and Vocal Tract Parameters by ARMAX-LF Model for Speech Analysis/Synthesis.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Hierarchical Prosody Analysis Improves Categorical and Dimensional Emotion Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Zero-shot Domain Adaptation with Inference Relation Paths for Spoken Language Understanding.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
Towards Efficient Processing and Learning with Spikes: New Approaches for Multi-Spike Learning.
CoRR, 2020

Speech Emotion Recognition Using 3D Convolutions and Attention-Based Sliding Recurrent Networks With Auditory Front-Ends.
IEEE Access, 2020

Relation Modeling with Graph Convolutional Networks for Facial Action Unit Detection.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

Speaker-Aware Speech Emotion Recognition by Fusing Amplitude and Phase Information.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

Neural Entrainment to Natural Speech Envelope Based on Subject Aligned EEG Signals.
Proceedings of the Interspeech 2020, 2020

Dynamic Margin Softmax Loss for Speaker Verification.
Proceedings of the Interspeech 2020, 2020

Cortical Oscillatory Hierarchy for Natural Sentence Processing.
Proceedings of the Interspeech 2020, 2020

EEG-Based Short-Time Auditory Attention Detection Using Multi-Task Deep Learning.
Proceedings of the Interspeech 2020, 2020

Singing Voice Extraction with Attention-Based Spectrograms Fusion.
Proceedings of the Interspeech 2020, 2020

Dimensional Emotion Prediction Based on Interactive Context in Conversation.
Proceedings of the Interspeech 2020, 2020

Temporal Attention Convolutional Network for Speech Emotion Recognition with Latent Representation.
Proceedings of the Interspeech 2020, 2020

Staged Knowledge Distillation for End-to-End Dysarthric Speech Recognition and Speech Attribute Transcription.
Proceedings of the Interspeech 2020, 2020

Segment-Level Effects of Gender, Nationality and Emotion Information on Text-Independent Speaker Verification.
Proceedings of the Interspeech 2020, 2020

SpEx+: A Complete Time Domain Speaker Extraction Network.
Proceedings of the Interspeech 2020, 2020

Deep Discriminative Embedding with Ranked Weight for Speaker Verification.
Proceedings of the Neural Information Processing - 27th International Conference, 2020

Hierarchical Interactive Matching Network for Multi-turn Response Selection in Retrieval-Based Chatbots.
Proceedings of the Neural Information Processing - 27th International Conference, 2020

Adversarial Shared-Private Attention Network for Joint Slot Filling and Intent Detection.
Proceedings of the Neural Information Processing - 27th International Conference, 2020

Investigation of Effectively Synthesizing Code-Switched Speech Using Highly Imbalanced Mix-Lingual Data.
Proceedings of the Neural Information Processing - 27th International Conference, 2020

Integrating Group Homophily and Individual Personality of Topics Can Better Model Network Communities.
Proceedings of the 20th IEEE International Conference on Data Mining, 2020

Amplitude Consistent Enhancement for Speech Dereverberation.
Proceedings of the ICCAI '20: 2020 6th International Conference on Computing and Artificial Intelligence, 2020

A Hierarchical Model for Dialog Act Recognition Considering Acoustic and Lexical Context Information.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Spectrograms Fusion with Minimum Difference Masks Estimation for Monaural Speech Dereverberation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Speech Emotion Recognition with Local-Global Aware Deep Representation Learning.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

End-to-End Articulatory Modeling for Dysarthric Articulatory Attribute Detection.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Temporal-Spatial-Spectral Investigation of Brain Network Dynamics in Human Speech Perception.
Proceedings of the Brain Informatics - 13th International Conference, 2020

A Multi-subject Temporal-spatial Hyper-alignment Method for EEG-based Neural Entrainment to Speech.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

A Pitch-aware Speaker Extraction Serial Network.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

Topic Enhanced Sentiment Spreading Model in Social Networks Considering User Interest.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Working Memory-Driven Neural Networks with a Novel Knowledge Enhancement Paradigm for Implicit Discourse Relation Recognition.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Multi-kernel SVM based depression recognition using social media data.
Int. J. Mach. Learn. Cybern., 2019

Story co-segmentation of Chinese broadcast news using weakly-supervised semantic similarity.
Neurocomputing, 2019

Scalable Community Identification with Manifold Learning on Speaker I-Vector Space.
IEICE Trans. Inf. Syst., 2019

Combination of links and node contents for community discovery using a graph regularization approach.
Future Gener. Comput. Syst., 2019

Replay attack detection with auditory filter-based relative phase features.
EURASIP J. Audio Speech Music. Process., 2019

Robust Environmental Sound Recognition with Sparse Key-point Encoding and Efficient Multi-spike Learning.
CoRR, 2019

Morphological Verb-Aware Tibetan Language Model.
IEEE Access, 2019

Exploration of Complementary Features for Speech Emotion Recognition Based on Kernel Extreme Learning Machine.
IEEE Access, 2019

Implicit Discourse Relation Recognition via a BiLSTM-CNN Architecture With Dynamic Chunk-Based Max Pooling.
IEEE Access, 2019

An integrated system for robust gender classification with convolutional restricted Boltzmann machine and spiking neural network.
Proceedings of the IEEE Symposium Series on Computational Intelligence, 2019

A Matching Pursuit Approach for Image Classification with Spiking Neural Networks.
Proceedings of the IEEE Symposium Series on Computational Intelligence, 2019

CNN-BLSTM Based Question Detection from Dialogs Considering Phase and Context Information.
Proceedings of the Interspeech 2019, 2019

Environment-Dependent Attention-Driven Recurrent Convolutional Neural Network for Robust Speech Enhancement.
Proceedings of the Interspeech 2019, 2019

Acoustic and Articulatory Study of Ewe Vowels: A Comparative Study of Male and Female.
Proceedings of the Interspeech 2019, 2019

A Spiking Neural Network with Distributed Keypoint Encoding for Robust Sound Recognition.
Proceedings of the International Joint Conference on Neural Networks, 2019

Time-Frequency Deep Representation Learning for Speech Emotion Recognition Integrating Self-attention.
Proceedings of the Neural Information Processing - 26th International Conference, 2019

A Fast Convolutional Self-attention Based Speech Dereverberation Method for Robust Speech Recognition.
Proceedings of the Neural Information Processing - 26th International Conference, 2019

Interaction Process Label Recognition in Group Discussion.
Proceedings of the International Conference on Multimodal Interaction, 2019

Lingual and Acoustic Differences in EWE Oral and Nasal Vowels.
Proceedings of the 2019 3rd International Conference on Digital Signal Processing, 2019

NVSRN: A Neural Variational Scaling Reasoning Network for Initiative Response Generation.
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

Replay Attack Detection Using Magnitude and Phase Information with Attention-based Adaptive Filters.
Proceedings of the IEEE International Conference on Acoustics, 2019

A Multi-spike Approach for Robust Sound Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

Robust Sound Event Classification with Local Time-Frequency Information and Convolutional Neural Networks.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2019: Text and Time Series, 2019

Emotional Contagion-Based Social Sentiment Mining in Social Networks by Introducing Network Communities.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Dimensional Emotion Recognition from Speech Using Modulation Spectral Features and Recurrent Neural Networks.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Effective Training End-to-End ASR systems for Low-resource Lhasa Dialect of Tibetan Language.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Investigation of speech-planning mechanism based on eye movement and EEG.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Acoustic Attributes of Citation Tones in Standard Chinese Produced by Prelingually Deaf Adults.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Community Detection in Social Networks Considering Topic Correlations.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Watermarking Based on Compressive Sensing for Digital Speech Detection and Recovery.
Sensors, 2018

Phase and reverberation aware DNN for distant-talking speech enhancement.
Multim. Tools Appl., 2018

Unsupervised measure of Chinese lexical semantic similarity using correlated graph model for news story segmentation.
Neurocomputing, 2018

Incorporating network structure with node contents for community detection on large networks using deep learning.
Neurocomputing, 2018

Autoencoder Based Community Detection with Adaptive Integration of Network Topology and Node Contents.
Proceedings of the Knowledge Science, Engineering and Management, 2018

Investigation of the Comprehension Process during Silent Reading based on Eye Movements.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Manifold-based incremental community detection method for online speaker identification.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Replay Attacks Detection Using Phase and Magnitude Features with Various Frequency Resolutions.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Distant-talking Speech Recognition Based on Multi-objective Learning using Phase and Magnitude-based Feature.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Pitch Synchronized Relative Phase with Peak Error Detection For Noise-robust Speaker Recognition.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Revealing Spatiotemporal Brain Dynamics of Speech Production Based on EEG and Eye Movement.
Proceedings of the Interspeech 2018, 2018

Multiple Phase Information Combination for Replay Attacks Detection.
Proceedings of the Interspeech 2018, 2018

Speech Emotion Recognition by Combining Amplitude and Phase Information Using Convolutional Neural Network.
Proceedings of the Interspeech 2018, 2018

Convolutional Neural Network with Spectrogram and Perceptual Features for Speech Emotion Recognition.
Proceedings of the Neural Information Processing - 25th International Conference, 2018

Efficient Multi-spike Learning with Tempotron-Like LTP and PSD-Like LTD.
Proceedings of the Neural Information Processing - 25th International Conference, 2018

Auditory-Inspired End-to-End Speech Emotion Recognition Using 3D Convolutional Recurrent Neural Networks Based on Spectral-Temporal Representation.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

A Feature Fusion Method Based on Extreme Learning Machine for Speech Emotion Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Gender-Aware CNN-BLSTM for Speech Emotion Recognition.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2018, 2018

Interaction-Aware Topic Model for Microblog Conversations through Network Embedding and User Attention.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Implicit Discourse Relation Recognition using Neural Tensor Network with Interactive Attention and Sparse Learning.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Robust Detection of Link Communities in Large Social Networks by Exploiting Link Semantics.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Twitter summarization with social-temporal context.
World Wide Web, 2017

Multimodal sensory fusion for soccer robot self-localization based on long short-term memory recurrent neural network.
J. Ambient Intell. Humaniz. Comput., 2017

Simulation of heat conduction in fluids on GPU with particle method.
Comput. Syst. Sci. Eng., 2017

Interactions Between Modal and Amodal Semantic Areas in Spoken Word Comprehension.
Proceedings of the Studies on Speech Production - 11th International Seminar, 2017

Prediction of F0 Based on Articulatory Features Using DNN.
Proceedings of the Studies on Speech Production - 11th International Seminar, 2017

A Hybrid Method for Acoustic Analysis of the Vocal Tract During Vowel Production.
Proceedings of the Studies on Speech Production - 11th International Seminar, 2017

Particle Interaction Adaptivity and Absorbing Boundary Conditions in the Lagrangian Particle Aeroacoustic Model.
Proceedings of the Studies on Speech Production - 11th International Seminar, 2017

Global Monitoring of Dynamic Functional Interactions in the Brain During Chinese Verbs Perception.
Proceedings of the Studies on Speech Production - 11th International Seminar, 2017

Investigation of Speech-Planning Mechanism Based on Eye Movement.
Proceedings of the Studies on Speech Production - 11th International Seminar, 2017

Speech Emotion Recognition Considering Local Dynamic Features.
Proceedings of the Studies on Speech Production - 11th International Seminar, 2017

A Neuro-Experimental Evidence for the Motor Theory of Speech Perception.
Proceedings of the Interspeech 2017, 2017

Identification of Generalized Communities with Semantics in Networks with Content.
Proceedings of the 29th IEEE International Conference on Tools with Artificial Intelligence, 2017

Phonemic Restoration Based on the Movement Continuity of Articulation.
Proceedings of the Neural Information Processing - 24th International Conference, 2017

Neuronal Classifier for both Rate and Timing-Based Spike Patterns.
Proceedings of the Neural Information Processing - 24th International Conference, 2017

Exploiting the Tibetan Radicals in Recurrent Neural Network for Low-Resource Language Models.
Proceedings of the Neural Information Processing - 24th International Conference, 2017

Stochastic Sequential Minimal Optimization for Large-Scale Linear SVM.
Proceedings of the Neural Information Processing - 24th International Conference, 2017

Phase aware deep neural network for noise robust voice activity detection.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Pseudo-pitch-synchronized phase information extraction and its application for robust speaker recognition.
Proceedings of the IEEE 6th Global Conference on Consumer Electronics, 2017

Speech emotion recognition using multichannel parallel convolutional recurrent neural networks based on gammatone auditory filterbank.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

The acoustic characteristics of tone 3 in standard chinese produced by prelingually deaf adults.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

A study of high level tone in standard chinese produced by prelingually deaf adults.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
A Novel Method for Constructing 3D Geometric Articulatory Models.
J. Signal Process. Syst., 2016

Multi-modal recording and modeling of vocal tract movements.
Multim. Tools Appl., 2016

Mapping ultrasound-based articulatory images and vowel sounds with a deep neural network framework.
Multim. Tools Appl., 2016

Sketch4Image: a novel framework for sketch-based image retrieval based on product quantization with coding residuals.
Multim. Tools Appl., 2016

Audio-visual speech recognition integrating 3D lip information obtained from the Kinect.
Multim. Syst., 2016

A Study on Detection and Recovery of Speech Signal Tampering.
Proceedings of the 2016 IEEE Trustcom/BigDataSE/ISPA, 2016

Tongue performance in articulating Mandarin apical syllables by prelingual deaf adults using ultrasonic technology: Two case studies.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

EEG evidence for a three-phase recurrent process during spoken word processing.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Multi-channel feature adaptation for robust speech recognition.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Spatial co-variation of lip and tongue at strong and weak syllables.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Relationship between perception and production of English vowels by Chinese English learners.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Investigation of the spatiotemporal dynamics of the brain during perceiving words.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Voice activity detection based on sequential Gaussian mixture model with maximum likelihood criterion.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Exploring tonal information for Lhasa dialect acoustic modeling.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

The singing voice before and after vocal warm-up by students of Chinese national singing.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

A New Model for Acoustic Wave Propagation and Scattering in the Vocal Tract.
Proceedings of the Interspeech 2016, 2016

Effects of Subglottal-Coupling and Interdental-Space on Formant Trajectories During Front-to-Back Vowel Transitions in Chinese.
Proceedings of the Interspeech 2016, 2016

Investigations into vowel and consonant structures in articulatory and auditory spaces using Laplacian eigenmaps.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

The MFCC Vowel Space of [ɤ] in Grammatical and Lexical Word in Standard Chinese.
Proceedings of the Chinese Lexical Semantics - 17th Workshop, 2016

Investigation of noun-verb dissociation based on EEG source reconstruction.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

Tibetan vowel analysis with a multi-modal Mandarin-Tibetan speech corpus.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

A new method of acceleration measurement for observing tongue movement in ultrasound image during speech production.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

Mandarin citation tone patterns of prelingual Chinese deaf adults.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

Detect Overlapping Communities via Ranking Node Popularities.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Strength of syllabic influences on articulation in Mandarin Chinese and French: Insights from a motor control approach.
J. Phonetics, 2015

An empirical study of phonetic transfer in English monophthong learning by Tibetan (Lhasa) speakers.
Proceedings of the 2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015

An articulatory analysis of apical syllables in Standard Chinese.
Proceedings of the 2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015

Community detection with manifold learning on speaker i-vector space for Chinese.
Proceedings of the INTERSPEECH 2015, 2015

Measuring oral and nasal airflow in production of Chinese plosive.
Proceedings of the INTERSPEECH 2015, 2015

Combined cine- and tagged-MRI for tracking landmarks on the tongue surface.
Proceedings of the INTERSPEECH 2015, 2015

Perception of Mandarin tones by native tibetan speakers.
Proceedings of the INTERSPEECH 2015, 2015

The perception of English vowel contrasts by Chinese EFL learners and native English speakers.
Proceedings of the 18th International Congress of Phonetic Sciences, 2015

A lip protrusion mechanism examined by magnetic resonance imaging and finite element modeling.
Proceedings of the 18th International Congress of Phonetic Sciences, 2015

Vocal responses to frequency modulated composite sinewaves via auditory and vibrotactile pathways.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Vowel normalization by articulatory information.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Chinese opera genre classification based on multi-feature fusion and extreme learning machine.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Influences of auditory and vibrotactile information on vocal F0 responses.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Investigation of learning trajectory of Mandarin for Tibetan speakers.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Investigation of relation between speech perception and production based on EEG source reconstruction.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014
Detection of speaker individual information using a phoneme effect suppression method.
Speech Commun., 2014

Image decomposing for inpainting using compressed sensing in DCT domain.
Frontiers Comput. Sci., 2014

Mapping between ultrasound and vowel speech using DNN framework.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Visualization of mandarin articulation driven by ultrasound data.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Investigation on articulatory and acoustic characteristics of dysarthria.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Automatic speech recognition under robot ego noises.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Locality Preserving Hashing Method for Image Retrieval.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2014

Acoustic features of Mandarin monophthongs by Tibetan speakers.
Proceedings of the 2014 International Conference on Asian Language Processing, 2014

An acoustic analysis of English monophthongs by Tibetan speakers.
Proceedings of the 2014 International Conference on Asian Language Processing, 2014

2013
An MRI-based acoustic study of Mandarin vowels.
Proceedings of the INTERSPEECH 2013, 2013

An anisotropic diffusion filter based on multidirectional separability.
Proceedings of the INTERSPEECH 2013, 2013

Dialog Act classification in Chinese spoken language.
Proceedings of the International Conference on Machine Learning and Cybernetics, 2013

Emotional McGurk Effect? A Cross-Cultural Investigation on Emotion Expression under Vocal and Facial Conflict.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2013

Individual variation of morphological and acoustic effects of the nasal tract.
Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013

Morphological personalization of a physiological articulatory model.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Morphological normalization: A study of vowels for Mandarin and Japanese.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Emotional intonation modeling: A cross-language study on Chinese and Japanese.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Visualization of Mandarin articulation by using a physiological articulatory model.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

A neural understanding of speech motor learning.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012
Efficient feature extraction of speaker identification using phoneme mean F-ratio for Chinese.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Detailed morphological analysis of mandarin sustained steady vowels.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Reconstruction of vocal tract based on multi-source image information.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Tongue shape synthesis based on Active Shape Model.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

More targets? Simulating emotional intonation of mandarin with PENTA.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Acoustic and articulatory analysis on Japanese vowels in emotional speech.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

A method of speaker identification based on phoneme mean F-ratio contribution.
Proceedings of the INTERSPEECH 2012, 2012

Noise estimation using a constrained sequential HMM IN log-spectral domain.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Mandarin vowel synthesis based on 2D and 3D vocal tract model by finite-difference time-domain method.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

An anisotropic diffusion filter for reducing speckle noise of ultrasound images based on separability.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

An investigation of dependencies between frequency components and speaker characteristics based on phoneme mean F-ratio contribution.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011
Voice Activity Detection Based on an Unsupervised Learning Framework.
IEEE ACM Trans. Audio Speech Lang. Process., 2011

Emotional Intonation in a Tone Language: Experimental Evidence from Chinese.
Proceedings of the 17th International Congress of Phonetic Sciences, 2011

2010
Vowel Production Manifold: Intrinsic Factor Analysis of Vowel Articulation.
IEEE Trans. Speech Audio Process., 2010

Investigation of muscle activation in speech production based on an articulatory model.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Acoustic and articulatory analysis on Mandarin Chinese vowels in emotional speech.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Morphological normalization of vocal tract shape.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Speech Enhancement Based on Noise Eigenspace Projection.
IEICE Trans. Inf. Syst., 2009

Feedforward control of a 3d physiological articulatory model for vowel production.
Proceedings of the INTERSPEECH 2009, 2009

2008
An investigation of dependencies between frequency components and speaker characteristics for text-independent speaker identification.
Speech Commun., 2008

A model based investigation of activation patterns of the tongue muscles for vowel production.
Proceedings of the INTERSPEECH 2008, 2008

2007
A Model-Based Learning Process for Modeling Coarticulation of Human Speech.
IEICE Trans. Inf. Syst., 2007

Dimension reduction for speaker identification based on mutual information.
Proceedings of the INTERSPEECH 2007, 2007

Physiological Feature Extraction for Text Independent Speaker Identification using Non-Uniform Subband Processing.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
Communication Between Speech Production and Perception Within the Brain-Observation and Simulation.
J. Comput. Sci. Technol., 2006

A Robust Voice Activity Detection Based on Noise Eigenspace Projection.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

Auditory Contrast Spectrum for Robust Speech Recognition.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

Speech Synthesis Based on a Physiological Articulatory Model.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

A simulation based parameter optimization for a coarticulation model.
Proceedings of the INTERSPEECH 2006, 2006

2005
Investigation and modeling of coarticulation during speech.
Proceedings of the INTERSPEECH 2005, 2005

2004
Investigation and modeling of coarticulation in speech production.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

2003
Consideration of muscle co-contraction in a physiological articulatory model.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
Estimation of vocal tract shapes from speech sounds with a physiological articulatory model.
J. Phonetics, 2002

Investigation of coarticulation based on electromagnetic articulographic data.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2000
Improvement of a physiological articulatory model for synthesis of vowel sequences.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1998
Speech production of vowel sequences using a physiological articulatory model.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

1996
An improved vocal tract model of vowel production implementing piriform resonance and transvelar nasal coupling.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

1994
A physiological model of speech production and the implication of tongue-larynx interaction.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Investigation of the acoustic characteristics of the velum for vowels.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994


  Loading...