Jia Jia

According to our database1, Jia Jia authored at least 92 papers between 2005 and 2019.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2019
Inferring Emotions From Large-Scale Internet Voice Data.
IEEE Trans. Multimedia, 2019

Towards Discriminative Representation Learning for Speech Emotion Recognition.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Modality Attention for End-to-end Audio-visual Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

A Compact Framework for Voice Conversion Using Wavenet Conditioned on Phonetic Posteriorgrams.
Proceedings of the IEEE International Conference on Acoustics, 2019

Dilated Residual Network with Multi-head Self-attention for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

Learning Discriminative Features from Spectrograms Using Center Loss for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Analyzing and Predicting Emoji Usages in Social Media.
Proceedings of the Companion of the The Web Conference 2018 on The Web Conference 2018, 2018

Inferring User Emotive State Changes in Realistic Human-Computer Conversational Dialogs.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Emphasis Detection for Voice Dialogue Applications Using Multi-channel Convolutional Bidirectional Long Short-Term Memory Network.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Speech Super-Resolution Using Parallel WaveNet.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Emotion Recognition from Variable-Length Speech Segments Using Deep Learning on Spectrograms.
Proceedings of the Interspeech 2018, 2018

Cross-Domain Depression Detection via Harvesting Social Media.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Emphatic Speech Generation with Conditioned Input Layer and Bidirectional LSTMS for Expressive Speech Synthesis.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Inferring Emotion from Conversational Voice Data: A Semi-Supervised Multi-Path Generative Neural Network Approach.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Inferring Emotional Tags From Social Images With User Demographics.
IEEE Trans. Multimedia, 2017

Detecting Stress Based on Social Interactions in Social Networks.
IEEE Trans. Knowl. Data Eng., 2017

Analyzing and Identifying Teens' Stressful Periods and Stressor Events From a Microblog.
IEEE J. Biomedical and Health Informatics, 2017

Multi-scale Context Based Attention for Dynamic Music Emotion Prediction.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

PIC2DISH: A Customized Cooking Assistant System.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Speech Emotion Recognition with Emotion-Pair Based Framework Considering Emotion Distribution Information in Dimensional Emotion Space.
Proceedings of the Interspeech 2017, 2017

Depression Detection via Harvesting Social Media: A Multimodal Dictionary Learning Solution.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Inferring emotions from heterogeneous social media data: A Cross-media Auto-Encoder solution.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

A systematic approach to compute perceptual distribution of monosyllables.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Learning cross-lingual knowledge with multilingual BLSTM for emphasis detection with limited training data.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Multi-Task Deep Learning for User Intention Understanding in Speech Interaction Systems.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Towards Better Understanding the Clothing Fashion Styles: A Multimodal Deep Learning Approach.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

SenseRun: Real-Time Running Routes Recommendation towards Providing Pleasant Running Experiences.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

A Virtual Personal Fashion Consultant: Learning from the Personal Preference of Fashion.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Learning robust uniform features for cross-media social data by using cross autoencoders.
Knowl.-Based Syst., 2016

A systematic exploration of the micro-blog feature space for teens stress detection.
Health Inf. Sci. Syst., 2016

Analysis of Teens' Chronic Stress on Micro-blog.
Proceedings of the Web Information Systems Engineering - WISE 2016, 2016

Magic Mirror: A Virtual Fashion Consultant.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

THear: Development of a mobile multimodal audiometry application on a cross-platform framework.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Expressive Speech Driven Talking Avatar Synthesis with DBLSTM Using Limited Amount of Emotional Bimodal Data.
Proceedings of the Interspeech 2016, 2016

Phoneme Embedding and its Application to Speech Driven Talking Avatar Synthesis.
Proceedings of the Interspeech 2016, 2016

What Does Social Media Say about Your Stress?.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Inferring users' emotions for human-mobile voice dialogue applications.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Low level descriptors based DBLSTM bottleneck feature for speech driven talking avatar.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Social Role-Aware Emotion Contagion in Image Social Networks.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Moodee: An Intelligent Mobile Companion for Sensing Your Stress from Your Social Media Postings.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Learning to Appreciate the Aesthetic Effects of Clothing.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Modeling Emotion Influence in Image Social Networks.
IEEE Trans. Affective Computing, 2015

Expressive talking avatar synthesis and animation.
Multimedia Tools Appl., 2015

Generating emphatic speech with hidden Markov model for expressive speech synthesis.
Multimedia Tools Appl., 2015

Using tilt for automatic emphasis detection with Bayesian networks.
Proceedings of the INTERSPEECH 2015, 2015

Teenagers' Stress Detection Based on Time-Sensitive Micro-blog Comment/Response Actions.
Proceedings of the Artificial Intelligence in Theory and Practice IV, 2015

Release Adolescent Stress by Virtual Chatting.
Proceedings of the Engineering the Web in the Big Data Era - 15th International Conference, 2015

MPHA: A Personal Hearing Doctor Based on Mobile Devices.
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015

Understanding the emotions behind social images: Inferring with user demographics.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

HMM-based emphatic speech synthesis for corrective feedback in computer-aided pronunciation training.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

TeenChat: A Chatterbot System for Sensing and Releasing Adolescents' Stress.
Proceedings of the Health Information Science - 4th International Conference, 2015

Understanding speaking styles of internet speech data with LSTM and low-resource training.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

2014
Synthesizing English emphatic speech for multimodal corrective feedback in computer-aided pronunciation training.
Multimedia Tools Appl., 2014

Head and facial gestures synthesis using PAD model for an expressive talking avatar.
Multimedia Tools Appl., 2014

Grading the Severity of Mispronunciations in CAPT Based on Statistical Analysis and Computational Speech Perception.
J. Comput. Sci. Technol., 2014

A computational cognition model of perception, memory, and judgment.
SCIENCE CHINA Information Sciences, 2014

Inferring Emotions from Social Images Leveraging Influence Analysis.
Proceedings of the Social Media Processing - Third National Conference, 2014

Learning to Infer Public Emotions from Large-Scale Networked Voice Data.
Proceedings of the MultiMedia Modeling - 20th Anniversary International Conference, 2014

User-level psychological stress detection from social media using deep neural network.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Automatic speech data clustering with human perception based weighted distance.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Algorithm of pure tone audiometry based on multiple judgment.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Using conditional random fields to predict focus word pair in spontaneous spoken English.
Proceedings of the INTERSPEECH 2014, 2014

Acoustics, content and geo-information based sentiment prediction from large-scale networked voice data.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Psychological stress detection from cross-media microblog data using Deep Sparse Neural Network.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Helping Teenagers Relieve Psychological Pressures: A Micro-blog Based System.
Proceedings of the 17th International Conference on Extending Database Technology, 2014

How Do Your Friends on Social Media Disclose Your Emotions?
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Affective image adjustment with a single word.
The Visual Computer, 2013

WeCard: a multimodal solution for making personalized electronic greeting cards.
Proceedings of the ACM Multimedia Conference, 2013

SNR estimation for clipped audio based on amplitude distribution.
Proceedings of the Ninth International Conference on Natural Computation, 2013

Interpretable aesthetic features for affective image classification.
Proceedings of the IEEE International Conference on Image Processing, 2013

TalkingAndroid: An interactive, multimodal and real-time talking avatar application on mobile phones.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Comparing feature dimension reduction algorithms for GMM-SVM based speech emotion recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012
Affective Image Colorization.
J. Comput. Sci. Technol., 2012

Comparison of adaptation methods for GMM-SVM based speech emotion recognition.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Understanding the emotional impact of images.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Can we understand van gogh's mood?: learning to infer affects from images in social networks.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Adaptive named entity recognition based on conditional random fields with automatic updated dynamic gazetteers.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

A real-time tone enhancement method for continuous Mandarin speeches.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Perceptual clustering based unit selection optimization for concatenative text-to-speech synthesis.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Analysis on mispronunciations in CAPT based on computational speech perception.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Hierarchical English Emphatic Speech Synthesis Based on HMM with Limited Training Data.
Proceedings of the INTERSPEECH 2012, 2012

Intention understanding based on multi-source information integration for Chinese Mandarin spoken commands.
Proceedings of the 9th International Conference on Fuzzy Systems and Knowledge Discovery, 2012

Image Colorization with an Affective Word.
Proceedings of the Computational Visual Media - First International Conference, 2012

Modeling the correlation between modality semantics and facial expressions.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011
Emotional Audio-Visual Speech Synthesis Based on PAD.
IEEE Trans. Audio, Speech & Language Processing, 2011

2010
Emotional talking agent: System and evaluation.
Proceedings of the Sixth International Conference on Natural Computation, 2010

Facial expression synthesis based on motion patterns learned from face database.
Proceedings of the International Conference on Image Processing, 2010

2008
Analysis and Modeling of Affective Audio Visual Speech Based on PAD Emotion Space.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

2007
Fingerprint matching based on weighting method and the SVM.
Neurocomputing, 2007

Fake Finger Detection Based on Time-Series Fingerprint Image Analysis.
Proceedings of the Advanced Intelligent Computing Theories and Applications. With Aspects of Theoretical and Methodological Issues, 2007

A New Approach to Fake Finger Detection Based on Skin Elasticity Analysis.
Proceedings of the Advances in Biometrics, International Conference, 2007

2005
A TSVM-Based Minutiae Matching Approach for Fingerprint Verification.
Proceedings of the Advances in Biometric Person Authentication, 2005


  Loading...