Shuwu Zhang

Orcid: 0000-0002-6013-6351

According to our database1, Shuwu Zhang authored at least 88 papers between 1996 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
GesGPT: Speech Gesture Synthesis With Text Parsing From ChatGPT.
IEEE Robotics Autom. Lett., 2024

DiffMAC: Diffusion Manifold Hallucination Correction for High Generalization Blind Face Restoration.
CoRR, 2024

2023
Robust Texture-Aware Local Adaptive Image Watermarking With Perceptual Guarantee.
IEEE Trans. Circuits Syst. Video Technol., September, 2023

Skeleton-aware implicit function for single-view human reconstruction.
CAAI Trans. Intell. Technol., June, 2023

Movie Scene Event Extraction with Graph Attention Network Based on Argument Correlation Information.
Sensors, February, 2023

GesGPT: Speech Gesture Synthesis With Text Parsing from GPT.
CoRR, 2023

Gesture Motion Graphs for Few-Shot Speech-Driven Gesture Reenactment.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023

Adversarial Audio Watermarking: Embedding Watermark into Deep Feature.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Learning Video Localization on Segment-Level Video Copy Detection with Transformer.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2023, 2023

2022
ECSS: High-Embedding-Capacity Audio Watermarking with Diversity Reception.
Entropy, December, 2022

From general to specific: Online updating for blind super-resolution.
Pattern Recognit., 2022

Heterogeneous Avatar Synthesis Based on Disentanglement of Topology and Rendering.
Proceedings of the Computer Vision - ACCV 2022, 2022

2021
Detection of Fake Reviews Using Group Model.
Mob. Networks Appl., 2021

Learning to predict more accurate text instances for scene text detection.
Neurocomputing, 2021

Approaching the Limit of Image Rescaling via Flow Guidance.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Single shot multi-oriented text detection based on local and non-local features.
Int. J. Document Anal. Recognit., 2020

IBN-STR: A Robust Text Recognizer for Irregular Text in Natural Scenes.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

2019
Enhancing Image Watermarking With Adaptive Embedding Parameter and PSNR Guarantee.
IEEE Trans. Multim., 2019

Adaptive Attention Annotation Model: Optimizing the Prediction Path through Dependency Fusion.
KSII Trans. Internet Inf. Syst., 2019

Learning to Predict More Accurate Text Instances for Scene Text Detection.
CoRR, 2019

Inductive Zero-Shot Image Annotation via Embedding Graph.
IEEE Access, 2019

Improving Relation Extraction with Relation-Based Gated Convolutional Selector.
Proceedings of the Chinese Computational Linguistics - 18th China National Conference, 2019

2018
Aspect-Level Sentiment Classification with Conv-Attention Mechanism.
Proceedings of the Neural Information Processing - 25th International Conference, 2018

Word Semantic Similarity Calculation Based on Word2vec.
Proceedings of the 2018 International Conference on Control, 2018

2017
A cascaded method for text detection in natural scene images.
Neurocomputing, 2017

SIFT Matching with CNN Evidences for Particular Object Retrieval.
Neurocomputing, 2017

Region based image retrieval with query-adaptive feature fusion.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

2016
Adaptive bit allocation product quantization.
Neurocomputing, 2016

Scene text detection with extremal region based cascaded filtering.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Region matching and similarity enhancing for image retrieval.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Adaptive bit allocation hashing for approximate nearest neighbor search.
Neurocomputing, 2015

Efficient Location-Based Event Detection in Social Text Streams.
Proceedings of the Intelligence Science and Big Data Engineering. Big Data and Machine Learning Techniques, 2015

Transmitting informative components of fisher codes for mobile visual search.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

A Practical Keyword Recommendation Method Based on Probability in Digital Publication Domain.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2015

2014
Multi-feature hierarchical topic models for human behavior recognition.
Sci. China Inf. Sci., 2014

Individualized matching based on logo density for scalable logo recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

Real-Time Event Detection Based on Geo Extraction and Temporal Analysis.
Proceedings of the Advanced Data Mining and Applications - 10th International Conference, 2014

2013
Large Scale Image Retrieval with Practical Spatial Weighting for Bag-of-Visual-Words.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Chinese Short Text Classification Based on Domain Knowledge.
Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013

Adaptive bit allocation hashing for approximate nearest neighbor search.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Image classification using weighted sparsity induced neighbors and label embeddings learning.
Proceedings of the IEEE International Conference on Electro-Information Technology , 2013

2012
Spatial connected component pre-locating algorithm for rapid logo detection.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Global and Local Features based topic model for scene recognition.
Proceedings of the IEEE International Conference on Systems, 2011

Video Semantic Mining Based on Dense Sub Graph Finding.
Proceedings of the Seventh International Conference on Signal-Image Technology and Internet-Based Systems, 2011

Automatic behavior model selection by iterative learning and abnormality recognition.
Proceedings of the 2011 IEEE International Conference on Intelligence and Security Informatics, 2011

Evaluation of Global Descriptors for Large Scale Image Retrieval.
Proceedings of the Image Analysis and Processing - ICIAP 2011, 2011

A Chinese Character Localization Method Based on Intergrating Structure and CC-Clustering for Advertising Images.
Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011

A Novel Italic Detection and Rectification Method for Chinese Advertising Images.
Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011

A hierarchical generative model for Generic Audio Document Categorization.
Proceedings of the IEEE International Conference on Acoustics, 2011

Hierarchical Latent Dirichlet Allocation models for realistic action recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Affine Resilient Image Watermarking Based on Trace Transform.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010

A Fast Image Inpainting Method Based on Hybrid Similarity-Distance.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Similarity-based image classification via kernelized sparse representation.
Proceedings of the International Conference on Image Processing, 2010

A local appearance contextual descriptor for object matching.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
A novel approach to musical genre classification using probabilistic latent semantic analysis model.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Part-Based Object Detection Using Cascades of Boosted Classifiers.
Proceedings of the Computer Vision, 2009

2008
A Novel Video Classification Method Based on Hybrid Generative/Discriminative Models.
Proceedings of the Structural, 2008

A Graph Based Subspace Semi-supervised Learning Framework for Dimensionality Reduction.
Proceedings of the Computer Vision, 2008

2006
Robust Target Speaker Tracking in Broadcast TV Streams.
Int. J. Comput. Linguistics Chin. Lang. Process., 2006

A quality measure method using Gaussian mixture models and divergence measure for speaker identification.
Proceedings of the INTERSPEECH 2006, 2006

Fast SVM training based on the choice of effective samples for audio classification.
Proceedings of the INTERSPEECH 2006, 2006

A Two-level Method for Unsupervised Speaker-based Audio Segmentation.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

A Mongolian Speech Recognition System Based on HMM.
Proceedings of the Computational Intelligence, 2006

A Comparative Study of Feature and Score Normalization for Speaker Verification.
Proceedings of the Advances in Biometrics, International Conference, 2006

A Question Answering System on Special Domain and the Implementation of Speech Interface.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2006

2005
A Histogram Algorithm for Fast Audio Retrieval.
Proceedings of the ISMIR 2005, 2005

A Hierarchical Approach for Audio Stream Segmentation and Classification.
Proceedings of the ISMIR 2005, 2005

Optimal model order selection based on regression tree in speaker identification.
Proceedings of the INTERSPEECH 2005, 2005

2004
Cross-Language Acoustic Modeling in Large Vocabulary Continuous Speech Recognition.
J. Chin. Lang. Comput., 2004

Hand-Free Speech Recognition in Adverse Environment with Microphone Arrays.
J. Chin. Lang. Comput., 2004

A Novel Polyspectra-Based End Point Detector In Noisy Environments.
J. Chin. Lang. Comput., 2004

Tone Modeling for Continuous Mandarin Speech Recognition.
Int. J. Speech Technol., 2004

Improvement of Speaker Identification by Combining Prosodic Features with Acoustic Features.
Proceedings of the Advances in Biometric Person Authentication, 2004

Text-independent speaker identification using GMM-UBM and frame level likelihood normalization.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

Robust speaker recognition integrating pitch and Wiener filter.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

Multi-layer structure MLLR adaptation algorithm with subspace regression classes and tying.
Proceedings of the INTERSPEECH 2004, 2004

Combining agglomerative and tree-based state clustering for high accuracy acoustic modeling.
Proceedings of the INTERSPEECH 2004, 2004

A novel target-driven generalized JMAP adaptation algorithm.
Proceedings of the INTERSPEECH 2004, 2004

Chinese-English bilingual phone modeling for cross-language speech recognition.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Statistical speech-to-speech translation with multilingual speech recognition and bilingual-chunk parsing.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Discriminative optimization of large vocabulary Mandarin conversational speech recognition system.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

A vector statistical piecewise polynomial approximation algorithm for environment compensation in telephone LVCSR.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Comparison and study of some variants of partially tied covariance modeling.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2001
A hybrid approach to enhance task portability of acoustic models in Chinese speech recognition.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
An embedded knowledge integration for hybrid language modelling.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999
Improving n-gram modeling using distance-related unit association maximum entropy language modeling.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1997
An integrated language modeling with n-gram model and WA model for speech recognition.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

1996
Speaker-independent dictation of Chinese speech with 32k vocabulary.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996


  Loading...