Shuang Xu

Orcid: 0000-0002-5882-020X

According to our database1, Shuang Xu authored at least 179 papers between 2002 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
APGVAE: Adaptive disentangled representation learning with the graph-based structure information.
Inf. Sci., February, 2024

A Knowledge-enhanced Two-stage Generative Framework for Medical Dialogue Information Extraction.
Mach. Intell. Res., February, 2024

Integrating reaction pathways and downstream separation network for optimal sustainable process route selection.
Comput. Chem. Eng., February, 2024

Fast Thick Cloud Removal for Multi-Temporal Remote Sensing Imagery via Representation Coefficient Total Variation.
Remote. Sens., January, 2024

Distributed and multi-layer hierarchical controller placement in software-defined satellite-terrestrial network.
Trans. Emerg. Telecommun. Technol., January, 2024

MobileVLM V2: Faster and Stronger Baseline for Vision Language Model.
CoRR, 2024

2023
MBIAN: Multi-level bilateral interactive attention network for multi-modal image processing.
Expert Syst. Appl., November, 2023

Predicting Scientist Collaboration by Multiple Motif Features.
IEEE Trans. Comput. Soc. Syst., August, 2023

An integrated framework for sustainable process design by hybrid and intensified equipment.
Comput. Chem. Eng., August, 2023

Research on high-precision positioning method of robot based on laser tracker.
Intell. Serv. Robotics, July, 2023

Enhanced Semantic Representation Learning for Sarcasm Detection by Integrating Context-Aware Attention and Fusion Network.
Entropy, June, 2023

Hyperspectral Denoising Using Asymmetric Noise Modeling Deep Image Prior.
Remote. Sens., April, 2023

Multi-agent deep reinforcement learning algorithm with trend consistency regularization for portfolio management.
Neural Comput. Appl., March, 2023

Modified Dynamic Routing Convolutional Neural Network for Pan-Sharpening.
Remote. Sens., 2023

Long Short-Term Memory Networks with Multiple Variables for Stock Market Prediction.
Neural Process. Lett., 2023

An exploration of ethnic minorities' needs for multilingual information access of public digital cultural services.
J. Documentation, 2023

VLP: A Survey on Vision-language Pre-training.
Int. J. Autom. Comput., 2023

Local-to-Global Causal Reasoning for Cross-Document Relation Extraction.
IEEE CAA J. Autom. Sinica, 2023

MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices.
CoRR, 2023

ReFusion: Learning Image Fusion from Reconstruction with Learnable Loss via Meta-Learning.
CoRR, 2023

RobustCalib: Robust Lidar-Camera Extrinsic Calibration with Consistency Learning.
CoRR, 2023

Neural Gradient Regularizer.
CoRR, 2023

HIDFlowNet: A Flow-Based Deep Network for Hyperspectral Image Denoising.
CoRR, 2023

ViLaS: Integrating Vision and Language into Automatic Speech Recognition.
CoRR, 2023

Equivariant Multi-Modality Image Fusion.
CoRR, 2023

X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages.
CoRR, 2023

Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation.
CoRR, 2023

GoogLeNet based on residual network and attention mechanism identification of rice leaf diseases.
Comput. Electron. Agric., 2023

Nonlinear Analytical Model of Linear Switched Reluctance Motor With Segmented Secondary Considering Iron Saturation and End Effect.
IEEE Access, 2023

Make Spoken Document Readable: Leveraging Graph Attention Networks for Chinese Document-Level Spoken-to-Written Simplification.
Proceedings of the Neural Information Processing - 30th International Conference, 2023

DDFM: Denoising Diffusion Model for Multi-Modality Image Fusion.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Spherical Space Feature Decomposition for Guided Depth Map Super-Resolution.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Matching-Based Term Semantics Pre-Training for Spoken Patient Query Understanding.
Proceedings of the IEEE International Conference on Acoustics, 2023

A Novel Sensor Method for Dietary Detection.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2023

Deep Convolutional Sparse Coding Networks for Interpretable Image Fusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CDDFuse: Correlation-Driven Dual-Branch Feature Decomposition for Multi-Modality Image Fusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Exploring Spatial Correlation for Light Field Saliency Detection: Expansion From a Single View.
IEEE Trans. Image Process., 2022

MD³Net: Integrating Model-Driven and Data-Driven Approaches for Pansharpening.
IEEE Trans. Geosci. Remote. Sens., 2022

Hyperspectral Image Denoising by Asymmetric Noise Modeling.
IEEE Trans. Geosci. Remote. Sens., 2022

Hybrid Loss-Guided Coarse-to-Fine Model for Seismic Data Consecutively Missing Trace Reconstruction.
IEEE Trans. Geosci. Remote. Sens., 2022

EDChannel: channel prediction of backscatter communication network based on encoder-decoder.
Telecommun. Syst., 2022

Efficient and Model-Based Infrared and Visible Image Fusion via Algorithm Unrolling.
IEEE Trans. Circuits Syst. Video Technol., 2022

Hyperspectral image denoising by low-rank models with hyper-Laplacian total variation prior.
Signal Process., 2022

ExHIBit: Breath-based augmentative and alternative communication solution using commercial RFID devices.
Inf. Sci., 2022

A model-driven network for guided image denoising.
Inf. Fusion, 2022

Seismic fault detection using convolutional neural networks with focal loss.
Comput. Geosci., 2022

Two-Level Supervised Contrastive Learning for Response Selection in Multi-Turn Dialogue.
CoRR, 2022

Predicting Ca<sup>2+</sup> and Mg<sup>2+</sup> ligand binding sites by deep neural network algorithm.
BMC Bioinform., 2022

Research on the Restoration Perception Evaluation of Historical Blocks Along the Inner Mongolia Section of the Middle East Railway Based on Network Text Big Data.
Proceedings of the Mobile Networks and Management - 12th EAI International Conference, 2022

PreyNet: Preying on Camouflaged Objects.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Unsupervised and Pseudo-Supervised Vision-Language Alignment in Visual Dialog.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Joint Modeling of Document and Label with Clause Interaction Hypergraph for ICD Medical Code Assignment.
Proceedings of the International Joint Conference on Neural Networks, 2022

Electrical Engineering Specialty Construction Based on "Four-dimensional Practical Teaching, Multimedia Integration, System Reconfiguration" Multi-Innovation Methods.
Proceedings of the 2022 5th International Conference on Education Technology Management, 2022

Improving Cross-Modal Understanding in Visual Dialog Via Contrastive Learning.
Proceedings of the IEEE International Conference on Acoustics, 2022

A Multi Domain Knowledge Enhanced Matching Network for Response Selection in Retrieval-Based Dialogue Systems.
Proceedings of the IEEE International Conference on Acoustics, 2022

Discrete Cosine Transform Network for Guided Depth Map Super-Resolution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
MFIF-GAN: A new generative adversarial network for multi-focus image fusion.
Signal Process. Image Commun., 2021

CondenseNet with exclusive lasso regularization.
Neural Comput. Appl., 2021

ReType: Your Breath Tells Your Mind!
IEEE Internet Things J., 2021

Research on ceramic tile defect detection based on YOLOv3.
Int. J. Wirel. Mob. Comput., 2021

Discrete Cosine Transform Network for Guided Depth Map Super-Resolution.
CoRR, 2021

Counterfactual Supporting Facts Extraction for Explainable Medical Record Based Diagnosis with Graph Network.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Deep Convolutional Sparse Coding Network For Pansharpening With Guidance Of Side Information.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

FGF-GAN: A Lightweight Generative Adversarial Network for Pansharpening via Fast Guided Filter.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Deep Gradient Projection Networks for Pan-sharpening.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning Flexibly Distributional Representation for Low-quality 3D Face Recognition.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Listen, Understand and Translate: Triple Supervision Decouples End-to-end Speech-to-text Translation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Consecutive Decoding for Speech-to-text Translation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
LFNet: Light Field Fusion Network for Salient Object Detection.
IEEE Trans. Image Process., 2020

HAM-MFN: Hyperspectral and Multispectral Image Multiscale Fusion Network With RAP Loss.
IEEE Trans. Geosci. Remote. Sens., 2020

Towards Reducing Severe Defocus Spread Effects for Multi-Focus Image Fusion via an Optimization Based Strategy.
IEEE Trans. Computational Imaging, 2020

Inverse Projection Representation and Category Contribution Rate for Robust Tumor Recognition.
IEEE ACM Trans. Comput. Biol. Bioinform., 2020

BLAS: Broadcast Relative Localization and Clock Synchronization for Dynamic Dense Multiagent Systems.
IEEE Trans. Aerosp. Electron. Syst., 2020

Robust CP Tensor Factorization With Skew Noise.
IEEE Signal Process. Lett., 2020

Bayesian fusion for infrared and visible images.
Signal Process., 2020

PercepPan: Towards Unsupervised Pan-Sharpening Based on Perceptual Loss.
Remote. Sens., 2020

Weighted-capsule routing via a fuzzy gaussian model.
Pattern Recognit. Lett., 2020

Adaptive quantile low-rank matrix factorization.
Pattern Recognit., 2020

Bayesian deep matrix factorization network for multiple images denoising.
Neural Networks, 2020

Partial label metric learning by collapsing classes.
Int. J. Mach. Learn. Cybern., 2020

Variational Bayesian weighted complex network reconstruction.
Inf. Sci., 2020

DUT-LFSaliency: Versatile Dataset and Light Field-to-RGB Saliency Detection.
CoRR, 2020

SDST: Successive Decoding for Speech-to-text Translation.
CoRR, 2020

TED: Triple Supervision Decouples End-to-end Speech-to-text Translation.
CoRR, 2020

When Image Decomposition Meets Deep Learning: A Novel Infrared and Visible Image Fusion Method.
CoRR, 2020

A Comparison of Label-Synchronous and Frame-Synchronous End-to-End Models for Speech Recognition.
CoRR, 2020

Deep Convolutional Sparse Coding Networks for Image Fusion.
CoRR, 2020

Efficient and Interpretable Infrared and Visible Image Fusion Via Algorithm Unrolling.
CoRR, 2020

MFFW: A new dataset for multi-focus image fusion.
CoRR, 2020

Advanced Variable Switching Frequency Control for Improving Weighted Efficiency of Distributed Renewable Generation Systems.
IEEE Access, 2020

DIDFuse: Deep Image Decomposition for Infrared and Visible Image Fusion.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Identification of Electrical Equipment Based on Faster LSTM-CNN Network.
Proceedings of the IEEE International Conference on Networking, Sensing and Control, 2020

User Profiling and Behavior Evaluation Based on Improved Logistics Algorithm.
Proceedings of the IEEE International Conference on Networking, Sensing and Control, 2020

Bridging the Gap between Prior and Posterior Knowledge Selection for Knowledge-Grounded Dialogue Generation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Asymmetric Two-Stream Architecture for Accurate RGB-D Saliency Detection.
Proceedings of the Computer Vision - ECCV 2020, 2020

Knowledge Aware Emotion Recognition in Textual Conversations via Multi-Task Incremental Transformer.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

2019
Spectral Learning Algorithm Reveals Propagation Capability of Complex Networks.
IEEE Trans. Cybern., 2019

Hybrid Attention for Chinese Character-Level Neural Machine Translation.
Neurocomputing, 2019

A novel variational Bayesian method for variable selection in logistic regression models.
Comput. Stat. Data Anal., 2019

BLAS: Broadcast Relative Localization and Clock Synchronization for Dynamic Dense Multi-Agent Systems.
CoRR, 2019

Hand gesture recognition based on convolution neural network.
Clust. Comput., 2019

Gesture recognition based on modified adaptive orthogonal matching pursuit algorithm.
Clust. Comput., 2019

Gesture recognition based on binocular vision.
Clust. Comput., 2019

Gesture recognition based on an improved local sparse representation classification algorithm.
Clust. Comput., 2019

Dynamic Gesture Recognition in the Internet of Things.
IEEE Access, 2019

How Users Gaze and Experience on Digital Humanities Platform?: A Model of Usability Evaluation.
Proceedings of the Information in Contemporary Society - 14th International Conference, 2019

Review: Genetic Algorithm Application in Gesture Recognition.
Proceedings of the 2019 3rd International Conference on Digital Signal Processing, 2019

Image Segmentation Technology Based on Genetic Algorithm.
Proceedings of the 2019 3rd International Conference on Digital Signal Processing, 2019

Users' visual attention flow on the search result page of digital cultural heritage collection.
Proceedings of the Information... Anyone, Anywhere, Any Time, Any Way, 2019

Adapting Translation Models for Transcript Disfluency Detection.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Software Defined Space-Terrestrial Integrated Networks: Architecture, Challenges, and Solutions.
IEEE Netw., 2018

10-bit Single-Slope ADC with error quantification and double reset technique for CMOS image sensor.
Microelectron. J., 2018

Security of Intelligent Building Network Based on Wireless Sensor Network.
Int. J. Online Eng., 2018

Tactile sensing and feedback in SEMG hand.
Int. J. Comput. Sci. Math., 2018

Modular and deep QoE/QoS mapping for multimedia services over satellite networks.
Int. J. Commun. Syst., 2018

Quantifying the Effects of Topology and Weight for Link Prediction in Weighted Complex Networks.
Entropy, 2018

Variational Bayesian Complex Network Reconstruction.
CoRR, 2018

Multilingual End-to-End Speech Recognition with A Single Transformer on Low-Resource Languages.
CoRR, 2018

Software-Defined Next-Generation Satellite Networks: Architecture, Challenges, and Solutions.
IEEE Access, 2018

Controller Placement in Software-Defined Satellite Networks.
Proceedings of the 14th International Conference on Mobile Ad-Hoc and Sensor Networks, 2018

Syllable-Based Sequence-to-Sequence Speech Recognition with the Transformer in Mandarin Chinese.
Proceedings of the Interspeech 2018, 2018

Single-channel Speech Dereverberation via Generative Adversarial Training.
Proceedings of the Interspeech 2018, 2018

Syllable-Based Acoustic Modeling with CTC for Multi-Scenarios Mandarin speech recognition.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

Convolutional Neural Network Based Traffic Sign Recognition System.
Proceedings of the 5th International Conference on Systems and Informatics, 2018

Recurrent Neural Network Based Small-footprint Wake-up-word Speech Recognition System with a Score Calibration Method.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Compression of Acoustic Model via Knowledge Distillation and Pruning.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

A Comparison of Modeling Units in Sequence-to-Sequence Speech Recognition with the Transformer on Mandarin Chinese.
Proceedings of the Neural Information Processing - 25th International Conference, 2018

Electromechanical Control Based On Artificial Neural Network.
Proceedings of the 2018 International Conference on Machine Learning and Cybernetics, 2018

Image Segmentation Algorithm Based On Clustering.
Proceedings of the 2018 International Conference on Machine Learning and Cybernetics, 2018

Research On Image Compression Technology Based On Bp Neural Network.
Proceedings of the 2018 International Conference on Machine Learning and Cybernetics, 2018

Pretreatment of sEMG Using Wavelet Threshold Method.
Proceedings of the 2018 International Conference on Machine Learning and Cybernetics, 2018

CBLDNN-Based Speaker-Independent Speech Separation Via Generative Adversarial Training.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Speech-Transformer: A No-Recurrence Sequence-to-Sequence Model for Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Conducting Cost-Effective User Research in China Remotely.
Proceedings of the HCI in Business, Government, and Organizations, 2018

A study on QoE-QoS relationship for multimedia services in satellite networks.
Proceedings of the 22nd IEEE International Conference on Computer Supported Cooperative Work in Design, 2018

Semi-Supervised Disfluency Detection.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Generalized Transport-of-Intensity Equation Based Space-Frequency Image Signal Analysis.
Proceedings of the 11th International Congress on Image and Signal Processing, 2018

Manchu Word Recognition Based on Convolutional Neural Network with Spatial Pyramid Pooling.
Proceedings of the 11th International Congress on Image and Signal Processing, 2018

2017
A Methodology of Evaluating Service Value based on the Service Field Concept and Its Application to Evaluation of Attractiveness in Sightseeing.
Int. J. Knowl. Syst. Sci., 2017

A Routing Scheme for Software-Defined Satellite Network.
Proceedings of the 2017 IEEE International Symposium on Parallel and Distributed Processing with Applications and 2017 IEEE International Conference on Ubiquitous Computing and Communications (ISPA/IUCC), 2017

Multilingual Recurrent Neural Networks with Residual Learning for Low-Resource Speech Recognition.
Proceedings of the Interspeech 2017, 2017

A class-specific copy network for handling the rare word problem in neural machine translation.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Word-Level Permutation and Improved Lower Frame Rate for RNN-Based Acoustic Modeling.
Proceedings of the Neural Information Processing - 24th International Conference, 2017

A panoramic survey method based on gesture recognition.
Proceedings of the 2017 IEEE Global Conference on Signal and Information Processing, 2017

Towards Compact and Fast Neural Machine Translation Using a Combined Method.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

2016
Investigating gated recurrent neural networks for acoustic modeling.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Multidimensional Residual Learning Based on Recurrent Neural Networks for Acoustic Modeling.
Proceedings of the Interspeech 2016, 2016

First Step Towards End-to-End Parametric TTS Synthesis: Generating Spectral Parameters with Neural Attention.
Proceedings of the Interspeech 2016, 2016

Gating recurrent mixture density networks for acoustic modeling in statistical parametric speech synthesis.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Where Is Siri? The Accessibility Design Challenges for Enterprise Touchscreen Interfaces.
Proceedings of the HCI in Business, Government, and Organizations: Information Systems, 2016

2015
Development and Usability Evaluation of the Mobile Delirium Assessment App Based on Confusion Assessment Method for Intensive Care Unit (CAM-ICU).
Proceedings of the MEDINFO 2015: eHealth-enabled Health, 2015

Capacity analysis method for MLSN based on improved DGA.
Proceedings of the 11th International Conference on Natural Computation, 2015

Improving Accessibility Design on Touchscreens.
Proceedings of the Universal Access in Human-Computer Interaction. Access to Interaction, 2015

Computing the color complexity of images.
Proceedings of the 12th International Conference on Fuzzy Systems and Knowledge Discovery, 2015

2013
Design Touch Feedback for Blind Users.
Proceedings of the HCI International 2013 - Posters' Extended Abstracts, 2013

2012
Palmprint Image Processing and Linear Discriminant Analysis Method.
J. Multim., 2012

Sea Clutter Constant False-Alarm Processing Technology Research Based on Wavelet Transform.
Proceedings of the Information Computing and Applications - Third International Conference, 2012

2011
Improved linear discriminant analysis based on two-dimensional Gabor for palmprint recognition.
Proceedings of the Third International Conference of Soft Computing and Pattern Recognition, 2011

Usability Issues in Introducing Capacitive Interaction into Mobile Navigation.
Proceedings of the Human Interface and the Management of Information. Interacting with Information, 2011

2010
A bi-directional compressed 2DPCA for palmprint recognition based on Gabor wavelets.
Proceedings of the Sixth International Conference on Natural Computation, 2010

2009
Development of a dual-modal information presentation of sequential relationship.
Int. J. Mob. Learn. Organisation, 2009

Understanding users' perception of speech recognition errors in mobile communication.
Int. J. Mob. Learn. Organisation, 2009

Automatic pronunciation error detection based on linguistic knowledge and pronunciation space.
Proceedings of the IEEE International Conference on Acoustics, 2009

Study of Spectrum Analysis Based on EMD Adaptive Filter.
Proceedings of the 2009 International Conference on Computational Intelligence and Security, 2009

2008
Development of a Dual-Modal Presentation of Texts for Small Screens.
Int. J. Hum. Comput. Interact., 2008

A New Algorithm for Speech Enhancement Using Wavelet Packet Transform Based on Auditory Model.
Proceedings of the International Conference on Computer Science and Software Engineering, 2008

2007
Facial Expression Analysis on Semantic Neighborhood Preserving Embedding.
Proceedings of the Advances in Neural Networks, 2007

An Empirical Study on Users' Acceptance of Speech Recognition Errors in Text-Messaging.
Proceedings of the Human-Computer Interaction. HCI Intelligent Multimodal Interaction Environments, 2007

User Expectations from Dictation on Mobile Devices.
Proceedings of the Human-Computer Interaction. Interaction Platforms and Techniques, 2007

Discriminant Clustering Embedding for Face Recognition with Image Sets.
Proceedings of the Computer Vision, 2007

2006
Moderating Effects of Task Type on Wireless Technology Acceptance.
J. Manag. Inf. Syst., 2006

A Study of the Feasibility and Effectiveness of Dual-Modal Information Presentations.
Int. J. Hum. Comput. Interact., 2006

Coding Facial Expression with Oriented Steerable Filters.
Proceedings of the International Conference on Image Processing, 2006

2005
A Dual Modal Presentation of Network Relationships in Texts.
Proceedings of the A Conference on a Human Scale. 11th Americas Conference on Information Systems, 2005

Development of Dual-Modal Presentations of Textual Information.
Proceedings of the A Conference on a Human Scale. 11th Americas Conference on Information Systems, 2005

2004
User-Centered Guidelines for Design of Mobile Applications.
Proceedings of the Fourth International Conference on Electronic Business, 2004

An Empirical Study of Dual-Modal Information Presentation.
Proceedings of the 10th Americas Conference on Information Systems, 2004

2002
Usability for Mobile Commerce Across Multiple Form Factors.
J. Electron. Commer. Res., 2002

A Direct Method for Positioning the Arms of a Human Model.
Proceedings of the Graphics Interface 2002 Conference, 2002


  Loading...