Jiahui Yu

Orcid: 0000-0003-1314-2481

According to our database1, Jiahui Yu authored at least 115 papers between 2016 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
A localization method of manipulator towards achieving more precision control.
Comput. Intell., February, 2024

Fast Object Detection Leveraging Global Feature Fusion in Boundary-Aware Convolutional Networks.
Inf., 2024

Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation.
CoRR, 2024

2023
Seamless Wireless Communication Platform for Internet of Things Applications.
IEEE Wirel. Commun., December, 2023

Pyramid Attention Network for Image Restoration.
Int. J. Comput. Vis., December, 2023

Marrying Global-Local Spatial Context for Image Patches in Computer-Aided Assessment.
IEEE Trans. Syst. Man Cybern. Syst., November, 2023

Combined scaling for zero-shot transfer learning.
Neurocomputing, October, 2023

Deep learning-based prediction framework of temperature control time for wide-thick slab hot rolling production.
Expert Syst. Appl., October, 2023

Can molecular dynamics simulations improve predictions of protein-ligand binding affinity with machine learning?
Briefings Bioinform., March, 2023

Intelligent Decision-Making and Human Language Communication Based on Deep Reinforcement Learning in a Wargame Environment.
IEEE Trans. Hum. Mach. Syst., 2023

A Novel Motor Structure with Extended Particle Swarm Optimization for Space Robot Control.
Sensors, 2023

Gemini: A Family of Highly Capable Multimodal Models.
CoRR, 2023

IG Captioner: Information Gain Captioners are Strong Zero-shot Classifiers.
CoRR, 2023

Towards an Automatic AI Agent for Reaction Condition Recommendation in Chemical Synthesis.
CoRR, 2023

De-Diffusion Makes Text a Strong Cross-Modal Interface.
CoRR, 2023

AudioPaLM: A Large Language Model That Can Speak and Listen.
CoRR, 2023

Practical Conformer: Optimizing size, speed and flops of Conformer for on-Device and cloud ASR.
CoRR, 2023

CoBIT: A Contrastive Bi-directional Image-Text Generation Model.
CoRR, 2023

Noise2Music: Text-conditioned Music Generation with Diffusion Models.
CoRR, 2023

Local-to-global spatial learning for whole-slide image representation and classification.
Comput. Medical Imaging Graph., 2023

Module-wise Adaptive Distillation for Multimodality Foundation Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Two-BranchTGNet: A Two-Branch Neural Network for Breast Cancer Subtype Classification.
Proceedings of the 2023 4th International Symposium on Artificial Intelligence for Medicine Science, 2023

Examining the Impact of Muscle-Electrode Distance in sEMG Based Hand Motion Recognition.
Proceedings of the Intelligent Robotics and Applications - 16th International Conference, 2023

Adversarial Attacks on Skeleton-Based Sign Language Recognition.
Proceedings of the Intelligent Robotics and Applications - 16th International Conference, 2023

Combating Label Ambiguity with Smooth Learning for Facial Expression Recognition.
Proceedings of the Intelligent Robotics and Applications - 16th International Conference, 2023


Robust Subgraph Augmentation for Graph Convolutional Networks with Few Labeled Nodes.
Proceedings of the International Conference on Advanced Robotics and Mechatronics, 2023

VILA: Learning Image Aesthetics from User Comments with Vision-Language Pretraining.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Scale Enhancement Pyramid Network for Small Object Detection from UAV Images.
Entropy, November, 2022

Scaling Autoregressive Models for Content-Rich Text-to-Image Generation.
Trans. Mach. Learn. Res., 2022

CoCa: Contrastive Captioners are Image-Text Foundation Models.
Trans. Mach. Learn. Res., 2022

Deep Object Detector With Attentional Spatiotemporal LSTM for Space Human-Robot Interaction.
IEEE Trans. Hum. Mach. Syst., 2022

Deep Temporal Model-Based Identity-Aware Hand Detection for Space Human-Robot Interaction.
IEEE Trans. Cybern., 2022

Efficient Trustworthiness Management for Malicious User Detection in Big Data Collection.
IEEE Trans. Big Data, 2022

Spatial Cognition-Driven Deep Learning for Car Detection in Unmanned Aerial Vehicle Imagery.
IEEE Trans. Cogn. Dev. Syst., 2022

Adaptive Spatiotemporal Representation Learning for Skeleton-Based Human Action Recognition.
IEEE Trans. Cogn. Dev. Syst., 2022

BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition.
IEEE J. Sel. Top. Signal Process., 2022

Organic Compound Synthetic Accessibility Prediction Based on the Graph Attention Mechanism.
J. Chem. Inf. Model., 2022

From theory to experiment: transformer-based generation enables rapid discovery of novel reactions.
J. Cheminformatics, 2022

VGSC-DB: an online database of voltage-gated sodium channels.
J. Cheminformatics, 2022

Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners.
CoRR, 2022

Exploiting Category Names for Few-Shot Classification with Vision-Language Models.
CoRR, 2022

Deep object detection for waterbird monitoring using aerial imagery.
CoRR, 2022

Normalization effects on deep neural networks.
CoRR, 2022

View-Robust Neural Networks for Unseen Human Action Recognition in Videos.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2022

A Prediction Method of Order Completion Time Based on BP-SVR.
Proceedings of the 15th International Symposium on Computational Intelligence and Design, 2022

A Multi-modal Virtual-Real Fusion System for Multi-task Human-Computer Interaction.
Proceedings of the IEEE International Conference on Networking, Sensing and Control, 2022

Deep object detection for waterbird monitoring using aerial imagery.
Proceedings of the 21st IEEE International Conference on Machine Learning and Applications, 2022

Self-supervised learning with random-projection quantizer for speech recognition.
Proceedings of the International Conference on Machine Learning, 2022

Vector-quantized Image Modeling with Improved VQGAN.
Proceedings of the Tenth International Conference on Learning Representations, 2022

SimVLM: Simple Visual Language Model Pretraining with Weak Supervision.
Proceedings of the Tenth International Conference on Learning Representations, 2022

An End-to-End Object Detector with Spatiotemporal Context Learning for Machine-Assisted Rehabilitation.
Proceedings of the Intelligent Robotics and Applications - 15th International Conference, 2022

2021
A Pre-Authentication Approach to Proxy Re-Encryption in Big Data Context.
IEEE Trans. Big Data, 2021

Generative Adversarial Networks for Image and Video Synthesis: Algorithms and Applications.
Proc. IEEE, 2021

Co-training Transformer with Videos and Images Improves Action Recognition.
CoRR, 2021

Asymmetric Convolution View Adaptation Networks for Skeleton-Based Human Action Recognition.
Proceedings of the Advances in Computational Intelligence Systems, 2021

Finding the global optimal solution in Dynamic multiple TSPTW with data-driven ACO.
Proceedings of the 2021 IEEE SmartWorld, 2021

A One-stage Temporal Detector with Attentional LSTM for Video Object Detection.
Proceedings of the 27th International Conference on Mechatronics and Machine Vision in Practice, 2021

An Efficient Skeleton-based Action Recognition Approach with View Transformation.
Proceedings of the 27th International Conference on Mechatronics and Machine Vision in Practice, 2021

Self-Cure Network with Two-Stage Method for Facial Expression Recognition.
Proceedings of the 27th International Conference on Mechatronics and Machine Vision in Practice, 2021

An Efficient Streaming Non-Recurrent On-Device End-to-End Model with Improvements to Rare-Word Modeling.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

A Multi-sensor Gesture Interaction System for Human-robot Cooperation.
Proceedings of the IEEE International Conference on Networking, Sensing and Control, 2021

Dual-mode ASR: Unify and Improve Streaming ASR with Full-context Modeling.
Proceedings of the 9th International Conference on Learning Representations, 2021

FastEmit: Low-Latency Streaming ASR with Sequence-Level Emission Regularization.
Proceedings of the IEEE International Conference on Acoustics, 2021

Dynamic Sparsity Neural Networks for Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Cascaded Encoders for Unifying Streaming and Non-Streaming ASR.
Proceedings of the IEEE International Conference on Acoustics, 2021

A Better and Faster end-to-end Model for Streaming ASR.
Proceedings of the IEEE International Conference on Acoustics, 2021

A Distributed Approach to Energy Efficiency in Seamless IoT Communications.
Proceedings of the IEEE Global Communications Conference, 2021

2020
Towards efficient, on-demand and automated deep learning
PhD thesis, 2020

Coordinated Optimal Control of Secondary Cooling and Final Electromagnetic Stirring for Continuous Casting Billets.
J. Control. Sci. Eng., 2020

Normalization effects on shallow neural networks and related asymptotic expansions.
CoRR, 2020

Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling.
CoRR, 2020

Cross-Supervised Object Detection.
CoRR, 2020

Pyramid Attention Networks for Image Restoration.
CoRR, 2020

A Discriminative Deep Model With Feature Fusion and Temporal Attention for Human Action Recognition.
IEEE Access, 2020

Measurement of Simulated Lunar Soil Information Using Rutting Images.
IEEE Access, 2020

Neural Sparse Representation for Image Restoration.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context.
Proceedings of the Interspeech 2020, 2020

Conformer: Convolution-augmented Transformer for Speech Recognition.
Proceedings of the Interspeech 2020, 2020

Smart Factory Production and Operation Management Methods based on HCPS.
Proceedings of the IEEE International Conference on Networking, Sensing and Control, 2020

FSNet: Compression of Deep Convolutional Neural Networks by Filter Summary.
Proceedings of the 8th International Conference on Learning Representations, 2020

BigNAS: Scaling up Neural Architecture Search with Big Single-Stage Models.
Proceedings of the Computer Vision - ECCV 2020, 2020


Scale-Wise Convolution for Image Restoration.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Privacy-preserving Data Aggregation Computing in Cyber-Physical Social Systems.
ACM Trans. Cyber Phys. Syst., 2019

Novel Circuit Implementation Method for Pulse Signal Finite Rate of Innovation Sparse Sampling Based on an Improved Exponential Reproducing Kernel.
Circuits Syst. Signal Process., 2019

Adversarial-Based Knowledge Distillation for Multi-Model Ensemble and Noisy Data Refinement.
CoRR, 2019

Network Slimming by Slimmable Networks: Towards One-Shot Architecture Search for Channel Numbers.
CoRR, 2019

Performance Analysis for the Magnetically Coupled Resonant Wireless Energy Transmission System.
Complex., 2019

Fast Proximal Gradient Descent for A Class of Non-convex and Non-smooth Sparse Learning Problems.
Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, 2019

Research on Multi-modal Interactive Control for Quadrotor UAV.
Proceedings of the 16th IEEE International Conference on Networking, Sensing and Control, 2019

Slimmable Neural Networks.
Proceedings of the 7th International Conference on Learning Representations, 2019

KPCA-Based Visual Fault Diagnosis for Nonlinear Industrial Process.
Proceedings of the Intelligent Robotics and Applications - 12th International Conference, 2019

Free-Form Image Inpainting With Gated Convolution.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Universally Slimmable Networks and Improved Training Techniques.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

User Equipment Localization and Victim Estimation with Next-Generation PSC in Emergency Response.
Proceedings of the 2019 IEEE Global Communications Conference, 2019

Foreground-Aware Image Inpainting.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019



An Empirical Investigation of Efficient Spatio-Temporal Modeling in Video Restoration.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Wide Activation for Efficient Image and Video Super-Resolution.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

Improving Object Detection from Scratch via Gated Feature Reuse.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

2018
A Simple Non-i.i.d. Sampling Approach for Efficient Training and Better Generalization.
CoRR, 2018

Wide Activation for Efficient and Accurate Image Super-Resolution.
CoRR, 2018

Smooth Path Planning for Robot Docking in Unknown Environment with Obstacles.
Complex., 2018

A Novel Convolutional Neural Network for Facial Expression Recognition.
Proceedings of the Cognitive Systems and Signal Processing - 4th International Conference, 2018

A Hierarchical Approach to Encrypted Data Packet Classification in Smart Home Gateways.
Proceedings of the 2018 IEEE 16th Intl Conf on Dependable, 2018

Generative Image Inpainting With Contextual Attention.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Wide-activated Deep Residual Networks based Restoration for BPG-compressed Images.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

2017
Neighborhood Regularized l^1-Graph.
Proceedings of the Thirty-Third Conference on Uncertainty in Artificial Intelligence, 2017

Support Regularized Sparse Coding and Its Fast Encoder.
Proceedings of the 5th International Conference on Learning Representations, 2017


Balanced Two-Stage Residual Networks for Image Super-Resolution.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

Depth map extracting based on geometric perspective: An applicable 2D to 3D conversion technology.
Proceedings of the 10th International Congress on Image and Signal Processing, 2017

2016
UnitBox: An Advanced Object Detection Network.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016


  Loading...