Bowen Shi

Orcid: 0000-0002-9169-7055

According to our database1, Bowen Shi authored at least 70 papers between 2015 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
PTCAS: Prompt tuning with continuous answer search for relation extraction.
Inf. Sci., February, 2024

XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception.
CoRR, 2024

Towards Privacy-Aware Sign Language Translation at Scale.
CoRR, 2024

2023
Domain-relevance of influence: characterizing variations in online influence across multiple domains on social media.
J. Big Data, December, 2023

An automatic model management system and its implementation for AIOps on microservice platforms.
J. Supercomput., July, 2023

Increasing acceptance of medical AI: The role of medical staff participation in AI development.
Int. J. Medical Informatics, July, 2023

Audiobox: Unified Audio Generation with Natural Language Prompts.
CoRR, 2023

Finite elements for symmetric and traceless tensors in three dimensions.
CoRR, 2023

AiluRus: A Scalable ViT Framework for Dense Prediction.
CoRR, 2023

Generative Pre-training for Speech with Flow Matching.
CoRR, 2023

Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning.
CoRR, 2023

Toward American Sign Language Processing in the Real World: Data, Tasks, and Methods.
CoRR, 2023

EXPRESSO: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis.
CoRR, 2023

Hybrid Distillation: Connecting Masked Autoencoders with Contrastive Learners.
CoRR, 2023

Prompt to GPT-3: Step-by-Step Thinking Instructions for Humor Generation.
CoRR, 2023

Scaling Speech Technology to 1, 000+ Languages.
CoRR, 2023

Rethinking Visual Prompt Learning as Masked Visual Token Modeling.
CoRR, 2023

MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation.
CoRR, 2023

Visual Story Generation Based on Emotion and Keywords.
CoRR, 2023

Novel End-Winding Hybrid Flux Machine.
IEEE Access, 2023

TTIC's Submission to WMT-SLT 23.
Proceedings of the Eighth Conference on Machine Translation, 2023

AiluRus: A Scalable ViT Framework for Dense Prediction.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

VioLET: Vision-Language Efficient Tuning with Collaborative Multi-modal Gradients.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

SEGA: Structural Entropy Guided Anchor View for Graph Contrastive Learning.
Proceedings of the International Conference on Machine Learning, 2023

ActionPrompt: Action-Guided 3D Human Pose Estimation With Text and Pose Prompting.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Comparative Layer-Wise Analysis of Self-Supervised Speech Models.
Proceedings of the IEEE International Conference on Acoustics, 2023

Adapting Shortcut with Normalizing Flow: An Efficient Tuning Framework for Visual Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Regeneration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Pose-Oriented Transformer with Uncertainty-Guided Refinement for 2D-to-3D Human Pose Estimation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Interprovincial Joint Prevention and Control of Open Straw Burning in Northeast China: Implications for Atmospheric Environment Management.
Remote. Sens., 2022

Characterizing usages, updates and risks of third-party libraries in Java projects.
Empir. Softw. Eng., 2022

Behavior Variations and Their Implications for Popularity Promotions: From Elites to Mass on Weibo.
Entropy, 2022

ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement.
CoRR, 2022

A Single Self-Supervised Model for Many Speech Modalities Enables Zero-Shot Modality Transfer.
CoRR, 2022

TTIC's WMT-SLT 22 Sign Language Translation System.
Proceedings of the Seventh Conference on Machine Translation, 2022

u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer to Unlabeled Modality.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT.
Proceedings of the Interspeech 2022, 2022

Robust Self-Supervised Audio-Visual Speech Recognition.
Proceedings of the Interspeech 2022, 2022

Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Open-Domain Sign Language Translation Learned from Online Video.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

A Transformer-Based Decoder for Semantic Segmentation with Multi-level Context Mining.
Proceedings of the Computer Vision - ECCV 2022, 2022

Searching for fingerspelled content in American Sign Language.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
An Improved Hybrid Field Model for Calculating On-Load Performance of Interior Permanent-Magnet Motors.
IEEE Trans. Ind. Electron., 2021

Multi-dataset Pretraining: A Unified Model for Semantic Segmentation.
CoRR, 2021

Whole-Word Segmental Speech Recognition with Acoustic Word Embeddings.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Fingerspelling Detection in American Sign Language.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Hierarchical Graph Networks for 3D Human Pose Estimation.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Latency-Aware Differentiable Neural Architecture Search.
CoRR, 2020

Interactive, effort-aware library version harmonization.
Proceedings of the ESEC/FSE '20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2020

A Cross-Task Analysis of Text Span Representations.
Proceedings of the 5th Workshop on Representation Learning for NLP, 2020

VIMES: A Wearable Memory Assistance System for Automatic Information Retrieval.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

A Joint Framework for Audio Tagging and Weakly Supervised Acoustic Event Detection Using DenseNet with Global Average Pooling.
Proceedings of the Interspeech 2020, 2020

An Empirical Study of Usages, Updates and Risks of Third-Party Libraries in Java Projects.
Proceedings of the IEEE International Conference on Software Maintenance and Evolution, 2020

Tiny-Hourglassnet: An Efficient Design For 3d Human Pose Estimation.
Proceedings of the IEEE International Conference on Image Processing, 2020

Super-Resolution Reconstruction of Electric Power Inspection Images Based on Very Deep Network Super Resolution.
Proceedings of the Artificial Intelligence and Security - 6th International Conference, 2020

Few-Shot Acoustic Event Detection Via Meta Learning.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
ITS-Frame: A Framework for Multi-Aspect Analysis in the Field of Intelligent Transportation Systems.
IEEE Trans. Intell. Transp. Syst., 2019

Compression of Acoustic Event Detection Models with Low-rank Matrix Factorization and Quantization Training.
CoRR, 2019

Research on Recognition Method of Electrical Components Based on YOLO V3.
IEEE Access, 2019

Compression of Acoustic Event Detection Models with Quantized Distillation.
Proceedings of the Interspeech 2019, 2019

On the Contributions of Visual and Textual Supervision in Low-Resource Semantic Speech Retrieval.
Proceedings of the Interspeech 2019, 2019

Fingerspelling Recognition in the Wild With Iterative Visual Attention.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Semi-supervised Acoustic Event Detection Based on Tri-training.
Proceedings of the IEEE International Conference on Acoustics, 2019

Deep Neural Network-Based Algorithm Approximation via Multivariate Polynomial Regression.
Proceedings of the 2019 IEEE Global Communications Conference, 2019

2018
American Sign Language Fingerspelling Recognition in the Wild.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

2017
Towards robust models of food flows and their role in invasive species spread.
Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017

Multitask training with unlabeled data for end-to-end sign language fingerspelling recognition.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2015
Offloading Guidelines for Augmented Reality Applications on Wearable Devices.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Accuracy improvement of carrier signal injection sensorless control for IPMSM in consideration of inverter nonlinearity.
Proceedings of the IECON 2015, 2015


  Loading...