Shihao Chen
According to our database1,
Shihao Chen authored at least 31 papers
between 2009 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
HAFM: Hierarchical Autoregressive Foundation Model for Music Accompaniment Generation.
CoRR, April, 2026
3D-DCASphereNet: 3D dynamic convolutional attention network with spherical representation for high heterogeneity in lung nodule detection.
Expert Syst. Appl., 2026
2025
CoRR, October, 2025
VectorLLM: Human-like Extraction of Structured Building Contours vis Multimodal LLMs.
CoRR, July, 2025
CoRR, June, 2025
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
Proceedings of the IEEE/CVF International Conference on Computer Vision, ICCV 2025, 2025
A Lightweight and Real-Time Binaural Speech Enhancement Model with Spatial Cues Preservation.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
Sinba: Singing-To-Accompaniment Generation With Pitch Guidance Via Mamba-Based Language Model.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025
CSSinger: End-to-End Chunkwise Streaming Singing Voice Synthesis System Based on Conditional Variational Autoencoder.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025
2024
A Lightweight and Real-Time Binaural Speech Enhancement Model with Spatial Cues Preservation.
CoRR, 2024
A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition.
CoRR, 2024
Intelligent Energy-Efficient and Fair Resource Scheduling for UAV-Assisted Space-Air-Ground Integrated Networks Under Jamming Attacks.
Proceedings of the 99th IEEE Vehicular Technology Conference, 2024
LCM-SVC: Latent Diffusion Model Based Singing Voice Conversion with Inference Acceleration via Latent Consistency Distillation.
Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024
LDM-SVC: Latent Diffusion Model Based Zero-Shot Any-to-Any Singing Voice Conversion with Singer Guidance.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
A Study of Multichannel Spatiotemporal Features and Knowledge Distillation on Robust Target Speaker Extraction.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
STPose: 6D object pose estimation network based on sparse attention and cross-layer connection.
Proceedings of the 35th British Machine Vision Conference, 2024
2023
Proceedings of the 20th International Conference on Spoken Language Translation, 2023
2022
Design for Operation in Two Frequency Bands by Division of the Coupled Region in a Waveguide 2-Plane Coupler.
IEICE Trans. Electron., December, 2022
Nanoporous Graphene Oxide-Based Quartz Crystal Microbalance Gas Sensor with Dual-Signal Responses for Trimethylamine Detection.
Sensors, 2022
2020
Proceedings of the 2020 International Conference on Wireless Communications and Signal Processing (WCSP), 2020
Proceedings of the 31st IEEE Annual International Symposium on Personal, 2020
2018
Proceedings of the 2018 IEEE International Conference on Software Maintenance and Evolution, 2018
2009
Proceedings of the Fifth International Conference on Image and Graphics, 2009
Proceedings of the 2009 International Conference on Environmental Science and Information Application Technology, 2009
Proceedings of the 2nd International Conference on BioMedical Engineering and Informatics, 2009