Siqi Zheng

Orcid: 0009-0002-6787-4223

According to our database1, Siqi Zheng authored at least 42 papers between 2018 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
UbiPhysio: Support Daily Functioning, Fitness, and Rehabilitation with Action Understanding and Feedback in Natural Language.
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., March, 2024

An Embarrassingly Simple Approach for LLM with Strong ASR Capacity.
CoRR, 2024

2023
Loss Masking Is Not Needed in Decoder-only Transformer for Discrete-token Based ASR.
CoRR, 2023

LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT.
CoRR, 2023

Improving Speaker Diarization using Semantic Information: Joint Pairwise Constraints Propagation.
CoRR, 2023

FunCodec: A Fundamental, Reproducible and Integrable Open-source Toolkit for Neural Speech Codec.
CoRR, 2023

Self-Distillation Network with Ensemble Prototypes: Learning Robust Speaker Representations without Supervision.
CoRR, 2023

Improving BERT with Hybrid Pooling Network and Drop Mask.
CoRR, 2023

3D-Speaker: A Large-Scale Multi-Device, Multi-Distance, and Multi-Dialect Corpus for Speech Representation Disentanglement.
CoRR, 2023

An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification.
CoRR, 2023

CAM++: A Fast and Efficient Network for Speaker Verification Using Context-Aware Masking.
CoRR, 2023

Pushing the Limits of Self-Supervised Speaker Verification using Regularized Distillation Framework.
Proceedings of the IEEE International Conference on Acoustics, 2023

Ditto: A Simple and Efficient Approach to Improve Sentence Embeddings.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

A Two-Layer Human-in-the-Loop Optimization Framework for Customizing Lower-Limb Exoskeleton Assistance.
Proceedings of the American Control Conference, 2023

DopplerBAS: Binaural Audio Synthesis Addressing Doppler Effect.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Exploring Speaker-Related Information in Spoken Language Understanding for Better Speaker Diarization.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Multi-Source Time Series Remote Sensing Feature Selection and Urban Forest Extraction Based on Improved Artificial Bee Colony.
Remote. Sens., 2022

Contextual Expressive Text-to-Speech.
CoRR, 2022

Speaker Overlap-aware Neural Diarization for Multi-party Meeting Analysis.
CoRR, 2022

Speaker Embedding-aware Neural Diarization: an Efficient Framework for Overlapping Speech Diarization in Meeting Scenarios.
CoRR, 2022

Deep Representation Decomposition for Rate-Invariant Speaker Verification.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

PRISM: Pre-trained Indeterminate Speaker Representation Model for Speaker Diarization and Speaker Verification.
Proceedings of the Interspeech 2022, 2022

Label-Dividing Gated Graph Neural Network for Hierarchical Text Classification.
Proceedings of the International Joint Conference on Neural Networks, 2022

PoNet: Pooling Network for Efficient Token Mixing in Long Sequences.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Reformulating Speaker Diarization As Community Detection With Emphasis On Topological Structure.
Proceedings of the IEEE International Conference on Acoustics, 2022

Summary on the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2022

M2Met: The Icassp 2022 Multi-Channel Multi-Party Meeting Transcription Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2022

Graph Convolutional Network Based Semi-Supervised Learning on Multi-Speaker Meeting Data.
Proceedings of the IEEE International Conference on Acoustics, 2022

Speaker Overlap-aware Neural Diarization for Multi-party Meeting Analysis.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021
Speaker Embedding-aware Neural Diarization for Flexible Number of Speakers with Textual Information.
CoRR, 2021

BeamTransformer: Microphone Array-based Overlapping Speech Detection.
CoRR, 2021

Measuring daily-life fear perception change: a computational study in the context of COVID-19.
CoRR, 2021

Estimating air quality co-benefits of energy transition using machine learning.
CoRR, 2021

Investigation of Spatial-Acoustic Features for Overlapping Speech Detection in Multiparty Meetings.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

A Real-Time Speaker Diarization System Based on Spatial Spectrum.
Proceedings of the IEEE International Conference on Acoustics, 2021

Cam: Context-Aware Masking for Robust Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Phonetically-Aware Coupled Network For Short Duration Text-Independent Speaker Verification.
Proceedings of the Interspeech 2020, 2020

2019
Time-resolved protein activation by proximal decaging in living systems.
Nat., 2019

Autoencoder-Based Semi-Supervised Curriculum Learning for Out-of-Domain Speaker Verification.
Proceedings of the Interspeech 2019, 2019

Towards a Fault-Tolerant Speaker Verification System: A Regularization Approach to Reduce the Condition Number.
Proceedings of the Interspeech 2019, 2019

Factors Influencing University Students' Intention to Redeem Digital Takeaway Coupons - Analysis Based on A Survey in China.
Proceedings of the ICIT 2019, 2019

2018
A Noise-Robust Self-Adaptive Multitarget Speaker Detection System.
Proceedings of the 24th International Conference on Pattern Recognition, 2018


  Loading...