Yongkang Yin
According to our database1,
Yongkang Yin authored at least 6 papers
between 2023 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
WhisperDiari: A Whisper-Based Speaker Diarization Framework in Token Space Leveraging Semantic and Speaker Information for Better Text Adaptability.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
CoRR, September, 2025
FoleyMaster: High-Quality Video-to-Audio Synthesis via MLLM-Augmented Prompt Tuning and Joint Semantic-Temporal Adaptation.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025
Audio-Faces Intra-Frame Alignment with Graph Attention Networks for Active Speaker Detection.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
2024
AFL-Net: Integrating Audio, Facial, and Lip Modalities with a Two-step Cross-attention for Robust Speaker Diarization in the Wild.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
2023
AFL-Net: Integrating Audio, Facial, and Lip Modalities with Cross-Attention for Robust Speaker Diarization in the Wild.
CoRR, 2023