Saki Mizuno

According to our database1, Saki Mizuno authored at least 15 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Data stream-pairwise bottleneck transformer for engagement estimation from video conversation.
Frontiers Artif. Intell., 2025

ToMATO: Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of Mind.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Multimodal Fine-Grained Apparent Personality Trait Recognition: Joint Modeling of Big Five and Questionnaire Item-level Scores.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Participant-Pair-Wise Bottleneck Transformer for Engagement Estimation from Video Conversation.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Learning from Multiple Annotator Biased Labels in Multimodal Conversation.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Unified Multi-Talker ASR with and without Target-speaker Enrollment.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Talking Face Generation for Impression Conversion Considering Speech Semantics.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

End-to-End Joint Target and Non-Target Speakers ASR.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Transcribing Speech as Spoken and Written Dual Text Using an Autoregressive Model.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Audio-Visual Praise Estimation for Conversational Video based on Synchronization-Guided Multimodal Transformer.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Next-Speaker Prediction Based on Non-Verbal Information in Multi-Party Video Conversation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Modeling Lead-Lag Structure in Facial Expression Synchrony for Social-Psychological Outcome Prediction from Negotiation Interaction.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Multimodal Negotiation Corpus with Various Subjective Assessments for Social-Psychological Outcome Prediction from Non-Verbal Cues.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

End-to-End Joint Modeling of Conversation History-Dependent and Independent ASR Systems with Multi-History Training.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022


  Loading...