Koichi Saito

Orcid: 0000-0002-8563-9262

According to our database1, Koichi Saito authored at least 31 papers between 1993 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Echoes Over Time: Unlocking Length Generalization in Video-to-Audio Generation Models.
CoRR, February, 2026

2025
Schrodinger Audio-Visual Editor: Object-Level Audiovisual Removal.
CoRR, December, 2025

FoleyBench: A Benchmark For Video-to-Audio Models.
CoRR, November, 2025

TalkCuts: A Large-Scale Dataset for Multi-Shot Human Speech Video Generation.
CoRR, October, 2025

SoundReactor: Frame-level Online Video-to-Audio Generation.
CoRR, October, 2025

Music Arena: Live Evaluation for Text-to-Music.
CoRR, July, 2025

Equation of State for Hyperonic Neutron-Star Matter in SU(3) Flavor Symmetry.
Symmetry, 2025

Schrödinger Bridge Consistency Trajectory Models for Speech Enhancement.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2025

VERSA: A Versatile Evaluation Toolkit for Speech, Audio, and Music.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Aligning Text-to-Music Evaluation with Human Preferences.
Proceedings of the 26th International Society for Music Information Retrieval Conference, 2025

SoundCTM: Unifying Score-based and Consistency Models for Full-band Text-to-Sound Generation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Dyadic Mamba: Long-term Dyadic Human Motion Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

2024
DisMix: Disentangling Mixtures of Musical Instruments for Source-level Pitch and Timbre Manipulation.
CoRR, 2024

SpecMaskGIT: Masked Generative Modeling of Audio Spectrograms for Efficient Audio Synthesis and Beyond.
CoRR, 2024

SoundCTM: Uniting Score-based and Consistency Models for Text-to-Sound Generation.
CoRR, 2024

Analysis of 50 Years of Health Food Research Trends Using Natural Language Processing and Generative AI.
Proceedings of the Knowledge-Based and Intelligent Information & Engineering Systems: Proceedings of the 28th International Conference KES-2024, 2024

SpecMaskGIT: Masked Generative Modeling of Audio Spectrogram for Efficient Audio Synthesis and Beyond.
Proceedings of the 25th International Society for Music Information Retrieval Conference, 2024

VRDMG: Vocal Restoration via Diffusion Posterior Sampling with Multiple Guidance.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration.
Proceedings of the International Conference on Machine Learning, 2023

Unsupervised Vocal Dereverberation with Diffusion-Based Generative Models.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Sampling-Frequency-Independent Convolutional Layer and its Application to Audio Source Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

GPS Positioning Errors Improvements by The Pseudo Range Correction with An Actually Measured Parameter in Shinjuku Area.
Proceedings of the 13th International Conference on Information and Communication Technology Convergence, 2022

2021
Training Speech Enhancement Systems with Noisy Speech Datasets.
CoRR, 2021

GPS Pseudo Range Correction by the Number of Reflections and Incident Angle Estimations.
Proceedings of the International Conference on Information and Communication Technology Convergence, 2021

Sampling-Frequency-Independent Audio Source Separation Using Convolution Layer Based on Impulse Invariant Method.
Proceedings of the 29th European Signal Processing Conference, 2021

2019
Abnormal axon guidance signals and reduced interhemispheric connection via anterior commissure in neonates of marmoset ASD model.
NeuroImage, 2019

2017
Multiple output variable overdrive voltage CMOS current mirror.
Proceedings of the International SoC Design Conference, 2017

2016
Thermal safety through Limp Home Mode for intelligent Rear View Camera Systems.
Proceedings of the 2016 IEEE Symposium in Low-Power and High-Speed Chips, 2016

2006
Adaptive Clock Recovery Method Utilizing Proportional-Integral-Derivative (PID) Control for Circuit Emulation.
IEICE Trans. Commun., 2006

Common technical specification of the G-PON system among major worldwide access carriers.
IEEE Commun. Mag., 2006

1993
Distributed Multilink System for Very-High-Speed Data Link Control.
IEEE J. Sel. Areas Commun., 1993


  Loading...