Takashi Shibuya
Orcid: 0000-0002-4277-0164Affiliations:
- Sony Corporation, Tokyo, Japan
- University of Tsukuba, Japan
- University of Tokyo, Japan (former)
According to our database1,
Takashi Shibuya
authored at least 47 papers
between 2009 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on linkedin.com
-
on orcid.org
-
on github.com
On csauthors.net:
Bibliography
2025
Stereo Sound Event Localization and Detection with Onscreen/offscreen Classification.
CoRR, July, 2025
Vid-CamEdit: Video Camera Trajectory Editing with Generative Rendering from Estimated Geometry.
CoRR, June, 2025
Efficiency without Compromise: CLIP-aided Text-to-Image GANs with Increased Diversity.
CoRR, June, 2025
CoRR, April, 2025
CCStereo: Audio-Visual Contextual and Contrastive Learning for Binaural Audio Generation.
CoRR, January, 2025
SoundCTM: Unifying Score-based and Consistency Models for Full-band Text-to-Sound Generation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
HERO: Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
MMDisCo: Multi-Modal Discriminator-Guided Cooperative Diffusion for Joint Audio and Video Generation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
2024
Dataset, April, 2024
Trans. Mach. Learn. Res., 2024
CoRR, 2024
Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning.
CoRR, 2024
A Simple but Strong Baseline for Sounding Video Generation: Effective Adaptation of Audio and Video Diffusion Models for Joint Generation.
CoRR, 2024
SpecMaskGIT: Masked Generative Modeling of Audio Spectrograms for Efficient Audio Synthesis and Beyond.
CoRR, 2024
MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training.
CoRR, 2024
CoRR, 2024
CoRR, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
SpecMaskGIT: Masked Generative Modeling of Audio Spectrogram for Efficient Audio Synthesis and Beyond.
Proceedings of the 25th International Society for Music Information Retrieval Conference, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
2023
Dataset, September, 2023
Dataset, July, 2023
Dataset, July, 2023
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
XMD: An End-to-End Framework for Interactive Explanation-Based Debugging of NLP Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2023
2022
SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization.
Proceedings of the International Conference on Machine Learning, 2022
Good Examples Make A Faster Learner: Simple Demonstration-based Learning for Low-resource NER.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
2020
Trans. Assoc. Comput. Linguistics, 2020
2013
Audio fingerprinting robust against reverberation and noise based on quantification of sinusoidality.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013
2010
Proceedings of the Tenth International Conference on Epigenetic Robotics (EpiRob 2010), 2010
2009
Causality quantification and its applications: structuring and modeling of multivariate time series.
Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, France, June 28, 2009