Manjie Xu

Orcid: 0000-0001-9220-3662

According to our database1, Manjie Xu authored at least 18 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Hearing from Silence: Reasoning Audio Descriptions from Silent Videos via Vision-Language Model.
CoRR, May, 2025

Learning to Plan with Personalized Preferences.
CoRR, February, 2025

STA-V2A: Video-to-Audio Generation with Semantic and Temporal Alignment.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
Towards Diverse and Efficient Audio Captioning via Diffusion Models.
CoRR, 2024

Video-to-Audio Generation with Hidden Alignment.
CoRR, 2024

Context-Aware Head-and-Eye Motion Generation with Diffusion Model.
Proceedings of the IEEE Conference Virtual Reality and 3D User Interfaces, 2024

Prompt-guided Precise Audio Editing with Diffusion Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

SRC-gAudio: Sampling-Rate-Controlled Audio Generation.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2024

2023
MEWL: Few-shot multimodal word learning with referential uncertainty.
Dataset, June, 2023

Active Reasoning in an Open-World Environment.
CoRR, 2023

Interactive Visual Reasoning under Uncertainty.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Active Reasoning in an Open-World Environment.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Evaluating and Inducing Personality in Pre-trained Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

On the Complexity of Bayesian Generalization.
Proceedings of the International Conference on Machine Learning, 2023

MEWL: Few-shot multimodal word learning with referential uncertainty.
Proceedings of the International Conference on Machine Learning, 2023

2022
To think inside the box, or to think out of the box? Scientific discovery via the reciprocation of insights and concepts.
CoRR, 2022

EST: Evaluating Scientific Thinking in Artificial Agents.
CoRR, 2022

MPI: Evaluating and Inducing Personality in Pre-trained Language Models.
CoRR, 2022


  Loading...