Tianyi Bai

Orcid: 0009-0009-5057-7100

According to our database¹, Tianyi Bai authored at least 28 papers between 2022 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

CUA-Gym: Scaling Verifiable Training Environments and Tasks for Computer-Use Agents.

[BibT_eX]

[DOI]

CoRR, May, 2026

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models.

[BibT_eX]

[DOI]

CoRR, April, 2026

TAG: Thinking with Action Unit Grounding for Facial Expression Recognition.

[BibT_eX]

[DOI]

CoRR, February, 2026

Synthesizing Multimodal Geometry Datasets from Scratch and Enabling Visual Alignment via Plotting Code.

[BibT_eX]

[DOI]

CoRR, February, 2026

Research on World Models Is Not Merely Injecting World Knowledge into Specific Tasks.

[BibT_eX]

[DOI]

CoRR, February, 2026

From Completion to Editing: Unlocking Context-Aware Code Infilling via Search-and-Replace Instruction Tuning.

[BibT_eX]

[DOI]

CoRR, January, 2026

2025

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI.

[BibT_eX]

[DOI]

CoRR, December, 2025

Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation.

[BibT_eX]

[DOI]

CoRR, December, 2025

From Pixels to Feelings: Aligning MLLMs with Human Cognitive Perception of Images.

[BibT_eX]

[DOI]

CoRR, November, 2025

LAST: LeArning to Think in Space and Time for Generalist Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, November, 2025

VADE: Variance-Aware Dynamic Sampling via Online Sample-Level Difficulty Estimation for Multimodal RL.

[BibT_eX]

[DOI]

CoRR, November, 2025

UltraLLaDA: Scaling the Context Length to 128K for Diffusion Large Language Models.

[BibT_eX]

[DOI]

CoRR, October, 2025

Multi-Step Visual Reasoning with Visual Tokens Scaling and Verification.

[BibT_eX]

[DOI]

CoRR, June, 2025

TAH-QUANT: Effective Activation Quantization in Pipeline Parallelism over Slow Network.

[BibT_eX]

[DOI]

CoRR, June, 2025

Unsupervised Topic Models are Data Mixers for Pre-training Language Models.

[BibT_eX]

[DOI]

CoRR, February, 2025

Fast, Secure, Adaptable: LionsOS Design, Implementation and Performance.

[BibT_eX]

[DOI]

CoRR, January, 2025

Hallucination at a Glance: Controlled Visual Edits and Fine-Grained Multimodal Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Harnessing Diversity for Important Data Selection in Pretraining Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Meta-rater: A Multi-dimensional Data Selection Method for Pre-training Language Models.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Efficient Pretraining Data Selection for Language Models via Multi-Actor Collaboration.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining.

[BibT_eX]

[DOI]

CoRR, 2024

Harnessing Diversity for Important Data Selection in Pretraining Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

KeyVideoLLM: Towards Large-scale Video Keyframe Selection.

[BibT_eX]

[DOI]

CoRR, 2024

A Survey of Multimodal Large Language Model from A Data-centric Perspective.

[BibT_eX]

[DOI]

CoRR, 2024

2023

Transfer Learning for Bayesian Optimization: A Survey.

[BibT_eX]

[DOI]

CoRR, 2023

2022

Transfer Learning based Search Space Design for Hyperparameter Tuning.

[BibT_eX]

[DOI]

Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Tianyi Bai

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...