Amro Abbas

According to our database1, Amro Abbas authored at least 12 papers between 2023 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
The Finetuner's Fallacy: When to Pretrain with Your Finetuning Data.
CoRR, March, 2026

ÜberWeb: Insights from Multilingual Curation for a 20-Trillion-Token Dataset.
CoRR, February, 2026

DatBench: Discriminative, Faithful, and Efficient VLM Evaluations.
CoRR, January, 2026

2025
Luxical: High-Speed Lexical-Dense Text Embeddings.
CoRR, December, 2025

BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining.
CoRR, August, 2025

A comparison between humans and AI at recognizing objects in unusual poses.
Trans. Mach. Learn. Res., 2025

2024
Humans Beat Deep Networks at Recognizing Objects in Unusual Poses, Given Enough Time.
CoRR, 2024


Effective pruning of web-scale datasets based on complexity of concept clusters.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Sieve: Multimodal Dataset Pruning Using Image Captioning Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
SemDeDup: Data-efficient learning at web-scale through semantic deduplication.
CoRR, 2023

Progress and Limitations of Deep Networks to Recognize Objects in Unusual Poses.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023


  Loading...