Pedro Cuenca's picture

Pedro Cuenca

pcuenq

AI & ML interests

None yet

Recent Activity

reacted to andito's post with ๐Ÿš€ about 17 hours ago
Finally, our new paper is out! "๐—™๐—ถ๐—ป๐—ฒ๐—ฉ๐—ถ๐˜€๐—ถ๐—ผ๐—ป: ๐—ข๐—ฝ๐—ฒ๐—ป ๐——๐—ฎ๐˜๐—ฎ ๐—œ๐˜€ ๐—”๐—น๐—น ๐—ฌ๐—ผ๐˜‚ ๐—ก๐—ฒ๐—ฒ๐—ฑ"! ๐Ÿฅณ https://huggingface.co/papers/2510.17269 If you've ever trained a VLM, you know this problem: nobody shares their data mixtures. It's a black box, making replicating SOTA work impossible. We wanted to change that. FineVision unifies 200 sources into 24 million samples. With 17.3 million images and 9.5 billion answer tokens, it's the largest open resource of its kind. In the paper, we share how we built it: ๐Ÿ” finding and cleaning data at scale ๐Ÿงน removing excessive duplicates across sources ๐Ÿค— decontaminating against 66 public benchmarks My favorite part is Figure 6 (in the video!). It's our visual diversity analysis. It shows that FineVision isn't just bigger; it's more balanced and conceptually richer than other open datasets. NVIDIA's Eagle 2 paper highlighted just how critical this visual diversity is, and our results confirm it: models trained on FineVision consistently outperform those trained on any other open dataset on 11 benchmarks! ๐ŸŽ‰ To celebrate the paper, Iโ€™m also releasing a concatenated and shuffled version of the full dataset! ๐Ÿ‘‰`HuggingFaceM4/FineVision_full_shuffled` Itโ€™s ready to stream, so you can start training your own models right away: from datasets import load_dataset d = load_dataset("HuggingFaceM4/FineVision_full_shuffled", split="train", streaming=True) print(next(iter(d))) A big shoutout to the first authors: Luis Wiedmann and Orr Zohar. They are rockstars!
View all activity

Organizations

Hugging Face's profile picture Google's profile picture Sentence Transformers's profile picture ๐ŸงจDiffusers's profile picture PyTorch Image Models's profile picture Hugging Face Internal Testing Organization's profile picture Flax Community's profile picture DALLE mini's profile picture ControlNet 1.1 Preview's profile picture I Hackathon Somos NLP: PLN en Espaรฑol's profile picture SomosNLP's profile picture Huggingface.js's profile picture HuggingFaceM4's profile picture Apple's profile picture Open-Source AI Meetup's profile picture (De)fusing's profile picture Huggingface Projects's profile picture CompVis's profile picture Hugging Face OSS Metrics's profile picture CompVis Community's profile picture Diffusers Pipelines Library for Stable Diffusion's profile picture Core ML Projects's profile picture LocalCodeLLMs's profile picture Code Llama's profile picture UniverseTBD's profile picture Hands-On Generative AI with Transformers and Diffusion Models's profile picture Diffusers Demo at ICCV 2023's profile picture Hugging Face Smol Models Research's profile picture Core ML Files's profile picture huggingPartyParis's profile picture adept-hf-collab's profile picture Enterprise Explorers's profile picture Latent Consistency's profile picture TTS Eval (OLD)'s profile picture ggml.ai's profile picture kotol's profile picture LocalLLaMA's profile picture gg-hf's profile picture Mistral AI EAP's profile picture Llzama's profile picture MLX Community's profile picture Hugging Face Assignments's profile picture IBM Granite's profile picture On-device Squad's profile picture TTS AGI's profile picture Social Post Explorers's profile picture Apple CoreNet Models 's profile picture hsramall's profile picture gg-tt's profile picture LLHF's profile picture SLLHF's profile picture Hugging Quants's profile picture lbhf's profile picture Meta Llama's profile picture kmhf's profile picture nltpt's profile picture s0409's profile picture Mt Metrics's profile picture nltpt-q's profile picture dummyosan's profile picture Test Org's profile picture metavision's profile picture mv's profile picture H company's profile picture Bert ... but new's profile picture qrias's profile picture open/ acc's profile picture wut?'s profile picture DDUF's profile picture kernels-community's profile picture Hackathon SomosNLP 2025's profile picture None yet's profile picture Hugging Face Agents Course's profile picture LiteRT Community (FKA TFLite)'s profile picture s0225's profile picture gg-hf-g's profile picture hf-private-mlx's profile picture llrehf's profile picture Transformers Community's profile picture Inference Endpoints Images's profile picture gg-hf-gm's profile picture Hugging Face MCP Course's profile picture yofo's profile picture yorgllre's profile picture MLX Community โ€“ Staging's profile picture TRELLIS Community's profile picture beep boop's profile picture Plop's profile picture hffma's profile picture CruciVerbi's profile picture Temporary Org's profile picture