Igor Gromov 's picture

Igor Gromov

Transformator

·

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

baidu/ERNIE-4.5-VL-28B-A3B-Thinking

liked a model 3 days ago

google/videoprism-lvt-large-f8r288

upvoted a collection 3 days ago

View all activity

Organizations

None yet

upvoted a collection 3 days ago

Olmo 3

Artifacts for the Olmo 3 release. • 9 items • Updated 3 days ago • 102

upvoted a paper 3 days ago

ResearchRubrics: A Benchmark of Prompts and Rubrics For Evaluating Deep Research Agents

Paper • 2511.07685 • Published 13 days ago • 7

upvoted a collection 6 days ago

The Bestiary

Decensored language models made using Heretic (https://github.com/p-e-w/heretic) • 6 items • Updated 7 days ago • 58

upvoted a paper 10 days ago

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published 11 days ago • 98

upvoted a collection 12 days ago

MDGA

Make Diffusion Great Again. The resource list for Super Data Learners, Quokka, and OpenMoE 2. • 16 items • Updated 19 days ago • 7

upvoted a paper 12 days ago

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published 18 days ago • 116

upvoted a paper 15 days ago

Contamination Detection for VLMs using Multi-Modal Semantic Perturbation

Paper • 2511.03774 • Published 18 days ago • 12

upvoted a paper 23 days ago

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published 25 days ago • 45

upvoted a paper about 1 month ago

Building a Foundational Guardrail for General Agentic Systems via Synthetic Data

Paper • 2510.09781 • Published Oct 10 • 26

upvoted a collection 7 months ago

Health AI Developer Foundations (HAI-DEF)

Groups models released for use in health AI by Google. Read more about HAI-DEF at https://developers.google.com/health-ai-developer-foundations • 15 items • Updated Jul 10 • 113

upvoted 2 papers 8 months ago

Survey on Evaluation of LLM-based Agents

Paper • 2503.16416 • Published Mar 20 • 95

Aligning Multimodal LLM with Human Preference: A Survey

Paper • 2503.14504 • Published Mar 18 • 26

upvoted an article 8 months ago

Article

Vision Language Models Explained

Apr 11, 2024

•

491

upvoted 3 papers 9 months ago

LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Paper • 2503.07536 • Published Mar 10 • 88

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Paper • 2503.07920 • Published Mar 10 • 101

On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective

Paper • 2502.14296 • Published Feb 20 • 45

upvoted an article 9 months ago

Article

Open Source Developers Guide to the EU AI Act

Dec 2, 2024

•

47

upvoted 3 papers 9 months ago

IFIR: A Comprehensive Benchmark for Evaluating Instruction-Following in Expert-Domain Information Retrieval

Paper • 2503.04644 • Published Mar 6 • 21

Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs

Paper • 2503.02846 • Published Mar 4 • 18

MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents

Paper • 2503.01935 • Published Mar 3 • 29