Edoardo's picture

6 5 13

Edoardo

e-zorzi

·

AI & ML interests

RL, Embodied AI, Vision, Vision-Language model, Robotics

Recent Activity

published a model 1 day ago

e-zorzi/sft-model

updated a dataset 5 days ago

e-zorzi/multimodal

published a dataset 5 days ago

e-zorzi/multimodal

View all activity

Organizations

upvoted a paper 2 months ago

Self-Rewarding Vision-Language Model via Reasoning Decomposition

Paper • 2508.19652 • Published Aug 27 • 84

upvoted a collection 5 months ago

V-JEPA 2

A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13 • 167

upvoted a paper 7 months ago

VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model

Paper • 2504.07615 • Published Apr 10 • 33

upvoted a paper 11 months ago

Collaborative Instance Navigation: Leveraging Agent Self-Dialogue to Minimize User Input

Paper • 2412.01250 • Published Dec 2, 2024 • 5

upvoted a collection 11 months ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated Apr 30 • 308