-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 311 • 98 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 36 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
Collections
Discover the best community collections!
Collections including paper arxiv:2512.22615
-
Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone
Paper • 2512.22615 • Published • 39 -
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models
Paper • 2512.20557 • Published • 48 -
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
Paper • 2512.16093 • Published • 90
-
ReXGroundingCT: A 3D Chest CT Dataset for Segmentation of Findings from Free-Text Reports
Paper • 2507.22030 • Published -
Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decode
Paper • 2508.04107 • Published • 4 -
Phrase-grounded Fact-checking for Automatically Generated Chest X-ray Reports
Paper • 2509.21356 • Published -
Learning Segmentation from Radiology Reports
Paper • 2507.05582 • Published • 1
-
Gemini Robotics: Bringing AI into the Physical World
Paper • 2503.20020 • Published • 29 -
Magma: A Foundation Model for Multimodal AI Agents
Paper • 2502.13130 • Published • 58 -
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Paper • 2311.05437 • Published • 51 -
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents
Paper • 2410.23218 • Published • 49
-
openai/gpt-oss-120b
Text Generation • 120B • Updated • 3.57M • • 4.31k -
Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning
Paper • 2512.20605 • Published • 59 -
Nested Browser-Use Learning for Agentic Information Seeking
Paper • 2512.23647 • Published • 17 -
TimeBill: Time-Budgeted Inference for Large Language Models
Paper • 2512.21859 • Published • 19
-
Tracking Any Object Amodally
Paper • 2312.12433 • Published • 12 -
Flow-GRPO: Training Flow Matching Models via Online RL
Paper • 2505.05470 • Published • 86 -
CFG-Zero*: Improved Classifier-Free Guidance for Flow Matching Models
Paper • 2503.18886 • Published • 24 -
Diffusion Models without Classifier-free Guidance
Paper • 2502.12154 • Published • 8
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 311 • 98 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 36 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
-
Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone
Paper • 2512.22615 • Published • 39 -
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models
Paper • 2512.20557 • Published • 48 -
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
Paper • 2512.16093 • Published • 90
-
ReXGroundingCT: A 3D Chest CT Dataset for Segmentation of Findings from Free-Text Reports
Paper • 2507.22030 • Published -
Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decode
Paper • 2508.04107 • Published • 4 -
Phrase-grounded Fact-checking for Automatically Generated Chest X-ray Reports
Paper • 2509.21356 • Published -
Learning Segmentation from Radiology Reports
Paper • 2507.05582 • Published • 1
-
openai/gpt-oss-120b
Text Generation • 120B • Updated • 3.57M • • 4.31k -
Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning
Paper • 2512.20605 • Published • 59 -
Nested Browser-Use Learning for Agentic Information Seeking
Paper • 2512.23647 • Published • 17 -
TimeBill: Time-Budgeted Inference for Large Language Models
Paper • 2512.21859 • Published • 19
-
Gemini Robotics: Bringing AI into the Physical World
Paper • 2503.20020 • Published • 29 -
Magma: A Foundation Model for Multimodal AI Agents
Paper • 2502.13130 • Published • 58 -
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Paper • 2311.05437 • Published • 51 -
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents
Paper • 2410.23218 • Published • 49
-
Tracking Any Object Amodally
Paper • 2312.12433 • Published • 12 -
Flow-GRPO: Training Flow Matching Models via Online RL
Paper • 2505.05470 • Published • 86 -
CFG-Zero*: Improved Classifier-Free Guidance for Flow Matching Models
Paper • 2503.18886 • Published • 24 -
Diffusion Models without Classifier-free Guidance
Paper • 2502.12154 • Published • 8