arxiv:2412.05467
Massimo Caccia
optimass
AI & ML interests
None yet
Recent Activity
liked
a model
about 1 month ago
ServiceNow-AI/Apriel-1.5-15b-Thinker
upvoted
a
paper
about 2 months ago
A Survey of Reinforcement Learning for Large Reasoning Models
upvoted
a
paper
about 2 months ago
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey