Isadora White's picture

8 1

Isadora White

izzcw

·

https://icwhite.github.io/website/

AI & ML interests

LLMs, Reinforcement Learning, agents, embodiment, multi-agent collaboration

Recent Activity

upvoted a paper about 1 month ago

Steering Autoregressive Music Generation with Recursive Feature Machines

upvoted a paper 4 months ago

Group Sequence Policy Optimization

published a model 6 months ago

izzcw/dpo_model_3.1_8k

View all activity

Organizations

upvoted a paper about 1 month ago

Steering Autoregressive Music Generation with Recursive Feature Machines

Paper • 2510.19127 • Published Oct 21 • 7

upvoted a paper 4 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 310

upvoted 3 papers 6 months ago

lmgame-Bench: How Good are LLMs at Playing Games?

Paper • 2505.15146 • Published May 21 • 20

Web-Shepherd: Advancing PRMs for Reinforcing Web Agents

Paper • 2505.15277 • Published May 21 • 104

Group Think: Multiple Concurrent Reasoning Agents Collaborating at Token Level Granularity

Paper • 2505.11107 • Published May 16 • 29

upvoted 3 papers 7 months ago

DiaTool-DPO: Multi-Turn Direct Preference Optimization for Tool-Augmented Large Language Models

Paper • 2504.02882 • Published Apr 2 • 7

Communicate to Play: Pragmatic Reasoning for Efficient Cross-Cultural Communication in Codenames

Paper • 2408.04900 • Published Aug 9, 2024 • 1

Collaborating Action by Action: A Multi-agent LLM Framework for Embodied Reasoning

Paper • 2504.17950 • Published Apr 24 • 5