AI & ML interests

Software Engineering, AI Evaluation

Recent Activity

zhiminy  updated a Space about 1 hour ago
SWE-Arena/SWE-Issue
zhiminy  updated a Space about 1 hour ago
SWE-Arena/SWE-Review
zhiminy  updated a Space about 1 hour ago
SWE-Arena/SWE-PR
View all activity

Software Engineering Arena is an open-source initiative to transparently evaluate and track AI coding agents across real-world software engineering tasks. We provide interactive platforms, tracking systems, and novel metrics to advance the field of AI-assisted software development.

The easier it is to verify a solution, the faster an AI system can learn to master the task. — Jason Wei¹

¹ https://www.jasonwei.net/blog/asymmetry-of-verification-and-verifiers-law

Welcome collaboration from research labs, independent contributors, and the broader SE community!

models 0

None public yet