Time to Talk: LLM Agents for Asynchronous Group Communication in Mafia Games Paper • 2506.05309 • Published Jun 5 • 15
Debatable Intelligence: Benchmarking LLM Judges via Debate Speech Evaluation Paper • 2506.05062 • Published Jun 5 • 15
CHIMERA: A Knowledge Base of Idea Recombination in Scientific Literature Paper • 2505.20779 • Published May 27 • 15
WHISTRESS: Enriching Transcriptions with Sentence Stress Detection Paper • 2505.19103 • Published May 25 • 13
Follow the Flow: On Information Flow Across Textual Tokens in Text-to-Image Models Paper • 2504.01137 • Published Apr 1 • 21
More Documents, Same Length: Isolating the Challenge of Multiple Documents in RAG Paper • 2503.04388 • Published Mar 6 • 17
Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights Paper • 2502.09619 • Published Feb 13 • 35
Padding Tone: A Mechanistic Analysis of Padding Tokens in T2I Models Paper • 2501.06751 • Published Jan 12 • 32