The Role of Computing Resources in Publishing Foundation Model Research Paper • 2510.13621 • Published Oct 15 • 16
Building a Foundational Guardrail for General Agentic Systems via Synthetic Data Paper • 2510.09781 • Published Oct 10 • 26
Breaking Focus: Contextual Distraction Curse in Large Language Models Paper • 2502.01609 • Published Feb 3 • 1
On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective Paper • 2502.14296 • Published Feb 20 • 45
Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge Paper • 2410.02736 • Published Oct 3, 2024 • 1
AutoBench-V: Can Large Vision-Language Models Benchmark Themselves? Paper • 2410.21259 • Published Oct 28, 2024 • 1
Breaking Focus: Contextual Distraction Curse in Large Language Models Paper • 2502.01609 • Published Feb 3 • 1
Preference Leakage: A Contamination Problem in LLM-as-a-judge Paper • 2502.01534 • Published Feb 3 • 40