MLLM as a UI Judge: Benchmarking Multimodal LLMs for Predicting Human Perception of User Interfaces Paper • 2510.08783 • Published 14 days ago • 4
Learning to Route LLMs from Bandit Feedback: One Policy, Many Trade-offs Paper • 2510.07429 • Published 16 days ago • 3
Optimizing Data Delivery: Insights from User Preferences on Visuals, Tables, and Text Paper • 2411.07451 • Published Nov 12, 2024
MODS: Moderating a Mixture of Document Speakers to Summarize Debatable Queries in Document Collections Paper • 2502.00322 • Published Feb 1
A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations Paper • 2505.14106 • Published May 20
The Photographer Eye: Teaching Multimodal Large Language Models to See and Critique like Photographers Paper • 2509.18582 • Published Sep 23 • 2
mSCoRe: a $M$ultilingual and Scalable Benchmark for $S$kill-based $Co$mmonsense $Re$asoning Paper • 2508.10137 • Published Aug 13 • 2
Lizard: An Efficient Linearization Framework for Large Language Models Paper • 2507.09025 • Published Jul 11 • 18
A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality Paper • 2507.07202 • Published Jul 9 • 22
FigCaps-HF: A Figure-to-Caption Generative Framework and Benchmark with Human Feedback Paper • 2307.10867 • Published Jul 20, 2023
Forecasting Time Series with LLMs via Patch-Based Prompting and Decomposition Paper • 2506.12953 • Published Jun 15 • 2
MS4UI: A Dataset for Multi-modal Summarization of User Interface Instructional Videos Paper • 2506.12623 • Published Jun 14 • 2
LaMP-Cap: Personalized Figure Caption Generation With Multimodal Figure Profiles Paper • 2506.06561 • Published Jun 6 • 2
Follow the Flow: Fine-grained Flowchart Attribution with Neurosymbolic Agents Paper • 2506.01344 • Published Jun 2 • 5
A Graph Perspective to Probe Structural Patterns of Knowledge in Large Language Models Paper • 2505.19286 • Published May 25 • 3