Churro Collection Dataset and model for handwritten and print text recognition in historical documents • 3 items • Updated Sep 27, 2025 • 2
view article Article ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases Nov 5, 2025 • 58
SauerkrautLM-Multilingual-(Reason)-ColBERT Collection SauerkrautLM ColBERT is a suite of Late-Interaction retrieval models built with PyLate’s ColBERT architecture and tuned for seven European languages. • 7 items • Updated Aug 3, 2025 • 20
view article Article System Prompt Learning: Teaching LLMs to Learn Problem-Solving Strategies from Experience Jun 2, 2025 • 23
view article Article Introducing Synthetic Data Workshop: Your Gateway to Easy Synthetic Dataset Creation Jun 20, 2024 • 12
view article Article Post-OCR-Correction: 1 billion words dataset of automated OCR correction by LLM Apr 26, 2024 • 17