PICARD: Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models Paper • 2109.05093 • Published Sep 10, 2021 • 1
UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models Paper • 2201.05966 • Published Jan 16, 2022 • 1
Unifying Autoregressive and Diffusion-Based Sequence Generation Paper • 2504.06416 • Published Apr 8 • 3
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper • 2510.08697 • Published 20 days ago • 32
Unifying Autoregressive and Diffusion-Based Sequence Generation Paper • 2504.06416 • Published Apr 8 • 3
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper • 2510.08697 • Published 20 days ago • 32
view article Article SyGra: The One-Stop Framework for Building Data for LLMs and SLMs By ServiceNow-AI and 3 others • Sep 22 • 11
BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks Paper • 2412.04626 • Published Dec 5, 2024 • 14