PICARD: Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models Paper • 2109.05093 • Published Sep 10, 2021 • 1
UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models Paper • 2201.05966 • Published Jan 16, 2022 • 1
Unifying Autoregressive and Diffusion-Based Sequence Generation Paper • 2504.06416 • Published Apr 8 • 3
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper • 2510.08697 • Published 17 days ago • 32
Optimizing What Matters: AUC-Driven Learning for Robust Neural Retrieval Paper • 2510.00137 • Published 26 days ago • 2
DeepCodeSeek: Real-Time API Retrieval for Context-Aware Code Generation Paper • 2509.25716 • Published 26 days ago • 3
GRAFT: GRaPH and Table Reasoning for Textual Alignment -- A Benchmark for Structured Instruction Following and Visual Reasoning Paper • 2508.15690 • Published Aug 21 • 8
Modular Techniques for Synthetic Long-Context Data Generation in Language Model Training and Evaluation Paper • 2509.01185 • Published Sep 1
SyGra: A Unified Graph-Based Framework for Scalable Generation, Quality Tagging, and Management of Synthetic Data Paper • 2508.15432 • Published Aug 21 • 7
SyGra: A Unified Graph-Based Framework for Scalable Generation, Quality Tagging, and Management of Synthetic Data Paper • 2508.15432 • Published Aug 21 • 7
SyGra: A Unified Graph-Based Framework for Scalable Generation, Quality Tagging, and Management of Synthetic Data Paper • 2508.15432 • Published Aug 21 • 7
SyGra: A Unified Graph-Based Framework for Scalable Generation, Quality Tagging, and Management of Synthetic Data Paper • 2508.15432 • Published Aug 21 • 7
SyGra: A Unified Graph-Based Framework for Scalable Generation, Quality Tagging, and Management of Synthetic Data Paper • 2508.15432 • Published Aug 21 • 7
GRAFT: GRaPH and Table Reasoning for Textual Alignment -- A Benchmark for Structured Instruction Following and Visual Reasoning Paper • 2508.15690 • Published Aug 21 • 8
Modular Techniques for Synthetic Long-Context Data Generation in Language Model Training and Evaluation Paper • 2509.01185 • Published Sep 1
Fine-Tune an SLM or Prompt an LLM? The Case of Generating Low-Code Workflows Paper • 2505.24189 • Published May 30 • 5
INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge Paper • 2411.19799 • Published Nov 29, 2024 • 14