BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks
Paper
•
2412.04626
•
Published
•
14
None defined yet.
TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar
General-Reasoner: Advancing LLM Reasoning Across All Domains