Self-Supervised Learning with Lie Symmetries for Partial Differential Equations Paper • 2307.05432 • Published Jul 11, 2023 • 17
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only Paper • 2306.01116 • Published Jun 1, 2023 • 41
The Pile: An 800GB Dataset of Diverse Text for Language Modeling Paper • 2101.00027 • Published Dec 31, 2020 • 9
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models Paper • 2403.13372 • Published Mar 20, 2024 • 175
Agent Lightning: Train ANY AI Agents with Reinforcement Learning Paper • 2508.03680 • Published Aug 5, 2025 • 121
CoulGAT: An Experiment on Interpretability of Graph Attention Networks Paper • 1912.08409 • Published Dec 18, 2019 • 1
Power Law Graph Transformer for Machine Translation and Representation Learning Paper • 2107.02039 • Published Jun 27, 2021 • 1
PLDR-LLM: Large Language Model from Power Law Decoder Representations Paper • 2410.16703 • Published Oct 22, 2024 • 1
PLDR-LLMs Learn A Generalizable Tensor Operator That Can Replace Its Own Deep Neural Net At Inference Paper • 2502.13502 • Published Feb 19, 2025 • 3