YaRN: Efficient Context Window Extension of Large Language Models Paper • 2309.00071 • Published Aug 31, 2023 • 77
GraphGPT: Generative Pre-trained Graph Eulerian Transformer Paper • 2401.00529 • Published Dec 31, 2023 • 1
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 40 items • Updated 3 days ago • 350