-
Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws
Paper • 2401.00448 • Published • 30 -
Improving Text Embeddings with Large Language Models
Paper • 2401.00368 • Published • 82 -
E^2-LLM: Efficient and Extreme Length Extension of Large Language Models
Paper • 2401.06951 • Published • 26 -
The Unreasonable Ineffectiveness of the Deeper Layers
Paper • 2403.17887 • Published • 82
allthingsdisaggregated
lastweek
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
Qwen3-Omni Technical Report
upvoted
a
paper
3 months ago
Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed
Inference
upvoted
a
paper
5 months ago
Inference-Time Hyper-Scaling with KV Cache Compression
Organizations
None yet