-
CLEX: Continuous Length Extrapolation for Large Language Models
Paper • 2310.16450 • Published • 10 -
E^2-LLM: Efficient and Extreme Length Extension of Large Language Models
Paper • 2401.06951 • Published • 26 -
Data Engineering for Scaling Language Models to 128K Context
Paper • 2402.10171 • Published • 25
Juan Herrera
juampahc
AI & ML interests
None yet
Recent Activity
liked
a model
9 days ago
jinaai/jina-embeddings-v4
liked
a model
7 months ago
nomic-ai/nomic-embed-text-v2-moe
liked
a model
8 months ago
tomg-group-umd/huginn-0125