Text Generation
Transformers
Safetensors
liger_gsa

Liger-GLA-8B

[πŸ“‚ GitHub] [πŸ“œ Liger] [πŸ“‘ arXiv]

We introduce Liger-GSA-8B, a gated linear recurrent model linearized from Transformer-based LLM.

Our Liger framework is compatible with various linear recurrent models with gating structures:

Model Name Base Model Linear Structure HF Link
Liger-GLA-8B Llama-3-8B GLA πŸ€— link
Liger-GSA-8B Llama-3-8B GSA πŸ€— link

Citation

If you find this repo useful, please cite and star our work:

@article{lan2025liger,
  title={Liger: Linearizing Large Language Models to Gated Recurrent Structures},
  author={Lan, Disen and Sun, Weigao and Hu, Jiaxi and Du, Jusen and Cheng, Yu},
  journal={arXiv preprint arXiv:2503.01496},
  year={2025}
}
Downloads last month
1
Safetensors
Model size
8B params
Tensor type
BF16
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Collection including linear-moe-hub/Liger-GSA-8B