AI & ML interests

Large Language Models, Transformers, Natural Language Processing

FromtheskyResearchLabs 's collections 3

PLDR-LLMs with KVG cache (Pytorch/Transformers)
Pretrained PLDR-LLMs from paper titled "PLDR-LLMs Learn A Generalizable Tensor Operator That Can Replace Its Own Deep Neural Net At Inference"