Zeyu Mi's picture

3 4 9

Zeyu Mi

yzmizeyu

·

https://ipads.se.sjtu.edu.cn/pub/members/zeyu_mi

AI & ML interests

None yet

Organizations

authored 2 papers 5 months ago

PipeLLM: Fast and Confidential Large Language Model Services with Speculative Pipelined Encryption

Paper • 2411.03357 • Published Nov 4, 2024

SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment

Paper • 2507.20984 • Published Jul 28 • 57

authored 3 papers over 1 year ago

ReLU$^2$ Wins: Discovering Efficient Activation Functions for Sparse LLMs

Paper • 2402.03804 • Published Feb 6, 2024 • 4

Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated Parameters

Paper • 2406.05955 • Published Jun 10, 2024 • 27

PowerInfer-2: Fast Large Language Model Inference on a Smartphone

Paper • 2406.06282 • Published Jun 10, 2024 • 38

authored a paper about 2 years ago

PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU

Paper • 2312.12456 • Published Dec 16, 2023 • 44