Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
MElHuseyni 's Collections
Arabic Models (LLM, VLM, Multimodel)
Instance Segmentation
SmolVLM 🚐
Image Segmentation Models 🍪
OCR Models 👀️📃
Object Detection Models 🍉
Visual Embedding Models 🖼️
Vision Language Leader-boards 📈
Speech Models 🎧
LLM Inference 🚀

LLM Inference 🚀

updated Aug 9
Upvote
1

  • DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference

    Paper • 2401.08671 • Published Jan 9, 2024 • 15

  • NanoFlow: Towards Optimal Large Language Model Serving Throughput

    Paper • 2408.12757 • Published Aug 22, 2024 • 19

  • richard-park/llama3-deepspeed-v1.0

    Text Generation • 8B • Updated Jul 4, 2024 • • 1
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs