Update README.md Add vLLM deployment info.
#31
by
QscQ
- opened
README.md
CHANGED
|
@@ -231,7 +231,17 @@ generated_ids = [
|
|
| 231 |
response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
|
| 232 |
```
|
| 233 |
|
| 234 |
-
## 5.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 235 |
|
| 236 |
```
|
| 237 |
@misc{minimax2025minimax01scalingfoundationmodels,
|
|
@@ -245,8 +255,8 @@ response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
|
|
| 245 |
}
|
| 246 |
```
|
| 247 |
|
| 248 |
-
##
|
| 249 |
For general use and evaluation, we provide a [Chatbot](https://chat.minimax.io/) with online search capabilities and the [online API](https://www.minimax.io/platform) for developers. For general use and evaluation, we provide the [MiniMax MCP Server](https://github.com/MiniMax-AI/MiniMax-MCP) with video generation, image generation, speech synthesis, and voice cloning for developers.
|
| 250 |
|
| 251 |
-
##
|
| 252 |
Contact us at [[email protected]](mailto:[email protected]).
|
|
|
|
| 231 |
response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
|
| 232 |
```
|
| 233 |
|
| 234 |
+
## 5. Deployment Guide
|
| 235 |
+
For production deployment, we recommend using [vLLM](https://docs.vllm.ai/en/latest/) to serve MiniMax-Text-01. vLLM provides excellent performance for serving large language models with the following features:
|
| 236 |
+
|
| 237 |
+
🔥 Outstanding service throughput performance
|
| 238 |
+
⚡ Efficient and intelligent memory management
|
| 239 |
+
📦 Powerful batch request processing capability
|
| 240 |
+
⚙️ Deeply optimized underlying performance
|
| 241 |
+
|
| 242 |
+
For detailed deployment instructions, please refer to our [vLLM Deployment Guide](https://github.com/MiniMax-AI/MiniMax-01/blob/main/docs/vllm_deployment_guild.md).
|
| 243 |
+
|
| 244 |
+
## 6. Citation
|
| 245 |
|
| 246 |
```
|
| 247 |
@misc{minimax2025minimax01scalingfoundationmodels,
|
|
|
|
| 255 |
}
|
| 256 |
```
|
| 257 |
|
| 258 |
+
## 7. Chatbot & API
|
| 259 |
For general use and evaluation, we provide a [Chatbot](https://chat.minimax.io/) with online search capabilities and the [online API](https://www.minimax.io/platform) for developers. For general use and evaluation, we provide the [MiniMax MCP Server](https://github.com/MiniMax-AI/MiniMax-MCP) with video generation, image generation, speech synthesis, and voice cloning for developers.
|
| 260 |
|
| 261 |
+
## 8. Contact Us
|
| 262 |
Contact us at [[email protected]](mailto:[email protected]).
|