Update performance dashboard (#2)
Browse files- Update performance dashboard (bd55be6d4447db6dcf0db8f2eeed73af443c1749)
Co-authored-by: Xiaoming Hu <[email protected]>
README.md
CHANGED
|
@@ -22,7 +22,25 @@ Please follow the [instructions](https://github.com/google-ai-edge/mediapipe-sam
|
|
| 22 |
|
| 23 |
Note that all benchmark stats are from a Samsung S24 Ultra.
|
| 24 |
|
| 25 |
-
|
| 26 |
-
|
| 27 |
-
|
| 28 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 22 |
|
| 23 |
Note that all benchmark stats are from a Samsung S24 Ultra.
|
| 24 |
|
| 25 |
+
<table border="1">
|
| 26 |
+
<tr>
|
| 27 |
+
<th>Model</th>
|
| 28 |
+
<td colspan="2">DeepSeek-R1-Distill-Qwen-1.5B (Int8 quantized)</td>
|
| 29 |
+
</tr>
|
| 30 |
+
<tr>
|
| 31 |
+
<th>Params</th>
|
| 32 |
+
<td colspan="2">1.78 B</td>
|
| 33 |
+
</tr>
|
| 34 |
+
<tr>
|
| 35 |
+
<th></th>
|
| 36 |
+
<td><b>Prefill 512 tokens</b></td><td><b>Decode 128 tokens</b></td>
|
| 37 |
+
</tr>
|
| 38 |
+
<tr>
|
| 39 |
+
<th>LiteRT tk/s (XNNPACK, 4 threads)</th>
|
| 40 |
+
<td>260.95</td><td>23.126</td>
|
| 41 |
+
</tr>
|
| 42 |
+
<tr>
|
| 43 |
+
<th>GGML tk/s (CPU, 4 threads)</th>
|
| 44 |
+
<td>64.66</td><td>23.85</td>
|
| 45 |
+
</tr>
|
| 46 |
+
</table>
|