theharshithh
/

lmsys-vicuna-7b-v1.5-medusa

Model card Files Files and versions

theharshithh commited on May 28

Commit

a086178

·

verified ·

1 Parent(s): 50b73b9

Update README.md

Files changed (1) hide show

README.md +7 -0

README.md CHANGED Viewed

@@ -1,3 +1,10 @@
 # Speculative Decoding
 This model implements Medusa, an efficient speculative decoding approach that can achieve up to 3x faster inference for large language models. The implementation consists of the base Vicuna-7B model augmented with specialized prediction heads that enable parallel token generation.

+---
+license: mit
+language:
+- en
+base_model:
+- lmsys/vicuna-7b-v1.5
+---
 # Speculative Decoding
 This model implements Medusa, an efficient speculative decoding approach that can achieve up to 3x faster inference for large language models. The implementation consists of the base Vicuna-7B model augmented with specialized prediction heads that enable parallel token generation.