e5 efederici/e5-base-multilingual-4096 Sentence Similarity • Updated Aug 7, 2023 • 99 • 16 prometheus-eval/prometheus-7b-v1.0 Text Generation • Updated Oct 14, 2023 • 6 • 30 SparQ Attention: Bandwidth-Efficient LLM Inference Paper • 2312.04985 • Published Dec 8, 2023 • 40
paper SparQ Attention: Bandwidth-Efficient LLM Inference Paper • 2312.04985 • Published Dec 8, 2023 • 40
e5 efederici/e5-base-multilingual-4096 Sentence Similarity • Updated Aug 7, 2023 • 99 • 16 prometheus-eval/prometheus-7b-v1.0 Text Generation • Updated Oct 14, 2023 • 6 • 30 SparQ Attention: Bandwidth-Efficient LLM Inference Paper • 2312.04985 • Published Dec 8, 2023 • 40
paper SparQ Attention: Bandwidth-Efficient LLM Inference Paper • 2312.04985 • Published Dec 8, 2023 • 40