magiccodingman's picture
File name changes
94a426d verified

ggml_cuda_init: GGML_CUDA_FORCE_MMQ: no ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no ggml_cuda_init: found 2 CUDA devices: Device 0: NVIDIA GeForce RTX 3090, compute capability 8.6, VMM: yes Device 1: NVIDIA GeForce RTX 3090, compute capability 8.6, VMM: yes

model size params backend ngl test t/s
seed_oss 36B Q5_K - Medium 23.83 GiB 36.15 B CUDA 35 pp8 22.04 ± 0.37
seed_oss 36B Q5_K - Medium 23.83 GiB 36.15 B CUDA 35 tg128 4.20 ± 0.00

build: 92bb442ad (7040)