AI & ML interests

AI inference, AI in the cloud, AI on edge, software acceleration of AI workloads on hardware, efficient AI deployments, GPU-Free AI inference, AI model optimization.

Recent Activity

jangrzybek  updated a collection about 1 month ago
GPT-OSS
jangrzybek  updated a model about 1 month ago
AmpereComputing/gpt-oss-20b-gguf
jangrzybek  published a model about 1 month ago
AmpereComputing/gpt-oss-20b-gguf
View all activity

AmpereComputing 's collections 21

DeepSeek R1
Ampere's quantization formats (Q4_K_4 / Q8R16) require Ampere optimized llama.cpp available here: https://hub.docker.com/r/amperecomputingai/llama.cpp
DeepSeek R1
Ampere's quantization formats (Q4_K_4 / Q8R16) require Ampere optimized llama.cpp available here: https://hub.docker.com/r/amperecomputingai/llama.cpp