SINQ Collection This collection contains the models quantized with the SINQ quantization method. • 15 items • Updated 4 days ago • 10
AC-LoRA: (Almost) Training-Free Access Control-Aware Multi-Modal LLMs Paper • 2505.11557 • Published May 15 • 8
AC-LoRA: (Almost) Training-Free Access Control-Aware Multi-Modal LLMs Paper • 2505.11557 • Published May 15 • 8
AcceLLM: Accelerating LLM Inference using Redundancy for Load Balancing and Data Locality Paper • 2411.05555 • Published Nov 8, 2024 • 6
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights Paper • 2509.22944 • Published Sep 26 • 76
AcceLLM: Accelerating LLM Inference using Redundancy for Load Balancing and Data Locality Paper • 2411.05555 • Published Nov 8, 2024 • 6