Spaces:

fuvty
/

C2C_demo

Sleeping

Apply for community grant: Academic project (gpu and storage)

by fuvty - opened 19 days ago

Owner 19 days ago

Cache-to-Cache (C2C) enables Large Language Models to communicate directly through their KV-Caches, bypassing text generation. By projecting and fusing KV-Caches between models, C2C achieves 8.5–10.5% higher accuracy than individual models and 3.0–5.0% better performance than text-based communication, with 2.0× speedup in latency. Thank you so much for your help and support!
It earns much attention on X: https://x.com/jiqizhixin/status/1985219136000299215

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment