Collection of State-of-the-art FP8 Block Quantized Models
NM Testing
company
AI & ML interests
None defined yet.
Recent Activity
View all activity
models
484
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8-Static-Asym-e2e
1B
•
Updated
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8-Dynamic-Asym-e2e
1B
•
Updated
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A16-e2e
0.4B
•
Updated
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A16_channel-e2e
0.4B
•
Updated
nm-testing/TinyLlama-1.1B-Chat-v1.0-w4a16-sym-awq-e2e
0.3B
•
Updated
nm-testing/TinyLlama-1.1B-Chat-v1.0-w4a16-asym-awq-e2e
0.3B
•
Updated
nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16-e2e
0.3B
•
Updated
nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16_channel-e2e
0.3B
•
Updated
nm-testing/TinyLlama-1.1B-Chat-v1.0-actorder-weight-e2e
0.3B
•
Updated
nm-testing/TinyLlama-1.1B-Chat-v1.0-actorder-group-e2e
0.3B
•
Updated