Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
HTW-KI-Werkstatt
/
gpt_and_prejudice
like
0
Follow
KI-Werkstatt
14
Safetensors
gpt_and_prejudice
interpretability
explainability
bias
fairness
GPT
sparse_auto_encoders
SAEs
language_model
custom_code
arxiv:
2510.01252
Model card
Files
Files and versions
xet
Community
main
gpt_and_prejudice
/
multi_head_attention.py
Commit History
refactoring
b41eca1
mariamkhmahran
commited on
Sep 22