YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Combined Task Vector Model

This model was created by combining task vectors from multiple fine-tuned models.

Task Vector Computation

t_1 = TaskVector("meta-llama/Llama-2-7b-chat-hf", "coastalcph/Llama-2-7b-chat-gsm8k_bs8_2e-4")
t_2 = TaskVector("meta-llama/Llama-2-7b-chat-hf", "coastalcph/Llama-2-7b-harmful-af-refuse")
t_2 = TaskVector("meta-llama/Llama-2-7b-chat-hf", "None")
t_combined = 1.0 * t_1 + 6.0 * t_2
new_model = t_combined.apply_to("meta-llama/Llama-2-7b-chat-hf", scaling_coef=1.0)

Models Used

Base Model: https://huggingface.co/meta-llama/Llama-2-7b-chat-hf
Fine-tuned Model 1: https://huggingface.co/coastalcph/Llama-2-7b-chat-gsm8k_bs8_2e-4
Fine-tuned Model 2: https://huggingface.co/coastalcph/Llama-2-7b-harmful-af-refuse
Fine-tuned Model 3: https://huggingface.co/None

Technical Details

Creation Script Git Hash: c0d48d558d9a3976b3769438b8148fd782b5168e
Task Vector Method: Additive combination
Args: { "pretrained_model": "meta-llama/Llama-2-7b-chat-hf", "finetuned_model1": "coastalcph/Llama-2-7b-chat-gsm8k_bs8_2e-4", "finetuned_model2": "coastalcph/Llama-2-7b-harmful-af-refuse", "finetuned_model3": null, "apply_to_diff_model_architecure": null, "output_model_name": "cfierro/Llama-2-7b-chat-1t_gsm8k-6t_af_refuse", "output_dir": "/projects/nlp/data/constanzam/weight-interp/task-vectors/math_non_sycophant_12Aug", "scaling_coef": 1.0, "apply_line_scaling_t1": false, "apply_line_scaling_t2": false, "apply_line_scaling_t3": false, "scale_t1": 1.0, "scale_t2": 6.0, "scale_t3": null }

Downloads last month: 4

Safetensors

Model size

7B params

Tensor type

F32

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support