Update README.md
Browse files
README.md
CHANGED
|
@@ -293,5 +293,7 @@ datasets:
|
|
| 293 |
|
| 294 |
# Model Card for Model ID
|
| 295 |
|
| 296 |
-
deberta-v3-base with context length of 1280 fine-tuned on tasksource for 150k steps. I oversampled tasks
|
| 297 |
-
Training data include helpsteer v1/v2, logical reasoning tasks (FOLIO, FOL-nli, LogicNLI...), OASST, hh/rlhf, linguistics oriented NLI tasks, tasksource-dpo, fact verification tasks.
|
|
|
|
|
|
|
|
|
| 293 |
|
| 294 |
# Model Card for Model ID
|
| 295 |
|
| 296 |
+
deberta-v3-base with context length of 1280 fine-tuned on tasksource for 150k steps. I oversampled long NLI tasks (ConTRoL, doc-nli).
|
| 297 |
+
Training data include helpsteer v1/v2, logical reasoning tasks (FOLIO, FOL-nli, LogicNLI...), OASST, hh/rlhf, linguistics oriented NLI tasks, tasksource-dpo, fact verification tasks.
|
| 298 |
+
|
| 299 |
+
This model is suitable for long context NLI or and as a backbone for RLHF fine-tuning.
|