Safetensors
English
llama

Model Card for Model ID

This model is trained by lora for Retrospex based on AgentInstruct and ShareGPT datasets. The base model is Llama-3-8B-Instruct.

Model Details

Model Description

  • Developed by: Convai NJU
  • Shared by [optional]: Convai NJU
  • Model type: Llama model
  • Language(s) (NLP): en
  • License: llama3
  • Finetuned from model [optional]: Llama-3-8B-Instruct

Model Sources

Training Details

Training Data

AgentInstruct: https://huggingface.co/datasets/THUDM/AgentInstruct

ShareGPT: https://huggingface.co/datasets/anon8231489123/ShareGPT_Vicuna_unfiltered

Training Hyperparameters

  • fp16: True
  • lr: 2e-5
  • batch size: 8
  • lora r: 16
  • lora alpha: 64
Downloads last month
-
Safetensors
Model size
8B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for AronXiang/RetrospexLLaMA3

Quantizations
2 models

Datasets used to train AronXiang/RetrospexLLaMA3