add fine-tune model

Browse files

Files changed (7) hide show

README.md +119 -3
adapter_config.json +32 -0
adapter_model.safetensors +3 -0
special_tokens_map.json +24 -0
tokenizer.json +0 -0
tokenizer.model +3 -0
tokenizer_config.json +45 -0

README.md CHANGED Viewed

@@ -1,3 +1,119 @@
----
-license: unknown
----

+---
+base_model: mistralai/Mistral-7B-Instruct-v0.1
+library_name: peft
+---
+# Model Card for Orapi Maintenance Chatbot
+<!-- Provide a quick summary of what the model is/does. -->
+The Orapi Maintenance Chatbot is a fine-tuned language model designed to assist users in selecting appropriate maintenance products from the Orapi catalog. It provides recommendations based on user queries about tasks (e.g., cleaning, assembling) and specific conditions (e.g., surface type, constraints like grease type). The model is fine-tuned from Mistral-7B-Instruct-v0.1 using the PEFT library with LoRA for efficient adaptation.
+## Model Details
+### Model Description
+<!-- Provide a longer summary of what this model is. -->
+This model is a fine-tuned version of `mistralai/Mistral-7B-Instruct-v0.1`, a 7-billion-parameter language model developed by Mistral AI. It has been adapted using the Parameter-Efficient Fine-Tuning (PEFT) library with Low-Rank Adaptation (LoRA) to specialize in recommending maintenance products from the Orapi catalog. The fine-tuning dataset consists of approximately 1000 product entries, including product names, codes, functions, and claims in French. The model is designed to answer user queries in a conversational manner, providing detailed product recommendations tailored to specific maintenance tasks and conditions.
+- **Developed by:** Jeremy Indelicato (tacituzn999 on Hugging Face)
+- **Funded by [optional]:** [More Information Needed]
+- **Shared by [optional]:** [More Information Needed]
+- **Model type:** Causal Language Model (fine-tuned with LoRA)
+- **Language(s) (NLP):** French (based on the dataset provided, which includes French product descriptions and claims)
+- **License:** [More Information Needed] (Mistral-7B-Instruct-v0.1 has a gated license; you may need to specify your own license for the fine-tuned model)
+- **Finetuned from model [optional]:** mistralai/Mistral-7B-Instruct-v0.1
+### Model Sources [optional]
+<!-- Provide the basic links for the model. -->
+- **Repository:** [More Information Needed] (e.g., a GitHub repository or Hugging Face model hub link if you upload the model)
+- **Paper [optional]:** Not applicable
+- **Demo [optional]:** [More Information Needed] (e.g., a link to a deployed demo if you host the chatbot online)
+## Uses
+<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
+### Direct Use
+<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
+The Orapi Maintenance Chatbot is intended for direct use by maintenance professionals, technicians, or anyone seeking product recommendations for industrial maintenance tasks. Users can interact with the chatbot through a web interface, asking questions like "Quel produit recommandez-vous pour nettoyer des graisses sur un moteur ?" The model responds with a detailed recommendation, including the product name, code, function, and advantages.
+### Downstream Use [optional]
+<!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
+The model can be integrated into larger systems, such as a customer support platform for Orapi, to provide automated product recommendations. It could also be further fine-tuned for additional tasks, such as generating product documentation or answering more complex technical queries.
+### Out-of-Scope Use
+<!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
+The model is not designed for general-purpose conversation or tasks outside the domain of Orapi maintenance products. It should not be used for critical decision-making without human oversight, as it may not account for all safety or regulatory considerations. Misuse could include using the model to provide recommendations for unrelated domains (e.g., medical advice) or attempting to generate harmful content.
+## Bias, Risks, and Limitations
+<!-- This section is meant to convey both technical and sociotechnical limitations. -->
+- **Technical Limitations**:
+  - The model was fine-tuned on a relatively small dataset (1000 products), which may limit its ability to generalize to edge cases or products not well-represented in the dataset.
+  - The model relies on the quality of the input data (e.g., product descriptions and claims). Inaccurate or incomplete data may lead to suboptimal recommendations.
+  - Performance may degrade if the user query is ambiguous or poorly formatted.
+  - The model requires significant computational resources (GPU with CUDA support for efficient inference with 4-bit quantization, or substantial RAM for CPU inference).
+- **Sociotechnical Risks**:
+  - The model may inadvertently reflect biases present in the training data, such as over-recommending certain products if they are over-represented in the dataset.
+  - Users may overly rely on the model’s recommendations without verifying them, potentially leading to incorrect product usage.
+### Recommendations
+<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
+Users (both direct and downstream) should be made aware of the risks, biases, and limitations of the model. Recommendations should be cross-verified with official Orapi documentation or a human expert, especially for critical maintenance tasks. To mitigate biases, the training dataset could be expanded to include a more diverse set of products and use cases. Additionally, users should be encouraged to provide clear and specific queries to improve the quality of recommendations.
+## How to Get Started with the Model
+Use the code below to get started with the model.
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer, BitsAndBytesConfig
+from peft import PeftModel
+import torch
+# Configuration pour la quantification 4-bit (nécessite un GPU avec CUDA)
+bnb_config = BitsAndBytesConfig(
+    load_in_4bit=True,
+    bnb_4bit_quant_type="nf4",
+    bnb_4bit_compute_dtype=torch.float16
+)
+# Votre token Hugging Face (nécessaire pour accéder au modèle de base)
+hf_token = "hf_xxxxxxxxxxxxxxxxxxxxxxxxxx"  # Remplacez par votre token
+# Chemins locaux
+base_model_name = "mistralai/Mistral-7B-Instruct-v0.1"
+model_path = "./fine_tuned_model"  # Chemin vers le dossier du modèle fine-tuné
+# Charger le tokenizer
+tokenizer = AutoTokenizer.from_pretrained(model_path)
+# Charger le modèle de base et appliquer les adaptateurs LoRA
+base_model = AutoModelForCausalLM.from_pretrained(
+    base_model_name,
+    quantization_config=bnb_config,
+    device_map="auto",
+    token=hf_token
+)
+model = PeftModel.from_pretrained(base_model, model_path)
+# Fonction pour générer une réponse
+def generate_response(question):
+    input_text = f"[INST] {question} [/INST]"
+    inputs = tokenizer(input_text, return_tensors="pt").to("cuda")
+    outputs = model.generate(**inputs, max_new_tokens=200, temperature=0.7)
+    response = tokenizer.decode(outputs[0], skip_special_tokens=True)
+    return response.split("[/INST]")[1].strip()

adapter_config.json ADDED Viewed

	@@ -0,0 +1,32 @@

+{
+  "alpha_pattern": {},
+  "auto_mapping": null,
+  "base_model_name_or_path": "mistralai/Mistral-7B-Instruct-v0.1",
+  "bias": "none",
+  "eva_config": null,
+  "exclude_modules": null,
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layer_replication": null,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "loftq_config": {},
+  "lora_alpha": 32,
+  "lora_bias": false,
+  "lora_dropout": 0.05,
+  "megatron_config": null,
+  "megatron_core": "megatron.core",
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "r": 16,
+  "rank_pattern": {},
+  "revision": null,
+  "target_modules": [
+    "q_proj",
+    "v_proj"
+  ],
+  "task_type": "CAUSAL_LM",
+  "use_dora": false,
+  "use_rslora": false
+}

adapter_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c365c5818933cbe4b12150bc6557f3d0c96a351352bd7b6db8706dc92c7799f0
+size 27280152

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,24 @@

+{
+  "bos_token": {
+    "content": "<s>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "eos_token": {
+    "content": "</s>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "pad_token": "</s>",
+  "unk_token": {
+    "content": "<unk>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  }
+}

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer.model ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:dadfd56d766715c61d2ef780a525ab43b8e6da4de6865bda3d95fdef5e134055
+size 493443

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,45 @@

+{
+  "add_bos_token": true,
+  "add_eos_token": false,
+  "add_prefix_space": null,
+  "added_tokens_decoder": {
+    "0": {
+      "content": "<unk>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "1": {
+      "content": "<s>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "2": {
+      "content": "</s>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "additional_special_tokens": [],
+  "bos_token": "<s>",
+  "chat_template": "{%- if messages[0]['role'] == 'system' %}\n    {%- set system_message = messages[0]['content'] %}\n    {%- set loop_messages = messages[1:] %}\n{%- else %}\n    {%- set loop_messages = messages %}\n{%- endif %}\n\n{{- bos_token }}\n{%- for message in loop_messages %}\n    {%- if (message['role'] == 'user') != (loop.index0 % 2 == 0) %}\n        {{- raise_exception('After the optional system message, conversation roles must alternate user/assistant/user/assistant/...') }}\n    {%- endif %}\n    {%- if message['role'] == 'user' %}\n        {%- if loop.first and system_message is defined %}\n            {{- ' [INST] ' + system_message + '\\n\\n' + message['content'] + ' [/INST]' }}\n        {%- else %}\n            {{- ' [INST] ' + message['content'] + ' [/INST]' }}\n        {%- endif %}\n    {%- elif message['role'] == 'assistant' %}\n        {{- ' ' + message['content'] + eos_token}}\n    {%- else %}\n        {{- raise_exception('Only user and assistant roles are supported, with the exception of an initial optional system message!') }}\n    {%- endif %}\n{%- endfor %}\n",
+  "clean_up_tokenization_spaces": false,
+  "eos_token": "</s>",
+  "extra_special_tokens": {},
+  "legacy": false,
+  "model_max_length": 1000000000000000019884624838656,
+  "pad_token": "</s>",
+  "sp_model_kwargs": {},
+  "spaces_between_special_tokens": false,
+  "tokenizer_class": "LlamaTokenizer",
+  "unk_token": "<unk>",
+  "use_default_system_prompt": false
+}