Triangle104 commited on
Commit
25b20b1
·
verified ·
1 Parent(s): 569a31e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +62 -0
README.md CHANGED
@@ -16,7 +16,69 @@ This model was converted to GGUF format from [`ibm-granite/granite-3.2-2b-instru
16
  Refer to the [original model card](https://huggingface.co/ibm-granite/granite-3.2-2b-instruct) for more details on the model.
17
 
18
  ---
 
 
 
 
 
 
 
19
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
20
  model = AutoModelForCausalLM.from_pretrained(
21
  model_path,
22
  device_map=device,
 
16
  Refer to the [original model card](https://huggingface.co/ibm-granite/granite-3.2-2b-instruct) for more details on the model.
17
 
18
  ---
19
+ Model Summary:
20
+ -
21
+ Granite-3.2-2B-Instruct is an 2-billion-parameter, long-context AI model fine-tuned for thinking capabilities. Built on top of Granite-3.1-2B-Instruct,
22
+ it has been trained using a mix of permissively licensed open-source
23
+ datasets and internally generated synthetic data designed for reasoning
24
+ tasks. The model allows controllability of its thinking capability,
25
+ ensuring it is applied only when required.
26
 
27
+
28
+ Developers: Granite Team, IBM
29
+ Website: Granite Docs
30
+ Release Date: February 26th, 2025
31
+ License: Apache 2.0
32
+
33
+
34
+ Supported Languages:
35
+ -
36
+ English, German, Spanish, French, Japanese, Portuguese, Arabic, Czech,
37
+ Italian, Korean, Dutch, and Chinese. However, users may finetune this
38
+ Granite model for languages beyond these 12 languages.
39
+
40
+
41
+ Intended Use:
42
+ -
43
+ This model is designed to handle general instruction-following tasks and
44
+ can be integrated into AI assistants across various domains, including
45
+ business applications.
46
+
47
+
48
+ Capabilities
49
+ -
50
+ Thinking
51
+ Summarization
52
+ Text classification
53
+ Text extraction
54
+ Question-answering
55
+ Retrieval Augmented Generation (RAG)
56
+ Code related tasks
57
+ Function-calling tasks
58
+ Multilingual dialog use cases
59
+ Long-context tasks including long document/meeting summarization, long document QA, etc.
60
+
61
+
62
+ Generation:
63
+ -
64
+ This is a simple example of how to use Granite-3.2-2B-Instruct model.
65
+
66
+
67
+ Install the following libraries:
68
+ -
69
+
70
+ pip install torch torchvision torchaudio
71
+ pip install accelerate
72
+ pip install transformers
73
+
74
+ Then, copy the snippet from the section that is relevant for your use case.
75
+
76
+
77
+ from transformers import AutoModelForCausalLM, AutoTokenizer, set_seed
78
+ import torch
79
+
80
+ model_path="ibm-granite/granite-3.2-2b-instruct"
81
+ device="cuda"
82
  model = AutoModelForCausalLM.from_pretrained(
83
  model_path,
84
  device_map=device,