Upload folder using huggingface_hub

Browse files

Files changed (8) hide show

README.md +295 -0
adapter_config.json +26 -0
adapter_model.safetensors +3 -0
gitattributes +35 -0
special_tokens_map.json +30 -0
tokenizer.json +0 -0
tokenizer_config.json +81 -0
training_args.bin +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,295 @@

+---
+library_name: peft
+base_model: codellama/CodeLlama-7b-Instruct-hf
+tags:
+- terraform
+- terraform-configuration
+- infrastructure-as-code
+- iac
+- hashicorp
+- codellama
+- lora
+- qlora
+- peft
+- code-generation
+- devops
+- cloud
+- aws
+- azure
+- gcp
+- multi-cloud
+- automation
+- configuration-management
+- cloud-infrastructure
+license: apache-2.0
+language:
+- en
+pipeline_tag: text-generation
+---
+# terraform-cloud-codellama-7b
+**RECOMMENDED MODEL** - An advanced LoRA fine-tuned model for comprehensive Terraform infrastructure-as-code generation, supporting multiple cloud providers (AWS, Azure, GCP). This model generates Terraform configurations, HCL code, and multi-cloud infrastructure automation scripts.
+## Model Description
+This is the **enhanced model** - an advanced version of terraform-codellama-7b that has been additionally trained on AWS, Azure, and GCP public documentation. It provides superior performance for multi-cloud Terraform development with deep understanding of cloud provider-specific resources and best practices.
+### Key Features
+- **Multi-Cloud Support**: Trained on AWS, Azure, and GCP documentation
+- **Enhanced Performance**: Superior to the base terraform-codellama-7b model
+- **Production Ready**: Optimized for real-world multi-cloud infrastructure development
+- **Comprehensive Coverage**: Handles complex cloud provider-specific configurations
+- **Efficient Training**: Uses QLoRA (4-bit quantization + LoRA) for memory efficiency
+## Model Details
+- **Developed by**: Rafi Al Attrach, Patrick Schmitt, Nan Wu, Helena Schneider, Stefania Saju (TUM + IBM Research Project)
+- **Model type**: LoRA fine-tuned CodeLlama (Enhanced)
+- **Language(s)**: English
+- **License**: Apache 2.0
+- **Finetuned from**: [codellama/CodeLlama-7b-Instruct-hf](https://huggingface.co/codellama/CodeLlama-7b-Instruct-hf)
+- **Training method**: QLoRA (4-bit quantization + LoRA)
+- **Base Model**: Built on [rafiaa/terraform-codellama-7b](https://huggingface.co/rafiaa/terraform-codellama-7b)
+### Technical Specifications
+- **Base Model**: CodeLlama-7b-Instruct-hf
+- **LoRA Rank**: 64
+- **LoRA Alpha**: 16
+- **Target Modules**: q_proj, v_proj
+- **Training Epochs**: 3 (Stage 1) + Additional training (Stage 2)
+- **Max Sequence Length**: 512
+- **Quantization**: 4-bit (fp4)
+## Uses
+### Direct Use
+This model is designed for:
+- **Multi-cloud Terraform development**
+- **AWS resource configuration** (EC2, S3, RDS, Lambda, etc.)
+- **Azure resource management** (Virtual Machines, Storage Accounts, App Services, etc.)
+- **GCP resource deployment** (Compute Engine, Cloud Storage, Cloud SQL, etc.)
+- **Complex infrastructure orchestration**
+- **Cloud provider-specific best practices**
+### Example Use Cases
+```python
+# Generate AWS multi-service infrastructure
+prompt = "Create a Terraform configuration for an AWS application with VPC, EC2, RDS, and S3"
+```
+```python
+# Generate Azure App Service with database
+prompt = "Create a Terraform configuration for an Azure App Service with PostgreSQL database"
+```
+```python
+# Generate GCP Kubernetes cluster
+prompt = "Create a Terraform configuration for a GCP GKE cluster with node pools"
+```
+```python
+# Generate multi-cloud setup
+prompt = "Create a Terraform configuration for a hybrid cloud setup using AWS and Azure"
+```
+## How to Get Started
+### Installation
+```bash
+pip install transformers torch peft accelerate bitsandbytes
+```
+### Loading the Model
+#### GPU Usage (Recommended)
+```python
+from transformers import AutoTokenizer, AutoModelForCausalLM
+from peft import PeftModel
+import torch
+# Load base model with 4-bit quantization (GPU)
+base_model = "codellama/CodeLlama-7b-Instruct-hf"
+model = AutoModelForCausalLM.from_pretrained(
+    base_model,
+    load_in_4bit=True,
+    torch_dtype=torch.float16,
+    device_map="auto"
+)
+# Load LoRA adapter
+model = PeftModel.from_pretrained(model, "rafiaa/terraform-cloud-codellama-7b")
+tokenizer = AutoTokenizer.from_pretrained(base_model)
+# Set pad token
+if tokenizer.pad_token is None:
+    tokenizer.pad_token = tokenizer.eos_token
+```
+#### CPU Usage (Alternative)
+```python
+from transformers import AutoTokenizer, AutoModelForCausalLM
+from peft import PeftModel
+import torch
+# Load base model (CPU compatible)
+base_model = "codellama/CodeLlama-7b-Instruct-hf"
+model = AutoModelForCausalLM.from_pretrained(
+    base_model,
+    torch_dtype=torch.float32,
+    device_map="cpu"
+)
+# Load LoRA adapter
+model = PeftModel.from_pretrained(model, "rafiaa/terraform-cloud-codellama-7b")
+tokenizer = AutoTokenizer.from_pretrained(base_model)
+# Set pad token
+if tokenizer.pad_token is None:
+    tokenizer.pad_token = tokenizer.eos_token
+```
+### Usage Example
+```python
+def generate_terraform(prompt, max_length=512):
+    inputs = tokenizer(prompt, return_tensors="pt")
+    with torch.no_grad():
+        outputs = model.generate(
+            **inputs,
+            max_length=max_length,
+            temperature=0.7,
+            do_sample=True,
+            pad_token_id=tokenizer.eos_token_id
+        )
+    return tokenizer.decode(outputs[0], skip_special_tokens=True)
+# Example: Multi-cloud infrastructure
+prompt = """
+Create a Terraform configuration for a multi-cloud setup:
+- AWS: VPC with public/private subnets, EC2 instances
+- Azure: Storage account and App Service
+- GCP: Cloud SQL database
+"""
+result = generate_terraform(prompt)
+print(result)
+```
+### Advanced Usage
+```python
+# Cloud-specific prompts
+aws_prompt = "Create a Terraform configuration for AWS EKS cluster with managed node groups"
+azure_prompt = "Create a Terraform configuration for Azure Kubernetes Service (AKS)"
+gcp_prompt = "Create a Terraform configuration for GCP Cloud Run service"
+# Generate configurations
+aws_config = generate_terraform(aws_prompt)
+azure_config = generate_terraform(azure_prompt)
+gcp_config = generate_terraform(gcp_prompt)
+```
+## Training Details
+### Training Data
+**Stage 1**: Public Terraform Registry documentation
+**Stage 2**: Additional training on:
+- **AWS Documentation**: EC2, S3, RDS, Lambda, VPC, IAM, etc.
+- **Azure Documentation**: Virtual Machines, Storage Accounts, App Services, Key Vault, etc.
+- **GCP Documentation**: Compute Engine, Cloud Storage, Cloud SQL, GKE, etc.
+### Training Procedure
+- **Method**: QLoRA (4-bit quantization + LoRA)
+- **Two-Stage Training**:
+  1. Terraform Registry documentation
+  2. Cloud provider documentation (AWS, Azure, GCP)
+- **LoRA Rank**: 64
+- **LoRA Alpha**: 16
+- **Target Modules**: q_proj, v_proj
+- **Training Epochs**: 3 (Stage 1) + Additional training (Stage 2)
+- **Max Sequence Length**: 512
+- **Quantization**: 4-bit (fp4)
+### Training Hyperparameters
+- **Training regime**: 4-bit mixed precision
+- **LoRA Dropout**: 0.0
+- **Learning Rate**: Optimized for QLoRA training
+- **Batch Size**: Optimized for memory efficiency
+## Performance Comparison
+| Model | Terraform Knowledge | AWS Support | Azure Support | GCP Support | Multi-Cloud Capability |
+|-------|-------------------|-------------|---------------|-------------|-------------------|
+| terraform-codellama-7b | Excellent | Limited | Limited | Limited | Basic |
+| **terraform-cloud-codellama-7b** | Excellent | Excellent | Excellent | Excellent | Advanced |
+## Limitations and Bias
+### Known Limitations
+- **Context Length**: Limited to 512 tokens due to training configuration
+- **Domain Specificity**: Optimized for Terraform and cloud infrastructure
+- **Base Model Limitations**: Inherits limitations from CodeLlama-7b-Instruct-hf
+- **Cloud Provider Updates**: May not include the latest cloud provider features
+### Recommendations
+- Use for Terraform and cloud infrastructure tasks
+- Validate generated configurations before deployment
+- Consider the 512-token context limit for complex configurations
+- For production use, always review and test generated code
+- Stay updated with cloud provider documentation for latest features
+## Environmental Impact
+- **Training Method**: QLoRA reduces computational requirements significantly
+- **Hardware**: Trained using efficient 4-bit quantization
+- **Carbon Footprint**: Reduced compared to full fine-tuning due to QLoRA efficiency
+- **Two-Stage Approach**: Efficient incremental training
+## Citation
+If you use this model in your research, please cite:
+```bibtex
+@misc{terraform-cloud-codellama-7b,
+  title={terraform-cloud-codellama-7b: A Multi-Cloud LoRA Fine-tuned Model for Terraform Code Generation},
+  author={Rafi Al Attrach and Patrick Schmitt and Nan Wu and Helena Schneider and Stefania Saju},
+  year={2024},
+  url={https://huggingface.co/rafiaa/terraform-cloud-codellama-7b}
+}
+```
+## Related Models
+- **Base Model**: [codellama/CodeLlama-7b-Instruct-hf](https://huggingface.co/codellama/CodeLlama-7b-Instruct-hf)
+- **Stage 1 Model**: [rafiaa/terraform-codellama-7b](https://huggingface.co/rafiaa/terraform-codellama-7b)
+- **This Model**: [rafiaa/terraform-cloud-codellama-7b](https://huggingface.co/rafiaa/terraform-cloud-codellama-7b) (Recommended)
+## Model Card Contact
+- **Author**: rafiaa
+- **Model Repository**: [HuggingFace Model](https://huggingface.co/rafiaa/terraform-cloud-codellama-7b)
+- **Issues**: Please report issues through the HuggingFace model page
+## Acknowledgments
+- **Research Project**: Early 2024 research project at TUM + IBM
+- **Training Data**: Public documentation from Terraform Registry, AWS, Azure, and GCP
+- **Base Model**: Meta's CodeLlama-7b-Instruct-hf
+- **Training Method**: QLoRA for efficient fine-tuning
+---
+*This model represents the culmination of a two-stage fine-tuning approach, combining Terraform expertise with comprehensive cloud provider knowledge for superior infrastructure-as-code generation.*

adapter_config.json ADDED Viewed

	@@ -0,0 +1,26 @@

+{
+  "alpha_pattern": {},
+  "auto_mapping": null,
+  "base_model_name_or_path": "codellama/CodeLlama-7b-Instruct-hf",
+  "bias": "none",
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "loftq_config": {},
+  "lora_alpha": 16,
+  "lora_dropout": 0.0,
+  "megatron_config": null,
+  "megatron_core": "megatron.core",
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "r": 64,
+  "rank_pattern": {},
+  "revision": null,
+  "target_modules": [
+    "v_proj",
+    "q_proj"
+  ],
+  "task_type": "CAUSAL_LM"
+}

adapter_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5ceed0c4c770992a0c222605208186092c792755c980836cab1628bf9988bee0
+size 134235048

gitattributes ADDED Viewed

	@@ -0,0 +1,35 @@

+*.7z filter=lfs diff=lfs merge=lfs -text
+*.arrow filter=lfs diff=lfs merge=lfs -text
+*.bin filter=lfs diff=lfs merge=lfs -text
+*.bz2 filter=lfs diff=lfs merge=lfs -text
+*.ckpt filter=lfs diff=lfs merge=lfs -text
+*.ftz filter=lfs diff=lfs merge=lfs -text
+*.gz filter=lfs diff=lfs merge=lfs -text
+*.h5 filter=lfs diff=lfs merge=lfs -text
+*.joblib filter=lfs diff=lfs merge=lfs -text
+*.lfs.* filter=lfs diff=lfs merge=lfs -text
+*.mlmodel filter=lfs diff=lfs merge=lfs -text
+*.model filter=lfs diff=lfs merge=lfs -text
+*.msgpack filter=lfs diff=lfs merge=lfs -text
+*.npy filter=lfs diff=lfs merge=lfs -text
+*.npz filter=lfs diff=lfs merge=lfs -text
+*.onnx filter=lfs diff=lfs merge=lfs -text
+*.ot filter=lfs diff=lfs merge=lfs -text
+*.parquet filter=lfs diff=lfs merge=lfs -text
+*.pb filter=lfs diff=lfs merge=lfs -text
+*.pickle filter=lfs diff=lfs merge=lfs -text
+*.pkl filter=lfs diff=lfs merge=lfs -text
+*.pt filter=lfs diff=lfs merge=lfs -text
+*.pth filter=lfs diff=lfs merge=lfs -text
+*.rar filter=lfs diff=lfs merge=lfs -text
+*.safetensors filter=lfs diff=lfs merge=lfs -text
+saved_model/**/* filter=lfs diff=lfs merge=lfs -text
+*.tar.* filter=lfs diff=lfs merge=lfs -text
+*.tar filter=lfs diff=lfs merge=lfs -text
+*.tflite filter=lfs diff=lfs merge=lfs -text
+*.tgz filter=lfs diff=lfs merge=lfs -text
+*.wasm filter=lfs diff=lfs merge=lfs -text
+*.xz filter=lfs diff=lfs merge=lfs -text
+*.zip filter=lfs diff=lfs merge=lfs -text
+*.zst filter=lfs diff=lfs merge=lfs -text
+*tfevents* filter=lfs diff=lfs merge=lfs -text

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,30 @@

+{
+  "additional_special_tokens": [
+    "▁<PRE>",
+    "▁<MID>",
+    "▁<SUF>",
+    "▁<EOT>"
+  ],
+  "bos_token": {
+    "content": "<s>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "eos_token": {
+    "content": "</s>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "pad_token": "</s>",
+  "unk_token": {
+    "content": "<unk>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  }
+}

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,81 @@

+{
+  "added_tokens_decoder": {
+    "0": {
+      "content": "<unk>",
+      "lstrip": false,
+      "normalized": true,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "1": {
+      "content": "<s>",
+      "lstrip": false,
+      "normalized": true,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "2": {
+      "content": "</s>",
+      "lstrip": false,
+      "normalized": true,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "32007": {
+      "content": "▁<PRE>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "32008": {
+      "content": "▁<SUF>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "32009": {
+      "content": "▁<MID>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "32010": {
+      "content": "▁<EOT>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "additional_special_tokens": [
+    "▁<PRE>",
+    "▁<MID>",
+    "▁<SUF>",
+    "▁<EOT>"
+  ],
+  "bos_token": "<s>",
+  "clean_up_tokenization_spaces": false,
+  "eos_token": "</s>",
+  "eot_token": "▁<EOT>",
+  "fill_token": "<FILL_ME>",
+  "legacy": null,
+  "middle_token": "▁<MID>",
+  "model_max_length": 1000000000000000019884624838656,
+  "pad_token": "</s>",
+  "prefix_token": "▁<PRE>",
+  "sp_model_kwargs": {},
+  "suffix_token": "▁<SUF>",
+  "tokenizer_class": "CodeLlamaTokenizer",
+  "unk_token": "<unk>",
+  "use_default_system_prompt": false
+}

training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9d6d743a5ea448e88622f69d8ee718438fcd05eddbd451faf76bb807b13a295a
+size 4600