LEGIT-BART
					Collection
				
This collection includes all LEGIT-BART models: Italian legal pre-trained models with varying context lengths utilizing LSG.
					β’ 
				8 items
				β’ 
				Updated
					
				
The LEGIT-BART models are a family of pre-trained transformer-based models for Italian legal text processing. 
They build upon BART-IT (morenolq/bart-it) and are further pre-trained on Italian legal corpora.
π‘ Key features:
β οΈ This specific model is pre-trained on general-purpose Italian text! Please select the best model from the table below.
| Model | Description | Link | 
|---|---|---|
| LEGIT-BART | Continued pre-training of morenolq/bart-iton Italian legal texts | π Link | 
| LEGIT-BART-LSG-4096 | Continued pre-training of morenolq/bart-it, supporting 4,096 tokens | π Link | 
| LEGIT-BART-LSG-16384 | Continued pre-training of morenolq/bart-it, supporting 16,384 tokens | π Link | 
| LEGIT-SCRATCH-BART | Trained from scratch on Italian legal texts | π Link | 
| LEGIT-SCRATCH-BART-LSG-4096 | Trained from scratch with LSG attention, supporting 4,096 tokens | π Link | 
| LEGIT-SCRATCH-BART-LSG-16384 | Trained from scratch with LSG attention, supporting 16,384 tokens | π Link | 
| BART-IT-LSG-4096 | morenolq/bart-itwith LSG attention, supporting 4,096 tokens (β οΈ no legal adaptation) | π Link | 
| BART-IT-LSG-16384 | morenolq/bart-itwith LSG attention, supporting 16,384 tokens (β οΈ no legal adaptation) | π Link | 
πΉ Architecture
morenolq/bart-itπΉ Training Data
joelniklaus/Multi_Legal_Pilefrom transformers import BartForConditionalGeneration, AutoTokenizer
# Load tokenizer and model
model_name = "morenolq/BART-IT-LSG-16384"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = BartForConditionalGeneration.from_pretrained(model_name)
# Example input
input_text = "<mask> 1234: Il contratto si intende concluso quando..."
inputs = tokenizer(input_text, return_tensors="pt", max_length=16384, truncation=True)
# Generate summary
summary_ids = model.generate(inputs.input_ids, max_length=150, num_beams=4, early_stopping=True)
summary = tokenizer.decode(summary_ids[0], skip_special_tokens=True)
print("π Summary:", summary)
β οΈ Limitations & Ethical Considerations
The paper presenting LEGIT-BART models is currently under review and will be updated here once published.
@article{benedetto2025legitbart,
    title        = {LegItBART: a summarization model for Italian legal documents},
    author       = {Benedetto, Irene and La Quatra, Moreno and Cagliero, Luca},
    year         = 2025,
    journal      = {Artificial Intelligence and Law},
    publisher    = {Springer},
    pages        = {1--31},
    doi          = {10.1007/s10506-025-09436-y},
    url          = {doi.org/10.1007/s10506-025-09436-y}
}
Base model
morenolq/bart-it