BARTpho Vietnamese Summarization Model

This model is a fine-tuned version of vinai/bartpho-syllable for Vietnamese text summarization.

Model Details

  • Base Model: vinai/bartpho-syllable
  • Task: Text Summarization
  • Language: Vietnamese
  • Training Dataset: Vietnamese News Dataset

Usage

from transformers import BartForConditionalGeneration, AutoTokenizer

model_name = "YOUR_USERNAME/bartpho-vietnamese-summarization"
# Use AutoTokenizer for BARTpho (automatically loads BartphoTokenizer)
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = BartForConditionalGeneration.from_pretrained(model_name)

# Example usage
text = "Your Vietnamese news article text here..."
inputs = tokenizer(text, return_tensors="pt", max_length=1024, truncation=True)
summary_ids = model.generate(inputs["input_ids"], max_length=128, num_beams=4, early_stopping=True)
summary = tokenizer.decode(summary_ids[0], skip_special_tokens=True)
print(summary)

Training Details

  • Training Framework: Hugging Face Transformers
  • GPU: NVIDIA P100 16GB
  • Batch Size: 8 per device
  • Gradient Accumulation: 2 steps
  • Learning Rate: 2e-5
  • Epochs: 3
Downloads last month
20
Safetensors
Model size
0.4B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Evaluation results