LLM-Essay Detector (RoBERTa-base Finetuned)
This model is a fine-tuned version of FacebookAI/roberta-base for detecting LLM-generated student essays.
It was developed as part of the research paper:
Lukas Gehring & Benjamin Paaßen (2025)
Assessing LLM Text Detection in Educational Contexts: Does Human Contribution Affect Detection?
arXiv:2508.08096
Model Details
- Model type: Transformer-based text classifier (binary classification)
- Language: English
- Base model:
FacebookAI/roberta-base - Labels:
["human-written (LABEL_0)", "LLM-generated (LABEL_1)"]
Training Data
Human-written Essays
The human-written essays are from the BAWE dataset (Hilary et al. [1]),
a corpus of argumentative student essays originally designed for persuasive writing research.
LLM-generated Essays
Synthetic essays were generated following the methodology described in the cited paper (Gehring & Paaßen, 2025),
to simulate LLM-produced student writing.
These examples were paired with the human-written essays to create a balanced binary classification dataset.
Related Models
We also provide two additional fine-tuned versions of this model:
https://huggingface.co/lgehring/roberta-base-gede-aae https://huggingface.co/lgehring/roberta-base-gede-persuade
Intended Use
- Intended for: Research on LLM text detection, model interpretability, and fairness in educational contexts.
- Out-of-scope use: Automated essay grading, plagiarism detection, or disciplinary decisions in education.
Limitations
- May produce false positives (classifying human text as LLM-generated) or false negatives.
- Performance depends on domain similarity between training data and input text.
- Should not be used to make high-stakes decisions about students or authorship.
Example Usage
from transformers import pipeline
detector = pipeline("text-classification", model="lgehring/roberta-base-gede-bawe")
text = "This essay argues that artificial intelligence will reshape education."
print(detector(text))
Cite
If you use this model, please cite:
@misc{gehring2025assessingllmtextdetection,
title={Assessing LLM Text Detection in Educational Contexts: Does Human Contribution Affect Detection?},
author={Lukas Gehring and Benjamin Paaßen},
year={2025},
eprint={2508.08096},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2508.08096},
}
References
[1] Hilary Nesi, Sheena Gardner, Paul Thompson, and Paul Wickens. British academic written english corpus, 2008. URL http://hdl.handle.net/20.500.14106/2539. Literary and Linguistic Data Service.
- Downloads last month
- 2
Model tree for lgehring/roberta-base-gede-bawe
Base model
FacebookAI/roberta-base