AndyChiang
/

Pre-CoFactv3-Question-Answering

+---
+license: mit
+language: en
+tags:
+- Pre-CoFactv3
+- Question Answering
+datasets:
+- FACTIFY5WQA
+metrics:
+- bleu
+pipeline_tag: question-answering
+library_name: transformers
+base_model: microsoft/deberta-v3-large
+widget:
+- text: "Who spent an entire season at aston vila without playing a single game?"
+  context: "Micah Richards spent an entire season at Aston Vila without playing a single game."
+  example_title: "Claim"
+- text: "Who spent an entire season at aston vila without playing a single game?"
+  context: "Despite speculation that Richards would leave Aston Villa before the transfer deadline for the 2018~19 season , he remained at the club , although he is not being considered for first team selection."
+  example_title: "Evidence"
+---
+# Pre-CoFactv3-Question-Answering
+## Model description
+This is a Question Answering model for **AAAI 2024 Workshop Paper: “Team Trifecta at Factify5WQA: Setting the Standard in Fact Verification with Fine-Tuning”**
+Its input are question and context, and output is the answers derived from the context. It is fine-tuned by **FACTIFY5WQA** dataset based on [**microsoft/deberta-v3-large**](https://huggingface.co/microsoft/deberta-v3-large) model.
+For more details, you can see our **paper** or [**GitHub**](https://github.com/AndyChiangSH/Pre-CoFactv3).
+## How to use?
+1. Download the model by hugging face transformers.
+```python
+from transformers import AutoModelForQuestionAnswering, AutoTokenizer, pipeline
+model = AutoModelForQuestionAnswering.from_pretrained("AndyChiang/Pre-CoFactv3-Question-Answering")
+tokenizer = AutoTokenizer.from_pretrained("AndyChiang/Pre-CoFactv3-Question-Answering")
+```
+2. Create a pipeline.
+```python
+QA = pipeline("question-answering", model=model, tokenizer=tokenizer)
+```
+3. Use the pipeline to answer the question by context.
+```python
+QA_input = {
+    'context': "Micah Richards spent an entire season at Aston Vila without playing a single game.",
+    'question': "Who spent an entire season at aston vila without playing a single game?",
+}
+answer = QA(QA_input)
+print(answer)
+```
+## Dataset
+We utilize the dataset FACTIFY5WQA provided by the AAAI-24 Workshop Factify 3.0.
+This dataset is designed for fact verification, with the task of determining the veracity of a claim based on the given evidence.
+- **claim:** the statement to be verified.
+- **evidence:** the facts to verify the claim.
+- **question:** the questions generated from the claim by the 5W framework (who, what, when, where, and why).
+- **claim_answer:** the answers derived from the claim.
+- **evidence_answer:** the answers derived from the evidence.
+- **label:** the veracity of the claim based on the given evidence, which is one of three categories: Support, Neutral, or Refute.
+|  | Training | Validation | Testing | Total |
+| --- | --- | --- | --- | --- |
+| Support | 3500 | 750 | 750 | 5000 |
+| Neutral | 3500 | 750 | 750 | 5000 |
+| Refute | 3500 | 750 | 750 | 5000 |
+| Total | 10500 | 2250 | 2250 | 15000 |
+## Fine-tuning
+Fine-tuning is conducted by the Hugging Face Trainer API on the [Question Answering](https://huggingface.co/docs/transformers/tasks/question_answering) task.
+### Training hyperparameters
+The following hyperparameters were used during training:
+- Pre-train language model: [microsoft/deberta-v3-large](https://huggingface.co/microsoft/deberta-v3-large)
+- Optimizer: adam
+- Learning rate: 0.00001
+- Max length of input: 3200
+- Batch size: 4
+- Epoch: 3
+- Device: NVIDIA RTX A5000
+## Testing
+We employ BLEU scores for both claim answer and evidence answer, taking the average of the two as the metric.
+| Claim Answer | Evidence Answer | Average |
+| ----- | ----- | ----- |
+| 0.5248 | 0.3963 | 0.4605 |
+## Other models
+[AndyChiang/Pre-CoFactv3-Text-Classification](https://huggingface.co/AndyChiang/Pre-CoFactv3-Text-Classification)
+## Citation