DARE: Data Augmented Relation Extraction with GPT-2
Abstract
Data Augmented Relation Extraction (DARE) fine-tunes GPT-2 to generate training data for specific relation types, improving a BERT-based classifier's performance by up to 11 F1 score points over a baseline and setting a new state-of-the-art on biomedical RE datasets.
Real-world Relation Extraction (RE) tasks are challenging to deal with, either due to limited training data or class imbalance issues. In this work, we present Data Augmented Relation Extraction(DARE), a simple method to augment training data by properly fine-tuning GPT-2 to generate examples for specific relation types. The generated training data is then used in combination with the gold dataset to train a BERT-based RE classifier. In a series of experiments we show the advantages of our method, which leads in improvements of up to 11 F1 score points against a strong base-line. Also, DARE achieves new state of the art in three widely used biomedical RE datasets surpassing the previous best results by 4.7 F1 points on average.
Get this paper in your agent:
hf papers read 2004.13845 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 2
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper