Skip to content
/ ArGPT Public

ArGPT: A LLM-based dataset for Argumentation

Notifications You must be signed in to change notification settings

C4AI/ArGPT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

ArGPT: A LLM-based dataset for Argumentation

The ArGPT dataset contains a set of argumentative essays generated using ChatGPT 3.5 and annotated for:

  • Argument Mining: as defined by three different sub-tasks, i.e. Span Detection, Component Classification and Relation Classification;
  • Automatic Essay Scoring: using real-world argumentative essay's correction criteria;
  • Argument Quality: defined as good if the essay defends a true claim using a sound argumentation, bad if the argumentation is flawed or ugly if the argumentation is sound, but the claim it justifies is false. The evaluation criteria were:
    • criteria_0: Clearly states a major claim;
    • criteria_1: Introduces the theme;
    • criteria_2: Develops the arguments throughout the text;
    • criteria_3: Recapitulates the arguments in the conclusion;
    • criteria_4: Adherence to standard language norms;
    • criteria_5: Correct use of argumentative connectives;
    • criteria_6: Adherence to the theme;
    • criteria_7: No repetition of arguments;
    • criteria_8: No contradictions;
    • criteria_9: No beating around the bush;
    • criteria_10: States true or plausible arguments.

ArGPT v1:

Contains 168 texts annotated by a single annotator

ArGPT v2:

Contains 172 texts with the resulting annotation being the consensus of two different annotators.

License

If you are using this dataset for research purposes, please cite the following paper:

Rocha, V.H.N., Silveira, I.C., Pirozelli, P., Mauá, D.D., Cozman, F.G.: Assessing good, bad and ugly arguments generated by chatgpt: a new dataset, its methodology and associated tasks. In: Moniz, N., Vale, Z., Cascalho, J., Silva, C., Sebastião, R. (eds.) Progress in Artificial Intelligence. pp. 428–440. Springer Nature Switzerland, Cham (2023)

About

ArGPT: A LLM-based dataset for Argumentation

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published