Research Data Leeds Repository

Annotated Corpus of Arabic Al-Quran Question and Answer

Alqahtani, Mohammad and Atwell, Eric (2018) Annotated Corpus of Arabic Al-Quran Question and Answer. University of Leeds. [Dataset] https://doi.org/10.5518/356

Dataset description

AQQAC is a collection of approximately 2224 questions and answers about Al-Al-Quran. Each question and answer is annotated with the question ID, question word (particles), chapter number, verse number, question topic, question type, Al-Quran ontology concepts (Alqahtani & Atwell, 2018) and question source. The aim of this corpus is to provide a Question-Answering taxonomy for questions about Al-Quran. Additionally, this corpus might be used as a data set for testing and evaluating Islamic IR systems. The text of Al-Quran questions and answers were extracted from trusted two islamic sources: (1000 Su'al Wa Jawab Fi ALKORAN) was compiled by the famous Islamic scholar Ashur (2001). This book contains 1000 questions and answers about Al-Quran written in the Arabic language. Islam – Al-Quran and Tafseer is a website about Al-Quran that includes a description and a translation of Al-Quran and the reciting rules, the “Tajweed”. Additionally, this website has approximately 1224 questions and answers about Al-Quran in the Arabic language extracted from the Altabari Tafseer. Currently, this dataset contains 1224 annotated question-answers and the missing data that hasn’t been shared is due to copyright concerns.

Keywords: quran, Q&A, corpus, NLP
Subjects: I000 - Computer sciences > I400 - Artificial intelligence
Divisions: Faculty of Engineering > School of Computing
License: Creative Commons Attribution 4.0 International (CC BY 4.0)
Date deposited: 03 Dec 2018 16:08
URI: https://archive.researchdata.leeds.ac.uk/id/eprint/464

Files

Data

Research Data Leeds Repository is powered by EPrints
Copyright © 2021 University of Leeds