Building a causation annotated corpus : the Salford Arabic Causal Bank-proclitics

Sadek, J and Meziane, F ORCID: 2018, Building a causation annotated corpus : the Salford Arabic Causal Bank-proclitics , in: The 3rd Workshop on Open-Source Arabic Corpora and Processing Tools, 8 May 2018, Miyazaki, Japan.

PDF - Accepted Version
Available under License Creative Commons Attribution Non-commercial 4.0.

Download (229kB) | Preview


We introduce the Salford Arabic Causal Bank (SACB) corpus, a new corpus dedicated to Arabic Causal relations. Causality as a linguistic phenomenon can be expressed using different elements and grammatical expressions. In Arabic language, causal particles – Purpose Lām, Causation Fa’a, Causation Ba’a- are frequently prefixed to words; they play a key role in indicating causality. However, these particles give different meanings according to their position in the text. In fact, these meanings can be interpreted according to the context in which they occur. This ambiguity emphasizes the high demand for a large-scale corpus in which instances of these particles are annotated. In this paper, we present the first stage of building the SACB, which includes a collection of annotated sentences each of which contains an instance of a causal particle. The sentences were carefully examined by two specialist annotators to give an accurate account for each annotated instance. Arabic is a less–resourced language and we hope this corpus would help in building better Information Extraction systems.

Item Type: Conference or Workshop Item (Paper)
Schools: Schools > School of Computing, Science and Engineering > Salford Innovation Research Centre
Journal or Publication Title: 11th Edition of the Language Resources and Evaluation Conference
Related URLs:
Depositing User: Prof Farid Meziane
Date Deposited: 03 May 2018 08:45
Last Modified: 16 Feb 2022 18:47

Actions (login required)

Edit record (repository staff only) Edit record (repository staff only)


Downloads per month over past year