Corpus Construction for Arabic Question Answering Subjectivity Classification

SOUFFI, Soumia; BOUAMEUR, Mounia

dc.contributor.author	SOUFFI, Soumia
dc.contributor.author	BOUAMEUR, Mounia
dc.date.accessioned	2023-09-20T09:17:03Z
dc.date.available	2023-09-20T09:17:03Z
dc.date.issued	2023
dc.identifier.uri	https://dspace.univ-ghardaia.edu.dz/xmlui/handle/123456789/6419
dc.description.abstract	Subjectivity and sentiment analysis, have gained significant attention in the field of Natural Language Processing (NLP) due to their ability to extract and classify subjective information expressed in textual data. Although, extensive research has been conducted on major languages such as English, Arabic with its dialectal variations lacks sufficient resources and research in this domain. This study aims to overcome the scarcity of resources in Arabic subjectivity analysis by constructing an extensive Arabic Question-Answering (QA) corpus specifically designed for subjectivity analysis. The corpus construction involves the following steps: data collection through web scraping, and data cleaning to ensure quality, followed by the annotation process by affecting subjectivity labels using two models that we developed utilizing the fine-tuning technique with two pre-trained models, XLM-RoBERTa and AraBERT. The availability of this corpus stimulates further research, drives advancements in Arabic NLP, and contributes to various applications in sentiment analysis and opinion mining.	EN_en
dc.publisher	university ghardaia	EN_en
dc.subject	Subjectivity analysis, sentiment analysis, fine-tuning, AraBERT, XLMRoBERTa.	EN_en
dc.title	Corpus Construction for Arabic Question Answering Subjectivity Classification	EN_en
dc.type	Thesis	EN_en