Skip to the content

Sentiment classification in Persian: Introducing a mutual information-based method for feature selection

Bagheri, A, Saraee, MH and de Jong, F 2013, Sentiment classification in Persian: Introducing a mutual information-based method for feature selection , in: 21st Iranian Conference on Electrical Engineering (ICEE), 2013, 14-16 May 2013, Mashhad, Iran.

[img] PDF - Published Version
Restricted to Repository staff only

Download (750kB) | Request a copy

Abstract

With the enormous growth of online reviews in Internet, sentiment analysis has received more and more attention in information retrieval and natural language processing community. Up to now there are very few researches conducted on sentiment analysis for Persian documents. This paper considers the problem of sentiment classification for online customer reviews in Persian language. One of the challenges of Persian language is using of a wide variety of declensional suffixes. Another common problem of Persian text is word spacing. In Persian in addition to white space as interwords space, an intra-word space called pseudo-space separates word's part. One more noticeable challenge in customer reviews in Persian language is that of utilizing many informal or colloquial words in text. In this paper we study these challenges by proposing a model for sentiment classification of Persian review documents. The proposed model is based on a lemmatization approach for Persian language and is employed Naive Bayes learning algorithm for classification. Additionally we present a new feature selection method based on the mutual information method to extract the best feature collection from the initial extracted features. Finally we evaluate the performance of the model on a manually gathered collection of cellphone reviews, where the results show the effectiveness of the proposed model.

Item Type: Conference or Workshop Item (Paper)
Themes: Memory, Text and Place
Schools: Schools > School of Computing, Science and Engineering
Schools > School of Computing, Science and Engineering > Salford Innovation Research Centre (SIRC)
Journal or Publication Title: Electrical Engineering (ICEE)
Publisher: IEEE
Refereed: Yes
Related URLs:
Funders: Non funded research
Depositing User: Dr Mo Saraee
Date Deposited: 27 Nov 2013 17:06
Last Modified: 30 Nov 2015 23:54
URI: http://usir.salford.ac.uk/id/eprint/30648

Actions (login required)

Edit record (repository staff only) Edit record (repository staff only)

Downloads

Downloads per month over past year