<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE ArticleSet PUBLIC "-//NLM//DTD PubMed 2.7//EN" "https://dtd.nlm.nih.gov/ncbi/pubmed/in/PubMed.dtd">
<ArticleSet>
		<Article>
		<Journal>
			<PublisherName>Majlesi Journal of Electrical Engineering</PublisherName>
			<JournalTitle>An Investigation of Term Weighting and Feature Selection Methods for Sentiment Analysis</JournalTitle>
			<Issn></Issn>
			<Volume>Volume 12 (2018)</Volume>
			<Issue>Issue 2, June 2018</Issue>
			<PubDate PubStatus="epublish">
                <Year>2024</Year>
                <Month>02</Month>
                <Day>15</Day>
			</PubDate>
		</Journal>
		<ArticleTitle>An Investigation of Term Weighting and Feature Selection Methods for Sentiment Analysis</ArticleTitle>
		<VernacularTitle></VernacularTitle>
		<FirstPage></FirstPage>
		<LastPage></LastPage>
		<ELocationID EIdType="doi"></ELocationID>
		<Language>EN</Language>
		<AuthorList>
            			<Author>
                				<FirstName>Tuba</FirstName>
				<LastName>Parlar</LastName>
				<Affiliation>Department of Mathematics, Mustafa Kemal University, 31060, Hatay, Turkiye</Affiliation>
				<Identifier Source="ORCID"></Identifier>
			</Author>
            			<Author>
                				<FirstName>Selma</FirstName>
				<LastName>Ayşe Özel</LastName>
				<Affiliation>Department of Computer Engineering, Cukurova University, 01330, Adana, Turkiye</Affiliation>
				<Identifier Source="ORCID"></Identifier>
			</Author>
            		</AuthorList>
		<PublicationType>Journal Article</PublicationType>
		<History>
			<PubDate PubStatus="received">
				<Year>2024</Year>
				<Month>02</Month>
				<Day>15</Day>
			</PubDate>
		</History>
		<Abstract>Sentiment analysis automatically classifies the opinions, which are expressed in a document, usually as positive or negative. A review document in general, reflects its author’s opinion about the objects mentioned in the text. Therefore, it can have many useful applications such as opinionated web search and automatic analysis of reviews. Although sentiment analysis is a kind of text classification problem, structures of review documents are different from texts like news, articles, or web pages; so that techniques applied for text classification are needed to be re-experimented for the sentiment analysis. Assigning appropriate weights to features is important to the performance of sentiment analysis so that important features can receive higher weights for the feature vectors. Feature selection reduces feature vector size by eliminating redundant or irrelevant features to improve classification accuracy. In this study, our aim is to examine the effects of term weighting methods on newly proposed Query Expansion Ranking (QER) feature selection method and also compare the classification results with one of the well-known feature selection method namely Chi-square statistic. We use three popular term weighting methods (i.e., term presence, term frequency, term frequency and inverse document frequency-tf*idf) and perform experiments using multinomial Naïve Bayes classifier. The experimental results show that when QER feature selection method is used with tf*idf term weighting method, the classification performance improves in terms of F-score.</Abstract>
		<ObjectList>
            			<Object Type="keyword">
				<Param Name="value">Text Classification</Param>
			</Object>
						<Object Type="keyword">
				<Param Name="value">OpenCV</Param>
			</Object>
						<Object Type="keyword">
				<Param Name="value">Feature Selection</Param>
			</Object>
						<Object Type="keyword">
				<Param Name="value">Sentiment Analysis</Param>
			</Object>
						<Object Type="keyword">
				<Param Name="value">Term Weighting</Param>
			</Object>
					</ObjectList>
	</Article>
	</ArticleSet>
