Skripsi
ANALISIS SENTIMEN REVIEW MOVIE PADA IMDB MENGGUNAKAN METODE SELEKSI FITUR INFORMATION GAIN DAN ALGORITMA SUPPORT VECTOR MACHINE (SVM).
IMDb is a website that provides information about all movies, including user-generated movie reviews. Reviews are identified through textual data in the form of comment text. However, the large number of features in reviews makes the textual data ambiguous, creating difficulties for sentiment analysis. To address this challenge, this research employs the Information Gain feature selection method to reduce the high feature dimensions in sentiment analysis of IMDb movie reviews. The test results indicate that implementing the Information Gain feature selection method within a linear kernel SVM algorithm with a parameter C value of 1 yields the highest performance. The resulting accuracy, precision, recall, and f-measure are 0.88, 0.88, 0.87, and 0.87, respectively. Furthermore, utilizing this feature selection approach reduces the number of features and computation time from 21,989 to 5,869 features and only 0.12 seconds of computation time. In contrast, the use of the SVM algorithm without feature selection resulted in inferior performance with an accuracy of 0.83, precision of 0.84, recall of 0.84, f-measure of 0.83, and a computation time of 2.25 seconds, considering a total of 21989 features. These outcomes indicate that accurate parameter selection and the application of the Information Gain feature selection method can enhance the efficiency, effectiveness, and accuracy of sentiment analysis. This study seeks to enhance methods for sentiment analysis on text data with a large number of features.
Inventory Code | Barcode | Call Number | Location | Status |
---|---|---|---|---|
2307006420 | T130885 | T1308852023 | Central Library (Referens) | Available |
No other version available