Text
KLASIFIKASI UJARAN KEBENCIAN DI MEDIA SOSIAL APLIKASI X MENGGUNAKAN RANDOM FOREST DAN SYNTHETIC MINORITY OVERSAMPLING TECHNIQUE (SMOTE)
Classification is one of the fields of Natural Language Processing that automates the classification of text into one or more appropriate categories based on its content by building a model using training data. This study aims to classify hate speech tweets on application X using Random Forest and Synthetic Minority Oversampling Technique (SMOTE). The Random Forest approach is used because of its ability to handle classification problems on complex datasets and the SMOTE technique to overcome the imbalance of the majority and minority classes in a dataset of 3,300 data, namely 2000 hate tweets and 1300 nothate tweets. The highest accuracy results occurred in the Random Forest + SMOTE algorithm with an accuracy of 89,4%, a precision of 95,0%, a recall of 74,7%, and an f1-score of 83,6%. The influence of the SMOTE technique on the performance results of the Random Forest algorithm in classifying can be seen in the accuracy level of the best model without using SMOTE 81% while the best model using SMOTE 89.4% increased by 10.37%.
Inventory Code | Barcode | Call Number | Location | Status |
---|---|---|---|---|
2507002130 | T170335 | T1703352025 | Central Library (Reference) | Available but not for loan - Not for Loan |
No other version available