Search by :

ALL Author Subject ISBN/ISSN Advanced Search

Last search:

Image of KLASIFIKASI SPAM PADA EMAIL BERBAHASA INDONESIA MENGGUNAKAN FASTTEXT DAN BERNOULLI NAÏVE BAYES

Skripsi

KLASIFIKASI SPAM PADA EMAIL BERBAHASA INDONESIA MENGGUNAKAN FASTTEXT DAN BERNOULLI NAÏVE BAYES

Putri, Zatun Aulia - Personal Name;

Penilaian

0,0

dari 5

Indonesia ranks sixth globally in terms of the number of spam senders. Numerous studies have been conducted on spam detection and filtering, with Bayesian algorithms being among the most commonly used approaches. This study aims to classify Indonesian-language email messages into spam and non-spam categories. A secondary dataset consisting of 2,604 messages was used, comprising 1,362 spam messages and 1,242 non-spam messages. Word representation was performed using FastText with an n-gram approach to capture sub-word level information, while classification was carried out using the Bernoulli Naïve Bayes algorithm based on binary values. The experiments compared the performance of the Bernoulli Naïve Bayes algorithm with and without the use of FastText. Evaluation was conducted using accuracy, confusion matrix, and classification report metrics, with a 70:30 data split. The results showed that both models, with and without FastText, achieved 95% accuracy. However, the model incorporating FastText demonstrated more balanced performance across classes and higher recall in detecting spam. In contrast, the model without FastText achieved perfect precision and recall for spam but showed decreased performance for non-spam. Therefore, the use of FastText contributes to improving the sensitivity and balance of spam email classification in the Indonesian language

Availability

Inventory Code	Barcode	Call Number	Location	Status
2507005406	T182739	T1827392025	Central Library (Reference)	Available but not for loan - Not for Loan

Detail Information

Series Title: -
Call Number: T1827392025
Publisher: Indralaya : Prodi Teknik Informatika, Fakultas Ilmu Komputer Universitas Sriwijaya., 2025
Collation: xvi, 125 hlm.; ilus.; tab.; 29 cm.
Language: Indonesia
ISBN/ISSN: -
Classification: 005.138 07
Content Type: Text
Media Type: unmediated
Carrier Type: -
Edition: -
Subject(s): bahasa Indonesia
Prodi Teknik Informatika
Specific Detail Info: -
Statement of Responsibility: KA

Other version/related

No other version available

File Attachment

KLASIFIKASI SPAM PADA EMAIL BERBAHASA INDONESIA MENGGUNAKAN FASTTEXT DAN BERNOULLI NAÏVE BAYES

Comments

You must be logged in to post a comment