Skripsi
DETEKSI MALWARE PADA FILE PORTABLE DOCUMENT FORMAT (PDF) DENGAN BYTE FREQUENCY DISTRIBUTION (BFD) DAN PENDEKATAN SUPPORT VECTOR MACHINE (SVM)
Portable Document Format (PDF) files as well as files in several other formats such as (.docx, .hwp and .jpg) are often used to conduct cyber attacks. According to VirusTotal, PDF ranks fourth among document files that are frequently used to spread malware in 2020. Malware detection is challenging partly because of its ability to stay hidden and adapt its own code and thus requiring new smarter methods to detect. Therefore, outdated detection and classification methods become less effective. Nowadays, one of such methods that can be used to detect PDF files infected with malware is a machine learning approach. In this research, the Support Vector Machine (SVM) algorithm was used to detect PDF malware because of its ability to process non-linear data, and in some studies, SVM produces the best accuracy. In the process, the file was converted into byte format and then presented in Byte Frequency Distribution (BFD). To reduce the dimensions of the features, the Sequential Forward Selection (SFS) method was used. After the features are selected, the next stage is SVM to train the model. The performance obtained using the proposed method was quite good, as evidenced by the accuracy obtained in this study, which was 95.58% with an F1 score of 97.47%. The contributions of this research are new approaches to detect PDF malware which is using BFD and SVM algorithm, and using SFS to perform feature selection with the purpose of improving model performance. To this end, this proposed system can be an alternative to detect PDF malware.
Inventory Code | Barcode | Call Number | Location | Status |
---|---|---|---|---|
2407001480 | T138577 | T1385772024 | Central Library (Referens) | Available but not for loan - Not for Loan |
No other version available