Text
IMPLEMENTASI MULTI-LABEL TEXT CLASSIFICATION MENGGUNAKAN INDOBERT UNTUK KLASIFIKASI GENRE FILM BERDASARKAN SINOPSIS BERBAHASA INDONESIA
This research is motivated by the rapid growth of the Indonesian film industry and the increasing need for an accurate film genre classification system to help viewers understand film content more precisely. Synopsis is often the main reference in identifying film genres, but the complexity and diversity of languages make the classification process a challenge. To answer this challenge, this study carries IndoBERT as the main method in classifying film genres in a multi-label manner based on Indonesian synopses. IndoBERT was chosen because of its ability to understand the context and structure of the Indonesian language in depth. The dataset used consists of 1,738 films with five main genres: Drama, Comedy, Horror, Action, and Romance. As a complement, this study also applies two non-core optimization techniques, namely Dynamic Thresholding and Per-class Performance Tracking, to overcome data imbalances between genres and improve prediction accuracy. As a result, the IndoBERT model managed to achieve an accuracy of 79.23% and a Macro F1-score of 57.15%, an increase of 6.21% from the initial baseline of 73.03%. The horror genre recorded the best performance among the other genres with an accuracy of 88.89% and an F1-score of 77.17%, supported by distinctive and consistent linguistic features. The model was integrated into a Streamlit-based web application with an average response time of under 0.25 seconds, proving that IndoBERT is effective and efficient for movie genre classification in real-world scenarios.
Inventory Code | Barcode | Call Number | Location | Status |
---|---|---|---|---|
2507002609 | T172178 | T1721782025 | Central Library (Reference) | Available but not for loan - Not for Loan |
No other version available