ANALISIS SENTIMEN BERBASIS ASPEK TERHADAP TOKOH POLITIK PADA TWITTER

LUTHFIL KHAIRI

LUTHFIL KHAIRI, Drs. Edi Winarko, M.Sc., Ph.D.

2021 | Tesis | MAGISTER ILMU KOMPUTER

Abstrak
File Pdf

Banyaknya jumlah perbincangan terhadap tokoh politik pada twitter dapat digunakan untuk melakukan analisis sentimen untuk mengekstrak informasi dari komentar tweet tersebut. Analisis sentimen pada level dokumen dan kalimat tidak cukup mewakili opini secara spesifik karena polaritas sentimen diukur berdasarkan keseluruhan kalimat atau dokumen sehingga untuk analisis secara menyeluruh dibutuhkan untuk menemukan aspek-aspek dan menentukan sentimen positif atau negatif terhadap aspek tersebut. Untuk mengekstrak dokumen secara detail seperti ini diperlukan aspect based sentiment analysis (ABSA). Oleh karena itu, penelitian ini dilakukan pembuatan model analisis sentimen berbasis aspek terhadap tokoh politik pada twitter. Klasifikasi sentimen dilakukan dengan menggunakan metode Long Short-Term Memory (LSTM) dan Naive Bayes sebagai pembandingnya dalam melakukan klasifikasi sentimen. Pelabelan data menggunakan metode binary relevance. Metode ini mengubah masalah multi-label menjadi binary problems dengan cara memisahkan tiap kelas. Tahapan utama dalam penelitian ini adalah melakukan pengumpulan data, pelabelan data, preparasi data, modeling dan melakukan pengujian terhadap model. Tahapan pelabelan data dilakukan oleh pelabel yang dianggap memiliki kapabilitas dalam melakukan pelabelan. Data yang telah dilakukan preprocessing akan masuk ke tahapan word embedding untuk merepresentasikan kamus kata yang telah dibuat kedalam bentuk vektor. Nilai vektor diambil dari hasil training korpus Wikipedia Indonesia menggunakan word2vec. Sedangkan, pada metode Naive Bayes ekstraksi fitur yang digunakan adalah Tf-IDF. Penelitian ini akan melakukan evaluasi terhadap model dengan empat kriteria: akurasi, presisi, recall, dan F1-Measure. Berdasarkan hasil pengujian memperlihatkan bahwa metode LSTM lebih unggul pada pengujian kategori aspek dengan mendapatkan rata-rata akurasi sebesar 84.68%. Sedangkan, rata-rata akurasi yang didapatkan dengan menggunakan metode Naive Bayes adalah 84.38%. Hasil pengujian klasifikasi sentimen dengan metode LSTM mendapatkan rata-rata akurasi sedikit lebih rendah dari metode Naive Bayes yaitu 54.92% dan rata-rata akurasi yang dapatkan dengan menggunakan metode Naive Bayes adalah 67,41%. Sedangkan, hasil pengujian klasifikasi kategori aspek dan sentimen yang dilakukan secara sekuensial dengan menggunakan metode LSTM mendapatkan nilai rata-rata akurasi sebesar 78,51% dan rata-rata akurasi untuk pengujian dengan metode Naive Bayes sebesar 89,44%.

The large number of conversations with political figures on twitter can be used to conduct sentiment analysis to extract information from the tweet comments. Sentiment analysis at the document and sentence levels is not sufficient to represent specific opinions because the polarity of sentiment is measured based on the whole sentence or document, so a thorough analysis is needed to find aspects and determine positive or negative sentiments towards these aspects. To extract the document in detail as is necessary aspect based sentiment analysis (ABSA). Therefore, this study was conducted to create an aspect-based sentiment analysis model for political figures on Twitter. Sentiment classification is carried out using the Long Short-Term Memory (LSTM) method and Naive Bayes as a comparison in classifying sentiment. Data labeling uses the binary relevance method. This method converts multi-label problems into binary problems by separating each class. The main stages in this research are data collection, data labeling, data preparation, modeling and testing the model. The data labeling stage is carried out by the labeler who is considered to have the capability to label. Data that has been preprocessed will enter the word embedding stage to represent the dictionary of words that have been made into vector form. The vector values are taken from the training results of the Indonesian Wikipedia corpus using word2vec. Meanwhile, the Naive Bayes feature extraction method used is Tf-IDF. This study will evaluate the model with four criteria: accuracy, precision, recall, and F1-Measure. Based on the test results show that the LSTM method is superior in testing the aspect category by getting an average accuracy of 84.68%. Meanwhile, the average accuracy obtained using the Naive Bayes method is 84.38%. The results of the sentiment classification test using the LSTM method get an average accuracy slightly lower than the Naive Bayes method, which is 54.92% and the average accuracy obtained using the Naive Bayes method is 67.41%. Meanwhile, the results of testing the classification of aspects and sentiment categories carried out sequentially using the LSTM method obtained an average accuracy value of 78.51% and an average accuracy for testing using the Naive Bayes method of 89.44%.

Kata Kunci : Aspect-Based Sentiment Analysis, LSTM, NVB, Political Figures

S2-2021-433782-abstract .pdf
S2-2021-433782-bibliography.pdf
S2-2021-433782-tableofcontent .pdf
S2-2021-433782-title.pdf

LAYANAN

E-Resources

Quick Access