Analisis Sentimen Transportasi Online Menggunakan Ekstraksi Fitur Model Word2vec Text Embedding Dan Algoritma Support Vector Machine (SVM)

Emi Suryati, Styawati Styawati, Ahmad Ari Aldino

Abstract


In the era of society 5.0, information technology is growing rapidly, one of which is in the field of transportation. The phenomenon of online transportation services is becoming increasingly popular among the public. With this phenomenon, many people have opinions about online transportation services, both positive and negative comments. The purpose of this study was to conduct a sentiment analysis of comments or users of online transportation service applications on gojek and grab on the Google Play Store. The stages of this research process are data collection, data labeling, data preprocessing, feature extraction and sentiment classification using the Support Vector Machine (SVM) algorithm. Data collection is done by web scraping. Dataset labeling is divided into two classes, namely positive sentiment and negative sentiment. Word2Vec Text Embedding is used as a feature extraction model to represent words in vector form. The architecture of the word2vec model used is the skip-gram model. The Support Vector Machine (SVM) algorithm is used for the data classification process to determine the level of accuracy of the data sentiment used. The results of tests carried out on the classification of sentiment analysis in online transportation applications show quite good performance results, namely for the Gojek application to get an accuracy rate of 87%, a precision of 93%, and a re-call of 84%. While the Grab application gets an accuracy rate of 82%, precision of 89%, and re-call of 83%.

Full Text:

PDF

References


Ahmadi, M. I. et al. (2020) ‘Sentiment Analysis Online Shop on the Play Store Using Method Support Vector Machine (Svm)’, Seminar Nasional …, 2020(Semnasif), pp. 196–203. Available at: http://jurnal.upnyk.ac.id/index.php/semnasif/article/view/4101.

Bamler, R. and Mandt, S. (2017) ‘Dynamic word embeddings’, arXiv. arXiv.

Boeing, G. and Waddell, P. (2017) ‘New Insights into Rental Housing Markets across the United States: Web Scraping and Analyzing Craigslist Rental Listings’, Journal of Planning Education and Research, 37(4), pp. 457–476. doi: 10.1177/0739456X16664789.

Budiman, I., Faisal, M. R. and Nugrahadi, D. T. (2020) ‘Studi Ekstraksi Fitur Berbasis Vektor Word2Vec pada Pembentukan Fitur Berdimensi Rendah’, Jurnal Komputasi, 8(1), pp. 62–69. doi: 10.23960/komputasi.v8i1.2517.

Fauzi, M. A. (2019) ‘Word2Vec model for sentiment analysis of product reviews in Indonesian language’, International Journal of Electrical and Computer Engineering (IJECE), 9(1), p. 525. doi: 10.11591/ijece.v9i1.pp525-530.

Fikria, N. (2018) ‘Analisis Klasifikasi Sentimen Review Aplikasi E-Ticketing Menggunakan Metode Support Vector Machine Dan Asosiasi’. Available at: https://dspace.uii.ac.id/handle/123456789/7717.

Liu, B. (2012) ‘Sentiment analysis and opinion mining’, Synthesis Lectures on Human Language Technologies, 5(1), pp. 1–184. doi: 10.2200/S00416ED1V01Y201204HLT016.

Mikolov, T. et al. (2013) ‘Efficient estimation of word representations in vector space’, 1st International Conference on Learning Representations, ICLR 2013 - Workshop Track Proceedings, pp. 1–12.

Octaviani, P. A., Yuciana Wilandari and Ispriyanti, D. (2014) ‘Penerapan Metode Klasifikasi Support Vector Machine (SVM) pada Data Akreditasi Sekolah Dasar (SD) di Kabupaten Magelang’, Jurnal Gaussian, 3(8), pp. 811–820. Available at: http://download.portalgaruda.org/article.php?article=286497&val=4706&title=PENERAPAN METODE KLASIFIKASI SUPPORT VECTOR MACHINE (SVM) PADA DATA AKREDITASI SEKOLAH DASAR (SD) DI KABUPATEN MAGELANG.

Olive, I. et al. (2020) ‘Implementasi Text Mining untuk Analisis Layanan Transportasi Online dengan Analisis Faktor’, Jurnal SimanteC, 8(2).

Pratama, Y. T., Bachtiar, F. A. and Setiawan, N. Y. (2018) ‘Analisis Sentimen Opini Pelanggan Terhadap Aspek Pariwisata Pantai Malang Selatan Menggunakan TF-IDF dan Support Vector Machine’, Jurnal Pengembangan Teknologi Informasi dan Ilmu Komputer (J-PTIIK) Universitas Brawijaya, 2(12), pp. 6244–6252.

Priansya, S. (2017) ‘Normalisasi Teks Media Sosial Menggunakan Word2vec, Levenshtein Distance, dan Jaro-Winkler Distance’, Institut Teknologi Sepuluh Nopember, p. Tugas Akhir-KS141501.

Raffi, A. et al. (2019) ‘Analisis Model Word2vec dalam Penyelesaian Soal Analogi pada Bahasa Indonesia’, 6(2), pp. 8513–8519.

Rong, X. (2014) ‘word2vec Parameter Learning Explained’, arXiv:1411.2738v1, pp. 1–20. Available at: http://arxiv.org/abs/1411.2738.

Rusli, M. (2020) ‘Ekstraksi Fitur Menggunakan Model Word2Vec Pada Sentiment Analysis Kolom Komentar Kuisioner Evaluasi Dosen Oleh Mahasiswa’, Klik - Kumpulan Jurnal Ilmu Komputer, 7(1), p. 35. doi: 10.20527/klik.v7i1.296.




DOI: https://doi.org/10.33365/jtsi.v4i1.2445

Refbacks

  • There are currently no refbacks.


Copyright (c) © 2023 : Emi Suryati, Styawati Styawati, Ahmad Ari Aldino

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.


JURNAL Teknologi dan Sistem Informasi (JTSI)
Published by Universitas Teknokrat Indonesia
Organized by Prodi S1 Sistem Informasi FTIK Universitas Teknokrat Indonesia

W: http://jim.teknokrat.ac.id/index.php/sisteminformasi
E : jtsi@teknokrat.ac.id.
Jl. Zainal Abidin Pagaralam, No.9-11, Labuhan Ratu, Bandarlampung

Creative Commons License
Jurnal Teknologi dan Sistem Informasi is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.