Hasil Pencarian

Ditemukan 2 dokumen yang sesuai dengan query

Alvin Subakti

Analisis kinerja BERT sebagai metode representasi teks untuk text clustering = Performance analysis of BERT as a text representation method for text clustering.

"Text clustering adalah teknik pengelompokan teks sehingga teks di dalam kelompok yang sama memiliki tingkat similaritas yang lebih tinggi satu sama lain dibandingkan dengan teks pada kelompok yang berbeda. Proses pengelompokkan teks secara manual membutuhkan waktu dan sumber daya yang banyak sehingga digunakan machine learning untuk melakukan pengelompokan secara otomatis. Representasi dari teks perlu diekstraksi sebelum dimasukkan ke dalam model machine learning. Metode yang umumnya digunakan untuk mengekstraksi representasi data teks adalah TFIDF. Namun, metode TFIDF memiliki kekurangan yaitu tidak memperhatikan posisi dan konteks penggunaan kata. Model BERT adalah model yang dapat menghasilkan representasi kata yang bergantung pada posisi dan konteks penggunaan suatu kata dalam kalimat. Penelitian ini menganalisis kinerja model BERT sebagai metode representasi data teks dengan membandingkan model BERT dengan TFIDF. Selain itu, penelitian ini juga mengimplementasikan dan membandingkan kinerja metode ekstraksi dan normalisasi fitur yang berbeda pada representasi teks yang dihasilkan model BERT. Metode ekstraksi fitur yang digunakan adalah max dan mean pooling. Sementara itu, metode normalisasi fitur yang digunakan adalah identity, layer, standard, dan min-max normalization. Representasi teks yang diperoleh dimasukkan ke dalam 4 algoritma clustering berbeda, yaitu k-means clustering, eigenspace-based fuzzy c-means, deep embedded clustering, dan improved deep embedded clustering. Kinerja representasi teks dievaluasi dengan menggunakan metrik clustering accuracy, normalized mutual information, dan adjusted rand index. Hasil simulasi menunjukkan representasi data teks yang dihasilkan model BERT mampu mengungguli representasi yang dihasilkan TFIDF pada 28 dari 36 metrik. Selain itu, implementasi ekstraksi dan normalisasi fitur yang berbeda pada model BERT memberikan kinerja yang berbeda-beda dan perlu disesuaikan dengan algoritma yang digunakan.

Text clustering is a task of grouping a set of texts in a way such that text in the same group will be more similar toward each other than to those from different group. The process of grouping text manually requires significant amount of time and labor. Therefore, automation utilizing machine learning is necessary. Text representation needs to be extracted to become the input for machine learning models. The common method used to represent textual data is TFIDF. However, TFIDF cannot consider the position and context of a word in a sentence. BERT model has the capability to produce text representation that incorporate position and context of a word in a sentence. This research analyzed the performance of BERT model as a text representation method by comparing it with TFIDF. Moreover, various feature extraction and normalization methods are also applied in text representation from BERT model. Feature extraction methods used are max and mean pooling. On the other hand, feature normalization methods used are identity, layer, standard, and min-max normalization. Text representation obtained become an input for 4 clustering algorithms, k-means clustering, eigenspace-based fuzzy c-means, deep embedded clustering, and improved deep embedded clustering. Performance of text representations in text clustering are evaluated utilizing clustering accuracy, normalized mutual information, and adjusted rand index. Simulation results showed that text representation obtained from BERT model outperforms representation from TFIDF in 28 out of 36 metrics. Furthermore, different feature extraction and normalization produced varied performances. The usage of these feature extraction and normalization must be altered depending on the text clustering algorithm used."

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2021

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Habib Saputra

Analisis Ulasan Pengguna Aplikasi Mobile JKN dengan Analisis Sentimen dan Pendeteksian Topik = Analysis of JKN Mobile Application User Reviews with Sentiment Analysis and Topic Detection

"Pada era digital saat ini, aplikasi Mobile Jaminan Kesehatan Nasional (JKN) telah menjadi alat penting dalam memudahkan akses dan manajemen layanan kesehatan masyarakat. Namun, untuk meningkatkan kualitas layanan dan kepuasan pengguna, perlu dilakukan analisis ulasan pengguna untuk memahami sentimen dan topik yang terkandung di dalamnya. Penelitian ini bertujuan untuk mengembangkan model analisis sentimen menggunakan metode Bidirectional Encoder Representations from Transformers (BERT) dan pendeteksian topik menggunakan metode Latent Dirichlet Allocation (LDA) pada ulasan pengguna aplikasi Mobile JKN. Penelitian ini menggunakan dataset yang terdiri dari ulasan pengguna aplikasi Mobile JKN yang dikumpulkan dari Play Store. Hasil dari penelitian ini menunjukkan bahwa model BERT yang dikembangkan berhasil mencapai akurasi sebesar 90% dalam melakukan analisis sentimen pada ulasan pengguna aplikasi Mobile JKN. Dari analisis sentimen tersebut, ditemukan bahwa dari 54.000 data yang akan dianalisis terdapat 14.748 data ulasan positif, 3.950 data ulasan netral, dan 35.302 data ulasan negatif yang terdeteksi oleh model BERT yang telah dikembangkan. Selanjutnya, melalui pendekatan LDA, penelitian ini juga berhasil mengidentifikasi 6 topik utama yang muncul dalam ulasan pengguna aplikasi Mobile JKN yang memiliki coherence value sebesar 0,466131. Topik-topik tersebut yaitu, topik pertama mengenai Pelayanan Mobile JKN, topik kedua perubahan data peserta, topik ketiga pembayaran iuran, topik keempat verifikasi nomor handphone, topik kelima update dan login pada aplikasi, dan topik keenam pendaftaran online. Hasil sentimen pada masing-masing topik menunjukkan bahwa topik 1, 2, dan 3 memiliki ulasan dengan sentimen positif lebih banyak daripada sentimen negatif, sedangkan topik 4, 5, dan 6 memiliki ulasan dengan sentimen negatif lebih banyak daripada sentimen positif. Demikian untuk topik mengenai verifikasi nomor handphone, update dan login pada aplikasi, dan pendaftaran online harus dilakukan evaluasi untuk perbaikan aplikasi Mobile JKN kedepannya.

In the current digital era, the National Health Insurance (Jaminan Kesehatan Nasional or JKN) mobile application has become an essential tool in facilitating access and management of healthcare services for the public. However, to improve service quality and user satisfaction, it is necessary to analyze user reviews to understand the sentiments and topics contained within them. This research aims to develop a sentiment analysis model using the Bidirectional Encoder Representations from Transformers (BERT) method and topic detection using the Latent Dirichlet Allocation (LDA) method on user reviews of the JKN mobile application. The research utilizes a dataset consisting of user reviews of the JKN application collected from the Play Store. The results of this study show that the developed BERT model successfully achieved an accuracy of 90% in sentiment analysis of user reviews of the JKN mobile application. From the sentiment analysis it is known that of the 54,000 data to be analyzed, there are 14,748 positive reviews, 3,950 neutral reviews, and 35,302 negative reviews detected by the BERT model that has been developed. Furthermore, through the LDA approach, this research also successfully identified 6 main topics that emerged in user reviews of the JKN mobile application with a coherence value of 0.466131. These topics are, the first topic regarding Mobile JKN Services, the second topic is changing participant data, the third topic is payment of contributions, the fourth topic is handphone number verification, the fifth topic is updating and logging in to the application, and the sixth topic is online registration. The sentiment results for each topic show that topics 1, 2, and 3 have reviews with more positive sentiment than negative sentiment, while topics 4, 5, and 6 have reviews with more negative sentiment than positive sentiment. So that for topics regarding handphone number verification, updating and logging into applications, and online registration, an evaluation must be carried out to improve the Mobile JKN application in the future."

Depok: Fakultas Matematika Dan Ilmu Pengetahuan Alam Universitas Indonesia, 2023

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Hasil Pencarian :: Simpan CSV :: Kembali

Hasil Pencarian