PERBANDINGAN METODE CLUSTERING MENGGUNAKAN HIERARCHICAL CLUSTERING DAN PARTITIONAL CLUSTERING UNTUK MENGELOMPOKKAN DOKUMEN BERITA

Authors

  • Yunita Kusumawardani Teknik Informatika, IST AKPRIND Yogyakarta
  • Amir Hamzah Teknik Informatika, IST AKPRIND Yogyakarta
  • Suraya - Teknik Informatika, IST AKPRIND Yogyakarta

Keywords:

News of the text, K-means Clustering, Agglomerative Hierarchical Clustering, accuracy

Abstract

The development of the internet in Indonesia increasingly advanced and sophisticated. Similarly mass media that has been a lot of venturing into the media online to facilitate users in accessing the news or information via online. And here began of growing and abudant news or information in the text. So make the news has not categorized properly. Then cause difficulties in searching appropriate news categories and should spend a lot of time in searching the abundant news text on the internet.

Research to classsifying this news of the text, using 2 clustering method that is Hierarchical Clustering by using Agglomerative Hierarchical Clustering and Partitional Clustering method using K-Means Clustering. Two clustering methods are used, with the aim of knowing and comparing which methods are better and resulting in higher accuracy in grouping of news texts, as well as faster time in grouping processing.

This research use data totaling 30 news data. From 30 news data is divided into 8 news data for news document collection 1, 10 news data for news document collection 2, and 12 news data for news document collection 3. The result of research using agglomerative hierarchical clustering obtained accuracy value on news document collection 1 that is 62 , 5%, news document collection 2 is 50%, news document collection 3 is 33,33%, for k-means clustering is obtained result of highest accuracy value in news document collection 1 is 62,5%, news document collection 2 is 60 %, News document collection 3 is 33,33%. While the average time obtained with agglomerative hierarchical clustering is 29,5 s and k-means clustering is 30,19 s.

References

Romli, A. S. M. 2012. Jurnalistik online: Panduan Mengelola Media Online. Nuansa. Bandung.
Hidayat, T., dan Istiadah, N. 2011. Panduan Lengkap SPSS 19 untuk Mengolah Data Statistik Penelitian. Media Kita. Jakarta.
Atmajaya, D. 2016. K-means Algorithm. Fakultas Ilmu Komputer UMI. Makassar.
Kusumawardani, Y. 2016. Klasifikasi Opini Menggunakan Metode K-mean Clustering. Kerja Praktek. IST Akprind Yogyakarta
Husni., Negara, Y. D. P., dan Muhammad, S. 2015. Clusterisasi Dokumen Web (Berita) Bahasa Indonesia Menggunakan Algoritma K-means. Jurnal Simantec vol. 4, no. 3. Madura.
Alfina, T., Santosa, B., dan Barakbah, A, R. 2012. Analisa Perbandingan Metode Hierarchical Clustering, K-means dan Gabungan Keduanya dalam Cluster Data (Studi Kasus: Problem Kerja PraktekJurusan Teknik Industri ITS). Jurnal Teknik ITS Vol. 1. Surabaya.
Nofriansyah, D. 2014. Konsep Data Mining Sistem Pendukung Keputusan. Deepublish. Yogyakarta.
Putra, D. 2010. Pengolahan Citra Digital. Andi Offset. Yogyakarta.
Supianto, A. 2014. Pengenalan Pola Hierarchical Clustering. Materi PPT PTIIK. Universitas Brawijaya.
Sarwono, Y. T. Aplikasi Model Jaringan Syaraf Tiruan Dengan Radial Basis Function Untuk Mendeteksi Kelainan Otak (Stroke Infark). Jurnal Sistem Informasi

Downloads

Published

2018-06-02

Issue

Section

Articles