METODE COSINE SIMILARITY UNTUK KLASIFIKASI PENGARSIPAN DATA PEGAWAI DI PT PLN DISJAYA

SITOMPUL, RIFKA RAHMA YANI and Yudho, Satrio and Siregar, Riki Ruli Affandi (2019) METODE COSINE SIMILARITY UNTUK KLASIFIKASI PENGARSIPAN DATA PEGAWAI DI PT PLN DISJAYA. Diploma thesis, ITPLN.

[thumbnail of Penulisan Cosine Similarity.pdf]

Text
Penulisan Cosine Similarity.pdf
Restricted to Registered users only
Download (5MB)

Abstract

Archiving is the process of storing and managing archived documents according to a particular filling system. At PT PLN Disjaya employee data archiving is carried out in 3 stages: file sorting, file scanning and file number input. The problem that arises in the archiving of data is that employee data that has been entered into the system is only stored just like that into a folder available on the desktop without any direct distribution process into the predetermined category classification, the number of employee data that must be archived in a day there are 100 data even more. Time needs in the process of archiving employee data that must be archived spend 5-15 minutes for one employee data. The purpose of this study is that the input data can be distributed directly and properly classified into 6 categories, namely: application categories, job categories, family categories, education categories, reward or punishment categories and other categories. The method used is Cosine Similarity which serves to show the similarity between test documents and training documents. This method goes through the process of processing first to get the value of TF ID weighting, then the calculation of the similarity results of each employee's data. Accuracy results obtained from the Cosine Similarity method are as follows: the similarity is 70%.

Pengarsipan adalah proses menyimpan dan mengelola dokumen arsip menurut sistem pengarsipan tertentu. Pada PT PLN Disjaya pengarsipan data pegawai dilakukan 3 tahapan yaitu: sortir berkas, scan berkas dan input nomor berkas. Masalah yang muncul pada pengarsipan data adalah data pegawai yang telah dimasukkan ke dalam sistem hanya disimpan begitu saja kedalam folder yang tersedia di dekstop tanpa adanya proses distribusi langsung ke dalam klasifikasi kategori yang telah ditentukan, jumlah data pegawai yang harus diarsipkan dalam sehari ada 100 data bahkan lebih. Kebutuhan waktu dalam proses pengarsipan data pegawai yang harus diarsipkan menghabiskan waktu 5 sampai 15 menit untuk satu data pegawai. Tujuan penelitian ini adalah agar data yang di input dapat terdistribusi langsung dan terklasifikasi dengan baik ke dalam 6 kategori yaitu: kategori lamaran, kategori jabatan, kategori keluarga, kategori pendidikan, kategori penghargaan/hukuman dan kategori lain – lain. Metode yang digunakan adalah Cosine Similarity yang berfungsi untuk menunjukkan kemiripan antara dokumen uji dan dokumen datalatih. Metode ini melalui tahapan prosessing terlebih dahulu untuk mendapatkan nilai pembobotan TF IDF kemudian penghitungan hasil kemiripan terhadap masing - masing data pegawai. Hasil akurasi yang diperoleh dari metode Cosine Similarity adalah 70%.

Item Type:	Thesis (Diploma)
Uncontrolled Keywords:	Cosine Similarity, archive, TF IDF, Text Processing Cosine Similarity , arsip, TF IDF, Text Processing
Subjects:	Skripsi Bidang Keilmuan > Teknik Informatika
Divisions:	Fakultas Telematika Energi > S1 Teknik Informatika
Depositing User:	Sudarman
Date Deposited:	19 Sep 2025 08:26
Last Modified:	19 Sep 2025 08:26
URI:	https://repository.itpln.ac.id/id/eprint/1333

Actions (login required)

: View Item