TUGAS AKHIR
PERANCANGAN DAN IMPLEMENTASI SISTEM PENCARIAN BUKU RUANG BACA ILMU KOMPUTER UDAYANA BERBASIS WEB DENGAN METODE BM25
KOMPETENSI RPL
MICHAEL SENNA SAPUTRA NIM. 1008605062
PROGRAM STUDI TEKNIK INFORMATIKA JURUSAN ILMU KOMPUTER FAKULTAS MATEMATIKA DAN ILMU PENGETAHUAN ALAM UNIVERSITAS UDAYANA 2014
i
ii
Perancangan dan Implementasi Sistem Pencarian Buku Ruang Baca Ilmu Komputer Udayana Berbasis Web Dengan Metode Bm25
KOMPETENSI REKAYASA PERANGKAT LUNAK [SKRIPSI] Sebagai syarat untuk memperoleh gelar Sarjana Komputer pada Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Udayana
Tulisan ini merupakan hasil penelitian yang belum pernah dipublikasikan
MICHAEL SENNA SAPUTRA NIM. 1008605062
Pembimbing I
Pembimbing II
Ngurah Agus Sanjaya ER, S.Kom, M.Kom
I Made Widiartha, S.Si, M.Kom .
NIP. 197803212005011001
NIP. 198212202008011008
iii
LEMBAR PENGESAHAN TUGAS AKHIR Judul
: Perancangan dan Implementasi Sistem
Pencarian Buku
Ruang Baca Ilmu Komputer Udayana Berbasis Web Dengan Metode Bm25 Kompetensi
: RPL
Nama
: Michael Senna Saputra
NIM
: 1008605062
Tanggal Seminar
: 30 September 2014
Disetujui oleh:
Pembimbing I
Penguji I
Ngurah Agus Sanjaya ER, S.Kom, M.Kom NIP. 19800616 2005011001
Dra. Luh Gede Astuti, M.Kom NIP. 196401141994022001
Pembimbing II
Penguji II
I Made Widiartha, S.Si, M.Kom
I Gede Santi Astawa, S.T , M.Cs
NIP. 198212202008011008
NIP. 198012062006041003 Penguji III
Ida Bagus Made Mahendra, S.Kom, M.Kom NIP. 198006212008121002 Mengetahui, Jurusan Ilmu Komputer FMIPA UNUD Ketua,
Drs. I Wayan Santiyasa, M.Si NIP. 196704141992031002
iv
Judul
: Perancangan dan Implementasi Sistem Pencarian Buku Ruang Baca Ilmu Komputer Udayana Berbasis Web Dengan Metode Bm25
Nama
: Michael Senna Saputra
Pembimbing : 1. Ngurah Agus Sanjaya ER, S.Kom, M.Kom 2. I Made Widiartha, S.Si, M.Kom
ABSTRAK Permasalahan yang dialami pengunjung ruang baca Ilmu Komputer Udayanan ialah sulitnya menemukan buku yang sesuai dengan topik yang diinginkan. Salah satu solusi untuk mengatasi permasalahan tersebut adalah dengan mengimplementasikan STKI. Sistem STKI pada umumnya hanya menghitung jumlah perulangan kata, BM25 menambahkan panjang dokumen, parameter k1 yang digunakan untuk perhitungan perulangan kata di dokumen, parameter k3 untuk perulangan kata dari masukan pengguna dan parameter b untuk perhitungan rata-rata panjang dokumen. Nilai ketiga parameter dapat disesuaikan untuk memaksimalkan hasil penghitungan kerelevanan. Sistem dikembangkan menggunakan dataset buku di ruang baca Ilmu Komputer Universitas Udayana. Dari hasil pengujian, sistem pencarian informasi berhasil diimplementasikan menggunakan PHP,HTML, dan SQL. Sistem memiliki nilai Presisi sebesar 50,2 % dan Nilai Recall sebesar 96 %. Hal tersebut menunjukkan bahwa sistem mampu menghasilkan list-list buku yang sesuai dengan topik yang diminta. Kata Kunci : STKI, Precision, Recall, BM25
Pembimbing I
Pembimbing II
Ngurah Agus Sanjaya ER, S.Kom, M.Kom
I Made Widiartha, S.Si, M.Kom
NIP. 197803212005011001
NIP. 198212202008011008
v
Judul
: The Designing And Implementation Od Web-Based Book Searching At The Reading Room Of The Udayanan University Computer Science By The Method Of BM25
Name
: Michael Senna Saputra
Supervisor
: 1. Ngurah Agus Sanjaya ER, S.Kom, M.Kom 2. I Made Widiartha, S.Si, M.Kom
ABSTRAK Problems experienced by visitors in the reading room of Computer Science of Udayana was the difficulty in finding books that match with the desired topic. One of the solutions to overcome these problems is to implement the retrieval or recall system (STKI). STKI generally only counts the number of words repetition. BM25 added length of document, ki parameter is used for the calculation of words in a document, the parameter k3 for words repetition from the users' input and the parameter b for the calculation of the average length of a document. The values of these three parameters can be adjusted to maximize the count results of the relevance. The system was developed by using data set of books in the reading room of Computer Science, University of Udayana. The test results showed that the information recall systems were successfully implemented by using PHP, HTML, and SQL. The system had a Precision value of 50.2% and a Recall value of 96%. This showed that the system was able to generate a list of book that match with the topics requested Keyword : STKI, Precision, Recall, BM25
vi
KATA PENGANTAR
Puji syukur penulis panjatkan kehadapan Tuhan Yang Maha Esa, karena berkat rahmat dan karunia-Nya, Proposal Tugas Akhir yang berjudul “Perancangan Dan Implementasi Sistem Pencarian Buku Berbasis Web Dengan Metode BM25” ini dapat diselesaikan tepat pada waktunya. Secara khusus penulis mengucapkan terima kasih dan penghargaan kepada berbagai pihak yang telah membantu proposal ini, yaitu : 1. Bapak Ngurah Agus Sanjaya ER, S.Kom, M.Kom. sebagai pembimbing 1 yang telah bersedia mengkritisi, membantu dan memeriksa serta menyempurnakan tugas akhir ini; 2. Bapak I Made Widiartha, S.Si, M.Kom. sebagai pembimbing 2 yang telah bersedia mengkritisi, membantu dan memeriksa serta menyempurnakan tugas akhir ini; 3. Bapak-bapak dan ibu-ibu dosen dan juga pegawai di Jurusan Ilmu Komputer yang telah meluangkan waktu turut memberikan saran dan masukan dalam penyempurnaan proposal ini.; 4. Rekan-rekan mahasiswa yang telah memberi dukungan, motivasi, semangat dan kerja sama dalam pembuatan proposal tugas akhir ini. 5. Keluarga dan kerabat serta semua pihak yang turut serta memberi dukungan sehingga laporan ini dapat diselesaikan sesuai dengan waktu yang ditentukan. Pada akhirnya penulis berharap agar adanya perbaikan pada Proposal Tugas Akhir ini mengingat keterbatasan penulis, sehingga sangat diharapkan untuk adanya kritik dan saran yang membangun untuk pencapaian yang lebih baik.
Bukit Jimbaran, September 2014
Penulis
vii
DAFTAR ISI
LEMBAR PENGESAHAN TUGAS AKHIR ...................................................... iii ABSTRAK .......................................................... Error! Bookmark not defined. ABSTRACT ........................................................ Error! Bookmark not defined. KATA PENGANTAR.......................................................................................... vi DAFTAR ISI ..................................................................................................... vii DAFTAR TABEL ................................................................................................ x DAFTAR GAMBAR .......................................................................................... xi BAB I PENDAHULUAN .................................................................................... 1 1.1 Latar Belakang ......................................................................................... 1 1.2 Rumusan Masalah .................................................................................... 2 1.3 Tujuan Penelitian ...................................................................................... 2 1.4 Batasan Masalah....................................................................................... 2 1.5 Manfaat Penelitian.................................................................................... 2 1.6 Metodologi Penelitian .............................................................................. 3 1.6.1 Desain Penelitian ............................................................................ 3 1.6.2 Pengumpulan Data .......................................................................... 3 1.6.3 Pengolahan Data Awal .................................................................... 4 1.6.4 Metode yang digunakan .................................................................. 5 1.6.5 Eksperiman dan Pengujian Metode ................................................. 5 1.6.6 Evaluasi dan Validasi Hasil ............................................................. 7 1.6.7 Jadwal Pelaksanaan Kegiatan ........................................................ 10 BAB II TINJAUAN PUSTAKA ........................................................................ 11 2.1 Sistem Temu Kembali Informasi............................................................ 11
viii
2.2 BM25 (Best Matching 25) ..................................................................... 13 2.3 Stemming............................................................................................... 15 2.3.1 Porter Stemming ........................................................................... 15 2.3.2 Tala Stemming .............................................................................. 19 2.4 HTML5 ................................................................................................. 20 2.5 Teknik Pengujian Sistem ....................................................................... 20 2.5.1 Pengujian Statis (Static Testing) .................................................... 20 2.5.2 Blackbox Testing .......................................................................... 21 2.5.3 Whitebox Testing .......................................................................... 23 2.5.4 Pengujian Performa (Performance Testing) ................................... 24 2.6 Tinjauan Studi ....................................................................................... 24 BAB III ANALISIS DAN PERANCANGAN .................................................... 26 3.1 Definisi Kebutuhan .................................................................................. 26 3.1.1 Kebutuhan Fungsional .................................................................. 26 3.1.2 Kebutuhan Non-Fungsional .......................................................... 27 3.2 ERD ........................................................................................................ 28 3.3 DFD Sistem............................................................................................ 29 3.3.1 Context Diagram............................................................................ 29 3.3.2 DFD level 0 .................................................................................. 30 3.3.3 DFD level 1 Preprocessing ............................................................. 32 3.3.4 DFD Level 1 Pencarian .................................................................. 33 3.3.5 DFD Level 1 Manajemen Data....................................................... 34 3.4 Perancangan Interface .............................................................................. 34 3.4.1 Tampilan Awal ............................................................................... 35 3.4.2 Tampilan Search ............................................................................ 35
ix
3.4.3 Halaman Admin ............................................................................. 36 3.4.4 Halaman Manajemen Data ............................................................. 37 3.4.5 Halaman Tambah (Buku/Setting) ................................................... 38 BAB IV HASIL DAN PEMBAHASAN ............................................................ 39 4.1 Lingkungan dan Penggunaan Perangkat Lunak ........................................ 39 4.2 Implementasi Database ............................................................................ 39 4.3 Implementasi Interface ............................................................................. 41 4.4 Implementasi System ............................................................................... 48 4.4.1 Controller utama ............................................................................ 48 4.4.2 Controller Admin ........................................................................... 51 4.4.3Model Database .............................................................................. 54 4.4.4 Model Stemming ........................................................................... 60 4.5 Pengujian Sistem .................................................................................... 63 4.5.1 Static Testing ................................................................................. 64 4.5.2 Whitebox Testing ........................................................................... 64 4.5.3 Blackbox Testing ........................................................................... 68 4.5.4 Performance Testing ...................................................................... 74 BAB V KESIMPULAN DAN SARAN .............................................................. 79 5.1 Kesimpulan.............................................................................................. 79 5.2 Saran ....................................................................................................... 79 DAFTAR PUSTAKA ......................................................................................... 80
x
DAFTAR TABEL
Tabel 1.1 Rancangan Tabel Blackbox ................................................................... 8 Tabel 1.2 Rancangan Tabel Whitebox .................................................................. 8 Tabel 1.3 Jadwal Pelaksanaan Kegiatan.............................................................. 10 Tabel 2.1 Tabel Aturan 1a Porter ........................................................................ 16 Tabel 2.2 Tabel Aturan 1b Porter ........................................................................ 17 Tabel 2.3 Tabel Aturan tambahan 1b Porter ........................................................ 17 Tabel 2.4 Tabel Aturan 1c Porter ........................................................................ 17 Tabel 2.5 Tabel Aturan 2 Porter .......................................................................... 17 Tabel 2.6 Tabel Aturan 3 Porter .......................................................................... 18 Tabel 2.7 Tabel Aturan 4 Porter .......................................................................... 18 Tabel 2.8 Tabel Aturan 5a Porter ........................................................................ 18 Tabel 2.9 Tabel Aturan 5b Porter ........................................................................ 19 Tabel 2.10 Penghitungan DCG ........................................................................... 22 Tabel 3.1 Definisi Kebutuhan Sistem ................................................................. 26 Tabel 4.1 Tabel hasil static testing ...................................................................... 64 Tabel 4.2 Tabel hasil whitebox testing ................................................................ 65 Tabel 4.3 Tabel hasil perhitungan precision dan Recall ....................................... 71 Tabel 4.4 Tabel hasil perhitungan DCG .............................................................. 73 Tabel 4.5 Tabel Percobaan dengan 1 Query ........................................................ 74 Tabel 4.6 Tabel Percobaan dengan 2 Query ........................................................ 75 Tabel 4.7 Tabel Percobaan dengan 3 Query ........................................................ 75 Tabel 4.8 Tabel Lama waktu preprocessing ........................................................ 77 Tabel 4.9 Tabel Hasil Uji Stress ......................................................................... 78
xi
DAFTAR GAMBAR
Gambar 2.1 Ilustrasi Sistem Temu Kembali Informasi........................................ 11 Gambar 2.2 Indexing dokumen .......................................................................... 13 Gambar 3.1 ERD sistem..................................................................................... 28 Gambar 3.2 Context Diagram............................................................................. 30 Gambar 3.3 DFD level 0 .................................................................................... 31 Gambar 3.4 DFD Level 1 Preprocessing ........................................................... 32 Gambar 3.5 DFD Level 1 Pencarian .................................................................. 33 Gambar 3.6 DFD Level 1 Manajemen Data....................................................... 34 Gambar 3.7 Design Halaman Awal ..................................................................... 35 Gambar 3.8 Design Halaman Pencarian ............................................................ 36 Gambar 3.9 Halaman Admin ............................................................................. 37 Gambar 3.10 Halaman Manajemen .................................................................... 38 Gambar 3.11 Design Halaman Tambah ............................................................. 38 Gambar 4.1 Tabel Buku ..................................................................................... 39 Gambar 4.2 Tabel Term ...................................................................................... 40 Gambar 4.3 Tabel detail .................................................................................... 40 Gambar 4.4 Tabel df........................................................................................... 41 Gambar 4.5 Halaman sistem awal ..................................................................... 41 Gambar 4.6 Fungsi Login.................................................................................. 42 Gambar 4.7 Pesan login error ............................................................................ 42 Gambar 4.8 Halaman admin .............................................................................. 43 Gambar 4.9 Informasi Logout ........................................................................... 44 Gambar 4.10 Halaman Tambah ......................................................................... 44
xii
Gambar 4.11 Pesan error ID sama ..................................................................... 44 Gambar 4.12 Halaman admin manage ............................................................... 45 Gambar 4.13 Dialog box delete data.................................................................. 45 Gambar 4.14 Halaman setting ........................................................................... 46 Gambar 4.15 Halaman hasil pencarian .............................................................. 47 Gambar 4.16 Informasi lokasi buku .................................................................. 47 Gambar 4.18 Kode Login ................................................................................... 49 Gambar 4.19 Kode pencarian ............................................................................. 50 Gambar 4.20 Kode Download ............................................................................ 51 Gambar 4.21 Kode Manajemen data buku .......................................................... 53 Gambar 4.23 Kode Penambahan buku ................................................................ 54 Gambar 4.25 Kode Penghilangan Stopword ....................................................... 55 Gambar 4.27 Kode Metode BM25 ..................................................................... 57 Gambar 4.28 Kode Update data Buku ................................................................ 59 Gambar 4.30 Kode stemming tahap 2................................................................. 61 Gambar 4.31 Kode stemming tahap 3................................................................. 62 Gambar 4.33 Kode stemming tahap 5................................................................. 63 Gambar 4.34 Alur Preprocessing ........................................................................ 66 Gambar 4.35 Alur Pencarian data ....................................................................... 67 Gambar 4.36 Grafik Peningkatan waktu ............................................................. 76 Gambar 4.37 Grafik Waktu Preprocessing .......................................................... 77