Development of smart web crawler by applying Breadth-First algorithm and vector space model

Ariyanto, Irvan (2017) Development of smart web crawler by applying Breadth-First algorithm and vector space model. Undergraduate thesis, Universitas Islam Negeri Maulana Malik Ibrahim.

[img]
Preview
Text (Fulltext)
13650104.pdf - Accepted Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.

Download (11MB) | Preview

Abstract

ENGLISH:

Information retrieval system is a system used to find information relevant to the needs of its users. Retrieval process is a process to calculate the resemblance of query to the document, the calculation of similarity using the concept of vector space model by looking for cosine similarity value. Queries performed by the retrieval process directly result in less good, so the researchers make the process of expansion in the query. Query expansions are done by searching for a word connection in a query based on a thesaurus. The results showed that by expanding the query can increase the precision by 7% and the accuracy of 0.6%.

INDONESIA:

Information retrieval system merupakan sistem yang digunakan untuk menemukan informasi yang relevan dengan kebutuhan dari penggunanya.. Proses retrieval adalah proses untuk menghitung kemiripan query terhadap dokumen, perhitungan kemiripan menggunakan konsep vector space model dengan mencari nilai cosine similarity. Query yang dilakukan proses retrieval secara langsung membarikan hasil yang kurang bagus, sehingga peneliti melakukan proses ekspansi pada query. Ekspansi query dilakukan dengan mencari keterkaitan kata dalam query berdasarkan thesaurus. Hasil penelitian menunjukan dengan melakukan ekspansi pada query dapat meningkatkan presisi sebesar 7% dan akurasi sebesar 0.6%.

Item Type: Thesis (Undergraduate)
Supervisor: Crysdian, Cahyo and Hariyadi, M. Amin
Keywords: Web crawler; Information Retrieval; Breadth-First Algorithm; Vector Space Model
Departement: Fakultas Sains dan Teknologi > Jurusan Teknik Informatika
Depositing User: Arsyadillah Arsyadillah
Date Deposited: 23 May 2018 03:45
Last Modified: 23 May 2018 03:45
URI: http://etheses.uin-malang.ac.id/id/eprint/10622

Actions (login required)

View Item View Item