Ariyanto, Irvan (2017) Development of smart web crawler by applying Breadth-First algorithm and vector space model. Undergraduate thesis, Universitas Islam Negeri Maulana Malik Ibrahim.
|
Text (Fulltext)
13650104.pdf - Accepted Version Available under License Creative Commons Attribution Non-commercial No Derivatives. Download (11MB) | Preview |
Abstract
ENGLISH:
Information retrieval system is a system used to find information relevant to the needs of its users. Retrieval process is a process to calculate the resemblance of query to the document, the calculation of similarity using the concept of vector space model by looking for cosine similarity value. Queries performed by the retrieval process directly result in less good, so the researchers make the process of expansion in the query. Query expansions are done by searching for a word connection in a query based on a thesaurus. The results showed that by expanding the query can increase the precision by 7% and the accuracy of 0.6%.
INDONESIA:
Information retrieval system merupakan sistem yang digunakan untuk menemukan informasi yang relevan dengan kebutuhan dari penggunanya.. Proses retrieval adalah proses untuk menghitung kemiripan query terhadap dokumen, perhitungan kemiripan menggunakan konsep vector space model dengan mencari nilai cosine similarity. Query yang dilakukan proses retrieval secara langsung membarikan hasil yang kurang bagus, sehingga peneliti melakukan proses ekspansi pada query. Ekspansi query dilakukan dengan mencari keterkaitan kata dalam query berdasarkan thesaurus. Hasil penelitian menunjukan dengan melakukan ekspansi pada query dapat meningkatkan presisi sebesar 7% dan akurasi sebesar 0.6%.
Item Type: | Thesis (Undergraduate) | |||||||||
---|---|---|---|---|---|---|---|---|---|---|
Supervisor: | Crysdian, Cahyo and Hariyadi, M. Amin | |||||||||
Contributors: |
|
|||||||||
Keywords: | Web crawler; Information Retrieval; Breadth-First Algorithm; Vector Space Model | |||||||||
Departement: | Fakultas Sains dan Teknologi > Jurusan Teknik Informatika | |||||||||
Depositing User: | Arsyadillah Arsyadillah | |||||||||
Date Deposited: | 23 May 2018 10:45 | |||||||||
Last Modified: | 23 May 2018 10:45 | |||||||||
URI: | http://etheses.uin-malang.ac.id/id/eprint/10622 |
Downloads
Downloads per month over past year
Actions (login required)
View Item |