Deteksi situs Phishing berbasis Neural Network

Suarti, Isa (2024) Deteksi situs Phishing berbasis Neural Network. Masters thesis, Universitas Islam Negeri Maulana Malik Ibrahim.

Preview

Text (Fulltext)
220605210007.pdf - Accepted Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.
(5MB) | Preview

Abstract

ABSTRAK

Phishing adalah kejahatan digital yang bertujuan mencuri data pribadi dengan membuat tautan atau situs web palsu yang meniru situs asli. Jenis serangan siber ini dapat dipicu oleh pemberitahuan melalui pesan teks, email, atau panggilan telepon. Salah satu tindakan penanggulangan yang efektif terhadap phishing adalah deteksi dini situs yang berpotensi berbahaya, terutama dengan menganalisis fitur kode sumber. Hal ini memerlukan pemeriksaan konten halaman web, yang bisa menjadi proses yang lambat karena intervensi pihak ketiga. Meskipun penggunaan teknologi canggih dalam deteksi dini phishing, masih terdapat kebutuhan yang signifikan terhadap rekayasa fitur manual agar dapat mendeteksi ancaman phishing baru dengan andal. Salah satu pendekatannya melibatkan pelatihan pada neural network menggunakan kumpulan data phishing yang diketahui dan URL yang sah. Penelitian dilakukan dengan 200 sampel data yang dibagi menjadi set pelatihan dan pengujian. Hasilnya menunjukkan bahwa penggunaan fungsi aktivasi tanh menghasilkan lebih sedikit iterasi dan kesalahan dibandingkan dengan fungsi aktivasi logistik. Pada pengujian deteksi situs phishing menggunakan neural network dengan 2 strategi yang melibatkan 1 hidden layer dan 2 hidden layer menunjukkan hasil terbaik dengan lapisan tersembunyi sebanyak 1 dan 2 neuron, hasilnya menunjukkan akurasi yang baik. Pilihan fungsi aktivasi memiliki dampak signifikan terhadap performa model jaringan saraf, terutama untuk arsitektur dan konfigurasi tertentu. Meskipun kedua fungsi aktivasi bekerja dengan baik, fungsi aktivasi tanh jelas melampaui fungsi aktivasi logistik dalam hal akurasi dan skor F1 untuk arsitektur jaringan saraf khusus ini.

ABSTRACT

Phishing is a digital crime aimed at stealing personal data by creating fake links or websites that mimic the original site. This type of cyber attack can be triggered by notifications via text messages, emails, or phone calls. One effective countermeasure against phishing is the early detection of potentially harmful sites, especially by analyzing source code features. This requires examining the web page content, which can be a slow process due to third-party intervention. Despite the use of advanced technology in early phishing detection, there is still a significant need for manual feature engineering to reliably detect new phishing threats. One approach involves training a neural network using known phishing datasets and legitimate URLs. The research was conducted with 200 data samples divided into training and testing sets. The results showed that using the tanh activation function resulted in fewer iterations and errors compared to the logistic activation function. In testing phishing site detection using neural networks with two strategies involving 1 hidden layer and 2 hidden layers, the best results were shown with a hidden layer of 1 and 2 neurons, demonstrating good accuracy. The choice of activation function has a significant impact on the performance of the neural network model, especially for certain architectures and configurations. Although both activation functions performed well, the tanh activation function clearly outperformed the logistic activation function in terms of accuracy and F1 score for this specific neural network architecture.

مستخلص البحث

التصيّد الاحتيالي هو جريمة رقمية تهدف إلى سرقة البيانات الشخصية من خلال إنشاء روابط أو مواقع ويب مزيفة تحاكي الموقع الأصلي. يمكن أن يتم تفعيل هذا النوع من الهجمات الإلكترونية عبر إشعارات من خلال الرسائل النصية أو رسائل البريد الإلكتروني أو المكالمات الهاتفية. أحد الإجراءات الفعّالة لمواجهة التصيّد الاحتيالي هو الكشف المبكر عن المواقع التي قد تكون ضارة، خاصة من خلال تحليل ميزات كود المصدر. يتطلب ذلك فحص محتوى صفحة الويب، مما قد يكون عملية بطيئة بسبب تدخل الطرف الثالث. على الرغم من استخدام التكنولوجيا المتقدمة في الكشف المبكر عن التصيّد الاحتيالي، لا يزال هناك حاجة كبيرة إلى هندسة الميزات اليدوية للكشف بشكل موثوق عن التهديدات التصيّدية الجديدة. أحد النهج يتضمن تدريب شبكة عصبية باستخدام مجموعات بيانات التصيّد الاحتيالي المعروفة وعناوين URL الشرعية. أُجريت الدراسة باستخدام 200 عينة بيانات تم تقسيمها إلى مجموعات تدريب واختبار. أظهرت النتائج أن استخدام دالة التنشيط tanh أدى إلى تقليل عدد التكرارات والأخطاء مقارنة بدالة التنشيط اللوجيستية. في اختبار كشف مواقع التصيّد الاحتيالي باستخدام الشبكات العصبية مع استراتيجيتين تتضمنان طبقة خفية واحدة وطبقتين خفيتين، أظهرت أفضل النتائج بطبقة خفية واحدة واثنين من الخلايا العصبية، مما أظهر دقة جيدة. إن اختيار دالة التنشيط له تأثير كبير على أداء نموذج الشبكة العصبية، خاصة بالنسبة لهندسة وتكوينات معينة. على الرغم من أداء كلتا دالتي التنشيط بشكل جيد، فإن دالة التنشيط tanh تفوقت بوضوح على دالة التنشيط اللوجيستية من حيث الدقة ودرجة F1 لهذه الهندسة المعينة للشبكة العصبية.

Item Type:	Thesis (Masters)
Supervisor:	Crysdian, Cahyo and Chamidy, Totok
Keywords:	Phishing, Jaringan Syaraf Tiruan, Situs Web, URL Phishing,Neural Network,Web Site,URL التصيد الاحتيالي، الشبكة العصبية الاصطناعية، موقع الويب، عنوان ، URL
Subjects:	08 INFORMATION AND COMPUTING SCIENCES > 0801 Artificial Intelligence and Image Processing > 080108 Neural, Evolutionary and Fuzzy Computation 08 INFORMATION AND COMPUTING SCIENCES > 0802 Computation Theory and Mathematics > 080201 Analysis of Algorithms and Complexity
Departement:	Fakultas Sains dan Teknologi > Jurusan Magister Tehnik Informatika
Depositing User:	ISA SUARTI
Date Deposited:	02 Jul 2024 14:07
Last Modified:	02 Jul 2024 14:07
URI:	http://etheses.uin-malang.ac.id/id/eprint/65644

Downloads

Downloads per month over past year

Actions (login required)

View Item

Link Terkait

Media Sosial

Alamat

Gedung Abdurrahman Wahid

Jl. Gajayana No.50, Dinoyo, Lowokwaru, Malang,

Jawa Timur 65149, Indonesia

Email: csc@uin-malang.ac.id