Analisis Long Short-Term Memory untuk memprediksi cuaca berdasarkan penyinaran Matahari

Faiza, Nor (2026) Analisis Long Short-Term Memory untuk memprediksi cuaca berdasarkan penyinaran Matahari. Masters thesis, Universitas Islam Negeri Maulana Malik Ibrahim.

Text (Fulltext)
240605210010.pdf - Accepted Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.
(2MB)

Abstract

ABSTRAK

Prediksi parameter meteorologi merupakan tantangan yang kompleks dalam bidang informatika karena tingginya variabilitas dan sifat non-linier data atmosfer. Permasalahan utama dalam studi ini adalah fluktuasi data penyinaran matahari yang sulit diidentifikasi oleh model konvensional, serta adanya risiko degradasi performa model akibat konfigurasi arsitektur dan pembagian data yang tidak optimal. Penelitian ini bertujuan untuk mengevaluasi efektivitas arsitektur jaringan Long Short-Term Memory (LSTM) dalam memprediksi kondisi cuaca harian melalui variabel Solar Radiation Average (SR Avg), Solar Radiation Maximum (SR Max), dan durasi penyinaran. Metode eksperimen dilakukan dengan mengevaluasi sembilan skenario arsitektur LSTM melalui proses hyperparameter tuning dan variasi komposisi data latih (70%, 80%, dan 90%) dengan keluaran 1 lapisan Dense. Proses tuning difokuskan pada optimasi kombinasi jumlah neurons (3 hingga 30 unit), durasi iterasi (epoch 300 dan 500), serta ukuran komputasi (batch size 50 hingga 200). Kinerja estimasi diukur menggunakan indikator Mean Absolute Error (MAE), Root Mean Square Error (RMSE) dan Koefisien Determinasi (R²_Score). Hasil uji global skala 0–1 menunjukkan nilai MAE total 0.1095, RMSE 0.1392, dan R²_Score 0.3594 (menjelaskan 35,94% variabilitas). Secara komparatif, Model A1 (data latih 70%) mencatatkan kapabilitas generalisasi terbaik dengan R² tertinggi 0.4142 dan RMSE terkecil 0.1193. Sementara itu, Model B2 (data latih 80%) menghasilkan penyimpangan absolut terendah dengan MAE terkecil 0.0911. Sebaliknya, porsi data latih 90% memicu overfitting yang menurunkan performa model. Pada batas toleransi eror < 0.05, akurasi kategorial menghasilkan prediksi "Benar" 28,55% (207 hari) dan "Salah" 71,45% (518 hari). Rendahnya persentase kategorial aman ini dipicu oleh smoothing effect pada LSTM, model akurat pada fase stabil (Berawan), namun mengalami regresi menuju nilai rata-rata (underestimate) dan gagal mengejar nilai puncak saat transisi ekstrem ke cuaca Cerah.

ABSTRACT

Forecasting meteorological parameters is a complex challenge in the field of informatics due to the high variability and non-linear nature of atmospheric data. The main problems in this study are the fluctuations of solar radiation data that are difficult to identify by conventional models, as well as the risk of degradation of model performance due to architectural configuration and suboptimal data sharing. This study aims to evaluate the effectiveness of the Long Short-Term Memory (LSTM) network architecture in predicting daily weather conditions through the variables of Solar Radiation Average (SR Avg), Solar Radiation Maximum (SR Max), and duration of irradiation. The experimental method was carried out by evaluating nine LSTM architecture scenarios through a hyperparameter tuning process and variations in the composition of the training data (70%, 80%, and 90%) with the output of 1 Dense layer. The tuning process was focused on optimizing the combination of the number of neurons (3 to 30 units), the duration of the iteration (epoch 300 and 500), and the computational size (batch size 50 to 200). The estimated performance was measured using the Mean Absolute Error (MAE), Root Mean Square Error (RMSE) and Coefficient of Determination (R²_Score) indicators. The results of the global test on a scale of 0–1 showed a total MAE value of 0.1095, RMSE 0.1392, and R²_Score 0.3594 (explaining 35.94% variability). Comparatively, Model A1 (70% training data) recorded the best generalization capabilities with the highest R² of 0.4142 and the smallest RMSE of 0.1193. Meanwhile, Model B2 (80% training data) produced the lowest absolute deviation with the smallest MAE of 0.0911. In contrast, 90% of the training data portion triggers overfitting that degrades model performance. At an error tolerance limit of < 0.05, categorical accuracy results in a "True" prediction of 28.55% (207 days) and "False" of 71.45% (518 days). This low percentage of safe categorizations is triggered by the smoothing effect on LSTMs, which are accurate in the stable phase (Cloudy), but regress to mean values (underestimate) and fail to catch peak values during extreme transitions to sunny weather.

الملخص

التنبؤ بمعايير الأرصاد الجوية يمثل تحديا معقدا في مجال المعلوماتية بسبب التغيرات العالية والطبيعة غير الخطية لبيانات الغلاف الجوي. المشاكل الرئيسية في هذه الدراسة هي تقلبات بيانات الإشعاع الشمسي التي يصعب تحديدها بواسطة النماذج التقليدية، بالإضافة إلى خطر تدهور أداء النماذج بسبب التكوين المعماري ومشاركة البيانات غير المثالية. تهدف هذه الدراسة إلى تقييم فعالية بنية شبكة الذاكرة طويلة المدى قصيرة المدى (LSTM) في التنبؤ بظروف الطقس اليومية من خلال متغيرات متوسط الإشعاع الشمسي (SR AVG)، وأقصى إشعاع شمسي (SR Max)، ومدة الإشعاع. تم تنفيذ الطريقة التجريبية من خلال تقييم تسعة سيناريوهات لبنية LSTM من خلال عملية ضبط المعلمات الفائقة وتغيرات في تركيب بيانات التدريب (70٪، 80٪، و90٪) مع إنتاج طبقة كثيفة واحدة. ركزت عملية الضبط على تحسين الجمع بين عدد الخلايا العصبية (من 3 إلى 30 وحدة)، ومدة التكرار (العصر 300 و500)، والحجم الحسابي (حجم الدفعة من 50 إلى 200). تم قياس أداء التقدير باستخدام مؤشرات متوسط الخطأ المطلق (MAE)، والجذر متوسط التربيع (RMSE)، ومعامل التحديد (R²_Score). أظهرت نتائج الاختبار العالمي على مقياس 0–1 قيمة MAE إجمالية 0.1095، RMSE 0.1392، وR²_Score 0.3594 (مما يفسر تباين 35.94٪). بالمقارنة، سجل النموذج A1 (70٪ من بيانات التدريب) أفضل قدرات تعميم بأعلى R² 0.4142 وأصغر RMSE 0.1193. وفي الوقت نفسه، أنتج النموذج B2 (80٪ بيانات تدريب) أدنى انحراف مطلق مع أصغر مؤشر متوسط متوسط متوسط بلغ 0.0911. على النقيض من ذلك، 90٪ من جزء بيانات التدريب يسبب إفراط في التركيب يؤثر على أداء النموذج. عند حد تحمل الخطأ < 0.05، تؤدي الدقة التصنيفية إلى توقع "صحيح" بنسبة 28.55٪ (207 أيام) و"خاطئ" بنسبة 71.45٪ (518 يوما). هذه النسبة المنخفضة من التصنيفات الآمنة تفعل تأثير التنعيم على نماذج LSTM، التي تكون دقيقة في الطور المستقر (الغائم)، لكنها تتراجع إلى القيم المتوسطة (تقلل من التقدير) وتفشل في التقاط قيم الذروة خلال الانتقالات القصوى إلى الطقس المشمس.

Item Type:	Thesis (Masters)
Supervisor:	Almais, Agung Teguh Wibowo and Pagalay, Usman
Keywords:	Prediksi Cuaca; LSTM; Penyinaran Matahari; MAE; RMSE; Optimasi Hiperparameter; Deep Learning. Weather Prediction; LSTM; Solar Radiation; MAE; RMSE; Hyperparameter Optimization; Deep Learning. التنبؤ بالطقس; LSTM; الإشعاع الشمسي; MAE; RMSE; تحسين المعلمات الفائق ; التعلم العميق
Subjects:	08 INFORMATION AND COMPUTING SCIENCES > 0801 Artificial Intelligence and Image Processing > 080108 Neural, Evolutionary and Fuzzy Computation
Departement:	Fakultas Sains dan Teknologi > Jurusan Magister Tehnik Informatika
Depositing User:	Nor Faiza
Date Deposited:	26 Jun 2026 14:54
Last Modified:	26 Jun 2026 14:54
URI:	http://etheses.uin-malang.ac.id/id/eprint/86799

Downloads

Downloads per month over past year

Actions (login required)

View Item

Link Terkait

Media Sosial

Alamat

Gedung Abdurrahman Wahid

Jl. Gajayana No.50, Dinoyo, Lowokwaru, Malang,

Jawa Timur 65149, Indonesia

Email: csc@uin-malang.ac.id