Prediction of Potential Fishing Zones Using K-Means Clustering and Random Forest in Batam Waters
DOI:
https://doi.org/10.30595/juita.v14i1.29679Keywords:
Potential fishing areas, oceanographic data, weather data, K-Means clustering, Random Forest.Abstract
Identification of potential fishing zones remains a significant challenge in fisheries management, particularly in coastal and island waters characterized by high spatial and temporal environmental variability. In Batam waters, fishing activities are still dominated by fishermen's experience and heuristic judgment, while existing studies often focus on a single prediction model or limited environmental parameters. This indicates a research gap, namely the lack of an integrated framework that simultaneously captures environmental heterogeneity and improves prediction accuracy using a data-driven approach. To address this gap, this study proposes a hybrid data mining framework that explicitly integrates unsupervised environmental zoning and supervised classification for predicting fishing potential. Weather and oceanographic variables—including sea surface temperature, chlorophyll-a concentration, wind speed, ocean current speed, and salinity—are used in conjunction with historical fish catch data. K-Means clustering is first used to identify homogeneous marine environmental zones, which are then incorporated as contextual features into a Random Forest classification model. Model performance is then evaluated using accuracy, precision, recall, F1 score, and confusion matrix analysis. The results show that the proposed hybrid approach achieves an accuracy of 89.2% and an F1 score of 89.1%, representing a quantitative improvement of approximately 5.6% in accuracy and 5.0% in F1 score compared to the baseline Random Forest model without clustering. This comparison clearly demonstrates that the integration of clustering information significantly improves classification performance. Furthermore, feature importance analysis confirms that sea surface temperature and chlorophyll-a concentration are the most influential predictors, while cluster labels contribute indirectly by improving the model's contextual understanding of complex environmental conditions. The novelty of this research is articulated through the integration of unsupervised marine environmental zoning with supervised machine learning in a local fisheries context, which allows for improved predictive performance and enhanced model interpretability. Unlike conventional approaches that treat environmental variables independently, the proposed framework captures multidimensional environmental interactions in a structured manner. The implications of these findings are profound. The proposed model can support data-driven decision-making for fishermen by reducing search time and operational costs, while providing a scientific basis for fisheries managers for spatial planning and sustainable resource management. Therefore, this research contributes both methodologically and practically to the advancement of intelligent fisheries prediction systems in dynamic coastal environments such as Batam waters.
References
[1] R. M. C. S. Adiputra, E. Djunarsjah, F. W. Muharram, and A. P. Putra, “Spatial Analysis of Potential Fishing Zones (PFZ) for Tuna in Parangtritis Waters Based on Sea Surface Temperature, Chlorophyll-a, and Bathymetry,” J. Komput. Teknol. Inf. Sist. Inf., vol. 4, no. 2, pp. 531–545, 2025, doi: 10.62712/juktisi.v4i2.470.
[2] R. Fauzan, W. Widianingsih, and H. Endrawati, “Distribusi Klorofil-a dan Suhu Permukaan Laut terhadap Kelimpahan Ikan Cephalopholis argus dan Cephalopholis miniata Di Pulau Pieh, Sumatera Barat,” J. Mar. Res., vol. 13, no. 2, pp. 328–336, 2024, doi: 10.14710/jmr.v13i2.43988.
[3] D. J. Lestari, Lisna, and S. Heltria, “Analisis Pengaruh Angin, Curah Hujan dan Suhu Permukaan Laut Terhadap Hasil Tangkapan Handline di Pelabuhan Perikanan Samudera Bungus,” J. Mar. Coast. Sci., vol. 14, no. 3, pp. 198–211, 2025, doi: 10.20473/jmcs.v14i3.72571.
[4] D. Safitri, M. Tadjuddah, A. Mustafa, N. Alimina, and H. Arami, “Spatial and Temporal Patterns of Fishing Using Payang Nets in Staring Bay, South Konawe District,” Aquasains, vol. 12, no. 3, pp. 1528–1537, 2024, doi: 10.23960/aqs.v12i3.p1528-1537.
[5] R. K. E. Rekarti, M. Wibowo, F. Mubarok, “Climate Change and Fisheries: Meta-Synthesis of Regional Vulnerabilities and Responses,” J. Lemhannas RI, vol. 13, no. 1, pp. 37–56, 2025, doi: 10.55960/jlri.v13i1.1096.
[6] D. A. Y. C. T. G. Fa’u, W. S. Pranowo, M. P. Suhana, S. Mujiasih, R. B. Hatmaja, H. I. Ratnawati, “Analysis of Wind Characteristics and Sea Surface Elevation Dynamics in Coastal Waters of Mantang Island, Bintan Regency, Indonesia,” Bul. Oseanografi Mar., vol. 14, no. 2, pp. 267–276, 2025, doi: 10.14710/buloma.v14i2.70748.
[7] H. M. M. M. Z. Lubis, G. Surya, D. S. Pamungkas, B. Subhan, “Characteristics of Waters during Transitional Season, Senimba Waters,” Trends Sci., vol. 19, no. 11, pp. 1–11, 2022, doi: 10.48048/tis.2022.4495.
[8] A. Nugroho, M. Abdul, and G. Al, “Comparative Performance Evaluation Of Machine Learning Algorithms For Sentinel-2 Benthic Habitat Classification Using Google Earth Engine,” vol. 18, no. 3, pp. 248–256, 2025, doi: http://doi.org/10.21107/jk.v18i3.32389.
[9] G. J. Jane, L. O. Alifatri, E. Tasriah, and S. Pramana, “Coastal Ecosystem Classification Using Satellite-Based Machine Learning Approaches,” Jambura J. Biomath., vol. 6, no. 2, pp. 142–153, 2025, doi: 10.37905/jjbm.v6i2.30466.
[10] M. D. Rivaldo, G. W. N. Wibowo, and H. Mulyo, “Implementasi Algoritma K-Means untuk Klasterisasi Data Hasil Tangkapan Ikan di Karimunjawa,” J. Minfo Polgan, vol. 13, no. 1, pp. 1045–1056, 2024, doi: 10.33395/jmp.v13i1.13928.
[11] S. M. Ulfa, R. K. Dinata, and R. Risawandi, “Clustering Coastal Areas Based on Aquaculture Productivity in North Aceh Regency Using K-Means Algorithm,” J. Appl. Informatics Comput., vol. 9, no. 5, pp. 2371–2381, 2025, doi: 10.30871/jaic.v9i5.10094.
[12] S. Dwiasnati, E. Eliyani, S. M. Arif, and R. Avrizal, “Pengelompokan wilayah produksi tuna, cakalang, tongkol dan udang di Indonesia menggunakan algoritma K-Means,” IT-Explore J. Penerapan Teknol. Inf. dan Komun., vol. 4, no. 2, pp. 128–137, 2025, doi: 10.24246/itexplore.v4i2.2025.pp128-137.
[13] W. S. Mulyani and A. Supriyanto, “Clustering Analysis Of Significant Wave Height Dynamics Using K-Means Algorithm In The Semarang–Demak Coastal Waters,” vol. 8, no. 3, pp. 285–293, 2025, doi: 10.33387/jiko.v8i3.10964.
[14] A. Kurnianto, I. S. Sitanggang, and M. K. D. Hardhienata, “Klasifikasi Daerah Penangkapan Ikan Menggunakan Algoritma Random Forest dan Support Vector Machine,” J. Ilmu Komput. dan Agri-Informatika, vol. 11, no. 2, pp. 100–110, 2024, doi: 10.29244/jika.11.2.100-110.
[15] A. P. J. F. P. Anugrahnu, E. Etika, Sumarni, N. Debataraja, Lestyowati, “A Strategy To Increase Exports Of Marine Products Through Measured Fisheries Policy Using The Random Forest Algorithm,” vol. 17, no. November, pp. 105–113, 2025, doi: http://dx.doi.org/10.15578/jkpi.17.2.2025.105-113.
[16] T. A. Nengsih, I. Wardhana, and M. N. M. Nazori Madjid, “Addressing Missing Data in Environmental Technologies: Economic and Environmental Optimizing Air Quality Monitoring with Random Forest and MissForest,” J. Ris. Teknol. Pencegah. Pencemaran Ind., vol. 16, no. 1, pp. 23–31, 2025, doi: 10.21771/jrtppi.2025.v16.no1.p23-31.
[17] F. D. Rahman, M. I. Z. Mulki, and A. Taryana, “Clustering Dan Klasifikasi Data Cuaca Cilacap Dengan Menggunakan Metode K-Means Dan Random Forest,” J. SINTA Sist. Inf. dan Teknol. Komputasi, vol. 1, no. 2, pp. 90–97, 2024, doi: 10.61124/sinta.v1i2.15.
[18] M. Muhammad, T. Sutikno, and I. Riadi, “A Comparative Study of K-Means and KNN Imputation for Handling Missing Data in Scholarship Applicant Datasets,” JUITA J. Inform., vol. 13, no. 3, pp. 245–254, 2025, doi: 10.30595/juita.v13i3.26502.
[19] M. F. Akbar and L. Zahrotun, “K-Means Centroid Optimization with Genetic Algorithm for Clustering Micro, Small, Medium Enterprises in Yogyakarta,” JUITA J. Inform., vol. 13, no. 2, pp. 87–97, 2025, doi: 10.30595/juita.v13i2.25480.
[20] E. Helmud, E. Helmud, F. Fitriyani, and P. Romadiana, “Classification Comparison Performance of Supervised Machine Learning Random Forest and Decision Tree Algorithms Using Confusion Matrix,” J. Sisfokom (Sistem Inf. dan Komputer), vol. 13, no. 1, pp. 92–97, 2024, doi: 10.32736/sisfokom.v13i1.1985.
[21] A. Muhariya, I. Riadi, and Y. Prayudi, “Cyberbullying Analysis on Instagram Using K-Means Clustering,” JUITA J. Inform., vol. 10, no. 2, p. 261, 2022, doi: 10.30595/juita.v10i2.14490.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Sarah Astiti, Alvendo Wahyu Aranski, Darmansah

This work is licensed under a Creative Commons Attribution 4.0 International License.

JUITA: Jurnal Informatika is licensed under a Creative Commons Attribution 4.0 International License.








