Bicluster Analysis of Cheng and Church's Algorithm to Identify Patterns of People's Welfare in Indonesia

Laradea Marifni, I Made Sumertajaya, Utami Dyah Syafitri

Abstract


Biclustering is a method of grouping numerical data where rows and columns are grouped simultaneously. The Cheng and Church (CC) algorithm is one of the bi-clustering algorithms that try to find the maximum bi-cluster with a high similarity value, called MSR (Mean Square Residue). The association of rows and columns is called a bi-cluster if the MSR is lower than a predetermined threshold value (delta). Detection of people's welfare in Indonesia using Bi-Clustering is essential to get an overview of the characteristics of people's interest in each province in Indonesia. Bi-Clustering using the CC algorithm requires a threshold value (delta) determined by finding the MSR value of the actual data. The threshold value (delta) must be smaller than the MSR of the actual data. This study's threshold values are 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, and 0.8. After evaluating the optimum delta by considering the MSR value and the bi-cluster formed, the optimum delta is obtained as 0.1, with the number of bi-cluster included as 4.

Keywords


Bi-clustering; CC algorithm; MSR; Cheng and Church

References


[1] Iskandar A Muhaimin. “Negara dan Politik Kesejahteraan: Reorientasi Arah Baru Pembangunan”. Jakarta: PT Gramedia Pustaka Utama, 2021.

[2] Roestam S. “Pembangunan Nasional untuk Kesejahteraan Rakyat”. Jakarta: Kantor Menteri Koordinator Bidang Kesejahteraan Rakyat Republik Indonesia, 1993.

[3] Badan Pusat Statistik. Indikator Kesejahteraan Rakyat 2020. Jakarta Pusat: Badan Pusat Statistik.2020

[4] Mattjik A, Sumertajaya IM. Sidik Peubah Ganda. Bogor: IPB Press.2011

[5] Tryon RC, Bailey DE. Cluster Analysis. New York (US): McGraw-Hill. 1970

[6] J. A. Hartigan, “Direct clustering of a data matrix,” J. Am. Stat. Assoc., vol. 67, no. 337, pp. 123–129, 1972, doi: 10.1080/01621459.1972.10481214.

[7] Mirkin B, “Mathematical Classification and Clustering” . Dordrecht (NL): Kluwer Academic Publishers, 1996.

[8] Y. Cheng and G. M. Church, “Biclustering of expression data.,” Proc. Int. Conf. Intell. Syst. Mol. Biol., vol. 8, pp. 93–103, 2000.

[9] Y. Kluger, R. Basri, J. T. Chang, and M. Gerstein, Spectral biclustering of microarray data: Coclustering genes and conditions, vol. 13, no. 4. 2003. doi: 10.1101/gr.648603.

[10] A. Prelić et al., “A systematic comparison and evaluation of biclustering methods for gene expression data,” Bioinformatics, vol. 22, no. 9, pp. 1122–1129, 2006, doi: 10.1093/bioinformatics/btl060.

[11] Nurmawiya and R. Kurniawan, “Pengelompokan Wilayah Indonesia Dalam Menghadapi Revolusi Industri 4.0 Dengan Metode Biclustering,” pp. 790–797, 2020

[12] Putri CA, Irfani R, Sartono B. Recognizing poverty pattern in Central Java using Biclustering Analysis. Journal of Physics: Conference Series. 1863(1).2021.

[13] Novidianto R, Irfani R. Bicluster CC Algoritm Analysis to Identify Patterns of Food Insecurity in Indonesia. Jurnal Matematika, Statistika dan Komputasi. 2021. 17(2):325-338

[14] B. Wang, Y. Miao, H. Zhao, J. Jin, and Y. Chen, “A biclustering-based method for market segmentation using customer pain points,” Eng. Appl. Artif. Intell., vol. 47, pp. 101–109, 2016, doi: 10.1016/j.engappai.2015.06.005

[15] Tanay A, Sharan R, Shamir R. Biclustering Algorithms: A Survey. Handb Comput Mol Biol. 2024. May:709–726. doi:10.1201/9781420036275-40

[16] B. Pontes, R. Giráldez, and J. S. Aguilar-Ruiz, “Biclustering on expression data: A review,” J. Biomed. Inform., vol. 57, pp. 163–180, 2015, doi: 10.1016/j.jbi.2015.06.028

[17] N. Kavitha Sri and R. Porkodi, “An extensive survey on biclustering approaches and algorithms for gene expression data,” Int. J. Sci. Technol. Res., vol. 8, no. 9, pp. 2228–2236, 2019.

[18] H. Cho and I. S. Dhillon, “Coclustering of human cancer microarrays using minimum sum-squared residue coclustering,” IEEE/ACM Trans. Comput. Biol. Bioinforma., vol. 5, no. 3, pp. 385–400, 2008, doi: 10.1109/TCBB.2007.70268.

[19] A. Chakraborty and H. Maka, “Biclustering of gene expression data using genetic algorithm,” Proc. 2005 IEEE Symp. Comput. Intell. Bioinforma. Comput. Biol. CIBCB ’05, vol. 2005, no. 2000, 2005, doi: 10.1109/cibcb.2005.1594893


Full Text: PDF

DOI: 10.30595/juita.v11i2.17446

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.

ISSN: 2579-8901