Model Components Selection in Bayesian Model Averaging Using Occam's Window for Microarray Data

Ani Budi Astuti, Nur Iriawan, irhamah Irhamah, Heri Kuswanto


Microarray is an analysis for monitoring gene expression activity simultaneously. Microarray data are generated from microarray experiments having characteristics of very few number of samples where the shape of distribution is very complex and the number of measured variables is very large. Due to this specific characteristics, it requires special method to overcome this. Bayesian Model Averaging (BMA) is a Bayesian solution method that is capable to handle microarray data with a best single model constructed by combining all possible models in which the posterior distribution of all the best models will be averaged. There are several method that can be used to select the model components in Bayesian Model Averaging (BMA). One of the method that can be used is the Occam's Window method. The purpose of this study is to measure the performance of Occam's Window method in the selection of the best model components in the Bayesian Model Averaging (BMA). The data used in this study are some of the gene expression data as a result of microarray experiments used in the study of Sebastiani, Xie and Ramoni in 2006. The results showed that the Occam's Window method can reduce a number of models that may be formed as much as 65.7% so that the formation of a single model with Bayesian Model Averaging method (BMA) only involves the model of 34.3%.

Keywords— Bayesian Model Averaging, Microarray Data, Model Components Selection, Occam's Window Method.

Full Text:



Knudsen, S., “A Guide to Analysis of DNA Microarray Data”, Second Edition, John Wiley & Sons, Inc., New Jersey, Canada, 2004.

Muller, P., Parmigiani, G., Robert, C., and Rouseau, J., “Optimal Sample Size for Multiple Testing: the Case of Gene Expression Microarrays,” Tech. rep., University of Texas, M.D. Anderson Cancer Center, 2002.

Gosh, J. K., Delampady, M. and Samanta, T., “An Introduction to Bayesian Analysis Theory and Method”. Springer, New York, 2006.

Gelman, A., Carlin, J. B., Stern, H. S. and Rubin, D. B., “Bayesian Data Analysis”, Chapman & Hall, London, 1995.

Madigan, D. and Raftery, A. E., “Model Selection and Accounting for Model Uncertainty in Graphical Models Using Occam’s Window”, Journal of the American Statistical Association, Vol.89. 428: 1535-1546, 1994.

Hustianda, V. F., “Comparison of Bayesian Model Averaging and Multiple Linear Regression in Predicting Factors Affecting Number of Infant Death in East Java”, Thesis, Statistics Department, FMIPA-ITS, Surabaya, 2012.

Liang, F. M, Troung, Y, and Wong, W. H., “Automatic Bayesian Model Averaging for Linear Regression and Applications in Bayesian Curve Fitting”, Statistical Science, 11(4): 1005-1029, 2001.

Brown, P.J., Vannucci, M. and Fearn, T., “Bayesian Model Averaging with Selection of Regressors”, J. R. Statist. Soc. B Part 3. 519–536, 2002.

Sebastiani, P., Xie H. and Ramoni, M.F., “Bayesian Analysis of Comparative Microarray Experiments By Model Averaging”, International Society For Bayesian Analysis 1, number 4, pp. 707-732, 2006.

Purnamasari, R., “The use of Bayesian Model Averaging (BMA) method with Markov Chain Monte Carlo (MCMC) approach for Wind Speed Daily Averages Forecasting in Juanda Meteorological Station”, Thesis, Statistics Department, FMIPA-ITS, Surabaya, 2011.

Kuswanto, H. and Sari, M. R., “Bayesian Model Averaging with Markov Chain Monte Carlo for Calibrating Temperature Forecast from Combination of Time Series Models”, (on Review), 2013.

Shoemaker, J. S. and Lin, S. M., “Method of Microarray Analysis IV”, Springer, New York, 2005.

Duggan, J. D., Bittner, M., Chen, Y., Meltzer, P. and Trent, J. M., “Expression Profiling Using CDNA Microarrays. Nature Genetics”, 21: 10- 14, 1999.

Box, G. E. P. and Tiao, “Bayesian Inference in Statistical Analysis”, MA: Addison-Wesley, Massachusetts, 1973.

Zellner, A., “An Introduction to Bayesian Inference in Econometrics”, John Wiley, New York. 1971.

Iriawan, N., “Simulation Technique”, Teaching Modules, ITS, Surabaya, 2003.

Gamerman, D., “Markov Chain Monte Carlo”, Chapman & Hall, London, 1997.


  • There are currently no refbacks.