Microarray analysis, pre-processing. Quality in the selection of differentiatedly expressed genes

Authors

  • Nuria Ruiz Ruiz Departamento de Estadística e Investigación Operativa.
  • Andrés Redchuk Departamento de Estadística e Investigación Operativa.
  • Javier M. Moguerza Universidad Autónoma de Chile.

Keywords:

microarrays, process optimization, quality improvement

Abstract

Following the success of microarray technology, in the literature there is a large number of experiments made with them. However, the problems of standardization and the many sources of variability make it necessary posteriori validation techniques. For this reason, we have tried to study how the selection of genes influence some key preprocessing techniques. Many of the studies conducted to compare these techniques have been carried out on experiments which optimal results are known a priori to try to determine which has greater accuracy. In our case, we do not know the correct result a priori and what has been accomplished is a comparative analysis of the results obtained in each case in order to predict a priori the behavior of each of the techniques discussed in terms of various factors as initial data distribution, expression patterns object of interest, the presence of outliers, and so on.

Three techniques have been applied on the preprocessing on a microarray experiment. The techniques are GCRMA, MAS5 and MBEI. In our data there have been found mainly three patterns of expression in those genes expressed and have been shown statistically that there is an association between the pre-processing technique used and the predominant pattern in it. This trend is related to efficiency in the detection of outliers and the magnitude of change detected with each of them. So far, it has not been able to establish a statistically significant in confirming the agreement between the three methods after the selection of differentially expressed genes.

Downloads

Download data is not yet available.

References

AFFYMETRIX, (2002): “STATISTICAL ALGORITHMS DESCRIPTION DOCUMENT”. Technical Report, Affymetrix, Santa Clara, CA.

ARIAS-CASTRO, E. CANDES, E. J., AND PLAN, Y., (2010): “GLOBAL TESTING AND SPARSE ALTERNATIVES: ANOVA, MULTIPLE COMPARISONS AND THE HIGHER CRITICISM”. AR-XIV:1007.1434.

BREMER, M., HIMELBLAU, E. AND MADLUNG, A., (2010): “STATISTICAL METHODS IN MOLECULAR BIOLOGY”. METHODS IN MOLECULAR BIOLOGY, 620(3):287-313.

BUTTE, A. J., (2009): “BIOINFORMATIC AND COMPUTATIONAL ANALYSIS FOR GENOMIC MEDICINE. ESSENTIALS OF GENOMIC AND PERSONALIZED MEDICINE, CHAP. 10”. HUNTINGTON F. WILLARD AND GEO_REY S. GINSBURG, ELSEVIER, ACADEMIC PRESS.

CALZA, S., VALENTINI, D., AND PAWITAN, Y., (2008): “NORMALIZATION OF OLIGONUCLEOTIDE ARRAYS BASED ON THE LEAST-VARIANT SET

OF GENES”. BMC BIOINFORMATICS, 9(1):140.

DALGAARD, P., (2008): “INTRODUCTORY STATISTICS WITH R”. SPRINGER, 2ND EDITION.

DE BIN, R. AND RISSO, D., (2011). “A NOVEL APPROACH TO THE CLUSTERING OF MICROARRAY DATA VIA NONPARAMETRIC DENSITY ESTIMATION”. BMC BIOINFORMATICS, 12:49.

DE HAAN, J.R., WEHRENS, R., BAUERSCHMIDT, S., PIEK, E., VAN SCHAIK, R. AND BUYDENS, L.M.C. (2007): “INTERPRETATION OF

ANOVA MODELS FOR MICROARRAY DATA USING PCA”. BIOINFORMATICS 2007, 23:184-190.

ELASHOFF, M., ALVARES, C. AND LAUREN, P., (2009): “METHOD AND SYSTEM FOR MANAGING AND QUERYING GENE EXPRESSION DATA

ACCORDING TO QUALITY”. US PATENT 7,558, 411, 2009.

GAN, G., MA, C. AND WU, J. (2009): “DATA CLUSTERING: THEORY, ALGORITHMS, AND APPLICATIONS”. SIAM, SOCIETY FOR INDUSTRIAL

AND APPLIED MATHEMATICS.

IRIZARRY, R.A., WU, Z. AND JA_EE, H.A., (2006): “COMPARISON OF AFFYMETRIX GENECHIP EXPRESSION MEASURES”. BIOINFORMATICS

22, 789-794.

LI, C. AND WONG, W. H., (2001A): “MODEL-BASED ANALYSIS OF OLIGONUCLEOTIDE ARRAYS: EXPRESSION INDEX COMPUTATION

AND OUTLIER DETECTION”. PROC NATL ACAD SCI U S A, 98(1):31-36.

LI, C. AND WONG, W. H., (2001B): “MODEL-BASED ANALYSIS OF OLIGONUCLEOTIDE ARRAYS: MODEL VALIDATION, DESIGN ISSUES

AND STANDARD ERROR APPLICATION”. GENOME BIOL, 2(8):RESEARCH0032.

MUIR, W.M. ROSA, G.J.M., PITTENDRIGH, B.R., XU, Z., RIDER, S.D. , FOUNTAIN, M. AND OGAS J., (2009): “A MIXTURE MODEL APPROACH

FOR THE ANALYSIS OF SMALL EXPLORATORY MICROARRAY EXPERIMENTS.COMPUTATIONAL”. STATISTICS AND DATA ANALYSIS, 53:1566-1576.

NAEF, F., AND MAGNASCO, M. O., (2003): “SOLVING THE RIDDLE OF THE BRIGHT MISMATCHES: LABELING AND ELECTIVE BINDING IN OLIGONUCLEOTIDE ARRAYS”. PHYSICAL REVIEW E 68, 011906.

NEPOMUCENO, J.A., TRONCOSO, A., AGUILAR-RUIZ, J.S., (2011): “BICLUSTERING OF GENE EXPRESSION DATA BY CORRELATIONBASED

SCATTER SEARCH”. BIODATA MIN., 4(1):3.

RAUCH, G., GEISTANGER, A., TIMM, J., (2011): “A NEW OUTLIER IDENTI_CATION TEST FOR METHOD COMPARISON STUDIES BASED ON ROBUST REGRESSION”. JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 1520-5711, 21(1):151-169.

SORIN DRAGHICI, (2003): “DATA ANALYSIS TOOLS FOR DNA MICROARRAYS, 2A ED”. CHAPMAN & HALL/CRC, 2003.

WU, Z., IRIZARRY, R. A., GENTLEMAN, R., MURILLO, F. M., AND SPENCER, F., (2003): “A MODEL BASED BACKGROUND ADJUSTMENT FOR OLIGONUCLEOTIDE EXPRESSION ARRAYS”. TECHNICAL REPORT WORKING PAPER1, JOHNS HOPKINS UNIVERSITY, DEPT. OF BIOSTATISTICS WORKING PAPERS.

Downloads

Published

2018-06-18

Issue

Section

Artículos Científicos

How to Cite

Microarray analysis, pre-processing. Quality in the selection of differentiatedly expressed genes. (2018). Revista De La Escuela De Perfeccionamiento En Investigación Operativa, 20(33), 72-88. https://revistas.psi.unc.edu.ar/index.php/epio/article/view/20345