![]() |
||
|
|
/ Home / Analysis Tools / Normalization
This tutorial could be downloaded as a PDF file (1.1 Mb). The older version of this tutorial using Varan is available using this link. The GenePix software enabled you to convert your microarray image pixels to an intensity digital data for each spot. The next step consists in normalizing data within each experiment to eliminate systematic errors and scaling all of them so that they can be compared. The R language: R is a statistical and graphic programming language close to the S+ language. It is available freely on the Windows, MacOS and Linux platform. R functions could be improved and are based on the contributions of a developer community. R has a growing room in biology, a large number of modules have been developed to answer specifically high-throughput biology problems, including the microarray domain. So, an independent R project, Bioconductor has been dedicated to microarray. R modules installation is quite simple, particularly under Windows operating system. An executable installer installs R automatically and modules are loaded from the graphic interface using the item “Packages” from the user interface menu bar. In order to normalize our microarray data, we will use a R script developed locally, which allow to call a set of R functions and to obtain a file of normalised data and a lot of graphical outputs to control each step of the normalisation process. More information is available on the script web pages. Launching the “Goulphar.R” R script:
Script execution:
Graphical outputs created: Details on each graphics created are available on the Goulphar online user manual. Filtered spots depend of the choices made in the parameter file. They combine spots excluded because of their smaller diameter, saturating intensities or GenePix flags. They will be excluded form the normalisation process and from most of the graphical outputs. They will be all there instead, if not any filter is selected. Normalised data: R script graphical outputs are automatically stored in the working directory, the one from which the “Goulphar.R” script is launched. A tabulated text file is created at the end of the script (“GPR_start_file_norm.txt”). The R, G, Rb and Gb columns correspond to raw foreground and background intensities in Cy5 and Cy3 respectively. They are not modified by the normalisation process, as the A column (corresponding to the log2 mean intensities). The Mnorm, alone, includes normalized data. The Rb and Gb columns, background measurements in Cy5 and Cy3, are found only if the subtracted background option has been selected in the parameter file. Cells labelled “NA” are cells where values are none available, discarded after the filtration process.
Useful links:
|
| This page is also available in french | Last page update: 9/7/2011 - 11:28 For any questions or comments send an e-mail to the webmaster |