论文部分内容阅读
Oligonucleotide arrays such as Affymetrix GeneChips use multiple probes, or a probe set, to measure the abundance of mRNA of every gene of interest.Some anal ysis methods attempt to summarize the multiple observations into one single score before conducting further analysis such as detecting differentially expressed genes (DEG), clustering and classification.However, there is a risk of losing a significant amount of information and consequently reaching inaccurate or even incorrect con clusions during this data reduction.We developed a novel statistical method called robustified multivariate analysis of variance (MANOVA) based on the traditional MANOVA model and permutation test to detect DEG for both one-way and twoway cases.It can be extended to detect some special patterns of gene expression through profile analysis across k populations.The method utilizes probe level data and requires no assumptions about the distribution of the data set.We also pro pose a method of estimating the null distribution using quantile normalization in contrast to the pooling method.Monte Carlo simulation and real data analysis are conducted to demonstrate the performance of the proposed method comparing with the pooling method and the usual ANOVA test based on the summarized scores.It is found that the new method successfully detects DEG under desired false discovery rate and is more powerful than the competing method cspecially whcn the number of groups is small.