R code and data for

Modeling tissue contamination to improve molecular identification of the primary tumor site of metastases

Martin Vincent, Katharina Perell, Finn Cilius Nielsen, Gedske Daugaard and Niels Richard Hansen

R code

To obtain the R code used in the preparation of Modeling tissue contamination to improve molecular identification of the primary tumor site of metastases, submitted to Bioinformatics, download the zip archive

R_Scripts.zip
and unzip it. The README.pdf file (included in the zip archive) contains information on all the R scripts included.

Data

The data used in the paper is available from the Gene Expression Omnibus with accession number GSE51429. The data set can also be downloaded as the bioconductor ExpressionSet exprs:

expression_set_data.rd


The data can be loaded directly into R using

con <- url("http://www.math.ku.dk/~richard/msgl/expression_set_data.rd")
load(con)
close(con)
Note that the load_data.R script in the zip archive requires the expression_set_data.rd file in the working directory.