Opened 17 years ago

Last modified 17 years ago

#486 closed task

Import raw data from the Illumina platform — at Version 4

Reported by: Nicklas Nordborg Owned by: Nicklas Nordborg
Priority: minor Milestone: BASE 2.4
Component: coreplugins Version:
Keywords: Cc:

Description (last modified by Nicklas Nordborg)

A discussion about importing raw data from the Illumina platform was started on the mailing list. See:

http://www.mail-archive.com/basedb-users@lists.sourceforge.net/msg00500.html

Since it seems that the file format is not compatible with the existing import plugin a new plugin or something else is needed. One working solution (by Jeremy Davis-Turak) is to split the data file into one file for each hybridization with an R script and then use the regular import plugin.

Does anybody have any real and up-to-date Illumina files? We need some that we can do tests with. Without test data we are not able to create an integrated plug-in for the import.

Change History (11)

comment:1 by Jari Häkkinen, 17 years ago

Milestone: BASE 2.x+

by base, 17 years ago

Attachment: Illumina conversion.txt added

R script to convert a multi-array Illumina .csv output file into multiple single-array files.

by base, 17 years ago

Attachment: extended-properties.xml added

Extended properties file edited by Jeremy Davis-Turak to include reporter columns used by Illumina

comment:2 by Jari Häkkinen, 17 years ago

Milestone: BASE 2.x+BASE 2.4

comment:3 by Nicklas Nordborg, 17 years ago

Description: modified (diff)

comment:4 by Nicklas Nordborg, 17 years ago

Description: modified (diff)

by base, 17 years ago

Attachment: gene_profile_Rat_sample.csv added

by base, 17 years ago

Sample rat data.

by base, 17 years ago

Sample rat annotation file.

by base, 17 years ago

Sample human annotation file. Note the different number of columns per array (2) compared to the rat file, and the different number of arrays per chip (6, where rat has 12).

by base, 17 years ago

Attachment: 1401771138_A.sample.txt added

Example of a file resulting from the R script. Note that the column names are general.

Note: See TracTickets for help on using tickets.