1 | Summary of changes and additions for supporting sequencing |
---|
2 | ========================================================== |
---|
3 | |
---|
4 | 1. Remove [LabeledExtract] and [Label]. Existing labeled |
---|
5 | extracts are converted to [Extract] items with a subtype. |
---|
6 | Existing labels are converted to [Tag] items with a subtype. |
---|
7 | We define two subtypes for tags ('Label' and 'Barcode'), |
---|
8 | and two subtypes for extracts ('LabeledExtract' and 'Library'). |
---|
9 | Extracts can be tagged with a [Tag]. |
---|
10 | |
---|
11 | |
---|
12 | What about protocol types? What do we do with the current |
---|
13 | LABELING protocol type? It would be useful to define more |
---|
14 | protocol types, for example, 'Library preparation'. But |
---|
15 | how do we know which protocol type is the correct one? |
---|
16 | Can we add a link between [ItemSubtype] and [ProtcolType] |
---|
17 | which means that when an item of the given subtype is |
---|
18 | created the protocol should be from the linked protocol |
---|
19 | type? |
---|
20 | |
---|
21 | |
---|
22 | Do we need some kind of 'mode' setting in the GUI so that it uses |
---|
23 | the correct terminology as much as possible? The 'mode' setting |
---|
24 | could also control parts of the 'Validate' functionality in the |
---|
25 | 'Item overview' which may need to work differently. Particularly |
---|
26 | all rules for 'number of channels' which are only needed for microarray |
---|
27 | experiments. |
---|
28 | |
---|
29 | |
---|
30 | 2. Changes in [BioMaterialEvent] |
---|
31 | |
---|
32 | A new entity class [BioMaterialParent] is introduced instead of |
---|
33 | the "anonymous" link-table 'BioMaterialEventSources'. |
---|
34 | This should make it possible to get rid of the [UsedQuantity] |
---|
35 | "fulhack" that was used to support multi-array slides. |
---|
36 | Existing information can easily be moved to the new tables. |
---|
37 | |
---|
38 | The parent and pooled properties are modified so that the |
---|
39 | parent may hold a SINGLE biomaterial of the same type or |
---|
40 | of the parent type. The pooled flag should only be used when |
---|
41 | there are two or more parents (of the same type). |
---|
42 | |
---|
43 | The 'Hybridization' event type is changed to 'BioAssayCreation'. |
---|
44 | The link between [BioMaterialEvent] and [Hybridization] is replaced |
---|
45 | with a link to [PhysicalBioAssay]. |
---|
46 | |
---|
47 | 3. New entity class [PhysicalBioAssay] that replaces [Hybridization] |
---|
48 | |
---|
49 | Existing hybridizations are converted to [PhysicalBioAssay] items |
---|
50 | with a subtype. We define two subtypes: 'Hybridization' and '???'. |
---|
51 | |
---|
52 | The [PhysicalBioAssay] should implement [FileStoreEnabled] so |
---|
53 | that we can link files to it. |
---|
54 | |
---|
55 | New [ProtocolType]: 'Sequencing' |
---|
56 | New [HardwareType]: 'Sequencing station' |
---|
57 | New [Hardware]: 'HiSeq 2000' |
---|
58 | |
---|
59 | See also discussion above about protocol types and item subtypes. |
---|
60 | |
---|
61 | 4. Changes for [FileSet] and related classes |
---|
62 | |
---|
63 | [FileSetMember] is made into an [Annotatable] item so that |
---|
64 | we can add annotations on files. |
---|
65 | |
---|
66 | [FileSet] is modified so that it becomes possible to add more |
---|
67 | than one file for each [DataFileType]. But this is controlled |
---|
68 | by a flag ('allowMultiple'). |
---|
69 | |
---|
70 | See ticket #1604 for more information about this. |
---|
71 | |
---|
72 | 5. New entity classes [BioAssayEvent], [DerivedBioAssaySet] and |
---|
73 | [DerivedBioAssay] that replaces [Scan] and [Image] |
---|
74 | |
---|
75 | The [BioAssayEvent] and [DerivedBioAssaySet] makes up a loop |
---|
76 | that is started from a [PhysicalBioAssay] and ends with a |
---|
77 | [RawBioAssay]. This loop is similar to the loop with [Transformation] |
---|
78 | and [BioAssaySet] in the existing analysis section. |
---|
79 | |
---|
80 | Existing [Scan] and [Image] data is moved into a single "iteration" |
---|
81 | of that loop. The scan data is split between the bioassay event |
---|
82 | (protocol, scanner, date) and the derived bioassay set |
---|
83 | (name, description, owner). One or more derived bioassays are also |
---|
84 | created with links back to the biomaterial they are related to. |
---|
85 | Image data is moved to the file set of the derived bioassay set |
---|
86 | with the properties (jpeg, tiff, etc.) added as annotations. |
---|
87 | |
---|
88 | The new classes can be used to represent the multiple steps |
---|
89 | that are required before sequenced data can be boiled down to |
---|
90 | something that is similar to expression data. |
---|
91 | |
---|
92 | A bioassay event can be linked with [Job], [Protocol], |
---|
93 | [Hardware] and [Software]. It should be possible to |
---|
94 | create iterations both manually and with plug-ins. |
---|
95 | |
---|
96 | The existing 'Analysis' [PluginType] is re-used. Since plug-ins are |
---|
97 | required to implement context-checking there shouldn't be any risk |
---|
98 | of mixing things up. |
---|
99 | |
---|
100 | For gui things we probably need a new plug-in similar to the |
---|
101 | 'Manual transform' plug-in that exists for the experiment analysis |
---|
102 | section. Maybe we can provide configurations for some of the |
---|
103 | software mentioned below. |
---|
104 | |
---|
105 | New [Software]: BCL Converter, Casava, Bowtie, Myrna, Tophat, |
---|
106 | Cufflinks, and more??? |
---|
107 | |
---|
108 | New [DataFileType]: bcl, cif, fastq, qseq, bam, sam, and more??? |
---|
109 | Which ones do we really want to store/reference through BASE? |
---|
110 | |
---|
111 | New [FileType]: ???? |
---|
112 | |
---|
113 | New [Platform]/[PlatformVariant]: ??? |
---|
114 | |
---|
115 | Or should most of this be in an extensions package similar to |
---|
116 | the existing Illumina package? |
---|
117 | |
---|