1 | Summary of changes and additions for supporting sequencing
|
---|
2 | ==========================================================
|
---|
3 |
|
---|
4 | 1. Remove [LabeledExtract] and [Label]. Existing labeled
|
---|
5 | extracts are converted to [Extract] items with a subtype.
|
---|
6 | Existing labels are converted to [Tag] items with a subtype.
|
---|
7 | We define two subtypes for tags ('Label' and 'Barcode'),
|
---|
8 | and two subtypes for extracts ('LabeledExtract' and 'Library').
|
---|
9 | Extracts can be tagged with a [Tag].
|
---|
10 |
|
---|
11 |
|
---|
12 | What about protocol types? What do we do with the current
|
---|
13 | LABELING protocol type? It would be useful to define more
|
---|
14 | protocol types, for example, 'Library preparation'. But
|
---|
15 | how do we know which protocol type is the correct one?
|
---|
16 | Can we add a link between [ItemSubtype] and [ProtcolType]
|
---|
17 | which means that when an item of the given subtype is
|
---|
18 | created the protocol should be from the linked protocol
|
---|
19 | type?
|
---|
20 |
|
---|
21 |
|
---|
22 | Do we need some kind of 'mode' setting in the GUI so that it uses
|
---|
23 | the correct terminology as much as possible? The 'mode' setting
|
---|
24 | could also control parts of the 'Validate' functionality in the
|
---|
25 | 'Item overview' which may need to work differently. Particularly
|
---|
26 | all rules for 'number of channels' which are only needed for microarray
|
---|
27 | experiments.
|
---|
28 |
|
---|
29 |
|
---|
30 | 2. Changes in [BioMaterialEvent]
|
---|
31 |
|
---|
32 | A new entity class [BioMaterialParent] is introduced instead of
|
---|
33 | the "anonymous" link-table 'BioMaterialEventSources'.
|
---|
34 | This should make it possible to get rid of the [UsedQuantity]
|
---|
35 | "fulhack" that was used to support multi-array slides.
|
---|
36 | Existing information can easily be moved to the new tables.
|
---|
37 |
|
---|
38 | The parent and pooled properties are modified so that the
|
---|
39 | parent may hold a SINGLE biomaterial of the same type or
|
---|
40 | of the parent type. The pooled flag should only be used when
|
---|
41 | there are two or more parents (of the same type).
|
---|
42 |
|
---|
43 | The 'Hybridization' event type is changed to 'BioAssayCreation'.
|
---|
44 | The link between [BioMaterialEvent] and [Hybridization] is replaced
|
---|
45 | with a link to [PhysicalBioAssay].
|
---|
46 |
|
---|
47 | 3. New entity class [PhysicalBioAssay] that replaces [Hybridization]
|
---|
48 |
|
---|
49 | Existing hybridizations are converted to [PhysicalBioAssay] items
|
---|
50 | with a subtype. We define two subtypes: 'Hybridization' and '???'.
|
---|
51 |
|
---|
52 | The [PhysicalBioAssay] should implement [FileStoreEnabled] so
|
---|
53 | that we can link files to it.
|
---|
54 |
|
---|
55 | New [ProtocolType]: 'Sequencing'
|
---|
56 | New [HardwareType]: 'Sequencing station'
|
---|
57 | New [Hardware]: 'HiSeq 2000'
|
---|
58 |
|
---|
59 | See also discussion above about protocol types and item subtypes.
|
---|
60 |
|
---|
61 | 4. Changes for [FileSet] and related classes
|
---|
62 |
|
---|
63 | [FileSetMember] is made into an [Annotatable] item so that
|
---|
64 | we can add annotations on files.
|
---|
65 |
|
---|
66 | [FileSet] is modified so that it becomes possible to add more
|
---|
67 | than one file for each [DataFileType]. But this is controlled
|
---|
68 | by a flag ('allowMultiple').
|
---|
69 |
|
---|
70 | See ticket #1604 for more information about this.
|
---|
71 |
|
---|
72 | 5. New entity classes [BioAssayEvent], [DerivedBioAssaySet] and
|
---|
73 | [DerivedBioAssay] that replaces [Scan] and [Image]
|
---|
74 |
|
---|
75 | The [BioAssayEvent] and [DerivedBioAssaySet] makes up a loop
|
---|
76 | that is started from a [PhysicalBioAssay] and ends with a
|
---|
77 | [RawBioAssay]. This loop is similar to the loop with [Transformation]
|
---|
78 | and [BioAssaySet] in the existing analysis section.
|
---|
79 |
|
---|
80 | Existing [Scan] and [Image] data is moved into a single "iteration"
|
---|
81 | of that loop. The scan data is split between the bioassay event
|
---|
82 | (protocol, scanner, date) and the derived bioassay set
|
---|
83 | (name, description, owner). One or more derived bioassays are also
|
---|
84 | created with links back to the biomaterial they are related to.
|
---|
85 | Image data is moved to the file set of the derived bioassay set
|
---|
86 | with the properties (jpeg, tiff, etc.) added as annotations.
|
---|
87 |
|
---|
88 | The new classes can be used to represent the multiple steps
|
---|
89 | that are required before sequenced data can be boiled down to
|
---|
90 | something that is similar to expression data.
|
---|
91 |
|
---|
92 | A bioassay event can be linked with [Job], [Protocol],
|
---|
93 | [Hardware] and [Software]. It should be possible to
|
---|
94 | create iterations both manually and with plug-ins.
|
---|
95 |
|
---|
96 | The existing 'Analysis' [PluginType] is re-used. Since plug-ins are
|
---|
97 | required to implement context-checking there shouldn't be any risk
|
---|
98 | of mixing things up.
|
---|
99 |
|
---|
100 | For gui things we probably need a new plug-in similar to the
|
---|
101 | 'Manual transform' plug-in that exists for the experiment analysis
|
---|
102 | section. Maybe we can provide configurations for some of the
|
---|
103 | software mentioned below.
|
---|
104 |
|
---|
105 | New [Software]: BCL Converter, Casava, Bowtie, Myrna, Tophat,
|
---|
106 | Cufflinks, and more???
|
---|
107 |
|
---|
108 | New [DataFileType]: bcl, cif, fastq, qseq, bam, sam, and more???
|
---|
109 | Which ones do we really want to store/reference through BASE?
|
---|
110 |
|
---|
111 | New [FileType]: ????
|
---|
112 |
|
---|
113 | New [Platform]/[PlatformVariant]: ???
|
---|
114 |
|
---|
115 | Or should most of this be in an extensions package similar to
|
---|
116 | the existing Illumina package?
|
---|
117 |
|
---|