Ticket #1153: sequencing-draft-2.txt

File sequencing-draft-2.txt, 4.9 KB (added by Nicklas Nordborg, 12 years ago)
Line 
1Summary of changes and additions for supporting sequencing
2==========================================================
3
41. Remove [LabeledExtract] and [Label]. Existing labeled
5   extracts are converted to [Extract] items with a subtype.
6   Existing labels are converted to [Tag] items with a subtype.
7   We define two subtypes for tags ('Label' and 'Barcode'),
8   and two subtypes for extracts ('LabeledExtract' and 'Library').
9   Extracts can be tagged with a [Tag].
10
11
12   What about protocol types? What do we do with the current
13   LABELING protocol type? It would be useful to define more
14   protocol types, for example, 'Library preparation'. But
15   how do we know which protocol type is the correct one?
16   Can we add a link between [ItemSubtype] and [ProtcolType]
17   which means that when an item of the given subtype is
18   created the protocol should be from the linked protocol
19   type?
20
21 
22   Do we need some kind of 'mode' setting in the GUI so that it uses
23   the correct terminology as much as possible? The 'mode' setting
24   could also control parts of the 'Validate' functionality in the
25   'Item overview' which may need to work differently. Particularly
26   all rules for 'number of channels' which are only needed for microarray
27   experiments.
28   
29   
302. Changes in [BioMaterialEvent]
31
32   A new entity class [BioMaterialParent] is introduced instead of
33   the "anonymous" link-table 'BioMaterialEventSources'.
34   This should make it possible to get rid of the [UsedQuantity]
35   "fulhack" that was used to support multi-array slides.
36   Existing information can easily be moved to the new tables.
37   
38   The parent and pooled properties are modified so that the
39   parent may hold a SINGLE biomaterial of the same type or
40   of the parent type. The pooled flag should only be used when
41   there are two or more parents (of the same type).
42   
43   The 'Hybridization' event type is changed to 'BioAssayCreation'.
44   The link between [BioMaterialEvent] and [Hybridization] is replaced
45   with a link to [PhysicalBioAssay].
46 
473. New entity class [PhysicalBioAssay] that replaces [Hybridization]
48
49   Existing hybridizations are converted to [PhysicalBioAssay] items
50   with a subtype. We define two subtypes: 'Hybridization' and '???'.
51   
52   The [PhysicalBioAssay] should implement [FileStoreEnabled] so
53   that we can link files to it.
54   
55   New [ProtocolType]: 'Sequencing'
56   New [HardwareType]: 'Sequencing station'
57   New [Hardware]: 'HiSeq 2000'
58   
59   See also discussion above about protocol types and item subtypes.
60 
614. Changes for [FileSet] and related classes
62
63   [FileSetMember] is made into an [Annotatable] item so that
64   we can add annotations on files.
65
66   [FileSet] is modified so that it becomes possible to add more
67   than one file for each [DataFileType]. But this is controlled
68   by a flag ('allowMultiple').
69   
70   See ticket #1604 for more information about this.
71
725. New entity classes [BioAssayEvent], [DerivedBioAssaySet] and
73   [DerivedBioAssay] that replaces [Scan] and [Image]
74   
75   The [BioAssayEvent] and [DerivedBioAssaySet] makes up a loop
76   that is started from a [PhysicalBioAssay] and ends with a
77   [RawBioAssay]. This loop is similar to the loop with [Transformation]
78   and [BioAssaySet] in the existing analysis section.
79   
80   Existing [Scan] and [Image] data is moved into a single "iteration"
81   of that loop. The scan data is split between the bioassay event
82   (protocol, scanner, date) and the derived bioassay set
83   (name, description, owner). One or more derived bioassays are also
84   created with links back to the biomaterial they are related to.
85   Image data is moved to the file set of the derived bioassay set
86   with the properties (jpeg, tiff, etc.) added as annotations.
87
88   The new classes can be used to represent the multiple steps
89   that are required before sequenced data can be boiled down to
90   something that is similar to expression data.
91   
92   A bioassay event can be linked with [Job], [Protocol],
93   [Hardware] and [Software]. It should be possible to
94   create iterations both manually and with plug-ins.
95   
96   The existing 'Analysis' [PluginType] is re-used. Since plug-ins are
97   required to implement context-checking there shouldn't be any risk
98   of mixing things up.
99   
100   For gui things we probably need a new plug-in similar to the
101   'Manual transform' plug-in that exists for the experiment analysis
102   section. Maybe we can provide configurations for some of the
103   software mentioned below.
104   
105   New [Software]: BCL Converter, Casava, Bowtie, Myrna, Tophat,
106   Cufflinks, and more???
107
108   New [DataFileType]: bcl, cif, fastq, qseq, bam, sam, and more???
109   Which ones do we really want to store/reference through BASE?
110   
111   New [FileType]: ????
112   
113   New [Platform]/[PlatformVariant]: ???
114   
115   Or should most of this be in an extensions package similar to
116   the existing Illumina package?
117