+

WO2013007996A1 - Salmonella detection assay - Google Patents

Salmonella detection assay Download PDF

Info

Publication number
WO2013007996A1
WO2013007996A1 PCT/GB2012/051614 GB2012051614W WO2013007996A1 WO 2013007996 A1 WO2013007996 A1 WO 2013007996A1 GB 2012051614 W GB2012051614 W GB 2012051614W WO 2013007996 A1 WO2013007996 A1 WO 2013007996A1
Authority
WO
WIPO (PCT)
Prior art keywords
seq
probe
nucleotide
bind
complement
Prior art date
Application number
PCT/GB2012/051614
Other languages
French (fr)
Inventor
John Wain
Mark Achtman
Original Assignee
Health Protection Agency
University College Cork - National University Of Ireland
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Health Protection Agency, University College Cork - National University Of Ireland filed Critical Health Protection Agency
Publication of WO2013007996A1 publication Critical patent/WO2013007996A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • C12Q1/6886Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/156Polymorphic or mutational markers

Definitions

  • the present invention relates to a method for the detection of Hazard Group 3 (HG3) Salmonella enterica serovar Typhi (S. Typhi), S. Paratyphi A, S. Paratyphi B, and S. Paratyphi C). Also provided are corresponding probes and primers.
  • HG3 Salmonella enterica serovar Typhi S. Typhi
  • S. Paratyphi A Salmonella enterica serovar Typhi
  • S. Paratyphi B S. Paratyphi B
  • S. Paratyphi C S. Paratyphi C
  • probes and primers also provided.
  • HG3 refers to a classification system introduced by the Advisory Committee on Dangerous Pathogens (http://www.dh.gov.uk/ab/ACDP/index.htm).
  • the Advisory Committee on Dangerous Pathogens (ACDP) is a UK-wide advisory non-departmental public body. It was established in 1981 , and the terms of reference were revised in 1991 and 201 1 to allow for a wider remit.
  • S. Typhi and S. Paratyphi are invasive, life-threatening human bacterial pathogens, causing the systemic diseases enteric fever which continues to pose significant threats for public health, especially in many developing countries.
  • a population based study estimated that the global disease burdens associated with typhoid and paratyphoid fever were 21.6 million cases (with 216,000 deaths) and 5.4 million cases, respectively - Crump, J.A. et al. (2004) Bull World Hith Org 82, pp. 346-53.
  • clinical samples are screened for the presence or absence of HG3 Salmonella enterica.
  • HG3 bacterium If a HG3 bacterium is detected the sample in question must be processed in a high-level containment laboratory, containment level 3 (CL-3) laboratory, under stringent, high-level containment procedures, and by laboratory personnel skilled in handling HG3 organisms. In contrast, if no HG3 salmonella bacterium is detected (for example, only HG2 salmonella bacteria such as Enteritidis, Typhimuruim, or Choleraesuis are detected), the sample in question may be processed in a low-level containment laboratory, with associated less stringent operating procedures, and without the need for highly skilled HG3 personnel.
  • CL-3 containment level 3
  • HG3 containment The costs involved with HG3 containment are significant, and considerably higher (in terms of laboratory equipment and consumables, increased processing time due to added exposure risk, and use of highly-skilled personnel) than are the corresponding costs associated with low-level containment.
  • HG3 salmonella detection systems typically rely on a combination of antiserum tests and biochemical tests, which are not only time-consuming (typically taking
  • HG3 detection systems are sub-optimal, and result in the generation of undesirable false-positives (i.e. salmonella assigned to HG3, when, in fact, they should have been assigned to a lower category such as HG2).
  • the present invention addresses one or more of the above-described problems.
  • a first aspect of the present invention provides a method of screening a sample containing nucleic acid for the presence or absence of HG3 salmonella nucleic acid, said method comprising:
  • probe A wherein probe A binds to:
  • probe A does not bind to the nucleotide at position 328 of SEQ ID NO: 2 when the nucleotide at position 328 is C; ii. probe B, wherein probe B binds to:
  • probe B does not bind to the nucleotide at position 136 of SEQ ID NO: 4 when the nucleotide at position 136 is G;
  • probe C wherein probe C binds to:
  • probe C does not bind to the nucleotide at position 378 of SEQ ID NO: 6 when the nucleotide at position 378 is A;
  • probe D wherein probe D binds to:
  • probe E binds to:
  • probe F wherein probe F binds to: i. position 69 of target SEQ ID NO: 1 1 when the nucleotide at position 69 of SEQ ID NO: 1 1 is A, and wherein probe F does not bind to position 69 of SEQ ID NO: 1 1 when the nucleotide at position 69 is G; or
  • probe F does not bind to the nucleotide at position 69 of SEQ I D NO: 1 1 when the nucleotide at position 69 is C;
  • probe G wherein probe G binds to:
  • G does not bind to the nucleotide at position 204 of SEQ ID NO:
  • probe H wherein probe H binds to:
  • probe I wherein probe I binds to:
  • probe I does not bind to the nucleotide at position 300 of SEQ ID NO: 18 when the nucleotide at position 300 is A;
  • probe J binds to: i. position 342 of target SEQ ID NO: 19 when the nucleotide at position 342 of SEQ ID NO: 19 is C, and wherein probe J does not bind to position 342 of SEQ ID NO: 19 when the nucleotide at position 342 is A or T; or
  • probe K wherein probe K binds to:
  • probe K does not bind to the nucleotide at position 252 of SEQ ID NO: 22 when the nucleotide at position 252 is C;
  • probe L wherein probe L binds to:
  • probe L does not bind to the nucleotide at position 9 of SEQ ID NO: 24 when the nucleotide at position 9 is A;
  • the method of present invention provides a highly accurate (e.g. greater than 99%) and/ or rapid (e.g. within 48, 36 or 24 hours) screening system for confirming the presence or absence of HG3 salmonella. This represents a significant improvement over existing HG3 salmonella detection systems.
  • all of the probes A-L are added substantially simultaneously and/ or contact the nucleic acid sample substantially simultaneously.
  • This multiplex embodiment has the advantage of minimising the overall time required to complete the method of the invention.
  • the method is performed in a cyclical manner, each cycle comprising the introduction of one or more of said probe(s) A-L.
  • each of said probes A-L may be added prior to, simultaneously with, or subsequent to one another.
  • the method may include a washing step (eg. between steps B and C) to remove unbound probe(s).
  • the detection step C may be performed on bound probe and/ or on unbound probe.
  • the detection step C may be performed on unbound probe.
  • the initially bound probe may be released (eg. by changing an assay parameter such as temperature and/ or pH), and the released (unbound) probe then detected.
  • the method may further comprise use of probe M, wherein probe M binds to:
  • probe M does not bind to the nucleotide at position 67 of SEQ ID NO: 26 when the nucleotide at position 67 is T, and wherein probe M does not bind to the nucleotide at position 67 of SEQ ID NO: 26 when the nucleotide at position 67 is C;
  • the binding of probe M to its target nucleic acid sequence (or the complement thereof) when in combination with the binding of any one of probes C-L to their respective target nucleic acid sequences (or the complement thereof), provides additional confirmation of the presence of HG3 salmonella nucleic acid.
  • the method of the present invention may further comprise confirming the presence or absence of HG2 salmonella nucleic acid.
  • Said HG2 detection method may be performed prior to, simultaneously with, or subsequent to any one of the HG3 detection method steps.
  • HG2 nucleic acid is confirmed by the binding (to sample nucleic acid) of one or more probe selected from the group consisting of probes N-U, wherein:
  • probe N does not bind to the nucleotide at position 385 of SEQ ID NO: 28 when the nucleotide at position 385 is G;
  • nucleotide at position 501 when the nucleotide at position 501 is G, and wherein probe O does not bind to the nucleotide at position 501 of SEQ ID NO: 30 when the nucleotide at position 501 is A;
  • probe P binds to
  • probe Q does not bind to the nucleotide at position 492 of SEQ ID NO: 34 when the nucleotide at position 492 is T;
  • probe R does not bind to the nucleotide at position 271 of SEQ ID NO: 36 when the nucleotide at position 271 is C;
  • probe S does not bind to the nucleotide at position 100 of SEQ ID NO: 38 when the nucleotide at position 100 is C;
  • probe T does not bind to the nucleotide at position 210 of SEQ ID NO: 40 when the nucleotide at position 210 is T;
  • probe U binds to
  • probe U does not bind to the nucleotide at position 66 of SEQ ID NO: 42 when the nucleotide at position 66 is G.
  • binding of one or more of probes A-B to sample nucleic acid confirms the presence of HG3 S. paratyphi A nucleic acid.
  • binding of one or more of probes C-L (or probe M when in combination with any of probes C-L) to sample nucleic acid confirms the presence of HG3 S. paratyphi B/ Java nucleic acid.
  • binding of one or more of probes J-K to sample nucleic acid confirms the presence of HG3 S. paratyphi C nucleic acid. In one embodiment, binding of probe L to sample nucleic acid confirms the presence of HG3 S. typhi nucleic acid.
  • binding of one or more of probes N-P to sample nucleic acid confirms the presence of HG2 S. enteritidis nucleic acid.
  • binding of one or more of probes Q-T to sample nucleic acid confirms the presence of HG2 S. typhimurium nucleic acid.
  • binding of probe U to sample nucleic acid confirms the presence of HG2 S. choleraesuis nucleic acid.
  • the present invention includes a method of 'serotyping' salmonella bacterial species. In more detail, the aforementioned methods may be performed to confirm (or deny) the presence of one or more specific types of salmonella selected from HG3 S. paratyphi A nucleic acid (via probe A and/ or probe B), HG3 S. paratyphi B/ Java (via one or more, such as all, of probes C-L), HG3 S. paratyphi C nucleic acid (via probe J and/ or probe K), HG3 S.
  • HG2 S. enteritidis nucleic acid via one or more, such as all, of probes N-P
  • HG2 S. typhimurium nucleic acid via one or more, such as all, of probes Q-T
  • HG2 S. choleraesuis via probe U.
  • probe A and/ or probe B may be used to serotype S. paratyphi A; one or more, such as all, of probes C-L may be used to serotype HG3 S. paratyphi B/ Java; probe J and/ or probe K may be used to serotype HG3 S. paratyphi C; probe L may be used to serotype HG3 S. typhi; one or more, such as all, of probes N-P may be used to serotype HG2 S. enteritidis; one or more, such as all, of probes Q-T may be used to serotype HG2 S.
  • probe U may be used to serotype HG2 S. choleraesuis.
  • the probes A-U of the present invention bind to a defined region on the target nucleic acid SEQ ID NOs (or complement thereof) as specified above, wherein said defined region includes the nucleic acid residue at the particular position as specified above.
  • the probe A binds to a region on SEQ ID NO: 1 that includes the nucleic acid residue A at position 328. However, the same probe A does not bind to the same region on SEQ ID NO: 1 when the nucleic acid residue at position 328 is G. Similarly, the probe A binds to a region on SEQ I D NO: 2 (ie. the complement of SEQ ID NO: 1 ) that includes the nucleic acid residue T at position 328. However, the same probe A does not bind to the same region on SEQ ID NO: 2 when the nucleic acid residue at position 328 is C.
  • probe B-U binds to a region on SEQ ID NO: 3 that includes the nucleic acid residue T at position 136.
  • the same probe B does not bind to the same region on SEQ ID NO: 3 when the nucleic acid residue at position 136 is C.
  • the probe B binds to a region on SEQ ID NO: 4 (ie. the complement of SEQ ID NO: 3) that includes the nucleic acid residue A at position 136.
  • the same probe B does not bind to the same region on SEQ ID NO: 4 when the nucleic acid residue at position 136 is G.
  • Each of probes A-U typically comprises a sequence of 9-30 (or longer) consecutive nucleotides. It is this sequence that binds the probe to the defined region on the respective target nucleic acid SEQ ID NO (or complement thereof) as specified above.
  • the probes may contain additional nucleotides, though said additional nucleotides need not necessarily contribute to the binding of the probe to its target nucleic acid (or complement thereof). In this regard, additional nucleotides may be included to help provide stability to the probe. Similarly, additional nucleotides may be added (typically to the 5' end) to provide a labelling means for downstream detection.
  • each of probes A-U may comprise a sequence of at least 9, at least 12, at least 15, at least 18, at least 21 , at least 24, or at least 27 consecutive nucleotides that bind the probe to its target nucleic acid (or complement thereof).
  • each of the probes A-U may comprise at most 30, at most 27, at most 24, at most 21 , at most 18, at most 15, or at most 12 consecutive nucleotides that bind the probe to its target nucleic acid (or complement thereof).
  • Each of probes A-U binds to a defined region on its respective target nucleic acid SEQ ID NO (or complement thereof) as specified above. In this regard, it is self-explanatory that the length of the defined region typically matches the length of the probe.
  • said probe will typically bind to a corresponding defined region of 20 consecutive nucleotides on the target sequence (or complement thereof).
  • Each of probes A-U binds to its respective target nucleic acid SEQ ID NO (or complement thereof) as specified above. In order to do so, each of said probes is complementary in sequence to its respective target nucleic acid SEQ ID NO (or complement thereof).
  • each of probes A-U includes a key nucleotide that forms a natural base-pairing with the particular nucleotide (at the particular position) as specified above for each target nucleic acid SEQ ID NO (or complement thereof).
  • Reference to a natural base-pairing means that, when the specified nucleotide on the target SEQ ID NO (or complement thereof) is A, then the probe contains a key nucleotide T that binds the probe to the A nucleotide present on the target SEQ ID NO (or complement thereof). Similarly, when the specified nucleotide on the target SEQ ID NO (or complement thereof) is T, then the probe will contain a key nucleotide A that binds the probe to the T nucleotide present on the target SEQ ID NO (or complement thereof).
  • the probe when the specified nucleotide on the target SEQ ID NO (or complement thereof) is G, then the probe will contain a key nucleotide C that binds the probe to the G nucleotide present on the target SEQ ID NO (or complement thereof). Similarly, when the specified nucleotide on the target SEQ ID NO (or complement thereof) is C, then the probe will contain a key nucleotide G that binds the probe to the C nucleotide present on the target SEQ ID NO (or complement thereof).
  • Probes A-U have a 5'-to-3' orientation.
  • the key nucleotide may be located at any position within the probe.
  • the key nucleotide may be located within 5 nucleotides of the 3' end of the probe.
  • the key nucleotide may be located at the 3' end (ie. the extreme 3' nucleotide position) of the probe.
  • the key nucleotide may be located 1 , 2, 3, or 4 nucleotides in (ie. in a 5' direction) from the extreme 3' nucleotide position of the probe.
  • each of the probes A-U comprises (or consists of) a nucleotide sequence having at least 80% identity to (eg.
  • nucleotide sequence selected from SEQ ID NOs: 395-436 (as shown in Table 1 ), though with the proviso that the key nucleotide (underlined) is not changed. Conservative substitutions are preferred.
  • the present invention also embraces shorter probe sequences thereof (as described above), though with the proviso that the key nucleotide (underlined) is not changed.
  • each of the probes A-U comprises (or consists of) a nucleotide sequence as shown in Table 1 , though differing at no more than 5 nucleotide positions (for example at no more than 4, 3, 2 or 1 nucleotide position), though again with the proviso that the key nucleotide (underlined) is not changed. Conservative substitutions are preferred.
  • the present invention also embraces shorter probe sequences thereof (as described above), though with the proviso that the key nucleotide (underlined) is not changed.
  • the probes of the invention bind to critical nucleotides (present at specific positions) on genes of HG3 (and/ or HG2) salmonella.
  • probe A binds to 328A of SEQ ID NO: 1 , but not when the nucleotide at position 328 is G.
  • SEQ ID NO: 1 corresponds to a S. paratyphi A aroC gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 43-52.
  • probe A also binds to 328A of any of SEQ ID NOs: 43-52, but not when the nucleotide at position 328 is G.
  • probe A also binds to 328T of any of SEQ ID NOs: 53- 62, but not when the nucleotide at position 328 is C.
  • Probe B binds to 136T of SEQ ID NO: 3, but not when the nucleotide at position 136 is C.
  • SEQ ID NO: 3 corresponds to a S. paratyphi A thr A gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 63-72.
  • probe B also binds to 136T of any of SEQ ID NOs: 63-72, but not when the nucleotide at position 136 is C.
  • SEQ ID NO: 3 ie. SEQ ID NO: 4
  • SEQ ID NOs: 73-82 represent strain variants of SEQ ID NO: 4.
  • probe B also binds to 136A of any of SEQ ID NOs: 73-82, but not when the nucleotide at position 136 is G.
  • Probe C binds to 378C of SEQ ID NO: 5, but not when the nucleotide at position 378 is T.
  • SEQ ID NO: 5 corresponds to a S. paratyphi B/ java dnaN gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 83-92.
  • probe C also binds to 378C of any of SEQ ID NOs: 83-92, but not when the nucleotide at position 378 is T.
  • SEQ ID NO: 6 ie. SEQ ID NO: 5
  • SEQ ID NOs: 93-102 represent strain variants of SEQ ID NO: 5.
  • probe C also binds to 378G of any of SEQ ID NOs: 93-102, but not when the nucleotide at position 378 is A.
  • Probe D binds to 57C of SEQ ID NO: 7, but not when the nucleotide at position 57 is T.
  • SEQ ID NO: 7 corresponds to a S. paratyphi B/ java dnaN gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 103- 1 12.
  • probe D also binds to 57C of any of SEQ ID NOs: 103-1 12, but not when the nucleotide at position 57 is T.
  • SEQ ID NO: 7 ie. SEQ ID NO: 8
  • SEQ ID NOs: 1 13-122 represent strain variants of SEQ ID NO: 7.
  • probe D also binds to 57G of any of SEQ ID NOs: 1 13-122, but not when the nucleotide at position 57 is A.
  • Probe E binds to 339C of SEQ ID NO: 9, but not when the nucleotide at position 339 is T.
  • SEQ ID NO: 9 corresponds to a S. paratyphi B/ java hisD gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 123- 132.
  • probe E also binds to 339C of any of SEQ ID NOs: 123-132, but not when the nucleotide at position 339 is T.
  • SEQ ID NO: 9 ie. SEQ ID NO: 10
  • SEQ ID NOs: 133-142 represent strain variants of SEQ ID NO: 10.
  • probe E also binds to 339G of any of SEQ ID NOs: 133-142, but not when the nucleotide at position 339 is A.
  • Probe F binds to 69A of SEQ ID NO: 1 1 , but not when the nucleotide at position 69 is G.
  • SEQ ID NO: 1 1 corresponds to a S. paratyphi B/ java hisD gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 143- 152.
  • probe F also binds to 60A of any of SEQ ID NOs: 143-152, but not when the nucleotide at position 69 is G.
  • SEQ ID NO: 1 1 ie. SEQ ID NO: 12
  • SEQ ID NOs: 153-162 represent strain variants of SEQ ID NO: 12.
  • probe F also binds to 69T of any of SEQ ID NOs: 153-162, but not when the nucleotide at position 69 is C.
  • Probe G binds to 204G of SEQ ID NO: 13, but not when the nucleotide at position 204 is A.
  • SEQ ID NO: 13 corresponds to a S. paratyphi B/ java hisD gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 163- 172.
  • probe G also binds to 204G of any of SEQ ID NOs: 163-172, but not when the nucleotide at position 204 is A.
  • SEQ ID NO: 14 the complement of SEQ ID NO: 13 (ie. SEQ ID NO: 14) in that SEQ ID NOs: 173-182 represent strain variants of SEQ ID NO: 14.
  • probe G also binds to 204C of any of SEQ ID NOs: 173-182, but not when the nucleotide at position 204 is T.
  • Probe H binds to 255A of SEQ I D NO: 15, but not when the nucleotide at position 255 is G.
  • SEQ ID NO: 15 corresponds to a S. paratyphi B/ java sucA gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 183- 192.
  • probe H also binds to 255A of any of SEQ I D NOs: 183-192, but not when the nucleotide at position 255 is G.
  • SEQ ID NO: 15 ie. SEQ ID NO: 16
  • SEQ ID NOs: 193-202 represent strain variants of SEQ ID NO: 16.
  • probe H also binds to 255T of any of SEQ ID NOs: 193-202, but not when the nucleotide at position 255 is C.
  • Probe I binds to 300C of SEQ ID NO: 17, but not when the nucleotide at position 300 is T.
  • SEQ ID NO: 17 corresponds to a S. paratyphi B/ java sucA gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 203- 212.
  • probe I also binds to 300C of any of SEQ ID NOs: 203-212, but not when the nucleotide at position 300 is T.
  • SEQ ID NO: 18 the complement of SEQ ID NO: 17 (ie. SEQ ID NO: 18) in that SEQ ID NOs: 213-222 represent strain variants of SEQ ID NO: 18.
  • probe I also binds to 300G of any of SEQ ID NOs: 213-222, but not when the nucleotide at position 300 is A.
  • Probe J binds to 342C of SEQ ID NO: 19, but not when the nucleotide at position 342 is A or T.
  • SEQ ID NO: 19 corresponds to a S. paratyphi C dnaN gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 223- 232.
  • probe J also binds to 342C of any of SEQ ID NOs: 223-232, but not when the nucleotide at position 342 is A or T.
  • SEQ ID NO: 19 ie. SEQ ID NO: 20
  • SEQ ID NOs: 233-242 represent strain variants of SEQ I D NO: 20.
  • probe J also binds to 342G of any of SEQ ID NOs: 233-242, but not when the nucleotide at position 342 is T or A.
  • Probe K binds to 252T of SEQ ID NO: 21 , but not when the nucleotide at position 252 is G.
  • SEQ ID NO: 21 corresponds to a S. paratyphi C sucA gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 243- 252.
  • probe K also binds to 352T of any of SEQ ID NOs: 243-252, but not when the nucleotide at position 252 is G.
  • SEQ ID NO: 21 ie. SEQ ID NO: 22
  • SEQ ID NOs: 253-262 represent strain variants of SEQ ID NO: 22.
  • probe K also binds to 252A of any of SEQ ID NOs: 253-262, but not when the nucleotide at position 252 is C.
  • Probe L binds to 9C of SEQ ID NO: 23, but not when the nucleotide at position 9 is T.
  • SEQ ID NO: 23 corresponds to a S. paratyphi C aroC gene sequence, and strain variants of said sequence are known in the art.
  • probe L also binds to 9C of any of said strain variant SEQ ID NOs, but not when the nucleotide at position 9 is T
  • SEQ ID NO: 24 the complement of SEQ ID NO: 23 (ie. SEQ ID NO: 24) in that strain variants of SEQ ID NO: 24 are known in the art.
  • probe L also binds to 9G of any of said variant SEQ ID NOs, but not when the nucleotide at position 9 is A.
  • Probe M binds to 67A of SEQ ID NO: 25, but not when the nucleotide at position 67 is G.
  • SEQ ID NO: 25 corresponds to a S. paratyphi B tartrate gene sequence, and strain variants of said sequence are illustrated, for example, in SEQ ID NO: 263.
  • probe M also binds to 67A of any of said variant SEQ ID NOs, but not when the nucleotide at position 67 is G.
  • probe M also binds to 67T of any of strain variant SEQ ID NOs, but not when the nucleotide at position 67 is C.
  • Probe N binds to 385T of SEQ ID NO: 27, but not when the nucleotide at position 385 is C.
  • SEQ ID NO: 27 corresponds to a S. enteritidis hisD gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 265- 267.
  • probe N also binds to 385T of any of SEQ ID NOs: 265-267, but not when the nucleotide at position 385 is C.
  • SEQ ID NO: 28 the complement of SEQ ID NO: 27 (ie. SEQ ID NO: 28) in that SEQ ID NOs: 268-270 represent strain variants of SEQ ID NO: 28.
  • probe N also binds to 385A of any of SEQ ID NOs: 268-270, but not when the nucleotide at position 385 is G.
  • Probe O binds to 501 C of SEQ ID NO: 29, but not when the nucleotide at position 501 is T.
  • SEQ ID NO: 29 corresponds to a S. enteritidis hisD gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 271 - 280.
  • probe O also binds to 501 C of any of SEQ ID NOs: 271 -280, but not when the nucleotide at position 501 is T.
  • the same applies to the complement of SEQ ID NO: 29 ie.
  • SEQ ID NO: 30 in that SEQ ID NOs: 281 -290 represent strain variants of SEQ ID NO: 30.
  • probe O also binds to 501 G of any of SEQ ID NOs: 281 -290, but not when the nucleotide at position 501 is A.
  • Probe P binds to 271 G of SEQ I D NO: 31 , but not when the nucleotide at position 271 is T.
  • SEQ ID NO: 31 corresponds to a S. enteritidis purE gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 291 - 300.
  • probe P also binds to 271 G of any of SEQ ID NOs: 291 -300, but not when the nucleotide at position 271 is T.
  • SEQ ID NO: 32 the complement of SEQ ID NO: 31 (ie. SEQ ID NO: 32) in that SEQ ID NOs: 301 -310 represent strain variants of SEQ ID NO: 32.
  • probe P also binds to 271 C of any of SEQ ID NOs: 301 -310, but not when the nucleotide at position 271 is A.
  • Probe Q binds to 492G of SEQ ID NO: 33, but not when the nucleotide at position 492 is A.
  • SEQ ID NO: 33 corresponds to a S. typhimurium aroC gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 31 1 - 320.
  • probe Q also binds to 492G of any of SEQ ID NOs: 31 1 -320, but not when the nucleotide at position 492 is A.
  • the same applies to the complement of SEQ ID NO: 33 ie.
  • SEQ ID NO: 34 in that SEQ ID NOs: 321 -330 represent strain variants of SEQ ID NO: 34.
  • probe Q also binds to 492C of any of SEQ ID NOs: 321 -330, but not when the nucleotide at position 492 is T.
  • Probe R binds to 271 T of SEQ ID NO: 35, but not when the nucleotide at position 271 is G.
  • SEQ ID NO: 35 corresponds to a S. typhimurium dnaN gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 331 - 338.
  • probe R also binds to 271 T of any of SEQ ID NOs: 331 -338, but not when the nucleotide at position 271 is G.
  • SEQ ID NO: 35 ie. SEQ ID NO: 36
  • SEQ ID NOs: 339-346 represent strain variants of SEQ ID NO: 36.
  • probe R also binds to 271A of any of SEQ ID NOs: 339-346, but not when the nucleotide at position 271 is C.
  • Probe S binds to 100T of SEQ ID NO: 37, but not when the nucleotide at position 100 is G.
  • SEQ ID NO: 37 corresponds to a S. typhimurium hemD gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 347- 356.
  • probe S also binds to 100T of any of SEQ ID NOs: 347-356, but not when the nucleotide at position 100 is G.
  • SEQ ID NO: 38 the complement of SEQ ID NO: 37 (ie. SEQ ID NO: 38) in that SEQ ID NOs: 357-366 represent strain variants of SEQ ID NO: 38.
  • probe S also binds to 100A of any of SEQ ID NOs: 357-366, but not when the nucleotide at position 100 is C.
  • Probe T binds to 210T of SEQ ID NO: 39, but not when the nucleotide at position 210 is A.
  • SEQ ID NO: 39 corresponds to a S. typhimurium thrA gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 367- 376.
  • probe T also binds to 210T of any of SEQ ID NOs: 367-376, but not when the nucleotide at position 201 is A.
  • SEQ ID NO: 40 the complement of SEQ ID NO: 39 (ie. SEQ ID NO: 40) in that SEQ ID NOs: 377-386 represent strain variants of SEQ ID NO: 40.
  • probe T also binds to 210A of any of SEQ ID NOs: 377-386, but not when the nucleotide at position 210 is T.
  • Probe U binds to 66T of SEQ ID NO: 41 , but not when the nucleotide at position 66 is C.
  • SEQ ID NO: 41 corresponds to a S. cholera aroC gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 387-390.
  • probe U also binds to 66T of any of SEQ ID NOs: 387-390, but not when the nucleotide at position 66 is C.
  • probe U also binds to 66A of any of SEQ ID NOs: 391 -394, but not when the nucleotide at position 66 is G.
  • the probes of the invention are designed to hybridise to their target (or complement) sequence. It is preferred that the binding conditions are such that a high level of specificity is provided - ie. hybridisation of the probe occurs under "stringent conditions". In general, stringent conditions are selected to be about 5°C lower than the thermal melting point (T m ) for the specific sequence at a defined ionic strength and pH.
  • the T m is the temperature (under defined ionic strength and pH) at which 50% of the target (or complement) sequence hybridises to a perfectly matched probe.
  • the T m of probes of the present invention at a salt concentration of about 0.02M or less at pH 7, is for example above 60°C, such as about 70°C.
  • Premixed binding solutions are commercially available (eg. EXPRESSHYB Hybridisation Solution from CLONTECH Laboratories, Inc.), and hybridisation can be performed according to the manufacturer's instructions.
  • Preferred probes of the present invention are selected so as to have minimal homology with human DNA.
  • the selection process may involve comparing a candidate probe sequence with human DNA and rejecting the probe if the homology is greater than 50%.
  • the aim of this selection process is to reduce annealing of probe to contaminating human DNA sequences and hence allow improved specificity of the assay.
  • any of the probes described herein may comprise a tag and/ or label.
  • the tag and/ or label may, for example, be located (independently of one another) towards the middle or towards or at the 5' or 3' end of the herein described probes, for example at the 5' end.
  • the tag/ label is associated with the HG3 target nucleic acid (or complement thereof).
  • the probes may act as primers during the method of the invention and the tag/ label may therefore become incorporated into the amplification product as the primer is extended.
  • suitable labels include detectable labels such as radiolabels or fluorescent or coloured molecules, enzymatic markers or chromogenic markers - eg. dyes that produce a visible colour change upon hybridisation of the probe.
  • the label may be digoxygenin, fluorescein-isothiocyanate (FITC), R-phycoerythrin, Alexa 532 or Cy3.
  • the label may be a reporter molecule, which is detected directly, such as by exposure to photographic or X-ray film.
  • the label is not directly detectable, but may be detected indirectly, for example, in a two-phase system.
  • An example of indirect label detection is binding of an antibody to the label.
  • tags include "complement/ anti-complement pairs".
  • the term "complement/ anti-complement pair” denotes non-identical moieties that form a non- covalently associated, stable pair under appropriate conditions.
  • suitable tags include biotin and streptavidin (or avidin).
  • a biotin tag may be captured using streptavidin, which may be coated onto a substrate or support such as a bead (for example a magnetic bead) or membrane.
  • a streptavidin tag may be captured using biotin, which may be coated onto a substrate or support such as a bead (for example a magnetic bead) or membrane.
  • complement/ anti-complement pairs include receptor/ ligand pairs, antibody/ antigen (or hapten or epitope) pairs, and the like.
  • Another example is a nucleic acid sequence tag that binds to a complementary sequence. The latter may itself be pre-labelled, or may be attached to a surface (eg. a bead) which is separately labelled.
  • An example of the latter embodiment is the well-known Luminex R bead system.
  • Other exemplary pairs of tags and capture molecules include receptor/ ligand pairs and antibody/ antigen (or hapten or epitope) pairs. Where subsequent dissociation of the complement/ anti- complement pair is desirable, the complement/ anti-complement pair has a binding affinity of, for example, less than 10 9 M "1 .
  • probes A-U may be labelled with different labels or tags, thereby allowing separate identification of each probe when used in the method of the present invention.
  • one or more of probes A-U may be labelled with different nucleic acid sequences (aka 'sequence tags' or 'sequence labels').
  • Said different nucleic acid sequence labels may be attached, for example, to the 5' end of the defined sequence of the probe that binds to the target nucleic acid SEQ ID NO (or complement thereof), thereby allowing separate identification of each probe when used in the method of the present invention.
  • each of said nucleic acid tags is able to bind to a different, complementary nucleic acid sequence present on a surface (eg.
  • nucleic acid tags are illustrated in Table 2.
  • nucleic acid tags Any conventional method may be employed to attach nucleic acid tags to a probe of the present invention (eg. to the 5' end of the defined binding region of the probe).
  • nucleic acid probes of the invention may be constructed by commercial providers. Examples of probes of the invention to which tags illustrated in Table 2 have been attached are illustrated in Table 3.
  • the sample is for example a clinical sample (or is derived from a clinical sample) such as: faeces or blood, sputum, bronchoalveolar lavage, tracheal aspirate, lung tissue samples, cerebrospinal fluid, archaeological samples.
  • a clinical sample or is derived from a clinical sample
  • faeces or blood such as: faeces or blood, sputum, bronchoalveolar lavage, tracheal aspirate, lung tissue samples, cerebrospinal fluid, archaeological samples.
  • an amplification step may be introduced.
  • amplification may be carried out using methods and platforms known in the art, for example PCR, such as real-time PCR, block-based PCR, ligase chain reaction, glass capillaries, isothermal amplification methods including loop-mediated isothermal amplification, rolling circle amplification transcription mediated amplification, nucleic acid sequence-based amplification, signal mediated amplification of RNA technology, strand displacement amplification, isothermal multiple displacement amplification, helicase-dependent amplification, single primer isothermal amplification, and circular helicase-dependent amplification.
  • PCR such as real-time PCR, block-based PCR, ligase chain reaction, glass capillaries
  • isothermal amplification methods including loop-mediated isothermal amplification, rolling circle amplification transcription mediated amplification, nucleic acid sequence-based amplification, signal mediated amplification of RNA technology, strand displacement amplification, isothermal multiple displacement amplification,
  • amplification can be carried using any amplification platform - as such, an advantage of this embodiment of the assay is that it is platform independent and not tied to any particular instrument.
  • a general amplification step eg. pre-detection
  • PCR amplification primers are typically employed to amplify approximately 100-400 base pair regions of the target/ complementary nucleic acid that contain the HG3 (and/ or HG2) nucleotide targets of the present invention.
  • forward and reverse primers are extended in a 5' to 3' direction, thereby initiating the synthesis of new nucleic acid strands that are complementary to the individual strands of the target/ complementary HG3 nucleic acid.
  • the primers thereby drive amplification of HG3 (and/ or HG2) nucleic acid sequences, thereby generating amplification products comprising said HG3 (and/ or HG2) nucleic acid sequences.
  • an amplification step may be employed in which the probes of the present invention act as primers.
  • the probes act as primers
  • the probes are extended from their 3' ends (ie. in a 5'-to-'3') direction.
  • the resulting amplification products typically comprise 100-400 base pair regions of the target/ complementary nucleic acid.
  • This embodiment may be employed in conjunction with a general amplification step, such as the one described above.
  • the detection step may be carried out by any known means.
  • the probe or amplification product may be tagged and/ or labelled, and the detection method may therefore comprise detecting said tag and/ or label.
  • the probe(s) may comprise a tag and/ or label.
  • the tag/ label becomes associated with the target (or complement) nucleic acid.
  • the assay may comprise detecting the tag/ label and correlating presence of tag/ label with presence of HG3.
  • tag and/ or label may be incorporated during extension of the probe(s).
  • the amplification product(s) become tagged/ labelled, and the assay may therefore comprise detecting the tag/ label and correlating presence of tag/ label with presence of amplification product, and hence the presence of HG3 (and/ or HG2).
  • the amplification product may incorporate a tag/ label (eg. via a tagged/ labelled dNTP such as biotin-dNTP) as part of the amplification process, and the assay may further comprise the use of a binding partner complementary to said tag (eg. streptavidin) that includes a detectable tag/ label (eg. a fluorescent label, such as R-phycoerythrin).
  • a detectable tag/ label eg. a fluorescent label, such as R-phycoerythrin
  • the amplified product incorporates a detectable tag/ label (eg. a fluorescent label, such as R-phycoerythrin).
  • the probe(s) and/ or the amplification product(s) may include a further tag/ label (as the complement component) to allow capture of the amplification product(s).
  • a “complement/ anti-complement” pairing may be employed in which an anti-complement capture component binds to said further tag/ label (complement component) and thereby permits capture of the probe(s) and/ or amplification product(s).
  • suitable "complement/ anti-complement” partners have been described earlier in this specification, such as a complementary pair of nucleic acid sequences, a complementary antibody-antigen pair, etc.
  • the anti-complement capture component may be attached (eg. coated) on to a substrate or solid support - examples of suitable substrates/ supports include membranes and/ or beads (eg. a magnetic or fluorescent bead). Capture methods are well known in the art. For example, Luminex R beads may be employed. Alternatively, the use of magnetic beads may be advantageous because the beads (plus captured, tagged/ labelled amplification product) can easily be concentrated and separated from the sample, using conventional techniques known in the art.
  • Immobilisation provides a physical location for the anti-complement capture component (or probes), and may serve to fix the capture component/ probe at a desired location and/ or facilitate recovery or separation of probe.
  • the support may be a rigid solid support made from, for example, glass or plastic, such as a bead (for example a fluorescent or magnetic bead).
  • the support may be a membrane, such as nylon or nitrocellulose membrane.
  • 3D matrices are also suitable supports for use with the present invention - eg. polyacrylamide or PEG gels.
  • Immobilisation to a support/ platform may be achieved by a variety of conventional means.
  • immobilisation onto a support such as a nylon membrane may be achieved by UV cross-linking.
  • biotin-labelled molecules may be bound to streptavidin-coated substrates (and vice-versa), and molecules prepared with amino linkers may be immobilised on to silanised surfaces.
  • Another means of immobilisation is via a poly-T tail or a poly-C tail, for example at the 3' or 5' end.
  • probes A-L (optionally also probe M; optionally also probes N-U with or without probe M) of the invention comprise a nucleic acid sequence tag/ label (eg. attached to each probe at the 5' end of the defined sequence of the probe that binds to target/ complement nucleic acid).
  • each of the probes is provided with a different nucleic acid sequence tag/ label, wherein each of said tags/ labels (specifically) binds to a complementary nucleic acid sequence present on the surface of a bead.
  • Each of the different tags/ labels binds to its complementary sequence counterpart (and not to any of the complementary sequence counterparts of the other tags), which is located on a uniquely identifiable bead.
  • the beads are uniquely identifiable, for example by means of fluorescence at a specific wavelength.
  • probes of the invention bind to target/ complement nucleic acid (if present in the sample). Thereafter, (only) the bound probes may be extended (in the 3' direction) in the presence of one or more labelled dNTP (eg. biotin labelled dNTPs, such as biotin- dCTPs).
  • the extended primers may be contacted with a binding partner counterpart to the labelled dNTPs (eg. a streptavidin labelled flurophore, such as streptavidin labelled R- phycoerythrin), which binds to those labelled dNTPs that have become incorporated into the extended primers.
  • a binding partner counterpart to the labelled dNTPs eg. a streptavidin labelled flurophore, such as streptavidin labelled R- phycoerythrin
  • the labelled extended primers may be identified by allowing them to bind to their nucleic acid counterparts present on the uniquely identifiable beads. The latter may then be "called” (eg. to determine the type of bead present by wavelength emission) and the nature of the primer extension (and thus the type of target/ complement nucleic acid present) may be determined.
  • a simple schematic of one such embodiment is illustrated in Figure 1 .
  • HG3 detection probes For simplicity only two HG3 detection probes are illustrated - for illustrative purposes only, the first probe has a 3' nucleotide residue designated G and the second probe has a 3' nucleotide residue designated A. Each of the probes has a different 5' nucleic acid sequence tag.
  • Figure 1 illustrates a scenario in which only the second probe binds to salmonella nucleic acid present in the test sample.
  • the first probe does not bind to any nucleic acid present in the test sample.
  • the bound second probe is then extended in a 3' direction, and fluorescent label is incorporated as the probe becomes extended.
  • the present invention (a method of serotyping by PCR used for the detection of Hazard Group 3 (HG3) Salmonella) has been tested for use in the Laboratory for Gastrointestinal Pathogens at the HPA Colindale. Specific examples will now be given of the use of the invention in that laboratory. These are exemplary; they are not limiting in any way to the scope of the claims or to the invention. These examples represent the best practice for the using the invention as currently envisaged.
  • Example 1 The invention was tested for an ability to screen all Salmonella to allow release of HG2 Salmonella from the high level containment laboratory (CL). HG3 Salmonella must remain in the CL.
  • Salmonella Typhi (HG3): 150 tested and 150 correctly identified by the invention.
  • Salmonella Paratyphi A (HG3) 76 tested and 76 correctly identified by the invention;
  • Salmonella Paratyphi C (HG3) 34 tested and 34 correctly identified by the invention; of 260 HG3 isolates all were correctly identified.
  • Salmonella Enteritidis 25 tested and 25 correctly identified by the invention (1 not assigned as either HG3 or HG2).
  • the invention has an accuracy of 100% for the positive identification of 260 HG3 Salmonella and an accuracy of 106/106 for the positive identification of HG2 Salmonella.
  • the serious error rate (identification HG3 as HG2 Salmonella) was zero.
  • Example 2 The invention was tested using 356 unknown isolates as they were received by the Laboratory for Gastrointestinal Pathogens.
  • Salmonella Typhi (HG3): 17 tested and 17 correctly identified by the invention.
  • Salmonella Paratyphi A (HG3) 5 tested and 5 correctly identified by the invention; of 22 HG3 isolates all were correctly identified.
  • Other HG2 Salmonella 221 tested and 221 correctly identified by the invention; of 334 HG2 Salmonella tested 333 were correctly identified.
  • the invention when the operator is blind to the expected result of the test, has an accuracy of 22/22 for the positive identification of HG3 Salmonella and an accuracy of 333/334 for the positive identification of HG2 Salmonella.
  • the serious error rate (identification HG3 as HG2 Salmonella) was zero.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Organic Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Engineering & Computer Science (AREA)
  • Immunology (AREA)
  • Pathology (AREA)
  • Analytical Chemistry (AREA)
  • Zoology (AREA)
  • Genetics & Genomics (AREA)
  • Wood Science & Technology (AREA)
  • Physics & Mathematics (AREA)
  • Biotechnology (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Hospice & Palliative Care (AREA)
  • Biophysics (AREA)
  • Oncology (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The present invention relates to a method for the detection of Hazard Group 3 (HG3) Salmonella enterica serovar Typhi(S.Typhi), S.Paratyphi A, S.Paratyphi B, and S. Paratyphi C. Also provided are corresponding probes and primers.

Description

Salmonella detection assay
The present invention relates to a method for the detection of Hazard Group 3 (HG3) Salmonella enterica serovar Typhi (S. Typhi), S. Paratyphi A, S. Paratyphi B, and S. Paratyphi C). Also provided are corresponding probes and primers.
HG3 refers to a classification system introduced by the Advisory Committee on Dangerous Pathogens (http://www.dh.gov.uk/ab/ACDP/index.htm). The Advisory Committee on Dangerous Pathogens (ACDP) is a UK-wide advisory non-departmental public body. It was established in 1981 , and the terms of reference were revised in 1991 and 201 1 to allow for a wider remit.
S. Typhi and S. Paratyphi are invasive, life-threatening human bacterial pathogens, causing the systemic diseases enteric fever which continues to pose significant threats for public health, especially in many developing countries. In 2000, a population based study estimated that the global disease burdens associated with typhoid and paratyphoid fever were 21.6 million cases (with 216,000 deaths) and 5.4 million cases, respectively - Crump, J.A. et al. (2004) Bull World Hith Org 82, pp. 346-53. As part of existing health and safely procedures, clinical samples are screened for the presence or absence of HG3 Salmonella enterica. If a HG3 bacterium is detected the sample in question must be processed in a high-level containment laboratory, containment level 3 (CL-3) laboratory, under stringent, high-level containment procedures, and by laboratory personnel skilled in handling HG3 organisms. In contrast, if no HG3 salmonella bacterium is detected (for example, only HG2 salmonella bacteria such as Enteritidis, Typhimuruim, or Choleraesuis are detected), the sample in question may be processed in a low-level containment laboratory, with associated less stringent operating procedures, and without the need for highly skilled HG3 personnel.
The costs involved with HG3 containment are significant, and considerably higher (in terms of laboratory equipment and consumables, increased processing time due to added exposure risk, and use of highly-skilled personnel) than are the corresponding costs associated with low-level containment.
Existing HG3 salmonella detection systems typically rely on a combination of antiserum tests and biochemical tests, which are not only time-consuming (typically taking
l approximately 20 days), but also often inconclusive and/ or inaccurate. Thus, existing HG3 detection systems are sub-optimal, and result in the generation of undesirable false-positives (i.e. salmonella assigned to HG3, when, in fact, they should have been assigned to a lower category such as HG2).
Moreover, the risks associated with incorrect assignment of a salmonella HG3 bacterium to a lower Hazard Group category (e.g. HG2), are highly undesirable. First, the handling of a HG3 salmonella bacterium in a low level containment facility inherently leads to an increased risk of contamination and spread. Secondly, the processing of a HG3 bacterium in a non-CL3 containment laboratory mandates a detailed, written incident report.
The above-described factors combine to place an ever-increasing demand on the use of CL-3 facilities and on the use of highly-skilled HG3-trained personnel. Both CL-3 facilities and HG3-trained personnel are expensive commodities and a limited resource.
There is therefore a need in the art for a method and/ or laboratory tools, which address one or more of the above-mentioned problems. For example, there is a need for more accurate and/ or more rapid diagnosis of HG3 salmonella.
The present invention addresses one or more of the above-described problems.
A first aspect of the present invention provides a method of screening a sample containing nucleic acid for the presence or absence of HG3 salmonella nucleic acid, said method comprising:
A. contacting the sample nucleic acid with a series of probes, said series of probes comprising:
i. probe A, wherein probe A binds to:
i. position 328 of target SEQ ID NO: 1 when the nucleotide at position 328 of SEQ ID NO: 1 is A, and wherein probe A does not bind to position 328 of SEQ ID NO: 1 when the nucleotide at position 328 is G; or
ii. position 328 of SEQ ID NO: 2 (the complement of SEQ ID NO:
1 ) when the nucleotide at position 328 is T, and wherein probe A does not bind to the nucleotide at position 328 of SEQ ID NO: 2 when the nucleotide at position 328 is C; ii. probe B, wherein probe B binds to:
i. position 136 of target SEQ ID NO: 3 when the nucleotide at position 136 of SEQ ID NO: 3 is T, and wherein probe B does not bind to position 136 of SEQ ID NO: 3 when the nucleotide at position 136 is C; or
ii. position 136 of SEQ ID NO: 4 (the complement of SEQ ID NO:
3) when the nucleotide at position 136 is A, and wherein probe B does not bind to the nucleotide at position 136 of SEQ ID NO: 4 when the nucleotide at position 136 is G;
iii. probe C, wherein probe C binds to:
i. position 378 of target SEQ ID NO: 5 when the nucleotide at position 378 of SEQ ID NO: 5 is C, and wherein probe C does not bind to position 378 of SEQ ID NO: 5 when the nucleotide at position 378 is T; or
ii. position 378 of SEQ ID NO: 6 (the complement of SEQ ID NO:
5) when the nucleotide at position 378 is G, and wherein probe C does not bind to the nucleotide at position 378 of SEQ ID NO: 6 when the nucleotide at position 378 is A;
iv. probe D, wherein probe D binds to:
i. position 57 of target SEQ ID NO: 7 when the nucleotide at position 57 of SEQ ID NO: 7 is C, and wherein probe D does not bind to position 57 of SEQ ID NO: 7 when the nucleotide at position 57 is T; or
ii. position 57 of SEQ ID NO: 8 (the complement of SEQ ID NO: 7) when the nucleotide at position 57 is G, and wherein probe D does not bind to the nucleotide at position 57 of SEQ ID NO: 8 when the nucleotide at position 57 is A;
v. probe E, wherein probe E binds to:
i. position 339 of target SEQ ID NO: 9 when the nucleotide at position 339 of SEQ ID NO: 9 is C, and wherein probe E does not bind to position 339 of SEQ ID NO: 9 when the nucleotide at position 339 is T; or
ii. position 339 of SEQ ID NO: 10 (the complement of SEQ I D NO:
9) when the nucleotide at position 339 is G, and wherein probe E does not bind to the nucleotide at position 339 of SEQ ID NO: 10 when the nucleotide at position 339 is A;
vi. probe F, wherein probe F binds to: i. position 69 of target SEQ ID NO: 1 1 when the nucleotide at position 69 of SEQ ID NO: 1 1 is A, and wherein probe F does not bind to position 69 of SEQ ID NO: 1 1 when the nucleotide at position 69 is G; or
ii. position 69 of SEQ ID NO: 12 (the complement of SEQ ID NO:
1 1 ) when the nucleotide at position 69 is T, and wherein probe F does not bind to the nucleotide at position 69 of SEQ I D NO: 1 1 when the nucleotide at position 69 is C;
vii. probe G, wherein probe G binds to:
i. position 204 of target SEQ ID NO: 13 when the nucleotide at position 204 of SEQ I D NO: 13 is G, and wherein probe G does not bind to position 204 of SEQ ID NO: 13 when the nucleotide at position 204 is A; or
ii. position 204 of SEQ ID NO: 14 (the complement of SEQ I D NO:
13) when the nucleotide at position 204 is C, and wherein probe
G does not bind to the nucleotide at position 204 of SEQ ID NO:
14 when the nucleotide at position 204 is T;
viii. probe H, wherein probe H binds to:
i. position 255 of target SEQ ID NO: 15 when the nucleotide at position 255 of SEQ ID NO: 15 is A, and wherein probe H does not bind to position 255 of SEQ ID NO: 15 when the nucleotide at position 255 is G; or
ii. position X of SEQ ID NO: 16 (the complement of SEQ ID NO:
15) when the nucleotide at position 255 is T, and wherein probe H does not bind to the nucleotide at position 255 of SEQ ID NO:
15 when the nucleotide at position 255 is C;
ix. probe I, wherein probe I binds to:
i. position 300 of target SEQ ID NO: 17 when the nucleotide at position 300 of SEQ ID NO: 17 is C, and wherein probe I does not bind to position 300 of SEQ ID NO: 17 when the nucleotide at position 300 is T; or
ii. position 300 of SEQ ID NO: 18 (the complement of SEQ I D NO:
17) when the nucleotide at position 300 is G, and wherein probe I does not bind to the nucleotide at position 300 of SEQ ID NO: 18 when the nucleotide at position 300 is A;
x. probe J, wherein probe J binds to: i. position 342 of target SEQ ID NO: 19 when the nucleotide at position 342 of SEQ ID NO: 19 is C, and wherein probe J does not bind to position 342 of SEQ ID NO: 19 when the nucleotide at position 342 is A or T; or
ii. position 342 of SEQ ID NO: 20 (the complement of SEQ I D NO:
19) when the nucleotide at position 342 is G, and wherein probe J does not bind to the nucleotide at position 342 of SEQ ID NO: 20 when the nucleotide at position 342 is T or A; xi. probe K, wherein probe K binds to:
i. position 252 of target SEQ ID NO: 21 when the nucleotide at position 252 of SEQ ID NO: 22 is T, and wherein probe K does not bind to position 252 of SEQ ID NO: 21 when the nucleotide at position 252 is G; or
ii. position 252 of SEQ ID NO: 22 (the complement of SEQ I D NO:
21 ) when the nucleotide at position 252 is A, and wherein probe K does not bind to the nucleotide at position 252 of SEQ ID NO: 22 when the nucleotide at position 252 is C;
xii. probe L, wherein probe L binds to:
i. position 9 of target SEQ ID NO: 23 when the nucleotide at position 9 of SEQ ID NO: 23 is C, and wherein probe L does not bind to position 9 of SEQ ID NO: 23 when the nucleotide at position 9 is T; or
ii. position 9 of SEQ ID NO: 24 (the complement of SEQ ID NO:
23) when the nucleotide at position 9 is G, and wherein probe L does not bind to the nucleotide at position 9 of SEQ ID NO: 24 when the nucleotide at position 9 is A;
allowing said series of probes to bind to their respective target nucleic acid sequence (or the complement thereof),
i. wherein the binding of one or more of said probes to its respective target nucleic acid sequence (or the complement thereof) indicates the presence of HG3 salmonella nucleic acid; and
ii. wherein the absence of binding of said series of probes to their respective target nucleic acid sequence (or the complement thereof) indicates the absence of HG3 salmonella nucleic acid; and
confirming the presence of HG3 salmonella nucleic acid by detecting the presence of one or more of said probes, or confirming the absence of HG3 salmonella nucleic acid by detecting the absence of said series to probes. Thus, the method of present invention provides a highly accurate (e.g. greater than 99%) and/ or rapid (e.g. within 48, 36 or 24 hours) screening system for confirming the presence or absence of HG3 salmonella. This represents a significant improvement over existing HG3 salmonella detection systems.
In one embodiment, all of the probes A-L are added substantially simultaneously and/ or contact the nucleic acid sample substantially simultaneously. This multiplex embodiment has the advantage of minimising the overall time required to complete the method of the invention.
In another embodiment, the method is performed in a cyclical manner, each cycle comprising the introduction of one or more of said probe(s) A-L. In this regard, each of said probes A-L may be added prior to, simultaneously with, or subsequent to one another.
The method may include a washing step (eg. between steps B and C) to remove unbound probe(s). In one embodiment, the detection step C may be performed on bound probe and/ or on unbound probe.
In one embodiment, the detection step C may be performed on unbound probe. By way of example, following initial probe binding and subsequent washing to remove unbound probe, the initially bound probe may be released (eg. by changing an assay parameter such as temperature and/ or pH), and the released (unbound) probe then detected.
The method may further comprise use of probe M, wherein probe M binds to:
i. position 67 of target SEQ I D NO: 25 when the nucleotide at position 67 of SEQ ID NO: 25 is A, and wherein probe M does not bind to position 67 of SEQ ID NO: 25 when the nucleotide at position 166 is G; or ii. position 67 of SEQ ID NO: 26 (the complement of SEQ ID NO: 25) when the nucleotide at position 67 is T, and wherein probe M does not bind to the nucleotide at position 67 of SEQ ID NO: 26 when the nucleotide at position 67 is C; In this regard, the binding of probe M to its target nucleic acid sequence (or the complement thereof), when in combination with the binding of any one of probes C-L to their respective target nucleic acid sequences (or the complement thereof), provides additional confirmation of the presence of HG3 salmonella nucleic acid.
The method of the present invention may further comprise confirming the presence or absence of HG2 salmonella nucleic acid. Said HG2 detection method may be performed prior to, simultaneously with, or subsequent to any one of the HG3 detection method steps.
In one embodiment, the presence of HG2 nucleic acid is confirmed by the binding (to sample nucleic acid) of one or more probe selected from the group consisting of probes N-U, wherein:
i. probe N binds to
i. position 385 of target SEQ ID NO: 27 when the nucleotide at position 385 of SEQ ID NO: 27 is T, and wherein probe N does not bind to position 385 of SEQ ID NO: 27 when the nucleotide at position 385 is C; or
ii. position 385 of SEQ I D NO: 28 (the complement of SEQ ID NO:
27) when the nucleotide at position 385 is A, and wherein probe N does not bind to the nucleotide at position 385 of SEQ ID NO: 28 when the nucleotide at position 385 is G;
ii. probe O binds to
i. position 501 of target SEQ ID NO: 29 when the nucleotide at position 501 of SEQ ID NO: 29 is C, and wherein probe O does not bind to position 501 of SEQ ID NO: 29 when the nucleotide at position 501 is T; or
ii. position 501 of SEQ ID NO: 30 (the complement of SEQ I D NO:
29) when the nucleotide at position 501 is G, and wherein probe O does not bind to the nucleotide at position 501 of SEQ ID NO: 30 when the nucleotide at position 501 is A;
iii. probe P binds to
i. position 271 of target SEQ ID NO: 31 when the nucleotide at position 271 of SEQ ID NO: 31 is G, and wherein probe P does not bind to position 271 of SEQ ID NO: 31 when the nucleotide at position 271 is T; or ii. position 271 of SEQ ID NO: 32 (the complement of SEQ ID NO: 31 ) when the nucleotide at position 271 is C, and wherein probe P does not bind to the nucleotide at position 271 of SEQ ID NO: 32 when the nucleotide at position 271 is A;
probe Q binds to
i. position 492 of target SEQ ID NO: 33 when the nucleotide at position 492 of SEQ ID NO: 33 is G, and wherein probe Q does not bind to position 492 of SEQ ID NO: 33 when the nucleotide at position 492 is A; or
ii. position 492 of SEQ ID NO: 34 (the complement of SEQ ID NO:
33) when the nucleotide at position 492 is C, and wherein probe Q does not bind to the nucleotide at position 492 of SEQ ID NO: 34 when the nucleotide at position 492 is T;
probe R binds to
i. position 271 of target SEQ ID NO: 35 when the nucleotide at position 271 of SEQ ID NO: 35 is T, and wherein probe R does not bind to position 271 of SEQ ID NO: 35 when the nucleotide at position 271 is G; or
ii. position 271 of SEQ ID NO: 36 (the complement of SEQ ID NO:
35) when the nucleotide at position 271 is A, and wherein probe R does not bind to the nucleotide at position 271 of SEQ ID NO: 36 when the nucleotide at position 271 is C;
probe S binds to
i. position 100 of target SEQ ID NO: 37 when the nucleotide at position 100 of SEQ ID NO: 37 is T, and wherein probe S does not bind to position 100 of SEQ ID NO: 37 when the nucleotide at position 100 is G; or
ii. position 100 of SEQ ID NO: 38 (the complement of SEQ ID NO:
37) when the nucleotide at position 100 is A, and wherein probe S does not bind to the nucleotide at position 100 of SEQ ID NO: 38 when the nucleotide at position 100 is C;
probe T binds to
i. position 210 of target SEQ ID NO: 39 when the nucleotide at position 210 of SEQ ID NO: 39 is T, and wherein probe T does not bind to position 210 of SEQ ID NO: 39 when the nucleotide at position 210 is A; or ii. position 210 of SEQ ID NO: 40 (the complement of SEQ ID NO:
39) when the nucleotide at position 210 is A, and wherein probe T does not bind to the nucleotide at position 210 of SEQ ID NO: 40 when the nucleotide at position 210 is T;
viii. probe U binds to
i. position 66 of target SEQ ID NO: 41 when the nucleotide at position 66 of SEQ ID NO: 41 is T, and wherein probe U does not bind to position 66 of SEQ ID NO: 41 when the nucleotide at position 66 is C; or
ii. position 66 of SEQ ID NO: 42 (the complement of SEQ ID NO:
41 ) when the nucleotide at position 66 is A, and wherein probe U does not bind to the nucleotide at position 66 of SEQ ID NO: 42 when the nucleotide at position 66 is G. In one embodiment, binding of one or more of probes A-B to sample nucleic acid confirms the presence of HG3 S. paratyphi A nucleic acid.
In one embodiment, binding of one or more of probes C-L (or probe M when in combination with any of probes C-L) to sample nucleic acid confirms the presence of HG3 S. paratyphi B/ Java nucleic acid.
In one embodiment, binding of one or more of probes J-K to sample nucleic acid confirms the presence of HG3 S. paratyphi C nucleic acid. In one embodiment, binding of probe L to sample nucleic acid confirms the presence of HG3 S. typhi nucleic acid.
In one embodiment, binding of one or more of probes N-P to sample nucleic acid confirms the presence of HG2 S. enteritidis nucleic acid.
In one embodiment, binding of one or more of probes Q-T to sample nucleic acid confirms the presence of HG2 S. typhimurium nucleic acid.
In one embodiment, binding of probe U to sample nucleic acid confirms the presence of HG2 S. choleraesuis nucleic acid. Accordingly, in a related aspect, the present invention includes a method of 'serotyping' salmonella bacterial species. In more detail, the aforementioned methods may be performed to confirm (or deny) the presence of one or more specific types of salmonella selected from HG3 S. paratyphi A nucleic acid (via probe A and/ or probe B), HG3 S. paratyphi B/ Java (via one or more, such as all, of probes C-L), HG3 S. paratyphi C nucleic acid (via probe J and/ or probe K), HG3 S. typhi nucleic acid (via probe L), HG2 S. enteritidis nucleic acid (via one or more, such as all, of probes N-P), HG2 S. typhimurium nucleic acid (via one or more, such as all, of probes Q-T) and/ or HG2 S. choleraesuis (via probe U).
In other words, the present invention provides probes for serotyping salmonella species/ sub-types. By way of example, probe A and/ or probe B may be used to serotype S. paratyphi A; one or more, such as all, of probes C-L may be used to serotype HG3 S. paratyphi B/ Java; probe J and/ or probe K may be used to serotype HG3 S. paratyphi C; probe L may be used to serotype HG3 S. typhi; one or more, such as all, of probes N-P may be used to serotype HG2 S. enteritidis; one or more, such as all, of probes Q-T may be used to serotype HG2 S. typhimurium; and/ or probe U may be used to serotype HG2 S. choleraesuis. The probes A-U of the present invention bind to a defined region on the target nucleic acid SEQ ID NOs (or complement thereof) as specified above, wherein said defined region includes the nucleic acid residue at the particular position as specified above.
Thus, for example, the probe A binds to a region on SEQ ID NO: 1 that includes the nucleic acid residue A at position 328. However, the same probe A does not bind to the same region on SEQ ID NO: 1 when the nucleic acid residue at position 328 is G. Similarly, the probe A binds to a region on SEQ I D NO: 2 (ie. the complement of SEQ ID NO: 1 ) that includes the nucleic acid residue T at position 328. However, the same probe A does not bind to the same region on SEQ ID NO: 2 when the nucleic acid residue at position 328 is C.
The same applies to probes B-U, though naturally with reference to the different SEQ ID NOs and nucleotide residues/ positions as specified above. Thus, for example, the probe B binds to a region on SEQ ID NO: 3 that includes the nucleic acid residue T at position 136. However, the same probe B does not bind to the same region on SEQ ID NO: 3 when the nucleic acid residue at position 136 is C. Similarly, the probe B binds to a region on SEQ ID NO: 4 (ie. the complement of SEQ ID NO: 3) that includes the nucleic acid residue A at position 136. However, the same probe B does not bind to the same region on SEQ ID NO: 4 when the nucleic acid residue at position 136 is G.
Each of probes A-U typically comprises a sequence of 9-30 (or longer) consecutive nucleotides. It is this sequence that binds the probe to the defined region on the respective target nucleic acid SEQ ID NO (or complement thereof) as specified above. The probes may contain additional nucleotides, though said additional nucleotides need not necessarily contribute to the binding of the probe to its target nucleic acid (or complement thereof). In this regard, additional nucleotides may be included to help provide stability to the probe. Similarly, additional nucleotides may be added (typically to the 5' end) to provide a labelling means for downstream detection.
By way of example, each of probes A-U may comprise a sequence of at least 9, at least 12, at least 15, at least 18, at least 21 , at least 24, or at least 27 consecutive nucleotides that bind the probe to its target nucleic acid (or complement thereof). Alternatively, each of the probes A-U may comprise at most 30, at most 27, at most 24, at most 21 , at most 18, at most 15, or at most 12 consecutive nucleotides that bind the probe to its target nucleic acid (or complement thereof). Each of probes A-U binds to a defined region on its respective target nucleic acid SEQ ID NO (or complement thereof) as specified above. In this regard, it is self-explanatory that the length of the defined region typically matches the length of the probe. Thus, for a probe comprising a sequence of say 20 consecutive nucleotides that bind the probe to the target nucleic acid sequence (or complement thereof), said probe will typically bind to a corresponding defined region of 20 consecutive nucleotides on the target sequence (or complement thereof).
Each of probes A-U binds to its respective target nucleic acid SEQ ID NO (or complement thereof) as specified above. In order to do so, each of said probes is complementary in sequence to its respective target nucleic acid SEQ ID NO (or complement thereof). In particular, each of probes A-U includes a key nucleotide that forms a natural base-pairing with the particular nucleotide (at the particular position) as specified above for each target nucleic acid SEQ ID NO (or complement thereof). Reference to a natural base-pairing means that, when the specified nucleotide on the target SEQ ID NO (or complement thereof) is A, then the probe contains a key nucleotide T that binds the probe to the A nucleotide present on the target SEQ ID NO (or complement thereof). Similarly, when the specified nucleotide on the target SEQ ID NO (or complement thereof) is T, then the probe will contain a key nucleotide A that binds the probe to the T nucleotide present on the target SEQ ID NO (or complement thereof). Also, when the specified nucleotide on the target SEQ ID NO (or complement thereof) is G, then the probe will contain a key nucleotide C that binds the probe to the G nucleotide present on the target SEQ ID NO (or complement thereof). Similarly, when the specified nucleotide on the target SEQ ID NO (or complement thereof) is C, then the probe will contain a key nucleotide G that binds the probe to the C nucleotide present on the target SEQ ID NO (or complement thereof).
Probes A-U have a 5'-to-3' orientation. In one embodiment, the key nucleotide may be located at any position within the probe.
In another embodiment, the key nucleotide may be located within 5 nucleotides of the 3' end of the probe. For example, the key nucleotide may be located at the 3' end (ie. the extreme 3' nucleotide position) of the probe. Alternatively, the key nucleotide may be located 1 , 2, 3, or 4 nucleotides in (ie. in a 5' direction) from the extreme 3' nucleotide position of the probe. In one embodiment, each of the probes A-U comprises (or consists of) a nucleotide sequence having at least 80% identity to (eg. at least 82, 83, 84, 85, 86, 87, 88, 89, 90 91 , 92, 93, 94, 95, 96, 97, 98, 99 or 100% identity to) a nucleotide sequence selected from SEQ ID NOs: 395-436 (as shown in Table 1 ), though with the proviso that the key nucleotide (underlined) is not changed. Conservative substitutions are preferred. The present invention also embraces shorter probe sequences thereof (as described above), though with the proviso that the key nucleotide (underlined) is not changed.
Table 1
Figure imgf000014_0001
398 B C CTCGGRCCACGATGATGA
399 C T TAACTCGTGTGGCGGTCC
400 C c ATTG AG C AC AC C G C C AG G
401 D T GAGCCRGGTGCTACTACC
402 D c CTCGGRCCACGATGATGG
403 E T TCCCAGGCTGAGCACGGC
404 E c AGGGTCCGACTCGTGCCG
405 F T GATGAAATCCTCTATGCGGCA
406 F c CTACTTTAG GAG AT AC G C C GT
407 G T GCCTTTGTAACCGAAGCCAAG
408 G c CGGAAACATTGGCTTCGGTTC
409 H T TCGACGAATTTGCCGGTAAA
410 H c AG CTG CTTAAAC G G C C ATTT
411 I T GACGTGAAGTATCACATGGGT
412 I c CTGCACTTCATAGTGTACCCA
413 J T TATGCTGTTTGAAACGGAAGGC
414 J c ATACGACAAACTTTGCCTTCCG
415 K T GTTCGACGAATTTGCCGGT
416 K c CAAGCTGCTTAAACGGCCA
417 L T GCGATTAAAGATGTTTTTCGC
418 L c C G CTAATTTCTAC AAAAAG C G
419 M T TCCATTCCCATTACCCAAGA
420 M c AGGTAAGGGTAATGGGTTCT 421 N T C CTGATG CTG AC ATTG CCT
422 N c GGACTACGACTGTAACGGA
423 0 T AAGATTTAGCGCAGTGCGTC
424 0 c TTCTAAATCGCGTCACGCAG
425 P T GGATAGCCTCTACTCCATCG
426 P c CCTATCGGAGATGAGGTAGC
427 Q T CAGAATCGCGATGAAATCACG
428 Q c GTCTTAGCGCTACTTTAGTGC
429 R T CGATGAAGCGCCTGATTGAAT
430 R c GCTACTTCGCG G ACTAACTTA
431 S T ACGAAACTGGCCTGYGT
432 S c TG CTTT G AC C G G AC YC A
433 T T CCTTCACCCTCGCACCATT
434 T c GGAAGTGGGAGCGTGGTAA
435 u T CTGCGCGATTACCGTGGT
436 u c GACGCGCTAATGGCACCA
Alternatively, each of the probes A-U comprises (or consists of) a nucleotide sequence as shown in Table 1 , though differing at no more than 5 nucleotide positions (for example at no more than 4, 3, 2 or 1 nucleotide position), though again with the proviso that the key nucleotide (underlined) is not changed. Conservative substitutions are preferred. The present invention also embraces shorter probe sequences thereof (as described above), though with the proviso that the key nucleotide (underlined) is not changed. The probes of the invention bind to critical nucleotides (present at specific positions) on genes of HG3 (and/ or HG2) salmonella. The sequences of said genes across the group of HG3 (and/ or HG2) salmonella bacteria exhibit minor strain-to-strain variations. However, the critical nucleotides targeted by the present invention remain constant across strains identified to-date. By way of example, probe A binds to 328A of SEQ ID NO: 1 , but not when the nucleotide at position 328 is G. In this regard, SEQ ID NO: 1 corresponds to a S. paratyphi A aroC gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 43-52. Thus, probe A also binds to 328A of any of SEQ ID NOs: 43-52, but not when the nucleotide at position 328 is G. The same applies to the complement of SEQ ID NO: 1 (ie. SEQ ID NO: 2) in that SEQ ID NOs: 53-62 represent strain variants of SEQ ID NO: 2. Thus, probe A also binds to 328T of any of SEQ ID NOs: 53- 62, but not when the nucleotide at position 328 is C.
Probe B binds to 136T of SEQ ID NO: 3, but not when the nucleotide at position 136 is C. In this regard, SEQ ID NO: 3 corresponds to a S. paratyphi A thr A gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 63-72. Thus, probe B also binds to 136T of any of SEQ ID NOs: 63-72, but not when the nucleotide at position 136 is C. The same applies to the complement of SEQ ID NO: 3 (ie. SEQ ID NO: 4) in that SEQ ID NOs: 73-82 represent strain variants of SEQ ID NO: 4. Thus, probe B also binds to 136A of any of SEQ ID NOs: 73-82, but not when the nucleotide at position 136 is G.
Probe C binds to 378C of SEQ ID NO: 5, but not when the nucleotide at position 378 is T. In this regard, SEQ ID NO: 5 corresponds to a S. paratyphi B/ java dnaN gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 83-92. Thus, probe C also binds to 378C of any of SEQ ID NOs: 83-92, but not when the nucleotide at position 378 is T. The same applies to the complement of SEQ ID NO: 6 (ie. SEQ ID NO: 5) in that SEQ ID NOs: 93-102 represent strain variants of SEQ ID NO: 5. Thus, probe C also binds to 378G of any of SEQ ID NOs: 93-102, but not when the nucleotide at position 378 is A.
Probe D binds to 57C of SEQ ID NO: 7, but not when the nucleotide at position 57 is T. In this regard, SEQ ID NO: 7 corresponds to a S. paratyphi B/ java dnaN gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 103- 1 12. Thus, probe D also binds to 57C of any of SEQ ID NOs: 103-1 12, but not when the nucleotide at position 57 is T. The same applies to the complement of SEQ ID NO: 7 (ie. SEQ ID NO: 8) in that SEQ ID NOs: 1 13-122 represent strain variants of SEQ ID NO: 7. Thus, probe D also binds to 57G of any of SEQ ID NOs: 1 13-122, but not when the nucleotide at position 57 is A.
Probe E binds to 339C of SEQ ID NO: 9, but not when the nucleotide at position 339 is T. In this regard, SEQ ID NO: 9 corresponds to a S. paratyphi B/ java hisD gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 123- 132. Thus, probe E also binds to 339C of any of SEQ ID NOs: 123-132, but not when the nucleotide at position 339 is T. The same applies to the complement of SEQ ID NO: 9 (ie. SEQ ID NO: 10) in that SEQ ID NOs: 133-142 represent strain variants of SEQ ID NO: 10. Thus, probe E also binds to 339G of any of SEQ ID NOs: 133-142, but not when the nucleotide at position 339 is A.
Probe F binds to 69A of SEQ ID NO: 1 1 , but not when the nucleotide at position 69 is G. In this regard, SEQ ID NO: 1 1 corresponds to a S. paratyphi B/ java hisD gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 143- 152. Thus, probe F also binds to 60A of any of SEQ ID NOs: 143-152, but not when the nucleotide at position 69 is G. The same applies to the complement of SEQ ID NO: 1 1 (ie. SEQ ID NO: 12) in that SEQ ID NOs: 153-162 represent strain variants of SEQ ID NO: 12. Thus, probe F also binds to 69T of any of SEQ ID NOs: 153-162, but not when the nucleotide at position 69 is C.
Probe G binds to 204G of SEQ ID NO: 13, but not when the nucleotide at position 204 is A. In this regard, SEQ ID NO: 13 corresponds to a S. paratyphi B/ java hisD gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 163- 172. Thus, probe G also binds to 204G of any of SEQ ID NOs: 163-172, but not when the nucleotide at position 204 is A. The same applies to the complement of SEQ ID NO: 13 (ie. SEQ ID NO: 14) in that SEQ ID NOs: 173-182 represent strain variants of SEQ ID NO: 14. Thus, probe G also binds to 204C of any of SEQ ID NOs: 173-182, but not when the nucleotide at position 204 is T.
Probe H binds to 255A of SEQ I D NO: 15, but not when the nucleotide at position 255 is G. In this regard, SEQ ID NO: 15 corresponds to a S. paratyphi B/ java sucA gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 183- 192. Thus, probe H also binds to 255A of any of SEQ I D NOs: 183-192, but not when the nucleotide at position 255 is G. The same applies to the complement of SEQ ID NO: 15 (ie. SEQ ID NO: 16) in that SEQ ID NOs: 193-202 represent strain variants of SEQ ID NO: 16. Thus, probe H also binds to 255T of any of SEQ ID NOs: 193-202, but not when the nucleotide at position 255 is C.
Probe I binds to 300C of SEQ ID NO: 17, but not when the nucleotide at position 300 is T. In this regard, SEQ ID NO: 17 corresponds to a S. paratyphi B/ java sucA gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 203- 212. Thus, probe I also binds to 300C of any of SEQ ID NOs: 203-212, but not when the nucleotide at position 300 is T. The same applies to the complement of SEQ ID NO: 17 (ie. SEQ ID NO: 18) in that SEQ ID NOs: 213-222 represent strain variants of SEQ ID NO: 18. Thus, probe I also binds to 300G of any of SEQ ID NOs: 213-222, but not when the nucleotide at position 300 is A.
Probe J binds to 342C of SEQ ID NO: 19, but not when the nucleotide at position 342 is A or T. In this regard, SEQ ID NO: 19 corresponds to a S. paratyphi C dnaN gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 223- 232. Thus, probe J also binds to 342C of any of SEQ ID NOs: 223-232, but not when the nucleotide at position 342 is A or T. The same applies to the complement of SEQ ID NO: 19 (ie. SEQ ID NO: 20) in that SEQ ID NOs: 233-242 represent strain variants of SEQ I D NO: 20. Thus, probe J also binds to 342G of any of SEQ ID NOs: 233-242, but not when the nucleotide at position 342 is T or A.
Probe K binds to 252T of SEQ ID NO: 21 , but not when the nucleotide at position 252 is G. In this regard, SEQ ID NO: 21 corresponds to a S. paratyphi C sucA gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 243- 252. Thus, probe K also binds to 352T of any of SEQ ID NOs: 243-252, but not when the nucleotide at position 252 is G. The same applies to the complement of SEQ ID NO: 21 (ie. SEQ ID NO: 22) in that SEQ ID NOs: 253-262 represent strain variants of SEQ ID NO: 22. Thus, probe K also binds to 252A of any of SEQ ID NOs: 253-262, but not when the nucleotide at position 252 is C.
Probe L binds to 9C of SEQ ID NO: 23, but not when the nucleotide at position 9 is T. In this regard, SEQ ID NO: 23 corresponds to a S. paratyphi C aroC gene sequence, and strain variants of said sequence are known in the art. Thus, probe L also binds to 9C of any of said strain variant SEQ ID NOs, but not when the nucleotide at position 9 is T The same applies to the complement of SEQ ID NO: 23 (ie. SEQ ID NO: 24) in that strain variants of SEQ ID NO: 24 are known in the art. Thus, probe L also binds to 9G of any of said variant SEQ ID NOs, but not when the nucleotide at position 9 is A. Probe M binds to 67A of SEQ ID NO: 25, but not when the nucleotide at position 67 is G. In this regard, SEQ ID NO: 25 corresponds to a S. paratyphi B tartrate gene sequence, and strain variants of said sequence are illustrated, for example, in SEQ ID NO: 263. Thus, probe M also binds to 67A of any of said variant SEQ ID NOs, but not when the nucleotide at position 67 is G. The same applies to the complement of SEQ ID NO: 25 (ie. SEQ ID NO: 26) in that strain variants of SEQ ID NO: 26 are known in the art, for example SEQ ID NO: 264. Thus, probe M also binds to 67T of any of strain variant SEQ ID NOs, but not when the nucleotide at position 67 is C.
Probe N binds to 385T of SEQ ID NO: 27, but not when the nucleotide at position 385 is C. In this regard, SEQ ID NO: 27 corresponds to a S. enteritidis hisD gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 265- 267. Thus, probe N also binds to 385T of any of SEQ ID NOs: 265-267, but not when the nucleotide at position 385 is C. The same applies to the complement of SEQ ID NO: 27 (ie. SEQ ID NO: 28) in that SEQ ID NOs: 268-270 represent strain variants of SEQ ID NO: 28. Thus, probe N also binds to 385A of any of SEQ ID NOs: 268-270, but not when the nucleotide at position 385 is G. Probe O binds to 501 C of SEQ ID NO: 29, but not when the nucleotide at position 501 is T. In this regard, SEQ ID NO: 29 corresponds to a S. enteritidis hisD gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 271 - 280. Thus, probe O also binds to 501 C of any of SEQ ID NOs: 271 -280, but not when the nucleotide at position 501 is T. The same applies to the complement of SEQ ID NO: 29 (ie. SEQ ID NO: 30) in that SEQ ID NOs: 281 -290 represent strain variants of SEQ ID NO: 30. Thus, probe O also binds to 501 G of any of SEQ ID NOs: 281 -290, but not when the nucleotide at position 501 is A.
Probe P binds to 271 G of SEQ I D NO: 31 , but not when the nucleotide at position 271 is T. In this regard, SEQ ID NO: 31 corresponds to a S. enteritidis purE gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 291 - 300. Thus, probe P also binds to 271 G of any of SEQ ID NOs: 291 -300, but not when the nucleotide at position 271 is T. The same applies to the complement of SEQ ID NO: 31 (ie. SEQ ID NO: 32) in that SEQ ID NOs: 301 -310 represent strain variants of SEQ ID NO: 32. Thus, probe P also binds to 271 C of any of SEQ ID NOs: 301 -310, but not when the nucleotide at position 271 is A. Probe Q binds to 492G of SEQ ID NO: 33, but not when the nucleotide at position 492 is A. In this regard, SEQ ID NO: 33 corresponds to a S. typhimurium aroC gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 31 1 - 320. Thus, probe Q also binds to 492G of any of SEQ ID NOs: 31 1 -320, but not when the nucleotide at position 492 is A. The same applies to the complement of SEQ ID NO: 33 (ie. SEQ ID NO: 34) in that SEQ ID NOs: 321 -330 represent strain variants of SEQ ID NO: 34. Thus, probe Q also binds to 492C of any of SEQ ID NOs: 321 -330, but not when the nucleotide at position 492 is T. Probe R binds to 271 T of SEQ ID NO: 35, but not when the nucleotide at position 271 is G. In this regard, SEQ ID NO: 35 corresponds to a S. typhimurium dnaN gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 331 - 338. Thus, probe R also binds to 271 T of any of SEQ ID NOs: 331 -338, but not when the nucleotide at position 271 is G. The same applies to the complement of SEQ ID NO: 35 (ie. SEQ ID NO: 36) in that SEQ ID NOs: 339-346 represent strain variants of SEQ ID NO: 36. Thus, probe R also binds to 271A of any of SEQ ID NOs: 339-346, but not when the nucleotide at position 271 is C.
Probe S binds to 100T of SEQ ID NO: 37, but not when the nucleotide at position 100 is G. In this regard, SEQ ID NO: 37 corresponds to a S. typhimurium hemD gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 347- 356. Thus, probe S also binds to 100T of any of SEQ ID NOs: 347-356, but not when the nucleotide at position 100 is G. The same applies to the complement of SEQ ID NO: 37 (ie. SEQ ID NO: 38) in that SEQ ID NOs: 357-366 represent strain variants of SEQ ID NO: 38. Thus, probe S also binds to 100A of any of SEQ ID NOs: 357-366, but not when the nucleotide at position 100 is C.
Probe T binds to 210T of SEQ ID NO: 39, but not when the nucleotide at position 210 is A. In this regard, SEQ ID NO: 39 corresponds to a S. typhimurium thrA gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 367- 376. Thus, probe T also binds to 210T of any of SEQ ID NOs: 367-376, but not when the nucleotide at position 201 is A. The same applies to the complement of SEQ ID NO: 39 (ie. SEQ ID NO: 40) in that SEQ ID NOs: 377-386 represent strain variants of SEQ ID NO: 40. Thus, probe T also binds to 210A of any of SEQ ID NOs: 377-386, but not when the nucleotide at position 210 is T. Probe U binds to 66T of SEQ ID NO: 41 , but not when the nucleotide at position 66 is C. In this regard, SEQ ID NO: 41 corresponds to a S. cholera aroC gene sequence, and strain variants of said sequence are illustrated in SEQ ID NOs: 387-390. Thus, probe U also binds to 66T of any of SEQ ID NOs: 387-390, but not when the nucleotide at position 66 is C. The same applies to the complement of SEQ ID NO: 41 (ie. SEQ ID NO: 42) in that SEQ ID NOs: 391 -394 represent strain variants of SEQ ID NO: 42. Thus, probe U also binds to 66A of any of SEQ ID NOs: 391 -394, but not when the nucleotide at position 66 is G. The probes of the invention are designed to hybridise to their target (or complement) sequence. It is preferred that the binding conditions are such that a high level of specificity is provided - ie. hybridisation of the probe occurs under "stringent conditions". In general, stringent conditions are selected to be about 5°C lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature (under defined ionic strength and pH) at which 50% of the target (or complement) sequence hybridises to a perfectly matched probe. In this regard, the Tm of probes of the present invention, at a salt concentration of about 0.02M or less at pH 7, is for example above 60°C, such as about 70°C. Premixed binding solutions are commercially available (eg. EXPRESSHYB Hybridisation Solution from CLONTECH Laboratories, Inc.), and hybridisation can be performed according to the manufacturer's instructions.
It is preferable to screen the probes to minimise self-complementarity and dimer formation (probe-probe binding). Preferred probes of the present invention are selected so as to have minimal homology with human DNA. The selection process may involve comparing a candidate probe sequence with human DNA and rejecting the probe if the homology is greater than 50%. The aim of this selection process is to reduce annealing of probe to contaminating human DNA sequences and hence allow improved specificity of the assay.
Any of the probes described herein may comprise a tag and/ or label. The tag and/ or label may, for example, be located (independently of one another) towards the middle or towards or at the 5' or 3' end of the herein described probes, for example at the 5' end. Hence, following hybridisation of tagged/ labelled probe to target nucleic acid (or complement thereof), the tag/ label is associated with the HG3 target nucleic acid (or complement thereof). Alternatively (or in addition), the probes may act as primers during the method of the invention and the tag/ label may therefore become incorporated into the amplification product as the primer is extended.
Examples of suitable labels include detectable labels such as radiolabels or fluorescent or coloured molecules, enzymatic markers or chromogenic markers - eg. dyes that produce a visible colour change upon hybridisation of the probe.. By way of example, the label may be digoxygenin, fluorescein-isothiocyanate (FITC), R-phycoerythrin, Alexa 532 or Cy3. The label may be a reporter molecule, which is detected directly, such as by exposure to photographic or X-ray film. Alternatively, the label is not directly detectable, but may be detected indirectly, for example, in a two-phase system. An example of indirect label detection is binding of an antibody to the label.
Examples of suitable tags include "complement/ anti-complement pairs". The term "complement/ anti-complement pair" denotes non-identical moieties that form a non- covalently associated, stable pair under appropriate conditions. Examples of suitable tags include biotin and streptavidin (or avidin). By way of example, a biotin tag may be captured using streptavidin, which may be coated onto a substrate or support such as a bead (for example a magnetic bead) or membrane. Likewise, a streptavidin tag may be captured using biotin, which may be coated onto a substrate or support such as a bead (for example a magnetic bead) or membrane. Other exemplary complement/ anti-complement pairs include receptor/ ligand pairs, antibody/ antigen (or hapten or epitope) pairs, and the like. Another example is a nucleic acid sequence tag that binds to a complementary sequence. The latter may itself be pre-labelled, or may be attached to a surface (eg. a bead) which is separately labelled. An example of the latter embodiment is the well-known LuminexR bead system. Other exemplary pairs of tags and capture molecules include receptor/ ligand pairs and antibody/ antigen (or hapten or epitope) pairs. Where subsequent dissociation of the complement/ anti- complement pair is desirable, the complement/ anti-complement pair has a binding affinity of, for example, less than 109 M"1.
One or more of probes A-U may be labelled with different labels or tags, thereby allowing separate identification of each probe when used in the method of the present invention. By way of example, one or more of probes A-U may be labelled with different nucleic acid sequences (aka 'sequence tags' or 'sequence labels'). Said different nucleic acid sequence labels may be attached, for example, to the 5' end of the defined sequence of the probe that binds to the target nucleic acid SEQ ID NO (or complement thereof), thereby allowing separate identification of each probe when used in the method of the present invention. In one embodiment, each of said nucleic acid tags is able to bind to a different, complementary nucleic acid sequence present on a surface (eg. the surface of a bead). Thus, by use of discrete surfaces (eg. beads), which are uniquely identifiable (eg. via fluorescence at a specific wavelength), it is routine to identify the present of target (or complement) nucleic acid when bound to said surfaces (eg. beads). Examples of such nucleic acid tags (plus the complementary binding partner) are illustrated in Table 2.
Table 2
Figure imgf000024_0001
Any conventional method may be employed to attach nucleic acid tags to a probe of the present invention (eg. to the 5' end of the defined binding region of the probe). Alternatively, nucleic acid probes of the invention (with pre-attached nucleic acid tags) may be constructed by commercial providers. Examples of probes of the invention to which tags illustrated in Table 2 have been attached are illustrated in Table 3.
Figure imgf000024_0002
5'
443 CTATCTATCTAACTATCTATATCAGCGCGAAAGTGACGGTGA
3'
5'
444 TCATTCATATACATACCAATTCATTGCCGGACGCCAGGCTGC
3'
5'
445 TCATTTACCAATCTTTCTTTATACGAGCCRGGTGCTACTACC
3'
The sample is for example a clinical sample (or is derived from a clinical sample) such as: faeces or blood, sputum, bronchoalveolar lavage, tracheal aspirate, lung tissue samples, cerebrospinal fluid, archaeological samples.
If desired, an amplification step may be introduced. In this regard, amplification may be carried out using methods and platforms known in the art, for example PCR, such as real-time PCR, block-based PCR, ligase chain reaction, glass capillaries, isothermal amplification methods including loop-mediated isothermal amplification, rolling circle amplification transcription mediated amplification, nucleic acid sequence-based amplification, signal mediated amplification of RNA technology, strand displacement amplification, isothermal multiple displacement amplification, helicase-dependent amplification, single primer isothermal amplification, and circular helicase-dependent amplification.
In one embodiment, amplification can be carried using any amplification platform - as such, an advantage of this embodiment of the assay is that it is platform independent and not tied to any particular instrument. In one embodiment, a general amplification step (eg. pre-detection) may be employed to increase the amount of target/ complement nucleic acid present in the sample. In this embodiment, PCR amplification primers are typically employed to amplify approximately 100-400 base pair regions of the target/ complementary nucleic acid that contain the HG3 (and/ or HG2) nucleotide targets of the present invention. In the presence of a suitable polymerase and DNA precursors (dATP, dCTP, dGTP and dTTP), forward and reverse primers are extended in a 5' to 3' direction, thereby initiating the synthesis of new nucleic acid strands that are complementary to the individual strands of the target/ complementary HG3 nucleic acid. The primers thereby drive amplification of HG3 (and/ or HG2) nucleic acid sequences, thereby generating amplification products comprising said HG3 (and/ or HG2) nucleic acid sequences.
In one embodiment, an amplification step may be employed in which the probes of the present invention act as primers. In this embodiment, the probes (acting as primers) are extended from their 3' ends (ie. in a 5'-to-'3') direction. The resulting amplification products typically comprise 100-400 base pair regions of the target/ complementary nucleic acid. This embodiment may be employed in conjunction with a general amplification step, such as the one described above.
The detection step may be carried out by any known means. In this regard, the probe or amplification product may be tagged and/ or labelled, and the detection method may therefore comprise detecting said tag and/ or label. In one embodiment, the probe(s) may comprise a tag and/ or label. Thus, in one embodiment, following hybridisation of tagged/ labelled probe to target (or complement) nucleic acid, the tag/ label becomes associated with the target (or complement) nucleic acid. Thus, in one embodiment, the assay may comprise detecting the tag/ label and correlating presence of tag/ label with presence of HG3.
In one embodiment, tag and/ or label may be incorporated during extension of the probe(s). In doing so, the amplification product(s) become tagged/ labelled, and the assay may therefore comprise detecting the tag/ label and correlating presence of tag/ label with presence of amplification product, and hence the presence of HG3 (and/ or HG2).
By way of example, in one embodiment, the amplification product may incorporate a tag/ label (eg. via a tagged/ labelled dNTP such as biotin-dNTP) as part of the amplification process, and the assay may further comprise the use of a binding partner complementary to said tag (eg. streptavidin) that includes a detectable tag/ label (eg. a fluorescent label, such as R-phycoerythrin). In this way, the amplified product incorporates a detectable tag/ label (eg. a fluorescent label, such as R-phycoerythrin). In one embodiment, the probe(s) and/ or the amplification product(s) may include a further tag/ label (as the complement component) to allow capture of the amplification product(s).
By way of example, a "complement/ anti-complement" pairing may be employed in which an anti-complement capture component binds to said further tag/ label (complement component) and thereby permits capture of the probe(s) and/ or amplification product(s). Examples of suitable "complement/ anti-complement" partners have been described earlier in this specification, such as a complementary pair of nucleic acid sequences, a complementary antibody-antigen pair, etc.
The anti-complement capture component may be attached (eg. coated) on to a substrate or solid support - examples of suitable substrates/ supports include membranes and/ or beads (eg. a magnetic or fluorescent bead). Capture methods are well known in the art. For example, LuminexR beads may be employed. Alternatively, the use of magnetic beads may be advantageous because the beads (plus captured, tagged/ labelled amplification product) can easily be concentrated and separated from the sample, using conventional techniques known in the art.
Immobilisation provides a physical location for the anti-complement capture component (or probes), and may serve to fix the capture component/ probe at a desired location and/ or facilitate recovery or separation of probe. The support may be a rigid solid support made from, for example, glass or plastic, such as a bead (for example a fluorescent or magnetic bead). Alternatively, the support may be a membrane, such as nylon or nitrocellulose membrane. 3D matrices are also suitable supports for use with the present invention - eg. polyacrylamide or PEG gels.
Immobilisation to a support/ platform may be achieved by a variety of conventional means. By way of example, immobilisation onto a support such as a nylon membrane may be achieved by UV cross-linking. Alternatively, biotin-labelled molecules may be bound to streptavidin-coated substrates (and vice-versa), and molecules prepared with amino linkers may be immobilised on to silanised surfaces. Another means of immobilisation is via a poly-T tail or a poly-C tail, for example at the 3' or 5' end.
In one embodiment of the present invention, probes A-L (optionally also probe M; optionally also probes N-U with or without probe M) of the invention comprise a nucleic acid sequence tag/ label (eg. attached to each probe at the 5' end of the defined sequence of the probe that binds to target/ complement nucleic acid). In more detail, each of the probes is provided with a different nucleic acid sequence tag/ label, wherein each of said tags/ labels (specifically) binds to a complementary nucleic acid sequence present on the surface of a bead.
Each of the different tags/ labels binds to its complementary sequence counterpart (and not to any of the complementary sequence counterparts of the other tags), which is located on a uniquely identifiable bead. In this regard, the beads are uniquely identifiable, for example by means of fluorescence at a specific wavelength. Thus, in use, probes of the invention bind to target/ complement nucleic acid (if present in the sample). Thereafter, (only) the bound probes may be extended (in the 3' direction) in the presence of one or more labelled dNTP (eg. biotin labelled dNTPs, such as biotin- dCTPs).
The extended primers may be contacted with a binding partner counterpart to the labelled dNTPs (eg. a streptavidin labelled flurophore, such as streptavidin labelled R- phycoerythrin), which binds to those labelled dNTPs that have become incorporated into the extended primers. Thereafter, the labelled extended primers may be identified by allowing them to bind to their nucleic acid counterparts present on the uniquely identifiable beads. The latter may then be "called" (eg. to determine the type of bead present by wavelength emission) and the nature of the primer extension (and thus the type of target/ complement nucleic acid present) may be determined. A simple schematic of one such embodiment is illustrated in Figure 1 . In more detail, for simplicity only two HG3 detection probes are illustrated - for illustrative purposes only, the first probe has a 3' nucleotide residue designated G and the second probe has a 3' nucleotide residue designated A. Each of the probes has a different 5' nucleic acid sequence tag. When both probes are challenged with a test sample, Figure 1 illustrates a scenario in which only the second probe binds to salmonella nucleic acid present in the test sample. In contrast, the first probe does not bind to any nucleic acid present in the test sample. The bound second probe is then extended in a 3' direction, and fluorescent label is incorporated as the probe becomes extended. The different 5' nucleic acid sequence tags present on each probe are then relied on to bind the respective probes to different capture beads (by complementary binding partner sequences present ion the beads). Thereafter, presence of the extended second primer is detected, which confirms the presence of HG3 salmonella nucleic acid. SEQ ID NOs
SEQ ID NO: 1 (AROC1 )
GTTTTTCGCCCGGGACACGCGGATTACACCTATGAGCAGAAATACGGCCTGCGCGATTACCGCGGCGGTGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATCGCCAAGAAATACTTGGCGGAAAAGTTCGG CATCGAAATCCGCGGCTGCCTGACCCAGATGGGCGACATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGCCCCGATGCGGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG TGACTCCATCGGCGCGAAAGTGACGGTGATGGCGAGCGGCGTGCCGGCAGGGCTTGGCGAACCGGTATTTGACC GACTGGATGCGGACATCGCCCATGCGCTGATGAGCATCAATGCGGTGAAAGGCGTGGAGATCGGCGAAGGATTT AACGTGGTGGCGCTGCGCGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGT
SEQ ID NO: 2 (AROC1 )
CAAAAAGCGGGCCCTGTGCGCCTAATGTGGATACTCGTCTTTATGCCGGACGCGCTAATGGCGCCGCCACCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAGCGGTTCTTTATGAACCGCCTTTTCAAGCCG TAGCTTTAGGCGCCGACGGACTGGGTCTACCCGCTGTAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAA TTAGGCAAGAAAACGGGGCTACGCCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCAC TGAGGTAGCCGCGCTTTCACTGCCACTACCGCTCGCCGCACGGCCGTCCCGAACCGCTTGGCCATAAACTGGCT GACCTACGCCTGTAGCGGGTACGCGACTACTCGTAGTTACGCCACTTTCCGCACCTCTAGCCGCTTCCTAAATTGC ACCACCGCGACGCGCCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCA
SEQ ID NO: 3 (THRA107)
GTGCTGGGCCGTAATGGTTCCGACTATTCCGCCGCCGTGCTGGCTGCCTGTTTACGCGCTGACTGCTGTGAAATC TGGACTGACGTCGATGGCGTGTATACCTGTGACCCGCGCCAGGTGCCGGACGCCAGGCTGTTGAAATCGATGTC CTACCAGGAAGCGATGGAGCTCTCTTACTTCGGCGCCAAAGTCCTTCACCCTCGCACCATAACGCCTATCGCCCA GTTCCAGATCCCCTGTCTGATTAAAAATACCGGCAATCCGCAGGCGCCAGGAACGCTGATCGGCGCGTCCAGCGA CGATGATAATCTGCCGGTTAAAGGGATCTCTAACCTTAACAACATGGCGATGTTTAGCGTCTCCGGCCCGGGAATG AAAGGGATGATTGGGATGGCGGCGCGTGTTTTCGCCGCCATGTCTCGCGCCGGGATCTCGGTGGTGCTCATTACC CAGTCCTCCTCTGAGTACAGCATCAGCTTCTGTGTGCCGCAGAGTGACTGC
SEQ ID NO: 4 (THRA107)
CACGACCCGGCATTACCAAGGCTGATAAGGCGGCGGCACGACCGACGGACAAATGCGCGACTGACGACACTTTA GACCTGACTGCAGCTACCGCACATATGGACACTGGGCGCGGTCCACGGCCTGCGGTCCGACAACTTTAGCTACAG GATGGTCCTTCGCTACCTCGAGAGAATGAAGCCGCGGTTTCAGGAAGTGGGAGCGTGGTATTGCGGATAGCGGG TCAAGGTCTAGGGGACAGACTAATTTTTATGGCCGTTAGGCGTCCGCGGTCCTTGCGACTAGCCGCGCAGGTCGC TGCTACTATTAGACGGCCAATTTCCCTAGAGATTGGAATTGTTGTACCGCTACAAATCGCAGAGGCCGGGCCCTTA CTTTCCCTACTAACCCTACCGCCGCGCACAAAAGCGGCGGTACAGAGCGCGGCCCTAGAGCCACCACGAGTAATG GGTCAGGAGGAGACTCATGTCGTAGTCGAAGACACACGGCGTCTCACTGACG SEQ ID NO: 5 (DNAN101 )
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCGGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGTAGCGAACTGCGCACTGTCGCGACCGACGGCC ACCGCCTGGCGGTGTGCTCAATGCCGCTGGAAGCGTCTTTACCCAGCCATTCGGTGATTGTGCCGCGTAAAGGCG TGATTGAACTGATGCGTATGCTCGACGGTGGCGAAAACCCGCTGCGCGTGCAG
SEQ ID NO: 6 (DNAN101 )
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGCCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTC CTACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCATCGCTTGACGCGTGACAGCGCTGGCTGCCG GTGGCGGACCGCCACACGAGTTACGGCGACCTTCGCAGAAATGGGTCGGTAAGCCACTAACACGGCGCATTTCC GCACTAACTTGACTACGCATACGAGCTGCCACCGCTTTTGGGCGACGCGCACGTC
SEQ ID NO: 7 (DNAN1 )
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCAGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGTAGCGAACTGCGCACTGTTGCGACCGACGGCC ACCGTCTGGCGGTGTGCTCAATGCCGCTGGAGGCGTCTTTACCTAGCCACTCGGTGATTGTGCCGCGTAAAGGCG TGATTGAACTGATGCGTATGCTCGACGGTGGCGAAAACCCGCTGCGCGTGCAG
SEQ ID NO: 8 (DNAN1 )
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGTCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTC CTACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCATCGCTTGACGCGTGACAACGCTGGCTGCCGG TGGCAGACCGCCACACGAGTTACGGCGACCTCCGCAGAAATGGATCGGTGAGCCACTAACACGGCGCATTTCCG CACTAACTTGACTACGCATACGAGCTGCCACCGCTTTTGGGCGACGCGCACGTC
SEQ ID NO: 9 (HISD1 )
ATTGCGGGATGCCAGAAGGTGGTTCTGTGCTCGCCGCCACCCATCGCTGATGAAATCCTCTATGCGGCGCAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGTCT CGACGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTGCTGGTGATCGCCGACAGCGGCGCAACACCG GATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGCCCGGATTCCCAGGTGATCCTGCTGACGCCGGAT GCTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAACTGCCGCGCGCGGGCACCGCCC GGCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 10 (HISD1 )
TAACGCCCTACGGTCTTCCACCAAGACACGAGCGGCGGTGGGTAGCGACTACTTTAGGAGATACGCCGCGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCAGA GCTGCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCACGACCACTAGCGGCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCGGGCCTAAGGGTCCACTAGGACGACTGCGGCCTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTTGACGGCGCGCGCCCGTGGCGGGC CGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 1 1 (HISD102)
ATTGCGGGATGCCAGAACGTGATTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCACAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAACGCCT CGACGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTACTGGTGATCGCCGACAGCGGCGCAACACCGG ATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGTCCGGATTCGCAGGTGATCCTGCTGACGCCTGATG CTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTTGCAGAACTGCCGCGCGCGGACACTGCCCGG CAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC SEQ ID NO: 12 (HISD102)
TAACGCCCTACGGTCTTGCACTAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGTGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTTGCGGA GCTGCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCATGACCACTAGCGGCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCAGGCCTAAGCGTCCACTAGGACGACTGCGGACTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGAACGTCTTGACGGCGCGCGCCTGTGACGGGC CGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 13 (HISD1 )
ATTGCGGGATGCCAGAAGGTGGTTCTGTGCTCGCCGCCACCCATCGCTGATGAAATCCTCTATGCGGCGCAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGTCT CGACGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTGCTGGTGATCGCCGACAGCGGCGCAACACCG GATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGCCCGGATTCCCAGGTGATCCTGCTGACGCCGGAT GCTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAACTGCCGCGCGCGGGCACCGCCC GGCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 14 (HISD1 )
TAACGCCCTACGGTCTTCCACCAAGACACGAGCGGCGGTGGGTAGCGACTACTTTAGGAGATACGCCGCGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCAGA GCTGCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCACGACCACTAGCGGCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCGGGCCTAAGGGTCCACTAGGACGACTGCGGCCTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTTGACGGCGCGCGCCCGTGGCGGGC CGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 15 (SUCA1 )
AAACGCTTCCTGAACGAACTGACCGCCGCTGAAGGGCTGGAACGTTATCTGGGCGCCAAATTCCCGGGTGCGAAA CGTTTCTCGCTCGAGGGGGGAGATGCGCTGATACCTATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGCGG CACTCGCGAAGTGGTGCTGGGGATGGCGCACCGCGGTCGTCTGAACGTGCTGATCAACGTACTGGGTAAAAAAC CGCAGGATCTGTTCGACGAGTTTGCCGGTAAACATAAAGAACATCTGGGTACCGGCGACGTGAAGTATCACATGG GCTTCTCGTCAGATATCGAAACTGAAGGCGGTCTGGTTCACCTGGCGCTGGCGTTTAACCCATCGCATCTGGAAAT TGTGAGCCCGGTGGTGATGGGCTCCGTGCGCGCCCGTCTGGACCGACTGGACGAACCGAGCAGTAATAAAGTGC TGCCGATCACTATTCACGGCGACGCCGCGGTGACCGGCCAGGGCGTGGTTCAG
SEQ ID NO: 16 (SUCA1 )
TTTGCGAAGGACTTGCTTGACTGGCGGCGACTTCCCGACCTTGCAATAGACCCGCGGTTTAAGGGCCCACGCTTT GCAAAGAGCGAGCTCCCCCCTCTACGCGACTATGGATACGACTTTCTCTACCAAGCGGTACGCCCATTGTCGCCG TGAGCGCTTCACCACGACCCCTACCGCGTGGCGCCAGCAGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC GTCCTAGACAAGCTGCTCAAACGGCCATTTGTATTTCTTGTAGACCCATGGCCGCTGCACTTCATAGTGTACCCGA AGAGCAGTCTATAGCTTTGACTTCCGCCAGACCAAGTGGACCGCGACCGCAAATTGGGTAGCGTAGACCTTTAACA CTCGGGCCACCACTACCCGAGGCACGCGCGGGCAGACCTGGCTGACCTGCTTGGCTCGTCATTATTTCACGACG GCTAGTGATAAGTGCCGCTGCGGCGCCACTGGCCGGTCCCGCACCAAGTC
SEQ ID NO: 17 (SUCA1 )
AAACGCTTCCTGAACGAACTGACCGCCGCTGAAGGGCTGGAACGTTATCTGGGCGCCAAATTCCCGGGTGCGAAA CGTTTCTCGCTCGAGGGGGGAGATGCGCTGATACCTATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGCGG CACTCGCGAAGTGGTGCTGGGGATGGCGCACCGCGGTCGTCTGAACGTGCTGATCAACGTACTGGGTAAAAAAC CGCAGGATCTGTTCGACGAGTTTGCCGGTAAACATAAAGAACATCTGGGTACCGGCGACGTGAAGTATCACATGG GCTTCTCGTCAGATATCGAAACTGAAGGCGGTCTGGTTCACCTGGCGCTGGCGTTTAACCCATCGCATCTGGAAAT TGTGAGCCCGGTGGTGATGGGCTCCGTGCGCGCCCGTCTGGACCGACTGGACGAACCGAGCAGTAATAAAGTGC TGCCGATCACTATTCACGGCGACGCCGCGGTGACCGGCCAGGGCGTGGTTCAG
SEQ ID NO: 18 (SUCA1 )
TTTGCGAAGGACTTGCTTGACTGGCGGCGACTTCCCGACCTTGCAATAGACCCGCGGTTTAAGGGCCCACGCTTT GCAAAGAGCGAGCTCCCCCCTCTACGCGACTATGGATACGACTTTCTCTACCAAGCGGTACGCCCATTGTCGCCG TGAGCGCTTCACCACGACCCCTACCGCGTGGCGCCAGCAGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC GTCCTAGACAAGCTGCTCAAACGGCCATTTGTATTTCTTGTAGACCCATGGCCGCTGCACTTCATAGTGTACCCGA AGAGCAGTCTATAGCTTTGACTTCCGCCAGACCAAGTGGACCGCGACCGCAAATTGGGTAGCGTAGACCTTTAACA CTCGGGCCACCACTACCCGAGGCACGCGCGGGCAGACCTGGCTGACCTGCTTGGCTCGTCATTATTTCACGACG GCTAGTGATAAGTGCCGCTGCGGCGCCACTGGCCGGTCCCGCACCAAGTC SEQ ID NO: 19 (DNAN100)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCGGGTGCTACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGTTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGCAGCGAACTGCGCACTGTTGCGACCGACGGCC ACCGTCTGGCGGTGTGTTCAATGCCGCTGGAAGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGCG TCATTGAACTGATGCGTATGCTCGACGGCGGTGAAAACCCGCTGCGCGTGCAG
SEQ ID NO: 20 (DNAN100)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGCCCACGATGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACAACCACGC AAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCGC TTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTCCT ACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCGTCGCTTGACGCGTGACAACGCTGGCTGCCGGT GGCAGACCGCCACACAAGTTACGGCGACCTTCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCCGC AGTAACTTGACTACGCATACGAGCTGCCGCCACTTTTGGGCGACGCGCACGTC
SEQ ID NO: 21 (SUCA228)
AAACGCTTCCTGAACGAACTGACCGCCGCTGAAGGGCTGGAACGTTATCTGGGCGCCAAATTCCCCGGTGCGAAA CGTTTCTCGCTGGAAGGGGGAGATGCTCTGGTACCCATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGCGG CACTCGCGAAGTGGTGCTGGGGATGGCGCACCGCGGTCGCCTGAACGTGCTGATCAACGTACTGGGTAAAAAAC CGCAAGATCTGTTCGACGAGTTTGCCGGGAAGCATAAAGAACATCTGGGTACCGGCGACGTGAAGTATCACATGG GCTTCTCGTCAGATATCGAAACCGAAGGCGGTCTGGTTCACCTGGCTCTGGCGTTTAACCCATCGCATCTGGAAAT TGTGAGTCCGGTGGTGATGGGATCTGTGCGTGCCCGTCTGGATAGACTGGACGAACCGAGCAGCAACAAAGTGTT GCCGATCACCATTCACGGCGACGCCGCGGTGACCGGCCAGGGCGTGGTTCAG
SEQ ID NO: 22 (SUCA228)
TTTGCGAAGGACTTGCTTGACTGGCGGCGACTTCCCGACCTTGCAATAGACCCGCGGTTTAAGGGGCCACGCTTT GCAAAGAGCGACCTTCCCCCTCTACGAGACCATGGGTACGACTTTCTCTACCAAGCGGTACGCCCATTGTCGCCG TGAGCGCTTCACCACGACCCCTACCGCGTGGCGCCAGCGGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC GTTCTAGACAAGCTGCTCAAACGGCCCTTCGTATTTCTTGTAGACCCATGGCCGCTGCACTTCATAGTGTACCCGA AGAGCAGTCTATAGCTTTGGCTTCCGCCAGACCAAGTGGACCGAGACCGCAAATTGGGTAGCGTAGACCTTTAAC ACTCAGGCCACCACTACCCTAGACACGCACGGGCAGACCTATCTGACCTGCTTGGCTCGTCGTTGTTTCACAACG GCTAGTGGTAAGTGCCGCTGCGGCGCCACTGGCCGGTCCCGCACCAAGTC
SEQ ID NO: 23 (AROC1 )
GTTTTTCGCCCGGGACACGCGGATTACACCTATGAGCAGAAATACGGCCTGCGCGATTACCGCGGCGGTGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATCGCCAAGAAATACTTGGCGGAAAAGTTCGG CATCGAAATCCGCGGCTGCCTGACCCAGATGGGCGACATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGCCCCGATGCGGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG TGACTCCATCGGCGCGAAAGTGACGGTGATGGCGAGCGGCGTGCCGGCAGGGCTTGGCGAACCGGTATTTGACC GACTGGATGCGGACATCGCCCATGCGCTGATGAGCATCAATGCGGTGAAAGGCGTGGAGATCGGCGAAGGATTT AACGTGGTGGCGCTGCGCGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGT
SEQ ID NO: 24 (AROC1 )
CAAAAAGCGGGCCCTGTGCGCCTAATGTGGATACTCGTCTTTATGCCGGACGCGCTAATGGCGCCGCCACCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAGCGGTTCTTTATGAACCGCCTTTTCAAGCCG TAGCTTTAGGCGCCGACGGACTGGGTCTACCCGCTGTAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAA TTAGGCAAGAAAACGGGGCTACGCCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCAC TGAGGTAGCCGCGCTTTCACTGCCACTACCGCTCGCCGCACGGCCGTCCCGAACCGCTTGGCCATAAACTGGCT GACCTACGCCTGTAGCGGGTACGCGACTACTCGTAGTTACGCCACTTTCCGCACCTCTAGCCGCTTCCTAAATTGC ACCACCGCGACGCGCCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCA
SEQ ID NOs 25 (comp Tartrate)
GTGAAAAAGGGTATGTTTGTACTGCTATCGCTTGTGGTTATTTGTATTCCATTCCCATTACCCAAGATATAAATATTA ATAGGTTAGTTACTATATACTATAACATAGGTTACAACCGTCCCTCTTTAATAAGGGTATGTTTTTGATTCAGTTTAG CAAAGAGTCCTTTCTACGTCCTCACCCTAGATGTAGTTCTAGCACCAATCTAGCAATGACCTGCACTAATTATCTTA ACTTCTTAACCAACTTCGCGGACGCTACGAGTGCGTCCGCTTTTCTAGTCCGTCTTCCCAGTGGTTGTATCGCCCA GTCGTATAAGAGGTAACTCGCTTATTACACAAGCGCGTACGCGACCGCAC
SEQ ID NOs 26 (forward Tartrate)
CACTTTTTCCCATACAAACATGACGATAGCGAACACCAATAAACATAAGGTAAGGGTAATGGGTTCIATATTTATAAT
TATCCAATCAATGATATATGATATTGTATCCAATGTTGGCAGGGAGAAATTATTCCCATACAAAAACTAAGTCAAATC
GTTTCTCAGGAAAGATGCAGGAGTGGGATCTACATCAAGATCGTGGTTAGATCGTTACTGGACGTGATTAATAGAA
TTGAAGAATTGGTTGAAGCGCCTGCGATGCTCACGCAGGCGAAAAGATCAGGCAGAAGGGTCACCAAC
ATAGCGGGTCAGCATATTCTCCATTGAGCGAATAATGTGTTCGCGCATGCGCTGGCGTGC SEQ ID NO: 27 (HISD255)
ATTGCGGGATGTCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCGCAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAACGTCAGGTCAGCCAACGCCT CGACGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTACTGGTGATCGCCGACAGCGGCGCAACACCGG ATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGTCCGGATTCGCAGGTGATTCTGCTGACGCCTGATG CTGACATTGCCTGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCAGAACTGCCGCGCGCGGACACCGCCAG GCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTT
SEQ ID NO: 28 (HISD255)
TAACGCCCTACAGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGCGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTTGCAGTCCAGTCGGTTGCGGAG CTGCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCATGACCACTAGCGGCTGTCGCCGCGTTGTGGCCT AAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCAGGCCTAAGCGTCCACTAAGACGACTGCGGACTAC GACTGTAACGGACGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGTCTTGACGGCGCGCGCCTGTGGCGGTCC GTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAA
SEQ ID NO: 29 (HISD1 )
ATTGCGGGATGCCAGAAGGTGGTTCTGTGCTCGCCGCCACCCATCGCTGATGAAATCCTCTATGCGGCGCAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGTCT CGACGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTGCTGGTGATCGCCGACAGCGGCGCAACACCG GATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGCCCGGATTCCCAGGTGATCCTGCTGACGCCGGAT GCTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAACTGCCGCGCGCGGGCACCGCCC GGCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 30 (HISD1 )
TAACGCCCTACGGTCTTCCACCAAGACACGAGCGGCGGTGGGTAGCGACTACTTTAGGAGATACGCCGCGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCAGA GCTGCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCACGACCACTAGCGGCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCGGGCCTAAGGGTCCACTAGGACGACTGCGGCCTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTTGACGGCGCGCGCCCGTGGCGGGC CGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 31 (PURE1 )
AGCGACTGGGCTACCATGCAATTCGCCGCCGAAATTTTTGAAATTCTGGATGTCCCGCACCATGTAGAAGTGGTTT CCGCCCATCGCACCCCCGATAAACTGTTCAGCTTCGCCGAAACGGCGGAAGAGAACGGATATCAAGTGATTATTG CCGGCGCGGGCGGCGCGGCACACCTGCCGGGAATGATTGCGGCAAAAACGCTGGTCCCGGTACTCGGCGTGCC GGTACAAAGCGCTGCGCTCAGCGGCGTGGATAGCCTCTACTCCATCGTGCAGATGCCGCGCGGCATTCCGGTGG GTACGCTGGCGATCGGTAAAGCCGGGGCGGCGAACGCCGCACTGCTGGCAGCGCAAATTTTGGCTACGCATGAT AGCGCGCTGCATCGGCGCATCGCCGAC SEQ ID NO: 32 (PURE1 )
TCGCTGACCCGATGGTACGTTAAGCGGCGGCTTTAAAAACTTTAAGACCTACAGGGCGTGGTACATCTTCACCAAA GGCGGGTAGCGTGGGGGCTATTTGACAAGTCGAAGCGGCTTTGCCGCCTTCTCTTGCCTATAGTTCACTAATAAC GGCCGCGCCCGCCGCGCCGTGTGGACGGCCCTTACTAACGCCGTTTTTGCGACCAGGGCCATGAGCCGCACGG CCATGTTTCGCGACGCGAGTCGCCGCACCTATCGGAGATGAGGTAGCACGTCTACGGCGCGCCGTAAGGCCACC CATGCGACCGCTAGCCATTTCGGCCCCGCCGCTTGCGGCGTGACGACCGTCGCGTTTAAAACCGATGCGTACTAT CGCGCGACGTAGCCGCGTAGCGGCTG
SEQ ID NO: 33 (AROC1 )
GTTTTTCGCCCGGGACACGCGGATTACACCTATGAGCAGAAATACGGCCTGCGCGATTACCGCGGCGGTGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATCGCCAAGAAATACTTGGCGGAAAAGTTCGG CATCGAAATCCGCGGCTGCCTGACCCAGATGGGCGACATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGCCCCGATGCGGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG TGACTCCATCGGCGCGAAAGTGACGGTGATGGCGAGCGGCGTGCCGGCAGGGCTTGGCGAACCGGTATTTGACC GACTGGATGCGGACATCGCCCATGCGCTGATGAGCATCAATGCGGTGAAAGGCGTGGAGATCGGCGAAGGATTT AACGTGGTGGCGCTGCGCGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGT
SEQ ID NO: 34 (AROC1 )
CAAAAAGCGGGCCCTGTGCGCCTAATGTGGATACTCGTCTTTATGCCGGACGCGCTAATGGCGCCGCCACCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAGCGGTTCTTTATGAACCGCCTTTTCAAGCCG TAGCTTTAGGCGCCGACGGACTGGGTCTACCCGCTGTAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAA TTAGGCAAGAAAACGGGGCTACGCCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCAC TGAGGTAGCCGCGCTTTCACTGCCACTACCGCTCGCCGCACGGCCGTCCCGAACCGCTTGGCCATAAACTGGCT GACCTACGCCTGTAGCGGGTACGCGACTACTCGTAGTTACGCCACTTTCCGCACCTCTAGCCGCTTCCTAAATTGC ACCACCGCGACGCGCCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCA
SEQ ID NO: 35 (DNAN126)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCGGGTGCTACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGTTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAATCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGCAGCGAACTGCGCACTGTCGCGACCGACGGCC ACCGTCTGGCGGTGTGCTCAATGCCGCTGGAGGCGTCTTTACCCAGCCATTCGGTGATTGTGCCGCGTAAAGGCG TGATTGAACTGATGCGTATGCTCGACGGTGGCGAAAACCCGCTGCGCGTGCAG SEQ ID NO: 36 (DNAN126)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGCCCACGATGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACAACCACGC AAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCGC TTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTAGCTGGGTCAAAAGCTACCGGGTAGTCCT ACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCGTCGCTTGACGCGTGACAGCGCTGGCTGCCGGT GGCAGACCGCCACACGAGTTACGGCGACCTCCGCAGAAATGGGTCGGTAAGCCACTAACACGGCGCATTTCCGC ACTAACTTGACTACGCATACGAGCTGCCACCGCTTTTGGGCGACGCGCACGTC SEQ ID NO: 37 (HEMD1 )
GCAACGCTGACGGAAAACGATCTGGTTTTTGCCCTTTCACAGCACTCCGTCGCCTTTGCTCACGCCCAGCTCCAGC GGGATGGACGAAACTGGCCTGCGTCGCCGCGCTATTTCTCGATTGGCCGCACTACGGCGCTCGCCCTTCATACCG TTAGCGGGTTCGATATTCGTTATCCATTGGATCGGGAAATCAGCGAAGCCTTGCTACAATTACCTGAATTACAAAAT ATTGCGGGCAAACGCGCGCTGATTTTGCGTGGCAATGGCGGCCGCGAACTGCTGGGCGAAACCCTGACAGTTCG CGGAGCCGAAGTCAGTTTTTGTGAATGTTATCAACGATGTGCGAAACATTACGATGGCGCGGAAGAAGCGATGCG CTGGCATACTCGCGGCGTAACAACGCTTGTTGTTACCAGCGGCGAGATGTTGCAA
SEQ ID NO: 38 (HEMD1 )
CGTTGCGACTGCCTTTTGCTAGACCAAAAACGGGAAAGTGTCGTGAGGCAGCGGAAACGAGTGCGGGTCGAGGT CGCCCTACCTGCTTTGACCGGACGCAGCGGCGCGATAAAGAGCTAACCGGCGTGATGCCGCGAGCGGGAAGTAT GGCAATCGCCCAAGCTATAAGCAATAGGTAACCTAGCCCTTTAGTCGCTTCGGAACGATGTTAATGGACTTAATGTT TTATAACGCCCGTTTGCGCGCGACTAAAACGCACCGTTACCGCCGGCGCTTGACGACCCGCTTTGGGACTGTCAA GCGCCTCGGCTTCAGTCAAAAACACTTACAATAGTTGCTACACGCTTTGTAATGCTACCGCGCCTTCTTCGCTACG CGACCGTATGAGCGCCGCATTGTTGCGAACAACAATGGTCGCCGCTCTACAACGTT
SEQ ID NO: 39 (THRA10)
GTGCTGGGCCGTAATGGTTCCGACTATTCCGCCGCCGTGCTGGCCGCCTGTTTACGCGCTGACTGCTGTGAAATC TGGACTGACGTCGATGGCGTGTATACCTGTGACCCGCGCCAGGTGCCGGACGCCAGGCTGCTGAAATCGATGTC CTACCAGGAAGCGATGGAACTCTCTTACTTCGGCGCCAAAGTTCTTCACCCTCGCACCATTACGCCCATCGCCCAG TTCCAGATCCCCTGTCTGATTAAAAATACCGGTAATCCGCAGGCGCCAGGAACGCTGATCGGCGCGTCCAGCGAC GATGATAATCTGCCGGTCAAAGGGATCTCTAACCTTAACAATATGGCGATGTTTAGCGTCTCCGGCCCGGGAATGA AAGGGATGATTGGGATGGCGGCGCGTGTTTTCGCCGCCATGTCTCGCGCCGGGATCTCGGTGGTGCTCATTACC CAGTCCTCCTCTGAGTACAGCATCAGTTTCTGTGTGCCGCAGAGTGACTGC SEQ ID NO: 40 (THRA10)
CACGACCCGGCATTACCAAGGCTGATAAGGCGGCGGCACGACCGGCGGACAAATGCGCGACTGACGACACTTTA GACCTGACTGCAGCTACCGCACATATGGACACTGGGCGCGGTCCACGGCCTGCGGTCCGACGACTTTAGCTACA GGATGGTCCTTCGCTACCTTGAGAGAATGAAGCCGCGGTTTCAAGAAGTGGGAGCGTGGTAATGCGGGTAGCGG GTCAAGGTCTAGGGGACAGACTAATTTTTATGGCCATTAGGCGTCCGCGGTCCTTGCGACTAGCCGCGCAGGTCG CTGCTACTATTAGACGGCCAGTTTCCCTAGAGATTGGAATTGTTATACCGCTACAAATCGCAGAGGCCGGGCCCTT ACTTTCCCTACTAACCCTACCGCCGCGCACAAAAGCGGCGGTACAGAGCGCGGCCCTAGAGCCACCACGAGTAAT GGGTCAGGAGGAGACTCATGTCGTAGTCAAAGACACACGGCGTCTCACTGACG
SEQ ID NO: 41 (AROC129)
GTTTTTCGTCCGGGACACGCGGATTACACCTATGAGCAGAAATACGGCCTGCGCGATTACCGTGGTGGTGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATCGCCAAGAAATACCTGGCGGAAAAGTTCGG CATCGAAATCCGCGGCTGCCTGACCCAGATGGGCGACATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGCCCCGATGCGGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG TGACTCCATCGGTGCGAAAGTGACGGTGATGGCGAGCGGCGTGCCGGCAGGGCTTGGCGAACCGGTATTTGACC GACTGGATGCGGACATCGCCCATGCGCTGATGAGCATCAATGCGGTGAAAGGCGTGGAGATCGGCGAAGGATTT AACGTAGTGGCGCTGCGCGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGT
SEQ ID NO: 42 (AROC129)
CAAAAAGCAGGCCCTGTGCGCCTAATGTGGATACTCGTCTTTATGCCGGACGCGCTAATGGCACCACCACCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAGCGGTTCTTTATGGACCGCCTTTTCAAGCCG TAGCTTTAGGCGCCGACGGACTGGGTCTACCCGCTGTAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAA TTAGGCAAGAAAACGGGGCTACGCCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCAC TGAGGTAGCCACGCTTTCACTGCCACTACCGCTCGCCGCACGGCCGTCCCGAACCGCTTGGCCATAAACTGGCTG ACCTACGCCTGTAGCGGGTACGCGACTACTCGTAGTTACGCCACTTTCCGCACCTCTAGCCGCTTCCTAAATTGCA TCACCGCGACGCGCCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCA
SEQ ID NO: 43 (AROC10)
GTTTTTCGTCCGGGACACGCGGATTACACCTATGAGCAGAAATACGGCCTGCGCGATTACCGTGGCGGTGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATCGCCAAGAAATACCTGGCGGAAAAGTTCGG CATCGAAATCCGCGGCTGCCTGACCCAGATGGGCGACATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGTCCCGATGCGGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG TGACTCCATCGGCGCGAAAGTGACGGTGATGGCGAGCGGCGTGCCGGCAGGGCTTGGCGAACCGGTATTTGACC GACTGGATGCGGACATCGCCCATGCGCTGATGAGCATTAATGCGGTGAAAGGCGTGGAGATCGGCGAAGGATTTA ACGTGGTGGCGCTGCGCGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGT
SEQ ID NO: 44 (AROC100)
GTTTTTCGTCCGGGACACGCGGATTACACCTATGAGCAGAAATACGGCCTGCGCGATTACCGTGGCGGTGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATCGCCAAGAAATACCTGGCGGAAAAGTTCGG CATCGAAATCCGCGGCTGCCTGACCCAGATGGGCGACATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGTCCCGATGCGGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG TGACTCCATCGGCGCGAAAGTGACGGTGATGGCGAGCGGCGTGCCGGCAGGGCTTGGCGAACCGGTATTTGACC GACTGGATGCGGACATCGCCCATGCGCTGATGAGCATCAATGCGGTGAAAGGCGTGGAGATCGGTGAAGGATTTA ACGTAGTGGCGCTGCGTGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGT
SEQ ID NO: 45 (AROC101 )
GTTTTTCGTCCGGGACACGCGGATTACACCTATGAGCAGAAATACGGCCTGCGCGATTACCGTGGCGGTGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATCGCCAAGAAATACCTGGCGGAAAAGTTCGG CATCGAAATCCGCGGCTGCCTGACCCAGATGGGCGATATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGTCCCGATGCGGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG CGACTCCATCGGCGCGAAAGTGACGGTGATGGCGAGCGGCGTGCCGGCAGGGCTTGGCGAACCGGTATTTGACC GACTGGATGCGGACATCGCCCATGCGCTGATGAGCATTAATGCGGTGAAAGGCGTGGAGATCGGCGAAGGATTTA ACGTGGTGGCGCTGCGCGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGT SEQ ID NO: 46 (AROC102)
GTTTTTCGTCCGGGACACGCGGATTACACCTATGAGCAGAAATACGGCCTGCGTGATTACCGTGGCGGTGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATCGCCAAGAAATACCTGGCGGAAAAGTTCGG CATCGAAATCCGCGGCTGCCTGACCCAGATGGGCGATATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGCCCCGATGCAGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG TGACTCCATCGGCGCGAAAGTGACGGTGATGGCGAGCGGCGTGCCGGCAGGGCTTGGCGAACCGGTTTTTGACC GACTGGATGCGGACATCGCCCATGCGCTGATGAGCATCAATGCGGTGAAAGGCGTGGAGATCGGCGAAGGATTT AACGTAGTGGCGCTGCGCGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGT
SEQ ID NO: 47 (AROC103)
GTTTTTCGTCCGGGACACGCGGATTACACCTATGAGCAGAAATACGGCCTGCGCGATTACCGTGGCGGTGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATCGCCAAGAAATACCTGGCGGAAAAGTTCGG CATCGAAATCCGCGGCTGCCTGACCCAGATGGGCGACATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGCCCCGATGCGGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG TGACTCCATCGGCGCGAAAGTGACGGTGATGGCGAGCGGCGTGCCGGCAGGGCTTGGCGAACCGGTATTTGACC GACTGGATGCGGACATCGCCCATGCGCTGATGAGCATCAATGCGGTGAAAGGCGTGGAGATCGGCGAAGGATTT AACGTGGTGGCGCTGCGTGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGT
SEQ ID NO: 48 (AROC104)
GTTTTTCGTCCGGGACACGCGGATTATACCTATGAGCAGAAATACGGCCTGCGCGATTACCGTGGCGGTGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATTGCCAAGAAATACCTGGCGGAAAAGTTCGG CATCGAAATCCGCGGCTGCCTGACCCAGATGGGCGACATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGCCCCGATGCGGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG TGACTCCATCGGCGCGAAAGTGACGGTGATGGCGAGCGGTGTGCCGGCAGGGCTTGGCGAACCGGTATTTGACC GACTGGATGCGGACATCGCCCATGCGCTGATGAGCATTAATGCGGTGAAAGGCGTGGAGATCGGCGAAGGATTTA ACGTGGTGGCGCTGCGCGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGT
SEQ ID NO: 49 (AROC105)
GTTTTTCGTCCGGGACACGCGGATTACACCTATGAGCAGAAATACGGCCTGCGCGATTACCGTGGCGGTGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATTGCCAAGAAATACCTGGCGGAAAAGTTCGG CATCGAAATCCGCGGCTGCCTGACCCAGATGGGCGATATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGTCCCGATGCGGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG CGACTCCATCGGCGCGAAAGTGACGGTGATGGCGAGCGGCGTGCCGGCAGGGCTTGGCGAACCGGTTTTTGACC GACTGGATGCGGACATCGCCCATGCGCTGATGAGCATCAATGCGGTGAAAGGCGTGGAGATCGGCGAAGGATTT AACGTGGTGGCGCTGCGCGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGT
SEQ ID NO: 50 (AROC106)
GTTTTTCGTCCGGGACACGCGGATTACACCTATGAGCAGAAATACGGCCTGCGCGATTACCGTGGCGGTGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATCGCCAAGAAATACCTGGCGGAAAAGTTCGG TATCGAAATCCGCGGCTGCCTGACCCAGATGGGCGACATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGCCCCGATGCAGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG TGACTCCATCGGCGCGAAAGTGACGGTGATGGCGAGCGGCGTGCCGGCAGGGCTTGGCGAACCGGTATTTGACC GACTGGATGCGGACATCGCCCATGCGCTGATGAGCATCAATGCGGTGAAAGGCGTGGAGATCGGCGAAGGATTT AACGTGGTGGCGCTGCGCGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGT
SEQ ID NO: 51 (AROC107)
GTTTTTCGTCCGGGACACGCGGATTACACCTATGAGCAGAAATACGGCCTGCGCGATTACCGTGGCGGTGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATTGCCAAGAAATACCTGGCGGAAAAGTTCGG CATCGAAATCCGCGGCTGCCTGACCCAGATGGGCGACATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGCCCCGATGCGGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG TGACTCCATCGGCGCGAAAGTGACGGTGATGGCGAGCGGTGTGCCGGCAGGGCTTGGCGAACCGGTATTTGACC GACTGGATGCGGACATCGCCCATGCGCTGATGAGCATTAATGCGGTGAAAGGCGTGGAGATCGGCGAAGGATTTA ACGTGGTGGCGCTGCGCGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGT
SEQ ID NO: 52 (AROC108)
GTTTTTCGTCCGGGACACGCGGATTACACCTATGAGCAGAAATACGGCCTGCGCGATTACCGTGGCGGTGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATTGCCAAGAAATACCTGGCGGAAAAGTTCGG CATCGAAATCCGCGGCTGCCTGACCCAGATGGGCGATATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGCCCCGATGCGGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG TGACTCCATCGGCGCGAAAGTGACGGTGATGGCGAGCGGCGTGCCGGCAGGGCTTGGCGAACCGGTATTTGACC GACTGGATGCGGACATCGCCCATGCGCTGATGAGCATTAATGCGGTGAAAGGCGTGGAGATCGGCGAAGGATTTA ACGTGGTGGCGCTGCGCGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGT
SEQ ID NO: 53 (AROC10)
CAAAAAGCAGGCCCTGTGCGCCTAATGTGGATACTCGTCTTTATGCCGGACGCGCTAATGGCACCGCCACCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAGCGGTTCTTTATGGACCGCCTTTTCAAGCCG TAGCTTTAGGCGCCGACGGACTGGGTCTACCCGCTGTAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAA TTAGGCAAGAAAACAGGGCTACGCCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCAC TGAGGTAGCCGCGCTTTCACTGCCACTACCGCTCGCCGCACGGCCGTCCCGAACCGCTTGGCCATAAACTGGCT GACCTACGCCTGTAGCGGGTACGCGACTACTCGTAATTACGCCACTTTCCGCACCTCTAGCCGCTTCCTAAATTGC ACCACCGCGACGCGCCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCA
SEQ ID NO: 54 (AROC100)
CAAAAAGCAGGCCCTGTGCGCCTAATGTGGATACTCGTCTTTATGCCGGACGCGCTAATGGCACCGCCACCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAGCGGTTCTTTATGGACCGCCTTTTCAAGCCG TAGCTTTAGGCGCCGACGGACTGGGTCTACCCGCTGTAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAA TTAGGCAAGAAAACAGGGCTACGCCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCAC TGAGGTAGCCGCGCTTTCACTGCCACTACCGCTCGCCGCACGGCCGTCCCGAACCGCTTGGCCATAAACTGGCT GACCTACGCCTGTAGCGGGTACGCGACTACTCGTAGTTACGCCACTTTCCGCACCTCTAGCCACTTCCTAAATTGC ATCACCGCGACGCACCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCA SEQ ID NO: 55 (AROC101 )
CAAAAAGCAGGCCCTGTGCGCCTAATGTGGATACTCGTCTTTATGCCGGACGCGCTAATGGCACCGCCACCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAGCGGTTCTTTATGGACCGCCTTTTCAAGCCG TAGCTTTAGGCGCCGACGGACTGGGTCTACCCGCTATAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAAT TAGGCAAGAAAACAGGGCTACGCCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCGCT GAGGTAGCCGCGCTTTCACTGCCACTACCGCTCGCCGCACGGCCGTCCCGAACCGCTTGGCCATAAACTGGCTG ACCTACGCCTGTAGCGGGTACGCGACTACTCGTAATTACGCCACTTTCCGCACCTCTAGCCGCTTCCTAAATTGCA CCACCGCGACGCGCCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCA
SEQ ID NO: 56 (AROC102)
CAAAAAGCAGGCCCTGTGCGCCTAATGTGGATACTCGTCTTTATGCCGGACGCACTAATGGCACCGCCACCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAGCGGTTCTTTATGGACCGCCTTTTCAAGCCG TAGCTTTAGGCGCCGACGGACTGGGTCTACCCGCTATAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAAT TAGGCAAGAAAACGGGGCTACGTCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCACT GAGGTAGCCGCGCTTTCACTGCCACTACCGCTCGCCGCACGGCCGTCCCGAACCGCTTGGCCAAAAACTGGCTG ACCTACGCCTGTAGCGGGTACGCGACTACTCGTAGTTACGCCACTTTCCGCACCTCTAGCCGCTTCCTAAATTGCA TCACCGCGACGCGCCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCA
SEQ ID NO: 57 (AROC103)
CAAAAAGCAGGCCCTGTGCGCCTAATGTGGATACTCGTCTTTATGCCGGACGCGCTAATGGCACCGCCACCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAGCGGTTCTTTATGGACCGCCTTTTCAAGCCG TAGCTTTAGGCGCCGACGGACTGGGTCTACCCGCTGTAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAA TTAGGCAAGAAAACGGGGCTACGCCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCAC TGAGGTAGCCGCGCTTTCACTGCCACTACCGCTCGCCGCACGGCCGTCCCGAACCGCTTGGCCATAAACTGGCT GACCTACGCCTGTAGCGGGTACGCGACTACTCGTAGTTACGCCACTTTCCGCACCTCTAGCCGCTTCCTAAATTGC ACCACCGCGACGCACCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCA SEQ ID NO: 58 (AROC104)
CAAAAAGCAGGCCCTGTGCGCCTAATATGGATACTCGTCTTTATGCCGGACGCGCTAATGGCACCGCCACCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAACGGTTCTTTATGGACCGCCTTTTCAAGCCG TAGCTTTAGGCGCCGACGGACTGGGTCTACCCGCTGTAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAA TTAGGCAAGAAAACGGGGCTACGCCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCAC TGAGGTAGCCGCGCTTTCACTGCCACTACCGCTCGCCACACGGCCGTCCCGAACCGCTTGGCCATAAACTGGCTG ACCTACGCCTGTAGCGGGTACGCGACTACTCGTAATTACGCCACTTTCCGCACCTCTAGCCGCTTCCTAAATTGCA CCACCGCGACGCGCCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCA SEQ ID NO: 59 (AROC105)
CAAAAAGCAGGCCCTGTGCGCCTAATGTGGATACTCGTCTTTATGCCGGACGCGCTAATGGCACCGCCACCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAACGGTTCTTTATGGACCGCCTTTTCAAGCCG TAGCTTTAGGCGCCGACGGACTGGGTCTACCCGCTATAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAAT TAGGCAAGAAAACAGGGCTACGCCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCGCT GAGGTAGCCGCGCTTTCACTGCCACTACCGCTCGCCGCACGGCCGTCCCGAACCGCTTGGCCAAAAACTGGCTG ACCTACGCCTGTAGCGGGTACGCGACTACTCGTAGTTACGCCACTTTCCGCACCTCTAGCCGCTTCCTAAATTGCA CCACCGCGACGCGCCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCA
SEQ ID NO: 60 (AROC106)
CAAAAAGCAGGCCCTGTGCGCCTAATGTGGATACTCGTCTTTATGCCGGACGCGCTAATGGCACCGCCACCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAGCGGTTCTTTATGGACCGCCTTTTCAAGCCA TAGCTTTAGGCGCCGACGGACTGGGTCTACCCGCTGTAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAA TTAGGCAAGAAAACGGGGCTACGTCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCAC TGAGGTAGCCGCGCTTTCACTGCCACTACCGCTCGCCGCACGGCCGTCCCGAACCGCTTGGCCATAAACTGGCT GACCTACGCCTGTAGCGGGTACGCGACTACTCGTAGTTACGCCACTTTCCGCACCTCTAGCCGCTTCCTAAATTGC ACCACCGCGACGCGCCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCA
SEQ ID NO: 61 (AROC107)
CAAAAAGCAGGCCCTGTGCGCCTAATGTGGATACTCGTCTTTATGCCGGACGCGCTAATGGCACCGCCACCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAACGGTTCTTTATGGACCGCCTTTTCAAGCCG TAGCTTTAGGCGCCGACGGACTGGGTCTACCCGCTGTAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAA TTAGGCAAGAAAACGGGGCTACGCCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCAC TGAGGTAGCCGCGCTTTCACTGCCACTACCGCTCGCCACACGGCCGTCCCGAACCGCTTGGCCATAAACTGGCTG ACCTACGCCTGTAGCGGGTACGCGACTACTCGTAATTACGCCACTTTCCGCACCTCTAGCCGCTTCCTAAATTGCA CCACCGCGACGCGCCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCA
SEQ ID NO: 62 (AROC108)
CAAAAAGCAGGCCCTGTGCGCCTAATGTGGATACTCGTCTTTATGCCGGACGCGCTAATGGCACCGCCACCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAACGGTTCTTTATGGACCGCCTTTTCAAGCCG TAGCTTTAGGCGCCGACGGACTGGGTCTACCCGCTATAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAAT TAGGCAAGAAAACGGGGCTACGCCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCACT GAGGTAGCCGCGCTTTCACTGCCACTACCGCTCGCCGCACGGCCGTCCCGAACCGCTTGGCCATAAACTGGCTG ACCTACGCCTGTAGCGGGTACGCGACTACTCGTAATTACGCCACTTTCCGCACCTCTAGCCGCTTCCTAAATTGCA CCACCGCGACGCGCCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCA
SEQ ID NO: 63 (THRA1 17)
GTGCTGGGGCGTAATGGCTCTGACTACTCTGCCGCCGTGCTGGCCGCCTGTTTACGCGCTGGCTGCTGTGAAATC TGGACTGACGTCGATGGCGTGTATACCTGTGACCCGCGCCAGGTGCCGGACGCCAGGCTGTTGAAATCGATGTC CTACCAGGAAGCGATGGAGCTCTCTTACTTCGGCGCCAAAGTCCTTCACCCTCGCACCATAACGCCTATCGCCCA GTTCCAGATCCCCTGTCTGATTAAAAATACCGGCAATCCGCAGGCGCCAGGAACGCTGATCGGCGCGTCCAGCGA CGATGATAATCTGCCAGTTAAAGGGATCTCTAACCTTAACAACATGGCGATGTTTAGCGTCTCCGGCCCGGGAATG AAAGGGATGATTGGGATGGCGGCGCGTGTTTTCGCCGCCATGTCTCGCGCCGGGATCTCAGTGGTGCTCATTACC CAGTCCTCCTCTGAGTACAGCATCAGCTTCTGTGTGCCGCAGAGTGACTGC SEQ ID NO: 64 (THRA1 19)
GTGCTGGGCCGTAATGGTTCCGACTATTCCGCCGCCGTGCTGGCCGCCTGTTTACGCGCTGACTGCTGTGAAATC TGGACTGACGTCGATGGCGTGTATACCTGTGACCCGCGCCAGGTGCCGGACGCCAGGCTGTTGAAATCGATGTC CTACCAGGAAGCGATGGAGCTCTCTTACTTCGGCGCCAAAGTCCTTCACCCTCGCACCATAACGCCTATCGCCCA GTTCCAGATCCCCTGTCTGATTAAAAATACCGGCAATCCGCAGGCGCCAGGAACGCTGATCGGCGCGTCCAGCGA CGATGATAATCTGCCGGTTAAAGGGATCTCTAACCTTAACAACATGGCGATGTTTAGCGTCTCCGGCCCGGGAATG AAAGGGATGATTGGGATGGCGGCACGTGTTTTCGCCGCCATGTCTCGCGCCGGGATCTCGGTGGTGCTCATTACC CAGTCCTCCTCTGAGTACAGCATCAGCTTCTGTGTGCCGCAGAGTGACTGC
SEQ ID NO: 65 (THRA120) GTGCTGGGCCGTAATGGTTCCGACTATTCCGCCGCCGTGCTGGCCGCCTGTTTACGCGCTGACTGCTGTGAAATC TGGACTGACGTCGATGGCGTGTATACCTGTGACCCGCGCCAGGTGCCGGATGCCAGGCTGTTGAAATCGATGTCC TACCAGGAAGCGATGGAGCTCTCTTACTTCGGCGCCAAAGTCCTTCACCCTCGCACCATAACGCCTATCGCCCAGT TCCAGATCCCCTGTCTGATTAAAAATACCGGCAATCCGCAGGCGCCAGGAACGCTGATCGGCGCGTCCAGCGACG ATGATAATCTGCCGGTTAAAGGGATCTCTAACCTTAACAACATGGCGATGTTTAGCGTCTCCGGCCCGGGAATGAA AGGGATGATTGGGATGGCGGCGCGTGTTTTCGCCGCCATGTCTCGCGCCGGGATCTCGGTGGTGCTCATTACCC AGTCCTCCTCTGAGTACAGCATCAGCTTCTGTGTGCCGCAGAGTGACTGC
SEQ ID NO: 66 (THRA128)
GTGCTGGGCCGTAATGGTTCCGACTATTCCGCCGCCGTGCTGGCCGCCTGTTTACGCGCTGACTGCTGTGAAATC TGGACTGACGTCGATGGCGTGTATACCTGTGACCCGCGCCAGGTGCCGGACGCCAGGCTGTTGAAATCGATGTC CTACCAGGAAGCGATGGAACTCTCTTACTTCGGCGCCAAAGTTCTTCACCCTCGCACCATTACGCCCATCGCCCAG TTCCAGATCCCCTGTCTGATTAAAAATACCGGTAATCCACAGGCGCCAGGAACGCTGATCGGCGCGTCCAGCGAC GATGATAATCTGCCGGTCAAAGGGATCTCTAACCTTAACAACATGGCGATGTTTAGCGTCTCCGGCCCTGGAATGA AAGGGATGATTGGGATGGCGGCGCGTGTTTTCGCCGCCATGTCTCGCGCCGGGATCTCGGTGGTGCTCATTACC CAGTCCTCCTCTGAGTACAGCATCAGCTTCTGTGTGCCGCAGAGTGACTGC
SEQ ID NO: 67 (THRA135)
GTGCTGGGGCGTAATGGCTCTGACTACTCTGCCGCCGTGCTGGCCGCCTGTTTACGCGCTGACTGCTGTGAAATC TGGACTGACGTCGATGGCGTGTATACCTGTGACCCGCGCCAGGTGCCGGACGCCAGGCTGTTGAAATCGATGTC CTACCAGGAAGCGATGGAGCTCTCTTACTTCGGCGCCAAAGTCCTTCACCCTCGCACCATAACGCCTATCGCCCA GTTCCAGATCCCCTGTCTGATTAAAAATACCGGCAATCCGCAGGCGCCAGGAACGCTGATCGGCGCGTCCAGCGA CGATGATAATCTGCCGGTTAAAGGGATCTCTAACCTTAACAACATGGCGATGTTTAGCGTCTCCGGCCCGGGAATG AAAGGGATGATTGGGATGGCGGCGCGTGTTTTCGCCGCCATGTCTCGCGCCGGGATCTCAGTGGTGCTCATTACC CAGTCCTCCTCTGAGTACAGCATCAGCTTCTGTGTGCCGCAGAGTGACTGC
SEQ ID NO: 68 (THRA142)
GTGCTGGGCCGTAATGGTTCCGACTATTCCGCCGCTGTGCTGGCCGCCTGTTTACGCGCTGACTGCTGTGAAATC TGGACTGACGTCGATGGCGTGTATACCTGTGACCCGCGCCAGGTGCCGGATGCCAGGCTGTTGAAATCGATGTCC TACCAGGAAGCGATGGAGCTCTCTTACTTCGGCGCCAAAGTCCTTCACCCTCGCACCATAACGCCTATCGCCCAGT TCCAGATCCCCTGTCTGATTAAAAATACCGGCAATCCGCAGGCGCCAGGAACGCTGATCGGCGCGTCCAGCGACG ATGATAATCTGCCGGTTAAAGGGATCTCTAACCTTAACAACATGGCGATGTTTAGCGTCTCCGGCCCGGGAATGAA AGGGATGATTGGGATGGCGGCGCGTGTTTTCGCCGCCATGTCTCGCGCCGGGATCTCAGTGGTGCTCATTACCCA GTCCTCCTCTGAGTACAGCATCAGCTTCTGTGTGCCGCAGAGTGACTGC
SEQ ID NO: 69 (THRA146)
GTGCTGGGCCGTAATGGTTCCGACTATTCCGCCGCCGTGCTGGCCGCCTGTTTACGCGCTGACTGCTGTGAAATC TGGACTGACGTCGATGGCGTGTATACCTGTGACCCGCGCCAGGTGCCGGACGCCAGGCTGTTGAAATCGATGTC CTACCAGGAAGCGATGGAGCTCTCTTACTTCGGCGCCAAAGTCCTTCACCCTCGCACCATAACGCCTATCGCCCA GTTCCAGATCCCCTGTCTGATTAAAAATACCGGCAATCCGCAGGCGCCAGGAACGCTGATCGGCGCGTCCAGCGA CGATGATAATCTGCCGGTTAAAGGGATCTCTAACCTCAACAACATGGCGATGTTTAGCGTCTCCGGCCCGGGAATG AAAGGGATGATTGGGATGGCGGCGCGTGTTTTCGCCGCCATGTCTCGCGCCGGGATCTCGGTGGTGCTCATTACC CAGTCCTCCTCTGAGTACAGCATCAGCTTCTGTGTGCCGCAGAGTGACTGC SEQ ID NO: 70 (THRA150)
GTGCTGGGCCGTAATGGTTCCGACTATTCCGCCGCCGTGCTGGCCGCCTGTTTACGCGCTGACTGCTGTGAAATC TGGACTGACGTCGATGGCGTGTATACCTGTGACCCGCGCCAGGTGCCGGACGCCAGGCTGTTGAAATCGATGTC CTACCAGGAAGCGATGGAGCTCTCTTACTTCGGCGCCAAAGTCCTTCACCCTCGCACCATAACGCCTATCGCCCA GTTCCAGATCCCCTGTCTGATTAAAAATACCGGTAATCCGCAGGCGCCAGGAACGCTGATCGGCGCATCCAGCGA CGATGATAATCTGCCGGTTAAAGGGATCTCTAACCTTAACAACATGGCGATGTTTAGCGTCTCCGGCCCGGGAATG AAAGGGATGATTGGGATGGCGGCGCGTGTTTTCGCCGCCATGTCTCGCGCCGGGATCTCGGTGGTGCTCATTACC CAGTCCTCCTCTGAGTACAGCATCAGCTTCTGTGTGCCGCAGAGTGACTGC
SEQ ID NO: 71 (THRA181 )
GTGCTGGGCCGTAATGGTTCCGACTATTCCGCCGCTGTGCTGGCCGCCTGTTTACGCGCTGACTGCTGTGAAATC TGGACTGACGTCGATGGCGTGTATACCTGTGACCCGCGCCAGGTGCCGGACGCCAGGCTGTTGAAATCGATGTC CTACCAGGAAGCGATGGAGCTCTCTTACTTCGGCGCCAAAGTCCTTCACCCTCGCACCATAACGCCTATCGCCCA GTTCCAGATCCCCTGTCTGATTAAAAATACCGGCAATCCGCAGGCGCCAGGAACGCTGATCGGCGCGTCCAGCGA CGATGATAATCTGCCGGTTAAAGGGATCTCTAACCTTAACAACATGGCGATGTTTAGCGTCTCCGGCCCGGGAATG AAAGGGATGATTGGGATGGCGGCGCGTGTTTTCGCCGCCATGTCTCGCGCCGGGATCTCAGTGGTGCTCATTACC CAGTCCTCCTCTGAGTACAGCATCAGCTTCTGTGTGCCGCAGAGTGACTGC
SEQ ID NO: 72 (THRA191 )
GTGCTGGGCCGTAATGGTTCCGACTATTCCGCCGCTGTGCTGGCCGCCTGTTTACGCGCTGACTGCTGTGAAATC TGGACTGACGTCGATGGCGTGTATACCTGTGACCCGCGCCAGGTGCCGGACGCCAGGCTGTTGAAATCGATGTC CTACCAGGAAGCGATGGAGCTCTCTTACTTCGGCGCCAAAGTCCTTCACCCTCGCACCATAACGCCTATCGCCCA GTTCCAGATCCCCTGTCTGATTAAAAATACCGGCAATCCGCAGGCGCCAGGAACGCTGATCGGCGCGTCCAGCGA CGATGATAATCTGCCGGTTAAAGGGATCTCTAACCTTAACAACATGGCGATGTTTAGCGTCTCCGGCCCGGGAATG AAAGGGATGATTGGGATGGCGGCGCGTGTTTTCGCCGCCATGTCTCGCGCCGGGATCTCGGTGGTGCTCATTACC CAGTCCTCCTCTGAGTACAGCATCAGCTTCTGTGTGCCGCAGAGTGACTGC
SEQ ID NO: 73 (THRA1 17)
CACGACCCCGCATTACCGAGACTGATGAGACGGCGGCACGACCGGCGGACAAATGCGCGACCGACGACACTTTA GACCTGACTGCAGCTACCGCACATATGGACACTGGGCGCGGTCCACGGCCTGCGGTCCGACAACTTTAGCTACAG GATGGTCCTTCGCTACCTCGAGAGAATGAAGCCGCGGTTTCAGGAAGTGGGAGCGTGGTATTGCGGATAGCGGG TCAAGGTCTAGGGGACAGACTAATTTTTATGGCCGTTAGGCGTCCGCGGTCCTTGCGACTAGCCGCGCAGGTCGC TGCTACTATTAGACGGTCAATTTCCCTAGAGATTGGAATTGTTGTACCGCTACAAATCGCAGAGGCCGGGCCCTTA CTTTCCCTACTAACCCTACCGCCGCGCACAAAAGCGGCGGTACAGAGCGCGGCCCTAGAGTCACCACGAGTAATG GGTCAGGAGGAGACTCATGTCGTAGTCGAAGACACACGGCGTCTCACTGACG SEQ ID NO: 74 (THRA1 19)
CACGACCCGGCATTACCAAGGCTGATAAGGCGGCGGCACGACCGGCGGACAAATGCGCGACTGACGACACTTTA GACCTGACTGCAGCTACCGCACATATGGACACTGGGCGCGGTCCACGGCCTGCGGTCCGACAACTTTAGCTACAG GATGGTCCTTCGCTACCTCGAGAGAATGAAGCCGCGGTTTCAGGAAGTGGGAGCGTGGTATTGCGGATAGCGGG TCAAGGTCTAGGGGACAGACTAATTTTTATGGCCGTTAGGCGTCCGCGGTCCTTGCGACTAGCCGCGCAGGTCGC TGCTACTATTAGACGGCCAATTTCCCTAGAGATTGGAATTGTTGTACCGCTACAAATCGCAGAGGCCGGGCCCTTA CTTTCCCTACTAACCCTACCGCCGTGCACAAAAGCGGCGGTACAGAGCGCGGCCCTAGAGCCACCACGAGTAATG GGTCAGGAGGAGACTCATGTCGTAGTCGAAGACACACGGCGTCTCACTGACG
SEQ ID NO: 75 (THRA120)
CACGACCCGGCATTACCAAGGCTGATAAGGCGGCGGCACGACCGGCGGACAAATGCGCGACTGACGACACTTTA GACCTGACTGCAGCTACCGCACATATGGACACTGGGCGCGGTCCACGGCCTACGGTCCGACAACTTTAGCTACAG GATGGTCCTTCGCTACCTCGAGAGAATGAAGCCGCGGTTTCAGGAAGTGGGAGCGTGGTATTGCGGATAGCGGG TCAAGGTCTAGGGGACAGACTAATTTTTATGGCCGTTAGGCGTCCGCGGTCCTTGCGACTAGCCGCGCAGGTCGC TGCTACTATTAGACGGCCAATTTCCCTAGAGATTGGAATTGTTGTACCGCTACAAATCGCAGAGGCCGGGCCCTTA CTTTCCCTACTAACCCTACCGCCGCGCACAAAAGCGGCGGTACAGAGCGCGGCCCTAGAGCCACCACGAGTAATG GGTCAGGAGGAGACTCATGTCGTAGTCGAAGACACACGGCGTCTCACTGACG
SEQ ID NO: 76 (THRA128)
CACGACCCGGCATTACCAAGGCTGATAAGGCGGCGGCACGACCGGCGGACAAATGCGCGACTGACGACACTTTA GACCTGACTGCAGCTACCGCACATATGGACACTGGGCGCGGTCCACGGCCTGCGGTCCGACAACTTTAGCTACAG GATGGTCCTTCGCTACCTTGAGAGAATGAAGCCGCGGTTTCAAGAAGTGGGAGCGTGGTAATGCGGGTAGCGGGT CAAGGTCTAGGGGACAGACTAATTTTTATGGCCATTAGGTGTCCGCGGTCCTTGCGACTAGCCGCGCAGGTCGCT GCTACTATTAGACGGCCAGTTTCCCTAGAGATTGGAATTGTTGTACCGCTACAAATCGCAGAGGCCGGGACCTTAC TTTCCCTACTAACCCTACCGCCGCGCACAAAAGCGGCGGTACAGAGCGCGGCCCTAGAGCCACCACGAGTAATG GGTCAGGAGGAGACTCATGTCGTAGTCGAAGACACACGGCGTCTCACTGACG
SEQ ID NO: 77 (THRA135)
CACGACCCCGCATTACCGAGACTGATGAGACGGCGGCACGACCGGCGGACAAATGCGCGACTGACGACACTTTA GACCTGACTGCAGCTACCGCACATATGGACACTGGGCGCGGTCCACGGCCTGCGGTCCGACAACTTTAGCTACAG GATGGTCCTTCGCTACCTCGAGAGAATGAAGCCGCGGTTTCAGGAAGTGGGAGCGTGGTATTGCGGATAGCGGG TCAAGGTCTAGGGGACAGACTAATTTTTATGGCCGTTAGGCGTCCGCGGTCCTTGCGACTAGCCGCGCAGGTCGC TGCTACTATTAGACGGCCAATTTCCCTAGAGATTGGAATTGTTGTACCGCTACAAATCGCAGAGGCCGGGCCCTTA CTTTCCCTACTAACCCTACCGCCGCGCACAAAAGCGGCGGTACAGAGCGCGGCCCTAGAGTCACCACGAGTAATG GGTCAGGAGGAGACTCATGTCGTAGTCGAAGACACACGGCGTCTCACTGACG
SEQ ID NO: 78 (THRA142)
CACGACCCGGCATTACCAAGGCTGATAAGGCGGCGACACGACCGGCGGACAAATGCGCGACTGACGACACTTTA GACCTGACTGCAGCTACCGCACATATGGACACTGGGCGCGGTCCACGGCCTACGGTCCGACAACTTTAGCTACAG GATGGTCCTTCGCTACCTCGAGAGAATGAAGCCGCGGTTTCAGGAAGTGGGAGCGTGGTATTGCGGATAGCGGG TCAAGGTCTAGGGGACAGACTAATTTTTATGGCCGTTAGGCGTCCGCGGTCCTTGCGACTAGCCGCGCAGGTCGC TGCTACTATTAGACGGCCAATTTCCCTAGAGATTGGAATTGTTGTACCGCTACAAATCGCAGAGGCCGGGCCCTTA CTTTCCCTACTAACCCTACCGCCGCGCACAAAAGCGGCGGTACAGAGCGCGGCCCTAGAGTCACCACGAGTAATG GGTCAGGAGGAGACTCATGTCGTAGTCGAAGACACACGGCGTCTCACTGACG SEQ ID NO: 79 (THRA146)
CACGACCCGGCATTACCAAGGCTGATAAGGCGGCGGCACGACCGGCGGACAAATGCGCGACTGACGACACTTTA GACCTGACTGCAGCTACCGCACATATGGACACTGGGCGCGGTCCACGGCCTGCGGTCCGACAACTTTAGCTACAG GATGGTCCTTCGCTACCTCGAGAGAATGAAGCCGCGGTTTCAGGAAGTGGGAGCGTGGTATTGCGGATAGCGGG TCAAGGTCTAGGGGACAGACTAATTTTTATGGCCGTTAGGCGTCCGCGGTCCTTGCGACTAGCCGCGCAGGTCGC TGCTACTATTAGACGGCCAATTTCCCTAGAGATTGGAGTTGTTGTACCGCTACAAATCGCAGAGGCCGGGCCCTTA CTTTCCCTACTAACCCTACCGCCGCGCACAAAAGCGGCGGTACAGAGCGCGGCCCTAGAGCCACCACGAGTAATG GGTCAGGAGGAGACTCATGTCGTAGTCGAAGACACACGGCGTCTCACTGACG
SEQ ID NO: 80 (THRA150)
CACGACCCGGCATTACCAAGGCTGATAAGGCGGCGGCACGACCGGCGGACAAATGCGCGACTGACGACACTTTA GACCTGACTGCAGCTACCGCACATATGGACACTGGGCGCGGTCCACGGCCTGCGGTCCGACAACTTTAGCTACAG GATGGTCCTTCGCTACCTCGAGAGAATGAAGCCGCGGTTTCAGGAAGTGGGAGCGTGGTATTGCGGATAGCGGG TCAAGGTCTAGGGGACAGACTAATTTTTATGGCCATTAGGCGTCCGCGGTCCTTGCGACTAGCCGCGTAGGTCGC TGCTACTATTAGACGGCCAATTTCCCTAGAGATTGGAATTGTTGTACCGCTACAAATCGCAGAGGCCGGGCCCTTA CTTTCCCTACTAACCCTACCGCCGCGCACAAAAGCGGCGGTACAGAGCGCGGCCCTAGAGCCACCACGAGTAATG GGTCAGGAGGAGACTCATGTCGTAGTCGAAGACACACGGCGTCTCACTGACG
SEQ ID NO: 81 (THRA181 )
CACGACCCGGCATTACCAAGGCTGATAAGGCGGCGACACGACCGGCGGACAAATGCGCGACTGACGACACTTTA GACCTGACTGCAGCTACCGCACATATGGACACTGGGCGCGGTCCACGGCCTGCGGTCCGACAACTTTAGCTACAG GATGGTCCTTCGCTACCTCGAGAGAATGAAGCCGCGGTTTCAGGAAGTGGGAGCGTGGTATTGCGGATAGCGGG TCAAGGTCTAGGGGACAGACTAATTTTTATGGCCGTTAGGCGTCCGCGGTCCTTGCGACTAGCCGCGCAGGTCGC TGCTACTATTAGACGGCCAATTTCCCTAGAGATTGGAATTGTTGTACCGCTACAAATCGCAGAGGCCGGGCCCTTA CTTTCCCTACTAACCCTACCGCCGCGCACAAAAGCGGCGGTACAGAGCGCGGCCCTAGAGTCACCACGAGTAATG GGTCAGGAGGAGACTCATGTCGTAGTCGAAGACACACGGCGTCTCACTGACG
SEQ ID NO: 82 (THRA191 )
CACGACCCGGCATTACCAAGGCTGATAAGGCGGCGACACGACCGGCGGACAAATGCGCGACTGACGACACTTTA GACCTGACTGCAGCTACCGCACATATGGACACTGGGCGCGGTCCACGGCCTGCGGTCCGACAACTTTAGCTACAG GATGGTCCTTCGCTACCTCGAGAGAATGAAGCCGCGGTTTCAGGAAGTGGGAGCGTGGTATTGCGGATAGCGGG TCAAGGTCTAGGGGACAGACTAATTTTTATGGCCGTTAGGCGTCCGCGGTCCTTGCGACTAGCCGCGCAGGTCGC TGCTACTATTAGACGGCCAATTTCCCTAGAGATTGGAATTGTTGTACCGCTACAAATCGCAGAGGCCGGGCCCTTA CTTTCCCTACTAACCCTACCGCCGCGCACAAAAGCGGCGGTACAGAGCGCGGCCCTAGAGCCACCACGAGTAATG GGTCAGGAGGAGACTCATGTCGTAGTCGAAGACACACGGCGTCTCACTGACG
SEQ ID NO: 83 (DNAN1 10)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCGGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCTCATCAGG ATGTGCGCTATTACTTAAACGGTATGCTGTTTGAAACGGAAGGTAGCGAACTGCGCACTGTCGCGACCGACGGCC ACCGCCTGGCGGTGTGCTCAATGCCGCTGGAAGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGC GTGATTGAACTGATGCGTATGCTCGACGGCGGCGAAAACCCGCTGCGCGTGCAG SEQ ID NO: 84 (DNAN1 1 1 )
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCAGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAGGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGTAGCGAACTGCGCACTGTCGCGACCGACGGCC ACCGCCTGGCGGTGTGCTCAATGCCGCTGGAAGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGC GTGATTGAACTGATGCGTATGCTCGACGGCGGCGAAAACCCGCTGCGCGTGCAG
SEQ ID NO: 85 (DNAN1 15)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCGGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGTAGCGAACTGCGCACTGTCGCGACCGACGGCC ACCGCCTGGCGGTGTGCTCAATGCCGCTGGAAGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGC GTGATTGAACTGATGCGTATGCTCGACGGCGGCGAAAACCCGCTGCGCGTGCAG
SEQ ID NO: 86 (DNAN1 17)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCAGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACACTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGTAGCGAACTGCGCACTGTCGCGACCGACGGCC ACCGCCTGGCGGTGTGCTCAATGCCGCTGGAAGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGC GTGATTGAACTGATGCGTATGCTCGACGGCGGCGAAAACCCGCTGCGCGTGCAG SEQ ID NO: 87 (DNAN122)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAACCAGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGTAGCGAACTGCGCACTGTCGCGACCGACGGCC ACCGCCTGGCGGTGTGCTCAATGCCGCTGGAAGCGTCTTTACCCAGCCATTCGGTGATTGTGCCGCGTAAAGGCG TGATTGAACTGATGCGTATGCTCGACGGTGGCGAAAACCCGCTGCGCGTGCAG SEQ ID NO: 88 (DNAN124)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCAGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAAGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGTAGCGAACTGCGCACTGTCGCGACCGACGGCC ACCGCCTGGCGGTGTGCTCAATGCCGCTGGAAGCGTCTTTACCCAGCCATTCGGTGATTGTGCCGCGTAAAGGCG TGATTGAACTGATGCGTATGCTCGACGGTGGCGAAAACCCGCTGCGCGTGCAG
SEQ ID NO: 89 (DNAN133)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCGGGTGCTACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACACTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGTAGCGAACTGCGCACTGTCGCGACCGACGGCC ACCGCCTGGCGGTGTGCTCAATGCCGCTGGAAGCGTCTTTACCCAGCCATTCGGTGATTGTGCCGCGTAAAGGCG TGATTGAACTGATGCGTATGCTCGACGGCGGCGAAAACCCGCTGCGCGTGCAG
SEQ ID NO: 90 (DNAN138)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCAGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACACTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGTAGCGAACTGCGCACTGTCGCGACCGACGGCC ACCGCCTGGCGGTGTGCTCAATGCCGCTGGAAGCGTCTTTACCCAGCCATTCGGTGATTGTGCCGCGTAAAGGCG TGATTGAACTGATGCGTATGCTCGACGGCGGCGAAAATCCGCTGCGCGTGCAG
SEQ ID NO: 91 (DNAN14)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCAGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGTAGCGAACTGCGCACTGTCGCGACCGACGGCC ACCGCCTGGCGGTGTGCTCAATGCCGCTGGAAGCGTCTTTACCCAGCCATTCGGTGATTGTGCCGCGTAAAGGCG TGATTGAACTGATGCGTATGCTCGACGGTGGCGAAAACCCGCTGCGCGTGCAG
SEQ ID NO: 92 (DNAN15)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCAGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGCAGCGAACTGCGCACTGTTGCGACCGACGGCC ACCGCCTGGCGGTGTGCTCAATGCCGCTGGAAGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGC GTGATTGAACTGATGCGTATGCTCGACGGCGGCGAAAACCCGCTGCGCGTGCAG SEQ ID NO: 93 (DNAN1 10)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGCCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGAGTAGTCC TACACGCGATAATGAATTTGCCATACGACAAACTTTGCCTTCCATCGCTTGACGCGTGACAGCGCTGGCTGCCGGT GGCGGACCGCCACACGAGTTACGGCGACCTTCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCCGC ACTAACTTGACTACGCATACGAGCTGCCGCCGCTTTTGGGCGACGCGCACGTC
SEQ ID NO: 94 (DNAN1 1 1 )
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGTCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTCCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTC CTACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCATCGCTTGACGCGTGACAGCGCTGGCTGCCG GTGGCGGACCGCCACACGAGTTACGGCGACCTTCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCC GCACTAACTTGACTACGCATACGAGCTGCCGCCGCTTTTGGGCGACGCGCACGTC
SEQ ID NO: 95 (DNAN1 15)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGCCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTC CTACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCATCGCTTGACGCGTGACAGCGCTGGCTGCCG GTGGCGGACCGCCACACGAGTTACGGCGACCTTCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCC GCACTAACTTGACTACGCATACGAGCTGCCGCCGCTTTTGGGCGACGCGCACGTC
SEQ ID NO: 96 (DNAN1 17)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGTCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGTGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTC CTACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCATCGCTTGACGCGTGACAGCGCTGGCTGCCG GTGGCGGACCGCCACACGAGTTACGGCGACCTTCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCC GCACTAACTTGACTACGCATACGAGCTGCCGCCGCTTTTGGGCGACGCGCACGTC SEQ ID NO: 97 (DNAN122)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTTGGTCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTC CTACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCATCGCTTGACGCGTGACAGCGCTGGCTGCCG GTGGCGGACCGCCACACGAGTTACGGCGACCTTCGCAGAAATGGGTCGGTAAGCCACTAACACGGCGCATTTCC GCACTAACTTGACTACGCATACGAGCTGCCACCGCTTTTGGGCGACGCGCACGTC
SEQ ID NO: 98 (DNAN124)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGTCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTTCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTCC TACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCATCGCTTGACGCGTGACAGCGCTGGCTGCCGGT GGCGGACCGCCACACGAGTTACGGCGACCTTCGCAGAAATGGGTCGGTAAGCCACTAACACGGCGCATTTCCGC ACTAACTTGACTACGCATACGAGCTGCCACCGCTTTTGGGCGACGCGCACGTC
SEQ ID NO: 99 (DNAN133)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGCCCACGATGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGTGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTC CTACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCATCGCTTGACGCGTGACAGCGCTGGCTGCCG GTGGCGGACCGCCACACGAGTTACGGCGACCTTCGCAGAAATGGGTCGGTAAGCCACTAACACGGCGCATTTCC GCACTAACTTGACTACGCATACGAGCTGCCGCCGCTTTTGGGCGACGCGCACGTC
SEQ ID NO: 100 (DNAN138)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGTCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGTGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTCC TACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCATCGCTTGACGCGTGACAGCGCTGGCTGCCGGT GGCGGACCGCCACACGAGTTACGGCGACCTTCGCAGAAATGGGTCGGTAAGCCACTAACACGGCGCATTTCCGC ACTAACTTGACTACGCATACGAGCTGCCGCCGCTTTTAGGCGACGCGCACGTC
SEQ ID NO: 101 (DNAN14)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGTCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTC CTACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCATCGCTTGACGCGTGACAGCGCTGGCTGCCG GTGGCGGACCGCCACACGAGTTACGGCGACCTTCGCAGAAATGGGTCGGTAAGCCACTAACACGGCGCATTTCC GCACTAACTTGACTACGCATACGAGCTGCCACCGCTTTTGGGCGACGCGCACGTC
SEQ ID NO: 102 (DNAN15)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGTCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTC CTACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCGTCGCTTGACGCGTGACAACGCTGGCTGCCG GTGGCGGACCGCCACACGAGTTACGGCGACCTTCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCC GCACTAACTTGACTACGCATACGAGCTGCCGCCGCTTTTGGGCGACGCGCACGTC
SEQ ID NO: 103 (DNAN10)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCAGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACACTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGTAGCGAACTGCGCACTGTCGCGACCGACGGCC ACCGTCTGGCGGTGTGCTCAATGCCGCTGGAAGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGCG TGATTGAACTGATGCGTATGCTCGACGGCGGCGAAAACCCGCTGCGCGTGCAG
SEQ ID NO: 104 (DNAN100)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCGGGTGCTACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGTTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGCAGCGAACTGCGCACTGTTGCGACCGACGGCC ACCGTCTGGCGGTGTGTTCAATGCCGCTGGAAGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGCG TCATTGAACTGATGCGTATGCTCGACGGCGGTGAAAACCCGCTGCGCGTGCAG
SEQ ID NO: 105 (DNAN101 )
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCGGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGTAGCGAACTGCGCACTGTCGCGACCGACGGCC ACCGCCTGGCGGTGTGCTCAATGCCGCTGGAAGCGTCTTTACCCAGCCATTCGGTGATTGTGCCGCGTAAAGGCG TGATTGAACTGATGCGTATGCTCGACGGTGGCGAAAACCCGCTGCGCGTGCAG SEQ ID NO: 106 (DNAN102)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCGGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCTCATCAGG ATGTGCGCTATTACTTAAACGGTATGCTGTTTGAAACGGAAGGTAGCGAACTGCGCACTGTCGCGACCGACGGCC ACCGTCTGGCGGTGTGCTCAATGCCGCTGGAAGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGCG TGATTGAACTGATGCGTATGCTCGACGGTGGCGAAAACCCGCTGCGCGTGCAG
SEQ ID NO: 107 (DNAN103)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCGGGTGCTACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACTCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGTAGCGAACTGCGCACTGTCGCGACCGACGGCC ACCGTCTGGCGGTGTGCTCAATGCCGCTGGAAGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGCG TGATTGAACTGATGCGTATGCTCGACGGCGGCGAAAACCCGCTGCGCGTGCAG
SEQ ID NO: 108 (DNAN104)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCGGGTGCTACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGTTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGCAGCGAACTGCGCACTGTTGCGACCGACGGCC ACCGTCTGGCGGTGTGCTCAATGCCGCTGGAGGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGC GTGATTGAACTGATGCGTATGCTCGACGGCGGCGAAAACCCGCTGCGCGTGCAG SEQ ID NO: 109 (DNAN105)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCAGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGTTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTCGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGCAGCGAACTGCGCACTGTTGCGACCGACGGCC ACCGTCTGGCGGTGTGCTCAATGCCGCTGGAGGCGTCTTTACCTAGCCACTCGGTGATTGTGCCGCGTAAAGGCG TGATTGAACTGATGCGTATGCTCGACGGTGGCGAAAACCCGCTGCGCGTGCAG
SEQ ID NO: 1 10 (DNAN106)
ATGGAGATGGTCGCGCGCGTGACGCTTTCTCAGCCGCATGAGCCGGGTGCTACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGTGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGCAGCGAACTGCGCACTGTTGCGACCGACGGCC ACCGTCTGGCGGTGTGCTCAATGCCGCTGGAAGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGCG TGATTGAACTGATGCGTATGCTCGACGGCGGCGAAAACCCGCTGCGCGTGCAG SEQ ID NO: 1 1 1 (DNAN107)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCAGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGTTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGCAGCGAACTGCGCACTGTTGCGACCGACGGCC ACCGTCTGGCGGTGTGCTCAATGCCGCTGGAGGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGC GTGATTGAACTGATGCGTATGCTCGACGGCGGCGAAAACCCGCTGCGCGTGCAG
SEQ ID NO: 1 12 (DNAN108)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCAGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGTCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGCAGCGAACTGCGCACTGTTGCGACCGACGGCC ACCGTCTAGCGGTGTGCTCAATGCCGCTGGAGACGTCTTTACCTAGCCACTCGGTGATTGTGCCGCGTAAAGGCG TGATTGAACTGATGCGTATGCTCGACGGTGGCGAAAACCCGCTGCGAGTGCAG
SEQ ID NO: 1 13 (DNAN10)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGTCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGTGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTC CTACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCATCGCTTGACGCGTGACAGCGCTGGCTGCCG GTGGCAGACCGCCACACGAGTTACGGCGACCTTCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCC GCACTAACTTGACTACGCATACGAGCTGCCGCCGCTTTTGGGCGACGCGCACGTC
SEQ ID NO: 1 14 (DNAN100)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGCCCACGATGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACAACCACGC AAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCGC TTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTCCT ACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCGTCGCTTGACGCGTGACAACGCTGGCTGCCGGT GGCAGACCGCCACACAAGTTACGGCGACCTTCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCCGC AGTAACTTGACTACGCATACGAGCTGCCGCCACTTTTGGGCGACGCGCACGTC
SEQ ID NO: 1 15 (DNAN101 )
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGCCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTC CTACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCATCGCTTGACGCGTGACAGCGCTGGCTGCCG GTGGCGGACCGCCACACGAGTTACGGCGACCTTCGCAGAAATGGGTCGGTAAGCCACTAACACGGCGCATTTCC GCACTAACTTGACTACGCATACGAGCTGCCACCGCTTTTGGGCGACGCGCACGTC SEQ ID NO: 1 16 (DNAN102) TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGCCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGAGTAGTCC TACACGCGATAATGAATTTGCCATACGACAAACTTTGCCTTCCATCGCTTGACGCGTGACAGCGCTGGCTGCCGGT GGCAGACCGCCACACGAGTTACGGCGACCTTCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCCGC ACTAACTTGACTACGCATACGAGCTGCCACCGCTTTTGGGCGACGCGCACGTC
SEQ ID NO: 1 17 (DNAN103)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGCCCACGATGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGAGTCAAAAGCTACCGGGTAGTCC TACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCATCGCTTGACGCGTGACAGCGCTGGCTGCCGGT GGCAGACCGCCACACGAGTTACGGCGACCTTCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCCGC ACTAACTTGACTACGCATACGAGCTGCCGCCGCTTTTGGGCGACGCGCACGTC
SEQ ID NO: 1 18 (DNAN104)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGCCCACGATGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACAACCACGC AAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCGC TTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTCCT ACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCGTCGCTTGACGCGTGACAACGCTGGCTGCCGGT GGCAGACCGCCACACGAGTTACGGCGACCTCCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCCGC ACTAACTTGACTACGCATACGAGCTGCCGCCGCTTTTGGGCGACGCGCACGTC
SEQ ID NO: 1 19 (DNAN105)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGTCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCAACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAGCTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTC CTACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCGTCGCTTGACGCGTGACAACGCTGGCTGCCG GTGGCAGACCGCCACACGAGTTACGGCGACCTCCGCAGAAATGGATCGGTGAGCCACTAACACGGCGCATTTCC GCACTAACTTGACTACGCATACGAGCTGCCACCGCTTTTGGGCGACGCGCACGTC
SEQ ID NO: 120 (DNAN106)
TACCTCTACCAGCGCGCGCACTGCGAAAGAGTCGGCGTACTCGGCCCACGATGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCACTAGCCTACGACCACGC AAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCGC TTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTCCT ACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCGTCGCTTGACGCGTGACAACGCTGGCTGCCGGT GGCAGACCGCCACACGAGTTACGGCGACCTTCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCCGC ACTAACTTGACTACGCATACGAGCTGCCGCCGCTTTTGGGCGACGCGCACGTC SEQ ID NO: 121 (DNAN107)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGTCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCAACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTC CTACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCGTCGCTTGACGCGTGACAACGCTGGCTGCCG GTGGCAGACCGCCACACGAGTTACGGCGACCTCCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCC GCACTAACTTGACTACGCATACGAGCTGCCGCCGCTTTTGGGCGACGCGCACGTC
SEQ ID NO: 122 (DNAN108)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGTCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCAGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTC CTACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCGTCGCTTGACGCGTGACAACGCTGGCTGCCG GTGGCAGATCGCCACACGAGTTACGGCGACCTCTGCAGAAATGGATCGGTGAGCCACTAACACGGCGCATTTCCG CACTAACTTGACTACGCATACGAGCTGCCACCGCTTTTGGGCGACGCTCACGTC
SEQ ID NO: 123 (HISD105)
ATTGCAGGATGCCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCACAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGCCT CGACGGCGCGGCTATCGATATGCCTGCCGGGCCGTCTGAAGTGCTGGTGATCGCCGACAGCGGCGCAACGCCG GATTTTGTCGCATCTGACCTGCTCTCCCAGGCAGAACACGGCCCGGATTCCCAGGTGATTTTGCTGACGCCGGAT GCTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAACTGCCGCGCGCGGACACCGCCC GGCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 124 (HISD1 1 )
ATTGCGGGATGTCAGAACGTGGTTTTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCGCAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGTCT CGACGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTGCTGGTGATCGCCGACAGCGGCGCAACACCG GATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGCCCGGATTCCCAGGTGATCCTGCTGACGCCGGAT GCTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAACTGCCGCGCGCGGACACCGCCC GGCAGGCCCTGAGCGCCAGCCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC SEQ ID NO: 125 (HISD1 10)
ATTGCGGGATGTCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCACAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGTCT CGACGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTACTGGTGATCGCAGACAGCGGCGCAACACCGG ATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGCCCGGATTCCCAGGTGATCCTGCTGACGCCGGATG CTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAACTGCCGCGCGCGGACACCGCCCG GCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 126 (HISD1 16)
ATTGCGGGATGTCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCACAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGTCT CGACGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTGCTGGTGATCGCAGACAGCGGCGCAACACCG GATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGCCCGGATTCCCAGGTGATCCTGCTGACGCCGGAT GCTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAGCTGCCGCGCGCGGACACCGCCC GGCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 127 (HISD1 19)
ATTGCGGGATGCCAGAAGGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCGCAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGTCT CGACGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTGCTGGTGATCGCCGACAGCGGCGCAACACCG GATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGCCCGGATTCCCAGGTGATCCTGCTGACGCCGGAT GCTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAACTGCCGCGCGCGGACACCGCCC GGCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 128 (HISD123)
ATTGCGGGATGTCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCACAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGTCT CGACGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTGCTGGTGATCGCCGACAGCGGCGCAACACCG GATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGCCCGGATTCCCAGGTGATCCTGCTGACGCCGGAT GCTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAACTGCCGCGCGCGGACACCGCCC GGCAGGCCCTGAGCGCCAGCCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 129 (HISD125)
ATTGCGGGATGTCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCACAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGTCT CGACGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTACTGGTGATCGCCGACAGCGGCGCAACACCGG ATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGCCCGGATTCCCAGGTGATCCTGCTGACGCCTGATG CTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAACTGCCGCGCGCGGACACCGCCCG GCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC SEQ ID NO: 130 (HISD130)
ATTGCGGGATGTCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCGCAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGTCT CGACGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTACTGGTGATCGCAGACAGCGGCGCAACACCGG ATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGCCCGGATTCCCAGGTGATCCTGCTGACGCCTGATG CTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAACTGCCGCGCGCGGACACCGCCCG GCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 131 (HISD133)
ATTGCGGGATGCCAGAAGGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCGCAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAACGTCAGGTCAGCCAGCGTCT CGACGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTACTGGTGATCGCAGACAGCGGCGCAACACTGG ATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGCCCGGATTCCCAGGTGATCCTGCTGACGCCTGATG CTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAACTGCCGCGCGCGGACACCGCCCG GCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 132 (HISD14)
ATTGCGGGATGTCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCACAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAACGTCAGGTCAGCCAGCGTCT CGACGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTACTGGTGATCGCAGACAGCGGCGCAACACCGG ATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGCCCGGATTCCCAGGTGATCCTGCTGACGCCTGATG CTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAACTGCCGCGCGCGGACACCGCCCG GCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 133 (HISD105)
TAACGTCCTACGGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGTGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCGGA GCTGCCGCGCCGATAGCTATACGGACGGCCCGGCAGACTTCACGACCACTAGCGGCTGTCGCCGCGTTGCGGCC TAAAACAGCGTAGACTGGACGAGAGGGTCCGTCTTGTGCCGGGCCTAAGGGTCCACTAAAACGACTGCGGCCTAC GACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTTGACGGCGCGCGCCTGTGGCGGGCC GTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 134 (HISD1 1 )
TAACGCCCTACAGTCTTGCACCAAAACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGCGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCAGA GCTGCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCACGACCACTAGCGGCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCGGGCCTAAGGGTCCACTAGGACGACTGCGGCCTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTTGACGGCGCGCGCCTGTGGCGGGC CGTCCGGGACTCGCGGTCGGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG SEQ ID NO: 135 (HISD1 10)
TAACGCCCTACAGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGTGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCAGA GCTGCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCATGACCACTAGCGTCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCGGGCCTAAGGGTCCACTAGGACGACTGCGGCCTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTTGACGGCGCGCGCCTGTGGCGGGC CGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 136 (HISD1 16)
TAACGCCCTACAGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGTGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCAGA GCTGCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCACGACCACTAGCGTCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCGGGCCTAAGGGTCCACTAGGACGACTGCGGCCTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTCGACGGCGCGCGCCTGTGGCGGGC CGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 137 (HISD1 19)
TAACGCCCTACGGTCTTCCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGCGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCAGA GCTGCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCACGACCACTAGCGGCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCGGGCCTAAGGGTCCACTAGGACGACTGCGGCCTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTTGACGGCGCGCGCCTGTGGCGGGC CGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG SEQ ID NO: 138 (HISD123)
TAACGCCCTACAGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGTGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCAGA GCTGCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCACGACCACTAGCGGCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCGGGCCTAAGGGTCCACTAGGACGACTGCGGCCTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTTGACGGCGCGCGCCTGTGGCGGGC CGTCCGGGACTCGCGGTCGGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 139 (HISD125)
TAACGCCCTACAGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGTGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCAGA GCTGCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCATGACCACTAGCGGCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCGGGCCTAAGGGTCCACTAGGACGACTGCGGACTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTTGACGGCGCGCGCCTGTGGCGGGC CGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG SEQ ID NO: 140 (HISD130)
TAACGCCCTACAGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGCGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCAGA GCTGCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCATGACCACTAGCGTCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCGGGCCTAAGGGTCCACTAGGACGACTGCGGACTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTTGACGGCGCGCGCCTGTGGCGGGC CGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 141 (HISD133)
TAACGCCCTACGGTCTTCCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGCGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTTGCAGTCCAGTCGGTCGCAGAG CTGCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCATGACCACTAGCGTCTGTCGCCGCGTTGTGACCTA AAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCGGGCCTAAGGGTCCACTAGGACGACTGCGGACTACG ACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTTGACGGCGCGCGCCTGTGGCGGGCCG TCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 142 (HISD14)
TAACGCCCTACAGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGTGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTTGCAGTCCAGTCGGTCGCAGAG CTGCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCATGACCACTAGCGTCTGTCGCCGCGTTGTGGCCTA AAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCGGGCCTAAGGGTCCACTAGGACGACTGCGGACTACG ACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTTGACGGCGCGCGCCTGTGGCGGGCCG TCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 143 (HISD105)
ATTGCAGGATGCCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCACAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGCCT CGACGGCGCGGCTATCGATATGCCTGCCGGGCCGTCTGAAGTGCTGGTGATCGCCGACAGCGGCGCAACGCCG GATTTTGTCGCATCTGACCTGCTCTCCCAGGCAGAACACGGCCCGGATTCCCAGGTGATTTTGCTGACGCCGGAT GCTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAACTGCCGCGCGCGGACACCGCCC GGCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 144 (HISD1 10)
ATTGCGGGATGTCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCACAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGTCT CGACGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTACTGGTGATCGCAGACAGCGGCGCAACACCGG ATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGCCCGGATTCCCAGGTGATCCTGCTGACGCCGGATG CTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAACTGCCGCGCGCGGACACCGCCCG GCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC SEQ ID NO: 145 (HISD1 1 1 ) ATTGCGGGATGTCAGAACGTGGTTCTGTGTTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCACAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGTCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGCCT CGACGGCGCGGCTATCGATATGCCTGCCGGGCCGTCTGAAGTACTGGTGATCGCCGACAGCGGCGCAACACCGG ATTTCGTCGCTTCTGACCTGCTCTCCCAGGCAGAACACGGTCCGGATTCGCAGGTGATCCTGCTGACGCCTGATG CTGACATTGCCCGCAAGGTGGCGGAGGAGGTAGAACGTCAACTGGCGGAACTGCCGCGCGCGGACACCGCCCG GCAGGCCCTGAGCACCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 146 (HISD1 16)
ATTGCGGGATGTCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCACAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGTCT CGACGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTGCTGGTGATCGCAGACAGCGGCGCAACACCG GATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGCCCGGATTCCCAGGTGATCCTGCTGACGCCGGAT GCTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAGCTGCCGCGCGCGGACACCGCCC GGCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 147 (HISD1 17)
ATTGCGGGATGTCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCACAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGTCT CGACGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTACTGGTGATCGCAGACAGCGGCGCAACACCGG ATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGTCCGGATTCCCAGGTGATCCTGCTAACGCCTGATG CTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCAGAACTGCCGCGCGCGGACACCGCCAG GCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 148 (HISD123)
ATTGCGGGATGTCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCACAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGTCT CGACGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTGCTGGTGATCGCCGACAGCGGCGCAACACCG GATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGCCCGGATTCCCAGGTGATCCTGCTGACGCCGGAT GCTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAACTGCCGCGCGCGGACACCGCCC GGCAGGCCCTGAGCGCCAGCCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 149 (HISD125)
ATTGCGGGATGTCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCACAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGTCT CGACGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTACTGGTGATCGCCGACAGCGGCGCAACACCGG ATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGCCCGGATTCCCAGGTGATCCTGCTGACGCCTGATG CTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAACTGCCGCGCGCGGACACCGCCCG GCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC SEQ ID NO: 150 (HISD129)
ATTGCGGGATGTCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCACAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGCCT CGACGGCGCGGCTATCGATATGCCTGCCGGGCCGTCTGAAGTACTGGTGATCGCCGACAGCGGCGCAACACCGG ATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGTCCGGATTCCCAGGTGATTCTGCTGACGCCTGATG CTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAACTGCCGCGCGCGGACACCGCCCG GCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 151 (HISD134)
ATTGCCGGCTGTCAGAAAGTGGTTCTGTGCTCACCACCTCCTATCGCCGACGAAATCCTTTATGCGGCACAGCTGT GTGGCATAGAAGAAATCTTTAATGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAATCGGTTC CGAAAGTCGATAAAATTTTTGGCCCTGGCAACGCCTTTGTAACCGAAGCCAAACGTCAGGTCAGCCAGCGTCTCGA CGGCGCAGCTATCGATATGCCTGCCGGGCCATCTGAAGTGCTGGTGATCGCCGACAGCGGCGCAACGCCGGATT TTGTCGCATCTGACCTGCTCTCCCAGGCAGAACACGGTCCGGATTCCCAGGTGATTTTGCTGACGCCGGATGTTG ACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCAGAACTGCCGCGCGCGGACACCGCCCGGCA GGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 152 (HISD135)
ATTGCCGGCTGTCAGAAAGTGGTTCTGTGCTCACCACCTCCTATCGCCGACGAAATCCTTTATGCGGCACAGCTGT GTGGCATAGAAGAAATCTTTAATGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAATCGGTTC CGAAAGTCGATAAAATTTTTGGCCCTGGCAACGCCTTTGTAACCGAAGCCAAACGTCAGGTCAGCCAGCGTCTCGA CGGCGCAGCTATCGATATGCCTGCCGGGCCATCTGAAGTGCTGGTGATCGCCGACAGCGGCGCAACACCGGATT TTGTCGCATCTGACCTGCTCTCCCAGGCAGAACACGGTCCGGATTCCCAGGTGATTTTGCTGACGCCGGATGTTG ACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCAGAACTGCCGCGCGCGGACACCGCCCGGCA GGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 153 (HISD105)
TAACGTCCTACGGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGTGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCGGA GCTGCCGCGCCGATAGCTATACGGACGGCCCGGCAGACTTCACGACCACTAGCGGCTGTCGCCGCGTTGCGGCC TAAAACAGCGTAGACTGGACGAGAGGGTCCGTCTTGTGCCGGGCCTAAGGGTCCACTAAAACGACTGCGGCCTAC GACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTTGACGGCGCGCGCCTGTGGCGGGCC GTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG SEQ ID NO: 154 (HISD1 10)
TAACGCCCTACAGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGTGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCAGA GCTGCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCATGACCACTAGCGTCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCGGGCCTAAGGGTCCACTAGGACGACTGCGGCCTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTTGACGGCGCGCGCCTGTGGCGGGC CGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 155 (HISD1 1 1 )
TAACGCCCTACAGTCTTGCACCAAGACACAAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGTGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCAGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCGGA GCTGCCGCGCCGATAGCTATACGGACGGCCCGGCAGACTTCATGACCACTAGCGGCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGTCTTGTGCCAGGCCTAAGCGTCCACTAGGACGACTGCGGACTA CGACTGTAACGGGCGTTCCACCGCCTCCTCCATCTTGCAGTTGACCGCCTTGACGGCGCGCGCCTGTGGCGGGC CGTCCGGGACTCGTGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 156 (HISD1 16)
TAACGCCCTACAGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGTGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCAGA GCTGCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCACGACCACTAGCGTCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCGGGCCTAAGGGTCCACTAGGACGACTGCGGCCTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTCGACGGCGCGCGCCTGTGGCGGGC CGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 157 (HISD1 17)
TAACGCCCTACAGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGTGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCAGA GCTGCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCATGACCACTAGCGTCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCAGGCCTAAGGGTCCACTAGGACGATTGCGGACTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGTCTTGACGGCGCGCGCCTGTGGCGGTC CGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 158 (HISD123)
TAACGCCCTACAGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGTGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCAGA GCTGCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCACGACCACTAGCGGCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCGGGCCTAAGGGTCCACTAGGACGACTGCGGCCTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTTGACGGCGCGCGCCTGTGGCGGGC CGTCCGGGACTCGCGGTCGGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG SEQ ID NO: 159 (HISD125)
TAACGCCCTACAGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGTGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCAGA GCTGCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCATGACCACTAGCGGCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCGGGCCTAAGGGTCCACTAGGACGACTGCGGACTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTTGACGGCGCGCGCCTGTGGCGGGC CGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 160 (HISD129)
TAACGCCCTACAGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGTGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCGGA GCTGCCGCGCCGATAGCTATACGGACGGCCCGGCAGACTTCATGACCACTAGCGGCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCAGGCCTAAGGGTCCACTAAGACGACTGCGGACTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTTGACGGCGCGCGCCTGTGGCGGGC CGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 161 (HISD134)
TAACGGCCGACAGTCTTTCACCAAGACACGAGTGGTGGAGGATAGCGGCTGCTTTAGGAAATACGCCGTGTCGAC ACACCGTATCTTCTTTAGAAATTACAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTTAGCCAA GGCTTTCAGCTATTTTAAAAACCGGGACCGTTGCGGAAACATTGGCTTCGGTTTGCAGTCCAGTCGGTCGCAGAGC TGCCGCGTCGATAGCTATACGGACGGCCCGGTAGACTTCACGACCACTAGCGGCTGTCGCCGCGTTGCGGCCTA AAACAGCGTAGACTGGACGAGAGGGTCCGTCTTGTGCCAGGCCTAAGGGTCCACTAAAACGACTGCGGCCTACAA CTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGTCTTGACGGCGCGCGCCTGTGGCGGGCCGT CCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 162 (HISD135)
TAACGGCCGACAGTCTTTCACCAAGACACGAGTGGTGGAGGATAGCGGCTGCTTTAGGAAATACGCCGTGTCGAC ACACCGTATCTTCTTTAGAAATTACAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTTAGCCAA GGCTTTCAGCTATTTTAAAAACCGGGACCGTTGCGGAAACATTGGCTTCGGTTTGCAGTCCAGTCGGTCGCAGAGC TGCCGCGTCGATAGCTATACGGACGGCCCGGTAGACTTCACGACCACTAGCGGCTGTCGCCGCGTTGTGGCCTA AAACAGCGTAGACTGGACGAGAGGGTCCGTCTTGTGCCAGGCCTAAGGGTCCACTAAAACGACTGCGGCCTACAA CTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGTCTTGACGGCGCGCGCCTGTGGCGGGCCGT CCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 163 (HISD10)
ATTGCGGGATGCCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCGCAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGCCT CGACGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTACTGGTGATCGCCGACAGCGGCGCAACACCGG ATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGTCCGGATTCGCAGGTGATCCTGCTGACGCCTGATG CTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAGCTGCCGCGCGCGGACACCGCCCG GCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC SEQ ID NO: 164 (HISD100)
ATTGCGGGATGTCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCGCAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGCCT CGACGGCGCGGCTATCGATATGCCTGCCGGGCCGTCTGAAGTGCTGGTGATCGCCGACAGCGGCGCAACACCG GATTTCGTCGCTTCTGACCTGCTCTCCCAGGCAGAACACGGTCCGGATTCGCAGGTGATCCTGCTGACGCCTGAT GCTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTTGCAGAACTGCCGCGCGCGGACACCGCCC GGCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 165 (HISD101 )
ATTGCGGGATGTCAGAACGTGGTTCTGTGCACGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCGCAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGCCT CGACGGCGCGGCTATCGATATGCCTGCCGGGCCGTCTGAAGTACTGGTGATCGCCGACAGCGGCGCAACACCGG ATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGTCCGGATTCCCAGGTGATTCTGCTGACGCCTGATG CTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAGCTGCCGCGCGCGGACACCGCCCG GCAGGCCCTGAGCGCCAGTCGTCTAATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 166 (HISD102)
ATTGCGGGATGCCAGAACGTGATTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCACAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAACGCCT CGACGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTACTGGTGATCGCCGACAGCGGCGCAACACCGG ATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGTCCGGATTCGCAGGTGATCCTGCTGACGCCTGATG CTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTTGCAGAACTGCCGCGCGCGGACACTGCCCGG CAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC SEQ ID NO: 167 (HISD104)
ATTGCGGGATGCCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATTCTCTATGCGGCGCAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAACGCCT CGAGGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTACTGGTGATCGCCGACAGCGGCGCAACACCG GATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGTCCGGATTCGCAGGTGATCCTGCTGACGCCTGAT GCTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTTGCAGAACTGCCGCGCGCGGACACCGCCC GGCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC SEQ ID NO: 168 (HISD105)
ATTGCAGGATGCCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCACAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGCCT CGACGGCGCGGCTATCGATATGCCTGCCGGGCCGTCTGAAGTGCTGGTGATCGCCGACAGCGGCGCAACGCCG GATTTTGTCGCATCTGACCTGCTCTCCCAGGCAGAACACGGCCCGGATTCCCAGGTGATTTTGCTGACGCCGGAT GCTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAACTGCCGCGCGCGGACACCGCCC GGCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 169 (HISD106)
ATTGCGGGATGCCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCGCAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGCCT CGACGGCGCGGCTATCGATATGCCTGCCGGGCCGTCTGAAGTACTGGTGATCGCCGACAGCGGCGCAACACCGG ATTTCGTCGCTTCTGACCTGCTCTCCCAGGCAGAACACGGTCCGGATTCGCAGGTGATCCTGCTGACGCCTGATG CTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCAGAACTGCCGCGCGCGGACACCGCCAG GCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 170 (HISD107)
ATTGCGGGATGTCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCGCAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGCCT CGACGGCGCGGCTATCGATATGCCTGCCGGGCCGTCTGAAGTACTGGTGATCGCCGACAGCGGCGCAACACCGG ATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGTCCGGATTCCCAGGTGATTCTGCTGACGCCTGATG CTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAGCTGCCGCGCGCGGACACCGCCCG GCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 171 (HISD108)
ATTGCGGGATGTCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCGCAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGCCT CGACGGCGCGGCTATCGATATGCCTGCCGGGCCGTCTGAAGTACTGGTGATCGCCGACAGCGGCGCAACACCGG ATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGTCCGGATTCCCAGGTGATTCTGCTGACGCCTGATG CTGACATTGCCCGCAAGGTGGCGGAGGGGGTAGAACGTCAACTGGCGGAGCTGCCGCGCGCGGACACCGCCCG GCAGGCCCTGAGCGCCAGTCGTCTAATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 172 (HISD109)
ATTGCGGGATGTCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCGCAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAACGCCT CGACGGCGCGGCTATCGATATGCCTGCCGGGCCGTCTGAAGTGCTGGTGATCGCCGACAGCGGCGCAACACCG GATTTCGTCGCGTCTGACCTGCTCTCCCAGGCTGAGCACGGTCCGGATTCGCAGGTGATCCTGCTGACGCCGGAT GCTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAGCTGCCGCGCGCGGACACCGCCC GGCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC SEQ ID NO: 173 (HISD10)
TAACGCCCTACGGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGCGTTGA CACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGC ATGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCGG AGCTGCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCATGACCACTAGCGGCTGTCGCCGCGTTGTGGC CTAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCAGGCCTAAGCGTCCACTAGGACGACTGCGGACT ACGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTCGACGGCGCGCGCCTGTGGCGGG CCGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 174 (HISD100)
TAACGCCCTACAGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGCGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCGGA GCTGCCGCGCCGATAGCTATACGGACGGCCCGGCAGACTTCACGACCACTAGCGGCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGTCTTGTGCCAGGCCTAAGCGTCCACTAGGACGACTGCGGACTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGAACGTCTTGACGGCGCGCGCCTGTGGCGGGC CGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 175 (HISD101 )
TAACGCCCTACAGTCTTGCACCAAGACACGTGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGCGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCGGA GCTGCCGCGCCGATAGCTATACGGACGGCCCGGCAGACTTCATGACCACTAGCGGCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCAGGCCTAAGGGTCCACTAAGACGACTGCGGACTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTCGACGGCGCGCGCCTGTGGCGGGC CGTCCGGGACTCGCGGTCAGCAGATTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 176 (HISD102)
TAACGCCCTACGGTCTTGCACTAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGTGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTTGCGGA GCTGCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCATGACCACTAGCGGCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCAGGCCTAAGCGTCCACTAGGACGACTGCGGACTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGAACGTCTTGACGGCGCGCGCCTGTGACGGGC CGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG SEQ ID NO: 177 (HISD104)
TAACGCCCTACGGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAAGAGATACGCCGCGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTTGCGGA GCTCCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCATGACCACTAGCGGCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCAGGCCTAAGCGTCCACTAGGACGACTGCGGACTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGAACGTCTTGACGGCGCGCGCCTGTGGCGGGC CGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 178 (HISD105)
TAACGTCCTACGGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGTGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCGGA GCTGCCGCGCCGATAGCTATACGGACGGCCCGGCAGACTTCACGACCACTAGCGGCTGTCGCCGCGTTGCGGCC TAAAACAGCGTAGACTGGACGAGAGGGTCCGTCTTGTGCCGGGCCTAAGGGTCCACTAAAACGACTGCGGCCTAC GACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTTGACGGCGCGCGCCTGTGGCGGGCC GTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 179 (HISD106)
TAACGCCCTACGGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGCGTTGA CACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGC ATGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCGG AGCTGCCGCGCCGATAGCTATACGGACGGCCCGGCAGACTTCATGACCACTAGCGGCTGTCGCCGCGTTGTGGC CTAAAGCAGCGAAGACTGGACGAGAGGGTCCGTCTTGTGCCAGGCCTAAGCGTCCACTAGGACGACTGCGGACT ACGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGTCTTGACGGCGCGCGCCTGTGGCGGT CCGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 180 (HISD107)
TAACGCCCTACAGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGCGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCGGA GCTGCCGCGCCGATAGCTATACGGACGGCCCGGCAGACTTCATGACCACTAGCGGCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCAGGCCTAAGGGTCCACTAAGACGACTGCGGACTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTCGACGGCGCGCGCCTGTGGCGGGC CGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 181 (HISD108)
TAACGCCCTACAGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGCGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCGGA GCTGCCGCGCCGATAGCTATACGGACGGCCCGGCAGACTTCATGACCACTAGCGGCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCAGGCCTAAGGGTCCACTAAGACGACTGCGGACTA CGACTGTAACGGGCGTTCCACCGCCTCCCCCATCTTGCAGTTGACCGCCTCGACGGCGCGCGCCTGTGGCGGGC CGTCCGGGACTCGCGGTCAGCAGATTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 182 (HISD109)
TAACGCCCTACAGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGCGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTTGCGGA GCTGCCGCGCCGATAGCTATACGGACGGCCCGGCAGACTTCACGACCACTAGCGGCTGTCGCCGCGTTGTGGCC TAAAGCAGCGCAGACTGGACGAGAGGGTCCGACTCGTGCCAGGCCTAAGCGTCCACTAGGACGACTGCGGCCTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTCGACGGCGCGCGCCTGTGGCGGGC CGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 183 (SUCA10)
AAACGCTTCCTGAACGAACTGACCGCCGCTGAAGGGCTGGAACGTTATCTGGGCGCCAAATTCCCGGGTGCGAAA CGTTTCTCGCTTGAGGGGGGAGATGCGCTGATACCCATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGCGG CACTCGCGAAGTGGTGCTGGGGATGGCGCACCGCGGTCGCCTGAACGTGCTGATCAACGTACTGGGTAAAAAAC CGCAGGATCTGTTCGACGAGTTTGCCGGTAAACATAAAGAACATCTGGGTACCGGCGACGTGAAGTATCACATGG GCTTCTCGTCAGATATCGAAACCGAAGGCGGTCTGGTTCACCTGGCGCTGGCGTTTAACCCATCGCATCTGGAAAT TGTGAGCCCGGTGGTGATGGGCTCCGTGCGCGCCCGTCTGGACAGACTGGACGAACCGAGCAGCAACAAAGTGT TGCCGATCACTATTCACGGCGACGCCGCGGTGACCGGCCAGGGCGTGGTTCAG
SEQ ID NO: 184 (SUCA102)
AAACGCTTCCTGAACGAACTGACCGCCGCTGAAGGGCTGGAACGTTATCTGGGCGCCAAATTCCCGGGTGCGAAA CGTTTCTCGCTTGAGGGGGGAGATGCGCTGATACCCATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGCGG CACTCGCGAAGTGGTGCTGGGGATGGCGCACCGCGGTCGCCTGAACGTGCTGATCAACGTACTGGGTAAAAAAC CGCAGGATCTGTTCGACGAGTTTGCCGGTAAACATAAAGAACATCTGGGTACCGGCGACGTGAAGTATCACATGG GCTTCTCGTCAGATATCGAAACCGAAGGCGGTCTGGTTCACCTGGCGCTGGCGTTTAACCCATCGCATCTGGAAAT TGTGAGCCCGGTGGTGATGGGCTCTGTGCGTGCCCGTCTGGACCGACTGGACGAACCGAGCAGCAACAAAGTGT TGCCGATCACTATTCACGGCGACGCCGCGGTGACCGGCCAGGGCGTGGTTCAG
SEQ ID NO: 185 (SUCA126)
AAACGTTTCCTGAACGAACTGACCGCCGCTGAAGGGCTGGAACGTTATCTGGGTGCCAAATTCCCGGGTGCGAAA CGTTTCTCGCTCGAGGGGGGAGATGCGCTGATACCCATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGCGG CACTCGCGAAGTGGTGCTGGGGATGGCGCACCGCGGTCGCCTGAACGTGCTGATCAACGTACTGGGTAAAAAAC CGCAGGATCTGTTCGACGAGTTTGCCGGTAAACATAAAGAACATCTGGGTACCGGCGACGTGAAGTATCACATGG GCTTCTCGTCAGATATCGAAACCGAAGGCGGTCTGGTTCACCTGGCGCTGGCGTTTAACCCATCGCATCTGGAAAT TGTGAGCCCGGTGGTGATGGGCTCCGTGCGCGCCCGTCTGGACAGACTGGACGAACCGAGCAGCAACAAAGTGT TGCCGATCACTATTCACGGCGACGCCGCGGTGACCGGCCAGGGCGTGGTTCAG
SEQ ID NO: 186 (SUCA130)
AAACGCTTCCTGAACGAACTGACCGCCGCTGAAGGGCTGGAACGTTATCTGGGCGCCAAATTCCCGGGTGCGAAA CGTTTCTCGCTTGAGGGGGGAGATGCGCTGATACCCATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGCGG CACTCGCGAAGTGGTGCTGGGGATGGCGCACCGCGGTCGCCTGAACGTGCTGATCAACGTACTGGGTAAAAAAC CGCAGGATCTGTTCGACGAGTTTGCCGGTAAACATAAAGAACATCTGGGTACCGGCGACGTGAAGTATCACATGG GCTTCTCGTCAGATATCGAAACCGAAGGCGGTCTGGTTCACCTGGCGCTGGCGTTTAACCCATCGCACCTGGAAA TTGTGAGCCCGGTGGTGATGGGCTCCGTGCGTGCCCGTCTGGACCGACTGGACGAACCGAGCAGCAACAAAGTG TTGCCGATCACTATTCACGGCGACGCCGCGGTGACCGGCCAGGGCGTAGTTCAG SEQ ID NO: 187 (SUCA133)
AAACGCTTCCTGAACGAACTGACCGCTGCAGAAGGGCTGGAACGTTATCTGGGGGCAAAATTCCCTGGCGCGAAA CGTTTTTCGCTGGAAGGCGGCGATGCGTTAATTCCGATGCTCAAAGAGATGGTCCGCCATGCGGGCAACAGCGGC ACCCGCGAAGTGGTGTTGGGAATGGCGCACCGTGGTCGCCTGAACGTACTGGTCAACGTGCTGGGTAAAAAACCT CAGGATCTGTTTGACGAGTTTGCCGGTAAACATAAAGAACATTTGGGCACCGGCGACGTGAAGTACCATATGGGCT TCTCGTCGGATATCGAAACCGAAGGCGGACTGGTTCACCTGGCGCTGGCGTTTAACCCGTCACACCTGGAAATCG TCAGCCCGGTAGTGATGGGGTCTGTGCGCGCACGTCTCGACCGGCTCGACGAACCGAGCAGCAACAAAGTGTTG CCAATCACCATTCATGGTGATGCAGCAGTTACCGGGCAGGGCGTGGTTCAG
SEQ ID NO: 188 (SUCA134)
AAACGCTTCCTGAACGAACTGACCGCTGCAGAAGGGCTGGAACGTTATCTGGGGGCAAAATTCCCTGGCGCGAAA CGTTTTTCGCTGGAAGGCGGCGATGCGTTAATTCCGATGCTCAAAGAGATGGTCCGCCATGCGGGCAACAGCGGC ACCCGCGAAGTGGTGTTGGGAATGGCGCACCGTGGTCGCCTGAACCTACTGGTCAACGTGCTGGGTAAAAAACCT CAGGATCTGTTTGACGAGTTTGCCGGTAAACATAAAGAACATTTGGGCACCGGCGACGTGAAGTACCATATGGGCT TCTCGTCGGATATCGAAACCGAAGGCGGACTGGTTCACCTGGCGCTGGCGTTTAACCCGTCACACCTGGAAATCG TCAGCCCGGTAGTGATGGGGTCTGTGCGCGCACGTCTCGACCGGCTCGACGAACCGAGCAGCAACAAAGTGTTG CCAATCACCATTCATGGTGATGCAGCAGTTACCGGGCAGGGCGTGGTTCAG
SEQ ID NO: 189 (SUCA135)
AAACGCTTCCTGAACGAACTGACCGCTGCAGAAGGGCTGGAACGTTATCTGGGGGCAAAATTCCCTGGCGCGAAA CGTTTTTCGCTGGAAGGCGGCGATGCGTTAATTCCGATGCTCAAAGAGATGGTCCGCCATGCGGGCAACAGCGGC ACCCGCGAAGTGGTGTTGGGAATGGCGCACCGTGGTCGCCTGAACGTACTGGTCAACGTGCTGGGTAAAAAACCT CAGGATCTGTTTGACGAGTTTGCCGGTAAACATAAAGAACATTTGGGCACCGGCGACGTGAAGTACCATATGGGCT TCTCGTCGGATATCGAAACCGAAGGCGGACTGGTTCACCTGGCGCTGGCGTTTAACCCGTCGCACCTGGAAATCG TCAGCCCGGTAGTGATGGGGTCTGTGCGCGCACGTCTCGACCGGCTCGACGAACCGAGCAGCAACAAAGTGTTG CCAATCACCATTCATGGTGATGCAGCAGTTACCGGGCAGGGCGTGGTTCAG
SEQ ID NO: 190 (SUCA136)
AAACGCTTCCTGAACGAACTGACCGCTGCAGAAGGGCTGGAACGTTATCTGGGGGCAAAATTCCCTGGCGCGAAA CGTTTTTCGCTGGAAGGCGGCGATGCGTTAATTCCGATGCTCAAAGAGATGGTCCGCCATGCGGGCAACAGCGGC ACCCGCGAAGTGGTGTTGGGAATGGCGCACCGTGGTCGCCTGAACGTACTGGTCAACGTGCTGGGTAAAAAACCT CAGGATCTGTTTGACGAGTTTGCCGGTAAACATAAAGAACATTTGGGCACCGGCGACGTGAAGTACCATATGGGAT TCTCGTCGGATATCGAAACCGAAGGCGGACTGGTTCACCTGGCGCTGGCGTTTAACCCGTCGCACCTGGAAATCG TCAGCCCGGTAGTGATGGGGTCTGTGCGCGCACGTCTCGACCGGCTCGACGAACCGAGCAGCAACAAAGTGTTG CCAATCACCATTCATGGTGATGCAGCAGTTACCGGGCAGGGCGTGGTTCAG
SEQ ID NO: 191 (SUCA137)
AAACGCTTCCTGAACGAACTGACCGCTGCAGAAGGGCTGGAACGTTATCTGGGGGCAAAATTCCCTGGCGCGAAA CGTTTTTCGCTGGAAGGCGGCGATGCGTTAATTCCGATGCTCAAAGAGATGGTCCGCCATGCGGGCAACAGCGGC ACCCGCGAAGTGGTGTTGGGAATGGCGCACCGTGGTCGCCTGAACGTACTGGTCAACGTGCTGGGTAAAAAACCT CAGGATCTGTTTGACGAGTTTGCCGGTAAACATAAAGAACATTTGGGCACCGGCGACGTGAAGTACCATATGGGCT TCTCGTCGGATATCGAAACCGAAGGCGGACTGGTTCACCTGGCGCTGGCGTTTAACCCGTCGCACCTGGAAATCG TCAGCCCGGTAGTGATGGGGTCTGTGCGCGCACGTCTCGACCGTCTCGACGAACCGAGCAGCAACAAAGTGTTG CCAATCACCATTCATGGTGATGCAGCAGTTACCGGGCAGGGCGTGGTTCAG
SEQ ID NO: 192 (SUCA138)
AAACGCTTCCTGAACGAACTGACCGCCGCTGAAGGGCTGGAACGTTATCTGGGCGCCAAATTCCCGGGTGCGAAA CGTTTCTCGCTTGAGGGGGGAGATGCGCTGATACCCATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGCGG CACTCGCGAAGTGGTGCTGGGGATGGCGCACCGCGGTCGCCTGAACGTGCTGATCAACGTACTGGGTAAAAAAC CGCAGGATCTGTTCGACGAGTTTGCCGGTAAACATAAAGAACATCTGGGTACCGGCGACGTGAAGTATCACATGG GCTTCTCGTCAGATATCGAAACCGAAGGCGGTCTGGTTCACCTGGCGTTGGCGTTTAACCCATCGCACCTGGAAAT TGTGAGCCCGGTGGTGATGGGCTCCGTGCGTGCCCGTCTGGACCGACTGGACGAACCGAGCAGCAACAAAGTGT TGCCGATCACTATTCACGGCGACGCCGCGGTGACCGGCCAGGGCGTGGTTCAG SEQ ID NO: 193 (SUCA10)
TTTGCGAAGGACTTGCTTGACTGGCGGCGACTTCCCGACCTTGCAATAGACCCGCGGTTTAAGGGCCCACGCTTT GCAAAGAGCGAACTCCCCCCTCTACGCGACTATGGGTACGACTTTCTCTACCAAGCGGTACGCCCATTGTCGCCG TGAGCGCTTCACCACGACCCCTACCGCGTGGCGCCAGCGGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC GTCCTAGACAAGCTGCTCAAACGGCCATTTGTATTTCTTGTAGACCCATGGCCGCTGCACTTCATAGTGTACCCGA AGAGCAGTCTATAGCTTTGGCTTCCGCCAGACCAAGTGGACCGCGACCGCAAATTGGGTAGCGTAGACCTTTAAC ACTCGGGCCACCACTACCCGAGGCACGCGCGGGCAGACCTGTCTGACCTGCTTGGCTCGTCGTTGTTTCACAACG GCTAGTGATAAGTGCCGCTGCGGCGCCACTGGCCGGTCCCGCACCAAGTC
SEQ ID NO: 194 (SUCA102)
TTTGCGAAGGACTTGCTTGACTGGCGGCGACTTCCCGACCTTGCAATAGACCCGCGGTTTAAGGGCCCACGCTTT GCAAAGAGCGAACTCCCCCCTCTACGCGACTATGGGTACGACTTTCTCTACCAAGCGGTACGCCCATTGTCGCCG TGAGCGCTTCACCACGACCCCTACCGCGTGGCGCCAGCGGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC GTCCTAGACAAGCTGCTCAAACGGCCATTTGTATTTCTTGTAGACCCATGGCCGCTGCACTTCATAGTGTACCCGA AGAGCAGTCTATAGCTTTGGCTTCCGCCAGACCAAGTGGACCGCGACCGCAAATTGGGTAGCGTAGACCTTTAAC ACTCGGGCCACCACTACCCGAGACACGCACGGGCAGACCTGGCTGACCTGCTTGGCTCGTCGTTGTTTCACAACG GCTAGTGATAAGTGCCGCTGCGGCGCCACTGGCCGGTCCCGCACCAAGTC
SEQ ID NO: 195 (SUCA126)
TTTGCAAAGGACTTGCTTGACTGGCGGCGACTTCCCGACCTTGCAATAGACCCACGGTTTAAGGGCCCACGCTTT GCAAAGAGCGAGCTCCCCCCTCTACGCGACTATGGGTACGACTTTCTCTACCAAGCGGTACGCCCATTGTCGCCG TGAGCGCTTCACCACGACCCCTACCGCGTGGCGCCAGCGGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC GTCCTAGACAAGCTGCTCAAACGGCCATTTGTATTTCTTGTAGACCCATGGCCGCTGCACTTCATAGTGTACCCGA AGAGCAGTCTATAGCTTTGGCTTCCGCCAGACCAAGTGGACCGCGACCGCAAATTGGGTAGCGTAGACCTTTAAC ACTCGGGCCACCACTACCCGAGGCACGCGCGGGCAGACCTGTCTGACCTGCTTGGCTCGTCGTTGTTTCACAACG GCTAGTGATAAGTGCCGCTGCGGCGCCACTGGCCGGTCCCGCACCAAGTC SEQ ID NO: 196 (SUCA130)
TTTGCGAAGGACTTGCTTGACTGGCGGCGACTTCCCGACCTTGCAATAGACCCGCGGTTTAAGGGCCCACGCTTT GCAAAGAGCGAACTCCCCCCTCTACGCGACTATGGGTACGACTTTCTCTACCAAGCGGTACGCCCATTGTCGCCG TGAGCGCTTCACCACGACCCCTACCGCGTGGCGCCAGCGGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC GTCCTAGACAAGCTGCTCAAACGGCCATTTGTATTTCTTGTAGACCCATGGCCGCTGCACTTCATAGTGTACCCGA AGAGCAGTCTATAGCTTTGGCTTCCGCCAGACCAAGTGGACCGCGACCGCAAATTGGGTAGCGTGGACCTTTAAC ACTCGGGCCACCACTACCCGAGGCACGCACGGGCAGACCTGGCTGACCTGCTTGGCTCGTCGTTGTTTCACAAC GGCTAGTGATAAGTGCCGCTGCGGCGCCACTGGCCGGTCCCGCATCAAGTC SEQ ID NO: 197 (SUCA133)
TTTGCGAAGGACTTGCTTGACTGGCGACGTCTTCCCGACCTTGCAATAGACCCCCGTTTTAAGGGACCGCGCTTTG CAAAAAGCGACCTTCCGCCGCTACGCAATTAAGGCTACGAGTTTCTCTACCAGGCGGTACGCCCGTTGTCGCCGT GGGCGCTTCACCACAACCCTTACCGCGTGGCACCAGCGGACTTGCATGACCAGTTGCACGACCCATTTTTTGGAG TCCTAGACAAACTGCTCAAACGGCCATTTGTATTTCTTGTAAACCCGTGGCCGCTGCACTTCATGGTATACCCGAA GAGCAGCCTATAGCTTTGGCTTCCGCCTGACCAAGTGGACCGCGACCGCAAATTGGGCAGTGTGGACCTTTAGCA GTCGGGCCATCACTACCCCAGACACGCGCGTGCAGAGCTGGCCGAGCTGCTTGGCTCGTCGTTGTTTCACAACG GTTAGTGGTAAGTACCACTACGTCGTCAATGGCCCGTCCCGCACCAAGTC
SEQ ID NO: 198 (SUCA134)
TTTGCGAAGGACTTGCTTGACTGGCGACGTCTTCCCGACCTTGCAATAGACCCCCGTTTTAAGGGACCGCGCTTTG CAAAAAGCGACCTTCCGCCGCTACGCAATTAAGGCTACGAGTTTCTCTACCAGGCGGTACGCCCGTTGTCGCCGT GGGCGCTTCACCACAACCCTTACCGCGTGGCACCAGCGGACTTGGATGACCAGTTGCACGACCCATTTTTTGGAG TCCTAGACAAACTGCTCAAACGGCCATTTGTATTTCTTGTAAACCCGTGGCCGCTGCACTTCATGGTATACCCGAA GAGCAGCCTATAGCTTTGGCTTCCGCCTGACCAAGTGGACCGCGACCGCAAATTGGGCAGTGTGGACCTTTAGCA GTCGGGCCATCACTACCCCAGACACGCGCGTGCAGAGCTGGCCGAGCTGCTTGGCTCGTCGTTGTTTCACAACG GTTAGTGGTAAGTACCACTACGTCGTCAATGGCCCGTCCCGCACCAAGTC
SEQ ID NO: 199 (SUCA135)
TTTGCGAAGGACTTGCTTGACTGGCGACGTCTTCCCGACCTTGCAATAGACCCCCGTTTTAAGGGACCGCGCTTTG CAAAAAGCGACCTTCCGCCGCTACGCAATTAAGGCTACGAGTTTCTCTACCAGGCGGTACGCCCGTTGTCGCCGT GGGCGCTTCACCACAACCCTTACCGCGTGGCACCAGCGGACTTGCATGACCAGTTGCACGACCCATTTTTTGGAG TCCTAGACAAACTGCTCAAACGGCCATTTGTATTTCTTGTAAACCCGTGGCCGCTGCACTTCATGGTATACCCGAA GAGCAGCCTATAGCTTTGGCTTCCGCCTGACCAAGTGGACCGCGACCGCAAATTGGGCAGCGTGGACCTTTAGCA GTCGGGCCATCACTACCCCAGACACGCGCGTGCAGAGCTGGCCGAGCTGCTTGGCTCGTCGTTGTTTCACAACG GTTAGTGGTAAGTACCACTACGTCGTCAATGGCCCGTCCCGCACCAAGTC
SEQ ID NO: 200 (SUCA136)
TTTGCGAAGGACTTGCTTGACTGGCGACGTCTTCCCGACCTTGCAATAGACCCCCGTTTTAAGGGACCGCGCTTTG CAAAAAGCGACCTTCCGCCGCTACGCAATTAAGGCTACGAGTTTCTCTACCAGGCGGTACGCCCGTTGTCGCCGT GGGCGCTTCACCACAACCCTTACCGCGTGGCACCAGCGGACTTGCATGACCAGTTGCACGACCCATTTTTTGGAG TCCTAGACAAACTGCTCAAACGGCCATTTGTATTTCTTGTAAACCCGTGGCCGCTGCACTTCATGGTATACCCTAAG AGCAGCCTATAGCTTTGGCTTCCGCCTGACCAAGTGGACCGCGACCGCAAATTGGGCAGCGTGGACCTTTAGCAG TCGGGCCATCACTACCCCAGACACGCGCGTGCAGAGCTGGCCGAGCTGCTTGGCTCGTCGTTGTTTCACAACGGT TAGTGGTAAGTACCACTACGTCGTCAATGGCCCGTCCCGCACCAAGTC
SEQ ID NO: 201 (SUCA137)
TTTGCGAAGGACTTGCTTGACTGGCGACGTCTTCCCGACCTTGCAATAGACCCCCGTTTTAAGGGACCGCGCTTTG CAAAAAGCGACCTTCCGCCGCTACGCAATTAAGGCTACGAGTTTCTCTACCAGGCGGTACGCCCGTTGTCGCCGT GGGCGCTTCACCACAACCCTTACCGCGTGGCACCAGCGGACTTGCATGACCAGTTGCACGACCCATTTTTTGGAG TCCTAGACAAACTGCTCAAACGGCCATTTGTATTTCTTGTAAACCCGTGGCCGCTGCACTTCATGGTATACCCGAA GAGCAGCCTATAGCTTTGGCTTCCGCCTGACCAAGTGGACCGCGACCGCAAATTGGGCAGCGTGGACCTTTAGCA GTCGGGCCATCACTACCCCAGACACGCGCGTGCAGAGCTGGCAGAGCTGCTTGGCTCGTCGTTGTTTCACAACG GTTAGTGGTAAGTACCACTACGTCGTCAATGGCCCGTCCCGCACCAAGTC SEQ ID NO: 202 (SUCA138)
TTTGCGAAGGACTTGCTTGACTGGCGGCGACTTCCCGACCTTGCAATAGACCCGCGGTTTAAGGGCCCACGCTTT GCAAAGAGCGAACTCCCCCCTCTACGCGACTATGGGTACGACTTTCTCTACCAAGCGGTACGCCCATTGTCGCCG TGAGCGCTTCACCACGACCCCTACCGCGTGGCGCCAGCGGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC GTCCTAGACAAGCTGCTCAAACGGCCATTTGTATTTCTTGTAGACCCATGGCCGCTGCACTTCATAGTGTACCCGA AGAGCAGTCTATAGCTTTGGCTTCCGCCAGACCAAGTGGACCGCAACCGCAAATTGGGTAGCGTGGACCTTTAAC ACTCGGGCCACCACTACCCGAGGCACGCACGGGCAGACCTGGCTGACCTGCTTGGCTCGTCGTTGTTTCACAAC GGCTAGTGATAAGTGCCGCTGCGGCGCCACTGGCCGGTCCCGCACCAAGTC
SEQ ID NO: 203 (SUCA10)
AAACGCTTCCTGAACGAACTGACCGCCGCTGAAGGGCTGGAACGTTATCTGGGCGCCAAATTCCCGGGTGCGAAA CGTTTCTCGCTTGAGGGGGGAGATGCGCTGATACCCATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGCGG CACTCGCGAAGTGGTGCTGGGGATGGCGCACCGCGGTCGCCTGAACGTGCTGATCAACGTACTGGGTAAAAAAC CGCAGGATCTGTTCGACGAGTTTGCCGGTAAACATAAAGAACATCTGGGTACCGGCGACGTGAAGTATCACATGG GCTTCTCGTCAGATATCGAAACCGAAGGCGGTCTGGTTCACCTGGCGCTGGCGTTTAACCCATCGCATCTGGAAAT TGTGAGCCCGGTGGTGATGGGCTCCGTGCGCGCCCGTCTGGACAGACTGGACGAACCGAGCAGCAACAAAGTGT TGCCGATCACTATTCACGGCGACGCCGCGGTGACCGGCCAGGGCGTGGTTCAG
SEQ ID NO: 204 (SUCA100)
AAACGCTTCCTGAACGAACTGACCGCCGCTGAAGGGCTGGAACGTTATCTGGGCGCCAAATTCCCGGGGGCGAA ACGTTTCTCGCTCGAGGGGGGAGATGCGCTGATACCCATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGCG GCACTCGCGAAGTGGTGCTGGGGATGGCGCACCGCGGTCGCCTGAACGTGCTGATCAACGTACTGGGTAAAAAA CCGCAGGATCTGTTCGACGAATTTGCCGGTAAGCATAAAGAGCATCTGGGTACCGGCGACGTGAAGTATCACATG GGCTTCTCGTCAGATATCGAAACCGAAGGCGGTCTGGTTCACCTGGCGCTGGCGTTTAACCCATCGCATCTGGAA ATTGTGAGCCCGGTGGTGATGGGCTCCGTGCGCGCCCGTCTGGACCGACTGGACGAACCGAGCAGTAATAAAGT GCTGCCGATCACTATTCACGGCGACGCCGCGGTGACCGGCCAGGGCGTGGTTCAG
SEQ ID NO: 205 (SUCA101 )
AAACGCTTCCTGAACGAACTGACCGCCGCTGAAGGGCTGGAACGTTATCTGGGCGCCAAATTCCCGGGTGCGAAA CGTTTCTCGCTCGAGGGGGGAGATGCGCTGATACCCATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGCGG CACTCGCGAAGTGGTGCTGGGGATGGCGCACCGCGGTCGCCTGAACGTGCTGATCAACGTACTGGGTAAAAAAC CGCAGGATCTGTTCGACGAATTTGCCGGTAAGCATAAAGAGCATCTGGGTACCGGCGACGTGAAGTATCACATGG GCTTCTCGTCAGATATCGAAACCGAAGGCGGTCTGGTTCACCTGGCGCTGGCGTTTAACCCATCGCATCTGGAAAT TGTGAGCCCGGTGGTGATGGGCTCCGTGCGCGCCCGTCTGGACAGACTGGACGAACCGAGCAGCAACAAAGTGT TGCCGATCACTATTCACGGCGACGCCGCGGTGACCGGCCAGGGCGTGGTTCAG SEQ ID NO: 206 (SUCA102)
AAACGCTTCCTGAACGAACTGACCGCCGCTGAAGGGCTGGAACGTTATCTGGGCGCCAAATTCCCGGGTGCGAAA CGTTTCTCGCTTGAGGGGGGAGATGCGCTGATACCCATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGCGG CACTCGCGAAGTGGTGCTGGGGATGGCGCACCGCGGTCGCCTGAACGTGCTGATCAACGTACTGGGTAAAAAAC CGCAGGATCTGTTCGACGAGTTTGCCGGTAAACATAAAGAACATCTGGGTACCGGCGACGTGAAGTATCACATGG GCTTCTCGTCAGATATCGAAACCGAAGGCGGTCTGGTTCACCTGGCGCTGGCGTTTAACCCATCGCATCTGGAAAT TGTGAGCCCGGTGGTGATGGGCTCTGTGCGTGCCCGTCTGGACCGACTGGACGAACCGAGCAGCAACAAAGTGT TGCCGATCACTATTCACGGCGACGCCGCGGTGACCGGCCAGGGCGTGGTTCAG
SEQ ID NO: 207 (SUCA103)
AAACGCTTCCTGAACGAACTGACCGCCGCTGAAGGGCTGGAACGTTATCTGGGCGCCAAATTCCCGGGTGCGAAA CGTTTCTCGCTCGAGGGGGGAGATGCGCTGATACCCATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGCGG CACTCGCGAAGTGGTGCTAGGGATGGCGCACCGCGGTCGCCTGAACGTGCTGATCAACGTACTGGGTAAAAAAC CGCAGGATCTGTTCGACGAATTTGCCGGTAAGCATAAAGAACATCTGGGTACCGGCGACGTGAAGTATCACATGG GCTTCTCGTCAGATATCGAAACCGAAGGCGGTCTGGTTCACCTGGCGCTGGCGTTTAACCCATCGCATCTGGAAAT TGTGAGCCCGGTGGTGATGGGCTCCGTGCGCGCCCGTCTGGACAGACTGGACGAACCGAGCAGCAACAAAGTGT TGCCGATCACTATTCACGGCGACGCCGCGGTGACCGGCCAGGGCGTGGTTCAG
SEQ ID NO: 208 (SUCA104)
AAACGCTTCCTGAACGAACTGACCGCCGCTGAAGGGCTGGAACGTTATCTGGGCGCCAAATTCCCGGGTGCGAAA CGTTTCTCGCTCGAGGGGGGAGATGCGCTGATACCTATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGCGGT ACTCGCGAAGTGGTGCTGGGGATGGCGCACCGCGGTCGCCTGAACGTGCTGATCAACGTACTGGGTAAAAAACC GCAGGATCTGTTCGACGAGTTTGCCGGTAAGCATAAAGAACATCTGGGTACCGGCGACGTGAAGTATCACATGGG CTTCTCGTCAGATATCGAAACCGAAGGCGGTCTGGTTCACCTGGCGCTGGCGTTTAACCCATCGCATCTGGAAATT GTGAGCCCGGTGGTGATGGGCTCCGTGCGCGCCCGTCTGGATAGACTGGACGAACCGAGCAGCAACAAAGTGTT GCCGATCACTATTCACGGCGACGCTGCGGTGACCGGCCAGGGCGTGGTTCAG
SEQ ID NO: 209 (SUCA105)
AAACGCTTCCTGAACGAGCTGACCGCTGCTGAAGGGCTGGAACGTTATCTGGGTGCCAAATTCCCGGGTGCGAAA CGTTTCTCGCTCGAGGGGGGAGATGCGCTGATACCCATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGCGG CACTCGCGAAGTGGTGCTGGGGATGGCGCACCGCGGTCGCCTGAACGTGCTGATCAACGTACTGGGTAAAAAAC CGCAGGATCTGTTCGACGAGTTTGCCGGTAAGCATAAAGAACATCTGGGTACCGGCGACGTGAAGTATCACATGG GCTTCTCGTCAGATATCGAAACCGAAGGCGGTCTGGTTCACCTGGCGCTGGCGTTTAACCCATCGCATCTGGAAAT TGTGAGCCCGGTGGTGATGGGCTCCGTGCGCGCCCGTCTGGACCGACTGGACGAACCGAGCAGCAACAAAGTGT TGCCGATCACTATTCACGGCGACGCCGCGGTGACCGGCCAGGGCGTGGTTCAG
SEQ ID NO: 210 (SUCA106)
AAACGCTTCCTGAACGAACTGACCGCCGCTGAAGGGCTGGAACGTTATCTGGGCGCCAAATTCCCGGGTGCGAAA CGTTTCTCGCTCGAGGGGGGAGATGCGCTGATACCCATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGCGG CACTCGCGAAGTGGTACTGGGGATGGCGCACCGCGGTCGCCTGAACGTGCTGATCAACGTACTGGGTAAAAAAC CGCAGGATCTGTTCGACGAATTTGCCGGTAAGCATAAAGAACATCTGGGTACCGGCGACGTGAAGTATCACATGG GCTTCTCGTCAGATATCGAAACCGAAGGCGGTCTGGTTCACCTGGCGCTGGCGTTTAACCCATCGCATCTGGAAAT TGTGAGCCCGGTGGTGATGGGCTCCGTGCGCGCCCGTCTGGACAGACTGGACGAACCGAGCAGCAACAAAGTGT TGCCGATCACTATTCACGGCGACGCCGCGGTGACCGGCCAGGGCGTGGTTCAG
SEQ ID NO: 21 1 (SUCA107)
AAACGCTTCCTGAACGAACTGACCGCTGCTGAAGGGCTGGAACGTTATCTGGGCGCCAAATTCCCGGGTGCGAAA CGTTTCTCGCTCGAGGGGGGAGATGCGCTGATACCCATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGCGG CACTCGCGAAGTGGTGCTGGGGATGGCGCACCGCGGTCGCCTGAACGTGCTGATCAACGTACTGGGTAAAAAAC CGCAGGATCTGTTCGACGAATTTGCCGGTAAGCATAAAGAGCATCTGGGTACCGGCGACGTGAAGTATCACATGG GCTTCTCGTCAGATATCGAAACCGAAGGCGGTCTGGTTCACCTGGCGCTGGCGTTTAACCCATCGCATCTGGAAAT TGTGAGCCCGGTGGTGATGGGCTCCGTGCGTGCCCGTCTGGACCGACTGGACGAACCGAGCAGCAACAAAGTGT TGCCGATCACTATTCACGGCGACGCTGCGGTGACCGGCCAGGGCGTGGTTCAG
SEQ ID NO: 212 (SUCA108)
AAACGCTTCCTGAACGAACTGACCGCCGCTGAAGGGCTGGAACGTTATCTGGGTGCCAAATTCCCGGGTGCGAAA CGTTTCTCGCTCGAGGGGGGAGATGCGTTGATACCCATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGTGGC ACTCGCGAAGTGGTGCTGGGGATGGCGCACCGCGGTCGCCTGAACGTGCTGATCAACGTACTGGGTAAAAAACC GCAGGATCTGTTCGACGAATTTGCCGGTAAGCATAAAGAACATCTGGGTACCGGCGACGTGAAGTATCACATGGG CTTCTCGTCAGATATCGAAACCGAAGGCGGTCTGGTTCACCTGGCGCTGGCGTTTAACCCATCGCATCTGGAAATT GTGAGCCCGGTGGTGATGGGCTCCGTGCGCGCCCGTCTGGACAGACTGGACGAACCGAGCAGCAACAAAGTGTT GCCGATCACTATTCACGGCGACGCCGCGGTGACCGGCCAGGGCGTGGTTCAG
SEQ ID NO: 213 (SUCA10)
TTTGCGAAGGACTTGCTTGACTGGCGGCGACTTCCCGACCTTGCAATAGACCCGCGGTTTAAGGGCCCACGCTTT GCAAAGAGCGAACTCCCCCCTCTACGCGACTATGGGTACGACTTTCTCTACCAAGCGGTACGCCCATTGTCGCCG TGAGCGCTTCACCACGACCCCTACCGCGTGGCGCCAGCGGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC GTCCTAGACAAGCTGCTCAAACGGCCATTTGTATTTCTTGTAGACCCATGGCCGCTGCACTTCATAGTGTACCCGA AGAGCAGTCTATAGCTTTGGCTTCCGCCAGACCAAGTGGACCGCGACCGCAAATTGGGTAGCGTAGACCTTTAAC ACTCGGGCCACCACTACCCGAGGCACGCGCGGGCAGACCTGTCTGACCTGCTTGGCTCGTCGTTGTTTCACAACG GCTAGTGATAAGTGCCGCTGCGGCGCCACTGGCCGGTCCCGCACCAAGTC
SEQ ID NO: 214 (SUCA100)
TTTGCGAAGGACTTGCTTGACTGGCGGCGACTTCCCGACCTTGCAATAGACCCGCGGTTTAAGGGCCCCCGCTTT GCAAAGAGCGAGCTCCCCCCTCTACGCGACTATGGGTACGACTTTCTCTACCAAGCGGTACGCCCATTGTCGCCG TGAGCGCTTCACCACGACCCCTACCGCGTGGCGCCAGCGGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC GTCCTAGACAAGCTGCTTAAACGGCCATTCGTATTTCTCGTAGACCCATGGCCGCTGCACTTCATAGTGTACCCGA AGAGCAGTCTATAGCTTTGGCTTCCGCCAGACCAAGTGGACCGCGACCGCAAATTGGGTAGCGTAGACCTTTAAC ACTCGGGCCACCACTACCCGAGGCACGCGCGGGCAGACCTGGCTGACCTGCTTGGCTCGTCATTATTTCACGAC GGCTAGTGATAAGTGCCGCTGCGGCGCCACTGGCCGGTCCCGCACCAAGTC SEQ ID NO: 215 (SUCA101 )
TTTGCGAAGGACTTGCTTGACTGGCGGCGACTTCCCGACCTTGCAATAGACCCGCGGTTTAAGGGCCCACGCTTT GCAAAGAGCGAGCTCCCCCCTCTACGCGACTATGGGTACGACTTTCTCTACCAAGCGGTACGCCCATTGTCGCCG TGAGCGCTTCACCACGACCCCTACCGCGTGGCGCCAGCGGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC GTCCTAGACAAGCTGCTTAAACGGCCATTCGTATTTCTCGTAGACCCATGGCCGCTGCACTTCATAGTGTACCCGA AGAGCAGTCTATAGCTTTGGCTTCCGCCAGACCAAGTGGACCGCGACCGCAAATTGGGTAGCGTAGACCTTTAAC ACTCGGGCCACCACTACCCGAGGCACGCGCGGGCAGACCTGTCTGACCTGCTTGGCTCGTCGTTGTTTCACAACG GCTAGTGATAAGTGCCGCTGCGGCGCCACTGGCCGGTCCCGCACCAAGTC
SEQ ID NO: 216 (SUCA102)
TTTGCGAAGGACTTGCTTGACTGGCGGCGACTTCCCGACCTTGCAATAGACCCGCGGTTTAAGGGCCCACGCTTT GCAAAGAGCGAACTCCCCCCTCTACGCGACTATGGGTACGACTTTCTCTACCAAGCGGTACGCCCATTGTCGCCG TGAGCGCTTCACCACGACCCCTACCGCGTGGCGCCAGCGGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC GTCCTAGACAAGCTGCTCAAACGGCCATTTGTATTTCTTGTAGACCCATGGCCGCTGCACTTCATAGTGTACCCGA AGAGCAGTCTATAGCTTTGGCTTCCGCCAGACCAAGTGGACCGCGACCGCAAATTGGGTAGCGTAGACCTTTAAC ACTCGGGCCACCACTACCCGAGACACGCACGGGCAGACCTGGCTGACCTGCTTGGCTCGTCGTTGTTTCACAACG GCTAGTGATAAGTGCCGCTGCGGCGCCACTGGCCGGTCCCGCACCAAGTC
SEQ ID NO: 217 (SUCA103)
TTTGCGAAGGACTTGCTTGACTGGCGGCGACTTCCCGACCTTGCAATAGACCCGCGGTTTAAGGGCCCACGCTTT GCAAAGAGCGAGCTCCCCCCTCTACGCGACTATGGGTACGACTTTCTCTACCAAGCGGTACGCCCATTGTCGCCG TGAGCGCTTCACCACGATCCCTACCGCGTGGCGCCAGCGGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC GTCCTAGACAAGCTGCTTAAACGGCCATTCGTATTTCTTGTAGACCCATGGCCGCTGCACTTCATAGTGTACCCGA AGAGCAGTCTATAGCTTTGGCTTCCGCCAGACCAAGTGGACCGCGACCGCAAATTGGGTAGCGTAGACCTTTAAC ACTCGGGCCACCACTACCCGAGGCACGCGCGGGCAGACCTGTCTGACCTGCTTGGCTCGTCGTTGTTTCACAACG GCTAGTGATAAGTGCCGCTGCGGCGCCACTGGCCGGTCCCGCACCAAGTC SEQ ID NO: 218 (SUCA104)
TTTGCGAAGGACTTGCTTGACTGGCGGCGACTTCCCGACCTTGCAATAGACCCGCGGTTTAAGGGCCCACGCTTT GCAAAGAGCGAGCTCCCCCCTCTACGCGACTATGGATACGACTTTCTCTACCAAGCGGTACGCCCATTGTCGCCA TGAGCGCTTCACCACGACCCCTACCGCGTGGCGCCAGCGGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC GTCCTAGACAAGCTGCTCAAACGGCCATTCGTATTTCTTGTAGACCCATGGCCGCTGCACTTCATAGTGTACCCGA AGAGCAGTCTATAGCTTTGGCTTCCGCCAGACCAAGTGGACCGCGACCGCAAATTGGGTAGCGTAGACCTTTAAC ACTCGGGCCACCACTACCCGAGGCACGCGCGGGCAGACCTATCTGACCTGCTTGGCTCGTCGTTGTTTCACAACG GCTAGTGATAAGTGCCGCTGCGACGCCACTGGCCGGTCCCGCACCAAGTC SEQ ID NO: 219 (SUCA105)
TTTGCGAAGGACTTGCTCGACTGGCGACGACTTCCCGACCTTGCAATAGACCCACGGTTTAAGGGCCCACGCTTT GCAAAGAGCGAGCTCCCCCCTCTACGCGACTATGGGTACGACTTTCTCTACCAAGCGGTACGCCCATTGTCGCCG TGAGCGCTTCACCACGACCCCTACCGCGTGGCGCCAGCGGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC GTCCTAGACAAGCTGCTCAAACGGCCATTCGTATTTCTTGTAGACCCATGGCCGCTGCACTTCATAGTGTACCCGA AGAGCAGTCTATAGCTTTGGCTTCCGCCAGACCAAGTGGACCGCGACCGCAAATTGGGTAGCGTAGACCTTTAAC ACTCGGGCCACCACTACCCGAGGCACGCGCGGGCAGACCTGGCTGACCTGCTTGGCTCGTCGTTGTTTCACAAC GGCTAGTGATAAGTGCCGCTGCGGCGCCACTGGCCGGTCCCGCACCAAGTC
SEQ ID NO: 220 (SUCA106)
TTTGCGAAGGACTTGCTTGACTGGCGGCGACTTCCCGACCTTGCAATAGACCCGCGGTTTAAGGGCCCACGCTTT GCAAAGAGCGAGCTCCCCCCTCTACGCGACTATGGGTACGACTTTCTCTACCAAGCGGTACGCCCATTGTCGCCG TGAGCGCTTCACCATGACCCCTACCGCGTGGCGCCAGCGGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC GTCCTAGACAAGCTGCTTAAACGGCCATTCGTATTTCTTGTAGACCCATGGCCGCTGCACTTCATAGTGTACCCGA AGAGCAGTCTATAGCTTTGGCTTCCGCCAGACCAAGTGGACCGCGACCGCAAATTGGGTAGCGTAGACCTTTAAC ACTCGGGCCACCACTACCCGAGGCACGCGCGGGCAGACCTGTCTGACCTGCTTGGCTCGTCGTTGTTTCACAACG GCTAGTGATAAGTGCCGCTGCGGCGCCACTGGCCGGTCCCGCACCAAGTC
SEQ ID NO: 221 (SUCA107)
TTTGCGAAGGACTTGCTTGACTGGCGACGACTTCCCGACCTTGCAATAGACCCGCGGTTTAAGGGCCCACGCTTT GCAAAGAGCGAGCTCCCCCCTCTACGCGACTATGGGTACGACTTTCTCTACCAAGCGGTACGCCCATTGTCGCCG TGAGCGCTTCACCACGACCCCTACCGCGTGGCGCCAGCGGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC GTCCTAGACAAGCTGCTTAAACGGCCATTCGTATTTCTCGTAGACCCATGGCCGCTGCACTTCATAGTGTACCCGA AGAGCAGTCTATAGCTTTGGCTTCCGCCAGACCAAGTGGACCGCGACCGCAAATTGGGTAGCGTAGACCTTTAAC ACTCGGGCCACCACTACCCGAGGCACGCACGGGCAGACCTGGCTGACCTGCTTGGCTCGTCGTTGTTTCACAAC GGCTAGTGATAAGTGCCGCTGCGACGCCACTGGCCGGTCCCGCACCAAGTC
SEQ ID NO: 222 (SUCA108)
TTTGCGAAGGACTTGCTTGACTGGCGGCGACTTCCCGACCTTGCAATAGACCCACGGTTTAAGGGCCCACGCTTT GCAAAGAGCGAGCTCCCCCCTCTACGCAACTATGGGTACGACTTTCTCTACCAAGCGGTACGCCCATTGTCACCG TGAGCGCTTCACCACGACCCCTACCGCGTGGCGCCAGCGGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC GTCCTAGACAAGCTGCTTAAACGGCCATTCGTATTTCTTGTAGACCCATGGCCGCTGCACTTCATAGTGTACCCGA AGAGCAGTCTATAGCTTTGGCTTCCGCCAGACCAAGTGGACCGCGACCGCAAATTGGGTAGCGTAGACCTTTAAC ACTCGGGCCACCACTACCCGAGGCACGCGCGGGCAGACCTGTCTGACCTGCTTGGCTCGTCGTTGTTTCACAACG GCTAGTGATAAGTGCCGCTGCGGCGCCACTGGCCGGTCCCGCACCAAGTC
SEQ ID NO: 223 (DNAN104)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCGGGTGCTACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGTTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGCAGCGAACTGCGCACTGTTGCGACCGACGGCC ACCGTCTGGCGGTGTGCTCAATGCCGCTGGAGGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGC GTGATTGAACTGATGCGTATGCTCGACGGCGGCGAAAACCCGCTGCGCGTGCAG SEQ ID NO: 224 (DNAN105)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCAGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGTTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTCGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGCAGCGAACTGCGCACTGTTGCGACCGACGGCC ACCGTCTGGCGGTGTGCTCAATGCCGCTGGAGGCGTCTTTACCTAGCCACTCGGTGATTGTGCCGCGTAAAGGCG TGATTGAACTGATGCGTATGCTCGACGGTGGCGAAAACCCGCTGCGCGTGCAG
SEQ ID NO: 225 (DNAN106)
ATGGAGATGGTCGCGCGCGTGACGCTTTCTCAGCCGCATGAGCCGGGTGCTACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGTGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGCAGCGAACTGCGCACTGTTGCGACCGACGGCC ACCGTCTGGCGGTGTGCTCAATGCCGCTGGAAGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGCG TGATTGAACTGATGCGTATGCTCGACGGCGGCGAAAACCCGCTGCGCGTGCAG
SEQ ID NO: 226 (DNAN107)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCAGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGTTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGCAGCGAACTGCGCACTGTTGCGACCGACGGCC ACCGTCTGGCGGTGTGCTCAATGCCGCTGGAGGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGC GTGATTGAACTGATGCGTATGCTCGACGGCGGCGAAAACCCGCTGCGCGTGCAG
SEQ ID NO: 227 (DNAN108)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCAGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGTCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGCAGCGAACTGCGCACTGTTGCGACCGACGGCC ACCGTCTAGCGGTGTGCTCAATGCCGCTGGAGACGTCTTTACCTAGCCACTCGGTGATTGTGCCGCGTAAAGGCG TGATTGAACTGATGCGTATGCTCGACGGTGGCGAAAACCCGCTGCGAGTGCAG SEQ ID NO: 228 (DNAN109)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCGGGTGCTACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCTCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGCAGCGAACTGCGCACTGTTGCGACTGACGGCC ACCGTCTGGCGGTGTGCTCAATGCCGCTGGAGGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGC GTGATTGAACTGATGCGTATGCTTGACGGTGGCGAAAACCCGTTGCGCGTGCAG
SEQ ID NO: 229 (DNAN1 1 )
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCGGGTGCTACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGTGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGCAGCGAACTGCGCACTGTTGCGACCGACGGCC ACCGTCTGGCGGTGTGCTCAATGCCGCTGGAGGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGC GTGATTGAACTGATGCGTATGCTCGACGGCGGCGAAAACCCGCTGCGCGTGCAG
SEQ ID NO: 230 (DNAN1 12)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCGGGTGCTACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGCAGCGAACTGCGCACTGTTGCGACCGACGGCC ACCGTCTGGCGGTGTGCTCAATGCCGCTGGAAGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGCG TGATTGAACTGATGCGTATGCTCGACGGCGGCGAAAACCCGCTGCGCGTGCAG
SEQ ID NO: 231 (DNAN12)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCAGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGCAGCGAACTGCGCACTGTTGCGACCGACGGCC ACCGTCTGGCGGTGTGCTCAATGCCGCTGGAGGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGC GTGATTGAACTGATGCGTATGCTTGACGGTGGCGAAAACCCGCTGCGCGTGCAG
SEQ ID NO: 232 (DNAN125)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCGGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAGGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGTTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAAGCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGCAGCGAACTGCGCACTGTTGCGACCGACGGCC ACCGTCTGGCGGTGTGCTCAATGCCGCTGGAGGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGC GTGATTGAACTGATGCGTATGCTCGACGGCGGCGAAAACCCGCTGCGCGTGCAG
SEQ ID NO: 233 (DNAN104)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGCCCACGATGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACAACCACGC AAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCGC TTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTCCT ACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCGTCGCTTGACGCGTGACAACGCTGGCTGCCGGT GGCAGACCGCCACACGAGTTACGGCGACCTCCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCCGC ACTAACTTGACTACGCATACGAGCTGCCGCCGCTTTTGGGCGACGCGCACGTC
SEQ ID NO: 234 (DNAN105)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGTCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCAACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAGCTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTC CTACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCGTCGCTTGACGCGTGACAACGCTGGCTGCCG GTGGCAGACCGCCACACGAGTTACGGCGACCTCCGCAGAAATGGATCGGTGAGCCACTAACACGGCGCATTTCC GCACTAACTTGACTACGCATACGAGCTGCCACCGCTTTTGGGCGACGCGCACGTC
SEQ ID NO: 235 (DNAN106)
TACCTCTACCAGCGCGCGCACTGCGAAAGAGTCGGCGTACTCGGCCCACGATGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCACTAGCCTACGACCACGC AAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCGC TTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTCCT ACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCGTCGCTTGACGCGTGACAACGCTGGCTGCCGGT GGCAGACCGCCACACGAGTTACGGCGACCTTCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCCGC ACTAACTTGACTACGCATACGAGCTGCCGCCGCTTTTGGGCGACGCGCACGTC
SEQ ID NO: 236 (DNAN107)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGTCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCAACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTC CTACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCGTCGCTTGACGCGTGACAACGCTGGCTGCCG GTGGCAGACCGCCACACGAGTTACGGCGACCTCCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCC GCACTAACTTGACTACGCATACGAGCTGCCGCCGCTTTTGGGCGACGCGCACGTC SEQ ID NO: 237 (DNAN108)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGTCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCAGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTC CTACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCGTCGCTTGACGCGTGACAACGCTGGCTGCCG GTGGCAGATCGCCACACGAGTTACGGCGACCTCTGCAGAAATGGATCGGTGAGCCACTAACACGGCGCATTTCCG CACTAACTTGACTACGCATACGAGCTGCCACCGCTTTTGGGCGACGCTCACGTC
SEQ ID NO: 238 (DNAN109)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGCCCACGATGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGAGTAGTCC TACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCGTCGCTTGACGCGTGACAACGCTGACTGCCGGT GGCAGACCGCCACACGAGTTACGGCGACCTCCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCCGC ACTAACTTGACTACGCATACGAACTGCCACCGCTTTTGGGCAACGCGCACGTC
SEQ ID NO: 239 (DNAN1 1 )
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGCCCACGATGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCACTAGCCTACGACCACGC AAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCGC TTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTCCT ACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCGTCGCTTGACGCGTGACAACGCTGGCTGCCGGT GGCAGACCGCCACACGAGTTACGGCGACCTCCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCCGC ACTAACTTGACTACGCATACGAGCTGCCGCCGCTTTTGGGCGACGCGCACGTC SEQ ID NO: 240 (DNAN1 12)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGCCCACGATGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTC CTACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCGTCGCTTGACGCGTGACAACGCTGGCTGCCG GTGGCAGACCGCCACACGAGTTACGGCGACCTTCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCC GCACTAACTTGACTACGCATACGAGCTGCCGCCGCTTTTGGGCGACGCGCACGTC
SEQ ID NO: 241 (DNAN12)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGTCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTC CTACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCGTCGCTTGACGCGTGACAACGCTGGCTGCCG GTGGCAGACCGCCACACGAGTTACGGCGACCTCCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCC GCACTAACTTGACTACGCATACGAACTGCCACCGCTTTTGGGCGACGCGCACGTC SEQ ID NO: 242 (DNAN125)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGCCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTCCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCAACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTCGCTGGGTCAAAAGCTACCGGGTAGTC CTACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCGTCGCTTGACGCGTGACAACGCTGGCTGCCG GTGGCAGACCGCCACACGAGTTACGGCGACCTCCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCC GCACTAACTTGACTACGCATACGAGCTGCCGCCGCTTTTGGGCGACGCGCACGTC
SEQ ID NO: 243 (SUCA246)
AAACGCTTCCTGAACGAACTGACCGCCGCTGAAGGGCTGGAACGTTATCTGGGCGCCAAATTCCCCGGTGCGAAA CGTTTCTCGCTGGAAGGGGGAGATGCTCTGGTACCCATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGCGG CACTCGTGAAGTGGTGCTGGGGATGGCGCACCGCGGTCGCCTGAACGTGCTGATCAACGTACTGGGTAAAAAAC CGCAGGATCTGTTCGACGAGTTTGCCGGGAAGCATAAAGAACATCTTGGTACCGGCGACGTGAAGTATCACATGG GCTTCTCGTCAGATATCGAAACCGAAGGCGGTCTGGTTCACCTGGCGCTGGCGTTTAACCCATCGCATCTGGAAAT TGTGAGTCCGGTGGTGATGGGATCTGTGCGTGCCCGTCTGGATAGACTGGACGAACCGAGCAGCAACAAAGTGTT GCCGATCACCATTCACGGCGACGCCGCGGTGACCGGCCAGGGCGTGGTTCAG
SEQ ID NO: 244 (SUCA258)
AAACGCTTCCTGAACGAACTGACCGCCGCTGAAGGGCTGGAACGTTATCTGGGCGCCAAATTCCCCGGTGCGAAA CGTTTCTCGCTGGAAGGGGGAGATGCTCTGGTACCCATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGCGG CACTCGTGAAGTGGTGCTGGGGATGGCGCACCGCGGTCGCCTGAACGTGCTGATCAACGTACTGGGTAAAAAAC CGCAGGATCTGTTCGACGAGTTTGCCGGGAAGCATAAAGAACATCTTGGTACCGGCGACGTGAAGTATCACATGG GCTTCTCGTCAGATATCGAAACCGAAGGCGGTCTGGTTCACCTGGCTCTGGCGTTTAACCCATCGCATCTGGAAAT TGTGAGTCCGGTGGTGATGGGATCTGTGCGTGCCCGTCTGGACAGACTGGACGAACCGAGCAGCAACAAAGTGTT GCCGATCACCATTCACGGCGACGCGGCGGTGACCGGCCAGGGCGTGGTTCAG
SEQ ID NO: 245 (SUCA261 )
AAACGCTTCCTGAACGAACTGACCGCCGCTGAAGGGCTGGAACGTTATCTGGGCGCCAAATTCCCCGGTGCGAAA CGTTTCTCGCTGGAAGGGGGAGATGCTCTGGTACCCATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGCGG CACTCGTGAAGTGGTGCTGGGGATGGCGCACCGCGGTCGCCTGAACGTGCTGATCAACGTACTGGGTAAAAAAC CGCAGGATCTGTTCGACGAGTTTGCCGGGAAGCATAAAGAACATCTGGGTACCGGCGACGTGAAGTATCACATGG GCTTCTCGTCAGATATCGAAACCGAAGGCGGTCTGGTTCACCTGGCGCTGGCGTTTAACCCATCGCATCTGGAAAT TGTGAGTCCGGTGGTGATGGGATCTGTGCGTGCCCGTCTGGATAGACTGGACGAACCGAGCAGCAACAAAGTGTT GCCGATTACCATTCACGGCGACGCCGCGGTGACCGGCCAGGGCGTGGTTCAG
SEQ ID NO: 246 (SUCA262)
AAACGCTTCCTGAACGAACTGACCGCCGCTGAAGGGCTGGAACGTTATCTGGGCGCCAAATTCCCCGGTGCGAAA CGTTTCTCGCTGGAAGGGGGAGATGCTCTGGTACCCATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGCGG CACTCGCGAAGTGGTGCTGGGGATGGCGCACCGCGGTCGCCTGAACGTGCTGATCAACGTACTGGGTAAAAAAC CGCAGGATCTGTTCGACGAGTTTGCCGGGAAGCATAAAGAACATCTGGGTACCGGCGACGTGAAATATCACATGG GCTTCTCGTCAGATATCGAAACCGAAGGCGGTCTGGTTCACCTGGCTCTGGCGTTTAACCCATCGCATCTGGAAAT TGTGAGTCCGGTGGTGATGGGATCTGTGCGTGCCCGTCTGGATAGACTGGACGAACCGAGCAGCAACAAAGTGTT GCCGATCACCATTCACGGCGACGCCGCGGTGACCGGCCAGGGCGTGGTTCAG SEQ ID NO: 247 (SUCA270) AAACGCTTCCTGAACGAACTGACCGCCGCTGAAGGGCTGGAACGTTATCTGGGCGCCAAATTCCCCGGTGCGAAA CGTTTCTCGCTGGAAGGGGGAGATGCTCTGGTACCCATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGCGG CACTCGTGAAGTGGTGCTGGGGATGGCGCACCGCGGTCGCCTGAACGTGCTGATCAACGTACTGGGTAAAAAAC CGCAGGATCTGTTCGACGAGTTTGCCGGGAAGCATAAAGAACATCTTGGTACCGGCGACGTGAAGTATCACATGG GCTTCTCGTCAGATATCGAAACCGAAGGCGGTCTGGTTCACCTGGCGCTGGCGTTTAACCCATCGCATCTGGAAAT TGTGAGTCCGGTGGTGATGGGATCTGTGCGTGCCCGTCTGGACAGACTGGACGAACCGAGCAGCAACAAAGTGTT GCCGATCACCATTCACGGCGACGCCGCGGTGACCGGCCAGGGCGTGGTTCAG
SEQ ID NO: 248 (SUCA293)
AAACGCTTCCTGAACGAACTGACCGCCGCTGAAGGGCTGGAACGTTATCTGGGCGCCAAATTCCCCGGTGCGAAA CGTTTCTCGCTGGAAGGGGGAGATGCTCTGGTACCCATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGCGG CACTCGTGAAGTGGTGCTGGGGATGGCGCACCGCGGTCGCCTGAACGTGCTGATCAACGTACTGGGTAAAAAAC CGCAGGATCTGTTCGACGAGTTTGCCGGGAAGCATAAAGAACATCTTGGTACCGGCGACGTGAAGTATCACATGG GCTTCTCGTCAGATATCGAAACCGAAGGCGGTCTGGTTCACCTGGCTCTGGCGTTTAACCCATCGCATCTGGAAAT TGTGAGTCCGGTGGTGATGGGATCTGTGCGTGCCCGTCTGGATAGACTGGACGAACCGAGCAGCAACAAAGTGTT GCCGATCACCATTCACGGCGACGCCGCGGTGACCGGCCAGGGCGTGGTTCAG
SEQ ID NO: 249 (SUCA302)
AAACGCTTCCTGAACGAACTGACCGCCGCTGAAGGGCTGGAACGTTATCTGGGCGCCAAATTCCCCGGTGCGAAA CGTTTCTCGCTGGAAGGGGGAGATGCTCTGGTACCCATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGCGG CACTCGTGAAGTGGTGCTGGGGATGGCGCACCGCGGTCGCCTGAACGTGCTGATCAACGTACTGGGTAAAAAAC CGCAGGATCTGTTCGACGAGTTTGCCGGGAAGCATAAAGAACATCTTGGTACCGGCGACGTGAAGTATCACATGG GCTTCTCGTCAGATATCGAAACCGAAGGCGGTCTGGTTCACCTGGCTCTGGCGTTTAACCCATCGCATCTGGAAAT TGTGAGTCCGGTGGTGATGGGATCTGTGCGTGCCCGTCTGGATAGACTGGACGAACCGAGCAGCAACAAAGTGTT GCCGATCACCATTCACGGCGACGCGGCGGTGACCGGCCAGGGCGTGGTTCAG
SEQ ID NO: 250 (SUCA378)
AAACGCTTCCTGAACGAACTGACCGCCGCTGAAGGGCTGGAACGTTATCTGGGCGCCAAATTCCCCGGTGCGAAA CGTTTCTCGCTGGAAGGGGGAGATGCTCTGGTACCCATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGCGG CACTCGTGAAGTGGTGCTGGGGATGGCGCACCGCGGTCGCCTGAACGTGCTGATCAACGTACTGGGTAAAAAAC CGCAGGATCTGTTCGACGAGTTTGCCGGGAAGCATAAAGAACATCTTGGTACCGGCGACGTGAAGTATCACATGG GCTTCTCGTCAGATATCGAAACCGAAGGCGGTCTGGTTCACCTGGCTCTGGCGTTTAACCCATCGCATCTGGAAAT TGTGAGTCCGGTGGTGATGGGATCTGTGCGTGCCCGTCTGGACAGACTGGACGAACCGAGCAGCAACAAAGTGTT GCCGATCACCATTCACGGCGACGCCGCGGTGACCGGCCAGGGCGTGGTTCAG
SEQ ID NO: 251 (SUCA40)
AAACGCTTCCTGAACGAACTGACCGCCGCTGAAGGGCTGGAACGTTATCTGGGTGCCAAATTCCCGGGTGCGAAA CGTTTCTCGCTCGAGGGGGGAGATGCGCTGATACCCATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGCGG CACTCGCGAAGTGGTGCTGGGGATGGCGCACCGCGGTCGCCTGAACGTGCTGATCAACGTACTGGGTAAAAAAC CGCAGGATCTGTTCGACGAATTTGCCGGGAAGCATAAAGAACATCTGGGTACCGGCGACGTGAAGTATCACATGG GCTTCTCGTCAGATATCGAAACCGAAGGCGGTCTGGTTCACCTGGCGCTGGCGTTTAACCCATCGCATCTGGAAAT TGTGAGCCCGGTGGTGATGGGCTCCGTGCGCGCCCGTCTGGACAGACTGGACGAACCGAGCAGCAACAAAGTGT TGCCGATCACTATTCACGGCGACGCCGCGGTGACCGGCCAGGGCGTGGTTCAG SEQ ID NO: 252 (SUCA52)
AAACGCTTCCTGAACGAACTGACCGCCGCTGAAGGGCTGGAACGTTATCTGGGTGCCAAATTCCCGGGTGCGAAA CGTTTCTCGCTCGAGGGGGGAGATGCGCTGATACCCATGCTGAAAGAGATGGTTCGCCATGCGGGTAACAGCGG CACTCGCGAAGTGGTGCTGGGGATGGCACACCGCGGTCGCCTGAACGTGCTGATCAACGTACTGGGTAAAAAAC CGCAGGATCTGTTCGACGAATTTGCCGGGAAGCATAAAGAACATCTGGGTACCGGCGACGTGAAGTATCACATGG GCTTCTCGTCAGATATCGAAACCGAAGGCGGTCTGGTTCACCTGGCGCTGGCGTTTAACCCATCGCATCTGGAAAT TGTGAGCCCGGTGGTGATGGGCTCCGTGCGCGCCCGTCTGGACAGACTGGACGAACCGAGCAGCAACAAAGTGT TGCCGATCACTATTCACGGCGACGCCGCGGTGACCGGCCAGGGCGTGGTTCAG
SEQ ID NO: 253 (SUCA246)
TTTGCGAAGGACTTGCTTGACTGGCGGCGACTTCCCGACCTTGCAATAGACCCGCGGTTTAAGGGGCCACGCTTT GCAAAGAGCGACCTTCCCCCTCTACGAGACCATGGGTACGACTTTCTCTACCAAGCGGTACGCCCATTGTCGCCG TGAGCACTTCACCACGACCCCTACCGCGTGGCGCCAGCGGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC GTCCTAGACAAGCTGCTCAAACGGCCCTTCGTATTTCTTGTAGAACCATGGCCGCTGCACTTCATAGTGTACCCGA AGAGCAGTCTATAGCTTTGGCTTCCGCCAGACCAAGTGGACCGCGACCGCAAATTGGGTAGCGTAGACCTTTAAC ACTCAGGCCACCACTACCCTAGACACGCACGGGCAGACCTATCTGACCTGCTTGGCTCGTCGTTGTTTCACAACG GCTAGTGGTAAGTGCCGCTGCGGCGCCACTGGCCGGTCCCGCACCAAGTC
SEQ ID NO: 254 (SUCA258)
TTTGCGAAGGACTTGCTTGACTGGCGGCGACTTCCCGACCTTGCAATAGACCCGCGGTTTAAGGGGCCACGCTTT GCAAAGAGCGACCTTCCCCCTCTACGAGACCATGGGTACGACTTTCTCTACCAAGCGGTACGCCCATTGTCGCCG TGAGCACTTCACCACGACCCCTACCGCGTGGCGCCAGCGGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC GTCCTAGACAAGCTGCTCAAACGGCCCTTCGTATTTCTTGTAGAACCATGGCCGCTGCACTTCATAGTGTACCCGA AGAGCAGTCTATAGCTTTGGCTTCCGCCAGACCAAGTGGACCGAGACCGCAAATTGGGTAGCGTAGACCTTTAAC ACTCAGGCCACCACTACCCTAGACACGCACGGGCAGACCTGTCTGACCTGCTTGGCTCGTCGTTGTTTCACAACG GCTAGTGGTAAGTGCCGCTGCGCCGCCACTGGCCGGTCCCGCACCAAGTC
SEQ ID NO: 255 (SUCA261 )
TTTGCGAAGGACTTGCTTGACTGGCGGCGACTTCCCGACCTTGCAATAGACCCGCGGTTTAAGGGGCCACGCTTT GCAAAGAGCGACCTTCCCCCTCTACGAGACCATGGGTACGACTTTCTCTACCAAGCGGTACGCCCATTGTCGCCG TGAGCACTTCACCACGACCCCTACCGCGTGGCGCCAGCGGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC GTCCTAGACAAGCTGCTCAAACGGCCCTTCGTATTTCTTGTAGACCCATGGCCGCTGCACTTCATAGTGTACCCGA AGAGCAGTCTATAGCTTTGGCTTCCGCCAGACCAAGTGGACCGCGACCGCAAATTGGGTAGCGTAGACCTTTAAC ACTCAGGCCACCACTACCCTAGACACGCACGGGCAGACCTATCTGACCTGCTTGGCTCGTCGTTGTTTCACAACG GCTAATGGTAAGTGCCGCTGCGGCGCCACTGGCCGGTCCCGCACCAAGTC SEQ ID NO: 256 (SUCA262)
TTTGCGAAGGACTTGCTTGACTGGCGGCGACTTCCCGACCTTGCAATAGACCCGCGGTTTAAGGGGCCACGCTTT GCAAAGAGCGACCTTCCCCCTCTACGAGACCATGGGTACGACTTTCTCTACCAAGCGGTACGCCCATTGTCGCCG TGAGCGCTTCACCACGACCCCTACCGCGTGGCGCCAGCGGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC GTCCTAGACAAGCTGCTCAAACGGCCCTTCGTATTTCTTGTAGACCCATGGCCGCTGCACTTTATAGTGTACCCGA AGAGCAGTCTATAGCTTTGGCTTCCGCCAGACCAAGTGGACCGAGACCGCAAATTGGGTAGCGTAGACCTTTAAC ACTCAGGCCACCACTACCCTAGACACGCACGGGCAGACCTATCTGACCTGCTTGGCTCGTCGTTGTTTCACAACG GCTAGTGGTAAGTGCCGCTGCGGCGCCACTGGCCGGTCCCGCACCAAGTC
SEQ ID NO: 257 (SUCA270)
TTTGCGAAGGACTTGCTTGACTGGCGGCGACTTCCCGACCTTGCAATAGACCCGCGGTTTAAGGGGCCACGCTTT GCAAAGAGCGACCTTCCCCCTCTACGAGACCATGGGTACGACTTTCTCTACCAAGCGGTACGCCCATTGTCGCCG TGAGCACTTCACCACGACCCCTACCGCGTGGCGCCAGCGGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC GTCCTAGACAAGCTGCTCAAACGGCCCTTCGTATTTCTTGTAGAACCATGGCCGCTGCACTTCATAGTGTACCCGA AGAGCAGTCTATAGCTTTGGCTTCCGCCAGACCAAGTGGACCGCGACCGCAAATTGGGTAGCGTAGACCTTTAAC ACTCAGGCCACCACTACCCTAGACACGCACGGGCAGACCTGTCTGACCTGCTTGGCTCGTCGTTGTTTCACAACG GCTAGTGGTAAGTGCCGCTGCGGCGCCACTGGCCGGTCCCGCACCAAGTC
SEQ ID NO: 258 (SUCA293)
TTTGCGAAGGACTTGCTTGACTGGCGGCGACTTCCCGACCTTGCAATAGACCCGCGGTTTAAGGGGCCACGCTTT GCAAAGAGCGACCTTCCCCCTCTACGAGACCATGGGTACGACTTTCTCTACCAAGCGGTACGCCCATTGTCGCCG TGAGCACTTCACCACGACCCCTACCGCGTGGCGCCAGCGGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC GTCCTAGACAAGCTGCTCAAACGGCCCTTCGTATTTCTTGTAGAACCATGGCCGCTGCACTTCATAGTGTACCCGA AGAGCAGTCTATAGCTTTGGCTTCCGCCAGACCAAGTGGACCGAGACCGCAAATTGGGTAGCGTAGACCTTTAAC ACTCAGGCCACCACTACCCTAGACACGCACGGGCAGACCTATCTGACCTGCTTGGCTCGTCGTTGTTTCACAACG GCTAGTGGTAAGTGCCGCTGCGGCGCCACTGGCCGGTCCCGCACCAAGTC
SEQ ID NO: 259 (SUCA302)
TTTGCGAAGGACTTGCTTGACTGGCGGCGACTTCCCGACCTTGCAATAGACCCGCGGTTTAAGGGGCCACGCTTT GCAAAGAGCGACCTTCCCCCTCTACGAGACCATGGGTACGACTTTCTCTACCAAGCGGTACGCCCATTGTCGCCG TGAGCACTTCACCACGACCCCTACCGCGTGGCGCCAGCGGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC GTCCTAGACAAGCTGCTCAAACGGCCCTTCGTATTTCTTGTAGAACCATGGCCGCTGCACTTCATAGTGTACCCGA AGAGCAGTCTATAGCTTTGGCTTCCGCCAGACCAAGTGGACCGAGACCGCAAATTGGGTAGCGTAGACCTTTAAC ACTCAGGCCACCACTACCCTAGACACGCACGGGCAGACCTATCTGACCTGCTTGGCTCGTCGTTGTTTCACAACG GCTAGTGGTAAGTGCCGCTGCGCCGCCACTGGCCGGTCCCGCACCAAGTC
SEQ ID NO: 260 (SUCA378)
TTTGCGAAGGACTTGCTTGACTGGCGGCGACTTCCCGACCTTGCAATAGACCCGCGGTTTAAGGGGCCACGCTTT GCAAAGAGCGACCTTCCCCCTCTACGAGACCATGGGTACGACTTTCTCTACCAAGCGGTACGCCCATTGTCGCCG TGAGCACTTCACCACGACCCCTACCGCGTGGCGCCAGCGGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC GTCCTAGACAAGCTGCTCAAACGGCCCTTCGTATTTCTTGTAGAACCATGGCCGCTGCACTTCATAGTGTACCCGA AGAGCAGTCTATAGCTTTGGCTTCCGCCAGACCAAGTGGACCGAGACCGCAAATTGGGTAGCGTAGACCTTTAAC ACTCAGGCCACCACTACCCTAGACACGCACGGGCAGACCTGTCTGACCTGCTTGGCTCGTCGTTGTTTCACAACG GCTAGTGGTAAGTGCCGCTGCGGCGCCACTGGCCGGTCCCGCACCAAGTC SEQ ID NO: 261 (SUCA40)
TTTGCGAAGGACTTGCTTGACTGGCGGCGACTTCCCGACCTTGCAATAGACCCACGGTTTAAGGGCCCACGCTTT GCAAAGAGCGAGCTCCCCCCTCTACGCGACTATGGGTACGACTTTCTCTACCAAGCGGTACGCCCATTGTCGCCG TGAGCGCTTCACCACGACCCCTACCGCGTGGCGCCAGCGGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC GTCCTAGACAAGCTGCTTAAACGGCCCTTCGTATTTCTTGTAGACCCATGGCCGCTGCACTTCATAGTGTACCCGA AGAGCAGTCTATAGCTTTGGCTTCCGCCAGACCAAGTGGACCGCGACCGCAAATTGGGTAGCGTAGACCTTTAAC ACTCGGGCCACCACTACCCGAGGCACGCGCGGGCAGACCTGTCTGACCTGCTTGGCTCGTCGTTGTTTCACAACG GCTAGTGATAAGTGCCGCTGCGGCGCCACTGGCCGGTCCCGCACCAAGTC
SEQ ID NO: 262 (SUCA52)
TTTGCGAAGGACTTGCTTGACTGGCGGCGACTTCCCGACCTTGCAATAGACCCACGGTTTAAGGGCCCACGCTTT
GCAAAGAGCGAGCTCCCCCCTCTACGCGACTATGGGTACGACTTTCTCTACCAAGCGGTACGCCCATTGTCGCCG
TGAGCGCTTCACCACGACCCCTACCGTGTGGCGCCAGCGGACTTGCACGACTAGTTGCATGACCCATTTTTTGGC
GTCCTAGACAAGCTGCTTAAACGGCCCTTCGTATTTCTTGTAGACCCATGGCCGCTGCACTTCATAGTGTACCCGA
AGAGCAGTCTATAGCTTTGGCTTCCGCCAGACCAAGTGGACCGCGACCGCAAATTGGGTAGCGTAGACCTTTAAC
ACTCGGGCCACCACTACCCGAGGCACGCGCGGGCAGACCTGTCTGACCTGCTTGGCTCGTCGTTGTTTCACAACG
GCTAGTGATAAGTGCCGCTGCGGCGCCACTGGCCGGTCCCGCACCAAGTC
SEQ ID NOs 263 (comp Tartrate)
GTGAAAAAGGGTATGTTTGTACTGCTATCGCTTGTGGTTATTTGTATTCCATTCCCATTACCCAAGGTATAAATATTA
ATAGGTTAGTTACTATATACTATAACATAGGTTACAACCGTCCCTCTTTAATAAGGGTATGTTTTTGATTCAGTTTAG
CAAAGAGTCCTTTCTACGTCCTCACCCTAGATGTAGTTCTAGCACCAATCTAGCAATGACCTGCACTAATTATCTTA
ACTTCTTAACCAACTTCGCGGACGCTACGAGTGCGTCCGCTTTTCTAGTCCGTCTTCCCAGTGGTTGTATCGCCCA
GTCGTATAAGAGGTAACTCGCTTATTACACAAGCGCGTACGCGACCGCAC SEQ ID NOs 264 (forward Tartrate)
CACTTTTTCCCATACAAACATGACGATAGCGAACACCAATAAACATAAGGTAAGGGTAATGGGTTCCATATTTATAAT TATCCAATCAATGATATATGATATTGTATCCAATGTTGGCAGGGAGAAATTATTCCCATACAAAAACTAAGTCAAATC GTTTCTCAGGAAAGATGCAGGAGTGGGATCTACATCAAGATCGTGGTTAGATCGTTACTGGACGTGATTAATAGAA TTGAAGAATTGGTTGAAGCGCCTGCGATGCTCACGCAGGCGAAAAGATCAGGCAGAAGGGTCACCAAC ATAGCGGGTCAGCATATTCTCCATTGAGCGAATAATGTGTTCGCGCATGCGCTGGCGTGC
SEQ ID NO: 265 (HISD59)
ATTGCGGGATGTCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCGCAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAACGTCAGGTCAGCCAACGCCT CGACGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTACTGGTGATCGCCGACAGCGGCGCAACACCGG ATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGTCCGGATTCGCAGGTGATTCTGCTGACGCCTGATG CTGACATTGCCTGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCAGAACTGCCGCGCGCGGACACCGCCAG GCAGGCACTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 266 (HISD61 )
ATTGCGGGATGTCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCGCAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAACGTCAGGTCAGCCAACGCCT CGACGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTACTGGTGATCGCCGACAGCGGCGCAACACCGG ATTTCGTTGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGTCCGGATTCGCAGGTGATTCTGCTGACGCCTGATG CTGACATTGCCTGCAAGGTGGAGGAGGCGGTAGAACGTCAACTGGCAGAACTGCCGCGCGCGGACACCGCCAGG CAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC SEQ ID NO: 267 (HISD7)
ATTGCGGGATGTCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCGCAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAACGTCAGGTCAGCCAACGCCT CGACGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTACTGGTGATCGCCGACAGCGGCGCAACACCGG ATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGTCCGGATTCGCAGGTGATTCTGCTGACGCCTGATG CTGACATTGCCTGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCAGAACTGCCGCGCGCGGACACCGCCAG GCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 268 (HISD59)
TAACGCCCTACAGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGCGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTTGCAGTCCAGTCGGTTGCGGAG CTGCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCATGACCACTAGCGGCTGTCGCCGCGTTGTGGCCT AAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCAGGCCTAAGCGTCCACTAAGACGACTGCGGACTAC GACTGTAACGGACGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGTCTTGACGGCGCGCGCCTGTGGCGGTCC GTCCGTGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG SEQ ID NO: 269 (HISD61 )
TAACGCCCTACAGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGCGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTTGCAGTCCAGTCGGTTGCGGAG CTGCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCATGACCACTAGCGGCTGTCGCCGCGTTGTGGCCT AAAGCAACGAAGACTGGACGAGAGGGTCCGACTCGTGCCAGGCCTAAGCGTCCACTAAGACGACTGCGGACTAC GACTGTAACGGACGTTCCACCTCCTCCGCCATCTTGCAGTTGACCGTCTTGACGGCGCGCGCCTGTGGCGGTCCG TCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 270 (HISD7)
TAACGCCCTACAGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGCGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTTGCAGTCCAGTCGGTTGCGGAG CTGCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCATGACCACTAGCGGCTGTCGCCGCGTTGTGGCCT AAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCAGGCCTAAGCGTCCACTAAGACGACTGCGGACTAC GACTGTAACGGACGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGTCTTGACGGCGCGCGCCTGTGGCGGTCC GTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG SEQ ID NO: 271 (HISD10)
ATTGCGGGATGCCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCGCAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGCCT CGACGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTACTGGTGATCGCCGACAGCGGCGCAACACCGG ATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGTCCGGATTCGCAGGTGATCCTGCTGACGCCTGATG CTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAGCTGCCGCGCGCGGACACCGCCCG GCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 272 (HISD100)
ATTGCGGGATGTCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCGCAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGCCT CGACGGCGCGGCTATCGATATGCCTGCCGGGCCGTCTGAAGTGCTGGTGATCGCCGACAGCGGCGCAACACCG GATTTCGTCGCTTCTGACCTGCTCTCCCAGGCAGAACACGGTCCGGATTCGCAGGTGATCCTGCTGACGCCTGAT GCTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTTGCAGAACTGCCGCGCGCGGACACCGCCC GGCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 273 (HISD101 )
ATTGCGGGATGTCAGAACGTGGTTCTGTGCACGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCGCAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGCCT CGACGGCGCGGCTATCGATATGCCTGCCGGGCCGTCTGAAGTACTGGTGATCGCCGACAGCGGCGCAACACCGG ATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGTCCGGATTCCCAGGTGATTCTGCTGACGCCTGATG CTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAGCTGCCGCGCGCGGACACCGCCCG GCAGGCCCTGAGCGCCAGTCGTCTAATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 274 (HISD102)
ATTGCGGGATGCCAGAACGTGATTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCACAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAACGCCT CGACGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTACTGGTGATCGCCGACAGCGGCGCAACACCGG ATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGTCCGGATTCGCAGGTGATCCTGCTGACGCCTGATG CTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTTGCAGAACTGCCGCGCGCGGACACTGCCCGG CAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 275 (HISD103)
ATTGCGGGATGCCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCGCAACTG TGCGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCAGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAACGTCAGGTCAGCCAACGCCT CGACGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTACTGGTGATCGCCGACAGCGGCGCAACACCGG ATTTCGTCGCTTCTGACCTGCTCTCCCAGGCAGAACACGGTCCGGATTCGCAGGTGATCCTGCTGACGCCTGATG CTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAACTGCCGCGCGCGGACACCGCCCG GCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC SEQ ID NO: 276 (HISD104) ATTGCGGGATGCCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATTCTCTATGCGGCGCAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAACGCCT CGAGGGCGCGGCTATCGATATGCCAGCCGGGCCGTCTGAAGTACTGGTGATCGCCGACAGCGGCGCAACACCG GATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGTCCGGATTCGCAGGTGATCCTGCTGACGCCTGAT GCTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTTGCAGAACTGCCGCGCGCGGACACCGCCC GGCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 277 (HISD105)
ATTGCAGGATGCCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCACAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGCCT CGACGGCGCGGCTATCGATATGCCTGCCGGGCCGTCTGAAGTGCTGGTGATCGCCGACAGCGGCGCAACGCCG GATTTTGTCGCATCTGACCTGCTCTCCCAGGCAGAACACGGCCCGGATTCCCAGGTGATTTTGCTGACGCCGGAT GCTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAACTGCCGCGCGCGGACACCGCCC GGCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 278 (HISD106)
ATTGCGGGATGCCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCGCAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGCCT CGACGGCGCGGCTATCGATATGCCTGCCGGGCCGTCTGAAGTACTGGTGATCGCCGACAGCGGCGCAACACCGG ATTTCGTCGCTTCTGACCTGCTCTCCCAGGCAGAACACGGTCCGGATTCGCAGGTGATCCTGCTGACGCCTGATG CTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCAGAACTGCCGCGCGCGGACACCGCCAG GCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 279 (HISD107)
ATTGCGGGATGTCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCGCAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGCCT CGACGGCGCGGCTATCGATATGCCTGCCGGGCCGTCTGAAGTACTGGTGATCGCCGACAGCGGCGCAACACCGG ATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGTCCGGATTCCCAGGTGATTCTGCTGACGCCTGATG CTGACATTGCCCGCAAGGTGGCGGAGGCGGTAGAACGTCAACTGGCGGAGCTGCCGCGCGCGGACACCGCCCG GCAGGCCCTGAGCGCCAGTCGTCTGATTGTGACCAAAGATTTAGCGCAGTGCGTC
SEQ ID NO: 280 (HISD108)
ATTGCGGGATGTCAGAACGTGGTTCTGTGCTCGCCGCCGCCCATCGCTGATGAAATCCTCTATGCGGCGCAACTG TGTGGCGTGCAGGAAATCTTTAACGTCGGCGGCGCGCAGGCGATTGCCGCTCTGGCCTTCGGCAGCGAGTCCGT ACCGAAAGTGGATAAAATTTTTGGCCCCGGCAACGCCTTTGTAACCGAAGCCAAGCGTCAGGTCAGCCAGCGCCT CGACGGCGCGGCTATCGATATGCCTGCCGGGCCGTCTGAAGTACTGGTGATCGCCGACAGCGGCGCAACACCGG ATTTCGTCGCTTCTGACCTGCTCTCCCAGGCTGAGCACGGTCCGGATTCCCAGGTGATTCTGCTGACGCCTGATG CTGACATTGCCCGCAAGGTGGCGGAGGGGGTAGAACGTCAACTGGCGGAGCTGCCGCGCGCGGACACCGCCCG GCAGGCCCTGAGCGCCAGTCGTCTAATTGTGACCAAAGATTTAGCGCAGTGCGTC SEQ ID NO: 281 (HISD10)
TAACGCCCTACGGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGCGTTGA CACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGC ATGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCGG AGCTGCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCATGACCACTAGCGGCTGTCGCCGCGTTGTGGC CTAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCAGGCCTAAGCGTCCACTAGGACGACTGCGGACT ACGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTCGACGGCGCGCGCCTGTGGCGGG CCGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 282 (HISD100)
TAACGCCCTACAGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGCGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCGGA GCTGCCGCGCCGATAGCTATACGGACGGCCCGGCAGACTTCACGACCACTAGCGGCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGTCTTGTGCCAGGCCTAAGCGTCCACTAGGACGACTGCGGACTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGAACGTCTTGACGGCGCGCGCCTGTGGCGGGC CGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 283 (HISD101 )
TAACGCCCTACAGTCTTGCACCAAGACACGTGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGCGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCGGA GCTGCCGCGCCGATAGCTATACGGACGGCCCGGCAGACTTCATGACCACTAGCGGCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCAGGCCTAAGGGTCCACTAAGACGACTGCGGACTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTCGACGGCGCGCGCCTGTGGCGGGC CGTCCGGGACTCGCGGTCAGCAGATTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 284 (HISD102)
TAACGCCCTACGGTCTTGCACTAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGTGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTTGCGGA GCTGCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCATGACCACTAGCGGCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCAGGCCTAAGCGTCCACTAGGACGACTGCGGACTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGAACGTCTTGACGGCGCGCGCCTGTGACGGGC CGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 285 (HISD103)
TAACGCCCTACGGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGCGTTGA CACGCCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGTCGAGACCGGAAGCCGTCGCTCAGGC ATGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTTGCAGTCCAGTCGGTTGCGGA GCTGCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCATGACCACTAGCGGCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGTCTTGTGCCAGGCCTAAGCGTCCACTAGGACGACTGCGGACTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTTGACGGCGCGCGCCTGTGGCGGGC CGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG SEQ ID NO: 286 (HISD104)
TAACGCCCTACGGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAAGAGATACGCCGCGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTTGCGGA GCTCCCGCGCCGATAGCTATACGGTCGGCCCGGCAGACTTCATGACCACTAGCGGCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCAGGCCTAAGCGTCCACTAGGACGACTGCGGACTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGAACGTCTTGACGGCGCGCGCCTGTGGCGGGC CGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 287 (HISD105)
TAACGTCCTACGGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGTGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCGGA GCTGCCGCGCCGATAGCTATACGGACGGCCCGGCAGACTTCACGACCACTAGCGGCTGTCGCCGCGTTGCGGCC TAAAACAGCGTAGACTGGACGAGAGGGTCCGTCTTGTGCCGGGCCTAAGGGTCCACTAAAACGACTGCGGCCTAC GACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTTGACGGCGCGCGCCTGTGGCGGGCC GTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 288 (HISD106)
TAACGCCCTACGGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGCGTTGA CACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGC ATGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCGG AGCTGCCGCGCCGATAGCTATACGGACGGCCCGGCAGACTTCATGACCACTAGCGGCTGTCGCCGCGTTGTGGC CTAAAGCAGCGAAGACTGGACGAGAGGGTCCGTCTTGTGCCAGGCCTAAGCGTCCACTAGGACGACTGCGGACT ACGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGTCTTGACGGCGCGCGCCTGTGGCGGT CCGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 289 (HISD107)
TAACGCCCTACAGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGCGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCGGA GCTGCCGCGCCGATAGCTATACGGACGGCCCGGCAGACTTCATGACCACTAGCGGCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCAGGCCTAAGGGTCCACTAAGACGACTGCGGACTA CGACTGTAACGGGCGTTCCACCGCCTCCGCCATCTTGCAGTTGACCGCCTCGACGGCGCGCGCCTGTGGCGGGC CGTCCGGGACTCGCGGTCAGCAGACTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 290 (HISD108)
TAACGCCCTACAGTCTTGCACCAAGACACGAGCGGCGGCGGGTAGCGACTACTTTAGGAGATACGCCGCGTTGAC ACACCGCACGTCCTTTAGAAATTGCAGCCGCCGCGCGTCCGCTAACGGCGAGACCGGAAGCCGTCGCTCAGGCA TGGCTTTCACCTATTTTAAAAACCGGGGCCGTTGCGGAAACATTGGCTTCGGTTCGCAGTCCAGTCGGTCGCGGA GCTGCCGCGCCGATAGCTATACGGACGGCCCGGCAGACTTCATGACCACTAGCGGCTGTCGCCGCGTTGTGGCC TAAAGCAGCGAAGACTGGACGAGAGGGTCCGACTCGTGCCAGGCCTAAGGGTCCACTAAGACGACTGCGGACTA CGACTGTAACGGGCGTTCCACCGCCTCCCCCATCTTGCAGTTGACCGCCTCGACGGCGCGCGCCTGTGGCGGGC CGTCCGGGACTCGCGGTCAGCAGATTAACACTGGTTTCTAAATCGCGTCACGCAG
SEQ ID NO: 291 (PURE10)
AGCGACTGGGCTACCATGCAATTCGCCGCCGAAATTTTTGAAATTCTGGATGTCCCGCACCATGTAGAAGTGGTTT CCGCCCATCGCACCCCCGATAAGCTGTTCAGCTTCGCCGAAACGGCGGAAGAGAACGGATATCAAGTGATTATTG CCGGCGCGGGCGGCGCGGCACATCTGCCGGGAATGATTGCGGCAAAAACGCTGGTCCCGGTACTCGGCGTACC GGTACAAAGCGCTGCGCTAAGCGGTGTGGATAGCCTCTACTCCATCGTGCAGATGCCGCGCGGCATTCCGGTGG GTACGCTGGCGATCGGTAAAGCCGGTGCCGCTAACGCCGCACTGCTGGCAGCGCAGATTCTGGCGCAACACGAC GCGGAACTGCATCAGCGCATTGCCGAC
SEQ ID NO: 292 (PURE100)
AGCGACTGGGCTACCATGCAATTCGCCGCCGAAATTTTTGAAATTCTGGATGTCCCGCACCATGTAGAAGTGGTTT CCGCCCATCGCACCCCCGATAAGCTGTTCAGCTTCGCCGAAACGGCGGAAGAGAACGGATATCAAGTGATTATTG CCGGCGCGGGCGGCGCGGCACACCTGCCGGGAATGATTGCGGCAAAAACGCTGGTCCCGGTACTCGGCGTACC GGTACAAAGCGCTGCGCTCAGCGGCGTGGATAGCCTCTACTCCATCGTGCAGATGCCGCGCGGCATTCCGGTGG GTACACTGGCGATCGGTAAAGCCGGTGCCGCTAACGCCGCCCTGCTCGCCGCGCAGATTCTGGCGCAACACGAC GCAGAACTGCATCAGCGCATCGCCGAC SEQ ID NO: 293 (PURE101 )
AGCGACTGGGCTACCATGCAATTCGCCGCCGAAATTTTTGAAATTCTGGATGTCCCGCACCATGTAGAAGTGGTTT CCGCCCATCGCACCCCCGATAAACTGTTCAGTTTCGCCGAAACGGCGGAAGAGAACGGATATCAAGTGATTATTG CCGGGGCGGGCGGCGCGGCACACCTGCCGGGAATGATTGCGGCAAAAACGCTGGTCCCGGTACTCGGCGTGCC GGTACAAAGCGCTGCGCTAAGCGGCGTGGATAGCCTCTACTCCATCGTGCAGATGCCGCGCGGCATTCCGGTGG GTACGCTGCGATCGGTAAAGCCGGTGCCGCTAACGCCGCCCTGCTCGCCGCGCAGATTCTGGCGCAACACGACG CGGAACTGCATCAGCGCATTGCCGAC
SEQ ID NO: 294 (PURE102)
AGCGACTGGGCTACCATGCAATTCGCCGCCGAAATTTTTGAAATTCTGGATGTCCCGCACCATGTAGAAGTGGTTT CCGCTCATCGCACCCCCGATAAGCTGTTCAGCTTCGCCGAAACGGCGGAAGAGAACGGATATCAAGTGATTATTG CCGGCGCGGGCGGCGCGGCGCACCTGCCGGGAATGATTGCGGCAAAAACGCTGGTCCCGGTACTCGGCGTACC GGTACAAAGCGCTGCGCTAAGCGGTGTGGATAGCCTCTACTCCATCGTGCAGATGCCGCGCGGCATTCCGGTGG GTACGCTGGCGATCGGTAAAGCCGGTGCCGCTAACGCCGCACTGCTGGCAGCGCAGATTCTGGCGCAACACGAC GCGGAACTGCATCAGCGCATTGCCGAC
SEQ ID NO: 295 (PURE103)
AGCGACTGGGCTACCATGCAATTCGCCGCCGAAATTTTTGAAATTCTGGATGTCCCGCACCATGTAGAAGTGGTTT CCGCCCATCGCACCCCCGATAAGCTGTTCAGCTTCGCCGAAACGGCGGAAGAGAACGGATATCAAGTGATTATTG CCGGCGCGGGCGGCGCGGCGCACCTGCCGGGAATGATTGCGGCAAAAACGCTGGTCCCGGTACTCGGCGTGCC GGTACAAAGCGCTGCGCTAAGCGGCGTGGATAGCCTCTACTCCATCGTGCAGATGCCGCGCGGCATTCCGGTGG GTACGCTGGCGATCGGTAAAGCCGGTGCCGCTAACGCCGCCCTGCTCGCCGCGCAGATTCTGGCGCAATACGAC GCGGAACTGCATCAGCGCATTGCCGAC
SEQ ID NO: 296 (PURE104)
AGCGACTGGGCTACCATGCAATTCGCCGCCGAAATTTTTGAAATTCTGGATGTCCCGCACCATGTAGAAGTGGTTT CCGCCCATCGCACCCCCGATAAGCTGTTCAGCTTCGCCGAAACGGCGGAAGAGAACGGATATCAAGTGATTATTG CCGGCGCGGGCGGCGCGGCACACCTGCCGGGAATGATTGCGGCAAAAACGCTGGTCCCGGTACTCGGCGTGCC GGTACAAAGCGCTGCGCTCAGCGGCGTGGATAGCCTCTACTCCATCGTGCAGATGCCGCGCGGCATTCCGGTGG GTACGCTGGCGATCGGTAAAGCCGGTGCCGCTAACGCCGCCCTGCTCGCCGCGCAGATTCTGGCGCAACGCGAC GCGGAACTGCATCAGCGCATCGCTGAC
SEQ ID NO: 297 (PURE105)
AGCGACTGGGCTACCATGCAATTCGCCGACGAAATTTTTGAAATTCTGGATGTCCCGCACCATGTAGAAGTGGTTT CCGCCCATCGCACCCCCGATAAACTGTTCAGCTTCGCCGAAACGGCGGAAGAGAACGGATATCAAGTGATTATTG CCGGCGCGGGCGGCGCGGCGCACCTGCCGGGAATGATTGCGGCAAAAACGCTGGTCCCGGTACTCGGCGTGCC GGTACAAAGCGCTGCGCTAAGCGGCGTGGATAGCCTCTACTCCATCGTGCAGATGCCGCGCGGCATTCCGGTGG GTACGCTGGCGATCGGTAAAGCCGGTGCCGCTAACGCCGCCCTGCTCGCCGCGCAGATTCTGGCGCAACACGAC GCGGAACTGCATCAGCGCATCGCCGAC SEQ ID NO: 298 (PURE106)
AGCGACTGGGCTACCATGCAATTCGCCGCCGAAATTTTTGAAATTCTGGATGTCCCGCACCATGTAGAAGTGGTTT CCGCCCATCGCACCCCCGATAAACTGTTCAGCTTCGCCGAAACGGCGGAAGAGAACGGATATCAAGTGATTATTG CCAGCGCGGGCGGCGCGGCGCACCTGCCGGGAATGATTGCGGCAAAAACGCTGGTCCCGGTACTCGGCGTGCC GGTACAAAGCGCTGCGCTCAGCGGCGTGGATAGCCTCTACTCCATCGTGCAGATGCCGCGCGGCATTCCGGTGG GTACGCTGGCGATCGGTAAAGCCGGTGCCGCTAACGCCGCCCTGCTCGCCGCGCAGATTCTGGCGCAACGCGAC GCGGAACTGCATCAGCGCATCGCTGAC SEQ ID NO: 299 (PURE107)
AGCGACTGGGCTACCATGCAATTCGCCGCCGAAATTTTTGAAATTCTGGATGTCCCGCACCATGTAGAAGTGGTTT CCGCCCATCGCACCCCCAATAAGCTGTTCAGCTTCGCCGAAACGGCGGAAGAGAACGGATATCAAGTGATTATTG CCGGCGCGGGCGGCGCGGCACACCTGCCGGGAATGATTGCGGCAAAAACGCTGGTCCCGGTACTCGGCGTGCC GGTACAAAGCGCTGCGCTAAGCGGCGTGGATAGCCTCTACTCCATCGTGCAGATGCCGCGCGGCATTCCGGTGG GTACGCTGGCGATCGGTAAAGCCGGTGCCGCTAACGCCGCCCTGCTCGCCGCGCAGATTCTGGCGCAACACGAC GCGGAACTGCATCAGCGCATTGCCGAC SEQ ID NO: 300 (PURE108)
AGCGACTGGGCTACCATGCAATTCGCCGCCGAAATTTTTGAAATTCTGGATGTCCCGCACCATGTAGAAGTGGTTT CCGCTCATCGCACCCCCGATAAACTGTTCAGCTTCGCCGAAACGGCGGAAGAGAACGGATATCAAGTGATTATTG CCGGCGCGGGCGGCGCGGCGCACCTGCCGGGAATGATTGCGGCAAAAACGCTGGTCCCGGTACTCGGCGTGCC GGTACAAAGCGCTGCGCTCAGCGGCGTGGATAGCCTCTACTCCATCGTGCAGATGCCGCGTGGCATTCCGGTGG GTACGCTGGCGATCGGTAAAGCCGGTGCCGCTAACGCCGCCCTGCTCGCCGCGCAGATTCTGGCGCAACGCGAC GCGGAACTGCATCAGCGCATCGCTGAC
SEQ ID NO: 301 (PURE10)
TCGCTGACCCGATGGTACGTTAAGCGGCGGCTTTAAAAACTTTAAGACCTACAGGGCGTGGTACATCTTCACCAAA GGCGGGTAGCGTGGGGGCTATTCGACAAGTCGAAGCGGCTTTGCCGCCTTCTCTTGCCTATAGTTCACTAATAAC GGCCGCGCCCGCCGCGCCGTGTAGACGGCCCTTACTAACGCCGTTTTTGCGACCAGGGCCATGAGCCGCATGGC CATGTTTCGCGACGCGATTCGCCACACCTATCGGAGATGAGGTAGCACGTCTACGGCGCGCCGTAAGGCCACCCA TGCGACCGCTAGCCATTTCGGCCACGGCGATTGCGGCGTGACGACCGTCGCGTCTAAGACCGCGTTGTGCTGCG CCTTGACGTAGTCGCGTAACGGCTG
SEQ ID NO: 302 (PURE100)
TCGCTGACCCGATGGTACGTTAAGCGGCGGCTTTAAAAACTTTAAGACCTACAGGGCGTGGTACATCTTCACCAAA GGCGGGTAGCGTGGGGGCTATTCGACAAGTCGAAGCGGCTTTGCCGCCTTCTCTTGCCTATAGTTCACTAATAAC GGCCGCGCCCGCCGCGCCGTGTGGACGGCCCTTACTAACGCCGTTTTTGCGACCAGGGCCATGAGCCGCATGGC CATGTTTCGCGACGCGAGTCGCCGCACCTATCGGAGATGAGGTAGCACGTCTACGGCGCGCCGTAAGGCCACCC ATGTGACCGCTAGCCATTTCGGCCACGGCGATTGCGGCGGGACGAGCGGCGCGTCTAAGACCGCGTTGTGCTGC GTCTTGACGTAGTCGCGTAGCGGCTG
SEQ ID NO: 303 (PURE101 )
TCGCTGACCCGATGGTACGTTAAGCGGCGGCTTTAAAAACTTTAAGACCTACAGGGCGTGGTACATCTTCACCAAA GGCGGGTAGCGTGGGGGCTATTTGACAAGTCAAAGCGGCTTTGCCGCCTTCTCTTGCCTATAGTTCACTAATAACG GCCCCGCCCGCCGCGCCGTGTGGACGGCCCTTACTAACGCCGTTTTTGCGACCAGGGCCATGAGCCGCACGGCC ATGTTTCGCGACGCGATTCGCCGCACCTATCGGAGATGAGGTAGCACGTCTACGGCGCGCCGTAAGGCCACCCAT GCGACGCTAGCCATTTCGGCCACGGCGATTGCGGCGGGACGAGCGGCGCGTCTAAGACCGCGTTGTGCTGCGC CTTGACGTAGTCGCGTAACGGCTG
SEQ ID NO: 304 (PURE102)
TCGCTGACCCGATGGTACGTTAAGCGGCGGCTTTAAAAACTTTAAGACCTACAGGGCGTGGTACATCTTCACCAAA GGCGAGTAGCGTGGGGGCTATTCGACAAGTCGAAGCGGCTTTGCCGCCTTCTCTTGCCTATAGTTCACTAATAAC GGCCGCGCCCGCCGCGCCGCGTGGACGGCCCTTACTAACGCCGTTTTTGCGACCAGGGCCATGAGCCGCATGG CCATGTTTCGCGACGCGATTCGCCACACCTATCGGAGATGAGGTAGCACGTCTACGGCGCGCCGTAAGGCCACC CATGCGACCGCTAGCCATTTCGGCCACGGCGATTGCGGCGTGACGACCGTCGCGTCTAAGACCGCGTTGTGCTG CGCCTTGACGTAGTCGCGTAACGGCTG SEQ ID NO: 305 (PURE103)
TCGCTGACCCGATGGTACGTTAAGCGGCGGCTTTAAAAACTTTAAGACCTACAGGGCGTGGTACATCTTCACCAAA GGCGGGTAGCGTGGGGGCTATTCGACAAGTCGAAGCGGCTTTGCCGCCTTCTCTTGCCTATAGTTCACTAATAAC GGCCGCGCCCGCCGCGCCGCGTGGACGGCCCTTACTAACGCCGTTTTTGCGACCAGGGCCATGAGCCGCACGG CCATGTTTCGCGACGCGATTCGCCGCACCTATCGGAGATGAGGTAGCACGTCTACGGCGCGCCGTAAGGCCACC CATGCGACCGCTAGCCATTTCGGCCACGGCGATTGCGGCGGGACGAGCGGCGCGTCTAAGACCGCGTTATGCTG CGCCTTGACGTAGTCGCGTAACGGCTG
SEQ ID NO: 306 (PURE104)
TCGCTGACCCGATGGTACGTTAAGCGGCGGCTTTAAAAACTTTAAGACCTACAGGGCGTGGTACATCTTCACCAAA GGCGGGTAGCGTGGGGGCTATTCGACAAGTCGAAGCGGCTTTGCCGCCTTCTCTTGCCTATAGTTCACTAATAAC GGCCGCGCCCGCCGCGCCGTGTGGACGGCCCTTACTAACGCCGTTTTTGCGACCAGGGCCATGAGCCGCACGG CCATGTTTCGCGACGCGAGTCGCCGCACCTATCGGAGATGAGGTAGCACGTCTACGGCGCGCCGTAAGGCCACC CATGCGACCGCTAGCCATTTCGGCCACGGCGATTGCGGCGGGACGAGCGGCGCGTCTAAGACCGCGTTGCGCTG CGCCTTGACGTAGTCGCGTAGCGACTG SEQ ID NO: 307 (PURE105)
TCGCTGACCCGATGGTACGTTAAGCGGCTGCTTTAAAAACTTTAAGACCTACAGGGCGTGGTACATCTTCACCAAA GGCGGGTAGCGTGGGGGCTATTTGACAAGTCGAAGCGGCTTTGCCGCCTTCTCTTGCCTATAGTTCACTAATAAC GGCCGCGCCCGCCGCGCCGCGTGGACGGCCCTTACTAACGCCGTTTTTGCGACCAGGGCCATGAGCCGCACGG CCATGTTTCGCGACGCGATTCGCCGCACCTATCGGAGATGAGGTAGCACGTCTACGGCGCGCCGTAAGGCCACC CATGCGACCGCTAGCCATTTCGGCCACGGCGATTGCGGCGGGACGAGCGGCGCGTCTAAGACCGCGTTGTGCTG CGCCTTGACGTAGTCGCGTAGCGGCTG
SEQ ID NO: 308 (PURE106)
TCGCTGACCCGATGGTACGTTAAGCGGCGGCTTTAAAAACTTTAAGACCTACAGGGCGTGGTACATCTTCACCAAA GGCGGGTAGCGTGGGGGCTATTTGACAAGTCGAAGCGGCTTTGCCGCCTTCTCTTGCCTATAGTTCACTAATAAC GGTCGCGCCCGCCGCGCCGCGTGGACGGCCCTTACTAACGCCGTTTTTGCGACCAGGGCCATGAGCCGCACGG CCATGTTTCGCGACGCGAGTCGCCGCACCTATCGGAGATGAGGTAGCACGTCTACGGCGCGCCGTAAGGCCACC CATGCGACCGCTAGCCATTTCGGCCACGGCGATTGCGGCGGGACGAGCGGCGCGTCTAAGACCGCGTTGCGCTG CGCCTTGACGTAGTCGCGTAGCGACTG
SEQ ID NO: 309 (PURE107)
TCGCTGACCCGATGGTACGTTAAGCGGCGGCTTTAAAAACTTTAAGACCTACAGGGCGTGGTACATCTTCACCAAA GGCGGGTAGCGTGGGGGTTATTCGACAAGTCGAAGCGGCTTTGCCGCCTTCTCTTGCCTATAGTTCACTAATAAC GGCCGCGCCCGCCGCGCCGTGTGGACGGCCCTTACTAACGCCGTTTTTGCGACCAGGGCCATGAGCCGCACGG CCATGTTTCGCGACGCGATTCGCCGCACCTATCGGAGATGAGGTAGCACGTCTACGGCGCGCCGTAAGGCCACC CATGCGACCGCTAGCCATTTCGGCCACGGCGATTGCGGCGGGACGAGCGGCGCGTCTAAGACCGCGTTGTGCTG CGCCTTGACGTAGTCGCGTAACGGCTG SEQ ID NO: 310 (PURE108)
TCGCTGACCCGATGGTACGTTAAGCGGCGGCTTTAAAAACTTTAAGACCTACAGGGCGTGGTACATCTTCACCAAA GGCGAGTAGCGTGGGGGCTATTTGACAAGTCGAAGCGGCTTTGCCGCCTTCTCTTGCCTATAGTTCACTAATAACG GCCGCGCCCGCCGCGCCGCGTGGACGGCCCTTACTAACGCCGTTTTTGCGACCAGGGCCATGAGCCGCACGGC CATGTTTCGCGACGCGAGTCGCCGCACCTATCGGAGATGAGGTAGCACGTCTACGGCGCACCGTAAGGCCACCC ATGCGACCGCTAGCCATTTCGGCCACGGCGATTGCGGCGGGACGAGCGGCGCGTCTAAGACCGCGTTGCGCTGC GCCTTGACGTAGTCGCGTAGCGACTG
SEQ ID NO: 31 1 (AROC10)
GTTTTTCGTCCGGGACACGCGGATTACACCTATGAGCAGAAATACGGCCTGCGCGATTACCGTGGCGGTGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATCGCCAAGAAATACCTGGCGGAAAAGTTCGG CATCGAAATCCGCGGCTGCCTGACCCAGATGGGCGACATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGTCCCGATGCGGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG TGACTCCATCGGCGCGAAAGTGACGGTGATGGCGAGCGGCGTGCCGGCAGGGCTTGGCGAACCGGTATTTGACC GACTGGATGCGGACATCGCCCATGCGCTGATGAGCATTAATGCGGTGAAAGGCGTGGAGATCGGCGAAGGATTTA ACGTGGTGGCGCTGCGCGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGT
SEQ ID NO: 312 (AROC100)
GTTTTTCGTCCGGGACACGCGGATTACACCTATGAGCAGAAATACGGCCTGCGCGATTACCGTGGCGGTGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATCGCCAAGAAATACCTGGCGGAAAAGTTCGG CATCGAAATCCGCGGCTGCCTGACCCAGATGGGCGACATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGTCCCGATGCGGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG TGACTCCATCGGCGCGAAAGTGACGGTGATGGCGAGCGGCGTGCCGGCAGGGCTTGGCGAACCGGTATTTGACC GACTGGATGCGGACATCGCCCATGCGCTGATGAGCATCAATGCGGTGAAAGGCGTGGAGATCGGTGAAGGATTTA ACGTAGTGGCGCTGCGTGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGT
SEQ ID NO: 313 (AROC101 )
GTTTTTCGTCCGGGACACGCGGATTACACCTATGAGCAGAAATACGGCCTGCGCGATTACCGTGGCGGTGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATCGCCAAGAAATACCTGGCGGAAAAGTTCGG CATCGAAATCCGCGGCTGCCTGACCCAGATGGGCGATATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGTCCCGATGCGGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG CGACTCCATCGGCGCGAAAGTGACGGTGATGGCGAGCGGCGTGCCGGCAGGGCTTGGCGAACCGGTATTTGACC GACTGGATGCGGACATCGCCCATGCGCTGATGAGCATTAATGCGGTGAAAGGCGTGGAGATCGGCGAAGGATTTA ACGTGGTGGCGCTGCGCGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGT SEQ ID NO: 314 (AROC102)
GTTTTTCGTCCGGGACACGCGGATTACACCTATGAGCAGAAATACGGCCTGCGTGATTACCGTGGCGGTGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATCGCCAAGAAATACCTGGCGGAAAAGTTCGG CATCGAAATCCGCGGCTGCCTGACCCAGATGGGCGATATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGCCCCGATGCAGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG TGACTCCATCGGCGCGAAAGTGACGGTGATGGCGAGCGGCGTGCCGGCAGGGCTTGGCGAACCGGTTTTTGACC GACTGGATGCGGACATCGCCCATGCGCTGATGAGCATCAATGCGGTGAAAGGCGTGGAGATCGGCGAAGGATTT AACGTAGTGGCGCTGCGCGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGT
SEQ ID NO: 315 (AROC103)
GTTTTTCGTCCGGGACACGCGGATTACACCTATGAGCAGAAATACGGCCTGCGCGATTACCGTGGCGGTGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATCGCCAAGAAATACCTGGCGGAAAAGTTCGG CATCGAAATCCGCGGCTGCCTGACCCAGATGGGCGACATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGCCCCGATGCGGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG TGACTCCATCGGCGCGAAAGTGACGGTGATGGCGAGCGGCGTGCCGGCAGGGCTTGGCGAACCGGTATTTGACC GACTGGATGCGGACATCGCCCATGCGCTGATGAGCATCAATGCGGTGAAAGGCGTGGAGATCGGCGAAGGATTT AACGTGGTGGCGCTGCGTGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGT
SEQ ID NO: 316 (AROC104)
GTTTTTCGTCCGGGACACGCGGATTATACCTATGAGCAGAAATACGGCCTGCGCGATTACCGTGGCGGTGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATTGCCAAGAAATACCTGGCGGAAAAGTTCGG CATCGAAATCCGCGGCTGCCTGACCCAGATGGGCGACATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGCCCCGATGCGGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG TGACTCCATCGGCGCGAAAGTGACGGTGATGGCGAGCGGTGTGCCGGCAGGGCTTGGCGAACCGGTATTTGACC GACTGGATGCGGACATCGCCCATGCGCTGATGAGCATTAATGCGGTGAAAGGCGTGGAGATCGGCGAAGGATTTA ACGTGGTGGCGCTGCGCGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGT
SEQ ID NO: 317 (AROC105)
GTTTTTCGTCCGGGACACGCGGATTACACCTATGAGCAGAAATACGGCCTGCGCGATTACCGTGGCGGTGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATTGCCAAGAAATACCTGGCGGAAAAGTTCGG CATCGAAATCCGCGGCTGCCTGACCCAGATGGGCGATATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGTCCCGATGCGGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG CGACTCCATCGGCGCGAAAGTGACGGTGATGGCGAGCGGCGTGCCGGCAGGGCTTGGCGAACCGGTTTTTGACC GACTGGATGCGGACATCGCCCATGCGCTGATGAGCATCAATGCGGTGAAAGGCGTGGAGATCGGCGAAGGATTT AACGTGGTGGCGCTGCGCGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGT
SEQ ID NO: 318 (AROC106)
GTTTTTCGTCCGGGACACGCGGATTACACCTATGAGCAGAAATACGGCCTGCGCGATTACCGTGGCGGTGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATCGCCAAGAAATACCTGGCGGAAAAGTTCGG TATCGAAATCCGCGGCTGCCTGACCCAGATGGGCGACATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGCCCCGATGCAGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG TGACTCCATCGGCGCGAAAGTGACGGTGATGGCGAGCGGCGTGCCGGCAGGGCTTGGCGAACCGGTATTTGACC GACTGGATGCGGACATCGCCCATGCGCTGATGAGCATCAATGCGGTGAAAGGCGTGGAGATCGGCGAAGGATTT AACGTGGTGGCGCTGCGCGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGT SEQ ID NO: 319 (AROC107)
GTTTTTCGTCCGGGACACGCGGATTACACCTATGAGCAGAAATACGGCCTGCGCGATTACCGTGGCGGTGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATTGCCAAGAAATACCTGGCGGAAAAGTTCGG CATCGAAATCCGCGGCTGCCTGACCCAGATGGGCGACATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGCCCCGATGCGGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG TGACTCCATCGGCGCGAAAGTGACGGTGATGGCGAGCGGTGTGCCGGCAGGGCTTGGCGAACCGGTATTTGACC GACTGGATGCGGACATCGCCCATGCGCTGATGAGCATTAATGCGGTGAAAGGCGTGGAGATCGGCGAAGGATTTA ACGTGGTGGCGCTGCGCGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGT
SEQ ID NO: 320 (AROC108)
GTTTTTCGTCCGGGACACGCGGATTACACCTATGAGCAGAAATACGGCCTGCGCGATTACCGTGGCGGTGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATTGCCAAGAAATACCTGGCGGAAAAGTTCGG CATCGAAATCCGCGGCTGCCTGACCCAGATGGGCGATATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGCCCCGATGCGGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG TGACTCCATCGGCGCGAAAGTGACGGTGATGGCGAGCGGCGTGCCGGCAGGGCTTGGCGAACCGGTATTTGACC GACTGGATGCGGACATCGCCCATGCGCTGATGAGCATTAATGCGGTGAAAGGCGTGGAGATCGGCGAAGGATTTA ACGTGGTGGCGCTGCGCGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGT
SEQ ID NO: 321 (AROC10)
CAAAAAGCAGGCCCTGTGCGCCTAATGTGGATACTCGTCTTTATGCCGGACGCGCTAATGGCACCGCCACCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAGCGGTTCTTTATGGACCGCCTTTTCAAGCCG TAGCTTTAGGCGCCGACGGACTGGGTCTACCCGCTGTAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAA TTAGGCAAGAAAACAGGGCTACGCCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCAC TGAGGTAGCCGCGCTTTCACTGCCACTACCGCTCGCCGCACGGCCGTCCCGAACCGCTTGGCCATAAACTGGCT GACCTACGCCTGTAGCGGGTACGCGACTACTCGTAATTACGCCACTTTCCGCACCTCTAGCCGCTTCCTAAATTGC ACCACCGCGACGCGCCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCA SEQ ID NO: 322 (AROC100)
CAAAAAGCAGGCCCTGTGCGCCTAATGTGGATACTCGTCTTTATGCCGGACGCGCTAATGGCACCGCCACCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAGCGGTTCTTTATGGACCGCCTTTTCAAGCCG TAGCTTTAGGCGCCGACGGACTGGGTCTACCCGCTGTAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAA TTAGGCAAGAAAACAGGGCTACGCCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCAC TGAGGTAGCCGCGCTTTCACTGCCACTACCGCTCGCCGCACGGCCGTCCCGAACCGCTTGGCCATAAACTGGCT GACCTACGCCTGTAGCGGGTACGCGACTACTCGTAGTTACGCCACTTTCCGCACCTCTAGCCACTTCCTAAATTGC ATCACCGCGACGCACCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCA SEQ ID NO: 323 (AROC101 )
CAAAAAGCAGGCCCTGTGCGCCTAATGTGGATACTCGTCTTTATGCCGGACGCGCTAATGGCACCGCCACCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAGCGGTTCTTTATGGACCGCCTTTTCAAGCCG TAGCTTTAGGCGCCGACGGACTGGGTCTACCCGCTATAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAAT TAGGCAAGAAAACAGGGCTACGCCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCGCT GAGGTAGCCGCGCTTTCACTGCCACTACCGCTCGCCGCACGGCCGTCCCGAACCGCTTGGCCATAAACTGGCTG ACCTACGCCTGTAGCGGGTACGCGACTACTCGTAATTACGCCACTTTCCGCACCTCTAGCCGCTTCCTAAATTGCA CCACCGCGACGCGCCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCA
SEQ ID NO: 324 (AROC102)
CAAAAAGCAGGCCCTGTGCGCCTAATGTGGATACTCGTCTTTATGCCGGACGCACTAATGGCACCGCCACCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAGCGGTTCTTTATGGACCGCCTTTTCAAGCCG TAGCTTTAGGCGCCGACGGACTGGGTCTACCCGCTATAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAAT TAGGCAAGAAAACGGGGCTACGTCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCACT GAGGTAGCCGCGCTTTCACTGCCACTACCGCTCGCCGCACGGCCGTCCCGAACCGCTTGGCCAAAAACTGGCTG ACCTACGCCTGTAGCGGGTACGCGACTACTCGTAGTTACGCCACTTTCCGCACCTCTAGCCGCTTCCTAAATTGCA TCACCGCGACGCGCCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCA
SEQ ID NO: 325 (AROC103)
CAAAAAGCAGGCCCTGTGCGCCTAATGTGGATACTCGTCTTTATGCCGGACGCGCTAATGGCACCGCCACCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAGCGGTTCTTTATGGACCGCCTTTTCAAGCCG TAGCTTTAGGCGCCGACGGACTGGGTCTACCCGCTGTAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAA TTAGGCAAGAAAACGGGGCTACGCCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCAC TGAGGTAGCCGCGCTTTCACTGCCACTACCGCTCGCCGCACGGCCGTCCCGAACCGCTTGGCCATAAACTGGCT GACCTACGCCTGTAGCGGGTACGCGACTACTCGTAGTTACGCCACTTTCCGCACCTCTAGCCGCTTCCTAAATTGC ACCACCGCGACGCACCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCA
SEQ ID NO: 326 (AROC104)
CAAAAAGCAGGCCCTGTGCGCCTAATATGGATACTCGTCTTTATGCCGGACGCGCTAATGGCACCGCCACCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAACGGTTCTTTATGGACCGCCTTTTCAAGCCG TAGCTTTAGGCGCCGACGGACTGGGTCTACCCGCTGTAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAA TTAGGCAAGAAAACGGGGCTACGCCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCAC TGAGGTAGCCGCGCTTTCACTGCCACTACCGCTCGCCACACGGCCGTCCCGAACCGCTTGGCCATAAACTGGCTG ACCTACGCCTGTAGCGGGTACGCGACTACTCGTAATTACGCCACTTTCCGCACCTCTAGCCGCTTCCTAAATTGCA CCACCGCGACGCGCCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCA
SEQ ID NO: 327 (AROC105)
CAAAAAGCAGGCCCTGTGCGCCTAATGTGGATACTCGTCTTTATGCCGGACGCGCTAATGGCACCGCCACCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAACGGTTCTTTATGGACCGCCTTTTCAAGCCG TAGCTTTAGGCGCCGACGGACTGGGTCTACCCGCTATAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAAT TAGGCAAGAAAACAGGGCTACGCCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCGCT GAGGTAGCCGCGCTTTCACTGCCACTACCGCTCGCCGCACGGCCGTCCCGAACCGCTTGGCCAAAAACTGGCTG ACCTACGCCTGTAGCGGGTACGCGACTACTCGTAGTTACGCCACTTTCCGCACCTCTAGCCGCTTCCTAAATTGCA CCACCGCGACGCGCCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCA SEQ ID NO: 328 (AROC106)
CAAAAAGCAGGCCCTGTGCGCCTAATGTGGATACTCGTCTTTATGCCGGACGCGCTAATGGCACCGCCACCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAGCGGTTCTTTATGGACCGCCTTTTCAAGCCA TAGCTTTAGGCGCCGACGGACTGGGTCTACCCGCTGTAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAA TTAGGCAAGAAAACGGGGCTACGTCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCAC TGAGGTAGCCGCGCTTTCACTGCCACTACCGCTCGCCGCACGGCCGTCCCGAACCGCTTGGCCATAAACTGGCT GACCTACGCCTGTAGCGGGTACGCGACTACTCGTAGTTACGCCACTTTCCGCACCTCTAGCCGCTTCCTAAATTGC ACCACCGCGACGCGCCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCA
SEQ ID NO: 329 (AROC107)
CAAAAAGCAGGCCCTGTGCGCCTAATGTGGATACTCGTCTTTATGCCGGACGCGCTAATGGCACCGCCACCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAACGGTTCTTTATGGACCGCCTTTTCAAGCCG TAGCTTTAGGCGCCGACGGACTGGGTCTACCCGCTGTAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAA TTAGGCAAGAAAACGGGGCTACGCCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCAC TGAGGTAGCCGCGCTTTCACTGCCACTACCGCTCGCCACACGGCCGTCCCGAACCGCTTGGCCATAAACTGGCTG ACCTACGCCTGTAGCGGGTACGCGACTACTCGTAATTACGCCACTTTCCGCACCTCTAGCCGCTTCCTAAATTGCA CCACCGCGACGCGCCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCA
SEQ ID NO: 330 (AROC108)
CAAAAAGCAGGCCCTGTGCGCCTAATGTGGATACTCGTCTTTATGCCGGACGCGCTAATGGCACCGCCACCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAACGGTTCTTTATGGACCGCCTTTTCAAGCCG TAGCTTTAGGCGCCGACGGACTGGGTCTACCCGCTATAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAAT TAGGCAAGAAAACGGGGCTACGCCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCACT GAGGTAGCCGCGCTTTCACTGCCACTACCGCTCGCCGCACGGCCGTCCCGAACCGCTTGGCCATAAACTGGCTG ACCTACGCCTGTAGCGGGTACGCGACTACTCGTAATTACGCCACTTTCCGCACCTCTAGCCGCTTCCTAAATTGCA CCACCGCGACGCGCCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCA
SEQ ID NO: 331 (DNAN136)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCAGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAATCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGCAGCGAACTGCGCACTGTCGCGACCGACGGCC ACCGTCTGGCGGTATGCTCAATGCCGCTGGAGGCGTCTTTACCCAGCCATTCGGTGATTGTGCCGCGTAAAGGCG TGATTGAACTGATGCGTATGCTCGACGGTGGCGAAAACCCGCTGCGCGTGCAG SEQ ID NO: 332 (DNAN19)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCAGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAATCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGTAGCGAACTGCGCACTGTCGCGACCGACGGCC ACCGTCTGGCGGTGTGCTCAATGCCGCTGGAAGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGCG TGATTGAACTGATGCGTATGCTCGACGGCGGCGAAAACCCGCTGCGCATGCAG
SEQ ID NO: 333 (DNAN210)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCGGGTGCTACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAATCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGTAGCGAACTGCGCACTGTCGCGACCGACGGCC ACCGCCTGGCGGTGTGCTCAATGCCGCTGGAAGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGC GTGATTGAACTGATGCGTATGCTCGACGGCGGTGAAAACCCGCTGCGCGTGCAG
SEQ ID NO: 334 (DNAN56)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCTAGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAATCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGTAGCGAACTGCGCACTGTCGCGACCGACGGCC ACCGTCTGGCGGTGTGCTCAATGCCGCTGGAAGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGCG TGATTGAACTGATGCGTATGCTCGACGGCGGCGAAAACCCGCTGCGCGTGCAG
SEQ ID NO: 335 (DNAN58)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCAGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAATCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGTAGCGAACTGCGCACTGTCGCGACCGACGGCC ACCGCCTGGCGGTGTGCTCAATGCCGCTGGAAGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGC GTGATTGAACTGATGCGTATGCTCGACGGCGGTGAAAACCCGCTGCGCGTGCAG
SEQ ID NO: 336 (DNAN61 )
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCAGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAATCGACCCAGTTTTCGATGGCCCATCAAGA TGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGTAGCGAACTGCGCACTGTCGCGACCGACGGCCA CCGTCTGGCGGTGTGCTCAATGCCGCTGGAAGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGCGT GATTGAACTGATGCGTATGCTCGACGGCGGCGAAAACCCGCTGCGCGTGCAG
SEQ ID NO: 337 (DNAN62)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCAGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAATCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACCTAAACGGTATGCTGTTTGAAACGGAAGGTAGCGAACTGCGCACTGTCGCGACCGACGGCC ACCGTCTGGCGGTGTGCTCAATGCCGCTGGAAGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGCG TGATTGAACTGATGCGTATGCTCGACGGCGGCGAAAACCCGCTGCGCGTGCAG
SEQ ID NO: 338 (DNAN7)
ATGGAGATGGTCGCGCGCGTTACGCTTTCTCAGCCGCATGAGCCAGGCGCCACTACCGTGCCGGCGCGGAAATT CTTTGATATCTGCCGCGGCCTGCCGGAGGGCGCGGAGATTGCCGTTCAGTTGGAAGGCGATCGGATGCTGGTGC GTTCTGGCCGTAGCCGCTTCTCGCTGTCTACGCTGCCTGCCGCCGATTTCCCGAATCTTGACGACTGGCAAAGCG AAGTTGAATTTACGCTGCCGCAGGCCACGATGAAGCGCCTGATTGAATCGACCCAGTTTTCGATGGCCCATCAGG ATGTGCGCTACTACTTAAACGGTATGCTGTTTGAAACGGAAGGTAGCGAACTGCGCACTGTCGCGACCGACGGCC ACCGTCTGGCGGTGTGCTCAATGCCGCTGGAAGCGTCTTTACCCAGCCACTCGGTGATTGTGCCGCGTAAAGGCG TGATTGAACTGATGCGTATGCTCGACGGCGGCGAAAACCCGCTGCGCGTGCAG
SEQ ID NO: 339 (DNAN136)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGTCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTAGCTGGGTCAAAAGCTACCGGGTAGTC CTACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCGTCGCTTGACGCGTGACAGCGCTGGCTGCCG GTGGCAGACCGCCATACGAGTTACGGCGACCTCCGCAGAAATGGGTCGGTAAGCCACTAACACGGCGCATTTCC GCACTAACTTGACTACGCATACGAGCTGCCACCGCTTTTGGGCGACGCGCACGTC
SEQ ID NO: 340 (DNAN19)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGTCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTAGCTGGGTCAAAAGCTACCGGGTAGTC CTACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCATCGCTTGACGCGTGACAGCGCTGGCTGCCG GTGGCAGACCGCCACACGAGTTACGGCGACCTTCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCC GCACTAACTTGACTACGCATACGAGCTGCCGCCGCTTTTGGGCGACGCGTACGTC SEQ ID NO: 341 (DNAN210)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGCCCACGATGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTAGCTGGGTCAAAAGCTACCGGGTAGTC CTACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCATCGCTTGACGCGTGACAGCGCTGGCTGCCG GTGGCGGACCGCCACACGAGTTACGGCGACCTTCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCC GCACTAACTTGACTACGCATACGAGCTGCCGCCACTTTTGGGCGACGCGCACGTC
SEQ ID NO: 342 (DNAN56)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGATCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTAGCTGGGTCAAAAGCTACCGGGTAGTC CTACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCATCGCTTGACGCGTGACAGCGCTGGCTGCCG GTGGCAGACCGCCACACGAGTTACGGCGACCTTCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCC GCACTAACTTGACTACGCATACGAGCTGCCGCCGCTTTTGGGCGACGCGCACGTC
SEQ ID NO: 343 (DNAN58)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGTCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTAGCTGGGTCAAAAGCTACCGGGTAGTC CTACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCATCGCTTGACGCGTGACAGCGCTGGCTGCCG GTGGCGGACCGCCACACGAGTTACGGCGACCTTCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCC GCACTAACTTGACTACGCATACGAGCTGCCGCCACTTTTGGGCGACGCGCACGTC SEQ ID NO: 344 (DNAN61 )
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGTCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTAGCTGGGTCAAAAGCTACCGGGTAGTTC TACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCATCGCTTGACGCGTGACAGCGCTGGCTGCCGGT GGCAGACCGCCACACGAGTTACGGCGACCTTCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCCGC ACTAACTTGACTACGCATACGAGCTGCCGCCGCTTTTGGGCGACGCGCACGTC SEQ ID NO: 345 (DNAN62)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGTCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTAGCTGGGTCAAAAGCTACCGGGTAGTC CTACACGCGATGATGGATTTGCCATACGACAAACTTTGCCTTCCATCGCTTGACGCGTGACAGCGCTGGCTGCCG GTGGCAGACCGCCACACGAGTTACGGCGACCTTCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCC GCACTAACTTGACTACGCATACGAGCTGCCGCCGCTTTTGGGCGACGCGCACGTC
SEQ ID NO: 346 (DNAN7)
TACCTCTACCAGCGCGCGCAATGCGAAAGAGTCGGCGTACTCGGTCCGCGGTGATGGCACGGCCGCGCCTTTAA GAAACTATAGACGGCGCCGGACGGCCTCCCGCGCCTCTAACGGCAAGTCAACCTTCCGCTAGCCTACGACCACG CAAGACCGGCATCGGCGAAGAGCGACAGATGCGACGGACGGCGGCTAAAGGGCTTAGAACTGCTGACCGTTTCG CTTCAACTTAAATGCGACGGCGTCCGGTGCTACTTCGCGGACTAACTTAGCTGGGTCAAAAGCTACCGGGTAGTC CTACACGCGATGATGAATTTGCCATACGACAAACTTTGCCTTCCATCGCTTGACGCGTGACAGCGCTGGCTGCCG GTGGCAGACCGCCACACGAGTTACGGCGACCTTCGCAGAAATGGGTCGGTGAGCCACTAACACGGCGCATTTCC GCACTAACTTGACTACGCATACGAGCTGCCGCCGCTTTTGGGCGACGCGCACGTC
SEQ ID NO: 347 (HEMD10)
GCGACGCTGACGGAAAACGATCTGGTTTTTGCCCTTTCACAGCACGCCGTCGCCTTTGCTCACGCCCAGCTCCAG CGGGATGGCCGAAACTGGCCTGCGTCGCCGCGCTATTTCGCGATTGGCCGCACCACGGCGCTCGCCCTTCATAC CGTTAGCGGGTTCGATATTCGTTATCCATTGGATCGGGAAATCAGCGAAGCCTTGCTACAATTACCTGAATTACAAA ATATTGCGGGCAAACGCGCGCTGATTTTGCGTGGCAATGGCGGCCGCGAACTGCTGGGCGAAACCCTGACAGCG CGCGGAGCCGAAGTCAGTTTTTGTGAATGTTATCAACGATGTGCGAAACATTACGATGGCGCGGAAGAAGCGATG CGCTGGCATACTCGCGGCGTAACAACGCTTGTTGTTACCAGCGGCGAGATGTTGCAA
SEQ ID NO: 348 (HEMD100)
GCGACGTTGACGGAAAACGATCTGGTTTTTGCCCTTTCACAGCACGCCGTCGCCTTTGCCCACGCCCAACTCCAG CGAGATGGTCGAAACTGGCCTGCGTCGCCGCGCTATTTCGCGATTGGTCGCACTACGGCGCTCGCCCTTCATACC GTTAGCGGGTTCGATATTCGTTATCCATTGGATCGGGAAATCAGCGAAGTCTTGCTACAATTACCTGAATTACAAAA TATTGCGGGCAAACGCGCGCTGATTTTGCGTGGCAATGGCGGTCGCGAACTGCTGGGCGAAACCCTGACAGCTC GCGGAGCCGAAGTCAGTTTTTGTGAATGTTATCAACGAATTGCGAAACATTACGATGGCGCAGAAGAGGCGATGC GCTGGCATACTCGCGGCGTAACGACGCTTGTTGTCACCAGCGGCGAGATGTTGCAA
SEQ ID NO: 349 (HEMD102)
GCGACGCTCGCGGAAAACGATCTGGTTTTTGCCCTTTCACAGCACGCTGTCGCCTTTGCTCACGCCCAGCTCCAG CGGGATGGCCGAAACTGGCCTGCGTCGCCGCGCTATTTCGCGATTGGCCGCACCACGGCGCTCGCCCTTCATAC CGTTAGCGGGTTCGATATTCGTTATCCATTGGATCGGGAAATCAGCGAAGCCTTGCTACAATTACCTGAATTACAAA ATATTGCGGGCAAACGCGCGCTGATTTTGCGTGGCAATGGCGGCCGCGAACTGCTGGGCGAAACCCTGACAGCT CGCGGAGCCGAAGTCAGTTTTTGTGAATGTTATCAACGACGTGAGAAACATTACGATGGCGCGGAAGAAGCGATG CGCTGGCATACTCGCGGCGTAACAACGCTTGTTGTTACCAGCGGCGAGATGTTGCAA
SEQ ID NO: 350 (HEMD103)
GCGACACTGACGGAAAACGATCTGGTTTTTGCCCTTTCTCAGCACGCCGTCGCCTTCGCTCATGCCCGCCTTCAG CAGGATGGTCGGCACTGGCCTGCGTTACCGCGCTATTTTGCGATTGGCCGCACCACAGCGCTCGCTCTTCATACC GTTAGCGGGTTCGATATTCGTTATCCGTTGGATCGGGAAATCAGCGAAGTCTTGCTACAATTACCTGAATTACAAAA TATTGCGGGCAAACGCGCGCTGATTTTGCGTGGCAACGGTGGGCGCGAACTGTTGGGTGAAACCCTAACCGCTC GCGGAGCCGAAGTTAGTTTTTGTGAATGTTATCAACGAAGTGCGAAACATTACGATGGCGCAGAAGAAGCGATGC GCTGGCATGCTCGCGGCGTAACAACACTTGTTGTTACCAGCGGCGGAATGTTGCAG SEQ ID NO: 351 (HEMD104)
GCGACACTGACGGAAAACGATCTGGTTTTTGCCCTTTCTCAGCACGCCGTCGCCTTCGCTCATGCCCGCCTCCAG CAGGATGGTCGGCACTGGCCTGCGTTACCGCGCTATTTTGCGATTGGCCGCACCACAGCGCTCGCTCTTCATACC GTTAGCGGGTTCGATATTCGTTATCCGTTGGATCGGGAAATCAGCGAAGTCTTGCTACAATTACCTGAATTACAAAA TATTGCGGGCAAACGCGCGCTGATTTTGCGTGGCAACGGTGGGCGCGAACTGTTGGGTGAAACCCTAACCGCTC GCGGAGCCGAAGTCAGTTTTTGTGAATGTTATCAACGAAGTGAGAAACATTACGATGGCGCAGAAGAAGCGATGC GCTGGCATGCTCGCGGCGTAACAACACTTGTTGTTACCAGCGGCGAAATGTTGCAG SEQ ID NO: 352 (HEMD105)
GCGACACTGACGGAAAACGATCTGGTTTTTGCCCTTTCTCAGCACGCCGTCGCCTTTGCTCATGCCCGCCTCCAG CAGGATGGTCGGCACTGGCCTGCGTTACCGCGCTATTTTGCGATTGGCCGCACCACAGCGCTCGCTCTTCATACC GTTAGCGGGTTTGATATTCGTTATCCGTTGGATCGGGAAATCAGCGAAGTCTTGCTACAATTACCTGAATTACAAAA TATTGCGGGCAAACGCGCGCTGATTTTGCGTGGCAACGGTGGGCGCGAACTGTTGGGTGAAACCCTAACCGCTC GCGGAGCCGAAGTCAGTTTTTGTGAATGTTATCAACGAAGTGCGAAACATTACGATGGCGCAGAAGAAGCGATGC GCTGGCATGCTCGCGGCGTAACAACACTTGTTGTTACCAGCGGCGAAATGTTGCAG SEQ ID NO: 353 (HEMD106)
GCGACACTGACGGAAAACGATCTGGTTTTTGCCCTTTCTCAGCACGCCGTCGCCTTCGCTCATGCCCGCCTTCAG CAGGATGGTCGGCACTGGCCTGCGTTACCACGCTATTTTGCGATTGGCCGCACCACAGCGCTCGCTCTTCATACC GTTAGCGGGTTCGATATTCGTTATCCGTTGGATCGGGAAATCAGCGAAGTCTTGCTACAATTACCTGAATTACAAAA TATTGCGGGCAAACGCGCGCTGATTTTGCGTGGCAACGGTGGGCGCGAACTGTTGGGTGAAACCCTAACCGCTC GCGGAGCCGAAGTCAGTTTTTGTGAATGTTATCAACGAAGTGCGAAACATTACGATGGCGCAGAAGAAGCGATGC GCTGGCATGCTCGCGGCGTAACAACACTTGTTGTTACCAGCGGCGAAATGTTGCAG
SEQ ID NO: 354 (HEMD107)
GCGACACTGACGGAAAACGATCTGGTTTTTGCCCTTTCTCAGCATGCCGTCGCCTTCGCTCATGCCCGCCTCCAG CAGGATGGTCGGCACTGGCCTGCGTTACCGCGCTATTTTGCGATTGGCCGCACCACAGCGCTCGCTCTTCATACC GTTAGCGGGTTTGATATTCGTTATCCGTTGGATCGGGAAATCAGCGAAGTCTTGCTACAATTACCTGAATTACAAAA TATTGCGGGCAAACGCGCGCTGATTTTGCGTGGCAACGGTGGGCGCGAACTGTTGGGTGAAACCCTAACCGCTC GCGGAGCCGAAGTTAGTTTTTGTGAATGTTATCAACGAAGTGCGAAACATTACGATGGCGCAGAAGAAGCGATGC GCTGGCATGCTCGCGGCGTAACAACACTTGTTGTTACCAGCGGCGGAATGTTGCAG
SEQ ID NO: 355 (HEMD108)
GCGACGCTGACGGAAAACGATCTGGTTTTTGCCCTTTCACAGCACGCCGTCGCCTTTGCTCACGCCCAGCTCCAG CGGGATGGCCGAAACTGGCCTGCGTCGCCGCGCTATTTCGCGATTGGCCGCACCACGGCGCTCGCCCTTCATAC CGTTAGCGGGTTCGATATTCGTTATCCATTGGATCGGGAAATTAGCGAAGCCTTGCTACAATTACCTGAATTACAAA ATATTGCGGGCAAACGCGCGCTGATTTTGCGTGGCAATGGCGGCCGCGAACTGCTGGGCGAAACCCTGACAGCT CGCGGAGCCGAAGTCAGTTTTTGTGAATGTTATCAACGATGTGCGAAACATTACGATGGCGCGGAAGAAGCGATG CGCTGGCATACTCGCGGCGTAACAACGCTTGTTGTTACCAGCGGCGAGATGTTGCAA
SEQ ID NO: 356 (HEMD109)
GCGACGCTCGCGGAAAACGATCTGGTTTTTGCCCTTTCACAGCACGCCGTCGCCTTTGCTCACGCCCAGTTCCAG CGGGATGGCCGAAACTGGCCTGCGTCGCCGCGCTATTTCGCGATTGGCCGCACCACGGCGCTCGCCCTTCATAC CGTTAGCGGGTTCGATATTCGTTATCCATTGGATCGGGAAATCAGCGAAGCCTTGCTACAATTACCTGAATTACAAA ATATTGCGGGCAAACGCGCGCTGATTTTGCGTGGCAATGGCGGCCGCGAACTGCTGGGCGAAACCCTGACAGCT CGCGGAGCCGAAGTCAGTTTTTGTGAATGTTATCAACGATGTGCGAAACATTACGATGGCGCGGAAGAAGCGATG CGCTGGCATACTCGCGGCGTAACAACACTTGTTGTTACCAGCGGCGAGATGTTGCAA
SEQ ID NO: 357 (HEMD10)
CGCTGCGACTGCCTTTTGCTAGACCAAAAACGGGAAAGTGTCGTGCGGCAGCGGAAACGAGTGCGGGTCGAGGT CGCCCTACCGGCTTTGACCGGACGCAGCGGCGCGATAAAGCGCTAACCGGCGTGGTGCCGCGAGCGGGAAGTA TGGCAATCGCCCAAGCTATAAGCAATAGGTAACCTAGCCCTTTAGTCGCTTCGGAACGATGTTAATGGACTTAATGT TTTATAACGCCCGTTTGCGCGCGACTAAAACGCACCGTTACCGCCGGCGCTTGACGACCCGCTTTGGGACTGTCG CGCGCCTCGGCTTCAGTCAAAAACACTTACAATAGTTGCTACACGCTTTGTAATGCTACCGCGCCTTCTTCGCTAC GCGACCGTATGAGCGCCGCATTGTTGCGAACAACAATGGTCGCCGCTCTACAACGTT SEQ ID NO: 358 (HEMD100)
CGCTGCAACTGCCTTTTGCTAGACCAAAAACGGGAAAGTGTCGTGCGGCAGCGGAAACGGGTGCGGGTTGAGGT CGCTCTACCAGCTTTGACCGGACGCAGCGGCGCGATAAAGCGCTAACCAGCGTGATGCCGCGAGCGGGAAGTAT GGCAATCGCCCAAGCTATAAGCAATAGGTAACCTAGCCCTTTAGTCGCTTCAGAACGATGTTAATGGACTTAATGTT TTATAACGCCCGTTTGCGCGCGACTAAAACGCACCGTTACCGCCAGCGCTTGACGACCCGCTTTGGGACTGTCGA GCGCCTCGGCTTCAGTCAAAAACACTTACAATAGTTGCTTAACGCTTTGTAATGCTACCGCGTCTTCTCCGCTACGC GACCGTATGAGCGCCGCATTGCTGCGAACAACAGTGGTCGCCGCTCTACAACGTT
SEQ ID NO: 359 (HEMD102)
CGCTGCGAGCGCCTTTTGCTAGACCAAAAACGGGAAAGTGTCGTGCGACAGCGGAAACGAGTGCGGGTCGAGGT CGCCCTACCGGCTTTGACCGGACGCAGCGGCGCGATAAAGCGCTAACCGGCGTGGTGCCGCGAGCGGGAAGTA TGGCAATCGCCCAAGCTATAAGCAATAGGTAACCTAGCCCTTTAGTCGCTTCGGAACGATGTTAATGGACTTAATGT TTTATAACGCCCGTTTGCGCGCGACTAAAACGCACCGTTACCGCCGGCGCTTGACGACCCGCTTTGGGACTGTCG AGCGCCTCGGCTTCAGTCAAAAACACTTACAATAGTTGCTGCACTCTTTGTAATGCTACCGCGCCTTCTTCGCTACG CGACCGTATGAGCGCCGCATTGTTGCGAACAACAATGGTCGCCGCTCTACAACGTT
SEQ ID NO: 360 (HEMD103) CGCTGTGACTGCCTTTTGCTAGACCAAAAACGGGAAAGAGTCGTGCGGCAGCGGAAGCGAGTACGGGCGGAAGT CGTCCTACCAGCCGTGACCGGACGCAATGGCGCGATAAAACGCTAACCGGCGTGGTGTCGCGAGCGAGAAGTAT GGCAATCGCCCAAGCTATAAGCAATAGGCAACCTAGCCCTTTAGTCGCTTCAGAACGATGTTAATGGACTTAATGTT TTATAACGCCCGTTTGCGCGCGACTAAAACGCACCGTTGCCACCCGCGCTTGACAACCCACTTTGGGATTGGCGA GCGCCTCGGCTTCAATCAAAAACACTTACAATAGTTGCTTCACGCTTTGTAATGCTACCGCGTCTTCTTCGCTACGC GACCGTACGAGCGCCGCATTGTTGTGAACAACAATGGTCGCCGCCTTACAACGTC
SEQ ID NO: 361 (HEMD104)
CGCTGTGACTGCCTTTTGCTAGACCAAAAACGGGAAAGAGTCGTGCGGCAGCGGAAGCGAGTACGGGCGGAGGT CGTCCTACCAGCCGTGACCGGACGCAATGGCGCGATAAAACGCTAACCGGCGTGGTGTCGCGAGCGAGAAGTAT GGCAATCGCCCAAGCTATAAGCAATAGGCAACCTAGCCCTTTAGTCGCTTCAGAACGATGTTAATGGACTTAATGTT TTATAACGCCCGTTTGCGCGCGACTAAAACGCACCGTTGCCACCCGCGCTTGACAACCCACTTTGGGATTGGCGA GCGCCTCGGCTTCAGTCAAAAACACTTACAATAGTTGCTTCACTCTTTGTAATGCTACCGCGTCTTCTTCGCTACGC GACCGTACGAGCGCCGCATTGTTGTGAACAACAATGGTCGCCGCTTTACAACGTC
SEQ ID NO: 362 (HEMD105)
CGCTGTGACTGCCTTTTGCTAGACCAAAAACGGGAAAGAGTCGTGCGGCAGCGGAAACGAGTACGGGCGGAGGT CGTCCTACCAGCCGTGACCGGACGCAATGGCGCGATAAAACGCTAACCGGCGTGGTGTCGCGAGCGAGAAGTAT GGCAATCGCCCAAACTATAAGCAATAGGCAACCTAGCCCTTTAGTCGCTTCAGAACGATGTTAATGGACTTAATGTT TTATAACGCCCGTTTGCGCGCGACTAAAACGCACCGTTGCCACCCGCGCTTGACAACCCACTTTGGGATTGGCGA GCGCCTCGGCTTCAGTCAAAAACACTTACAATAGTTGCTTCACGCTTTGTAATGCTACCGCGTCTTCTTCGCTACGC GACCGTACGAGCGCCGCATTGTTGTGAACAACAATGGTCGCCGCTTTACAACGTC
SEQ ID NO: 363 (HEMD106)
CGCTGTGACTGCCTTTTGCTAGACCAAAAACGGGAAAGAGTCGTGCGGCAGCGGAAGCGAGTACGGGCGGAAGT CGTCCTACCAGCCGTGACCGGACGCAATGGTGCGATAAAACGCTAACCGGCGTGGTGTCGCGAGCGAGAAGTAT GGCAATCGCCCAAGCTATAAGCAATAGGCAACCTAGCCCTTTAGTCGCTTCAGAACGATGTTAATGGACTTAATGTT TTATAACGCCCGTTTGCGCGCGACTAAAACGCACCGTTGCCACCCGCGCTTGACAACCCACTTTGGGATTGGCGA GCGCCTCGGCTTCAGTCAAAAACACTTACAATAGTTGCTTCACGCTTTGTAATGCTACCGCGTCTTCTTCGCTACGC GACCGTACGAGCGCCGCATTGTTGTGAACAACAATGGTCGCCGCTTTACAACGTC
SEQ ID NO: 364 (HEMD107)
CGCTGTGACTGCCTTTTGCTAGACCAAAAACGGGAAAGAGTCGTACGGCAGCGGAAGCGAGTACGGGCGGAGGT CGTCCTACCAGCCGTGACCGGACGCAATGGCGCGATAAAACGCTAACCGGCGTGGTGTCGCGAGCGAGAAGTAT GGCAATCGCCCAAACTATAAGCAATAGGCAACCTAGCCCTTTAGTCGCTTCAGAACGATGTTAATGGACTTAATGTT TTATAACGCCCGTTTGCGCGCGACTAAAACGCACCGTTGCCACCCGCGCTTGACAACCCACTTTGGGATTGGCGA GCGCCTCGGCTTCAATCAAAAACACTTACAATAGTTGCTTCACGCTTTGTAATGCTACCGCGTCTTCTTCGCTACGC GACCGTACGAGCGCCGCATTGTTGTGAACAACAATGGTCGCCGCCTTACAACGTC SEQ ID NO: 365 (HEMD108)
CGCTGCGACTGCCTTTTGCTAGACCAAAAACGGGAAAGTGTCGTGCGGCAGCGGAAACGAGTGCGGGTCGAGGT CGCCCTACCGGCTTTGACCGGACGCAGCGGCGCGATAAAGCGCTAACCGGCGTGGTGCCGCGAGCGGGAAGTA TGGCAATCGCCCAAGCTATAAGCAATAGGTAACCTAGCCCTTTAATCGCTTCGGAACGATGTTAATGGACTTAATGT TTTATAACGCCCGTTTGCGCGCGACTAAAACGCACCGTTACCGCCGGCGCTTGACGACCCGCTTTGGGACTGTCG AGCGCCTCGGCTTCAGTCAAAAACACTTACAATAGTTGCTACACGCTTTGTAATGCTACCGCGCCTTCTTCGCTAC GCGACCGTATGAGCGCCGCATTGTTGCGAACAACAATGGTCGCCGCTCTACAACGTT
SEQ ID NO: 366 (HEMD109)
CGCTGCGAGCGCCTTTTGCTAGACCAAAAACGGGAAAGTGTCGTGCGGCAGCGGAAACGAGTGCGGGTCAAGGT CGCCCTACCGGCTTTGACCGGACGCAGCGGCGCGATAAAGCGCTAACCGGCGTGGTGCCGCGAGCGGGAAGTA TGGCAATCGCCCAAGCTATAAGCAATAGGTAACCTAGCCCTTTAGTCGCTTCGGAACGATGTTAATGGACTTAATGT TTTATAACGCCCGTTTGCGCGCGACTAAAACGCACCGTTACCGCCGGCGCTTGACGACCCGCTTTGGGACTGTCG AGCGCCTCGGCTTCAGTCAAAAACACTTACAATAGTTGCTACACGCTTTGTAATGCTACCGCGCCTTCTTCGCTAC GCGACCGTATGAGCGCCGCATTGTTGTGAACAACAATGGTCGCCGCTCTACAACGTT
SEQ ID NO: 367 (THRA103)
GTGCTGGGCCGTAATGGTTCCGACTATTCCGCCGCCGTGCTGGCCGCCTGTTTACGCGCTGACTGCTGTGAAATC TGGACTGACGTCGATGGCGTGTATACCTGTGACCCGCGCCAGGTGCCGGACGCCAGGCTGCTGAAATCGATGTC CTACCAGGAAGCGATGGAACTCTCTTACTTCGGCGCCAAAGTTCTTCACCCTCGCACCATTACGCCCATCGCCCAG TTCCAGATCCCCTGTCTGATTAAAAATACCGGTAATCCGCAGGCGCCAGGAACGCTGATCGGCGCGTCCAGCGAC GATGATAATCTGCCGGTCAAAGGGATCTCTAACCTTAACAACATGGCGATGTTTAGCGTCTCCGGCCCTGGAATGA AAGGGATGATTGGGATGGCGGCACGTGTTTTCGCCGCCATGTCTCGCGCCGGGATCTCGGTGGTGCTCATTACCC AGTCCTCCTCTGAATACAGCATCAGCTTCTGTGTGCCGCAGAGTGACTGC SEQ ID NO: 368 (THRA104) GTGCTGGGGCGTAATGGCTCTGACTACTCTGCCGCCGTGCTGGCCGCCTGTTTACGCGCTGACTGCTGTGAAATC TGGACTGACGTCGATGGCGTGTATACCTGTGACCCGCGCCAGGTGCCGGACGCCAGGCTGCTGAAATCGATGTC CTACCAGGAAGCGATGGAACTCTCTTACTTCGGCGCCAAAGTCCTTCACCCTCGCACCATTACGCCCATCGCCCA GTTCCAGATCCCCTGTCTGATTAAAAATACCGGTAATCCGCAGGCGCCAGGAACGCTGATCGGCGCGTCCAGCGA CGATGATAACCTGCCGGTTAAAGGGATCTCTAACCTTAACAACATGGCGATGTTTAGCGTCTCCGGCCCGGGAATG AAAGGGATGATTGGGATGGCGGCGCGTGTTTTCGCCGCCATGTCTCGCGCCGGGATCTCGGTGGTGCTCATTACC CAGTCCTCCTCTGAGTACAGCATCAGTTTCTGTGTGCCGCAGAGTGACTGC
SEQ ID NO: 369 (THRA105)
GTGCTGGGCCGTAATGGTTCCGACTATTCCGCCGCCGTGCTGGCCGCCTGTTTACGCGCTGACTGCTGTGAAATC TGGACTGACGTCGATGGCGTGTATACCTGTGACCCGCGTCAGGTGCCGGACGCCAGGCTGCTGAAATCGATGTC CTACCAGGAAGCGATGGAACTCTCTTACTTCGGCGCCAAAGTCCTTCACCCTCGCACCATTACGCCCATCGCCCA GTTCCAGATCCCCTGTCTGATTAAAAATACCGGTAATCCGCAGGCGCCAGGAACGCTGATCGGCGCGTCCAGCGA CGATGATAACCTGCCGGTTAAAGGGATCTCTAACCTTAACAACATGGCGATGTTTAGCGTCTCCGGCCCGGGAATG AAAGGGATGATTGGGATGGCGGCGCGTGTTTTCGCCGCCATGTCTCGCGCCGGGATCTCGGTGGTGCTCATTACC CAGTCCTCCTCTGAGTACAGCATCAGTTTCTGTGTGCCGCAGAGTGACTGC
SEQ ID NO: 370 (THRA109)
GTGCTGGGCCGTAATGGTTCCGACTATTCCGCCGCCGTGCTGGCCGCCTGTTTACGCGCTGACTGCTGTGAAATC TGGACTGACGTCGATGGCGTGTATACCTGTGACCCGCGTCAGGTGCCGGACGCCAGGCTGCTGAAATCGATGTC CTACCAGGAAGCGATGGAACTCTCTTACTTCGGCGCCAAAGTCCTTCACCCTCGCACCATTACGCCCATCGCCCA GTTCCAGATCCCCTGTCTGATTAAAAATACCGGCAATCCGCAGGCGCCAGGAACGCTGATCGGCGCGTCCAGCGA CGATGATAACCTGCCGGTTAAAGGGATCTCTAACCTTAACAACATGGCGATGTTTAGCGTCTCCGGCCCGGGAATG AAAGGGATGATTGGGATGGCGGCGCGTGTTTTCGCCGCCATGTCTCGCGCCGGGATCTCGGTGGTGCTCATTACC CAGTCCTCCTCTGAGTACAGCATCAGCTTCTGTGTGCCGCAGAGTGACTGC
SEQ ID NO: 371 (THRA1 1 )
GTGCTGGGCCGTAATGGTTCCGACTATTCCGCCGCCGTGCTGGCCGCCTGTTTACGCGCTGACTGCTGTGAAATC TGGACTGACGTCGATGGCGTGTATACCTGTGACCCGCGCCAGGTGCCGGACGCCAGGCTGCTGAAATCGATGTC CTACCAGGAAGCGATGGAACTCTCTTACTTCGGCGCCAAAGTTCTTCACCCTCGCACCATTACGCCCATCGCCCAG TTCCAGATCCCCTGTCTGATTAAAAATACCGGTAATCCGCAGGCGCCAGGAACGCTGATCGGCGCGTCCAGCGAC GATGATAACCTGCCGGTTAAAGGGATCTCTAACCTTAACAACATGGCGATGTTTAGCGTCTCCGGCCCGGGAATGA AAGGGATGATTGGGATGGCGGCGCGTGTTTTCGCCGCCATGTCTCGCGCCGGGATCTCGGTGGTGCTCATTACC CAGTCCTCCTCTGAGTACAGCATCAGCTTCTGTGTGCCGCAGAGTGACTGC
SEQ ID NO: 372 (THRA1 10)
GTGCTGGGCCGTAATGGTTCCGACTATTCCGCCGCCGTGCTGGCCGCCTGTTTACGCGCTGACTGCTGTGAAATC TGGACTGACGTCGATGGCGTGTATACCTGTGACCCGCGCCAGGTGCCGGACGCCAGGCTGCTGAAATCGATGTC CTACCAGGAAGCGATGGAACTCTCTTACTTCGGCGCCAAAGTTCTTCACCCTCGCACCATTACGCCTATCGCCCAG TTCCAGATCCCCTGTCTGATTAAAAATACCGGCAATCCGCAGGCACCAGGAACGCTGATCGGCGCGTCCAGCGAC GATGATAATCTGCCGGTTAAAGGGATCTCTAACCTTAACAACATGGCGATGTTTAGCGTCTCCGGCCCGGGAATGA AAGGGATGATTGGGATGGCGGCGCGTGTTTTCGCCGCCATGTCTCGCGCCGGGATCTCGGTGGTGCTCATTACC CAGTCCTCCTCTGAGTACAGCATCAGCTTCTGTGTGCCGCAGAGTGACTGC SEQ ID NO: 373 (THRA1 1 1 )
GTGCTGGGCCGTAATGGTTCCGACTATTCCGCCGCCGTGCTGGCCGCCTGTTTACGCGCTGACTGCTGTGAAATC TGGACTGACGTCGATGGCGTGTATACCTGTGACCCACGTCAGGTGCCGGACGCCAGGCTGCTGAAATCGATGTCC TACCAGGAAGCGATGGAACTCTCTTACTTCGGCGCCAAAGTCCTTCACCCTCGCACCATTACGCCTATCGCCCAGT TCCAGATCCCCTGTCTGATTAAAAATACCGGCAATCCGCAGGCGCCAGGAACGCTGATCGGCGCGTCCAGCGACG ATGATAATCTGCCGGTTAAAGGGATCTCTAACCTTAACAACATGGCGATGTTTAGCGTCTCCGGCCCGGGAATGAA AGGGATGATTGGGATGGCGGCGCGTGTTTTCGCCGCCATGTCTCGCGCCGGGATCTCGGTGGTGCTCATTACCC AGTCCTCCTCTGAGTACAGCATCAGCTTCTGTGTGCCGCAGAGTGATTGC
SEQ ID NO: 374 (THRA1 12)
GTGCTGGGCCGTAATGGTTCCGACTATTCCGCCGCCGTGCTGGCCGCCTGTTTACGCGCTGACTGCTGTGAAATC TGGACTGACGTCGATGGCGTGTATACCTGTGACCCACGTCAGGTGCCGGACGCCAGGCTGCTGAAATCGATGTCC TACCAGGAAGCGATGGAACTCTCTTACTTCGGCGCCAAAGTCCTTCACCCTCGCACCATTACGCCTATCGCCCAGT TCCAGATCCCCTGTCTGATTAAAAATACCGGTAATCCGCAGGCGCCAGGAACGCTGATCGGCGCATCCAGCGACG ATGATAATCTGCCGGTTAAAGGGATCTCTAACCTTAACAACATGGCGATGTTTAGCGTCTCCGGCCCGGGAATGAA AGGGATGATTGGGATGGCGGCGCGTGTTTTCGCCGCCATGTCTCGCGCCGGGATCTCGGTGGTGCTCATTACCC AGTCCTCCTCTGAGTACAGCATCAGCTTCTGTGTGCCGCAGAGTGACTGC
SEQ ID NO: 375 (THRA1 13)
GTGCTGGGCCGTAATGGTTCCGACTATTCCGCCGCCGTGCTGGCCGCCTGTTTACGCGCTGACTGCTGTGAAATC TGGACTGACGTCGATGGCGTGTATACCTGTGACCCGCGCCAGGTGCCGGACGCCAGGCTGCTGAAATCGATGTC CTACCAGGAAGCGATGGAACTCTCTTACTTCGGCGCCAAAGTTCTTCACCCTCGCACCATTACGCCCATCGCCCAG TTCCAGATCCCCTGTCTGATTAAAAATACCGGCAATCCGCAGGCGCCAGGAACGCTGATCGGCGCGTCCAGCGAC GATGATAATCTGCCGGTTAAAGGGATCTCTAACCTTAACAACATGGCGATGTTTAGCGTCTCCGGCCCGGGAATGA AAGGGATGATTGGGATGGCGGCGCGTGTTTTCGCCGCCATGTCTCGCGCCGGGATCTCGGTGGTGCTCATTACC CAGTCCTCCTCTGAGGACAGCATCAGTTTCTGTGTGCCGCAGAGTGACTGC
SEQ ID NO: 376 (THRA1 18)
GTGCTGGGCCGTAATGGTTCCGACTATTCCGCCGCCGTGCTGGCCGCCTGTTTACGCGCTGACTGCTGTGAAATC TGGACTGACGTCGATGGCGTGTATACCTGTGACCCGCGCCAGGTGCCGGACGCCAGGCTGCTGAAATCGATGTC CTACCAGGAAGCGATGGAGCTCTCTTACTTCGGTGCCAAAGTCCTTCATCCTCGCACCATTACGCCTATCGCCCAG TTCCAGATCCCCTGTCTGATTAAAAATACCGGCAATCCGCAGGCGCCAGGAACGCTGATCGGCGCGTCCAGCGAC GATGATAATCTGCCGGTTAAAGGGATCTCTAACCTTAACAACATGGCGATGTTTAGCGTCTCCGGCCCGGGAATGA AAGGGATGATTGGGATGGCGGCGCGTGTTTTCGCCGCCATGTCTCGCGCCGGGATCTCAGTGGTGCTCATTACCC AGTCCTCCTCTGAGTACAGCATCAGCTTCTGTGTGCCGCAGAGTGACTGC SEQ ID NO: 377 (THRA103)
CACGACCCGGCATTACCAAGGCTGATAAGGCGGCGGCACGACCGGCGGACAAATGCGCGACTGACGACACTTTA GACCTGACTGCAGCTACCGCACATATGGACACTGGGCGCGGTCCACGGCCTGCGGTCCGACGACTTTAGCTACA GGATGGTCCTTCGCTACCTTGAGAGAATGAAGCCGCGGTTTCAAGAAGTGGGAGCGTGGTAATGCGGGTAGCGG GTCAAGGTCTAGGGGACAGACTAATTTTTATGGCCATTAGGCGTCCGCGGTCCTTGCGACTAGCCGCGCAGGTCG CTGCTACTATTAGACGGCCAGTTTCCCTAGAGATTGGAATTGTTGTACCGCTACAAATCGCAGAGGCCGGGACCTT ACTTTCCCTACTAACCCTACCGCCGTGCACAAAAGCGGCGGTACAGAGCGCGGCCCTAGAGCCACCACGAGTAAT GGGTCAGGAGGAGACTTATGTCGTAGTCGAAGACACACGGCGTCTCACTGACG
SEQ ID NO: 378 (THRA104)
CACGACCCCGCATTACCGAGACTGATGAGACGGCGGCACGACCGGCGGACAAATGCGCGACTGACGACACTTTA GACCTGACTGCAGCTACCGCACATATGGACACTGGGCGCGGTCCACGGCCTGCGGTCCGACGACTTTAGCTACA GGATGGTCCTTCGCTACCTTGAGAGAATGAAGCCGCGGTTTCAGGAAGTGGGAGCGTGGTAATGCGGGTAGCGG GTCAAGGTCTAGGGGACAGACTAATTTTTATGGCCATTAGGCGTCCGCGGTCCTTGCGACTAGCCGCGCAGGTCG CTGCTACTATTGGACGGCCAATTTCCCTAGAGATTGGAATTGTTGTACCGCTACAAATCGCAGAGGCCGGGCCCTT ACTTTCCCTACTAACCCTACCGCCGCGCACAAAAGCGGCGGTACAGAGCGCGGCCCTAGAGCCACCACGAGTAAT GGGTCAGGAGGAGACTCATGTCGTAGTCAAAGACACACGGCGTCTCACTGACG
SEQ ID NO: 379 (THRA105)
CACGACCCGGCATTACCAAGGCTGATAAGGCGGCGGCACGACCGGCGGACAAATGCGCGACTGACGACACTTTA GACCTGACTGCAGCTACCGCACATATGGACACTGGGCGCAGTCCACGGCCTGCGGTCCGACGACTTTAGCTACAG GATGGTCCTTCGCTACCTTGAGAGAATGAAGCCGCGGTTTCAGGAAGTGGGAGCGTGGTAATGCGGGTAGCGGG TCAAGGTCTAGGGGACAGACTAATTTTTATGGCCATTAGGCGTCCGCGGTCCTTGCGACTAGCCGCGCAGGTCGC TGCTACTATTGGACGGCCAATTTCCCTAGAGATTGGAATTGTTGTACCGCTACAAATCGCAGAGGCCGGGCCCTTA CTTTCCCTACTAACCCTACCGCCGCGCACAAAAGCGGCGGTACAGAGCGCGGCCCTAGAGCCACCACGAGTAATG GGTCAGGAGGAGACTCATGTCGTAGTCAAAGACACACGGCGTCTCACTGACG
SEQ ID NO: 380 (THRA109)
CACGACCCGGCATTACCAAGGCTGATAAGGCGGCGGCACGACCGGCGGACAAATGCGCGACTGACGACACTTTA GACCTGACTGCAGCTACCGCACATATGGACACTGGGCGCAGTCCACGGCCTGCGGTCCGACGACTTTAGCTACAG GATGGTCCTTCGCTACCTTGAGAGAATGAAGCCGCGGTTTCAGGAAGTGGGAGCGTGGTAATGCGGGTAGCGGG TCAAGGTCTAGGGGACAGACTAATTTTTATGGCCGTTAGGCGTCCGCGGTCCTTGCGACTAGCCGCGCAGGTCGC TGCTACTATTGGACGGCCAATTTCCCTAGAGATTGGAATTGTTGTACCGCTACAAATCGCAGAGGCCGGGCCCTTA CTTTCCCTACTAACCCTACCGCCGCGCACAAAAGCGGCGGTACAGAGCGCGGCCCTAGAGCCACCACGAGTAATG GGTCAGGAGGAGACTCATGTCGTAGTCGAAGACACACGGCGTCTCACTGACG
SEQ ID NO: 381 (THRA1 1 )
CACGACCCGGCATTACCAAGGCTGATAAGGCGGCGGCACGACCGGCGGACAAATGCGCGACTGACGACACTTTA GACCTGACTGCAGCTACCGCACATATGGACACTGGGCGCGGTCCACGGCCTGCGGTCCGACGACTTTAGCTACA GGATGGTCCTTCGCTACCTTGAGAGAATGAAGCCGCGGTTTCAAGAAGTGGGAGCGTGGTAATGCGGGTAGCGG GTCAAGGTCTAGGGGACAGACTAATTTTTATGGCCATTAGGCGTCCGCGGTCCTTGCGACTAGCCGCGCAGGTCG CTGCTACTATTGGACGGCCAATTTCCCTAGAGATTGGAATTGTTGTACCGCTACAAATCGCAGAGGCCGGGCCCTT ACTTTCCCTACTAACCCTACCGCCGCGCACAAAAGCGGCGGTACAGAGCGCGGCCCTAGAGCCACCACGAGTAAT GGGTCAGGAGGAGACTCATGTCGTAGTCGAAGACACACGGCGTCTCACTGACG SEQ ID NO: 382 (THRA1 10)
CACGACCCGGCATTACCAAGGCTGATAAGGCGGCGGCACGACCGGCGGACAAATGCGCGACTGACGACACTTTA GACCTGACTGCAGCTACCGCACATATGGACACTGGGCGCGGTCCACGGCCTGCGGTCCGACGACTTTAGCTACA GGATGGTCCTTCGCTACCTTGAGAGAATGAAGCCGCGGTTTCAAGAAGTGGGAGCGTGGTAATGCGGATAGCGG GTCAAGGTCTAGGGGACAGACTAATTTTTATGGCCGTTAGGCGTCCGTGGTCCTTGCGACTAGCCGCGCAGGTCG CTGCTACTATTAGACGGCCAATTTCCCTAGAGATTGGAATTGTTGTACCGCTACAAATCGCAGAGGCCGGGCCCTT ACTTTCCCTACTAACCCTACCGCCGCGCACAAAAGCGGCGGTACAGAGCGCGGCCCTAGAGCCACCACGAGTAAT GGGTCAGGAGGAGACTCATGTCGTAGTCGAAGACACACGGCGTCTCACTGACG
SEQ ID NO: 383 (THRA1 1 1 )
CACGACCCGGCATTACCAAGGCTGATAAGGCGGCGGCACGACCGGCGGACAAATGCGCGACTGACGACACTTTA GACCTGACTGCAGCTACCGCACATATGGACACTGGGTGCAGTCCACGGCCTGCGGTCCGACGACTTTAGCTACAG GATGGTCCTTCGCTACCTTGAGAGAATGAAGCCGCGGTTTCAGGAAGTGGGAGCGTGGTAATGCGGATAGCGGGT CAAGGTCTAGGGGACAGACTAATTTTTATGGCCGTTAGGCGTCCGCGGTCCTTGCGACTAGCCGCGCAGGTCGCT GCTACTATTAGACGGCCAATTTCCCTAGAGATTGGAATTGTTGTACCGCTACAAATCGCAGAGGCCGGGCCCTTAC TTTCCCTACTAACCCTACCGCCGCGCACAAAAGCGGCGGTACAGAGCGCGGCCCTAGAGCCACCACGAGTAATG GGTCAGGAGGAGACTCATGTCGTAGTCGAAGACACACGGCGTCTCACTAACG
SEQ ID NO: 384 (THRA1 12)
CACGACCCGGCATTACCAAGGCTGATAAGGCGGCGGCACGACCGGCGGACAAATGCGCGACTGACGACACTTTA GACCTGACTGCAGCTACCGCACATATGGACACTGGGTGCAGTCCACGGCCTGCGGTCCGACGACTTTAGCTACAG GATGGTCCTTCGCTACCTTGAGAGAATGAAGCCGCGGTTTCAGGAAGTGGGAGCGTGGTAATGCGGATAGCGGGT CAAGGTCTAGGGGACAGACTAATTTTTATGGCCATTAGGCGTCCGCGGTCCTTGCGACTAGCCGCGTAGGTCGCT GCTACTATTAGACGGCCAATTTCCCTAGAGATTGGAATTGTTGTACCGCTACAAATCGCAGAGGCCGGGCCCTTAC TTTCCCTACTAACCCTACCGCCGCGCACAAAAGCGGCGGTACAGAGCGCGGCCCTAGAGCCACCACGAGTAATG GGTCAGGAGGAGACTCATGTCGTAGTCGAAGACACACGGCGTCTCACTGACG
SEQ ID NO: 385 (THRA1 13)
CACGACCCGGCATTACCAAGGCTGATAAGGCGGCGGCACGACCGGCGGACAAATGCGCGACTGACGACACTTTA GACCTGACTGCAGCTACCGCACATATGGACACTGGGCGCGGTCCACGGCCTGCGGTCCGACGACTTTAGCTACA GGATGGTCCTTCGCTACCTTGAGAGAATGAAGCCGCGGTTTCAAGAAGTGGGAGCGTGGTAATGCGGGTAGCGG GTCAAGGTCTAGGGGACAGACTAATTTTTATGGCCGTTAGGCGTCCGCGGTCCTTGCGACTAGCCGCGCAGGTCG CTGCTACTATTAGACGGCCAATTTCCCTAGAGATTGGAATTGTTGTACCGCTACAAATCGCAGAGGCCGGGCCCTT ACTTTCCCTACTAACCCTACCGCCGCGCACAAAAGCGGCGGTACAGAGCGCGGCCCTAGAGCCACCACGAGTAAT GGGTCAGGAGGAGACTCCTGTCGTAGTCAAAGACACACGGCGTCTCACTGACG
SEQ ID NO: 386 (THRA1 18)
CACGACCCGGCATTACCAAGGCTGATAAGGCGGCGGCACGACCGGCGGACAAATGCGCGACTGACGACACTTTA GACCTGACTGCAGCTACCGCACATATGGACACTGGGCGCGGTCCACGGCCTGCGGTCCGACGACTTTAGCTACA GGATGGTCCTTCGCTACCTCGAGAGAATGAAGCCACGGTTTCAGGAAGTAGGAGCGTGGTAATGCGGATAGCGG GTCAAGGTCTAGGGGACAGACTAATTTTTATGGCCGTTAGGCGTCCGCGGTCCTTGCGACTAGCCGCGCAGGTCG CTGCTACTATTAGACGGCCAATTTCCCTAGAGATTGGAATTGTTGTACCGCTACAAATCGCAGAGGCCGGGCCCTT ACTTTCCCTACTAACCCTACCGCCGCGCACAAAAGCGGCGGTACAGAGCGCGGCCCTAGAGTCACCACGAGTAAT GGGTCAGGAGGAGACTCATGTCGTAGTCGAAGACACACGGCGTCTCACTGACG SEQ ID NO: 387 (AROC159)
GTTTTTCGTCCGGGACACGCGGATTACACCTATGAGCAGAAATACGGCCTGCGCGATTACCGTGGTGGTGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATTGCCAAGAAATACCTGGCGGAAAAGTTCGG CATCGAAATCCGCGGCTGCCTGACCCAGATGGGCGACATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGTCCCGATGCGGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG TGACTCCATCGGCGCGAAAGTGACGGTGATGGCGAGCGGCGTGCCGGCAGGGCTTGGCGAACCGGTATTTGACC GACTGGATGCGGACATCGCCCATGCGCTGATGAGCATCAATGCGGTGAAAGGCGTGGAGATCGGCGAAGGATTT AACGTAGTGGCGCTGCGCGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGT
SEQ ID NO: 388 (AROC221 )
GTTTTTCGTCCGGGACACGCGGATTACACCTATGAGCAGAAATACGGCCTGCGCGATTACCGTGGTGGCGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATCGCCAAGAAATACCTGGCGGAAAAGTTTGG CATCGAAATCCGCGGCTGCCTGAGCCAGATGGGCGACATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGCCCCGATGCGGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG TGACTCCATCGGCGCGAAAGTGACGGTGGTGGCGAGCGGCGTGCCGGCAGGGCTTGGCGAACCAGTATTTGACC GACTGGACGCGGATATCGCCCATGCGCTGATGAGCATCAATGCGGTGAAAGGTGTGGAGATCGGCGAAGGATTTA ACGTGGTGGCGCTGCGTGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGC
SEQ ID NO: 389 (AROC34)
GTTTTTCGTCCGGGACACGCGGATTACACCTATGAGCAGAAATACGGCCTGCGCGATTACCGTGGTGGTGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATTGCCAAGAAATACCTGGCGGAAAAGTTCGG CATCGAAATCCGCGGCTGCCTGACCCAGATGGGCGACATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGCCCCGATGCGGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG TGACTCCATCGGCGCGAAAGTGACGGTGATGGCGAGCGGCGTGCCGGCAGGGCTTGGCGAACCGGTATTTGACC GACTGGATGCGGACATCGCCCATGCGCTGATGAGCATCAATGCGGTGAAAGGCGTGGAGATCGGCGAAGGATTT AACGTAGTGGCGCTGCGCGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGT SEQ ID NO: 390 (AROC36)
GTTTTTCGTCCGGGACACGCGGATTACACCTATGAGCAGAAATACGGCCTGCGCGATTACCGTGGTGGTGGACGT TCTTCCGCGCGTGAAACCGCGATGCGCGTAGCGGCAGGGGCGATCGCCAAGAAATACCTGGCGGAAAAGTTCGG CATCGAAATCCGCGGCTGCCTGACCCAGATGGGCGACATTCCGCTGGAGATTAAAGACTGGCGTCAGGTTGAGCT TAATCCGTTCTTTTGCCCCGATGCGGACAAACTTGACGCGCTGGACGAACTGATGCGCGCGCTGAAAAAAGAGGG TGACTCCATCGGCGCGAAAGTGACGGTGATGGCGAGCGGCGTGCCGGCAGGGCTTGGCGAACCGGTATTTGACC GACTGGATGCGGACATCGCCCATGCGCTGATGAGCATCAATGCGGTGAAAGGCGTGGAGATCGGCGAAGGATTT AACGTAGTGGCGCTGCGCGGCAGCCAGAATCGCGATGAAATCACGGCGCAGGGT SEQ ID NO: 391 (AROC159)
CAAAAAGCAGGCCCTGTGCGCCTAATGTGGATACTCGTCTTTATGCCGGACGCGCTAATGGCACCACCACCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAACGGTTCTTTATGGACCGCCTTTTCAAGCCG TAGCTTTAGGCGCCGACGGACTGGGTCTACCCGCTGTAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAA TTAGGCAAGAAAACAGGGCTACGCCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCAC TGAGGTAGCCGCGCTTTCACTGCCACTACCGCTCGCCGCACGGCCGTCCCGAACCGCTTGGCCATAAACTGGCT GACCTACGCCTGTAGCGGGTACGCGACTACTCGTAGTTACGCCACTTTCCGCACCTCTAGCCGCTTCCTAAATTGC ATCACCGCGACGCGCCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCA
SEQ ID NO: 392 (AROC221 )
CAAAAAGCAGGCCCTGTGCGCCTAATGTGGATACTCGTCTTTATGCCGGACGCGCTAATGGCACCACCGCCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAGCGGTTCTTTATGGACCGCCTTTTCAAACCG TAGCTTTAGGCGCCGACGGACTCGGTCTACCCGCTGTAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAAT TAGGCAAGAAAACGGGGCTACGCCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCACT GAGGTAGCCGCGCTTTCACTGCCACCACCGCTCGCCGCACGGCCGTCCCGAACCGCTTGGTCATAAACTGGCTG ACCTGCGCCTATAGCGGGTACGCGACTACTCGTAGTTACGCCACTTTCCACACCTCTAGCCGCTTCCTAAATTGCA CCACCGCGACGCACCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCG
SEQ ID NO: 393 (AROC34)
CAAAAAGCAGGCCCTGTGCGCCTAATGTGGATACTCGTCTTTATGCCGGACGCGCTAATGGCACCACCACCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAACGGTTCTTTATGGACCGCCTTTTCAAGCCG TAGCTTTAGGCGCCGACGGACTGGGTCTACCCGCTGTAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAA TTAGGCAAGAAAACGGGGCTACGCCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCAC TGAGGTAGCCGCGCTTTCACTGCCACTACCGCTCGCCGCACGGCCGTCCCGAACCGCTTGGCCATAAACTGGCT GACCTACGCCTGTAGCGGGTACGCGACTACTCGTAGTTACGCCACTTTCCGCACCTCTAGCCGCTTCCTAAATTGC ATCACCGCGACGCGCCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCA
SEQ ID NO: 394 (AROC36)
CAAAAAGCAGGCCCTGTGCGCCTAATGTGGATACTCGTCTTTATGCCGGACGCGCTAATGGCACCACCACCTGCA AGAAGGCGCGCACTTTGGCGCTACGCGCATCGCCGTCCCCGCTAGCGGTTCTTTATGGACCGCCTTTTCAAGCCG TAGCTTTAGGCGCCGACGGACTGGGTCTACCCGCTGTAAGGCGACCTCTAATTTCTGACCGCAGTCCAACTCGAA TTAGGCAAGAAAACGGGGCTACGCCTGTTTGAACTGCGCGACCTGCTTGACTACGCGCGCGACTTTTTTCTCCCAC TGAGGTAGCCGCGCTTTCACTGCCACTACCGCTCGCCGCACGGCCGTCCCGAACCGCTTGGCCATAAACTGGCT GACCTACGCCTGTAGCGGGTACGCGACTACTCGTAGTTACGCCACTTTCCGCACCTCTAGCCGCTTCCTAAATTGC ATCACCGCGACGCGCCGTCGGTCTTAGCGCTACTTTAGTGCCGCGTCCCA
SEQ ID NO: 395 GCGCGAAAGTGACGGTGA
SEQ ID NO: 396 CGCGCTTTCACTGCCACT
SEQ ID NO: 397 GAGCCRGGTGCTACTACT
SEQ ID NO: 398 CTCGGRCCACGATGATGA
SEQ ID NO: 399 TAACTCGTGTGGCGGTCC
SEQ ID NO: 400 ATTGAGCACACCGCCAGG
SEQ ID NO: 401 GAGCCRGGTGCTACTACC
SEQ ID NO: 402 CTCGGRCCACGATGATGG
SEQ ID NO: 403 TCCCAGGCTGAGCACGGC
SEQ ID NO: 404 AGGGTCCGACTCGTGCCG
SEQ ID NO: 405 GATGAAATCCTCTATGCGGCA SEQ ID NO: 406 CTACTTTAGGAGATACGCCGT SEQ ID NO: 407 GCCTTTGTAACCGAAGCCAAG
SEQ ID NO: 408 CGGAAACATTGGCTTCGGTTC
SEQ ID NO: 409 TCGACGAATTTGCCGGTAAA SEQ ID NO: 410 AGCTGCTTAAACGGCCATTT
SEQ ID NO: 41 1 GACGTGAAGTATCACATGGGT
SEQ ID NO: 412 CTGCACTTCATAGTGTACCCA SEQ ID NO: 413 TATGCTGTTTGAAACGGAAGGC
SEQ ID NO: 414 ATACGACAAACTTTGCCTTCCG SEQ ID NO: 415 GTTCGACGAATTTGCCGGT
SEQ ID NO: 416 CAAGCTGCTTAAACGGCCA
SEQ ID NO: 417 GCGATTAAAGATGTTTTTCGC SEQ ID NO: 418 CGCTAATTTCTACAAAAAGCG
SEQ ID NO: 419 TCCATTCCCATTACCCAAGA SEQ ID NO: 420 AGGTAAGGGTAATGGGTTCT
SEQ ID NO: 421 CCTGATGCTGACATTGCCT
SEQ ID NO: 422 GGACTACGACTGTAACGGA SEQ ID NO: 423 AAGATTTAGCGCAGTGCGTC
SEQ ID NO: 424 TTCTAAATCGCGTCACGCAG SEQ ID NO: 425 GGATAGCCTCTACTCCATCG
SEQ ID NO: 426 CCTATCGGAGATGAGGTAGC SEQ ID NO: 427 CAGAATCGCGATGAAATCACG SEQ ID NO: 428 GTCTTAGCGCTACTTTAGTGC
SEQ ID NO: 429 CGATGAAGCGCCTGATTGAAT SEQ ID NO: 430 GCTACTTCGCGGACTAACTTA
SEQ ID NO: 431 ACGAAACTGGCCTGYGT
SEQ ID NO: 432 TGCTTTGACCGGACYCA SEQ ID NO: 433 CCTTCACCCTCGCACCATT
SEQ ID NO: 434 GGAAGTGGGAGCGTGGTAA SEQ ID NO: 435 CTGCGCGATTACCGTGGT
SEQ ID NO: 436 GACGCGCTAATGGCACCA
SEQ ID NO: 437 CTATCTATCTAACTATCTATATCA SEQ ID NO: 438 GATAGATAGATTGATAGATATAGT SEQ ID NO: 439 TCATTCATATACATACCAATTCAT
SEQ ID NO: 440 AGTAAGTATATGTATGGTTAAGTA
SEQ ID NO: 441 TCATTTACCAATCTTTCTTTATAC
SEQ ID NO: 442 AGTAAATGGTTAGAAAGAAATATG
SEQ ID NO: 443 CTATCTATCTAACTATCTATATCAGCGCGAAAGTGACGGTGA
SEQ ID NO: 444 TCATTCATATACATACCAATTCATTGCCGGACGCCAGGCTGC
SEQ ID NO: 445 TCATTTACCAATCTTTCTTTATACGAGCCRGGTGCTACTACC
The present invention (a method of serotyping by PCR used for the detection of Hazard Group 3 (HG3) Salmonella) has been tested for use in the Laboratory for Gastrointestinal Pathogens at the HPA Colindale. Specific examples will now be given of the use of the invention in that laboratory. These are exemplary; they are not limiting in any way to the scope of the claims or to the invention. These examples represent the best practice for the using the invention as currently envisaged.
Example 1 : The invention was tested for an ability to screen all Salmonella to allow release of HG2 Salmonella from the high level containment laboratory (CL). HG3 Salmonella must remain in the CL.
Using 366 strains of bacteria from the archived collection of the HPA testing was carried out according to the description of the invention with the following results: Salmonella Typhi (HG3): 150 tested and 150 correctly identified by the invention. Salmonella Paratyphi A (HG3) 76 tested and 76 correctly identified by the invention; Salmonella Paratyphi C (HG3) 34 tested and 34 correctly identified by the invention; of 260 HG3 isolates all were correctly identified. Salmonella Typhimurium 81 tested and 81 correctly identified by the invention. Salmonella Enteritidis 25 tested and 25 correctly identified by the invention (1 not assigned as either HG3 or HG2). The invention has an accuracy of 100% for the positive identification of 260 HG3 Salmonella and an accuracy of 106/106 for the positive identification of HG2 Salmonella. The serious error rate (identification HG3 as HG2 Salmonella) was zero. Table 4. Analytical Validation
Figure imgf000085_0001
Example 2: The invention was tested using 356 unknown isolates as they were received by the Laboratory for Gastrointestinal Pathogens. Salmonella Typhi (HG3): 17 tested and 17 correctly identified by the invention. Salmonella Paratyphi A (HG3) 5 tested and 5 correctly identified by the invention; of 22 HG3 isolates all were correctly identified. Salmonella Typhimurium 74 tested and 74 correctly identified by the invention. Salmonella Enteritidis 39 tested and 38 correctly identified by the invention (1 not assigned as either HG3 or HG2). Other HG2 Salmonella 221 tested and 221 correctly identified by the invention; of 334 HG2 Salmonella tested 333 were correctly identified. The invention, when the operator is blind to the expected result of the test, has an accuracy of 22/22 for the positive identification of HG3 Salmonella and an accuracy of 333/334 for the positive identification of HG2 Salmonella. The serious error rate (identification HG3 as HG2 Salmonella) was zero.
Table 5. Diagnostic Validation
Figure imgf000085_0002

Claims

1 . A method of screening a sample containing nucleic acid for the presence or absence of HG3 salmonella nucleic acid, said method comprising:
A. contacting the sample with a series of probes, said series of probes comprising:
probe A, wherein probe A binds to:
i. position 328 of target SEQ ID NO: 1 when the nucleotide at position 328 of SEQ ID NO: 1 is A, and wherein probe A does not bind to position 328 of SEQ ID NO: 1 when the nucleotide at position 328 is G; or
ii. position 328 of SEQ ID NO: 2 (the complement of SEQ ID NO:
1 ) when the nucleotide at position 328 is T, and wherein probe A does not bind to the nucleotide at position 328 of SEQ ID NO: 2 when the nucleotide at position 328 is C;
probe B, wherein probe B binds to:
i. position 136 of target SEQ ID NO: 3 when the nucleotide at position 136 of SEQ ID NO: 3 is T, and wherein probe B does not bind to position 136 of SEQ ID NO: 3 when the nucleotide at position 136 is C; or
ii. position 136 of SEQ ID NO: 4 (the complement of SEQ ID NO:
3) when the nucleotide at position 136 is A, and wherein probe B does not bind to the nucleotide at position 136 of SEQ ID NO: 4 when the nucleotide at position 136 is G;
probe C, wherein probe C binds to:
i. position 378 of target SEQ ID NO: 5 when the nucleotide at position 378 of SEQ ID NO: 5 is C, and wherein probe C does not bind to position 378 of SEQ ID NO: 5 when the nucleotide at position 378 is T; or
ii. position 378 of SEQ ID NO: 6 (the complement of SEQ ID NO:
5) when the nucleotide at position 378 is G, and wherein probe C does not bind to the nucleotide at position 378 of SEQ ID NO: 6 when the nucleotide at position 378 is A;
probe D, wherein probe D binds to:
i. position 57 of target SEQ ID NO: 7 when the nucleotide at position 57 of SEQ ID NO: 7 is C, and wherein probe D does not bind to position 57 of SEQ ID NO: 7 when the nucleotide at position 57 is T; or
ii. position 57 of SEQ ID NO: 8 (the complement of SEQ ID NO: 7) when the nucleotide at position 57 is G, and wherein probe D does not bind to the nucleotide at position 57 of SEQ ID NO: 8 when the nucleotide at position 57 is A;
probe E, wherein probe E binds to:
i. position 339 of target SEQ ID NO: 9 when the nucleotide at position 339 of SEQ ID NO: 9 is C, and wherein probe E does not bind to position 339 of SEQ ID NO: 9 when the nucleotide at position 339 is T; or
ii. position 339 of SEQ ID NO: 10 (the complement of SEQ I D NO:
9) when the nucleotide at position 339 is G, and wherein probe E does not bind to the nucleotide at position 339 of SEQ ID NO: 10 when the nucleotide at position 339 is A;
probe F, wherein probe F binds to:
i. position 69 of target SEQ ID NO: 1 1 when the nucleotide at position 69 of SEQ ID NO: 1 1 is A, and wherein probe F does not bind to position 69 of SEQ ID NO: 1 1 when the nucleotide at position 69 is G; or
ii. position 69 of SEQ ID NO: 12 (the complement of SEQ ID NO:
1 1 ) when the nucleotide at position 69 is T, and wherein probe F does not bind to the nucleotide at position 69 of SEQ I D NO: 1 1 when the nucleotide at position 69 is C;
probe G, wherein probe G binds to:
i. position 204 of target SEQ ID NO: 13 when the nucleotide at position 204 of SEQ I D NO: 13 is G, and wherein probe G does not bind to position 204 of SEQ ID NO: 13 when the nucleotide at position 204 is A; or
ii. position 204 of SEQ ID NO: 14 (the complement of SEQ I D NO:
13) when the nucleotide at position 204 is C, and wherein probe G does not bind to the nucleotide at position 204 of SEQ ID NO: 14 when the nucleotide at position 204 is T;
probe H, wherein probe H binds to:
i. position 255 of target SEQ ID NO: 15 when the nucleotide at position 255 of SEQ ID NO: 15 is A, and wherein probe H does not bind to position 255 of SEQ ID NO: 15 when the nucleotide at position 255 is G; or
ii. position X of SEQ ID NO: 16 (the complement of SEQ ID NO:
15) when the nucleotide at position 255 is T, and wherein probe H does not bind to the nucleotide at position 255 of SEQ ID NO: 15 when the nucleotide at position 255 is C;
probe I, wherein probe I binds to:
i. position 300 of target SEQ ID NO: 17 when the nucleotide at position 300 of SEQ ID NO: 17 is C, and wherein probe I does not bind to position 300 of SEQ ID NO: 17 when the nucleotide at position 300 is T; or
ii. position 300 of SEQ ID NO: 18 (the complement of SEQ I D NO:
17) when the nucleotide at position 300 is G, and wherein probe I does not bind to the nucleotide at position 300 of SEQ ID NO: 18 when the nucleotide at position 300 is A;
probe J, wherein probe J binds to:
i. position 342 of target SEQ ID NO: 19 when the nucleotide at position 342 of SEQ ID NO: 19 is C, and wherein probe J does not bind to position 342 of SEQ ID NO: 19 when the nucleotide at position 342 is A or T; or
ii. position 342 of SEQ ID NO: 20 (the complement of SEQ I D NO:
19) when the nucleotide at position 342 is G, and wherein probe J does not bind to the nucleotide at position 342 of SEQ ID NO: 20 when the nucleotide at position 342 is T or A;
probe K, wherein probe K binds to:
i. position 252 of target SEQ ID NO: 21 when the nucleotide at position 252 of SEQ ID NO: 22 is T, and wherein probe K does not bind to position 252 of SEQ ID NO: 21 when the nucleotide at position 252 is G; or
ii. position 252 of SEQ ID NO: 22 (the complement of SEQ I D NO:
21 ) when the nucleotide at position 252 is A, and wherein probe K does not bind to the nucleotide at position 252 of SEQ ID NO: 22 when the nucleotide at position 252 is C;
probe L, wherein probe L binds to:
i. position 9 of target SEQ ID NO: 23 when the nucleotide at position 9 of SEQ ID NO: 23 is C, and wherein probe L does not bind to position 9 of SEQ ID NO: 23 when the nucleotide at position 9 is T; or
ii. position 9 of SEQ ID NO: 24 (the complement of SEQ ID NO:
23) when the nucleotide at position 9 is G, and wherein probe L does not bind to the nucleotide at position 9 of SEQ ID NO: 24 when the nucleotide at position 9 is A;
B. allowing said series of probes to bind to their respective target nucleic acid sequence (or the complement thereof),
i. wherein the binding of one or more of said probes to its respective target nucleic acid sequence (or the complement thereof) indicates the presence of HG3 salmonella nucleic acid; and
ii. wherein the absence of binding of said series of probes to their respective target nucleic acid sequence (or the complement thereof) indicates the absence of HG3 salmonella nucleic acid; and
C. confirming the presence of HG3 salmonella nucleic acid by detecting the presence of one or more of said probes, or confirming the absence of HG3 salmonella nucleic acid by detecting the absence of said series to probes.
2. The method according to Claim 1 , wherein each of the probes comprises a sequence of 9-30 consecutive nucleotides, and wherein said sequence of 9-30 consecutive nucleotides binds the probe(s) to the nucleotide and nucleotide position of the target nucleic acid sequence(s) (or complement thereof) as defined in Claim 1.
3. The method according to Claim 2, wherein said sequence of 9-30 consecutive nucleotides further comprises a complement capture moiety.
4. The method according to Claim 3, wherein said method comprises contacting said probes and said sample nucleic acid with an anti-complement capture moiety that binds to the complement capture moiety.
5. The method according to Claim 4, wherein said complement capture moiety and said anti-complement capture moiety comprise complementary nucleic acid sequences.
6. The method according to Claim 4 or Claim 5, wherein said anti-complement capture moiety is immobilised on a surface, for example on the surface of a bead.
7. The method according to any of Claims 4-6, wherein each complement capture moiety is unique to each probe and binds only to one of said anti-complement capture moieties.
8. The method according to any of Claims 4-7, wherein each anti-complement capture moiety is immobilised on to the surface of a different bead, where each of said different beads carries a unique identifier.
9. The method according to Claim 8, wherein said unique identifier comprises a dye or combination of dyes, and wherein each of the different beads emits light at a different wavelength.
10. The method according to any preceding claim, wherein the probe has a 3'-to-5' nucleic acid sequence orientation, and wherein following binding of the probe(s) to a target or complement nucleic acid sequence, said probe(s) is extended from the 3' end by sequential nucleotide addition to form a probe-containing amplification product.
1 1 . The method according to Claim 10, wherein said probe-extending reaction includes the presence of labelled dNTPs, which are thereby incorporated into the probe-containing amplification product.
12. The method according to any preceding claim, wherein said method further comprises confirming the presence or absence of HG2 nucleic acid, wherein the presence of HG2 nucleic acid is confirmed by the binding (to sample nucleic acid) of one or more probes selected from the group consisting of probes N-U and wherein the absence of HG2 nucleic acid is confirmed by the absence of binding (to sample nucleic acid) of one or more of said probes, wherein:
ix. probe N binds to
i. position 385 of target SEQ ID NO: 27 when the nucleotide at position 385 of SEQ ID NO: 27 is T, and wherein probe N does not bind to position 385 of SEQ ID NO: 27 when the nucleotide at position 385 is C; or
ii. position 385 of SEQ I D NO: 28 (the complement of SEQ ID NO:
27) when the nucleotide at position 385 is A, and wherein probe N does not bind to the nucleotide at position 385 of SEQ ID NO:
28 when the nucleotide at position 385 is G;
x. probe O binds to i. position 501 of target SEQ ID NO: 29 when the nucleotide at position 501 of SEQ ID NO: 29 is C, and wherein probe O does not bind to position 501 of SEQ ID NO: 29 when the nucleotide at position 501 is T; or
ii. position 501 of SEQ ID NO: 30 (the complement of SEQ ID NO:
29) when the nucleotide at position 501 is G, and wherein probe O does not bind to the nucleotide at position 501 of SEQ ID NO: 30 when the nucleotide at position 501 is A;
xi. probe P binds to
i. position 271 of target SEQ ID NO: 31 when the nucleotide at position 271 of SEQ ID NO: 31 is G, and wherein probe P does not bind to position 271 of SEQ ID NO: 31 when the nucleotide at position 271 is T; or
ii. position 271 of SEQ ID NO: 32 (the complement of SEQ ID NO:
31 ) when the nucleotide at position 271 is C, and wherein probe
P does not bind to the nucleotide at position 271 of SEQ ID NO: 32 when the nucleotide at position 271 is A;
xii. probe Q binds to
i. position 492 of target SEQ ID NO: 33 when the nucleotide at position 492 of SEQ ID NO: 33 is G, and wherein probe Q does not bind to position 492 of SEQ ID NO: 33 when the nucleotide at position 492 is A; or
ii. position 492 of SEQ ID NO: 34 (the complement of SEQ ID NO:
33) when the nucleotide at position 492 is C, and wherein probe Q does not bind to the nucleotide at position 492 of SEQ ID NO:
34 when the nucleotide at position 492 is T;
xiii. probe R binds to
i. position 271 of target SEQ ID NO: 35 when the nucleotide at position 271 of SEQ ID NO: 35 is T, and wherein probe R does not bind to position 271 of SEQ ID NO: 35 when the nucleotide at position 271 is G; or
ii. position 271 of SEQ ID NO: 36 (the complement of SEQ ID NO:
35) when the nucleotide at position 271 is A, and wherein probe R does not bind to the nucleotide at position 271 of SEQ ID NO: 36 when the nucleotide at position 271 is C;
xiv. probe S binds to i. position 100 of target SEQ ID NO: 37 when the nucleotide at position 100 of SEQ ID NO: 37 is T, and wherein probe S does not bind to position 100 of SEQ ID NO: 37 when the nucleotide at position 100 is G; or
ii. position 100 of SEQ ID NO: 38 (the complement of SEQ ID NO:
37) when the nucleotide at position 100 is A, and wherein probe S does not bind to the nucleotide at position 100 of SEQ ID NO: 38 when the nucleotide at position 100 is C;
xv. probe T binds to
i. position 210 of target SEQ ID NO: 39 when the nucleotide at position 210 of SEQ ID NO: 39 is T, and wherein probe T does not bind to position 210 of SEQ ID NO: 39 when the nucleotide at position 210 is A; or
ii. position 210 of SEQ ID NO: 40 (the complement of SEQ ID NO:
39) when the nucleotide at position 210 is A, and wherein probe T does not bind to the nucleotide at position 210 of SEQ ID NO: 40 when the nucleotide at position 210 is T;
xvi. probe U binds to
i. position 66 of target SEQ ID NO: 41 when the nucleotide at position 66 of SEQ ID NO: 41 is T, and wherein probe U does not bind to position 66 of SEQ ID NO: 41 when the nucleotide at position 66 is C; or
ii. position 66 of SEQ ID NO: 42 (the complement of SEQ ID NO:
41 ) when the nucleotide at position 66 is A, and wherein probe U does not bind to the nucleotide at position 66 of SEQ ID NO: 42 when the nucleotide at position 66 is G.
13. The method according to any preceding claim wherein the sample nucleic acid is contacted substantially simultaneously with all of probes A-L, or with all of probes A- L and N-U.
14. The method according to any preceding claim, wherein said series of probes comprises:
i) . at least one probe A;
ii) at least one probe B
iii) at least one probe C
iv) at least one probe D v) at least one probe E;
vi) at least one probe F;
vii) at least one probe G;
viii) at least one probe H;
ix) at least one probe 1;
x) at least one probe J;
xi) at least one probe K; and
xii) at least one probe L;
xiii) and optionally comprising at least one probe M;
xiv) and further optionally comprising one or more of
1 . at least one probe N;
2. at least one probe O;
3. at least one probe P;
4. at least one probe Q;
5. at least one probe R;
6. at least one probe S;
7. at least one probe T; and
8. at least one probe U;
wherein each one of the probes is independently selected from nucleic acid sequences having at least 80% sequence identity to the nucleic acid sequences listed in Table 1 (preferably having at least 80% sequence identity to the target sequence binding probes listed in Table 1 ) including fragments thereof that comprises at least 9 consecutive nucleotides of said sequences listed in Table 1 .
A kit of detection probes comprising one or more of:
i)- at least one probe A;
ϋ) at least one probe B;
xv) at least one probe C;
xvi) at least one probe D;
xvii) at least one probe E;
xviii) at least one probe F;
xix) at least one probe G;
xx) at least one probe H;
xxi) at least one probe I;
xxii) at least one probe J;
xxiii) at least one probe K; and/ or
xxiv) at least one probe L; and optionally comprising at least one probe M;
and further optionally comprising one or more of
1 . at least one probe N;
2. at least one probe O;
3. at least one probe P;
4 at least one probe Q;
5 at least one probe R;
6 at least one probe S;
7 at least one probe T; and
8 at least one probe U;
wherein said probes are independently selected from nucleic acid sequences having at least 80% sequence identity to the nucleic acid sequences listed in Table 1 (preferably having at least 80% sequence identity to the target sequence binding probes listed in Table 1 ) including fragments thereof that comprises at least 9 consecutive nucleotides of said sequences listed in Table 1.
16. Use of the kit of probes according to Claim 15, for serotyping one or more salmonella species present in a sample, wherein said salmonella species is selected from the group consisting of: S. paratyphi A (via probe A and/ or probe B), S. paratyphi B/ Java (via one or more, such as all, of probes C-L), S. paratyphi C (via probe J and/ or probe K), S. typhi (via probe L), S. enteritidis (via one or more, such as all, of probes N-P), S. typhimurium (via one or more, such as all, of probes Q-T) and/ or S. choleraesuis (via probe U).
PCT/GB2012/051614 2011-07-08 2012-07-09 Salmonella detection assay WO2013007996A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GB1111723.1 2011-07-08
GBGB1111723.1A GB201111723D0 (en) 2011-07-08 2011-07-08 Salmonella detection assay

Publications (1)

Publication Number Publication Date
WO2013007996A1 true WO2013007996A1 (en) 2013-01-17

Family

ID=44544443

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/GB2012/051614 WO2013007996A1 (en) 2011-07-08 2012-07-09 Salmonella detection assay

Country Status (2)

Country Link
GB (1) GB201111723D0 (en)
WO (1) WO2013007996A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105004857A (en) * 2015-08-19 2015-10-28 上海市长宁区疾病预防控制中心 Kit for salmonella paratyphi B and salmonella paratyphi B var Java, as well as preparation and using method of kit
CN109468394A (en) * 2018-11-30 2019-03-15 舟山出入境检验检疫局综合技术服务中心 A kind of five heavy PCR primers, kit and its application detecting four kinds of Salmonella serogroups
GB2612409A (en) * 2021-08-09 2023-05-03 Univ Jiangsu Indel molecular marker of ultrasonic mutagenesis salmonella typhimurium HISD gene and use thereof

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008025570A1 (en) * 2006-09-01 2008-03-06 Bio-Rad Pasteur Detection of salmonella by real-time multiplex pcr
WO2010133257A1 (en) * 2009-05-22 2010-11-25 Fondazione Parco Tecnologico Padano Method for detection and identification of bacterial strains belonging to the classes escherichia coli, salmonella, campylobacter and listeria

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008025570A1 (en) * 2006-09-01 2008-03-06 Bio-Rad Pasteur Detection of salmonella by real-time multiplex pcr
WO2010133257A1 (en) * 2009-05-22 2010-11-25 Fondazione Parco Tecnologico Padano Method for detection and identification of bacterial strains belonging to the classes escherichia coli, salmonella, campylobacter and listeria

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
CRUMP, J.A. ET AL., BULL WORLD HITH ORG, vol. 82, 2004, pages 346 - 53
H. CHART ET AL: "Serodiagnosis of Salmonella enterica serovar Typhi and S. enterica serovars Paratyphi A, B and C human infections", JOURNAL OF MEDICAL MICROBIOLOGY, vol. 56, no. 9, 1 September 2007 (2007-09-01), pages 1161 - 1166, XP055035890, ISSN: 0022-2615, DOI: 10.1099/jmm.0.47197-0 *
TOURON A ET AL: "Detection of Salmonella in environmental water and sediment by a nested-multiplex polymerase chain reaction assay", RESEARCH IN MICROBIOLOGY, ELSEVIER, AMSTERDAM, NL, vol. 156, no. 4, 1 May 2005 (2005-05-01), pages 541 - 553, XP027793129, ISSN: 0923-2508, [retrieved on 20050501] *
YAJUN SONG ET AL: "A multiplex single nucleotide polymorphism typing assay for detecting mutations that result in decreased fluoroquinolone susceptibility in Salmonella enterica serovars Typhi and Paratyphi A", JOURNAL OF ANTIMICROBIAL CHEMOTHERAPY, OXFORD UNIVERSITY PRESS, GB, vol. 65, no. 8, 1 January 2010 (2010-01-01), pages 1631 - 1641, XP002678832, ISSN: 0305-7453, Retrieved from the Internet <URL:http://jac.oxfordjournals.org/content/65/8/1631> [retrieved on 20100528], DOI: 10.1093/JAC/DKQ175 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105004857A (en) * 2015-08-19 2015-10-28 上海市长宁区疾病预防控制中心 Kit for salmonella paratyphi B and salmonella paratyphi B var Java, as well as preparation and using method of kit
CN109468394A (en) * 2018-11-30 2019-03-15 舟山出入境检验检疫局综合技术服务中心 A kind of five heavy PCR primers, kit and its application detecting four kinds of Salmonella serogroups
GB2612409A (en) * 2021-08-09 2023-05-03 Univ Jiangsu Indel molecular marker of ultrasonic mutagenesis salmonella typhimurium HISD gene and use thereof
EP4155414A4 (en) * 2021-08-09 2023-12-20 Jiangsu University Indel molecular marker of ultrasonic mutagenesis salmonella typhimurium hisd gene and use thereof
GB2612409B (en) * 2021-08-09 2024-08-21 Univ Jiangsu Indel molecular marker of ultrasonic mutagenesis salmonella typhimurium hisD gene and use thereof

Also Published As

Publication number Publication date
GB201111723D0 (en) 2011-08-24

Similar Documents

Publication Publication Date Title
CN107075569B (en) Biomarkers and combinations thereof for diagnosing tuberculosis
CN106399517B (en) A nucleic acid detection technology based on multi-cross isothermal amplification combined with gold nanobiosensing
JP2017501719A (en) Nucleic acid detection method and kit
CN107167602B (en) The methods for intersecting constant-temperature amplification combination gold nano bio-sensing detection comma bacillus more
US20160237479A1 (en) Methods and compositions for isothermal amplification and detection of mycoplasma pneumoniae
US9868995B2 (en) Method for detecting Helicobacter pylori DNA in a stool sample
JPWO2017212904A1 (en) Rapid detection method of African swine fever virus using LAMP method combining multiple primer sets
RU2009135270A (en) METHODS FOR DETECTING AN INFLAMMATORY INTESTINAL DISEASE
WO2017187178A1 (en) Method for detecting carbapenemase-producing bacteria
KR101922598B1 (en) Composition for diagnosing diseases caused by mutation of RNF213 gene using Real-Time PCR, and use thereof
CN105755134B (en) Endonuclease-mediated real-time multiple cross-displacement nucleic acid amplification technology and application
WO2013007996A1 (en) Salmonella detection assay
EP3497246A1 (en) Flavivirus diagnostic assay
EP1426448A1 (en) Method for lowering the effects of sequence variations in a diagnostic hybridization assay, probe for use in the assay and assay
US20200040390A1 (en) Methods for Sequencing Repetitive Genomic Regions
JP5750369B2 (en) Rapid detection of mycobacteria
KR102039178B1 (en) Primer for amplifying the gene of malaria protozoa for the diagnosis of infections of malaria protozoa and detection method of malaria protozoa using the same
CN103374627A (en) Congenital heart defect (CHD)-related gene polycystic kidney disease 1 like l (PKD1L1) and application thereof
KR20230057233A (en) Method of Detecting Target RNA Molecule and Target Protein Simultaneously and Kit for the Same Method
KR102076343B1 (en) Composition for detecting adenovirus type 55 using Real-time LAMP and uses thereof
KR100868883B1 (en) Detection of Neuroblastoma Cells
KR20200060145A (en) Loop Mediated Isothermal Amplification Primer Set for Detection of Dengue Virus Serotype 1 or 3 and Uses Thereof
KR100812795B1 (en) Primer for anthrax detection, probe and method for detecting anthrax using the same
JP2013220032A (en) Colon bacillus o-antigenic type inspection method using lamp method and inspection kit
JP2008161113A (en) Method for discriminating bacterium helicobacter pylori

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 12735616

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 12735616

Country of ref document: EP

Kind code of ref document: A1

点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载