+

CN110914423A - 经修饰的Cas9蛋白及其用途 - Google Patents

经修饰的Cas9蛋白及其用途 Download PDF

Info

Publication number
CN110914423A
CN110914423A CN201880050453.1A CN201880050453A CN110914423A CN 110914423 A CN110914423 A CN 110914423A CN 201880050453 A CN201880050453 A CN 201880050453A CN 110914423 A CN110914423 A CN 110914423A
Authority
CN
China
Prior art keywords
lys
leu
ctg
aag
glu
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201880050453.1A
Other languages
English (en)
Other versions
CN110914423B (zh
Inventor
濡木理
西增弘志
平野央人
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of Tokyo NUC
Original Assignee
University of Tokyo NUC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of Tokyo NUC filed Critical University of Tokyo NUC
Publication of CN110914423A publication Critical patent/CN110914423A/zh
Application granted granted Critical
Publication of CN110914423B publication Critical patent/CN110914423B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • C12N9/22Ribonucleases [RNase]; Deoxyribonucleases [DNase]
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/113Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/20Fusion polypeptide containing a tag with affinity for a non-protein ligand
    • C07K2319/21Fusion polypeptide containing a tag with affinity for a non-protein ligand containing a His-tag
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/50Fusion polypeptide containing protease site

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Genetics & Genomics (AREA)
  • Chemical & Material Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Biotechnology (AREA)
  • General Health & Medical Sciences (AREA)
  • Microbiology (AREA)
  • Biochemistry (AREA)
  • Biophysics (AREA)
  • Plant Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Medicinal Chemistry (AREA)
  • Enzymes And Modification Thereof (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

本发明的目的在于:提供一种在维持与向导RNA的结合能力的同时靶序列的限制得到缓解的经修饰的Cas9蛋白及其用途。一种蛋白等,该蛋白包含在SEQ ID NO:1中1335位的精氨酸突变成丙氨酸(R1335A)、异亮氨酸(R1335I)、蛋氨酸(R1335M)、苏氨酸(R1335T)或缬氨酸(R1335V)、1111位的亮氨酸突变成精氨酸(L1111R)、1135位的天冬氨酸突变成缬氨酸(D1135V)、1218位的甘氨酸突变成精氨酸(G1218R)、1219位的谷氨酸突变成苯丙氨酸(E1219F)、1322位的丙氨酸突变成精氨酸(A1322R)、1337位的苏氨酸突变成精氨酸(T1337R)而得到的氨基酸序列。

Description

经修饰的Cas9蛋白及其用途
技术领域
本发明涉及可靶的区域进一步扩张的、经修饰的Cas9蛋白及其用途。
背景技术
已知成簇的规律间隔短回文重复序列(Clustered Regularly InterspacedShort Palindromic Repeats:CRISPR)和Cas (CRISPR-相关)基因一起在细菌和古细菌中构成提供对侵入外来核酸的获得耐性的适应免疫系统。CRISPR往往是由噬菌体或质粒DNA引起的,由其间插入有大小类似的被称作间隔序列的独特可变DNA序列的24~48bp的短保守重复序列构成。另外,在重复和间隔序列的附近存在编码Cas蛋白家族的基因组。
在CRISPR-Cas系统中,外源性DNA被Cas蛋白家族切割成30bp左右的片段,并插入CRISPR中。作为Cas蛋白家族之一的Cas1和Cas2蛋白识别外源性DNA的被称作前间隔序列邻近基序(proto-spacer adjacent motif,PAM)的核苷酸序列(塩基配列),切取其上游,插入到宿主的CRISPR序列中,这成为细菌的免疫记忆。包含免疫记忆的CRISPR序列转录生成的RNA(称作pre-crRNA。)与一部分互补的RNA(反式激活crRNA (trans-activating crRNA:tracrRNA))配对,被摄入到作为Cas蛋白家族之一的Cas9蛋白中。摄入到Cas9中的pre-crRNA和tracrRNA被RNaseIII切割,成为包含外来序列(向导序列)的小的RNA片段(CRISPR-RNAs:crRNAs),形成Cas9-crRNA-tracrRNA复合物。Cas9-crRNA-tracrRNA复合物和与crRNA互补的外来侵入性DNA结合,作为切割DNA的酶(核酸酶)的Cas9蛋白切割外来侵入性DNA,从而抑制和排除从外侵入的DNA的功能。
Cas9蛋白识别外来侵入性DNA中的PAM序列,在其上游切割双链DNA使形成平滑末端。PAM序列的长度或核苷酸序列根据细菌种类而多种多样,在酿脓链球菌(Streptococcus pyogenes)(S.pyogenes)中识别“NGG”这3个碱基(塩基)。嗜热链球菌(Streptococcus thermophilus)(S.thermophilus)持有2个Cas9,分别将“NGGNG”或“NNAGAA”这5~6个碱基识别为PAM序列。在新凶手弗朗西丝氏菌(Francisella novicida)(F. novicida)中识别“NGR”这3个碱基。切割PAM序列上游的任何bp的位置还根据细菌种类而不同,但包含S. pyogenes的大部分的Cas9直系同源物切割PAM序列的3个碱基上游。
近年来,正在积极开发将细菌中的CRISPR-Cas系统应用于基因组编辑的技术。使crRNA与tracrRNA融合,作为tracrRNA-crRNA嵌合体(以下,称作向导RNA(guide RNA:gRNA)。)来表达,并进行有效利用。由此,称为核酸酶(RNA-向导的核酸酶:RGN),在目标部位(靶位点)切割基因组DNA。
CRISPR-Cas系统有I、II、III型,但在基因组编辑中专门使用II型CRISPR-Cas系统,在II型中使用Cas9蛋白作为RGN。由于来自S.pyogenes的Cas9蛋白识别NGG这3个碱基作为PAM序列,所以只要有2个鸟嘌呤并列的序列,即可切割其上游。
利用了CRISPR-Cas系统的方法不仅可以合成与目标DNA序列同源的短gRNA,还可以使用作为单一蛋白的Cas9蛋白进行基因组编辑。因此,不必像以往使用的锌指核酸酶(ZFN)或类反式激活因子激动剂(TALEN)那样合成每个DNA序列都不同的大的蛋白,即可简便且快速地进行基因组编辑。
专利文献1中公开了:有效利用了来自S.pyogenes的CRISPR-Cas系统的基因组编辑技术。
专利文献2中公开了:有效利用了来自S.thermophilus的CRISPR-Cas系统的基因组编辑技术。而且,专利文献2中还公开了:Cas9蛋白的D31A或N891A突变体起到仅其中一条DNA链带有切口的DNA切割酶即切口酶的作用。而且,在根据DNA切割后的修复机制容易发生插入缺失等突变的非同源末端结合的发生率少时,仍具有与野生型Cas9蛋白同等程度的同源重组效率。
非专利文献1中公开了:利用2个Cas9蛋白的D10A突变体和与该D10A突变体形成复合物的1对靶特异性向导RNA的双切口酶系统,其是使用了来自S.pyogenes的Cas9的CRISPR-Cas系统。各Cas9蛋白的D10A突变体和靶特异性向导RNA的复合物在与向导RNA互补的DNA链上仅制作1个切口。一对向导RNA有约20个碱基左右的错配(ずれ),仅识别位于靶DNA的相反链的靶序列。由各Cas9蛋白的D10A突变体和靶特异性向导RNA的复合物制作的2个切口形成模仿DNA双链切割(DNA double-strand break:DSB)的状态,通过使用一对向导RNA,在维持高水平的效率的同时,可以改善Cas9蛋白介导型基因编辑的特异性。
专利文献3中公开了来自S.pyogenes的Cas9蛋白的各种突变体,专利文献4中公开了来自F.novicida的Cas9蛋白的各种突变体。
现有技术文献
专利文献
专利文献1:WO2014/093661;
专利文献2:日本特表2015-510778号公报;
专利文献3:WO2016/141224;
专利文献4:WO2017/010543;
非专利文献
非专利文献1:Ran, F. A.等人, Double Nicking by RNA-Guided CRISPR Cas9 forEnhanced Genome Editing Specificity, Cell, 第154卷, 第1380-1389页, 2013。
发明内容
发明所要解决的课题
专利文献1中公开的来自S.pyogenes的Cas9 (在本说明书中也称作SpCas9)蛋白可识别的PAM序列是“NGG (N为任意碱基)”这2个碱基。另外,在非专利文献1所公开的双切口酶系统中使用了SpCas9蛋白,需要在靶序列内的有义链和反义链中各有1处可识别的PAM序列共计2处,因此可进一步编辑的靶序列受到限制。
如此,在现有的Cas9蛋白中,可识别的PAM序列有限制,所以存在着可编辑的靶序列受到限制的问题。
本发明的目的在于:提供一种在维持与向导RNA的结合能力的同时靶序列的限制得到缓解的经修饰的Cas9蛋白及其用途。
用于解决课题的手段
本发明人着眼于SpCas9蛋白作为Cas9蛋白,为了解决上述课题进行了深入研究。其结果,通过将SpCas9蛋白的规定位置的氨基酸取代成特定的氨基酸(导入突变),成功地在维持与向导RNA的结合能力的同时将作为现有NGG (N为任意碱基)的2个碱基的PAM序列转换成NG的1个碱基的序列,从而完成了本发明。
本说明书中,有时将导入突变前的Cas9蛋白称为野生型Cas9蛋白,而将导入突变后的Cas9蛋白称为经修饰的Cas9蛋白或突变型Cas9蛋白。
即,本发明如下。
[1] 蛋白,该蛋白由包含在SEQ ID NO: 1所表示的氨基酸序列中1335位的精氨酸被选自丙氨酸、甘氨酸、半胱氨酸、异亮氨酸、亮氨酸、蛋氨酸、苯丙氨酸、脯氨酸、缬氨酸、苏氨酸、天冬酰胺和天冬氨酸的1个氨基酸取代而得到的氨基酸序列的序列构成,并且具有与向导RNA的结合能力。
[2] 上述[1]所述的蛋白,其中,在SEQ ID NO: 1所表示的氨基酸序列中,进一步在1219位具有突变。
[3] 上述[1]或[2]所述的蛋白,其中,在SEQ ID NO: 1所表示的氨基酸序列中,进一步在1322位具有突变。
[4] 蛋白,该蛋白由包含在SEQ ID NO: 1所表示的氨基酸序列中1335位的精氨酸被选自丙氨酸、甘氨酸、半胱氨酸、异亮氨酸、亮氨酸、蛋氨酸、苯丙氨酸、脯氨酸、缬氨酸、苏氨酸、天冬酰胺和天冬氨酸的1个氨基酸取代、并进一步在1219位具有突变的氨基酸序列的序列构成,并且具有与向导RNA的结合能力。
[5] 蛋白,该蛋白由包含在SEQ ID NO: 1所表示的氨基酸序列中1335位的精氨酸被选自丙氨酸、甘氨酸、半胱氨酸、异亮氨酸、亮氨酸、蛋氨酸、苯丙氨酸、脯氨酸、缬氨酸、苏氨酸、天冬酰胺和天冬氨酸的1个氨基酸取代、并进一步在1322位具有突变的氨基酸序列的序列构成,并且具有与向导RNA的结合能力。
[6] 上述[1]~[5]中任一项所述的蛋白,其中,1335位的精氨酸的取代是取代成丙氨酸。
[7] 上述[1]~[5]中任一项所述的蛋白,其中,1335位的精氨酸的取代是取代成异亮氨酸、蛋氨酸、苏氨酸或缬氨酸。
[8] 上述[2]或[4]所述的蛋白,其中,1219位的突变是谷氨酸被取代成苯丙氨酸。
[9] 上述[3]或[5]所述的蛋白,其中,1322位的突变是丙氨酸被取代成精氨酸、组氨酸或赖氨酸。
[10] 上述[9]所述的蛋白,其中,1322位的突变是丙氨酸被取代成精氨酸。
[11] 上述[1]~[10]中任一项所述的蛋白,其中,在SEQ ID NO: 1所表示的氨基酸序列中,进一步在选自1111位、1135位、1218位和1337位的至少一个位置具有突变。
[12] 上述[11]所述的蛋白,其中,在SEQ ID NO: 1所表示的氨基酸序列中,进一步在选自1111位、1135位、1218位和1337位的至少2个位置具有突变。
[13] 上述[11]所述的蛋白,其中,在SEQ ID NO: 1所表示的氨基酸序列中,进一步在选自1111位、1135位、1218位和1337位的至少3个位置具有突变。
[14] 上述[11]所述的蛋白,其中,在SEQ ID NO: 1所表示的氨基酸序列中,进一步在1111位、1135位、1218位和1337位具有突变。
[15] 上述[11]~[14]中任一项所述的蛋白,其中,
1111位的突变是亮氨酸被取代成精氨酸、组氨酸或赖氨酸;
1135位的突变是天冬氨酸被取代成缬氨酸;
1218位的突变是甘氨酸被取代成精氨酸、组氨酸或赖氨酸;
1337位的突变是苏氨酸被取代成精氨酸、组氨酸或赖氨酸。
[16] 上述[1]~[15]中任一项所述的蛋白,其中,在SEQ ID NO: 1的施行了突变的位置以外的位点具有80%以上的同源性。
[17] 上述[1]~[15]中任一项所述的蛋白,其中,在SEQ ID NO: 1的施行了突变的位置以外的位点取代、缺失、插入和/或添加了1个~多个氨基酸。
[18] 上述[1]~[17]中任一项所述的蛋白,该蛋白具有RNA诱导性DNA核酸内切酶活性。
[19] 上述[1]~[16]中任一项所述的蛋白,其中,在SEQ ID NO: 1所表示的氨基酸序列中进一步具有缺失一部分或全部的核酸酶活性的突变。
[20] 上述[19]所述的蛋白,其中,缺失一部分或全部的核酸酶活性的突变是SEQID NO: 1所表示的氨基酸序列中的、(i)选自10位、762位、839位、983位和986位的至少1个位置或相当于此的位置、和/或(ii)选自840位和863位的位置或相当于此的位置的突变。
[21] 上述[20]所述的蛋白,其中,10位的天冬氨酸被取代成丙氨酸或天冬酰胺;或者
840位的组氨酸被取代成丙氨酸、天冬酰胺或酪氨酸。
[22] 上述[19]~[21]中任一项所述的蛋白,该蛋白连接有转录调控因子蛋白或结构域。
[23] 上述[22]所述的蛋白,其中,转录调控因子为转录激活因子。
[24] 上述[22]所述的蛋白,其中,转录调控因子为转录沉默子或转录抑制因子。
[25] 核酸,该核酸编码上述[1]~[24]中任一项所述的蛋白。
[26] 蛋白-RNA复合物,该复合物具备上述[1]~[24]中任一项所述的蛋白和向导RNA,所述向导RNA包含由与靶双链多核苷酸中的PAM(前间隔序列邻近基序,Proto-spacerAdjacent Motif)序列的从1个碱基上游到20个碱基以上且24个碱基以下上游的核苷酸序列互补的核苷酸序列构成的多核苷酸。
[27] 用于位点特异性地修饰靶双链多核苷酸的方法,该方法具备以下步骤:
将靶双链多核苷酸、蛋白和向导RNA混合并进行培养的步骤;以及
上述蛋白在位于PAM序列上游的结合位点修饰上述靶双链多核苷酸的步骤,
上述靶双链多核苷酸具有由NG(N是指任意碱基,G是指鸟嘌呤)构成的PAM序列,
上述蛋白为上述[1]~[24]中任一项所述的蛋白,
上述向导RNA包含由与上述靶双链多核苷酸中的上述PAM序列的从1个碱基上游到20个碱基以上且24个碱基以下上游的核苷酸序列互补的核苷酸序列构成的多核苷酸。
[28] 上述[27]所述的方法,其中,修饰是指靶双链多核苷酸的位点特异性切割。
[29] 上述[27]所述的方法,其中,修饰是指靶双链多核苷酸中的位点特异性的1个以上的核苷酸的取代、缺失和/或添加。
[30] 增加细胞的靶基因表达的方法,该方法包括:使上述[23]所述的蛋白和针对上述靶基因的1个或多个向导RNA在上述细胞内表达。
[31] 减少细胞的靶基因表达的方法,该方法包括:使上述[24]所述的蛋白和针对上述靶基因的1个或多个向导RNA在上述细胞内表达。
[32] 上述[30]或[31]所述的方法,其中,细胞为真核细胞。
[33] 上述[30]或[31]所述的方法,其中,细胞为酵母细胞、植物细胞或动物细胞。
发明效果
根据本发明,可以获得在保持与向导RNA的结合力的同时PAM序列的识别变得广泛的Cas9蛋白。另外,可以提供利用了上述Cas9蛋白的简便且快速的靶序列位点特异性的基因组编辑技术。
附图说明
[图1A]图1A是表示实施例1中的DNA切割活性测定试验的琼脂糖凝胶电泳的结果的图。使用“TGT”作为PAM序列,使用EcoRI作为限制酶。
[图1B]图1B是表示实施例1中的DNA切割活性测定试验的琼脂糖凝胶电泳的结果的图。使用“TGG”作为PAM序列,使用HindIII作为限制酶。
[图1C]图1C是表示实施例1中的DNA切割活性测定试验的琼脂糖凝胶电泳的结果的图。使用“TGNA”作为PAM序列,使用BamHI作为限制酶。
[图1D]图1D是表示实施例1中的DNA切割活性测定试验的琼脂糖凝胶电泳的结果的图。使用“TGN”作为PAM序列,使用BamHI作为限制酶。
[图2]图2是表示实施例2中的DNA切割活性测定试验的琼脂糖凝胶电泳的结果的图。
[图3]图3是表示实施例3中的DNA切割活性测定试验的结果的图。使用“TGA”作为PAM序列,使用BamHI作为限制酶。
[图4]图4是表示实施例4中的DNA切割活性测定试验的结果的图。
[图5]图5是表示实施例5中的DNA切割活性测定试验的结果的图。
具体实施方式
以下,对本发明进行说明。本说明书中使用的术语只要没有特别说明,则具有该领域通常使用的意义。
<PAM序列的识别变得广泛的Cas9蛋白>
本实施方式的蛋白是在保持与向导RNA的结合力的同时PAM序列的识别变得广泛的Cas9蛋白。根据本实施方式的蛋白,可以提供简便且快速、并且针对靶序列进行位点特异性的基因组编辑的技术。
在本说明书中,“向导RNA”是指模仿tracrRNA-crRNA的发夹结构的RNA,在其5’末端区域包含多核苷酸,所述多核苷酸由与靶双链多核苷酸中的PAM序列的从1个碱基上游到优选20个碱基以上且24个碱基以下、更优选22个碱基以上且24个碱基以下的核苷酸序列互补的核苷酸序列构成。该向导RNA可以进一步包含1个以上的多核苷酸,所述多核苷酸由可获得发夹结构的核苷酸序列构成,该核苷酸序列由与靶双链多核苷酸不互补的核苷酸序列构成,并排列成以一点为轴对称性地互补的序列。
向导RNA具有与本发明的突变型Cas9蛋白结合而将该蛋白引导至靶DNA的功能。向导RNA在其5’末端具有与靶DNA互补的序列,经由该互补序列与靶DNA结合,从而将本发明的突变型Cas9蛋白引导至靶DNA。在突变型Cas9蛋白起到DNA核酸内切酶的作用的情况下,可以在靶DNA存在的位点切割DNA,例如可以特异性地使靶DNA的功能丧失。
向导RNA是根据应该切割或修饰的靶DNA的序列信息设计、调制的。具体而言,可以列举如实施例中使用的序列。
在本说明书中,“核酸内切酶”是指切割核苷酸链的中间位置的酶。因此,具有核酸内切酶活性的、使本实施方式的PAM序列的识别变得广泛的Cas9蛋白具有通过向导RNA诱导而切割DNA链的中间位置的酶活性。
在本说明书中,“多肽”、“肽”和“蛋白”是指氨基酸残基的聚合物,且互换地使用。另外,还指下述的氨基酸聚合物:1个或多个氨基酸为天然存在的对应氨基酸的化学类似物或修饰衍生物。
在本说明书中,“序列”是指任意长度的核苷酸序列,为脱氧核糖核苷酸或核糖核苷酸,呈线状、环状或支链状,为单链或双链。
在本说明书中,“PAM序列”是指存在于靶双链多核苷酸中、且可由Cas9蛋白识别的序列,PAM序列的长度或核苷酸序列根据细菌种类而不同。可由本实施方式的PAM序列的识别变得广泛的Cas9蛋白识别的序列可用“5’-NG-3’”表示。
需要说明的是,在本说明书中,“N”是指选自腺嘌呤、胞嘧啶、胸腺嘧啶和鸟嘌呤的任意一种碱基,“A”是指腺嘌呤,“G”是指鸟嘌呤,“C”是指胞嘧啶,“T”是指胸腺嘧啶,“R”是指具有嘌呤骨架的碱基(腺嘌呤或鸟嘌呤),“Y”是指具有嘧啶骨架的碱基(胞嘧啶或胸腺嘧啶)。
在本说明书中,“多核苷酸”是指呈线状或环状构象、且为单链或双链形态的任一种形态的脱氧核糖核苷酸或核糖核苷酸聚合物,关于聚合物的长度没有限制性的解释。另外,还包含天然核苷酸的已知的类似物、以及在碱基部分、糖部分和磷酸部分中的至少一个部分被修饰的核苷酸(例如,硫代磷酸盐骨架)。通常,特定核苷酸的类似物具有相同的碱基配对特异性,例如,A的类似物与T进行碱基配对。
在一实施方式中,本发明提供蛋白(方案1),该蛋白由在SEQ ID NO: 1所表示的氨基酸序列中在1335位具有突变的氨基酸序列构成,并且具有与向导RNA的结合能力。此外,方案1的蛋白还具有RNA诱导性DNA核酸内切酶活性。
SEQ ID NO: 1是SpCas9蛋白的全长氨基酸序列。SpCas9蛋白中的PAM序列识别位点的序列是由SEQ ID NO: 1的第1097位~第1368位的271个残基构成的氨基酸序列。
具体而言,SEQ ID NO: 1的1335位的突变是指1335位的精氨酸被取代成选自丙氨酸、甘氨酸、半胱氨酸、异亮氨酸、亮氨酸、蛋氨酸、苯丙氨酸、脯氨酸、苏氨酸、缬氨酸、天冬酰胺和天冬氨酸的1个氨基酸。优选取代成丙氨酸。另外,1335位的另一优选突变是取代成异亮氨酸、蛋氨酸、苏氨酸或缬氨酸。
与PAM序列中的第3位的鸟嘌呤(5’-NG“G”-3’)形成的氢键因1335位的突变而消失,因此可使该蛋白的PAM序列的识别变得广泛。
在本发明的另一实施方案中,本发明提供蛋白(方案2),该蛋白除了具有上述方案1的突变以外,还进一步在1219位具有突变,并且具有与向导RNA的结合能力。此外,方案2的蛋白还具有RNA诱导性DNA核酸内切酶活性。
具体而言,该1219位的突变是指1219位的谷氨酸被取代成苯丙氨酸。
1219位的突变可有助于提高(维持)RNA诱导性DNA核酸内切酶活性的表达速度。
在本发明的又一实施方案中,本发明提供蛋白(方案3),该蛋白除了具有上述方案1或2的突变以外,还进一步在1322位具有突变,并且具有与向导RNA的结合能力。此外,方案3的蛋白还具有RNA诱导性DNA核酸内切酶活性。
具体而言,该1322位的突变是指1322位的丙氨酸被取代成精氨酸、组氨酸或赖氨酸。优选取代成精氨酸。
1322位的突变可有助于RNA诱导性DNA核酸内切酶活性的活性提高(活性维持)。
在本发明的又一实施方案中,本发明提供蛋白(方案4),该蛋白除了具有上述方案1、2或3的突变以外,还进一步在选自1111位、1135位、1218位和1337位的至少1个、优选2个、更优选3个、特别优选全部的4个位置具有突变,并且还具有与向导RNA的结合能力。方案4的蛋白具有RNA诱导性DNA核酸内切酶活性。
具体而言,该1111位的突变是指1111位的亮氨酸被取代成精氨酸、组氨酸或赖氨酸。优选取代成精氨酸。
具体而言,该1135位的突变是指1135位的天冬氨酸被取代成缬氨酸。
具体而言,该1218位的突变是指1218位的甘氨酸被取代成精氨酸、组氨酸或赖氨酸。优选取代成精氨酸。
具体而言,该1337位的突变是指1337位的苏氨酸被取代成精氨酸、组氨酸或赖氨酸。优选取代成精氨酸。
在本发明的又一实施方案中,本发明提供蛋白(方案5),该蛋白除了具有上述方案1、2、3或4的突变以外,还进一步在(i)选自10位、762位、839位、983位、986位的至少1个位置和/或(ii)选自840位和863位的位置具有突变,并且具有与向导RNA的结合能力。
具体而言,该10位的突变是指10位的天冬氨酸被取代成丙氨酸或天冬酰胺。
具体而言,该762位的突变是指762位的谷氨酸被取代成谷氨酰胺。
具体而言,该839位的突变是指839位的天冬氨酸被取代成丙氨酸或天冬酰胺。
具体而言,该983位的突变是指983位的组氨酸被取代成天冬酰胺或酪氨酸。
具体而言,该986位的突变是指986位的天冬氨酸被取代成天冬酰胺。
具体而言,该840位的突变是指840位的组氨酸被取代成丙氨酸、天冬酰胺或酪氨酸。
具体而言,该863位的突变是指863位的天冬酰胺被取代成天冬氨酸、丝氨酸或组氨酸。
作为方案5,优选为10位的天冬氨酸被取代成丙氨酸或天冬酰胺、或者840位的组氨酸被取代成丙氨酸、天冬酰胺或酪氨酸的蛋白。
具有(i)的突变或(ii)的突变的方案5的蛋白具有切口酶活性。
具有(i)的突变和(ii)的突变的方案5的蛋白虽然与向导RNA结合而被运送到靶DNA中,但丧失了核酸内切酶活性。
在本发明的又一实施方案中,本发明提供在功能上与上述方案1~5的蛋白同等的蛋白(方案6)。为了在功能上与上述方案1~5的蛋白同等,在SEQ ID NO: 1所表示的氨基酸序列中,在上述方案1~5中施行了突变的位置以外的位点具有80%以上的序列同源性,并且具有与向导RNA的结合能力。在通过突变而使氨基酸有所增减的情况下,该“施行了突变的位置以外的位点”可理解为“相当于施行了突变的位置的位置以外的位点”。作为所涉及的同源性,优选80%以上,更优选85%以上,进一步优选90%以上,特别优选95%以上,最优选99%以上。氨基酸序列同源性可通过自身已知的方法来确定。例如,氨基酸序列同源性(%)可以按照初期设定利用该领域所惯用的程序(例如BLAST、FASTA等)来确定。另一方面,同源性(%)可以利用该领域已知的任意算法、例如Needleman等人(1970) (J. Mol. Biol. 48:444-453)、Myers和Miller (CABIOS, 1988, 4: 11-17)的算法等来确定。Needleman等人的算法被整合到GCG软件包(可通过www.gcg.com获取)的GAP程序中,例如可以通过使用BLOSUM 62 matrix或PAM250 matrix、以及空位权重(加权)(gap weight): 16、14、12、10、8、6或4和长度权重(length weight): 1、2、3、4、5或6的任一者来确定同源性(%)。另外,Myers和Miller的算法被整合到作为GCG序列比对软件包的一部分的ALIGN程序中。在为了比较氨基酸序列而利用ALIGN程序的情况下,例如可以使用PAM120权重残基表(weightresidue table)、空位长度罚分(gap length penalty) 12、空位罚分(gap penalty) 4。
作为在功能上与上述方案1~5的蛋白同等的蛋白,提供下述蛋白(方案7):该蛋白在SEQ ID NO: 1所表示的氨基酸序列中在通过上述方案1~5施行了突变的位置以外的位点有1个~多个氨基酸被取代、缺失、插入和/或添加,并且具有与向导RNA的结合能力。在通过突变而使氨基酸有所增减的情况下,该“施行了突变的位置以外的位点”可以理解为“相当于施行了突变的位置的位置以外的位点”。
作为人为地进行“氨基酸的取代、缺失、插入和/或添加”的情形的手法,例如可以列举:对编码规定的氨基酸序列的DNA施行惯用的位点特异性突变导入,之后利用常规方法使该DNA表达的手法。这里,作为位点特异性突变导入法,例如可以列举:利用琥珀突变的方法(缺口双链体(gapped duplex)法、Nucleic Acids Res., 12, 9441-9456 (1984))、使用了突变导入用引物的PCR的方法等。
上述修饰的氨基酸的数目至少是1个残基,具体而言,是指1个或多个、或其以上。另外,在上述取代、缺失、插入或添加中,特别优选氨基酸的取代。该取代更优选取代成具有在疏水性、电荷、pK、立体结构上的特征等类似的性质的氨基酸。作为这样的取代,例如可以列举下述组内的取代:i)甘氨酸、丙氨酸;ii)缬氨酸、异亮氨酸、亮氨酸;iii)天冬氨酸、谷氨酸、天冬酰胺、谷氨酰胺;iv)丝氨酸、苏氨酸;v)赖氨酸、精氨酸;vi)苯丙氨酸、酪氨酸。
作为本发明的PAM序列的识别变得广泛的Cas9蛋白,优选列举包含下述氨基酸序列的蛋白:在SEQ ID NO: 1中1335位的精氨酸突变成丙氨酸(R1335A)、1111位的亮氨酸突变成精氨酸(L1111R)、1135位的天冬氨酸突变成缬氨酸(D1135V)、1218位的甘氨酸突变成精氨酸(G1218R)、1219位的谷氨酸突变成苯丙氨酸(E1219F)、1322位的丙氨酸突变成精氨酸(A1322R)、1337位的苏氨酸突变成精氨酸(T1337R)而得到的氨基酸序列(SEQ ID NO:18)。
另外,作为本发明的PAM序列的识别变得广泛的Cas9蛋白,还优选包含下述氨基酸序列的蛋白:在SEQ ID NO: 1中1335位的精氨酸突变成异亮氨酸(R1335I)、蛋氨酸(R1335M)、苏氨酸(R1335T)或缬氨酸(R1335V) (更优选R1335M和R1335V)、1111位的亮氨酸突变成精氨酸(L1111R)、1135位的天冬氨酸突变成缬氨酸(D1135V)、1218位的甘氨酸突变成精氨酸(G1218R)、1219位的谷氨酸突变成苯丙氨酸(E1219F)、1322位的丙氨酸突变成精氨酸(A1322R)、1337位的苏氨酸突变成精氨酸(T1337R)而得到的氨基酸序列。该蛋白相当于分别包含在SEQ ID NO: 18中1335位的丙氨酸突变成异亮氨酸、蛋氨酸、苏氨酸或缬氨酸而得到的氨基酸序列的蛋白。
本说明书中,表示到取代位点为止的氨基酸残基数的数字的左侧显示的字母(alphabet),表示取代前的氨基酸的单字母标记,而右侧显示的字母(alphabet),表示取代后的氨基酸的单字母标记。
本实施方式中的PAM识别变得广泛的Cas9蛋白例如可以通过如下所述的方法来制作。首先,使用包含编码上述PAM识别变得广泛的Cas9蛋白的核酸的载体转化宿主。然后,培养该宿主,使上述蛋白表达。培养基的组成、培养的温度、时间、诱导物质的添加等条件可由本领域技术人员按照已知的方法来确定,使转化体生长,高效率地产生上述蛋白。另外,例如在将抗生素抗性基因整合到表达载体中作为选择标志物的情况下,通过在培养基中加入抗生素,可以选择转化体。然后,通过将宿主所表达的上述蛋白按照自身已知的适当方法进行纯化,得到PAM识别变得广泛的Cas9蛋白。
对宿主没有特别限定,可以列举:动物细胞、植物细胞、昆虫细胞、或大肠杆菌、枯草杆菌、酵母等微生物。
<PAM序列的识别变得广泛的Cas9蛋白-向导RNA复合物>
在一实施方式中,本发明提供蛋白-RNA复合物,该复合物具备:上述的<PAM序列的识别变得广泛的Cas9蛋白>中所示的蛋白和向导RNA,所述向导RNA包含多核苷酸,该多核苷酸由与靶双链多核苷酸中的PAM (前间隔序列邻近基序,Proto-spacer Adjacent Motif)序列的从1个碱基上游到20个碱基以上且24个碱基以下上游的核苷酸序列互补的核苷酸序列构成的。
根据本实施方式的蛋白-RNA复合物,PAM序列变得广泛,可以简便且快速、并且针对靶序列进行位点特异性的靶双链多核苷酸的编辑。
上述蛋白和上述向导RNA通过在体外和体内、在温和的条件下混合,可以形成蛋白-RNA复合物。温和的条件表示蛋白不会分解或变性的程度的温度和pH,温度优选4℃以上且40℃以下,pH优选4以上且10以下。
另外,混合上述蛋白和上述向导RNA进行培养的时间优选0.5小时以上且1小时以下。由上述蛋白和上述向导RNA形成的复合物稳定,即使在室温下静置数小时也可保持稳定性。
<CRISPR-Cas载体系统>
在一实施方式中,本发明提供CRISPR-Cas载体系统,该载体系统具备第1载体和第2载体,所述第1载体包含编码上述的<PAM序列的识别变得广泛的Cas9蛋白>中所示的蛋白的基因,所述第2载体包含向导RNA,该向导RNA包含由与靶双链多核苷酸中的PAM序列的从1个碱基上游到20个碱基以上且24个碱基以下上游的核苷酸序列互补的核苷酸序列构成的多核苷酸。
根据本实施方式的CRISPR-Cas载体系统,PAM序列变得广泛,可以简便且快速、并且针对靶序列进行位点特异性的靶双链多核苷酸的编辑。
关于向导RNA,只要适当设计在5’末端区域包含多核苷酸的向导RNA即可,所述多核苷酸由与靶双链多核苷酸中的PAM序列的从1个碱基上游到优选20个碱基以上且24个碱基以下、更优选22个碱基以上且24个碱基以下的核苷酸序列互补的核苷酸序列构成。该向导RNA可以进一步包含1个以上的多核苷酸,所述多核苷酸由可获得发夹结构的核苷酸序列构成,该核苷酸序列由与靶双链多核苷酸不互补的核苷酸序列构成,并排列成以一点为轴对称性地互补的序列。
本实施方式的载体优选为表达载体。对表达载体没有特别限定,例如可以使用:pBR322、pBR325、pUC12、pUC13等来自大肠杆菌的质粒;pUB110、pTP5、pC194等来自枯草杆菌的质粒;pSH19、pSH15等来自酵母的质粒;λ噬菌体等噬菌体;腺病毒、腺相关病毒、慢病毒、牛痘病毒、杆状病毒等病毒;以及修饰这些载体而得到的载体等。
在上述的表达载体中,对用于表达上述Cas9蛋白和上述向导RNA的启动子没有特别限定,例如可以使用:EF1α启动子、SRα启动子、SV40启动子、LTR启动子、CMV(巨细胞病毒)启动子、HSV-tk启动子等用于在动物细胞中表达的启动子;花椰菜花叶病毒(CaMV)的35S启动子、REF(橡胶延伸因子,rubber elongation factor)启动子等用于在植物细胞中表达的启动子;多角体蛋白启动子、p10启动子等用于在昆虫细胞中表达的启动子等。这些启动子可以根据上述Cas9蛋白和上述向导RNA、或者根据表达上述Cas9蛋白和上述向导RNA的细胞的种类而适当选择。
上述的表达载体可以进一步具有多克隆位点、增强子、剪接信号、添加了聚A的信号、选择标志物、复制起点等。
<用于位点特异性地修饰靶双链多核苷酸的方法>
[第1实施方式]
在一实施方式中,本发明提供用于位点特异性地修饰靶双链多核苷酸的方法,该方法具备以下步骤:
将靶双链多核苷酸、蛋白和向导RNA混合进行培养的步骤;以及上述蛋白在位于PAM序列上游的结合位点修饰上述靶双链多核苷酸的步骤,
上述靶双链多核苷酸具有由NG(N是指任意碱基,G是指鸟嘌呤)构成的PAM序列,
上述蛋白为上述的<PAM序列的识别变得广泛的Cas9蛋白>中所示的蛋白,
上述向导RNA包含多核苷酸,所述多核苷酸由与上述靶双链多核苷酸中的上述PAM序列的从1个碱基上游到20个碱基以上且24个碱基以下上游的核苷酸序列互补的核苷酸序列构成。
根据本实施方式的方法,通过使用PAM序列变得广泛的突变型Cas9蛋白,可以简便且快速、并且针对靶序列进行位点特异性地修饰靶双链多核苷酸。
在本实施方式中,靶双链多核苷酸只要具有由NG(N是指任意碱基,G是指鸟嘌呤)构成的PAM序列即可,没有特别限定。
在本实施方式中,关于蛋白和向导RNA,如上述的<PAM序列的识别变得广泛的Cas9蛋白>中所示。
以下,对用于位点特异性地修饰靶双链多核苷酸的方法的细节进行说明。
首先,在温和的条件下混合上述蛋白和上述向导RNA,并进行培养。温和的条件如上所述。进行培养的时间优选0.5小时以上且1小时以下。由上述蛋白和上述向导RNA形成的复合物稳定,即使在室温下静置数小时也可保持稳定性。
接下来,上述蛋白和上述向导RNA在上述靶双链多核苷酸上形成复合物。上述蛋白识别由“ 5’-NG-3’ ”构成的PAM序列,在位于PAM序列上游的结合位点与上述靶双链多核苷酸结合。在上述蛋白具有核酸内切酶活性的情况下在该位点切割该多核苷酸。上述Cas9蛋白识别PAM序列,以PAM序列为起点,剥离上述靶双链多核苷酸的双螺旋结构,和上述向导RNA中的与上述靶双链多核苷酸互补的核苷酸序列退火,从而使上述靶双链多核苷酸的一部分双螺旋结构解开。此时,上述Cas9蛋白在位于PAM序列上游的切割位点和位于与PAM序列互补的序列的上游的切割位点,切割上述靶双链多核苷酸的磷酸二酯键。
[第2实施方式]
在本实施方式中,在培养步骤之前,可以进一步具备下述表达步骤:使用上述的CRISPR-Cas载体系统,使上述的<PAM序列的识别变得广泛的Cas9蛋白>中所示的蛋白和向导RNA表达。
在本实施方式的表达步骤中,首先,使用上述的CRISPR-Cas载体系统使Cas9蛋白和向导RNA表达。作为使之表达的具体方法,分别使用包含编码Cas9蛋白的基因的表达载体和包含向导RNA的表达载体来转化宿主。然后,培养该宿主,使Cas9蛋白和向导RNA表达。培养基的组成、培养的温度、时间、诱导物质的添加等条件可由本领域技术人员按照已知的方法确定,使转化体生长,高效率地产生融合蛋白。另外,例如在将抗生素抗性基因整合到表达载体中作为选择标志物的情况下,通过在培养基中加入抗生素,可以选择转化体。然后,通过利用适当的方法纯化宿主所表达的Cas9蛋白和向导RNA,获得Cas9蛋白和向导RNA。
<用于位点特异性地修饰靶双链核苷酸的方法>
[第1实施方式]
在一实施方式中,本发明提供用于位点特异性地修饰靶双链多核苷酸的方法,该方法具备以下步骤:
将靶双链多核苷酸、蛋白和向导RNA混合进行培养的步骤;上述蛋白在位于PAM序列上游的结合位点与上述靶双链多核苷酸结合的步骤;以及在通过上述向导RNA与上述靶双链多核苷酸的互补性结合确定的区域得到被修饰的上述靶双链多核苷酸的步骤,
上述靶双链多核苷酸具有由NG (N是指任意碱基,G是指鸟嘌呤)构成的PAM序列,
上述蛋白为上述的<PAM序列的识别变得广泛的Cas9蛋白>中所示的蛋白,
上述向导RNA包含多核苷酸,所述多核苷酸由与上述靶双链多核苷酸中的上述PAM序列的从1个碱基上游到20个碱基以上且24个碱基以下上游的核苷酸序列互补的核苷酸序列构成。
根据本实施方式的方法,通过使用PAM序列变得广泛的RNA诱导性DNA核酸内切酶,可以简便且快速、并且针对靶序列进行位点特异性地修饰靶双链多核苷酸。
在本实施方式中,关于靶双链多核苷酸、蛋白和向导RNA,如上述的<PAM序列的识别变得广泛的Cas9蛋白>和<用于位点特异性地修饰靶双链多核苷酸的方法>中所示。
以下,对用于位点特异性地修饰靶双链多核苷酸的方法的细节进行说明。直到与靶双链多核苷酸位点特异性地结合为止的步骤与上述的<用于位点特异性地切割靶双链多核苷酸的方法>中所示的步骤同样。然后,在通过上述向导RNA与上述双链多核苷酸的互补性结合确定的区域可以获得根据目的施行了修饰的靶双链多核苷酸。
在本说明书中,“修饰”是指靶双链多核苷酸的核苷酸序列发生变化。例如,除了通过靶双链多核苷酸的切割、切割后的外源性序列的插入(物理性插入或经由同源定向修复的复制进行的插入)而引起的靶双链多核苷酸的核苷酸序列的变化、切割后的非同源末端连接(NHEJ:通过切割生成的DNA末端彼此再次结合)以外,还可以列举通过添加功能性的蛋白或核苷酸序列而引起的靶双链多核苷酸的核苷酸序列的变化等。
通过本实施方式中的靶双链多核苷酸的修饰,可以向靶双链多核苷酸中导入突变,或者可以破坏、改变靶双链多核苷酸的功能。
[第2实施方式]
在本实施方式中,在培养步骤之前,可以进一步具备下述的表达步骤:使用上述的CRISPR-Cas载体系统,使上述的<PAM序列的识别变得广泛的Cas9蛋白>中所示的蛋白和向导RNA表达。
在本实施方式的表达步骤中,首先,使用上述的CRISPR-Cas载体系统使Cas9蛋白和向导RNA表达。作为使之表达的具体方法,与上述的<用于位点特异性地修饰靶双链多核苷酸的方法>的[第2实施方式]中例示的方法同样。
<用于在细胞内位点特异性地修饰靶双链多核苷酸的方法>
在一实施方式中,本发明提供用于在细胞内位点特异性地修饰靶双链多核苷酸的方法,该方法具备以下步骤:
将上述的CRISPR-Cas载体系统导入细胞中,使上述的<PAM序列的识别变得广泛的Cas9蛋白>中所示的蛋白和向导RNA表达的表达步骤;
上述蛋白在位于PAM序列上游的结合位点与上述靶双链多核苷酸结合的步骤;以及
在通过上述向导RNA与上述靶双链多核苷酸的互补性结合确定的区域得到被修饰的上述靶双链多核苷酸的步骤,
上述靶双链多核苷酸具有由NG (N是指任意碱基,G是指鸟嘌呤)构成的PAM序列,
上述向导RNA包含多核苷酸,所述多核苷酸由与上述靶双链多核苷酸中的上述PAM序列的从1个碱基上游到20个碱基以上且24个碱基以下上游的核苷酸序列互补的核苷酸序列构成。
在本实施方式的表达步骤中,首先,利用上述的CRISPR-Cas载体系统使Cas9蛋白和向导RNA在细胞内表达。
作为本实施方式的方法的应用对象的细胞的来源的生物,例如可以列举:原核生物、酵母、动物、植物、昆虫等。对上述动物没有特别限定,例如可以列举:人、猴、狗、猫、兔、猪、牛、小鼠、大鼠等,不限于这些。另外,作为细胞来源的生物的种类,可以根据所期望的靶双链多核苷酸的种类、目的等任意选择。
作为本实施方式的方法的应用对象的动物来源的细胞,例如可以列举:生殖细胞(精子、卵子等)、构成生物体的体细胞、干细胞、前体细胞、由生物体分离的癌细胞、由生物体分离并获得永生化能力而在体外稳定维持的细胞(细胞株)、由生物体分离且人为地进行了基因修饰的细胞、由生物体分离且人为地交换了核的细胞等,不限于这些。
作为构成生物体的体细胞,例如可以列举:由皮肤、肾脏、脾脏、腎上腺、肝脏、肺、卵巢、胰腺、子宫、胃、结肠、小肠、大肠、膀胱、前列腺、睾丸、胸腺、肌肉、结缔组织、骨、软骨、血管组织、血液、心脏、眼、脑、神经组织等任意的组织采集的细胞等,不限于这些。作为体细胞,更具体而言,例如可以列举:成纤维细胞、骨髄细胞、免疫细胞(例如,B淋巴细胞、T淋巴细胞、嗜中性粒细胞、巨噬细胞、单核细胞等)、红细胞、血小板、骨细胞、骨髄细胞、周皮细胞(周细胞)、树突状细胞、角质形成细胞、脂肪细胞、间充质细胞、上皮细胞、表皮细胞、内皮细胞、血管内皮细胞、淋巴管内皮细胞、肝细胞、胰岛细胞(例如,α细胞、β细胞、δ细胞、ε细胞、PP细胞等)、软骨细胞、卵丘细胞、胶质细胞、神经细胞(神经元)、少突胶质细胞、小胶质细胞、星形胶质细胞、心肌细胞、食道细胞、肌肉细胞(例如,平滑肌细胞、骨骼肌细胞等)、黑色素细胞、单核细胞等,不限于这些。
干细胞是指兼具自身复制能力和分化成其他多个系统的细胞的能力的细胞。作为干细胞,例如可以列举:胚胎干细胞(ES细胞)、胚胎肿瘤细胞、胚胎生殖干细胞、人工多能性干细胞(iPS细胞)、神经干细胞、造血干细胞、间充质干细胞、肝干细胞、胰干细胞、肌肉干细胞、生殖干细胞、肠干细胞、癌干细胞、毛囊干细胞等,但不限于这些。
癌细胞是指由体细胞衍生而获得无限增殖能力的细胞。作为癌细胞来源的癌,例如可以列举:乳腺癌(例如,浸润性乳腺管癌、非浸润性乳腺管癌、炎症性乳腺癌等)、前列腺癌(例如,激素依赖性前列腺癌、激素非依赖性前列腺癌等)、胰腺癌(例如,胰腺管癌等)、胃癌(例如,乳头状腺癌、粘液性腺癌、腺鳞癌等)、肺癌(例如,非小细胞肺癌、小细胞肺癌、恶性间皮瘤等)、结肠癌(例如,胃肠道间质肿瘤等)、直肠癌(例如,胃肠道间质肿瘤等)、大肠癌(例如,家族性大肠癌、遗传性非息肉病性大肠癌、胃肠道间质肿瘤等)、小肠癌(例如,非霍奇金氏淋巴瘤、胃肠道间质肿瘤等)、食道癌、十二指肠癌、舌癌、咽癌(例如、鼻咽癌(上咽癌)、口咽癌、下咽癌等)、头颈部癌、唾液腺癌、脑肿瘤(例如,松果体星形细胞瘤、毛细胞性星形细胞瘤、弥漫性星形细胞瘤、间变性星形细胞瘤等)、神经鞘瘤、肝癌(例如,原发性肝癌、肝外胆管癌等)、肾癌(例如,肾细胞癌、肾盂与输尿管的移行上皮癌等)、胆囊癌、胆管癌、胰腺癌、子宫内膜癌、子宫颈癌、卵巢癌(例如,上皮性卵巢癌、性腺外胚细胞肿瘤、卵巢性胚细胞肿瘤、卵巢低度恶性肿瘤等)、膀胱癌、尿道癌、皮肤癌(例如,眼内(眼)黑色素瘤、Merkel(梅克尔)细胞癌等)、血管瘤、恶性淋巴瘤(例如,网状细胞肉瘤、淋巴肉瘤、霍奇金病等)、黑色素瘤(恶性黑色素瘤)、甲状腺癌(例如,甲状腺髓样癌等)、甲状旁腺癌、鼻腔癌、鼻旁窦癌、骨肿瘤(例如,骨肉瘤、尤文氏肿瘤(尤文氏肉瘤)、子宫肉瘤、软组织肉瘤等)、转移性髓母细胞瘤、血管纤维瘤、隆突性皮肤纤维肉瘤、视网膜肉瘤、阴茎癌、睾丸肿瘤、儿童实体瘤(癌)(例如,威尔姆氏肿瘤、儿童肾脏肿瘤等)、卡波西肉瘤、因AIDS引起的卡波西肉瘤、上颌窦肿瘤、纤维组织细胞瘤、平滑肌肉瘤、横纹肌肉瘤、慢性骨髄增殖性疾病、白血病(例如,急性骨髄性白血病、急性淋巴母细胞白血病等)等,不限于这些。
细胞株是指在生物体外通过人为操作获得了无限增殖能力的细胞。作为细胞株,例如可以列举:HCT116、Huh7、HEK293(人胎肾细胞)、HeLa (人子宫颈癌细胞株)、HepG2 (人肝癌细胞株)、UT7/TPO (人白血病细胞株)、CHO (中国仓鼠卵巢细胞株)、MDCK、MDBK、BHK、C-33A、HT-29、AE-1、3D9、Ns0/1、Jurkat、NIH3T3、PC12、S2、Sf9、Sf21、High Five、Vero等,不限于这些。
作为向细胞内导入CRISPR-Cas载体系统的方法,可以通过适合于使用的活细胞的方法来进行,可以列举:电穿孔法、热休克法、磷酸钙法、脂质转染法、DEAE葡聚糖法、微注射法、粒子枪法、使用了病毒的方法、或者使用了FuGENE (注册商标)6转染试剂(Transfection Reagent)(Roche公司制造)、转染胺(Lipofectamine)2000试剂(Invitrogen公司制造)、转染胺LTX试剂(Invitrogen公司制造)、转染胺3000试剂(Invitrogen公司制造)等市售的转染试剂的方法等。
然后,关于修饰步骤,与上述的<用于位点特异性地修饰靶双链核苷酸的方法>的[第1实施方式]中所示的方法同样。
通过本实施方式中的靶双链多核苷酸的修饰,可以获得向靶双链多核苷酸中导入了突变、或者靶双链多核苷酸的功能已被破坏、改变的细胞。
作为本发明的突变型Cas9蛋白,在采用不具有核酸内切酶活性的方案(例如,方案5)的情况下,该蛋白虽然可以在位于PAM序列上游的结合位点与上述靶双链多核苷酸结合,但停留在此而无法进行切割。因此,例如若使该蛋白与荧光蛋白(例如GFP)等标记蛋白融合,则可经由向导RNA-突变型Cas9蛋白使标记蛋白与靶双链多核苷酸结合。通过适当选择与突变型Cas9蛋白结合的物质,可以对靶双链多核苷酸赋予各种各样的功能。
可以进一步在由突变型Cas9蛋白或突变型Cas9缺失了一部分或全部的切割酶活性的蛋白的N末端或C末端连接转录调控因子蛋白或结构域。作为转录调控因子或其结构域,可以列举:转录激活因子或其结构域(例如,VP64、NF-κB p65)和转录沉默子或其结构域(例如,异染色质蛋白1(HP1))或转录抑制因子或其结构域(例如,Kruppel相关盒(KRAB)、ERF阻遏域(ERD)、mSin3A相互作用结构域(SID))。
还可以连接修饰DNA的甲基化状态的酶(例如,DNA甲基转移酶(DNMT)、TET)或修饰组蛋白亚单位的酶(例如,组蛋白乙酰转移酶(HAT)、组蛋白脱乙酰酶(HDAC)、组蛋白甲基转移酶、组蛋白去甲基化酶)。
<基因治疗>
在一实施方式中,本发明提供用于实行基因组编辑、进行基因治疗的方法和组合物。与以前已知的靶向化的基因重组方法相比,本实施方式的方法的实行有效且廉价,而且可适合于任意的细胞或生物。细胞或生物的双链核酸的任意片段可以通过本实施方式的基因治疗方法进行修饰。本实施方式的基因治疗方法采用在所有细胞中均为内在的同源重组工艺和非同源重组工艺两者。
本说明书中,“基因组编辑”是指,通过利用CRISPR/Cas9系统或转录激活因子样效应物核酸酶(Transcription Activator-Like Effector Nucleases,TALEN)等技术实行已靶向化的基因重组或已靶向化的突变,进行特异性的基因破坏或报道基因的敲入等新的基因修饰技术。
另外,在一实施方式中,本发明提供:进行已靶向化的DNA插入或已靶向化的DNA缺失的基因治疗方法。该基因治疗方法包括:使用包含供体DNA的核酸构建物来转化细胞的步骤。关于与靶基因切割后的DNA插入和DNA缺失相关的图解,本领域技术人员可以按照已知的方法确定。
另外,在一实施方式中,本发明提供基因治疗方法:该方法在体细胞和生殖细胞中均被利用,于特定的基因座进行基因操作。
另外,在一实施方式中,本发明提供:用于在体细胞内破坏基因的基因治疗方法。这里,基因过度表达对细胞或生物有害的产物,并表达对细胞或生物有害的产物。这样的基因能够在疾病中所产生的1个以上的细胞型中过度表达。通过本实施方式的基因治疗方法进行的上述过度表达的基因的破坏,可以对患有由上述过度表达的基因引起的疾病的个体带来更好的健康。即,细胞的微小比例的基因的破坏起作用,表达水平降低,产生治疗效果。
另外,在一实施方式中,本发明提供:用于在生殖细胞内破坏基因的基因治疗方法。特定的基因被破坏的细胞可有效用于制作不具有特定的基因的功能的生物。在上述基因被破坏的细胞中,基因可以完全敲除。该特定细胞中的功能的缺失能够具有治疗效果。
另外,在一实施方式中,本发明提供:插入编码基因产物的供体DNA的基因治疗方法。该基因产物在构成性地表达的情况下具有治疗效果。例如可以列举下述方法:为了在胰细胞的个体组中进行活性启动子和编码胰岛素基因的供体DNA的插入,而在患有糖尿病的个体(患者)中插入上述供体DNA。然后,包含上述供体DNA的胰细胞的上述个体组可以生成胰岛素,对糖尿病患者进行治疗。而且,将上述供体DNA插入植物中,可以生成药剂相关基因产物。蛋白产物的基因(例如,胰岛素、脂肪酶或血红蛋白)可以和调控元件(组成型活性启动子、或诱导型启动子)一起插入到植物中,在植物中生成大量的药物。然后,可从植物中分离这样的蛋白产物。
转基因植物或转基因动物可以通过采用核酸移入技术(McCreath, K. J.等人(2000) Nature 405: 1066-1069;Polejaeva, I. A.等人, (2000) Nature 407:86-90)的方法进行制作。组织型特异性载体或细胞型特异性载体可以为了仅在所选择的细胞内提供基因表达而利用。
另外,在将上述方法用于生殖细胞的情况下,可以在靶基因中插入供体DNA,通过之后的所有的细胞分裂,生成具有所设计的遗传变更的细胞。
作为本实施方式的基因治疗方法的应用对象,例如可以列举:任意的生物、培养细胞、培养组织、培养核(在完整的培养细胞、培养组织或培养核中包含可用于再生生物的细胞、组织或核)、配子(例如,发育的各种阶段的卵或精子)等,并不限于这些。
作为本实施方式的基因治疗方法的应用对象的细胞的来源,可以列举任意的生物(可以列举昆虫、真菌、啮齿类、牛、绵羊、山羊、鸡和其他的农业上重要的动物、以及其他哺乳动物(例如可以列举狗、猫和人,但并不限于这些),并不限于这些)等,并不限于这些。
本实施方式的基因治疗方法可以进一步在植物中使用。对作为本实施方式的基因治疗方法的应用对象的植物没有特别限定,可以在任意的各种植物种(例如,单子叶植物或双子叶植物等)中应用。
以下,给出实施例,以更详细地说明本发明,但这些实施例并不限定本发明的范围。
实施例
实施例1
1. 野生型和突变型SpCas9的调制
(1) 结构体(Construct)的设计
将通过基因合成使密码子最优化的野生型或突变型SpCas9基因整合到pET载体(Novagen)中。进一步在His标记物与SpCas9基因之间添加TEV识别序列。在由完成的结构体表达的Cas9的N末端连接6残基的组氨酸(His标记物),形成添加有TEV蛋白酶识别位点的设计。
所使用的SpCas9基因的核苷酸序列如下。
WT:野生型SpCas9的核苷酸序列:SEQ ID NO: 2;
m0:突变型SpCas9基因(R1335A)的核苷酸序列:SEQ ID NO: 3;
m4:突变型SpCas9基因(R1335A/G1218R)的核苷酸序列:SEQ ID NO: 4;
m18:突变型SpCas9基因(R1335A/G1218R/T1337R)的核苷酸序列:SEQ ID NO: 5;
m19:突变型SpCas9基因(R1335A/G1218R/T1337R/L1111R)的核苷酸序列:SEQ ID NO:6;
m20:突变型SpCas9基因(R1335A/G1218R/T1337R/L1111R/D1332R)的核苷酸序列:SEQID NO: 7;
m21:突变型SpCas9基因(R1335A/G1218R/T1337R/L1111R/D1332R/ A1322R)的核苷酸序列:SEQ ID NO: 8;
m22:突变型SpCas9基因(R1335A/G1218R/T1337R/L1111R/D1332R/ A1322R/D1284R/A1285R)的核苷酸序列:SEQ ID NO: 9;
m23:突变型SpCas9基因(R1335A/G1218R/L1111R/D1332R/A1322R)的核苷酸序列:SEQID NO: 10;
m24:突变型SpCas9基因(R1335A/G1218R/L1111R/D1332R/A1322R/ D1284R/A1285R)的核苷酸序列:SEQ ID NO: 11;
m25:突变型SpCas9基因(R1335A/G1218R/T1337R/L1111R/A1322R)的核苷酸序列:SEQID NO: 12;
m26:突变型SpCas9基因(R1335A/G1218R/L1111R/A1322R)的核苷酸序列:SEQ ID NO:13;
m29:突变型SpCas9基因(R1335A/G1218R/L1111R)的核苷酸序列:SEQ ID NO: 14;
m32:突变型SpCas9基因(R1335A/G1218R/T1337R/L1111R/A1322R/ E1219M)的核苷酸序列:SEQ ID NO: 15;
m33:突变型SpCas9基因(R1335A/G1218R/T1337R/L1111R/A1322R/ E1219F)的核苷酸序列:SEQ ID NO: 16;
m34:突变型SpCas9基因(R1335A/G1218R/T1337R/L1111R/A1322R/ E1219W)的核苷酸序列:SEQ ID NO: 17;
m43:突变型SpCas9基因(R1335A/G1218R/T1337R/L1111R/A1322R/ E1219F/D1135V)的核苷酸序列:SEQ ID NO: 18;
m61:突变型SpCas9基因(R1335I/G1218R/T1337R/L1111R/A1322R/ E1219F/D1135V)的核苷酸序列:相对于m43的核苷酸序列(SEQ ID NO: 18)将4003-4005位的gcc变换成atc而得到的核苷酸序列。
m62:突变型SpCas9基因(R1335L/G1218R/T1337R/L1111R/A1322R/ E1219F/D1135V)的核苷酸序列:相对于m43的核苷酸序列(SEQ ID NO: 18)将4003-4005位的gcc变换成ctg而得到的核苷酸序列。
m63:突变型SpCas9基因(R1335M/G1218R/T1337R/L1111R/A1322R/ E1219F/D1135V)的核苷酸序列:相对于m43的核苷酸序列(SEQ ID NO: 18)将4003-4005位的gcc变换成atg而得到的核苷酸序列。
m64:突变型SpCas9基因(R1335F/G1218R/T1337R/L1111R/A1322R/ E1219F/D1135V)的核苷酸序列:相对于m43的核苷酸序列(SEQ ID NO: 18)将4003-4005位的gcc变换成ttt而得到的核苷酸序列。
m65:突变型SpCas9基因(R1335T/G1218R/T1337R/L1111R/A1322R/ E1219F/D1135V)的核苷酸序列:相对于m43的核苷酸序列(SEQ ID NO: 18)将4003-4005位的gcc变换成acc而得到的核苷酸序列。
m66:突变型SpCas9基因(R1335V/G1218R/T1337R/L1111R/A1322R/ E1219F/D1135V)的核苷酸序列:相对于m43的核苷酸序列(SEQ ID NO: 18)将4003-4005位的gcc变换成gtg而得到的核苷酸序列。
(2) 在大肠杆菌中的表达
将所制作的载体转化到大肠杆菌Escherichia coli rosetta2 (DE3)株中。之后,使用含有20μg/ml的卡那霉素和20μg/ml的氯霉素的LB培养基进行培养。在培养直至OD=0.8的时间点添加作为表达诱导剂的异丙基-β-硫代半乳糖吡喃糖苷(Isopropyl β-D-1-thiogalactopyranoside:IPTG)(终浓度为1mM),在37℃下培养4小时。培养后,通过离心(5,000g、10分钟)回收大肠杆菌。
(3) 野生型和突变型SpCas9的纯化
将(2)中回收的菌体用缓冲液A悬浮,进行超声波破碎。通过离心(25,000g、30分钟)回收上清,与经缓冲液A调平衡的Ni-NTA Superflow树脂 (QIAGEN)混合,平稳地颠倒混合1小时。回收流过的组分,之后用4柱容量的缓冲液A、进一步用2柱容量的高盐浓度缓冲液B进行洗涤。
然后,再次用2柱容量的缓冲液A洗涤,之后用5柱容量的高咪唑浓度缓冲液C洗脱目标蛋白。
然后,将粗制的样品加载到HiTrapSP (GE Healthcare)上。然后,用5柱容量份(容量分)的92.5%的缓冲液D (0M的NaCl)和7.5%的缓冲液F (2M的NaCl)的混合溶液进行洗涤,之后使用缓冲液E形成从10%到50% (NaCl浓度从200mM到1M)的直线梯度,洗脱目标蛋白。
缓冲液A~E的组成如下所示。
缓冲液A:20mM的Tris-HCl、pH8.0,300mM的NaCl、20mM的咪唑;
缓冲液B:20mM的Tris-HCl、pH8.0,1000mM的NaCl、20mM的咪唑;
缓冲液C:20mM的Tris-HCl、pH8.0,300mM的NaCl、300mM的咪唑;
缓冲液D:20mM的Tris-HCl、pH8.0;
缓冲液E:20mM的Tris-HCl、pH8.0,2000mM的NaCl。
2. 向导RNA的调制
进行了插入有目标向导RNA序列(ggaaauuaggugcgcuuggcguuuuaga gcuagaaauagcaaguuaaaauaaggcuaguccguuaucaacuugaaaaagug; SEQ ID NO: 19)的载体的制作。下划线显示20个碱基的向导序列,余部相当于支架部分(颈-环2,stem-loop 2)。在向导RNA序列的上游添加T7启动子序列,整合到已形成线状的pUC119载体(TaKaRa)中。根据所制作的载体,利用PCR制作了体外转录反应的模板DNA。使用该模板DNA,在37℃下进行了4小时的T7 RNA聚合酶的体外转录反应。在包含转录产物的反应液中加入等量的苯酚氯仿进行混合,之后在20℃下离心(10,000g、2分钟),回收上清。在上清中添加1/10量的3M的乙酸钠和2.5倍量的100%乙醇,在4℃下离心(10,000g、3分钟),使转录产物沉淀。废弃上清,添加70%乙醇,在4℃下离心(10,000g、3分钟),再次废弃上清。将沉淀风干后,再次悬浮于TBE缓冲液中,通过经7M脲改性的10% PAGE进行纯化。切出位于目标RNA的分子量的谱带,利用Elutrap电洗脱系统(GE Healthcare)提取RNA。之后,使所提取的RNA通过PD-10柱(GE Healthcare),将缓冲液换成缓冲液H (10mM的Tris-HCl (pH8.0)、150mM的NaCl)。
3. 质粒DNA切割活性测定试验
进行了插入有靶DNA序列和PAM序列的载体的制作,以便在DNA切割活性测定试验中使用。在靶DNA序列中分别添加PAM序列1~4,整合到已形成线状的pUC119载体中。靶序列和PAM序列1~4见表1。
[表1]
核苷酸序列 SEQ ID NO
靶DNA 5’-GGAAATTAGGTGCGCTTGGC-3’ SEQ ID NO: 20
PAM序列1 5’-TGT-3’
PAM序列2 5’-TGG-3’
PAM序列3 5’-TGNA-3’
PAM序列4 5’-TGN-3’
使用所制作的载体转化大肠杆菌Mach1株(Life Technologies),使用含有20μg/mL的氨苄青霉素的LB培养基,在37℃下进行培养。
培养后,通过离心(8,000g、1分钟)回收菌体,使用QIAprep Spin Miniprep试剂盒(QIAGEN)纯化质粒DNA。
使用添加有已纯化的PAM序列的靶质粒DNA,进行了切割实验。质粒DNA通过限制酶形成1根线状。若野生型或突变型的SpCas9切割该线状化DNA中的靶DNA序列,则可产生约1,000bp和约2,000bp的切割产物。作为切割时的缓冲液,使用下述组成的裂解缓冲液B。
B (×10)的组成
200mM的HEPES 7.5;
1000mM的KCl;
50%的甘油;
10mM的DTT;
5mM的EDTA;
20mM的MgCl2
使用1%浓度的琼脂糖凝胶对反应后的样品进行电泳,确认了切割产物的谱带。结果见图1A~D。图中,“Substrate”显示底物,“Product”显示切割产物。PAM序列和反应条件如图中所示。
在野生型SpCas9中,仅识别PAM序列的第3位的碱基为G的情形,靶质粒DNA被切割,相对于此,在突变型SpCas9中,还识别第3位的碱基为G以外的PAM序列,可以切割靶质粒DNA。
因此,确认到了:在野生型的SpCas9中识别PAM序列“NGG”,相对于此,在突变型的SpCas9中识别PAM序列“NG”。
由以上结果明确了:在突变型的SpCas9中,PAM序列变得广泛,可以简便且快速地针对靶序列进行位点特异性的靶双链多核苷酸的切割。
实施例2
使用实施例1中调制的突变型SpCas9(m43),进行与实施例1同样的操作,进行了质粒DNA切割活性测定试验。结果见图2。
在野生型SpCas9中,仅识别PAM序列的第3位的碱基为G的情形,靶质粒DNA被切割,相对于此,在突变型SpCas9中,还识别第3位的碱基为G以外的PAM序列,可以切割靶质粒DNA。
因此,确认了:在野生型的SpCas9中识别PAM序列“NGG”,相对于此,在突变型的SpCas9中识别PAM序列“NG”。
实施例3
使用实施例1中调制的突变型SpCas9(m43、m61~m66),进行与实施例1同样的操作,进行了质粒DNA切割活性测定试验。尚需说明的是,在切割产物的检测中使用了MultiNA毛细管电泳装置(岛津制作所)。作为PAM序列,使用了作为PAM序列4的5’-TGC-3’。切割实验进行了0.5分钟(0.5m)和2分钟(2m)。结果见图3。在m61、m63、m65和m66中确认到了优异的DNA切割活性。
实施例4
使用实施例1中调制的突变型SpCas9(m43、m61、m63和m66),进行与实施例1同样的操作,进行了质粒DNA切割活性测定试验。切割实验进行了0.5分钟(0.5m)和2分钟(2m)。结果见图4。
在野生型SpCas9中,仅识别PAM序列的第3位的碱基为G的情形,靶质粒DNA被切割,相对于此,在突变型SpCas9中,还识别第3位的碱基为G以外的PAM序列,可以切割靶质粒DNA。确认到:m61、m63和m66、特别是m63和m66,即使在使用在m43中效率低的TGA和TGC的PAM序列的情况下,也可以高效率地切割DNA。
实施例5
使用实施例1中调制的野生型SpCas9和突变型SpCas9(WT、m43)和进行与实施例1同样的操作而调制的下述的突变型SpCas9,进行与实施例1同样的操作,进行了质粒DNA切割活性测定试验。切割实验随时间(0、0.5、1、2、5分钟)而进行。结果见图5。在m43中确认到了与WT相媲美的切割活性的提高。
突变型SpCas9基因(R1335A/G1218R/T1337R/L1111R/A1322R/D1135V)的核苷酸序列:相对于m25的核苷酸序列(SEQ ID NO: 12),将3403-3405位的gac变换成gtt而得到的核苷酸序列。
产业实用性
根据本发明,可以获得在保持与靶双链多核苷酸的结合力、进一步保持核酸内切酶活性的同时PAM序列的识别变得广泛的Cas9蛋白。另外,可以提供利用了上述Cas9蛋白的简便且快速、并且针对靶序列进行位点特异性的基因组编辑的技术。
本申请以在日本申请的特愿2017-108556(申请日:2017年5月31日)为基础,且其内容全部包含在本说明书中。
<110> 东京大学
<120> 经修饰的Cas9蛋白及其用途
<130> 092761
<150> JP2017-108556
<151> 2017-05-31
<160> 20
<170> PatentIn version 3.5
<210> 1
<211> 1368
<212> PRT
<213> 酿脓链球菌
<400> 1
Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val
1 5 10 15
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
20 25 30
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
35 40 45
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
50 55 60
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
65 70 75 80
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
85 90 95
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
100 105 110
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
115 120 125
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
130 135 140
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
145 150 155 160
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
165 170 175
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
180 185 190
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
195 200 205
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
210 215 220
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
225 230 235 240
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
245 250 255
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
260 265 270
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
275 280 285
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
290 295 300
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
305 310 315 320
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
325 330 335
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
340 345 350
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
355 360 365
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
370 375 380
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
385 390 395 400
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
405 410 415
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
420 425 430
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
435 440 445
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
450 455 460
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
465 470 475 480
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
485 490 495
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
500 505 510
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
515 520 525
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
530 535 540
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
545 550 555 560
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
565 570 575
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
580 585 590
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
595 600 605
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
610 615 620
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
625 630 635 640
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
645 650 655
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
660 665 670
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
675 680 685
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
690 695 700
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
705 710 715 720
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
725 730 735
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
740 745 750
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
755 760 765
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
770 775 780
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
785 790 795 800
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
805 810 815
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
820 825 830
Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu Lys
835 840 845
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg
850 855 860
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
865 870 875 880
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
885 890 895
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
900 905 910
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
915 920 925
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
930 935 940
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
945 950 955 960
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
965 970 975
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
980 985 990
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe
995 1000 1005
Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala
1010 1015 1020
Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe
1025 1030 1035
Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala
1040 1045 1050
Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu
1055 1060 1065
Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val
1070 1075 1080
Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys Thr
1085 1090 1095
Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro Lys
1100 1105 1110
Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro
1115 1120 1125
Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser Val
1130 1135 1140
Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys
1145 1150 1155
Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser
1160 1165 1170
Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys
1175 1180 1185
Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu
1190 1195 1200
Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala Gly
1205 1210 1215
Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr Val
1220 1225 1230
Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser
1235 1240 1245
Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys
1250 1255 1260
His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys
1265 1270 1275
Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala
1280 1285 1290
Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn
1295 1300 1305
Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala Ala
1310 1315 1320
Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr Ser
1325 1330 1335
Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr
1340 1345 1350
Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp
1355 1360 1365
<210> 2
<211> 4107
<212> DNA
<213> 酿脓链球菌
<220>
<221> CDS
<222> (1)..(4107)
<400> 2
atg gac aag aag tac agc atc ggc ctg gac atc ggc acc aac tct gtg 48
Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val
1 5 10 15
ggc tgg gcc gtg atc acc gac gag tac aag gtg ccc agc aag aaa ttc 96
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
20 25 30
aag gtg ctg ggc aac acc gac cgg cac agc atc aag aag aac ctg atc 144
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
35 40 45
gga gcc ctg ctg ttc gac agc ggc gaa aca gcc gag gcc acc cgg ctg 192
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
50 55 60
aag aga acc gcc aga aga aga tac acc aga cgg aag aac cgg atc tgc 240
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
65 70 75 80
tat ctg caa gag atc ttc agc aac gag atg gcc aag gtg gac gac agc 288
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
85 90 95
ttc ttc cac aga ctg gaa gag tcc ttc ctg gtg gaa gag gat aag aag 336
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
100 105 110
cac gag cgg cac ccc atc ttc ggc aac atc gtg gac gag gtg gcc tac 384
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
115 120 125
cac gag aag tac ccc acc atc tac cac ctg aga aag aaa ctg gtg gac 432
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
130 135 140
agc acc gac aag gcc gac ctg cgg ctg atc tat ctg gcc ctg gcc cac 480
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
145 150 155 160
atg atc aag ttc cgg ggc cac ttc ctg atc gag ggc gac ctg aac ccc 528
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
165 170 175
gac aac agc gac gtg gac aag ctg ttc atc cag ctg gtg cag acc tac 576
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
180 185 190
aac cag ctg ttc gag gaa aac ccc atc aac gcc agc ggc gtg gac gcc 624
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
195 200 205
aag gcc atc ctg tct gcc aga ctg agc aag agc aga cgg ctg gaa aat 672
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
210 215 220
ctg atc gcc cag ctg ccc ggc gag aag aag aat ggc ctg ttc ggc aac 720
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
225 230 235 240
ctg att gcc ctg agc ctg ggc ctg acc ccc aac ttc aag agc aac ttc 768
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
245 250 255
gac ctg gcc gag gat gcc aaa ctg cag ctg agc aag gac acc tac gac 816
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
260 265 270
gac gac ctg gac aac ctg ctg gcc cag atc ggc gac cag tac gcc gac 864
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
275 280 285
ctg ttt ctg gcc gcc aag aac ctg tcc gac gcc atc ctg ctg agc gac 912
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
290 295 300
atc ctg aga gtg aac acc gag atc acc aag gcc ccc ctg agc gcc tct 960
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
305 310 315 320
atg atc aag aga tac gac gag cac cac cag gac ctg acc ctg ctg aaa 1008
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
325 330 335
gct ctc gtg cgg cag cag ctg cct gag aag tac aaa gag att ttc ttc 1056
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
340 345 350
gac cag agc aag aac ggc tac gcc ggc tac att gac ggc gga gcc agc 1104
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
355 360 365
cag gaa gag ttc tac aag ttc atc aag ccc atc ctg gaa aag atg gac 1152
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
370 375 380
ggc acc gag gaa ctg ctc gtg aag ctg aac aga gag gac ctg ctg cgg 1200
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
385 390 395 400
aag cag cgg acc ttc gac aac ggc agc atc ccc cac cag atc cac ctg 1248
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
405 410 415
gga gag ctg cac gcc att ctg cgg cgg cag gaa gat ttt tac cca ttc 1296
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
420 425 430
ctg aag gac aac cgg gaa aag atc gag aag atc ctg acc ttc cgc atc 1344
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
435 440 445
ccc tac tac gtg ggc cct ctg gcc agg gga aac agc aga ttc gcc tgg 1392
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
450 455 460
atg acc aga aag agc gag gaa acc atc acc ccc tgg aac ttc gag gaa 1440
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
465 470 475 480
gtg gtg gac aag ggc gct tcc gcc cag agc ttc atc gag cgg atg acc 1488
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
485 490 495
aac ttc gat aag aac ctg ccc aac gag aag gtg ctg ccc aag cac agc 1536
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
500 505 510
ctg ctg tac gag tac ttc acc gtg tat aac gag ctg acc aaa gtg aaa 1584
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
515 520 525
tac gtg acc gag gga atg aga aag ccc gcc ttc ctg agc ggc gag cag 1632
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
530 535 540
aaa aag gcc atc gtg gac ctg ctg ttc aag acc aac cgg aaa gtg acc 1680
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
545 550 555 560
gtg aag cag ctg aaa gag gac tac ttc aag aaa atc gag tgc ttc gac 1728
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
565 570 575
tcc gtg gaa atc tcc ggc gtg gaa gat cgg ttc aac gcc tcc ctg ggc 1776
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
580 585 590
aca tac cac gat ctg ctg aaa att atc aag gac aag gac ttc ctg gac 1824
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
595 600 605
aat gag gaa aac gag gac att ctg gaa gat atc gtg ctg acc ctg aca 1872
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
610 615 620
ctg ttt gag gac aga gag atg atc gag gaa cgg ctg aaa acc tat gcc 1920
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
625 630 635 640
cac ctg ttc gac gac aaa gtg atg aag cag ctg aag cgg cgg aga tac 1968
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
645 650 655
acc ggc tgg ggc agg ctg agc cgg aag ctg atc aac ggc atc cgg gac 2016
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
660 665 670
aag cag tcc ggc aag aca atc ctg gat ttc ctg aag tcc gac ggc ttc 2064
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
675 680 685
gcc aac aga aac ttc atg cag ctg atc cac gac gac agc ctg acc ttt 2112
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
690 695 700
aaa gag gac atc cag aaa gcc cag gtg tcc ggc cag ggc gat agc ctg 2160
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
705 710 715 720
cac gag cac att gcc aat ctg gcc ggc agc ccc gcc att aag aag ggc 2208
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
725 730 735
atc ctg cag aca gtg aag gtg gtg gac gag ctc gtg aaa gtg atg ggc 2256
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
740 745 750
cgg cac aag ccc gag aac atc gtg atc gaa atg gcc aga gag aac cag 2304
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
755 760 765
acc acc cag aag gga cag aag aac agc cgc gag aga atg aag cgg atc 2352
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
770 775 780
gaa gag ggc atc aaa gag ctg ggc agc cag atc ctg aaa gaa cac ccc 2400
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
785 790 795 800
gtg gaa aac acc cag ctg cag aac gag aag ctg tac ctg tac tac ctg 2448
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
805 810 815
cag aat ggg cgg gat atg tac gtg gac cag gaa ctg gac atc aac cgg 2496
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
820 825 830
ctg tcc gac tac gat gtg gac cat atc gtg cct cag agc ttt ctg aag 2544
Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu Lys
835 840 845
gac gac tcc atc gac aac aag gtg ctg acc aga agc gac aag aac cgg 2592
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg
850 855 860
ggc aag agc gac aac gtg ccc tcc gaa gag gtc gtg aag aag atg aag 2640
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
865 870 875 880
aac tac tgg cgg cag ctg ctg aac gcc aag ctg att acc cag aga aag 2688
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
885 890 895
ttc gac aat ctg acc aag gcc gag aga ggc ggc ctg agc gaa ctg gat 2736
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
900 905 910
aag gcc ggc ttc atc aag aga cag ctg gtg gaa acc cgg cag atc aca 2784
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
915 920 925
aag cac gtg gca cag atc ctg gac tcc cgg atg aac act aag tac gac 2832
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
930 935 940
gag aat gac aag ctg atc cgg gaa gtg aaa gtg atc acc ctg aag tcc 2880
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
945 950 955 960
aag ctg gtg tcc gat ttc cgg aag gat ttc cag ttt tac aaa gtg cgc 2928
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
965 970 975
gag atc aac aac tac cac cac gcc cac gac gcc tac ctg aac gcc gtc 2976
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
980 985 990
gtg gga acc gcc ctg atc aaa aag tac cct aag ctg gaa agc gag ttc 3024
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe
995 1000 1005
gtg tac ggc gac tac aag gtg tac gac gtg cgg aag atg atc gcc 3069
Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala
1010 1015 1020
aag agc gag cag gaa atc ggc aag gct acc gcc aag tac ttc ttc 3114
Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe
1025 1030 1035
tac agc aac atc atg aac ttt ttc aag acc gag att acc ctg gcc 3159
Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala
1040 1045 1050
aac ggc gag atc cgg aag cgg cct ctg atc gag aca aac ggc gaa 3204
Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu
1055 1060 1065
acc ggg gag atc gtg tgg gat aag ggc cgg gat ttt gcc acc gtg 3249
Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val
1070 1075 1080
cgg aaa gtg ctg agc atg ccc caa gtg aat atc gtg aaa aag acc 3294
Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys Thr
1085 1090 1095
gag gtg cag aca ggc ggc ttc agc aaa gag tct atc ctg ccc aag 3339
Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro Lys
1100 1105 1110
agg aac agc gat aag ctg atc gcc aga aag aag gac tgg gac cct 3384
Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro
1115 1120 1125
aag aag tac ggc ggc ttc gac agc ccc acc gtg gcc tat tct gtg 3429
Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser Val
1130 1135 1140
ctg gtg gtg gcc aaa gtg gaa aag ggc aag tcc aag aaa ctg aag 3474
Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys
1145 1150 1155
agt gtg aaa gag ctg ctg ggg atc acc atc atg gaa aga agc agc 3519
Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser
1160 1165 1170
ttc gag aag aat ccc atc gac ttt ctg gaa gcc aag ggc tac aaa 3564
Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys
1175 1180 1185
gaa gtg aaa aag gac ctg atc atc aag ctg cct aag tac tcc ctg 3609
Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu
1190 1195 1200
ttc gag ctg gaa aac ggc cgg aag aga atg ctg gcc tct gcc ggc 3654
Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala Gly
1205 1210 1215
gaa ctg cag aag gga aac gaa ctg gcc ctg ccc tcc aaa tat gtg 3699
Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr Val
1220 1225 1230
aac ttc ctg tac ctg gcc agc cac tat gag aag ctg aag ggc tcc 3744
Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser
1235 1240 1245
ccc gag gat aat gag cag aaa cag ctg ttt gtg gaa cag cac aag 3789
Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys
1250 1255 1260
cac tac ctg gac gag atc atc gag cag atc agc gag ttc tcc aag 3834
His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys
1265 1270 1275
aga gtg atc ctg gcc gac gct aat ctg gac aaa gtg ctg tcc gcc 3879
Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala
1280 1285 1290
tac aac aag cac cgg gat aag ccc atc aga gag cag gcc gag aat 3924
Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn
1295 1300 1305
atc atc cac ctg ttt acc ctg acc aat ctg gga gcc cct gcc gcc 3969
Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala Ala
1310 1315 1320
ttc aag tac ttt gac acc acc atc gac cgg aag agg tac acc agc 4014
Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr Ser
1325 1330 1335
acc aaa gag gtg ctg gac gcc acc ctg atc cac cag agc atc acc 4059
Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr
1340 1345 1350
ggc ctg tac gag aca cgg atc gac ctg tct cag ctg gga ggc gac 4104
Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp
1355 1360 1365
taa 4107
<210> 3
<211> 4107
<212> DNA
<213> 酿脓链球菌
<220>
<221> CDS
<222> (1)..(4107)
<400> 3
atg gac aag aag tac agc atc ggc ctg gac atc ggc acc aac tct gtg 48
Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val
1 5 10 15
ggc tgg gcc gtg atc acc gac gag tac aag gtg ccc agc aag aaa ttc 96
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
20 25 30
aag gtg ctg ggc aac acc gac cgg cac agc atc aag aag aac ctg atc 144
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
35 40 45
gga gcc ctg ctg ttc gac agc ggc gaa aca gcc gag gcc acc cgg ctg 192
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
50 55 60
aag aga acc gcc aga aga aga tac acc aga cgg aag aac cgg atc tgc 240
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
65 70 75 80
tat ctg caa gag atc ttc agc aac gag atg gcc aag gtg gac gac agc 288
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
85 90 95
ttc ttc cac aga ctg gaa gag tcc ttc ctg gtg gaa gag gat aag aag 336
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
100 105 110
cac gag cgg cac ccc atc ttc ggc aac atc gtg gac gag gtg gcc tac 384
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
115 120 125
cac gag aag tac ccc acc atc tac cac ctg aga aag aaa ctg gtg gac 432
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
130 135 140
agc acc gac aag gcc gac ctg cgg ctg atc tat ctg gcc ctg gcc cac 480
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
145 150 155 160
atg atc aag ttc cgg ggc cac ttc ctg atc gag ggc gac ctg aac ccc 528
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
165 170 175
gac aac agc gac gtg gac aag ctg ttc atc cag ctg gtg cag acc tac 576
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
180 185 190
aac cag ctg ttc gag gaa aac ccc atc aac gcc agc ggc gtg gac gcc 624
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
195 200 205
aag gcc atc ctg tct gcc aga ctg agc aag agc aga cgg ctg gaa aat 672
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
210 215 220
ctg atc gcc cag ctg ccc ggc gag aag aag aat ggc ctg ttc ggc aac 720
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
225 230 235 240
ctg att gcc ctg agc ctg ggc ctg acc ccc aac ttc aag agc aac ttc 768
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
245 250 255
gac ctg gcc gag gat gcc aaa ctg cag ctg agc aag gac acc tac gac 816
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
260 265 270
gac gac ctg gac aac ctg ctg gcc cag atc ggc gac cag tac gcc gac 864
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
275 280 285
ctg ttt ctg gcc gcc aag aac ctg tcc gac gcc atc ctg ctg agc gac 912
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
290 295 300
atc ctg aga gtg aac acc gag atc acc aag gcc ccc ctg agc gcc tct 960
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
305 310 315 320
atg atc aag aga tac gac gag cac cac cag gac ctg acc ctg ctg aaa 1008
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
325 330 335
gct ctc gtg cgg cag cag ctg cct gag aag tac aaa gag att ttc ttc 1056
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
340 345 350
gac cag agc aag aac ggc tac gcc ggc tac att gac ggc gga gcc agc 1104
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
355 360 365
cag gaa gag ttc tac aag ttc atc aag ccc atc ctg gaa aag atg gac 1152
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
370 375 380
ggc acc gag gaa ctg ctc gtg aag ctg aac aga gag gac ctg ctg cgg 1200
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
385 390 395 400
aag cag cgg acc ttc gac aac ggc agc atc ccc cac cag atc cac ctg 1248
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
405 410 415
gga gag ctg cac gcc att ctg cgg cgg cag gaa gat ttt tac cca ttc 1296
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
420 425 430
ctg aag gac aac cgg gaa aag atc gag aag atc ctg acc ttc cgc atc 1344
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
435 440 445
ccc tac tac gtg ggc cct ctg gcc agg gga aac agc aga ttc gcc tgg 1392
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
450 455 460
atg acc aga aag agc gag gaa acc atc acc ccc tgg aac ttc gag gaa 1440
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
465 470 475 480
gtg gtg gac aag ggc gct tcc gcc cag agc ttc atc gag cgg atg acc 1488
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
485 490 495
aac ttc gat aag aac ctg ccc aac gag aag gtg ctg ccc aag cac agc 1536
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
500 505 510
ctg ctg tac gag tac ttc acc gtg tat aac gag ctg acc aaa gtg aaa 1584
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
515 520 525
tac gtg acc gag gga atg aga aag ccc gcc ttc ctg agc ggc gag cag 1632
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
530 535 540
aaa aag gcc atc gtg gac ctg ctg ttc aag acc aac cgg aaa gtg acc 1680
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
545 550 555 560
gtg aag cag ctg aaa gag gac tac ttc aag aaa atc gag tgc ttc gac 1728
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
565 570 575
tcc gtg gaa atc tcc ggc gtg gaa gat cgg ttc aac gcc tcc ctg ggc 1776
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
580 585 590
aca tac cac gat ctg ctg aaa att atc aag gac aag gac ttc ctg gac 1824
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
595 600 605
aat gag gaa aac gag gac att ctg gaa gat atc gtg ctg acc ctg aca 1872
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
610 615 620
ctg ttt gag gac aga gag atg atc gag gaa cgg ctg aaa acc tat gcc 1920
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
625 630 635 640
cac ctg ttc gac gac aaa gtg atg aag cag ctg aag cgg cgg aga tac 1968
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
645 650 655
acc ggc tgg ggc agg ctg agc cgg aag ctg atc aac ggc atc cgg gac 2016
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
660 665 670
aag cag tcc ggc aag aca atc ctg gat ttc ctg aag tcc gac ggc ttc 2064
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
675 680 685
gcc aac aga aac ttc atg cag ctg atc cac gac gac agc ctg acc ttt 2112
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
690 695 700
aaa gag gac atc cag aaa gcc cag gtg tcc ggc cag ggc gat agc ctg 2160
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
705 710 715 720
cac gag cac att gcc aat ctg gcc ggc agc ccc gcc att aag aag ggc 2208
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
725 730 735
atc ctg cag aca gtg aag gtg gtg gac gag ctc gtg aaa gtg atg ggc 2256
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
740 745 750
cgg cac aag ccc gag aac atc gtg atc gaa atg gcc aga gag aac cag 2304
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
755 760 765
acc acc cag aag gga cag aag aac agc cgc gag aga atg aag cgg atc 2352
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
770 775 780
gaa gag ggc atc aaa gag ctg ggc agc cag atc ctg aaa gaa cac ccc 2400
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
785 790 795 800
gtg gaa aac acc cag ctg cag aac gag aag ctg tac ctg tac tac ctg 2448
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
805 810 815
cag aat ggg cgg gat atg tac gtg gac cag gaa ctg gac atc aac cgg 2496
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
820 825 830
ctg tcc gac tac gat gtg gac cat atc gtg cct cag agc ttt ctg aag 2544
Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu Lys
835 840 845
gac gac tcc atc gac aac aag gtg ctg acc aga agc gac aag aac cgg 2592
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg
850 855 860
ggc aag agc gac aac gtg ccc tcc gaa gag gtc gtg aag aag atg aag 2640
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
865 870 875 880
aac tac tgg cgg cag ctg ctg aac gcc aag ctg att acc cag aga aag 2688
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
885 890 895
ttc gac aat ctg acc aag gcc gag aga ggc ggc ctg agc gaa ctg gat 2736
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
900 905 910
aag gcc ggc ttc atc aag aga cag ctg gtg gaa acc cgg cag atc aca 2784
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
915 920 925
aag cac gtg gca cag atc ctg gac tcc cgg atg aac act aag tac gac 2832
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
930 935 940
gag aat gac aag ctg atc cgg gaa gtg aaa gtg atc acc ctg aag tcc 2880
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
945 950 955 960
aag ctg gtg tcc gat ttc cgg aag gat ttc cag ttt tac aaa gtg cgc 2928
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
965 970 975
gag atc aac aac tac cac cac gcc cac gac gcc tac ctg aac gcc gtc 2976
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
980 985 990
gtg gga acc gcc ctg atc aaa aag tac cct aag ctg gaa agc gag ttc 3024
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe
995 1000 1005
gtg tac ggc gac tac aag gtg tac gac gtg cgg aag atg atc gcc 3069
Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala
1010 1015 1020
aag agc gag cag gaa atc ggc aag gct acc gcc aag tac ttc ttc 3114
Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe
1025 1030 1035
tac agc aac atc atg aac ttt ttc aag acc gag att acc ctg gcc 3159
Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala
1040 1045 1050
aac ggc gag atc cgg aag cgg cct ctg atc gag aca aac ggc gaa 3204
Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu
1055 1060 1065
acc ggg gag atc gtg tgg gat aag ggc cgg gat ttt gcc acc gtg 3249
Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val
1070 1075 1080
cgg aaa gtg ctg agc atg ccc caa gtg aat atc gtg aaa aag acc 3294
Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys Thr
1085 1090 1095
gag gtg cag aca ggc ggc ttc agc aaa gag tct atc ctg ccc aag 3339
Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro Lys
1100 1105 1110
agg aac agc gat aag ctg atc gcc aga aag aag gac tgg gac cct 3384
Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro
1115 1120 1125
aag aag tac ggc ggc ttc gac agc ccc acc gtg gcc tat tct gtg 3429
Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser Val
1130 1135 1140
ctg gtg gtg gcc aaa gtg gaa aag ggc aag tcc aag aaa ctg aag 3474
Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys
1145 1150 1155
agt gtg aaa gag ctg ctg ggg atc acc atc atg gaa aga agc agc 3519
Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser
1160 1165 1170
ttc gag aag aat ccc atc gac ttt ctg gaa gcc aag ggc tac aaa 3564
Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys
1175 1180 1185
gaa gtg aaa aag gac ctg atc atc aag ctg cct aag tac tcc ctg 3609
Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu
1190 1195 1200
ttc gag ctg gaa aac ggc cgg aag aga atg ctg gcc tct gcc ggc 3654
Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala Gly
1205 1210 1215
gaa ctg cag aag gga aac gaa ctg gcc ctg ccc tcc aaa tat gtg 3699
Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr Val
1220 1225 1230
aac ttc ctg tac ctg gcc agc cac tat gag aag ctg aag ggc tcc 3744
Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser
1235 1240 1245
ccc gag gat aat gag cag aaa cag ctg ttt gtg gaa cag cac aag 3789
Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys
1250 1255 1260
cac tac ctg gac gag atc atc gag cag atc agc gag ttc tcc aag 3834
His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys
1265 1270 1275
aga gtg atc ctg gcc gac gct aat ctg gac aaa gtg ctg tcc gcc 3879
Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala
1280 1285 1290
tac aac aag cac cgg gat aag ccc atc aga gag cag gcc gag aat 3924
Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn
1295 1300 1305
atc atc cac ctg ttt acc ctg acc aat ctg gga gcc cct gcc gcc 3969
Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala Ala
1310 1315 1320
ttc aag tac ttt gac acc acc atc gac cgg aag gcc tac acc agc 4014
Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Ala Tyr Thr Ser
1325 1330 1335
acc aaa gag gtg ctg gac gcc acc ctg atc cac cag agc atc acc 4059
Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr
1340 1345 1350
ggc ctg tac gag aca cgg atc gac ctg tct cag ctg gga ggc gac 4104
Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp
1355 1360 1365
taa 4107
<210> 4
<211> 4107
<212> DNA
<213> 酿脓链球菌
<220>
<221> CDS
<222> (1)..(4107)
<400> 4
atg gac aag aag tac agc atc ggc ctg gac atc ggc acc aac tct gtg 48
Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val
1 5 10 15
ggc tgg gcc gtg atc acc gac gag tac aag gtg ccc agc aag aaa ttc 96
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
20 25 30
aag gtg ctg ggc aac acc gac cgg cac agc atc aag aag aac ctg atc 144
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
35 40 45
gga gcc ctg ctg ttc gac agc ggc gaa aca gcc gag gcc acc cgg ctg 192
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
50 55 60
aag aga acc gcc aga aga aga tac acc aga cgg aag aac cgg atc tgc 240
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
65 70 75 80
tat ctg caa gag atc ttc agc aac gag atg gcc aag gtg gac gac agc 288
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
85 90 95
ttc ttc cac aga ctg gaa gag tcc ttc ctg gtg gaa gag gat aag aag 336
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
100 105 110
cac gag cgg cac ccc atc ttc ggc aac atc gtg gac gag gtg gcc tac 384
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
115 120 125
cac gag aag tac ccc acc atc tac cac ctg aga aag aaa ctg gtg gac 432
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
130 135 140
agc acc gac aag gcc gac ctg cgg ctg atc tat ctg gcc ctg gcc cac 480
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
145 150 155 160
atg atc aag ttc cgg ggc cac ttc ctg atc gag ggc gac ctg aac ccc 528
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
165 170 175
gac aac agc gac gtg gac aag ctg ttc atc cag ctg gtg cag acc tac 576
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
180 185 190
aac cag ctg ttc gag gaa aac ccc atc aac gcc agc ggc gtg gac gcc 624
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
195 200 205
aag gcc atc ctg tct gcc aga ctg agc aag agc aga cgg ctg gaa aat 672
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
210 215 220
ctg atc gcc cag ctg ccc ggc gag aag aag aat ggc ctg ttc ggc aac 720
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
225 230 235 240
ctg att gcc ctg agc ctg ggc ctg acc ccc aac ttc aag agc aac ttc 768
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
245 250 255
gac ctg gcc gag gat gcc aaa ctg cag ctg agc aag gac acc tac gac 816
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
260 265 270
gac gac ctg gac aac ctg ctg gcc cag atc ggc gac cag tac gcc gac 864
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
275 280 285
ctg ttt ctg gcc gcc aag aac ctg tcc gac gcc atc ctg ctg agc gac 912
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
290 295 300
atc ctg aga gtg aac acc gag atc acc aag gcc ccc ctg agc gcc tct 960
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
305 310 315 320
atg atc aag aga tac gac gag cac cac cag gac ctg acc ctg ctg aaa 1008
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
325 330 335
gct ctc gtg cgg cag cag ctg cct gag aag tac aaa gag att ttc ttc 1056
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
340 345 350
gac cag agc aag aac ggc tac gcc ggc tac att gac ggc gga gcc agc 1104
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
355 360 365
cag gaa gag ttc tac aag ttc atc aag ccc atc ctg gaa aag atg gac 1152
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
370 375 380
ggc acc gag gaa ctg ctc gtg aag ctg aac aga gag gac ctg ctg cgg 1200
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
385 390 395 400
aag cag cgg acc ttc gac aac ggc agc atc ccc cac cag atc cac ctg 1248
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
405 410 415
gga gag ctg cac gcc att ctg cgg cgg cag gaa gat ttt tac cca ttc 1296
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
420 425 430
ctg aag gac aac cgg gaa aag atc gag aag atc ctg acc ttc cgc atc 1344
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
435 440 445
ccc tac tac gtg ggc cct ctg gcc agg gga aac agc aga ttc gcc tgg 1392
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
450 455 460
atg acc aga aag agc gag gaa acc atc acc ccc tgg aac ttc gag gaa 1440
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
465 470 475 480
gtg gtg gac aag ggc gct tcc gcc cag agc ttc atc gag cgg atg acc 1488
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
485 490 495
aac ttc gat aag aac ctg ccc aac gag aag gtg ctg ccc aag cac agc 1536
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
500 505 510
ctg ctg tac gag tac ttc acc gtg tat aac gag ctg acc aaa gtg aaa 1584
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
515 520 525
tac gtg acc gag gga atg aga aag ccc gcc ttc ctg agc ggc gag cag 1632
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
530 535 540
aaa aag gcc atc gtg gac ctg ctg ttc aag acc aac cgg aaa gtg acc 1680
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
545 550 555 560
gtg aag cag ctg aaa gag gac tac ttc aag aaa atc gag tgc ttc gac 1728
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
565 570 575
tcc gtg gaa atc tcc ggc gtg gaa gat cgg ttc aac gcc tcc ctg ggc 1776
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
580 585 590
aca tac cac gat ctg ctg aaa att atc aag gac aag gac ttc ctg gac 1824
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
595 600 605
aat gag gaa aac gag gac att ctg gaa gat atc gtg ctg acc ctg aca 1872
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
610 615 620
ctg ttt gag gac aga gag atg atc gag gaa cgg ctg aaa acc tat gcc 1920
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
625 630 635 640
cac ctg ttc gac gac aaa gtg atg aag cag ctg aag cgg cgg aga tac 1968
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
645 650 655
acc ggc tgg ggc agg ctg agc cgg aag ctg atc aac ggc atc cgg gac 2016
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
660 665 670
aag cag tcc ggc aag aca atc ctg gat ttc ctg aag tcc gac ggc ttc 2064
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
675 680 685
gcc aac aga aac ttc atg cag ctg atc cac gac gac agc ctg acc ttt 2112
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
690 695 700
aaa gag gac atc cag aaa gcc cag gtg tcc ggc cag ggc gat agc ctg 2160
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
705 710 715 720
cac gag cac att gcc aat ctg gcc ggc agc ccc gcc att aag aag ggc 2208
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
725 730 735
atc ctg cag aca gtg aag gtg gtg gac gag ctc gtg aaa gtg atg ggc 2256
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
740 745 750
cgg cac aag ccc gag aac atc gtg atc gaa atg gcc aga gag aac cag 2304
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
755 760 765
acc acc cag aag gga cag aag aac agc cgc gag aga atg aag cgg atc 2352
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
770 775 780
gaa gag ggc atc aaa gag ctg ggc agc cag atc ctg aaa gaa cac ccc 2400
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
785 790 795 800
gtg gaa aac acc cag ctg cag aac gag aag ctg tac ctg tac tac ctg 2448
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
805 810 815
cag aat ggg cgg gat atg tac gtg gac cag gaa ctg gac atc aac cgg 2496
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
820 825 830
ctg tcc gac tac gat gtg gac cat atc gtg cct cag agc ttt ctg aag 2544
Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu Lys
835 840 845
gac gac tcc atc gac aac aag gtg ctg acc aga agc gac aag aac cgg 2592
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg
850 855 860
ggc aag agc gac aac gtg ccc tcc gaa gag gtc gtg aag aag atg aag 2640
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
865 870 875 880
aac tac tgg cgg cag ctg ctg aac gcc aag ctg att acc cag aga aag 2688
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
885 890 895
ttc gac aat ctg acc aag gcc gag aga ggc ggc ctg agc gaa ctg gat 2736
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
900 905 910
aag gcc ggc ttc atc aag aga cag ctg gtg gaa acc cgg cag atc aca 2784
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
915 920 925
aag cac gtg gca cag atc ctg gac tcc cgg atg aac act aag tac gac 2832
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
930 935 940
gag aat gac aag ctg atc cgg gaa gtg aaa gtg atc acc ctg aag tcc 2880
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
945 950 955 960
aag ctg gtg tcc gat ttc cgg aag gat ttc cag ttt tac aaa gtg cgc 2928
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
965 970 975
gag atc aac aac tac cac cac gcc cac gac gcc tac ctg aac gcc gtc 2976
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
980 985 990
gtg gga acc gcc ctg atc aaa aag tac cct aag ctg gaa agc gag ttc 3024
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe
995 1000 1005
gtg tac ggc gac tac aag gtg tac gac gtg cgg aag atg atc gcc 3069
Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala
1010 1015 1020
aag agc gag cag gaa atc ggc aag gct acc gcc aag tac ttc ttc 3114
Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe
1025 1030 1035
tac agc aac atc atg aac ttt ttc aag acc gag att acc ctg gcc 3159
Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala
1040 1045 1050
aac ggc gag atc cgg aag cgg cct ctg atc gag aca aac ggc gaa 3204
Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu
1055 1060 1065
acc ggg gag atc gtg tgg gat aag ggc cgg gat ttt gcc acc gtg 3249
Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val
1070 1075 1080
cgg aaa gtg ctg agc atg ccc caa gtg aat atc gtg aaa aag acc 3294
Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys Thr
1085 1090 1095
gag gtg cag aca ggc ggc ttc agc aaa gag tct atc ctg ccc aag 3339
Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro Lys
1100 1105 1110
agg aac agc gat aag ctg atc gcc aga aag aag gac tgg gac cct 3384
Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro
1115 1120 1125
aag aag tac ggc ggc ttc gac agc ccc acc gtg gcc tat tct gtg 3429
Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser Val
1130 1135 1140
ctg gtg gtg gcc aaa gtg gaa aag ggc aag tcc aag aaa ctg aag 3474
Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys
1145 1150 1155
agt gtg aaa gag ctg ctg ggg atc acc atc atg gaa aga agc agc 3519
Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser
1160 1165 1170
ttc gag aag aat ccc atc gac ttt ctg gaa gcc aag ggc tac aaa 3564
Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys
1175 1180 1185
gaa gtg aaa aag gac ctg atc atc aag ctg cct aag tac tcc ctg 3609
Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu
1190 1195 1200
ttc gag ctg gaa aac ggc cgg aag aga atg ctg gcc tct gcc cgg 3654
Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala Arg
1205 1210 1215
gaa ctg cag aag gga aac gaa ctg gcc ctg ccc tcc aaa tat gtg 3699
Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr Val
1220 1225 1230
aac ttc ctg tac ctg gcc agc cac tat gag aag ctg aag ggc tcc 3744
Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser
1235 1240 1245
ccc gag gat aat gag cag aaa cag ctg ttt gtg gaa cag cac aag 3789
Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys
1250 1255 1260
cac tac ctg gac gag atc atc gag cag atc agc gag ttc tcc aag 3834
His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys
1265 1270 1275
aga gtg atc ctg gcc gac gct aat ctg gac aaa gtg ctg tcc gcc 3879
Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala
1280 1285 1290
tac aac aag cac cgg gat aag ccc atc aga gag cag gcc gag aat 3924
Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn
1295 1300 1305
atc atc cac ctg ttt acc ctg acc aat ctg gga gcc cct gcc gcc 3969
Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala Ala
1310 1315 1320
ttc aag tac ttt gac acc acc atc gac cgg aag gcc tac acc agc 4014
Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Ala Tyr Thr Ser
1325 1330 1335
acc aaa gag gtg ctg gac gcc acc ctg atc cac cag agc atc acc 4059
Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr
1340 1345 1350
ggc ctg tac gag aca cgg atc gac ctg tct cag ctg gga ggc gac 4104
Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp
1355 1360 1365
taa 4107
<210> 5
<211> 4107
<212> DNA
<213> 酿脓链球菌
<220>
<221> CDS
<222> (1)..(4107)
<400> 5
atg gac aag aag tac agc atc ggc ctg gac atc ggc acc aac tct gtg 48
Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val
1 5 10 15
ggc tgg gcc gtg atc acc gac gag tac aag gtg ccc agc aag aaa ttc 96
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
20 25 30
aag gtg ctg ggc aac acc gac cgg cac agc atc aag aag aac ctg atc 144
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
35 40 45
gga gcc ctg ctg ttc gac agc ggc gaa aca gcc gag gcc acc cgg ctg 192
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
50 55 60
aag aga acc gcc aga aga aga tac acc aga cgg aag aac cgg atc tgc 240
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
65 70 75 80
tat ctg caa gag atc ttc agc aac gag atg gcc aag gtg gac gac agc 288
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
85 90 95
ttc ttc cac aga ctg gaa gag tcc ttc ctg gtg gaa gag gat aag aag 336
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
100 105 110
cac gag cgg cac ccc atc ttc ggc aac atc gtg gac gag gtg gcc tac 384
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
115 120 125
cac gag aag tac ccc acc atc tac cac ctg aga aag aaa ctg gtg gac 432
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
130 135 140
agc acc gac aag gcc gac ctg cgg ctg atc tat ctg gcc ctg gcc cac 480
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
145 150 155 160
atg atc aag ttc cgg ggc cac ttc ctg atc gag ggc gac ctg aac ccc 528
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
165 170 175
gac aac agc gac gtg gac aag ctg ttc atc cag ctg gtg cag acc tac 576
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
180 185 190
aac cag ctg ttc gag gaa aac ccc atc aac gcc agc ggc gtg gac gcc 624
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
195 200 205
aag gcc atc ctg tct gcc aga ctg agc aag agc aga cgg ctg gaa aat 672
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
210 215 220
ctg atc gcc cag ctg ccc ggc gag aag aag aat ggc ctg ttc ggc aac 720
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
225 230 235 240
ctg att gcc ctg agc ctg ggc ctg acc ccc aac ttc aag agc aac ttc 768
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
245 250 255
gac ctg gcc gag gat gcc aaa ctg cag ctg agc aag gac acc tac gac 816
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
260 265 270
gac gac ctg gac aac ctg ctg gcc cag atc ggc gac cag tac gcc gac 864
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
275 280 285
ctg ttt ctg gcc gcc aag aac ctg tcc gac gcc atc ctg ctg agc gac 912
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
290 295 300
atc ctg aga gtg aac acc gag atc acc aag gcc ccc ctg agc gcc tct 960
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
305 310 315 320
atg atc aag aga tac gac gag cac cac cag gac ctg acc ctg ctg aaa 1008
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
325 330 335
gct ctc gtg cgg cag cag ctg cct gag aag tac aaa gag att ttc ttc 1056
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
340 345 350
gac cag agc aag aac ggc tac gcc ggc tac att gac ggc gga gcc agc 1104
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
355 360 365
cag gaa gag ttc tac aag ttc atc aag ccc atc ctg gaa aag atg gac 1152
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
370 375 380
ggc acc gag gaa ctg ctc gtg aag ctg aac aga gag gac ctg ctg cgg 1200
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
385 390 395 400
aag cag cgg acc ttc gac aac ggc agc atc ccc cac cag atc cac ctg 1248
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
405 410 415
gga gag ctg cac gcc att ctg cgg cgg cag gaa gat ttt tac cca ttc 1296
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
420 425 430
ctg aag gac aac cgg gaa aag atc gag aag atc ctg acc ttc cgc atc 1344
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
435 440 445
ccc tac tac gtg ggc cct ctg gcc agg gga aac agc aga ttc gcc tgg 1392
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
450 455 460
atg acc aga aag agc gag gaa acc atc acc ccc tgg aac ttc gag gaa 1440
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
465 470 475 480
gtg gtg gac aag ggc gct tcc gcc cag agc ttc atc gag cgg atg acc 1488
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
485 490 495
aac ttc gat aag aac ctg ccc aac gag aag gtg ctg ccc aag cac agc 1536
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
500 505 510
ctg ctg tac gag tac ttc acc gtg tat aac gag ctg acc aaa gtg aaa 1584
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
515 520 525
tac gtg acc gag gga atg aga aag ccc gcc ttc ctg agc ggc gag cag 1632
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
530 535 540
aaa aag gcc atc gtg gac ctg ctg ttc aag acc aac cgg aaa gtg acc 1680
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
545 550 555 560
gtg aag cag ctg aaa gag gac tac ttc aag aaa atc gag tgc ttc gac 1728
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
565 570 575
tcc gtg gaa atc tcc ggc gtg gaa gat cgg ttc aac gcc tcc ctg ggc 1776
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
580 585 590
aca tac cac gat ctg ctg aaa att atc aag gac aag gac ttc ctg gac 1824
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
595 600 605
aat gag gaa aac gag gac att ctg gaa gat atc gtg ctg acc ctg aca 1872
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
610 615 620
ctg ttt gag gac aga gag atg atc gag gaa cgg ctg aaa acc tat gcc 1920
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
625 630 635 640
cac ctg ttc gac gac aaa gtg atg aag cag ctg aag cgg cgg aga tac 1968
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
645 650 655
acc ggc tgg ggc agg ctg agc cgg aag ctg atc aac ggc atc cgg gac 2016
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
660 665 670
aag cag tcc ggc aag aca atc ctg gat ttc ctg aag tcc gac ggc ttc 2064
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
675 680 685
gcc aac aga aac ttc atg cag ctg atc cac gac gac agc ctg acc ttt 2112
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
690 695 700
aaa gag gac atc cag aaa gcc cag gtg tcc ggc cag ggc gat agc ctg 2160
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
705 710 715 720
cac gag cac att gcc aat ctg gcc ggc agc ccc gcc att aag aag ggc 2208
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
725 730 735
atc ctg cag aca gtg aag gtg gtg gac gag ctc gtg aaa gtg atg ggc 2256
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
740 745 750
cgg cac aag ccc gag aac atc gtg atc gaa atg gcc aga gag aac cag 2304
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
755 760 765
acc acc cag aag gga cag aag aac agc cgc gag aga atg aag cgg atc 2352
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
770 775 780
gaa gag ggc atc aaa gag ctg ggc agc cag atc ctg aaa gaa cac ccc 2400
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
785 790 795 800
gtg gaa aac acc cag ctg cag aac gag aag ctg tac ctg tac tac ctg 2448
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
805 810 815
cag aat ggg cgg gat atg tac gtg gac cag gaa ctg gac atc aac cgg 2496
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
820 825 830
ctg tcc gac tac gat gtg gac cat atc gtg cct cag agc ttt ctg aag 2544
Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu Lys
835 840 845
gac gac tcc atc gac aac aag gtg ctg acc aga agc gac aag aac cgg 2592
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg
850 855 860
ggc aag agc gac aac gtg ccc tcc gaa gag gtc gtg aag aag atg aag 2640
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
865 870 875 880
aac tac tgg cgg cag ctg ctg aac gcc aag ctg att acc cag aga aag 2688
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
885 890 895
ttc gac aat ctg acc aag gcc gag aga ggc ggc ctg agc gaa ctg gat 2736
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
900 905 910
aag gcc ggc ttc atc aag aga cag ctg gtg gaa acc cgg cag atc aca 2784
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
915 920 925
aag cac gtg gca cag atc ctg gac tcc cgg atg aac act aag tac gac 2832
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
930 935 940
gag aat gac aag ctg atc cgg gaa gtg aaa gtg atc acc ctg aag tcc 2880
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
945 950 955 960
aag ctg gtg tcc gat ttc cgg aag gat ttc cag ttt tac aaa gtg cgc 2928
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
965 970 975
gag atc aac aac tac cac cac gcc cac gac gcc tac ctg aac gcc gtc 2976
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
980 985 990
gtg gga acc gcc ctg atc aaa aag tac cct aag ctg gaa agc gag ttc 3024
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe
995 1000 1005
gtg tac ggc gac tac aag gtg tac gac gtg cgg aag atg atc gcc 3069
Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala
1010 1015 1020
aag agc gag cag gaa atc ggc aag gct acc gcc aag tac ttc ttc 3114
Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe
1025 1030 1035
tac agc aac atc atg aac ttt ttc aag acc gag att acc ctg gcc 3159
Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala
1040 1045 1050
aac ggc gag atc cgg aag cgg cct ctg atc gag aca aac ggc gaa 3204
Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu
1055 1060 1065
acc ggg gag atc gtg tgg gat aag ggc cgg gat ttt gcc acc gtg 3249
Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val
1070 1075 1080
cgg aaa gtg ctg agc atg ccc caa gtg aat atc gtg aaa aag acc 3294
Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys Thr
1085 1090 1095
gag gtg cag aca ggc ggc ttc agc aaa gag tct atc ctg ccc aag 3339
Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro Lys
1100 1105 1110
agg aac agc gat aag ctg atc gcc aga aag aag gac tgg gac cct 3384
Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro
1115 1120 1125
aag aag tac ggc ggc ttc gac agc ccc acc gtg gcc tat tct gtg 3429
Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser Val
1130 1135 1140
ctg gtg gtg gcc aaa gtg gaa aag ggc aag tcc aag aaa ctg aag 3474
Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys
1145 1150 1155
agt gtg aaa gag ctg ctg ggg atc acc atc atg gaa aga agc agc 3519
Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser
1160 1165 1170
ttc gag aag aat ccc atc gac ttt ctg gaa gcc aag ggc tac aaa 3564
Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys
1175 1180 1185
gaa gtg aaa aag gac ctg atc atc aag ctg cct aag tac tcc ctg 3609
Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu
1190 1195 1200
ttc gag ctg gaa aac ggc cgg aag aga atg ctg gcc tct gcc cgg 3654
Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala Arg
1205 1210 1215
gaa ctg cag aag gga aac gaa ctg gcc ctg ccc tcc aaa tat gtg 3699
Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr Val
1220 1225 1230
aac ttc ctg tac ctg gcc agc cac tat gag aag ctg aag ggc tcc 3744
Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser
1235 1240 1245
ccc gag gat aat gag cag aaa cag ctg ttt gtg gaa cag cac aag 3789
Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys
1250 1255 1260
cac tac ctg gac gag atc atc gag cag atc agc gag ttc tcc aag 3834
His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys
1265 1270 1275
aga gtg atc ctg gcc gac gct aat ctg gac aaa gtg ctg tcc gcc 3879
Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala
1280 1285 1290
tac aac aag cac cgg gat aag ccc atc aga gag cag gcc gag aat 3924
Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn
1295 1300 1305
atc atc cac ctg ttt acc ctg acc aat ctg gga gcc cct gcc gcc 3969
Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala Ala
1310 1315 1320
ttc aag tac ttt gac acc acc atc gac cgg aag gcc tac cgg agc 4014
Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Ala Tyr Arg Ser
1325 1330 1335
acc aaa gag gtg ctg gac gcc acc ctg atc cac cag agc atc acc 4059
Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr
1340 1345 1350
ggc ctg tac gag aca cgg atc gac ctg tct cag ctg gga ggc gac 4104
Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp
1355 1360 1365
taa 4107
<210> 6
<211> 4107
<212> DNA
<213> 酿脓链球菌
<220>
<221> CDS
<222> (1)..(4107)
<400> 6
atg gac aag aag tac agc atc ggc ctg gac atc ggc acc aac tct gtg 48
Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val
1 5 10 15
ggc tgg gcc gtg atc acc gac gag tac aag gtg ccc agc aag aaa ttc 96
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
20 25 30
aag gtg ctg ggc aac acc gac cgg cac agc atc aag aag aac ctg atc 144
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
35 40 45
gga gcc ctg ctg ttc gac agc ggc gaa aca gcc gag gcc acc cgg ctg 192
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
50 55 60
aag aga acc gcc aga aga aga tac acc aga cgg aag aac cgg atc tgc 240
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
65 70 75 80
tat ctg caa gag atc ttc agc aac gag atg gcc aag gtg gac gac agc 288
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
85 90 95
ttc ttc cac aga ctg gaa gag tcc ttc ctg gtg gaa gag gat aag aag 336
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
100 105 110
cac gag cgg cac ccc atc ttc ggc aac atc gtg gac gag gtg gcc tac 384
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
115 120 125
cac gag aag tac ccc acc atc tac cac ctg aga aag aaa ctg gtg gac 432
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
130 135 140
agc acc gac aag gcc gac ctg cgg ctg atc tat ctg gcc ctg gcc cac 480
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
145 150 155 160
atg atc aag ttc cgg ggc cac ttc ctg atc gag ggc gac ctg aac ccc 528
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
165 170 175
gac aac agc gac gtg gac aag ctg ttc atc cag ctg gtg cag acc tac 576
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
180 185 190
aac cag ctg ttc gag gaa aac ccc atc aac gcc agc ggc gtg gac gcc 624
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
195 200 205
aag gcc atc ctg tct gcc aga ctg agc aag agc aga cgg ctg gaa aat 672
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
210 215 220
ctg atc gcc cag ctg ccc ggc gag aag aag aat ggc ctg ttc ggc aac 720
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
225 230 235 240
ctg att gcc ctg agc ctg ggc ctg acc ccc aac ttc aag agc aac ttc 768
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
245 250 255
gac ctg gcc gag gat gcc aaa ctg cag ctg agc aag gac acc tac gac 816
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
260 265 270
gac gac ctg gac aac ctg ctg gcc cag atc ggc gac cag tac gcc gac 864
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
275 280 285
ctg ttt ctg gcc gcc aag aac ctg tcc gac gcc atc ctg ctg agc gac 912
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
290 295 300
atc ctg aga gtg aac acc gag atc acc aag gcc ccc ctg agc gcc tct 960
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
305 310 315 320
atg atc aag aga tac gac gag cac cac cag gac ctg acc ctg ctg aaa 1008
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
325 330 335
gct ctc gtg cgg cag cag ctg cct gag aag tac aaa gag att ttc ttc 1056
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
340 345 350
gac cag agc aag aac ggc tac gcc ggc tac att gac ggc gga gcc agc 1104
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
355 360 365
cag gaa gag ttc tac aag ttc atc aag ccc atc ctg gaa aag atg gac 1152
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
370 375 380
ggc acc gag gaa ctg ctc gtg aag ctg aac aga gag gac ctg ctg cgg 1200
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
385 390 395 400
aag cag cgg acc ttc gac aac ggc agc atc ccc cac cag atc cac ctg 1248
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
405 410 415
gga gag ctg cac gcc att ctg cgg cgg cag gaa gat ttt tac cca ttc 1296
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
420 425 430
ctg aag gac aac cgg gaa aag atc gag aag atc ctg acc ttc cgc atc 1344
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
435 440 445
ccc tac tac gtg ggc cct ctg gcc agg gga aac agc aga ttc gcc tgg 1392
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
450 455 460
atg acc aga aag agc gag gaa acc atc acc ccc tgg aac ttc gag gaa 1440
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
465 470 475 480
gtg gtg gac aag ggc gct tcc gcc cag agc ttc atc gag cgg atg acc 1488
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
485 490 495
aac ttc gat aag aac ctg ccc aac gag aag gtg ctg ccc aag cac agc 1536
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
500 505 510
ctg ctg tac gag tac ttc acc gtg tat aac gag ctg acc aaa gtg aaa 1584
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
515 520 525
tac gtg acc gag gga atg aga aag ccc gcc ttc ctg agc ggc gag cag 1632
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
530 535 540
aaa aag gcc atc gtg gac ctg ctg ttc aag acc aac cgg aaa gtg acc 1680
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
545 550 555 560
gtg aag cag ctg aaa gag gac tac ttc aag aaa atc gag tgc ttc gac 1728
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
565 570 575
tcc gtg gaa atc tcc ggc gtg gaa gat cgg ttc aac gcc tcc ctg ggc 1776
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
580 585 590
aca tac cac gat ctg ctg aaa att atc aag gac aag gac ttc ctg gac 1824
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
595 600 605
aat gag gaa aac gag gac att ctg gaa gat atc gtg ctg acc ctg aca 1872
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
610 615 620
ctg ttt gag gac aga gag atg atc gag gaa cgg ctg aaa acc tat gcc 1920
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
625 630 635 640
cac ctg ttc gac gac aaa gtg atg aag cag ctg aag cgg cgg aga tac 1968
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
645 650 655
acc ggc tgg ggc agg ctg agc cgg aag ctg atc aac ggc atc cgg gac 2016
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
660 665 670
aag cag tcc ggc aag aca atc ctg gat ttc ctg aag tcc gac ggc ttc 2064
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
675 680 685
gcc aac aga aac ttc atg cag ctg atc cac gac gac agc ctg acc ttt 2112
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
690 695 700
aaa gag gac atc cag aaa gcc cag gtg tcc ggc cag ggc gat agc ctg 2160
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
705 710 715 720
cac gag cac att gcc aat ctg gcc ggc agc ccc gcc att aag aag ggc 2208
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
725 730 735
atc ctg cag aca gtg aag gtg gtg gac gag ctc gtg aaa gtg atg ggc 2256
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
740 745 750
cgg cac aag ccc gag aac atc gtg atc gaa atg gcc aga gag aac cag 2304
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
755 760 765
acc acc cag aag gga cag aag aac agc cgc gag aga atg aag cgg atc 2352
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
770 775 780
gaa gag ggc atc aaa gag ctg ggc agc cag atc ctg aaa gaa cac ccc 2400
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
785 790 795 800
gtg gaa aac acc cag ctg cag aac gag aag ctg tac ctg tac tac ctg 2448
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
805 810 815
cag aat ggg cgg gat atg tac gtg gac cag gaa ctg gac atc aac cgg 2496
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
820 825 830
ctg tcc gac tac gat gtg gac cat atc gtg cct cag agc ttt ctg aag 2544
Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu Lys
835 840 845
gac gac tcc atc gac aac aag gtg ctg acc aga agc gac aag aac cgg 2592
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg
850 855 860
ggc aag agc gac aac gtg ccc tcc gaa gag gtc gtg aag aag atg aag 2640
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
865 870 875 880
aac tac tgg cgg cag ctg ctg aac gcc aag ctg att acc cag aga aag 2688
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
885 890 895
ttc gac aat ctg acc aag gcc gag aga ggc ggc ctg agc gaa ctg gat 2736
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
900 905 910
aag gcc ggc ttc atc aag aga cag ctg gtg gaa acc cgg cag atc aca 2784
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
915 920 925
aag cac gtg gca cag atc ctg gac tcc cgg atg aac act aag tac gac 2832
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
930 935 940
gag aat gac aag ctg atc cgg gaa gtg aaa gtg atc acc ctg aag tcc 2880
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
945 950 955 960
aag ctg gtg tcc gat ttc cgg aag gat ttc cag ttt tac aaa gtg cgc 2928
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
965 970 975
gag atc aac aac tac cac cac gcc cac gac gcc tac ctg aac gcc gtc 2976
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
980 985 990
gtg gga acc gcc ctg atc aaa aag tac cct aag ctg gaa agc gag ttc 3024
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe
995 1000 1005
gtg tac ggc gac tac aag gtg tac gac gtg cgg aag atg atc gcc 3069
Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala
1010 1015 1020
aag agc gag cag gaa atc ggc aag gct acc gcc aag tac ttc ttc 3114
Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe
1025 1030 1035
tac agc aac atc atg aac ttt ttc aag acc gag att acc ctg gcc 3159
Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala
1040 1045 1050
aac ggc gag atc cgg aag cgg cct ctg atc gag aca aac ggc gaa 3204
Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu
1055 1060 1065
acc ggg gag atc gtg tgg gat aag ggc cgg gat ttt gcc acc gtg 3249
Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val
1070 1075 1080
cgg aaa gtg ctg agc atg ccc caa gtg aat atc gtg aaa aag acc 3294
Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys Thr
1085 1090 1095
gag gtg cag aca ggc ggc ttc agc aaa gag tct atc cgg ccc aag 3339
Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Arg Pro Lys
1100 1105 1110
agg aac agc gat aag ctg atc gcc aga aag aag gac tgg gac cct 3384
Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro
1115 1120 1125
aag aag tac ggc ggc ttc gac agc ccc acc gtg gcc tat tct gtg 3429
Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser Val
1130 1135 1140
ctg gtg gtg gcc aaa gtg gaa aag ggc aag tcc aag aaa ctg aag 3474
Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys
1145 1150 1155
agt gtg aaa gag ctg ctg ggg atc acc atc atg gaa aga agc agc 3519
Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser
1160 1165 1170
ttc gag aag aat ccc atc gac ttt ctg gaa gcc aag ggc tac aaa 3564
Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys
1175 1180 1185
gaa gtg aaa aag gac ctg atc atc aag ctg cct aag tac tcc ctg 3609
Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu
1190 1195 1200
ttc gag ctg gaa aac ggc cgg aag aga atg ctg gcc tct gcc cgg 3654
Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala Arg
1205 1210 1215
gaa ctg cag aag gga aac gaa ctg gcc ctg ccc tcc aaa tat gtg 3699
Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr Val
1220 1225 1230
aac ttc ctg tac ctg gcc agc cac tat gag aag ctg aag ggc tcc 3744
Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser
1235 1240 1245
ccc gag gat aat gag cag aaa cag ctg ttt gtg gaa cag cac aag 3789
Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys
1250 1255 1260
cac tac ctg gac gag atc atc gag cag atc agc gag ttc tcc aag 3834
His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys
1265 1270 1275
aga gtg atc ctg gcc gac gct aat ctg gac aaa gtg ctg tcc gcc 3879
Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala
1280 1285 1290
tac aac aag cac cgg gat aag ccc atc aga gag cag gcc gag aat 3924
Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn
1295 1300 1305
atc atc cac ctg ttt acc ctg acc aat ctg gga gcc cct gcc gcc 3969
Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala Ala
1310 1315 1320
ttc aag tac ttt gac acc acc atc gac cgg aag gcc tac cgg agc 4014
Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Ala Tyr Arg Ser
1325 1330 1335
acc aaa gag gtg ctg gac gcc acc ctg atc cac cag agc atc acc 4059
Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr
1340 1345 1350
ggc ctg tac gag aca cgg atc gac ctg tct cag ctg gga ggc gac 4104
Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp
1355 1360 1365
taa 4107
<210> 7
<211> 4107
<212> DNA
<213> 酿脓链球菌
<220>
<221> CDS
<222> (1)..(4107)
<400> 7
atg gac aag aag tac agc atc ggc ctg gac atc ggc acc aac tct gtg 48
Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val
1 5 10 15
ggc tgg gcc gtg atc acc gac gag tac aag gtg ccc agc aag aaa ttc 96
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
20 25 30
aag gtg ctg ggc aac acc gac cgg cac agc atc aag aag aac ctg atc 144
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
35 40 45
gga gcc ctg ctg ttc gac agc ggc gaa aca gcc gag gcc acc cgg ctg 192
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
50 55 60
aag aga acc gcc aga aga aga tac acc aga cgg aag aac cgg atc tgc 240
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
65 70 75 80
tat ctg caa gag atc ttc agc aac gag atg gcc aag gtg gac gac agc 288
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
85 90 95
ttc ttc cac aga ctg gaa gag tcc ttc ctg gtg gaa gag gat aag aag 336
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
100 105 110
cac gag cgg cac ccc atc ttc ggc aac atc gtg gac gag gtg gcc tac 384
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
115 120 125
cac gag aag tac ccc acc atc tac cac ctg aga aag aaa ctg gtg gac 432
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
130 135 140
agc acc gac aag gcc gac ctg cgg ctg atc tat ctg gcc ctg gcc cac 480
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
145 150 155 160
atg atc aag ttc cgg ggc cac ttc ctg atc gag ggc gac ctg aac ccc 528
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
165 170 175
gac aac agc gac gtg gac aag ctg ttc atc cag ctg gtg cag acc tac 576
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
180 185 190
aac cag ctg ttc gag gaa aac ccc atc aac gcc agc ggc gtg gac gcc 624
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
195 200 205
aag gcc atc ctg tct gcc aga ctg agc aag agc aga cgg ctg gaa aat 672
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
210 215 220
ctg atc gcc cag ctg ccc ggc gag aag aag aat ggc ctg ttc ggc aac 720
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
225 230 235 240
ctg att gcc ctg agc ctg ggc ctg acc ccc aac ttc aag agc aac ttc 768
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
245 250 255
gac ctg gcc gag gat gcc aaa ctg cag ctg agc aag gac acc tac gac 816
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
260 265 270
gac gac ctg gac aac ctg ctg gcc cag atc ggc gac cag tac gcc gac 864
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
275 280 285
ctg ttt ctg gcc gcc aag aac ctg tcc gac gcc atc ctg ctg agc gac 912
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
290 295 300
atc ctg aga gtg aac acc gag atc acc aag gcc ccc ctg agc gcc tct 960
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
305 310 315 320
atg atc aag aga tac gac gag cac cac cag gac ctg acc ctg ctg aaa 1008
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
325 330 335
gct ctc gtg cgg cag cag ctg cct gag aag tac aaa gag att ttc ttc 1056
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
340 345 350
gac cag agc aag aac ggc tac gcc ggc tac att gac ggc gga gcc agc 1104
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
355 360 365
cag gaa gag ttc tac aag ttc atc aag ccc atc ctg gaa aag atg gac 1152
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
370 375 380
ggc acc gag gaa ctg ctc gtg aag ctg aac aga gag gac ctg ctg cgg 1200
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
385 390 395 400
aag cag cgg acc ttc gac aac ggc agc atc ccc cac cag atc cac ctg 1248
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
405 410 415
gga gag ctg cac gcc att ctg cgg cgg cag gaa gat ttt tac cca ttc 1296
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
420 425 430
ctg aag gac aac cgg gaa aag atc gag aag atc ctg acc ttc cgc atc 1344
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
435 440 445
ccc tac tac gtg ggc cct ctg gcc agg gga aac agc aga ttc gcc tgg 1392
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
450 455 460
atg acc aga aag agc gag gaa acc atc acc ccc tgg aac ttc gag gaa 1440
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
465 470 475 480
gtg gtg gac aag ggc gct tcc gcc cag agc ttc atc gag cgg atg acc 1488
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
485 490 495
aac ttc gat aag aac ctg ccc aac gag aag gtg ctg ccc aag cac agc 1536
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
500 505 510
ctg ctg tac gag tac ttc acc gtg tat aac gag ctg acc aaa gtg aaa 1584
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
515 520 525
tac gtg acc gag gga atg aga aag ccc gcc ttc ctg agc ggc gag cag 1632
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
530 535 540
aaa aag gcc atc gtg gac ctg ctg ttc aag acc aac cgg aaa gtg acc 1680
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
545 550 555 560
gtg aag cag ctg aaa gag gac tac ttc aag aaa atc gag tgc ttc gac 1728
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
565 570 575
tcc gtg gaa atc tcc ggc gtg gaa gat cgg ttc aac gcc tcc ctg ggc 1776
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
580 585 590
aca tac cac gat ctg ctg aaa att atc aag gac aag gac ttc ctg gac 1824
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
595 600 605
aat gag gaa aac gag gac att ctg gaa gat atc gtg ctg acc ctg aca 1872
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
610 615 620
ctg ttt gag gac aga gag atg atc gag gaa cgg ctg aaa acc tat gcc 1920
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
625 630 635 640
cac ctg ttc gac gac aaa gtg atg aag cag ctg aag cgg cgg aga tac 1968
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
645 650 655
acc ggc tgg ggc agg ctg agc cgg aag ctg atc aac ggc atc cgg gac 2016
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
660 665 670
aag cag tcc ggc aag aca atc ctg gat ttc ctg aag tcc gac ggc ttc 2064
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
675 680 685
gcc aac aga aac ttc atg cag ctg atc cac gac gac agc ctg acc ttt 2112
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
690 695 700
aaa gag gac atc cag aaa gcc cag gtg tcc ggc cag ggc gat agc ctg 2160
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
705 710 715 720
cac gag cac att gcc aat ctg gcc ggc agc ccc gcc att aag aag ggc 2208
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
725 730 735
atc ctg cag aca gtg aag gtg gtg gac gag ctc gtg aaa gtg atg ggc 2256
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
740 745 750
cgg cac aag ccc gag aac atc gtg atc gaa atg gcc aga gag aac cag 2304
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
755 760 765
acc acc cag aag gga cag aag aac agc cgc gag aga atg aag cgg atc 2352
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
770 775 780
gaa gag ggc atc aaa gag ctg ggc agc cag atc ctg aaa gaa cac ccc 2400
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
785 790 795 800
gtg gaa aac acc cag ctg cag aac gag aag ctg tac ctg tac tac ctg 2448
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
805 810 815
cag aat ggg cgg gat atg tac gtg gac cag gaa ctg gac atc aac cgg 2496
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
820 825 830
ctg tcc gac tac gat gtg gac cat atc gtg cct cag agc ttt ctg aag 2544
Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu Lys
835 840 845
gac gac tcc atc gac aac aag gtg ctg acc aga agc gac aag aac cgg 2592
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg
850 855 860
ggc aag agc gac aac gtg ccc tcc gaa gag gtc gtg aag aag atg aag 2640
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
865 870 875 880
aac tac tgg cgg cag ctg ctg aac gcc aag ctg att acc cag aga aag 2688
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
885 890 895
ttc gac aat ctg acc aag gcc gag aga ggc ggc ctg agc gaa ctg gat 2736
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
900 905 910
aag gcc ggc ttc atc aag aga cag ctg gtg gaa acc cgg cag atc aca 2784
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
915 920 925
aag cac gtg gca cag atc ctg gac tcc cgg atg aac act aag tac gac 2832
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
930 935 940
gag aat gac aag ctg atc cgg gaa gtg aaa gtg atc acc ctg aag tcc 2880
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
945 950 955 960
aag ctg gtg tcc gat ttc cgg aag gat ttc cag ttt tac aaa gtg cgc 2928
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
965 970 975
gag atc aac aac tac cac cac gcc cac gac gcc tac ctg aac gcc gtc 2976
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
980 985 990
gtg gga acc gcc ctg atc aaa aag tac cct aag ctg gaa agc gag ttc 3024
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe
995 1000 1005
gtg tac ggc gac tac aag gtg tac gac gtg cgg aag atg atc gcc 3069
Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala
1010 1015 1020
aag agc gag cag gaa atc ggc aag gct acc gcc aag tac ttc ttc 3114
Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe
1025 1030 1035
tac agc aac atc atg aac ttt ttc aag acc gag att acc ctg gcc 3159
Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala
1040 1045 1050
aac ggc gag atc cgg aag cgg cct ctg atc gag aca aac ggc gaa 3204
Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu
1055 1060 1065
acc ggg gag atc gtg tgg gat aag ggc cgg gat ttt gcc acc gtg 3249
Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val
1070 1075 1080
cgg aaa gtg ctg agc atg ccc caa gtg aat atc gtg aaa aag acc 3294
Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys Thr
1085 1090 1095
gag gtg cag aca ggc ggc ttc agc aaa gag tct atc cgg ccc aag 3339
Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Arg Pro Lys
1100 1105 1110
agg aac agc gat aag ctg atc gcc aga aag aag gac tgg gac cct 3384
Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro
1115 1120 1125
aag aag tac ggc ggc ttc gac agc ccc acc gtg gcc tat tct gtg 3429
Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser Val
1130 1135 1140
ctg gtg gtg gcc aaa gtg gaa aag ggc aag tcc aag aaa ctg aag 3474
Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys
1145 1150 1155
agt gtg aaa gag ctg ctg ggg atc acc atc atg gaa aga agc agc 3519
Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser
1160 1165 1170
ttc gag aag aat ccc atc gac ttt ctg gaa gcc aag ggc tac aaa 3564
Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys
1175 1180 1185
gaa gtg aaa aag gac ctg atc atc aag ctg cct aag tac tcc ctg 3609
Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu
1190 1195 1200
ttc gag ctg gaa aac ggc cgg aag aga atg ctg gcc tct gcc cgg 3654
Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala Arg
1205 1210 1215
gaa ctg cag aag gga aac gaa ctg gcc ctg ccc tcc aaa tat gtg 3699
Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr Val
1220 1225 1230
aac ttc ctg tac ctg gcc agc cac tat gag aag ctg aag ggc tcc 3744
Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser
1235 1240 1245
ccc gag gat aat gag cag aaa cag ctg ttt gtg gaa cag cac aag 3789
Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys
1250 1255 1260
cac tac ctg gac gag atc atc gag cag atc agc gag ttc tcc aag 3834
His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys
1265 1270 1275
aga gtg atc ctg gcc gac gct aat ctg gac aaa gtg ctg tcc gcc 3879
Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala
1280 1285 1290
tac aac aag cac cgg gat aag ccc atc aga gag cag gcc gag aat 3924
Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn
1295 1300 1305
atc atc cac ctg ttt acc ctg acc aat ctg gga gcc cct gcc gcc 3969
Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala Ala
1310 1315 1320
ttc aag tac ttt gac acc acc atc cgg cgg aag gcc tac cgg agc 4014
Phe Lys Tyr Phe Asp Thr Thr Ile Arg Arg Lys Ala Tyr Arg Ser
1325 1330 1335
acc aaa gag gtg ctg gac gcc acc ctg atc cac cag agc atc acc 4059
Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr
1340 1345 1350
ggc ctg tac gag aca cgg atc gac ctg tct cag ctg gga ggc gac 4104
Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp
1355 1360 1365
taa 4107
<210> 8
<211> 4107
<212> DNA
<213> 酿脓链球菌
<220>
<221> CDS
<222> (1)..(4107)
<400> 8
atg gac aag aag tac agc atc ggc ctg gac atc ggc acc aac tct gtg 48
Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val
1 5 10 15
ggc tgg gcc gtg atc acc gac gag tac aag gtg ccc agc aag aaa ttc 96
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
20 25 30
aag gtg ctg ggc aac acc gac cgg cac agc atc aag aag aac ctg atc 144
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
35 40 45
gga gcc ctg ctg ttc gac agc ggc gaa aca gcc gag gcc acc cgg ctg 192
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
50 55 60
aag aga acc gcc aga aga aga tac acc aga cgg aag aac cgg atc tgc 240
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
65 70 75 80
tat ctg caa gag atc ttc agc aac gag atg gcc aag gtg gac gac agc 288
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
85 90 95
ttc ttc cac aga ctg gaa gag tcc ttc ctg gtg gaa gag gat aag aag 336
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
100 105 110
cac gag cgg cac ccc atc ttc ggc aac atc gtg gac gag gtg gcc tac 384
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
115 120 125
cac gag aag tac ccc acc atc tac cac ctg aga aag aaa ctg gtg gac 432
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
130 135 140
agc acc gac aag gcc gac ctg cgg ctg atc tat ctg gcc ctg gcc cac 480
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
145 150 155 160
atg atc aag ttc cgg ggc cac ttc ctg atc gag ggc gac ctg aac ccc 528
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
165 170 175
gac aac agc gac gtg gac aag ctg ttc atc cag ctg gtg cag acc tac 576
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
180 185 190
aac cag ctg ttc gag gaa aac ccc atc aac gcc agc ggc gtg gac gcc 624
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
195 200 205
aag gcc atc ctg tct gcc aga ctg agc aag agc aga cgg ctg gaa aat 672
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
210 215 220
ctg atc gcc cag ctg ccc ggc gag aag aag aat ggc ctg ttc ggc aac 720
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
225 230 235 240
ctg att gcc ctg agc ctg ggc ctg acc ccc aac ttc aag agc aac ttc 768
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
245 250 255
gac ctg gcc gag gat gcc aaa ctg cag ctg agc aag gac acc tac gac 816
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
260 265 270
gac gac ctg gac aac ctg ctg gcc cag atc ggc gac cag tac gcc gac 864
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
275 280 285
ctg ttt ctg gcc gcc aag aac ctg tcc gac gcc atc ctg ctg agc gac 912
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
290 295 300
atc ctg aga gtg aac acc gag atc acc aag gcc ccc ctg agc gcc tct 960
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
305 310 315 320
atg atc aag aga tac gac gag cac cac cag gac ctg acc ctg ctg aaa 1008
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
325 330 335
gct ctc gtg cgg cag cag ctg cct gag aag tac aaa gag att ttc ttc 1056
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
340 345 350
gac cag agc aag aac ggc tac gcc ggc tac att gac ggc gga gcc agc 1104
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
355 360 365
cag gaa gag ttc tac aag ttc atc aag ccc atc ctg gaa aag atg gac 1152
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
370 375 380
ggc acc gag gaa ctg ctc gtg aag ctg aac aga gag gac ctg ctg cgg 1200
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
385 390 395 400
aag cag cgg acc ttc gac aac ggc agc atc ccc cac cag atc cac ctg 1248
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
405 410 415
gga gag ctg cac gcc att ctg cgg cgg cag gaa gat ttt tac cca ttc 1296
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
420 425 430
ctg aag gac aac cgg gaa aag atc gag aag atc ctg acc ttc cgc atc 1344
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
435 440 445
ccc tac tac gtg ggc cct ctg gcc agg gga aac agc aga ttc gcc tgg 1392
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
450 455 460
atg acc aga aag agc gag gaa acc atc acc ccc tgg aac ttc gag gaa 1440
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
465 470 475 480
gtg gtg gac aag ggc gct tcc gcc cag agc ttc atc gag cgg atg acc 1488
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
485 490 495
aac ttc gat aag aac ctg ccc aac gag aag gtg ctg ccc aag cac agc 1536
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
500 505 510
ctg ctg tac gag tac ttc acc gtg tat aac gag ctg acc aaa gtg aaa 1584
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
515 520 525
tac gtg acc gag gga atg aga aag ccc gcc ttc ctg agc ggc gag cag 1632
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
530 535 540
aaa aag gcc atc gtg gac ctg ctg ttc aag acc aac cgg aaa gtg acc 1680
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
545 550 555 560
gtg aag cag ctg aaa gag gac tac ttc aag aaa atc gag tgc ttc gac 1728
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
565 570 575
tcc gtg gaa atc tcc ggc gtg gaa gat cgg ttc aac gcc tcc ctg ggc 1776
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
580 585 590
aca tac cac gat ctg ctg aaa att atc aag gac aag gac ttc ctg gac 1824
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
595 600 605
aat gag gaa aac gag gac att ctg gaa gat atc gtg ctg acc ctg aca 1872
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
610 615 620
ctg ttt gag gac aga gag atg atc gag gaa cgg ctg aaa acc tat gcc 1920
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
625 630 635 640
cac ctg ttc gac gac aaa gtg atg aag cag ctg aag cgg cgg aga tac 1968
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
645 650 655
acc ggc tgg ggc agg ctg agc cgg aag ctg atc aac ggc atc cgg gac 2016
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
660 665 670
aag cag tcc ggc aag aca atc ctg gat ttc ctg aag tcc gac ggc ttc 2064
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
675 680 685
gcc aac aga aac ttc atg cag ctg atc cac gac gac agc ctg acc ttt 2112
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
690 695 700
aaa gag gac atc cag aaa gcc cag gtg tcc ggc cag ggc gat agc ctg 2160
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
705 710 715 720
cac gag cac att gcc aat ctg gcc ggc agc ccc gcc att aag aag ggc 2208
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
725 730 735
atc ctg cag aca gtg aag gtg gtg gac gag ctc gtg aaa gtg atg ggc 2256
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
740 745 750
cgg cac aag ccc gag aac atc gtg atc gaa atg gcc aga gag aac cag 2304
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
755 760 765
acc acc cag aag gga cag aag aac agc cgc gag aga atg aag cgg atc 2352
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
770 775 780
gaa gag ggc atc aaa gag ctg ggc agc cag atc ctg aaa gaa cac ccc 2400
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
785 790 795 800
gtg gaa aac acc cag ctg cag aac gag aag ctg tac ctg tac tac ctg 2448
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
805 810 815
cag aat ggg cgg gat atg tac gtg gac cag gaa ctg gac atc aac cgg 2496
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
820 825 830
ctg tcc gac tac gat gtg gac cat atc gtg cct cag agc ttt ctg aag 2544
Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu Lys
835 840 845
gac gac tcc atc gac aac aag gtg ctg acc aga agc gac aag aac cgg 2592
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg
850 855 860
ggc aag agc gac aac gtg ccc tcc gaa gag gtc gtg aag aag atg aag 2640
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
865 870 875 880
aac tac tgg cgg cag ctg ctg aac gcc aag ctg att acc cag aga aag 2688
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
885 890 895
ttc gac aat ctg acc aag gcc gag aga ggc ggc ctg agc gaa ctg gat 2736
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
900 905 910
aag gcc ggc ttc atc aag aga cag ctg gtg gaa acc cgg cag atc aca 2784
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
915 920 925
aag cac gtg gca cag atc ctg gac tcc cgg atg aac act aag tac gac 2832
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
930 935 940
gag aat gac aag ctg atc cgg gaa gtg aaa gtg atc acc ctg aag tcc 2880
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
945 950 955 960
aag ctg gtg tcc gat ttc cgg aag gat ttc cag ttt tac aaa gtg cgc 2928
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
965 970 975
gag atc aac aac tac cac cac gcc cac gac gcc tac ctg aac gcc gtc 2976
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
980 985 990
gtg gga acc gcc ctg atc aaa aag tac cct aag ctg gaa agc gag ttc 3024
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe
995 1000 1005
gtg tac ggc gac tac aag gtg tac gac gtg cgg aag atg atc gcc 3069
Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala
1010 1015 1020
aag agc gag cag gaa atc ggc aag gct acc gcc aag tac ttc ttc 3114
Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe
1025 1030 1035
tac agc aac atc atg aac ttt ttc aag acc gag att acc ctg gcc 3159
Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala
1040 1045 1050
aac ggc gag atc cgg aag cgg cct ctg atc gag aca aac ggc gaa 3204
Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu
1055 1060 1065
acc ggg gag atc gtg tgg gat aag ggc cgg gat ttt gcc acc gtg 3249
Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val
1070 1075 1080
cgg aaa gtg ctg agc atg ccc caa gtg aat atc gtg aaa aag acc 3294
Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys Thr
1085 1090 1095
gag gtg cag aca ggc ggc ttc agc aaa gag tct atc cgg ccc aag 3339
Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Arg Pro Lys
1100 1105 1110
agg aac agc gat aag ctg atc gcc aga aag aag gac tgg gac cct 3384
Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro
1115 1120 1125
aag aag tac ggc ggc ttc gac agc ccc acc gtg gcc tat tct gtg 3429
Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser Val
1130 1135 1140
ctg gtg gtg gcc aaa gtg gaa aag ggc aag tcc aag aaa ctg aag 3474
Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys
1145 1150 1155
agt gtg aaa gag ctg ctg ggg atc acc atc atg gaa aga agc agc 3519
Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser
1160 1165 1170
ttc gag aag aat ccc atc gac ttt ctg gaa gcc aag ggc tac aaa 3564
Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys
1175 1180 1185
gaa gtg aaa aag gac ctg atc atc aag ctg cct aag tac tcc ctg 3609
Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu
1190 1195 1200
ttc gag ctg gaa aac ggc cgg aag aga atg ctg gcc tct gcc cgg 3654
Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala Arg
1205 1210 1215
gaa ctg cag aag gga aac gaa ctg gcc ctg ccc tcc aaa tat gtg 3699
Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr Val
1220 1225 1230
aac ttc ctg tac ctg gcc agc cac tat gag aag ctg aag ggc tcc 3744
Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser
1235 1240 1245
ccc gag gat aat gag cag aaa cag ctg ttt gtg gaa cag cac aag 3789
Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys
1250 1255 1260
cac tac ctg gac gag atc atc gag cag atc agc gag ttc tcc aag 3834
His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys
1265 1270 1275
aga gtg atc ctg gcc gac gct aat ctg gac aaa gtg ctg tcc gcc 3879
Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala
1280 1285 1290
tac aac aag cac cgg gat aag ccc atc aga gag cag gcc gag aat 3924
Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn
1295 1300 1305
atc atc cac ctg ttt acc ctg acc aat ctg gga gcc cct cgg gcc 3969
Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Arg Ala
1310 1315 1320
ttc aag tac ttt gac acc acc atc cgg cgg aag gcc tac cgg agc 4014
Phe Lys Tyr Phe Asp Thr Thr Ile Arg Arg Lys Ala Tyr Arg Ser
1325 1330 1335
acc aaa gag gtg ctg gac gcc acc ctg atc cac cag agc atc acc 4059
Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr
1340 1345 1350
ggc ctg tac gag aca cgg atc gac ctg tct cag ctg gga ggc gac 4104
Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp
1355 1360 1365
taa 4107
<210> 9
<211> 4107
<212> DNA
<213> 酿脓链球菌
<220>
<221> CDS
<222> (1)..(4107)
<400> 9
atg gac aag aag tac agc atc ggc ctg gac atc ggc acc aac tct gtg 48
Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val
1 5 10 15
ggc tgg gcc gtg atc acc gac gag tac aag gtg ccc agc aag aaa ttc 96
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
20 25 30
aag gtg ctg ggc aac acc gac cgg cac agc atc aag aag aac ctg atc 144
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
35 40 45
gga gcc ctg ctg ttc gac agc ggc gaa aca gcc gag gcc acc cgg ctg 192
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
50 55 60
aag aga acc gcc aga aga aga tac acc aga cgg aag aac cgg atc tgc 240
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
65 70 75 80
tat ctg caa gag atc ttc agc aac gag atg gcc aag gtg gac gac agc 288
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
85 90 95
ttc ttc cac aga ctg gaa gag tcc ttc ctg gtg gaa gag gat aag aag 336
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
100 105 110
cac gag cgg cac ccc atc ttc ggc aac atc gtg gac gag gtg gcc tac 384
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
115 120 125
cac gag aag tac ccc acc atc tac cac ctg aga aag aaa ctg gtg gac 432
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
130 135 140
agc acc gac aag gcc gac ctg cgg ctg atc tat ctg gcc ctg gcc cac 480
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
145 150 155 160
atg atc aag ttc cgg ggc cac ttc ctg atc gag ggc gac ctg aac ccc 528
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
165 170 175
gac aac agc gac gtg gac aag ctg ttc atc cag ctg gtg cag acc tac 576
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
180 185 190
aac cag ctg ttc gag gaa aac ccc atc aac gcc agc ggc gtg gac gcc 624
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
195 200 205
aag gcc atc ctg tct gcc aga ctg agc aag agc aga cgg ctg gaa aat 672
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
210 215 220
ctg atc gcc cag ctg ccc ggc gag aag aag aat ggc ctg ttc ggc aac 720
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
225 230 235 240
ctg att gcc ctg agc ctg ggc ctg acc ccc aac ttc aag agc aac ttc 768
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
245 250 255
gac ctg gcc gag gat gcc aaa ctg cag ctg agc aag gac acc tac gac 816
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
260 265 270
gac gac ctg gac aac ctg ctg gcc cag atc ggc gac cag tac gcc gac 864
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
275 280 285
ctg ttt ctg gcc gcc aag aac ctg tcc gac gcc atc ctg ctg agc gac 912
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
290 295 300
atc ctg aga gtg aac acc gag atc acc aag gcc ccc ctg agc gcc tct 960
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
305 310 315 320
atg atc aag aga tac gac gag cac cac cag gac ctg acc ctg ctg aaa 1008
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
325 330 335
gct ctc gtg cgg cag cag ctg cct gag aag tac aaa gag att ttc ttc 1056
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
340 345 350
gac cag agc aag aac ggc tac gcc ggc tac att gac ggc gga gcc agc 1104
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
355 360 365
cag gaa gag ttc tac aag ttc atc aag ccc atc ctg gaa aag atg gac 1152
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
370 375 380
ggc acc gag gaa ctg ctc gtg aag ctg aac aga gag gac ctg ctg cgg 1200
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
385 390 395 400
aag cag cgg acc ttc gac aac ggc agc atc ccc cac cag atc cac ctg 1248
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
405 410 415
gga gag ctg cac gcc att ctg cgg cgg cag gaa gat ttt tac cca ttc 1296
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
420 425 430
ctg aag gac aac cgg gaa aag atc gag aag atc ctg acc ttc cgc atc 1344
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
435 440 445
ccc tac tac gtg ggc cct ctg gcc agg gga aac agc aga ttc gcc tgg 1392
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
450 455 460
atg acc aga aag agc gag gaa acc atc acc ccc tgg aac ttc gag gaa 1440
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
465 470 475 480
gtg gtg gac aag ggc gct tcc gcc cag agc ttc atc gag cgg atg acc 1488
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
485 490 495
aac ttc gat aag aac ctg ccc aac gag aag gtg ctg ccc aag cac agc 1536
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
500 505 510
ctg ctg tac gag tac ttc acc gtg tat aac gag ctg acc aaa gtg aaa 1584
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
515 520 525
tac gtg acc gag gga atg aga aag ccc gcc ttc ctg agc ggc gag cag 1632
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
530 535 540
aaa aag gcc atc gtg gac ctg ctg ttc aag acc aac cgg aaa gtg acc 1680
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
545 550 555 560
gtg aag cag ctg aaa gag gac tac ttc aag aaa atc gag tgc ttc gac 1728
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
565 570 575
tcc gtg gaa atc tcc ggc gtg gaa gat cgg ttc aac gcc tcc ctg ggc 1776
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
580 585 590
aca tac cac gat ctg ctg aaa att atc aag gac aag gac ttc ctg gac 1824
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
595 600 605
aat gag gaa aac gag gac att ctg gaa gat atc gtg ctg acc ctg aca 1872
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
610 615 620
ctg ttt gag gac aga gag atg atc gag gaa cgg ctg aaa acc tat gcc 1920
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
625 630 635 640
cac ctg ttc gac gac aaa gtg atg aag cag ctg aag cgg cgg aga tac 1968
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
645 650 655
acc ggc tgg ggc agg ctg agc cgg aag ctg atc aac ggc atc cgg gac 2016
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
660 665 670
aag cag tcc ggc aag aca atc ctg gat ttc ctg aag tcc gac ggc ttc 2064
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
675 680 685
gcc aac aga aac ttc atg cag ctg atc cac gac gac agc ctg acc ttt 2112
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
690 695 700
aaa gag gac atc cag aaa gcc cag gtg tcc ggc cag ggc gat agc ctg 2160
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
705 710 715 720
cac gag cac att gcc aat ctg gcc ggc agc ccc gcc att aag aag ggc 2208
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
725 730 735
atc ctg cag aca gtg aag gtg gtg gac gag ctc gtg aaa gtg atg ggc 2256
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
740 745 750
cgg cac aag ccc gag aac atc gtg atc gaa atg gcc aga gag aac cag 2304
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
755 760 765
acc acc cag aag gga cag aag aac agc cgc gag aga atg aag cgg atc 2352
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
770 775 780
gaa gag ggc atc aaa gag ctg ggc agc cag atc ctg aaa gaa cac ccc 2400
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
785 790 795 800
gtg gaa aac acc cag ctg cag aac gag aag ctg tac ctg tac tac ctg 2448
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
805 810 815
cag aat ggg cgg gat atg tac gtg gac cag gaa ctg gac atc aac cgg 2496
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
820 825 830
ctg tcc gac tac gat gtg gac cat atc gtg cct cag agc ttt ctg aag 2544
Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu Lys
835 840 845
gac gac tcc atc gac aac aag gtg ctg acc aga agc gac aag aac cgg 2592
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg
850 855 860
ggc aag agc gac aac gtg ccc tcc gaa gag gtc gtg aag aag atg aag 2640
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
865 870 875 880
aac tac tgg cgg cag ctg ctg aac gcc aag ctg att acc cag aga aag 2688
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
885 890 895
ttc gac aat ctg acc aag gcc gag aga ggc ggc ctg agc gaa ctg gat 2736
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
900 905 910
aag gcc ggc ttc atc aag aga cag ctg gtg gaa acc cgg cag atc aca 2784
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
915 920 925
aag cac gtg gca cag atc ctg gac tcc cgg atg aac act aag tac gac 2832
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
930 935 940
gag aat gac aag ctg atc cgg gaa gtg aaa gtg atc acc ctg aag tcc 2880
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
945 950 955 960
aag ctg gtg tcc gat ttc cgg aag gat ttc cag ttt tac aaa gtg cgc 2928
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
965 970 975
gag atc aac aac tac cac cac gcc cac gac gcc tac ctg aac gcc gtc 2976
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
980 985 990
gtg gga acc gcc ctg atc aaa aag tac cct aag ctg gaa agc gag ttc 3024
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe
995 1000 1005
gtg tac ggc gac tac aag gtg tac gac gtg cgg aag atg atc gcc 3069
Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala
1010 1015 1020
aag agc gag cag gaa atc ggc aag gct acc gcc aag tac ttc ttc 3114
Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe
1025 1030 1035
tac agc aac atc atg aac ttt ttc aag acc gag att acc ctg gcc 3159
Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala
1040 1045 1050
aac ggc gag atc cgg aag cgg cct ctg atc gag aca aac ggc gaa 3204
Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu
1055 1060 1065
acc ggg gag atc gtg tgg gat aag ggc cgg gat ttt gcc acc gtg 3249
Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val
1070 1075 1080
cgg aaa gtg ctg agc atg ccc caa gtg aat atc gtg aaa aag acc 3294
Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys Thr
1085 1090 1095
gag gtg cag aca ggc ggc ttc agc aaa gag tct atc cgg ccc aag 3339
Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Arg Pro Lys
1100 1105 1110
agg aac agc gat aag ctg atc gcc aga aag aag gac tgg gac cct 3384
Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro
1115 1120 1125
aag aag tac ggc ggc ttc gac agc ccc acc gtg gcc tat tct gtg 3429
Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser Val
1130 1135 1140
ctg gtg gtg gcc aaa gtg gaa aag ggc aag tcc aag aaa ctg aag 3474
Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys
1145 1150 1155
agt gtg aaa gag ctg ctg ggg atc acc atc atg gaa aga agc agc 3519
Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser
1160 1165 1170
ttc gag aag aat ccc atc gac ttt ctg gaa gcc aag ggc tac aaa 3564
Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys
1175 1180 1185
gaa gtg aaa aag gac ctg atc atc aag ctg cct aag tac tcc ctg 3609
Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu
1190 1195 1200
ttc gag ctg gaa aac ggc cgg aag aga atg ctg gcc tct gcc cgg 3654
Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala Arg
1205 1210 1215
gaa ctg cag aag gga aac gaa ctg gcc ctg ccc tcc aaa tat gtg 3699
Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr Val
1220 1225 1230
aac ttc ctg tac ctg gcc agc cac tat gag aag ctg aag ggc tcc 3744
Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser
1235 1240 1245
ccc gag gat aat gag cag aaa cag ctg ttt gtg gaa cag cac aag 3789
Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys
1250 1255 1260
cac tac ctg gac gag atc atc gag cag atc agc gag ttc tcc aag 3834
His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys
1265 1270 1275
aga gtg atc ctg gcc cgg cgg aat ctg gac aaa gtg ctg tcc gcc 3879
Arg Val Ile Leu Ala Arg Arg Asn Leu Asp Lys Val Leu Ser Ala
1280 1285 1290
tac aac aag cac cgg gat aag ccc atc aga gag cag gcc gag aat 3924
Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn
1295 1300 1305
atc atc cac ctg ttt acc ctg acc aat ctg gga gcc cct cgg gcc 3969
Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Arg Ala
1310 1315 1320
ttc aag tac ttt gac acc acc atc cgg cgg aag gcc tac cgg agc 4014
Phe Lys Tyr Phe Asp Thr Thr Ile Arg Arg Lys Ala Tyr Arg Ser
1325 1330 1335
acc aaa gag gtg ctg gac gcc acc ctg atc cac cag agc atc acc 4059
Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr
1340 1345 1350
ggc ctg tac gag aca cgg atc gac ctg tct cag ctg gga ggc gac 4104
Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp
1355 1360 1365
taa 4107
<210> 10
<211> 4107
<212> DNA
<213> 酿脓链球菌
<220>
<221> CDS
<222> (1)..(4107)
<400> 10
atg gac aag aag tac agc atc ggc ctg gac atc ggc acc aac tct gtg 48
Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val
1 5 10 15
ggc tgg gcc gtg atc acc gac gag tac aag gtg ccc agc aag aaa ttc 96
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
20 25 30
aag gtg ctg ggc aac acc gac cgg cac agc atc aag aag aac ctg atc 144
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
35 40 45
gga gcc ctg ctg ttc gac agc ggc gaa aca gcc gag gcc acc cgg ctg 192
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
50 55 60
aag aga acc gcc aga aga aga tac acc aga cgg aag aac cgg atc tgc 240
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
65 70 75 80
tat ctg caa gag atc ttc agc aac gag atg gcc aag gtg gac gac agc 288
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
85 90 95
ttc ttc cac aga ctg gaa gag tcc ttc ctg gtg gaa gag gat aag aag 336
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
100 105 110
cac gag cgg cac ccc atc ttc ggc aac atc gtg gac gag gtg gcc tac 384
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
115 120 125
cac gag aag tac ccc acc atc tac cac ctg aga aag aaa ctg gtg gac 432
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
130 135 140
agc acc gac aag gcc gac ctg cgg ctg atc tat ctg gcc ctg gcc cac 480
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
145 150 155 160
atg atc aag ttc cgg ggc cac ttc ctg atc gag ggc gac ctg aac ccc 528
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
165 170 175
gac aac agc gac gtg gac aag ctg ttc atc cag ctg gtg cag acc tac 576
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
180 185 190
aac cag ctg ttc gag gaa aac ccc atc aac gcc agc ggc gtg gac gcc 624
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
195 200 205
aag gcc atc ctg tct gcc aga ctg agc aag agc aga cgg ctg gaa aat 672
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
210 215 220
ctg atc gcc cag ctg ccc ggc gag aag aag aat ggc ctg ttc ggc aac 720
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
225 230 235 240
ctg att gcc ctg agc ctg ggc ctg acc ccc aac ttc aag agc aac ttc 768
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
245 250 255
gac ctg gcc gag gat gcc aaa ctg cag ctg agc aag gac acc tac gac 816
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
260 265 270
gac gac ctg gac aac ctg ctg gcc cag atc ggc gac cag tac gcc gac 864
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
275 280 285
ctg ttt ctg gcc gcc aag aac ctg tcc gac gcc atc ctg ctg agc gac 912
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
290 295 300
atc ctg aga gtg aac acc gag atc acc aag gcc ccc ctg agc gcc tct 960
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
305 310 315 320
atg atc aag aga tac gac gag cac cac cag gac ctg acc ctg ctg aaa 1008
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
325 330 335
gct ctc gtg cgg cag cag ctg cct gag aag tac aaa gag att ttc ttc 1056
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
340 345 350
gac cag agc aag aac ggc tac gcc ggc tac att gac ggc gga gcc agc 1104
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
355 360 365
cag gaa gag ttc tac aag ttc atc aag ccc atc ctg gaa aag atg gac 1152
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
370 375 380
ggc acc gag gaa ctg ctc gtg aag ctg aac aga gag gac ctg ctg cgg 1200
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
385 390 395 400
aag cag cgg acc ttc gac aac ggc agc atc ccc cac cag atc cac ctg 1248
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
405 410 415
gga gag ctg cac gcc att ctg cgg cgg cag gaa gat ttt tac cca ttc 1296
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
420 425 430
ctg aag gac aac cgg gaa aag atc gag aag atc ctg acc ttc cgc atc 1344
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
435 440 445
ccc tac tac gtg ggc cct ctg gcc agg gga aac agc aga ttc gcc tgg 1392
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
450 455 460
atg acc aga aag agc gag gaa acc atc acc ccc tgg aac ttc gag gaa 1440
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
465 470 475 480
gtg gtg gac aag ggc gct tcc gcc cag agc ttc atc gag cgg atg acc 1488
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
485 490 495
aac ttc gat aag aac ctg ccc aac gag aag gtg ctg ccc aag cac agc 1536
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
500 505 510
ctg ctg tac gag tac ttc acc gtg tat aac gag ctg acc aaa gtg aaa 1584
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
515 520 525
tac gtg acc gag gga atg aga aag ccc gcc ttc ctg agc ggc gag cag 1632
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
530 535 540
aaa aag gcc atc gtg gac ctg ctg ttc aag acc aac cgg aaa gtg acc 1680
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
545 550 555 560
gtg aag cag ctg aaa gag gac tac ttc aag aaa atc gag tgc ttc gac 1728
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
565 570 575
tcc gtg gaa atc tcc ggc gtg gaa gat cgg ttc aac gcc tcc ctg ggc 1776
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
580 585 590
aca tac cac gat ctg ctg aaa att atc aag gac aag gac ttc ctg gac 1824
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
595 600 605
aat gag gaa aac gag gac att ctg gaa gat atc gtg ctg acc ctg aca 1872
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
610 615 620
ctg ttt gag gac aga gag atg atc gag gaa cgg ctg aaa acc tat gcc 1920
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
625 630 635 640
cac ctg ttc gac gac aaa gtg atg aag cag ctg aag cgg cgg aga tac 1968
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
645 650 655
acc ggc tgg ggc agg ctg agc cgg aag ctg atc aac ggc atc cgg gac 2016
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
660 665 670
aag cag tcc ggc aag aca atc ctg gat ttc ctg aag tcc gac ggc ttc 2064
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
675 680 685
gcc aac aga aac ttc atg cag ctg atc cac gac gac agc ctg acc ttt 2112
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
690 695 700
aaa gag gac atc cag aaa gcc cag gtg tcc ggc cag ggc gat agc ctg 2160
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
705 710 715 720
cac gag cac att gcc aat ctg gcc ggc agc ccc gcc att aag aag ggc 2208
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
725 730 735
atc ctg cag aca gtg aag gtg gtg gac gag ctc gtg aaa gtg atg ggc 2256
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
740 745 750
cgg cac aag ccc gag aac atc gtg atc gaa atg gcc aga gag aac cag 2304
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
755 760 765
acc acc cag aag gga cag aag aac agc cgc gag aga atg aag cgg atc 2352
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
770 775 780
gaa gag ggc atc aaa gag ctg ggc agc cag atc ctg aaa gaa cac ccc 2400
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
785 790 795 800
gtg gaa aac acc cag ctg cag aac gag aag ctg tac ctg tac tac ctg 2448
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
805 810 815
cag aat ggg cgg gat atg tac gtg gac cag gaa ctg gac atc aac cgg 2496
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
820 825 830
ctg tcc gac tac gat gtg gac cat atc gtg cct cag agc ttt ctg aag 2544
Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu Lys
835 840 845
gac gac tcc atc gac aac aag gtg ctg acc aga agc gac aag aac cgg 2592
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg
850 855 860
ggc aag agc gac aac gtg ccc tcc gaa gag gtc gtg aag aag atg aag 2640
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
865 870 875 880
aac tac tgg cgg cag ctg ctg aac gcc aag ctg att acc cag aga aag 2688
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
885 890 895
ttc gac aat ctg acc aag gcc gag aga ggc ggc ctg agc gaa ctg gat 2736
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
900 905 910
aag gcc ggc ttc atc aag aga cag ctg gtg gaa acc cgg cag atc aca 2784
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
915 920 925
aag cac gtg gca cag atc ctg gac tcc cgg atg aac act aag tac gac 2832
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
930 935 940
gag aat gac aag ctg atc cgg gaa gtg aaa gtg atc acc ctg aag tcc 2880
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
945 950 955 960
aag ctg gtg tcc gat ttc cgg aag gat ttc cag ttt tac aaa gtg cgc 2928
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
965 970 975
gag atc aac aac tac cac cac gcc cac gac gcc tac ctg aac gcc gtc 2976
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
980 985 990
gtg gga acc gcc ctg atc aaa aag tac cct aag ctg gaa agc gag ttc 3024
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe
995 1000 1005
gtg tac ggc gac tac aag gtg tac gac gtg cgg aag atg atc gcc 3069
Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala
1010 1015 1020
aag agc gag cag gaa atc ggc aag gct acc gcc aag tac ttc ttc 3114
Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe
1025 1030 1035
tac agc aac atc atg aac ttt ttc aag acc gag att acc ctg gcc 3159
Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala
1040 1045 1050
aac ggc gag atc cgg aag cgg cct ctg atc gag aca aac ggc gaa 3204
Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu
1055 1060 1065
acc ggg gag atc gtg tgg gat aag ggc cgg gat ttt gcc acc gtg 3249
Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val
1070 1075 1080
cgg aaa gtg ctg agc atg ccc caa gtg aat atc gtg aaa aag acc 3294
Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys Thr
1085 1090 1095
gag gtg cag aca ggc ggc ttc agc aaa gag tct atc cgg ccc aag 3339
Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Arg Pro Lys
1100 1105 1110
agg aac agc gat aag ctg atc gcc aga aag aag gac tgg gac cct 3384
Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro
1115 1120 1125
aag aag tac ggc ggc ttc gac agc ccc acc gtg gcc tat tct gtg 3429
Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser Val
1130 1135 1140
ctg gtg gtg gcc aaa gtg gaa aag ggc aag tcc aag aaa ctg aag 3474
Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys
1145 1150 1155
agt gtg aaa gag ctg ctg ggg atc acc atc atg gaa aga agc agc 3519
Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser
1160 1165 1170
ttc gag aag aat ccc atc gac ttt ctg gaa gcc aag ggc tac aaa 3564
Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys
1175 1180 1185
gaa gtg aaa aag gac ctg atc atc aag ctg cct aag tac tcc ctg 3609
Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu
1190 1195 1200
ttc gag ctg gaa aac ggc cgg aag aga atg ctg gcc tct gcc cgg 3654
Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala Arg
1205 1210 1215
gaa ctg cag aag gga aac gaa ctg gcc ctg ccc tcc aaa tat gtg 3699
Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr Val
1220 1225 1230
aac ttc ctg tac ctg gcc agc cac tat gag aag ctg aag ggc tcc 3744
Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser
1235 1240 1245
ccc gag gat aat gag cag aaa cag ctg ttt gtg gaa cag cac aag 3789
Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys
1250 1255 1260
cac tac ctg gac gag atc atc gag cag atc agc gag ttc tcc aag 3834
His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys
1265 1270 1275
aga gtg atc ctg gcc gac gct aat ctg gac aaa gtg ctg tcc gcc 3879
Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala
1280 1285 1290
tac aac aag cac cgg gat aag ccc atc aga gag cag gcc gag aat 3924
Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn
1295 1300 1305
atc atc cac ctg ttt acc ctg acc aat ctg gga gcc cct cgg gcc 3969
Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Arg Ala
1310 1315 1320
ttc aag tac ttt gac acc acc atc cgg cgg aag gcc tac acc agc 4014
Phe Lys Tyr Phe Asp Thr Thr Ile Arg Arg Lys Ala Tyr Thr Ser
1325 1330 1335
acc aaa gag gtg ctg gac gcc acc ctg atc cac cag agc atc acc 4059
Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr
1340 1345 1350
ggc ctg tac gag aca cgg atc gac ctg tct cag ctg gga ggc gac 4104
Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp
1355 1360 1365
taa 4107
<210> 11
<211> 4107
<212> DNA
<213> 酿脓链球菌
<220>
<221> CDS
<222> (1)..(4107)
<400> 11
atg gac aag aag tac agc atc ggc ctg gac atc ggc acc aac tct gtg 48
Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val
1 5 10 15
ggc tgg gcc gtg atc acc gac gag tac aag gtg ccc agc aag aaa ttc 96
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
20 25 30
aag gtg ctg ggc aac acc gac cgg cac agc atc aag aag aac ctg atc 144
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
35 40 45
gga gcc ctg ctg ttc gac agc ggc gaa aca gcc gag gcc acc cgg ctg 192
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
50 55 60
aag aga acc gcc aga aga aga tac acc aga cgg aag aac cgg atc tgc 240
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
65 70 75 80
tat ctg caa gag atc ttc agc aac gag atg gcc aag gtg gac gac agc 288
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
85 90 95
ttc ttc cac aga ctg gaa gag tcc ttc ctg gtg gaa gag gat aag aag 336
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
100 105 110
cac gag cgg cac ccc atc ttc ggc aac atc gtg gac gag gtg gcc tac 384
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
115 120 125
cac gag aag tac ccc acc atc tac cac ctg aga aag aaa ctg gtg gac 432
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
130 135 140
agc acc gac aag gcc gac ctg cgg ctg atc tat ctg gcc ctg gcc cac 480
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
145 150 155 160
atg atc aag ttc cgg ggc cac ttc ctg atc gag ggc gac ctg aac ccc 528
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
165 170 175
gac aac agc gac gtg gac aag ctg ttc atc cag ctg gtg cag acc tac 576
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
180 185 190
aac cag ctg ttc gag gaa aac ccc atc aac gcc agc ggc gtg gac gcc 624
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
195 200 205
aag gcc atc ctg tct gcc aga ctg agc aag agc aga cgg ctg gaa aat 672
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
210 215 220
ctg atc gcc cag ctg ccc ggc gag aag aag aat ggc ctg ttc ggc aac 720
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
225 230 235 240
ctg att gcc ctg agc ctg ggc ctg acc ccc aac ttc aag agc aac ttc 768
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
245 250 255
gac ctg gcc gag gat gcc aaa ctg cag ctg agc aag gac acc tac gac 816
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
260 265 270
gac gac ctg gac aac ctg ctg gcc cag atc ggc gac cag tac gcc gac 864
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
275 280 285
ctg ttt ctg gcc gcc aag aac ctg tcc gac gcc atc ctg ctg agc gac 912
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
290 295 300
atc ctg aga gtg aac acc gag atc acc aag gcc ccc ctg agc gcc tct 960
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
305 310 315 320
atg atc aag aga tac gac gag cac cac cag gac ctg acc ctg ctg aaa 1008
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
325 330 335
gct ctc gtg cgg cag cag ctg cct gag aag tac aaa gag att ttc ttc 1056
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
340 345 350
gac cag agc aag aac ggc tac gcc ggc tac att gac ggc gga gcc agc 1104
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
355 360 365
cag gaa gag ttc tac aag ttc atc aag ccc atc ctg gaa aag atg gac 1152
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
370 375 380
ggc acc gag gaa ctg ctc gtg aag ctg aac aga gag gac ctg ctg cgg 1200
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
385 390 395 400
aag cag cgg acc ttc gac aac ggc agc atc ccc cac cag atc cac ctg 1248
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
405 410 415
gga gag ctg cac gcc att ctg cgg cgg cag gaa gat ttt tac cca ttc 1296
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
420 425 430
ctg aag gac aac cgg gaa aag atc gag aag atc ctg acc ttc cgc atc 1344
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
435 440 445
ccc tac tac gtg ggc cct ctg gcc agg gga aac agc aga ttc gcc tgg 1392
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
450 455 460
atg acc aga aag agc gag gaa acc atc acc ccc tgg aac ttc gag gaa 1440
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
465 470 475 480
gtg gtg gac aag ggc gct tcc gcc cag agc ttc atc gag cgg atg acc 1488
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
485 490 495
aac ttc gat aag aac ctg ccc aac gag aag gtg ctg ccc aag cac agc 1536
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
500 505 510
ctg ctg tac gag tac ttc acc gtg tat aac gag ctg acc aaa gtg aaa 1584
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
515 520 525
tac gtg acc gag gga atg aga aag ccc gcc ttc ctg agc ggc gag cag 1632
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
530 535 540
aaa aag gcc atc gtg gac ctg ctg ttc aag acc aac cgg aaa gtg acc 1680
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
545 550 555 560
gtg aag cag ctg aaa gag gac tac ttc aag aaa atc gag tgc ttc gac 1728
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
565 570 575
tcc gtg gaa atc tcc ggc gtg gaa gat cgg ttc aac gcc tcc ctg ggc 1776
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
580 585 590
aca tac cac gat ctg ctg aaa att atc aag gac aag gac ttc ctg gac 1824
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
595 600 605
aat gag gaa aac gag gac att ctg gaa gat atc gtg ctg acc ctg aca 1872
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
610 615 620
ctg ttt gag gac aga gag atg atc gag gaa cgg ctg aaa acc tat gcc 1920
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
625 630 635 640
cac ctg ttc gac gac aaa gtg atg aag cag ctg aag cgg cgg aga tac 1968
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
645 650 655
acc ggc tgg ggc agg ctg agc cgg aag ctg atc aac ggc atc cgg gac 2016
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
660 665 670
aag cag tcc ggc aag aca atc ctg gat ttc ctg aag tcc gac ggc ttc 2064
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
675 680 685
gcc aac aga aac ttc atg cag ctg atc cac gac gac agc ctg acc ttt 2112
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
690 695 700
aaa gag gac atc cag aaa gcc cag gtg tcc ggc cag ggc gat agc ctg 2160
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
705 710 715 720
cac gag cac att gcc aat ctg gcc ggc agc ccc gcc att aag aag ggc 2208
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
725 730 735
atc ctg cag aca gtg aag gtg gtg gac gag ctc gtg aaa gtg atg ggc 2256
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
740 745 750
cgg cac aag ccc gag aac atc gtg atc gaa atg gcc aga gag aac cag 2304
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
755 760 765
acc acc cag aag gga cag aag aac agc cgc gag aga atg aag cgg atc 2352
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
770 775 780
gaa gag ggc atc aaa gag ctg ggc agc cag atc ctg aaa gaa cac ccc 2400
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
785 790 795 800
gtg gaa aac acc cag ctg cag aac gag aag ctg tac ctg tac tac ctg 2448
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
805 810 815
cag aat ggg cgg gat atg tac gtg gac cag gaa ctg gac atc aac cgg 2496
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
820 825 830
ctg tcc gac tac gat gtg gac cat atc gtg cct cag agc ttt ctg aag 2544
Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu Lys
835 840 845
gac gac tcc atc gac aac aag gtg ctg acc aga agc gac aag aac cgg 2592
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg
850 855 860
ggc aag agc gac aac gtg ccc tcc gaa gag gtc gtg aag aag atg aag 2640
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
865 870 875 880
aac tac tgg cgg cag ctg ctg aac gcc aag ctg att acc cag aga aag 2688
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
885 890 895
ttc gac aat ctg acc aag gcc gag aga ggc ggc ctg agc gaa ctg gat 2736
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
900 905 910
aag gcc ggc ttc atc aag aga cag ctg gtg gaa acc cgg cag atc aca 2784
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
915 920 925
aag cac gtg gca cag atc ctg gac tcc cgg atg aac act aag tac gac 2832
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
930 935 940
gag aat gac aag ctg atc cgg gaa gtg aaa gtg atc acc ctg aag tcc 2880
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
945 950 955 960
aag ctg gtg tcc gat ttc cgg aag gat ttc cag ttt tac aaa gtg cgc 2928
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
965 970 975
gag atc aac aac tac cac cac gcc cac gac gcc tac ctg aac gcc gtc 2976
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
980 985 990
gtg gga acc gcc ctg atc aaa aag tac cct aag ctg gaa agc gag ttc 3024
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe
995 1000 1005
gtg tac ggc gac tac aag gtg tac gac gtg cgg aag atg atc gcc 3069
Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala
1010 1015 1020
aag agc gag cag gaa atc ggc aag gct acc gcc aag tac ttc ttc 3114
Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe
1025 1030 1035
tac agc aac atc atg aac ttt ttc aag acc gag att acc ctg gcc 3159
Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala
1040 1045 1050
aac ggc gag atc cgg aag cgg cct ctg atc gag aca aac ggc gaa 3204
Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu
1055 1060 1065
acc ggg gag atc gtg tgg gat aag ggc cgg gat ttt gcc acc gtg 3249
Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val
1070 1075 1080
cgg aaa gtg ctg agc atg ccc caa gtg aat atc gtg aaa aag acc 3294
Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys Thr
1085 1090 1095
gag gtg cag aca ggc ggc ttc agc aaa gag tct atc cgg ccc aag 3339
Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Arg Pro Lys
1100 1105 1110
agg aac agc gat aag ctg atc gcc aga aag aag gac tgg gac cct 3384
Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro
1115 1120 1125
aag aag tac ggc ggc ttc gac agc ccc acc gtg gcc tat tct gtg 3429
Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser Val
1130 1135 1140
ctg gtg gtg gcc aaa gtg gaa aag ggc aag tcc aag aaa ctg aag 3474
Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys
1145 1150 1155
agt gtg aaa gag ctg ctg ggg atc acc atc atg gaa aga agc agc 3519
Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser
1160 1165 1170
ttc gag aag aat ccc atc gac ttt ctg gaa gcc aag ggc tac aaa 3564
Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys
1175 1180 1185
gaa gtg aaa aag gac ctg atc atc aag ctg cct aag tac tcc ctg 3609
Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu
1190 1195 1200
ttc gag ctg gaa aac ggc cgg aag aga atg ctg gcc tct gcc cgg 3654
Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala Arg
1205 1210 1215
gaa ctg cag aag gga aac gaa ctg gcc ctg ccc tcc aaa tat gtg 3699
Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr Val
1220 1225 1230
aac ttc ctg tac ctg gcc agc cac tat gag aag ctg aag ggc tcc 3744
Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser
1235 1240 1245
ccc gag gat aat gag cag aaa cag ctg ttt gtg gaa cag cac aag 3789
Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys
1250 1255 1260
cac tac ctg gac gag atc atc gag cag atc agc gag ttc tcc aag 3834
His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys
1265 1270 1275
aga gtg atc ctg gcc cgg cgg aat ctg gac aaa gtg ctg tcc gcc 3879
Arg Val Ile Leu Ala Arg Arg Asn Leu Asp Lys Val Leu Ser Ala
1280 1285 1290
tac aac aag cac cgg gat aag ccc atc aga gag cag gcc gag aat 3924
Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn
1295 1300 1305
atc atc cac ctg ttt acc ctg acc aat ctg gga gcc cct cgg gcc 3969
Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Arg Ala
1310 1315 1320
ttc aag tac ttt gac acc acc atc cgg cgg aag gcc tac acc agc 4014
Phe Lys Tyr Phe Asp Thr Thr Ile Arg Arg Lys Ala Tyr Thr Ser
1325 1330 1335
acc aaa gag gtg ctg gac gcc acc ctg atc cac cag agc atc acc 4059
Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr
1340 1345 1350
ggc ctg tac gag aca cgg atc gac ctg tct cag ctg gga ggc gac 4104
Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp
1355 1360 1365
taa 4107
<210> 12
<211> 4107
<212> DNA
<213> 酿脓链球菌
<220>
<221> CDS
<222> (1)..(4107)
<400> 12
atg gac aag aag tac agc atc ggc ctg gac atc ggc acc aac tct gtg 48
Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val
1 5 10 15
ggc tgg gcc gtg atc acc gac gag tac aag gtg ccc agc aag aaa ttc 96
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
20 25 30
aag gtg ctg ggc aac acc gac cgg cac agc atc aag aag aac ctg atc 144
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
35 40 45
gga gcc ctg ctg ttc gac agc ggc gaa aca gcc gag gcc acc cgg ctg 192
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
50 55 60
aag aga acc gcc aga aga aga tac acc aga cgg aag aac cgg atc tgc 240
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
65 70 75 80
tat ctg caa gag atc ttc agc aac gag atg gcc aag gtg gac gac agc 288
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
85 90 95
ttc ttc cac aga ctg gaa gag tcc ttc ctg gtg gaa gag gat aag aag 336
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
100 105 110
cac gag cgg cac ccc atc ttc ggc aac atc gtg gac gag gtg gcc tac 384
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
115 120 125
cac gag aag tac ccc acc atc tac cac ctg aga aag aaa ctg gtg gac 432
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
130 135 140
agc acc gac aag gcc gac ctg cgg ctg atc tat ctg gcc ctg gcc cac 480
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
145 150 155 160
atg atc aag ttc cgg ggc cac ttc ctg atc gag ggc gac ctg aac ccc 528
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
165 170 175
gac aac agc gac gtg gac aag ctg ttc atc cag ctg gtg cag acc tac 576
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
180 185 190
aac cag ctg ttc gag gaa aac ccc atc aac gcc agc ggc gtg gac gcc 624
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
195 200 205
aag gcc atc ctg tct gcc aga ctg agc aag agc aga cgg ctg gaa aat 672
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
210 215 220
ctg atc gcc cag ctg ccc ggc gag aag aag aat ggc ctg ttc ggc aac 720
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
225 230 235 240
ctg att gcc ctg agc ctg ggc ctg acc ccc aac ttc aag agc aac ttc 768
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
245 250 255
gac ctg gcc gag gat gcc aaa ctg cag ctg agc aag gac acc tac gac 816
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
260 265 270
gac gac ctg gac aac ctg ctg gcc cag atc ggc gac cag tac gcc gac 864
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
275 280 285
ctg ttt ctg gcc gcc aag aac ctg tcc gac gcc atc ctg ctg agc gac 912
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
290 295 300
atc ctg aga gtg aac acc gag atc acc aag gcc ccc ctg agc gcc tct 960
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
305 310 315 320
atg atc aag aga tac gac gag cac cac cag gac ctg acc ctg ctg aaa 1008
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
325 330 335
gct ctc gtg cgg cag cag ctg cct gag aag tac aaa gag att ttc ttc 1056
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
340 345 350
gac cag agc aag aac ggc tac gcc ggc tac att gac ggc gga gcc agc 1104
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
355 360 365
cag gaa gag ttc tac aag ttc atc aag ccc atc ctg gaa aag atg gac 1152
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
370 375 380
ggc acc gag gaa ctg ctc gtg aag ctg aac aga gag gac ctg ctg cgg 1200
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
385 390 395 400
aag cag cgg acc ttc gac aac ggc agc atc ccc cac cag atc cac ctg 1248
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
405 410 415
gga gag ctg cac gcc att ctg cgg cgg cag gaa gat ttt tac cca ttc 1296
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
420 425 430
ctg aag gac aac cgg gaa aag atc gag aag atc ctg acc ttc cgc atc 1344
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
435 440 445
ccc tac tac gtg ggc cct ctg gcc agg gga aac agc aga ttc gcc tgg 1392
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
450 455 460
atg acc aga aag agc gag gaa acc atc acc ccc tgg aac ttc gag gaa 1440
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
465 470 475 480
gtg gtg gac aag ggc gct tcc gcc cag agc ttc atc gag cgg atg acc 1488
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
485 490 495
aac ttc gat aag aac ctg ccc aac gag aag gtg ctg ccc aag cac agc 1536
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
500 505 510
ctg ctg tac gag tac ttc acc gtg tat aac gag ctg acc aaa gtg aaa 1584
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
515 520 525
tac gtg acc gag gga atg aga aag ccc gcc ttc ctg agc ggc gag cag 1632
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
530 535 540
aaa aag gcc atc gtg gac ctg ctg ttc aag acc aac cgg aaa gtg acc 1680
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
545 550 555 560
gtg aag cag ctg aaa gag gac tac ttc aag aaa atc gag tgc ttc gac 1728
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
565 570 575
tcc gtg gaa atc tcc ggc gtg gaa gat cgg ttc aac gcc tcc ctg ggc 1776
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
580 585 590
aca tac cac gat ctg ctg aaa att atc aag gac aag gac ttc ctg gac 1824
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
595 600 605
aat gag gaa aac gag gac att ctg gaa gat atc gtg ctg acc ctg aca 1872
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
610 615 620
ctg ttt gag gac aga gag atg atc gag gaa cgg ctg aaa acc tat gcc 1920
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
625 630 635 640
cac ctg ttc gac gac aaa gtg atg aag cag ctg aag cgg cgg aga tac 1968
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
645 650 655
acc ggc tgg ggc agg ctg agc cgg aag ctg atc aac ggc atc cgg gac 2016
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
660 665 670
aag cag tcc ggc aag aca atc ctg gat ttc ctg aag tcc gac ggc ttc 2064
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
675 680 685
gcc aac aga aac ttc atg cag ctg atc cac gac gac agc ctg acc ttt 2112
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
690 695 700
aaa gag gac atc cag aaa gcc cag gtg tcc ggc cag ggc gat agc ctg 2160
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
705 710 715 720
cac gag cac att gcc aat ctg gcc ggc agc ccc gcc att aag aag ggc 2208
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
725 730 735
atc ctg cag aca gtg aag gtg gtg gac gag ctc gtg aaa gtg atg ggc 2256
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
740 745 750
cgg cac aag ccc gag aac atc gtg atc gaa atg gcc aga gag aac cag 2304
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
755 760 765
acc acc cag aag gga cag aag aac agc cgc gag aga atg aag cgg atc 2352
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
770 775 780
gaa gag ggc atc aaa gag ctg ggc agc cag atc ctg aaa gaa cac ccc 2400
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
785 790 795 800
gtg gaa aac acc cag ctg cag aac gag aag ctg tac ctg tac tac ctg 2448
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
805 810 815
cag aat ggg cgg gat atg tac gtg gac cag gaa ctg gac atc aac cgg 2496
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
820 825 830
ctg tcc gac tac gat gtg gac cat atc gtg cct cag agc ttt ctg aag 2544
Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu Lys
835 840 845
gac gac tcc atc gac aac aag gtg ctg acc aga agc gac aag aac cgg 2592
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg
850 855 860
ggc aag agc gac aac gtg ccc tcc gaa gag gtc gtg aag aag atg aag 2640
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
865 870 875 880
aac tac tgg cgg cag ctg ctg aac gcc aag ctg att acc cag aga aag 2688
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
885 890 895
ttc gac aat ctg acc aag gcc gag aga ggc ggc ctg agc gaa ctg gat 2736
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
900 905 910
aag gcc ggc ttc atc aag aga cag ctg gtg gaa acc cgg cag atc aca 2784
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
915 920 925
aag cac gtg gca cag atc ctg gac tcc cgg atg aac act aag tac gac 2832
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
930 935 940
gag aat gac aag ctg atc cgg gaa gtg aaa gtg atc acc ctg aag tcc 2880
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
945 950 955 960
aag ctg gtg tcc gat ttc cgg aag gat ttc cag ttt tac aaa gtg cgc 2928
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
965 970 975
gag atc aac aac tac cac cac gcc cac gac gcc tac ctg aac gcc gtc 2976
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
980 985 990
gtg gga acc gcc ctg atc aaa aag tac cct aag ctg gaa agc gag ttc 3024
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe
995 1000 1005
gtg tac ggc gac tac aag gtg tac gac gtg cgg aag atg atc gcc 3069
Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala
1010 1015 1020
aag agc gag cag gaa atc ggc aag gct acc gcc aag tac ttc ttc 3114
Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe
1025 1030 1035
tac agc aac atc atg aac ttt ttc aag acc gag att acc ctg gcc 3159
Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala
1040 1045 1050
aac ggc gag atc cgg aag cgg cct ctg atc gag aca aac ggc gaa 3204
Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu
1055 1060 1065
acc ggg gag atc gtg tgg gat aag ggc cgg gat ttt gcc acc gtg 3249
Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val
1070 1075 1080
cgg aaa gtg ctg agc atg ccc caa gtg aat atc gtg aaa aag acc 3294
Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys Thr
1085 1090 1095
gag gtg cag aca ggc ggc ttc agc aaa gag tct atc cgg ccc aag 3339
Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Arg Pro Lys
1100 1105 1110
agg aac agc gat aag ctg atc gcc aga aag aag gac tgg gac cct 3384
Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro
1115 1120 1125
aag aag tac ggc ggc ttc gac agc ccc acc gtg gcc tat tct gtg 3429
Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser Val
1130 1135 1140
ctg gtg gtg gcc aaa gtg gaa aag ggc aag tcc aag aaa ctg aag 3474
Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys
1145 1150 1155
agt gtg aaa gag ctg ctg ggg atc acc atc atg gaa aga agc agc 3519
Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser
1160 1165 1170
ttc gag aag aat ccc atc gac ttt ctg gaa gcc aag ggc tac aaa 3564
Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys
1175 1180 1185
gaa gtg aaa aag gac ctg atc atc aag ctg cct aag tac tcc ctg 3609
Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu
1190 1195 1200
ttc gag ctg gaa aac ggc cgg aag aga atg ctg gcc tct gcc cgg 3654
Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala Arg
1205 1210 1215
gaa ctg cag aag gga aac gaa ctg gcc ctg ccc tcc aaa tat gtg 3699
Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr Val
1220 1225 1230
aac ttc ctg tac ctg gcc agc cac tat gag aag ctg aag ggc tcc 3744
Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser
1235 1240 1245
ccc gag gat aat gag cag aaa cag ctg ttt gtg gaa cag cac aag 3789
Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys
1250 1255 1260
cac tac ctg gac gag atc atc gag cag atc agc gag ttc tcc aag 3834
His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys
1265 1270 1275
aga gtg atc ctg gcc gac gct aat ctg gac aaa gtg ctg tcc gcc 3879
Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala
1280 1285 1290
tac aac aag cac cgg gat aag ccc atc aga gag cag gcc gag aat 3924
Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn
1295 1300 1305
atc atc cac ctg ttt acc ctg acc aat ctg gga gcc cct cgg gcc 3969
Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Arg Ala
1310 1315 1320
ttc aag tac ttt gac acc acc atc gac cgg aag gcc tac cgg agc 4014
Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Ala Tyr Arg Ser
1325 1330 1335
acc aaa gag gtg ctg gac gcc acc ctg atc cac cag agc atc acc 4059
Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr
1340 1345 1350
ggc ctg tac gag aca cgg atc gac ctg tct cag ctg gga ggc gac 4104
Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp
1355 1360 1365
taa 4107
<210> 13
<211> 4107
<212> DNA
<213> 酿脓链球菌
<220>
<221> CDS
<222> (1)..(4107)
<400> 13
atg gac aag aag tac agc atc ggc ctg gac atc ggc acc aac tct gtg 48
Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val
1 5 10 15
ggc tgg gcc gtg atc acc gac gag tac aag gtg ccc agc aag aaa ttc 96
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
20 25 30
aag gtg ctg ggc aac acc gac cgg cac agc atc aag aag aac ctg atc 144
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
35 40 45
gga gcc ctg ctg ttc gac agc ggc gaa aca gcc gag gcc acc cgg ctg 192
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
50 55 60
aag aga acc gcc aga aga aga tac acc aga cgg aag aac cgg atc tgc 240
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
65 70 75 80
tat ctg caa gag atc ttc agc aac gag atg gcc aag gtg gac gac agc 288
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
85 90 95
ttc ttc cac aga ctg gaa gag tcc ttc ctg gtg gaa gag gat aag aag 336
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
100 105 110
cac gag cgg cac ccc atc ttc ggc aac atc gtg gac gag gtg gcc tac 384
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
115 120 125
cac gag aag tac ccc acc atc tac cac ctg aga aag aaa ctg gtg gac 432
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
130 135 140
agc acc gac aag gcc gac ctg cgg ctg atc tat ctg gcc ctg gcc cac 480
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
145 150 155 160
atg atc aag ttc cgg ggc cac ttc ctg atc gag ggc gac ctg aac ccc 528
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
165 170 175
gac aac agc gac gtg gac aag ctg ttc atc cag ctg gtg cag acc tac 576
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
180 185 190
aac cag ctg ttc gag gaa aac ccc atc aac gcc agc ggc gtg gac gcc 624
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
195 200 205
aag gcc atc ctg tct gcc aga ctg agc aag agc aga cgg ctg gaa aat 672
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
210 215 220
ctg atc gcc cag ctg ccc ggc gag aag aag aat ggc ctg ttc ggc aac 720
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
225 230 235 240
ctg att gcc ctg agc ctg ggc ctg acc ccc aac ttc aag agc aac ttc 768
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
245 250 255
gac ctg gcc gag gat gcc aaa ctg cag ctg agc aag gac acc tac gac 816
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
260 265 270
gac gac ctg gac aac ctg ctg gcc cag atc ggc gac cag tac gcc gac 864
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
275 280 285
ctg ttt ctg gcc gcc aag aac ctg tcc gac gcc atc ctg ctg agc gac 912
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
290 295 300
atc ctg aga gtg aac acc gag atc acc aag gcc ccc ctg agc gcc tct 960
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
305 310 315 320
atg atc aag aga tac gac gag cac cac cag gac ctg acc ctg ctg aaa 1008
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
325 330 335
gct ctc gtg cgg cag cag ctg cct gag aag tac aaa gag att ttc ttc 1056
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
340 345 350
gac cag agc aag aac ggc tac gcc ggc tac att gac ggc gga gcc agc 1104
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
355 360 365
cag gaa gag ttc tac aag ttc atc aag ccc atc ctg gaa aag atg gac 1152
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
370 375 380
ggc acc gag gaa ctg ctc gtg aag ctg aac aga gag gac ctg ctg cgg 1200
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
385 390 395 400
aag cag cgg acc ttc gac aac ggc agc atc ccc cac cag atc cac ctg 1248
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
405 410 415
gga gag ctg cac gcc att ctg cgg cgg cag gaa gat ttt tac cca ttc 1296
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
420 425 430
ctg aag gac aac cgg gaa aag atc gag aag atc ctg acc ttc cgc atc 1344
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
435 440 445
ccc tac tac gtg ggc cct ctg gcc agg gga aac agc aga ttc gcc tgg 1392
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
450 455 460
atg acc aga aag agc gag gaa acc atc acc ccc tgg aac ttc gag gaa 1440
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
465 470 475 480
gtg gtg gac aag ggc gct tcc gcc cag agc ttc atc gag cgg atg acc 1488
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
485 490 495
aac ttc gat aag aac ctg ccc aac gag aag gtg ctg ccc aag cac agc 1536
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
500 505 510
ctg ctg tac gag tac ttc acc gtg tat aac gag ctg acc aaa gtg aaa 1584
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
515 520 525
tac gtg acc gag gga atg aga aag ccc gcc ttc ctg agc ggc gag cag 1632
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
530 535 540
aaa aag gcc atc gtg gac ctg ctg ttc aag acc aac cgg aaa gtg acc 1680
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
545 550 555 560
gtg aag cag ctg aaa gag gac tac ttc aag aaa atc gag tgc ttc gac 1728
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
565 570 575
tcc gtg gaa atc tcc ggc gtg gaa gat cgg ttc aac gcc tcc ctg ggc 1776
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
580 585 590
aca tac cac gat ctg ctg aaa att atc aag gac aag gac ttc ctg gac 1824
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
595 600 605
aat gag gaa aac gag gac att ctg gaa gat atc gtg ctg acc ctg aca 1872
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
610 615 620
ctg ttt gag gac aga gag atg atc gag gaa cgg ctg aaa acc tat gcc 1920
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
625 630 635 640
cac ctg ttc gac gac aaa gtg atg aag cag ctg aag cgg cgg aga tac 1968
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
645 650 655
acc ggc tgg ggc agg ctg agc cgg aag ctg atc aac ggc atc cgg gac 2016
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
660 665 670
aag cag tcc ggc aag aca atc ctg gat ttc ctg aag tcc gac ggc ttc 2064
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
675 680 685
gcc aac aga aac ttc atg cag ctg atc cac gac gac agc ctg acc ttt 2112
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
690 695 700
aaa gag gac atc cag aaa gcc cag gtg tcc ggc cag ggc gat agc ctg 2160
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
705 710 715 720
cac gag cac att gcc aat ctg gcc ggc agc ccc gcc att aag aag ggc 2208
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
725 730 735
atc ctg cag aca gtg aag gtg gtg gac gag ctc gtg aaa gtg atg ggc 2256
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
740 745 750
cgg cac aag ccc gag aac atc gtg atc gaa atg gcc aga gag aac cag 2304
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
755 760 765
acc acc cag aag gga cag aag aac agc cgc gag aga atg aag cgg atc 2352
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
770 775 780
gaa gag ggc atc aaa gag ctg ggc agc cag atc ctg aaa gaa cac ccc 2400
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
785 790 795 800
gtg gaa aac acc cag ctg cag aac gag aag ctg tac ctg tac tac ctg 2448
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
805 810 815
cag aat ggg cgg gat atg tac gtg gac cag gaa ctg gac atc aac cgg 2496
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
820 825 830
ctg tcc gac tac gat gtg gac cat atc gtg cct cag agc ttt ctg aag 2544
Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu Lys
835 840 845
gac gac tcc atc gac aac aag gtg ctg acc aga agc gac aag aac cgg 2592
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg
850 855 860
ggc aag agc gac aac gtg ccc tcc gaa gag gtc gtg aag aag atg aag 2640
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
865 870 875 880
aac tac tgg cgg cag ctg ctg aac gcc aag ctg att acc cag aga aag 2688
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
885 890 895
ttc gac aat ctg acc aag gcc gag aga ggc ggc ctg agc gaa ctg gat 2736
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
900 905 910
aag gcc ggc ttc atc aag aga cag ctg gtg gaa acc cgg cag atc aca 2784
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
915 920 925
aag cac gtg gca cag atc ctg gac tcc cgg atg aac act aag tac gac 2832
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
930 935 940
gag aat gac aag ctg atc cgg gaa gtg aaa gtg atc acc ctg aag tcc 2880
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
945 950 955 960
aag ctg gtg tcc gat ttc cgg aag gat ttc cag ttt tac aaa gtg cgc 2928
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
965 970 975
gag atc aac aac tac cac cac gcc cac gac gcc tac ctg aac gcc gtc 2976
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
980 985 990
gtg gga acc gcc ctg atc aaa aag tac cct aag ctg gaa agc gag ttc 3024
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe
995 1000 1005
gtg tac ggc gac tac aag gtg tac gac gtg cgg aag atg atc gcc 3069
Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala
1010 1015 1020
aag agc gag cag gaa atc ggc aag gct acc gcc aag tac ttc ttc 3114
Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe
1025 1030 1035
tac agc aac atc atg aac ttt ttc aag acc gag att acc ctg gcc 3159
Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala
1040 1045 1050
aac ggc gag atc cgg aag cgg cct ctg atc gag aca aac ggc gaa 3204
Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu
1055 1060 1065
acc ggg gag atc gtg tgg gat aag ggc cgg gat ttt gcc acc gtg 3249
Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val
1070 1075 1080
cgg aaa gtg ctg agc atg ccc caa gtg aat atc gtg aaa aag acc 3294
Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys Thr
1085 1090 1095
gag gtg cag aca ggc ggc ttc agc aaa gag tct atc cgg ccc aag 3339
Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Arg Pro Lys
1100 1105 1110
agg aac agc gat aag ctg atc gcc aga aag aag gac tgg gac cct 3384
Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro
1115 1120 1125
aag aag tac ggc ggc ttc gac agc ccc acc gtg gcc tat tct gtg 3429
Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser Val
1130 1135 1140
ctg gtg gtg gcc aaa gtg gaa aag ggc aag tcc aag aaa ctg aag 3474
Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys
1145 1150 1155
agt gtg aaa gag ctg ctg ggg atc acc atc atg gaa aga agc agc 3519
Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser
1160 1165 1170
ttc gag aag aat ccc atc gac ttt ctg gaa gcc aag ggc tac aaa 3564
Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys
1175 1180 1185
gaa gtg aaa aag gac ctg atc atc aag ctg cct aag tac tcc ctg 3609
Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu
1190 1195 1200
ttc gag ctg gaa aac ggc cgg aag aga atg ctg gcc tct gcc cgg 3654
Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala Arg
1205 1210 1215
gaa ctg cag aag gga aac gaa ctg gcc ctg ccc tcc aaa tat gtg 3699
Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr Val
1220 1225 1230
aac ttc ctg tac ctg gcc agc cac tat gag aag ctg aag ggc tcc 3744
Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser
1235 1240 1245
ccc gag gat aat gag cag aaa cag ctg ttt gtg gaa cag cac aag 3789
Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys
1250 1255 1260
cac tac ctg gac gag atc atc gag cag atc agc gag ttc tcc aag 3834
His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys
1265 1270 1275
aga gtg atc ctg gcc gac gct aat ctg gac aaa gtg ctg tcc gcc 3879
Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala
1280 1285 1290
tac aac aag cac cgg gat aag ccc atc aga gag cag gcc gag aat 3924
Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn
1295 1300 1305
atc atc cac ctg ttt acc ctg acc aat ctg gga gcc cct cgg gcc 3969
Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Arg Ala
1310 1315 1320
ttc aag tac ttt gac acc acc atc gac cgg aag gcc tac acc agc 4014
Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Ala Tyr Thr Ser
1325 1330 1335
acc aaa gag gtg ctg gac gcc acc ctg atc cac cag agc atc acc 4059
Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr
1340 1345 1350
ggc ctg tac gag aca cgg atc gac ctg tct cag ctg gga ggc gac 4104
Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp
1355 1360 1365
taa 4107
<210> 14
<211> 4107
<212> DNA
<213> 酿脓链球菌
<220>
<221> CDS
<222> (1)..(4107)
<400> 14
atg gac aag aag tac agc atc ggc ctg gac atc ggc acc aac tct gtg 48
Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val
1 5 10 15
ggc tgg gcc gtg atc acc gac gag tac aag gtg ccc agc aag aaa ttc 96
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
20 25 30
aag gtg ctg ggc aac acc gac cgg cac agc atc aag aag aac ctg atc 144
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
35 40 45
gga gcc ctg ctg ttc gac agc ggc gaa aca gcc gag gcc acc cgg ctg 192
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
50 55 60
aag aga acc gcc aga aga aga tac acc aga cgg aag aac cgg atc tgc 240
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
65 70 75 80
tat ctg caa gag atc ttc agc aac gag atg gcc aag gtg gac gac agc 288
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
85 90 95
ttc ttc cac aga ctg gaa gag tcc ttc ctg gtg gaa gag gat aag aag 336
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
100 105 110
cac gag cgg cac ccc atc ttc ggc aac atc gtg gac gag gtg gcc tac 384
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
115 120 125
cac gag aag tac ccc acc atc tac cac ctg aga aag aaa ctg gtg gac 432
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
130 135 140
agc acc gac aag gcc gac ctg cgg ctg atc tat ctg gcc ctg gcc cac 480
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
145 150 155 160
atg atc aag ttc cgg ggc cac ttc ctg atc gag ggc gac ctg aac ccc 528
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
165 170 175
gac aac agc gac gtg gac aag ctg ttc atc cag ctg gtg cag acc tac 576
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
180 185 190
aac cag ctg ttc gag gaa aac ccc atc aac gcc agc ggc gtg gac gcc 624
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
195 200 205
aag gcc atc ctg tct gcc aga ctg agc aag agc aga cgg ctg gaa aat 672
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
210 215 220
ctg atc gcc cag ctg ccc ggc gag aag aag aat ggc ctg ttc ggc aac 720
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
225 230 235 240
ctg att gcc ctg agc ctg ggc ctg acc ccc aac ttc aag agc aac ttc 768
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
245 250 255
gac ctg gcc gag gat gcc aaa ctg cag ctg agc aag gac acc tac gac 816
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
260 265 270
gac gac ctg gac aac ctg ctg gcc cag atc ggc gac cag tac gcc gac 864
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
275 280 285
ctg ttt ctg gcc gcc aag aac ctg tcc gac gcc atc ctg ctg agc gac 912
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
290 295 300
atc ctg aga gtg aac acc gag atc acc aag gcc ccc ctg agc gcc tct 960
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
305 310 315 320
atg atc aag aga tac gac gag cac cac cag gac ctg acc ctg ctg aaa 1008
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
325 330 335
gct ctc gtg cgg cag cag ctg cct gag aag tac aaa gag att ttc ttc 1056
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
340 345 350
gac cag agc aag aac ggc tac gcc ggc tac att gac ggc gga gcc agc 1104
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
355 360 365
cag gaa gag ttc tac aag ttc atc aag ccc atc ctg gaa aag atg gac 1152
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
370 375 380
ggc acc gag gaa ctg ctc gtg aag ctg aac aga gag gac ctg ctg cgg 1200
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
385 390 395 400
aag cag cgg acc ttc gac aac ggc agc atc ccc cac cag atc cac ctg 1248
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
405 410 415
gga gag ctg cac gcc att ctg cgg cgg cag gaa gat ttt tac cca ttc 1296
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
420 425 430
ctg aag gac aac cgg gaa aag atc gag aag atc ctg acc ttc cgc atc 1344
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
435 440 445
ccc tac tac gtg ggc cct ctg gcc agg gga aac agc aga ttc gcc tgg 1392
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
450 455 460
atg acc aga aag agc gag gaa acc atc acc ccc tgg aac ttc gag gaa 1440
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
465 470 475 480
gtg gtg gac aag ggc gct tcc gcc cag agc ttc atc gag cgg atg acc 1488
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
485 490 495
aac ttc gat aag aac ctg ccc aac gag aag gtg ctg ccc aag cac agc 1536
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
500 505 510
ctg ctg tac gag tac ttc acc gtg tat aac gag ctg acc aaa gtg aaa 1584
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
515 520 525
tac gtg acc gag gga atg aga aag ccc gcc ttc ctg agc ggc gag cag 1632
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
530 535 540
aaa aag gcc atc gtg gac ctg ctg ttc aag acc aac cgg aaa gtg acc 1680
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
545 550 555 560
gtg aag cag ctg aaa gag gac tac ttc aag aaa atc gag tgc ttc gac 1728
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
565 570 575
tcc gtg gaa atc tcc ggc gtg gaa gat cgg ttc aac gcc tcc ctg ggc 1776
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
580 585 590
aca tac cac gat ctg ctg aaa att atc aag gac aag gac ttc ctg gac 1824
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
595 600 605
aat gag gaa aac gag gac att ctg gaa gat atc gtg ctg acc ctg aca 1872
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
610 615 620
ctg ttt gag gac aga gag atg atc gag gaa cgg ctg aaa acc tat gcc 1920
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
625 630 635 640
cac ctg ttc gac gac aaa gtg atg aag cag ctg aag cgg cgg aga tac 1968
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
645 650 655
acc ggc tgg ggc agg ctg agc cgg aag ctg atc aac ggc atc cgg gac 2016
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
660 665 670
aag cag tcc ggc aag aca atc ctg gat ttc ctg aag tcc gac ggc ttc 2064
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
675 680 685
gcc aac aga aac ttc atg cag ctg atc cac gac gac agc ctg acc ttt 2112
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
690 695 700
aaa gag gac atc cag aaa gcc cag gtg tcc ggc cag ggc gat agc ctg 2160
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
705 710 715 720
cac gag cac att gcc aat ctg gcc ggc agc ccc gcc att aag aag ggc 2208
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
725 730 735
atc ctg cag aca gtg aag gtg gtg gac gag ctc gtg aaa gtg atg ggc 2256
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
740 745 750
cgg cac aag ccc gag aac atc gtg atc gaa atg gcc aga gag aac cag 2304
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
755 760 765
acc acc cag aag gga cag aag aac agc cgc gag aga atg aag cgg atc 2352
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
770 775 780
gaa gag ggc atc aaa gag ctg ggc agc cag atc ctg aaa gaa cac ccc 2400
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
785 790 795 800
gtg gaa aac acc cag ctg cag aac gag aag ctg tac ctg tac tac ctg 2448
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
805 810 815
cag aat ggg cgg gat atg tac gtg gac cag gaa ctg gac atc aac cgg 2496
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
820 825 830
ctg tcc gac tac gat gtg gac cat atc gtg cct cag agc ttt ctg aag 2544
Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu Lys
835 840 845
gac gac tcc atc gac aac aag gtg ctg acc aga agc gac aag aac cgg 2592
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg
850 855 860
ggc aag agc gac aac gtg ccc tcc gaa gag gtc gtg aag aag atg aag 2640
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
865 870 875 880
aac tac tgg cgg cag ctg ctg aac gcc aag ctg att acc cag aga aag 2688
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
885 890 895
ttc gac aat ctg acc aag gcc gag aga ggc ggc ctg agc gaa ctg gat 2736
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
900 905 910
aag gcc ggc ttc atc aag aga cag ctg gtg gaa acc cgg cag atc aca 2784
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
915 920 925
aag cac gtg gca cag atc ctg gac tcc cgg atg aac act aag tac gac 2832
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
930 935 940
gag aat gac aag ctg atc cgg gaa gtg aaa gtg atc acc ctg aag tcc 2880
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
945 950 955 960
aag ctg gtg tcc gat ttc cgg aag gat ttc cag ttt tac aaa gtg cgc 2928
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
965 970 975
gag atc aac aac tac cac cac gcc cac gac gcc tac ctg aac gcc gtc 2976
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
980 985 990
gtg gga acc gcc ctg atc aaa aag tac cct aag ctg gaa agc gag ttc 3024
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe
995 1000 1005
gtg tac ggc gac tac aag gtg tac gac gtg cgg aag atg atc gcc 3069
Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala
1010 1015 1020
aag agc gag cag gaa atc ggc aag gct acc gcc aag tac ttc ttc 3114
Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe
1025 1030 1035
tac agc aac atc atg aac ttt ttc aag acc gag att acc ctg gcc 3159
Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala
1040 1045 1050
aac ggc gag atc cgg aag cgg cct ctg atc gag aca aac ggc gaa 3204
Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu
1055 1060 1065
acc ggg gag atc gtg tgg gat aag ggc cgg gat ttt gcc acc gtg 3249
Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val
1070 1075 1080
cgg aaa gtg ctg agc atg ccc caa gtg aat atc gtg aaa aag acc 3294
Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys Thr
1085 1090 1095
gag gtg cag aca ggc ggc ttc agc aaa gag tct atc cgg ccc aag 3339
Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Arg Pro Lys
1100 1105 1110
agg aac agc gat aag ctg atc gcc aga aag aag gac tgg gac cct 3384
Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro
1115 1120 1125
aag aag tac ggc ggc ttc gac agc ccc acc gtg gcc tat tct gtg 3429
Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser Val
1130 1135 1140
ctg gtg gtg gcc aaa gtg gaa aag ggc aag tcc aag aaa ctg aag 3474
Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys
1145 1150 1155
agt gtg aaa gag ctg ctg ggg atc acc atc atg gaa aga agc agc 3519
Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser
1160 1165 1170
ttc gag aag aat ccc atc gac ttt ctg gaa gcc aag ggc tac aaa 3564
Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys
1175 1180 1185
gaa gtg aaa aag gac ctg atc atc aag ctg cct aag tac tcc ctg 3609
Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu
1190 1195 1200
ttc gag ctg gaa aac ggc cgg aag aga atg ctg gcc tct gcc cgg 3654
Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala Arg
1205 1210 1215
gaa ctg cag aag gga aac gaa ctg gcc ctg ccc tcc aaa tat gtg 3699
Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr Val
1220 1225 1230
aac ttc ctg tac ctg gcc agc cac tat gag aag ctg aag ggc tcc 3744
Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser
1235 1240 1245
ccc gag gat aat gag cag aaa cag ctg ttt gtg gaa cag cac aag 3789
Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys
1250 1255 1260
cac tac ctg gac gag atc atc gag cag atc agc gag ttc tcc aag 3834
His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys
1265 1270 1275
aga gtg atc ctg gcc gac gct aat ctg gac aaa gtg ctg tcc gcc 3879
Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala
1280 1285 1290
tac aac aag cac cgg gat aag ccc atc aga gag cag gcc gag aat 3924
Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn
1295 1300 1305
atc atc cac ctg ttt acc ctg acc aat ctg gga gcc cct gcc gcc 3969
Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala Ala
1310 1315 1320
ttc aag tac ttt gac acc acc atc gac cgg aag gcc tac acc agc 4014
Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Ala Tyr Thr Ser
1325 1330 1335
acc aaa gag gtg ctg gac gcc acc ctg atc cac cag agc atc acc 4059
Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr
1340 1345 1350
ggc ctg tac gag aca cgg atc gac ctg tct cag ctg gga ggc gac 4104
Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp
1355 1360 1365
taa 4107
<210> 15
<211> 4107
<212> DNA
<213> 酿脓链球菌
<220>
<221> CDS
<222> (1)..(4107)
<400> 15
atg gac aag aag tac agc atc ggc ctg gac atc ggc acc aac tct gtg 48
Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val
1 5 10 15
ggc tgg gcc gtg atc acc gac gag tac aag gtg ccc agc aag aaa ttc 96
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
20 25 30
aag gtg ctg ggc aac acc gac cgg cac agc atc aag aag aac ctg atc 144
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
35 40 45
gga gcc ctg ctg ttc gac agc ggc gaa aca gcc gag gcc acc cgg ctg 192
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
50 55 60
aag aga acc gcc aga aga aga tac acc aga cgg aag aac cgg atc tgc 240
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
65 70 75 80
tat ctg caa gag atc ttc agc aac gag atg gcc aag gtg gac gac agc 288
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
85 90 95
ttc ttc cac aga ctg gaa gag tcc ttc ctg gtg gaa gag gat aag aag 336
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
100 105 110
cac gag cgg cac ccc atc ttc ggc aac atc gtg gac gag gtg gcc tac 384
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
115 120 125
cac gag aag tac ccc acc atc tac cac ctg aga aag aaa ctg gtg gac 432
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
130 135 140
agc acc gac aag gcc gac ctg cgg ctg atc tat ctg gcc ctg gcc cac 480
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
145 150 155 160
atg atc aag ttc cgg ggc cac ttc ctg atc gag ggc gac ctg aac ccc 528
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
165 170 175
gac aac agc gac gtg gac aag ctg ttc atc cag ctg gtg cag acc tac 576
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
180 185 190
aac cag ctg ttc gag gaa aac ccc atc aac gcc agc ggc gtg gac gcc 624
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
195 200 205
aag gcc atc ctg tct gcc aga ctg agc aag agc aga cgg ctg gaa aat 672
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
210 215 220
ctg atc gcc cag ctg ccc ggc gag aag aag aat ggc ctg ttc gga aac 720
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
225 230 235 240
ctg att gcc ctg agc ctg ggc ctg acc ccc aac ttc aag agc aac ttc 768
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
245 250 255
gac ctg gcc gag gat gcc aaa ctg cag ctg agc aag gac acc tac gac 816
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
260 265 270
gac gac ctg gac aac ctg ctg gcc cag atc ggc gac cag tac gcc gac 864
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
275 280 285
ctg ttt ctg gcc gcc aag aac ctg tcc gac gcc atc ctg ctg agc gac 912
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
290 295 300
atc ctg aga gtg aac acc gag atc acc aag gcc ccc ctg agc gcc tct 960
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
305 310 315 320
atg atc aag aga tac gac gag cac cac cag gac ctg acc ctg ctg aaa 1008
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
325 330 335
gct ctc gtg cgg cag cag ctg cct gag aag tac aaa gag att ttc ttc 1056
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
340 345 350
gac cag agc aag aac ggc tac gcc ggc tac att gac ggc gga gcc agc 1104
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
355 360 365
cag gaa gag ttc tac aag ttc atc aag ccc atc ctg gaa aag atg gac 1152
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
370 375 380
ggc acc gag gaa ctg ctc gtg aag ctg aac aga gag gac ctg ctg cgg 1200
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
385 390 395 400
aag cag cgg acc ttc gac aac ggc agc atc ccc cac cag atc cac ctg 1248
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
405 410 415
gga gag ctg cac gcc att ctg cgg cgg cag gaa gat ttt tac cca ttc 1296
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
420 425 430
ctg aag gac aac cgg gaa aag atc gag aag atc ctg acc ttc cgc atc 1344
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
435 440 445
ccc tac tac gtg ggc cct ctg gcc agg gga aac agc aga ttc gcc tgg 1392
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
450 455 460
atg acc aga aag agc gag gaa acc atc acc ccc tgg aac ttc gag gaa 1440
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
465 470 475 480
gtg gtg gac aag ggc gct tcc gcc cag agc ttc atc gag cgg atg acc 1488
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
485 490 495
aac ttc gat aag aac ctg ccc aac gag aag gtg ctg ccc aag cac agc 1536
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
500 505 510
ctg ctg tac gag tac ttc acc gtg tat aac gag ctg acc aaa gtg aaa 1584
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
515 520 525
tac gtg acc gag gga atg aga aag ccc gcc ttc ctg agc ggc gag cag 1632
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
530 535 540
aaa aag gcc atc gtg gac ctg ctg ttc aag acc aac cgg aaa gtg acc 1680
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
545 550 555 560
gtg aag cag ctg aaa gag gac tac ttc aag aaa atc gag tgc ttc gac 1728
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
565 570 575
tcc gtg gaa atc tcc ggc gtg gaa gat cgg ttc aac gcc tcc ctg ggc 1776
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
580 585 590
aca tac cac gat ctg ctg aaa att atc aag gac aag gac ttc ctg gac 1824
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
595 600 605
aat gag gaa aac gag gac att ctg gaa gat atc gtg ctg acc ctg aca 1872
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
610 615 620
ctg ttt gag gac aga gag atg atc gag gaa cgg ctg aaa acc tat gcc 1920
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
625 630 635 640
cac ctg ttc gac gac aaa gtg atg aag cag ctg aag cgg cgg aga tac 1968
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
645 650 655
acc ggc tgg ggc agg ctg agc cgg aag ctg atc aac ggc atc cgg gac 2016
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
660 665 670
aag cag tcc ggc aag aca atc ctg gat ttc ctg aag tcc gac ggc ttc 2064
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
675 680 685
gcc aac aga aac ttc atg cag ctg atc cac gac gac agc ctg acc ttt 2112
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
690 695 700
aaa gag gac atc cag aaa gcc cag gtg tcc ggc cag ggc gat agc ctg 2160
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
705 710 715 720
cac gag cac att gcc aat ctg gcc ggc agc ccc gcc att aag aag ggc 2208
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
725 730 735
atc ctg cag aca gtg aag gtg gtg gac gag ctc gtg aaa gtg atg ggc 2256
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
740 745 750
cgg cac aag ccc gag aac atc gtg atc gaa atg gcc aga gag aac cag 2304
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
755 760 765
acc acc cag aag gga cag aag aac agc cgc gag aga atg aag cgg atc 2352
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
770 775 780
gaa gag ggc atc aaa gag ctg ggc agc cag atc ctg aaa gaa cac ccc 2400
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
785 790 795 800
gtg gaa aac acc cag ctg cag aac gag aag ctg tac ctg tac tac ctg 2448
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
805 810 815
cag aat ggg cgg gat atg tac gtg gac cag gaa ctg gac atc aac cgg 2496
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
820 825 830
ctg tcc gac tac gat gtg gac cat atc gtg cct cag agc ttt ctg aag 2544
Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu Lys
835 840 845
gac gac tcc atc gac aac aag gtg ctg acc aga agc gac aag aac cgg 2592
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg
850 855 860
ggc aag agc gac aac gtg ccc tcc gaa gag gtc gtg aag aag atg aag 2640
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
865 870 875 880
aac tac tgg cgg cag ctg ctg aac gcc aag ctg att acc cag aga aag 2688
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
885 890 895
ttc gac aat ctg acc aag gcc gag aga ggc ggc ctg agc gaa ctg gat 2736
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
900 905 910
aag gcc ggc ttc atc aag aga cag ctg gtg gaa acc cgg cag atc aca 2784
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
915 920 925
aag cac gtg gca cag atc ctg gac tcc cgg atg aac act aag tac gac 2832
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
930 935 940
gag aat gac aag ctg atc cgg gaa gtg aaa gtg atc acc ctg aag tcc 2880
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
945 950 955 960
aag ctg gtg tcc gat ttc cgg aag gat ttc cag ttt tac aaa gtg cgc 2928
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
965 970 975
gag atc aac aac tac cac cac gcc cac gac gcc tac ctg aac gcc gtc 2976
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
980 985 990
gtg gga acc gcc ctg atc aaa aag tac cct aag ctg gaa agc gag ttc 3024
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe
995 1000 1005
gtg tac ggc gac tac aag gtg tac gac gtg cgg aag atg atc gcc 3069
Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala
1010 1015 1020
aag agc gag cag gaa atc ggc aag gct acc gcc aag tac ttc ttc 3114
Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe
1025 1030 1035
tac agc aac atc atg aac ttt ttc aag acc gag att acc ctg gcc 3159
Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala
1040 1045 1050
aac ggc gag atc cgg aag cgg cct ctg atc gag aca aac ggc gaa 3204
Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu
1055 1060 1065
acc ggg gag atc gtg tgg gat aag ggc cgg gat ttt gcc acc gtg 3249
Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val
1070 1075 1080
cgg aaa gtg ctg agc atg ccc caa gtg aat atc gtg aaa aag acc 3294
Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys Thr
1085 1090 1095
gag gtg cag aca ggc ggc ttc agc aaa gag tct atc cgg ccc aag 3339
Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Arg Pro Lys
1100 1105 1110
agg aac agc gat aag ctg atc gcc aga aag aag gac tgg gac cct 3384
Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro
1115 1120 1125
aag aag tac ggc ggc ttc gtt agc ccc acc gtg gcc tat tct gtg 3429
Lys Lys Tyr Gly Gly Phe Val Ser Pro Thr Val Ala Tyr Ser Val
1130 1135 1140
ctg gtg gtg gcc aaa gtg gaa aag ggc aag tcc aag aaa ctg aag 3474
Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys
1145 1150 1155
agt gtg aaa gag ctg ctg ggg atc acc atc atg gaa aga agc agc 3519
Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser
1160 1165 1170
ttc gag aag aat ccc atc gac ttt ctg gaa gcc aag ggc tac aaa 3564
Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys
1175 1180 1185
gaa gtg aaa aag gac ctg atc atc aag ctg cct aag tac tcc ctg 3609
Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu
1190 1195 1200
ttc gag ctg gaa aac ggc cgg aag aga atg ctg gcc tct gcc cgg 3654
Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala Arg
1205 1210 1215
atg ctg cag aag gga aac gaa ctg gcc ctg ccc tcc aaa tat gtg 3699
Met Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr Val
1220 1225 1230
aac ttc ctg tac ctg gcc agc cac tat gag aag ctg aag ggc tcc 3744
Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser
1235 1240 1245
ccc gag gat aat gag cag aaa cag ctg ttt gtg gaa cag cac aag 3789
Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys
1250 1255 1260
cac tac ctg gac gag atc atc gag cag atc agc gag ttc tcc aag 3834
His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys
1265 1270 1275
aga gtg atc ctg gcc gac gct aat ctg gac aaa gtg ctg tcc gcc 3879
Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala
1280 1285 1290
tac aac aag cac cgg gat aag ccc atc aga gag cag gcc gag aat 3924
Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn
1295 1300 1305
atc atc cac ctg ttt acc ctg acc aat ctg gga gcc cct cgg gcc 3969
Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Arg Ala
1310 1315 1320
ttc aag tac ttt gac acc acc atc gac cgg aag gcc tac cgg agc 4014
Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Ala Tyr Arg Ser
1325 1330 1335
acc aaa gag gtg ctg gac gcc acc ctg atc cac cag agc atc acc 4059
Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr
1340 1345 1350
ggc ctg tac gag aca cgg atc gac ctg tct cag ctg gga ggc gac 4104
Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp
1355 1360 1365
taa 4107
<210> 16
<211> 4107
<212> DNA
<213> 酿脓链球菌
<220>
<221> CDS
<222> (1)..(4107)
<400> 16
atg gac aag aag tac agc atc ggc ctg gac atc ggc acc aac tct gtg 48
Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val
1 5 10 15
ggc tgg gcc gtg atc acc gac gag tac aag gtg ccc agc aag aaa ttc 96
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
20 25 30
aag gtg ctg ggc aac acc gac cgg cac agc atc aag aag aac ctg atc 144
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
35 40 45
gga gcc ctg ctg ttc gac agc ggc gaa aca gcc gag gcc acc cgg ctg 192
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
50 55 60
aag aga acc gcc aga aga aga tac acc aga cgg aag aac cgg atc tgc 240
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
65 70 75 80
tat ctg caa gag atc ttc agc aac gag atg gcc aag gtg gac gac agc 288
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
85 90 95
ttc ttc cac aga ctg gaa gag tcc ttc ctg gtg gaa gag gat aag aag 336
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
100 105 110
cac gag cgg cac ccc atc ttc ggc aac atc gtg gac gag gtg gcc tac 384
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
115 120 125
cac gag aag tac ccc acc atc tac cac ctg aga aag aaa ctg gtg gac 432
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
130 135 140
agc acc gac aag gcc gac ctg cgg ctg atc tat ctg gcc ctg gcc cac 480
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
145 150 155 160
atg atc aag ttc cgg ggc cac ttc ctg atc gag ggc gac ctg aac ccc 528
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
165 170 175
gac aac agc gac gtg gac aag ctg ttc atc cag ctg gtg cag acc tac 576
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
180 185 190
aac cag ctg ttc gag gaa aac ccc atc aac gcc agc ggc gtg gac gcc 624
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
195 200 205
aag gcc atc ctg tct gcc aga ctg agc aag agc aga cgg ctg gaa aat 672
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
210 215 220
ctg atc gcc cag ctg ccc ggc gag aag aag aat ggc ctg ttc gga aac 720
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
225 230 235 240
ctg att gcc ctg agc ctg ggc ctg acc ccc aac ttc aag agc aac ttc 768
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
245 250 255
gac ctg gcc gag gat gcc aaa ctg cag ctg agc aag gac acc tac gac 816
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
260 265 270
gac gac ctg gac aac ctg ctg gcc cag atc ggc gac cag tac gcc gac 864
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
275 280 285
ctg ttt ctg gcc gcc aag aac ctg tcc gac gcc atc ctg ctg agc gac 912
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
290 295 300
atc ctg aga gtg aac acc gag atc acc aag gcc ccc ctg agc gcc tct 960
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
305 310 315 320
atg atc aag aga tac gac gag cac cac cag gac ctg acc ctg ctg aaa 1008
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
325 330 335
gct ctc gtg cgg cag cag ctg cct gag aag tac aaa gag att ttc ttc 1056
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
340 345 350
gac cag agc aag aac ggc tac gcc ggc tac att gac ggc gga gcc agc 1104
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
355 360 365
cag gaa gag ttc tac aag ttc atc aag ccc atc ctg gaa aag atg gac 1152
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
370 375 380
ggc acc gag gaa ctg ctc gtg aag ctg aac aga gag gac ctg ctg cgg 1200
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
385 390 395 400
aag cag cgg acc ttc gac aac ggc agc atc ccc cac cag atc cac ctg 1248
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
405 410 415
gga gag ctg cac gcc att ctg cgg cgg cag gaa gat ttt tac cca ttc 1296
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
420 425 430
ctg aag gac aac cgg gaa aag atc gag aag atc ctg acc ttc cgc atc 1344
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
435 440 445
ccc tac tac gtg ggc cct ctg gcc agg gga aac agc aga ttc gcc tgg 1392
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
450 455 460
atg acc aga aag agc gag gaa acc atc acc ccc tgg aac ttc gag gaa 1440
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
465 470 475 480
gtg gtg gac aag ggc gct tcc gcc cag agc ttc atc gag cgg atg acc 1488
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
485 490 495
aac ttc gat aag aac ctg ccc aac gag aag gtg ctg ccc aag cac agc 1536
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
500 505 510
ctg ctg tac gag tac ttc acc gtg tat aac gag ctg acc aaa gtg aaa 1584
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
515 520 525
tac gtg acc gag gga atg aga aag ccc gcc ttc ctg agc ggc gag cag 1632
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
530 535 540
aaa aag gcc atc gtg gac ctg ctg ttc aag acc aac cgg aaa gtg acc 1680
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
545 550 555 560
gtg aag cag ctg aaa gag gac tac ttc aag aaa atc gag tgc ttc gac 1728
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
565 570 575
tcc gtg gaa atc tcc ggc gtg gaa gat cgg ttc aac gcc tcc ctg ggc 1776
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
580 585 590
aca tac cac gat ctg ctg aaa att atc aag gac aag gac ttc ctg gac 1824
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
595 600 605
aat gag gaa aac gag gac att ctg gaa gat atc gtg ctg acc ctg aca 1872
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
610 615 620
ctg ttt gag gac aga gag atg atc gag gaa cgg ctg aaa acc tat gcc 1920
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
625 630 635 640
cac ctg ttc gac gac aaa gtg atg aag cag ctg aag cgg cgg aga tac 1968
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
645 650 655
acc ggc tgg ggc agg ctg agc cgg aag ctg atc aac ggc atc cgg gac 2016
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
660 665 670
aag cag tcc ggc aag aca atc ctg gat ttc ctg aag tcc gac ggc ttc 2064
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
675 680 685
gcc aac aga aac ttc atg cag ctg atc cac gac gac agc ctg acc ttt 2112
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
690 695 700
aaa gag gac atc cag aaa gcc cag gtg tcc ggc cag ggc gat agc ctg 2160
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
705 710 715 720
cac gag cac att gcc aat ctg gcc ggc agc ccc gcc att aag aag ggc 2208
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
725 730 735
atc ctg cag aca gtg aag gtg gtg gac gag ctc gtg aaa gtg atg ggc 2256
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
740 745 750
cgg cac aag ccc gag aac atc gtg atc gaa atg gcc aga gag aac cag 2304
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
755 760 765
acc acc cag aag gga cag aag aac agc cgc gag aga atg aag cgg atc 2352
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
770 775 780
gaa gag ggc atc aaa gag ctg ggc agc cag atc ctg aaa gaa cac ccc 2400
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
785 790 795 800
gtg gaa aac acc cag ctg cag aac gag aag ctg tac ctg tac tac ctg 2448
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
805 810 815
cag aat ggg cgg gat atg tac gtg gac cag gaa ctg gac atc aac cgg 2496
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
820 825 830
ctg tcc gac tac gat gtg gac cat atc gtg cct cag agc ttt ctg aag 2544
Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu Lys
835 840 845
gac gac tcc atc gac aac aag gtg ctg acc aga agc gac aag aac cgg 2592
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg
850 855 860
ggc aag agc gac aac gtg ccc tcc gaa gag gtc gtg aag aag atg aag 2640
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
865 870 875 880
aac tac tgg cgg cag ctg ctg aac gcc aag ctg att acc cag aga aag 2688
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
885 890 895
ttc gac aat ctg acc aag gcc gag aga ggc ggc ctg agc gaa ctg gat 2736
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
900 905 910
aag gcc ggc ttc atc aag aga cag ctg gtg gaa acc cgg cag atc aca 2784
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
915 920 925
aag cac gtg gca cag atc ctg gac tcc cgg atg aac act aag tac gac 2832
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
930 935 940
gag aat gac aag ctg atc cgg gaa gtg aaa gtg atc acc ctg aag tcc 2880
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
945 950 955 960
aag ctg gtg tcc gat ttc cgg aag gat ttc cag ttt tac aaa gtg cgc 2928
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
965 970 975
gag atc aac aac tac cac cac gcc cac gac gcc tac ctg aac gcc gtc 2976
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
980 985 990
gtg gga acc gcc ctg atc aaa aag tac cct aag ctg gaa agc gag ttc 3024
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe
995 1000 1005
gtg tac ggc gac tac aag gtg tac gac gtg cgg aag atg atc gcc 3069
Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala
1010 1015 1020
aag agc gag cag gaa atc ggc aag gct acc gcc aag tac ttc ttc 3114
Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe
1025 1030 1035
tac agc aac atc atg aac ttt ttc aag acc gag att acc ctg gcc 3159
Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala
1040 1045 1050
aac ggc gag atc cgg aag cgg cct ctg atc gag aca aac ggc gaa 3204
Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu
1055 1060 1065
acc ggg gag atc gtg tgg gat aag ggc cgg gat ttt gcc acc gtg 3249
Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val
1070 1075 1080
cgg aaa gtg ctg agc atg ccc caa gtg aat atc gtg aaa aag acc 3294
Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys Thr
1085 1090 1095
gag gtg cag aca ggc ggc ttc agc aaa gag tct atc cgg ccc aag 3339
Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Arg Pro Lys
1100 1105 1110
agg aac agc gat aag ctg atc gcc aga aag aag gac tgg gac cct 3384
Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro
1115 1120 1125
aag aag tac ggc ggc ttc gtt agc ccc acc gtg gcc tat tct gtg 3429
Lys Lys Tyr Gly Gly Phe Val Ser Pro Thr Val Ala Tyr Ser Val
1130 1135 1140
ctg gtg gtg gcc aaa gtg gaa aag ggc aag tcc aag aaa ctg aag 3474
Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys
1145 1150 1155
agt gtg aaa gag ctg ctg ggg atc acc atc atg gaa aga agc agc 3519
Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser
1160 1165 1170
ttc gag aag aat ccc atc gac ttt ctg gaa gcc aag ggc tac aaa 3564
Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys
1175 1180 1185
gaa gtg aaa aag gac ctg atc atc aag ctg cct aag tac tcc ctg 3609
Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu
1190 1195 1200
ttc gag ctg gaa aac ggc cgg aag aga atg ctg gcc tct gcc cgg 3654
Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala Arg
1205 1210 1215
ttt ctg cag aag gga aac gaa ctg gcc ctg ccc tcc aaa tat gtg 3699
Phe Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr Val
1220 1225 1230
aac ttc ctg tac ctg gcc agc cac tat gag aag ctg aag ggc tcc 3744
Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser
1235 1240 1245
ccc gag gat aat gag cag aaa cag ctg ttt gtg gaa cag cac aag 3789
Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys
1250 1255 1260
cac tac ctg gac gag atc atc gag cag atc agc gag ttc tcc aag 3834
His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys
1265 1270 1275
aga gtg atc ctg gcc gac gct aat ctg gac aaa gtg ctg tcc gcc 3879
Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala
1280 1285 1290
tac aac aag cac cgg gat aag ccc atc aga gag cag gcc gag aat 3924
Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn
1295 1300 1305
atc atc cac ctg ttt acc ctg acc aat ctg gga gcc cct cgg gcc 3969
Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Arg Ala
1310 1315 1320
ttc aag tac ttt gac acc acc atc gac cgg aag gcc tac cgg agc 4014
Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Ala Tyr Arg Ser
1325 1330 1335
acc aaa gag gtg ctg gac gcc acc ctg atc cac cag agc atc acc 4059
Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr
1340 1345 1350
ggc ctg tac gag aca cgg atc gac ctg tct cag ctg gga ggc gac 4104
Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp
1355 1360 1365
taa 4107
<210> 17
<211> 4107
<212> DNA
<213> 酿脓链球菌
<220>
<221> CDS
<222> (1)..(4107)
<400> 17
atg gac aag aag tac agc atc ggc ctg gac atc ggc acc aac tct gtg 48
Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val
1 5 10 15
ggc tgg gcc gtg atc acc gac gag tac aag gtg ccc agc aag aaa ttc 96
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
20 25 30
aag gtg ctg ggc aac acc gac cgg cac agc atc aag aag aac ctg atc 144
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
35 40 45
gga gcc ctg ctg ttc gac agc ggc gaa aca gcc gag gcc acc cgg ctg 192
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
50 55 60
aag aga acc gcc aga aga aga tac acc aga cgg aag aac cgg atc tgc 240
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
65 70 75 80
tat ctg caa gag atc ttc agc aac gag atg gcc aag gtg gac gac agc 288
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
85 90 95
ttc ttc cac aga ctg gaa gag tcc ttc ctg gtg gaa gag gat aag aag 336
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
100 105 110
cac gag cgg cac ccc atc ttc ggc aac atc gtg gac gag gtg gcc tac 384
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
115 120 125
cac gag aag tac ccc acc atc tac cac ctg aga aag aaa ctg gtg gac 432
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
130 135 140
agc acc gac aag gcc gac ctg cgg ctg atc tat ctg gcc ctg gcc cac 480
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
145 150 155 160
atg atc aag ttc cgg ggc cac ttc ctg atc gag ggc gac ctg aac ccc 528
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
165 170 175
gac aac agc gac gtg gac aag ctg ttc atc cag ctg gtg cag acc tac 576
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
180 185 190
aac cag ctg ttc gag gaa aac ccc atc aac gcc agc ggc gtg gac gcc 624
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
195 200 205
aag gcc atc ctg tct gcc aga ctg agc aag agc aga cgg ctg gaa aat 672
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
210 215 220
ctg atc gcc cag ctg ccc ggc gag aag aag aat ggc ctg ttc ggc aac 720
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
225 230 235 240
ctg att gcc ctg agc ctg ggc ctg acc ccc aac ttc aag agc aac ttc 768
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
245 250 255
gac ctg gcc gag gat gcc aaa ctg cag ctg agc aag gac acc tac gac 816
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
260 265 270
gac gac ctg gac aac ctg ctg gcc cag atc ggc gac cag tac gcc gac 864
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
275 280 285
ctg ttt ctg gcc gcc aag aac ctg tcc gac gcc atc ctg ctg agc gac 912
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
290 295 300
atc ctg aga gtg aac acc gag atc acc aag gcc ccc ctg agc gcc tct 960
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
305 310 315 320
atg atc aag aga tac gac gag cac cac cag gac ctg acc ctg ctg aaa 1008
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
325 330 335
gct ctc gtg cgg cag cag ctg cct gag aag tac aaa gag att ttc ttc 1056
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
340 345 350
gac cag agc aag aac ggc tac gcc ggc tac att gac ggc gga gcc agc 1104
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
355 360 365
cag gaa gag ttc tac aag ttc atc aag ccc atc ctg gaa aag atg gac 1152
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
370 375 380
ggc acc gag gaa ctg ctc gtg aag ctg aac aga gag gac ctg ctg cgg 1200
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
385 390 395 400
aag cag cgg acc ttc gac aac ggc agc atc ccc cac cag atc cac ctg 1248
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
405 410 415
gga gag ctg cac gcc att ctg cgg cgg cag gaa gat ttt tac cca ttc 1296
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
420 425 430
ctg aag gac aac cgg gaa aag atc gag aag atc ctg acc ttc cgc atc 1344
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
435 440 445
ccc tac tac gtg ggc cct ctg gcc agg gga aac agc aga ttc gcc tgg 1392
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
450 455 460
atg acc aga aag agc gag gaa acc atc acc ccc tgg aac ttc gag gaa 1440
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
465 470 475 480
gtg gtg gac aag ggc gct tcc gcc cag agc ttc atc gag cgg atg acc 1488
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
485 490 495
aac ttc gat aag aac ctg ccc aac gag aag gtg ctg ccc aag cac agc 1536
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
500 505 510
ctg ctg tac gag tac ttc acc gtg tat aac gag ctg acc aaa gtg aaa 1584
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
515 520 525
tac gtg acc gag gga atg aga aag ccc gcc ttc ctg agc ggc gag cag 1632
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
530 535 540
aaa aag gcc atc gtg gac ctg ctg ttc aag acc aac cgg aaa gtg acc 1680
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
545 550 555 560
gtg aag cag ctg aaa gag gac tac ttc aag aaa atc gag tgc ttc gac 1728
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
565 570 575
tcc gtg gaa atc tcc ggc gtg gaa gat cgg ttc aac gcc tcc ctg ggc 1776
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
580 585 590
aca tac cac gat ctg ctg aaa att atc aag gac aag gac ttc ctg gac 1824
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
595 600 605
aat gag gaa aac gag gac att ctg gaa gat atc gtg ctg acc ctg aca 1872
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
610 615 620
ctg ttt gag gac aga gag atg atc gag gaa cgg ctg aaa acc tat gcc 1920
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
625 630 635 640
cac ctg ttc gac gac aaa gtg atg aag cag ctg aag cgg cgg aga tac 1968
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
645 650 655
acc ggc tgg ggc agg ctg agc cgg aag ctg atc aac ggc atc cgg gac 2016
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
660 665 670
aag cag tcc ggc aag aca atc ctg gat ttc ctg aag tcc gac ggc ttc 2064
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
675 680 685
gcc aac aga aac ttc atg cag ctg atc cac gac gac agc ctg acc ttt 2112
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
690 695 700
aaa gag gac atc cag aaa gcc cag gtg tcc ggc cag ggc gat agc ctg 2160
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
705 710 715 720
cac gag cac att gcc aat ctg gcc ggc agc ccc gcc att aag aag ggc 2208
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
725 730 735
atc ctg cag aca gtg aag gtg gtg gac gag ctc gtg aaa gtg atg ggc 2256
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
740 745 750
cgg cac aag ccc gag aac atc gtg atc gaa atg gcc aga gag aac cag 2304
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
755 760 765
acc acc cag aag gga cag aag aac agc cgc gag aga atg aag cgg atc 2352
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
770 775 780
gaa gag ggc atc aaa gag ctg ggc agc cag atc ctg aaa gaa cac ccc 2400
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
785 790 795 800
gtg gaa aac acc cag ctg cag aac gag aag ctg tac ctg tac tac ctg 2448
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
805 810 815
cag aat ggg cgg gat atg tac gtg gac cag gaa ctg gac atc aac cgg 2496
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
820 825 830
ctg tcc gac tac gat gtg gac cat atc gtg cct cag agc ttt ctg aag 2544
Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu Lys
835 840 845
gac gac tcc atc gac aac aag gtg ctg acc aga agc gac aag aac cgg 2592
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg
850 855 860
ggc aag agc gac aac gtg ccc tcc gaa gag gtc gtg aag aag atg aag 2640
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
865 870 875 880
aac tac tgg cgg cag ctg ctg aac gcc aag ctg att acc cag aga aag 2688
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
885 890 895
ttc gac aat ctg acc aag gcc gag aga ggc ggc ctg agc gaa ctg gat 2736
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
900 905 910
aag gcc ggc ttc atc aag aga cag ctg gtg gaa acc cgg cag atc aca 2784
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
915 920 925
aag cac gtg gca cag atc ctg gac tcc cgg atg aac act aag tac gac 2832
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
930 935 940
gag aat gac aag ctg atc cgg gaa gtg aaa gtg atc acc ctg aag tcc 2880
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
945 950 955 960
aag ctg gtg tcc gat ttc cgg aag gat ttc cag ttt tac aaa gtg cgc 2928
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
965 970 975
gag atc aac aac tac cac cac gcc cac gac gcc tac ctg aac gcc gtc 2976
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
980 985 990
gtg gga acc gcc ctg atc aaa aag tac cct aag ctg gaa agc gag ttc 3024
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe
995 1000 1005
gtg tac ggc gac tac aag gtg tac gac gtg cgg aag atg atc gcc 3069
Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala
1010 1015 1020
aag agc gag cag gaa atc ggc aag gct acc gcc aag tac ttc ttc 3114
Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe
1025 1030 1035
tac agc aac atc atg aac ttt ttc aag acc gag att acc ctg gcc 3159
Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala
1040 1045 1050
aac ggc gag atc cgg aag cgg cct ctg atc gag aca aac ggc gaa 3204
Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu
1055 1060 1065
acc ggg gag atc gtg tgg gat aag ggc cgg gat ttt gcc acc gtg 3249
Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val
1070 1075 1080
cgg aaa gtg ctg agc atg ccc caa gtg aat atc gtg aaa aag acc 3294
Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys Thr
1085 1090 1095
gag gtg cag aca ggc ggc ttc agc aaa gag tct atc cgg ccc aag 3339
Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Arg Pro Lys
1100 1105 1110
agg aac agc gat aag ctg atc gcc aga aag aag gac tgg gac cct 3384
Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro
1115 1120 1125
aag aag tac ggc ggc ttc gac agc ccc acc gtg gcc tat tct gtg 3429
Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser Val
1130 1135 1140
ctg gtg gtg gcc aaa gtg gaa aag ggc aag tcc aag aaa ctg aag 3474
Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys
1145 1150 1155
agt gtg aaa gag ctg ctg ggg atc acc atc atg gaa aga agc agc 3519
Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser
1160 1165 1170
ttc gag aag aat ccc atc gac ttt ctg gaa gcc aag ggc tac aaa 3564
Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys
1175 1180 1185
gaa gtg aaa aag gac ctg atc atc aag ctg cct aag tac tcc ctg 3609
Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu
1190 1195 1200
ttc gag ctg gaa aac ggc cgg aag aga atg ctg gcc tct gcc cgg 3654
Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala Arg
1205 1210 1215
tgg ctg cag aag gga aac gaa ctg gcc ctg ccc tcc aaa tat gtg 3699
Trp Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr Val
1220 1225 1230
aac ttc ctg tac ctg gcc agc cac tat gag aag ctg aag ggc tcc 3744
Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser
1235 1240 1245
ccc gag gat aat gag cag aaa cag ctg ttt gtg gaa cag cac aag 3789
Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys
1250 1255 1260
cac tac ctg gac gag atc atc gag cag atc agc gag ttc tcc aag 3834
His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys
1265 1270 1275
aga gtg atc ctg gcc gac gct aat ctg gac aaa gtg ctg tcc gcc 3879
Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala
1280 1285 1290
tac aac aag cac cgg gat aag ccc atc aga gag cag gcc gag aat 3924
Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn
1295 1300 1305
atc atc cac ctg ttt acc ctg acc aat ctg gga gcc cct cgg gcc 3969
Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Arg Ala
1310 1315 1320
ttc aag tac ttt gac acc acc atc gac cgg aag gcc tac cgg agc 4014
Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Ala Tyr Arg Ser
1325 1330 1335
acc aaa gag gtg ctg gac gcc acc ctg atc cac cag agc atc acc 4059
Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr
1340 1345 1350
ggc ctg tac gag aca cgg atc gac ctg tct cag ctg gga ggc gac 4104
Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp
1355 1360 1365
taa 4107
<210> 18
<211> 4107
<212> DNA
<213> 酿脓链球菌
<220>
<221> CDS
<222> (1)..(4107)
<400> 18
atg gac aag aag tac agc atc ggc ctg gac atc ggc acc aac tct gtg 48
Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val
1 5 10 15
ggc tgg gcc gtg atc acc gac gag tac aag gtg ccc agc aag aaa ttc 96
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
20 25 30
aag gtg ctg ggc aac acc gac cgg cac agc atc aag aag aac ctg atc 144
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
35 40 45
gga gcc ctg ctg ttc gac agc ggc gaa aca gcc gag gcc acc cgg ctg 192
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
50 55 60
aag aga acc gcc aga aga aga tac acc aga cgg aag aac cgg atc tgc 240
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
65 70 75 80
tat ctg caa gag atc ttc agc aac gag atg gcc aag gtg gac gac agc 288
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
85 90 95
ttc ttc cac aga ctg gaa gag tcc ttc ctg gtg gaa gag gat aag aag 336
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
100 105 110
cac gag cgg cac ccc atc ttc ggc aac atc gtg gac gag gtg gcc tac 384
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
115 120 125
cac gag aag tac ccc acc atc tac cac ctg aga aag aaa ctg gtg gac 432
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
130 135 140
agc acc gac aag gcc gac ctg cgg ctg atc tat ctg gcc ctg gcc cac 480
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
145 150 155 160
atg atc aag ttc cgg ggc cac ttc ctg atc gag ggc gac ctg aac ccc 528
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
165 170 175
gac aac agc gac gtg gac aag ctg ttc atc cag ctg gtg cag acc tac 576
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
180 185 190
aac cag ctg ttc gag gaa aac ccc atc aac gcc agc ggc gtg gac gcc 624
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
195 200 205
aag gcc atc ctg tct gcc aga ctg agc aag agc aga cgg ctg gaa aat 672
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
210 215 220
ctg atc gcc cag ctg ccc ggc gag aag aag aat ggc ctg ttc gga aac 720
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
225 230 235 240
ctg att gcc ctg agc ctg ggc ctg acc ccc aac ttc aag agc aac ttc 768
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
245 250 255
gac ctg gcc gag gat gcc aaa ctg cag ctg agc aag gac acc tac gac 816
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
260 265 270
gac gac ctg gac aac ctg ctg gcc cag atc ggc gac cag tac gcc gac 864
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
275 280 285
ctg ttt ctg gcc gcc aag aac ctg tcc gac gcc atc ctg ctg agc gac 912
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
290 295 300
atc ctg aga gtg aac acc gag atc acc aag gcc ccc ctg agc gcc tct 960
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
305 310 315 320
atg atc aag aga tac gac gag cac cac cag gac ctg acc ctg ctg aaa 1008
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
325 330 335
gct ctc gtg cgg cag cag ctg cct gag aag tac aaa gag att ttc ttc 1056
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
340 345 350
gac cag agc aag aac ggc tac gcc ggc tac att gac ggc gga gcc agc 1104
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
355 360 365
cag gaa gag ttc tac aag ttc atc aag ccc atc ctg gaa aag atg gac 1152
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
370 375 380
ggc acc gag gaa ctg ctc gtg aag ctg aac aga gag gac ctg ctg cgg 1200
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
385 390 395 400
aag cag cgg acc ttc gac aac ggc agc atc ccc cac cag atc cac ctg 1248
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
405 410 415
gga gag ctg cac gcc att ctg cgg cgg cag gaa gat ttt tac cca ttc 1296
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
420 425 430
ctg aag gac aac cgg gaa aag atc gag aag atc ctg acc ttc cgc atc 1344
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
435 440 445
ccc tac tac gtg ggc cct ctg gcc agg gga aac agc aga ttc gcc tgg 1392
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
450 455 460
atg acc aga aag agc gag gaa acc atc acc ccc tgg aac ttc gag gaa 1440
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
465 470 475 480
gtg gtg gac aag ggc gct tcc gcc cag agc ttc atc gag cgg atg acc 1488
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
485 490 495
aac ttc gat aag aac ctg ccc aac gag aag gtg ctg ccc aag cac agc 1536
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
500 505 510
ctg ctg tac gag tac ttc acc gtg tat aac gag ctg acc aaa gtg aaa 1584
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
515 520 525
tac gtg acc gag gga atg aga aag ccc gcc ttc ctg agc ggc gag cag 1632
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
530 535 540
aaa aag gcc atc gtg gac ctg ctg ttc aag acc aac cgg aaa gtg acc 1680
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
545 550 555 560
gtg aag cag ctg aaa gag gac tac ttc aag aaa atc gag tgc ttc gac 1728
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
565 570 575
tcc gtg gaa atc tcc ggc gtg gaa gat cgg ttc aac gcc tcc ctg ggc 1776
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
580 585 590
aca tac cac gat ctg ctg aaa att atc aag gac aag gac ttc ctg gac 1824
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
595 600 605
aat gag gaa aac gag gac att ctg gaa gat atc gtg ctg acc ctg aca 1872
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
610 615 620
ctg ttt gag gac aga gag atg atc gag gaa cgg ctg aaa acc tat gcc 1920
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
625 630 635 640
cac ctg ttc gac gac aaa gtg atg aag cag ctg aag cgg cgg aga tac 1968
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
645 650 655
acc ggc tgg ggc agg ctg agc cgg aag ctg atc aac ggc atc cgg gac 2016
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
660 665 670
aag cag tcc ggc aag aca atc ctg gat ttc ctg aag tcc gac ggc ttc 2064
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
675 680 685
gcc aac aga aac ttc atg cag ctg atc cac gac gac agc ctg acc ttt 2112
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
690 695 700
aaa gag gac atc cag aaa gcc cag gtg tcc ggc cag ggc gat agc ctg 2160
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
705 710 715 720
cac gag cac att gcc aat ctg gcc ggc agc ccc gcc att aag aag ggc 2208
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
725 730 735
atc ctg cag aca gtg aag gtg gtg gac gag ctc gtg aaa gtg atg ggc 2256
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
740 745 750
cgg cac aag ccc gag aac atc gtg atc gaa atg gcc aga gag aac cag 2304
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
755 760 765
acc acc cag aag gga cag aag aac agc cgc gag aga atg aag cgg atc 2352
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
770 775 780
gaa gag ggc atc aaa gag ctg ggc agc cag atc ctg aaa gaa cac ccc 2400
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
785 790 795 800
gtg gaa aac acc cag ctg cag aac gag aag ctg tac ctg tac tac ctg 2448
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
805 810 815
cag aat ggg cgg gat atg tac gtg gac cag gaa ctg gac atc aac cgg 2496
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
820 825 830
ctg tcc gac tac gat gtg gac cat atc gtg cct cag agc ttt ctg aag 2544
Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu Lys
835 840 845
gac gac tcc atc gac aac aag gtg ctg acc aga agc gac aag aac cgg 2592
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg
850 855 860
ggc aag agc gac aac gtg ccc tcc gaa gag gtc gtg aag aag atg aag 2640
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
865 870 875 880
aac tac tgg cgg cag ctg ctg aac gcc aag ctg att acc cag aga aag 2688
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
885 890 895
ttc gac aat ctg acc aag gcc gag aga ggc ggc ctg agc gaa ctg gat 2736
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
900 905 910
aag gcc ggc ttc atc aag aga cag ctg gtg gaa acc cgg cag atc aca 2784
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
915 920 925
aag cac gtg gca cag atc ctg gac tcc cgg atg aac act aag tac gac 2832
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
930 935 940
gag aat gac aag ctg atc cgg gaa gtg aaa gtg atc acc ctg aag tcc 2880
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
945 950 955 960
aag ctg gtg tcc gat ttc cgg aag gat ttc cag ttt tac aaa gtg cgc 2928
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
965 970 975
gag atc aac aac tac cac cac gcc cac gac gcc tac ctg aac gcc gtc 2976
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
980 985 990
gtg gga acc gcc ctg atc aaa aag tac cct aag ctg gaa agc gag ttc 3024
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe
995 1000 1005
gtg tac ggc gac tac aag gtg tac gac gtg cgg aag atg atc gcc 3069
Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala
1010 1015 1020
aag agc gag cag gaa atc ggc aag gct acc gcc aag tac ttc ttc 3114
Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe
1025 1030 1035
tac agc aac atc atg aac ttt ttc aag acc gag att acc ctg gcc 3159
Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala
1040 1045 1050
aac ggc gag atc cgg aag cgg cct ctg atc gag aca aac ggc gaa 3204
Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu
1055 1060 1065
acc ggg gag atc gtg tgg gat aag ggc cgg gat ttt gcc acc gtg 3249
Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val
1070 1075 1080
cgg aaa gtg ctg agc atg ccc caa gtg aat atc gtg aaa aag acc 3294
Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys Thr
1085 1090 1095
gag gtg cag aca ggc ggc ttc agc aaa gag tct atc cgg ccc aag 3339
Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Arg Pro Lys
1100 1105 1110
agg aac agc gat aag ctg atc gcc aga aag aag gac tgg gac cct 3384
Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro
1115 1120 1125
aag aag tac ggc ggc ttc gtt agc ccc acc gtg gcc tat tct gtg 3429
Lys Lys Tyr Gly Gly Phe Val Ser Pro Thr Val Ala Tyr Ser Val
1130 1135 1140
ctg gtg gtg gcc aaa gtg gaa aag ggc aag tcc aag aaa ctg aag 3474
Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys
1145 1150 1155
agt gtg aaa gag ctg ctg ggg atc acc atc atg gaa aga agc agc 3519
Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser
1160 1165 1170
ttc gag aag aat ccc atc gac ttt ctg gaa gcc aag ggc tac aaa 3564
Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys
1175 1180 1185
gaa gtg aaa aag gac ctg atc atc aag ctg cct aag tac tcc ctg 3609
Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu
1190 1195 1200
ttc gag ctg gaa aac ggc cgg aag aga atg ctg gcc tct gcc cgg 3654
Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala Arg
1205 1210 1215
ttc ctg cag aag gga aac gaa ctg gcc ctg ccc tcc aaa tat gtg 3699
Phe Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr Val
1220 1225 1230
aac ttc ctg tac ctg gcc agc cac tat gag aag ctg aag ggc tcc 3744
Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser
1235 1240 1245
ccc gag gat aat gag cag aaa cag ctg ttt gtg gaa cag cac aag 3789
Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys
1250 1255 1260
cac tac ctg gac gag atc atc gag cag atc agc gag ttc tcc aag 3834
His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys
1265 1270 1275
aga gtg atc ctg gcc gac gct aat ctg gac aaa gtg ctg tcc gcc 3879
Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala
1280 1285 1290
tac aac aag cac cgg gat aag ccc atc aga gag cag gcc gag aat 3924
Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn
1295 1300 1305
atc atc cac ctg ttt acc ctg acc aat ctg gga gcc cct cgg gcc 3969
Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Arg Ala
1310 1315 1320
ttc aag tac ttt gac acc acc atc gac cgg aag gcc tac cgg agc 4014
Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Ala Tyr Arg Ser
1325 1330 1335
acc aaa gag gtg ctg gac gcc acc ctg atc cac cag agc atc acc 4059
Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr
1340 1345 1350
ggc ctg tac gag aca cgg atc gac ctg tct cag ctg gga ggc gac 4104
Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp
1355 1360 1365
taa 4107
<210> 19
<211> 81
<212> RNA
<213> 人工序列
<220>
<223> 指导RNA
<400> 19
ggaaauuagg ugcgcuuggc guuuuagagc uagaaauagc aaguuaaaau aaggcuaguc 60
cguuaucaac uugaaaaagu g 81
<210> 20
<211> 23
<212> DNA
<213> 人工序列
<220>
<223> 靶DNA
<400> 20
ggaaattagg tgcgcttggc tgg 23

Claims (33)

1. 蛋白,该蛋白由包含在SEQ ID NO: 1所表示的氨基酸序列中1335位的精氨酸被选自丙氨酸、甘氨酸、半胱氨酸、异亮氨酸、亮氨酸、蛋氨酸、苯丙氨酸、脯氨酸、缬氨酸、苏氨酸、天冬酰胺和天冬氨酸的1个氨基酸取代而得到的氨基酸序列的序列构成,并且具有与向导RNA的结合能力。
2. 权利要求1所述的蛋白,其中,在SEQ ID NO: 1所表示的氨基酸序列中,进一步在1219位具有突变。
3. 权利要求1或2所述的蛋白,其中,在SEQ ID NO: 1所表示的氨基酸序列中,进一步在1322位具有突变。
4. 蛋白,该蛋白由包含在SEQ ID NO: 1所表示的氨基酸序列中1335位的精氨酸被选自丙氨酸、甘氨酸、半胱氨酸、异亮氨酸、亮氨酸、蛋氨酸、苯丙氨酸、脯氨酸、缬氨酸、苏氨酸、天冬酰胺和天冬氨酸的1个氨基酸取代、并进一步在1219位具有突变的氨基酸序列的序列构成,并且具有与向导RNA的结合能力。
5. 蛋白,该蛋白由包含在SEQ ID NO: 1所表示的氨基酸序列中1335位的精氨酸被选自丙氨酸、甘氨酸、半胱氨酸、异亮氨酸、亮氨酸、蛋氨酸、苯丙氨酸、脯氨酸、缬氨酸、苏氨酸、天冬酰胺和天冬氨酸的1个氨基酸取代、并进一步在1322位具有突变的氨基酸序列的序列构成,并且具有与向导RNA的结合能力。
6.权利要求1~5中任一项所述的蛋白,其中,1335位的精氨酸的取代是取代成丙氨酸。
7.权利要求1~5中任一项所述的蛋白,其中,1335位的精氨酸的取代是取代成异亮氨酸、蛋氨酸、苏氨酸或缬氨酸。
8.权利要求2或4所述的蛋白,其中,1219位的突变是谷氨酸被取代成苯丙氨酸。
9.权利要求3或5所述的蛋白,其中,1322位的突变是丙氨酸被取代成精氨酸、组氨酸或赖氨酸。
10.权利要求9所述的蛋白,其中,1322位的突变是丙氨酸被取代成精氨酸。
11. 权利要求1~10中任一项所述的蛋白,其中,在SEQ ID NO: 1所表示的氨基酸序列中,进一步在选自1111位、1135位、1218位和1337位的至少一个位置具有突变。
12. 权利要求11所述的蛋白,其中,在SEQ ID NO: 1所表示的氨基酸序列中,进一步在选自1111位、1135位、1218位和1337位的至少2个位置具有突变。
13. 权利要求11所述的蛋白,其中,在SEQ ID NO: 1所表示的氨基酸序列中,进一步在选自1111位、1135位、1218位和1337位的至少3个位置具有突变。
14. 权利要求11所述的蛋白,其中,在SEQ ID NO: 1所表示的氨基酸序列中,进一步在1111位、1135位、1218位和1337位具有突变。
15.权利要求11~14中任一项所述的蛋白,其中,
1111位的突变是亮氨酸被取代成精氨酸、组氨酸或赖氨酸;
1135位的突变是天冬氨酸被取代成缬氨酸;
1218位的突变是甘氨酸被取代成精氨酸、组氨酸或赖氨酸;
1337位的突变是苏氨酸被取代成精氨酸、组氨酸或赖氨酸。
16. 权利要求1~15中任一项所述的蛋白,其中,在SEQ ID NO: 1的施行了突变的位置以外的位点具有80%以上的同源性。
17. 权利要求1~15中任一项所述的蛋白,其中,在SEQ ID NO: 1的施行了突变的位置以外的位点取代、缺失、插入和/或添加了1个~多个氨基酸。
18.权利要求1~17中任一项所述的蛋白,该蛋白具有RNA诱导性DNA核酸内切酶活性。
19. 权利要求1~16中任一项所述的蛋白,其中,在SEQ ID NO: 1所表示的氨基酸序列中进一步具有缺失一部分或全部的核酸酶活性的突变。
20. 权利要求19所述的蛋白,其中,缺失一部分或全部的核酸酶活性的突变是SEQ IDNO: 1所表示的氨基酸序列中的、(i)选自10位、762位、839位、983位和986位的至少1个位置或相当于此的位置和/或(ii)选自840位和863位的位置或相当于此的位置的突变。
21.权利要求20所述的蛋白,其中,10位的天冬氨酸被取代成丙氨酸或天冬酰胺;或者
840位的组氨酸被取代成丙氨酸、天冬酰胺或酪氨酸。
22.权利要求19~21中任一项所述的蛋白,该蛋白连接有转录调控因子蛋白或结构域。
23.权利要求22所述的蛋白,其中,转录调控因子为转录激活因子。
24.权利要求22所述的蛋白,其中,转录调控因子为转录沉默子或转录抑制因子。
25.核酸,该核酸编码权利要求1~24中任一项所述的蛋白。
26.蛋白-RNA复合物,该复合物具备权利要求1~24中任一项所述的蛋白和向导RNA,该向导RNA包含多核苷酸,所述多核苷酸由与靶双链多核苷酸中的PAM序列的从1个碱基上游到20个碱基以上且24个碱基以下上游的核苷酸序列互补的核苷酸序列构成,所述PAM为前间隔序列邻近基序。
27. 用于位点特异性地修饰靶双链多核苷酸的方法,该方法具备以下步骤:
将靶双链多核苷酸、蛋白和向导RNA混合并进行培养的步骤;以及
上述蛋白在位于PAM序列上游的结合位点修饰上述靶双链多核苷酸的步骤,
上述靶双链多核苷酸具有由NG构成的PAM序列,其中,N是指任意碱基,G是指鸟嘌呤,
上述蛋白为权利要求1~24中任一项所述的蛋白,
上述向导RNA包含多核苷酸,该多核苷酸由与上述靶双链多核苷酸中的上述PAM序列的从1个碱基上游到20个碱基以上且24个碱基以下上游的核苷酸序列互补的核苷酸序列构成。
28.权利要求27所述的方法,其中,修饰是指靶双链多核苷酸的位点特异性切割。
29.权利要求27所述的方法,其中,修饰是指靶双链多核苷酸中的位点特异性的1个以上的核苷酸的取代、缺失和/或添加。
30.增加细胞的靶基因表达的方法,该方法包括:使权利要求23所述的蛋白和针对上述靶基因的1个或多个向导RNA在上述细胞内表达。
31.减少细胞的靶基因表达的方法,该方法包括:使权利要求24所述的蛋白和针对上述靶基因的1个或多个向导RNA在上述细胞内表达。
32.权利要求30或31所述的方法,其中,细胞为真核细胞。
33.权利要求30或31所述的方法,其中,细胞为酵母细胞、植物细胞或动物细胞。
CN201880050453.1A 2017-05-31 2018-05-31 经修饰的Cas9蛋白及其用途 Active CN110914423B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
JP2017-108556 2017-05-31
JP2017108556 2017-05-31
PCT/JP2018/021068 WO2018221685A1 (ja) 2017-05-31 2018-05-31 改変されたCas9タンパク質及びその用途

Publications (2)

Publication Number Publication Date
CN110914423A true CN110914423A (zh) 2020-03-24
CN110914423B CN110914423B (zh) 2024-02-06

Family

ID=64455845

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201880050453.1A Active CN110914423B (zh) 2017-05-31 2018-05-31 经修饰的Cas9蛋白及其用途

Country Status (5)

Country Link
US (2) US11371030B2 (zh)
EP (1) EP3633034A4 (zh)
JP (2) JP6628385B6 (zh)
CN (1) CN110914423B (zh)
WO (1) WO2018221685A1 (zh)

Families Citing this family (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU2015330699B2 (en) 2014-10-10 2021-12-02 Editas Medicine, Inc. Compositions and methods for promoting homology directed repair
EP3265559B1 (en) 2015-03-03 2021-01-06 The General Hospital Corporation Engineered crispr-cas9 nucleases with altered pam specificity
EP3555297A1 (en) 2016-12-19 2019-10-23 Editas Medicine, Inc. Assessing nuclease cleavage
US12110545B2 (en) 2017-01-06 2024-10-08 Editas Medicine, Inc. Methods of assessing nuclease cleavage
US11499151B2 (en) 2017-04-28 2022-11-15 Editas Medicine, Inc. Methods and systems for analyzing guide RNA molecules
US11371030B2 (en) * 2017-05-31 2022-06-28 The University Of Tokyo Modified Cas9 protein and use thereof
US10428319B2 (en) 2017-06-09 2019-10-01 Editas Medicine, Inc. Engineered Cas9 nucleases
WO2019014564A1 (en) 2017-07-14 2019-01-17 Editas Medicine, Inc. SYSTEMS AND METHODS OF TARGETED INTEGRATION AND GENOME EDITING AND DETECTION THEREOF WITH INTEGRATED PRIMING SITES
WO2019040650A1 (en) * 2017-08-23 2019-02-28 The General Hospital Corporation GENETICALLY MODIFIED CRISPR-CAS9 NUCLEASES HAVING MODIFIED PAM SPECIFICITY
WO2019194320A1 (ja) * 2018-04-06 2019-10-10 国立大学法人東京大学 エンジニアリングされたBlCas9ヌクレアーゼ
CA3104856A1 (en) 2018-06-29 2020-01-02 Editas Medicine, Inc. Synthetic guide molecules, compositions and methods relating thereto
BR112021019657A2 (pt) 2019-04-05 2021-11-30 Univ Osaka Método para produzir célula knock-in
US20210301269A1 (en) * 2020-01-22 2021-09-30 New York Genome Center, Inc. Recombinant crispr-cas9 nucleases with altered pam specificity
EP4093863A4 (en) * 2020-01-24 2024-04-10 The General Hospital Corporation CRISPR-CAS ENZYMES WITH ENHANCED ON-TARGET ACTIVITY
EP4093864A4 (en) 2020-01-24 2024-04-10 The General Hospital Corporation Unconstrained genome targeting with near-pamless engineered crispr-cas9 variants
US20230416709A1 (en) 2020-11-06 2023-12-28 Editforce, Inc. Foki nuclease domain mutant
GB202117583D0 (en) * 2021-12-06 2022-01-19 Cambridge Entpr Ltd Protein expression
US20250001010A1 (en) 2023-06-30 2025-01-02 Christiana Care Gene Editing Institute, Inc. Nras gene knockout for treatment of cancer

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160177278A1 (en) * 2014-12-22 2016-06-23 University Of Massachusetts Cas9-DNA Targeting Unit Chimeras
WO2016141224A1 (en) * 2015-03-03 2016-09-09 The General Hospital Corporation Engineered crispr-cas9 nucleases with altered pam specificity

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9637739B2 (en) 2012-03-20 2017-05-02 Vilnius University RNA-directed DNA cleavage by the Cas9-crRNA complex
US8697359B1 (en) 2012-12-12 2014-04-15 The Broad Institute, Inc. CRISPR-Cas systems and methods for altering expression of gene products
JPWO2017010543A1 (ja) 2015-07-14 2018-06-14 国立大学法人 東京大学 改変されたFnCas9タンパク質及びその使用
JP2017108556A (ja) 2015-12-10 2017-06-15 株式会社豊田自動織機 蓄電型充電装置及び充電制御方法
US11371030B2 (en) 2017-05-31 2022-06-28 The University Of Tokyo Modified Cas9 protein and use thereof
WO2019217943A1 (en) 2018-05-11 2019-11-14 Beam Therapeutics Inc. Methods of editing single nucleotide polymorphism using programmable base editor systems

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160177278A1 (en) * 2014-12-22 2016-06-23 University Of Massachusetts Cas9-DNA Targeting Unit Chimeras
WO2016141224A1 (en) * 2015-03-03 2016-09-09 The General Hospital Corporation Engineered crispr-cas9 nucleases with altered pam specificity

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
HIRANO.S.: "Structural Basis for the Altered PAM Specificities of Engineered CRISPR-Cas9" *

Also Published As

Publication number Publication date
JP2020043868A (ja) 2020-03-26
JP7213548B2 (ja) 2023-01-30
CN110914423B (zh) 2024-02-06
WO2018221685A1 (ja) 2018-12-06
US20200277586A1 (en) 2020-09-03
US11702645B2 (en) 2023-07-18
JP6628385B6 (ja) 2020-03-04
EP3633034A4 (en) 2021-03-24
US11371030B2 (en) 2022-06-28
US20220333090A1 (en) 2022-10-20
JP6628385B2 (ja) 2020-01-08
EP3633034A1 (en) 2020-04-08
JPWO2018221685A1 (ja) 2019-12-19

Similar Documents

Publication Publication Date Title
CN110914423B (zh) 经修饰的Cas9蛋白及其用途
US12152259B2 (en) Modified CAS9 protein, and use thereof
JPWO2018221685A6 (ja) 改変されたCas9タンパク質及びその用途
KR102691636B1 (ko) 상동 재조합에 의한 crispr/cas-기반 게놈 편집을 위한 화합물 및 방법
JP7138712B2 (ja) ゲノム編集のためのシステム及び方法
WO2020085441A1 (ja) 改変されたCas9タンパク質及びその用途
CN107794272A (zh) 一种高特异性的crispr基因组编辑体系
US20180201912A1 (en) Modified fncas9 protein and use thereof
JP7412001B2 (ja) 改変されたCas9タンパク質及びその用途
WO2018172798A1 (en) Argonaute system
HK40027079A (zh) 经修饰的cas9蛋白及其用途
WO2019026976A1 (ja) 改変されたCas9タンパク質及びその用途
JP2024501892A (ja) 新規の核酸誘導型ヌクレアーゼ

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 40027079

Country of ref document: HK

GR01 Patent grant
GR01 Patent grant
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载