DERIVADOS DE COLISMICINA.

Derivados de colismicina. La presente invención describe el aislamiento,

clonación y secuenciación de la agrupación de genes implicados en la biosíntesis de colismicina por Streptomyces spp. CS40, y el uso de dichos genes para incrementar la producción de colismicina y/o compuestos análogos o derivados relacionados por medio de cepas productoras. Dichos compuestos producidos en la presente invención son aplicables en el tratamiento de diversas enfermedades como, por ejemplo, el cáncer, enfermedades neurodegenerativas y enfermedades infecciosas.

Tipo: Patente de Invención. Resumen de patente/invención. Número de Solicitud: P201130010.

Solicitante: UNIVERSIDAD DE OVIEDO.

Nacionalidad solicitante: España.

Inventor/es: MENDEZ FERNANDEZ,CARMEN, SALAS FERNANDEZ,JOSE ANTONIO, FERNANDEZ BRAÑA,ALFREDO, MORIS VARAS, FRANCISCO, GONZALEZ SABIN,JAVIER, GARCÍA LLORENTE,Ignacio, MIGUEL VIOR,Natalia, SIALER GUERRERO,Carlos Alberto.

Fecha de Publicación: .

Clasificación Internacional de Patentes:

  • A61K31/44 NECESIDADES CORRIENTES DE LA VIDA.A61 CIENCIAS MEDICAS O VETERINARIAS; HIGIENE.A61K PREPARACIONES DE USO MEDICO, DENTAL O PARA EL ASEO (dispositivos o métodos especialmente concebidos para conferir a los productos farmacéuticos una forma física o de administración particular A61J 3/00; aspectos químicos o utilización de substancias químicas para, la desodorización del aire, la desinfección o la esterilización, vendas, apósitos, almohadillas absorbentes o de los artículos para su realización A61L; composiciones a base de jabón C11D). › A61K 31/00 Preparaciones medicinales que contienen ingredientes orgánicos activos. › Piridinas no condensadas; Sus derivados hidrogenados.
  • A61P31/00 A61 […] › A61P ACTIVIDAD TERAPEUTICA ESPECIFICA DE COMPUESTOS QUIMICOS O DE PREPARACIONES MEDICINALES.Antiinfecciosos, es decir antibióticos, antisépticos, quimioterápicos.
  • A61P35/00 A61P […] › Agentes antineoplásicos.
  • C12N15/00 QUIMICA; METALURGIA.C12 BIOQUIMICA; CERVEZA; BEBIDAS ALCOHOLICAS; VINO; VINAGRE; MICROBIOLOGIA; ENZIMOLOGIA; TECNICAS DE MUTACION O DE GENETICA.C12N MICROORGANISMOS O ENZIMAS; COMPOSICIONES QUE LOS CONTIENEN; PROPAGACION, CULTIVO O CONSERVACION DE MICROORGANISMOS; TECNICAS DE MUTACION O DE INGENIERIA GENETICA; MEDIOS DE CULTIVO (medios para ensayos microbiológicos C12Q 1/00). › Técnicas de mutación o de ingeniería genética; ADN o ARN relacionado con la ingeniería genética, vectores, p. ej. plásmidos, o su aislamiento, su preparación o su purificación; Utilización de huéspedes para ello (mutantes o microorganismos modificados por ingeniería genética C12N 1/00, C12N 5/00, C12N 7/00; nuevas plantas en sí A01H; reproducción de plantas por técnicas de cultivo de tejidos A01H 4/00; nuevas razas animales en sí A01K 67/00; utilización de preparaciones medicinales que contienen material genético que es introducido en células del cuerpo humano para tratar enfermedades genéticas, terapia génica A61K 48/00; péptidos en general C07K).

PDF original: ES-2397885_A1.pdf

 


Fragmento de la descripción:

Derivados de colismicina.

CAMPO DE LA INVENCIÓN

La presente invención está comprendida dentro del campo de la biología, la farmacia y la medicina.

ESTADO DE LA TÉCNICA

Dentro de la naturaleza las moléculas que presentan en su estructura un grupo 2, 2’-bipiridil son compuestos con un gran número de actividades biológicas descritas (Cristalli et al., 1986) ; (Gomi et al., 1994) ; (Tsuge et al. 1999) . Una de estas moléculas es la colismicina producida por Streptomyces spp. CS40 (FIG. 1A) , que presenta actividad antibiótica frente a bacterias Gram-positivas y Gram-negativas, antifúngica frente a un amplio espectro de hongos y citotóxica frente a células de leucemia P388 (Gomi et al. 1994) ; (Tsuge et al., 1999) . Adicionalmente la colismicina está descrita como un inhibidor de la unión entre la dexametasona y los receptores de glucocorticoides (Shindo et al., 1994) y en la patente WO/2007/017146 se describe la capacidad de la colismicina para inhibir el estrés oxidativo en células.

El desarrollo de la tecnología de ADN recombinante se está convirtiendo en una poderosa herramienta a la hora de incrementar nuestro conocimiento sobre los genes que participan en la biosíntesis de compuestos bioactivos. Esta tecnología puede hoy en día ser aplicada a la mejora en los niveles de producción de distintos compuestos bioactivos y a la obtención de nuevas moléculas derivadas con mejores propiedades clínicas a través de la combinación de genes de distintas rutas de biosíntesis de moléculas bioactivas y su expresión en microorganismos productores de estos compuestos, en lo que se ha denominado biosíntesis combinatoria.

La tecnología de ADN recombinante ha hecho posible el aislamiento de agrupaciones de genes completas para la biosíntesis de distintos compuestos bioactivos utilizando, entre otras estrategias, la clonación, la selección o el análisis de genotecas de los microorganismos productores de moléculas con interés farmacológico mediante sondas de ADN. Esta estrategia se basa en la existencia de información genética previa sobre la ruta de biosíntesis o rutas relacionadas biosintéticamente, lo que permite utilizar o diseñar sondas genéticas a partir de la secuencia total o parcial de un enzima de biosíntesis.

En la literatura científica no existe apenas información acerca de la maquinaria celular encargada de la biosíntesis de compuestos del tipo 2, 2’- bipiridil. Tan sólo se han realizado estudios sobre la biosíntesis de una molécula de la familia 2-2’-bipiridil, la caerulomicina. La caerulomicina es una molécula con una estructura similar a la de la colismicina (FIG. 1B) producida por Streptomyces caeruleus (Funk y Divekar, 1959) .

A través de experimentos de incorporación de precursores marcados radiactivamente se identificaron algunas de las moléculas intermediarias durante la biosíntesis de caerulomicina, una de estas moléculas es el ácido picolínico (Vining et al., 1988) . Este compuesto ha sido descrito, además, como un metabolito intermediario en la biosíntesis de otras moléculas bioactivas por parte de algunas especies de Streptomyces como es el caso de la nikomicina D (Bruntner y Bormann, 1998) . En todos los casos descritos en la literatura el precursor para la biosíntesis del ácido picolínico ó 3hidroxipicolínico (en el caso de la biosíntesis de virginiamicina) es el aminoácido lisina y las enzimas encargadas de la generación de este compuesto han sido identificadas, siendo la primera de ellas un enzima con actividad lisina 2aminotransferasa (Bruntner y Bormann, . 1998; Namwat et al., 2002) .

La presente invención describe la clonación y secuenciación de la agrupación de genes de biosíntesis de colismicina en el microorganismo productor Streptomyces sp. CS40. La información genética disponible sobre enzimas del tipo lisina-2aminotransferasa, ha sido usada para diseñar oligonucleótidos y construir una sonda genética que ha permitido el aislamiento y clonación de la agrupación de genes de biosíntesis de colismicina. Desde el punto de vista genético, no hay descripciones previas en la literatura relativas al aislamiento y secuenciación de la agrupación de genes de biosíntesis de colismicina u otra molécula de la familia 2, 2’-bipiridil producida por especies de Streptomyces.

La invención proporciona una importante herramienta para la manipulación genética de esta agrupación de genes en el sentido de aumentar la producción de colismicina y/o obtener nuevos derivados de esta molécula o moléculas estructuralmente relacionadas con propiedades mejoradas.

DESCRIPCIÓN DE LA INVENCIÓN

La presente invención se relaciona con el aislamiento e identificación de una agrupación de genes que participan en la biosíntesis de colismicina por Streptomyces spp. CS40, y proporciona una herramienta para la manipulación genética de esta agrupación génica para aumentar la producción de colismicina y obtener nuevos derivados con propiedades mejoradas para su aislamiento y utilización, entre otros, en el sector farmacéutico o químico.

Esta invención describe una secuencia de ADN que contiene 24 genes implicados en la biosíntesis de colismicina y sus precursores, incluyendo aquellos implicados en la regulación del agrupamiento génico. El agrupamiento génico incluye genes que codifican para un sistema híbrido policétido sintasa-péptido sintetasa no ribosomal (PCS-NRPS) encargado de sintetizar la estructura central de la colismicina, genes que codifican enzimas implicados en la biosíntesis del precursor ácido picolínico de la colismicina, genes que codifican enzimas implicados en modificaciones de la molécula tales como deshidrogenasas, aminotransferasas y metiltransferasas. La invención por tanto se refiere a nuevos genes y moléculas de ácidos nucleicos que codifican proteínas/polipéptidos que muestran actividades funcionales implicadas en la biosíntesis de colismicina, y su potencial aplicación en el incremento de los niveles de producción de colismicina en Streptomyces spp. CS40 y en la producción de nuevos derivados de colismicina.

Los procedimientos experimentales aplicados a la presente invención incluyen métodos de biología molecular convencionales. Una descripción detallada de los métodos utilizados y no detallados aquí se puede obtener de Hopwood et al. (1985) ; Sambrook et al. (1989) y Kieser et al., (2000) .

Con el fin de clonar la agrupación de genes de biosíntesis de colismicina, se construyó una genoteca de ADN cromosómico de Streptomyces spp. CS40 en Escherichia coli, utilizando el cósmido pWE15 (ver ejemplo 2) .

Para el aislamiento de la agrupación de genes de biosíntesis de colismicina se utilizó una sonda genética consistente en un fragmento de PCR de 412 bp procedente de la amplificación parcial del gen que codifica para una lisina 2aminotransferasa implicada en la biosíntesis de colismicina. Para su amplificación se utilizó como molde el ADN cromosómico de la cepa CS40 y dos oligonucleótidos: L2ATson1 (SEQ ID NO: 31) y L2ATson2 (SEQ. ID NO: 32) diseñados en base a la secuencia nucleotídica de una lisina 2-aminotransferasa cuya secuencia, de aproximadamente 500 bp, se identificó previamente usando los oligonucleótidos degenerados L2ATFW2 (SEQ. ID NO: 29) y L2ATRV2 (SEQ ID NO: 30) .

La utilización de esta sonda en la hibridación de la genoteca de ADN cromosómico de Streptomyces spp. CS40 permitió aislar un cósmido (cos1C3) que define una región de aproximadamente 41 kb en el cromosoma. La participación del ADN clonado en el cósmido cos1C3 en la biosíntesis de colismicina se determinó utilizando un fragmento BamHI de 3128 bp, que contenía un fragmento interno a un gen que codifica para una NRPS. Este fragmento fue clonado en el plásmido pOJ260 (Bierman et al., 1992) . La construcción resultante se usó para la disrupción génica en Streptomyces spp. CS40 generando un mutante no productor de colismicina (ver ejemplo 4) como prueba de la implicación del ADN clonado en la biosíntesis de colismicina.

No obstante, el análisis de la secuencia del cósmido cos1c3 mostró la necesidad de extender la región hacia la izquierda para buscar otros genes no presentes en el cósmido cos1C3, posiblemente implicados en la biosíntesis de colismicina. Un fragmento amplificado por PCR usando los oligonucleótidos 1c3-5FW (SEQ ID NO: 33) y 1c3-5RV (SEQ ID NO: 42) de 1, 9 kb del extremo del cósmido cos1C3 se usó como sonda para analizar de nuevo la genoteca de Streptomyces spp. CS40 aislándose un nuevo cósmido solapante con el cósmido cos1C3, cos3B11, usado para completar la secuencia de 46672 bp de la región que contiene el agrupamiento génico implicado en la biosíntesis de colismicina (FIG. 2)... [Seguir leyendo]

 


Reivindicaciones:

1. Molécula de ácido nucleico que consiste en al menos un fragmento de la SEQ ID NO: 1 capaz de codificar para al menos una de las secuencias SEQ ID NO: 2 a SEQ ID NO: 28.

2. Molécula de ácido nucleico, según la reivindicación 1, que consiste en la SEQ ID NO: 1.

3. Vector de expresión que comprende la molécula de ácido nucleico de las reivindicaciones 1 ó 2.

4. Vector de expresión, según la reivindicación 3, caracterizado por ser el cósmido cos1c3 o cos3b11.

5. Célula u organismo transgénico no humanos que comprende el vector de expresión de las reivindicaciones 3 ó

4.

6. Cepa de Streptomyces spp. con número de identificación CECT 7754, CECT 7755, CECT 7756, CECT 7757 o CECT 7861.

7. Compuesto de Fórmula (I) :

donde R1, R2, R3 y R4 son, cada uno e independientemente, hidrógeno o un grupo protector; donde el grupo protector comprende un grupo alquilo, un grupo cicloalquilo, un grupo cicloalquilo heterocíclico, un grupo hidroxialquílico, un grupo alquilo halogenado, un grupo alcoxialquilo, un grupo alquenilo, un grupo alquinilo, un grupo arilo, un grupo arilo heterocíclico, un grupo alquilarilo, un grupo éster, un grupo cetona, un grupo carbonato, un grupo ácido carboxílico, un grupo aldehído, un grupo cetona, un grupo oxima, un grupo nitrilo, un grupo uretano, un grupo sililo, un grupo sulfoxi o una combinación de ellos; exceptuando los compuestos con las siguientes fórmulas:

5

8. Compuesto, según la reivindicación 7, seleccionado entre los siguientes compuestos de Fórmula II a XVI:

9. Uso de un compuesto de Fórmula (I) :

donde R1, R2, R3 y R4 son, cada uno e independientemente, hidrógeno o un grupo protector; donde el grupo protector comprende un grupo alquilo, un grupo cicloalquilo, un grupo cicloalquilo heterocíclico, un grupo hidroxialquílico, un grupo alquilo halogenado, un grupo alcoxialquilo, un grupo alquenilo, un grupo alquinilo, un grupo arilo, un grupo arilo heterocíclico, un grupo alquilarilo, un grupo éster, un grupo cetona, un grupo carbonato, un grupo ácido carboxílico, un grupo aldehído, un grupo cetona, un grupo oxima, un grupo nitrilo, un grupo uretano, un grupo sililo, un grupo sulfoxi o una combinación de ellos; para la elaboración de una composición farmacéutica destinada al tratamiento del cáncer.

10. Uso, según la reivindicación 9, donde el tipo de cáncer se selecciona entre: cáncer de mama, cáncer de pulmón y cáncer de colon.

11. Uso de un compuesto según las reivindicación 7 u 8 para la elaboración de una composición farmacéutica destinada al tratamiento enfermedades neurodegenerativas.

12. Uso de un compuesto de Fórmula (I) :

donde R1, R2, R3 y R4 son, cada uno e independientemente, hidrógeno o un grupo protector; donde el grupo protector

comprende un grupo alquilo, un grupo cicloalquilo, un grupo cicloalquilo heterocíclico, un grupo hidroxialquílico, un grupo alquilo halogenado, un grupo alcoxialquilo, un grupo alquenilo, un grupo alquinilo, un grupo arilo, un grupo arilo heterocíclico, un grupo alquilarilo, un grupo éster, un grupo cetona, un grupo carbonato, un grupo ácido carboxílico, un grupo aldehído, un grupo cetona, un grupo oxima, un grupo nitrilo, un grupo uretano, un grupo sililo, un grupo sulfoxi o una combinación de ellos; para la elaboración de una composición farmacéutica destinada al tratamiento de enfermedades infecciosas.

13. Composición farmacéutica que comprende al menos uno de los compuestos de las reivindicaciones 7 u 8 y al menos un excipiente farmacéuticamente aceptable.

14. Compuesto de Fórmula (I) :

donde R1, R2, R3 y R4 son, cada uno e independientemente, hidrógeno o un grupo protector; donde el grupo protector comprende un grupo alquilo, un grupo cicloalquilo, un grupo cicloalquilo heterocíclico, un grupo hidroxialquílico, un grupo alquilo halogenado, un grupo alcoxialquilo, un grupo alquenilo, un grupo alquinilo, un grupo arilo, un grupo arilo heterocíclico, un grupo alquilarilo, un grupo éster, un grupo cetona, un grupo carbonato, un grupo ácido carboxílico, un grupo aldehído, un grupo cetona, un grupo oxima, un grupo nitrilo, un grupo uretano, un grupo sililo, un grupo sulfoxi o una combinación de ellos; para ser usado en el tratamiento del cáncer.

15. Compuesto, según la reivindicación 14, para ser usado en el tratamiento de un tipo de cáncer seleccionado entre: cáncer de mama, cáncer de pulmón y cáncer de colon.

16. Compuesto según las reivindicación 7 u 8 para ser usado en el tratamiento de enfermedades 10 neurodegenerativas.

17. Compuesto de Fórmula (I) :

donde R1, R2, R3 y R4 son, cada uno e independientemente, hidrógeno o un grupo protector; donde el grupo protector puede consistir en un grupo alquilo, un grupo cicloalquilo, un grupo cicloalquilo heterocíclico, un grupo hidroxialquílico, un grupo alquilo halogenado, un grupo alcoxialquilo, un grupo alquenilo, un grupo alquinilo, un grupo arilo, un grupo arilo heterocíclico, un grupo alquilarilo, un grupo éster, un grupo cetona, un grupo carbonato, un grupo ácido carboxílico, un grupo aldehído, un grupo cetona, un grupo oxima, un grupo nitrilo, un grupo uretano, un grupo sililo, un grupo sulfoxi o una combinación de ellos; para ser usado en el tratamiento de enfermedades infecciosas.

18. Procedimiento para la obtención de derivados acilados de colismicina A y/o colismicina C, que comprende reaccionar colismicina A y/o colismicina C con un agente acilante en presencia de una enzima hidrolasa.

FIG. 1 FIG. 2

17 3 8 1215

22

456 7

14

B

1920 21

kb

cos1c3

cos3b11

FIG. 3

Min.

FIG.4

FIG. 5

1.40

1.40

1.20

1.20

1.00

1.00

0.80

0.80

A

A

U

U

0.60

0.60

0.40

0.40

0.20

0.20

0.00

0.00

0.00 1.00 2.00 3.00 4.00 5.00 6.00 7.00 8.00 0.00 1.00 2.00 3.00 4.00 5.00 6.00 7.00 8.00 Min. Min.

FIG. 6

OH

N

orf19

H2O orf20

N

N

H2N H2N COOH H2N O COOH

N COOH N COOH

SCH3

L-Lys OCH3

FIG. 7

A

kb

aac (3) IV

23 4 5 678910 1112131415161718 20 21 22 23242526 27

0.00 1.00 2.00 3.00 4.00 5.00 6.00 7.00 8.00 0.00 1.00 2.00 3.00 4.00 5.00 6.00 7.00 8.00 Min. Min.

FIG. 8

A

kb

aac (3) IV

23 4 5 678910 121314151617181920 21 22 23242526 27

FIG. 9

A

kb

aac (3) IV

23 4 5 678 10 11121314151617181920 21 22 23242526 27

B

E

F

0.80

0.60

A U 0.40

0.20

3.048 (III) 4.103 (IV)

0.00

0.00 1.00 2.00 3.00 4.00 5.00 6.00 7.00 8.00 Min A

FIG. 10

20 30 40 kb

aac (3) IV

Min 2.00 4.00 6.00 8.00 10.00 12.00 14.00 16.00 18.00 20.00 22.00 Min FIG. 11

DA

A

A U

U

(VII)

FIG. 12

Min.

FIG. 13

B

A

3.155 (IX) 3.511 (X)

0.15 A U

0.10

0.00 1.00 2.00 3.00 4.00 5.00 6.00 7.00 8.00 Min.

FIG. 14

G

% Apoptosis

LISTADO DE SECUENCIAS

<110> UNIVERSIDAD DE OVIEDO

<110> ENTRECHEM S.L

<120> DERIVADOS DE COLISMICINA

<130> P-03987

<160> 56

<170> PatentIn version 3.5

<210> 1

<211> 46672

<212> DNA

<213> Streptomyces ssp.

<220>

<223> Cepa CS40

<400> 1 gatatcgagc gcgtccgcgg cgtcgacggc ggcgagggtg ttctccggcg cgtagcccga 60 cgccccccgg tgggccacga cgaccgggtt cgcccgctgc gcggagacct ggtccggggc 120 ctcgggccgg tcggcgggca gcagcagggc gccggcgccg agcagggcgg cggccgtggt 180 ggaggcggtt gcggtgcgga cgaacacggg gactcctcac gtcgggagtg tgcggacacg 240 cagagagtga cagcggcccg ccaacgcccg gcggggcggc gatggccgcg aaatgaacgg 300 aaccttcggt ccagaggacc accaccgcgg cacggcctgc gaccgtgccg ataaaatgcg 360 ccttcttttg tctgtctgcc gggactcgcc ccttggggac cctcctcgac cccgggcatt 420 ctcgaagggc gcacgcgcat gcaaggaaca gtcgacggat tcacgtacgg cctggtgact 480 ccggtcgcgg cgttcttcat ggccgcgctg ggagccgccc tcgggctccg ctgcaccacg 540 cggtcccttc acacgacacg ctcctgtcgg atcaggctcg accggtcgtc cggccgggtc 600 agcatgagct ccagcatcac gtagttcgtc agcaggatgt ccggagggtt cttgcgcagc 660 tccttgcgcc gctcctcgga ctcctggccg gtgtaacggg cgaaggtgac cggctcctgc 720 cccgcgccga aaccgtggcg caggtacttc tccagctcca tcagctgcga gttggccagg 780 gcgttcatcg ggtagacgac gatcgcgcgg acccggcccc cctcggtacg ccctcccgtg 840 ccgttccctc cggcgcgcag gctcgccgcc tgacgctcct tcaggacgtg gttcacgatc 900 ggcacgatgt acgccaggga cttgccggaa cccgtgccgg tggtcagcac gtaggaatcc 960 ccggcctgag cggcttcgat cgcctgccgc tgatggagat ggaaggtcag cggtcgcccg 1020 tcggggcgtc gcgaactctc cttcttgccc gcctggaaga tctccgcgca cttcgggtgc 1080 agcagtcctt cctggacgag gtcggtgacc ttgcctccgt cggcgaagaa ggggttcagg 1140 gagagccagg ggtcgggcca ctgcgacttc gccgccaggt cgtccttgac gaagccctcg 1200 atccgggcgt cgcggaacac tgtcgcgctc ttggtgaagc tctcgtactc gttgatcagg 1260 tcggcatgga taccgaagac gtccatggtc tcgggcaggc gcggttccct gcgcggcacc 1320 ggcacggggg cggccggggt gcgctccggg agcagggtcc gcagccgcgc caccgggtcc 1380 atcgccaccc agtgcggagc cagcagggcg gccgtgtcgt ccgcggcctg cgcgagcggt 1440 accagcacgt catggagcag ggccgcgttc tcgggccgaa gctcggcctt cttctccagc 1500 agacggtcca ccaggcggac caactcgacg gggaggtccg gacgtacgag cgtcaggagg 1560 ggaggctggg agtgctggtg ctgttcggag agttcgtacg gggtcctggc ggagaacggc 1620 ggctgtccga ccagcagttc gtacagaatg cagccgagcg catagaggtc cgcgaggccg 1680 gagacgtgtt cggcacggaa ctgctcgggt gccatgtagc gagccgtgcc gacgctgacg 1740 cccgtgctcg tcagccgcgt ctgatcgggg tcgtcgacga tcgagcccat gccgaagtcg 1800 aggatcttga cggtgccgtt gctggtcagc atgacattgg cgggtttcag gtcacggtgg 1860 acgaccccgg ccgtgtgcgc ggctgtcaga ccggcggcga tctgcgcgcc gatggcggcg 1920 acccaggaga cggggagctg gggttcctcg tcgatgaggt ccgcgagagg gtgaccgtcc 1980 agcagctcca tggccagata gggcagaccg ctgccgcctg gcgtcccgtc gacgcccccg 2040 tcgatcagcc gggtgaggtt cgggtggcct tgggagagca tgcgcatgat ccgcacctca 2100 cggcggaagc ggtcgaactc cctggtggat cccgaggtgt cgacggcgat cccggtccgg 2160 ccgcgcagca ccgtcttgac cgcgacctcc cggtgcgggg agtcgggtgc tgcccgaagg 2220 tcctccgctc tgtgcacctc gcccatgttc ccgcgcccga cgatccccgt gatccggaac 2280 cgtgcgtcga ccgtctccag gctcactgcg cctccccgtc ttctccacct gttgcacaca 2340 aggcagacac atcctcgggg agcgtatacc ttgagcagat caccaggatg acatggttct 2400 gttgacatcg tcaacaagtg cttcgggtcg tctgcgccga gcgggcggag cgggttcggg 2460 tatgcgccga tgggagcccc gtaatcccca cgagtgggtg gcgtccatga cccttccgga 2520 gttccgtagc ctcgggtccg ccctctgggg cggcgcgaag tcgtcgttcg agacgatgac 2580 tgaactcgct gctcacgtct gtgcccgtgc cgcacttccg cgcaccgttc tcgtccgact 2640 catggaccac ggccggctcc tgaagtcctt cgcctgctcg tccctgcgcg atctggacgg 2700 gaactccgca tgtgcggcgt gacattgccc aggcgggagg gccaccccat cgccgggccg 2760 ccccgcccgg cacgtagact ccgtgaaatg agtgtgcgcc gaatcacccc tggccgcccg 2820 cgaaccaaac ccgccgagga gcggcgcgcc gacctcctgg acgcggccga gggagtgttc 2880 gccgagcggg gaatcgatgc tgctcggatc gacgagatca cggagcgcgc gggtgtcgcg 2940 atcggcacct tctatctgca cttcagctcc aagcgcgaca tcatggccgc cgtacaggcc 3000 cgtttcgtcg accggctggt cgagcggcag cacgcggcgg cgcagaacct gcccgccgac 3060 gactggatcg gacgcgtgga cgcctggctc agcgacgcgg tccgcatcta tgtggaacat 3120 gcccaactgc acgacgtgct ctacggccac acgccgatca acgccacgac ggagatccag 3180 gtcagccccg agaacgcgca tgtcgaggcg ttccggcggc tcatcgccga gcgtcccgac 3240 ccgccggcgg acgccccgaa ccccgccatg gcggctcttc tgatctacag cgcctggtac 3300 ggcggcaccc atgcgctgct gcaccacgag gccgacgaca tcgaccggct ggccgacgag 3360 ctcatcgccg acatcaccga cctcgcccac cgctacctgc gctccgaacg ctgaaccggg 3420 ctgccgacgc cccgagttcg ggccgtcggc agcggaaagg gttcggtgac gcgcaccgcc 3480 gtgaggcgca cccctgggac cggttccccc gggtcagtga ctgccgctgg cgatgcgcca 3540 gtgccgcgcc cgttcgtact gggaactcag ccgcgcgaac gtcccacccg tgcgcagcag 3600 ttcctcgcgc gtgccggact ccacgacccg gccgccgtcg agcgagacca ccaggtccgc 3660 ggcctccagg gtggcggggc ggtgggcgat cacgatcacc gtacggtcgg ggtcggcgcc 3720 caggttcgcg atggcccggg tgatcgccgc ttcgttctcc gggtcgagag cggagctggc 3780 ctcgtcgacg aggacgatgc gcgcccgctt gagcatcgcc cgggcgatcg cgacgcgctg 3840 gcgttcgccg cccgacagct gtgcgccgcc ctcgcccacc cgggtctgcc agccggcggg 3900 cagccgctcg atcacctcgt cgagccgtgc cgcggtggcg gcctcccgca actcgtccca 3960 ggtggcctcc ggacgggcca gccgcagatt gtcctcgatg gtgtcgtcga acagatagac 4020 gtcctggaag acgatggcga tctcgtcgag cagggtggtc gggtcgagct tgcgcacatc 4080 gacgccgccg atgcgcagct cccccgagtc gatgtcgaag aagcgggcga tgagccgggt 4140 cgccgtcgtc ttcccggacc ccgacggtcc gaccagagcg gtggtgctgc ccgccggaca 4200 gcgcagggag acgttccgca gcgcgggggt gtcaccgccc ggataggtga acgtgacgtc 4260 cgcgaactcg acctccgcct cgtcgatccg ccgcaccggc tcggccggaa ccggcagtgc 4320 ctcagtggcg agcagctcct cgatgcgcgc gatctggttc cccaacgcgc gcagggcgcc 4380 gatgagttcg atgagattgc cgagcggctc caggaaacgg gcggccagca cgagcagcac 4440 gatcgccgtg gcggcgtcga tgcgtccgct gagcaccagc tcggcggtca gggcgagcac 4500 cgcgacgaaa ccggccatga tcacacccgt gtagacgaag aagggctgcc gcgcgcgccg 4560 catgccctcc cggtacgtct cgcggtggtc ctccagcgcg gcgcgcatcc ggggagtgcc 4620 ctcggcgtgc ccggcggccc gcaggaccgg ctgggcctgg cccagctcga tcgcccggct 4680 ggcgaccccg gccgccgccc gctccagttc gatctcggcg atctcggaga tccgggcggc 4740 acggcgcagc gccagatagg cgaccagggc gatggcgcac agcagcagcg ccatccgcca 4800 gtcgaccgtg aacgtcacga cgacgaccgt cgcgggcagc agcgtcgagg tgatcacggg 4860 cccgccgatc gtcacggcca ggtgcgcggc gctgccgacg tccgcggtca ccgcccgcgc 4920 gagccgggcc ttgtgctcgg cggtgaacca gcccagcgtc agcgtgctca catggtgcat 4980 cagccgccgg cgcagctgcg ccgcgagcgc gcccgccgcg gcgaagccca ccggggtggc 5040 gacgatgctc agtaccgcgt acagcgccgc gcccaccgcg ccgagcacca gccaggtcgt 5100 ggccgcgccg atgtccggtt cggggcgcac gacggcgtcc aggatcggga tgagcagggc 5160 gagcaacaac ccctggagca ccgcctgcgc cgcggtgagc agccacagcc gggccagcag 5220 tttcggattc ggccagagtc ggtacaggcg gcggatcacg cgagcacccc ttcctgtgcg 5280 gcccacatgc gggcgtacag cccgtcgcgc gcgaggagtt cttcgtggcg gccctgttcg 5340 accagccgcc cgttttccag tacgaggatc tgggcggcgc ccgcgatggt gtgcagccgg 5400 tgggcgatca cgatcaccgt cttgcccacc gccaggttgg ccagggcttc ctggaccgcc 5460 gattcgctgt cggcgtcgag ggaggccgtc gcctcgtcca ggacgacgat cggcgctccg 5520 gacaggatcg cccgcgcgat cgtcagccgc tggcgctccc cgccggacag actgccgccg 5580 cccgcgtcga gcagggtgtc gtacccgtcg ggcatgctct cgatcacgtc gtggatgtgt 5640 gcggcggccg ccgcctcgcg gacctcggcg tcgctcgcgc cgggccgccc gatgcggatg 5700 ttctccgcca cgctgtcgcg cagcagggcg acgtcctgga agaccaggga catcgacgcc 5760 agcagcttcg ccgacgggat cgagcgcacg tccaccccgc cgatcctgat ggatccccgg 5820 gacacttcgt agaagcgcgg cagcaggccg gccagggtcg tcttgccggc gccggacgga 5880 ccgacgatcg cggtcacccg gcccggtgga cagacggcgc tgatgtcctc gaccgcgttg 5940 gtcaccccgt cgtaggagaa ggagacccgg tcgaactcga cgccgtggcc gacgggctcc 6000 tgcggttccc gcggttcggg cagcggcgta cgggcgagca gctgctcgat gttggccgcg 6060 cccatccggc ccttgcgcac tccctgggcg gcggtcacgg cggggaggat cgagttgggc 6120 agcccgatgg cgacgaccag gaacgccacc aggtcggcga cgccgagcga gccgcggtcg 6180 acgaggacca ggccgacggc cgccacggcg gtgaggacgg ccatctccga gccgaacagc 6240 cgggaggccg cggagctgtg gcggacctcg gcgacccagg ccgcgaacgc ctgggtgtgc 6300 tcgtccacgg cgtcgtcgaa gcggcgcagg acacggccgc ccgtgccgaa cgtcttgacc 6360 acctggatgc cgtcggcgta ctcgacgctg gcggcgctga tccgggcgtc ggcgaggacc 6420 agccggttca tatgggtcgt catggaacgc atcgagacgc ggaagaagat gcccatcagg 6480 agcaggacgc ccagtaccac gaaggccatg cggacatcga cgaggagcag atagcccgtc 6540 gcgaccgtca tgacggtgac ggccccgacc aactggccca gggcgtgcgc gatgacctcg 6600 tgcatctcct ccagatcgcc cgtcatcgcc cgcttcacct tgcccgagcc cacggcccgg 6660 aaccagccga gcggcagctt gccgaggtgg gtcacgatcc gcacgcgcac gtcgtgcagg 6720 atcgccgcgt cggcgtagtg ccccacgcgg gaggactgca cgaacaggac gagccagacg 6780 cccgcgccgg cggccccgac acccacccag gtccagatgg ctgtgtgtga gcccgcgggg 6840 tcgtcgagca tgagccgcgc gatctcggcg acggcgatgt agggcacgat gcccgccgcc 6900 gcgctcaggc acgaaaggac cgcacagatg acgagatggg tccggatcgg cgcgagcagg 6960 cgcattaacg cgccgggccc tgccagcccc ggagtctgcg atgttgatct attcattagg 7020 gtaccctaac atggcgactg aagctcattc aataacgtgt gagtcgcccg gagatgcctc 7080 cgcgcggcga tgttccgatg ccctgagagc cttcaccgaa gggaactcct cttcatgttc 7140 cagccgcacc acgaccgccc gttaacaagc tgtcgtcgac cacgcgccgt tccgaagcgg 7200 ccccgggccg ccctgctcag tgccctgatc gcgctcgccg ccgtcggggc gaccgggtgc 7260 ggaggcggcg acgccgacgc acgccagaag ggcgagaaga agacgatcgc cttcgaatcc 7320 tgtagtcgca cggtcgagct cgaccggatc ccgaagcgcg tggcgatcac cacggacgcc 7380 atcgccgaca ctttgtttga gctcggcgtc ggcgaccgga tcgtcgccaa gacccgcggc 7440 gagtccgccc ccgcacccga gctgaaggag cgcctcgcgg ccctgcccag cctcgggaca 7500 cgcaatccga gcgtcgaggc gctggtcggc gccaagcccg atctcctgat caccgatcag 7560 gtcgagaagg tctccggcaa gctgggcagc ccgagcatcg ccgagctgga gcggctgggc 7620 atcgccacct atgtggtagg cggcggctgt gcggcggacc tgagcgagga cacctccggc 7680 ctcgaagcgc tcgacggaga catccggcag ctcggcacgg tcttcggcgt ggaggcccgc 7740 gcccggaagc ttgccgacaa gctgaacggc agcctcgacg acgtacgacg gcagaccgcc 7800 caggagcccc ggaccaaggt cgccaagctc tcgcaggtcg ccgggcagct ctacgtgacc 7860 tccggcggcc tctccgacga cgtgatcgag cgggcgggcg ggacgaacgt cttcgccgac 7920 ctgccgggcc agttcgcccc ggtcagcccc gaacagatcg tcgcgcgcga tccgcagtcc 7980 atcatcgtcg acaacttcac ggcgacggcg gccggtgaga acgaggcgat ggcgtacctc 8040 aagcgcacgt tccccaccgt cgaggcggtc aagaagcaac gggtcctcgt catcgacgcg 8100 gccaaatccg gggcgcgcgg cagtacgagg ccggtggagg gcgtcgtcga gatagcgcgc 8160 ttcctgcacc cgagcacgtc ccgctcccag tagggctcgc gatgccgaag caggaaacgc 8220 ctgctaccgc caccgccgac gaggaacgcc cgcgaggcgc ccgccggccg actcccgtac 8280 gcggaagccg acatgccgcc ctcccggtcg tcttgccgct gctgttcgtc gtcctcctcg 8340 gcgcggtctg cctcgccgtg tcggtcggat ccgtcggcgt gcggccggcg accgtcgtca 8400 aggtcatcgc cgatcacctg atcggcgtcg ggacgcccgc cgccgtcatg gacgatcaga 8460 tcgtctggaa tctgcggctg ccgcgtgtgc tgctggccgc cgccgtcggc ggggggctcg 8520 ccgtcgtcgg cgtcaccctg caggcgaccg tccgcaaccc gctcgccgac ccctacgtgc 8580 tcggtgtgtc cgccggcgcc gggctgatgg cctccatcgt gatcaccctc ggatcggtcg 8640 cggtggccgg cctgtcgaca tcggccgcgg ccttcgtcgg ggcgctcgtg gcgatggtgg 8700 cggtcctcgg cctctcgcgc cgggcgggcc gtgtcatccc gagccggctg ctgctcgccg 8760 gggtgacgct ctcctatctg ttcagcgggg ccaccagctt cgtcatcttc cgcagcggca 8820 acgcggacgc ggcgcactcg gtgctgttct ggctgctggg ctccctgtcg gaggcctcgt 8880 ggtcgaacct gtccctgccc gcggcggccg tgctggtcgt cggcgtgtac ctgatgctcc 8940 aggcccggac cctcaacgcg ctcgcggccg gtgacgacgc ggcgctctcg ctcggcgtgg 9000 ccgtccaccg cgtgcgcatc cgcctcctcg tcgtcgcctc cctcctgacc ggcgtgctcg 9060 tcgccgtgtc cggcgggatc ggcttcgtgg gactcatcgt tccccatctc gtccggctgg 9120 tcatgggccc ggaccaccgt cggctgctcc ccgtagccat gctcgtcggc gcggtctacc 9180 ttgtcgtcgt cgatctgctc tgccgtgttc tcgtccgtcc cgaggaactc cccatcggga 9240 tcgtcaccgc cgtgctcggc gcgccggtct tcctgtggct gctgcgacgc tcggaggcgt 9300 catgagactc gccctgtccg gcgtcagtgt ccggatcgat tccagcccga tcgtcgcgga 9360 gtgcgatctc accgtcgagg acggcgaacg cgtcggcctc gtcggcccca acggcagtgg 9420 caagacctcg ctgttgcgta cggtctaccg cgcgctcgac cccttcgcgg gcagtgtcgc 9480 gctgtccggc gacgacgtga cgacgctcag ccagcgggag gtcgcacgga gggccgcgct 9540 cgtcgcccag gacagcaccc cggacttcga cttcaccgtg gaggaggtcg tggccatggg 9600 gcgcgggccg tggctccggg ccttcgagtc gaccaccggc ggcggggacg cgacgatcac 9660 cgccgcactg gagcgggtcc gtctggcgga ccaccgcgcc cgcaggctct ccaccctgtc 9720 cggcggtgag cggcagcggg ccctggtggc ccgcgcgctc gcccaggaga gccctctgct 9780 cctcctcgac gagccgacca accacctcga tgtccacgcg gcgctggaac tcctggagct 9840 cgtgaaggac ttggagcggg cgacgctgtg tgtgctgcac gacctcaatc tggcggccgc 9900 ttattgcgac cgtatctacg tcctgcacgg cgggcggatc gtcgcggacg gcccgcccgc 9960 cgaggtgctc gaccccgaac tcgtccggac cgtcttcgga gtcacctgta cgcatctgac 10020 ccatccggtc accggcggac tgcttctcgc gttctcgccc cagcagcccg tccggtcacc 10080 gcactcggtg aaggggtgag aacttcccga cgtcggtatc gcgtcaaata aattcccttc 10140 gcacgctgtc tgcgtccctc atgaatgcct ggtgcgtcat aagggcggcg gggcggccgg 10200 gtatcgggcc gtcttggttg ccccggtcga ccgtagtcgc aaaagtgttt tgcttgttga 10260 cggcggcaga atcatgtggc gcccttggca ccgtgaagtg aatctctcga agtccgaagt 10320 gctcgttcat atgcgtcatc caaacgaaag gcgagtcacc cggtgatcga tcgtgtgaat 10380 gtcataccgt taacggagcg ccagaatgaa gacgaagccc aggcggccat catccagatc 10440 gacgaagtcc tgacgctcct ccaggagttg ggcgacgtcg cgcgccgcgc acggatgaac 10500 ctggtcgccc cccggcaccg gggtactcgt gtcaccgcgg ctcaggtacc ctcctccgga 10560 acctcccgca ccccggcacg gcccgagggc gaacccgccc agcagcccgc tgccccgggc 10620 ggcgggcccg gagccgacca gaagcccctg ccgctcaccc ggcgccagaa ggaagtgctc 10680 aatctgctgg cccagggcct gtccaaccgc cgcatcggcc gcgccctgca catcaccgag 10740 cagacggtga aggcccatct ccacatggtg taccacaagc tcggcgtcgc ggaccggaca 10800 gaggccgtgg tcatcgccct gcgccagggg ctggttcagc atcagcgcag ggaggacccg 10860 ccctcctgac cggcccgcgc gccggcagga ggcggcagtg gcaggagccc gcgccaccgc 10920 cggaaacgga tccgcctcgt gcggcgtgaa cgtgcaggcc cggccctggg acggctcggc 10980 cggggcttgc gcgcgatcgg ccgcaccgtc atgcgaacgg cccgggctcg cccgcgaccg 11040 gtcaggccgc gatatcgggc acgacctcga tgagcgacgc gttgagctcc tgggccagcg 11100 gcacgacctc ggtgacgcgc agcccggccc gcgcgcagat cgcggtgtac tccgcccggg 11160 tgcgctggcg gccgccgatg gtgacgagca tatgaagatc ggtcagatag gggttctcca 11220 gcgccgcgtg gttgagggag tccgcatcgg cggtgtcggg cagcaccggc tcggcgatga 11280 gcagcttccc gtggtcgggc atggcgtccc ggacccggcg cagaattgtc caggcgcgct 11340 cgtcgtccca gttgagcacc acgttcttga tgagatagag gtccgcgccc ttgggcactt 11400 cggcgaagaa gtctcccgcg accgcgcggc atctgtccgt cagccccgac ccggcgatgg 11460 tctcccgcgc ccgtgccgcc ccggcctcgg tctccagcac cgtgcccgtc aggtgcgggt 11520 accggtcgag gaacgcggcg agcagcgtgc cgtcgccacc gcccacatcg acgaccgtgc 11580 tgaaacgccc caggtcgtag acggcggaga tggccgcggc catctccagc gggcgatgac 11640 gcctgctcat ggccgcgtgg aacagggcgg ccagctccgg ccggcccgac agatagtcgt 11700 acaccggcag tccgaacgcc gtgtcgaagg cgggcagtcc ggtgtccagc gactcctgga 11760 gattcagcca gttgcgctgg aagacgtcat gggtgaggaa cgccgcgaac gccagcagcg 11820 acgccgggtc gtcccggcgc agcagcgccc ccgcctcggt gagcgcgaac cggccgggtt 11880 cggactccac gcacaggccg agcgatgcca gggcgcgcag cagtctgccc agctgttcgg 11940 cgggcacacc gcaggcgagg gccagttcgt cggtatcggc cccactctcg ccgatccggt 12000 ccgccagctc cagccgcgct gccgcgccga cggcctgggt ggccagggtt ccgaagatcg 12060 ctgatatcac ggtgcgccgg gcatcggcgt cggtcgtcat gaactcacca ttctctctgt 12120 gccggtgtac ggggtccgga ttcgggtcgt ccggcttcgg gtgccgatat cgacaaggcg 12180 tggtggaaaa tgtccaacgg gtcccactcc gccttgacct cttgcagcct gggatagttg 12240 tcgccgtaat acagcgtgtg ccagggaacg ccggaggtgt tccattccgg gtcggccaga 12300 tcgacatccg cgtaattgat gtaggcgccg gtgttgacct cgtccggcac cggaaccccg 12360 cccgtagccc tgtgcatgtc ccggtacagc tcgcggatcc accgcaggtg ccggtcgtcc 12420 tcggcggggt ccttccaggt gaccatgtag aacgtcttca gcagggcgcc ccgcggaatc 12480 gccgtggcct cgggcgggac ggcgttcact ctgccgccgt acgcgatgta ctccaccgcc 12540 gcgtaggtgc cccggtagtc gtcccgggtg agatggcggt ggagggcggc gagctgatcg 12600 gggcggtgcg gggcgcgcag atcggccgat ttgaccttgc accgcagtcc gatggccgcg 12660 gggccggtct cgggtacggc cagatactgg ctcgcggcca gccacggcag ggaggtgcgc 12720 cggaccgtgg gggcgatgcc cacgccggcg cccagcgcgg ccaggaacgc gtccatgcgg 12780 gcgtgggccc gcggcgcctc ctcagggagg tgggcgtgca gggtgatgtc gcccgccgac 12840 cggtgcgggc agtcgaggag tgcgtacaga tcggcgtcgg ccgagtccac cgcgctgttc 12900 gcgatctgcc agtccatgaa gttgccgacc aggcggacga agtccgcccc ggtgagccgg 12960 tcccacggcc agctcacctc ggcgatgtgc agggcgcccg gcgggcgcgg cagcaggccc 13020 gccggttcgg cgccggacgc ctccggggaa cgcatccagt accgggtgat cacgccgaag 13080 ttccctccgc cgccgcccgt atgggcccac cagaggtctc tgttcgggtc ggttttctcc 13140 cgtgtggcca ccaccgtcga cacgttccgg tcctggtcca cgaccaccac ctcgaccgcg 13200 tacaggtggt cgacgctgag ccccagcagc cgggacagcg gtccgtagcc gccgccgctg 13260 atgtgtccac cggcgcccac gcccatacag aacccggcgg ggacggtgac gccccagttg 13320 tggaagaggg tgtgcagcat ctttcccacc ggagcgcccg cctcgatgac gaaggcgttg 13380 cgctcccggt cgtaatcgac gcgggacatc ggggagaggt ccagaatcac ttccacatcg 13440 ggtgcggcga cgaagtcctc accgcagtgc ccgcgcgatc gcacggtgat ccgctttccc 13500 gctgtcaccg cttcccggac cgcctcttcg acctgccgcg gcgaatgcac gagacggaaa 13560 tagtccggct tcgcgatgaa ccggtagttg taggaacggc gcaggttctc gtagcggatg 13620 tcgtcgggcg tgacgcggga ggaaggcggc gtcggactgc ccggcccgct ctcctcttcg 13680 gcgtcctggt cgtcacgttt catgggggca agctacccgg gaaataaggc aggccacgcg 13740 gaacataaac gaaggttcga gccggttaaa tccggagatt tagacccgtt gccggtggtg 13800 gacgcgcggc tgtaaggtcc ggaatcacaa aagtggacag acggatcgtg cacctcgcaa 13860 ctgcattcat cgacggctgt acgaaactcg acggctatac gaaagaggca ccgaatgagc 13920 agtccggccg gcggacccgt ctttgcggac cgcgtgctca atgattggct gagcagtgtg 13980 gggctcggca tcgagtattc ccgagcggaa gccaacacgc tgtattacgt ggacgaggac 14040 ggtcaggagg ttccggtact cgaccatgtc tgcggattcg gctcgctgat cttcggtcat 14100 aacaacccgg acatcgtcgc gcatgcgaag acggtgctgg accggcaggt tccggtgtac 14160 gtccagctgt cccatcagtc gcacgccaac gatatcgccg ccgtactcaa tgccatattg 14220 cggcgggaga tcccgggcgg cgacggtgac tacgccgcga tattcgccaa cagtggagcc 14280 gaggccgtcg agatatgtgt gaagcacgcg gagctggagc gccgcgcacg cgtcgcgaaa 14340 ctcacggacg agaccagccg gaacgcggag gaggcgcgcg aggcggtcgg cgcggggcgg 14400 gccacggtcg ccgagaaccc gtacgtccgg ttcgacgggg cgagcggtga gtccgacagg 14460 ctggagcagc tgctcgccga ggccgggcgc aggaacgcgg agctgatcgc gcgggggccg 14520 gtccatctgg ccctggagaa cgcgttccac ggcaagctgg tcgccagcat ccagctcacc 14580 cagaacccgc actggcggct gccgttcacc tcgctggcct cctcgacccg cttcctccgc 14640 gccgaccggc cggacgagat gaaggcggcc gtcgaggaac tccggacgag cctgctggac 14700 gtcctggtgg acgacggcgt ggtgacggtg gtggagcggg acttccccct cgtcggcgcc 14760 ttcttcgtgg aacccgtcca gggcgcgaac ggcatgcgtc ccctgaccga ggcggcggcc 14820 cgcgagatcc gtgcggtgtg cgacgcggtc ggctgcccgt tgatcgtcga cgagatccac 14880 agcggtatgg gacggacggg cgccttcctc gccagttcgc acatgggcct gcggggcgac 14940 tactacaccc tggccaagag catcggcgga gggatcgcca agaacgcggt ggccctcttc 15000 caccgggacc ggttccgccc ggagttcgag gtcatgcaca gctcgacctt cgcgaaggac 15060 gggttctccg ccgccatcgc cctcaaggtc ctggagatgc tggaggccga cgacgggcgg 15120 gcgtacagga tcgcggccga gcggggggac cggctgaagg gggcgctcac cgcggtcgcc 15180 gccgacttcc cggacgtcgt ggacgcggtc cacggtatcg gcctcatgct cgcggtggag 15240 ttcaaggacc agaagggggc gtcgtccgag ccgctgcggg agaaggcggc gtcgggcatg 15300 ctcgggtact tcatcgcggg cctgatcctg cgcgagcacc gcatacgggt gctcccggtc 15360 ggacctgccg gaaactccgt ccggttcgaa ccctcgatct acctcaccga cgccgacatc 15420 gcccgtacgg agaacgcgct gcgcgatgtc tgcacgatcc tccgcgacca ggacggagac 15480 cgtctcaccc cctgacagcc ttcacgaggg agaggagcag cgcgctgaac gagcagacga 15540 ggcagatgag ttggagcggc agaagggcag ttcgaccatg agcggcgcga tacctaccga 15600 gtccctcgtg tcgggactga tggagcatgc cctcagcgcg gcctgcgcgg cctcggtgcg 15660 cgcggcggtg accctgggcc tgcccgaggc cctcggggaa gcgccggcca cggccgacga 15720 gctggcgaca gcggtcggcg ccgatccggg ggccctgcgg cggctcctgc ggtcgctgac 15780 ctcctacggc gtgttcgcgg aggaggggac tggttcgttc gtccacaccg agaagtcccg 15840 tgctctgcgg gaggactccc cggacagcat caagtacctg gtgctgtggt gcaccgaacc 15900 atggctctgg tcgctgtggg gagacctcga cgaatcggtg cggacgggcg gcgagatctt 15960 caccaggacg cacgggcgga ggttctacga gcacctgcac acccgatggc ccgagtccgc 16020 gcggatcttc aaccgggcca tgacccagca gacccggttg tccgcgaccg tcatcgcgga 16080 catgctcccg atgtcgggcg ccggcaccgt cgccgacgtc ggcggaggac agggtctcgt 16140 tctcggcacg ttgctggaac gccacccgca cgtacgggga ttcctcctgg acctccccga 16200 ggtcgtggcc aacgtggacg cgcgactgca cccgggcggt gaactggccg accgggtgcg 16260 gctggtgccc ggtgactgtc tcgaggggat ctccgtcgag gccgacgtct acctcttcaa 16320 gaacatcctg ggggccgacg acgacacatc ggtccggatc ctgcgcaacg ccatgaaggc 16380 ggcccggccg ggtgcccgga tggtcatagt ggagaacttc gtcgacgacg gccccggcga 16440 gaggctcgcc tccgcgctgg atctgcggat gctgctcgtc atcggcgggc agaagcacac 16500 ccgcgccggg ctgctgggga tcgcggagcg ggccggcctg accgtgcggg acgtacgccc 16560 ggtggactcc tcgctgcaca tgatcgagac ggttgtaccg gggtagccgt cgcaccacga 16620 acagcaccgg tcacctgtgg ggcccccggc atccgatgcc ggggccccac aggtgccggg 16680 ggagacgtca gccgacgacg accgccttgg accgcatgag gaaagagccg agatatccct 16740 cgtgcggaag gccggggagg gtccacgtgg tcggacggtc cccgatgtgg atcttgtcga 16800 tggacggctc ggcgaccagc gattcgatga gcccgtcgtc gtcggtgaag gcggtgagcg 16860 acaaggtgtc ccgcaggggc gccagccagt cgccctcccg ccgccacggc agcacccaca 16920 cgcaggggaa gcccatctcg atacgggcct gcggggcgtc cggtctgtcg agcaggaaca 16980 ccgcgggacg caacaccgcc gagccgtccc cgagttcggc gacgaccgtg tccgcaccca 17040 gcaacggccg cgcgtcgccc gccccggccc gcagatacgc gtccatctcg tgcgcgacgg 17100 cggcccggcg gaccggcagc cgggcccgct cgtcacccgg cggcagcgag ggaagctcgc 17160 ccagccgctg cgcgatggcc tcggccacgg gctccggatc gccctccacg aacaccgccg 17220 tcgcgttcac acacccggtg ccggcgtgac cggcgatcga ctccgtgatg gtggccagat 17280 gccgcccgac atccgtgccg gaggtgagga ggatcttcga ccggccgggc ccgaacggca 17340 tcaccagggt gcttgacccg tacttccggg ccacctcgtc gccgccgaac gcgatggcga 17400 cgtccgcctc ggcgaccagc cggtcggccg tcgcgtggtc cgtcggcaac aggacgacct 17460 ggtcatggtc gtaacccacc tggcgcaggg ccgagaccag ccggtgggcg gtgaacggct 17520 cccgctgcga cggccggacc gccacgcgat accccagagc cagagcgtcc agccacgagg 17580 tgtgcacggc aggggagttg gcgggagcgt gcaccatcag cacctcgcca cgacgggtcc 17640 acaccccgga gccgtgccgg gcccgctgat cgcgccagct gtcggccgcg cccaccggcc 17700 gggcctgacg cacattctcg tacgcggtcg cggcgtagtc gccgacctgc tgggccgact 17760 gccgtacggt ggcgatcggg atgccggaca cctcggcgac ggtccggtgg taggccgcgg 17820 cggtcacgcc gtcgaccgag ccgctcgcga agagccagcc cgcccgcgtc aggagtttct 17880 tgcgccgctc cggatccggt ggtgcggccc ggcgcagcgc ccggatgctg cgctgggcga 17940 acagccaggg aacgaggctc agttcggcca ccggttcgcc ccggacatcg tgcacgactt 18000 cgaggttccg gctccggtac ggacccccgg tgccgagcgc gtcgagggcg ggcggcgcgg 18060 aggacgatgc ccgcacgtca gtagacccct tcgatcacgg gggccgcacc gaactcccgt 18120 accggcttga actcggacac ggcgtcaccc gggagcccca ggcggtgcgg gtggcggatg 18180 ccggtgtccc ggtccaggtt gttgggcagg aacaggttcc gggtgaggtg atgggtgagg 18240 acctggccgc gctccccgta cggcacgttc ttcccggtgt cgggatcgat caccgagaac 18300 atggagaacg gcgacggcgg gtcgaagacc ggggactcgt cggccggtga gtccggccgc 18360 tccggcatcg cgcagaagat catcgtgctg ccgaagatgc tgacgaagtt gatctgcggg 18420 aagagctcgg tccggaacag gtggcgggtg tcctcgtcca tcgacgtacc gccgtagatg 18480 accgtgttga ccttctcgtt gatgagatcc accagatgat cgcgccggca gaccgcctcc 18540 aggagcggcg gggtgatgac catgacccgg atgtcctggc tccgcaggat ccattcgatc 18600 tggttcacca ggtgggtgat gtaggcgttc gcctcgtcga cggcgccgcg gccgatgacg 18660 agtttcgccc accgcgggtc gaagtccacg gtgaagaaga ttccgccgaa gcgctgggcg 18720 gtgtcctggg ccaggatgcc ggccatgtgc ggtcctgcgg gcatgatccc gagccagttg 18780 tcgcccgccg cgatgccgtg ctccaccagc ctgttgtagt accaggccca ggactgctcc 18840 cggacgtcgg gcagcacgaa cacccgcttc ggcgccccgg tcgtcccgcc gctctcgccc 18900 accaccggcg gcctggacag cctgtccggg ccgccgtagc cccgggggac caggtcctcg 18960 atgcgcgcgt cgcgcagttc gtccacgacg ttgggaaaca gcgtcagatc ctcgacggtc 19020 cgcacgtcct tgcgcggatc gaattccagc tttctgctct gctcctgcca gaacggcgat 19080 ccggtcttcg ggtcgaaatg ccatgccatg gcttcccgga cgagatcctg cgcctccggc 19140 gggctgtccc acggaacgtc gagtgcggat gcatcgcgtg aggtcattgt cttgctcctc 19200 ggcattctct ttcgacgggc agctgtattt tgaaggtaca aggattcctg gtccttcggc 19260 catacccgct gtctcgtccg atgggtcaac tggttttaga accgtggatc tagactcatt 19320 gccgcccgcg cggagttgtg cgtagcgtgc tgcgtgctgt tcctgttttc gtagccatcg 19380 catacgtgcg ggtgggagtg gaagatgaga gagagccgct acgacgtgat cgtcgtgggt 19440 gcgcggtgtg ccggatcgcc gaccgcgatg ctgttggcga ggaaggggta ccgggtcctg 19500 gtcgtggacc gggcggtctt tcccagcgac accctgtcga cccatctggt ccacccgccg 19560 ggcgtcgcgg cgctgcgcgg ctgggggctg ctcgaccggc tggtggccac gggctgtccg 19620 cccatccaca cctacgagtt cgacttcggc tccctcgtcc tgccgggcgc cccggggacg 19680 gaggccgaac cgtacgcgta cgcgccccgt cgcactgttc tggacaagct gctggtggac 19740 gcggcgaggg aggcgggggc cgaggtccgc gaggggttca cggtcaccgg gctcgtcctc 19800 ggcgacgccg gcgaggtggt gggcgtccgg gggcgcgggc cccgcggccc ggaggtgacc 19860 gagcgggcgc gcgtggtgct cggtgcggac ggtctccatt cgctcgtcgc gcgtgccgtg 19920 gacgcgccgc ggtacaacga gcatccgaaa ctgatggtcg gttactacag ctatttcagc 19980 gggctggaca tggacggtgt tttcaaggcc cattcccgcc cgtaccgcag tttcggtgcc 20040 tggcccacgc atgacggact gacgctggtc ggtggttgct ggccgttcgc ggagttcaac 20100 gacatacgga aagacatcga ggggaattac ctaaagaact tcgcgctggc gccggcctgg 20160 gaagaacgca tacgtgatgc ccggcgcgag gaccggatcg tcggagcggc cctgccgaac 20220 ttcttccgca aacctttcgg tcccggctgg gcgctggtcg gtgacgccgg ttattgcaag 20280 gacttcttca ccgcccaggg aatcagcgac gcgttcatct ccgccgagat gtgcgccgga 20340 tccctggacg acgccctgtc ggggcgcgcg ccgttcgaca cggcgatggc cgcctatcag 20400 gcggcccgcg accggcatgc gcgacccgtc tacgacttca cgctccaggt ctccacgctg 20460 gaacccctgt ccccggagtt cgagaaggtc ctggagggca tcgacgggaa tcagcagggc 20520 atggacgcct tcgcccaggt gaacgcgggt gtgacgtcga tggagcggtt ctccgccgac 20580 tggggtggcg ccgtccgacc cgtaccccgg tgacgtgacg aaaccgtccg cggctcggcg 20640 gacggtttcg ccgcgtcacc ggggcggagg ccggacggct cacccggagg tgacgggctc 20700 cccggacttc cacacccgtg tgatgcggtc cgggtcggcc agcacggtga tgtcggccag 20760 cggattccct tccaccgcca ggagatcggc gtcgtagccg gcttccagcc gcccggaccg 20820 gggcgcccgg ggcccgagcg tgccggggcc gtgggccgtg gccgccgtga tggcctccag 20880 cgggctcagc cccgcggaga ccagatgggc gaactcgctc gcgtgcccgc cccaggacag 20940 cgggccgcca cggtcgctgg tgcccagatc ggttccgagc gcgatggtga ccccggcccg 21000 gtgggcgatg ccgatcgcag tgaggtgccg ctcggccatc acctcgaagc ggtcccgcca 21060 ggccggcggc agcgcggcca cgttgtgccg gaacgcctcg tagatcgtcc gcgtgggcac 21120 cagcgtcatg ccccgctcgg ccatcaggtc tgcggtctcc tcgtcgatct ccgtgccgtg 21180 ctcgaccgtg cggcatcccg cgttgatcgc cgccaggatg ccggcccgcc cgtggcagtg 21240 ggccgccacg atccggtcgg cccgcgcggc ctcggtgacg atcgcgttca actcctcgct 21300 gcggtactgc tggtggaccg ggttgtccac ctcgctcagc acaccgcccg aggtgcagat 21360 cttgatcagt tcggcccccg cccgcagttg gaggcggacc gcccgcacgc actcgtccac 21420 cccgtcggcg atgcgcaggg tgccgccgct ccggcaggga tcggtcaccc agcggtaggg 21480 caggcggtgc gcgtcgctgt ggccgcccgt ctgcccgatg acctggttcg cactgtagat 21540 gttcggcccg gtgaaggtcc cttcccggat ggcttcagcg agtacgcacc cgtggccgcc 21600 catgtcgcgc acgctggtga agccggcccg cagcgccgtc tccgcgtcct tcacgctgcg 21660 ggcgacggcc agcgtctcgg gggtgagcat cagttcctcg gtgctgaccc ggccgcgtat 21720 cccggcgaag tgcacatggc agtcccacag gccgggcagc agagtggcga cccgtgtcgg 21780 ctccaacgcc cggtgctcgc ccgacagttc ggcggtcggt cccgcgaagc ggatacggcc 21840 gtccgcatgc acgacgctgc cgttcacggt gggctcgccc gcgccgggga tcagcaggtc 21900 cgcgtcgatg cgatgttcca tggtgctacg cctccacggc tgtccgccgc gcgatgtcgg 21960 cgcggagcac tttcttgtcg atcttgccga tggcggtcgt cggcaactgg gccaccgtct 22020 ccatcagttc gggcagcttc cagacggcga gaccgagtcc gcgcaggaac cggcgcagct 22080 ccagcagggt gggctcccgg cccgcctccg gcacgacgta caggcagacg acctccccgt 22140 acagctcgtg cggggcgccg acggcggcgg acgcgcgcac gtccggatgc cgtgccacca 22200 gcaactccag gtcctcggcc gggatcttct ccccgccgcg gttgatgatg tcgcgaaccc 22260 ggccgttgac gacgagatcg cccccggacg tccgggtcgc caggtcgccc gtacggtaga 22320 agccgtccgg ggtgaacgcc ttcgcggtcg cggccgggtc ccggtagtac cccgcgacgg 22380 tgtagggccc ccgggtgagg agctcccccg tcgtgccgtc cgccaccgga cggccctgct 22440 cgtccacgat gagcacctcg tccatcgggg aggcggggcg gccctgggta ccgacgatca 22500 ccgactcggg gtcgtcgagc cccgtgtagc tgagcagccc ctcgctcatg ccgtagcact 22560 gctggagcgt cgcccccagc tcggggcgga tccgggccgc cagctccgga gcggggcggg 22620 cgccaccggt ctggagcacc ctcagactgc tgagatcggc ggtcgtcccc tcggcggccg 22680 ccagccactg catggccacc gtcgggacca gcgcgcagtg tgtcacgcgc tcgcgctcga 22740 tcagcccgag cgcctgccgg gggtcggtgg tgtcgccgag cacgaccgtg ccgccctggg 22800 ccaacgtgcc gagaacgccg gggcagttca ggacgaatcc gtgggcggcc ggcatcaccg 22860 ccaggtacac cgcgtcctgc gagacgtccg cgattgcgca ggccgtgcgg atcatgtacg 22920 cgtatcccgc gttgccccgg gggatcacct tcggcgggcc ggtggtgccc cccgacagca 22980 ggaacacggc ggcgtcggtg agggcggagc cgcgccccgg gcgcgggtgc gcggaggagg 23040 cggggcccac cggcgcgtcc cacggcgcac cggggtcgca cagtgccgcc aggtcgacgt 23100 caccggccgc gacatcgccg agcgcgagga catggtcaag ggtttcgtgg tcctcgcgca 23160 gtcgccgggc cgattccacg gggtcggacg acctgctccc ggcgctcacc gcgagggcga 23220 ccggccgggc ggaccgcacc acatggccca gttcgtagtc gccgaacccg ggcagcacca 23280 ggaccggtgg ggcgccgatg tcgagcagcg ccaggaccag gaccacgaac tcggcacagt 23340 tgggcagttg gaccaccacc cggtccccgg cacggacccc gagcccctcc agccgccgcc 23400 cggccacggt gacggccgtg tgcagctcgc cgtgcgtcag ccgcgtcgta ccgctcacca 23460 cggcgaggcg gtcggggtgg cgtagccgcg cggcccgcag cagctcggcg acgccctcct 23520 cgcgccacag cccggccgcc agataggcct cggcccggcg ccggtcgatc accggtacgt 23580 ccggatcgag cggccggctc tgatacgggc tcatcggggc atctcccgta tccctgccgc 23640 caactgctcc atggtgtagt ccgccttgtc ccacagattg atccggacgc cgaatcgttc 23700 gcgcaactga atctccaggt cggtggtttc gaccgaactg agaccgagat cggcacggag 23760 ccgggtaccg ggttcgatct cggtcagcgt ccgctccgcg ataccgatat cgagcaacaa 23820 ctgcttgagt acatccacgg acacggcggc tccttgattt cccgacggca aggcaggccg 23880 aggctattcg acgtaccccg gcctgacaat ggactagtac ctcggtacta gccggcaccc 23940 cggaaacccg gtgcggtggg gcgaggtcgt agaacggcac tagacccacc ggtgtatttc 24000 cgatgggagc gggatttgag agtgttgcgc cgatgccggg cgaatatcgg acgtcagcgg 24060 tgcggtggtg accgcggaat actcgccccg ttcttcggat cagcgtagtg atcggcgcag 24120 tgctcagcgc agcctcggac cagcacatat cgatgcccga actgtcggat gacacgtgta 24180 aaggcggtcg ggagccttgg gggtggacgg tggaacggtc gtacgagatt ccctacccgg 24240 aactcgggtc cgtgctcgga gagcaggaaa tcggggtgct cacccggttg gtcacatccg 24300 gtgagaatct ttcgggaggg cgatgtcggg aggagttcga gcgctgtttc cgggaatacc 24360 tcgacgtgcc gcacgccctt tcggtgacca gtggaaccgt ggcgctggag atcgcgatac 24420 ggctgctcga cctcaatgag ggcgacgagg tcatagcgac cccgcagacc tacaaggcca 24480 gtgtgcaacc gctgttgaac tacccggtga aggtgcgctt ctgcgatgtc gggccgaaca 24540 cgctcaacat cgaccccgga cacttcgaat cgctgatcac cagccgcacc aaggccgtga 24600 tcctcgtgca ctacggcggt ctgccctgcg acatggacgc catcatggcg atcgcccgcc 24660 ggcacggaat caccgtcatc gaggactgcg cccacgcgct cggcgcggag taccggggac 24720 gcaagccggg cgcgctggcc gacatcggct gcttcagctt ccactccagc aagaacatca 24780 ccaccctcgg cgagggcggg atgatcacgc tgttcgaccc ggcgctggcc gaacgggcgg 24840 accgcatccg ctccaacgac gccgacgcgg tctaccgcgc acaggccagg gccatcggga 24900 acaccaccag cgcgcacccg tggatgctgc accccggcgc cgccttcacc cacgactgct 24960 cgacgatccg ttacggcggc acgaacgcca ccctcgccga gccgaacgcc gccgtgggca 25020 ccgtacaact ggcgaagctg gaccggctgg tgcgccgacg cgcggagatc gcggcggcct 25080 acacggacgt cctcaaacag caccccggcg tgcggatgca cgagggaccc gatccggtcc 25140 ggcacgccca ccacctgttc acgttcttcg ccgacccggc cgacggcatc ctgcgggacc 25200 ggctggtctc acggctcgac gcgctgggcg tgcagatgca gttgcgctac ttccccatgc 25260 acctcctcgc cgaatggcgg gccagaggcc acaccgccgg tgagtgcccg gtcgccgaac 25320 gcctgtggtt cgagcaacag gtcaacctgc cctgccaccc ggccatgacc gaccggcagg 25380 tcggccaggt cgtgtcccgg ctggacaccg tcctcggcca ggaggcccgg gaccaaccga 25440 tcgcgtccgt gacgacgccc acagcggagg gctggaaccg atgagcgaaa ctgagcacta 25500 cgacatcgcc gtgatcggcg gcggccccgt cggactcgcc tccgcctggc acgccgcccg 25560 ccgcggcgaa cgggtcgcgg tcttcgagca gttctccttc ggcaacgaac agtgcggcac 25620 gagcggggct gagcggcact ggcggctgca gtacaccgaa ccggatctgt gccggctcac 25680 cggggaagcc ctgccgctct ggcgggaact cgaacgggcc accgggcacc agctcctcca 25740 cgcgttcggc agcctctggt tcggcgacat cgatgtcgcc accaacgagg gcaggatctc 25800 ggcgaccgcc cggaccatgg acgacctgtc catcccgtac gagtggctca ccgcggccga 25860 catcgagcgc cgctacggat tcaccggcct gcccggccac ttcgaagggt tcgtccagcc 25920 cgacggcgga gccgtcgacg tacgcgccac cgtggaaggc ctgctgcggc tgaccgagga 25980 agcgggctgt gccctgcggg cgcacgagcc ggtgctggaa ctgatccccg acggcggcgg 26040 cgtgaccctg cgcaccgcac gcggccgctg ccgggcgggc aaggtcgtcg tggccaacgg 26100 cgcctacgcc aacaaactgc tcgaaccgct gggctcccgg ctcgacctcc acgtgttcga 26160 gatggcgctg gtgaccctgc gccagcgcga ccccaaggtc cgctacccgt tctggttcgt 26220 cttccaggag ccgaccgagg aggacaccaa tctgttctac gggttcccgc ccaacgcgtg 26280 gcaggacacc gacacggtcc gggtcggccc ggtgttcgag gtcaacgcgc tggccgaccc 26340 cgcgcgggcg accggcaccc ccgatccgcg ccatgtggcc cggatgtgcg agtgggtcga 26400 acggcatctg cccgtggtgg acccacggcc gctggcccgc gacacctgcc tcgccgtcct 26460 gcccgccgac cctgagcggc agttcttcct cggcacggcg aagggccggt tcgacggcgg 26520 cgagaacgtg gtgatcgcca ccggcgggtg ggggttcaag ttcgtcccgc tgctgggcaa 26580 ggtctgcgcc gatctgtgcg tggacggcgc gaccggctac cacgtggacc gcctgatgct 26640 gcccgacgcg gcccacccgt gagccgttcc gcgccgacgc gtggccgacc cttccccagc 26700 aaaggagatt ccccggtggt ttccgaagcc cccgccacgg gtgcccacca cgacggtacg 26760 tccgtcgaca tcgcagtcgt cgggatggcg ggccgcttcc ccggcgcccc cgacctcgac 26820 gcctactggc acaacctgcg ctccggcgtc gagtcgatcg agcggctcac cgaggacgac 26880 ctgctggccg agggcgtcga cccggagctg atcggcgcac cgggctatgt gcccgtcgcg 26940 ccggtgctgg agggcatcga cctcttcgac gcccggttct tcggcttcac cgcccgggag 27000 gccgcgctgc tcgacccaca gcagcggctc ttcctggaga gcgcgtggca cgccatggag 27060 cacgccggga tcgaccccgc ccggtgcggg accgccgcgg tgttcgcggg cggcaacatg 27120 cccgcctatc tgatgtccaa cctgctcggc ggggcccggg tcgtcctcga ctcggccatg 27180 ttcgagctcc agatacacaa cgacaaggac ttcctggcca gccgcaccgc ctacaagctc 27240 ggactgaccg gacccgcggt gaacgtccag acggcgtgct ccacctcgct ggtggccgtg 27300 caccaggccg ccgcggccct gcgctccggc gactgcgaga tcgccctggc cggcggggtg 27360 tgcgtccggg tgccgcaccg ggtcggctac cgctacgaac agggcctgat ctacgcgccg 27420 gacgggcgct gccgcccgtt cgacgccgac ggcgcgggca cggtcttcgg caacggagcc 27480 ggtgcggtcg tcctcaagcg gctggccgac gcccggcgcg acggcgaccg gatcctcgcc 27540 gtcctcaagg gctccgcggt caacaacgac ggggcggaga aggtcggtta cacctccccg 27600 agcgtgagcg gacaggaagc cgtggtcgcc gccgcgatcg ccgacagcgg ggtgcccgcc 27660 cggtcgatca ccgcgatcga ggcgcacggc acgggcaccc acgtcggtga ccccatcgag 27720 atcacggcgc tcagcagggc gttcggacgg cacacgacgg acacggggtt ctgcgccgtc 27780 ggctccgtca agtcgaacat cgggcatctc gaatcggccg cgggcatcgc cagtttcatc 27840 aaggccgtgc tgcaactgca ccaccgcacc ctggtgccca gcctgcactt cgagcgcccc 27900 aacccgcgta tcgacttcga cgccaccccc ttcttcgtca acacggaact gcgtgcctgg 27960 cccgaggggg agcacccacg gcggataggg gtcagctcct tcggcatcgg cggcaccaac 28020 gcccatgtgg tgctggagca ggccccggat ccggtccccg ccgagccctc cggccggccg 28080 gagctggtgg tggtgtcggc caagtccccg gcggccctgg acgcggccac cgaggcgctc 28140 gcggagaagc tcgccgcccc cgacgcgcag ccgctcgccg acatcgccca cacgctccag 28200 accggccgcg gggccatgcg ctaccgcagg gccgtcgtgg ccgccggcac cgccgaggcc 28260 gcggcgctgc tctccggggc cgacccgggc cgggtgcgca gcgcggacgc gggcactgcc 28320 ccggcgaagg tggtgttcct cttccccggc cagggcgcgc agtaccccgg gatgagccgc 28380 ggcctgtacg cgagcgaacc ggtgttcgcc gaggccctgg acgcgtgcgc ggacctgctg 28440 gccgaggagc tcggcatcga cctgcggacc gtgctcttcc ccgacgcacc cgccgaggac 28500 gggctcaccc acacgacgct cgcacagccc gccctgttcg ccaccgagta cgcgatggcc 28560 acgctgctgc ggtcctgggg cgtcgaaccc gacgtcatgg tcgggcacag catcggcgag 28620 ttcaccgcgg cggtcctgtc cggggtgctg tccctgaagg acgcggcgcg cctggtcgcc 28680 ctgcgcggca ggctcatgca ggaccggccg accggggcca tggtctccat cgccgcaccg 28740 gccgccgaca tcgaacccct cctgccggcc ggggtctcca tcgccgccat caacgccccc 28800 gtgctctgcg tcgcctcagg gccgcacgag gcggtggcgg aactcggcga gatcctggcg 28860 gccaaggaga tcaccgtccg cccgctccac acctcccacg ccttccactc cgcgatgatg 28920 gacccggtcg tggagccgtt caccgaggcc gtcgccggaa ccccgctcgc cgcccccggc 28980 ctgccgttcg tctcctgcgt caccggcctg cccatcaccg ccgagctggc cacggacccg 29040 cagtactggg gcacgcacct gcgccgcccc gtccgcttcg ccgacgccgt gcgcaccgcg 29100 atcggcgacg gccccgccgt cctcgtcgag gtgggccccg gcaacaccct gagcaccctg 29160 gcccgcgccg gggccggcac cggcgggccc cgctgcgcgg ccgtcaccac cctgcgcagg 29220 cccgacgagg ccgccgacga cggccaggtg ctccgcaccg ccgtcgggga catctggctc 29280 ttcggcggcg cggtggactg gcccgccctg caccagggcc gccgcaaccg ggtcgaactc 29340 cccggctacc ccttccagcg cgaccgctac tggatcgaac cgcggggctc cgcgaccggc 29400 acgcccctgg tcgccgactt cgccgagcac gaggaggccg agacggagcc ggccggcagg 29460 gcgacccggc cgagcaccct ggtcaccgcc tacgtggcac cggcggacga gctggagacc 29520 acgatcgccg ggatctggga ggagatgttc ggcatcgcgc cgatcggcac gcgggacgac 29580 ttcttcgagc tcggcggcca ctcgctcctc gccatccagg tcctcaaccg cctccaggcc 29640 acctccggcg tcaccgtgga gctgggccgg ctgctggcca cccccaccat cgggggcatg 29700 gccgaggagc tgcgcgcggc cggtgccgcc gggaccgacg accggctccc caccgtcgta 29760 ccgcgcccgg acctgcggta cgagccgttc ccgctcaccg agatgcagca ggcccagtgg 29820 atcggccggc tgtccagctt cgacatgggc ggagtcgccc ctcatctgta cttcgagttc 29880 gacagccgca ccatcgagac cgcccggctg gagcgggcct ggcagcgagt ggtgcagcgc 29940 cacgacatgc tgcggatggt ggtcctcccc gacggccggc agcagatcct cgacaacacc 30000 gagccgtacc gcttcgaggt gctcgacctg cgcaccaccg acccggagga ggcagagcgc 30060 caactggccc agatccgcga ccggatggcc accgaggtgc gccccgccga cgtctggccg 30120 ctgtgggagg tacgggtcgg cctgctgccc gaccaccggg tccgggtgca catcagcttc 30180 gatctgctgg tcgccgacgt ctcctcgttc ttctaccaac tgctgccgca atggcgggag 30240 ttctaccacc atcccgaaca cgaccccgag cccctcgccc tctccttccg ggactacgtg 30300 ctcgccgagg aggagttgcg gcggacgccc cggtacgaac ggtcgctgga gtactggcgc 30360 aagcgggtcc gcgagctgcc cgccgcaccc gaactgccca cggtgcaggg tgccgggggc 30420 ggcgaacggc tggggttcgt ccgcaggcac gcccggctgg acgccgagct gtggggccgg 30480 atcaaggcga aggcggggga gttcggcgtc accccgtcca gcgccatgct cgccgcgttc 30540 gccgtcacca tcggcacctg gagcaagtcc cagcggttca ccctcaactt cacggccgtc 30600 aaccggctcc ccgtccatga ggaagtcgac gatgtggtgg gcgagttcgc ctccttcgac 30660 ctactggagg tggacgcggt ctcggcaccc gacttcgccg gtctcgtgcg ggaactgcaa 30720 cggcagagct gggcggactt cgaccaccgt tacgtcagcg gcgtacgcat cctgcgcgag 30780 cgggcccggg cacgtggcgg cgcgggcgat gtcatgccgg tcgtcttcac cagcgcgctc 30840 ggctcggacg tggacggcaa gcccgcgccg tcgccggtgg actggctcgg cgagcagagc 30900 tacttcatct cgcagacccc gcaggtgacc atcgaccact tcctgctgga gttcgggggc 30960 aacctggagc tggcctggca cgccgtcgac gggctcttcc ccgacgggct gatggaggag 31020 atgttccagg cctaccagga cttcgtggtg ggcctggcgg agaccgacgg ctggcaccgc 31080 cccccggtgc tcgacctgcc cgccggtcag ctcgcacccc gcgccgccgc caacgacacg 31140 gcgggcgagc tgcccgacgg cgtgctgccc gcccggatcc tggcccgggc cggctccgcc 31200 gaaccggcgg tgatcaccga ggaccgtacg ctcgactacg ccgaactcac cggccgggcc 31260 gtggcgttgg cccgcgaact gaccgaggcg ggctacggcc gaggggctgt cgtcggtatc 31320 ggactcgcca agggctggcg ccagaccgtg gcggcgctcg cggcctccgc ggccggctgc 31380 acgtacgtac cactggaccc cggccttccc gaggcccggc gccgctggct ggtcgaacag 31440 gccggtatcg gctgcgtact ggccgaaccg gacacggcgg cactctggcc gaacgcgccc 31500 cgggtgctgc cggtcgccga ggacgcgcgg tgggaccccg cggacacggc ggcctggagc 31560 tgtcccgccc ggccggagga taccgcctac gtcatctaca cctcgggctc caccggaacc 31620 cccaagggcg tggcggtcag ccaccgggcc gcgctcaaca ccctggtgga catcgaggag 31680 cggttcggca tccggccggg agaccgggtg ctgggtctct ccgccctcaa cttcgacctc 31740 tccgtcttcg acgtcttcgg gatgctcgcg gcgggcggtg ccgtcgtgct gccggaggcg 31800 gcggaccggc gcaacccgga ccgctggacc gagctgtgcc ggcgccacgg ggtcacggtg 31860 tggaactcgg tgcccgcact gatgcagatg ctggtcgaac acctggagag ccggggcccg 31920 gccgacgacg cgggccacct ccccggcctg cggctcgccc tgctcagcgg ggactggata 31980 ccgctgtccc tgccggaccg gatacgggcg gtggcccccg cgaccgacgt gatcagcctc 32040 ggcggcgcga ccgaggccgc cgtctggtcc atcgcccacc cgatcggcga ggtcgatccc 32100 gactggccgt ccgtcccgta cggacgcccc ctgcgcaacc agcgcttcca cgtcctcaac 32160 gaccggctgc ggcacgcgcc ggtgtgggtg ccgggccagc tgcacatcgc cggggccgga 32220 ctcgccgagg ggtactggcg ggacgagcgg cgcaccgccg agtcgttcat cacgcacccc 32280 gagacggggg agcggctgta ccgcaccggc gacctcggac gctatctgcc ggacggcacc 32340 atcgagttcc tcggccgcga cgacttccag gtgaagatcg gcggccaccg gatcgagctg 32400 ggcgagatcg agcacgcgct cggctcccac cccgagctgc tcaacgccgt ggtgtccgcc 32460 cccggggaac gcaaccgcca acgcctggtc gcccatgtcg tcccggccga ccccgggacg 32520 cggaacgatg cggacttcgc cgaccggctg cgggaccacc tcaccaccac cctgccctcc 32580 tacatgatcc cgtccgacat cgtgctcatc gacgcgatgc cgctgagcgc caacggcaag 32640 gtcgaccgct ccgcactgcc cgaccctcag cggacgggcg acgccgaggc cgcggcgagc 32700 gcggccgagg acgacgggga ggaggccacc ggggcgctgc gcaccctgct ggtcctggca 32760 gccgacctcc tcggcgtgaa cgggccccgg ccgcgggaca acttcttcga gctcggcggc 32820 gactccatca tgggggtgca gctcgtcggc agggccaacg ccgaaggcat ccccatcacc 32880 ccgcagaacc tgttcgagag caccaccttc ctcgaactgg ccgcggccgt acccgtcgag 32940 ccgggaaccg acgacacggg cgaggcggtg gcactcaccc cgcaccagac cctcgcccac 33000 gcccaggtgg gctcggtact cctcgacgta ccggacgcct tcgacccggc gtccgccgcc 33060 cgggcactga acgccctggc cgaccgccac cccgccctgc gcacccgcgt gcgtaccgag 33120 gacggtcagc ggttcgccgt acgccccggc cccggcgagg acttcgacgt accggagatc 33180 gacctcgccg ccctgcccga cgacgtacgg gccgaagcgg tggcggagat gatcggcgag 33240 atggccgggg aggtggacat cgagaccggt ccggcggtca agttcgcggt gttccggctg 33300 ggcgaacgcg gcagcgtact ggcctgcacg gcggcccagg gcctgatgga cgacgcctcc 33360 gtcctcctgc tctgccgcga actgatccag gcgtacgacc ggttggccgc cggccggccg 33420 gtggtgtgga gcgacggcgc gggctcaccg caggcatgga accggggcct gcgccggaag 33480 cccgcgcacc cggccgggct cgccgaaaat ccgggaacag caggggagtt gccccggcag 33540 cggaacatgg agctggacgc cgcgcgtacg gcaggcctgt tcaccgcagc cgcgggcagc 33600 caccacctgg accccaccga ggtgctcgtg gccgcggcgt ccgccgccct cggcagggcg 33660 ctgcccgaac cgccgcaact cctcgtggaa cgttcgctcc gcgacgacct ggccgccggc 33720 gacgaacccg ccggacgcct ggtgggccgg acgaccgagc tgcgcacggt gcagccggtg 33780 gcggcgggta cccccctcga cacggccttg acctcggtga agggccggct ccggacggcg 33840 gacccggacc cggtccgggg gacgacggtc gcggtcaggg aagtcgtgac ctgggaccgc 33900 gtcgagggcg cggtcgaggt gcccgcggac ttcgccggag tgaccggcct cgcgggctgg 33960 cacgaggaga cggtggggca gctgagcgcc gccgtggtgg acggcgccct gcggatccgc 34020 tggcaactgg ccgcatccgt gccggaggac gcggcgaccc gcctggccga cgcgttcggc 34080 acggtgctcg gggagatcgc cgagcactgc cgccgggtgg cggagggctc ctacgaaccg 34140 tccgacttcc ccctggcgga cctctccggg gacgaactgg ccgagttcct cgatgaactg 34200 cggtgacggc gaccggtgac cgacgaatgg cgacaacggc ggacgggact gtgatgactg 34260 acaacggtgc gacgaagcga ccggtccgcg acatcgcgga catctacgaa ctctcgccga 34320 tccaacaggg cctgctctac gaacagctgg cgcagccggg tctcggcatc tacgtggagc 34380 agctggggct ggagttctcc gggacgatgc accccgagca cttcgagcgg gcgtggcagc 34440 tggtcgtcga ccggcacccg atcctgcgca cctccttcca ctggcgcaag gacggaagcg 34500 ccgtgcaggt cgtgcacggc tccgcccggc tgccgctgga gacactggac tggcgcgacc 34560 tcgacgaacg tacccaggag gagcggctgc gggcgtccct ggacgccgag agggccgagg 34620 ggttcgatct caccgatgtg ccattgatgc gcagcacgct gatccggcgc ggcgacgagc 34680 ggtggacctt ctcctggcgg ttctcgcacc tgctgatgga cggctggtcc ttcaccctgg 34740 ccatccagga cttcatcgac cactaccggg tgctgtgccg gggcgggcgg ccgacgctct 34800 caccggggcg ctcctaccgc gactacctct cctggtggcg cgaccgggac cccgaggagg 34860 cacgggagtt ctggcgcgag gagctggcgg actaccggcc ggtggagcag gtgcacctgg 34920 gcgggaccgg gattcccgaa ggggagccca cccacgccca cttcgagcgg atcctgggcg 34980 atctggcccc ccggctgacc gccctggccc gcgcggaaca gctgacactc gccacgctgg 35040 cccaaggcgc ctggttcatc gtgctgggcc gcttcctcgg ccgtacggat ctcgcctgcg 35100 ggatcaccat ggcgcaccgg ccccccgacc tcgtcggctc ccaggacatc ctcgggccga 35160 tgatcgccac cctgccgctg cgcaggaggc tggaccccgc gatgcatctg cggtcgtggc 35220 tgcgggagtt cggcaagcac ggcatcgagg cgagcgggca cagcgccgtg cccctgaccg 35280 agatgcaggc cctgctcgga acggactccg cgattccgat cctgcagagc agcgtgtcct 35340 acgagaacgt gccgatgccc gacttcgacc tggcggacgt cggcgcggag atgaccgaac 35400 tggtctatga cggcaggccc cacttcccga tcaccatggt gatcatgccg ggcgccgata 35460 tgccgctgcg ggtcgtccac gaccggcgga aggtctccga cgaggtggcc gagcggttcg 35520 ccggggaagt cgtctcggtg ctgacgcaga tgatcgaacg cccggacgtg acgctgggag 35580 agctgacctt cctcagcacc ccgcagcccg ggaacacccc cctgaccgag ccggacgccg 35640 aatccctgca cgagacgttc cggcggcacg cgcggctgcg cccggatgcg accgccgtgc 35700 gctgcggagg ccgcgccctc acctaccggg agctggacgc gtactccgac cggatcgccg 35760 ccaccctgcg cgcgcggtgt ccgggcgtga cccgcgtggg gctgtgcctg ccccgctcga 35820 tcgagctggt ggccgccatg atcggggtgt tcaaggccgg cgcggcctat gtccccctgg 35880 acccggagta cccggcggac cggctggccg acatgctcgc cgactcggcc gcggagctgg 35940 tgctcaccga cggcgcaccg gccgacgcgc tcaccgcggg caaggccggg ctggtcaccc 36000 tccccgagat ggacggagag ccggaccaca acgctccacc gccggtgccg gccgaccccg 36060 acgccccggc ctacctcctc tacacctcgg gctcgaccgg gcggcccaag ggcgtcccga 36120 tcacccaccg caacgtccag agcctgctgg ccgccggccg cgaggtcttc ggcttcacgg 36180 ccgaggacgt gtggaccttc gcgcactcct tcgccttcga ctactcggtg tgggagatct 36240 ggggcgccct gggcaacggc gcgtcgctgg tcgtggtgga ccacgagacc ggacgcgatc 36300 cgcgcgccct ggcccggctg atcgccgagg agcgggtgac ggtgctcagc gagacccccg 36360 cgctcttcga gcacctggtg cccgaactgg ccgacgacac gtcactgcgc agggtgttcc 36420 tgggcggcga ccggctggac ccggcgatcc tgcgcccctg gttcgcccgc ttcggtgacc 36480 gggacggagc cggccgggag accaccggcc gggcccccgc ccccggcatc gagctgtaca 36540 acctctacgg agtcaccgag gccaccgtgg tctccacgta ccaccgggtg cgcgaggagg 36600 acgtaagggc cggccggccg gtcccgatcg ggcgcgcgct gcccaaccag cgggtgtacc 36660 tgctcggaga ggacgaccgc ccggtcccgg tgggcgccac cggccagttg tgcgtggccg 36720 ggcacgcggt agcctccgga taccacgacc gggacggact caccgcggaa cggttcggct 36780 ccgacccgtc ggcgggtcct tcctcggcgg ccttccccct gtaccgcacc ggcgacctgg 36840 ccacggccac cccggacggg gaggtgcact tcctcggccg cgccgacacc caggtcaagg 36900 tgcgcggctt ccgcgtcgaa ccgggcgaga tcgaggccgc gctgcgcgag acgccggggg 36960 tgcggtcggc gacggtgacc gtccacggct ccggaacggc gcggcgcctg gtcggctatg 37020 ccgtgcccga ggacccggac gcggtcctca cagcggggcc ggttcccacc gaaccgctcc 37080 gggaacacct gcgcacccgg ctgcccgagc acatggtgcc ggccgcggtg tactggatcg 37140 accggatccc gaccacgccg ggcggcaagg tggacgtcgc cgccctcccc gtaccggatg 37200 ccggagggac cgaccggaac accgcgccga tgaccgaggc cgagcgcctg ctcgccgggc 37260 tgctgaccga ggtcctccag gtccccgacg tgggcgcgga cgacacgctg ggcgcgctgg 37320 gcctcgactc gctcggcgcg atgcggctgg cggcccggct gcgcggcgcg tacgccctcg 37380 atctggcggt gagcgatctg ccggccacgc gtacggtggc cgagctggca cgcgcggtgg 37440 aagccgcccg cccggtcgcg ggggaggggg cacggtgaac gcgaaccccg gacagacccc 37500 gacggacgcc acggatcacc ccgacagccc ggagaaccag cgctggcggg agcggatccg 37560 gcacttcgcc gagaaggaga tcgccccgct ctccaccacg atggacagga cggcgactct 37620 ggacgccggc ctgcgggagc gcctcttcgc cgaggggctg atgtcggtgg agatcccgcg 37680 cggctacggc ggcacgggcg gcaccctctg ccaactgatc ctcaccatcg aggaggtggc 37740 gcgggtggat cccggcgtcg ccgtcggggt gcacgtgcac aacgtcctcg tcgcgggcac 37800 gctgctgcgg cacgcgtcgg gcgatcagcg ccggcagtac cttccgcagc tggccaccgg 37860 gaagatcggc gcgttcgccc tctccgagga acaggcgggc agcgacgcgt tcgcgctgac 37920 caccgtcgcc cggcaggacg aggcgggcta tctgctgacc ggccgcaagc ggtggaccag 37980 caacgcgcgc aacgccgatc tgctgctcgt cttcgccctg gccgacgcgg gcgggcccac 38040 ggcgttcgtg gtccccgccg acgcgcccgg ggtgtccctg gacgaccggg tgcagcagat 38100 gggggtgcgg gcggcggcca cctcggacgt gatcttcgac gggaccccgg tccgtaccgc 38160 ccagcgcgtc ggcccgcccg gcggcggcca gacggtggcg ctgtccgggc tcggcctcgg 38220 gcggctcggc atcgccgcgc agatgaccgg cctcgcccag ggagccctgg acgcggcggt 38280 cggctacagc agggtccggg agcagttcgg cggccggata gccgaccacc agggggtcgc 38340 gttcccgctg gccgatgtgg cgagccggct cgcggcggcc cgggcactgc tgtaccgggc 38400 ggtcgacctc cacggccggg gcacggaccc ggtcgagctg atgaggctga ccgccatggc 38460 gaagtacgtc gcgtccgagg tggccgaacg ggcggcctcg gtcgccgtgg aaaccctcgg 38520 cgggaacggg tacaccgacg cctaccccgt ggagcggttc taccgggacg ccaaggccgg 38580 gaagatctac gagggcacgt ccaacgtcct gctccgcacg atcgcctcga tcctgatcgg 38640 aggaagcccc ggtgactgag cccctgcgcc tgctctgttt cccgtacgcg ggcggaaacg 38700 cccagaccta cgtccggtgg cgccggcacc tggcaccgga catcgaggtc tgccccatgc 38760 aactgcccgg ccacggcgag cggatcggcg agccgccccg ccaccgctgg gacgacctgc 38820 tggccgacat ccgcaccaga ctgaccgacc tgacgactcc tgaggaccgc ccgatcgccc 38880 tgttcggcca cagcctgggc gccctgctcg ccttcgagtg cgcccggatc ctggtgtccg 38940 agcacggcat ccggccggcc cgtctgctgg tctcgggcca ccgggcgccc cacctccccc 39000 tgcgggagga gacactccac cacctgccgg acacggagtt cctcacgagg ctcagtgagc 39060 ggtcgcgcac gctgcgcgcg ttgaccgacc ccgagttccg caaactcctc cttcccatgc 39120 tgcgggccga cttcaccgcg tcggagacct acacgttccg ggagggaccg acgctgacct 39180 gcccgatcac ggccctgggc ggcgagaggg acgaggacgc gaccctgggc gaggtcgccg 39240 cctggcaacg gcacactacg ggccggttcg agctgacggc cttcccgggc gatcacttct 39300 tcatcgacga cgcctgggag gcggtggtga ccgcggtcgg cgaccggctg cggtcacgag 39360 aggggagcac accgggctga ggggcgcgaa ggcccgcacg gaacccgcag gtggagagac 39420 ggctccggct ccgtggacgg gctacgccag gaccggaagg gacacgagtc cgcggacgag 39480 acgggttcgt cgccactcca gcagctctgc cggtacggca aggctgattc ctggaaagcg 39540 ggccaggaca gcccgaagag cgatctcgag ctcggccttg gccaggggtg caccaacgca 39600 gcggtggatg ccgtggccga aggcaaggtg agcagtcgcg ttgcggttca gatcgagctg 39660 atccggcgcc gggaaccgcg tggggtcccg gttggctgcc ccgagggcga ccaggaccgg 39720 ggtgtcggcc gggacatcgg tgccgccgag ggtgacggct tccgcggtga accggaaggt 39780 ggctgtactc accgcggcgt cgaagcggag cagttcgtcc agcgcggcgg ggatctcgtc 39840 ggggcttcca cgaaggcgat ccagctcggc cgggtgccgg agcagcgcga ggagggcatt 39900 gccaagggcg ttggtggtgg tctcgtgtcc ggctacgagc agcagcacgg ccatggagat 39960 cagttcctcc tcggtcaggt ggccgtctcc gtcgcgcgcc gagatgagcc ggtcgaggaa 40020 ggacctgccg gggtttttcc gtttgtcggc gaccaggtct gtcatgtagt cggccatcgc 40080 gtgcgaggca gcgtcgatga tgccgggctc tcccgccgcg aacagtccgg cggaccagcg 40140 ctggacatcg ggccggtcgg ccggcggcac tccgagcagt tcgcagatca cgatgaccgg 40200 cagtggcacc gccaggccgg ccacgacgtc gaacggcccg tgggcgggcc actggtccag 40260 aaggttgtca gtgatccggg cgatgaacgg gcgcagctgt gcgacggccc ctctggtgaa 40320 cgccttggtc accagtttgc gcagccgggt gtgccgcggc ggatcgctgg ccagcatggt 40380 tcgtgcgacc gcagggtgca gacggcgtcg cgatttcctg cctgcgaaga agacggcggt 40440 gtccttcgag agccgggcgt cgccgagggc ttcccgggct tccgcgtagc cggtgaccag 40500 gtagccgggg cgttcgccgg agccggcctg cagcggctgt accgggcatg cggagcgcag 40560 ggcgtcatac gtcggatagg ggttccggag gaagtggggg tccttcgtcg agtcagtcat 40620 ggtcccaggc tccaacgggt tccggcgggg catcgtcggc ggcacgcctc gcgtggcggg 40680 cgtaggcgga ggccaccagg aggagccact ccgtgtcccc cttcaggtct ttacggtagg 40740 cggcctccgc cgatcgcgga gcaacgttgt cggagaactc ggcccaccgc tcgccggttg 40800 ccgagtaacc ggttcgcccc gacatgcgca tcaccaggga gcggggtcgg gcccttactt 40860 ctgcgggcct ttcagcttct ggacgtgcgc ccagccccaa gagcccgtcg tcgcctccgt 40920 acccccgtcg catggtttcc atcgctgcgc tcatcgttca taccgtgccc ggcacaagct 40980 cagaccaagg gtccgcaccg gcagcgcacc ctgcgcgatg gccacctggc gcaaccttgc 41040 catcagagcg ctccgctcgg ccggagtgag gaacatcgcg gccagtctgc ggcacaaggc 41100 ccgtgacccg cgacggctcc tcgccgtccc cggcctcaac tgatcacaaa ccaggtgtca 41160 cgcgctacgc cgaggccctg acaactacgc tgagtttcat gactgacgtt tctctcggcg 41220 acgaactgac tcgccgcgtt gaagaccttg aagaacgagt ctctctgctc ttccaggaac 41280 acccagccgc tgagacgaac aacggggtcg gaacagctga tcatgtagct gaagccgagg 41340 acccgcgtca gctcgccacc aggctcctcg cacgttcgga tccggacgtg ttggcggctc 41400 gcctgaacgc gctcgggcac ccggtgcgtc tgcgtatcct gctcgcctgt cttgacggcc 41460 cgcgcagggc tgcggaactc gcagcgcaga ccgatatggg cagtacaggg cagatctacc 41520 atcacttgcg ccaactggtg aaccagggat ggctttcagc ctcacgcaga ggccactatg 41580 aagtaccccg cgaagccctt gaggttgtgg ctgcggttct tgcggcgacg ttctgggacg 41640 gcaacgccga ccccacatcg tccgaagctg cttcaggggc tcgcacgtcc gctgcatcct 41700 cgacgtgaac cccatgagtt tcggaccgat acgcggtcgt ggaggcccat cgggagacac 41760 atggtgaggc catacatgtg ccgccccgcc ctgcgggagc ggctgtccgg tccgggcgac 41820 cgcgaacttc tacgcacctg cccgggcccg gcagggcgac cacactctca gtgccgatcg 41880 ggcatgcccg atcgggtata ggcgttcggc catgccctta cgggcgaccg ggtggggtgc 41940 ccttctgcgc gcgctaagga gcgtctgagt agggcttttc ggtcttgatc ggacctggtc 42000 atgatgtgcc cgcaaggtga gctgttgctg tcgaagtcgc gccgtgagcc gcgaccgaac 42060 cgtctccgcg gctcccggcg tccgagcgca cttcagccgc gccatcgata ggggtaggaa 42120 attgtcgcag ggatctgcgc cttccacggt ccggacagcg gaccggctgc ccgcgacgat 42180 tcaccggctg gtggagtccc aggccgaacg tgcgccggac gccgtcgcgg tggagaccga 42240 ggacgggaag ctcacctacc gcgaactgga cgcgcgcgcc aaccagttcg cccgccacct 42300 ccggtcggcc ggcgtgcggg gggagtccct cgtcgccgtc cacatggagc ggggcctcct 42360 gacaccggtc gtcctgctcg gcatcctcaa aacgggcgcg gcctacctgc cgctcgacac 42420 cgaatcgccg gcggagcgcc tcgcggcggt gctcgccgac gcggcgcccg ccgcggtggt 42480 cacggccggc cccctgccgc cggtcgccgt cccgctgatc gacctcgaca ccgacctgcc 42540 cgcgatcacc gcactaccgg ccgagcccct cacggacgtc gaggagccgg gcccggaccg 42600 cctggcctac gtcatgttca cctccggctc caccggcgtc cccaagggcg tactggtgga 42660 gcaccgcgcc gtcatccggc tgatccggga gcagagctac gcgaggctcg gccccgatgc 42720 cacccatctg ctgctcgcgc cgctcgcctt cgacgcctcg accctggaga tctggggcgc 42780 cctggcgcac ggcggccggc tcgtcgtcgc ggccccggga gcgcgcaccg tcgaccagct 42840 cggccgcacc ctcgccgacc ggcgggtgac caccctgtgg ctgaccgcct cgctcttcaa 42900 cctcgtggtc gacgaggacc cgtccatcct cgccggggtc ggcgaccttc tcatcggcgg 42960 cgaggcgctg tcggtcaacc atgtccggac ggcacggaag gcgctgccgg acacggtcgt 43020 caccaacggc tacggcccga ccgagacgac cacgttcgcc tgcacccacg ccatccggcc 43080 gcaggacctc gacggcgcgt cgatcccgat cggcgggccc atcgcccaca ccgaagtcca 43140 tgtcctcgac gaggacttcg acccggtcgc gcccggcgag gcgggtgaac tgttcatcgg 43200 cggcccccgc ctggcgcgcg gctacctcaa ccggcccggc ctgaccgcgg agcgcttcgt 43260 cgcgcacccg gccgccaccg agcccgggtc ccggctctac cgcaccggcg accgggtcag 43320 ggtgcggccc gacggcaccc tggagtacct cgggcgcctg gacgaccagg tcaaactgcg 43380 cggcttccgg atcgaacccg gcgaggtccg cgccggactg accgggctgc cgcaggtcag 43440 ggacgcggtc gtcgtcgccc ggggcgggcc ctcggaccgg cgcctggtcg cctacgtggt 43500 gcccgaggcc gacgccacgg ccggcatgga caacgagcgg gagcaggtcg ccgactggga 43560 ggcggtgttc gacgagacct accgcgacgg tgtgggagcc gccgaaggac gctgggagct 43620 gagcgggtgg gtcggcagcg gtgacggcct gcccgtcccc gccgaccaga tgcgggaatg 43680 gacggacgcc acggtcgaac gcatccgcgc cctcggcgcc cggcgcgtgc tcgagatcgg 43740 ctgcggcacc ggcctgctcg cgatgcggct cgccccggac gcggaacgct atgtcggcag 43800 cgacctgtcc gccgtggcga tccgcaggct gcgcgcacag atggacgccg cgggactcga 43860 ccacaccgag ctggtgcacg cgccggccga cgacctcgac gcggttcccg gcggcacctt 43920 cgacgtcgtc gtactgaact ccatcgtgca gtacctgccg tcggcgcagt atctgcgcga 43980 ggtcatcgag cgcgcggccg cacggctggc acccggcggg cacctgttcg tcggagacgt 44040 gcgcagcctc ccgctcctcg acgccttcca cctcagcgcc gaactgaagc gcggccacga 44100 ggacgccgta cccctggcga cgctggccga ggcggtacgc gaacgggcgg cggccgagaa 44160 ggaactcgtc gtcgctccct ccttcttcac cgacctgagc ggccgggcag gcatcgacca 44220 cgtacgggtg accccgcggc gcggcaggca ccgcaacgag atgacgcagt tccgctacga 44280 cgcggtgctc cgggtgcgcg gagccgaacc ggcgcgcgtc cccgaccgct ggctcgactg 44340 gcgggacgaa ggactcactc tggaggacgt cgcacggatc ctccacgacc agcgcccgca 44400 gcacctggcg ctgcgcggtg tcaccgacgc gcgcgtcgcc gacgaggtgg cccgcctcgt 44460 ccggctgcgc gaggacgccg aagggaccgt ggccgcgctc cgggagaccg ggcacgacgg 44520 cccggccgtc gagatcgacg acgtgtacga cctggcggcc cgcgcctcct acaccgtgga 44580 cgtcagcgtc gcgggcagtg cggcgggcga tgccttcgac gtcatcctgt ggaccgacgc 44640 cgaaccggga ccggtcgcgt tcgcccccgg cccggcggaa gcacggggcc cccgcaccag 44700 catgccgctg gcgaccgcga cgtcgcggca cctcacgacg ctggtgcgcg attccctgcg 44760 cgagctcctg cccgcctaca tgatcccggc ggtgttcgtc ttcatggacg cgctgccgct 44820 cacctccacc ggcaagatcg accgctcggc cctgcccgag ccgccgcgcc ggacctcggc 44880 cggcggcgcg ggacggcggg ccgccaccgc gaccgaacgc gcactcgagc ccctgtggcg 44940 cgatctgctc gccctggaaa ccgtccacgt cgacgacgac ttcttcgcac tcggcggcca 45000 ctccctgctc ggcacccggc tgctgtcccg cgtccggggc ctgtggggtg tcgaactctc 45060 gctcgccgcc ctgttctccg ctcccaccct cggcgccctc gccgcccgca tcgactccgc 45120 gcgccaggac acgccggccc tccccggcac cctcgcggac aaggcggacc ccggatcggc 45180 gccaccgctg tccccggcac agcaccggct ctggctcgtc gaacagctca ccccgggcaa 45240 cccccgctac accgtgcccg tcgcctacag gatgcgcgga ccgatcgaca cggcggccct 45300 ccaggccgcc ctcgacaccc tcgtcgcccg acacgaggtg ctgcgcacca ccttcccctc 45360 ccacgacggc accccccgcc aggtcgtcgc cccctccgga cgcatcccca tcgaacgggc 45420 cgacgtcggc ggcgagggtg ccgacgcccc cgccgccgcc cacaacatcc tgacccggca 45480 ggcgagccgc tggctcgacg tgcagtccgg ccccctggcc gccgccaccc tcgtccggct 45540 cgccgaggac gaccacgtcc tgtgcctgac tctccatcac atgatctgcg acggctggtc 45600 cctggacctc ctcgcggccg aactgagcga gggctacaac gcccgcgtcg cccgccggac 45660 accgcagctg ccggagatcc accaccacac caggacccga taccggaccg gacaccccct 45720 ccaacagcac cacctcacgg aaccggccct catacgcagc ctgagtcacc accagcccca 45780 caccggcatc cgccaacatg aactcgacac ggtccgccgg atagccagga tcttgccgag 45840 ccccagcagc accacgtgcc ccccgaggcc tcggggcggg cggcgcagcg aggaggcggt 45900 gcgcagggtg ccgaacgcct ccagaacggc ggcgatcagc agcggcagca ggagcagacc 45960 ggcgatgccg gagaggatcc gcgtccacca gctcgtcccg gtgcaggcct cgctcgaagc 46020 ggcgttcatg gaactcacca ggacgagcgt cgaataccag gcgcgcacct ccaccggggc 46080 accggacgcg gaggtccgct catgaagccc gctgcttccg ccgccggcgc tcccggagca 46140 cgcgccgcgc ccctgacctt cggcggcatg ctccgctcgg agtggacgaa gatctacgag 46200 ggcgaggtcc gctacgcccg cgacgccttc gagggcctgc gcctcatgga cgcgctcatc 46260 ggcgtcaagc gcggcgtgcc gggcgcggcc ctgcccgagc tgaagcagcg ccgggtcccg 46320 aagaaggatg tggccgtcct ggaggtcgag gagcccgagg gctcggtgcg ctcggacgtc 46380 tccatcacca acccgatcgc cgcgctcggg ctgggcgaca tccggggcgc cgcccagcag 46440 tcgaaccgga tgctggaccg taccgtccgc tcgctggcct ccggcgaggg cgacgccccg 46500 gcccgggtgg gctcctcgct ggtcgagctg cgccgcacgg tggaggacct cgacccgcgc 46560 gacaccccgg ggcgcggggc gcggcggctg ctgtcgcggc tgccgggcgg gcggcggctg 46620 cgcgaccatg tcgcgaagta cgcctccgcc aagggcaccc tcgaccggat cc

<210> 2

<211> 562

<212> Proteína <213> Artificial <220>

<223> Orf1

<400> 2

Val His Arg Ala Glu Asp Leu Arg Ala Ala Pro Asp Ser Pro His Arg

5 10 15

Glu Val Ala Val Lys Thr Val Leu Arg Gly Arg Thr Gly Ile Ala Val 20 25 30 Asp Thr Ser Gly Ser Thr Arg Glu Phe Asp Arg Phe Arg Arg Glu Val 35 40 45 Arg Ile Met Arg Met Leu Ser Gln Gly His Pro Asn Leu Thr Arg Leu 50 55 60

Ile Asp Gly Gly Val Asp Gly Thr Pro Gly Gly Ser Gly Leu Pro Tyr

70 75 80

Leu Ala Met Glu Leu Leu Asp Gly His Pro Leu Ala Asp Leu Ile Asp

85 90 95 Glu Glu Pro Gln Leu Pro Val Ser Trp Val Ala Ala Ile Gly Ala Gln 100 105 110 Ile Ala Ala Gly Leu Thr Ala Ala His Thr Ala Gly Val Val His Arg 115 120 125 Asp Leu Lys Pro Ala Asn Val Met Leu Thr Ser Asn Gly Thr Val Lys 130 135 140

Ile Leu Asp Phe Gly Met Gly Ser Ile Val Asp Asp Pro Asp Gln Thr

145 150 155 160

Arg Leu Thr Ser Thr Gly Val Ser Val Gly Thr Ala Arg Tyr Met Ala 165 170 175 Pro Glu Gln Phe Arg Ala Glu His Val Ser Gly Leu Ala Asp Leu Tyr 180 185 190 Ala Leu Gly Cys Ile Leu Tyr Glu Leu Leu Val Gly Gln Pro Pro Phe 195 200 205 Ser Ala Arg Thr Pro Tyr Glu Leu Ser Glu Gln His Gln His Ser Gln

210 215 220

Pro Pro Leu Leu Thr Leu Val Arg Pro Asp Leu Pro Val Glu Leu Val

225 230 235 240

Arg Leu Val Asp Arg Leu Leu Glu Lys Lys Ala Glu Leu Arg Pro Glu

245 250 255 Asn Ala Ala Leu Leu His Asp Val Leu Val Pro Leu Ala Gln Ala Ala 260 265 270 Asp Asp Thr Ala Ala Leu Leu Ala Pro His Trp Val Ala Met Asp Pro 275 280 285 Val Ala Arg Leu Arg Thr Leu Leu Pro Glu Arg Thr Pro Ala Ala Pro 290 295 300

Val Pro Val Pro Arg Arg Glu Pro Arg Leu Pro Glu Thr Met Asp Val

305 310 315 320

Phe Gly Ile His Ala Asp Leu Ile Asn Glu Tyr Glu Ser Phe Thr Lys

325 330 335 Ser Ala Thr Val Phe Arg Asp Ala Arg Ile Glu Gly Phe Val Lys Asp 340 345 350 Asp Leu Ala Ala Lys Ser Gln Trp Pro Asp Pro Trp Leu Ser Leu Asn 355 360 365 Pro Phe Phe Ala Asp Gly Gly Lys Val Thr Asp Leu Val Gln Glu Gly

370 375 380

Leu Leu His Pro Lys Cys Ala Glu Ile Phe Gln Ala Gly Lys Lys Glu 385 390 395 400

Ser Ser Arg Arg Pro Asp Gly Arg Pro Leu Thr Phe His Leu His Gln 405 410 415 Arg Gln Ala Ile Glu Ala Ala Gln Ala Gly Asp Ser Tyr Val Leu Thr 420 425 430 Thr Gly Thr Gly Ser Gly Lys Ser Leu Ala Tyr Ile Val Pro Ile Val 435 440 445 Asn His Val Leu Lys Glu Arg Gln Ala Ala Ser Leu Arg Ala Gly Gly

450 455 460

Asn Gly Thr Gly Gly Arg Thr Glu Gly Gly Arg Val Arg Ala Ile Val

465 470 475 480

Val Tyr Pro Met Asn Ala Leu Ala Asn Ser Gln Leu Met Glu Leu Glu 485 490 495 Lys Tyr Leu Arg His Gly Phe Gly Ala Gly Gln Glu Pro Val Thr Phe 500 505 510 Ala Arg Tyr Thr Gly Gln Glu Ser Glu Glu Arg Arg Lys Glu Leu Arg 515 520 525 Lys Asn Pro Pro Asp Ile Leu Leu Thr Asn Tyr Val Met Leu Glu Leu 530 535 540

Met Leu Thr Arg Pro Asp Asp Arg Ser Ser Leu Ile Arg Gln Glu Arg

545 550 555 560

Val Val

<210> 3

<211> 185

<212> Proteína <213> Artificial

<220>

<223> Orf2

<400> 3 Leu Asp Ala Ala Glu Gly Val Phe Ala Glu Arg Gly Ile Asp Ala Ala 1 5 10 15 Arg Ile Asp Glu Ile Thr Glu Arg Ala Gly Val Ala Ile Gly Thr Phe 20 25 30 Tyr Leu His Phe Ser Ser Lys Arg Asp Ile Met Ala Ala Val Gln Ala 35 40 45 Arg Phe Val Asp Arg Leu Val Glu Arg Gln His Ala Ala Ala Gln Asn 50 55 60

Leu Pro Ala Asp Asp Trp Ile Gly Arg Val Asp Ala Trp Leu Ser Asp

70 75 80

Ala Val Arg Ile Tyr Val Glu His Ala Gln Leu His Asp Val Leu Tyr

90 95 Gly His Thr Pro Ile Asn Ala Thr Thr Glu Ile Gln Val Ser Pro Glu 100 105 110 Asn Ala His Val Glu Ala Phe Arg Arg Leu Ile Ala Glu Arg Pro Asp 115 120 125 Pro Pro Ala Asp Ala Pro Asn Pro Ala Met Ala Ala Leu Leu Ile Tyr

130 135 140

Ser Ala Trp Tyr Gly Gly Thr His Ala Leu Leu His His Glu Ala Asp

145 150 155 160

Asp Ile Asp Arg Leu Ala Asp Glu Leu Ile Ala Asp Ile Thr Asp Leu 165 170 175 Ala His Arg Tyr Leu Arg Ser Glu Arg 180 185

<210> 4

<211> 577

<212> Proteína <213> Artificial

<220>

<223> Orf3

<400> 4

Leu Tyr Arg Leu Trp Pro Asn Pro Lys Leu Leu Ala Arg Leu Trp Leu 1 5 10 15

Leu Thr Ala Ala Gln Ala Val Leu Gln Gly Leu Leu Leu Ala Leu Leu 20 25 30 Ile Pro Ile Leu Asp Ala Val Val Arg Pro Glu Pro Asp Ile Gly Ala 35 40 45 Ala Thr Thr Trp Leu Val Leu Gly Ala Val Gly Ala Ala Leu Tyr Ala 50 55 60 Val Leu Ser Ile Val Ala Thr Pro Val Gly Phe Ala Ala Ala Gly Ala 65 70 75 80 Leu Ala Ala Gln Leu Arg Arg Arg Leu Met His His Val Ser Thr Leu 85 90 95 Thr Leu Gly Trp Phe Thr Ala Glu His Lys Ala Arg Leu Ala Arg Ala 100 105 110 Val Thr Ala Asp Val Gly Ser Ala Ala His Leu Ala Val Thr Ile Gly 115 120 125 Gly Pro Val Ile Thr Ser Thr Leu Leu Pro Ala Thr Val Val Val Val

130 135 140

Thr Phe Thr Val Asp Trp Arg Met Ala Leu Leu Leu Cys Ala Ile Ala 145 150 155 160

Leu Val Ala Tyr Leu Ala Leu Arg Arg Ala Ala Arg Ile Ser Glu Ile 165 170 175 Ala Glu Ile Glu Leu Glu Arg Ala Ala Ala Gly Val Ala Ser Arg Ala 180 185 190 Ile Glu Leu Gly Gln Ala Gln Pro Val Leu Arg Ala Ala Gly His Ala 195 200 205 Glu Gly Thr Pro Arg Met Arg Ala Ala Leu Glu Asp His Arg Glu Thr

210 215 220

Tyr Arg Glu Gly Met Arg Arg Ala Arg Gln Pro Phe Phe Val Tyr Thr

225 230 235 240

Gly Val Ile Met Ala Gly Phe Val Ala Val Leu Ala Leu Thr Ala Glu 245 250 255 Leu Val Leu Ser Gly Arg Ile Asp Ala Ala Thr Ala Ile Val Leu Leu 260 265 270 Val Leu Ala Ala Arg Phe Leu Glu Pro Leu Gly Asn Leu Ile Glu Leu 275 280 285 Ile Gly Ala Leu Arg Ala Leu Gly Asn Gln Ile Ala Arg Ile Glu Glu 290 295 300

Leu Leu Ala Thr Glu Ala Leu Pro Val Pro Ala Glu Pro Val Arg Arg

305 310 315 320

Ile Asp Glu Ala Glu Val Glu Phe Ala Asp Val Thr Phe Thr Tyr Pro 325 330 335 Gly Gly Asp Thr Pro Ala Leu Arg Asn Val Ser Leu Arg Cys Pro Ala 340 345 350 Gly Ser Thr Thr Ala Leu Val Gly Pro Ser Gly Ser Gly Lys Thr Thr 355 360 365 Ala Thr Arg Leu Ile Ala Arg Phe Phe Asp Ile Asp Ser Gly Glu Leu 370 375 380

Arg Ile Gly Gly Val Asp Val Arg Lys Leu Asp Pro Thr Thr Leu Leu 385 390 395 400

Asp Glu Ile Ala Ile Val Phe Gln Asp Val Tyr Leu Phe Asp Asp Thr

405 410 415 Ile Glu Asp Asn Leu Arg Leu Ala Arg Pro Glu Ala Thr Trp Asp Glu 420 425 430 Leu Arg Glu Ala Ala Thr Ala Ala Arg Leu Asp Glu Val Ile Glu Arg 435 440 445 Leu Pro Ala Gly Trp Gln Thr Arg Val Gly Glu Gly Gly Ala Gln Leu 450 455 460

Ser Gly Gly Glu Arg Gln Arg Val Ala Ile Ala Arg Ala Met Leu Lys 465 470 475 480

Arg Ala Arg Ile Val Leu Val Asp Glu Ala Ser Ser Ala Leu Asp Pro 485 490 495 Glu Asn Glu Ala Ala Ile Thr Arg Ala Ile Ala Asn Leu Gly Ala Asp 500 505 510 Pro Asp Arg Thr Val Ile Val Ile Ala His Arg Pro Ala Thr Leu Glu 515 520 525 Ala Ala Asp Leu Val Val Ser Leu Asp Gly Gly Arg Val Val Glu Ser

530 535 540

Gly Thr Arg Glu Glu Leu Leu Arg Thr Gly Gly Thr Phe Ala Arg Leu

545 550 555 560

Ser Ser Gln Tyr Glu Arg Ala Arg His Trp Arg Ile Ala Ser Gly Ser 565 570 575

His <210> 5

<211> 586

<212> Proteína <213> Artificial

<220>

<223> Orf4

<400> 5

Met Asn Arg Ser Thr Ser Gln Thr Pro Gly Leu Ala Gly Pro Gly Ala 1 5 10 15

Leu Met Arg Leu Leu Ala Pro Ile Arg Thr His Leu Val Ile Cys Ala 20 25 30 Val Leu Ser Cys Leu Ser Ala Ala Ala Gly Ile Val Pro Tyr Ile Ala 35 40 45 Val Ala Glu Ile Ala Arg Leu Met Leu Asp Asp Pro Ala Gly Ser His 50 55 60

Thr Ala Ile Trp Thr Trp Val Gly Val Gly Ala Ala Gly Ala Gly Val

70 75 80

Trp Leu Val Leu Phe Val Gln Ser Ser Arg Val Gly His Tyr Ala Asp

90 95 Ala Ala Ile Leu His Asp Val Arg Val Arg Ile Val Thr His Leu Gly 100 105 110 Lys Leu Pro Leu Gly Trp Phe Arg Ala Val Gly Ser Gly Lys Val Lys 115 120 125 Arg Ala Met Thr Gly Asp Leu Glu Glu Met His Glu Val Ile Ala His 130 135 140

Ala Leu Gly Gln Leu Val Gly Ala Val Thr Val Met Thr Val Ala Thr

145 150 155 160

Gly Tyr Leu Leu Leu Val Asp Val Arg Met Ala Phe Val Val Leu Gly

165 170 175 Val Leu Leu Leu Met Gly Ile Phe Phe Arg Val Ser Met Arg Ser Met 180 185 190 Thr Thr His Met Asn Arg Leu Val Leu Ala Asp Ala Arg Ile Ser Ala 195 200 205 Ala Ser Val Glu Tyr Ala Asp Gly Ile Gln Val Val Lys Thr Phe Gly

210 215 220

Thr Gly Gly Arg Val Leu Arg Arg Phe Asp Asp Ala Val Asp Glu His 225 230 235 240

Thr Gln Ala Phe Ala Ala Trp Val Ala Glu Val Arg His Ser Ser Ala 245 250 255 Ala Ser Arg Leu Phe Gly Ser Glu Met Ala Val Leu Thr Ala Val Ala 260 265 270 Ala Val Gly Leu Val Leu Val Asp Arg Gly Ser Leu Gly Val Ala Asp 275 280 285 Leu Val Ala Phe Leu Val Val Ala Ile Gly Leu Pro Asn Ser Ile Leu 290 295 300

Pro Ala Val Thr Ala Ala Gln Gly Val Arg Lys Gly Arg Met Gly Ala

305 310 315 320

Ala Asn Ile Glu Gln Leu Leu Ala Arg Thr Pro Leu Pro Glu Pro Arg

325 330 335 Glu Pro Gln Glu Pro Val Gly His Gly Val Glu Phe Asp Arg Val Ser 340 345 350 Phe Ser Tyr Asp Gly Val Thr Asn Ala Val Glu Asp Ile Ser Ala Val 355 360 365 Cys Pro Pro Gly Arg Val Thr Ala Ile Val Gly Pro Ser Gly Ala Gly

370 375 380

Lys Thr Thr Leu Ala Gly Leu Leu Pro Arg Phe Tyr Glu Val Ser Arg

385 390 395 400

Gly Ser Ile Arg Ile Gly Gly Val Asp Val Arg Ser Ile Pro Ser Ala 405 410 415 Lys Leu Leu Ala Ser Met Ser Leu Val Phe Gln Asp Val Ala Leu Leu 420 425 430 Arg Asp Ser Val Ala Glu Asn Ile Arg Ile Gly Arg Pro Gly Ala Ser 435 440 445

Asp Ala Glu Val Arg Glu Ala Ala Ala Ala Ala His Ile His Asp Val 450 455 460

Ile Glu Ser Met Pro Asp Gly Tyr Asp Thr Leu Leu Asp Ala Gly Gly

465 470 475 480

Gly Ser Leu Ser Gly Gly Glu Arg Gln Arg Leu Thr Ile Ala Arg Ala 485 490 495 Ile Leu Ser Gly Ala Pro Ile Val Val Leu Asp Glu Ala Thr Ala Ser 500 505 510 Leu Asp Ala Asp Ser Glu Ser Ala Val Gln Glu Ala Leu Ala Asn Leu 515 520 525 Ala Val Gly Lys Thr Val Ile Val Ile Ala His Arg Leu His Thr Ile 530 535 540

Ala Gly Ala Ala Gln Ile Leu Val Leu Glu Asn Gly Arg Leu Val Glu 545 550 555 560

Gln Gly Arg His Glu Glu Leu Leu Ala Arg Asp Gly Leu Tyr Ala Arg

565 570 575 Met Trp Ala Ala Gln Glu Gly Val Leu Ala 580 585

<210> 6

<211> 322

<212> Proteína <213> Artificial

<220>

<223> Orf5

<400> 6

Leu Ile Ala Leu Ala Ala Val Gly Ala Thr Gly Cys Gly Gly Gly Asp

5 10 15

Ala Asp Ala Arg Gln Lys Gly Glu Lys Lys Thr Ile Ala Phe Glu Ser 20 25 30 Cys Ser Arg Thr Val Glu Leu Asp Arg Ile Pro Lys Arg Val Ala Ile 35 40 45 Thr Thr Asp Ala Ile Ala Asp Thr Leu Phe Glu Leu Gly Val Gly Asp

55 60

Arg Ile Val Ala Lys Thr Arg Gly Glu Ser Ala Pro Ala Pro Glu Leu 65 70 75 80

Lys Glu Arg Leu Ala Ala Leu Pro Ser Leu Gly Thr Arg Asn Pro Ser

90 95 Val Glu Ala Leu Val Gly Ala Lys Pro Asp Leu Leu Ile Thr Asp Gln 100 105 110 Val Glu Lys Val Ser Gly Lys Leu Gly Ser Pro Ser Ile Ala Glu Leu 115 120 125 Glu Arg Leu Gly Ile Ala Thr Tyr Val Val Gly Gly Gly Cys Ala Ala 130 135 140

Asp Leu Ser Glu Asp Thr Ser Gly Leu Glu Ala Leu Asp Gly Asp Ile 145 150 155 160

Arg Gln Leu Gly Thr Val Phe Gly Val Glu Ala Arg Ala Arg Lys Leu 165 170 175 Ala Asp Lys Leu Asn Gly Ser Leu Asp Asp Val Arg Arg Gln Thr Ala 180 185 190 Gln Glu Pro Arg Thr Lys Val Ala Lys Leu Ser Gln Val Ala Gly Gln 195 200 205 Leu Tyr Val Thr Ser Gly Gly Leu Ser Asp Asp Val Ile Glu Arg Ala 210 215 220

Gly Gly Thr Asn Val Phe Ala Asp Leu Pro Gly Gln Phe Ala Pro Val

225 230 235 240

Ser Pro Glu Gln Ile Val Ala Arg Asp Pro Gln Ser Ile Ile Val Asp

245 250 255 Asn Phe Thr Ala Thr Ala Ala Gly Glu Asn Glu Ala Met Ala Tyr Leu 260 265 270 Lys Arg Thr Phe Pro Thr Val Glu Ala Val Lys Lys Gln Arg Val Leu 275 280 285

Val Ile Asp Ala Ala Lys Ser Gly Ala Arg Gly Ser Thr Arg Pro Val 290 295 300

Glu Gly Val Val Glu Ile Ala Arg Phe Leu His Pro Ser Thr Ser Arg

305 310 315 320

Ser Gln <210> 7

<211> 327

<212> Proteína <213> Artificial

<220>

<223> Orf6

<400> 7 Leu Phe Val Val Leu Leu Gly Ala Val Cys Leu Ala Val Ser Val Gly 1 5 10 15 Ser Val Gly Val Arg Pro Ala Thr Val Val Lys Val Ile Ala Asp His

20 25 30 Leu Ile Gly Val Gly Thr Pro Ala Ala Val Met Asp Asp Gln Ile Val 35 40 45 Trp Asn Leu Arg Leu Pro Arg Val Leu Leu Ala Ala Ala Val Gly Gly

55 60

Gly Leu Ala Val Val Gly Val Thr Leu Gln Ala Thr Val Arg Asn Pro 65 70 75 80

Leu Ala Asp Pro Tyr Val Leu Gly Val Ser Ala Gly Ala Gly Leu Met

90 95 Ala Ser Ile Val Ile Thr Leu Gly Ser Val Ala Val Ala Gly Leu Ser 100 105 110 Thr Ser Ala Ala Ala Phe Val Gly Ala Leu Val Ala Met Val Ala Val 115 120 125 Leu Gly Leu Ser Arg Arg Ala Gly Arg Val Ile Pro Ser Arg Leu Leu 130 135 140

Leu Ala Gly Val Thr Leu Ser Tyr Leu Phe Ser Gly Ala Thr Ser Phe 145 150 155 160

Val Ile Phe Arg Ser Gly Asn Ala Asp Ala Ala His Ser Val Leu Phe 165 170 175 Trp Leu Leu Gly Ser Leu Ser Glu Ala Ser Trp Ser Asn Leu Ser Leu 180 185 190 Pro Ala Ala Ala Val Leu Val Val Gly Val Tyr Leu Met Leu Gln Ala 195 200 205 Arg Thr Leu Asn Ala Leu Ala Ala Gly Asp Asp Ala Ala Leu Ser Leu 210 215 220

Gly Val Ala Val His Arg Val Arg Ile Arg Leu Leu Val Val Ala Ser

225 230 235 240

Leu Leu Thr Gly Val Leu Val Ala Val Ser Gly Gly Ile Gly Phe Val

245 250 255 Gly Leu Ile Val Pro His Leu Val Arg Leu Val Met Gly Pro Asp His 260 265 270 Arg Arg Leu Leu Pro Val Ala Met Leu Val Gly Ala Val Tyr Leu Val 275 280 285 Val Val Asp Leu Leu Cys Arg Val Leu Val Arg Pro Glu Glu Leu Pro 290 295 300

Ile Gly Ile Val Thr Ala Val Leu Gly Ala Pro Val Phe Leu Trp Leu 305 310 315 320

Leu Arg Arg Ser Glu Ala Ser

<210> 8

<211> 261

<212> Proteína <213> Artificial

<220>

<223> Orf7

<400> 8 Leu Ser Gly Val Ser Val Arg Ile Asp Ser Ser Pro Ile Val Ala Glu 1 5 10 15 Cys Asp Leu Thr Val Glu Asp Gly Glu Arg Val Gly Leu Val Gly Pro 20 25 30 Asn Gly Ser Gly Lys Thr Ser Leu Leu Arg Thr Val Tyr Arg Ala Leu 35 40 45 Asp Pro Phe Ala Gly Ser Val Ala Leu Ser Gly Asp Asp Val Thr Thr

55 60

Leu Ser Gln Arg Glu Val Ala Arg Arg Ala Ala Leu Val Ala Gln Asp

70 75 80

Ser Thr Pro Asp Phe Asp Phe Thr Val Glu Glu Val Val Ala Met Gly

85 90 95 Arg Gly Pro Trp Leu Arg Ala Phe Glu Ser Thr Thr Gly Gly Gly Asp 100 105 110 Ala Thr Ile Thr Ala Ala Leu Glu Arg Val Arg Leu Ala Asp His Arg 115 120 125 Ala Arg Arg Leu Ser Thr Leu Ser Gly Gly Glu Arg Gln Arg Ala Leu 130 135 140

Val Ala Arg Ala Leu Ala Gln Glu Ser Pro Leu Leu Leu Leu Asp Glu

145 150 155 160

Pro Thr Asn His Leu Asp Val His Ala Ala Leu Glu Leu Leu Glu Leu 165 170 175 Val Lys Asp Leu Glu Arg Ala Thr Leu Cys Val Leu His Asp Leu Asn 180 185 190 Leu Ala Ala Ala Tyr Cys Asp Arg Ile Tyr Val Leu His Gly Gly Arg 195 200 205 Ile Val Ala Asp Gly Pro Pro Ala Glu Val Leu Asp Pro Glu Leu Val

210 215 220

Arg Thr Val Phe Gly Val Thr Cys Thr His Leu Thr His Pro Val Thr

225 230 235 240

Gly Gly Leu Leu Leu Ala Phe Ser Pro Gln Gln Pro Val Arg Ser Pro

245 250 255 His Ser Val Lys Gly 260

<210> 9

<211> 168

<212> Proteína <213> Artificial

<220>

<223> Orf8

<400> 9

Val Ile Asp Arg Val Asn Val Ile Pro Leu Thr Glu Arg Gln Asn Glu 1 5 10 15

Asp Glu Ala Gln Ala Ala Ile Ile Gln Ile Asp Glu Val Leu Thr Leu 20 25 30 Leu Gln Glu Leu Gly Asp Val Ala Arg Arg Ala Arg Met Asn Leu Val 35 40 45 Ala Pro Arg His Arg Gly Thr Arg Val Thr Ala Ala Gln Val Pro Ser

55 60

Ser Gly Thr Ser Arg Thr Pro Ala Arg Pro Glu Gly Glu Pro Ala Gln 65 70 75 80

Gln Pro Ala Ala Pro Gly Gly Gly Pro Gly Ala Asp Gln Lys Pro Leu 85 90 95 Pro Leu Thr Arg Arg Gln Lys Glu Val Leu Asn Leu Leu Ala Gln Gly 100 105 110 Leu Ser Asn Arg Arg Ile Gly Arg Ala Leu His Ile Thr Glu Gln Thr 115 120 125

Val Lys Ala His Leu His Met Val Tyr His Lys Leu Gly Val Ala Asp 130 135 140

Arg Thr Glu Ala Val Val Ile Ala Leu Arg Gln Gly Leu Val Gln His 145 150 155 160

Gln Arg Arg Glu Asp Pro Pro Ser 165

<210> 10

<211> 342

<212> Proteína <213> Artificial

<220>

<223> Orf9

<400> 10 Val Ile Ser Ala Ile Phe Gly Thr Leu Ala Thr Gln Ala Val Gly Ala 1 5 10 15 Ala Ala Arg Leu Glu Leu Ala Asp Arg Ile Gly Glu Ser Gly Ala Asp

20 25 30 Thr Asp Glu Leu Ala Leu Ala Cys Gly Val Pro Ala Glu Gln Leu Gly 35 40 45 Arg Leu Leu Arg Ala Leu Ala Ser Leu Gly Leu Cys Val Glu Ser Glu 50 55 60

Pro Gly Arg Phe Ala Leu Thr Glu Ala Gly Ala Leu Leu Arg Arg Asp

70 75 80

Asp Pro Ala Ser Leu Leu Ala Phe Ala Ala Phe Leu Thr His Asp Val

90 95 Phe Gln Arg Asn Trp Leu Asn Leu Gln Glu Ser Leu Asp Thr Gly Leu 100 105 110 Pro Ala Phe Asp Thr Ala Phe Gly Leu Pro Val Tyr Asp Tyr Leu Ser 115 120 125 Gly Arg Pro Glu Leu Ala Ala Leu Phe His Ala Ala Met Ser Arg Arg

130 135 140

His Arg Pro Leu Glu Met Ala Ala Ala Ile Ser Ala Val Tyr Asp Leu 145 150 155 160

Gly Arg Phe Ser Thr Val Val Asp Val Gly Gly Gly Asp Gly Thr Leu 165 170 175 Leu Ala Ala Phe Leu Asp Arg Tyr Pro His Leu Thr Gly Thr Val Leu 180 185 190 Glu Thr Glu Ala Gly Ala Ala Arg Ala Arg Glu Thr Ile Ala Gly Ser 195 200 205 Gly Leu Thr Asp Arg Cys Arg Ala Val Ala Gly Asp Phe Phe Ala Glu 210 215 220

Val Pro Lys Gly Ala Asp Leu Tyr Leu Ile Lys Asn Val Val Leu Asn 225 230 235 240

Trp Asp Asp Glu Arg Ala Trp Thr Ile Leu Arg Arg Val Arg Asp Ala 245 250 255 Met Pro Asp His Gly Lys Leu Leu Ile Ala Glu Pro Val Leu Pro Asp 260 265 270 Thr Ala Asp Ala Asp Ser Leu Asn His Ala Ala Leu Glu Asn Pro Tyr 275 280 285 Leu Thr Asp Leu His Met Leu Val Thr Ile Gly Gly Arg Gln Arg Thr

290 295 300

Arg Ala Glu Tyr Thr Ala Ile Cys Ala Arg Ala Gly Leu Arg Val Thr

305 310 315 320

Glu Val Val Pro Leu Ala Gln Glu Leu Asn Ala Ser Leu Ile Glu Val

325 330 335 Val Pro Asp Ile Ala Ala

<210> 11

<211> 532

<212> Proteína <213> Artificial

<220>

<223> Orf10

<400> 11

Met Lys Arg Asp Asp Gln Asp Ala Glu Glu Glu Ser Gly Pro Gly Ser

5 10 15

Pro Thr Pro Pro Ser Ser Arg Val Thr Pro Asp Asp Ile Arg Tyr Glu 20 25 30 Asn Leu Arg Arg Ser Tyr Asn Tyr Arg Phe Ile Ala Lys Pro Asp Tyr 35 40 45 Phe Arg Leu Val His Ser Pro Arg Gln Val Glu Glu Ala Val Arg Glu 50 55 60

Ala Val Thr Ala Gly Lys Arg Ile Thr Val Arg Ser Arg Gly His Cys 65 70 75 80

Gly Glu Asp Phe Val Ala Ala Pro Asp Val Glu Val Ile Leu Asp Leu 85 90 95 Ser Pro Met Ser Arg Val Asp Tyr Asp Arg Glu Arg Asn Ala Phe Val 100 105 110 Ile Glu Ala Gly Ala Pro Val Gly Lys Met Leu His Thr Leu Phe His 115 120 125 Asn Trp Gly Val Thr Val Pro Ala Gly Phe Cys Met Gly Val Gly Ala 130 135 140

Gly Gly His Ile Ser Gly Gly Gly Tyr Gly Pro Leu Ser Arg Leu Leu 145 150 155 160

Gly Leu Ser Val Asp His Leu Tyr Ala Val Glu Val Val Val Val Asp

165 170 175 Gln Asp Arg Asn Val Ser Thr Val Val Ala Thr Arg Glu Lys Thr Asp 180 185 190 Pro Asn Arg Asp Leu Trp Trp Ala His Thr Gly Gly Gly Gly Gly Asn 195 200 205 Phe Gly Val Ile Thr Arg Tyr Trp Met Arg Ser Pro Glu Ala Ser Gly

210 215 220

Ala Glu Pro Ala Gly Leu Leu Pro Arg Pro Pro Gly Ala Leu His Ile 225 230 235 240

Ala Glu Val Ser Trp Pro Trp Asp Arg Leu Thr Gly Ala Asp Phe Val

245 250 255 Arg Leu Val Gly Asn Phe Met Asp Trp Gln Ile Ala Asn Ser Ala Val 260 265 270 Asp Ser Ala Asp Ala Asp Leu Tyr Ala Leu Leu Asp Cys Pro His Arg 275 280 285 Ser Ala Gly Asp Ile Thr Leu His Ala His Leu Pro Glu Glu Ala Pro 290 295 300

Arg Ala His Ala Arg Met Asp Ala Phe Leu Ala Ala Leu Gly Ala Gly

305 310 315 320

Val Gly Ile Ala Pro Thr Val Arg Arg Thr Ser Leu Pro Trp Leu Ala

325 330 335 Ala Ser Gln Tyr Leu Ala Val Pro Glu Thr Gly Pro Ala Ala Ile Gly 340 345 350 Leu Arg Cys Lys Val Lys Ser Ala Asp Leu Arg Ala Pro His Arg Pro 355 360 365 Asp Gln Leu Ala Ala Leu His Arg His Leu Thr Arg Asp Asp Tyr Arg

370 375 380

Gly Thr Tyr Ala Ala Val Glu Tyr Ile Ala Tyr Gly Gly Arg Val Asn

385 390 395 400

Ala Val Pro Pro Glu Ala Thr Ala Ile Pro Arg Gly Ala Leu Leu Lys

405 410 415 Thr Phe Tyr Met Val Thr Trp Lys Asp Pro Ala Glu Asp Asp Arg His 420 425 430 Leu Arg Trp Ile Arg Glu Leu Tyr Arg Asp Met His Arg Ala Thr Gly 435 440 445 Gly Val Pro Val Pro Asp Glu Val Asn Thr Gly Ala Tyr Ile Asn Tyr 450 455 460

Ala Asp Val Asp Leu Ala Asp Pro Glu Trp Asn Thr Ser Gly Val Pro 465 470 475 480

Trp His Thr Leu Tyr Tyr Gly Asp Asn Tyr Pro Arg Leu Gln Glu Val 485 490 495 Lys Ala Glu Trp Asp Pro Leu Asp Ile Phe His His Ala Leu Ser Ile 500 505 510 Ser Ala Pro Glu Ala Gly Arg Pro Glu Ser Gly Pro Arg Thr Pro Ala 515 520 525 Gln Arg Glu Trp 530

<210> 12

<211> 544

<212> Proteína <213> Artificial

<220>

<223> Orf11

<400> 12 Leu His Ser Ser Thr Ala Val Arg Asn Ser Thr Ala Ile Arg Lys Arg 1 5 10 15 His Arg Met Ser Ser Pro Ala Gly Gly Pro Val Phe Ala Asp Arg Val

20 25 30 Leu Asn Asp Trp Leu Ser Ser Val Gly Leu Gly Ile Glu Tyr Ser Arg 35 40 45 Ala Glu Ala Asn Thr Leu Tyr Tyr Val Asp Glu Asp Gly Gln Glu Val

55 60

Pro Val Leu Asp His Val Cys Gly Phe Gly Ser Leu Ile Phe Gly His 65 70 75 80

Asn Asn Pro Asp Ile Val Ala His Ala Lys Thr Val Leu Asp Arg Gln 85 90 95 Val Pro Val Tyr Val Gln Leu Ser His Gln Ser His Ala Asn Asp Ile 100 105 110 Ala Ala Val Leu Asn Ala Ile Leu Arg Arg Glu Ile Pro Gly Gly Asp 115 120 125 Gly Asp Tyr Ala Ala Ile Phe Ala Asn Ser Gly Ala Glu Ala Val Glu 130 135 140

Ile Cys Val Lys His Ala Glu Leu Glu Arg Arg Ala Arg Val Ala Lys 145 150 155 160

Leu Thr Asp Glu Thr Ser Arg Asn Ala Glu Glu Ala Arg Glu Ala Val

165 170 175 Gly Ala Gly Arg Ala Thr Val Ala Glu Asn Pro Tyr Val Arg Phe Asp 180 185 190 Gly Ala Ser Gly Glu Ser Asp Arg Leu Glu Gln Leu Leu Ala Glu Ala 195 200 205 Gly Arg Arg Asn Ala Glu Leu Ile Ala Arg Gly Pro Val His Leu Ala 210 215 220

Leu Glu Asn Ala Phe His Gly Lys Leu Val Ala Ser Ile Gln Leu Thr

225 230 235 240

Gln Asn Pro His Trp Arg Leu Pro Phe Thr Ser Leu Ala Ser Ser Thr

245 250 255 Arg Phe Leu Arg Ala Asp Arg Pro Asp Glu Met Lys Ala Ala Val Glu 260 265 270 Glu Leu Arg Thr Ser Leu Leu Asp Val Leu Val Asp Asp Gly Val Val 275 280 285 Thr Val Val Glu Arg Asp Phe Pro Leu Val Gly Ala Phe Phe Val Glu 290 295 300

Pro Val Gln Gly Ala Asn Gly Met Arg Pro Leu Thr Glu Ala Ala Ala 305 310 315 320

Arg Glu Ile Arg Ala Val Cys Asp Ala Val Gly Cys Pro Leu Ile Val

325 330 335 Asp Glu Ile His Ser Gly Met Gly Arg Thr Gly Ala Phe Leu Ala Ser 340 345 350 Ser His Met Gly Leu Arg Gly Asp Tyr Tyr Thr Leu Ala Lys Ser Ile 355 360 365

Gly Gly Gly Ile Ala Lys Asn Ala Val Ala Leu Phe His Arg Asp Arg 370 375 380

Phe Arg Pro Glu Phe Glu Val Met His Ser Ser Thr Phe Ala Lys Asp

385 390 395 400

Gly Phe Ser Ala Ala Ile Ala Leu Lys Val Leu Glu Met Leu Glu Ala 405 410 415 Asp Asp Gly Arg Ala Tyr Arg Ile Ala Ala Glu Arg Gly Asp Arg Leu 420 425 430 Lys Gly Ala Leu Thr Ala Val Ala Ala Asp Phe Pro Asp Val Val Asp 435 440 445 Ala Val His Gly Ile Gly Leu Met Leu Ala Val Glu Phe Lys Asp Gln 450 455 460

Lys Gly Ala Ser Ser Glu Pro Leu Arg Glu Lys Ala Ala Ser Gly Met

465 470 475 480

Leu Gly Tyr Phe Ile Ala Gly Leu Ile Leu Arg Glu His Arg Ile Arg

485 490 495 Val Leu Pro Val Gly Pro Ala Gly Asn Ser Val Arg Phe Glu Pro Ser 500 505 510 Ile Tyr Leu Thr Asp Ala Asp Ile Ala Arg Thr Glu Asn Ala Leu Arg 515 520 525 Asp Val Cys Thr Ile Leu Arg Asp Gln Asp Gly Asp Arg Leu Thr Pro 530 535 540

<210> 13

<211> 328

<212> Proteína <213> Artificial

<220>

<223> Orf12

<400> 13

Met Glu His Ala Leu Ser Ala Ala Cys Ala Ala Ser Val Arg Ala Ala 1 5 10 15

Val Thr Leu Gly Leu Pro Glu Ala Leu Gly Glu Ala Pro Ala Thr Ala 20 25 30 Asp Glu Leu Ala Thr Ala Val Gly Ala Asp Pro Gly Ala Leu Arg Arg 35 40 45 Leu Leu Arg Ser Leu Thr Ser Tyr Gly Val Phe Ala Glu Glu Gly Thr

55 60

Gly Ser Phe Val His Thr Glu Lys Ser Arg Ala Leu Arg Glu Asp Ser

70 75 80

Pro Asp Ser Ile Lys Tyr Leu Val Leu Trp Cys Thr Glu Pro Trp Leu 85 90 95 Trp Ser Leu Trp Gly Asp Leu Asp Glu Ser Val Arg Thr Gly Gly Glu 100 105 110 Ile Phe Thr Arg Thr His Gly Arg Arg Phe Tyr Glu His Leu His Thr 115 120 125 Arg Trp Pro Glu Ser Ala Arg Ile Phe Asn Arg Ala Met Thr Gln Gln 130 135 140

Thr Arg Leu Ser Ala Thr Val Ile Ala Asp Met Leu Pro Met Ser Gly

145 150 155 160

Ala Gly Thr Val Ala Asp Val Gly Gly Gly Gln Gly Leu Val Leu Gly

165 170 175 Thr Leu Leu Glu Arg His Pro His Val Arg Gly Phe Leu Leu Asp Leu 180 185 190 Pro Glu Val Val Ala Asn Val Asp Ala Arg Leu His Pro Gly Gly Glu 195 200 205 Leu Ala Asp Arg Val Arg Leu Val Pro Gly Asp Cys Leu Glu Gly Ile 210 215 220

Ser Val Glu Ala Asp Val Tyr Leu Phe Lys Asn Ile Leu Gly Ala Asp

225 230 235 240

Asp Asp Thr Ser Val Arg Ile Leu Arg Asn Ala Met Lys Ala Ala Arg 245 250 255 Pro Gly Ala Arg Met Val Ile Val Glu Asn Phe Val Asp Asp Gly Pro 260 265 270 Gly Glu Arg Leu Ala Ser Ala Leu Asp Leu Arg Met Leu Leu Val Ile 275 280 285 Gly Gly Gln Lys His Thr Arg Ala Gly Leu Leu Gly Ile Ala Glu Arg

290 295 300

Ala Gly Leu Thr Val Arg Asp Val Arg Pro Val Asp Ser Ser Leu His 305 310 315 320

Met Ile Glu Thr Val Val Pro Gly 325

<210> 14

<211> 462

<212> Proteína <213> Artificial

<220>

<223> Orf13

<400> 14 Val Arg Ala Ser Ser Ser Ala Pro Pro Ala Leu Asp Ala Leu Gly Thr 1 5 10 15 Gly Gly Pro Tyr Arg Ser Arg Asn Leu Glu Val Val His Asp Val Arg

20 25 30 Gly Glu Pro Val Ala Glu Leu Ser Leu Val Pro Trp Leu Phe Ala Gln 35 40 45 Arg Ser Ile Arg Ala Leu Arg Arg Ala Ala Pro Pro Asp Pro Glu Arg

55 60

Arg Lys Lys Leu Leu Thr Arg Ala Gly Trp Leu Phe Ala Ser Gly Ser

70 75 80

Val Asp Gly Val Thr Ala Ala Ala Tyr His Arg Thr Val Ala Glu Val

85 90 95 Ser Gly Ile Pro Ile Ala Thr Val Arg Gln Ser Ala Gln Gln Val Gly 100 105 110 Asp Tyr Ala Ala Thr Ala Tyr Glu Asn Val Arg Gln Ala Arg Pro Val 115 120 125 Gly Ala Ala Asp Ser Trp Arg Asp Gln Arg Ala Arg His Gly Ser Gly

130 135 140

Val Trp Thr Arg Arg Gly Glu Val Leu Met Val His Ala Pro Ala Asn 145 150 155 160

Ser Pro Ala Val His Thr Ser Trp Leu Asp Ala Leu Ala Leu Gly Tyr

165 170 175 Arg Val Ala Val Arg Pro Ser Gln Arg Glu Pro Phe Thr Ala His Arg 180 185 190 Leu Val Ser Ala Leu Arg Gln Val Gly Tyr Asp His Asp Gln Val Val 195 200 205 Leu Leu Pro Thr Asp His Ala Thr Ala Asp Arg Leu Val Ala Glu Ala

210 215 220

Asp Val Ala Ile Ala Phe Gly Gly Asp Glu Val Ala Arg Lys Tyr Gly

225 230 235 240

Ser Ser Thr Leu Val Met Pro Phe Gly Pro Gly Arg Ser Lys Ile Leu

245 250 255 Leu Thr Ser Gly Thr Asp Val Gly Arg His Leu Ala Thr Ile Thr Glu 260 265 270 Ser Ile Ala Gly His Ala Gly Thr Gly Cys Val Asn Ala Thr Ala Val 275 280 285 Phe Val Glu Gly Asp Pro Glu Pro Val Ala Glu Ala Ile Ala Gln Arg

290 295 300

Leu Gly Glu Leu Pro Ser Leu Pro Pro Gly Asp Glu Arg Ala Arg Leu

305 310 315 320

Pro Val Arg Arg Ala Ala Val Ala His Glu Met Asp Ala Tyr Leu Arg

325 330 335 Ala Gly Ala Gly Asp Ala Arg Pro Leu Leu Gly Ala Asp Thr Val Val

340 345 350 Ala Glu Leu Gly Asp Gly Ser Ala Val Leu Arg Pro Ala Val Phe Leu 355 360 365 Leu Asp Arg Pro Asp Ala Pro Gln Ala Arg Ile Glu Met Gly Phe Pro 370 375 380

Cys Val Trp Val Leu Pro Trp Arg Arg Glu Gly Asp Trp Leu Ala Pro 385 390 395 400

Leu Arg Asp Thr Leu Ser Leu Thr Ala Phe Thr Asp Asp Asp Gly Leu 405 410 415 Ile Glu Ser Leu Val Ala Glu Pro Ser Ile Asp Lys Ile His Ile Gly 420 425 430 Asp Arg Pro Thr Thr Trp Thr Leu Pro Gly Leu Pro His Glu Gly Tyr 435 440 445 Leu Gly Ser Phe Leu Met Arg Ser Lys Ala Val Val Val Gly 450 455 460

<210> 15

<211> 343

<212> Proteína <213> Artificial

<220>

<223> Orf14

<400> 15 Met Ala Trp His Phe Asp Pro Lys Thr Gly Ser Pro Phe Trp Gln Glu 1 5 10 15 Gln Ser Arg Lys Leu Glu Phe Asp Pro Arg Lys Asp Val Arg Thr Val

20 25 30 Glu Asp Leu Thr Leu Phe Pro Asn Val Val Asp Glu Leu Arg Asp Ala 35 40 45 Arg Ile Glu Asp Leu Val Pro Arg Gly Tyr Gly Gly Pro Asp Arg Leu 50 55 60

Ser Arg Pro Pro Val Val Gly Glu Ser Gly Gly Thr Thr Gly Ala Pro 65 70 75 80

Lys Arg Val Phe Val Leu Pro Asp Val Arg Glu Gln Ser Trp Ala Trp

90 95 Tyr Tyr Asn Arg Leu Val Glu His Gly Ile Ala Ala Gly Asp Asn Trp 100 105 110 Leu Gly Ile Met Pro Ala Gly Pro His Met Ala Gly Ile Leu Ala Gln 115 120 125 Asp Thr Ala Gln Arg Phe Gly Gly Ile Phe Phe Thr Val Asp Phe Asp

130 135 140

Pro Arg Trp Ala Lys Leu Val Ile Gly Arg Gly Ala Val Asp Glu Ala 145 150 155 160

Asn Ala Tyr Ile Thr His Leu Val Asn Gln Ile Glu Trp Ile Leu Arg

165 170 175 Ser Gln Asp Ile Arg Val Met Val Ile Thr Pro Pro Leu Leu Glu Ala 180 185 190 Val Cys Arg Arg Asp His Leu Val Asp Leu Ile Asn Glu Lys Val Asn 195 200 205 Thr Val Ile Tyr Gly Gly Thr Ser Met Asp Glu Asp Thr Arg His Leu 210 215 220

Phe Arg Thr Glu Leu Phe Pro Gln Ile Asn Phe Val Ser Ile Phe Gly

225 230 235 240

Ser Thr Met Ile Phe Cys Ala Met Pro Glu Arg Pro Asp Ser Pro Ala

245 250 255 Asp Glu Ser Pro Val Phe Asp Pro Pro Ser Pro Phe Ser Met Phe Ser 260 265 270 Val Ile Asp Pro Asp Thr Gly Lys Asn Val Pro Tyr Gly Glu Arg Gly 275 280 285 Gln Val Leu Thr His His Leu Thr Arg Asn Leu Phe Leu Pro Asn Asn 290 295 300 Leu Asp Arg Asp Thr Gly Ile Arg His Pro His Arg Leu Gly Leu Pro 305 310 315 320

Gly Asp Ala Val Ser Glu Phe Lys Pro Val Arg Glu Phe Gly Ala Ala 325 330 335 Pro Val Ile Glu Gly Val Tyr 340

<210> 16

<211> 402

<212> Proteína <213> Artificial

<220>

<223> Orf15

<400> 16

Met Arg Glu Ser Arg Tyr Asp Val Ile Val Val Gly Ala Arg Cys Ala 1 5 10 15

Gly Ser Pro Thr Ala Met Leu Leu Ala Arg Lys Gly Tyr Arg Val Leu 20 25 30 Val Val Asp Arg Ala Val Phe Pro Ser Asp Thr Leu Ser Thr His Leu 35 40 45 Val His Pro Pro Gly Val Ala Ala Leu Arg Gly Trp Gly Leu Leu Asp

55 60

Arg Leu Val Ala Thr Gly Cys Pro Pro Ile His Thr Tyr Glu Phe Asp

70 75 80

Phe Gly Ser Leu Val Leu Pro Gly Ala Pro Gly Thr Glu Ala Glu Pro 85 90 95 Tyr Ala Tyr Ala Pro Arg Arg Thr Val Leu Asp Lys Leu Leu Val Asp 100 105 110 Ala Ala Arg Glu Ala Gly Ala Glu Val Arg Glu Gly Phe Thr Val Thr 115 120 125 Gly Leu Val Leu Gly Asp Ala Gly Glu Val Val Gly Val Arg Gly Arg

130 135 140

Gly Pro Arg Gly Pro Glu Val Thr Glu Arg Ala Arg Val Val Leu Gly

145 150 155 160

Ala Asp Gly Leu His Ser Leu Val Ala Arg Ala Val Asp Ala Pro Arg

165 170 175 Tyr Asn Glu His Pro Lys Leu Met Val Gly Tyr Tyr Ser Tyr Phe Ser 180 185 190 Gly Leu Asp Met Asp Gly Val Phe Lys Ala His Ser Arg Pro Tyr Arg 195 200 205 Ser Phe Gly Ala Trp Pro Thr His Asp Gly Leu Thr Leu Val Gly Gly

210 215 220

Cys Trp Pro Phe Ala Glu Phe Asn Asp Ile Arg Lys Asp Ile Glu Gly

225 230 235 240

Asn Tyr Leu Lys Asn Phe Ala Leu Ala Pro Ala Trp Glu Glu Arg Ile 245 250 255 Arg Asp Ala Arg Arg Glu Asp Arg Ile Val Gly Ala Ala Leu Pro Asn 260 265 270 Phe Phe Arg Lys Pro Phe Gly Pro Gly Trp Ala Leu Val Gly Asp Ala 275 280 285 Gly Tyr Cys Lys Asp Phe Phe Thr Ala Gln Gly Ile Ser Asp Ala Phe 290 295 300

Ile Ser Ala Glu Met Cys Ala Gly Ser Leu Asp Asp Ala Leu Ser Gly

305 310 315 320

Arg Ala Pro Phe Asp Thr Ala Met Ala Ala Tyr Gln Ala Ala Arg Asp

325 330 335 Arg His Ala Arg Pro Val Tyr Asp Phe Thr Leu Gln Val Ser Thr Leu 340 345 350 Glu Pro Leu Ser Pro Glu Phe Glu Lys Val Leu Glu Gly Ile Asp Gly 355 360 365 Asn Gln Gln Gly Met Asp Ala Phe Ala Gln Val Asn Ala Gly Val Thr

370 375 380

Ser Met Glu Arg Phe Ser Ala Asp Trp Gly Gly Ala Val Arg Pro Val

385 390 395 400

Pro Arg <210> 17

<211> 413

<212> Proteína <213> Artificial

<220>

<223> Orf16

<400> 17

Met Glu His Arg Ile Asp Ala Asp Leu Leu Ile Pro Gly Ala Gly Glu 1 5 10 15

Pro Thr Val Asn Gly Ser Val Val His Ala Asp Gly Arg Ile Arg Phe 20 25 30 Ala Gly Pro Thr Ala Glu Leu Ser Gly Glu His Arg Ala Leu Glu Pro 35 40 45 Thr Arg Val Ala Thr Leu Leu Pro Gly Leu Trp Asp Cys His Val His 50 55 60

Phe Ala Gly Ile Arg Gly Arg Val Ser Thr Glu Glu Leu Met Leu Thr

70 75 80

Pro Glu Thr Leu Ala Val Ala Arg Ser Val Lys Asp Ala Glu Thr Ala 85 90 95 Leu Arg Ala Gly Phe Thr Ser Val Arg Asp Met Gly Gly His Gly Cys 100 105 110 Val Leu Ala Glu Ala Ile Arg Glu Gly Thr Phe Thr Gly Pro Asn Ile 115 120 125 Tyr Ser Ala Asn Gln Val Ile Gly Gln Thr Gly Gly His Ser Asp Ala 130 135 140

His Arg Leu Pro Tyr Arg Trp Val Thr Asp Pro Cys Arg Ser Gly Gly

145 150 155 160

Thr Leu Arg Ile Ala Asp Gly Val Asp Glu Cys Val Arg Ala Val Arg

165 170 175 Leu Gln Leu Arg Ala Gly Ala Glu Leu Ile Lys Ile Cys Thr Ser Gly 180 185 190 Gly Val Leu Ser Glu Val Asp Asn Pro Val His Gln Gln Tyr Arg Ser 195 200 205 Glu Glu Leu Asn Ala Ile Val Thr Glu Ala Ala Arg Ala Asp Arg Ile 210 215 220

Val Ala Ala His Cys His Gly Arg Ala Gly Ile Leu Ala Ala Ile Asn 225 230 235 240

Ala Gly Cys Arg Thr Val Glu His Gly Thr Glu Ile Asp Glu Glu Thr

245 250 255 Ala Asp Leu Met Ala Glu Arg Gly Met Thr Leu Val Pro Thr Arg Thr 260 265 270 Ile Tyr Glu Ala Phe Arg His Asn Val Ala Ala Leu Pro Pro Ala Trp 275 280 285 Arg Asp Arg Phe Glu Val Met Ala Glu Arg His Leu Thr Ala Ile Gly

290 295 300

Ile Ala His Arg Ala Gly Val Thr Ile Ala Leu Gly Thr Asp Leu Gly

305 310 315 320

Thr Ser Asp Arg Gly Gly Pro Leu Ser Trp Gly Gly His Ala Ser Glu 325 330 335 Phe Ala His Leu Val Ser Ala Gly Leu Ser Pro Leu Glu Ala Ile Thr 340 345 350 Ala Ala Thr Ala His Gly Pro Gly Thr Leu Gly Pro Arg Ala Pro Arg 355 360 365 Ser Gly Arg Leu Glu Ala Gly Tyr Asp Ala Asp Leu Leu Ala Val Glu 370 375 380

Gly Asn Pro Leu Ala Asp Ile Thr Val Leu Ala Asp Pro Asp Arg Ile 385 390 395 400

Thr Arg Val Trp Lys Ser Gly Glu Pro Val Thr Ser Gly 405 410

<210> 18

<211> 562

<212> Proteína <213> Artificial

<220>

<223> Orf17

<400> 18

Met Ser Pro Tyr Gln Ser Arg Pro Leu Asp Pro Asp Val Pro Val Ile 1 5 10 15

Asp Arg Arg Arg Ala Glu Ala Tyr Leu Ala Ala Gly Leu Trp Arg Glu 20 25 30 Glu Gly Val Ala Glu Leu Leu Arg Ala Ala Arg Leu Arg His Pro Asp 35 40 45 Arg Leu Ala Val Val Ser Gly Thr Thr Arg Leu Thr His Gly Glu Leu 50 55 60

His Thr Ala Val Thr Val Ala Gly Arg Arg Leu Glu Gly Leu Gly Val

70 75 80

Arg Ala Gly Asp Arg Val Val Val Gln Leu Pro Asn Cys Ala Glu Phe 85 90 95 Val Val Leu Val Leu Ala Leu Leu Asp Ile Gly Ala Pro Pro Val Leu 100 105 110 Val Leu Pro Gly Phe Gly Asp Tyr Glu Leu Gly His Val Val Arg Ser 115 120 125 Ala Arg Pro Val Ala Leu Ala Val Ser Ala Gly Ser Arg Ser Ser Asp

130 135 140

Pro Val Glu Ser Ala Arg Arg Leu Arg Glu Asp His Glu Thr Leu Asp

145 150 155 160

His Val Leu Ala Leu Gly Asp Val Ala Ala Gly Asp Val Asp Leu Ala 165 170 175 Ala Leu Cys Asp Pro Gly Ala Pro Trp Asp Ala Pro Val Gly Pro Ala 180 185 190 Ser Ser Ala His Pro Arg Pro Gly Arg Gly Ser Ala Leu Thr Asp Ala 195 200 205 Ala Val Phe Leu Leu Ser Gly Gly Thr Thr Gly Pro Pro Lys Val Ile 210 215 220

Pro Arg Gly Asn Ala Gly Tyr Ala Tyr Met Ile Arg Thr Ala Cys Ala 225 230 235 240

Ile Ala Asp Val Ser Gln Asp Ala Val Tyr Leu Ala Val Met Pro Ala 245 250 255 Ala His Gly Phe Val Leu Asn Cys Pro Gly Val Leu Gly Thr Leu Ala 260 265 270 Gln Gly Gly Thr Val Val Leu Gly Asp Thr Thr Asp Pro Arg Gln Ala 275 280 285 Leu Gly Leu Ile Glu Arg Glu Arg Val Thr His Cys Ala Leu Val Pro 290 295 300

Thr Val Ala Met Gln Trp Leu Ala Ala Ala Glu Gly Thr Thr Ala Asp

305 310 315 320

Leu Ser Ser Leu Arg Val Leu Gln Thr Gly Gly Ala Arg Pro Ala Pro 325 330 335 Glu Leu Ala Ala Arg Ile Arg Pro Glu Leu Gly Ala Thr Leu Gln Gln 340 345 350 Cys Tyr Gly Met Ser Glu Gly Leu Leu Ser Tyr Thr Gly Leu Asp Asp 355 360 365 Pro Glu Ser Val Ile Val Gly Thr Gln Gly Arg Pro Ala Ser Pro Met

370 375 380

Asp Glu Val Leu Ile Val Asp Glu Gln Gly Arg Pro Val Ala Asp Gly

385 390 395 400

Thr Thr Gly Glu Leu Leu Thr Arg Gly Pro Tyr Thr Val Ala Gly Tyr

405 410 415 Tyr Arg Asp Pro Ala Ala Thr Ala Lys Ala Phe Thr Pro Asp Gly Phe 420 425 430 Tyr Arg Thr Gly Asp Leu Ala Thr Arg Thr Ser Gly Gly Asp Leu Val 435 440 445 Val Asn Gly Arg Val Arg Asp Ile Ile Asn Arg Gly Gly Glu Lys Ile 450 455 460

Pro Ala Glu Asp Leu Glu Leu Leu Val Ala Arg His Pro Asp Val Arg

465 470 475 480

Ala Ser Ala Ala Val Gly Ala Pro His Glu Leu Tyr Gly Glu Val Val 485 490 495

Cys Leu Tyr Val Val Pro Glu Ala Gly Arg Glu Pro Thr Leu Leu Glu 500 505 510 Leu Arg Arg Phe Leu Arg Gly Leu Gly Leu Ala Val Trp Lys Leu Pro 515 520 525 Glu Leu Met Glu Thr Val Ala Gln Leu Pro Thr Thr Ala Ile Gly Lys 530 535 540

Ile Asp Lys Lys Val Leu Arg Ala Asp Ile Ala Arg Arg Thr Ala Val

545 550 555 560

Glu Ala

<210> 19

<211> 86

<212> Proteína <213> Artificial

<220>

<223> Orf18

<400> 19 Leu Pro Ser Gly Asn Gln Gly Ala Ala Val Ser Val Asp Val Leu Lys 1 5 10 15 Gln Leu Leu Leu Asp Ile Gly Ile Ala Glu Arg Thr Leu Thr Glu Ile 20 25 30 Glu Pro Gly Thr Arg Leu Arg Ala Asp Leu Gly Leu Ser Ser Val Glu 35 40 45 Thr Thr Asp Leu Glu Ile Gln Leu Arg Glu Arg Phe Gly Val Arg Ile 50 55 60

Asn Leu Trp Asp Lys Ala Asp Tyr Thr Met Glu Gln Leu Ala Ala Gly

70 75 80

Ile Arg Glu Met Pro Arg

<210> 20

<211> 458

<212> Proteína <213> Artificial

<220>

<223> Orf19

<400> 20 Val Ile Gly Ala Val Leu Ser Ala Ala Ser Asp Gln His Ile Ser Met 1 5 10 15 Pro Glu Leu Ser Asp Asp Thr Cys Lys Gly Gly Arg Glu Pro Trp Gly

20 25 30 Trp Thr Val Glu Arg Ser Tyr Glu Ile Pro Tyr Pro Glu Leu Gly Ser 35 40 45 Val Leu Gly Glu Gln Glu Ile Gly Val Leu Thr Arg Leu Val Thr Ser

55 60

Gly Glu Asn Leu Ser Gly Gly Arg Cys Arg Glu Glu Phe Glu Arg Cys 65 70 75 80

Phe Arg Glu Tyr Leu Asp Val Pro His Ala Leu Ser Val Thr Ser Gly

90 95 Thr Val Ala Leu Glu Ile Ala Ile Arg Leu Leu Asp Leu Asn Glu Gly 100 105 110 Asp Glu Val Ile Ala Thr Pro Gln Thr Tyr Lys Ala Ser Val Gln Pro 115 120 125 Leu Leu Asn Tyr Pro Val Lys Val Arg Phe Cys Asp Val Gly Pro Asn 130 135 140

Thr Leu Asn Ile Asp Pro Gly His Phe Glu Ser Leu Ile Thr Ser Arg

145 150 155 160

Thr Lys Ala Val Ile Leu Val His Tyr Gly Gly Leu Pro Cys Asp Met

165 170 175 Asp Ala Ile Met Ala Ile Ala Arg Arg His Gly Ile Thr Val Ile Glu 180 185 190 Asp Cys Ala His Ala Leu Gly Ala Glu Tyr Arg Gly Arg Lys Pro Gly 195 200 205 Ala Leu Ala Asp Ile Gly Cys Phe Ser Phe His Ser Ser Lys Asn Ile 210 215 220

Thr Thr Leu Gly Glu Gly Gly Met Ile Thr Leu Phe Asp Pro Ala Leu 225 230 235 240

Ala Glu Arg Ala Asp Arg Ile Arg Ser Asn Asp Ala Asp Ala Val Tyr

245 250 255 Arg Ala Gln Ala Arg Ala Ile Gly Asn Thr Thr Ser Ala His Pro Trp 260 265 270 Met Leu His Pro Gly Ala Ala Phe Thr His Asp Cys Ser Thr Ile Arg 275 280 285 Tyr Gly Gly Thr Asn Ala Thr Leu Ala Glu Pro Asn Ala Ala Val Gly

290 295 300

Thr Val Gln Leu Ala Lys Leu Asp Arg Leu Val Arg Arg Arg Ala Glu 305 310 315 320

Ile Ala Ala Ala Tyr Thr Asp Val Leu Lys Gln His Pro Gly Val Arg

325 330 335 Met His Glu Gly Pro Asp Pro Val Arg His Ala His His Leu Phe Thr 340 345 350 Phe Phe Ala Asp Pro Ala Asp Gly Ile Leu Arg Asp Arg Leu Val Ser 355 360 365 Arg Leu Asp Ala Leu Gly Val Gln Met Gln Leu Arg Tyr Phe Pro Met

370 375 380

His Leu Leu Ala Glu Trp Arg Ala Arg Gly His Thr Ala Gly Glu Cys 385 390 395 400

Pro Val Ala Glu Arg Leu Trp Phe Glu Gln Gln Val Asn Leu Pro Cys

405 410 415 His Pro Ala Met Thr Asp Arg Gln Val Gly Gln Val Val Ser Arg Leu 420 425 430 Asp Thr Val Leu Gly Gln Glu Ala Arg Asp Gln Pro Ile Ala Ser Val 435 440 445 Thr Thr Pro Thr Ala Glu Gly Trp Asn Arg 450 455

<210> 21

<211> 393

<212> Proteína <213> Artificial

<220>

<223> Orf20

<400> 21 Met Ser Glu Thr Glu His Tyr Asp Ile Ala Val Ile Gly Gly Gly Pro 1 5 10 15 Val Gly Leu Ala Ser Ala Trp His Ala Ala Arg Arg Gly Glu Arg Val

20 25 30 Ala Val Phe Glu Gln Phe Ser Phe Gly Asn Glu Gln Cys Gly Thr Ser 35 40 45 Gly Ala Glu Arg His Trp Arg Leu Gln Tyr Thr Glu Pro Asp Leu Cys 50 55 60

Arg Leu Thr Gly Glu Ala Leu Pro Leu Trp Arg Glu Leu Glu Arg Ala 65 70 75 80

Thr Gly His Gln Leu Leu His Ala Phe Gly Ser Leu Trp Phe Gly Asp

90 95 Ile Asp Val Ala Thr Asn Glu Gly Arg Ile Ser Ala Thr Ala Arg Thr 100 105 110

Met Asp Asp Leu Ser Ile Pro Tyr Glu Trp Leu Thr Ala Ala Asp Ile 115 120 125

Glu Arg Arg Tyr Gly Phe Thr Gly Leu Pro Gly His Phe Glu Gly Phe 130 135 140

Val Gln Pro Asp Gly Gly Ala Val Asp Val Arg Ala Thr Val Glu Gly

145 150 155 160

Leu Leu Arg Leu Thr Glu Glu Ala Gly Cys Ala Leu Arg Ala His Glu 165 170 175 Pro Val Leu Glu Leu Ile Pro Asp Gly Gly Gly Val Thr Leu Arg Thr 180 185 190 Ala Arg Gly Arg Cys Arg Ala Gly Lys Val Val Val Ala Asn Gly Ala 195 200 205 Tyr Ala Asn Lys Leu Leu Glu Pro Leu Gly Ser Arg Leu Asp Leu His 210 215 220

Val Phe Glu Met Ala Leu Val Thr Leu Arg Gln Arg Asp Pro Lys Val

225 230 235 240

Arg Tyr Pro Phe Trp Phe Val Phe Gln Glu Pro Thr Glu Glu Asp Thr

245 250 255 Asn Leu Phe Tyr Gly Phe Pro Pro Asn Ala Trp Gln Asp Thr Asp Thr 260 265 270 Val Arg Val Gly Pro Val Phe Glu Val Asn Ala Leu Ala Asp Pro Ala 275 280 285 Arg Ala Thr Gly Thr Pro Asp Pro Arg His Val Ala Arg Met Cys Glu 290 295 300

Trp Val Glu Arg His Leu Pro Val Val Asp Pro Arg Pro Leu Ala Arg

305 310 315 320

Asp Thr Cys Leu Ala Val Leu Pro Ala Asp Pro Glu Arg Gln Phe Phe 325 330 335 Leu Gly Thr Ala Lys Gly Arg Phe Asp Gly Gly Glu Asn Val Val Ile 340 345 350 Ala Thr Gly Gly Trp Gly Phe Lys Phe Val Pro Leu Leu Gly Lys Val 355 360 365 Cys Ala Asp Leu Cys Val Asp Gly Ala Thr Gly Tyr His Val Asp Arg

370 375 380

Leu Met Leu Pro Asp Ala Ala His Pro 385 390

<210> 22

<211> 2552

<212> Proteína <213> Artificial

<220>

<223> Orf21

<400> 22 Val Gly Val Gln Val Arg Pro Ala Ala Gly Gln Gly Leu Arg Arg Ser 1 5 10 15 Val Arg Gly Arg Arg Asp Arg Leu Pro Arg Gly Pro Pro Asp Ala Ala 20 25 30 Arg Arg Gly Pro Pro Val Ser Arg Ser Ala Pro Thr Arg Gly Arg Pro 35 40 45 Phe Pro Ser Lys Gly Asp Ser Pro Val Val Ser Glu Ala Pro Ala Thr

55 60

Gly Ala His His Asp Gly Thr Ser Val Asp Ile Ala Val Val Gly Met

70 75 80

Ala Gly Arg Phe Pro Gly Ala Pro Asp Leu Asp Ala Tyr Trp His Asn

85 90 95 Leu Arg Ser Gly Val Glu Ser Ile Glu Arg Leu Thr Glu Asp Asp Leu 100 105 110 Leu Ala Glu Gly Val Asp Pro Glu Leu Ile Gly Ala Pro Gly Tyr Val 115 120 125 Pro Val Ala Pro Val Leu Glu Gly Ile Asp Leu Phe Asp Ala Arg Phe 130 135 140 Phe Gly Phe Thr Ala Arg Glu Ala Ala Leu Leu Asp Pro Gln Gln Arg

145 150 155 160 Leu Phe Leu Glu Ser Ala Trp His Ala Met Glu His Ala Gly Ile Asp 165 170 175 Pro Ala Arg Cys Gly Thr Ala Ala Val Phe Ala Gly Gly Asn Met Pro 180 185 190 Ala Tyr Leu Met Ser Asn Leu Leu Gly Gly Ala Arg Val Val Leu Asp 195 200 205 Ser Ala Met Phe Glu Leu Gln Ile His Asn Asp Lys Asp Phe Leu Ala 210 215 220

Ser Arg Thr Ala Tyr Lys Leu Gly Leu Thr Gly Pro Ala Val Asn Val

225 230 235 240

Gln Thr Ala Cys Ser Thr Ser Leu Val Ala Val His Gln Ala Ala Ala 245 250 255 Ala Leu Arg Ser Gly Asp Cys Glu Ile Ala Leu Ala Gly Gly Val Cys 260 265 270 Val Arg Val Pro His Arg Val Gly Tyr Arg Tyr Glu Gln Gly Leu Ile 275 280 285 Tyr Ala Pro Asp Gly Arg Cys Arg Pro Phe Asp Ala Asp Gly Ala Gly

290 295 300

Thr Val Phe Gly Asn Gly Ala Gly Ala Val Val Leu Lys Arg Leu Ala 305 310 315 320

Asp Ala Arg Arg Asp Gly Asp Arg Ile Leu Ala Val Leu Lys Gly Ser

325 330 335 Ala Val Asn Asn Asp Gly Ala Glu Lys Val Gly Tyr Thr Ser Pro Ser 340 345 350 Val Ser Gly Gln Glu Ala Val Val Ala Ala Ala Ile Ala Asp Ser Gly 355 360 365 Val Pro Ala Arg Ser Ile Thr Ala Ile Glu Ala His Gly Thr Gly Thr

370 375 380

His Val Gly Asp Pro Ile Glu Ile Thr Ala Leu Ser Arg Ala Phe Gly

385 390 395 400

Arg His Thr Thr Asp Thr Gly Phe Cys Ala Val Gly Ser Val Lys Ser

405 410 415 Asn Ile Gly His Leu Glu Ser Ala Ala Gly Ile Ala Ser Phe Ile Lys 420 425 430 Ala Val Leu Gln Leu His His Arg Thr Leu Val Pro Ser Leu His Phe 435 440 445 Glu Arg Pro Asn Pro Arg Ile Asp Phe Asp Ala Thr Pro Phe Phe Val

450 455 460

Asn Thr Glu Leu Arg Ala Trp Pro Glu Gly Glu His Pro Arg Arg Ile 465 470 475 480

Gly Val Ser Ser Phe Gly Ile Gly Gly Thr Asn Ala His Val Val Leu 485 490 495 Glu Gln Ala Pro Asp Pro Val Pro Ala Glu Pro Ser Gly Arg Pro Glu 500 505 510 Leu Val Val Val Ser Ala Lys Ser Pro Ala Ala Leu Asp Ala Ala Thr 515 520 525 Glu Ala Leu Ala Glu Lys Leu Ala Ala Pro Asp Ala Gln Pro Leu Ala 530 535 540

Asp Ile Ala His Thr Leu Gln Thr Gly Arg Gly Ala Met Arg Tyr Arg

545 550 555 560

Arg Ala Val Val Ala Ala Gly Thr Ala Glu Ala Ala Ala Leu Leu Ser 565 570 575 Gly Ala Asp Pro Gly Arg Val Arg Ser Ala Asp Ala Gly Thr Ala Pro 580 585 590 Ala Lys Val Val Phe Leu Phe Pro Gly Gln Gly Ala Gln Tyr Pro Gly 595 600 605 Met Ser Arg Gly Leu Tyr Ala Ser Glu Pro Val Phe Ala Glu Ala Leu 610 615 620

Asp Ala Cys Ala Asp Leu Leu Ala Glu Glu Leu Gly Ile Asp Leu Arg

625 630 635 640

Thr Val Leu Phe Pro Asp Ala Pro Ala Glu Asp Gly Leu Thr His Thr

645 650 655 Thr Leu Ala Gln Pro Ala Leu Phe Ala Thr Glu Tyr Ala Met Ala Thr 660 665 670

Leu Leu Arg Ser Trp Gly Val Glu Pro Asp Val Met Val Gly His Ser 675 680 685

Ile Gly Glu Phe Thr Ala Ala Val Leu Ser Gly Val Leu Ser Leu Lys 690 695 700

Asp Ala Ala Arg Leu Val Ala Leu Arg Gly Arg Leu Met Gln Asp Arg

705 710 715 720

Pro Thr Gly Ala Met Val Ser Ile Ala Ala Pro Ala Ala Asp Ile Glu 725 730 735 Pro Leu Leu Pro Ala Gly Val Ser Ile Ala Ala Ile Asn Ala Pro Val 740 745 750 Leu Cys Val Ala Ser Gly Pro His Glu Ala Val Ala Glu Leu Gly Glu 755 760 765 Ile Leu Ala Ala Lys Glu Ile Thr Val Arg Pro Leu His Thr Ser His 770 775 780

Ala Phe His Ser Ala Met Met Asp Pro Val Val Glu Pro Phe Thr Glu 785 790 795 800

Ala Val Ala Gly Thr Pro Leu Ala Ala Pro Gly Leu Pro Phe Val Ser

805 810 815 Cys Val Thr Gly Leu Pro Ile Thr Ala Glu Leu Ala Thr Asp Pro Gln 820 825 830 Tyr Trp Gly Thr His Leu Arg Arg Pro Val Arg Phe Ala Asp Ala Val 835 840 845 Arg Thr Ala Ile Gly Asp Gly Pro Ala Val Leu Val Glu Val Gly Pro 850 855 860

Gly Asn Thr Leu Ser Thr Leu Ala Arg Ala Gly Ala Gly Thr Gly Gly

865 870 875 880

Pro Arg Cys Ala Ala Val Thr Thr Leu Arg Arg Pro Asp Glu Ala Ala 885 890 895 Asp Asp Gly Gln Val Leu Arg Thr Ala Val Gly Asp Ile Trp Leu Phe 900 905 910 Gly Gly Ala Val Asp Trp Pro Ala Leu His Gln Gly Arg Arg Asn Arg 915 920 925 Val Glu Leu Pro Gly Tyr Pro Phe Gln Arg Asp Arg Tyr Trp Ile Glu 930 935 940

Pro Arg Gly Ser Ala Thr Gly Thr Pro Leu Val Ala Asp Phe Ala Glu 945 950 955 960

His Glu Glu Ala Glu Thr Glu Pro Ala Gly Arg Ala Thr Arg Pro Ser

965 970 975 Thr Leu Val Thr Ala Tyr Val Ala Pro Ala Asp Glu Leu Glu Thr Thr 980 985 990 Ile Ala Gly Ile Trp Glu Glu Met Phe Gly Ile Ala Pro Ile Gly Thr 995 1000 1005 Arg Asp Asp Phe Phe Glu Leu Gly Gly His Ser Leu Leu Ala Ile 1010 1015 1020

Gln Val Leu Asn Arg Leu Gln Ala Thr Ser Gly Val Thr Val Glu 1025 1030 1035

Leu Gly Arg Leu Leu Ala Thr Pro Thr Ile Gly Gly Met Ala Glu 1040 1045 1050

Glu Leu Arg Ala Ala Gly Ala Ala Gly Thr Asp Asp Arg Leu Pro 1055 1060 1065

Thr Val Val Pro Arg Pro Asp Leu Arg Tyr Glu Pro Phe Pro Leu 1070 1075 1080

Thr Glu Met Gln Gln Ala Gln Trp Ile Gly Arg Leu Ser Ser Phe 1085 1090 1095

Asp Met Gly Gly Val Ala Pro His Leu Tyr Phe Glu Phe Asp Ser 1100 1105 1110

Arg Thr Ile Glu Thr Ala Arg Leu Glu Arg Ala Trp Gln Arg Val 1115 1120 1125

Val Gln Arg His Asp Met Leu Arg Met Val Val Leu Pro Asp Gly 1130 1135 1140

Arg Gln Gln Ile Leu Asp Asn Thr Glu Pro Tyr Arg Phe Glu Val 1145 1150 1155

Leu Asp Leu Arg Thr Thr Asp Pro Glu Glu Ala Glu Arg Gln Leu 1160 1165 1170

Ala Gln Ile Arg Asp Arg Met Ala Thr Glu Val Arg Pro Ala Asp 1175 1180 1185

Val Trp Pro Leu Trp Glu Val Arg Val Gly Leu Leu Pro Asp His 1190 1195 1200

Arg Val Arg Val His Ile Ser Phe Asp Leu Leu Val Ala Asp Val 1205 1210 1215

Ser Ser Phe Phe Tyr Gln Leu Leu Pro Gln Trp Arg Glu Phe Tyr 1220 1225 1230

His His Pro Glu His Asp Pro Glu Pro Leu Ala Leu Ser Phe Arg 1235 1240 1245

Asp Tyr Val Leu Ala Glu Glu Glu Leu Arg Arg Thr Pro Arg Tyr 1250 1255 1260

Glu Arg Ser Leu Glu Tyr Trp Arg Lys Arg Val Arg Glu Leu Pro 1265 1270 1275

Ala Ala Pro Glu Leu Pro Thr Val Gln Gly Ala Gly Gly Gly Glu 1280 1285 1290

Arg Leu Gly Phe Val Arg Arg His Ala Arg Leu Asp Ala Glu Leu 1295 1300 1305

Trp Gly Arg Ile Lys Ala Lys Ala Gly Glu Phe Gly Val Thr Pro 1310 1315 1320

Ser Ser Ala Met Leu Ala Ala Phe Ala Val Thr Ile Gly Thr Trp 1325 1330 1335

Ser Lys Ser Gln Arg Phe Thr Leu Asn Phe Thr Ala Val Asn Arg 1340 1345 1350

Leu Pro Val His Glu Glu Val Asp Asp Val Val Gly Glu Phe Ala 1355 1360 1365

Ser Phe Asp Leu Leu Glu Val Asp Ala Val Ser Ala Pro Asp Phe 1370 1375 1380

Ala Gly Leu Val Arg Glu Leu Gln Arg Gln Ser Trp Ala Asp Phe 1385 1390 1395

Asp His Arg Tyr Val Ser Gly Val Arg Ile Leu Arg Glu Arg Ala 1400 1405 1410

Arg Ala Arg Gly Gly Ala Gly Asp Val Met Pro Val Val Phe Thr 1415 1420 1425

Ser Ala Leu Gly Ser Asp Val Asp Gly Lys Pro Ala Pro Ser Pro 1430 1435 1440

Val Asp Trp Leu Gly Glu Gln Ser Tyr Phe Ile Ser Gln Thr Pro 1445 1450 1455

Gln Val Thr Ile Asp His Phe Leu Leu Glu Phe Gly Gly Asn Leu 1460 1465 1470

Glu Leu Ala Trp His Ala Val Asp Gly Leu Phe Pro Asp Gly Leu 1475 1480 1485

Met Glu Glu Met Phe Gln Ala Tyr Gln Asp Phe Val Val Gly Leu 1490 1495 1500

Ala Glu Thr Asp Gly Trp His Arg Pro Pro Val Leu Asp Leu Pro 1505 1510 1515

Ala Gly Gln Leu Ala Pro Arg Ala Ala Ala Asn Asp Thr Ala Gly 1520 1525 1530

Glu Leu Pro Asp Gly Val Leu Pro Ala Arg Ile Leu Ala Arg Ala 1535 1540 1545

Gly Ser Ala Glu Pro Ala Val Ile Thr Glu Asp Arg Thr Leu Asp 1550 1555 1560

Tyr Ala Glu Leu Thr Gly Arg Ala Val Ala Leu Ala Arg Glu Leu 1565 1570 1575

Thr Glu Ala Gly Tyr Gly Arg Gly Ala Val Val Gly Ile Gly Leu 1580 1585 1590

Ala Lys Gly Trp Arg Gln Thr Val Ala Ala Leu Ala Ala Ser Ala 1595 1600 1605

Ala Gly Cys Thr Tyr Val Pro Leu Asp Pro Gly Leu Pro Glu Ala 1610 1615 1620

Arg Arg Arg Trp Leu Val Glu Gln Ala Gly Ile Gly Cys Val Leu 1625 1630 1635

Ala Glu Pro Asp Thr Ala Ala Leu Trp Pro Asn Ala Pro Arg Val 1640 1645 1650

Leu Pro Val Ala Glu Asp Ala Arg Trp Asp Pro Ala Asp Thr Ala 1655 1660 1665

Ala Trp Ser Cys Pro Ala Arg Pro Glu Asp Thr Ala Tyr Val Ile 1670 1675 1680

Tyr Thr Ser Gly Ser Thr Gly Thr Pro Lys Gly Val Ala Val Ser 1685 1690 1695

His Arg Ala Ala Leu Asn Thr Leu Val Asp Ile Glu Glu Arg Phe 1700 1705 1710

Gly Ile Arg Pro Gly Asp Arg Val Leu Gly Leu Ser Ala Leu Asn 1715 1720 1725

Phe Asp Leu Ser Val Phe Asp Val Phe Gly Met Leu Ala Ala Gly 1730 1735 1740

Gly Ala Val Val Leu Pro Glu Ala Ala Asp Arg Arg Asn Pro Asp 1745 1750 1755

Arg Trp Thr Glu Leu Cys Arg Arg His Gly Val Thr Val Trp Asn 1760 1765 1770

Ser Val Pro Ala Leu Met Gln Met Leu Val Glu His Leu Glu Ser 1775 1780 1785

Arg Gly Pro Ala Asp Asp Ala Gly His Leu Pro Gly Leu Arg Leu 1790 1795 1800

Ala Leu Leu Ser Gly Asp Trp Ile Pro Leu Ser Leu Pro Asp Arg 1805 1810 1815

Ile Arg Ala Val Ala Pro Ala Thr Asp Val Ile Ser Leu Gly Gly 1820 1825 1830

Ala Thr Glu Ala Ala Val Trp Ser Ile Ala His Pro Ile Gly Glu 1835 1840 1845

Val Asp Pro Asp Trp Pro Ser Val Pro Tyr Gly Arg Pro Leu Arg 1850 1855 1860

Asn Gln Arg Phe His Val Leu Asn Asp Arg Leu Arg His Ala Pro 1865 1870 1875

Val Trp Val Pro Gly Gln Leu His Ile Ala Gly Ala Gly Leu Ala 1880 1885 1890

Glu Gly Tyr Trp Arg Asp Glu Arg Arg Thr Ala Glu Ser Phe Ile 1895 1900 1905

Thr His Pro Glu Thr Gly Glu Arg Leu Tyr Arg Thr Gly Asp Leu 1910 1915 1920

Gly Arg Tyr Leu Pro Asp Gly Thr Ile Glu Phe Leu Gly Arg Asp 1925 1930 1935

Asp Phe Gln Val Lys Ile Gly Gly His Arg Ile Glu Leu Gly Glu 1940 1945 1950

Ile Glu His Ala Leu Gly Ser His Pro Glu Leu Leu Asn Ala Val 1955 1960 1965

Val Ser Ala Pro Gly Glu Arg Asn Arg Gln Arg Leu Val Ala His 1970 1975 1980

Val Val Pro Ala Asp Pro Gly Thr Arg Asn Asp Ala Asp Phe Ala 1985 1990 1995

Asp Arg Leu Arg Asp His Leu Thr Thr Thr Leu Pro Ser Tyr Met 2000 2005 2010

Ile Pro Ser Asp Ile Val Leu Ile Asp Ala Met Pro Leu Ser Ala 2015 2020 2025

Asn Gly Lys Val Asp Arg Ser Ala Leu Pro Asp Pro Gln Arg Thr 2030 2035 2040

Gly Asp Ala Glu Ala Ala Ala Ser Ala Ala Glu Asp Asp Gly Glu 2045 2050 2055

Glu Ala Thr Gly Ala Leu Arg Thr Leu Leu Val Leu Ala Ala Asp 2060 2065 2070

Leu Leu Gly Val Asn Gly Pro Arg Pro Arg Asp Asn Phe Phe Glu 2075 2080 2085

Leu Gly Gly Asp Ser Ile Met Gly Val Gln Leu Val Gly Arg Ala 2090 2095 2100

Asn Ala Glu Gly Ile Pro Ile Thr Pro Gln Asn Leu Phe Glu Ser 2105 2110 2115

Thr Thr Phe Leu Glu Leu Ala Ala Ala Val Pro Val Glu Pro Gly 2120 2125 2130

Thr Asp Asp Thr Gly Glu Ala Val Ala Leu Thr Pro His Gln Thr 2135 2140 2145

Leu Ala His Ala Gln Val Gly Ser Val Leu Leu Asp Val Pro Asp 2150 2155 2160

Ala Phe Asp Pro Ala Ser Ala Ala Arg Ala Leu Asn Ala Leu Ala 2165 2170 2175

Asp Arg His Pro Ala Leu Arg Thr Arg Val Arg Thr Glu Asp Gly 2180 2185 2190

Gln Arg Phe Ala Val Arg Pro Gly Pro Gly Glu Asp Phe Asp Val 2195 2200 2205

Pro Glu Ile Asp Leu Ala Ala Leu Pro Asp Asp Val Arg Ala Glu 2210 2215 2220

Ala Val Ala Glu Met Ile Gly Glu Met Ala Gly Glu Val Asp Ile 2225 2230 2235

Glu Thr Gly Pro Ala Val Lys Phe Ala Val Phe Arg Leu Gly Glu 2240 2245 2250

Arg Gly Ser Val Leu Ala Cys Thr Ala Ala Gln Gly Leu Met Asp 2255 2260 2265

Asp Ala Ser Val Leu Leu Leu Cys Arg Glu Leu Ile Gln Ala Tyr 2270 2275 2280

Asp Arg Leu Ala Ala Gly Arg Pro Val Val Trp Ser Asp Gly Ala 2285 2290 2295

Gly Ser Pro Gln Ala Trp Asn Arg Gly Leu Arg Arg Lys Pro Ala 2300 2305 2310

His Pro Ala Gly Leu Ala Glu Asn Pro Gly Thr Ala Gly Glu Leu 2315 2320 2325

Pro Arg Gln Arg Asn Met Glu Leu Asp Ala Ala Arg Thr Ala Gly 2330 2335 2340

Leu Phe Thr Ala Ala Ala Gly Ser His His Leu Asp Pro Thr Glu 2345 2350 2355

Val Leu Val Ala Ala Ala Ser Ala Ala Leu Gly Arg Ala Leu Pro 2360 2365 2370

Glu Pro Pro Gln Leu Leu Val Glu Arg Ser Leu Arg Asp Asp Leu 2375 2380 2385

Ala Ala Gly Asp Glu Pro Ala Gly Arg Leu Val Gly Arg Thr Thr 2390 2395 2400

Glu Leu Arg Thr Val Gln Pro Val Ala Ala Gly Thr Pro Leu Asp 2405 2410 2415

Thr Ala Leu Thr Ser Val Lys Gly Arg Leu Arg Thr Ala Asp Pro 2420 2425 2430

Asp Pro Val Arg Gly Thr Thr Val Ala Val Arg Glu Val Val Thr 2435 2440 2445

Trp Asp Arg Val Glu Gly Ala Val Glu Val Pro Ala Asp Phe Ala 2450 2455 2460

Gly Val Thr Gly Leu Ala Gly Trp His Glu Glu Thr Val Gly Gln 2465 2470 2475

Leu Ser Ala Ala Val Val Asp Gly Ala Leu Arg Ile Arg Trp Gln 2480 2485 2490

Leu Ala Ala Ser Val Pro Glu Asp Ala Ala Thr Arg Leu Ala Asp 2495 2500 2505

Ala Phe Gly Thr Val Leu Gly Glu Ile Ala Glu His Cys Arg Arg 2510 2515 2520

Val Ala Glu Gly Ser Tyr Glu Pro Ser Asp Phe Pro Leu Ala Asp 2525 2530 2535

Leu Ser Gly Asp Glu Leu Ala Glu Phe Leu Asp Glu Leu Arg 2540 2545 2550

<210> 23

<211> 1091

<212> Proteína <213> Artificial

<220>

<223> Orf22 <400> 23 Val Thr Ala Thr Gly Asp Arg Arg Met Ala Thr Thr Ala Asp Gly Thr 1 5 10 15 Val Met Thr Asp Asn Gly Ala Thr Lys Arg Pro Val Arg Asp Ile Ala 20 25 30 Asp Ile Tyr Glu Leu Ser Pro Ile Gln Gln Gly Leu Leu Tyr Glu Gln 35 40 45 Leu Ala Gln Pro Gly Leu Gly Ile Tyr Val Glu Gln Leu Gly Leu Glu 50 55 60

Phe Ser Gly Thr Met His Pro Glu His Phe Glu Arg Ala Trp Gln Leu 65 70 75 80

Val Val Asp Arg His Pro Ile Leu Arg Thr Ser Phe His Trp Arg Lys

85 90 95 Asp Gly Ser Ala Val Gln Val Val His Gly Ser Ala Arg Leu Pro Leu 100 105 110 Glu Thr Leu Asp Trp Arg Asp Leu Asp Glu Arg Thr Gln Glu Glu Arg 115 120 125 Leu Arg Ala Ser Leu Asp Ala Glu Arg Ala Glu Gly Phe Asp Leu Thr

130 135 140

Asp Val Pro Leu Met Arg Ser Thr Leu Ile Arg Arg Gly Asp Glu Arg

145 150 155 160

Trp Thr Phe Ser Trp Arg Phe Ser His Leu Leu Met Asp Gly Trp Ser

165 170 175 Phe Thr Leu Ala Ile Gln Asp Phe Ile Asp His Tyr Arg Val Leu Cys 180 185 190 Arg Gly Gly Arg Pro Thr Leu Ser Pro Gly Arg Ser Tyr Arg Asp Tyr 195 200 205 Leu Ser Trp Trp Arg Asp Arg Asp Pro Glu Glu Ala Arg Glu Phe Trp

210 215 220

Arg Glu Glu Leu Ala Asp Tyr Arg Pro Val Glu Gln Val His Leu Gly

225 230 235 240

Gly Thr Gly Ile Pro Glu Gly Glu Pro Thr His Ala His Phe Glu Arg

245 250 255 Ile Leu Gly Asp Leu Ala Pro Arg Leu Thr Ala Leu Ala Arg Ala Glu 260 265 270 Gln Leu Thr Leu Ala Thr Leu Ala Gln Gly Ala Trp Phe Ile Val Leu 275 280 285 Gly Arg Phe Leu Gly Arg Thr Asp Leu Ala Cys Gly Ile Thr Met Ala 290 295 300

His Arg Pro Pro Asp Leu Val Gly Ser Gln Asp Ile Leu Gly Pro Met

305 310 315 320

Ile Ala Thr Leu Pro Leu Arg Arg Arg Leu Asp Pro Ala Met His Leu 325 330 335 Arg Ser Trp Leu Arg Glu Phe Gly Lys His Gly Ile Glu Ala Ser Gly 340 345 350 His Ser Ala Val Pro Leu Thr Glu Met Gln Ala Leu Leu Gly Thr Asp 355 360 365 Ser Ala Ile Pro Ile Leu Gln Ser Ser Val Ser Tyr Glu Asn Val Pro 370 375 380

Met Pro Asp Phe Asp Leu Ala Asp Val Gly Ala Glu Met Thr Glu Leu 385 390 395 400

Val Tyr Asp Gly Arg Pro His Phe Pro Ile Thr Met Val Ile Met Pro 405 410 415 Gly Ala Asp Met Pro Leu Arg Val Val His Asp Arg Arg Lys Val Ser 420 425 430 Asp Glu Val Ala Glu Arg Phe Ala Gly Glu Val Val Ser Val Leu Thr 435 440 445 Gln Met Ile Glu Arg Pro Asp Val Thr Leu Gly Glu Leu Thr Phe Leu

450 455 460

Ser Thr Pro Gln Pro Gly Asn Thr Pro Leu Thr Glu Pro Asp Ala Glu 465 470 475 480

Ser Leu His Glu Thr Phe Arg Arg His Ala Arg Leu Arg Pro Asp Ala

485 490 495 Thr Ala Val Arg Cys Gly Gly Arg Ala Leu Thr Tyr Arg Glu Leu Asp 500 505 510 Ala Tyr Ser Asp Arg Ile Ala Ala Thr Leu Arg Ala Arg Cys Pro Gly

515 520 525

Val Thr Arg Val Gly Leu Cys Leu Pro Arg Ser Ile Glu Leu Val Ala 530 535 540

Ala Met Ile Gly Val Phe Lys Ala Gly Ala Ala Tyr Val Pro Leu Asp

545 550 555 560

Pro Glu Tyr Pro Ala Asp Arg Leu Ala Asp Met Leu Ala Asp Ser Ala 565 570 575 Ala Glu Leu Val Leu Thr Asp Gly Ala Pro Ala Asp Ala Leu Thr Ala 580 585 590 Gly Lys Ala Gly Leu Val Thr Leu Pro Glu Met Asp Gly Glu Pro Asp 595 600 605 His Asn Ala Pro Pro Pro Val Pro Ala Asp Pro Asp Ala Pro Ala Tyr

610 615 620

Leu Leu Tyr Thr Ser Gly Ser Thr Gly Arg Pro Lys Gly Val Pro Ile 625 630 635 640

Thr His Arg Asn Val Gln Ser Leu Leu Ala Ala Gly Arg Glu Val Phe

645 650 655 Gly Phe Thr Ala Glu Asp Val Trp Thr Phe Ala His Ser Phe Ala Phe 660 665 670 Asp Tyr Ser Val Trp Glu Ile Trp Gly Ala Leu Gly Asn Gly Ala Ser 675 680 685

Leu Val Val Val Asp His Glu Thr Gly Arg Asp Pro Arg Ala Leu Ala 690 695 700

Arg Leu Ile Ala Glu Glu Arg Val Thr Val Leu Ser Glu Thr Pro Ala

705 710 715 720

Leu Phe Glu His Leu Val Pro Glu Leu Ala Asp Asp Thr Ser Leu Arg

725 730 735 Arg Val Phe Leu Gly Gly Asp Arg Leu Asp Pro Ala Ile Leu Arg Pro 740 745 750 Trp Phe Ala Arg Phe Gly Asp Arg Asp Gly Ala Gly Arg Glu Thr Thr 755 760 765 Gly Arg Ala Pro Ala Pro Gly Ile Glu Leu Tyr Asn Leu Tyr Gly Val

770 775 780

Thr Glu Ala Thr Val Val Ser Thr Tyr His Arg Val Arg Glu Glu Asp

785 790 795 800

Val Arg Ala Gly Arg Pro Val Pro Ile Gly Arg Ala Leu Pro Asn Gln 805 810 815 Arg Val Tyr Leu Leu Gly Glu Asp Asp Arg Pro Val Pro Val Gly Ala 820 825 830 Thr Gly Gln Leu Cys Val Ala Gly His Ala Val Ala Ser Gly Tyr His 835 840 845 Asp Arg Asp Gly Leu Thr Ala Glu Arg Phe Gly Ser Asp Pro Ser Ala 850 855 860

Gly Pro Ser Ser Ala Ala Phe Pro Leu Tyr Arg Thr Gly Asp Leu Ala 865 870 875 880

Thr Ala Thr Pro Asp Gly Glu Val His Phe Leu Gly Arg Ala Asp Thr

885 890 895 Gln Val Lys Val Arg Gly Phe Arg Val Glu Pro Gly Glu Ile Glu Ala 900 905 910 Ala Leu Arg Glu Thr Pro Gly Val Arg Ser Ala Thr Val Thr Val His 915 920 925 Gly Ser Gly Thr Ala Arg Arg Leu Val Gly Tyr Ala Val Pro Glu Asp

930 935 940

Pro Asp Ala Val Leu Thr Ala Gly Pro Val Pro Thr Glu Pro Leu Arg

945 950 955 960

Glu His Leu Arg Thr Arg Leu Pro Glu His Met Val Pro Ala Ala Val

965 970 975 Tyr Trp Ile Asp Arg Ile Pro Thr Thr Pro Gly Gly Lys Val Asp Val 980 985 990 Ala Ala Leu Pro Val Pro Asp Ala Gly Gly Thr Asp Arg Asn Thr Ala 995 1000 1005 Pro Met Thr Glu Ala Glu Arg Leu Leu Ala Gly Leu Leu Thr Glu 1010 1015 1020

Val Leu Gln Val Pro Asp Val Gly Ala Asp Asp Thr Leu Gly Ala 1025 1030 1035

Leu Gly Leu Asp Ser Leu Gly Ala Met Arg Leu Ala Ala Arg Leu 1040 1045 1050

Arg Gly Ala Tyr Ala Leu Asp Leu Ala Val Ser Asp Leu Pro Ala 1055 1060 1065

Thr Arg Thr Val Ala Glu Leu Ala Arg Ala Val Glu Ala Ala Arg 1070 1075 1080

Pro Val Ala Gly Glu Gly Ala Arg 1085 1090

<210> 24

<211> 394

<212> Proteína <213> Artificial

<220>

<223> clX5

<400> 24 Val Asn Ala Asn Pro Gly Gln Thr Pro Thr Asp Ala Thr Asp His Pro 1 5 10 15 Asp Ser Pro Glu Asn Gln Arg Trp Arg Glu Arg Ile Arg His Phe Ala 20 25 30 Glu Lys Glu Ile Ala Pro Leu Ser Thr Thr Met Asp Arg Thr Ala Thr 35 40 45 Leu Asp Ala Gly Leu Arg Glu Arg Leu Phe Ala Glu Gly Leu Met Ser

55 60

Val Glu Ile Pro Arg Gly Tyr Gly Gly Thr Gly Gly Thr Leu Cys Gln 65 70 75 80

Leu Ile Leu Thr Ile Glu Glu Val Ala Arg Val Asp Pro Gly Val Ala 85 90 95 Val Gly Val His Val His Asn Val Leu Val Ala Gly Thr Leu Leu Arg 100 105 110 His Ala Ser Gly Asp Gln Arg Arg Gln Tyr Leu Pro Gln Leu Ala Thr 115 120 125 Gly Lys Ile Gly Ala Phe Ala Leu Ser Glu Glu Gln Ala Gly Ser Asp

130 135 140

Ala Phe Ala Leu Thr Thr Val Ala Arg Gln Asp Glu Ala Gly Tyr Leu 145 150 155 160

Leu Thr Gly Arg Lys Arg Trp Thr Ser Asn Ala Arg Asn Ala Asp Leu 165 170 175 Leu Leu Val Phe Ala Leu Ala Asp Ala Gly Gly Pro Thr Ala Phe Val 180 185 190 Val Pro Ala Asp Ala Pro Gly Val Ser Leu Asp Asp Arg Val Gln Gln 195 200 205 Met Gly Val Arg Ala Ala Ala Thr Ser Asp Val Ile Phe Asp Gly Thr

210 215 220

Pro Val Arg Thr Ala Gln Arg Val Gly Pro Pro Gly Gly Gly Gln Thr

225 230 235 240

Val Ala Leu Ser Gly Leu Gly Leu Gly Arg Leu Gly Ile Ala Ala Gln 245 250 255 Met Thr Gly Leu Ala Gln Gly Ala Leu Asp Ala Ala Val Gly Tyr Ser 260 265 270 Arg Val Arg Glu Gln Phe Gly Gly Arg Ile Ala Asp His Gln Gly Val 275 280 285 Ala Phe Pro Leu Ala Asp Val Ala Ser Arg Leu Ala Ala Ala Arg Ala 290 295 300

Leu Leu Tyr Arg Ala Val Asp Leu His Gly Arg Gly Thr Asp Pro Val

305 310 315 320

Glu Leu Met Arg Leu Thr Ala Met Ala Lys Tyr Val Ala Ser Glu Val

325 330 335 Ala Glu Arg Ala Ala Ser Val Ala Val Glu Thr Leu Gly Gly Asn Gly 340 345 350 Tyr Thr Asp Ala Tyr Pro Val Glu Arg Phe Tyr Arg Asp Ala Lys Ala 355 360 365

Gly Lys Ile Tyr Glu Gly Thr Ser Asn Val Leu Leu Arg Thr Ile Ala 370 375 380

Ser Ile Leu Ile Gly Gly Ser Pro Gly Asp

385 390

<210> 25

<211> 242

<212> Proteína <213> Artificial

<220>

<223> orf23

<400> 25 Val Thr Glu Pro Leu Arg Leu Leu Cys Phe Pro Tyr Ala Gly Gly Asn 1 5 10 15 Ala Gln Thr Tyr Val Arg Trp Arg Arg His Leu Ala Pro Asp Ile Glu 20 25 30 Val Cys Pro Met Gln Leu Pro Gly His Gly Glu Arg Ile Gly Glu Pro 35 40 45 Pro Arg His Arg Trp Asp Asp Leu Leu Ala Asp Ile Arg Thr Arg Leu 50 55 60

Thr Asp Leu Thr Thr Pro Glu Asp Arg Pro Ile Ala Leu Phe Gly His 65 70 75 80

Ser Leu Gly Ala Leu Leu Ala Phe Glu Cys Ala Arg Ile Leu Val Ser

90 95 Glu His Gly Ile Arg Pro Ala Arg Leu Leu Val Ser Gly His Arg Ala 100 105 110 Pro His Leu Pro Leu Arg Glu Glu Thr Leu His His Leu Pro Asp Thr 115 120 125 Glu Phe Leu Thr Arg Leu Ser Glu Arg Ser Arg Thr Leu Arg Ala Leu 130 135 140

Thr Asp Pro Glu Phe Arg Lys Leu Leu Leu Pro Met Leu Arg Ala Asp

145 150 155 160

Phe Thr Ala Ser Glu Thr Tyr Thr Phe Arg Glu Gly Pro Thr Leu Thr

165 170 175 Cys Pro Ile Thr Ala Leu Gly Gly Glu Arg Asp Glu Asp Ala Thr Leu 180 185 190 Gly Glu Val Ala Ala Trp Gln Arg His Thr Thr Gly Arg Phe Glu Leu 195 200 205 Thr Ala Phe Pro Gly Asp His Phe Phe Ile Asp Asp Ala Trp Glu Ala 210 215 220 Val Val Thr Ala Val Gly Asp Arg Leu Arg Ser Arg Glu Gly Ser Thr 225 230 235 240 Pro Gly

<210> 26

<211> 407

<212> Proteína <213> Artificial

<220>

<223> orf25

<400> 26 Val Pro Pro Thr Met Pro Arg Arg Asn Pro Leu Glu Pro Gly Thr Met 1 5 10 15 Thr Asp Ser Thr Lys Asp Pro His Phe Leu Arg Asn Pro Tyr Pro Thr

20 25 30 Tyr Asp Ala Leu Arg Ser Ala Cys Pro Val Gln Pro Leu Gln Ala Gly 35 40 45 Ser Gly Glu Arg Pro Gly Tyr Leu Val Thr Gly Tyr Ala Glu Ala Arg 50 55 60 Glu Ala Leu Gly Asp Ala Arg Leu Ser Lys Asp Thr Ala Val Phe Phe 65 70 75 80 Ala Gly Arg Lys Ser Arg Arg Arg Leu His Pro Ala Val Ala Arg Thr 85 90 95 Met Leu Ala Ser Asp Pro Pro Arg His Thr Arg Leu Arg Lys Leu Val 100 105 110 Thr Lys Ala Phe Thr Arg Gly Ala Val Ala Gln Leu Arg Pro Phe Ile 115 120 125 Ala Arg Ile Thr Asp Asn Leu Leu Asp Gln Trp Pro Ala His Gly Pro 130 135 140

Phe Asp Val Val Ala Gly Leu Ala Val Pro Leu Pro Val Ile Val Ile 145 150 155 160

Cys Glu Leu Leu Gly Val Pro Pro Ala Asp Arg Pro Asp Val Gln Arg

165 170 175 Trp Ser Ala Gly Leu Phe Ala Ala Gly Glu Pro Gly Ile Ile Asp Ala 180 185 190 Ala Ser His Ala Met Ala Asp Tyr Met Thr Asp Leu Val Ala Asp Lys 195 200 205 Arg Lys Asn Pro Gly Arg Ser Phe Leu Asp Arg Leu Ile Ser Ala Arg

210 215 220

Asp Gly Asp Gly His Leu Thr Glu Glu Glu Leu Ile Ser Met Ala Val

225 230 235 240

Leu Leu Leu Val Ala Gly His Glu Thr Thr Thr Asn Ala Leu Gly Asn 245 250 255 Ala Leu Leu Ala Leu Leu Arg His Pro Ala Glu Leu Asp Arg Leu Arg 260 265 270 Gly Ser Pro Asp Glu Ile Pro Ala Ala Leu Asp Glu Leu Leu Arg Phe 275 280 285 Asp Ala Ala Val Ser Thr Ala Thr Phe Arg Phe Thr Ala Glu Ala Val

290 295 300

Thr Leu Gly Gly Thr Asp Val Pro Ala Asp Thr Pro Val Leu Val Ala 305 310 315 320

Leu Gly Ala Ala Asn Arg Asp Pro Thr Arg Phe Pro Ala Pro Asp Gln 325 330 335 Leu Asp Leu Asn Arg Asn Ala Thr Ala His Leu Ala Phe Gly His Gly 340 345 350 Ile His Arg Cys Val Gly Ala Pro Leu Ala Lys Ala Glu Leu Glu Ile 355 360 365 Ala Leu Arg Ala Val Leu Ala Arg Phe Pro Gly Ile Ser Leu Ala Val

370 375 380

Pro Ala Glu Leu Leu Glu Trp Arg Arg Thr Arg Leu Val Arg Gly Leu 385 390 395 400

Val Ser Leu Pro Val Leu Ala 405

<210> 27

<211> 201

<212> Proteína <213> Artificial

<220>

<223> orf26

<400> 27

Val Thr Arg Asp Gly Ser Ser Pro Ser Pro Ala Ser Thr Asp His Lys 1 5 10 15

Pro Gly Val Thr Arg Tyr Ala Glu Ala Leu Thr Thr Thr Leu Ser Phe 20 25 30 Met Thr Asp Val Ser Leu Gly Asp Glu Leu Thr Arg Arg Val Glu Asp 35 40 45 Leu Glu Glu Arg Val Ser Leu Leu Phe Gln Glu His Pro Ala Ala Glu 50 55 60

Thr Asn Asn Gly Val Gly Thr Ala Asp His Val Ala Glu Ala Glu Asp

70 75 80

Pro Arg Gln Leu Ala Thr Arg Leu Leu Ala Arg Ser Asp Pro Asp Val 85 90 95 Leu Ala Ala Arg Leu Asn Ala Leu Gly His Pro Val Arg Leu Arg Ile 100 105 110 Leu Leu Ala Cys Leu Asp Gly Pro Arg Arg Ala Ala Glu Leu Ala Ala 115 120 125 Gln Thr Asp Met Gly Ser Thr Gly Gln Ile Tyr His His Leu Arg Gln 130 135 140

Leu Val Asn Gln Gly Trp Leu Ser Ala Ser Arg Arg Gly His Tyr Glu 145 150 155 160

Val Pro Arg Glu Ala Leu Glu Val Val Ala Ala Val Leu Ala Ala Thr

165 170 175 Phe Trp Asp Gly Asn Ala Asp Pro Thr Ser Ser Glu Ala Ala Ser Gly 180 185 190 Ala Arg Thr Ser Ala Ala Ser Ser Thr 195 200

<210> 28

<211> 1362

<212> Proteína <213> Artificial

<220>

<223> orf27

<400> 28 Val Ser Cys Cys Cys Arg Ser Arg Ala Val Ser Arg Asp Arg Thr Val 1 5 10 15 Ser Ala Ala Pro Gly Val Arg Ala His Phe Ser Arg Ala Ile Asp Arg

20 25 30 Gly Arg Lys Leu Ser Gln Gly Ser Ala Pro Ser Thr Val Arg Thr Ala 35 40 45 Asp Arg Leu Pro Ala Thr Ile His Arg Leu Val Glu Ser Gln Ala Glu 50 55 60

Arg Ala Pro Asp Ala Val Ala Val Glu Thr Glu Asp Gly Lys Leu Thr

70 75 80

Tyr Arg Glu Leu Asp Ala Arg Ala Asn Gln Phe Ala Arg His Leu Arg

90 95 Ser Ala Gly Val Arg Gly Glu Ser Leu Val Ala Val His Met Glu Arg 100 105 110 Gly Leu Leu Thr Pro Val Val Leu Leu Gly Ile Leu Lys Thr Gly Ala 115 120 125 Ala Tyr Leu Pro Leu Asp Thr Glu Ser Pro Ala Glu Arg Leu Ala Ala 130 135 140

Val Leu Ala Asp Ala Ala Pro Ala Ala Val Val Thr Ala Gly Pro Leu 145 150 155 160

Pro Pro Val Ala Val Pro Leu Ile Asp Leu Asp Thr Asp Leu Pro Ala 165 170 175 Ile Thr Ala Leu Pro Ala Glu Pro Leu Thr Asp Val Glu Glu Pro Gly 180 185 190 Pro Asp Arg Leu Ala Tyr Val Met Phe Thr Ser Gly Ser Thr Gly Val 195 200 205 Pro Lys Gly Val Leu Val Glu His Arg Ala Val Ile Arg Leu Ile Arg

210 215 220

Glu Gln Ser Tyr Ala Arg Leu Gly Pro Asp Ala Thr His Leu Leu Leu 225 230 235 240

Ala Pro Leu Ala Phe Asp Ala Ser Thr Leu Glu Ile Trp Gly Ala Leu 245 250 255 Ala His Gly Gly Arg Leu Val Val Ala Ala Pro Gly Ala Arg Thr Val 260 265 270 Asp Gln Leu Gly Arg Thr Leu Ala Asp Arg Arg Val Thr Thr Leu Trp 275 280 285 Leu Thr Ala Ser Leu Phe Asn Leu Val Val Asp Glu Asp Pro Ser Ile 290 295 300 Leu Ala Gly Val Gly Asp Leu Leu Ile Gly Gly Glu Ala Leu Ser Val

305 310 315 320 Asn His Val Arg Thr Ala Arg Lys Ala Leu Pro Asp Thr Val Val Thr 325 330 335 Asn Gly Tyr Gly Pro Thr Glu Thr Thr Thr Phe Ala Cys Thr His Ala 340 345 350 Ile Arg Pro Gln Asp Leu Asp Gly Ala Ser Ile Pro Ile Gly Gly Pro 355 360 365 Ile Ala His Thr Glu Val His Val Leu Asp Glu Asp Phe Asp Pro Val

370 375 380

Ala Pro Gly Glu Ala Gly Glu Leu Phe Ile Gly Gly Pro Arg Leu Ala 385 390 395 400

Arg Gly Tyr Leu Asn Arg Pro Gly Leu Thr Ala Glu Arg Phe Val Ala 405 410 415 His Pro Ala Ala Thr Glu Pro Gly Ser Arg Leu Tyr Arg Thr Gly Asp 420 425 430 Arg Val Arg Val Arg Pro Asp Gly Thr Leu Glu Tyr Leu Gly Arg Leu 435 440 445 Asp Asp Gln Val Lys Leu Arg Gly Phe Arg Ile Glu Pro Gly Glu Val

450 455 460

Arg Ala Gly Leu Thr Gly Leu Pro Gln Val Arg Asp Ala Val Val Val

465 470 475 480

Ala Arg Gly Gly Pro Ser Asp Arg Arg Leu Val Ala Tyr Val Val Pro 485 490 495 Glu Ala Asp Ala Thr Ala Gly Met Asp Asn Glu Arg Glu Gln Val Ala 500 505 510 Asp Trp Glu Ala Val Phe Asp Glu Thr Tyr Arg Asp Gly Val Gly Ala 515 520 525 Ala Glu Gly Arg Trp Glu Leu Ser Gly Trp Val Gly Ser Gly Asp Gly

530 535 540

Leu Pro Val Pro Ala Asp Gln Met Arg Glu Trp Thr Asp Ala Thr Val

545 550 555 560

Glu Arg Ile Arg Ala Leu Gly Ala Arg Arg Val Leu Glu Ile Gly Cys

565 570 575 Gly Thr Gly Leu Leu Ala Met Arg Leu Ala Pro Asp Ala Glu Arg Tyr 580 585 590 Val Gly Ser Asp Leu Ser Ala Val Ala Ile Arg Arg Leu Arg Ala Gln 595 600 605 Met Asp Ala Ala Gly Leu Asp His Thr Glu Leu Val His Ala Pro Ala 610 615 620

Asp Asp Leu Asp Ala Val Pro Gly Gly Thr Phe Asp Val Val Val Leu 625 630 635 640

Asn Ser Ile Val Gln Tyr Leu Pro Ser Ala Gln Tyr Leu Arg Glu Val

645 650 655 Ile Glu Arg Ala Ala Ala Arg Leu Ala Pro Gly Gly His Leu Phe Val 660 665 670 Gly Asp Val Arg Ser Leu Pro Leu Leu Asp Ala Phe His Leu Ser Ala 675 680 685 Glu Leu Lys Arg Gly His Glu Asp Ala Val Pro Leu Ala Thr Leu Ala 690 695 700

Glu Ala Val Arg Glu Arg Ala Ala Ala Glu Lys Glu Leu Val Val Ala 705 710 715 720

Pro Ser Phe Phe Thr Asp Leu Ser Gly Arg Ala Gly Ile Asp His Val

725 730 735 Arg Val Thr Pro Arg Arg Gly Arg His Arg Asn Glu Met Thr Gln Phe 740 745 750 Arg Tyr Asp Ala Val Leu Arg Val Arg Gly Ala Glu Pro Ala Arg Val 755 760 765

Pro Asp Arg Trp Leu Asp Trp Arg Asp Glu Gly Leu Thr Leu Glu Asp 770 775 780

Val Ala Arg Ile Leu His Asp Gln Arg Pro Gln His Leu Ala Leu Arg

785 790 795 800

Gly Val Thr Asp Ala Arg Val Ala Asp Glu Val Ala Arg Leu Val Arg

805 810 815 Leu Arg Glu Asp Ala Glu Gly Thr Val Ala Ala Leu Arg Glu Thr Gly 820 825 830

His Asp Gly Pro Ala Val Glu Ile Asp Asp Val Tyr Asp Leu Ala Ala 835 840 845

Arg Ala Ser Tyr Thr Val Asp Val Ser Val Ala Gly Ser Ala Ala Gly 850 855 860

Asp Ala Phe Asp Val Ile Leu Trp Thr Asp Ala Glu Pro Gly Pro Val

865 870 875 880

Ala Phe Ala Pro Gly Pro Ala Glu Ala Arg Gly Pro Arg Thr Ser Met

885 890 895 Pro Leu Ala Thr Ala Thr Ser Arg His Leu Thr Thr Leu Val Arg Asp 900 905 910 Ser Leu Arg Glu Leu Leu Pro Ala Tyr Met Ile Pro Ala Val Phe Val 915 920 925 Phe Met Asp Ala Leu Pro Leu Thr Ser Thr Gly Lys Ile Asp Arg Ser

930 935 940

Ala Leu Pro Glu Pro Pro Arg Arg Thr Ser Ala Gly Gly Ala Gly Arg

945 950 955 960

Arg Ala Ala Thr Ala Thr Glu Arg Ala Leu Glu Pro Leu Trp Arg Asp

965 970 975 Leu Leu Ala Leu Glu Thr Val His Val Asp Asp Asp Phe Phe Ala Leu 980 985 990 Gly Gly His Ser Leu Leu Gly Thr Arg Leu Leu Ser Arg Val Arg Gly 995 1000 1005 Leu Trp Gly Val Glu Leu Ser Leu Ala Ala Leu Phe Ser Ala Pro 1010 1015 1020 Thr Leu Gly Ala Leu Ala Ala Arg Ile Asp Ser Ala Arg Gln Asp 1025 1030 1035 Thr Pro Ala Leu Pro Gly Thr Leu Ala Asp Lys Ala Asp Pro Gly 1040 1045 1050 Ser Ala Pro Pro Leu Ser Pro Ala Gln His Arg Leu Trp Leu Val 1055 1060 1065 Glu Gln Leu Thr Pro Gly Asn Pro Arg Tyr Thr Val Pro Val Ala 1070 1075 1080 Tyr Arg Met Arg Gly Pro Ile Asp Thr Ala Ala Leu Gln Ala Ala 1085 1090 1095 Leu Asp Thr Leu Val Ala Arg His Glu Val Leu Arg Thr Thr Phe 1100 1105 1110 Pro Ser His Asp Gly Thr Pro Arg Gln Val Val Ala Pro Ser Gly 1115 1120 1125 Arg Ile Pro Ile Glu Arg Ala Asp Val Gly Gly Glu Gly Ala Asp 1130 1135 1140 Ala Pro Ala Ala Ala His Asn Ile Leu Thr Arg Gln Ala Ser Arg 1145 1150 1155 Trp Leu Asp Val Gln Ser Gly Pro Leu Ala Ala Ala Thr Leu Val 1160 1165 1170 Arg Leu Ala Glu Asp Asp His Val Leu Cys Leu Thr Leu His His 1175 1180 1185 Met Ile Cys Asp Gly Trp Ser Leu Asp Leu Leu Ala Ala Glu Leu 1190 1195 1200 Ser Glu Gly Tyr Asn Ala Arg Val Ala Arg Arg Thr Pro Gln Leu 1205 1210 1215 Pro Glu Ile His His His Thr Arg Thr Arg Tyr Arg Thr Gly His 1220 1225 1230

Pro Leu Gln Gln His His Leu Thr Glu Pro Ala Leu Ile Arg Ser 1235 1240 1245

Leu Ser His His Gln Pro His Thr Gly Ile Arg Gln His Glu Leu 1250 1255 1260

Asp Thr Val Arg Arg Ile Ala Arg Ile Leu Pro Ser Pro Ser Ser 1265 1270 1275

Thr Thr Cys Pro Pro Arg Pro Arg Gly Gly Arg Arg Ser Glu Glu 1280 1285 1290

Ala Val Arg Arg Val Pro Asn Ala Ser Arg Thr Ala Ala Ile Ser 1295 1300 1305

Ser Gly Ser Arg Ser Arg Pro Ala Met Pro Glu Arg Ile Arg Val 1310 1315 1320

His Gln Leu Val Pro Val Gln Ala Ser Leu Glu Ala Ala Phe Met

1325 1330 1335

Glu Leu Thr Arg Thr Ser Val Glu Tyr Gln Ala Arg Thr Ser Thr 1340 1345 1350

Gly Ala Pro Asp Ala Glu Val Arg Ser 1355 1360

<210> 29

<211> 17

<212> ADN

<213> Artificial

<220>

<223> L2ATFW2

<400> 29 5’-CTCGGCGACATCGGGTG-3’

<210> 30

<211> 18

<212> ADN

<213> Artificial

<220>

<223> L2ATFV2

<400> 30 5’-CAGGTGCARNGGGAAGTA-3’

<210> 31

<211> 17

<212> ADN

<213> Artificial

<220>

<223> L2ATson1

<400> 3.

5. ATCACCACCCTCGGCGA-3'

<210> 32

<211> 16

<212> ADN

<213> Artificial

<220>

<223> L2ATson2

<400> 3.

5. ATGCCGTCGGCCGGGT-3'

<210> 33

<211> 18

<212> ADN

<213> Artificial

<220>

<223> 1c35FW

<400> 3.

5. TCCCCGCCGGACAGACTG-3'

<210> 34

<211> 20

<212> ADN

<213> Artificial

<220>

<223> Heldisr1

<400> 3.

5. GGTCAGCACGTAGGAATCCC-3'

<210> 35

<211> 21

<212> ADN

<213> Artificial

<220>

<223> Heldisr2

<400> 3.

5. ACCTGAAACCCGCCAATGTCA-3'

<210> 36

<211> 22

<212> ADN

<213> Artificial

<220>

<223> orf22del1

<400> 3.

5. GAATTCCGGTGAATCGTCGCGG-3'

<210> 37

<211> 21

<212> ADN

<213> Artificial

<220>

<223> orf22del2

<400> 3.

5. AAGCTTCGGCACGCCTCGCGT-3'

<210> 38

<211> 23

<212> ADN

<213> Artificial

<220>

<223> orf22del3

<400> 3.

5. GATATCGCGTAGCCCGTCCACGG-3'

<210> 39

<211> 24

<212> ADN

<213> Artificial <220>

<223> orf22del4

<400> 3.

5. GATATCGCTATCTGCTGACCGGCC-3'

<210> 40

<211> 26

<212> ADN

<213> Artificial

<220>

<223> orf16del1

<400> 4.

5. GAATTCGCCCCTCGCTCATGCCGTAG-3'

<210> 41

<211> 28

<212> ADN

<213> Artificial

<220>

<223> orf16del2

<400> 4.

5. AAGCTTTGAGCACTGCGCCGATCACTAC-3'

<210> 42

<211> 18

<212> ADN

<213> Artificial

<220>

<223> 1c35RV

<400> 4.

5. AGGCGCTCCTTCAGCTCG-3'

<210> 43

<211> 29

<212> ADN

<213> Artificial

<220>

<223> orf16del3

<400> 4.

5. GGATCCACCGATGAGCGAAACTGAGCACT-3'

<210> 44

<211> 27

<212> ADN

<213> Artificial

<220>

<223> orf16del4

<400> 4.

5. GATATCCAGGAAGAGCCGCTGCTGTGG-3'

<210> 45

<211> 24

<212> ADN

<213> Artificial

<220>

<223> orf8del3

<400> 4.

5. GGATCCCACCCCCTGACAGCCTTC-3'

<210> 46

<211> 24

<212> ADN

<213> Artificial

<220>

<223> orf8del4

<400> 4.

5. GATATCCGCGGTGTTCCTGCTCGA-3'

<210> 47

<211> 24

<212> ADN

<213> Artificial

<220>

<223> orf8del1

<400> 4.

5. GAATTCGTACTCCACCGCCGCGTA-3'

<210> 48

<211> 24

<212> ADN

<213> Artificial

<220>

<223> orf8del2

<400> 4.

5. AAGCTTGACCGTCCTCGTCCACGT-3'

<210> 49

<211> 24

<212> ADN

<213> Artificial

<220>

<223> orf6del1

<400> 4.

5. GAATTCTCTCGTGCATTCGCCGCG-3'

<210> 50

<211> 30

<212> ADN

<213> Artificial

<220>

<223> orf6del2b

<400> 5.

5. AAGCTTTCCGAAGATCGCTGATATCACGGT-3'

<210> 51

<211> 18

<212> ADN

<213> Artificial

<220>

<223> orf6del3

<400> 5.

5. GCCCGATATCGCGGCCTG-3'

<210> 52

<211> 25

<212> ADN

<213> Artificial

<220>

<223> orf6del4

<400> 5.

5. GATATCCGCTCGTCGCCCAGGACAG-3'

<210> 53

<211> 24

<212> ADN

<213> Artificial

<220>

<223> orf101del1

<400> 5.

5. GAATTCAAGGGAATCCGCTGGCCG-3'

<210> 54

<211> 25

<212> ADN

<213> Artificial

<220>

<223> orf101del2

<400> 5.

5. AAGCTTCGTCCGATGGGTCAACTGG-3'

<210> 55

<211> 24

<212> ADN

<213> Artificial

<220>

<223> orf101del3

<400> 5.

5. GATATCAAGGCGGTCGTCGTCGGC-3'

<210> 56

<211> 25 5 <212> ADN

<213> Artificial

<220>

<223> orf101del4

<400> 5.

5. GATATCCTTCCCGGACGTCGTGGAC-3'


 

Patentes similares o relacionadas:

Compuestos y procedimientos de uso, del 29 de Julio de 2020, de Medivation Technologies LLC: Un compuesto de fórmula (Aa-1): **(Ver fórmula)** o una sal farmacéuticamente aceptable del mismo, en la que: A representa H, halógeno, amino, […]

Complejos de agentes terapéuticos de base vírica y poli(beta-amino-ésteres) modificados, del 29 de Julio de 2020, de Sagetis Biotech, SL: Un complejo de un agente terapéutico de base vírica con un polímero de fórmula I: **(Ver fórmula)** donde cada L1 y L2 están seleccionados […]

Compuestos de alquinilbenceno heterocíclicos, y composiciones médicas y usos de los mismos, del 29 de Julio de 2020, de Guangzhou Healthquest Pharma Co., Ltd: Un compuesto de alquinilbenceno heterocíclico que tiene la fórmula (I) y una sal farmacéuticamente aceptable, o estereoisómero del mismo, **(Ver […]

Imagen de 'Composiciones y métodos para el tratamiento de trastornos de…'Composiciones y métodos para el tratamiento de trastornos de células plasmáticas y trastornos prolinfocíticos de células b, del 29 de Julio de 2020, de Knopp Biosciences LLC: Una composición que comprende una cantidad terapéuticamente eficaz de dexpramipexol para su uso en el tratamiento de un trastorno de células B caracterizado por niveles elevados […]

Derivados de piperidina 1,4 sustituidos, del 29 de Julio de 2020, de 89Bio Ltd: Un compuesto de acuerdo con la Fórmula I: **(Ver fórmula)** o una sal farmacéuticamente aceptable del mismo, en donde: A se selecciona de […]

Profármacos de gemcitabina, del 29 de Julio de 2020, de NuCana plc: Una formulación farmacéutica que comprende gemcitabina-[fenil-benzoxi-L-alaninil)]-(S)-fosfato 3: **(Ver fórmula)** o una sal o solvato […]

Formas cristalinas de 6-((6,7-dimetoxiquinazolin-4-il)oxi)-N,2-dimetilbenzofuran-3-carboxamida, del 29 de Julio de 2020, de Hutchison Medipharma Limited: Forma I de 6-((6,7-dimetoxiquinazolin-4-il)oxi)-N,2-dimetilbenzofuran-3-carboxamida, en donde el difractograma de rayos X de polvo de la Forma […]

Derivado heteroarilo o sal farmacéuticamente aceptable del mismo, método de preparación del mismo y composición farmacéutica para prevenir o tratar enfermedades asociadas con PI3 quinasas, que contiene el mismo como principio activo, del 22 de Julio de 2020, de KOREA RESEARCH INSTITUTE OF CHEMICAL TECHNOLOGY: Un compuesto representado por la fórmula 1, un isómero óptico del mismo o una sal farmacéuticamente aceptable del mismo: **(Ver fórmula)** en la fórmula […]

Utilizamos cookies para mejorar nuestros servicios y mostrarle publicidad relevante. Si continua navegando, consideramos que acepta su uso. Puede obtener más información aquí. .