NUEVA HIDROXIFENILACETALDEHÍDO DESHIDROGENASA, ÁCIDO NUCLEICO QUE LA CODIFICA Y VECTORES Y MICROORGANISMOS RECOMBINANTES QUE LA EXPRESAN.

Nueva hidroxifenilacetaldehído deshidrogenasa, ácido nucleico que la codifica y vectores y microorganismos recombinantes que la expresan.

La nueva hidroxifenilacetaldehído deshidrogenasa forma parte de una vía bacteriana de degradación de tiramina y/o dopamina hasta ácido pirúvico y ácido succínico desconocida hasta ahora. Actúa tras la tiramina oxidasa que transforma tiramina o dopamina en 4-hidroxifenilacetaldehído y 3,4-dihidroxifenilacetaldehído, generando a partir de ellos ácido 4-hidroxifenilacético y 3,4-hidroxifenilacético, respectivamente. Así, puede usarse en procedimientos dirigidos a disminuir la cantidad de tiramina y/o dopamina, por ejemplo en alimentos. La invención se refiere también a las moléculas de ácido nucleico que codifican la enzima, los vectores que permitan su expresión y a microorganismos recombinantes transformados con dichos vectores.

Tipo: Patente de Invención. Resumen de patente/invención. Número de Solicitud: P201130649.

Solicitante: BIOGES STARTERS S.A..

Nacionalidad solicitante: España.

Inventor/es: ARCOS RODRIGUEZ,MARIO, RODRIGUEZ OLIVERA,ELIAS, NAHARRO CARRASCO,GERMAN, LUENGO RODRIGUEZ,JOSE MARIA.

Fecha de Publicación: .

Clasificación Internacional de Patentes:

  • A23B4/22 NECESIDADES CORRIENTES DE LA VIDA.A23 ALIMENTOS O PRODUCTOS ALIMENTICIOS; SU TRATAMIENTO, NO CUBIERTO POR OTRAS CLASES.A23B CONSERVACION, P.EJ. MEDIANTE ENLATADO, DE CARNE, PESCADO, HUEVOS, FRUTAS, VERDURAS, SEMILLAS COMESTIBLES; MADURACION QUIMICA DE FRUTAS Y VERDURAS; PRODUCTOS CONSERVADOS, MADURADOS O ENLATADOS.A23B 4/00 Métodos generales de conservación para carne, embutidos, pescado o productos a base de pescado. › Microorganismos; Enzimas.
  • A23C19/097 A23 […] › A23C PRODUCTOS LACTEOS, p. ej. LECHE, MANTEQUILLA, QUESO; SUCEDANEOS DE LA LECHE O DEL QUESO; SU FABRICACION (obtención de composiciones a base de proteínas para la alimentación A23J 1/00; preparación de péptidos, p. ej. de proteinas, en general C07K 1/00). › A23C 19/00 Queso; Preparados a base de queso; Fabricación de estos productos (sucedáneos del queso A23C 20/00; caseína A23J 1/20). › Conservación.
  • A23L1/015
  • C12N9/02 QUIMICA; METALURGIA.C12 BIOQUIMICA; CERVEZA; BEBIDAS ALCOHOLICAS; VINO; VINAGRE; MICROBIOLOGIA; ENZIMOLOGIA; TECNICAS DE MUTACION O DE GENETICA.C12N MICROORGANISMOS O ENZIMAS; COMPOSICIONES QUE LOS CONTIENEN; PROPAGACION, CULTIVO O CONSERVACION DE MICROORGANISMOS; TECNICAS DE MUTACION O DE INGENIERIA GENETICA; MEDIOS DE CULTIVO (medios para ensayos microbiológicos C12Q 1/00). › C12N 9/00 Enzimas, p. ej. ligasas (6.); Proenzimas; Composiciones que las contienen (preparaciones para la limpieza de los dientes que contienen enzimas A61K 8/66, A61Q 11/00; preparaciones de uso médico que contienen enzimas A61K 38/43; composiciones detergentes que contienen enzimas C11D ); Procesos para preparar, activar, inhibir, separar o purificar enzimas. › Oxidorreductasas (1.), p. ej. luciferasa.

PDF original: ES-2387150_A1.pdf

 


Fragmento de la descripción:

Nueva hidroxifenilacetaldehído deshidrogenasa, ácido nucleico que la codifica y vectores y microorganismos recombinantes que la expresan

CAMPO TÉCNICO DE LA INVENCIÓN.

El campo técnico de la invención pertenece a la Biotecnología. La invención se refiere a una nueva hidroxifenilacetaldehído deshidrogenasa. Forma parte de una via de degradación bacteriana de degradación de tiramina y dopamina, desconocida hasta ahora, y lleva a cabo la transformación de los compuestos generados por dicha vía a partir de tiramina y dopamina en los ácidos 4-hidroxifenilacético y 3, 4-dihidroxifenilacético respectivamente, compuestos que, al ser degradados por las enzimas de otro cluster complementario, son finalmente degradados en ácido pirúvico y ácido succínico. Por tanto, la enzima puede usarse en procedimientos para disminuir el contenido de tiramina y/o dopamina en muestras que las contengan, preferentemente alimentos y bebidas.

ESTADO DE LA TÉCNICA

Aminas biogénicas. Aspectos generales

Las aminas son compuestos químicos derivados del amoníaco que resultan de la sustitución de los hidrógenos de esa molécula por radicales alquilo. Según se sustituyan uno, dos o tres hidrógenos, las aminas serán primarias, secundarias

o terciarias. Cuando son originadas como consecuencia de la actividad de organismos vivos y poseen actividad biológica (cumplen importantes funciones en las células) reciben el nombre de aminas biogénicas o biogénicas. En función del número de grupos amino presentes en la molécula podemos diferenciar, monoaminas, diaminas y poliaminas. Las monoaminas alifáticas están muy extendidas en la naturaleza donde también es abundante la diamina putrescina, mientras que las poliamidas espermidina y espermina son producidas por animales, por plantas y por la mayoría de las bacterias (1) .

Las aminas aromáticas, originadas por descarboxilación de aminoácidos, son las aminas más comunes en los alimentos (histamina, 2-feniletilamina, tiramina, etc.) y también tienen gran importancia como transmisores dentro del sistema nervioso central (dopamina, noradrenalina, epinefrina, serotonina, etc.) .

Podemos hacer una distinción entre aminas biogénicas endógenas, que son aquellas que son sintetizadas en diferentes tejidos de los organismos superiores (como por ejemplo la adrenalina producida en la médula adrenal o la histamina en los mastocitos) y aminas biogénicas exógenas, que son las ingeridas en la dieta. Estas aminas biogénicas exógenas pueden estar presentes en los alimentos de origen vegetal (frutas y hortalizas) , o bien pueden aparecer en los alimentos como consecuencia de la actividad microbiana durante el procesado (cura de carnes y quesos) o durante el almacenaje de los mismos. Debido a que pueden provocar efectos nocivos tanto en el hombre como en los animales, son consideradas sustancias tóxicas.

Las aminas biogénicas más importantes que pueden encontrarse en los alimentos son la histamina, la putrescina, la cadaverina, la tiramina, la triptamina, la feniletilamina, la espermina y la espermidina; y los alimentos que las contienen pueden ser muy variados (pescado, carne, huevos, quesos, bebidas fermentadas, etc.) (2 ) .

Afortunadamente, los organismos cuentan con diferentes sistemas naturales de destoxificación (monoaminooxidasa -MAO- o la diaminooxidasa –DAO-) que les permiten eliminar las aminas biogénicas, evitando los efectos perjudiciales causados por estos compuestos. Sin embargo, puede haber casos en que estos sistemas no funcionan correctamente,

o se encuentran inhibidos por la acción de determinados fármacos, por lo que la presencia de aminas biogénicas en los alimentos puede suponer un grave problema para la salud.

Por todas estas razones es muy interesante seleccionar microorganismos que al ser utilizados en los procesos de elaboración de alimentos (curados, fermentaciones, etc.) , no acumulen aminas biogénicas, o que lo hagan en concentraciones que no sean peligrosas para la salud. La Ingeniería Genética y la Ingeniería Metabólica podrían contribuir a obtener este tipo de cepas asegurando, además, que se conserven otra serie de propiedades y características que son necesarias para mantener los estándares de identidad y calidad de los alimentos.

Las aminas biogénicas como neurotransmisores

Desde hace décadas se tiene constancia de que la transmisión catecolaminérgica está mediada por aminas biogénicas entre las que se incluyen las catecolaminas (dopamina, noradrenalina y adrenalina) derivadas del aminoácido tirosina; la indolamina serotonina, sintetizada a partir del triptófano; y la histamina, producida a partir del aminoácido histidina.

Catecolaminas

Bajo el término catecolaminas se engloban todas aquellas aminas biogénicas derivadas de la tirosina que contienen un grupo catecol y un grupo amino en su molécula. El primer paso en la síntesis de catecolaminas está catalizado por la enzima tirosinahidroxilasa mediante una reacción que requiere oxigeno como substrato y tetrahidrobiopterina como cofactor, y permite obtener como producto final dihidroxifenilalanina (DOPA) (Figura 1) . Por lo tanto, la tasa de tirosinahidroxilasa va a ser el factor limitante para la síntesis de las tres aminas neurotransmisoras catecolaminérgicas (dopamina, noradrenalina y adrenalina) .

La dopamina se produce por la descarboxilación de L-DOPA. Esta reacción se lleva a cabo por la enzima DOPA descarboxilasa. El área del cerebro donde se encuentra en mayor abundancia es en el corpus estriatum, jugando un papel esencial en la coordinación de los movimientos corporales (3) . En pacientes que padecen la enfermedad de Parkinson, por ejemplo, se ha observado degeneración de las neuronas dopaminérgicas, lo que va a dar lugar a la característica disfunción motora asociada a esta enfermedad (4) .

La noradrenalina, también llamada norepinefrina, requiere para su síntesis, a partir de dopamina, la acción de la dopamina-β-hidroxilasa. Esta catecolamina se produce mayoritariamente en las neuronas de los ganglios simpáticos y su acción está relacionada con el sueño, la vigilia, la atención y la conducta.

La adrenalina, también llamada epinefrina, está presente en el cerebro en niveles más bajos que las otras dos catecolaminas. La enzima que sintetiza la adrenalina, la feniletanolamina-N-metiltransferasa, se localiza solo en las neuronas secretoras de esta catecolamina.

Las enzimas más importantes en el catabolismo de catecolaminas son la monoaminooxidasa (MAO) y la catecol Ometiltransferasa (COMT) (5) . Estas enzimas se encuentran respectivamente en las mitocondrias y en el citoplasma tanto de las células neuronales como de las gliales. Los inhibidores de estas enzimas se utilizan en clínica como antidepresivos (6) .

Histamina

Esta amina biogénica neurotransmisora se produce por descarboxilación de la histidina debido a la acción de la histidinadescarboxilasa (Figura 2ª) . En su metabolismo intervienen tanto la histidinametiltransferasa como la MAO. La mayor concentración de este neurotransmisor se encuentra en las neuronas del hipotálamo y su acción está relacionada con los procesos de alerta y atención. La histamina también es liberada por los macrófagos en respuesta a reacciones alérgicas o a daños en los tejidos.

Serotonina

Esta indolamina, también llamada 5-hidroxitriptamina, se sintetiza en las neuronas a partir del triptófano ingerido con los alimentos tras ser hidroxilado a 5-hidroxitriptófano mediante una reacción catalizada por la enzima triptófano-5hidroxilasa. Posteriormente, el 5-hidroxitriptófano se descarboxila por medio de la acción de una 5-hidroxitriptofano descarboxilasa para dar lugar a la serotonina (Figura 2B) . La principal enzima encargada de su degradación es la MAO, al igual que sucede en las demás aminas biogénicas. La serotonina está implicada en la regulación del sueño y de la vigilia.

Además de las monoaminas neurotransmisoras, existen otras aminas biogénicas que poseen una estructura molecular parecida y que actúan como neuromoduladores o “falsos neurotransmisores”. Estas aminas endógenas, también denominadas aminas “traza” o microaminas, se encuentran en pequeñas cantidades en el sistema nervioso central y su

estudio está adquiriendo una importante relevancia en los últimos años.

Aminas... [Seguir leyendo]

 


Reivindicaciones:

1. Una molécula de ácido nucleico aislada que comprende una secuencia que codifica una proteína capaz de actuar como 4-hidroxifenilacetaldehído deshidrogenasa, seleccionada del grupo que consiste en:

a) una secuencia de ácido nucleico que es idéntica; al menos en un 60%, a la secuencia representada por SEQ ID NO:7;

b) una secuencia de ácido nucleico que comprende una secuencia que hibrida en condiciones estrictas con las secuencias de a) ;

c) una secuencia de ácido nucleico que comprende una secuencia que codifica un polipéptido cuya secuencia de aminoácidos es idéntica, al menos en un 60%, a la secuencia representada por SEQ ID NO:8;

d) una secuencia de ácido nucleico que comprende una secuencia que codifica una variante alélica natural de un polipéptido que comprende la secuencia de aminoácidos representada por SEQ ID NO:8, donde la molécula de ácido nucleico hibrida, bajo condiciones estrictas, con una secuencia de DNA que comprende la secuencia mencionada en a) ,

o una secuencia complementaria a la misma.

2. Molécula de ácido nucleico aislada según la reivindicación 1, seleccionada del grupo que consiste en:

a) una molécula de ácido nucleico que comprende una secuencia que es idéntica a la secuencia representada por SEQ ID NO:7;

b) una molécula de ácido nucleico que comprende una secuencia que codifica un polipéptido cuya secuencia de aminoácidos es idéntica a la secuencia representada por SEQ ID NO:8.

3. Un polipéptido purificado con actividad de hidroxifenilacetaldehído deshidrogenasa cuya secuencia de aminoácidos es idéntica al menos en un 60% a la secuencia polipeptídica representada por SEQ ID NO:8.

4. Polipéptido purificado con actividad hidroxifenilacetaldehído deshidrogenasa según la reivindicación 3, cuya secuencia es idéntica a la secuencia polipeptídica representada por SEQ ID NO:8.

5. Un vector de expresión que comprende una molécula de ácido nucleico según una cualquiera de las reivindicaciones 1 ó 2.

6. Vector de expresión según la reivindicación 5, que es un plásmido.

7. Vector de expresión según la reivindicación 6, en el que el que la molécula de ácido nucleico de una cualquiera de las reivindicaciones 1 ó 2 está insertada en el plásmido pK18::mob.

8. Un organismo hospedador transformado con un vector de expresión según una cualquiera de las reivindicaciones 5 a

7.

9. Organismo hospedador según la reivindicación 8, que es una bacteria.

10. Organismo hospedador según la reivindicación 9, que es una bacteria capaz de transformar un azúcar en ácido láctico.

11. Organismo hospedador según una cualquiera de las reivindicaciones 8 a 10, en el que el vector de expresión de una cualquiera de las reivindicaciones 5 a 7 está insertado en el genoma del hospedador.

12. Organismo hospedador según una cualquiera de las reivindicaciones 8 a 10, en el que el vector de expresión de una cualquiera de las reivindicaciones 5 a 7 permanece como forma replicativa autónoma.

13. Una composición que comprende el polipéptido o complejo proteico de la reivindicación 3 ó 4.

37

Figura 1 Figura 2

39

Figura 3

Figura 4 Figura 5

Figura 6 Figura 7

Figura 8

Figura 9

1 TCAGGCGAAACGCTCGAAGCGGTACGGTGACGGGTCGATCAGCGGGGTGGCCTGGGCCACCAGGTCTGCCGCCAG AGTCCGCTTTGCGAGCTTCGCCATGCCACTGCCCAGCTAGTCGCCCCACCGGACCCGGTGGTCCAGACGGCGGTC -2 & A F R E F R Y P S P D I L P T A Q A V L D A A L

76 CTGGCCAGCAGCAGGCGAGGTGCCGAAGCCATGCCCGGAAAAGCCGGTGGCCAGGGTCAGGCCCGGAATACTGGC GACCGGTCGTCGTCCGCTCCACGGCTTCGGTACGGGCCTTTTCGGCCACCGGTCCCAGTCCGGGCCTTATGACCG -2 Q G A A P S T G F G H G S F G T A L T L G P I S A

151 CACCGGGCCGATGACCGGGTTGGAGTCGGGGGTGACGTCAATCGTGCCGGCCCAGGCGCTGGCGATACGGGCCTG GTGGCCCGGCTACTGGCCCAACCTCAGCCCCCACTGCAGTTAGCACGGCCGGGTCCGCGACCGCTATGCCCGGAC -2 V P G I V P N S D P T V D I T G A W A S A I R A Q

226 TTCGAACACCGGCCAGGCCGCTTTCAGGTTGCGCATGGCCTCGTCGTTGAGGGCCGGGTTGGCGTGCGGGTCTTG AAGCTTGTGGCCGGTCCGGCGAAAGTCCAACGCGTACCGGAGCAGCAACTCCCGGCCCAACCGCACGCCCAGAAC -2 E F V P W A A K L N R M A E D N L A P N A H P D Q

301 TACCCGTACACGCTCGAAGGGGGTTACATCCGTTGCCTTCCAGCGCCGGGCCAGGGCCAGGTCCTTGAAGAAGTA ATGGGCATGTGCGAGCTTCCCCCAATGTAGGCAACGGAAGGTCGCGGCCCGGTCCCGGTCCAGGAACTTCTTCAT -2 V R V R E F P T V D T A K W R R A L A L D K F F Y

376 CTTGCCAAAGCTGATGCGCAAAAAGTCCCGCTGGGCACGCAGCTGGGGCAGGTAACGCTTGCCCAGCAGCAGGTG GAACGGTTTCGACTACGCGTTTTTCAGGGCGACCCGTGCGTCGACCCCGTCCATTGCGAACGGGTCGTCGTCCAC -2 K G F S I R L F D R Q A R L Q P L Y R K G L L L H

451 ATCGAGGGTGAGGAAGGCGTCCAGCGCGCCGCGCTGGGTGATGATGTAGCCGCCGTCCTTGTGCTTGCGGAAGGA TAGCTCCCACTCCTTCCGCAGGTCGCGCGGCGCGACCCACTACTACATCGGCGGCAGGAACACGAACGCCTTCCT -2 D L T L F A D L A G R Q T I I Y G G D K H K R F S

526 AAAATCTGGTGCGCCCACGGCGATGTCGGTTGGCCCGTCCATGGGCTCTGTGCGCAGCACGGAACAGGTCAGCGG TTTTAGACCACGCGGGTGCCGCTACAGCCAACCGGGCAGGTACCCGAGACACGCGTCGTGCCTTGTCCAGTCGCC -2 F D P A G V A I D T P G D M P E T R L V S C T L P

601 CAAGGTCGGCAGGTTGATGCCCAGGTTGCCGAGGAACTTGCGCGACCACAGGCCACCGGCCAGCAACACCTGGTC GTTCCAGCCGTCCAACTACGGGTCCAACGGCTCCTTGAACGCGCTGGTGTCCGGTGGCCGGTCGTTGTGGACCAG -2 L T P L N I G L N G L F K R S W L G G A L L V Q D

676 GCAGCGGATTTCACCTTGCTCGGTGACCACCCCGCTGACACGGCCGGCTGCGGTGACCAGCGTGCGCACCGCGCA CGTCGCCTAAAGTGGAACGAGCCACTGGTGGGGCGACTGTGCCGGCCGACGCCACTGGTCGCACGCGTGGCGCGT -2 C R I E G Q E T V V G S V R G A A T V L T R V A C

751 GTTCTCCACTACCACTGCACCTTTGGCGATCGCCGCCCGGGCGATGGCGCTGGCGGCCAGGGTCGGTTCGGCGCG CAAGAGGTGATGGTGACGTGGAAACCGCTAGCGGCGGGCCCGCTACCGCGACCGCCGGTCCCAGCCAAGCCGCGC -2 N E V V V A G K A I A A R A I A S A A L T P E A R

826 GGCGTCGGAGGGGGTGAAGATGCCACCTGCCCAATCCGCCCGACCACCCGGCACCATCCGGGTGATTTCCCGCGT CCGCAGCCTCCCCCACTTCTACGGTGGACGGGTTAGGCGGGCTGGTGGGCCGTGGTAGGCCCACTAAAGGGCGCA -2 A D S P T F I G G A W D A R G G P V M R T I E R T

901 GCTCAGCAGGCGCGAATCCAGGCCCAGCGCCTCGACGCTTTTCAGCCAGCCTTCATGCATGCCCATCTGCGTGTC CGAGTCGTCCGCGCTTAGGTCCGGGTCGCGGAGCTGCGAAAAGTCGGTCGGAAGTACGTACGGGTAGACGCACAG -2 S L L R S D L G L A E V S K L W G E H M G M Q T D

976 GTTACGGCCGATGAACATGATGCCGGCTTGCCGATAGCCAACGTCGCTGCCAACCCGTGCGGGCATCTCGGCCCA CAATGCCGGCTACTTGTACTACGGCCGAACGGCTATCGGTTGCAGCGACGGTTGGGCACGCCCGTAGAGCCGGGT -2 N R G I F M I G A Q R Y G V D S G V R A P M E A W

1051 CAGCCGATCAGCCGCCAGTGCCAGGGGAATGTCATGGGCGTGGCGGTTGGTCTTGCGCACCCAGCCCAGGTTGCG GTCGGCTAGTCGGCGGTCACGGTCCCCTTACAGTACCCGCACCGCCAACCAGAACGCGTGGGTCGGGTCCAACGC -2 L R D A A L A L P I D H A H R N T K R V W G L N R

1126 CGACGACTGCTCCCCAGCGATGCGCCCCTTCTCCAGCACCACCACCGGTATGTTGCGTTCGGCGAGGCTCAGTGC GCTGCTGACGAGGGGTCGCTACGCGGGGAAGAGGTCGTGGTGGTGGCCATACAACGCAAGCCGCTCCGAGTCACG -2 S S Q E G A I R G K E L V V V P I N R E A L S L A

1201 GGCGGTGAGGCCGATAATGCCGCCACCGATGATCACCACGGTAGTGGCGTCGGGGTGGCGGGTGCTGGTTTGCAC CCGCCACTCCGGCTATTACGGCGGTGGCTACTAGTGGTGCCATCACCGCAGCCCCACCGCCCACGACCAAACGTG -2 A T L G I I G G G I I V V T T A D P H R T S T Q V

1276 AGGGGCGATCGTGGGAGACATGGCTTTACTCTTTGTTGTGCGTGCAGGGGGAGTGTTCAGCGCCAGCCAGCAGCC TCCCCGCTAGCACCCTCTGTACCGAAATGAGAAACAACACGCACGTCCCCCTCACAAGTCGCGGTCGGTCGTCGG -2 P A I T P S M

tynA

1351 TCACTGGCCAAGGCGGATCAGGGTCACTTGCGCTTGCCCCGCACCGCGGTAGGCGGTGACCTCCAGCTCGACCTT AGTGACCGGTTCCGCCTAGTCCCAGTGAACGCGAACGGGGCGTGGCGCCATCCGCCACTGGAGGTCGAGCTGGAA

1426 GTAAACGGTGGAGCCCAGCGGCGGGCAGGTGACCGTGGTGGCCGGGTCGATGCCGCGGAACTTCTCGCCGATCAC CATTTGCCACCTCGGGTCGCCGCCCGTCCACTGGCACCACCGGCCCAGCTACGGCGCCTTGAAGAGCGGCTAGTG

1501 GTCCATGACCCGTGGTACATCGGCAGGGTCCTGGATGAACACGCGCGAGTTGATGACATCGGCCAGGCTGGCATC CAGGTACTGGGCACCATGTAGCCGTCCCAGGACCTACTTGTGCGCGCTCAACTACTGTAGCCGGTCCGACCGTAG

1576 GACTGCGGCCAGCGCGGTTTCGATGTTGGCGAACACCTGGTGGGTCTGTTCGATGACGTCCTCTGGAATGACCTG CTGACGCCGGTCGCGCCAAAGCTACAACCGCTTGTGGACCACCCAGACAAGCTACTGCAGGAGACCTTACTGGAC

1651 GGTCTGCGGGTTGCGTCCGGCGGTGTTGGAGACGTGAATCCAGTTGTCCACCGCCACCAGGCGGGAGTAGCTGGC CCAGACGCCCAACGCAGGCCGCCACAACCTCTGCACTTAGGTCAACAGGTGGCGGTGGTCCGCCCTCATCGACCG

1726 CATGGCTTCGAACTTGGAGCCGGTTTTCAGTTTGATGATCTGTGTCATGGGCTTTGCCTTGTTATCCGGTTGCGG GTACCGAAGCTTGAACCTCGGCCAAAAGTCAAACTACTAGACACAGTACCCGAAACGGAACAATAGGCCAACGCC

1801 GGATCAGCTGAGAACGGGGGTTTCCCAGAGGTTGAGCTTTACGCCGATGCCTTGCTCGAGCGCCTTGCGGTACAC CCTAGTCGACTCTTGCCCCCAAAGGGTCTCCAACTCGAAATGCGGCTACGGAACGAGCTCGCGGAACGCCATGTG -2 & S L V P T E W L N L K V G I G Q E L A K R Y V

1876 CACGGTGCCCCAGGCCACGTCTTCGACGGGCATGCCGCCCACCGACATCAGGATGATTTCGTCGTCATGCAGGCG GTGCCACGGGGTCCGGTGCAGAAGCTGCCCGTACGGCGGGTGGCTGTAGTCCTACTAAAGCAGCAGTACGTCCGC -2 V T G W A V D E V P M G G V S M L I I E D D H L R

1951 GCCCGGTGCGTCGCCGCTGATGATCTTGCCGATGTCTTCCACCTGCTCGGCGGCCAGCGTGCCTTCGGCAATCAT CGGGCCACGCAGCGGCGACTACTAGAACGGCTACAGAAGGTGGACGAGCCGCCGGTCGCACGGAAGCCGTTAGTA -2 G P A D G S I I K G I D E V Q E A A L T G E A I M

2026 GTCCATGAAGCGCACACCTACCAGCGGTACGTGGTTGTGCGCAGGCTTGGGCAGCTCTTCGAACCAGGCCTCGTA CAGGTACTTCGCGTGTGGATGGTCGCCATGCACCAACACGCGTCCGAACCCGTCGAGAAGCTTGGTCCGGAGCAT -2 D M F R V G V L P V H N H A P K P L E E F W A E Y 2101 GAGGCCGGTGTTGTCCACCACCTTGCGCACGTCGTCCTGCTCCATGCCGGCGTCGATACTGCACGGGGCTGGCAT CTCCGGCCACAACAGGTGGTGGAACGCGTGCAGCAGGACGAGGTACGGCCGCAGCTATGACGTGCCCCGACCGTA -2 L G T N D V V K R V D D Q E M G A D I S C P A P M

2176 GGCCAGGAACGCGCCAGGCTTGACCCACTCGCGGCGCACCAGCGGGTACTGGCTGGGGTCGCCGACTTCGCCCGA CCGGTCCTTGCGCGGTCCGAACTGGGTGAGCGCCGCGTGGTCGCCCATGACCGACCCCAGCGGCTGAAGCGGGCT -2 A L F A G P K V W E R R V L P Y Q S P D G V E G S

2251 GCTGCAGTAGCTGACCAGGTCGGAACCGCGTACCACTTCTTCCAGGGTTTCCACCACCTGGACATGAGTGATTTG CGACGTCATCGACTGGTCCAGCCTTGGCGCATGGTGAAGAAGGTCCCAAAGGTGGTGGACCTGTACTCACTAAAC -2 S C Y S V L D S G R V V E E L T E V V Q V H T I Q

2326 CGGGAAGCTGGTTTTCACCCAGGCGACGAAGGCATCCAGGTTCTTCTGGCCACGGCCCTTGACCTTGAGGGTGTC GCCCTTCGACCAAAAGTGGGTCCGCTGCTTCCGTAGGTCCAAGAAGACCGGTGCCGGGAACTGGAACTCCCACAG -2 P F S T K V W A V F A D L N K Q G R G K V K L T D

2401 GATCAGCGGGCAGACGGCCATGAACGCAGCGACCGTGGTCTTGCCCATCACCCCCGGGCCGGCCAGGCCGATCAC CTAGTCGCCCGTCTGCCGGTACTTGCGTCGCTGGCACCAGAACGGGTAGTGGGGGCCCGGCCGGTCCGGCTAGTG -2 I L P C V A M F A A V T T K G M V G P G A L G I V

2476 CTTGGCGTCCTTGCGCGCCAGGTGGCGGGCGCCGACGCCCGGGATGGCGCCGGTGCGGTAGGCCGACAGCAGGTT GAACCGCAGGAACGCGCGGTCCACCGCCCGCGGCTGCGGGCCCTACCGCGGCCACGCCATCCGGCTGTCGTCCAA -2 K A D K R A L H R A G V G P I A G T R Y A S L L N

2551 GGCCGACATGTGTGCCAGTGGCGCGCCGGTGTCGGCATCGTTGAGGGTGAACATCAGGATCGAGCGGGGCAGGCC CCGGCTGTACACACGGTCACCGCGCGGCCACAGCCGTAGCAACTCCCACTTGTAGTCCTAGCTCGCCCCGTCCGG -2 A S M H A L P A G T D A D N L T F M L I S R P L G

2626 TTTCTCACGGTTGGCGATGTTCGAGCCGTACCACTTGGCGCCTGCGGTCTGGAAGTTGCCGCCGAGGTACGCCGG AAAGAGTGCCAACCGCTACAAGCTCGGCATGGTGAACCGCGGACGCCAGACCTTCAACGGCGGCTCCATGCGGCC -2 K E R N A I N S G Y W K A G A T Q F N G G L Y A P

2701 CATCGCCATCATGCGCCGGTCGGCGGTGGGCTTGGGCATGTTGGGGAATGGCGAGTGCTCGGGGAAGGTAATCAT GTAGCGGTAGTACGCGGCCAGCCGCCACCCGAACCCGTACAACCCCTTACCGCTCACGAGCCCCTTCCATTAGTA -2 M A M M R R D A T P K P M N P F P S H E P F T I M

2776 CGCGCCGTGCGAGTCGCTGTTCGGGCCGGCCATGCGGTAGTCACCCTGGTACAGCAGGCCGAACATTTCTTCCAT GCGCGGCACGCTCAGCGACAAGCCCGGCCGGTACGCCATCAGTGGGACCATGTCGTCCGGCTTGTAAAGAAGGTA -2 A G H S D S N P G A M R Y D G Q Y L L G F M E E M

2851 GGTGTCGACACAGGCCGGCATGTCGGTGACGCCGGCACGGATCATGTCCTGCTCGGACAGGTAGATGAAGTCAAT CCACAGCTGTGTCCGGCCGTACAGCCACTGCGGCCGTGCCTAGTACAGGACGAGCCTGTCCATCTACTTCAGTTA -2 T D V C A P M D T V G A R I M D Q E S L Y I F D I

2926 TCTGGTATCGAGGGTCATGGCGGGTCTCGCAGGGCTGGCTGCCGTCGGATTTGTTGTTGGTTTCGAGGCAACCAG AGACCATAGCTCCCAGTACCGCCCAGAGCGTCCCGACCGACGGCAGCCTAAACAACAACCAAAGCTCCGTTGGTC -2 R T D L T M

tynB

3001 TTTCGCTAACGACTGGTAGGTCGTCTTGTGTCTGCCTGCCAGCCGAGTTGACCGTCAGTGCCAGGGCTTCAATGG AAAGCGATTGCTGACCATCCAGCAGAACACAGACGGACGGTCGGCTCAACTGGCAGTCACGGTCCCGAAGTTACC -3 & H

3076 CCCGCGAGCGAGAAGCTGGCCGGGGTGTGGCGCAGGCTGAGGGCGGTCAGCAGGCACACCACCAGGGTGCACAGG GGGCGCTCGCTCTTCGACCGGCCCCACACCGCGTCCGACTCCCGCCAGTCGTCCGTGTGGTGGTCCCACGTGTCC -3 G A L S F S A P T H R L S L A T L L C V V L T C L

3151 GCCAGCAGCGCGGCCCATGCGGTCGGGCCGTGGTTGAGTACCACTGCGGCCAGCGGGGCGGCGCCGGCAGACGCC CGGTCGTCGCGCCGGGTACGCCAGCCCGGCACCAACTCATGGTGACGCCGGTCGCCCCGCCGCGGCCGTCTGCGG -3 A L L A A W A T P G H N L V V A A L P A A G A S A

3226 GACAGCTGGATGGCGCCCAGCAGCGCTGCGGTGGAACCCAGTGCCTTTTCTTGCGAGGCCATCACCAGCGACATC CTGTCGACCTACCGCGGGTCGTCGCGACGCCACCTTGGGTCACGGAAAAGAACGCTCCGGTAGTGGTCGCTGTAG -3 S L Q I A G L L A A T S G L A K E Q S A M V L S M

3301 AGCGTCGACTCGGCTATCCCCAGGCCGAACAGGGCTATCACCATGCCGCCGGCCACACCTGGCAGCCCCAGGCCG TCGCAGCTGAGCCGATAGGGGTCCGGCTTGTCCCGATAGTGGTACGGCGGCCGGTGTGGACCGTCGGGGTCCGGC -3 L T S E A I G L G F L A I V M G G A V G P L G L G

3376 GTCAGTGCACCGAGCAGGCTGATGCAGGCACCGCCGGCCATGCACAGCACGCCCACCCGAGTCAAGGTATTGAGG CAGTCACGTGGCTCGTCCGACTACGTCCGTGGCGGCCGGTACGTGTCGTGCGGGTGGGCTCAGTTCCATAACTCC -3 T L A G L L S I C A G G A M C L V G V R T L T N L

3451 CCCAGCCGGCTGATCAGGTGGCTGGCCGTCATGGCGCCGAGCAGGATCGACACCCCGGTGGCGCCAAACAGCAGG GGGTCGGCCGACTAGTCCACCGACCGGCAGTACCGCGGCTCGTCCTAGCTGTGGGGCCACCGCGGTTTGTCGTCC -3 G L R S I L H S A T M A G L L I S V G T A G F L L

3526 CCGAAGGCCTGGGCGCTCAGGCCGTAGTGGGCCTGGTACACCAGGGTGGCACCGCCGATGTAGGCGAACAGGAAG GGCTTCCGGACCCGCGAGTCCGGCATCACCCGGACCATGTGGTCCCACCGTGGCGGCTACATCCGCTTGTCCTTC -3 G F A Q A S L G Y H A Q Y V L T A G G I Y A F L F

3601 AAGAATACCGCAGCAACCGCCAGGGTCGGGCGCAGGAAGCGGCGGTCGGCGAGGATGGCCAGGTAGGTGCTGCAG TTCTTATGGCGTCGTTGGCGGTCCCAGCCCGCGTCCTTCGCCGCCAGCCGCTCCTACCGGTCCATCCACGACGTC -3 F F V A A V A L T P R L F R R D A L I A L Y T S C

3676 GCGTGGCCCAGGCGCAGGGGTTCGCGTTTGCTGGGCGGCAGGGTTTCGGGCAGGTTCAGCAGGCTGTTGACCAGC CGCACCGGGTCCGCGTCCCCAAGCGCAAACGACCCGCCGTCCCAAAGCCCGTCCAAGTCGTCCGACAACTGGTCG -3 A H G L R L P E R K S P P L T E P L N L L S N V L

3751 ACCGTCACGCCCATGCCGGCGAGTACCAGCATTACTGCACGCCAGCCGAAATGTGCGTCGATCACGCCGCCCAGG TGGCAGTGCGGGTACGGCCGCTCATGGTCGTAATGACGTGCGGTCGGCTTTACACGCAGCTAGTGCGGCGGGTCC -3 V T V G M G A L V L M V A R W G F H A D I V G G L

3826 GCAGGTGCCAGGATCGGTGCGACGCCTTCGATGGTCATCAGCAGGGCGAACAGTTTGGTCGCGGCCACGCCCTGG CGTCCACGGTCCTAGCCACGCTGCGGAAGCTACCAGTAGTCGTCCCGCTTGTCAAACCAGCGCCGGTGCGGGACC -3 A P A L I P A V G E I T M L L A F L K T A A V G Q

3901 CTCACATCACGCACCATGCTCATGATCACCACCAGGGTCAGCGCACTGCCCAGGCCCTGGAAAAAGCGCAGCATG GAGTGTAGTGCGTGGTACGAGTACTAGTGGTGGTCCCAGTCGCGTGACGGGTCCGGGACCTTTTTCGCGTCGTAC -3 S V D R V M S M I V V L T L A S G L G Q F F R L M

3976 ATCAGGGTGTCGAGGCTGGGGGCTGCGGCTGCGCCCAGCGAGCACAGGATGAACAGCAGCAGGCCGGCCAGCAGC TAGTCCCACAGCTCCGACCCCCGACGCCGACGCGGGTCGCTCGTGTCCTACTTGTCGTCGTCCGGCCGGTCGTCG -3 I L T D L S P A A A A G L S C L I F L L L G A L L

4051 GGCTTGCGCCGGCCATAAGCGTCGACGATGGGGCCGAAGATCAGCTGGCCGGCGCCCATGGCCAGCAGGAAGAAG CCGAACGCGGCCGGTATTCGCAGCTGCTACCCCGGCTTCTAGTCGACCGGCCGCGGGTACCGGTCGTCCTTCTTC -3 P K R R G Y A D V I P G F I L Q G A G M A L L F F

4126 GTCAGTGTCAGCTGTACGCGGGTGAAGCTAGCCTGATAGTGGCTGGCGATTTCCGGCAGGCTCGACAGGTACATG CAGTCACAGTCGACATGCGCCCACTTCGATCGGACTATCACCGACCGCTAAAGGCCGTCCGAGCTGTCCATGTAC -3 T L T L Q V R T F S A Q Y H S A I E P L S S L Y M

4201 TCGACGGCGGAAGGGCCGAGGGCGCCGATCAGGCCTAGGCCCAGGGCGAAGCTGAAGGGTATGGGAGGGGAGGGA AGCTGCCGCCTTCCCGGCTCCCGCGGCTAGTCCGGATCCGGGTCCCGCTTCGACTTCCCATACCCTCCCCTCCCT -3 D V A S P G L A G I L G L G L A F S F P I P P S P

4276 TTGGCTTGCATGGTTTTCTCTGGCTGATTTTTCGCCTACCGACCGGTAGGTTTGCGAATATTATTCGCCGAGTCG AACCGAACGTACCAAAAGAGACCGACTAAAAAGCGGATGGCTGGCCATCCAAACGCTTATAATAAGCGGCTCAGC -3 N A Q M

tynF

4351 GCCAAGGTCAAACCCTTCCGCAAGGCCACTGATTCCTGTGGGGAGCGGGCATGCCCGCGAACACCGGCAAAGCCG CGGTTCCAGTTTGGGAAGGCGTTCCGGTGACTAAGGACACCCCTCGCCCGTACGGGCGCTTGTGGCCGTTTCGGC

4426 GTGCCACCGAGTCGCCTTCTTCGCGGGCATGCCCGCTCCCACATTGACCGCAGAGGTTGGTTACCGTGGTTGCGT CACGGTGGCTCAGCGGAAGAAGCGCCCGTACGGGCGAGGGTGTAACTGGCGTCTCCAACCAATGGCACCAACGCA

4501 CAGAACGGCACAGCCACGGTCAGCTGGCTATACACATTGGTACCATTCCCGCCCACCTGGTTGCCGCCGTTGCTC GTCTTGCCGTGTCGGTGCCAGTCGACCGATATGTGTAACCATGGTAAGGGCGGGTGGACCAACGGCGGCAACGAG -3 & F P V A V T L Q S Y V N T G N G G V Q N G G N S

4576 TCGTCCTTGCGCGGCTGGTAAAGGCCCACCAGCGGGCTGATTATCAGGTGCTCGTTGACTGCCCATTCCACATAC AGCAGGAACGCGCCGACCATTTCCGGGTGGTCGCCCGACTAATAGTCCACGAGCAACTGACGGGTAAGGTGTATG -3 E D K R P Q Y L G V L P S I I L H E N V A W E V Y

4651 AGGTCCAGCTCCCGCGCATCGAGGTTGAGGCTTTCGCGGGTGCGTACGGTGTCGAAGTCGAAGTACAGCGCCCCG TCCAGGTCGAGGGCGCGTAGCTCCAACTCCGAAAGCGCCCACGCATGCCACAGCTTCAGCTTCATGTCGCGGGGC -3 L D L E R A D L N L S E R T R V T D F D F Y L A G

4726 ACTGTGAGATTTTCCAGCGGTGTCGCCTTCACGCCCACATGGTGGATACCCGTGTTGCTGTTGAAGGGGCCGGCG TGACACTCTAAAAGGTCGCCACAGCGGAAGTGCGGGTGTACCACCTATGGGCACAACGACAACTTCCCCGGCCGC -3 V T L N E L P T A K V G V H H I G T N S N F P G A

4801 TAGTTGGCAGCGACTTCACCCTGGAACCAGGTGCCGTAACCGCTGGACAGGCCGCTGAACAGCGCGTCCCAGCCT ATCAACCGTCGCTGAAGTGGGACCTTGGTCCACGGCATTGGCGACCTGTCCGGCGACTTGTCGCGCAGGGTCGGA -3 Y N A A V E G Q F W T G Y G S S L G S F L A D W G

4876 GCCGAGTAGCGGGTGTAGCGGTAGGTAACCTGCGGTGCCCACGGCAGGTCGGCGAAGGTGTAGCCGGCCTGCAGG CGGCTCATCGCCCACATCGCCATCCATTGGACGCCACGGGTGCCGTCCAGCCGCTTCCACATCGGCCGGACGTCC -3 A S Y R T Y R Y T V Q P A W P L D A F T Y G A Q L

4951 TACCAGGCTTGCTCGGGGCCGTCGGTCTTGTCCTGCCAGGCGTATTCGAAGGCGAAACTGGCATTGTCGATGCCA ATGGTCCGAACGAGCCCCGGCAGCCAGAACAGGACGGTCCGCATAAGCTTCCGCTTTGACCGTAACAGCTACGGT -3 Y W A Q E P G D T K D Q W A Y E F A F S A N D I G

5026 GCGTTGCCTTCGCCGCGCACGCTATACACGTCCATGCCTTCGCGGGCTTTCTGAAAGTCGCTGGCCCATTGGTCG CGCAACGGAAGCGGCGCGTGCGATATGTGCAGGTACGGAAGCGCCCGAAAGACTTTCAGCGACCGGGTAACCAGC -3 A N G E G R V S Y V D M G E R A K Q F D S A W Q D

5101 GTGACGTCGATGCCGTGAATCCAGGTCAGCCCGAGGGTGCCCAAGGCTTGGGTGTAGTCCAGCGTGCCGGCGGCC CACTGCAGCTACGGCACTTAGGTCCAGTCGGGCTCCCACGGGTTCCGAACCCACATCAGGTCGCACGGCCGCCGG -3 T V D I G H I W T L G L T G L A Q T Y D L T G A A

5176 AGTTCGGTTTCGGCCTGGGCGCGGTTGTCGGATTTCAGCCACAGCAGGCTGCCATGCAGGCCATCGCTGCCCCCC TCAAGCCAAAGCCGGACCCGCGCCAACAGCCTAAAGTCGGTGTCGTCCGACGGTACGTCCGGTAGCGACGGGGGG -3 L E T E A Q A R N D S K L W L L S G H L G D S G G

5251 AGGCGCAGCATTGCGGTGCGGTCGAAGGCGTGGCGGGCGGCCAGGTAGTAGGCCCCGCCGCGGTCCAGCGCACCG TCCGCGTCGTAACGCCACGCCAGCTTCCGCACCGCCCGCCGGTCCATCATCCGGGGCGGCGCCAGGTCGCGTGGC -3 L R L M A T R D F A H R A A L Y Y A G G R D L A G

5326 TCGGCGACGCCGTTGCCCAGGTTCGGGCCGTCGTCGTTGATCAAAAAACCACTGCCCAGGCGAATGGTCTGGCGG AGCCGCTGCGGCAACGGGTCCAAGCCCGGCAGCAGCAACTAGTTTTTTGGTGACGGGTCCGCTTACCAGACCGCC -3 D A V G N G L N P G D D N I L F G S G L R I T Q R

5401 CCGGCGGAAACGTCCACTCCATCCTTGCCCAGCACCGGGAACAGGTCGGCCGAGCGCCAGCCGAGGAAGGCGTCT GGCCGCCTTTGCAGGTGAGGTAGGAACGGGTCGTGGCCCTTGTCCAGCCGGCTCGCGGTCGGCTCCTTCCGCAGA -3 G A S V D V G D K G L V P F L D A S R W G L F A D

5476 TCGATCTTGGTGGTGCGTTCGGAGCCATCGGTGTTGCCGGCCGCATCGCCATCGCCCCAGGTGGCCGAGCTCACC

AGCTAGAACCACCACGCAAGCCTCGGTAGCCACAACGGCCGGCGTAGCGGTAGCGGGGTCCACCGGCTCGAGTGG -3 E I K T T R E S G D T N G A A D G D G W T A S S V

5551 CAGTTCAGGCTGCCGTACAGCGTGCCGTTGCCGGCCAGGCCCTGGTCACCGCTGAGGCCATACTTGATAAAGCCT GTCAAGTCCGACGGCATGTCGCACGGCAACGGCCGGTCCGGGACCAGTGGCGACTCCGGTATGAACTATTTCGGA -3 W N L S G Y L T G N G A L G Q D G S L G Y K I F G

5626 TCACGCCAGGTCGAACCCCCTGTGGTGCCGTCGTAGTTCTTGCGGCTGTTGAACATGCCCCATACCGCCAGCATG AGTGCGGTCCAGCTTGGGGGACACCACGGCAGCATCAAGAACGCCGACAACTTGTACGGGGTATGGCGGTCGTAC -3 E R W T S G G T T G D Y N K R S N F M G W V A L M

5701 TCGGCGTTCAGGTGGCTGTCATCGTCGGCGTACAGCTCAACGGCCGGCGCGGCCTGGCTGGCCAGCAAGGTTGCC AGCCGCAAGTCCACCGACAGTAGCAGCCGCATGTCGAGTTGCCGGCCGCGCCGGACCGACCGGTCGTTCCAACGG -3 D A N L H S D D D A Y L E V A P A A Q S A L L T A

5776 AGGGCCAGGCTGGACAGCGTCTGTGGTTTGACCATTTGCACATCCCTCGTTTGTTCTCGGCCACCTTCACAGGGG TCCCGGTCCGACCTGTCGCAGACACCAAACTGGTAAACGTGTAGGGAGCAAACAAGAGCCGGTGGAAGTGTCCCC -3 L A L S S L T Q P K V M

tynE

5851 CCTTTGTTGTTCGGGGGCACCCTCGGTTCTGGCGAGGGGCCATCGCGGTTGGCGGCGATGGCCTATTAGGGCGTG GGAAACAACAAGCCCCCGTGGGAGCCAAGACCGCTCCCCGGTAGCGCCAACCGCCGCTACCGGATAATCCCGCAC

5926 TGCGGTGGGGCGGGGTCTTGTTCGTGGCTGCCAAGGCGCTTGCACGCCTTGGCCACAGGCGCGGTCAGTAGCGGA ACGCCACCCCGCCCCAGAACAAGCACCGACGGTTCCGCGAACGTGCGGAACCGGTGTCCGCGCCAGTCATCGCCT -1 & Y R I

6001 TCATCACCGACTTGAGCTCGGTGAAGTCATCGATGAAGGCCGAGCCGAACTCGCGGCCAATGCCGGAAGCCTTGA AGTAGTGGCTGAACTCGAGCCACTTCAGTAGCTACTTCCGGCTCGGCTTGAGCGCCGGTTACGGCCTTCGGAACT -1 M V S K L E T F D D I F A S G F E R G I G S A K I

6076 TGCCCCCAAACGGTACAGCCGGGTCGAGCAGGGTGTGCATGTTGACCCACAGGGTACCGGCCTGGATTTGCGGGA ACGGGGGTTTGCCATGTCGGCCCAGCTCGTCCCACACGTACAACTGGGTGTCCCATGGCCGGACCTAAACGCCCT -1 G G F P V A P D L L T H M N V W L T G A Q I Q P I

6151 TCATGCGCATGGCCTTGCCCAGGTCGTTGGTCCACAGGCTGGCGCTGAGGCCGTAGGGCGAGGCGTTCATCAGGT AGTACGCGTACCGGAACGGGTCCAGCAACCAGGTGTCCGACCGCGACTCCGGCATCCCGCTCCGCAAGTAGTCCA -1 M R M A K G L D N T W L S A S L G Y P S A N M L H

6226 GCAGCAGTTCGTCTTCGTCGTCATAAGGCAGGAAGGTCGCCACAGGGCCGAAGGTTTCCTGGGTGAGCAGGGTGT CGTCGTCAAGCAGAAGCAGCAGTATTCCGTCCTTCCAGCGGTGTCCCGGCTTCCAAAGGACCCACTCGTCCCACA -1 L L E D E D D Y P L F T A V P G F T E Q T L L T D

6301 CGCAGGCTGACCGGGCGAGGATTACCGTGGGTTCGACGAAACAGCCGGGGCCGTCGCCCAGGGTGCCGCCGTGAA GCGTCCGACTGGCCCGCTCCTAATGGCACCCAAGCTGCTTTGTCGGCCCCGGCAGCGGGTCCCACGGCGGCACTT -1 C A S R A L I V T P E V F C G P G D G L T G G H I

6376 TGATCTGGCTGCCTTCGGCGCGGGCGATGGCGAACAGTTCGGCCAGCTTCTGCTGGTGCGGCTTGTTGGCCACGG ACTAGACCGACGGAAGCCGCGCCCGCTACCGCTTGTCAAGCCGGTCGAAGACGACCACGCCGAACAACCGGTGCC -1 I Q S G E A R A I A F L E A L K Q Q H P K N A V P

6451 GGCCGAACTGGGTGGCCTCGTCCAGTGGCGAGCCGATTTTCAGTTGGCCCAGGCGCTGGGACAGGGCGTCCAGCA CCGGCTTGACCCACCGGAGCAGGTCACCGCTCGGCTAAAAGTCAACCGGGTCCGCGACCCTGTCCCGCAGGTCGT -1 G F Q T A E D L P S G I K L Q G L R Q S L A D L L

6526 GCGGGTCGATGCGCGAGCGGTGCACATAGAAGCGCTCGCCCGCGGCGCAGATTTGCCCCGAGTGCAGGAAGCCGG CGCCCAGCTACGCGCTCGCCACGTGTATCTTCGCGAGCGGGCGCCGCGTCTAAACGGGGCTCACGTCCTTCGGCC -1 P D I R S R H V Y F R E G A A C I Q G S H L F G A

6601 CCTCGATGATGCCGTCCACAGCCTTGTCGGTTGCCACGTCGGGCAGGAAGGCCACCGCGTTCTTGCCGCCCAGTT GGAGCTACTACGGCAGGTGTCGGAACAGCCAACGGTGCAGCCCGTCCTTCCGGTGGCGCAAGAACGGCGGGTCAA -1 E I I G D V A K D T A V D P L F A V A N K G G L E

6676 CCAGTGTCGCACGGGTCAGCTTGGCGCCCATGGCAGCCTGGCCTACGGCGATGCCAGTGGGCACGGAGCCGGTGA GGTCACAGCGTGCCCAGTCGAACCGCGGGTACCGTCGGACCGGATGCCGCTACGGTCACCCGTGCCTCGGCCACT -1 L T A R T L K A G M A A Q G V A I G T P V S G T F

6751 ACGAGACCTTGTCGGTACCTGCGTGCTCGATCAGTGCCTTGCCCACCAGGCCACCACCGGTCAGCACGTTCAGTG TGCTCTGGAACAGCCATGGACGCACGAGCTAGTCACGGAACGGGTGGTCCGGTGGTGGCCAGTCGTGCAAGTCAC -1 S V K D T G A H E I L A K G V L G G G T L V N L A

6826 CACCGGCCGGCAGGCCTGCTTCGGTGGCCAGTTCGGCAATGCGCAGCAGCGTCAGCGGGGTGAATTCGCTGGGCT GTGGCCGGCCGTCCGGACGAAGCCACCGGTCAAGCCGTTACGCGTCGTCGCAGTCGCCCCACTTAAGCGACCCGA -1 G A P L G A E T A L E A I R L L T L P T F E S P K 6901 TGAGGATAATGCTGCAGCCGGTTGTCAGGGCCGAGGCCAGCTTCCAGATGGCGATCATGCTGGCGAAGTTCCACG ACTCCTATTACGACGTCGGCCAACAGTCCCGGCTCCGGTCGAAGGTCTACCGCTAGTACGACCGCTTCAAGGTGC -1 L I I S C G T T L A S A L K W I A I M S A F N W P

6976 GCACGATGCCCACCACCACGCCAATCGGCTCGCGCAGGGTGAAGGCGCTGTAGCGCTCACCGGCGAACGAGGGCA CGTGCTACGGGTGGTGGTGCGGTTAGCCGAGCGCGTCCCACTTCCGCGACATCGCGAGTGGCCGCTTGCTCCCGT -1 V I G V V V G I P E R L T F A S Y R E G A F S P L

7051 GCGACGGGGTGATGGTCTGGCCGGTGATCTTGGTCGCCCAGCCGGCGTAGTAGCGCAGGAAGTGCGCGGCCTGCT CGCTGCCCCACTACCAGACCGGCCACTAGAACCAGCGGGTCGGCCGCATCATCGCGTCCTTCACGCGCCGGACGA -1 S P T I T Q G T I K T A W G A Y Y R L F H A A Q Q

7126 GTACTTCGAACGCACGGGAAATGCCGATGAGCTTGCCGGATTGCAAGGTTTCCAGCTGCGCCAGTTCTTCGCGGT CATGAAGCTTGCGTGCCCTTTACGGCTACTCGAACGGCCTAACGTTCCAAAGGTCGACGCGGTCAAGAAGCGCCA -1 V E F A R S I G I L K G S Q L T E L Q A L E E R N

7201 TGGCTTCCAGCAGGTCGGCCAGCTTGAACAGCACTGCGGCGCGGGCGGCGGGGCTGGTGTGCGACCAGGCGGTAA ACCGAAGGTCGTCCAGCCGGTCGAACTTGTCGTGACGCCGCGCCCGCCGCCCCGACCACACGCTGGTCCGCCATT -1 A E L L D A L K F L V A A R A A P S T H S W A T F

7276 AGCCTTGGCGCGAGGAGCTGACGGCATGGTCGACATCGGCCTGGTTGGCGTCGGCGATGTGGGCGATGGTCTGGC TCGGAACCGCGCTCCTCGACTGCCGTACCAGCTGTAGCCGGACCAACCGCAGCCGCTACACCCGCTACCAGACCG -1 G Q R S S S V A H D V D A Q N A D A I H A I T Q G

7351 CGTTGGCCGGGTTGACCACGGCAATGTTCGACGACGACTGGCTGGCGAGGTGCTGGCCGTGGATGAACACGCCAT GCAACCGGCCCAACTGGTGCCGTTACAAGCTGCTGCTGACCGACCGCTCCACGACCGGCACCTACTTGTGCGGTA -1 N A P N V V A I N S S S Q S A L H Q G H I F V G H

7426 GCTCGCGGGCCAGGAAGGCCGTGACGGCAGGTAGGAGGGTGATGTCGCTCATGCAGACTCCGGGGCAGTTGGCCA CGAGCGCCCGGTCCTTCCGGCACTGCCGTCCATCCTCCCACTACAGCGAGTACGTCTGAGGCCCCGTCAACCGGT -1 E R A L F A T V A P L L T I D S M

tynC

7501 AAGTTTGCAGCTTAATAAGCGGGGCAGTGCGGTGCTTGTGCCTGCGTGACAGGTGCATGACTGTGGCTGCCAACC TTCAAACGTCGAATTATTCGCCCCGTCACGCCACGAACACGGACGCACTGTCCACGTACTGACACCGACGGTTGG

7576 GCACTGGGTAAGCCTTGTGGGAGCGGCCTTGTGTCGCGATAGGGCCGCAGAGCGGCCCCGGCGATGTTGGCGGCG CGTGACCCATTCGGAACACCCTCGCCGGAACACAGCGCTATCCCGGCGTCTCGCCGGGGCCGCTACAACCGCCGC

7651 AAGCTGAAAATGCTGGGGCCGCTTCGCGCCCCTATCGCGACGCAAGGCCGCTCCCACAAAAAAAGCGAGCGTAGG TTCGACTTTTACGACCCCGGCGAAGCGCGGGGATAGCGCTGCGTTCCGGCGAGGGTGTTTTTTTCGCTCGCATCC

7726 CCGGGCTGATTGCTGGCAGGCAGCAACAAGCCCGGCGGCAGCCATCGGCAAGACGCCATGCCACCGGCAGCGCAC GGCCCGACTAACGACCGTCCGTCGTTGTTCGGGCCGCCGTCGGTAGCCGTTCTGCGGTACGGTGGCCGTCGCGTG

tynG

+3 T M S L N N K L T E H 7801 AGTAATCACTCGTTCAACGCCACAAAAACAAGCCGGGGCATACGATGTCACTCAATAACAAGCTCACCGAGCACC TCATTAGTGAGCAAGTTGCGGTGTTTTTGTTCGGCCCCGTATGCTACAGTGAGTTATTGTTCGAGTGGCTCGTGG

+3 L N R G T V G F P T A L A S T V G L I M A S P V I 7876 TCAACCGCGGCACTGTCGGTTTCCCCACCGCACTGGCCAGCACTGTCGGGCTGATCATGGCCAGCCCGGTGATCC AGTTGGCGCCGTGACAGCCAAAGGGGTGGCGTGACCGGTCGTGACAGCCCGACTAGTACCGGTCGGGCCACTAGG

+3 L T A T M G F G I G G S A F A V A M V I A A L M M 7951 TCACCGCGACCATGGGCTTTGGCATCGGCGGCAGCGCCTTCGCCGTGGCCATGGTCATCGCCGCACTGATGATGC AGTGGCGCTGGTACCCGAAACCGTAGCCGCCGTCGCGGAAGCGGCACCGGTACCAGTAGCGGCGTGACTACTACG +3 L A Q S T T F A E A A S I L P T T G S V Y D Y I N8026 TGGCGCAGTCCACCACCTTTGCCGAGGCTGCGTCGATCCTGCCGACCACGGGCTCGGTATACGACTACATCAACT ACCGCGTCAGGTGGTGGAAACGGCTCCGACGCAGCTAGGACGGCTGGTGCCCGAGCCATATGCTGATGTAGTTGA

+3 C G M G R F F A I T G T L S A Y L I V H V F A G T 8101 GTGGCATGGGCCGTTTCTTCGCCATTACCGGCACGCTGTCGGCCTACCTGATCGTGCATGTGTTCGCCGGTACCG CACCGTACCCGGCAAAGAAGCGGTAATGGCCGTGCGACAGCCGGATGGACTAGCACGTACACAAGCGGCCATGGC

+3 A E T I L S G V M A L V N F E H L N T L A E S A G 8176 CCGAAACCATCCTGTCGGGGGTGATGGCGCTGGTGAACTTCGAGCACCTCAATACCCTGGCGGAATCCGCCGGCG GGCTTTGGTAGGACAGCCCCCACTACCGCGACCACTTGAAGCTCGTGGAGTTATGGGACCGCCTTAGGCGGCCGC

+3 G S W L L G V C F V V A F A V L N A F G V S A F S 8251 GTTCGTGGCTGCTGGGGGTGTGCTTCGTGGTGGCGTTTGCGGTGCTCAATGCCTTTGGCGTCAGCGCCTTCAGCC CAAGCACCGACGACCCCCACACGAAGCACCACCGCAAACGCCACGAGTTACGGAAACCGCAGTCGCGGAAGTCGG

+3 R A E V V L T F G M W T T L M V F G V L G L I A A 8326 GCGCGGAAGTGGTCCTCACCTTCGGCATGTGGACCACCTTGATGGTGTTCGGCGTGCTTGGCCTGATCGCCGCAC CGCGCCTTCACCAGGAGTGGAAGCCGTACACCTGGTGGAACTACCACAAGCCGCACGAACCGGACTAGCGGCGTG

+3 P A V E L D G P F G V S L V G T D L M T I L S L V 8401 CCGCAGTGGAACTGGACGGCCCGTTCGGCGTGTCGCTGGTGGGCACCGACCTGATGACCATCCTCTCGCTGGTCG GGCGTCACCTTGACCTGCCGGGCAAGCCGCACAGCGACCACCCGTGGCTGGACTACTGGTAGGAGAGCGACCAGC

+3 G M A M F M F V G C E F V T P L A P E L R R S A W 8476 GCATGGCCATGTTCATGTTCGTTGGCTGCGAGTTCGTCACGCCGCTTGCCCCCGAACTGCGTCGCTCGGCCTGGG CGTACCGGTACAAGTACAAGCAACCGACGCTCAAGCAGTGCGGCGAACGGGGGCTTGACGCAGCGAGCCGGACCC

+3 V L P R A M A L G L F G V A S C M F I Y G A A M K 8551 TGCTGCCGCGGGCCATGGCGCTGGGCCTGTTTGGCGTGGCCAGCTGCATGTTCATCTACGGAGCGGCGATGAAGC ACGACGGCGCCCGGTACCGCGACCCGGACAAACCGCACCGGTCGACGTACAAGTAGATGCCTCGCCGCTACTTCG

+3 R Q V E N V V L D A A S G V H L L D T P M A I P R8626 GCCAGGTGGAAAACGTGGTGCTGGATGCCGCCAGTGGCGTGCACCTGCTGGACACGCCCATGGCCATCCCGCGCT CGGTCCACCTTTTGCACCACGACCTACGGCGGTCACCGCACGTGGACGACCTGTGCGGGTACCGGTAGGGCGCGA

+3 F A E Q V M G D I G P V W L G I G F L F A G A A T8701 TCGCCGAGCAGGTGATGGGTGATATTGGCCCAGTGTGGCTGGGTATCGGCTTCCTGTTCGCCGGCGCGGCCACCA AGCGGCTCGTCCACTACCCACTATAACCGGGTCACACCGACCCATAGCCGAAGGACAAGCGGCCGCGCCGGTGGT

+3 I N T L M A G V P R I L Y G M A V D G A L P K V F 8776 TCAACACGCTGATGGCCGGTGTGCCACGCATTCTTTACGGCATGGCGGTGGACGGCGCGTTGCCCAAGGTGTTCA AGTTGTGCGACTACCGGCCACACGGTGCGTAAGAAATGCCGTACCGCCACCTGCCGCGCAACGGGTTCCACAAGT

+3 T Y L H P R F K T P L L C I L V V A L I P C L H A 8851 CCTACCTGCACCCGCGCTTCAAGACGCCGCTGCTGTGCATCCTGGTGGTGGCGTTGATCCCTTGCCTGCATGCCT GGATGGACGTGGGCGCGAAGTTCTGCGGCGACGACACGTAGGACCACCACCGCAACTAGGGAACGGACGTACGGA

+3 W Y L G G N P D N I L H L V L A A V C A W S T A Y 8926 GGTACCTGGGCGGCAACCCGGACAACATCCTGCACCTGGTGCTGGCCGCCGTGTGCGCCTGGAGCACCGCCTACC CCATGGACCCGCCGTTGGGCCTGTTGTAGGACGTGGACCACGACCGGCGGCACACGCGGACCTCGTGGCGGATGG

+3 L L V T L S V V I L R I R R P D L P R A Y R S P L 9001 TGCTGGTGACCCTGTCGGTGGTGATATTGCGCATCCGCCGCCCAGACCTGCCGCGTGCCTACCGCTCGCCGCTGT ACGACCACTGGGACAGCCACCACTATAACGCGTAGGCGGCGGGTCTGGACGGCGCACGGATGGCGAGCGGCGACA

+3 F P L P Q I F S S S G I L I G M A F I T P P G M N 9076 TCCCGTTGCCGCAGATATTCTCCAGTAGCGGTATCCTCATCGGCATGGCGTTCATCACACCGCCGGGCATGAACC AGGGCAACGGCGTCTATAAGAGGTCATCGCCATAGGAGTAGCCGTACCGCAAGTAGTGTGGCGGCCCGTACTTGG

+3 P A D V Y V P F A I M L G A T A A Y A L F W T L W 9151 CTGCCGATGTCTACGTGCCGTTCGCCATCATGCTTGGCGCCACTGCGGCCTATGCATTGTTCTGGACGCTGTGGG GACGGCTACAGATGCACGGCAAGCGGTAGTACGAACCGCGGTGACGCCGGATACGTAACAAGACCTGCGACACCC

+3 V Q K V N P F K P A R V E D V L E K E F A A E P G 9226 TGCAGAAGGTCAACCCGTTCAAGCCGGCGCGGGTCGAGGATGTGCTCGAGAAAGAGTTTGCTGCCGAGCCTGGCC ACGTCTTCCAGTTGGGCAAGTTCGGCCGCGCCCAGCTCCTACACGAGCTCTTTCTCAAACGACGGCTCGGACCGG

+3 H A V E H V L H D Q K F A & 9301 ACGCCGTGGAGCACGTGCTGCATGATCAGAAATTTGCGTGAACGCTTGCTGGCGCCCCGAGCGCCTTCAGGCTAT TGCGGCACCTCGTGCACGACGTACTAGTCTTTAAACGCACTTGCGAACGACCGCGGGGCTCGCGGAAGTCCGATA

9376 CGCCCAGGCGCCACGCTGGCATGCCTGGCGCGCAACCTGGGGCAGCAGAACCTGGTGGCGGCCGGGGTGATCCAC GCGGGTCCGCGGTGCGACCGTACGGACCGCGCGTTGGACCCCGTCGTCTTGGACCACCGCCGGCCCCACTAGGTG

9451 GACCCGGCCCAGGGTTGGCAGGCCACGGTGCACGAACGCGTCGAGGCCCACCTGCTGATGCACATCGTCACCTGT CTGGGCCGGGTCCCAACCGTCCGGTGCCACGTGCTTGCGCAGCTCCGGGTGGACGACTACGTGTAGCAGTGGACA

9526 GAGTTCCAGCTGCAGTTGCCTGCTCCGCAAGGGGGCGAGGTCAGCCTGGAGCTGCGCCATACCGGTGCGCTTCGC CTCAAGGTCGACGTCAACGGACGAGGCGTTCCCCCGCTCCAGTCGGACCTCGACGCGGTATGGCCACGCGAAGCG

9601 CGTGCCGGCCTGGCCTGTGTGTACCGCAAGGGCGACCGGGCGCGCTTCGCCCGACTGCGCGACCGGTTGCTGCAG GCACGGCCGGACCGGACACACATGGCGTTCCCGCTGGCCCGCGCGAAGCGGGCTGACGCGCTGGCCAACGACGTC

9676 CAGGCCGCACTGGTGGCGGCGCTGATGCCGCTGGATTTCAAGCGCCTGACCTTGGCCTGGCGCGACGGCCAATGG GTCCGGCGTGACCACCGCCGCGACTACGGCGACCTAAAGTTCGCGGACTGGAACCGGACCGCGCTGCCGGTTACC 9751 TTGCTGACCCTGGAGCACATGGGCGGTAGCGAAGTGGTCAACCGCATGCCAGCGTTTCGCCGCTACATCCCCATC AACGACTGGGACCTCGTGTACCCGCCATCGCTTCACCAGTTGGCGTACGGTCGCAAAGCGGCGATGTAGGGGTAG

9826 AGCCCGCAACAGCGGGCGCACCTGATGGCCAGCCTGGCCCAGTTCAACACTTTGCTACCTAACCTTTGACGCAAA TCGGGCGTTGTCGCCCGCGTGGACTACCGGTCGGACCGGGTCAAGTTGTGAAACGATGGATTGGAAACTGCGTTT

9901 CTGGCATACGCCTTGCTGTATCAAGCGACGAATGATGACAGTTGTGCGCACATAGATAACATGTTAACAATGTGC GACCGTATGCGGAACGACATAGTTCGCTGCTTACTACTGTCAACACGCGTGTATCTATTGTACAATTGTTACACG

tynR

+3 M H T Q Q S N R Q G L E R W9976 GCATAACAACAAATCCTGCGTCGAGGGCAGCCATGCATACTCAACAATCCAACCGTCAGGGGCTGGAACGCTGGA

+3 T T A M Q Q I C G R F E T E L A S N H S L F I G E10051 CCACGGCCATGCAACAGATCTGTGGCCGTTTCGAGACGGAACTTGCGTCCAATCACTCGCTGTTCATCGGCGAGG GGTGCCGGTACGTTGTCTAGACACCGGCAAAGCTCTGCCTTGAACGCAGGTTAGTGAGCGACAAGTAGCCGCTCC

+3 V S T F S R A G L P L A N L R T N A G N I R R L G 10126 TTTCTACCTTTTCCCGTGCCGGCTTGCCGCTGGCCAACCTGCGCACCAATGCCGGCAACATCCGCCGGCTGGGCG AAAGATGGAAAAGGGCACGGCCGAACGGCGACCGGTTGGACGCGTGGTTACGGCCGTTGTAGGCGGCCGACCCGC

+3 E N P T L D D D Q H C F L V S Q R A G H S T V S Q10201 AAAACCCGACCCTTGACGATGACCAGCATTGTTTCCTGGTCAGCCAGCGTGCGGGGCATTCCACCGTGTCCCAGG TTTTGGGCTGGGAACTGCTACTGGTCGTAACAAAGGACCAGTCGGTCGCACGCCCCGTAAGGTGGCACAGGGTCC

+3 G G M Q V S L A P G E L L L M D S V G R C E I T P10276 GGGGCATGCAGGTCAGCCTGGCGCCGGGTGAGCTGCTGCTGATGGATTCGGTCGGGCGCTGCGAAATCACCCCCA CCCCGTACGTCCAGTCGGACCGCGGCCCACTCGACGACGACTACCTAAGCCAGCCCGCGACGCTTTAGTGGGGGT

+3 S G L I E H V S L A L S R E Q V R K Y V Q G S G P10351 GTGGGTTGATCGAACATGTCTCGCTGGCCCTGTCGCGTGAGCAGGTACGCAAGTATGTGCAAGGCAGCGGCCCGA CACCCAACTAGCTTGTACAGAGCGACCGGGACAGCGCACTCGTCCATGCGTTCATACACGTTCCGTCGCCGGGCT

+3 M F G K I S S S N A C G R M L H V L M D Q L C K D10426 TGTTTGGCAAGATCTCCTCGAGCAACGCCTGCGGGCGCATGCTGCATGTGCTGATGGACCAACTGTGCAAGGACG ACAAACCGTTCTAGAGGAGCTCGTTGCGGACGCCCGCGTACGACGTACACGACTACCTGGTTGACACGTTCCTGC

+3 G N V S G D G A Q G D A L Q T A F I A L L E P G F10501 GCAATGTAAGCGGTGATGGGGCCCAGGGCGACGCGCTGCAGACCGCCTTCATTGCCCTGCTGGAGCCAGGCTTCG CGTTACATTCGCCACTACCCCGGGTCCCGCTGCGCGACGTCTGGCGGAAGTAACGGGACGACCTCGGTCCGAAGC

+3 E R H G E A L G N L G A L N G A N L R G Y V Q Q V10576 AGCGCCATGGCGAAGCGCTGGGCAACCTTGGGGCCTTGAACGGGGCCAACCTGCGGGGCTACGTGCAGCAGGTGA TCGCGGTACCGCTTCGCGACCCGTTGGAACCCCGGAACTTGCCCCGGTTGGACGCCCCGATGCACGTCGTCCACT

+3 I D E S L S Q P G L T P S N L A G R L N I S V R H10651 TCGACGAGTCCCTGTCACAGCCCGGGCTGACCCCGTCCAACCTGGCCGGTCGCCTGAACATCTCGGTGCGTCACC AGCTGCTCAGGGACAGTGTCGGGCCCGACTGGGGCAGGTTGGACCGGCCAGCGGACTTGTAGAGCCACGCAGTGG

+3 L Y R L F E E E G D S V C R Y I Q R A R L K R S A10726 TGTACCGGCTGTTCGAGGAGGAGGGCGATAGTGTGTGCCGCTACATTCAGCGGGCGCGCCTGAAGCGCAGTGCGG ACATGGCCGACAAGCTCCTCCTCCCGCTATCACACACGGCGATGTAAGTCGCCCGCGCGGACTTCGCGTCACGCC

+3 D D L A N P F F R S E S I T S I A Y K W G F T D S 10801 ATGACCTGGCCAACCCGTTCTTCAGGAGCGAGTCGATTACCTCGATTGCCTACAAGTGGGGGTTTACCGACTCGG TACTGGACCGGTTGGGCAAGAAGTCCTCGCTCAGCTAATGGAGCTAACGGATGTTCACCCCCAAATGGCTGAGCC

+3 A H F S R S F K K Q F E R S P K D Y R A Q A M V & 10876 CGCATTTCAGCCGCTCGTTCAAGAAACAGTTCGAACGCTCGCCCAAGGACTACCGGGCGCAGGCGATGGTTTGAG GCGTAAAGTCGGCGAGCAAGTTCTTTGTCAAGCTTGCGAGCGGGTTCCTGATGGCCCGCGTCCGCTACCAAACTC

10951 TGTGATGGTGCTGCTTGTGCGGGCCTCATCGCCGGCAAGTCACTTGGCGGCGGTTCAGCGACGGCCGTTGAAGTA ACACTACCACGACGAACACGCCCGGAGTAGCGGCCGTTCAGTGAACCGCCGCCAAGTCGCTGCCGGCAACTTCAT & R R G N F Y

11026 GCCCGACAGCTGGTGCACGGTCTTGCCGGCAGTGAGCAGCAGCGGGCGGAAATGGTCCTTGCCGAGGATGCGCGC CGGGCTGTCGACCACGTGCCAGAACGGCCGTCACTCGTCGTCGCCCGCCTTTACCAGGAACGGCTCCTACGCGCG -2 G S L Q H V T K G A T L L L P R F H D K G L I R A

11101 ATGCTTGACCGAGCTGACCAGGTCATAGCGCTTCGATCCCTCCTGCATACCCTCGGCGAGTATCTTGCAAATGAT TACGAACTGGCTCGACTGGTCCAGTATCGCGAAGCTAGGGAGGACGTATGGGAGCCGCTCATAGAACGTTTACTA -2 H K V S S V L D Y R K S G E Q M G E A L I K C I I

11176 GTGGCTGGGCGTGACGCCAAAGCCGGAGTAGCCCTGCACATAGAAAGCGTTGGGGCGGTTGTCGAGGGTGCCTAT CACCGACCCGCACTGCGGTTTCGGCCTCATCGGGACGTGTATCTTTCGCAACCCCGCCAACAGCTCCCACGGATA -2 H S P T V G F G S Y G Q V Y F A N P R N D L T G I 11251 CTGCGGAAACAGGTTGGCACTGGTGGCCATCGGGCCGCCCCAGGCCAGGTCGATGCGCACGTCTTTCAGGTAGGG

GACGCCTTTGTCCAACCGTGACCACCGGTAGCCCGGCGGGGTCCGGTCCAGCTACGCGTGCAGAAAGTCCATCCC -2 Q P F L N A S T A M P G G W A L D I R V D K L Y P

11326 GAAAATCTTCAGCATCAGCGCGCGGTTCCACGCCTTCAGGTCCAGCGGGAAGTGCTCGACGAAGGGCGTGGCGGC CTTTTAGAAGTCGTAGTCGCGCGCCAAGGTGCGGAAGTCCAGGTCGCCCTTCACGAGCTGCTTCCCGCACCGCCG -2 F I K L M L A R N W A K L D L P F H E V F P T A A

11401 GCCAAACAGCAGGCGGTTCTCGCGGGTGACCCGGTAGTAGTCGATCACCGGGCGGATGTCGCTGTAGGCCCCGCG CGGTTTGTCGTCCGCCAAGAGCGCCCACTGGGCCATCATCAGCTAGTGGCCCGCCTACAGCGACATCCGGGGCGC -2 G F L L R N E R T V R Y Y D I V P R I D S Y A G R

11476 TATCGGGCTGATGCGCTCGATCAGCTCATCCGGCAATGGCTCGGTCATCATCTGGAAGGCATAGGTGTTTATAGT ATAGCCCGACTACGCGAGCTAGTCGAGTAGGCCGTTACCGAGCCAGTAGTAGACCTTCCGTATCCACAAATATCA -2 I P S I R E I L E D P L P E T M M Q F A Y T N I T

11551 GCGTGCGTGCAGCTGCGGCTCCAGCTTGTTGAGGAAGCTGTCGCACGCCCACAGCAGCTTGCTGGCGCGTACCGA CGCACGCACGTCGACGCCGAGGTCGAACAACTCCTTCGACAGCGTGCGGGTGTCGTCGAACGACCGCGCATGGCT -2 R A H L Q P E L K N L F S D C A W L L K S A R V S

11626 GCCACGGCCGGTGCGTACCGTGATGCGCTCGCCGTAGGTCACTTCCAGGGCCGGGCTGTGTTCGAAGATGCGCGC CGGTGCCGGCCACGCATGGCACTACGCGAGCGGCATCCAGTGAAGGTCCCGGCCCGACACAAGCTTCTACGCGCG -2 G R G T R V T I R E G Y T V E L A P S H E F I R A 11701 ACCATGGCCCACCAGTGCCTGCGCTTCGCCCAGCAGCAGGTTCAGGGAATGCACATGGCCACCGCCCATGTGCAT TGGTACCGGGTGGTCACGGACGCGAAGCGGGTCGTCGTCCAAGTCCCTTACGTGTACCGGTGGCGGGTACACGTA -2 G H G V L A Q A E G L L L N L S H V H G G G M H M

11776 CAGGGCGCTGCTGTAGGCGTTGCTGCCGATGATCTGGCGCACTTCGCTGCCACCGAGAAAACGGATCTCGTCGCG GTCCCGCGACGACATCCGCAACGACGGCTACTAGACCGCGTGAAGCGACGGTGGCTCTTTTGCCTAGAGCAGCGC -2 L A S S Y A N S G I I Q R V E S G G L F R I E D R

11851 GGTATTGATCGCCTTGAACGCCTTCTCCCATTTGCGCAGGGTCTGTTCCTGGCGGCGGTTGAAGCCCATGTAGCC CCATAACTAGCGGAACTTGCGGAAGAGGGTAAACGCGTCCCAGACAAGGACCGCCGCCAACTTCGGGTACATCGG -2 T N I A K F A K E W K R L T Q E Q R R N F G M Y G

11926 ATAGCCGTGGCAGAAGTCGGCGTCGATGGCGTAGCGGGCGATGCGGTCCTTGATGATGCCGGCGCCCAGTTCGCT TATCGGCACCGTCTTCAGCCGCAGCTACCGCATCGCCCGCTACGCCAGGAACTACTACGGCCGCGGGTCAAGCGA -2 Y G H C F D A D I A Y R A I R D K I I G A G L E S

12001 GATTTCGAAAATATCCCTCACGCCCTGATCACCGACGCTGCTGCGGATCTTCTCCAGGTCGTGGCCGATGCCCGC CTAAAGCTTTTATAGGGAGTGCGGGACTAGTGGCTGCGACGACGCCTAGAAGAGGTCCAGCACCGGCTACGGGCG -2 I E F I D R V G Q D G V S S R I K E L D H G I G A

12076 CATGATCTGCCCGCCGTTGCGCCCGCTACCGCCGTAGCCCAGATAACGGCCCTCGAGCACGACGATATTGGTCAC GTACTAGACGGGCGGCAACGCGGGCGATGGCGGCATCGGGTCTATTGCCGGGAGCTCGTGCTGCTATAACCAGTG -2 M I Q G G N R G S G G Y G L Y R G E L V V I N T V

12151 GCCTTGTTCCGCCAGCTCCAGGGCGGTGTTAATGCCGGAGAAACCGCCACCGATCACCACGACATCGGCCTCGAT CGGAACAAGGCGGTCGAGGTCCCGCCACAATTACGGCCTCTTTGGCGGTGGCTAGTGGTGCTGTAGCCGGAGCTA -2 G Q E A L E L A T N I G S F G G G I V V V D A E I

12226 GTCGCGTTCCAGGGTTGGGAAGCTCAGGTTGTACTTCTTGGTCGCCGAGTAGTAGGTGGGGCTCTCGAGGGTGAT CAGCGCAAGGTCCCAACCCTTCGAGTCCAACATGAAGAACCAGCGGCTCATCATCCACCCCGAGAGCTCCCACTA -2 D R E L T P F S L N Y K K T A S Y Y T P S E L T I

12301 CATGACGCCGCCTGCTGACTGGAAATGGGTAGAAATCATTCTATTAATGTATTAATGATTGTGCACTGGCATACT GTACTGCGGCGGACGACTGACCTTTACCCATCTTTAGTAAGATAATTACATAATTACTAACACGTGACCGTATGA -2 M V G G A S Q F H T S I M tynDhpaR

+3 M T T P R P S L T L T L L 12376 CGCCGGTTTGCTATTTCCAGCCTCCTTGAGCCCGCATGACCACACCGAGACCCTCCCTGACCCTGACCTTGCTGC GCGGCCAAACGATAAAGGTCGGAGGAACTCGGGCGTACTGGTGTGGCTCTGGGAGGGACTGGGACTGGAACGACG

+3 Q A R E A T M A F F R P A L N A H D L T E Q Q W R12451 AGGCGCGCGAAGCCACCATGGCGTTCTTCCGCCCGGCGCTGAATGCCCATGACCTGACCGAGCAGCAATGGCGGG TCCGCGCGCTTCGGTGGTACCGCAAGAAGGCGGGCCGCGACTTACGGGTACTGGACTGGCTCGTCGTTACCGCCC

+3 V I R I L R Q Q G E L E S H Q L A E L A C I L K P12526 TAATCCGTATCCTGCGCCAGCAAGGCGAGCTGGAAAGCCATCAGTTGGCGGAGCTGGCCTGTATCCTCAAACCCA ATTAGGCATAGGACGCGGTCGTTCCGCTCGACCTTTCGGTAGTCAACCGCCTCGACCGGACATAGGAGTTTGGGT

+3 S M S G V L K R L E R D G I V A R R K S P E D Q R12601 GTATGAGCGGGGTGCTCAAGCGCCTGGAGCGTGACGGCATCGTAGCGCGGCGCAAGTCGCCGGAGGACCAGCGCC

CATACTCGCCCCACGAGTTCGCGGACCTCGCACTGCCGTAGCATCGCGCCGCGTTCAGCGGCCTCCTGGTCGCGG

+3 R V F I S L T E A G Q Q A F L A M S E E M T R N Y12676 GGGTGTTCATCAGCCTGACCGAGGCCGGCCAGCAAGCGTTTCTGGCGATGAGCGAGGAGATGACCCGCAACTACG CCCACAAGTAGTCGGACTGGCTCCGGCCGGTCGTTCGCAAAGACCGCTACTCGCTCCTCTACTGGGCGTTGATGC

+3 D K I L A Q F G D D K L Q Q L M Q L L G E M K K I 12751 ACAAGATCCTCGCCCAGTTTGGCGATGACAAGCTGCAGCAGCTGATGCAGCTGCTGGGTGAAATGAAGAAGATCA TGTTCTAGGAGCGGGTCAAACCGCTACTGTTCGACGTCGTCGACTACGTCGACGACCCACTTTACTTCTTCTAGT

+3 K P &

12826 AACCCTGACGCGCCAGGCGTCAGCGGTTGAGTGACAGCGAGTCTTCCAGCACTTTCAGCAGTGCTGCCGCGCGCC TTGGGACTGCGCGGTCCGCAGTCGCCAACTCACTGTCGCTCAGAAGGTCGTGAAAGTCGTCACGACGGCGCGCGG -1 & R N L S L S D E L V K L L A A A R R

12901 GCTCATAGGCGTCGGGGCCTGCGTACATCAGCTCTACATACAGGCTGTCGATGATGCCCAGGTAGGCATCGGCAT CGAGTATCCGCAGCCCCGGACGCATGTAGTCGAGATGTATGTCCGACAGCTACTACGGGTCCATCCGTAGCCGTA -1 E Y A D P G A Y M L E V Y L S D I I G L Y A D A Y

12976 ACAGCGCCAGGCGGCTGTGCTGCTCATGCGCCCAGCCGTGGCGAGCTTGCAGGGCCACGCTGAACCCTTCGCGTA TGTCGCGGTCCGCCGACACGACGAGTACGCGGGTCGGCACCGCTCGAACGTCCCGGTGCGACTTGGGAAGCGCAT -1 L A L R S H Q E H A W G H R A Q L A V S F G E R I

13051 TGCCGTCCAGGTACTGTTCAAAGCCCGAAGTGACAATCGGCTTGATGCCCGCCGGGGGCAGGAACGCCGTGCGCA ACGGCAGGTCCATGACAAGTTTCGGGCTTCACTGTTAGCCGAACTACGGGCGGCCCCCGTCCTTGCGGCACGCGT -1 G D L Y Q E F G S T V I P K I G A P P L F A T R L

13126 ACACGAAGCGCAGTTGGGCCGAGTCGCGATAACGTTCGGCCAGGTGCAGGGCCAGCCAGTGCCCCGCCGCCAGGC TGTGCTTCGCGTCAACCCGGCTCAGCGCTATTGCAAGCCGGTCCACGTCCCGGTCGGTCACGGGGCGGCGGTCCG -1 V F R L Q A S D R Y R E A L H L A L W H G A A L G

13201 CGTCGCGGGCTTCCTGCGCAAAGCCGTGCTCGACAAAGGCCGTTTCCTGCACAAGCGCACGCTGGAACACCTCCA GCAGCGCCCGAAGGACGCGTTTCGGCACGAGCTGTTTCCGGCAAAGGACGTGTTCGCGTGCGACCTTGTGGAGGT -1 D R A E Q A F G H E V F A T E Q V L A R Q F V E V

13276 CGAACAAGGCGTCCTTGTTGGCGAAATGCGCATACAGCGATGCCTTGCGCATGCCCGCCAACTGGGCGATTTCGT GCTTGTTCCGCAGGAACAACCGCTTTACGCGTATGTCGCTACGGAACGCGTACGGGCGGTTGACCCGCTAAAGCA -1 F L A D K N A F H A Y L S A K R M G A L Q A I E N

13351 TCAGCGAAGAGGCGTCATAACCGTACTCGGCGAAGTGGCCGACGGCGGCATCGCACACACGCACCGCAGAAGGGG AGTCGCTTCTCCGCAGTATTGGCATGAGCCGCTTCACCGGCTGCCGCCGTAGCGTGTGTGCGTGGCGTCTTCCCC -1 L S S A D Y G Y E A F H G V A A D C V R V A S P S

13426 AAAGGTCTTTCAACAGCATCACTCCGTCAGGGGCGCGGCGGGCCGCGCGCGTCTTGAGGGTGGGATTGTGGTGAT TTTCCAGAAAGTTGTCGTAGTGAGGCAGTCCCCGCGCCGCCCGGCGCGCGCAGAACTCCCACCCTAACACCACTA -1 L D K L L M

tetR

13501 CGAAAATGCACGGGTCAATGCTTGTCGCAAGGCAATTTCCGGGCGCCATGGAAAGTGCAATGTTCCCCTCGTAAC GCTTTTACGTGCCCAGTTACGAACAGCGTTCCGTTAAAGGCCCGCGGTACCTTTCACGTTACAAGGGGAGCATTG

hpaB

+3 M 13576 GTGCATTCCTCCACCCAATCGCCGCTCACATACTGATCGCGTCTTCGAATCCAATAAGAAAGAGACCGCTCATGA CACGTAAGGAGGTGGGTTAGCGGCGAGTGTATGACTAGCGCAGAAGCTTAGGTTATTCTTTCTCTGGCGAGTACT

+3 K K P N P L L E D L K S V L P T I A A N A M R A E 13651 AAAAGCCAAACCCCCTGCTGGAAGACCTGAAGTCCGTCCTGCCGACCATTGCCGCCAATGCCATGCGTGCAGAGC TTTTCGGTTTGGGGGACGACCTTCTGGACTTCAGGCAGGACGGCTGGTAACGGCGGTTACGGTACGCACGTCTCG

+3 Q D R S V P A E N I A L L K S I G M H R A F L P K13726 AGGACCGCAGTGTGCCGGCAGAGAATATCGCCTTGCTGAAAAGCATCGGCATGCACCGCGCTTTCTTGCCCAAAC TCCTGGCGTCACACGGCCGTCTCTTATAGCGGAACGACTTTTCGTAGCCGTACGTGGCGCGAAAGAACGGGTTTG

+3 H F G G M E I T L P E F A Q C I A L L A G A C A S13801 ACTTCGGCGGCATGGAAATCACCCTGCCGGAGTTCGCCCAGTGCATCGCCTTGCTGGCGGGGGCCTGCGCCAGCA TGAAGCCGCCGTACCTTTAGTGGGACGGCCTCAAGCGGGTCACGTAGCGGAACGACCGCCCCCGGACGCGGTCGT

+3 T A W A M S L L C T H S H Q M A M F S P K L Q Q E13876 CAGCCTGGGCCATGAGCCTGCTGTGCACCCACAGCCACCAGATGGCAATGTTCTCGCCCAAGCTACAACAGGAGG GTCGGACCCGGTACTCGGACGACACGTGGGTGTCGGTGGTCTACCGTTACAAGAGCGGGTTCGATGTTGTCCTCC

+3 V W G S D P D A T A S S S I A P F G R T E E V E G

13951 TGTGGGGTAGCGACCCGGATGCTACCGCCAGCAGCAGTATCGCGCCGTTCGGCCGCACTGAAGAGGTTGAGGGTG ACACCCCATCGCTGGGCCTACGATGGCGGTCGTCGTCATAGCGCGGCAAGCCGGCGTGACTTCTCCAACTCCCAC

+3 G V S F S G E M G W S S G C D H A E W A I L G F R 14026 GCGTGTCGTTCAGCGGCGAAATGGGCTGGAGTTCCGGTTGCGACCACGCCGAATGGGCGATTCTCGGTTTCCGCC CGCACAGCAAGTCGCCGCTTTACCCGACCTCAAGGCCAACGCTGGTGCGGCTTACCCGCTAAGAGCCAAAGGCGG

+3 R K N A E G A Q D Y C F A I L P R S D Y E I R D D14101 GCAAGAATGCCGAAGGCGCTCAGGATTACTGCTTCGCCATCCTGCCTCGCAGTGACTATGAAATCCGTGATGACT CGTTCTTACGGCTTCCGCGAGTCCTAATGACGAAGCGGTAGGACGGAGCGTCACTGATACTTTAGGCACTACTGA

+3 W Y A V G M R G S G S K T L I V R D A F V P E H R 14176 GGTATGCCGTGGGCATGCGCGGCAGCGGCAGCAAGACCCTGATCGTGCGTGATGCCTTCGTGCCCGAGCACCGCA CCATACGGCACCCGTACGCGCCGTCGCCGTCGTTCTGGGACTAGCACGCACTACGGAAGCACGGGCTCGTGGCGT

+3 I Q K A K D M M E G K S A G F G L Y P D S K I F F14251 TCCAGAAGGCCAAGGACATGATGGAGGGCAAGTCGGCGGGCTTTGGTTTGTACCCCGACAGCAAGATTTTCTTCG AGGTCTTCCGGTTCCTGTACTACCTCCCGTTCAGCCGCCCGAAACCAAACATGGGGCTGTCGTTCTAAAAGAAGC

+3 A P Y R P Y F A S G F S T V S L G V A E R M L E V 14326 CCCCGTATCGCCCGTATTTTGCCAGCGGCTTCTCCACGGTCAGCTTGGGCGTTGCCGAGCGCATGCTGGAGGTGT GGGGCATAGCGGGCATAAAACGGTCGCCGAAGAGGTGCCAGTCGAACCCGCAACGGCTCGCGTACGACCTCCACA

+3 F R E K T R N R V R A Y T G A A V G A A T P A L M 14401 TCCGCGAGAAAACCCGCAACCGCGTGCGTGCCTACACCGGTGCTGCCGTGGGCGCCGCCACCCCGGCGCTGATGC AGGCGCTCTTTTGGGCGTTGGCGCACGCACGGATGTGGCCACGACGGCACCCGCGGCGGTGGGGCCGCGACTACG

+3 R L A E S T H Q V A A A R A L L E K S W D E I A E14476 GCCTGGCCGAGTCGACCCATCAGGTGGCCGCTGCCCGGGCATTGCTGGAAAAGAGCTGGGACGAGATTGCCGAGC CGGACCGGCTCAGCTGGGTAGTCCACCGGCGACGGGCCCGTAACGACCTTTTCTCGACCCTGCTCTAACGGCTCG

+3 H S A R H E Y P S R G T L A F W R T N Q G Y A V K14551 ACAGTGCCCGTCACGAATACCCGTCGCGTGGCACGCTGGCGTTCTGGCGTACCAACCAGGGCTACGCCGTGAAGA TGTCACGGGCAGTGCTTATGGGCAGCGCACCGTGCGACCGCAAGACCGCATGGTTGGTCCCGATGCGGCACTTCT

+3 M C I Q A V D R L M E A A G G G A W F E S N E L Q14626 TGTGCATCCAGGCCGTCGACCGCCTGATGGAAGCGGCCGGTGGTGGCGCCTGGTTCGAGAGCAACGAACTGCAGC ACACGTAGGTCCGGCAGCTGGCGGACTACCTTCGCCGGCCACCACCGCGGACCAAGCTCTCGTTGCTTGACGTCG

+3 R L F R D S H M T G A H A Y T D Y D V C A Q I L G14701 GGCTGTTCCGCGATTCGCACATGACCGGTGCCCATGCCTACACCGATTACGACGTGTGTGCGCAAATCCTCGGCC CCGACAAGGCGCTAAGCGTGTACTGGCCACGGGTACGGATGTGGCTAATGCTGCACACACGCGTTTAGGAGCCGG

+3 R E L M G L E P D P A M V & 14776 GCGAGCTGATGGGCCTGGAGCCTGACCCGGCGATGGTCTGAGCCGCCACTTGTTTTCACCCATCCCCTACAAGCA CGCTCGACTACCCGGACCTCGGACTGGGCCGCTACCAGACTCGGCGGTGAACAAAAGTGGGTAGGGGATGTTCGT

hpaC

+1 M S K E T F D S R A 14851 CAACAACAAACAGGGCAGGCTGCCAGGCCTGCCCGGGAGTCTTGCATGTCCAAAGAAACCTTCGATTCACGTGCC GTTGTTGTTTGTCCCGTCCGACGGTCCGGACGGGCCCTCAGAACGTACAGGTTTCTTTGGAAGCTAAGTGCACGG

+1 F R R A L G N F A T G V T V V T A A G P S G R K V 14926 TTCCGCCGCGCCCTGGGCAACTTCGCCACCGGCGTGACCGTGGTGACTGCCGCCGGCCCCAGTGGCCGCAAGGTC AAGGCGGCGCGGGACCCGTTGAAGCGGTGGCCGCACTGGCACCACTGACGGCGGCCGGGGTCACCGGCGTTCCAG

+1 G V T A N S F N S V S L D P A L I L W S I D K R S 15001 GGCGTTACCGCCAACAGCTTCAACTCGGTGTCGCTGGACCCGGCGCTGATCCTGTGGAGCATCGACAAGCGCTCC CCGCAATGGCGGTTGTCGAAGTTGAGCCACAGCGACCTGGGCCGCGACTAGGACACCTCGTAGCTGTTCGCGAGG

+1 T S H E V F E E A S H F A V N I L A A D Q I D L S15076 ACCAGCCATGAAGTGTTCGAAGAGGCCTCGCACTTTGCCGTGAACATTCTGGCTGCGGACCAGATCGACCTGTCC TGGTCGGTACTTCACAAGCTTCTCCGGAGCGTGAAACGGCACTTGTAAGACCGACGCCTGGTCTAGCTGGACAGG

+1 N N F A R P K E D R F A G I D Y E T G T G G A P L 15151 AACAACTTTGCCCGCCCGAAGGAAGATCGCTTTGCCGGTATCGACTACGAGACCGGCACTGGCGGCGCGCCGTTG TTGTTGAAACGGGCGGGCTTCCTTCTAGCGAAACGGCCATAGCTGATGCTCTGGCCGTGACCGCCGCGCGGCAAC

+1 F A D C A A R F E C E K Y Q Q L D G G D H W I L V15226 TTCGCCGATTGCGCGGCGCGCTTTGAGTGTGAAAAGTACCAGCAGCTGGACGGTGGCGATCACTGGATCCTGGTG AAGCGGCTAACGCGCCGCGCGAAACTCACACTTTTCATGGTCGTCGACCTGCCACCGCTAGTGACCTAGGACCAC

+1 G K V V A F D D F G R S P L L Y H Q G A Y S M V L15301 GGCAAGGTAGTGGCCTTTGATGACTTTGGCCGCTCGCCGCTGCTGTATCACCAGGGCGCCTATTCAATGGTGCTG CCGTTCCATCACCGGAAACTACTGAAACCGGCGAGCGGCGACGACATAGTGGTCCCGCGGATAAGTTACCACGAC

+1 P H T R M T Q G A E G Q A P S S H F Q G R L Q H N15376 CCGCATACCCGCATGACCCAAGGCGCAGAGGGGCAGGCACCGAGCAGCCACTTCCAGGGCCGCCTGCAGCACAAC GGCGTATGGGCGTACTGGGTTCCGCGTCTCCCCGTCCGTGGCTCGTCGGTGAAGGTCCCGGCGGACGTCGTGTTG

+1 L Y Y L M T Q A L R A Y Q A D Y Q P R Q L C T G L15451 CTGTACTACCTGATGACCCAGGCGCTGCGTGCCTACCAGGCTGACTACCAGCCACGCCAGCTGTGTACCGGCCTG GACATGATGGACTACTGGGTCCGCGACGCACGGATGGTCCGACTGATGGTCGGTGCGGTCGACACATGGCCGGAC

+1 R T S E A R M L M V L E N D A G L S L N D L Q R E15526 CGCACCAGCGAGGCACGCATGCTGATGGTGCTGGAGAACGATGCGGGCCTGAGCCTGAACGACCTGCAACGCGAA GCGTGGTCGCTCCGTGCGTACGACTACCACGACCTCTTGCTACGCCCGGACTCGGACTTGCTGGACGTTGCGCTT

+1 V A M P A R E I E E A V A N L K R K G L I A D D E 15601 GTGGCGATGCCGGCGCGGGAGATCGAGGAAGCGGTTGCCAACCTCAAGCGCAAAGGGCTGATTGCCGATGACGAA CACCGCTACGGCCGCGCCCTCTAGCTCCTTCGCCAACGGTTGGAGTTCGCGTTTCCCGACTAACGGCTACTGCTT

+1 G R V R L S V K G V D E T E A L W T I A R Q Q Q D15676 GGGCGAGTGCGGCTATCGGTGAAGGGCGTGGACGAGACCGAGGCGTTGTGGACCATTGCCCGGCAACAGCAGGAC CCCGCTCACGCCGATAGCCACTTCCCGCACCTGCTCTGGCTCCGCAACACCTGGTAACGGGCCGTTGTCGTCCTG

+1 K V F G Q F S E Q Q L E T F K T V L K A L I N I & 15751 AAGGTGTTCGGGCAGTTCAGTGAACAGCAGCTGGAGACTTTCAAGACCGTGCTCAAGGCCCTTATCAACATCTGA TTCCACAAGCCCGTCAAGTCACTTGTCGTCGACCTCTGAAAGTTCTGGCACGAGTTCCGGGAATAGTTGTAGACT

15826 ACACGCTTTGGGATGGCACCGGCTGTTTTGGATGGCACCGGCTGTGCCGGTGTTCGCGGATGAACCCGCTCCCAC TGTGCGAAACCCTACCGTGGCCGACAAAACCTACCGTGGCCGACACGGCCACAAGCGCCTACTTGGGCGAGGGTG

15901 AGGTCCAGCGCCAGTAGCAACTTCGGCGCGGTACCTGTGGGAGCGGCTTTAGCCGCGAACACCGGCAAAGCCGGT TCCAGGTCGCGGTCATCGTTGAAGCCGCGCCATGGACACCCTCGCCGAAATCGGCGCTTGTGGCCGTTTCGGCCA

15976 GCCATCCAACCAGAAGCCTCAGTAGGCACCACCCCCGGCACTGGGGACTACCACTGTATCCTTGAACTTCCCCGC CGGTAGGTTGGTCTTCGGAGTCATCCGTGGTGGGGGCCGTGACCCCTGATGGTGACATAGGAACTTGAAGGGGCG -2 G D L W F G & Y A G G G A S P V V V T D K F K G A

16051 CAGCTCGCGCAGCCCGCGCATCAGCACCGTGGTATCCACACCCACCGCCACAAACGCCGCACCCAGCTCGATGTA GTCGAGCGCGTCGGGCGCGTAGTCGTGGCACCATAGGTGTGGGTGGCGGTGTTTGCGGCGTGGGTCGAGCTACAT -2 L E R L G R M L V T T D V G V A V F A A G L E I Y

16126 GCGTCGCGCCAGTTTCTCGTCCGCGCTGAGAATGCCGGCGGCTTTGCCCGCCTTGCCAATGCGCACGATTGCGTC CGCAGCGCGGTCAAAGAGCAGGCGCGACTCTTACGGCCGCCGAAACGGGCGGAACGGTTACGCGTGCTAACGCAG -2 R R A L K E D A S L I G A A K G A K G I R V I A D

16201 TTCAATCGCCGCCTGCACCTCCGGGTGCCCGGGGTTGCCGCGATGCCCCATGGCCGCACTCAGGTCTGCAGGCCC AAGTTAGCGGCGGACGTGGAGGCCCACGGGCCCCAACGGCGCTACGGGGTACCGGCGTGAGTCCAGACGTCCGGG -2 E I A A Q V E P H G P N G R H G M A A S L D A P G

16276 GATGAACACGCCATCCACACCTTCCACTGCAACGATCTCGTCCAGGTTGGCCAGGCCTTCCTTGTTCTCGATCTG CTACTTGTGCGGTAGGTGTGGAAGGTGACGTTGCTAGAGCAGGTCCAACCGGTCCGGAAGGAACAAGAGCTAGAC -2 I F V G D V G E V A V I E D L N A L G E K N E I Q

16351 CACCAGCAGGCACATTTGCTCATCGGCGTGGTCCAGGTAACCGGGGAGGGTGTTCCAGCGCGAAGCCCGCGCCAG GTGGTCGTCCGTGTAAACGAGTAGCCGCACCAGGTCCATTGGCCCCTCCCACAAGGTCGCGCTTCGGGCGCGGTC -2 V L L C M Q E D A H D L Y G P L T N W R S A R A L

16426 CGCGCTGCCCACCCCGCGAATGCCCTTGGGCGGGTAATGCATGGCCTTGACCAGTTGCCGCGCCTGTTCGGCAGT GCGCGACGGGTGGGGCGCTTACGGGAACCCGCCCATTACGTACCGGAACTGGTCAACGGCGCGGACAAGCCGTCA -2 A S G V G R I G K P P Y H M A K V L Q R A Q E A T

16501 TTCCACCATCGGCACCAGCAAGGTTTGTGCGCCGATATCCAGCACCTGCTTGATCAGCGCGGTATCGCCGATCAC AAGGTGGTAGCCGTGGTCGTTCCAAACACGCGGCTATAGGTCGTGGACGAACTAGTCGCGCCATAGCGGCTAGTG -2 E V M P V L L T Q A G I D L V Q K I L A T D G I V

16576 CGGGCGGATCACTGCCTGGCTGGGGTAGGGTGCCACCGCCTGCAACTGGGCGAGCATGCCGCGCAGGTCGTTGGG GCCCGCCTAGTGACGGACCGACCCCATCCCACGGTGGCGGACGTTGACCCGCTCGTACGGCGCGTCCAGCAACCC -2 P R I V A Q S P Y P A V A Q L Q A L M G R L D N P

16651 CGCGTGTTCGCCGTCGATCAGCAGCCAGTCGAAACCGGCATTGGCCGCCAGCTCGGCGCAGTAGGCATCGGCCAG

GCGCACAAGCGGCAGCTAGTCGTCGGTCAGCTTTGGCCGTAACCGGCGGTCGAGCCGCGTCATCCGTAGCCGGTC -2 A H E G D I L L W D F G A N A A L E A C Y A D A L

16726 GCCGAGCCACAGGCCGATTTGCGGTTCACCGCTGTGCAGGCGTCGCTTGAAGTGGTTGATGGGCATGTCCATGAG CGGCTCGGTGTCCGGCTAAACGCCAAGTGGCGACACGTCCGCAGCGAACTTCACCAACTACCCGTACAGGTACTC -2 G L W L G I Q P E G S H L R R K F H N I P M D M

hpaI

16801 CAGGTCCTTAAACGAAGCGGCAGGCGATGGAGCCGAGCATGTCGTAGTCGACGTGGAAGGTGTCACCTGGGCGAG GTCCAGGAATTTGCTTCGCCGTCCGCTACCTCGGCTCGTACAGCATCAGCTGCACCTTCCACAGTGGACCCGCTC -1 # V F R C A I S G L M D Y D V H F T D G P R A

16876 CGGCGACCGGGCGGGTGAACGAACCCCCAAGGATGATCTGGCCGGGCTGCAAGGTGACGTCGTACGGCGCCAGTT GCCGCTGGCCCGCCCACTTGCTTGGGGGTTCCTACTAGACCGGCCCGACGTTCCACTGCAGCATGCCGCGGTCAA -1 A V P R T F S G G L I I Q G P Q L T V D Y P A L K

16951 TGTTGGCCAGCCAGGCAACGCCTTTGGCCGGGTGGTTGAGCACGGCAGCGCTGACCCCGGATTCCTCGATCACGC ACAACCGGTCGGTCCGTTGCGGAAACCGGCCCACCAACTCGTGCCGTCGCGACTGGGGCCTAAGGAGCTAGTGCG -1 N A L W A V G K A P H N L V A A S V G S E E I V G

17026 CATTGCGGTAGAGCACCGCCGGCACTTTGCGCAGGTCGATTTCGGTGGGGCGCACGGCCCGCCCGCCCATCACCA GTAACGCCATCTCGTGGCGGCCGTGAAACGCGTCCAGCTAAAGCCACCCCGCGTGCCGGGCGGGCGGGTAGTGGT -1 N R Y L V A P V K R L D I E T P R V A R G G M V V

17101 CGCCGGCATTGGCGGCGTTGTCGGAGATGGTGTCGAACACCTTGCGGGTGGCCTGGGTTTGCGGGTCCACCTGCT GCGGCCGTAACCGCCGCAACAGCCTCTACCACAGCTTGTGGAACGCCCACCGGACCCAAACGCCCAGGTGGACGA -1 G A N A A N D S I T D F V K R T A Q T Q P D V Q Q

17176 GGATGCGCGCGTCAATGATTTCCAGCGCCGGGATCACCCACTCGGTGGCGTCCAGCACATCAAACACGGTGATGT CCTACGCGCGCAGTTACTAAAGGTCGCGGCCCTAGTGGGTGAGCCACCGCAGGTCGTGTAGTTTGTGCCACTACA -1 I R A D I I E L A P I V W E T A D L V D F V T I N

17251 TCGGGCCCTTCAGCGGCTTGCCGAGGATGAACGCCAACTCCACTTCAACCCGCGGCACGATGAAGCGCTCGAAGG AGCCCGGGAAGTCGCCGAACGGCTCCTACTTGCGGTTGAGGTGAAGTTGGGCGCCGTGCTACTTCGCGAGCTTCC -1 P G K L P K G L I F A L E V E V R P V I F R E F P

17326 GGATGTCGCTGCCTTCGTCGAACAGCATGTCGTCGAGCAAGGCGCCGTAGTCGGGCTCGGTGATGTTCGACGATA CCTACAGCGACGGAAGCAGCTTGTCGTACAGCAGCTCGTTCCGCGGCATCAGCCCGAGCCACTACAAGCTGCTAT -1 I D S G E D F L M D D L L A G Y D P E T I N S S V

17401 CCTGCATGGCGCGCGAGGTCAGGCCGATCTTGTGGCCCACCAGCTTGCGCCCGGCGGCGATCTTTTTTGCCACCC GGACGTACCGCGCGCTCCAGTCCGGCTAGAACACCGGGTGGTCGAACGCGGGCCGCCGCTAGAAAAAACGGTGGG -1 Q M A R S T L G I K H G V L K R G A A I K K A V W

17476 AGGCGCGCTGGATGGCGTAGGCGTCTTCGATGGTGATTGCCGGTTGCTCCAGCGAGAACTGGCGCACTTGCTCGC TCCGCGCGACCTACCGCATCCGCAGAAGCTACCACTAACGGCCAACGAGGTCGCTCTTGACCGCGTGAACGAGCG -1 A R Q I A Y A D E I T I A P Q E L S F Q R V Q E R

17551 GGGAGCGTTCGGCCTGGTCGAGGCGGTCGGCGGCGTGCTGGATGAAAGCGTTGTCTAGCATGGGGGCGGTCTCTT CCCTCGCAAGCCGGACCAGCTCCGCCAGCCGCCGCACGACCTACTTTCGCAACAGATCGTACCCCCGCCAGAGAA -1 S R E A Q D L R D A A H Q I F A N D L M

hpaH

17626 GATTCAAGGGTTGACGATGGCAGCCTGGGTGCGCAACACCAGCAGGCCGCCCAGGGCGATGAAGACGGCGAGTAC CTAAGTTCCCAACTGCTACCGTCGGACCCACGCGTTGTGGTCGTCCGGCGGGTCCCGCTACTTCTGCCGCTCATG -2 & P N V I A A Q T R L V L L G G L A I F V A L V

17701 GTACAGAGCAAGGCTGGCGCTGTGGGTGGTGTCGCGCACCCAGCCGATGAAGTAGGGCGTGAAGAACGAGGCGAT CATGTCTCGTTCCGACCGCGACACCCACCACAGCGCGTGGGTCGGCTACTTCATCCCGCACTTCTTGCTCCGCTA -2 Y L A L S A S H T T D R V W G I F Y P T F F S A I

17776 GCTGCCCAGCGAGCTGATCAGGGCAATGCCGGCGGCCTGGGTACGGGCGTTGAGGAACGCCGGCGGCAGTTGCCA CGACGGGTCGCTCGACTAGTCCCGTTACGGCCGCCGGACCCATGCCCGCAACTCCTTGCGGCCGCCGTCAACGGT -2 S G L S S I L A I G A A Q T R A N L F A P P L Q W

17851 GAACATCGGCAGCGCAGCGCTGGCGCCCATGCCGGCCAGCACCAGGCCGGCCATTACCGGCAGCGCCTGCTCGGG CTTGTAGCCGTCGCGTCGCGACCGCGGGTACGGCCGGTCGTGGTCCGGCCGGTAATGGCCGTCGCGGACGAGCCC -2 F M P L A A S A G M G A L V L G A M V P L A Q E P

17926 GGCAATGGCCGCAATAGCGATGCCGATGGCAGCCATCAGCAGCGGTACGCACAGGTGCCAGCGGCGTTCGCGTTG CCGTTACCGGCGTTATCGCTACGGCTACCGTCGGTAGTCGTCGCCATGCGTGTCCACGGTCGCCGCAAGCGCAAC

- 2 A I A A I A I G I A A M L L P V C L H W R R E R Q

18001 GCGGTCGCTGGAGCGGCCGCACGCCAGCATGAACACGCAGCCGGCCACGTACGGCACAGCGCTGAGCAGGCCGAC CGCCAGCGACCTCGCCGGCGTGCGGTCGTACTTGTGCGTCGGCCGGTGCATGCCGTGTCGCGACTCGTCCGGCTG -2 R D S S R G C A L M F V C G A V Y P V A S L L G V

18076 ACTGGCGTCGCTGGCCACACCGGCACTGTGAATCAGGCTGGGCATCCAGAACGCAAGGGTATTCACCGCCAGCAT TGACCGCAGCGACCGGTGTGGCCGTGACACTTAGTCCGACCCGTAGGTCTTGCGTTCCCATAAGTGGCGGTCGTA -2 S A D S A V G A S H I L S P M W F A L T N V A L M

18151 CACCGCGCAATACACGGCCACCAACAGCCACAGCGCACGGCTTGCGAAAATGGCGCCGAACGAGGTTACGGGCTT GTGGCGCGTTATGTGCCGGTGGTTGTCGGTGTCGCGTGCCGAACGCTTTTACCGCGGCTTGCTCCAATGCCCGAA -2 V A C Y V A V L L W L A R S A F I A G F S T V P K

18226 GCGCTGTTCTTCCTCACCGAATTGCGCGCGCAGCGTGGCTTTCTGCTGCTCATCCAGCCAGCTCACCCGCTCGAA CGCGACAAGAAGGAGTGGCTTAACGCGCGCGTCGCACCGAAAGACGACGAGTAGGTCGGTCGAGTGGGCGAGCTT -2 R Q E E E G F Q A R L T A K Q Q E D L W S V R E F

18301 GTGCTCCGGCAAAACGGCCAGTACCACCAGGCCCAGCAACACCACCGGCGCCCCTTCGAGCAGGAACATCCACTG CACGAGGCCGTTTTGCCGGTCATGGTGGTCCGGGTCGTTGTGGTGGCCGCGGGGAAGCTCGTCCTTGTAGGTGAC -2 H E P L V A L V V L G L L V V P A G E L L F M W Q

18376 CCAGCCACGCAGCCCGCCCGTGTCGTGCATGAAGGCCAGTATGGCCCCGGACACTGGCCCGCCGACCACTCCGGC GGTCGGTGCGTCGGGCGGGCACAGCACGTACTTCCGGTCATACCGGGGCCTGTGACCGGGCGGCTGGTGAGGCCG -2 W G R L G G T D H M F A L I A G S V P G G V V G A

18451 CAACGGCACGGCAATGGCGAACAGCGCGGTGACCTGGGCGCGGCGCCCGGCCGGGTACCAGCGGTTGAGGTAAAC GTTGCCGTGCCGTTACCGCTTGTCGCGCCACTGGACCCGCGCCGCGGGCCGGCCCATGGTCGCCAACTCCATTTG -2 L P V A I A F L A T V Q A R R G A P Y W R N L Y V

18526 CAGAATGCCCGGGAAGAACCCGGCCTCGGCCGCGCCCAGGGCAAAGCGCAACAGGTAGAACGCGCTGCTGCTTTC GTCTTACGGGCCCTTCTTGGGCCGGAGCCGGCGCGGGTCCCGTTTCGCGTTGTCCATCTTGCGCGACGACGAAAG -2 L I G P F F G A E A A G L A F R L L Y F A S S S E

18601 GATCAGCAGCATGCTGGTCGACAACAGCCCCCACACCACCATCAGGCAGGCGATCCAGCGGCGTGGGCCAACGCG CTAGTCGTCGTACGACCAGCTGTTGTCGGGGGTGTGGTGGTAGTCCGTCCGCTAGGTCGCCGCACCCGGTTGCGC -2 I L L M S T S L L G W V V M L C A I W R R P G V R

18676 GTCGAGCATCAGGTTGCTGGGGACGCCGAACAGCGCATAGGCAATGAAGAACAGCCCGGCACCCAGGCCATAGAC CAGCTCGTAGTCCAACGACCCCTGCGGCTTGTCGCGTATCCGTTACTTCTTGTCGGGCCGTGGGTCCGGTATCTG -2 D L M L N S P V G F L A Y A I F F L G A G L G Y V

18751 CGTGTCGGACAAATGCAGGTCCTGGCTCATCTGCATCTTGGCGAAGCCAATGTTGATGCGGTCCAGGTGGGCGAA GCACAGCCTGTTTACGTCCAGGACCGAGTAGACGTAGAACCGCTTCGGTTACAACTACGCCAGGTCCACCCGCTT -2 T D S L H L D Q S M Q M K A F G I N I R D L H A F

18826 CAGGTAGCACACCAGCAGCAGCGGCATCAGCCGCCAGGTGACTGCCCGATGGGTACTGTCGGCCCGTTCAACGTG GTCCATCGTGTGGTCGTCGTCGCCGTAGTCGGCGGTCCACTGACGGGCTACCCATGACAGCCGGGCAAGTTGCAC -2 L Y C V L L L P M L R W T V A R H T S D A R E V H

18901 TGCCTCGCGCGGCGAGGCTTGTTCGAGTGTGCTCATGTTTTTGTACTTATTCTGTAATGAGTCGGGGAGGGCGTG ACGGAGCGCGCCGCTCCGAACAAGCTCACACGAGTACAAAAACATGAATAAGACATTACTCAGCCCCTCCCGCAC -2 A E R P S A Q E L T S M

hpaX

18976 GTTTGAGCCGGCGCGCTAGCGGTTGAACAGTGGGTGCAAGGTGCTGTGCTTGGCGTCGTAGACCTGGGCGGTGCT CAAACTCGGCCGCGCGATCGCCAACTTGTCACCCACGTTCCACGACACGAACCGCAGCATCTGGACCCGCCACGA @ R N F L P H L T S H K A D Y V Q A T S

19051 GTGGTCGATCTGCACGGTGATGCCGATCGGGCGCTGTTGCAGCAGTGGGTCCAGGCGCGCTTTCAACACTGCCAG CACCAGCTAGACGTGCCACTACGGCTAGCCCGCGACAACGTCGTCACCCAGGTCCGCGCGAAAGTTGTGACGGTC -2 H D I Q V T I G I P R Q Q L L P D L R A K L V A L

19126 CAAGCTGTCGCCCACTGTTTTGTGCACCTCGGCGCTACGGCCGGTAGCCATGCGCAGGTTGGCGTACAGAAAGCC GTTCGACAGCGGGTGACAAAACACGTGGAGCCGCGATGCCGGCCATCGGTACGCGTCCAACCGCATGTCTTTCGG -2 L S D G V T K H V E A S R G T A M R L N A Y L F G

19201 GTATTCGCCTTTGCCGTCGGCCACCGCGCAATGGGCGGCGGGGTAGGCCAGCACGCGTGTACCGCCAGTGGGGAA CATAAGCGGAAACGGCAGCCGGTGGCGCGTTACCCGCCGCCCCATCCGGTCGTGCGCACATGGCGGTCACCCCTT -2 Y E G K G D A V A C H A A P Y A L V R T G G T P F

19276 CACGGCTTTGCCTTCGGCATCGCGCTGTTCGAGCATGGTGTCGGCCAGGGCGCGGCACAGGCCGGGGATGTCGGC

GTGCCGAAACGGAAGCCGTAGCGCGACAAGCTCGTACCACAGCCGGTCCCGCGCCGTGTCCGGCCCCTACAGCCG -2 V A K G E A D R Q E L M T D A L A R C L G P I D A

19351 GTCGGTTTCCAGGTCGGGGGTATAGAGCAGAACCAGGTGTGGCATGGGGGCCTCCTCGGTGAGGGGCGGCTGGCC CAGCCAAAGGTCCAGCCCCCATATCTCGTCTTGGTCCACACCGTACCCCCGGAGGAGCCACTCCCCGCCGACCGG -2 D T E L D P T Y L L V L H P M

hpaF

19426 ACCCGCCAGGGCGACCAGCCGCGAACGGGTGGGTTACAGGCGGCTGGTGGGCACCACGGCGGCCGGGTTGGCGGC TGGGCGGTCCCGCTGGTCGGCGCTTGCCCACCCAATGTCCGCCGACCACCCGTGGTGCCGCCGGCCCAACCGCCG # L R S T P V V A A P N A A

19501 CTGGGCAGCGGGGATGGCACCACCGTCCTGCGGGGTGACCGGGAAGATCGCGTTGATCTGGCCGGTGCCGGAAGA GACCCGTCGCCCCTACCGTGGTGGCAGGACGCCCCACTGGCCCTTCTAGCGCAACTAGACCGGCCACGGCCTTCT -2 Q A A P I A G G D Q P T V P F I A N I Q G T G S S

19576 GCCGAAGTAGGGCGTGACCACTTCGGCCTTGCCGTCGTAATCGGACCAGCCCAGCGCACCCAGCAGCATTGCCGT CGGCTTCATCCCGCACTGGTGAAGCCGGAACGGCAGCATTAGCCTGGTCGGGTCGCGTGGGTCGTCGTAACGGCA -2 G F Y P T V V E A K G D Y D S W G L A G L L M A T

19651 GTCGTGCATGAAGCCTTCACCGTGGCCTTTGGCGGCGTACTCCGGCAGCATCCCGCAGAACGCTTCCCACTCGCC CAGCACGTACTTCGGAAGTGGCACCGGAAACCGCCGCATGAGGCCGTCGTAGGGCGTCTTGCGAAGGGTGAGCGG -2 D H M F G E G H G K A A Y E P L M G C F A E W E G

19726 GTCCTGCCACATTTGCACCACACGGTGGTCGAGGGTTTCGAGGAACGGGCTCCACACCTTGGTGGCAAAGTCCGG CAGGACGGTGTAAACGTGGTGTGCCACCAGCTCCCAAAGCTCCTTGCCCGAGGTGTGGAACCACCGTTTCAGGCC -2 D Q W M Q V V R H D L T E L F P S W V K T A F D P

19801 CGCCTGGCCGTTCTGCGCGAAGCGGTGCGACAGCGAGCCGCTGGCCAGGAACGCCACGGTGCCGTCGTAGTGGTC GCGGACCGGCAAGACGCGCTTCGCCACGCTGTCGCTCGGCGACCGGTCCTTGCGGTGCCACGGCAGCATCACCAG -2 A Q G N Q A F R H S L S G S A L F A V T G D Y H D

19876 TTCTACTGCCTTGCGCATGGCCCAGCCCAGGCGGGCACTGTCGGCCAGGTAGTGCGAGGTGCACAGGGCCGAGAC AAGATGACGGAACGCGTACCGGGTCGGGTCCGCCCGTGACAGCCGGTCCATCACGCTCCACGTGTCCCGGCTCTG -2 E V A K R M A W G L R A S D A L Y H S T C L A S V

19951 CGAGACCACTTTGAAGTGCTGGTCCTGGTTCATGTAGCGCATGGGCACCAGGGTGCCGTATTCCGGGGCGAGGGT GCTCTGGTGAAACTTCACGACCAGGACCAAGTACATCGCGTACCCGTGGTCCCACGGCATAAGGCCCCGCTCCCA -2 S V V K F H Q D Q N M Y R M P V L T G Y E P A L T

20026 GGTGGCGTGGTGGGCCATGGTTTCGACGTTGAAGCGGTTGCACTCCTCGGCCAGCAGCTTGCCCAGCTCGGGATT CCACCGCACCACCCGGTACCAAAGCTGCAACTTCGCCAACGTGAGGAGCCGGTCGTCGAACGGGTCGAGCCCTAA -2 T A H H A M T E V N F R N C E E A L L K G L E P N

20101 GCCGGGGAATGCGTAGGGCATGTTGCTGATGAAGTGCGGCAGTTCGTTGCTGGTGTACACGCCCTCGAAATGCGG CGGCCCCTTACGCATCCCGTACAACGACTACTTCACGCCGTCAAGCAACGACCACATGTGCGGGAGCTTTACGCC -2 G P F A Y P M N S I F H P L E N S T Y V G E F H P

20176 CCCGCACAGCACGTGGTAGTTGGCGTTGACCAGCCAGTGCGTGTCGAACACGACGATGGTGTCCACGCCCAGCTC GGGCGTGTCGTGCACCATCAACCGCAACTGGTCGGTCACGCACAGCTTGTGCTGCTACCACAGGTGCGGGTCGAG -2 G C L V H Y N A N V L W H T D F V V I T D V G L E

20251 ACGGCAACGGCGGCTGATTTCGTGATGCCCGTCGATGGCCGCCTGGCGAAAGCCTTGGCGCGGGCCTGGCAGTTC TGCCGTTGCCGCCGACTAAAGCACTACGGGCAGCTACCGGCGGACCGCTTTCGGAACCGCGCCCGGACCGTCAAG -2 R C R R S I E H H G D I A A Q R F G Q R P G P L E

20326 GGACATGTACATGGACGGTACATGGGTAATCTTGGCAGTGAGAGCGAGTTTGCCCATGGGGGTCTCCGATAAGAC CCTGTACATGTACCTGCCATGTACCCATTAGAACCGTCACTCTCGCTCAAACGGGTACCCCCAGAGGCTATTCTG -2 S M Y M S P V H T I K A T L A L K G M

hpaD

20401 GCTGTTGTTGTTTTGGGGCTGACCCGGTCCCTTGTAGGAGCGGCCTTGTTCCGGGATGGGGCGCACAGCGGCCCC CGACAACAACAAAACCCCGACTGGGCCAGGGAACATCCTCGCCGGAACAAGGCCCTACCCCGCGTGTCGCCGGGG

20476 GGCGATATCTGCGGCGAGGCTGAAATCCAGGGGCCGCTGCGCGCCCCATCGCGGGCACAAGGCCGCTCCTACACC CCGCTATAGACGCCGCTCCGACTTTAGGTCCCCGGCGACGCGCGGGGTAGCGCCCGTGTTCCGGCGAGGATGTGG

20551 CGGGCGGTGTAAACCGCACAGAGGGTTAGATGCCCCAGCGAGGAATGTGGTGATTACCCATGGAAATACACACGT GCCCGCCACATTTGGCGTGTCTCCCAATCTACGGGGTCGCTCCTTACACCACTAATGGGTACCTTTATGTGTGCA # I G W R P I H H N G M S I C V N

20626 TCTTGATCTCTGCAAAGACCTCGAAGCTGTACTGCCCGCCCTCACGCCCGGTACCGGAACCTTTCACGCCGCCGA AGAACTAGAGACGTTTCTGGAGCTTCGACATGACGGGCGGGAGTGCGGGCCATGGCCTTGGAAAGTGCGGCGGCT

- 1 K I E A F V E F S Y Q G G E R G T G S G K V G G F

20701 ACGGCTGGCGCAGGTCGCGTACGTTCTGGCTGTTGATGAACACCATGCCGGCCTCGATGCCACGGGCCAGGCGAT TGCCGACCGCGTCCAGCGCATGCAAGACCGACAACTACTTGTGGTACGGCCGGAGCTACGGTGCCCGGTCCGCTA -1 P Q R L D R V N Q S N I F V M G A E I G R A L R H

20776 GGGCTTTGCCGATGTCCTGGGTCCAGATGTACGAGGCCAGGCCATACTCGGTGTCGTTGGCCAGTTGCAGCGCCT CCCGAAACGGCTACAGGACCCAGGTCTACATGCTCCGGTCCGGTATGAGCCACAGCAACCGGTCAACGTCGCGGA -1 A K G I D Q T W I Y S A L G Y E T D N A L Q L A E

20851 CGGCTTCGTCCTTGAACGGGATCAGGCACACCACCGGGCCAAAGATTTCTTCCTGGGCAATGCGCATCTTGTTGT GCCGAAGCAGGAACTTGCCCTAGTCCGTGTGGTGGCCCGGTTTCTAAAGAAGGACCCGTTACGCGTAGAACAACA -1 A E D K F P I L C V V P G F I E E Q A I R M K N N

20926 TCACGTCGGCGAATACGGTGGGCTGGATGAACTGCCCCTTGGCCAGGTGCGCAGGCAGGTTGGCCGGGCGCTCCA AGTGCAGCCGCTTATGCCACCCGACCTACTTGACGGGGAACCGGTCCACGCGTCCGTCCAACCGGCCCGCGAGGT -1 V D A F V T P Q I F Q G K A L H A P L N A P R E L

21001 GGCCCCCGGCGACCAGGCGTGCACCTTCTTCGATGCCAATGCGGATGTACCCGGTGACCTTGTCATAGTGCTGCT CCGGGGGCCGCTGGTCCGCACGTGGAAGAAGCTACGGTTACGCCTACATGGGCCACTGGAACAGTATCACGACGA -1 G G A V L R A G E E I G I R I Y G T V K D Y H Q Q

21076 GGGTGATCATCGAACCGACCTGGGTTTTCGGGTCGGTCGGGTCACCTACGATCAGGCGCTTGGCGCGCGCCGCAA CCCACTAGTAGCTTGGCTGGACCCAAAAGCCCAGCCAGCCCAGTGGATGCTAGTCCGCGAACCGCGCGCGGCGTT -1 T I M S G V Q T K P D T P D G V I L R K A R A A F

21151 ACTCTGCGACAAACTGCGGGTACACGCTTTCCTGGATGAAGATGCGGCTGCCGGCGGTGCAGCGCTCGCCGTTCA TGAGACGCTGTTTGACGCCCATGTGCGAAAGGACCTACTTCTACGCCGACGGCCGCCACGTCGCGAGCGGCAAGT -1 E A V F Q P Y V S E Q I F I R S G A T C R E G N L

21226 GCGAGAAGATGGTGAACAGCGCGGCGTCCAGCGCACGCTCAAGGTCTGCGTCTTCGAAGATCAGCACGGGCGACT CGCTCTTCTACCACTTGTCGCGCCGCAGGTCGCGTGCGAGTTCCAGACGCAGAAGCTTCTAGTCGTGCCCGCTGA -1 S F I T F L A A D L A R E L D A D E F I L V P S K

21301 TGCCGCCCAGTTCCATCGAGTACTTTTTAAGGCCTGCGGTCTGCATGATCTTCTTGCCGGTGGCGGTACCGCCGG ACGGCGGGTCAAGGTAGCTCATGAAAAATTCCGGACGCCAGACGTACTAGAAGAACGGCCACCGCCATGGCGGCC -1 G G L E M S Y K K L G A T Q M I K K G T A T G G T

21376 TGAAGGAAATGGCGCGCACATCGGGGTGGCGGACCAGGGCATCGCCGGCGGTAGCGCCGTAACCCTGGATCACGT ACTTCCTTTACCGCGCGTGTAGCCCCACCGCCTGGTCCCGTAGCGGCCGCCATCGCGGCATTGGGACCTAGTGCA -1 F S I A R V D P H R V L A D G A T A G Y G Q I V N

21451 TCAGCACCCCGTTGGGGATGCCGGCTTCTACCGCCAGGCGGCCCAGTTCGTTGGCGGTCAGAGGCGACAGCTCGC AGTCGTGGGGCAACCCCTACGGCCGAAGATGGCGGTCCGCCGGGTCAAGCAACCGCCAGTCTCCGCTGTCGAGCG -1 L V G N P I G A E V A L R G L E N A T L P S L E S

21526 TCATCTTCAGCACGGCGGTGTTGCCCAGCGCCAGGCACGGCGCAGTCTTCCAGGTAGCCGTCATGAACGGCACGT AGTAGAAGTCGTGCCGCCACAACGGGTCGCGGTCCGTGCCGCGTCAGAAGGTCCATCGGCAGTACTTGCCGTGCA -1 M K L V A T N G L A L C P A T K W T A T M F P V N

21601 TCCATGGGCTTACCAGGCCGCACACACCCACCGGCTGGTACAGGGTGTAGTTGAGCATCTGGTCGTCGACCGGGT AGGTACCCGAATGGTCCGGCGTGTGTGGGTGGCCGACCATGTCCCACATCAACTCGTAGACCAGCAGCTGGCCCA -1 W P S V L G C V G V P Q Y L T Y N L M Q D D V P Y

21676 AGGTATGGCCGTCCATGCGCGTGCACACTTCGGCGAAGAAGTCGAAGTTGTGCGAGGCACGCGGGATCAGCACGT TCCATACCGGCAGGTACGCGCACGTGTGAAGCCGCTTCTTCAGCTTCAACACGCTCCGTGCGCCCTAGTCGTGCA -1 T H G D M R T C V E A F F D F N H S A R P I L V N

21751 TCTTGGTCTGGTGGATCGGCAGGCCGGTGTCGAGGGTTTCCAGCTCGGCGAGTTTCGGCACGTTCTGCTCAATCA AGAACCAGACCACCTAGCCGTCCGGCCACAGCTCCCAAAGGTCGAGCCGCTCAAAGCCGTGCAAGACGAGTTAGT -1 K T Q H I P L G T D L T E L E A L K P V N Q E I L

21826 GCTCACCCAGCTTGCGCATCAGCCGGGCACGTTCCTTGGCCGGGGTGTTGGCCCACTTGGGGAAGGCTTCCTTGG CGAGTGGGTCGAACGCGTAGTCGGCCCGTGCAAGGAACCGGCCCCACAACCGGGTGAACCCCTTCCGAAGGAACC -1 E G L K R M L R A R E K A P T N A W K P F A E K A

21901 CCGCAGCCACAGCCTGGGCCACTTCCTCGGCGCCGCCGCTGGCGACTTCGCAGATGGCGTCGCCGGTGGCCGGGT GGCGTCGGTGTCGGACCCGGTGAAGGAGCCGCGGCGGCGACCGCTGAAGCGTCTACCGCAGCGGCCACCGGCCCA -1 A A V A Q A V E E A G G S A V E C I A D G T A P N

21976 TGTAGTTGACGAAGGTGTCTTTGCTCTCGACCTCACGGCCGTTGATCCAGTGCTTGATCATGCTGCTCATGCCTT ACATCAACTGCTTCCACAGAAACGAGAGCTGGAGTGCCGGCAACTAGGTCACGAACTAGTACGACGAGTACGGAA -1 Y N V F T D K S E V E R G N I W H K I M

hpaE

- 2 & A K 22051 GTTGTTCTTGAAGAAGTCAGCTTCGCTGACGATACGGTTGACCAGGCGACCGACGCCTTCCACTTCCACCACCAC CAACAAGAACTTCTTCAGTCGAAGCGACTGCTATGCCAACTGGTCCGCTGGCTGCGGAAGGTGAAGGTGGTGGTG -2 N N K F F D A E S V I R N V L R G V G E V E V V V

22126 TTCGTCACCCGGCACCACATCGGCCAGGCCTTCTGGCGTGCCGGTGGCGATCATGTCGCCCGGTTGCAGGGTCAT AAGCAGTGGGCCGTGGTGTAGCCGGTCCGGAAGACCGCACGGCCACCGCTAGTACAGCGGGCCAACGTCCCAGTA -2 E D G P V V D A L G E P T G T A I M D G P Q L T M

22201 GAAGCTGGAGAAGTATTCGATGAGGTGCGGGATGTCGAAGATCATGTCCGCGGTGGTGCCTTCCTGCTTCAGCTC CTTCGACCTCTTCATAAGCTACTCCACGCCCTACAGCTTCTAGTACAGGCGCCACCACGGAAGGACGAAGTCGAG -2 F S S F Y E I L H P I D F I M D A T T G E Q K L E

22276 ACCGTTGATCCAGGTGCGCAGCTTCAGGTTGCTGACGTCTGGCACATCGGCCGCATCGACGATCCACGGGCCGAC TGGCAACTAGGTCCACGCGTCGAAGTCCAACGACTGCAGACCGTGTAGCCGGCGTAGCTGCTAGGTGCCCGGCTG -2 G N I W T R L K L N S V D P V D A A D V I W P G V

22351 CGGGGTGGTGGCATCGCGGTTTTTCACCCGCAGGTTGGGGCGGTAGTAGTTTTCCAGGTAGTCGCGGATGGCGTA GCCCCACCACCGTAGCGCCAAAAAGTGGGCGTCCAACCCCGCCATCATCAAAAGGTCCATCAGCGCCTACCGCAT -2 P T T A D R N K V R L N P R Y Y N E L Y D R I A Y

22426 GTCGTTGCACACGGTGTAGCCGGCAACGTAGGCCAGGGCGTCCTCACGCTTGACGTTCTTCGCCGCTTTGCCGAT CAGCAACGTGTGCCACATCGGCCGTTGCATCCGGTCCCGCAGGAGTGCGAACTGCAAGAAGCGGCGAAACGGCTA -2 D N C V T Y G A V Y A L A D E R K V N K A A K G I

22501 CACCGCCACCAGCTCGCACTCGTAGTGCATGTATTCGACGTTGTCCGGGCGCCAGGTGACCTGGATGTGGCCGGT GTGGCGGTGGTCGAGCGTGAGCATCACGTACATAAGCTGCAACAGGCCCGCGGTCCACTGGACCTACACCGGCCA -2 V A V L E C E Y H M Y E V N D P R W T V Q I H G T

22576 GTAGGTGCCTGGCGACTTGATGAAAGCCAACGGTTCGGTGGGCGGCGCGAAGGCCAGCTCCCTGGCGTGGTCGGC CATCCACGGACCGCTGAACTACTTTCGGTTGCCAAGCCACCCGCCGCGCTTCCGGTCGAGGGACCGCACCAGCCG -2 Y T G P S K I F A L P E T P P A F A L E R A H D A

22651 GTAGTTCAGGCCCAGGGCGAACATGCTGCCGGTGGCGGGTGGCAGCCAGGTGACCTGGTCCTGATGGACCAGGCG CATCAAGTCCGGGTCCCGCTTGTACGACGGCCACCGCCCACCGTCGGTCCACTGGACCAGGACTACCTGGTCCGC -2 Y N L G L A F M S G T A P P L W T V Q D Q H V L R

22726 GCCGTCGGCAAGGCGCAGGTGATCGTCTTCGACCGTGACATCGTGGGCCTGGCCGTCGAACTGGATACGGGCGTG CGGCAGCCGTTCCGCGTCCACTAGCAGAAGCTGGCACTGTAGCACCCGGACCGGCAGCTTGACCTATGCCCGCAC -2 G D A L R L H D D E V T V D H A Q G D F Q I R A H

22801 TTTCACAGGTAATTCCTCACTCGGCGACGATGTGGTTGGTCAGCTTGCCCAGGCCGTCGATCTCGATGTCGACGC AAAGTGTCCATTAAGGAGTGAGCCGCTGCTACACCAACCAGTCGAACGGGTCCGGCAGCTAGAGCTACAGCTGCG -1 & E A V I H N T L K G L G D I E I D V R -2 K V

hpaG2

22876 GGTCACCTGGCTGTACATCGACGCGGCCCTCGGGGGTTCCGGTGATCAGGATGTCGCCGGCGTGCAGGGTCATGA CCAGTGGACCGACATGTAGCTGCGCCGGGAGCCCCCAAGGCCACTAGTCCTACAGCGGCCGCACGTCCCAGTACT -1 D G P Q V D V R G E P T G T I L I D G A H L T M F

22951 ACTCGCTGATTTCGGCAATCAGCTGCGCCACCGTGCGTACGCAGTTGGCGGTGTTGTTGTGCTGGCGCAGTTCGC TGAGCGACTAAAGCCGTTAGTCGACGCGGTGGCACGCATGCGTCAACCGCCACAACAACACGACCGCGTCAAGCG -1 E S I E A I L Q A V T R V C N A T N N H Q R L E G

23026 CGTTCACATACAGGCGCAGGCCCAGGGCATCGGGGTTGGCCACTTGGCTGGCGGGCACCAGTTCAGGGCCGACCG GCAAGTGTATGTCCGCGTCCGGGTCCCGTAGCCCCAACCGGTGAACCGACCGCCCGTGGTCAAGTCCCGGCTGGC -1 N V Y L R L G L A D P N A V Q S A P V L E P G V P

23101 GGCAAAAACCATCACGGCACTTGGCCTTGACTGCAGGGCGGTAGTAGCTGGCTTCGGGCAGGCTCACTTCGTTGA CCGTTTTTGGTAGTGCCGTGAACCGGAACTGACGTCCCGCCATCATCGACCGAAGCCCGTCCGAGTGAAGCAACT -1 C F G D R C K A K V A P R Y Y S A E P L S V E N V

23176 CGATGGTGTAGCCCGCCACATGCTCCAGGGCATCGGCCACGCTGACGCGGCTGGCGTCCTTGCCAATCACCACTC GCTACCACATCGGGCGGTGTACGAGGTCCCGTAGCCGGTGCGACTGCGCCGACCGCAGGAACGGTTAGTGGTGAG -1 I T Y G A V H E L A D A V S V R S A D K G I V V G

23251 CCAGCGCCGGGCCGGGTTGCACGCGCTGCACGCCGGCCGGGAATACCACCTGGCCTTCATGCTGGTTGCGGGTGT GGTCGCGGCCCGGCCCAACGTGCGCGACGTGCGGCCGGCCCTTATGGTGGACCGGAAGTACGACCAACGCCCACA -1 L A P G P Q V R Q V G A P F V V Q G E H Q N R T N

23326 TCGGGGTCTTGACGAACAACACCGGCTTGACCGGCAGTTGCTTGTACGGTGCTTCCACGAACGCCGCTTGGTGCT AGCCCCAGAACTGCTTGTTGTGGCCGAACTGGCCGTCAACGAACATGCCACGAAGGTGCTTGCGGCGAACCACGA

- 1 P T K V F L V P K V P L Q K Y P A E V F A A Q H Q

23401 GCTGCAGCAAACCCTGGTAGTTCAGCGCGACGCCGAACAGGGTGCCGCTGGCAACGTCAAGCAGGGCATGGCTCA CGACGTCGTTTGGGACCATCAAGTCGCGCTGCGGCTTGTCCCACGGCGACCGTTGCAGTTCGTCCCGTACCGAGT -1 Q L L G Q Y N L A V G F L T G S A V D L L A H S M

hpaG1

23476 TGCTCTTCTCCTGGCAGTGCAGGGCGGTGGCCGTCCTGCGGATTTCGTTAATGTGTTAATGTTATAGTTAATATG ACGAGAAGAGGACCGTCACGTCCCGCCACCGGCAGGACGCCTAAAGCAATTACACAATTACAATATCAATTATAC

23551 TTAACGATGGTCAAGGGGTGGCCAGTGGCGCCTGCCGGCAAGGCAAGGCACCATGGGCCATCGTCAACAGGGTCA AATTGCTACCAGTTCCCCACCGGTCACCGCGGACGGCCGTTCCGTTCCGTGGTACCCGGTAGCAGTTGTCCCAGT

hpaA

+2 M S D R H P I P N I N I G Q V Y D Q 23626 AGCGATTTGCGAGCAAGCAGCCATGAGCGACCGGCATCCGATACCGAACATCAACATTGGCCAGGTTTACGACCA TCGCTAAACGCTCGTTCGTCGGTACTCGCTGGCCGTAGGCTATGGCTTGTAGTTGTAACCGGTCCAAATGCTGGT

+2 R Y S D S E V H Y D R L G N L A G F F G R N M P V 23701 GCGCTACAGCGACAGCGAGGTGCATTACGACCGGCTGGGCAACCTGGCGGGCTTTTTCGGGCGCAACATGCCGGT CGCGATGTCGCTGTCGCTCCACGTAATGCTGGCCGACCCGTTGGACCGCCCGAAAAAGCCCGCGTTGTACGGCCA

+2 H R H D R F F Q V H Y V K S G T V R V Y L D D Q Q23776 GCACCGGCATGACCGGTTTTTCCAGGTGCATTACGTGAAGTCGGGCACAGTACGGGTGTATCTGGATGACCAGCA

+2 Y I E A G P M F F L T P P T V A H A F V T E A D S 23851 GTACATCGAGGCCGGGCCGATGTTCTTCCTCACGCCACCCACGGTGGCGCACGCGTTCGTCACCGAAGCTGACAG CATGTAGCTCCGGCCCGGCTACAAGAAGGAGTGCGGTGGGTGCCACCGCGTGCGCAAGCAGTGGCTTCGACTGTC

+2 D G H V L T V R Q Q L V W Q L I E A D A S L L P A 23926 CGACGGGCATGTGCTGACGGTGCGCCAGCAACTGGTGTGGCAATTGATCGAAGCCGACGCCAGCCTGCTGCCGGC GCTGCCCGTACACGACTGCCACGCGGTCGTTGACCACACCGTTAACTAGCTTCGGCTGCGGTCGGACGACGGCCG

+2 G M Q V Q P A C V A L G N L P A E Y K A E A Q R L 24001 GGGCATGCAGGTGCAGCCAGCCTGTGTGGCGCTGGGCAACCTGCCGGCCGAATACAAGGCCGAGGCGCAGCGCCT CCCGTACGTCCACGTCGGTCGGACACACCGCGACCCGTTGGACGGCCGGCTTATGTTCCGGCTCCGCGTCGCGGA

+2 Q G W L D A L S D E F A T Q Q P G R E A A L Q S L 24076 GCAAGGCTGGCTGGACGCGTTGAGTGACGAGTTTGCCACGCAGCAACCGGGTCGCGAGGCGGCGTTGCAGTCGCT CGTTCCGACCGACCTGCGCAACTCACTGCTCAAACGGTGCGTCGTTGGCCCAGCGCTCCGCCGCAACGTCAGCGA

+2 T R L I M I S L L R L C P N S L E S T P A R H E D 24151 GACCCGCCTGATCATGATCAGCCTGCTGCGGCTGTGCCCCAACTCGCTGGAATCGACCCCGGCGCGGCATGAAGA CTGGGCGGACTAGTACTAGTCGGACGACGCCGACACGGGGTTGAGCGACCTTAGCTGGGGCCGCGCCGTACTTCT

+2 L K I F H R F N A L I E A H Y L E H W P L A R Y A 24226 CCTGAAGATCTTCCACCGTTTCAATGCCCTGATCGAAGCGCATTACCTTGAGCATTGGCCGCTGGCCCGCTACGC GGACTTCTAGAAGGTGGCAAAGTTACGGGACTAGCTTCGCGTAATGGAACTCGTAACCGGCGACCGGGCGATGCG

+2 Q Q I G V T E A R L N D V C R R I A D L P S K R L 24301 GCAGCAGATTGGCGTGACCGAGGCACGGCTGAACGATGTGTGCCGGCGCATCGCCGACTTGCCATCCAAGCGCCT CGTCGTCTAACCGCACTGGCTCCGTGCCGACTTGCTACACACGGCCGCGTAGCGGCTGAACGGTAGGTTCGCGGA

+2 V L E R L M Q E A K R L L L F S G S T A N E I C Y 24376 GGTGCTGGAACGGCTGATGCAGGAGGCCAAGCGTTTGCTGTTGTTTTCCGGCAGCACGGCCAACGAAATCTGTTA CCACGACCTTGCCGACTACGTCCTCCGGTTCGCAAACGACAACAAAAGGCCGTCGTGCCGGTTGCTTTAGACAAT

+2 Q L G F K D P A Y F S R F F N R Y A K L T P G E Y24451 CCAGCTCGGCTTCAAGGATCCGGCCTATTTCAGCCGCTTCTTCAACCGCTACGCCAAGCTCACACCCGGGGAGTA GGTCGAGCCGAAGTTCCTAGGCCGGATAAAGTCGGCGAAGAAGTTGGCGATGCGGTTCGAGTGTGGGCCCCTCAT

+2 R Q R Q A E L Q &24526 CCGCCAGCGGCAGGCAGAATTGCAGTGAAATGGCCATGGCGGCTCACCCGGGTGCTGTTGTTGTTTACAGCGGAT GGCGGTCGCCGTCCGTCTTAACGTCACTTTACCGGTACCGCCGAGTGGGCCCACGACAACAACAAATGTCGCCTA

24601 GGTCGCAGCCCGCGCGCCGGGCTTGAATGGGTTTTCCGTGGAACAGATTGCACTTTCCATCGTGCATGCCCTTAA CCAGCGTCGGGCGCGCGGCCCGAACTTACCCAAAAGGCACCTTGTCTAACGTGAAAGGTAGCACGTACGGGAATT

hpaR

+2 M T K T Q P S L T L S L L Q24676 ATTCGTGAATTGAGAAAAAGCCACAGGTTTGACCATGACCAAGACGCAACCTTCGCTCACGCTAAGCCTGTTGCA TAAGCACTTAACTCTTTTTCGGTGTCCAAACTGGTACTGGTTCTGCGTTGGAAGCGAGTGCGATTCGGACAACGT +2 A R E A A M A F F R P L L N Q H D L T E Q Q W R V24751 GGCCCGAGAAGCCGCGATGGCATTTTTCAGGCCGCTGTTGAACCAGCACGACCTGACCGAGCAGCAATGGCGGGT CCGGGCTCTTCGGCGCTACCGTAAAAAGTCCGGCGACAACTTGGTCGTGCTGGACTGGCTCGTCGTTACCGCCCA

+2 I R I L K Q H G E L E N Y Q L A E L A C I L K P S24826 AATCCGCATCCTCAAGCAGCACGGCGAGCTGGAGAATTATCAGTTGGCGGAACTGGCCTGCATCCTCAAGCCGAG TTAGGCGTAGGAGTTCGTCGTGCCGCTCGACCTCTTAATAGTCAACCGCCTTGACCGGACGTAGGAGTTCGGCTC

+2 M T G V L G R L E R D G L V R R Q K A A Q D Q R R24901 CATGACCGGGGTACTGGGGCGCCTGGAGCGAGACGGGCTGGTGCGGCGGCAGAAGGCCGCGCAGGACCAGCGACG GTACTGGCCCCATGACCCCGCGGACCTCGCTCTGCCCGACCACGCCGCCGTCTTCCGGCGCGTCCTGGTCGCTGC

+2 V F V S L T E R G E A C F A S M K E G M E A N Y Q24976 GGTGTTCGTCAGCCTGACCGAAAGAGGGGAGGCGTGCTTTGCCTCGATGAAGGAAGGCATGGAGGCCAACTACCA CCACAAGCAGTCGGACTGGCTTTCTCCCCTCCGCACGAAACGGAGCTACTTCCTTCCGTACCTCCGGTTGATGGT

+2 K I Q A Q F G E E K L Q Q L M G L L N D L K R I A25051 GAAGATTCAGGCGCAGTTTGGTGAAGAGAAGCTGCAGCAGCTGATGGGGTTGTTGAATGACCTGAAGCGCATCGC CTTCTAAGTCCGCGTCAAACCACTTCTCTTCGACGTCGTCGACTACCCCAACAACTTACTGGACTTCGCGTAGCG

+2 P # 25126 GCCATAA CGGTATT

P. putida silvestre Mutante A2

mM mM

5

Mutante A0

mM

4 3 2 1

mM Mutante A7 mM tynR::pK18mob

5

Figura 10

Figura 11

Figura 12

Figura 13

Figura 14 Figura 15

Figura 16

LISTADO DE SECUENCIAS

<110> BIOGES STARTERS S.A.

<120> NUEVA HIDROXIFENILACETALDEHÍDO DESHIDROGENASA, ÁCIDO NUCLEICO QUE LA CODIFICA Y VECTORES Y MICROORGANISMOS RECOMBINANTES QUELA EXPRESAN

<130> P-101091

<160> 45

<170> PatentIn version 3.3

<210> 1

<211> 25132

<212> DNA

<213> Pseudomonas putida U

<220>

<221> misc_feature

<222> (1) .. (25132)

<223> Secuencia que contiene los cluster tyn y hpa

<400> 1 tcaggcgaaa cgctcgaagc ggtacggtga cgggtcgatc agcggggtgg cctgggccac 60 caggtctgcc gccagctggc cagcagcagg cgaggtgccg aagccatgcc cggaaaagcc 120 ggtggccagg gtcaggcccg gaatactggc caccgggccg atgaccgggt tggagtcggg 180 ggtgacgtca atcgtgccgg cccaggcgct ggcgatacgg gcctgttcga acaccggcca 240 ggccgctttc aggttgcgca tggcctcgtc gttgagggcc gggttggcgt gcgggtcttg 300 tacccgtaca cgctcgaagg gggttacatc cgttgccttc cagcgccggg ccagggccag 360 gtccttgaag aagtacttgc caaagctgat gcgcaaaaag tcccgctggg cacgcagctg 420 gggcaggtaa cgcttgccca gcagcaggtg atcgagggtg aggaaggcgt ccagcgcgcc 480 gcgctgggtg atgatgtagc cgccgtcctt gtgcttgcgg aaggaaaaat ctggtgcgcc 540 cacggcgatg tcggttggcc cgtccatggg ctctgtgcgc agcacggaac aggtcagcgg 600 caaggtcggc aggttgatgc ccaggttgcc gaggaacttg cgcgaccaca ggccaccggc 660 cagcaacacc tggtcgcagc ggatttcacc ttgctcggtg accaccccgc tgacacggcc 720 ggctgcggtg accagcgtgc gcaccgcgca gttctccact accactgcac ctttggcgat 780 cgccgcccgg gcgatggcgc tggcggccag ggtcggttcg gcgcgggcgt cggagggggt 840 gaagatgcca cctgcccaat ccgcccgacc acccggcacc atccgggtga tttcccgcgt 900 gctcagcagg cgcgaatcca ggcccagcgc ctcgacgctt ttcagccagc cttcatgcat 960 gcccatctgc gtgtcgttac ggccgatgaa catgatgccg gcttgccgat agccaacgtc 1020 gctgccaacc cgtgcgggca tctcggccca cagccgatca gccgccagtg ccaggggaat 1080 gtcatgggcg tggcggttgg tcttgcgcac ccagcccagg ttgcgcgacg actgctcccc 1140 agcgatgcgc cccttctcca gcaccaccac cggtatgttg cgttcggcga ggctcagtgc 1200 ggcggtgagg ccgataatgc cgccaccgat gatcaccacg gtagtggcgt cggggtggcg 1260 ggtgctggtt tgcacagggg cgatcgtggg agacatggct ttactctttg ttgtgcgtgc 1320 agggggagtg ttcagcgcca gccagcagcc tcactggcca aggcggatca gggtcacttg 1380 cgcttgcccc gcaccgcggt aggcggtgac ctccagctcg accttgtaaa cggtggagcc 1440 cagcggcggg caggtgaccg tggtggccgg gtcgatgccg cggaacttct cgccgatcac 1500 gtccatgacc cgtggtacat cggcagggtc ctggatgaac acgcgcgagt tgatgacatc 1560 ggccaggctg gcatcgactg cggccagcgc ggtttcgatg ttggcgaaca cctggtgggt 1620 ctgttcgatg acgtcctctg gaatgacctg ggtctgcggg ttgcgtccgg cggtgttgga 1680 gacgtgaatc cagttgtcca ccgccaccag gcgggagtag ctggccatgg cttcgaactt 1740 ggagccggtt ttcagtttga tgatctgtgt catgggcttt gccttgttat ccggttgcgg 1800 ggatcagctg agaacggggg tttcccagag gttgagcttt acgccgatgc cttgctcgag 1860 cgccttgcgg tacaccacgg tgccccaggc cacgtcttcg acgggcatgc cgcccaccga 1920 catcaggatg atttcgtcgt catgcaggcg gcccggtgcg tcgccgctga tgatcttgcc 1980 gatgtcttcc acctgctcgg cggccagcgt gccttcggca atcatgtcca tgaagcgcac 2040 acctaccagc ggtacgtggt tgtgcgcagg cttgggcagc tcttcgaacc aggcctcgta 2100 gaggccggtg ttgtccacca ccttgcgcac gtcgtcctgc tccatgccgg cgtcgatact 2160 gcacggggct ggcatggcca ggaacgcgcc aggcttgacc cactcgcggc gcaccagcgg 2220 gtactggctg gggtcgccga cttcgcccga gctgcagtag ctgaccaggt cggaaccgcg 2280 taccacttct tccagggttt ccaccacctg gacatgagtg atttgcggga agctggtttt 2340 cacccaggcg acgaaggcat ccaggttctt ctggccacgg cccttgacct tgagggtgtc 2400 gatcagcggg cagacggcca tgaacgcagc gaccgtggtc ttgcccatca cccccgggcc 2460 ggccaggccg atcaccttgg cgtccttgcg cgccaggtgg cgggcgccga cgcccgggat 2520 ggcgccggtg cggtaggccg acagcaggtt ggccgacatg tgtgccagtg gcgcgccggt 2580 gtcggcatcg ttgagggtga acatcaggat cgagcggggc aggcctttct cacggttggc 2640 gatgttcgag ccgtaccact tggcgcctgc ggtctggaag ttgccgccga ggtacgccgg 2700 catcgccatc atgcgccggt cggcggtggg cttgggcatg ttggggaatg gcgagtgctc 2760 ggggaaggta atcatcgcgc cgtgcgagtc gctgttcggg ccggccatgc ggtagtcacc 2820 ctggtacagc aggccgaaca tttcttccat ggtgtcgaca caggccggca tgtcggtgac 2880 gccggcacgg atcatgtcct gctcggacag gtagatgaag tcaattctgg tatcgagggt 2940 catggcgggt ctcgcagggc tggctgccgt cggatttgtt gttggtttcg aggcaaccag 3000 tttcgctaac gactggtagg tcgtcttgtg tctgcctgcc agccgagttg accgtcagtg 3060 ccagggcttc aatggcccgc gagcgagaag ctggccgggg tgtggcgcag gctgagggcg 3120 gtcagcaggc acaccaccag ggtgcacagg gccagcagcg cggcccatgc ggtcgggccg 3180 tggttgagta ccactgcggc cagcggggcg gcgccggcag acgccgacag ctggatggcg 3240 cccagcagcg ctgcggtgga acccagtgcc ttttcttgcg aggccatcac cagcgacatc 3300 agcgtcgact cggctatccc caggccgaac agggctatca ccatgccgcc ggccacacct 3360 ggcagcccca ggccggtcag tgcaccgagc aggctgatgc aggcaccgcc ggccatgcac 3420 agcacgccca cccgagtcaa ggtattgagg cccagccggc tgatcaggtg gctggccgtc 3480 atggcgccga gcaggatcga caccccggtg gcgccaaaca gcaggccgaa ggcctgggcg 3540 ctcaggccgt agtgggcctg gtacaccagg gtggcaccgc cgatgtaggc gaacaggaag 3600 aagaataccg cagcaaccgc cagggtcggg cgcaggaagc ggcggtcggc gaggatggcc 3660 aggtaggtgc tgcaggcgtg gcccaggcgc aggggttcgc gtttgctggg cggcagggtt 3720 tcgggcaggt tcagcaggct gttgaccagc accgtcacgc ccatgccggc gagtaccagc 3780 attactgcac gccagccgaa atgtgcgtcg atcacgccgc ccagggcagg tgccaggatc 3840 ggtgcgacgc cttcgatggt catcagcagg gcgaacagtt tggtcgcggc cacgccctgg 3900 ctcacatcac gcaccatgct catgatcacc accagggtca gcgcactgcc caggccctgg 3960 aaaaagcgca gcatgatcag ggtgtcgagg ctgggggctg cggctgcgcc cagcgagcac 4020 aggatgaaca gcagcaggcc ggccagcagc ggcttgcgcc ggccataagc gtcgacgatg 4080 gggccgaaga tcagctggcc ggcgcccatg gccagcagga agaaggtcag tgtcagctgt 4140 acgcgggtga agctagcctg atagtggctg gcgatttccg gcaggctcga caggtacatg 4200 tcgacggcgg aagggccgag ggcgccgatc aggcctaggc ccagggcgaa gctgaagggt 4260 atgggagggg agggattggc ttgcatggtt ttctctggct gatttttcgc ctaccgaccg 4320 gtaggtttgc gaatattatt cgccgagtcg gccaaggtca aacccttccg caaggccact 4380 gattcctgtg gggagcgggc atgcccgcga acaccggcaa agccggtgcc accgagtcgc 4440 cttcttcgcg ggcatgcccg ctcccacatt gaccgcagag gttggttacc gtggttgcgt 4500 cagaacggca cagccacggt cagctggcta tacacattgg taccattccc gcccacctgg 4560 ttgccgccgt tgctctcgtc cttgcgcggc tggtaaaggc ccaccagcgg gctgattatc 4620 aggtgctcgt tgactgccca ttccacatac aggtccagct cccgcgcatc gaggttgagg 4680 ctttcgcggg tgcgtacggt gtcgaagtcg aagtacagcg ccccgactgt gagattttcc 4740 agcggtgtcg ccttcacgcc cacatggtgg atacccgtgt tgctgttgaa ggggccggcg 4800 tagttggcag cgacttcacc ctggaaccag gtgccgtaac cgctggacag gccgctgaac 4860 agcgcgtccc agcctgccga gtagcgggtg tagcggtagg taacctgcgg tgcccacggc 4920 aggtcggcga aggtgtagcc ggcctgcagg taccaggctt gctcggggcc gtcggtcttg 4980 tcctgccagg cgtattcgaa ggcgaaactg gcattgtcga tgccagcgtt gccttcgccg 5040 cgcacgctat acacgtccat gccttcgcgg gctttctgaa agtcgctggc ccattggtcg 5100 gtgacgtcga tgccgtgaat ccaggtcagc ccgagggtgc ccaaggcttg ggtgtagtcc 5160 agcgtgccgg cggccagttc ggtttcggcc tgggcgcggt tgtcggattt cagccacagc 5220 aggctgccat gcaggccatc gctgcccccc aggcgcagca ttgcggtgcg gtcgaaggcg 5280 tggcgggcgg ccaggtagta ggccccgccg cggtccagcg caccgtcggc gacgccgttg 5340 cccaggttcg ggccgtcgtc gttgatcaaa aaaccactgc ccaggcgaat ggtctggcgg 5400 ccggcggaaa cgtccactcc atccttgccc agcaccggga acaggtcggc cgagcgccag 5460 ccgaggaagg cgtcttcgat cttggtggtg cgttcggagc catcggtgtt gccggccgca 5520 tcgccatcgc cccaggtggc cgagctcacc cagttcaggc tgccgtacag cgtgccgttg 5580 ccggccaggc cctggtcacc gctgaggcca tacttgataa agccttcacg ccaggtcgaa 5640 ccccctgtgg tgccgtcgta gttcttgcgg ctgttgaaca tgccccatac cgccagcatg 5700 tcggcgttca ggtggctgtc atcgtcggcg tacagctcaa cggccggcgc ggcctggctg 5760 gccagcaagg ttgccagggc caggctggac agcgtctgtg gtttgaccat ttgcacatcc 5820 ctcgtttgtt ctcggccacc ttcacagggg cctttgttgt tcgggggcac cctcggttct 5880 ggcgaggggc catcgcggtt ggcggcgatg gcctattagg gcgtgtgcgg tggggcgggg 5940 tcttgttcgt ggctgccaag gcgcttgcac gccttggcca caggcgcggt cagtagcgga 6000 tcatcaccga cttgagctcg gtgaagtcat cgatgaaggc cgagccgaac tcgcggccaa 6060 tgccggaagc cttgatgccc ccaaacggta cagccgggtc gagcagggtg tgcatgttga 6120 cccacagggt accggcctgg atttgcggga tcatgcgcat ggccttgccc aggtcgttgg 6180 tccacaggct ggcgctgagg ccgtagggcg aggcgttcat caggtgcagc agttcgtctt 6240 cgtcgtcata aggcaggaag gtcgccacag ggccgaaggt ttcctgggtg agcagggtgt 6300 cgcaggctga ccgggcgagg attaccgtgg gttcgacgaa acagccgggg ccgtcgccca 6360 gggtgccgcc gtgaatgatc tggctgcctt cggcgcgggc gatggcgaac agttcggcca 6420 gcttctgctg gtgcggcttg ttggccacgg ggccgaactg ggtggcctcg tccagtggcg 6480 agccgatttt cagttggccc aggcgctggg acagggcgtc cagcagcggg tcgatgcgcg 6540 agcggtgcac atagaagcgc tcgcccgcgg cgcagatttg ccccgagtgc aggaagccgg 6600 cctcgatgat gccgtccaca gccttgtcgg ttgccacgtc gggcaggaag gccaccgcgt 6660 tcttgccgcc cagttccagt gtcgcacggg tcagcttggc gcccatggca gcctggccta 6720 cggcgatgcc agtgggcacg gagccggtga acgagacctt gtcggtacct gcgtgctcga 6780 tcagtgcctt gcccaccagg ccaccaccgg tcagcacgtt cagtgcaccg gccggcaggc 6840 ctgcttcggt ggccagttcg gcaatgcgca gcagcgtcag cggggtgaat tcgctgggct 6900 tgaggataat gctgcagccg gttgtcaggg ccgaggccag cttccagatg gcgatcatgc 6960 tggcgaagtt ccacggcacg atgcccacca ccacgccaat cggctcgcgc agggtgaagg 7020 cgctgtagcg ctcaccggcg aacgagggca gcgacggggt gatggtctgg ccggtgatct 7080 tggtcgccca gccggcgtag tagcgcagga agtgcgcggc ctgctgtact tcgaacgcac 7140 gggaaatgcc gatgagcttg ccggattgca aggtttccag ctgcgccagt tcttcgcggt 7200 tggcttccag caggtcggcc agcttgaaca gcactgcggc gcgggcggcg gggctggtgt 7260 gcgaccaggc ggtaaagcct tggcgcgagg agctgacggc atggtcgaca tcggcctggt 7320 tggcgtcggc gatgtgggcg atggtctggc cgttggccgg gttgaccacg gcaatgttcg 7380 acgacgactg gctggcgagg tgctggccgt ggatgaacac gccatgctcg cgggccagga 7440 aggccgtgac ggcaggtagg agggtgatgt cgctcatgca gactccgggg cagttggcca 7500 aagtttgcag cttaataagc ggggcagtgc ggtgcttgtg cctgcgtgac aggtgcatga 7560 ctgtggctgc caaccgcact gggtaagcct tgtgggagcg gccttgtgtc gcgatagggc 7620 cgcagagcgg ccccggcgat gttggcggcg aagctgaaaa tgctggggcc gcttcgcgcc 7680 cctatcgcga cgcaaggccg ctcccacaaa aaaagcgagc gtaggccggg ctgattgctg 7740 gcaggcagca acaagcccgg cggcagccat cggcaagacg ccatgccacc ggcagcgcac 7800 agtaatcact cgttcaacgc cacaaaaaca agccggggca tacgatgtca ctcaataaca 7860 agctcaccga gcacctcaac cgcggcactg tcggtttccc caccgcactg gccagcactg 7920 tcgggctgat catggccagc ccggtgatcc tcaccgcgac catgggcttt ggcatcggcg 7980 gcagcgcctt cgccgtggcc atggtcatcg ccgcactgat gatgctggcg cagtccacca 8040 cctttgccga ggctgcgtcg atcctgccga ccacgggctc ggtatacgac tacatcaact 8100 gtggcatggg ccgtttcttc gccattaccg gcacgctgtc ggcctacctg atcgtgcatg 8160 tgttcgccgg taccgccgaa accatcctgt cgggggtgat ggcgctggtg aacttcgagc 8220 acctcaatac cctggcggaa tccgccggcg gttcgtggct gctgggggtg tgcttcgtgg 8280 tggcgtttgc ggtgctcaat gcctttggcg tcagcgcctt cagccgcgcg gaagtggtcc 8340 tcaccttcgg catgtggacc accttgatgg tgttcggcgt gcttggcctg atcgccgcac 8400 ccgcagtgga actggacggc ccgttcggcg tgtcgctggt gggcaccgac ctgatgacca 8460 tcctctcgct ggtcggcatg gccatgttca tgttcgttgg ctgcgagttc gtcacgccgc 8520 ttgcccccga actgcgtcgc tcggcctggg tgctgccgcg ggccatggcg ctgggcctgt 8580 ttggcgtggc cagctgcatg ttcatctacg gagcggcgat gaagcgccag gtggaaaacg 8640 tggtgctgga tgccgccagt ggcgtgcacc tgctggacac gcccatggcc atcccgcgct 8700 tcgccgagca ggtgatgggt gatattggcc cagtgtggct gggtatcggc ttcctgttcg 8760 ccggcgcggc caccatcaac acgctgatgg ccggtgtgcc acgcattctt tacggcatgg 8820 cggtggacgg cgcgttgccc aaggtgttca cctacctgca cccgcgcttc aagacgccgc 8880 tgctgtgcat cctggtggtg gcgttgatcc cttgcctgca tgcctggtac ctgggcggca 8940 acccggacaa catcctgcac ctggtgctgg ccgccgtgtg cgcctggagc accgcctacc 9000 tgctggtgac cctgtcggtg gtgatattgc gcatccgccg cccagacctg ccgcgtgcct 9060 accgctcgcc gctgttcccg ttgccgcaga tattctccag tagcggtatc ctcatcggca 9120 tggcgttcat cacaccgccg ggcatgaacc ctgccgatgt ctacgtgccg ttcgccatca 9180 tgcttggcgc cactgcggcc tatgcattgt tctggacgct gtgggtgcag aaggtcaacc 9240 cgttcaagcc ggcgcgggtc gaggatgtgc tcgagaaaga gtttgctgcc gagcctggcc 9300 acgccgtgga gcacgtgctg catgatcaga aatttgcgtg aacgcttgct ggcgccccga 9360 gcgccttcag gctatcgccc aggcgccacg ctggcatgcc tggcgcgcaa cctggggcag 9420 cagaacctgg tggcggccgg ggtgatccac gacccggccc agggttggca ggccacggtg 9480 cacgaacgcg tcgaggccca cctgctgatg cacatcgtca cctgtgagtt ccagctgcag 9540 ttgcctgctc cgcaaggggg cgaggtcagc ctggagctgc gccataccgg tgcgcttcgc 9600 cgtgccggcc tggcctgtgt gtaccgcaag ggcgaccggg cgcgcttcgc ccgactgcgc 9660 gaccggttgc tgcagcaggc cgcactggtg gcggcgctga tgccgctgga tttcaagcgc 9720 ctgaccttgg cctggcgcga cggccaatgg ttgctgaccc tggagcacat gggcggtagc 9780 gaagtggtca accgcatgcc agcgtttcgc cgctacatcc ccatcagccc gcaacagcgg 9840 gcgcacctga tggccagcct ggcccagttc aacactttgc tacctaacct ttgacgcaaa 9900 ctggcatacg ccttgctgta tcaagcgacg aatgatgaca gttgtgcgca catagataac 9960 atgttaacaa tgtgcgcata acaacaaatc ctgcgtcgag ggcagccatg catactcaac 10020 aatccaaccg tcaggggctg gaacgctgga ccacggccat gcaacagatc tgtggccgtt 10080 tcgagacgga acttgcgtcc aatcactcgc tgttcatcgg cgaggtttct accttttccc 10140 gtgccggctt gccgctggcc aacctgcgca ccaatgccgg caacatccgc cggctgggcg 10200 aaaacccgac ccttgacgat gaccagcatt gtttcctggt cagccagcgt gcggggcatt 10260 ccaccgtgtc ccaggggggc atgcaggtca gcctggcgcc gggtgagctg ctgctgatgg 10320 attcggtcgg gcgctgcgaa atcaccccca gtgggttgat cgaacatgtc tcgctggccc 10380 tgtcgcgtga gcaggtacgc aagtatgtgc aaggcagcgg cccgatgttt ggcaagatct 10440 cctcgagcaa cgcctgcggg cgcatgctgc atgtgctgat ggaccaactg tgcaaggacg 10500 gcaatgtaag cggtgatggg gcccagggcg acgcgctgca gaccgccttc attgccctgc 10560 tggagccagg cttcgagcgc catggcgaag cgctgggcaa ccttggggcc ttgaacgggg 10620 ccaacctgcg gggctacgtg cagcaggtga tcgacgagtc cctgtcacag cccgggctga 10680 ccccgtccaa cctggccggt cgcctgaaca tctcggtgcg tcacctgtac cggctgttcg 10740 aggaggaggg cgatagtgtg tgccgctaca ttcagcgggc gcgcctgaag cgcagtgcgg 10800 atgacctggc caacccgttc ttcaggagcg agtcgattac ctcgattgcc tacaagtggg 10860 ggtttaccga ctcggcgcat ttcagccgct cgttcaagaa acagttcgaa cgctcgccca 10920 aggactaccg ggcgcaggcg atggtttgag tgtgatggtg ctgcttgtgc gggcctcatc 10980 gccggcaagt cacttggcgg cggttcagcg acggccgttg aagtagcccg acagctggtg 11040 cacggtcttg ccggcagtga gcagcagcgg gcggaaatgg tccttgccga ggatgcgcgc 11100 atgcttgacc gagctgacca ggtcatagcg cttcgatccc tcctgcatac cctcggcgag 11160 tatcttgcaa atgatgtggc tgggcgtgac gccaaagccg gagtagccct gcacatagaa 11220 agcgttgggg cggttgtcga gggtgcctat ctgcggaaac aggttggcac tggtggccat 11280 cgggccgccc caggccaggt cgatgcgcac gtctttcagg taggggaaaa tcttcagcat 11340 cagcgcgcgg ttccacgcct tcaggtccag cgggaagtgc tcgacgaagg gcgtggcggc 11400 gccaaacagc aggcggttct cgcgggtgac ccggtagtag tcgatcaccg ggcggatgtc 11460 gctgtaggcc ccgcgtatcg ggctgatgcg ctcgatcagc tcatccggca atggctcggt 11520 catcatctgg aaggcatagg tgtttatagt gcgtgcgtgc agctgcggct ccagcttgtt 11580 gaggaagctg tcgcacgccc acagcagctt gctggcgcgt accgagccac ggccggtgcg 11640 taccgtgatg cgctcgccgt aggtcacttc cagggccggg ctgtgttcga agatgcgcgc 11700 accatggccc accagtgcct gcgcttcgcc cagcagcagg ttcagggaat gcacatggcc 11760 accgcccatg tgcatcaggg cgctgctgta ggcgttgctg ccgatgatct ggcgcacttc 11820 gctgccaccg agaaaacgga tctcgtcgcg ggtattgatc gccttgaacg ccttctccca 11880 tttgcgcagg gtctgttcct ggcggcggtt gaagcccatg tagccatagc cgtggcagaa 11940 gtcggcgtcg atggcgtagc gggcgatgcg gtccttgatg atgccggcgc ccagttcgct 12000 gatttcgaaa atatccctca cgccctgatc accgacgctg ctgcggatct tctccaggtc 12060 gtggccgatg cccgccatga tctgcccgcc gttgcgcccg ctaccgccgt agcccagata 12120 acggccctcg agcacgacga tattggtcac gccttgttcc gccagctcca gggcggtgtt 12180 aatgccggag aaaccgccac cgatcaccac gacatcggcc tcgatgtcgc gttccagggt 12240 tgggaagctc aggttgtact tcttggtcgc cgagtagtag gtggggctct cgagggtgat 12300 catgacgccg cctgctgact ggaaatgggt agaaatcatt ctattaatgt attaatgatt 12360 gtgcactggc atactcgccg gtttgctatt tccagcctcc ttgagcccgc atgaccacac 12420 cgagaccctc cctgaccctg accttgctgc aggcgcgcga agccaccatg gcgttcttcc 12480 gcccggcgct gaatgcccat gacctgaccg agcagcaatg gcgggtaatc cgtatcctgc 12540 gccagcaagg cgagctggaa agccatcagt tggcggagct ggcctgtatc ctcaaaccca 12600 gtatgagcgg ggtgctcaag cgcctggagc gtgacggcat cgtagcgcgg cgcaagtcgc 12660 cggaggacca gcgccgggtg ttcatcagcc tgaccgaggc cggccagcaa gcgtttctgg 12720 cgatgagcga ggagatgacc cgcaactacg acaagatcct cgcccagttt ggcgatgaca 12780 agctgcagca gctgatgcag ctgctgggtg aaatgaagaa gatcaaaccc tgacgcgcca 12840 ggcgtcagcg gttgagtgac agcgagtctt ccagcacttt cagcagtgct gccgcgcgcc 12900 gctcataggc gtcggggcct gcgtacatca gctctacata caggctgtcg atgatgccca 12960 ggtaggcatc ggcatacagc gccaggcggc tgtgctgctc atgcgcccag ccgtggcgag 13020 cttgcagggc cacgctgaac ccttcgcgta tgccgtccag gtactgttca aagcccgaag 13080 tgacaatcgg cttgatgccc gccgggggca ggaacgccgt gcgcaacacg aagcgcagtt 13140 gggccgagtc gcgataacgt tcggccaggt gcagggccag ccagtgcccc gccgccaggc 13200 cgtcgcgggc ttcctgcgca aagccgtgct cgacaaaggc cgtttcctgc acaagcgcac 13260 gctggaacac ctccacgaac aaggcgtcct tgttggcgaa atgcgcatac agcgatgcct 13320 tgcgcatgcc cgccaactgg gcgatttcgt tcagcgaaga ggcgtcataa ccgtactcgg 13380 cgaagtggcc gacggcggca tcgcacacac gcaccgcaga aggggaaagg tctttcaaca 13440 gcatcactcc gtcaggggcg cggcgggccg cgcgcgtctt gagggtggga ttgtggtgat 13500 cgaaaatgca cgggtcaatg cttgtcgcaa ggcaatttcc gggcgccatg gaaagtgcaa 13560 tgttcccctc gtaacgtgca ttcctccacc caatcgccgc tcacatactg atcgcgtctt 13620 cgaatccaat aagaaagaga ccgctcatga aaaagccaaa ccccctgctg gaagacctga 13680 agtccgtcct gccgaccatt gccgccaatg ccatgcgtgc agagcaggac cgcagtgtgc 13740 cggcagagaa tatcgccttg ctgaaaagca tcggcatgca ccgcgctttc ttgcccaaac 13800 acttcggcgg catggaaatc accctgccgg agttcgccca gtgcatcgcc ttgctggcgg 13860 gggcctgcgc cagcacagcc tgggccatga gcctgctgtg cacccacagc caccagatgg 13920 caatgttctc gcccaagcta caacaggagg tgtggggtag cgacccggat gctaccgcca 13980 gcagcagtat cgcgccgttc ggccgcactg aagaggttga gggtggcgtg tcgttcagcg 14040 gcgaaatggg ctggagttcc ggttgcgacc acgccgaatg ggcgattctc ggtttccgcc 14100 gcaagaatgc cgaaggcgct caggattact gcttcgccat cctgcctcgc agtgactatg 14160 aaatccgtga tgactggtat gccgtgggca tgcgcggcag cggcagcaag accctgatcg 14220 tgcgtgatgc cttcgtgccc gagcaccgca tccagaaggc caaggacatg atggagggca 14280 agtcggcggg ctttggtttg taccccgaca gcaagatttt cttcgccccg tatcgcccgt 14340 attttgccag cggcttctcc acggtcagct tgggcgttgc cgagcgcatg ctggaggtgt 14400 tccgcgagaa aacccgcaac cgcgtgcgtg cctacaccgg tgctgccgtg ggcgccgcca 14460 ccccggcgct gatgcgcctg gccgagtcga cccatcaggt ggccgctgcc cgggcattgc 14520 tggaaaagag ctgggacgag attgccgagc acagtgcccg tcacgaatac ccgtcgcgtg 14580 gcacgctggc gttctggcgt accaaccagg gctacgccgt gaagatgtgc atccaggccg 14640 tcgaccgcct gatggaagcg gccggtggtg gcgcctggtt cgagagcaac gaactgcagc 14700 ggctgttccg cgattcgcac atgaccggtg cccatgccta caccgattac gacgtgtgtg 14760 cgcaaatcct cggccgcgag ctgatgggcc tggagcctga cccggcgatg gtctgagccg 14820 ccacttgttt tcacccatcc cctacaagca caacaacaaa cagggcaggc tgccaggcct 14880 gcccgggagt cttgcatgtc caaagaaacc ttcgattcac gtgccttccg ccgcgccctg 14940 ggcaacttcg ccaccggcgt gaccgtggtg actgccgccg gccccagtgg ccgcaaggtc 15000 ggcgttaccg ccaacagctt caactcggtg tcgctggacc cggcgctgat cctgtggagc 15060 atcgacaagc gctccaccag ccatgaagtg ttcgaagagg cctcgcactt tgccgtgaac 15120 attctggctg cggaccagat cgacctgtcc aacaactttg cccgcccgaa ggaagatcgc 15180 tttgccggta tcgactacga gaccggcact ggcggcgcgc cgttgttcgc cgattgcgcg 15240 gcgcgctttg agtgtgaaaa gtaccagcag ctggacggtg gcgatcactg gatcctggtg 15300 ggcaaggtag tggcctttga tgactttggc cgctcgccgc tgctgtatca ccagggcgcc 15360 tattcaatgg tgctgccgca tacccgcatg acccaaggcg cagaggggca ggcaccgagc 15420 agccacttcc agggccgcct gcagcacaac ctgtactacc tgatgaccca ggcgctgcgt 15480 gcctaccagg ctgactacca gccacgccag ctgtgtaccg gcctgcgcac cagcgaggca 15540 cgcatgctga tggtgctgga gaacgatgcg ggcctgagcc tgaacgacct gcaacgcgaa 15600 gtggcgatgc cggcgcggga gatcgaggaa gcggttgcca acctcaagcg caaagggctg 15660 attgccgatg acgaagggcg agtgcggcta tcggtgaagg gcgtggacga gaccgaggcg 15720 ttgtggacca ttgcccggca acagcaggac aaggtgttcg ggcagttcag tgaacagcag 15780 ctggagactt tcaagaccgt gctcaaggcc cttatcaaca tctgaacacg ctttgggatg 15840 gcaccggctg ttttggatgg caccggctgt gccggtgttc gcggatgaac ccgctcccac 15900 aggtccagcg ccagtagcaa cttcggcgcg gtacctgtgg gagcggcttt agccgcgaac 15960 accggcaaag ccggtgccat ccaaccagaa gcctcagtag gcaccacccc cggcactggg 16020 gactaccact gtatccttga acttccccgc cagctcgcgc agcccgcgca tcagcaccgt 16080 ggtatccaca cccaccgcca caaacgccgc acccagctcg atgtagcgtc gcgccagttt 16140 ctcgtccgcg ctgagaatgc cggcggcttt gcccgccttg ccaatgcgca cgattgcgtc 16200 ttcaatcgcc gcctgcacct ccgggtgccc ggggttgccg cgatgcccca tggccgcact 16260 caggtctgca ggcccgatga acacgccatc cacaccttcc actgcaacga tctcgtccag 16320 gttggccagg ccttccttgt tctcgatctg caccagcagg cacatttgct catcggcgtg 16380 gtccaggtaa ccggggaggg tgttccagcg cgaagcccgc gccagcgcgc tgcccacccc 16440 gcgaatgccc ttgggcgggt aatgcatggc cttgaccagt tgccgcgcct gttcggcagt 16500 ttccaccatc ggcaccagca aggtttgtgc gccgatatcc agcacctgct tgatcagcgc 16560 ggtatcgccg atcaccgggc ggatcactgc ctggctgggg tagggtgcca ccgcctgcaa 16620 ctgggcgagc atgccgcgca ggtcgttggg cgcgtgttcg ccgtcgatca gcagccagtc 16680 gaaaccggca ttggccgcca gctcggcgca gtaggcatcg gccaggccga gccacaggcc 16740 gatttgcggt tcaccgctgt gcaggcgtcg cttgaagtgg ttgatgggca tgtccatgag 16800 caggtcctta aacgaagcgg caggcgatgg agccgagcat gtcgtagtcg acgtggaagg 16860 tgtcacctgg gcgagcggcg accgggcggg tgaacgaacc cccaaggatg atctggccgg 16920 gctgcaaggt gacgtcgtac ggcgccagtt tgttggccag ccaggcaacg cctttggccg 16980 ggtggttgag cacggcagcg ctgaccccgg attcctcgat cacgccattg cggtagagca 17040 ccgccggcac tttgcgcagg tcgatttcgg tggggcgcac ggcccgcccg cccatcacca 17100 cgccggcatt ggcggcgttg tcggagatgg tgtcgaacac cttgcgggtg gcctgggttt 17160 gcgggtccac ctgctggatg cgcgcgtcaa tgatttccag cgccgggatc acccactcgg 17220 tggcgtccag cacatcaaac acggtgatgt tcgggccctt cagcggcttg ccgaggatga 17280 acgccaactc cacttcaacc cgcggcacga tgaagcgctc gaaggggatg tcgctgcctt 17340 cgtcgaacag catgtcgtcg agcaaggcgc cgtagtcggg ctcggtgatg ttcgacgata 17400 cctgcatggc gcgcgaggtc aggccgatct tgtggcccac cagcttgcgc ccggcggcga 17460 tcttttttgc cacccaggcg cgctggatgg cgtaggcgtc ttcgatggtg attgccggtt 17520 gctccagcga gaactggcgc acttgctcgc gggagcgttc ggcctggtcg aggcggtcgg 17580 cggcgtgctg gatgaaagcg ttgtctagca tgggggcggt ctcttgattc aagggttgac 17640 gatggcagcc tgggtgcgca acaccagcag gccgcccagg gcgatgaaga cggcgagtac 17700 gtacagagca aggctggcgc tgtgggtggt gtcgcgcacc cagccgatga agtagggcgt 17760 gaagaacgag gcgatgctgc ccagcgagct gatcagggca atgccggcgg cctgggtacg 17820 ggcgttgagg aacgccggcg gcagttgcca gaacatcggc agcgcagcgc tggcgcccat 17880 gccggccagc accaggccgg ccattaccgg cagcgcctgc tcgggggcaa tggccgcaat 17940 agcgatgccg atggcagcca tcagcagcgg tacgcacagg tgccagcggc gttcgcgttg 18000 gcggtcgctg gagcggccgc acgccagcat gaacacgcag ccggccacgt acggcacagc 18060 gctgagcagg ccgacactgg cgtcgctggc cacaccggca ctgtgaatca ggctgggcat 18120 ccagaacgca agggtattca ccgccagcat caccgcgcaa tacacggcca ccaacagcca 18180 cagcgcacgg cttgcgaaaa tggcgccgaa cgaggttacg ggcttgcgct gttcttcctc 18240 accgaattgc gcgcgcagcg tggctttctg ctgctcatcc agccagctca cccgctcgaa 18300 gtgctccggc aaaacggcca gtaccaccag gcccagcaac accaccggcg ccccttcgag 18360 caggaacatc cactgccagc cacgcagccc gcccgtgtcg tgcatgaagg ccagtatggc 18420 cccggacact ggcccgccga ccactccggc caacggcacg gcaatggcga acagcgcggt 18480 gacctgggcg cggcgcccgg ccgggtacca gcggttgagg taaaccagaa tgcccgggaa 18540 gaacccggcc tcggccgcgc ccagggcaaa gcgcaacagg tagaacgcgc tgctgctttc 18600 gatcagcagc atgctggtcg acaacagccc ccacaccacc atcaggcagg cgatccagcg 18660 gcgtgggcca acgcggtcga gcatcaggtt gctggggacg ccgaacagcg cataggcaat 18720 gaagaacagc ccggcaccca ggccatagac cgtgtcggac aaatgcaggt cctggctcat 18780 ctgcatcttg gcgaagccaa tgttgatgcg gtccaggtgg gcgaacaggt agcacaccag 18840 cagcagcggc atcagccgcc aggtgactgc ccgatgggta ctgtcggccc gttcaacgtg 18900 tgcctcgcgc ggcgaggctt gttcgagtgt gctcatgttt ttgtacttat tctgtaatga 18960 gtcggggagg gcgtggtttg agccggcgcg ctagcggttg aacagtgggt gcaaggtgct 19020 gtgcttggcg tcgtagacct gggcggtgct gtggtcgatc tgcacggtga tgccgatcgg 19080 gcgctgttgc agcagtgggt ccaggcgcgc tttcaacact gccagcaagc tgtcgcccac 19140 tgttttgtgc acctcggcgc tacggccggt agccatgcgc aggttggcgt acagaaagcc 19200 gtattcgcct ttgccgtcgg ccaccgcgca atgggcggcg gggtaggcca gcacgcgtgt 19260 accgccagtg gggaacacgg ctttgccttc ggcatcgcgc tgttcgagca tggtgtcggc 19320 cagggcgcgg cacaggccgg ggatgtcggc gtcggtttcc aggtcggggg tatagagcag 19380 aaccaggtgt ggcatggggg cctcctcggt gaggggcggc tggccacccg ccagggcgac 19440 cagccgcgaa cgggtgggtt acaggcggct ggtgggcacc acggcggccg ggttggcggc 19500 ctgggcagcg gggatggcac caccgtcctg cggggtgacc gggaagatcg cgttgatctg 19560 gccggtgccg gaagagccga agtagggcgt gaccacttcg gccttgccgt cgtaatcgga 19620 ccagcccagc gcacccagca gcattgccgt gtcgtgcatg aagccttcac cgtggccttt 19680 ggcggcgtac tccggcagca tcccgcagaa cgcttcccac tcgccgtcct gccacatttg 19740 caccacacgg tggtcgaggg tttcgaggaa cgggctccac accttggtgg caaagtccgg 19800 cgcctggccg ttctgcgcga agcggtgcga cagcgagccg ctggccagga acgccacggt 19860 gccgtcgtag tggtcttcta ctgccttgcg catggcccag cccaggcggg cactgtcggc 19920 caggtagtgc gaggtgcaca gggccgagac cgagaccact ttgaagtgct ggtcctggtt 19980 catgtagcgc atgggcacca gggtgccgta ttccggggcg agggtggtgg cgtggtgggc 20040 catggtttcg acgttgaagc ggttgcactc ctcggccagc agcttgccca gctcgggatt 20100 gccggggaat gcgtagggca tgttgctgat gaagtgcggc agttcgttgc tggtgtacac 20160 gccctcgaaa tgcggcccgc acagcacgtg gtagttggcg ttgaccagcc agtgcgtgtc 20220 gaacacgacg atggtgtcca cgcccagctc acggcaacgg cggctgattt cgtgatgccc 20280 gtcgatggcc gcctggcgaa agccttggcg cgggcctggc agttcggaca tgtacatgga 20340 cggtacatgg gtaatcttgg cagtgagagc gagtttgccc atgggggtct ccgataagac 20400 gctgttgttg ttttggggct gacccggtcc cttgtaggag cggccttgtt ccgggatggg 20460 gcgcacagcg gccccggcga tatctgcggc gaggctgaaa tccaggggcc gctgcgcgcc 20520 ccatcgcggg cacaaggccg ctcctacacc cgggcggtgt aaaccgcaca gagggttaga 20580 tgccccagcg aggaatgtgg tgattaccca tggaaataca cacgttcttg atctctgcaa 20640 agacctcgaa gctgtactgc ccgccctcac gcccggtacc ggaacctttc acgccgccga 20700 acggctggcg caggtcgcgt acgttctggc tgttgatgaa caccatgccg gcctcgatgc 20760 cacgggccag gcgatgggct ttgccgatgt cctgggtcca gatgtacgag gccaggccat 20820 actcggtgtc gttggccagt tgcagcgcct cggcttcgtc cttgaacggg atcaggcaca 20880 ccaccgggcc aaagatttct tcctgggcaa tgcgcatctt gttgttcacg tcggcgaata 20940 cggtgggctg gatgaactgc cccttggcca ggtgcgcagg caggttggcc gggcgctcca 21000 ggcccccggc gaccaggcgt gcaccttctt cgatgccaat gcggatgtac ccggtgacct 21060 tgtcatagtg ctgctgggtg atcatcgaac cgacctgggt tttcgggtcg gtcgggtcac 21120 ctacgatcag gcgcttggcg cgcgccgcaa actctgcgac aaactgcggg tacacgcttt 21180 cctggatgaa gatgcggctg ccggcggtgc agcgctcgcc gttcagcgag aagatggtga 21240 acagcgcggc gtccagcgca cgctcaaggt ctgcgtcttc gaagatcagc acgggcgact 21300 tgccgcccag ttccatcgag tactttttaa ggcctgcggt ctgcatgatc ttcttgccgg 21360 tggcggtacc gccggtgaag gaaatggcgc gcacatcggg gtggcggacc agggcatcgc 21420 cggcggtagc gccgtaaccc tggatcacgt tcagcacccc gttggggatg ccggcttcta 21480 ccgccaggcg gcccagttcg ttggcggtca gaggcgacag ctcgctcatc ttcagcacgg 21540 cggtgttgcc cagcgccagg cacggcgcag tcttccaggt agccgtcatg aacggcacgt 21600 tccatgggct taccaggccg cacacaccca ccggctggta cagggtgtag ttgagcatct 21660 ggtcgtcgac cgggtaggta tggccgtcca tgcgcgtgca cacttcggcg aagaagtcga 21720 agttgtgcga ggcacgcggg atcagcacgt tcttggtctg gtggatcggc aggccggtgt 21780 cgagggtttc cagctcggcg agtttcggca cgttctgctc aatcagctca cccagcttgc 21840 gcatcagccg ggcacgttcc ttggccgggg tgttggccca cttggggaag gcttccttgg 21900 ccgcagccac agcctgggcc acttcctcgg cgccgccgct ggcgacttcg cagatggcgt 21960 cgccggtggc cgggttgtag ttgacgaagg tgtctttgct ctcgacctca cggccgttga 22020 tccagtgctt gatcatgctg ctcatgcctt gttgttcttg aagaagtcag cttcgctgac 22080 gatacggttg accaggcgac cgacgccttc cacttccacc accacttcgt cacccggcac 22140 cacatcggcc aggccttctg gcgtgccggt ggcgatcatg tcgcccggtt gcagggtcat 22200 gaagctggag aagtattcga tgaggtgcgg gatgtcgaag atcatgtccg cggtggtgcc 22260 ttcctgcttc agctcaccgt tgatccaggt gcgcagcttc aggttgctga cgtctggcac 22320 atcggccgca tcgacgatcc acgggccgac cggggtggtg gcatcgcggt ttttcacccg 22380 caggttgggg cggtagtagt tttccaggta gtcgcggatg gcgtagtcgt tgcacacggt 22440 gtagccggca acgtaggcca gggcgtcctc acgcttgacg ttcttcgccg ctttgccgat 22500 caccgccacc agctcgcact cgtagtgcat gtattcgacg ttgtccgggc gccaggtgac 22560 ctggatgtgg ccggtgtagg tgcctggcga cttgatgaaa gccaacggtt cggtgggcgg 22620 cgcgaaggcc agctccctgg cgtggtcggc gtagttcagg cccagggcga acatgctgcc 22680 ggtggcgggt ggcagccagg tgacctggtc ctgatggacc aggcggccgt cggcaaggcg 22740 caggtgatcg tcttcgaccg tgacatcgtg ggcctggccg tcgaactgga tacgggcgtg 22800 tttcacaggt aattcctcac tcggcgacga tgtggttggt cagcttgccc aggccgtcga 22860 tctcgatgtc gacgcggtca cctggctgta catcgacgcg gccctcgggg gttccggtga 22920 tcaggatgtc gccggcgtgc agggtcatga actcgctgat ttcggcaatc agctgcgcca 22980 ccgtgcgtac gcagttggcg gtgttgttgt gctggcgcag ttcgccgttc acatacaggc 23040 gcaggcccag ggcatcgggg ttggccactt ggctggcggg caccagttca gggccgaccg 23100 ggcaaaaacc atcacggcac ttggccttga ctgcagggcg gtagtagctg gcttcgggca 23160 ggctcacttc gttgacgatg gtgtagcccg ccacatgctc cagggcatcg gccacgctga 23220 cgcggctggc gtccttgcca atcaccactc ccagcgccgg gccgggttgc acgcgctgca 23280 cgccggccgg gaataccacc tggccttcat gctggttgcg ggtgttcggg gtcttgacga 23340 acaacaccgg cttgaccggc agttgcttgt acggtgcttc cacgaacgcc gcttggtgct 23400 gctgcagcaa accctggtag ttcagcgcga cgccgaacag ggtgccgctg gcaacgtcaa 23460 gcagggcatg gctcatgctc ttctcctggc agtgcagggc ggtggccgtc ctgcggattt 23520 cgttaatgtg ttaatgttat agttaatatg ttaacgatgg tcaaggggtg gccagtggcg 23580 cctgccggca aggcaaggca ccatgggcca tcgtcaacag ggtcaagcga tttgcgagca 23640 agcagccatg agcgaccggc atccgatacc gaacatcaac attggccagg tttacgacca 23700 gcgctacagc gacagcgagg tgcattacga ccggctgggc aacctggcgg gctttttcgg 23760 gcgcaacatg ccggtgcacc ggcatgaccg gtttttccag gtgcattacg tgaagtcggg 23820 cacagtacgg gtgtatctgg atgaccagca gtacatcgag gccgggccga tgttcttcct 23880 cacgccaccc acggtggcgc acgcgttcgt caccgaagct gacagcgacg ggcatgtgct 23940 gacggtgcgc cagcaactgg tgtggcaatt gatcgaagcc gacgccagcc tgctgccggc 24000 gggcatgcag gtgcagccag cctgtgtggc gctgggcaac ctgccggccg aatacaaggc 24060 cgaggcgcag cgcctgcaag gctggctgga cgcgttgagt gacgagtttg ccacgcagca 24120 accgggtcgc gaggcggcgt tgcagtcgct gacccgcctg atcatgatca gcctgctgcg 24180 gctgtgcccc aactcgctgg aatcgacccc ggcgcggcat gaagacctga agatcttcca 24240 ccgtttcaat gccctgatcg aagcgcatta ccttgagcat tggccgctgg cccgctacgc 24300 gcagcagatt ggcgtgaccg aggcacggct gaacgatgtg tgccggcgca tcgccgactt 24360 gccatccaag cgcctggtgc tggaacggct gatgcaggag gccaagcgtt tgctgttgtt 24420 ttccggcagc acggccaacg aaatctgtta ccagctcggc ttcaaggatc cggcctattt 24480 cagccgcttc ttcaaccgct acgccaagct cacacccggg gagtaccgcc agcggcaggc 24540 agaattgcag tgaaatggcc atggcggctc acccgggtgc tgttgttgtt tacagcggat 24600 ggtcgcagcc cgcgcgccgg gcttgaatgg gttttccgtg gaacagattg cactttccat 24660 cgtgcatgcc cttaaattcg tgaattgaga aaaagccaca ggtttgacca tgaccaagac 24720 gcaaccttcg ctcacgctaa gcctgttgca ggcccgagaa gccgcgatgg catttttcag 24780 gccgctgttg aaccagcacg acctgaccga gcagcaatgg cgggtaatcc gcatcctcaa 24840 gcagcacggc gagctggaga attatcagtt ggcggaactg gcctgcatcc tcaagccgag 24900 catgaccggg gtactggggc gcctggagcg agacgggctg gtgcggcggc agaaggccgc 24960 gcaggaccag cgacgggtgt tcgtcagcct gaccgaaaga ggggaggcgt gctttgcctc 25020 gatgaaggaa ggcatggagg ccaactacca gaagattcag gcgcagtttg gtgaagagaa 25080 gctgcagcag ctgatggggt tgttgaatga cctgaagcgc atcgcgccat aa 25132

<210> 2

<211> 12339

<212> DNA

<213> Pseudomonas putida U

<220>

<221> misc_feature

<222> (1) .. (12339)

<223> Cluster tyn

<400> 2 tcaggcgaaa cgctcgaagc ggtacggtga cgggtcgatc agcggggtgg cctgggccac 60 caggtctgcc gccagctggc cagcagcagg cgaggtgccg aagccatgcc cggaaaagcc 120 ggtggccagg gtcaggcccg gaatactggc caccgggccg atgaccgggt tggagtcggg 180 ggtgacgtca atcgtgccgg cccaggcgct ggcgatacgg gcctgttcga acaccggcca 240 ggccgctttc aggttgcgca tggcctcgtc gttgagggcc gggttggcgt gcgggtcttg 300 tacccgtaca cgctcgaagg gggttacatc cgttgccttc cagcgccggg ccagggccag 360 gtccttgaag aagtacttgc caaagctgat gcgcaaaaag tcccgctggg cacgcagctg 420 gggcaggtaa cgcttgccca gcagcaggtg atcgagggtg aggaaggcgt ccagcgcgcc 480 gcgctgggtg atgatgtagc cgccgtcctt gtgcttgcgg aaggaaaaat ctggtgcgcc 540 cacggcgatg tcggttggcc cgtccatggg ctctgtgcgc agcacggaac aggtcagcgg 600 caaggtcggc aggttgatgc ccaggttgcc gaggaacttg cgcgaccaca ggccaccggc 660 cagcaacacc tggtcgcagc ggatttcacc ttgctcggtg accaccccgc tgacacggcc 720 ggctgcggtg accagcgtgc gcaccgcgca gttctccact accactgcac ctttggcgat 780 cgccgcccgg gcgatggcgc tggcggccag ggtcggttcg gcgcgggcgt cggagggggt 840 gaagatgcca cctgcccaat ccgcccgacc acccggcacc atccgggtga tttcccgcgt 900 gctcagcagg cgcgaatcca ggcccagcgc ctcgacgctt ttcagccagc cttcatgcat 960 gcccatctgc gtgtcgttac ggccgatgaa catgatgccg gcttgccgat agccaacgtc 1020 gctgccaacc cgtgcgggca tctcggccca cagccgatca gccgccagtg ccaggggaat 1080 gtcatgggcg tggcggttgg tcttgcgcac ccagcccagg ttgcgcgacg actgctcccc 1140 agcgatgcgc cccttctcca gcaccaccac cggtatgttg cgttcggcga ggctcagtgc 1200 ggcggtgagg ccgataatgc cgccaccgat gatcaccacg gtagtggcgt cggggtggcg 1260 ggtgctggtt tgcacagggg cgatcgtggg agacatggct ttactctttg ttgtgcgtgc 1320 agggggagtg ttcagcgcca gccagcagcc tcactggcca aggcggatca gggtcacttg 1380 cgcttgcccc gcaccgcggt aggcggtgac ctccagctcg accttgtaaa cggtggagcc 1440 cagcggcggg caggtgaccg tggtggccgg gtcgatgccg cggaacttct cgccgatcac 1500 gtccatgacc cgtggtacat cggcagggtc ctggatgaac acgcgcgagt tgatgacatc 1560 ggccaggctg gcatcgactg cggccagcgc ggtttcgatg ttggcgaaca cctggtgggt 1620 ctgttcgatg acgtcctctg gaatgacctg ggtctgcggg ttgcgtccgg cggtgttgga 1680 gacgtgaatc cagttgtcca ccgccaccag gcgggagtag ctggccatgg cttcgaactt 1740 ggagccggtt ttcagtttga tgatctgtgt catgggcttt gccttgttat ccggttgcgg 1800 ggatcagctg agaacggggg tttcccagag gttgagcttt acgccgatgc cttgctcgag 1860 cgccttgcgg tacaccacgg tgccccaggc cacgtcttcg acgggcatgc cgcccaccga 1920 catcaggatg atttcgtcgt catgcaggcg gcccggtgcg tcgccgctga tgatcttgcc 1980 gatgtcttcc acctgctcgg cggccagcgt gccttcggca atcatgtcca tgaagcgcac 2040 acctaccagc ggtacgtggt tgtgcgcagg cttgggcagc tcttcgaacc aggcctcgta 2100 gaggccggtg ttgtccacca ccttgcgcac gtcgtcctgc tccatgccgg cgtcgatact 2160 gcacggggct ggcatggcca ggaacgcgcc aggcttgacc cactcgcggc gcaccagcgg 2220 gtactggctg gggtcgccga cttcgcccga gctgcagtag ctgaccaggt cggaaccgcg 2280 taccacttct tccagggttt ccaccacctg gacatgagtg atttgcggga agctggtttt 2340 cacccaggcg acgaaggcat ccaggttctt ctggccacgg cccttgacct tgagggtgtc 2400 gatcagcggg cagacggcca tgaacgcagc gaccgtggtc ttgcccatca cccccgggcc 2460 ggccaggccg atcaccttgg cgtccttgcg cgccaggtgg cgggcgccga cgcccgggat 2520 ggcgccggtg cggtaggccg acagcaggtt ggccgacatg tgtgccagtg gcgcgccggt 2580 gtcggcatcg ttgagggtga acatcaggat cgagcggggc aggcctttct cacggttggc 2640 gatgttcgag ccgtaccact tggcgcctgc ggtctggaag ttgccgccga ggtacgccgg 2700 catcgccatc atgcgccggt cggcggtggg cttgggcatg ttggggaatg gcgagtgctc 2760 ggggaaggta atcatcgcgc cgtgcgagtc gctgttcggg ccggccatgc ggtagtcacc 2820 ctggtacagc aggccgaaca tttcttccat ggtgtcgaca caggccggca tgtcggtgac 2880 gccggcacgg atcatgtcct gctcggacag gtagatgaag tcaattctgg tatcgagggt 2940 catggcgggt ctcgcagggc tggctgccgt cggatttgtt gttggtttcg aggcaaccag 3000 tttcgctaac gactggtagg tcgtcttgtg tctgcctgcc agccgagttg accgtcagtg 3060 ccagggcttc aatggcccgc gagcgagaag ctggccgggg tgtggcgcag gctgagggcg 3120 gtcagcaggc acaccaccag ggtgcacagg gccagcagcg cggcccatgc ggtcgggccg 3180 tggttgagta ccactgcggc cagcggggcg gcgccggcag acgccgacag ctggatggcg 3240 cccagcagcg ctgcggtgga acccagtgcc ttttcttgcg aggccatcac cagcgacatc 3300 agcgtcgact cggctatccc caggccgaac agggctatca ccatgccgcc ggccacacct 3360 ggcagcccca ggccggtcag tgcaccgagc aggctgatgc aggcaccgcc ggccatgcac 3420 agcacgccca cccgagtcaa ggtattgagg cccagccggc tgatcaggtg gctggccgtc 3480 atggcgccga gcaggatcga caccccggtg gcgccaaaca gcaggccgaa ggcctgggcg 3540 ctcaggccgt agtgggcctg gtacaccagg gtggcaccgc cgatgtaggc gaacaggaag 3600 aagaataccg cagcaaccgc cagggtcggg cgcaggaagc ggcggtcggc gaggatggcc 3660 aggtaggtgc tgcaggcgtg gcccaggcgc aggggttcgc gtttgctggg cggcagggtt 3720 tcgggcaggt tcagcaggct gttgaccagc accgtcacgc ccatgccggc gagtaccagc 3780 attactgcac gccagccgaa atgtgcgtcg atcacgccgc ccagggcagg tgccaggatc 3840 ggtgcgacgc cttcgatggt catcagcagg gcgaacagtt tggtcgcggc cacgccctgg 3900 ctcacatcac gcaccatgct catgatcacc accagggtca gcgcactgcc caggccctgg 3960 aaaaagcgca gcatgatcag ggtgtcgagg ctgggggctg cggctgcgcc cagcgagcac 4020 aggatgaaca gcagcaggcc ggccagcagc ggcttgcgcc ggccataagc gtcgacgatg 4080 gggccgaaga tcagctggcc ggcgcccatg gccagcagga agaaggtcag tgtcagctgt 4140 acgcgggtga agctagcctg atagtggctg gcgatttccg gcaggctcga caggtacatg 4200 tcgacggcgg aagggccgag ggcgccgatc aggcctaggc ccagggcgaa gctgaagggt 4260 atgggagggg agggattggc ttgcatggtt ttctctggct gatttttcgc ctaccgaccg 4320 gtaggtttgc gaatattatt cgccgagtcg gccaaggtca aacccttccg caaggccact 4380 gattcctgtg gggagcgggc atgcccgcga acaccggcaa agccggtgcc accgagtcgc 4440 cttcttcgcg ggcatgcccg ctcccacatt gaccgcagag gttggttacc gtggttgcgt 4500 cagaacggca cagccacggt cagctggcta tacacattgg taccattccc gcccacctgg 4560 ttgccgccgt tgctctcgtc cttgcgcggc tggtaaaggc ccaccagcgg gctgattatc 4620 aggtgctcgt tgactgccca ttccacatac aggtccagct cccgcgcatc gaggttgagg 4680 ctttcgcggg tgcgtacggt gtcgaagtcg aagtacagcg ccccgactgt gagattttcc 4740 agcggtgtcg ccttcacgcc cacatggtgg atacccgtgt tgctgttgaa ggggccggcg 4800 tagttggcag cgacttcacc ctggaaccag gtgccgtaac cgctggacag gccgctgaac 4860 agcgcgtccc agcctgccga gtagcgggtg tagcggtagg taacctgcgg tgcccacggc 4920 aggtcggcga aggtgtagcc ggcctgcagg taccaggctt gctcggggcc gtcggtcttg 4980 tcctgccagg cgtattcgaa ggcgaaactg gcattgtcga tgccagcgtt gccttcgccg 5040 cgcacgctat acacgtccat gccttcgcgg gctttctgaa agtcgctggc ccattggtcg 5100 gtgacgtcga tgccgtgaat ccaggtcagc ccgagggtgc ccaaggcttg ggtgtagtcc 5160 agcgtgccgg cggccagttc ggtttcggcc tgggcgcggt tgtcggattt cagccacagc 5220 aggctgccat gcaggccatc gctgcccccc aggcgcagca ttgcggtgcg gtcgaaggcg 5280 tggcgggcgg ccaggtagta ggccccgccg cggtccagcg caccgtcggc gacgccgttg 5340 cccaggttcg ggccgtcgtc gttgatcaaa aaaccactgc ccaggcgaat ggtctggcgg 5400 ccggcggaaa cgtccactcc atccttgccc agcaccggga acaggtcggc cgagcgccag 5460 ccgaggaagg cgtcttcgat cttggtggtg cgttcggagc catcggtgtt gccggccgca 5520 tcgccatcgc cccaggtggc cgagctcacc cagttcaggc tgccgtacag cgtgccgttg 5580 ccggccaggc cctggtcacc gctgaggcca tacttgataa agccttcacg ccaggtcgaa 5640 ccccctgtgg tgccgtcgta gttcttgcgg ctgttgaaca tgccccatac cgccagcatg 5700 tcggcgttca ggtggctgtc atcgtcggcg tacagctcaa cggccggcgc ggcctggctg 5760 gccagcaagg ttgccagggc caggctggac agcgtctgtg gtttgaccat ttgcacatcc 5820 ctcgtttgtt ctcggccacc ttcacagggg cctttgttgt tcgggggcac cctcggttct 5880 ggcgaggggc catcgcggtt ggcggcgatg gcctattagg gcgtgtgcgg tggggcgggg 5940 tcttgttcgt ggctgccaag gcgcttgcac gccttggcca caggcgcggt cagtagcgga 6000 tcatcaccga cttgagctcg gtgaagtcat cgatgaaggc cgagccgaac tcgcggccaa 6060 tgccggaagc cttgatgccc ccaaacggta cagccgggtc gagcagggtg tgcatgttga 6120 cccacagggt accggcctgg atttgcggga tcatgcgcat ggccttgccc aggtcgttgg 6180 tccacaggct ggcgctgagg ccgtagggcg aggcgttcat caggtgcagc agttcgtctt 6240 cgtcgtcata aggcaggaag gtcgccacag ggccgaaggt ttcctgggtg agcagggtgt 6300 cgcaggctga ccgggcgagg attaccgtgg gttcgacgaa acagccgggg ccgtcgccca 6360 gggtgccgcc gtgaatgatc tggctgcctt cggcgcgggc gatggcgaac agttcggcca 6420 gcttctgctg gtgcggcttg ttggccacgg ggccgaactg ggtggcctcg tccagtggcg 6480 agccgatttt cagttggccc aggcgctggg acagggcgtc cagcagcggg tcgatgcgcg 6540 agcggtgcac atagaagcgc tcgcccgcgg cgcagatttg ccccgagtgc aggaagccgg 6600 cctcgatgat gccgtccaca gccttgtcgg ttgccacgtc gggcaggaag gccaccgcgt 6660 tcttgccgcc cagttccagt gtcgcacggg tcagcttggc gcccatggca gcctggccta 6720 cggcgatgcc agtgggcacg gagccggtga acgagacctt gtcggtacct gcgtgctcga 6780 tcagtgcctt gcccaccagg ccaccaccgg tcagcacgtt cagtgcaccg gccggcaggc 6840 ctgcttcggt ggccagttcg gcaatgcgca gcagcgtcag cggggtgaat tcgctgggct 6900 tgaggataat gctgcagccg gttgtcaggg ccgaggccag cttccagatg gcgatcatgc 6960 tggcgaagtt ccacggcacg atgcccacca ccacgccaat cggctcgcgc agggtgaagg 7020 cgctgtagcg ctcaccggcg aacgagggca gcgacggggt gatggtctgg ccggtgatct 7080 tggtcgccca gccggcgtag tagcgcagga agtgcgcggc ctgctgtact tcgaacgcac 7140 gggaaatgcc gatgagcttg ccggattgca aggtttccag ctgcgccagt tcttcgcggt 7200 tggcttccag caggtcggcc agcttgaaca gcactgcggc gcgggcggcg gggctggtgt 7260 gcgaccaggc ggtaaagcct tggcgcgagg agctgacggc atggtcgaca tcggcctggt 7320 tggcgtcggc gatgtgggcg atggtctggc cgttggccgg gttgaccacg gcaatgttcg 7380 acgacgactg gctggcgagg tgctggccgt ggatgaacac gccatgctcg cgggccagga 7440 aggccgtgac ggcaggtagg agggtgatgt cgctcatgca gactccgggg cagttggcca 7500 aagtttgcag cttaataagc ggggcagtgc ggtgcttgtg cctgcgtgac aggtgcatga 7560 ctgtggctgc caaccgcact gggtaagcct tgtgggagcg gccttgtgtc gcgatagggc 7620 cgcagagcgg ccccggcgat gttggcggcg aagctgaaaa tgctggggcc gcttcgcgcc 7680 cctatcgcga cgcaaggccg ctcccacaaa aaaagcgagc gtaggccggg ctgattgctg 7740 gcaggcagca acaagcccgg cggcagccat cggcaagacg ccatgccacc ggcagcgcac 7800 agtaatcact cgttcaacgc cacaaaaaca agccggggca tacgatgtca ctcaataaca 7860 agctcaccga gcacctcaac cgcggcactg tcggtttccc caccgcactg gccagcactg 7920 tcgggctgat catggccagc ccggtgatcc tcaccgcgac catgggcttt ggcatcggcg 7980 gcagcgcctt cgccgtggcc atggtcatcg ccgcactgat gatgctggcg cagtccacca 8040 cctttgccga ggctgcgtcg atcctgccga ccacgggctc ggtatacgac tacatcaact 8100 gtggcatggg ccgtttcttc gccattaccg gcacgctgtc ggcctacctg atcgtgcatg 8160 tgttcgccgg taccgccgaa accatcctgt cgggggtgat ggcgctggtg aacttcgagc 8220 acctcaatac cctggcggaa tccgccggcg gttcgtggct gctgggggtg tgcttcgtgg 8280 tggcgtttgc ggtgctcaat gcctttggcg tcagcgcctt cagccgcgcg gaagtggtcc 8340 tcaccttcgg catgtggacc accttgatgg tgttcggcgt gcttggcctg atcgccgcac 8400 ccgcagtgga actggacggc ccgttcggcg tgtcgctggt gggcaccgac ctgatgacca 8460 tcctctcgct ggtcggcatg gccatgttca tgttcgttgg ctgcgagttc gtcacgccgc 8520 ttgcccccga actgcgtcgc tcggcctggg tgctgccgcg ggccatggcg ctgggcctgt 8580 ttggcgtggc cagctgcatg ttcatctacg gagcggcgat gaagcgccag gtggaaaacg 8640 tggtgctgga tgccgccagt ggcgtgcacc tgctggacac gcccatggcc atcccgcgct 8700 tcgccgagca ggtgatgggt gatattggcc cagtgtggct gggtatcggc ttcctgttcg 8760 ccggcgcggc caccatcaac acgctgatgg ccggtgtgcc acgcattctt tacggcatgg 8820 cggtggacgg cgcgttgccc aaggtgttca cctacctgca cccgcgcttc aagacgccgc 8880 tgctgtgcat cctggtggtg gcgttgatcc cttgcctgca tgcctggtac ctgggcggca 8940 acccggacaa catcctgcac ctggtgctgg ccgccgtgtg cgcctggagc accgcctacc 9000 tgctggtgac cctgtcggtg gtgatattgc gcatccgccg cccagacctg ccgcgtgcct 9060 accgctcgcc gctgttcccg ttgccgcaga tattctccag tagcggtatc ctcatcggca 9120 tggcgttcat cacaccgccg ggcatgaacc ctgccgatgt ctacgtgccg ttcgccatca 9180 tgcttggcgc cactgcggcc tatgcattgt tctggacgct gtgggtgcag aaggtcaacc 9240 cgttcaagcc ggcgcgggtc gaggatgtgc tcgagaaaga gtttgctgcc gagcctggcc 9300 acgccgtgga gcacgtgctg catgatcaga aatttgcgtg aacgcttgct ggcgccccga 9360 gcgccttcag gctatcgccc aggcgccacg ctggcatgcc tggcgcgcaa cctggggcag 9420 cagaacctgg tggcggccgg ggtgatccac gacccggccc agggttggca ggccacggtg 9480 cacgaacgcg tcgaggccca cctgctgatg cacatcgtca cctgtgagtt ccagctgcag 9540 ttgcctgctc cgcaaggggg cgaggtcagc ctggagctgc gccataccgg tgcgcttcgc 9600 cgtgccggcc tggcctgtgt gtaccgcaag ggcgaccggg cgcgcttcgc ccgactgcgc 9660 gaccggttgc tgcagcaggc cgcactggtg gcggcgctga tgccgctgga tttcaagcgc 9720 ctgaccttgg cctggcgcga cggccaatgg ttgctgaccc tggagcacat gggcggtagc 9780 gaagtggtca accgcatgcc agcgtttcgc cgctacatcc ccatcagccc gcaacagcgg 9840 gcgcacctga tggccagcct ggcccagttc aacactttgc tacctaacct ttgacgcaaa 9900 ctggcatacg ccttgctgta tcaagcgacg aatgatgaca gttgtgcgca catagataac 9960 atgttaacaa tgtgcgcata acaacaaatc ctgcgtcgag ggcagccatg catactcaac 10020 aatccaaccg tcaggggctg gaacgctgga ccacggccat gcaacagatc tgtggccgtt 10080 tcgagacgga acttgcgtcc aatcactcgc tgttcatcgg cgaggtttct accttttccc 10140 gtgccggctt gccgctggcc aacctgcgca ccaatgccgg caacatccgc cggctgggcg 10200 aaaacccgac ccttgacgat gaccagcatt gtttcctggt cagccagcgt gcggggcatt 10260 ccaccgtgtc ccaggggggc atgcaggtca gcctggcgcc gggtgagctg ctgctgatgg 10320 attcggtcgg gcgctgcgaa atcaccccca gtgggttgat cgaacatgtc tcgctggccc 10380 tgtcgcgtga gcaggtacgc aagtatgtgc aaggcagcgg cccgatgttt ggcaagatct 10440 cctcgagcaa cgcctgcggg cgcatgctgc atgtgctgat ggaccaactg tgcaaggacg 10500 gcaatgtaag cggtgatggg gcccagggcg acgcgctgca gaccgccttc attgccctgc 10560 tggagccagg cttcgagcgc catggcgaag cgctgggcaa ccttggggcc ttgaacgggg 10620 ccaacctgcg gggctacgtg cagcaggtga tcgacgagtc cctgtcacag cccgggctga 10680 ccccgtccaa cctggccggt cgcctgaaca tctcggtgcg tcacctgtac cggctgttcg 10740 aggaggaggg cgatagtgtg tgccgctaca ttcagcgggc gcgcctgaag cgcagtgcgg 10800 atgacctggc caacccgttc ttcaggagcg agtcgattac ctcgattgcc tacaagtggg 10860 ggtttaccga ctcggcgcat ttcagccgct cgttcaagaa acagttcgaa cgctcgccca 10920 aggactaccg ggcgcaggcg atggtttgag tgtgatggtg ctgcttgtgc gggcctcatc 10980 gccggcaagt cacttggcgg cggttcagcg acggccgttg aagtagcccg acagctggtg 11040 cacggtcttg ccggcagtga gcagcagcgg gcggaaatgg tccttgccga ggatgcgcgc 11100 atgcttgacc gagctgacca ggtcatagcg cttcgatccc tcctgcatac cctcggcgag 11160 tatcttgcaa atgatgtggc tgggcgtgac gccaaagccg gagtagccct gcacatagaa 11220 agcgttgggg cggttgtcga gggtgcctat ctgcggaaac aggttggcac tggtggccat 11280 cgggccgccc caggccaggt cgatgcgcac gtctttcagg taggggaaaa tcttcagcat 11340 cagcgcgcgg ttccacgcct tcaggtccag cgggaagtgc tcgacgaagg gcgtggcggc 11400 gccaaacagc aggcggttct cgcgggtgac ccggtagtag tcgatcaccg ggcggatgtc 11460 gctgtaggcc ccgcgtatcg ggctgatgcg ctcgatcagc tcatccggca atggctcggt 11520 catcatctgg aaggcatagg tgtttatagt gcgtgcgtgc agctgcggct ccagcttgtt 11580 gaggaagctg tcgcacgccc acagcagctt gctggcgcgt accgagccac ggccggtgcg 11640 taccgtgatg cgctcgccgt aggtcacttc cagggccggg ctgtgttcga agatgcgcgc 11700 accatggccc accagtgcct gcgcttcgcc cagcagcagg ttcagggaat gcacatggcc 11760 accgcccatg tgcatcaggg cgctgctgta ggcgttgctg ccgatgatct ggcgcacttc 11820 gctgccaccg agaaaacgga tctcgtcgcg ggtattgatc gccttgaacg ccttctccca 11880 tttgcgcagg gtctgttcct ggcggcggtt gaagcccatg tagccatagc cgtggcagaa 11940 gtcggcgtcg atggcgtagc gggcgatgcg gtccttgatg atgccggcgc ccagttcgct 12000 gatttcgaaa atatccctca cgccctgatc accgacgctg ctgcggatct tctccaggtc 12060 gtggccgatg cccgccatga tctgcccgcc gttgcgcccg ctaccgccgt agcccagata 12120 acggccctcg agcacgacga tattggtcac gccttgttcc gccagctcca gggcggtgtt 12180 aatgccggag aaaccgccac cgatcaccac gacatcggcc tcgatgtcgc gttccagggt 12240 tgggaagctc aggttgtact tcttggtcgc cgagtagtag gtggggctct cgagggtgat 12300 catgacgccg cctgctgact ggaaatgggt agaaatcat 12339

<210> 3

<211> 1296

<212> DNA

<213> Pseudomonas putida U

<220>

<221> CDS

<222> (1) .. (1296)

<223> Secuencia codificante de TynA

<400> 3 atg tct ccc acg atc gcc cct gtg caa acc agc acc cgc cac ccc gac 48 Met Ser Pro Thr Ile Ala Pro Val Gln Thr Ser Thr Arg His Pro Asp1 5 10 15

gcc act acc gtg gtg atc atc ggt ggc ggc att atc ggc ctc acc gcc 96 Ala Thr Thr Val Val Ile Ile Gly Gly Gly Ile Ile Gly Leu Thr Ala

gca ctg agc ctc gcc gaa cgc aac ata ccg gtg gtg gtg ctg gag aag 144 Ala Leu Ser Leu Ala Glu Arg Asn Ile Pro Val Val Val Leu Glu Lys

ggg cgc atc gct ggg gag cag tcg tcg cgc aac ctg ggc tgg gtg cgc 192 Gly Arg Ile Ala Gly Glu Gln Ser Ser Arg Asn Leu Gly Trp Val Arg

50 55 60

aag acc aac cgc cac gcc cat gac att ccc ctg gca ctg gcg gct gat 240 Lys Thr Asn Arg His Ala His Asp Ile Pro Leu Ala Leu Ala Ala Asp65 70 75 80

cgg ctg tgg gcc gag atg ccc gca cgg gtt ggc agc gac gtt ggc tat 288 Arg Leu Trp Ala Glu Met Pro Ala Arg Val Gly Ser Asp Val Gly Tyr

85 90 95

cgg caa gcc ggc atc atg ttc atc ggc cgt aac gac acg cag atg ggc 336 Arg Gln Ala Gly Ile Met Phe Ile Gly Arg Asn Asp Thr Gln Met Gly

100 105 110

atg cat gaa ggc tgg ctg aaa agc gtc gag gcg ctg ggc ctg gat tcg 384 Met His Glu Gly Trp Leu Lys Ser Val Glu Ala Leu Gly Leu Asp Ser

115 120 125 cgc ctg ctg agc acg cgg gaa atc acc cgg atg gtg ccg ggt ggt cgg 432 Arg Leu Leu Ser Thr Arg Glu Ile Thr Arg Met Val Pro Gly Gly Arg

130 135 140

gcg gat tgg gca ggt ggc atc ttc acc ccc tcc gac gcc cgc gcc gaa 480 Ala Asp Trp Ala Gly Gly Ile Phe Thr Pro Ser Asp Ala Arg Ala Glu145 150 155 160

ccg acc ctg gcc gcc agc gcc atc gcc cgg gcg gcg atc gcc aaa ggt 528 Pro Thr Leu Ala Ala Ser Ala Ile Ala Arg Ala Ala Ile Ala Lys Gly

165 170 175

gca gtg gta gtg gag aac tgc gcg gtg cgc acg ctg gtc acc gca gcc 576 Ala Val Val Val Glu Asn Cys Ala Val Arg Thr Leu Val Thr Ala Ala

180 185 190

ggc cgt gtc agc ggg gtg gtc acc gag caa ggt gaa atc cgc tgc gac 624 Gly Arg Val Ser Gly Val Val Thr Glu Gln Gly Glu Ile Arg Cys Asp

195 200 205

cag gtg ttg ctg gcc ggt ggc ctg tgg tcg cgc aag ttc ctc ggc aac 672 Gln Val Leu Leu Ala Gly Gly Leu Trp Ser Arg Lys Phe Leu Gly Asn

210 215 220

ctg ggc atc aac ctg ccg acc ttg ccg ctg acc tgt tcc gtg ctg cgc 720 Leu Gly Ile Asn Leu Pro Thr Leu Pro Leu Thr Cys Ser Val Leu Arg225 230 235 240

aca gag ccc atg gac ggg cca acc gac atc gcc gtg ggc gca cca gat 768 Thr Glu Pro Met Asp Gly Pro Thr Asp Ile Ala Val Gly Ala Pro Asp

245 250 255

ttt tcc ttc cgc aag cac aag gac ggc ggc tac atc atc acc cag cgc 816 Phe Ser Phe Arg Lys His Lys Asp Gly Gly Tyr Ile Ile Thr Gln Arg

260 265 270

ggc gcg ctg gac gcc ttc ctc acc ctc gat cac ctg ctg ctg ggc aag 864 Gly Ala Leu Asp Ala Phe Leu Thr Leu Asp His Leu Leu Leu Gly Lys

275 280 285

cgt tac ctg ccc cag ctg cgt gcc cag cgg gac ttt ttg cgc atc agc 912 Arg Tyr Leu Pro Gln Leu Arg Ala Gln Arg Asp Phe Leu Arg Ile Ser

290 295 300

ttt ggc aag tac ttc ttc aag gac ctg gcc ctg gcc cgg cgc tgg aag 960 Phe Gly Lys Tyr Phe Phe Lys Asp Leu Ala Leu Ala Arg Arg Trp Lys305 310 315 320

gca acg gat gta acc ccc ttc gag cgt gta cgg gta caa gac ccg cac 1008 Ala Thr Asp Val Thr Pro Phe Glu Arg Val Arg Val Gln Asp Pro His

325 330 335

gcc aac ccg gcc ctc aac gac gag gcc atg cgc aac ctg aaa gcg gcc 1056 Ala Asn Pro Ala Leu Asn Asp Glu Ala Met Arg Asn Leu Lys Ala Ala

340 345 350

tgg ccg gtg ttc gaa cag gcc cgt atc gcc agc gcc tgg gcc ggc acg 1104 Trp Pro Val Phe Glu Gln Ala Arg Ile Ala Ser Ala Trp Ala Gly Thr

355 360 365 att gac gtc acc ccc gac tcc aac ccg gtc atc ggc ccg gtg gcc agtIle Asp Val Thr Pro Asp Ser Asn Pro Val Ile Gly Pro Val Ala Ser370 375 380 1152 att ccg ggc ctg acc ctg gcc acc ggc ttt tcc ggg cat ggc ttc ggcIle Pro Gly Leu Thr Leu Ala Thr Gly Phe Ser Gly His Gly Phe Gly385 390 395 400 1200 acc tcg cct gct gct ggc cag ctg gcg gca gac ctg gtg gcc cag gccThr Ser Pro Ala Ala Gly Gln Leu Ala Ala Asp Leu Val Ala Gln Ala405 410 415 1248 acc ccg ctg atc gac ccg tca ccg tac cgc ttc gag cgt ttc gcc tgaThr Pro Leu Ile Asp Pro Ser Pro Tyr Arg Phe Glu Arg Phe Ala420 425 430 1296 <210> 4 <211> 431 <212> PRT <213> Pseudomonas putida U <400> 4 Met Ser Pro Thr Ile Ala Pro Val Gln Thr Ser Thr Arg His Pro Asp1 5 10 15 Ala Thr Thr Val Val Ile Ile Gly Gly Gly Ile Ile Gly Leu Thr Ala20 25 30 Ala Leu Ser Leu Ala Glu Arg Asn Ile Pro Val Val Val Leu Glu Lys35 40 45 Gly Arg Ile Ala Gly Glu Gln Ser Ser Arg Asn Leu Gly Trp Val Arg50 55 60 Lys Thr Asn Arg His Ala His Asp Ile Pro Leu Ala Leu Ala Ala Asp65 70 75 80 Arg Leu Trp Ala Glu Met Pro Ala Arg Val Gly Ser Asp Val Gly Tyr85 90 95 Arg Gln Ala Gly Ile Met Phe Ile Gly Arg Asn Asp Thr Gln Met Gly100 105 110 Met His Glu Gly Trp Leu Lys Ser Val Glu Ala Leu Gly Leu Asp Ser115 120 125 Arg Leu Leu Ser Thr Arg Glu Ile Thr Arg Met Val Pro Gly Gly Arg 130 135 140 Ala Asp Trp Ala Gly Gly Ile Phe Thr Pro Ser Asp Ala Arg Ala Glu145 150 155 160 Pro Thr Leu Ala Ala Ser Ala Ile Ala Arg Ala Ala Ile Ala Lys Gly165 170 175

Ala Val Val Val Glu Asn Cys Ala Val Arg Thr Leu Val Thr Ala Ala180 185 190

Gly Arg Val Ser Gly Val Val Thr Glu Gln Gly Glu Ile Arg Cys Asp195 200 205

Gln Val Leu Leu Ala Gly Gly Leu Trp Ser Arg Lys Phe Leu Gly Asn210 215 220

Leu Gly Ile Asn Leu Pro Thr Leu Pro Leu Thr Cys Ser Val Leu Arg225 230 235 240

Thr Glu Pro Met Asp Gly Pro Thr Asp Ile Ala Val Gly Ala Pro Asp245 250 255

Phe Ser Phe Arg Lys His Lys Asp Gly Gly Tyr Ile Ile Thr Gln Arg260 265 270

Gly Ala Leu Asp Ala Phe Leu Thr Leu Asp His Leu Leu Leu Gly Lys275 280 285

Arg Tyr Leu Pro Gln Leu Arg Ala Gln Arg Asp Phe Leu Arg Ile Ser290 295 300

Phe Gly Lys Tyr Phe Phe Lys Asp Leu Ala Leu Ala Arg Arg Trp Lys305 310 315 320

Ala Thr Asp Val Thr Pro Phe Glu Arg Val Arg Val Gln Asp Pro His325 330 335

Ala Asn Pro Ala Leu Asn Asp Glu Ala Met Arg Asn Leu Lys Ala Ala340 345 350

Trp Pro Val Phe Glu Gln Ala Arg Ile Ala Ser Ala Trp Ala Gly Thr355 360 365

Ile Asp Val Thr Pro Asp Ser Asn Pro Val Ile Gly Pro Val Ala Ser370 375 380

Ile Pro Gly Leu Thr Leu Ala Thr Gly Phe Ser Gly His Gly Phe Gly385 390 395 400

Thr Ser Pro Ala Ala Gly Gln Leu Ala Ala Asp Leu Val Ala Gln Ala405 410 415

Thr Pro Leu Ile Asp Pro Ser Pro Tyr Arg Phe Glu Arg Phe Ala420 425 430

<210> 5

<211> 1140

<212> DNA

<213> Pseudomonas putida U

<220>

<221> CDS

<222> (1) .. (1140)

<223> TynB <400> 5 atg acc ctc gat acc aga att gac ttc atc tac ctg tcc gag cag gac 48 Met Thr Leu Asp Thr Arg Ile Asp Phe Ile Tyr Leu Ser Glu Gln Asp1 5 10 15

atg atc cgt gcc ggc gtc acc gac atg ccg gcc tgt gtc gac acc atg 96 Met Ile Arg Ala Gly Val Thr Asp Met Pro Ala Cys Val Asp Thr Met

gaa gaa atg ttc ggc ctg ctg tac cag ggt gac tac cgc atg gcc ggc 144 Glu Glu Met Phe Gly Leu Leu Tyr Gln Gly Asp Tyr Arg Met Ala Gly

ccg aac agc gac tcg cac ggc gcg atg att acc ttc ccc gag cac tcg 192 Pro Asn Ser Asp Ser His Gly Ala Met Ile Thr Phe Pro Glu His Ser

50 55 60

cca ttc ccc aac atg ccc aag ccc acc gcc gac cgg cgc atg atg gcg 240 Pro Phe Pro Asn Met Pro Lys Pro Thr Ala Asp Arg Arg Met Met Ala65 70 75 80

atg ccg gcg tac ctc ggc ggc aac ttc cag acc gca ggc gcc aag tgg 288 Met Pro Ala Tyr Leu Gly Gly Asn Phe Gln Thr Ala Gly Ala Lys Trp

85 90 95

tac ggc tcg aac atc gcc aac cgt gag aaa ggc ctg ccc cgc tcg atc 336 Tyr Gly Ser Asn Ile Ala Asn Arg Glu Lys Gly Leu Pro Arg Ser Ile

100 105 110

ctg atg ttc acc ctc aac gat gcc gac acc ggc gcg cca ctg gca cac 384 Leu Met Phe Thr Leu Asn Asp Ala Asp Thr Gly Ala Pro Leu Ala His

115 120 125

atg tcg gcc aac ctg ctg tcg gcc tac cgc acc ggc gcc atc ccg ggc 432 Met Ser Ala Asn Leu Leu Ser Ala Tyr Arg Thr Gly Ala Ile Pro Gly

130 135 140

gtc ggc gcc cgc cac ctg gcg cgc aag gac gcc aag gtg atc ggc ctg 480 Val Gly Ala Arg His Leu Ala Arg Lys Asp Ala Lys Val Ile Gly Leu145 150 155 160

gcc ggc ccg ggg gtg atg ggc aag acc acg gtc gct gcg ttc atg gcc 528 Ala Gly Pro Gly Val Met Gly Lys Thr Thr Val Ala Ala Phe Met Ala

165 170 175

gtc tgc ccg ctg atc gac acc ctc aag gtc aag ggc cgt ggc cag aag 576 Val Cys Pro Leu Ile Asp Thr Leu Lys Val Lys Gly Arg Gly Gln Lys

180 185 190

aac ctg gat gcc ttc gtc gcc tgg gtg aaa acc agc ttc ccg caa atc 624 Asn Leu Asp Ala Phe Val Ala Trp Val Lys Thr Ser Phe Pro Gln Ile

195 200 205

act cat gtc cag gtg gtg gaa acc ctg gaa gaa gtg gta cgc ggt tcc 672 Thr His Val Gln Val Val Glu Thr Leu Glu Glu Val Val Arg Gly Ser

210 215 220

gac ctg gtc agc tac tgc agc tcg ggc gaa gtc ggc gac ccc agc cag 720 Asp Leu Val Ser Tyr Cys Ser Ser Gly Glu Val Gly Asp Pro Ser Gln 225 230 235 240

tac ccg ctg gtg cgc cgc gag tgg gtc aag cct ggc gcg ttc ctg gcc 768 Tyr Pro Leu Val Arg Arg Glu Trp Val Lys Pro Gly Ala Phe Leu Ala

245 250 255

atg cca gcc ccg tgc agt atc gac gcc ggc atg gag cag gac gac gtg 816 Met Pro Ala Pro Cys Ser Ile Asp Ala Gly Met Glu Gln Asp Asp Val

260 265 270

cgc aag gtg gtg gac aac acc ggc ctc tac gag gcc tgg ttc gaa gag 864 Arg Lys Val Val Asp Asn Thr Gly Leu Tyr Glu Ala Trp Phe Glu Glu

275 280 285

ctg ccc aag cct gcg cac aac cac gta ccg ctg gta ggt gtg cgc ttc 912 Leu Pro Lys Pro Ala His Asn His Val Pro Leu Val Gly Val Arg Phe

290 295 300

atg gac atg att gcc gaa ggc acg ctg gcc gcc gag cag gtg gaa gac 960 Met Asp Met Ile Ala Glu Gly Thr Leu Ala Ala Glu Gln Val Glu Asp305 310 315 320

atc ggc aag atc atc agc ggc gac gca ccg ggc cgc ctg cat gac gac 1008 Ile Gly Lys Ile Ile Ser Gly Asp Ala Pro Gly Arg Leu His Asp Asp

325 330 335

gaa atc atc ctg atg tcg gtg ggc ggc atg ccc gtc gaa gac gtg gcc 1056 Glu Ile Ile Leu Met Ser Val Gly Gly Met Pro Val Glu Asp Val Ala

340 345 350

tgg ggc acc gtg gtg tac cgc aag gcg ctc gag caa ggc atc ggc gta 1104 Trp Gly Thr Val Val Tyr Arg Lys Ala Leu Glu Gln Gly Ile Gly Val

355 360 365

aag ctc aac ctc tgg gaa acc ccc gtt ctc agc tga 1140 Lys Leu Asn Leu Trp Glu Thr Pro Val Leu Ser

370 375

<210> 6

<211> 379

<212> PRT

<213> Pseudomonas putida U

<400> 6

Met Thr Leu Asp Thr Arg Ile Asp Phe Ile Tyr Leu Ser Glu Gln Asp1 5 10 15

Met Ile Arg Ala Gly Val Thr Asp Met Pro Ala Cys Val Asp Thr Met20 25 30

Glu Glu Met Phe Gly Leu Leu Tyr Gln Gly Asp Tyr Arg Met Ala Gly35 40 45

Pro Asn Ser Asp Ser His Gly Ala Met Ile Thr Phe Pro Glu His Ser50 55 60

Pro Phe Pro Asn Met Pro Lys Pro Thr Ala Asp Arg Arg Met Met Ala 65 70 75 80

Met Pro Ala Tyr Leu Gly Gly Asn Phe Gln Thr Ala Gly Ala Lys Trp85 90 95

Tyr Gly Ser Asn Ile Ala Asn Arg Glu Lys Gly Leu Pro Arg Ser Ile100 105 110

Leu Met Phe Thr Leu Asn Asp Ala Asp Thr Gly Ala Pro Leu Ala His115 120 125

Met Ser Ala Asn Leu Leu Ser Ala Tyr Arg Thr Gly Ala Ile Pro Gly130 135 140

Val Gly Ala Arg His Leu Ala Arg Lys Asp Ala Lys Val Ile Gly Leu145 150 155 160

Ala Gly Pro Gly Val Met Gly Lys Thr Thr Val Ala Ala Phe Met Ala165 170 175

Val Cys Pro Leu Ile Asp Thr Leu Lys Val Lys Gly Arg Gly Gln Lys180 185 190

Asn Leu Asp Ala Phe Val Ala Trp Val Lys Thr Ser Phe Pro Gln Ile195 200 205

Thr His Val Gln Val Val Glu Thr Leu Glu Glu Val Val Arg Gly Ser210 215 220

Asp Leu Val Ser Tyr Cys Ser Ser Gly Glu Val Gly Asp Pro Ser Gln225 230 235 240

Tyr Pro Leu Val Arg Arg Glu Trp Val Lys Pro Gly Ala Phe Leu Ala245 250 255

Met Pro Ala Pro Cys Ser Ile Asp Ala Gly Met Glu Gln Asp Asp Val260 265 270

Arg Lys Val Val Asp Asn Thr Gly Leu Tyr Glu Ala Trp Phe Glu Glu275 280 285

Leu Pro Lys Pro Ala His Asn His Val Pro Leu Val Gly Val Arg Phe290 295 300

Met Asp Met Ile Ala Glu Gly Thr Leu Ala Ala Glu Gln Val Glu Asp305 310 315 320

Ile Gly Lys Ile Ile Ser Gly Asp Ala Pro Gly Arg Leu His Asp Asp325 330 335

Glu Ile Ile Leu Met Ser Val Gly Gly Met Pro Val Glu Asp Val Ala340 345 350

Trp Gly Thr Val Val Tyr Arg Lys Ala Leu Glu Gln Gly Ile Gly Val355 360 365

Lys Leu Asn Leu Trp Glu Thr Pro Val Leu Ser370 375

<210> 7

<211> 1488

<212> DNA

<213> Pseudomonas putida U

<220>

<221> CDS

<222> (1) .. (1488)

<223> TynC

<400> 7 atg agc gac atc acc ctc cta cct gcc gtc acg gcc ttc ctg gcc cgc 48 Met Ser Asp Ile Thr Leu Leu Pro Ala Val Thr Ala Phe Leu Ala Arg1 5 10 15

gag cat ggc gtg ttc atc cac ggc cag cac ctc gcc agc cag tcg tcg 96 Glu His Gly Val Phe Ile His Gly Gln His Leu Ala Ser Gln Ser Ser

tcg aac att gcc gtg gtc aac ccg gcc aac ggc cag acc atc gcc cac 144 Ser Asn Ile Ala Val Val Asn Pro Ala Asn Gly Gln Thr Ile Ala His

atc gcc gac gcc aac cag gcc gat gtc gac cat gcc gtc agc tcc tcg 192 Ile Ala Asp Ala Asn Gln Ala Asp Val Asp His Ala Val Ser Ser Ser

50 55 60

cgc caa ggc ttt acc gcc tgg tcg cac acc agc ccc gcc gcc cgc gcc 240 Arg Gln Gly Phe Thr Ala Trp Ser His Thr Ser Pro Ala Ala Arg Ala65 70 75 80

gca gtg ctg ttc aag ctg gcc gac ctg ctg gaa gcc aac cgc gaa gaa 288 Ala Val Leu Phe Lys Leu Ala Asp Leu Leu Glu Ala Asn Arg Glu Glu

85 90 95

ctg gcg cag ctg gaa acc ttg caa tcc ggc aag ctc atc ggc att tcc 336 Leu Ala Gln Leu Glu Thr Leu Gln Ser Gly Lys Leu Ile Gly Ile Ser

100 105 110

cgt gcg ttc gaa gta cag cag gcc gcg cac ttc ctg cgc tac tac gcc 384 Arg Ala Phe Glu Val Gln Gln Ala Ala His Phe Leu Arg Tyr Tyr Ala

115 120 125

ggc tgg gcg acc aag atc acc ggc cag acc atc acc ccg tcg ctg ccc 432 Gly Trp Ala Thr Lys Ile Thr Gly Gln Thr Ile Thr Pro Ser Leu Pro

130 135 140

tcg ttc gcc ggt gag cgc tac agc gcc ttc acc ctg cgc gag ccg att 480 Ser Phe Ala Gly Glu Arg Tyr Ser Ala Phe Thr Leu Arg Glu Pro Ile145 150 155 160

ggc gtg gtg gtg ggc atc gtg ccg tgg aac ttc gcc agc atg atc gcc 528 Gly Val Val Val Gly Ile Val Pro Trp Asn Phe Ala Ser Met Ile Ala

165 170 175

atc tgg aag ctg gcc tcg gcc ctg aca acc ggc tgc agc att atc ctc 576 Ile Trp Lys Leu Ala Ser Ala Leu Thr Thr Gly Cys Ser Ile Ile Leu

180 185 190 aag ccc agc gaa ttc acc ccg ctg acg ctg ctg cgc att gcc gaa ctg 624 Lys Pro Ser Glu Phe Thr Pro Leu Thr Leu Leu Arg Ile Ala Glu Leu

195 200 205

gcc acc gaa gca ggc ctg ccg gcc ggt gca ctg aac gtg ctg acc ggt 672 Ala Thr Glu Ala Gly Leu Pro Ala Gly Ala Leu Asn Val Leu Thr Gly

210 215 220

ggt ggc ctg gtg ggc aag gca ctg atc gag cac gca ggt acc gac aag 720 Gly Gly Leu Val Gly Lys Ala Leu Ile Glu His Ala Gly Thr Asp Lys225 230 235 240

gtc tcg ttc acc ggc tcc gtg ccc act ggc atc gcc gta ggc cag gct 768 Val Ser Phe Thr Gly Ser Val Pro Thr Gly Ile Ala Val Gly Gln Ala

245 250 255

gcc atg ggc gcc aag ctg acc cgt gcg aca ctg gaa ctg ggc ggc aag 816 Ala Met Gly Ala Lys Leu Thr Arg Ala Thr Leu Glu Leu Gly Gly Lys

260 265 270

aac gcg gtg gcc ttc ctg ccc gac gtg gca acc gac aag gct gtg gac 864 Asn Ala Val Ala Phe Leu Pro Asp Val Ala Thr Asp Lys Ala Val Asp

275 280 285

ggc atc atc gag gcc ggc ttc ctg cac tcg ggg caa atc tgc gcc gcg 912 Gly Ile Ile Glu Ala Gly Phe Leu His Ser Gly Gln Ile Cys Ala Ala

290 295 300

ggc gag cgc ttc tat gtg cac cgc tcg cgc atc gac ccg ctg ctg gac 960 Gly Glu Arg Phe Tyr Val His Arg Ser Arg Ile Asp Pro Leu Leu Asp305 310 315 320

gcc ctg tcc cag cgc ctg ggc caa ctg aaa atc ggc tcg cca ctg gac 1008 Ala Leu Ser Gln Arg Leu Gly Gln Leu Lys Ile Gly Ser Pro Leu Asp

325 330 335

gag gcc acc cag ttc ggc ccc gtg gcc aac aag ccg cac cag cag aag 1056 Glu Ala Thr Gln Phe Gly Pro Val Ala Asn Lys Pro His Gln Gln Lys

340 345 350

ctg gcc gaa ctg ttc gcc atc gcc cgc gcc gaa ggc agc cag atc att 1104 Leu Ala Glu Leu Phe Ala Ile Ala Arg Ala Glu Gly Ser Gln Ile Ile

355 360 365

cac ggc ggc acc ctg ggc gac ggc ccc ggc tgt ttc gtc gaa ccc acg 1152 His Gly Gly Thr Leu Gly Asp Gly Pro Gly Cys Phe Val Glu Pro Thr

370 375 380

gta atc ctc gcc cgg tca gcc tgc gac acc ctg ctc acc cag gaa acc 1200 Val Ile Leu Ala Arg Ser Ala Cys Asp Thr Leu Leu Thr Gln Glu Thr385 390 395 400

ttc ggc cct gtg gcg acc ttc ctg cct tat gac gac gaa gac gaa ctg 1248 Phe Gly Pro Val Ala Thr Phe Leu Pro Tyr Asp Asp Glu Asp Glu Leu

405 410 415

ctg cac ctg atg aac gcc tcg ccc tac ggc ctc agc gcc agc ctg tgg 1296 Leu His Leu Met Asn Ala Ser Pro Tyr Gly Leu Ser Ala Ser Leu Trp

420 425 430 acc aac gac ctg ggc aag gcc atg cgc atg atc ccg caa atc cag gccThr Asn Asp Leu Gly Lys Ala Met Arg Met Ile Pro Gln Ile Gln Ala435 440 445 1344 ggt acc ctg tgg gtc aac atg cac acc ctg ctc gac ccg gct gta ccgGly Thr Leu Trp Val Asn Met His Thr Leu Leu Asp Pro Ala Val Pro 450 455 460 1392 ttt ggg ggc atc aag gct tcc ggc att ggc cgc gag ttc ggc tcg gccPhe Gly Gly Ile Lys Ala Ser Gly Ile Gly Arg Glu Phe Gly Ser Ala465 470 475 480 1440 ttc atc gat gac ttc acc gag ctc aag tcg gtg atg atc cgc tac tgaPhe Ile Asp Asp Phe Thr Glu Leu Lys Ser Val Met Ile Arg Tyr485 490 495 1488 <210> 8 <211> 495 <212> PRT <213> Pseudomonas putida U <400> 8 Met Ser Asp Ile Thr Leu Leu Pro Ala Val Thr Ala Phe Leu Ala Arg1 5 10 15 Glu His Gly Val Phe Ile His Gly Gln His Leu Ala Ser Gln Ser Ser20 25 30 Ser Asn Ile Ala Val Val Asn Pro Ala Asn Gly Gln Thr Ile Ala His35 40 45 Ile Ala Asp Ala Asn Gln Ala Asp Val Asp His Ala Val Ser Ser Ser50 55 60 Arg Gln Gly Phe Thr Ala Trp Ser His Thr Ser Pro Ala Ala Arg Ala65 70 75 80 Ala Val Leu Phe Lys Leu Ala Asp Leu Leu Glu Ala Asn Arg Glu Glu85 90 95 Leu Ala Gln Leu Glu Thr Leu Gln Ser Gly Lys Leu Ile Gly Ile Ser100 105 110 Arg Ala Phe Glu Val Gln Gln Ala Ala His Phe Leu Arg Tyr Tyr Ala115 120 125 Gly Trp Ala Thr Lys Ile Thr Gly Gln Thr Ile Thr Pro Ser Leu Pro130 135 140 Ser Phe Ala Gly Glu Arg Tyr Ser Ala Phe Thr Leu Arg Glu Pro Ile145 150 155 160 Gly Val Val Val Gly Ile Val Pro Trp Asn Phe Ala Ser Met Ile Ala165 170 175

Ile Trp Lys Leu Ala Ser Ala Leu Thr Thr Gly Cys Ser Ile Ile Leu180 185 190

Lys Pro Ser Glu Phe Thr Pro Leu Thr Leu Leu Arg Ile Ala Glu Leu195 200 205

Ala Thr Glu Ala Gly Leu Pro Ala Gly Ala Leu Asn Val Leu Thr Gly210 215 220

Gly Gly Leu Val Gly Lys Ala Leu Ile Glu His Ala Gly Thr Asp Lys225 230 235 240

Val Ser Phe Thr Gly Ser Val Pro Thr Gly Ile Ala Val Gly Gln Ala245 250 255

Ala Met Gly Ala Lys Leu Thr Arg Ala Thr Leu Glu Leu Gly Gly Lys260 265 270

Asn Ala Val Ala Phe Leu Pro Asp Val Ala Thr Asp Lys Ala Val Asp275 280 285

Gly Ile Ile Glu Ala Gly Phe Leu His Ser Gly Gln Ile Cys Ala Ala290 295 300

Gly Glu Arg Phe Tyr Val His Arg Ser Arg Ile Asp Pro Leu Leu Asp305 310 315 320

Ala Leu Ser Gln Arg Leu Gly Gln Leu Lys Ile Gly Ser Pro Leu Asp325 330 335

Glu Ala Thr Gln Phe Gly Pro Val Ala Asn Lys Pro His Gln Gln Lys340 345 350

Leu Ala Glu Leu Phe Ala Ile Ala Arg Ala Glu Gly Ser Gln Ile Ile355 360 365

His Gly Gly Thr Leu Gly Asp Gly Pro Gly Cys Phe Val Glu Pro Thr370 375 380

Val Ile Leu Ala Arg Ser Ala Cys Asp Thr Leu Leu Thr Gln Glu Thr385 390 395 400

Phe Gly Pro Val Ala Thr Phe Leu Pro Tyr Asp Asp Glu Asp Glu Leu405 410 415

Leu His Leu Met Asn Ala Ser Pro Tyr Gly Leu Ser Ala Ser Leu Trp420 425 430

Thr Asn Asp Leu Gly Lys Ala Met Arg Met Ile Pro Gln Ile Gln Ala435 440 445

Gly Thr Leu Trp Val Asn Met His Thr Leu Leu Asp Pro Ala Val Pro450 455 460

Phe Gly Gly Ile Lys Ala Ser Gly Ile Gly Arg Glu Phe Gly Ser Ala465 470 475 480

Phe Ile Asp Asp Phe Thr Glu Leu Lys Ser Val Met Ile Arg Tyr

485 490 495

<210> 9

<211> 942

<212> DNA

<213> Pseudomonas putida U

<220>

<221> CDS

<222> (1) .. (942)

<223> TynR

<400> 9 atg cat act caa caa tcc aac cgt cag ggg ctg gaa cgc tgg acc acg 48 Met His Thr Gln Gln Ser Asn Arg Gln Gly Leu Glu Arg Trp Thr Thr1 5 10 15

gcc atg caa cag atc tgt ggc cgt ttc gag acg gaa ctt gcg tcc aat 96 Ala Met Gln Gln Ile Cys Gly Arg Phe Glu Thr Glu Leu Ala Ser Asn

cac tcg ctg ttc atc ggc gag gtt tct acc ttt tcc cgt gcc ggc ttg 144 His Ser Leu Phe Ile Gly Glu Val Ser Thr Phe Ser Arg Ala Gly Leu

ccg ctg gcc aac ctg cgc acc aat gcc ggc aac atc cgc cgg ctg ggc 192 Pro Leu Ala Asn Leu Arg Thr Asn Ala Gly Asn Ile Arg Arg Leu Gly

50 55 60

gaa aac ccg acc ctt gac gat gac cag cat tgt ttc ctg gtc agc cag 240 Glu Asn Pro Thr Leu Asp Asp Asp Gln His Cys Phe Leu Val Ser Gln65 70 75 80

cgt gcg ggg cat tcc acc gtg tcc cag ggg ggc atg cag gtc agc ctg 288 Arg Ala Gly His Ser Thr Val Ser Gln Gly Gly Met Gln Val Ser Leu

85 90 95

gcg ccg ggt gag ctg ctg ctg atg gat tcg gtc ggg cgc tgc gaa atc 336 Ala Pro Gly Glu Leu Leu Leu Met Asp Ser Val Gly Arg Cys Glu Ile

100 105 110

acc ccc agt ggg ttg atc gaa cat gtc tcg ctg gcc ctg tcg cgt gag 384 Thr Pro Ser Gly Leu Ile Glu His Val Ser Leu Ala Leu Ser Arg Glu

115 120 125

cag gta cgc aag tat gtg caa ggc agc ggc ccg atg ttt ggc aag atc 432 Gln Val Arg Lys Tyr Val Gln Gly Ser Gly Pro Met Phe Gly Lys Ile

130 135 140

tcc tcg agc aac gcc tgc ggg cgc atg ctg cat gtg ctg atg gac caa 480 Ser Ser Ser Asn Ala Cys Gly Arg Met Leu His Val Leu Met Asp Gln145 150 155 160

ctg tgc aag gac ggc aat gta agc ggt gat ggg gcc cag ggc gac gcg 528 Leu Cys Lys Asp Gly Asn Val Ser Gly Asp Gly Ala Gln Gly Asp Ala

165 170 175

ctg cag acc gcc ttc att gcc ctg ctg gag cca ggc ttc gag cgc cat 576

Leu Gln Thr Ala Phe Ile Ala Leu Leu Glu Pro Gly Phe Glu Arg His180 185 190

ggc gaa gcg ctg ggc aac ctt ggg gcc ttg aac ggg gcc aac ctg cgg 624 Gly Glu Ala Leu Gly Asn Leu Gly Ala Leu Asn Gly Ala Asn Leu Arg

195 200 205

ggc tac gtg cag cag gtg atc gac gag tcc ctg tca cag ccc ggg ctg 672 Gly Tyr Val Gln Gln Val Ile Asp Glu Ser Leu Ser Gln Pro Gly Leu

210 215 220

acc ccg tcc aac ctg gcc ggt cgc ctg aac atc tcg gtg cgt cac ctg 720 Thr Pro Ser Asn Leu Ala Gly Arg Leu Asn Ile Ser Val Arg His Leu225 230 235 240

tac cgg ctg ttc gag gag gag ggc gat agt gtg tgc cgc tac att cag 768 Tyr Arg Leu Phe Glu Glu Glu Gly Asp Ser Val Cys Arg Tyr Ile Gln

245 250 255

cgg gcg cgc ctg aag cgc agt gcg gat gac ctg gcc aac ccg ttc ttc 816 Arg Ala Arg Leu Lys Arg Ser Ala Asp Asp Leu Ala Asn Pro Phe Phe

260 265 270

agg agc gag tcg att acc tcg att gcc tac aag tgg ggg ttt acc gac 864 Arg Ser Glu Ser Ile Thr Ser Ile Ala Tyr Lys Trp Gly Phe Thr Asp

275 280 285

tcg gcg cat ttc agc cgc tcg ttc aag aaa cag ttc gaa cgc tcg ccc 912 Ser Ala His Phe Ser Arg Ser Phe Lys Lys Gln Phe Glu Arg Ser Pro

290 295 300

aag gac tac cgg gcg cag gcg atg gtt tga 942 Lys Asp Tyr Arg Ala Gln Ala Met Val305 310

<210> 10

<211> 313

<212> PRT

<213> Pseudomonas putida U

<400> 10

Met His Thr Gln Gln Ser Asn Arg Gln Gly Leu Glu Arg Trp Thr Thr1 5 10 15

Ala Met Gln Gln Ile Cys Gly Arg Phe Glu Thr Glu Leu Ala Ser Asn20 25 30

His Ser Leu Phe Ile Gly Glu Val Ser Thr Phe Ser Arg Ala Gly Leu35 40 45

Pro Leu Ala Asn Leu Arg Thr Asn Ala Gly Asn Ile Arg Arg Leu Gly50 55 60

Glu Asn Pro Thr Leu Asp Asp Asp Gln His Cys Phe Leu Val Ser Gln65 70 75 80

Arg Ala Gly His Ser Thr Val Ser Gln Gly Gly Met Gln Val Ser Leu

85 90 95

Ala Pro Gly Glu Leu Leu Leu Met Asp Ser Val Gly Arg Cys Glu Ile100 105 110

Thr Pro Ser Gly Leu Ile Glu His Val Ser Leu Ala Leu Ser Arg Glu115 120 125

Gln Val Arg Lys Tyr Val Gln Gly Ser Gly Pro Met Phe Gly Lys Ile130 135 140

Ser Ser Ser Asn Ala Cys Gly Arg Met Leu His Val Leu Met Asp Gln145 150 155 160

Leu Cys Lys Asp Gly Asn Val Ser Gly Asp Gly Ala Gln Gly Asp Ala165 170 175

Leu Gln Thr Ala Phe Ile Ala Leu Leu Glu Pro Gly Phe Glu Arg His180 185 190

Gly Glu Ala Leu Gly Asn Leu Gly Ala Leu Asn Gly Ala Asn Leu Arg195 200 205

Gly Tyr Val Gln Gln Val Ile Asp Glu Ser Leu Ser Gln Pro Gly Leu 210 215 220

Thr Pro Ser Asn Leu Ala Gly Arg Leu Asn Ile Ser Val Arg His Leu225 230 235 240

Tyr Arg Leu Phe Glu Glu Glu Gly Asp Ser Val Cys Arg Tyr Ile Gln245 250 255

Arg Ala Arg Leu Lys Arg Ser Ala Asp Asp Leu Ala Asn Pro Phe Phe260 265 270

Arg Ser Glu Ser Ile Thr Ser Ile Ala Tyr Lys Trp Gly Phe Thr Asp275 280 285

Ser Ala His Phe Ser Arg Ser Phe Lys Lys Gln Phe Glu Arg Ser Pro290 295 300

Lys Asp Tyr Arg Ala Gln Ala Met Val305 310

<210> 11

<211> 1335

<212> DNA

<213> Pseudomonas putida U

<220>

<221> CDS

<222> (1) .. (1335)

<223> TynD

<400> 11 atg att tct acc cat ttc cag tca gca ggc ggc gtc atg atc acc ctc 48 Met Ile Ser Thr His Phe Gln Ser Ala Gly Gly Val Met Ile Thr Leu1 5 10 15 gag agc ccc acc tac tac tcg gcg acc aag aag tac aac ctg agc ttc 96 Glu Ser Pro Thr Tyr Tyr Ser Ala Thr Lys Lys Tyr Asn Leu Ser Phe

cca acc ctg gaa cgc gac atc gag gcc gat gtc gtg gtg atc ggt ggc 144 Pro Thr Leu Glu Arg Asp Ile Glu Ala Asp Val Val Val Ile Gly Gly

ggt ttc tcc ggc att aac acc gcc ctg gag ctg gcg gaa caa ggc gtg 192 Gly Phe Ser Gly Ile Asn Thr Ala Leu Glu Leu Ala Glu Gln Gly Val

50 55 60

acc aat atc gtc gtg ctc gag ggc cgt tat ctg ggc tac ggc ggt agc 240 Thr Asn Ile Val Val Leu Glu Gly Arg Tyr Leu Gly Tyr Gly Gly Ser65 70 75 80

ggg cgc aac ggc ggg cag atc atg gcg ggc atc ggc cac gac ctg gag 288 Gly Arg Asn Gly Gly Gln Ile Met Ala Gly Ile Gly His Asp Leu Glu

85 90 95

aag atc cgc agc agc gtc ggt gat cag ggc gtg agg gat att ttc gaa 336 Lys Ile Arg Ser Ser Val Gly Asp Gln Gly Val Arg Asp Ile Phe Glu

100 105 110

atc agc gaa ctg ggc gcc ggc atc atc aag gac cgc atc gcc cgc tac 384 Ile Ser Glu Leu Gly Ala Gly Ile Ile Lys Asp Arg Ile Ala Arg Tyr

115 120 125

gcc atc gac gcc gac ttc tgc cac ggc tat ggc tac atg ggc ttc aac 432 Ala Ile Asp Ala Asp Phe Cys His Gly Tyr Gly Tyr Met Gly Phe Asn

130 135 140

cgc cgc cag gaa cag acc ctg cgc aaa tgg gag aag gcg ttc aag gcg 480 Arg Arg Gln Glu Gln Thr Leu Arg Lys Trp Glu Lys Ala Phe Lys Ala145 150 155 160

atc aat acc cgc gac gag atc cgt ttt ctc ggt ggc agc gaa gtg cgc 528 Ile Asn Thr Arg Asp Glu Ile Arg Phe Leu Gly Gly Ser Glu Val Arg

165 170 175

cag atc atc ggc agc aac gcc tac agc agc gcc ctg atg cac atg ggc 576 Gln Ile Ile Gly Ser Asn Ala Tyr Ser Ser Ala Leu Met His Met Gly

180 185 190

ggt ggc cat gtg cat tcc ctg aac ctg ctg ctg ggc gaa gcg cag gca 624 Gly Gly His Val His Ser Leu Asn Leu Leu Leu Gly Glu Ala Gln Ala

195 200 205

ctg gtg ggc cat ggt gcg cgc atc ttc gaa cac agc ccg gcc ctg gaa 672 Leu Val Gly His Gly Ala Arg Ile Phe Glu His Ser Pro Ala Leu Glu

210 215 220

gtg acc tac ggc gag cgc atc acg gta cgc acc ggc cgt ggc tcg gta 720 Val Thr Tyr Gly Glu Arg Ile Thr Val Arg Thr Gly Arg Gly Ser Val225 230 235 240

cgc gcc agc aag ctg ctg tgg gcg tgc gac agc ttc ctc aac aag ctg 768 Arg Ala Ser Lys Leu Leu Trp Ala Cys Asp Ser Phe Leu Asn Lys Leu

245 250 255

gag ccg cag ctg cac gca cgc act ata aac acc tat gcc ttc cag atg 816 Glu Pro Gln Leu His Ala Arg Thr Ile Asn Thr Tyr Ala Phe Gln Met

260 265 270

atg acc gag cca ttg ccg gat gag ctg atc gag cgc atc agc ccg ata 864 Met Thr Glu Pro Leu Pro Asp Glu Leu Ile Glu Arg Ile Ser Pro Ile

275 280 285

cgc ggg gcc tac agc gac atc cgc ccg gtg atc gac tac tac cgg gtc 912 Arg Gly Ala Tyr Ser Asp Ile Arg Pro Val Ile Asp Tyr Tyr Arg Val

290 295 300

acc cgc gag aac cgc ctg ctg ttt ggc gcc gcc acg ccc ttc gtc gag 960 Thr Arg Glu Asn Arg Leu Leu Phe Gly Ala Ala Thr Pro Phe Val Glu305 310 315 320

cac ttc ccg ctg gac ctg aag gcg tgg aac cgc gcg ctg atg ctg aag 1008 His Phe Pro Leu Asp Leu Lys Ala Trp Asn Arg Ala Leu Met Leu Lys

325 330 335

att ttc ccc tac ctg aaa gac gtg cgc atc gac ctg gcc tgg ggc ggc 1056 Ile Phe Pro Tyr Leu Lys Asp Val Arg Ile Asp Leu Ala Trp Gly Gly

340 345 350

ccg atg gcc acc agt gcc aac ctg ttt ccg cag ata ggc acc ctc gac 1104 Pro Met Ala Thr Ser Ala Asn Leu Phe Pro Gln Ile Gly Thr Leu Asp

355 360 365

aac cgc ccc aac gct ttc tat gtg cag ggc tac tcc ggc ttt ggc gtc 1152 Asn Arg Pro Asn Ala Phe Tyr Val Gln Gly Tyr Ser Gly Phe Gly Val

370 375 380

acg ccc agc cac atc att tgc aag ata ctc gcc gag ggt atg cag gag 1200 Thr Pro Ser His Ile Ile Cys Lys Ile Leu Ala Glu Gly Met Gln Glu385 390 395 400

gga tcg aag cgc tat gac ctg gtc agc tcg gtc aag cat gcg cgc atc 1248 Gly Ser Lys Arg Tyr Asp Leu Val Ser Ser Val Lys His Ala Arg Ile

405 410 415

ctc ggc aag gac cat ttc cgc ccg ctg ctg ctc act gcc ggc aag acc 1296 Leu Gly Lys Asp His Phe Arg Pro Leu Leu Leu Thr Ala Gly Lys Thr

420 425 430

gtg cac cag ctg tcg ggc tac ttc aac ggc cgt cgc tga 1335 Val His Gln Leu Ser Gly Tyr Phe Asn Gly Arg Arg

435 440

<210> 12

<211> 444

<212> PRT

<213> Pseudomonas putida U

<400> 12

Met Ile Ser Thr His Phe Gln Ser Ala Gly Gly Val Met Ile Thr Leu 1 5 10 15

Glu Ser Pro Thr Tyr Tyr Ser Ala Thr Lys Lys Tyr Asn Leu Ser Phe20 25 30

Pro Thr Leu Glu Arg Asp Ile Glu Ala Asp Val Val Val Ile Gly Gly35 40 45

Gly Phe Ser Gly Ile Asn Thr Ala Leu Glu Leu Ala Glu Gln Gly Val50 55 60

Thr Asn Ile Val Val Leu Glu Gly Arg Tyr Leu Gly Tyr Gly Gly Ser65 70 75 80

Gly Arg Asn Gly Gly Gln Ile Met Ala Gly Ile Gly His Asp Leu Glu85 90 95

Lys Ile Arg Ser Ser Val Gly Asp Gln Gly Val Arg Asp Ile Phe Glu100 105 110

Ile Ser Glu Leu Gly Ala Gly Ile Ile Lys Asp Arg Ile Ala Arg Tyr115 120 125

Ala Ile Asp Ala Asp Phe Cys His Gly Tyr Gly Tyr Met Gly Phe Asn130 135 140

Arg Arg Gln Glu Gln Thr Leu Arg Lys Trp Glu Lys Ala Phe Lys Ala145 150 155 160

Ile Asn Thr Arg Asp Glu Ile Arg Phe Leu Gly Gly Ser Glu Val Arg165 170 175

Gln Ile Ile Gly Ser Asn Ala Tyr Ser Ser Ala Leu Met His Met Gly180 185 190

Gly Gly His Val His Ser Leu Asn Leu Leu Leu Gly Glu Ala Gln Ala195 200 205

Leu Val Gly His Gly Ala Arg Ile Phe Glu His Ser Pro Ala Leu Glu210 215 220

Val Thr Tyr Gly Glu Arg Ile Thr Val Arg Thr Gly Arg Gly Ser Val225 230 235 240

Arg Ala Ser Lys Leu Leu Trp Ala Cys Asp Ser Phe Leu Asn Lys Leu245 250 255

Glu Pro Gln Leu His Ala Arg Thr Ile Asn Thr Tyr Ala Phe Gln Met260 265 270

Met Thr Glu Pro Leu Pro Asp Glu Leu Ile Glu Arg Ile Ser Pro Ile275 280 285

Arg Gly Ala Tyr Ser Asp Ile Arg Pro Val Ile Asp Tyr Tyr Arg Val290 295 300

Thr Arg Glu Asn Arg Leu Leu Phe Gly Ala Ala Thr Pro Phe Val Glu305 310 315 320

His Phe Pro Leu Asp Leu Lys Ala Trp Asn Arg Ala Leu Met Leu Lys325 330 335

Ile Phe Pro Tyr Leu Lys Asp Val Arg Ile Asp Leu Ala Trp Gly Gly340 345 350

Pro Met Ala Thr Ser Ala Asn Leu Phe Pro Gln Ile Gly Thr Leu Asp355 360 365

Asn Arg Pro Asn Ala Phe Tyr Val Gln Gly Tyr Ser Gly Phe Gly Val370 375 380

Thr Pro Ser His Ile Ile Cys Lys Ile Leu Ala Glu Gly Met Gln Glu385 390 395 400

Gly Ser Lys Arg Tyr Asp Leu Val Ser Ser Val Lys His Ala Arg Ile405 410 415

Leu Gly Lys Asp His Phe Arg Pro Leu Leu Leu Thr Ala Gly Lys Thr420 425 430

Val His Gln Leu Ser Gly Tyr Phe Asn Gly Arg Arg435 440

<210> 13

<211> 1218

<212> DNA

<213> Pseudomonas putida U

<220> <221> CDS

<222> (1) .. (1218)

<223> TynF

<400> 13 atg caa gcc aat ccc tcc cct ccc ata ccc ttc agc ttc gcc ctg ggc 48 Met Gln Ala Asn Pro Ser Pro Pro Ile Pro Phe Ser Phe Ala Leu Gly1 5 10 15

cta ggc ctg atc ggc gcc ctc ggc cct tcc gcc gtc gac atg tac ctg 96 Leu Gly Leu Ile Gly Ala Leu Gly Pro Ser Ala Val Asp Met Tyr Leu

tcg agc ctg ccg gaa atc gcc agc cac tat cag gct agc ttc acc cgc 144 Ser Ser Leu Pro Glu Ile Ala Ser His Tyr Gln Ala Ser Phe Thr Arg

gta cag ctg aca ctg acc ttc ttc ctg ctg gcc atg ggc gcc ggc cag 192 Val Gln Leu Thr Leu Thr Phe Phe Leu Leu Ala Met Gly Ala Gly Gln

50 55 60

ctg atc ttc ggc ccc atc gtc gac gct tat ggc cgg cgc aag ccg ctg 240 Leu Ile Phe Gly Pro Ile Val Asp Ala Tyr Gly Arg Arg Lys Pro Leu65 70 75 80

ctg gcc ggc ctg ctg ctg ttc atc ctg tgc tcg ctg ggc gca gcc gca 288 Leu Ala Gly Leu Leu Leu Phe Ile Leu Cys Ser Leu Gly Ala Ala Ala

85 90 95

gcc ccc agc ctc gac acc ctg atc atg ctg cgc ttt ttc cag ggc ctg 336 Ala Pro Ser Leu Asp Thr Leu Ile Met Leu Arg Phe Phe Gln Gly Leu

100 105 110

ggc agt gcg ctg acc ctg gtg gtg atc atg agc atg gtg cgt gat gtg 384 Gly Ser Ala Leu Thr Leu Val Val Ile Met Ser Met Val Arg Asp Val

115 120 125

agc cag ggc gtg gcc gcg acc aaa ctg ttc gcc ctg ctg atg acc atc 432 Ser Gln Gly Val Ala Ala Thr Lys Leu Phe Ala Leu Leu Met Thr Ile

130 135 140

gaa ggc gtc gca ccg atc ctg gca cct gcc ctg ggc ggc gtg atc gac 480 Glu Gly Val Ala Pro Ile Leu Ala Pro Ala Leu Gly Gly Val Ile Asp145 150 155 160

gca cat ttc ggc tgg cgt gca gta atg ctg gta ctc gcc ggc atg ggc 528 Ala His Phe Gly Trp Arg Ala Val Met Leu Val Leu Ala Gly Met Gly

165 170 175

gtg acg gtg ctg gtc aac agc ctg ctg aac ctg ccc gaa acc ctg ccg 576 Val Thr Val Leu Val Asn Ser Leu Leu Asn Leu Pro Glu Thr Leu Pro

180 185 190

ccc agc aaa cgc gaa ccc ctg cgc ctg ggc cac gcc tgc agc acc tac 624 Pro Ser Lys Arg Glu Pro Leu Arg Leu Gly His Ala Cys Ser Thr Tyr

195 200 205

ctg gcc atc ctc gcc gac cgc cgc ttc ctg cgc ccg acc ctg gcg gtt 672 Leu Ala Ile Leu Ala Asp Arg Arg Phe Leu Arg Pro Thr Leu Ala Val 210 215 220

gct gcg gta ttc ttc ttc ctg ttc gcc tac atc ggc ggt gcc acc ctg 720 Ala Ala Val Phe Phe Phe Leu Phe Ala Tyr Ile Gly Gly Ala Thr Leu225 230 235 240

gtg tac cag gcc cac tac ggc ctg agc gcc cag gcc ttc ggc ctg ctg 768 Val Tyr Gln Ala His Tyr Gly Leu Ser Ala Gln Ala Phe Gly Leu Leu

245 250 255

ttt ggc gcc acc ggg gtg tcg atc ctg ctc ggc gcc atg acg gcc agc 816 Phe Gly Ala Thr Gly Val Ser Ile Leu Leu Gly Ala Met Thr Ala Ser

260 265 270

cac ctg atc agc cgg ctg ggc ctc aat acc ttg act cgg gtg ggc gtg 864 His Leu Ile Ser Arg Leu Gly Leu Asn Thr Leu Thr Arg Val Gly Val

275 280 285

ctg tgc atg gcc ggc ggt gcc tgc atc agc ctg ctc ggt gca ctg acc 912 Leu Cys Met Ala Gly Gly Ala Cys Ile Ser Leu Leu Gly Ala Leu Thr

290 295 300

ggc ctg ggg ctg cca ggt gtg gcc ggc ggc atg gtg ata gcc ctg ttc 960 Gly Leu Gly Leu Pro Gly Val Ala Gly Gly Met Val Ile Ala Leu Phe305 310 315 320

ggc ctg ggg ata gcc gag tcg acg ctg atg tcg ctg gtg atg gcc tcg 1008 Gly Leu Gly Ile Ala Glu Ser Thr Leu Met Ser Leu Val Met Ala Ser

325 330 335

caa gaa aag gca ctg ggt tcc acc gca gcg ctg ctg ggc gcc atc cag 1056 Gln Glu Lys Ala Leu Gly Ser Thr Ala Ala Leu Leu Gly Ala Ile Gln

340 345 350

ctg tcg gcg tct gcc ggc gcc gcc ccg ctg gcc gca gtg gta ctc aac 1104 Leu Ser Ala Ser Ala Gly Ala Ala Pro Leu Ala Ala Val Val Leu Asn

355 360 365

cac ggc ccg acc gca tgg gcc gcg ctg ctg gcc ctg tgc acc ctg gtg 1152 His Gly Pro Thr Ala Trp Ala Ala Leu Leu Ala Leu Cys Thr Leu Val

370 375 380

gtg tgc ctg ctg acc gcc ctc agc ctg cgc cac acc ccg gcc agc ttc 1200 Val Cys Leu Leu Thr Ala Leu Ser Leu Arg His Thr Pro Ala Ser Phe385 390 395 400

tcg ctc gcg ggc cat tga 1218 Ser Leu Ala Gly His

<210> 14

<211> 405

<212> PRT

<213> Pseudomonas putida U

<400> 14

Met Gln Ala Asn Pro Ser Pro Pro Ile Pro Phe Ser Phe Ala Leu Gly 1 5 10 15

Leu Gly Leu Ile Gly Ala Leu Gly Pro Ser Ala Val Asp Met Tyr Leu20 25 30

Ser Ser Leu Pro Glu Ile Ala Ser His Tyr Gln Ala Ser Phe Thr Arg35 40 45

Val Gln Leu Thr Leu Thr Phe Phe Leu Leu Ala Met Gly Ala Gly Gln50 55 60

Leu Ile Phe Gly Pro Ile Val Asp Ala Tyr Gly Arg Arg Lys Pro Leu65 70 75 80

Leu Ala Gly Leu Leu Leu Phe Ile Leu Cys Ser Leu Gly Ala Ala Ala85 90 95

Ala Pro Ser Leu Asp Thr Leu Ile Met Leu Arg Phe Phe Gln Gly Leu100 105 110

Gly Ser Ala Leu Thr Leu Val Val Ile Met Ser Met Val Arg Asp Val115 120 125

Ser Gln Gly Val Ala Ala Thr Lys Leu Phe Ala Leu Leu Met Thr Ile130 135 140

Glu Gly Val Ala Pro Ile Leu Ala Pro Ala Leu Gly Gly Val Ile Asp145 150 155 160

Ala His Phe Gly Trp Arg Ala Val Met Leu Val Leu Ala Gly Met Gly165 170 175

Val Thr Val Leu Val Asn Ser Leu Leu Asn Leu Pro Glu Thr Leu Pro 180 185 190

Pro Ser Lys Arg Glu Pro Leu Arg Leu Gly His Ala Cys Ser Thr Tyr195 200 205

Leu Ala Ile Leu Ala Asp Arg Arg Phe Leu Arg Pro Thr Leu Ala Val210 215 220

Ala Ala Val Phe Phe Phe Leu Phe Ala Tyr Ile Gly Gly Ala Thr Leu225 230 235 240

Val Tyr Gln Ala His Tyr Gly Leu Ser Ala Gln Ala Phe Gly Leu Leu245 250 255

Phe Gly Ala Thr Gly Val Ser Ile Leu Leu Gly Ala Met Thr Ala Ser260 265 270

His Leu Ile Ser Arg Leu Gly Leu Asn Thr Leu Thr Arg Val Gly Val275 280 285

Leu Cys Met Ala Gly Gly Ala Cys Ile Ser Leu Leu Gly Ala Leu Thr290 295 300

Gly Leu Gly Leu Pro Gly Val Ala Gly Gly Met Val Ile Ala Leu Phe305 310 315 320

Gly Leu Gly Ile Ala Glu Ser Thr Leu Met Ser Leu Val Met Ala Ser325 330 335

Gln Glu Lys Ala Leu Gly Ser Thr Ala Ala Leu Leu Gly Ala Ile Gln340 345 350

Leu Ser Ala Ser Ala Gly Ala Ala Pro Leu Ala Ala Val Val Leu Asn355 360 365

His Gly Pro Thr Ala Trp Ala Ala Leu Leu Ala Leu Cys Thr Leu Val370 375 380

Val Cys Leu Leu Thr Ala Leu Ser Leu Arg His Thr Pro Ala Ser Phe385 390 395 400

Ser Leu Ala Gly His405

<210> 15

<211> 1311

<212> DNA

<213> Pseudomonas putida U

<220>

<221> CDS

<222> (1) .. (1311)

<223> TynE

<400> 15 atg gtc aaa cca cag acg ctg tcc agc ctg gcc ctg gca acc ttg ctg 48 Met Val Lys Pro Gln Thr Leu Ser Ser Leu Ala Leu Ala Thr Leu Leu1 5 10 15

gcc agc cag gcc gcg ccg gcc gtt gag ctg tac gcc gac gat gac agc 96 Ala Ser Gln Ala Ala Pro Ala Val Glu Leu Tyr Ala Asp Asp Asp Ser

cac ctg aac gcc gac atg ctg gcg gta tgg ggc atg ttc aac agc cgc 144 His Leu Asn Ala Asp Met Leu Ala Val Trp Gly Met Phe Asn Ser Arg

aag aac tac gac ggc acc aca ggg ggt tcg acc tgg cgt gaa ggc ttt 192 Lys Asn Tyr Asp Gly Thr Thr Gly Gly Ser Thr Trp Arg Glu Gly Phe

50 55 60

atc aag tat ggc ctc agc ggt gac cag ggc ctg gcc ggc aac ggc acg 240 Ile Lys Tyr Gly Leu Ser Gly Asp Gln Gly Leu Ala Gly Asn Gly Thr65 70 75 80

ctg tac ggc agc ctg aac tgg gtg agc tcg gcc acc tgg ggc gat ggc 288 Leu Tyr Gly Ser Leu Asn Trp Val Ser Ser Ala Thr Trp Gly Asp Gly

85 90 95

gat gcg gcc ggc aac acc gat ggc tcc gaa cgc acc acc aag atc gaa 336 Asp Ala Ala Gly Asn Thr Asp Gly Ser Glu Arg Thr Thr Lys Ile Glu

100 105 110

gac gcc ttc ctc ggc tgg cgc tcg gcc gac ctg ttc ccg gtg ctg ggc 384

Asp Ala Phe Leu Gly Trp Arg Ser Ala Asp Leu Phe Pro Val Leu Gly115 120 125

aag gat gga gtg gac gtt tcc gcc ggc cgc cag acc att cgc ctg ggc 432 Lys Asp Gly Val Asp Val Ser Ala Gly Arg Gln Thr Ile Arg Leu Gly

130 135 140

agt ggt ttt ttg atc aac gac gac ggc ccg aac ctg ggc aac ggc gtc 480 Ser Gly Phe Leu Ile Asn Asp Asp Gly Pro Asn Leu Gly Asn Gly Val145 150 155 160

gcc gac ggt gcg ctg gac cgc ggc ggg gcc tac tac ctg gcc gcc cgc 528 Ala Asp Gly Ala Leu Asp Arg Gly Gly Ala Tyr Tyr Leu Ala Ala Arg

165 170 175

cac gcc ttc gac cgc acc gca atg ctg cgc ctg ggg ggc agc gat ggc 576 His Ala Phe Asp Arg Thr Ala Met Leu Arg Leu Gly Gly Ser Asp Gly

180 185 190

ctg cat ggc agc ctg ctg tgg ctg aaa tcc gac aac cgc gcc cag gcc 624 Leu His Gly Ser Leu Leu Trp Leu Lys Ser Asp Asn Arg Ala Gln Ala

195 200 205

gaa acc gaa ctg gcc gcc ggc acg ctg gac tac acc caa gcc ttg ggc 672 Glu Thr Glu Leu Ala Ala Gly Thr Leu Asp Tyr Thr Gln Ala Leu Gly

210 215 220

acc ctc ggg ctg acc tgg att cac ggc atc gac gtc acc gac caa tgg 720 Thr Leu Gly Leu Thr Trp Ile His Gly Ile Asp Val Thr Asp Gln Trp225 230 235 240

gcc agc gac ttt cag aaa gcc cgc gaa ggc atg gac gtg tat agc gtg 768 Ala Ser Asp Phe Gln Lys Ala Arg Glu Gly Met Asp Val Tyr Ser Val

245 250 255

cgc ggc gaa ggc aac gct ggc atc gac aat gcc agt ttc gcc ttc gaa 816 Arg Gly Glu Gly Asn Ala Gly Ile Asp Asn Ala Ser Phe Ala Phe Glu

260 265 270

tac gcc tgg cag gac aag acc gac ggc ccc gag caa gcc tgg tac ctg 864 Tyr Ala Trp Gln Asp Lys Thr Asp Gly Pro Glu Gln Ala Trp Tyr Leu

275 280 285

cag gcc ggc tac acc ttc gcc gac ctg ccg tgg gca ccg cag gtt acc 912 Gln Ala Gly Tyr Thr Phe Ala Asp Leu Pro Trp Ala Pro Gln Val Thr

290 295 300

tac cgc tac acc cgc tac tcg gca ggc tgg gac gcg ctg ttc agc ggc 960 Tyr Arg Tyr Thr Arg Tyr Ser Ala Gly Trp Asp Ala Leu Phe Ser Gly305 310 315 320

ctg tcc agc ggt tac ggc acc tgg ttc cag ggt gaa gtc gct gcc aac 1008 Leu Ser Ser Gly Tyr Gly Thr Trp Phe Gln Gly Glu Val Ala Ala Asn

325 330 335

tac gcc ggc ccc ttc aac agc aac acg ggt atc cac cat gtg ggc gtg 1056 Tyr Ala Gly Pro Phe Asn Ser Asn Thr Gly Ile His His Val Gly Val

340 345 350

aag gcg aca ccg ctg gaa aat ctc aca gtc ggg gcg ctg tac ttc gacLys Ala Thr Pro Leu Glu Asn Leu Thr Val Gly Ala Leu Tyr Phe Asp355 360 365 1104 ttc gac acc gta cgc acc cgc gaa agc ctc aac ctc gat gcg cgg gagPhe Asp Thr Val Arg Thr Arg Glu Ser Leu Asn Leu Asp Ala Arg Glu370 375 380 1152 ctg gac ctg tat gtg gaa tgg gca gtc aac gag cac ctg ata atc agcLeu Asp Leu Tyr Val Glu Trp Ala Val Asn Glu His Leu Ile Ile Ser385 390 395 400 1200 ccg ctg gtg ggc ctt tac cag ccg cgc aag gac gag agc aac ggc ggcPro Leu Val Gly Leu Tyr Gln Pro Arg Lys Asp Glu Ser Asn Gly Gly405 410 415 1248 aac cag gtg ggc ggg aat ggt acc aat gtg tat agc cag ctg acc gtgAsn Gln Val Gly Gly Asn Gly Thr Asn Val Tyr Ser Gln Leu Thr Val420 425 430 1296 gct gtg ccg ttc tgaAla Val Pro Phe 1311 435 <210> 16 <211> 436 <212> PRT <213> Pseudomonas putida U <400> 16 Met Val Lys Pro Gln Thr Leu Ser Ser Leu Ala Leu Ala Thr Leu Leu1 5 10 15 Ala Ser Gln Ala Ala Pro Ala Val Glu Leu Tyr Ala Asp Asp Asp Ser20 25 30 His Leu Asn Ala Asp Met Leu Ala Val Trp Gly Met Phe Asn Ser Arg35 40 45 Lys Asn Tyr Asp Gly Thr Thr Gly Gly Ser Thr Trp Arg Glu Gly Phe50 55 60 Ile Lys Tyr Gly Leu Ser Gly Asp Gln Gly Leu Ala Gly Asn Gly Thr65 70 75 80 Leu Tyr Gly Ser Leu Asn Trp Val Ser Ser Ala Thr Trp Gly Asp Gly85 90 95 Asp Ala Ala Gly Asn Thr Asp Gly Ser Glu Arg Thr Thr Lys Ile Glu100 105 110 Asp Ala Phe Leu Gly Trp Arg Ser Ala Asp Leu Phe Pro Val Leu Gly115 120 125 Lys Asp Gly Val Asp Val Ser Ala Gly Arg Gln Thr Ile Arg Leu Gly130 135 140

Ser Gly Phe Leu Ile Asn Asp Asp Gly Pro Asn Leu Gly Asn Gly Val145 150 155 160

Ala Asp Gly Ala Leu Asp Arg Gly Gly Ala Tyr Tyr Leu Ala Ala Arg165 170 175

His Ala Phe Asp Arg Thr Ala Met Leu Arg Leu Gly Gly Ser Asp Gly180 185 190

Leu His Gly Ser Leu Leu Trp Leu Lys Ser Asp Asn Arg Ala Gln Ala195 200 205

Glu Thr Glu Leu Ala Ala Gly Thr Leu Asp Tyr Thr Gln Ala Leu Gly210 215 220

Thr Leu Gly Leu Thr Trp Ile His Gly Ile Asp Val Thr Asp Gln Trp225 230 235 240

Ala Ser Asp Phe Gln Lys Ala Arg Glu Gly Met Asp Val Tyr Ser Val245 250 255

Arg Gly Glu Gly Asn Ala Gly Ile Asp Asn Ala Ser Phe Ala Phe Glu260 265 270

Tyr Ala Trp Gln Asp Lys Thr Asp Gly Pro Glu Gln Ala Trp Tyr Leu275 280 285

Gln Ala Gly Tyr Thr Phe Ala Asp Leu Pro Trp Ala Pro Gln Val Thr290 295 300

Tyr Arg Tyr Thr Arg Tyr Ser Ala Gly Trp Asp Ala Leu Phe Ser Gly305 310 315 320

Leu Ser Ser Gly Tyr Gly Thr Trp Phe Gln Gly Glu Val Ala Ala Asn325 330 335

Tyr Ala Gly Pro Phe Asn Ser Asn Thr Gly Ile His His Val Gly Val340 345 350

Lys Ala Thr Pro Leu Glu Asn Leu Thr Val Gly Ala Leu Tyr Phe Asp355 360 365

Phe Asp Thr Val Arg Thr Arg Glu Ser Leu Asn Leu Asp Ala Arg Glu370 375 380

Leu Asp Leu Tyr Val Glu Trp Ala Val Asn Glu His Leu Ile Ile Ser385 390 395 400

Pro Leu Val Gly Leu Tyr Gln Pro Arg Lys Asp Glu Ser Asn Gly Gly405 410 415

Asn Gln Val Gly Gly Asn Gly Thr Asn Val Tyr Ser Gln Leu Thr Val420 425 430

Ala Val Pro Phe 435

<210> 17

<211> 1497

<212> DNA

<213> Pseudomonas putida U

<220>

<221> CDS

<222> (1) .. (1497)

<223> TynG

<400> 17 atg tca ctc aat aac aag ctc acc gag cac ctc aac cgc ggc act gtc 48 Met Ser Leu Asn Asn Lys Leu Thr Glu His Leu Asn Arg Gly Thr Val1 5 10 15

ggt ttc ccc acc gca ctg gcc agc act gtc ggg ctg atc atg gcc agc 96 Gly Phe Pro Thr Ala Leu Ala Ser Thr Val Gly Leu Ile Met Ala Ser

ccg gtg atc ctc acc gcg acc atg ggc ttt ggc atc ggc ggc agc gcc 144 Pro Val Ile Leu Thr Ala Thr Met Gly Phe Gly Ile Gly Gly Ser Ala

ttc gcc gtg gcc atg gtc atc gcc gca ctg atg atg ctg gcg cag tcc 192 Phe Ala Val Ala Met Val Ile Ala Ala Leu Met Met Leu Ala Gln Ser

50 55 60

acc acc ttt gcc gag gct gcg tcg atc ctg ccg acc acg ggc tcg gta 240 Thr Thr Phe Ala Glu Ala Ala Ser Ile Leu Pro Thr Thr Gly Ser Val65 70 75 80

tac gac tac atc aac tgt ggc atg ggc cgt ttc ttc gcc att acc ggc 288 Tyr Asp Tyr Ile Asn Cys Gly Met Gly Arg Phe Phe Ala Ile Thr Gly

85 90 95

acg ctg tcg gcc tac ctg atc gtg cat gtg ttc gcc ggt acc gcc gaa 336 Thr Leu Ser Ala Tyr Leu Ile Val His Val Phe Ala Gly Thr Ala Glu

100 105 110

acc atc ctg tcg ggg gtg atg gcg ctg gtg aac ttc gag cac ctc aat 384 Thr Ile Leu Ser Gly Val Met Ala Leu Val Asn Phe Glu His Leu Asn

115 120 125

acc ctg gcg gaa tcc gcc ggc ggt tcg tgg ctg ctg ggg gtg tgc ttc 432 Thr Leu Ala Glu Ser Ala Gly Gly Ser Trp Leu Leu Gly Val Cys Phe

130 135 140

gtg gtg gcg ttt gcg gtg ctc aat gcc ttt ggc gtc agc gcc ttc agc 480 Val Val Ala Phe Ala Val Leu Asn Ala Phe Gly Val Ser Ala Phe Ser145 150 155 160

cgc gcg gaa gtg gtc ctc acc ttc ggc atg tgg acc acc ttg atg gtg 528 Arg Ala Glu Val Val Leu Thr Phe Gly Met Trp Thr Thr Leu Met Val

165 170 175

ttc ggc gtg ctt ggc ctg atc gcc gca ccc gca gtg gaa ctg gac ggc 576 Phe Gly Val Leu Gly Leu Ile Ala Ala Pro Ala Val Glu Leu Asp Gly

180 185 190

ccg ttc ggc gtg tcg ctg gtg ggc acc gac ctg atg acc atc ctc tcg 624

Pro Phe Gly Val Ser Leu Val Gly Thr Asp Leu Met Thr Ile Leu Ser195 200 205

ctg gtc ggc atg gcc atg ttc atg ttc gtt ggc tgc gag ttc gtc acg 672 Leu Val Gly Met Ala Met Phe Met Phe Val Gly Cys Glu Phe Val Thr

210 215 220

ccg ctt gcc ccc gaa ctg cgt cgc tcg gcc tgg gtg ctg ccg cgg gcc 720 Pro Leu Ala Pro Glu Leu Arg Arg Ser Ala Trp Val Leu Pro Arg Ala225 230 235 240

atg gcg ctg ggc ctg ttt ggc gtg gcc agc tgc atg ttc atc tac gga 768 Met Ala Leu Gly Leu Phe Gly Val Ala Ser Cys Met Phe Ile Tyr Gly

245 250 255

gcg gcg atg aag cgc cag gtg gaa aac gtg gtg ctg gat gcc gcc agt 816 Ala Ala Met Lys Arg Gln Val Glu Asn Val Val Leu Asp Ala Ala Ser

260 265 270

ggc gtg cac ctg ctg gac acg ccc atg gcc atc ccg cgc ttc gcc gag 864 Gly Val His Leu Leu Asp Thr Pro Met Ala Ile Pro Arg Phe Ala Glu

275 280 285

cag gtg atg ggt gat att ggc cca gtg tgg ctg ggt atc ggc ttc ctg 912 Gln Val Met Gly Asp Ile Gly Pro Val Trp Leu Gly Ile Gly Phe Leu

290 295 300

ttc gcc ggc gcg gcc acc atc aac acg ctg atg gcc ggt gtg cca cgc 960 Phe Ala Gly Ala Ala Thr Ile Asn Thr Leu Met Ala Gly Val Pro Arg305 310 315 320

att ctt tac ggc atg gcg gtg gac ggc gcg ttg ccc aag gtg ttc acc 1008 Ile Leu Tyr Gly Met Ala Val Asp Gly Ala Leu Pro Lys Val Phe Thr

325 330 335

tac ctg cac ccg cgc ttc aag acg ccg ctg ctg tgc atc ctg gtg gtg 1056 Tyr Leu His Pro Arg Phe Lys Thr Pro Leu Leu Cys Ile Leu Val Val

340 345 350

gcg ttg atc cct tgc ctg cat gcc tgg tac ctg ggc ggc aac ccg gac 1104 Ala Leu Ile Pro Cys Leu His Ala Trp Tyr Leu Gly Gly Asn Pro Asp

355 360 365

aac atc ctg cac ctg gtg ctg gcc gcc gtg tgc gcc tgg agc acc gcc 1152 Asn Ile Leu His Leu Val Leu Ala Ala Val Cys Ala Trp Ser Thr Ala

370 375 380

tac ctg ctg gtg acc ctg tcg gtg gtg ata ttg cgc atc cgc cgc cca 1200 Tyr Leu Leu Val Thr Leu Ser Val Val Ile Leu Arg Ile Arg Arg Pro385 390 395 400

gac ctg ccg cgt gcc tac cgc tcg ccg ctg ttc ccg ttg ccg cag ata 1248 Asp Leu Pro Arg Ala Tyr Arg Ser Pro Leu Phe Pro Leu Pro Gln Ile

405 410 415

ttc tcc agt agc ggt atc ctc atc ggc atg gcg ttc atc aca ccg ccg 1296 Phe Ser Ser Ser Gly Ile Leu Ile Gly Met Ala Phe Ile Thr Pro Pro

420 425 430 ggc atg aac cct gcc gat gtc tac gtg ccg ttc gcc atc atg ctt ggc 1344 Gly Met Asn Pro Ala Asp Val Tyr Val Pro Phe Ala Ile Met Leu Gly

435 440 445

gcc act gcg gcc tat gca ttg ttc tgg acg ctg tgg gtg cag aag gtc 1392 Ala Thr Ala Ala Tyr Ala Leu Phe Trp Thr Leu Trp Val Gln Lys Val

450 455 460

aac ccg ttc aag ccg gcg cgg gtc gag gat gtg ctc gag aaa gag ttt 1440 Asn Pro Phe Lys Pro Ala Arg Val Glu Asp Val Leu Glu Lys Glu Phe465 470 475 480

gct gcc gag cct ggc cac gcc gtg gag cac gtg ctg cat gat cag aaa 1488 Ala Ala Glu Pro Gly His Ala Val Glu His Val Leu His Asp Gln Lys

485 490 495

ttt gcg tga 1497 Phe Ala 165 170 175

<210> 18 <211> 498 <212> PRT <213> Pseudomonas putida U <400> 18 Met Ser Leu Asn Asn Lys Leu Thr Glu His Leu Asn Arg Gly Thr Val1 5 10 15 Gly Phe Pro Thr Ala Leu Ala Ser Thr Val Gly Leu Ile Met Ala Ser20 25 30 Pro Val Ile Leu Thr Ala Thr Met Gly Phe Gly Ile Gly Gly Ser Ala35 40 45 Phe Ala Val Ala Met Val Ile Ala Ala Leu Met Met Leu Ala Gln Ser 50 55 60 Thr Thr Phe Ala Glu Ala Ala Ser Ile Leu Pro Thr Thr Gly Ser Val65 70 75 80 Tyr Asp Tyr Ile Asn Cys Gly Met Gly Arg Phe Phe Ala Ile Thr Gly85 90 95 Thr Leu Ser Ala Tyr Leu Ile Val His Val Phe Ala Gly Thr Ala Glu100 105 110 Thr Ile Leu Ser Gly Val Met Ala Leu Val Asn Phe Glu His Leu Asn115 120 125 Thr Leu Ala Glu Ser Ala Gly Gly Ser Trp Leu Leu Gly Val Cys Phe130 135 140 Val Val Ala Phe Ala Val Leu Asn Ala Phe Gly Val Ser Ala Phe Ser145 150 155 160 Arg Ala Glu Val Val Leu Thr Phe Gly Met Trp Thr Thr Leu Met Val

Phe Gly Val Leu Gly Leu Ile Ala Ala Pro Ala Val Glu Leu Asp Gly180 185 190

Pro Phe Gly Val Ser Leu Val Gly Thr Asp Leu Met Thr Ile Leu Ser195 200 205

Leu Val Gly Met Ala Met Phe Met Phe Val Gly Cys Glu Phe Val Thr210 215 220

Pro Leu Ala Pro Glu Leu Arg Arg Ser Ala Trp Val Leu Pro Arg Ala225 230 235 240

Met Ala Leu Gly Leu Phe Gly Val Ala Ser Cys Met Phe Ile Tyr Gly245 250 255

Ala Ala Met Lys Arg Gln Val Glu Asn Val Val Leu Asp Ala Ala Ser260 265 270

Gly Val His Leu Leu Asp Thr Pro Met Ala Ile Pro Arg Phe Ala Glu275 280 285

Gln Val Met Gly Asp Ile Gly Pro Val Trp Leu Gly Ile Gly Phe Leu290 295 300

Phe Ala Gly Ala Ala Thr Ile Asn Thr Leu Met Ala Gly Val Pro Arg305 310 315 320

Ile Leu Tyr Gly Met Ala Val Asp Gly Ala Leu Pro Lys Val Phe Thr325 330 335

Tyr Leu His Pro Arg Phe Lys Thr Pro Leu Leu Cys Ile Leu Val Val340 345 350

Ala Leu Ile Pro Cys Leu His Ala Trp Tyr Leu Gly Gly Asn Pro Asp355 360 365

Asn Ile Leu His Leu Val Leu Ala Ala Val Cys Ala Trp Ser Thr Ala370 375 380

Tyr Leu Leu Val Thr Leu Ser Val Val Ile Leu Arg Ile Arg Arg Pro385 390 395 400

Asp Leu Pro Arg Ala Tyr Arg Ser Pro Leu Phe Pro Leu Pro Gln Ile405 410 415

Phe Ser Ser Ser Gly Ile Leu Ile Gly Met Ala Phe Ile Thr Pro Pro420 425 430

Gly Met Asn Pro Ala Asp Val Tyr Val Pro Phe Ala Ile Met Leu Gly435 440 445

Ala Thr Ala Ala Tyr Ala Leu Phe Trp Thr Leu Trp Val Gln Lys Val450 455 460

Asn Pro Phe Lys Pro Ala Arg Val Glu Asp Val Leu Glu Lys Glu Phe465 470 475 480

Ala Ala Glu Pro Gly His Ala Val Glu His Val Leu His Asp Gln Lys485 490 495

Phe Ala

<210> 19

<211> 1170

<212> DNA

<213> Pseudomonas putida U

<220>

<221> CDS

<222> (1) .. (1170)

<223> HpaB

<400> 19 atg aaa aag cca aac ccc ctg ctg gaa gac ctg aag tcc gtc ctg ccg 48 Met Lys Lys Pro Asn Pro Leu Leu Glu Asp Leu Lys Ser Val Leu Pro1 5 10 15

acc att gcc gcc aat gcc atg cgt gca gag cag gac cgc agt gtg ccg 96 Thr Ile Ala Ala Asn Ala Met Arg Ala Glu Gln Asp Arg Ser Val Pro

gca gag aat atc gcc ttg ctg aaa agc atc ggc atg cac cgc gct ttc 144 Ala Glu Asn Ile Ala Leu Leu Lys Ser Ile Gly Met His Arg Ala Phe

ttg ccc aaa cac ttc ggc ggc atg gaa atc acc ctg ccg gag ttc gcc 192 Leu Pro Lys His Phe Gly Gly Met Glu Ile Thr Leu Pro Glu Phe Ala

50 55 60

cag tgc atc gcc ttg ctg gcg ggg gcc tgc gcc agc aca gcc tgg gcc 240 Gln Cys Ile Ala Leu Leu Ala Gly Ala Cys Ala Ser Thr Ala Trp Ala65 70 75 80

atg agc ctg ctg tgc acc cac agc cac cag atg gca atg ttc tcg ccc 288 Met Ser Leu Leu Cys Thr His Ser His Gln Met Ala Met Phe Ser Pro

85 90 95

aag cta caa cag gag gtg tgg ggt agc gac ccg gat gct acc gcc agc 336 Lys Leu Gln Gln Glu Val Trp Gly Ser Asp Pro Asp Ala Thr Ala Ser

100 105 110

agc agt atc gcg ccg ttc ggc cgc act gaa gag gtt gag ggt ggc gtg 384 Ser Ser Ile Ala Pro Phe Gly Arg Thr Glu Glu Val Glu Gly Gly Val

115 120 125

tcg ttc agc ggc gaa atg ggc tgg agt tcc ggt tgc gac cac gcc gaa 432 Ser Phe Ser Gly Glu Met Gly Trp Ser Ser Gly Cys Asp His Ala Glu

130 135 140

tgg gcg att ctc ggt ttc cgc cgc aag aat gcc gaa ggc gct cag gat 480 Trp Ala Ile Leu Gly Phe Arg Arg Lys Asn Ala Glu Gly Ala Gln Asp145 150 155 160

tac tgc ttc gcc atc ctg cct cgc agt gac tat gaa atc cgt gat gac 528

Tyr Cys Phe Ala Ile Leu Pro Arg Ser Asp Tyr Glu Ile Arg Asp Asp165 170 175

tgg tat gcc gtg ggc atg cgc ggc agc ggc agc aag acc ctg atc gtg 576 Trp Tyr Ala Val Gly Met Arg Gly Ser Gly Ser Lys Thr Leu Ile Val

180 185 190

cgt gat gcc ttc gtg ccc gag cac cgc atc cag aag gcc aag gac atg 624 Arg Asp Ala Phe Val Pro Glu His Arg Ile Gln Lys Ala Lys Asp Met

195 200 205

atg gag ggc aag tcg gcg ggc ttt ggt ttg tac ccc gac agc aag att 672 Met Glu Gly Lys Ser Ala Gly Phe Gly Leu Tyr Pro Asp Ser Lys Ile

210 215 220

ttc ttc gcc ccg tat cgc ccg tat ttt gcc agc ggc ttc tcc acg gtc 720 Phe Phe Ala Pro Tyr Arg Pro Tyr Phe Ala Ser Gly Phe Ser Thr Val225 230 235 240

agc ttg ggc gtt gcc gag cgc atg ctg gag gtg ttc cgc gag aaa acc 768 Ser Leu Gly Val Ala Glu Arg Met Leu Glu Val Phe Arg Glu Lys Thr

245 250 255

cgc aac cgc gtg cgt gcc tac acc ggt gct gcc gtg ggc gcc gcc acc 816 Arg Asn Arg Val Arg Ala Tyr Thr Gly Ala Ala Val Gly Ala Ala Thr

260 265 270

ccg gcg ctg atg cgc ctg gcc gag tcg acc cat cag gtg gcc gct gcc 864 Pro Ala Leu Met Arg Leu Ala Glu Ser Thr His Gln Val Ala Ala Ala

275 280 285

cgg gca ttg ctg gaa aag agc tgg gac gag att gcc gag cac agt gcc 912 Arg Ala Leu Leu Glu Lys Ser Trp Asp Glu Ile Ala Glu His Ser Ala

290 295 300

cgt cac gaa tac ccg tcg cgt ggc acg ctg gcg ttc tgg cgt acc aac 960 Arg His Glu Tyr Pro Ser Arg Gly Thr Leu Ala Phe Trp Arg Thr Asn305 310 315 320

cag ggc tac gcc gtg aag atg tgc atc cag gcc gtc gac cgc ctg atg 1008 Gln Gly Tyr Ala Val Lys Met Cys Ile Gln Ala Val Asp Arg Leu Met

325 330 335

gaa gcg gcc ggt ggt ggc gcc tgg ttc gag agc aac gaa ctg cag cgg 1056 Glu Ala Ala Gly Gly Gly Ala Trp Phe Glu Ser Asn Glu Leu Gln Arg

340 345 350

ctg ttc cgc gat tcg cac atg acc ggt gcc cat gcc tac acc gat tac 1104 Leu Phe Arg Asp Ser His Met Thr Gly Ala His Ala Tyr Thr Asp Tyr

355 360 365

gac gtg tgt gcg caa atc ctc ggc cgc gag ctg atg ggc ctg gag cct 1152 Asp Val Cys Ala Gln Ile Leu Gly Arg Glu Leu Met Gly Leu Glu Pro

370 375 380

gac ccg gcg atg gtc tga 1170 Asp Pro Ala Met Val385

<210> 20

<211> 389

<212> PRT

<213> Pseudomonas putida U

<400> 20

Met Lys Lys Pro Asn Pro Leu Leu Glu Asp Leu Lys Ser Val Leu Pro1 5 10 15

Thr Ile Ala Ala Asn Ala Met Arg Ala Glu Gln Asp Arg Ser Val Pro20 25 30

Ala Glu Asn Ile Ala Leu Leu Lys Ser Ile Gly Met His Arg Ala Phe35 40 45

Leu Pro Lys His Phe Gly Gly Met Glu Ile Thr Leu Pro Glu Phe Ala50 55 60

Gln Cys Ile Ala Leu Leu Ala Gly Ala Cys Ala Ser Thr Ala Trp Ala65 70 75 80

Met Ser Leu Leu Cys Thr His Ser His Gln Met Ala Met Phe Ser Pro85 90 95

Lys Leu Gln Gln Glu Val Trp Gly Ser Asp Pro Asp Ala Thr Ala Ser100 105 110

Ser Ser Ile Ala Pro Phe Gly Arg Thr Glu Glu Val Glu Gly Gly Val115 120 125

Ser Phe Ser Gly Glu Met Gly Trp Ser Ser Gly Cys Asp His Ala Glu130 135 140

Trp Ala Ile Leu Gly Phe Arg Arg Lys Asn Ala Glu Gly Ala Gln Asp145 150 155 160

Tyr Cys Phe Ala Ile Leu Pro Arg Ser Asp Tyr Glu Ile Arg Asp Asp165 170 175

Trp Tyr Ala Val Gly Met Arg Gly Ser Gly Ser Lys Thr Leu Ile Val180 185 190

Arg Asp Ala Phe Val Pro Glu His Arg Ile Gln Lys Ala Lys Asp Met195 200 205

Met Glu Gly Lys Ser Ala Gly Phe Gly Leu Tyr Pro Asp Ser Lys Ile210 215 220

Phe Phe Ala Pro Tyr Arg Pro Tyr Phe Ala Ser Gly Phe Ser Thr Val225 230 235 240

Ser Leu Gly Val Ala Glu Arg Met Leu Glu Val Phe Arg Glu Lys Thr245 250 255

Arg Asn Arg Val Arg Ala Tyr Thr Gly Ala Ala Val Gly Ala Ala Thr260 265 270

Pro Ala Leu Met Arg Leu Ala Glu Ser Thr His Gln Val Ala Ala Ala275 280 285 Arg Ala Leu Leu Glu Lys Ser Trp Asp Glu Ile Ala Glu His Ser Ala 290 295 300 Arg His Glu Tyr Pro Ser Arg Gly Thr Leu Ala Phe Trp Arg Thr Asn305 310 315 320 Gln Gly Tyr Ala Val Lys Met Cys Ile Gln Ala Val Asp Arg Leu Met325 330 335 Glu Ala Ala Gly Gly Gly Ala Trp Phe Glu Ser Asn Glu Leu Gln Arg340 345 350 Leu Phe Arg Asp Ser His Met Thr Gly Ala His Ala Tyr Thr Asp Tyr355 360 365 Asp Val Cys Ala Gln Ile Leu Gly Arg Glu Leu Met Gly Leu Glu Pro370 375 380 Asp Pro Ala Met Val385 <210> 21 <211> 930 <212> DNA <213> Pseudmonas putida U <220> <221> CDS <222> (1) .. (930) <223> hpaC <400> 21 atg tcc aaa gaa acc ttc gat tca cgt gcc ttc cgc cgc gcc ctg ggcMet Ser Lys Glu Thr Phe Asp Ser Arg Ala Phe Arg Arg Ala Leu Gly1 5 10 15 48 aac ttc gcc acc ggc gtg acc gtg gtg act gcc gcc ggc ccc agt ggcAsn Phe Ala Thr Gly Val Thr Val Val Thr Ala Ala Gly Pro Ser Gly20 25 30 96 cgc aag gtc ggc gtt acc gcc aac agc ttc aac tcg gtg tcg ctg gacArg Lys Val Gly Val Thr Ala Asn Ser Phe Asn Ser Val Ser Leu Asp35 40 45 144 ccg gcg ctg atc ctg tgg agc atc gac aag cgc tcc acc agc cat gaaPro Ala Leu Ile Leu Trp Ser Ile Asp Lys Arg Ser Thr Ser His Glu50 55 60 192 gtg ttc gaa gag gcc tcg cac ttt gcc gtg aac att ctg gct gcg gacVal Phe Glu Glu Ala Ser His Phe Ala Val Asn Ile Leu Ala Ala Asp65 70 75 80 240 cag atc gac ctg tcc aac aac ttt gcc cgc ccg aag gaa gat cgc ttt Gln Ile Asp Leu Ser Asn Asn Phe Ala Arg Pro Lys Glu Asp Arg Phe85 90 95 288

gcc ggt atc gac tac gag acc ggc act ggc ggc gcg ccg ttg ttc gcc 336 Ala Gly Ile Asp Tyr Glu Thr Gly Thr Gly Gly Ala Pro Leu Phe Ala

100 105 110

gat tgc gcg gcg cgc ttt gag tgt gaa aag tac cag cag ctg gac ggt 384 Asp Cys Ala Ala Arg Phe Glu Cys Glu Lys Tyr Gln Gln Leu Asp Gly

115 120 125

ggc gat cac tgg atc ctg gtg ggc aag gta gtg gcc ttt gat gac ttt 432 Gly Asp His Trp Ile Leu Val Gly Lys Val Val Ala Phe Asp Asp Phe

130 135 140

ggc cgc tcg ccg ctg ctg tat cac cag ggc gcc tat tca atg gtg ctg 480 Gly Arg Ser Pro Leu Leu Tyr His Gln Gly Ala Tyr Ser Met Val Leu145 150 155 160

ccg cat acc cgc atg acc caa ggc gca gag ggg cag gca ccg agc agc 528 Pro His Thr Arg Met Thr Gln Gly Ala Glu Gly Gln Ala Pro Ser Ser

165 170 175

cac ttc cag ggc cgc ctg cag cac aac ctg tac tac ctg atg acc cag 576 His Phe Gln Gly Arg Leu Gln His Asn Leu Tyr Tyr Leu Met Thr Gln

180 185 190

gcg ctg cgt gcc tac cag gct gac tac cag cca cgc cag ctg tgt acc 624 Ala Leu Arg Ala Tyr Gln Ala Asp Tyr Gln Pro Arg Gln Leu Cys Thr

195 200 205

ggc ctg cgc acc agc gag gca cgc atg ctg atg gtg ctg gag aac gat 672 Gly Leu Arg Thr Ser Glu Ala Arg Met Leu Met Val Leu Glu Asn Asp

210 215 220

gcg ggc ctg agc ctg aac gac ctg caa cgc gaa gtg gcg atg ccg gcg 720 Ala Gly Leu Ser Leu Asn Asp Leu Gln Arg Glu Val Ala Met Pro Ala225 230 235 240

cgg gag atc gag gaa gcg gtt gcc aac ctc aag cgc aaa ggg ctg att 768 Arg Glu Ile Glu Glu Ala Val Ala Asn Leu Lys Arg Lys Gly Leu Ile

245 250 255

gcc gat gac gaa ggg cga gtg cgg cta tcg gtg aag ggc gtg gac gag 816 Ala Asp Asp Glu Gly Arg Val Arg Leu Ser Val Lys Gly Val Asp Glu

260 265 270

acc gag gcg ttg tgg acc att gcc cgg caa cag cag gac aag gtg ttc 864 Thr Glu Ala Leu Trp Thr Ile Ala Arg Gln Gln Gln Asp Lys Val Phe

275 280 285

ggg cag ttc agt gaa cag cag ctg gag act ttc aag acc gtg ctc aag 912 Gly Gln Phe Ser Glu Gln Gln Leu Glu Thr Phe Lys Thr Val Leu Lys

290 295 300

gcc ctt atc aac atc tga 930 Ala Leu Ile Asn Ile 305

<210> 22

<211> 309

<212> PRT

<213> Pseudmonas putida U

<400> 22 Met Ser Lys Glu Thr Phe Asp Ser Arg Ala Phe Arg Arg Ala Leu Gly1 5 10 15

Asn Phe Ala Thr Gly Val Thr Val Val Thr Ala Ala Gly Pro Ser Gly20 25 30

Arg Lys Val Gly Val Thr Ala Asn Ser Phe Asn Ser Val Ser Leu Asp35 40 45

Pro Ala Leu Ile Leu Trp Ser Ile Asp Lys Arg Ser Thr Ser His Glu50 55 60

Val Phe Glu Glu Ala Ser His Phe Ala Val Asn Ile Leu Ala Ala Asp65 70 75 80

Gln Ile Asp Leu Ser Asn Asn Phe Ala Arg Pro Lys Glu Asp Arg Phe85 90 95

Ala Gly Ile Asp Tyr Glu Thr Gly Thr Gly Gly Ala Pro Leu Phe Ala100 105 110

Asp Cys Ala Ala Arg Phe Glu Cys Glu Lys Tyr Gln Gln Leu Asp Gly115 120 125

Gly Asp His Trp Ile Leu Val Gly Lys Val Val Ala Phe Asp Asp Phe130 135 140

Gly Arg Ser Pro Leu Leu Tyr His Gln Gly Ala Tyr Ser Met Val Leu145 150 155 160

Pro His Thr Arg Met Thr Gln Gly Ala Glu Gly Gln Ala Pro Ser Ser165 170 175

His Phe Gln Gly Arg Leu Gln His Asn Leu Tyr Tyr Leu Met Thr Gln180 185 190

Ala Leu Arg Ala Tyr Gln Ala Asp Tyr Gln Pro Arg Gln Leu Cys Thr195 200 205

Gly Leu Arg Thr Ser Glu Ala Arg Met Leu Met Val Leu Glu Asn Asp210 215 220

Ala Gly Leu Ser Leu Asn Asp Leu Gln Arg Glu Val Ala Met Pro Ala225 230 235 240

Arg Glu Ile Glu Glu Ala Val Ala Asn Leu Lys Arg Lys Gly Leu Ile245 250 255

Ala Asp Asp Glu Gly Arg Val Arg Leu Ser Val Lys Gly Val Asp Glu260 265 270

Thr Glu Ala Leu Trp Thr Ile Ala Arg Gln Gln Gln Asp Lys Val Phe275 280 285

Gly Gln Phe Ser Glu Gln Gln Leu Glu Thr Phe Lys Thr Val Leu Lys290 295 300

Ala Leu Ile Asn Ile 305

<210> 23

<211> 924

<212> DNA

<213> Pseudomonas putida U

<220>

<221> CDS

<222> (1) .. (924)

<223> hpaD

<400> 23 atg ggc aaa ctc gct ctc act gcc aag att acc cat gta ccg tcc atg 48 Met Gly Lys Leu Ala Leu Thr Ala Lys Ile Thr His Val Pro Ser Met1 5 10 15

tac atg tcc gaa ctg cca ggc ccg cgc caa ggc ttt cgc cag gcg gcc 96 Tyr Met Ser Glu Leu Pro Gly Pro Arg Gln Gly Phe Arg Gln Ala Ala

atc gac ggg cat cac gaa atc agc cgc cgt tgc cgt gag ctg ggc gtg 144 Ile Asp Gly His His Glu Ile Ser Arg Arg Cys Arg Glu Leu Gly Val

gac acc atc gtc gtg ttc gac acg cac tgg ctg gtc aac gcc aac tac 192 Asp Thr Ile Val Val Phe Asp Thr His Trp Leu Val Asn Ala Asn Tyr

50 55 60

cac gtg ctg tgc ggg ccg cat ttc gag ggc gtg tac acc agc aac gaa 240 His Val Leu Cys Gly Pro His Phe Glu Gly Val Tyr Thr Ser Asn Glu65 70 75 80

ctg ccg cac ttc atc agc aac atg ccc tac gca ttc ccc ggc aat ccc 288 Leu Pro His Phe Ile Ser Asn Met Pro Tyr Ala Phe Pro Gly Asn Pro

85 90 95

gag ctg ggc aag ctg ctg gcc gag gag tgc aac cgc ttc aac gtc gaa 336 Glu Leu Gly Lys Leu Leu Ala Glu Glu Cys Asn Arg Phe Asn Val Glu

100 105 110

acc atg gcc cac cac gcc acc acc ctc gcc ccg gaa tac ggc acc ctg 384 Thr Met Ala His His Ala Thr Thr Leu Ala Pro Glu Tyr Gly Thr Leu

115 120 125

gtg ccc atg cgc tac atg aac cag gac cag cac ttc aaa gtg gtc tcg 432 Val Pro Met Arg Tyr Met Asn Gln Asp Gln His Phe Lys Val Val Ser

130 135 140

gtc tcg gcc ctg tgc acc tcg cac tac ctg gcc gac agt gcc cgc ctg 480 Val Ser Ala Leu Cys Thr Ser His Tyr Leu Ala Asp Ser Ala Arg Leu145 150 155 160 ggc tgg gcc atg cgc aag gca gta gaa gac cac tac gac ggc acc gtg 528 Gly Trp Ala Met Arg Lys Ala Val Glu Asp His Tyr Asp Gly Thr Val

165 170 175

gcg ttc ctg gcc agc ggc tcg ctg tcg cac cgc ttc gcg cag aac ggc 576 Ala Phe Leu Ala Ser Gly Ser Leu Ser His Arg Phe Ala Gln Asn Gly

180 185 190

cag gcg ccg gac ttt gcc acc aag gtg tgg agc ccg ttc ctc gaa acc 624 Gln Ala Pro Asp Phe Ala Thr Lys Val Trp Ser Pro Phe Leu Glu Thr

195 200 205

ctc gac cac cgt gtg gtg caa atg tgg cag gac ggc gag tgg gaa gcg 672 Leu Asp His Arg Val Val Gln Met Trp Gln Asp Gly Glu Trp Glu Ala

210 215 220

ttc tgc ggg atg ctg ccg gag tac gcc gcc aaa ggc cac ggt gaa ggc 720 Phe Cys Gly Met Leu Pro Glu Tyr Ala Ala Lys Gly His Gly Glu Gly225 230 235 240

ttc atg cac gac acg gca atg ctg ctg ggt gcg ctg ggc tgg tcc gat 768 Phe Met His Asp Thr Ala Met Leu Leu Gly Ala Leu Gly Trp Ser Asp

245 250 255

tac gac ggc aag gcc gaa gtg gtc acg ccc tac ttc ggc tct tcc ggc 816 Tyr Asp Gly Lys Ala Glu Val Val Thr Pro Tyr Phe Gly Ser Ser Gly

260 265 270

acc ggc cag atc aac gcg atc ttc ccg gtc acc ccg cag gac ggt ggt 864 Thr Gly Gln Ile Asn Ala Ile Phe Pro Val Thr Pro Gln Asp Gly Gly

275 280 285

gcc atc ccc gct gcc cag gcc gcc aac ccg gcc gcc gtg gtg ccc acc 912 Ala Ile Pro Ala Ala Gln Ala Ala Asn Pro Ala Ala Val Val Pro Thr

290 295 300

agc cgc ctg taa 924 Ser Arg Leu305

<210> 24

<211> 307

<212> PRT

<213> Pseudomonas putida U

<400> 24

Met Gly Lys Leu Ala Leu Thr Ala Lys Ile Thr His Val Pro Ser Met 1 5 10 15 Tyr Met Ser Glu Leu Pro Gly Pro Arg Gln Gly Phe Arg Gln Ala Ala

Ile Asp Gly His His Glu Ile Ser Arg Arg Cys Arg Glu Leu Gly Val35 40 45

Asp Thr Ile Val Val Phe Asp Thr His Trp Leu Val Asn Ala Asn Tyr50 55 60 His Val Leu Cys Gly Pro His Phe Glu Gly Val Tyr Thr Ser Asn Glu65 70 75 80

Leu Pro His Phe Ile Ser Asn Met Pro Tyr Ala Phe Pro Gly Asn Pro85 90 95

Glu Leu Gly Lys Leu Leu Ala Glu Glu Cys Asn Arg Phe Asn Val Glu100 105 110

Thr Met Ala His His Ala Thr Thr Leu Ala Pro Glu Tyr Gly Thr Leu115 120 125

Val Pro Met Arg Tyr Met Asn Gln Asp Gln His Phe Lys Val Val Ser130 135 140

Val Ser Ala Leu Cys Thr Ser His Tyr Leu Ala Asp Ser Ala Arg Leu145 150 155 160

Gly Trp Ala Met Arg Lys Ala Val Glu Asp His Tyr Asp Gly Thr Val165 170 175

Ala Phe Leu Ala Ser Gly Ser Leu Ser His Arg Phe Ala Gln Asn Gly180 185 190

Gln Ala Pro Asp Phe Ala Thr Lys Val Trp Ser Pro Phe Leu Glu Thr195 200 205

Leu Asp His Arg Val Val Gln Met Trp Gln Asp Gly Glu Trp Glu Ala210 215 220

Phe Cys Gly Met Leu Pro Glu Tyr Ala Ala Lys Gly His Gly Glu Gly225 230 235 240

Phe Met His Asp Thr Ala Met Leu Leu Gly Ala Leu Gly Trp Ser Asp245 250 255

Tyr Asp Gly Lys Ala Glu Val Val Thr Pro Tyr Phe Gly Ser Ser Gly260 265 270

Thr Gly Gln Ile Asn Ala Ile Phe Pro Val Thr Pro Gln Asp Gly Gly275 280 285

Ala Ile Pro Ala Ala Gln Ala Ala Asn Pro Ala Ala Val Val Pro Thr 290 295 300

Ser Arg Leu305

<210> 25

<211> 1461

<212> DNA

<213> Pseudomonas putida U

<220>

<221> CDS

<222> (1) .. (1461)

<223> hpaE <400> 25 atg atc aag cac tgg atc aac ggc cgt gag gtc gag agc aaa gac acc 48 Met Ile Lys His Trp Ile Asn Gly Arg Glu Val Glu Ser Lys Asp Thr1 5 10 15

ttc gtc aac tac aac ccg gcc acc ggc gac gcc atc tgc gaa gtc gcc 96 Phe Val Asn Tyr Asn Pro Ala Thr Gly Asp Ala Ile Cys Glu Val Ala

agc ggc ggc gcc gag gaa gtg gcc cag gct gtg gct gcg gcc aag gaa 144 Ser Gly Gly Ala Glu Glu Val Ala Gln Ala Val Ala Ala Ala Lys Glu

gcc ttc ccc aag tgg gcc aac acc ccg gcc aag gaa cgt gcc cgg ctg 192 Ala Phe Pro Lys Trp Ala Asn Thr Pro Ala Lys Glu Arg Ala Arg Leu

50 55 60

atg cgc aag ctg ggt gag ctg att gag cag aac gtg ccg aaa ctc gcc 240 Met Arg Lys Leu Gly Glu Leu Ile Glu Gln Asn Val Pro Lys Leu Ala65 70 75 80

gag ctg gaa acc ctc gac acc ggc ctg ccg atc cac cag acc aag aac 288 Glu Leu Glu Thr Leu Asp Thr Gly Leu Pro Ile His Gln Thr Lys Asn

85 90 95

gtg ctg atc ccg cgt gcc tcg cac aac ttc gac ttc ttc gcc gaa gtg 336 Val Leu Ile Pro Arg Ala Ser His Asn Phe Asp Phe Phe Ala Glu Val

100 105 110

tgc acg cgc atg gac ggc cat acc tac ccg gtc gac gac cag atg ctc 384 Cys Thr Arg Met Asp Gly His Thr Tyr Pro Val Asp Asp Gln Met Leu

115 120 125

aac tac acc ctg tac cag ccg gtg ggt gtg tgc ggc ctg gta agc cca 432 Asn Tyr Thr Leu Tyr Gln Pro Val Gly Val Cys Gly Leu Val Ser Pro

130 135 140

tgg aac gtg ccg ttc atg acg gct acc tgg aag act gcg ccg tgc ctg 480 Trp Asn Val Pro Phe Met Thr Ala Thr Trp Lys Thr Ala Pro Cys Leu145 150 155 160

gcg ctg ggc aac acc gcc gtg ctg aag atg agc gag ctg tcg cct ctg 528 Ala Leu Gly Asn Thr Ala Val Leu Lys Met Ser Glu Leu Ser Pro Leu

165 170 175

acc gcc aac gaa ctg ggc cgc ctg gcg gta gaa gcc ggc atc ccc aac 576 Thr Ala Asn Glu Leu Gly Arg Leu Ala Val Glu Ala Gly Ile Pro Asn

180 185 190

ggg gtg ctg aac gtg atc cag ggt tac ggc gct acc gcc ggc gat gcc 624 Gly Val Leu Asn Val Ile Gln Gly Tyr Gly Ala Thr Ala Gly Asp Ala

195 200 205

ctg gtc cgc cac ccc gat gtg cgc gcc att tcc ttc acc ggc ggt acc 672 Leu Val Arg His Pro Asp Val Arg Ala Ile Ser Phe Thr Gly Gly Thr

210 215 220

gcc acc ggc aag aag atc atg cag acc gca ggc ctt aaa aag tac tcg 720 Ala Thr Gly Lys Lys Ile Met Gln Thr Ala Gly Leu Lys Lys Tyr Ser225 230 235 240

atg gaa ctg ggc ggc aag tcg ccc gtg ctg atc ttc gaa gac gca gac 768 Met Glu Leu Gly Gly Lys Ser Pro Val Leu Ile Phe Glu Asp Ala Asp

245 250 255

ctt gag cgt gcg ctg gac gcc gcg ctg ttc acc atc ttc tcg ctg aac 816 Leu Glu Arg Ala Leu Asp Ala Ala Leu Phe Thr Ile Phe Ser Leu Asn

260 265 270

ggc gag cgc tgc acc gcc ggc agc cgc atc ttc atc cag gaa agc gtg 864 Gly Glu Arg Cys Thr Ala Gly Ser Arg Ile Phe Ile Gln Glu Ser Val

275 280 285

tac ccg cag ttt gtc gca gag ttt gcg gcg cgc gcc aag cgc ctg atc 912 Tyr Pro Gln Phe Val Ala Glu Phe Ala Ala Arg Ala Lys Arg Leu Ile

290 295 300

gta ggt gac ccg acc gac ccg aaa acc cag gtc ggt tcg atg atc acc 960 Val Gly Asp Pro Thr Asp Pro Lys Thr Gln Val Gly Ser Met Ile Thr305 310 315 320

cag cag cac tat gac aag gtc acc ggg tac atc cgc att ggc atc gaa 1008 Gln Gln His Tyr Asp Lys Val Thr Gly Tyr Ile Arg Ile Gly Ile Glu

325 330 335

gaa ggt gca cgc ctg gtc gcc ggg ggc ctg gag cgc ccg gcc aac ctg 1056 Glu Gly Ala Arg Leu Val Ala Gly Gly Leu Glu Arg Pro Ala Asn Leu

340 345 350

cct gcg cac ctg gcc aag ggg cag ttc atc cag ccc acc gta ttc gcc 1104 Pro Ala His Leu Ala Lys Gly Gln Phe Ile Gln Pro Thr Val Phe Ala

355 360 365

gac gtg aac aac aag atg cgc att gcc cag gaa gaa atc ttt ggc ccg 1152 Asp Val Asn Asn Lys Met Arg Ile Ala Gln Glu Glu Ile Phe Gly Pro

370 375 380

gtg gtg tgc ctg atc ccg ttc aag gac gaa gcc gag gcg ctg caa ctg 1200 Val Val Cys Leu Ile Pro Phe Lys Asp Glu Ala Glu Ala Leu Gln Leu385 390 395 400

gcc aac gac acc gag tat ggc ctg gcc tcg tac atc tgg acc cag gac 1248 Ala Asn Asp Thr Glu Tyr Gly Leu Ala Ser Tyr Ile Trp Thr Gln Asp

405 410 415

atc ggc aaa gcc cat cgc ctg gcc cgt ggc atc gag gcc ggc atg gtg 1296 Ile Gly Lys Ala His Arg Leu Ala Arg Gly Ile Glu Ala Gly Met Val

420 425 430

ttc atc aac agc cag aac gta cgc gac ctg cgc cag ccg ttc ggc ggc 1344 Phe Ile Asn Ser Gln Asn Val Arg Asp Leu Arg Gln Pro Phe Gly Gly

435 440 445

gtg aaa ggt tcc ggt acc ggg cgt gag ggc ggg cag tac agc ttc gag 1392 Val Lys Gly Ser Gly Thr Gly Arg Glu Gly Gly Gln Tyr Ser Phe Glu

450 455 460

gtc ttt gca gag atc aag aac gtg tgt att tcc atg ggt aat cac cacVal Phe Ala Glu Ile Lys Asn Val Cys Ile Ser Met Gly Asn His His465 470 475 480 1440 att cct cgc tgg ggc atc taaIle Pro Arg Trp Gly Ile485 1461 <210> 26 <211> 486 <212> PRT <213> Pseudomonas putida U <400> 26 Met Ile Lys His Trp Ile Asn Gly Arg Glu Val Glu Ser Lys Asp Thr1 5 10 15 Phe Val Asn Tyr Asn Pro Ala Thr Gly Asp Ala Ile Cys Glu Val Ala20 25 30 Ser Gly Gly Ala Glu Glu Val Ala Gln Ala Val Ala Ala Ala Lys Glu35 40 45 Ala Phe Pro Lys Trp Ala Asn Thr Pro Ala Lys Glu Arg Ala Arg Leu50 55 60 Met Arg Lys Leu Gly Glu Leu Ile Glu Gln Asn Val Pro Lys Leu Ala65 70 75 80 Glu Leu Glu Thr Leu Asp Thr Gly Leu Pro Ile His Gln Thr Lys Asn85 90 95 Val Leu Ile Pro Arg Ala Ser His Asn Phe Asp Phe Phe Ala Glu Val100 105 110 Cys Thr Arg Met Asp Gly His Thr Tyr Pro Val Asp Asp Gln Met Leu115 120 125 Asn Tyr Thr Leu Tyr Gln Pro Val Gly Val Cys Gly Leu Val Ser Pro130 135 140 Trp Asn Val Pro Phe Met Thr Ala Thr Trp Lys Thr Ala Pro Cys Leu145 150 155 160 Ala Leu Gly Asn Thr Ala Val Leu Lys Met Ser Glu Leu Ser Pro Leu165 170 175 Thr Ala Asn Glu Leu Gly Arg Leu Ala Val Glu Ala Gly Ile Pro Asn180 185 190 Gly Val Leu Asn Val Ile Gln Gly Tyr Gly Ala Thr Ala Gly Asp Ala195 200 205 Leu Val Arg His Pro Asp Val Arg Ala Ile Ser Phe Thr Gly Gly Thr210 215 220 Ala Thr Gly Lys Lys Ile Met Gln Thr Ala Gly Leu Lys Lys Tyr Ser

225 230 235 240

Met Glu Leu Gly Gly Lys Ser Pro Val Leu Ile Phe Glu Asp Ala Asp245 250 255

Leu Glu Arg Ala Leu Asp Ala Ala Leu Phe Thr Ile Phe Ser Leu Asn260 265 270

Gly Glu Arg Cys Thr Ala Gly Ser Arg Ile Phe Ile Gln Glu Ser Val275 280 285

Tyr Pro Gln Phe Val Ala Glu Phe Ala Ala Arg Ala Lys Arg Leu Ile290 295 300

Val Gly Asp Pro Thr Asp Pro Lys Thr Gln Val Gly Ser Met Ile Thr305 310 315 320

Gln Gln His Tyr Asp Lys Val Thr Gly Tyr Ile Arg Ile Gly Ile Glu325 330 335

Glu Gly Ala Arg Leu Val Ala Gly Gly Leu Glu Arg Pro Ala Asn Leu340 345 350

Pro Ala His Leu Ala Lys Gly Gln Phe Ile Gln Pro Thr Val Phe Ala355 360 365

Asp Val Asn Asn Lys Met Arg Ile Ala Gln Glu Glu Ile Phe Gly Pro 370 375 380

Val Val Cys Leu Ile Pro Phe Lys Asp Glu Ala Glu Ala Leu Gln Leu385 390 395 400

Ala Asn Asp Thr Glu Tyr Gly Leu Ala Ser Tyr Ile Trp Thr Gln Asp405 410 415

Ile Gly Lys Ala His Arg Leu Ala Arg Gly Ile Glu Ala Gly Met Val420 425 430

Phe Ile Asn Ser Gln Asn Val Arg Asp Leu Arg Gln Pro Phe Gly Gly435 440 445

Val Lys Gly Ser Gly Thr Gly Arg Glu Gly Gly Gln Tyr Ser Phe Glu450 455 460

Val Phe Ala Glu Ile Lys Asn Val Cys Ile Ser Met Gly Asn His His465 470 475 480

Ile Pro Arg Trp Gly Ile485

<210> 27

<211> 405

<212> DNA

<213> Pseudomonas putida U

<220>

<221> CDS

<222> (1) .. (405)

<223> hpaF

<400> 27 atg cca cac ctg gtt ctg ctc tat acc ccc gac ctg gaa acc gac gcc 48 Met Pro His Leu Val Leu Leu Tyr Thr Pro Asp Leu Glu Thr Asp Ala1 5 10 15

gac atc ccc ggc ctg tgc cgc gcc ctg gcc gac acc atg ctc gaa cag 96 Asp Ile Pro Gly Leu Cys Arg Ala Leu Ala Asp Thr Met Leu Glu Gln

cgc gat gcc gaa ggc aaa gcc gtg ttc ccc act ggc ggt aca cgc gtg 144 Arg Asp Ala Glu Gly Lys Ala Val Phe Pro Thr Gly Gly Thr Arg Val

ctg gcc tac ccc gcc gcc cat tgc gcg gtg gcc gac ggc aaa ggc gaa 192 Leu Ala Tyr Pro Ala Ala His Cys Ala Val Ala Asp Gly Lys Gly Glu

50 55 60

tac ggc ttt ctg tac gcc aac ctg cgc atg gct acc ggc cgt agc gcc 240 Tyr Gly Phe Leu Tyr Ala Asn Leu Arg Met Ala Thr Gly Arg Ser Ala65 70 75 80

gag gtg cac aaa aca gtg ggc gac agc ttg ctg gca gtg ttg aaa gcg 288 Glu Val His Lys Thr Val Gly Asp Ser Leu Leu Ala Val Leu Lys Ala

85 90 95

cgc ctg gac cca ctg ctg caa cag cgc ccg atc ggc atc acc gtg cag 336 Arg Leu Asp Pro Leu Leu Gln Gln Arg Pro Ile Gly Ile Thr Val Gln

100 105 110

atc gac cac agc acc gcc cag gtc tac gac gcc aag cac agc acc ttg 384 Ile Asp His Ser Thr Ala Gln Val Tyr Asp Ala Lys His Ser Thr Leu

115 120 125

cac cca ctg ttc aac cgc tag 405 His Pro Leu Phe Asn Arg

<210> 28

<211> 134

<212> PRT

<213> Pseudomonas putida U

<400> 28 Met Pro His Leu Val Leu Leu Tyr Thr Pro Asp Leu Glu Thr Asp Ala1 5 10 15

Asp Ile Pro Gly Leu Cys Arg Ala Leu Ala Asp Thr Met Leu Glu Gln20 25 30

Arg Asp Ala Glu Gly Lys Ala Val Phe Pro Thr Gly Gly Thr Arg Val35 40 45

Leu Ala Tyr Pro Ala Ala His Cys Ala Val Ala Asp Gly Lys Gly Glu50 55 60

Tyr Gly Phe Leu Tyr Ala Asn Leu Arg Met Ala Thr Gly Arg Ser Ala 65 70 75 80

Glu Val His Lys Thr Val Gly Asp Ser Leu Leu Ala Val Leu Lys Ala85 90 95

Arg Leu Asp Pro Leu Leu Gln Gln Arg Pro Ile Gly Ile Thr Val Gln100 105 110

Ile Asp His Ser Thr Ala Gln Val Tyr Asp Ala Lys His Ser Thr Leu115 120 125

His Pro Leu Phe Asn Arg130

<210> 29

<211> 660

<212> DNA

<213> Pseudomonas putida U

<220>

<221> CDS

<222> (1) .. (660)

<223> hpaG1

<400> 29 atg agc cat gcc ctg ctt gac gtt gcc agc ggc acc ctg ttc ggc gtc 48 Met Ser His Ala Leu Leu Asp Val Ala Ser Gly Thr Leu Phe Gly Val1 5 10 15

gcg ctg aac tac cag ggt ttg ctg cag cag cac caa gcg gcg ttc gtg 96 Ala Leu Asn Tyr Gln Gly Leu Leu Gln Gln His Gln Ala Ala Phe Val

gaa gca ccg tac aag caa ctg ccg gtc aag ccg gtg ttg ttc gtc aag 144 Glu Ala Pro Tyr Lys Gln Leu Pro Val Lys Pro Val Leu Phe Val Lys

acc ccg aac acc cgc aac cag cat gaa ggc cag gtg gta ttc ccg gcc 192 Thr Pro Asn Thr Arg Asn Gln His Glu Gly Gln Val Val Phe Pro Ala

50 55 60

ggc gtg cag cgc gtg caa ccc ggc ccg gcg ctg gga gtg gtg att ggc 240 Gly Val Gln Arg Val Gln Pro Gly Pro Ala Leu Gly Val Val Ile Gly65 70 75 80

aag gac gcc agc cgc gtc agc gtg gcc gat gcc ctg gag cat gtg gcg 288 Lys Asp Ala Ser Arg Val Ser Val Ala Asp Ala Leu Glu His Val Ala

85 90 95

ggc tac acc atc gtc aac gaa gtg agc ctg ccc gaa gcc agc tac tac 336 Gly Tyr Thr Ile Val Asn Glu Val Ser Leu Pro Glu Ala Ser Tyr Tyr

100 105 110

cgc cct gca gtc aag gcc aag tgc cgt gat ggt ttt tgc ccg gtc ggc 384 Arg Pro Ala Val Lys Ala Lys Cys Arg Asp Gly Phe Cys Pro Val Gly

115 120 125

cct gaa ctg gtg ccc gcc agc caa gtg gcc aac ccc gat gcc ctg ggcPro Glu Leu Val Pro Ala Ser Gln Val Ala Asn Pro Asp Ala Leu Gly130 135 140 432 ctg cgc ctg tat gtg aac ggc gaa ctg cgc cag cac aac aac acc gccLeu Arg Leu Tyr Val Asn Gly Glu Leu Arg Gln His Asn Asn Thr Ala145 150 155 160 480 aac tgc gta cgc acg gtg gcg cag ctg att gcc gaa atc agc gag ttcAsn Cys Val Arg Thr Val Ala Gln Leu Ile Ala Glu Ile Ser Glu Phe165 170 175 528 atg acc ctg cac gcc ggc gac atc ctg atc acc gga acc ccc gag ggcMet Thr Leu His Ala Gly Asp Ile Leu Ile Thr Gly Thr Pro Glu Gly180 185 190 576 cgc gtc gat gta cag cca ggt gac cgc gtc gac atc gag atc gac ggcArg Val Asp Val Gln Pro Gly Asp Arg Val Asp Ile Glu Ile Asp Gly195 200 205 624 ctg ggc aag ctg acc aac cac atc gtc gcc gag tgaLeu Gly Lys Leu Thr Asn His Ile Val Ala Glu210 215 660 <210> 30 <211> 219 <212> PRT <213> Pseudomonas putida U <400> 30 Met Ser His Ala Leu Leu Asp Val Ala Ser Gly Thr Leu Phe Gly Val1 5 10 15 Ala Leu Asn Tyr Gln Gly Leu Leu Gln Gln His Gln Ala Ala Phe Val20 25 30 Glu Ala Pro Tyr Lys Gln Leu Pro Val Lys Pro Val Leu Phe Val Lys35 40 45 Thr Pro Asn Thr Arg Asn Gln His Glu Gly Gln Val Val Phe Pro Ala50 55 60 Gly Val Gln Arg Val Gln Pro Gly Pro Ala Leu Gly Val Val Ile Gly65 70 75 80 Lys Asp Ala Ser Arg Val Ser Val Ala Asp Ala Leu Glu His Val Ala85 90 95 Gly Tyr Thr Ile Val Asn Glu Val Ser Leu Pro Glu Ala Ser Tyr Tyr100 105 110 Arg Pro Ala Val Lys Ala Lys Cys Arg Asp Gly Phe Cys Pro Val Gly115 120 125 Pro Glu Leu Val Pro Ala Ser Gln Val Ala Asn Pro Asp Ala Leu Gly130 135 140

Leu Arg Leu Tyr Val Asn Gly Glu Leu Arg Gln His Asn Asn Thr Ala145 150 155 160

Asn Cys Val Arg Thr Val Ala Gln Leu Ile Ala Glu Ile Ser Glu Phe165 170 175

Met Thr Leu His Ala Gly Asp Ile Leu Ile Thr Gly Thr Pro Glu Gly180 185 190

Arg Val Asp Val Gln Pro Gly Asp Arg Val Asp Ile Glu Ile Asp Gly195 200 205

Leu Gly Lys Leu Thr Asn His Ile Val Ala Glu210 215

<210> 31

<211> 765

<212> DNA

<213> Pseudomonas putida U

<220>

<221> CDS

<222> (1) .. (765)

<223> hpaG2

<400> 31 gtg aaa cac gcc cgt atc cag ttc gac ggc cag gcc cac gat gtc acg 48 Val Lys His Ala Arg Ile Gln Phe Asp Gly Gln Ala His Asp Val Thr1 5 10 15

gtc gaa gac gat cac ctg cgc ctt gcc gac ggc cgc ctg gtc cat cag 96 Val Glu Asp Asp His Leu Arg Leu Ala Asp Gly Arg Leu Val His Gln

gac cag gtc acc tgg ctg cca ccc gcc acc ggc agc atg ttc gcc ctg 144 Asp Gln Val Thr Trp Leu Pro Pro Ala Thr Gly Ser Met Phe Ala Leu

ggc ctg aac tac gcc gac cac gcc agg gag ctg gcc ttc gcg ccg ccc 192 Gly Leu Asn Tyr Ala Asp His Ala Arg Glu Leu Ala Phe Ala Pro Pro

50 55 60

acc gaa ccg ttg gct ttc atc aag tcg cca ggc acc tac acc ggc cac 240 Thr Glu Pro Leu Ala Phe Ile Lys Ser Pro Gly Thr Tyr Thr Gly His65 70 75 80

atc cag gtc acc tgg cgc ccg gac aac gtc gaa tac atg cac tac gag 288 Ile Gln Val Thr Trp Arg Pro Asp Asn Val Glu Tyr Met His Tyr Glu

85 90 95

tgc gag ctg gtg gcg gtg atc ggc aaa gcg gcg aag aac gtc aag cgt 336 Cys Glu Leu Val Ala Val Ile Gly Lys Ala Ala Lys Asn Val Lys Arg

100 105 110

gag gac gcc ctg gcc tac gtt gcc ggc tac acc gtg tgc aac gac tac 384 Glu Asp Ala Leu Ala Tyr Val Ala Gly Tyr Thr Val Cys Asn Asp Tyr

115 120 125 gcc atc cgc gac tac ctg gaa aac tac tac cgc ccc aac ctg cgg gtg 432 Ala Ile Arg Asp Tyr Leu Glu Asn Tyr Tyr Arg Pro Asn Leu Arg Val

130 135 140

aaa aac cgc gat gcc acc acc ccg gtc ggc ccg tgg atc gtc gat gcg 480 Lys Asn Arg Asp Ala Thr Thr Pro Val Gly Pro Trp Ile Val Asp Ala145 150 155 160

gcc gat gtg cca gac gtc agc aac ctg aag ctg cgc acc tgg atc aac 528 Ala Asp Val Pro Asp Val Ser Asn Leu Lys Leu Arg Thr Trp Ile Asn

165 170 175

ggt gag ctg aag cag gaa ggc acc acc gcg gac atg atc ttc gac atc 576 Gly Glu Leu Lys Gln Glu Gly Thr Thr Ala Asp Met Ile Phe Asp Ile

180 185 190

ccg cac ctc atc gaa tac ttc tcc agc ttc atg acc ctg caa ccg ggc 624 Pro His Leu Ile Glu Tyr Phe Ser Ser Phe Met Thr Leu Gln Pro Gly

195 200 205

gac atg atc gcc acc ggc acg cca gaa ggc ctg gcc gat gtg gtg ccg 672 Asp Met Ile Ala Thr Gly Thr Pro Glu Gly Leu Ala Asp Val Val Pro

210 215 220

ggt gac gaa gtg gtg gtg gaa gtg gaa ggc gtc ggt cgc ctg gtc aac 720 Gly Asp Glu Val Val Val Glu Val Glu Gly Val Gly Arg Leu Val Asn225 230 235 240

cgt atc gtc agc gaa gct gac ttc ttc aag aac aac aag gca tga 765 Arg Ile Val Ser Glu Ala Asp Phe Phe Lys Asn Asn Lys Ala

245 250

<210> 32

<211> 254

<212> PRT

<213> Pseudomonas putida U

<400> 32

Val Lys His Ala Arg Ile Gln Phe Asp Gly Gln Ala His Asp Val Thr1 5 10 15

Val Glu Asp Asp His Leu Arg Leu Ala Asp Gly Arg Leu Val His Gln20 25 30

Asp Gln Val Thr Trp Leu Pro Pro Ala Thr Gly Ser Met Phe Ala Leu35 40 45

Gly Leu Asn Tyr Ala Asp His Ala Arg Glu Leu Ala Phe Ala Pro Pro50 55 60

Thr Glu Pro Leu Ala Phe Ile Lys Ser Pro Gly Thr Tyr Thr Gly His65 70 75 80

Ile Gln Val Thr Trp Arg Pro Asp Asn Val Glu Tyr Met His Tyr Glu85 90 95

Cys Glu Leu Val Ala Val Ile Gly Lys Ala Ala Lys Asn Val Lys Arg

100 105 110 Glu Asp Ala Leu Ala Tyr Val Ala Gly Tyr Thr Val Cys Asn Asp Tyr115 120 125 Ala Ile Arg Asp Tyr Leu Glu Asn Tyr Tyr Arg Pro Asn Leu Arg Val130 135 140 Lys Asn Arg Asp Ala Thr Thr Pro Val Gly Pro Trp Ile Val Asp Ala145 150 155 160 Ala Asp Val Pro Asp Val Ser Asn Leu Lys Leu Arg Thr Trp Ile Asn165 170 175 Gly Glu Leu Lys Gln Glu Gly Thr Thr Ala Asp Met Ile Phe Asp Ile180 185 190 Pro His Leu Ile Glu Tyr Phe Ser Ser Phe Met Thr Leu Gln Pro Gly 195 200 205 Asp Met Ile Ala Thr Gly Thr Pro Glu Gly Leu Ala Asp Val Val Pro210 215 220 Gly Asp Glu Val Val Val Glu Val Glu Gly Val Gly Arg Leu Val Asn225 230 235 240 Arg Ile Val Ser Glu Ala Asp Phe Phe Lys Asn Asn Lys Ala245 250 <210> 33 <211> 804 <212> DNA <213> Pseudomonas putida U <220> <221> CDS <222> (1) .. (804) <223> hpaH <400> 33 atg cta gac aac gct ttc atc cag cac gcc gcc gac cgc ctc gac cagMet Leu Asp Asn Ala Phe Ile Gln His Ala Ala Asp Arg Leu Asp Gln1 5 10 15 48 gcc gaa cgc tcc cgc gag caa gtg cgc cag ttc tcg ctg gag caa ccgAla Glu Arg Ser Arg Glu Gln Val Arg Gln Phe Ser Leu Glu Gln Pro20 25 30 96 gca atc acc atc gaa gac gcc tac gcc atc cag cgc gcc tgg gtg gcaAla Ile Thr Ile Glu Asp Ala Tyr Ala Ile Gln Arg Ala Trp Val Ala35 40 45 144 aaa aag atc gcc gcc ggg cgc aag ctg gtg ggc cac aag atc ggc ctgLys Lys Ile Ala Ala Gly Arg Lys Leu Val Gly His Lys Ile Gly Leu50 55 60 192 acc tcg cgc gcc atg cag gta tcg tcg aac atc acc gag ccc gac tacThr Ser Arg Ala Met Gln Val Ser Ser Asn Ile Thr Glu Pro Asp Tyr 240

65 70 75 80

ggc gcc ttg ctc gac gac atg ctg ttc gac gaa ggc agc gac atc ccc 288 Gly Ala Leu Leu Asp Asp Met Leu Phe Asp Glu Gly Ser Asp Ile Pro

85 90 95

ttc gag cgc ttc atc gtg ccg cgg gtt gaa gtg gag ttg gcg ttc atc 336 Phe Glu Arg Phe Ile Val Pro Arg Val Glu Val Glu Leu Ala Phe Ile

100 105 110

ctc ggc aag ccg ctg aag ggc ccg aac atc acc gtg ttt gat gtg ctg 384 Leu Gly Lys Pro Leu Lys Gly Pro Asn Ile Thr Val Phe Asp Val Leu

115 120 125

gac gcc acc gag tgg gtg atc ccg gcg ctg gaa atc att gac gcg cgc 432 Asp Ala Thr Glu Trp Val Ile Pro Ala Leu Glu Ile Ile Asp Ala Arg

130 135 140

atc cag cag gtg gac ccg caa acc cag gcc acc cgc aag gtg ttc gac 480 Ile Gln Gln Val Asp Pro Gln Thr Gln Ala Thr Arg Lys Val Phe Asp145 150 155 160

acc atc tcc gac aac gcc gcc aat gcc ggc gtg gtg atg ggc ggg cgg 528 Thr Ile Ser Asp Asn Ala Ala Asn Ala Gly Val Val Met Gly Gly Arg

165 170 175

gcc gtg cgc ccc acc gaa atc gac ctg cgc aaa gtg ccg gcg gtg ctc 576 Ala Val Arg Pro Thr Glu Ile Asp Leu Arg Lys Val Pro Ala Val Leu

180 185 190

tac cgc aat ggc gtg atc gag gaa tcc ggg gtc agc gct gcc gtg ctc 624 Tyr Arg Asn Gly Val Ile Glu Glu Ser Gly Val Ser Ala Ala Val Leu

195 200 205

aac cac ccg gcc aaa ggc gtt gcc tgg ctg gcc aac aaa ctg gcg ccg 672 Asn His Pro Ala Lys Gly Val Ala Trp Leu Ala Asn Lys Leu Ala Pro

210 215 220

tac gac gtc acc ttg cag ccc ggc cag atc atc ctt ggg ggt tcg ttc 720 Tyr Asp Val Thr Leu Gln Pro Gly Gln Ile Ile Leu Gly Gly Ser Phe225 230 235 240

acc cgc ccg gtc gcc gct cgc cca ggt gac acc ttc cac gtc gac tac 768 Thr Arg Pro Val Ala Ala Arg Pro Gly Asp Thr Phe His Val Asp Tyr

245 250 255

gac atg ctc ggc tcc atc gcc tgc cgc ttc gtt taa 804 Asp Met Leu Gly Ser Ile Ala Cys Arg Phe Val

260 265

<210> 34

<211> 267

<212> PRT

<213> Pseudomonas putida U

<400> 34 Met Leu Asp Asn Ala Phe Ile Gln His Ala Ala Asp Arg Leu Asp Gln1 5 10 15

Ala Glu Arg Ser Arg Glu Gln Val Arg Gln Phe Ser Leu Glu Gln Pro20 25 30

Ala Ile Thr Ile Glu Asp Ala Tyr Ala Ile Gln Arg Ala Trp Val Ala35 40 45

Lys Lys Ile Ala Ala Gly Arg Lys Leu Val Gly His Lys Ile Gly Leu50 55 60

Thr Ser Arg Ala Met Gln Val Ser Ser Asn Ile Thr Glu Pro Asp Tyr65 70 75 80

Gly Ala Leu Leu Asp Asp Met Leu Phe Asp Glu Gly Ser Asp Ile Pro85 90 95

Phe Glu Arg Phe Ile Val Pro Arg Val Glu Val Glu Leu Ala Phe Ile100 105 110

Leu Gly Lys Pro Leu Lys Gly Pro Asn Ile Thr Val Phe Asp Val Leu115 120 125

Asp Ala Thr Glu Trp Val Ile Pro Ala Leu Glu Ile Ile Asp Ala Arg130 135 140

Ile Gln Gln Val Asp Pro Gln Thr Gln Ala Thr Arg Lys Val Phe Asp145 150 155 160

Thr Ile Ser Asp Asn Ala Ala Asn Ala Gly Val Val Met Gly Gly Arg165 170 175

Ala Val Arg Pro Thr Glu Ile Asp Leu Arg Lys Val Pro Ala Val Leu180 185 190

Tyr Arg Asn Gly Val Ile Glu Glu Ser Gly Val Ser Ala Ala Val Leu195 200 205

Asn His Pro Ala Lys Gly Val Ala Trp Leu Ala Asn Lys Leu Ala Pro 210 215 220

Tyr Asp Val Thr Leu Gln Pro Gly Gln Ile Ile Leu Gly Gly Ser Phe225 230 235 240

Thr Arg Pro Val Ala Ala Arg Pro Gly Asp Thr Phe His Val Asp Tyr245 250 255

Asp Met Leu Gly Ser Ile Ala Cys Arg Phe Val260 265

<210> 35

<211> 804

<212> DNA

<213> Pseudomonas putida U

<220>

<221> CDS

<222> (1) .. (804)

<223> hpaI <400> 35 atg gac atg ccc atc aac cac ttc aag cga cgc ctg cac agc ggt gaa 48 Met Asp Met Pro Ile Asn His Phe Lys Arg Arg Leu His Ser Gly Glu1 5 10 15

ccg caa atc ggc ctg tgg ctc ggc ctg gcc gat gcc tac tgc gcc gag 96 Pro Gln Ile Gly Leu Trp Leu Gly Leu Ala Asp Ala Tyr Cys Ala Glu

ctg gcg gcc aat gcc ggt ttc gac tgg ctg ctg atc gac ggc gaa cac 144 Leu Ala Ala Asn Ala Gly Phe Asp Trp Leu Leu Ile Asp Gly Glu His

gcg ccc aac gac ctg cgc ggc atg ctc gcc cag ttg cag gcg gtg gca 192 Ala Pro Asn Asp Leu Arg Gly Met Leu Ala Gln Leu Gln Ala Val Ala

50 55 60

ccc tac ccc agc cag gca gtg atc cgc ccg gtg atc ggc gat acc gcg 240 Pro Tyr Pro Ser Gln Ala Val Ile Arg Pro Val Ile Gly Asp Thr Ala65 70 75 80

ctg atc aag cag gtg ctg gat atc ggc gca caa acc ttg ctg gtg ccg 288 Leu Ile Lys Gln Val Leu Asp Ile Gly Ala Gln Thr Leu Leu Val Pro

85 90 95

atg gtg gaa act gcc gaa cag gcg cgg caa ctg gtc aag gcc atg cat 336 Met Val Glu Thr Ala Glu Gln Ala Arg Gln Leu Val Lys Ala Met His

100 105 110

tac ccg ccc aag ggc att cgc ggg gtg ggc agc gcg ctg gcg cgg gct 384 Tyr Pro Pro Lys Gly Ile Arg Gly Val Gly Ser Ala Leu Ala Arg Ala

115 120 125

tcg cgc tgg aac acc ctc ccc ggt tac ctg gac cac gcc gat gag caa 432 Ser Arg Trp Asn Thr Leu Pro Gly Tyr Leu Asp His Ala Asp Glu Gln

130 135 140

atg tgc ctg ctg gtg cag atc gag aac aag gaa ggc ctg gcc aac ctg 480 Met Cys Leu Leu Val Gln Ile Glu Asn Lys Glu Gly Leu Ala Asn Leu145 150 155 160

gac gag atc gtt gca gtg gaa ggt gtg gat ggc gtg ttc atc ggg cct 528 Asp Glu Ile Val Ala Val Glu Gly Val Asp Gly Val Phe Ile Gly Pro

165 170 175

gca gac ctg agt gcg gcc atg ggg cat cgc ggc aac ccc ggg cac ccg 576 Ala Asp Leu Ser Ala Ala Met Gly His Arg Gly Asn Pro Gly His Pro

180 185 190

gag gtg cag gcg gcg att gaa gac gca atc gtg cgc att ggc aag gcg 624 Glu Val Gln Ala Ala Ile Glu Asp Ala Ile Val Arg Ile Gly Lys Ala

195 200 205

ggc aaa gcc gcc ggc att ctc agc gcg gac gag aaa ctg gcg cga cgc 672 Gly Lys Ala Ala Gly Ile Leu Ser Ala Asp Glu Lys Leu Ala Arg Arg

210 215 220

tac atc gag ctg ggt gcg gcg ttt gtg gcg gtg ggt gtg gat acc acg 720

Tyr Ile Glu Leu Gly Ala Ala Phe Val Ala Val Gly Val Asp Thr Thr225 230 235 240 gtg ctg atg cgc ggg ctg cgc gag ctg gcg ggg aag ttc aag gat acaVal Leu Met Arg Gly Leu Arg Glu Leu Ala Gly Lys Phe Lys Asp Thr245 250 255 768 gtg gta gtc ccc agt gcc ggg ggt ggt gcc tac tgaVal Val Val Pro Ser Ala Gly Gly Gly Ala Tyr260 265 804 <210> 36 <211> 267 <212> PRT <213> Pseudomonas putida U <400> 36 Met Asp Met Pro Ile Asn His Phe Lys Arg Arg Leu His Ser Gly Glu1 5 10 15 Pro Gln Ile Gly Leu Trp Leu Gly Leu Ala Asp Ala Tyr Cys Ala Glu20 25 30 Leu Ala Ala Asn Ala Gly Phe Asp Trp Leu Leu Ile Asp Gly Glu His35 40 45 Ala Pro Asn Asp Leu Arg Gly Met Leu Ala Gln Leu Gln Ala Val Ala50 55 60 Pro Tyr Pro Ser Gln Ala Val Ile Arg Pro Val Ile Gly Asp Thr Ala65 70 75 80 Leu Ile Lys Gln Val Leu Asp Ile Gly Ala Gln Thr Leu Leu Val Pro85 90 95 Met Val Glu Thr Ala Glu Gln Ala Arg Gln Leu Val Lys Ala Met His100 105 110 Tyr Pro Pro Lys Gly Ile Arg Gly Val Gly Ser Ala Leu Ala Arg Ala115 120 125 Ser Arg Trp Asn Thr Leu Pro Gly Tyr Leu Asp His Ala Asp Glu Gln130 135 140 Met Cys Leu Leu Val Gln Ile Glu Asn Lys Glu Gly Leu Ala Asn Leu145 150 155 160 Asp Glu Ile Val Ala Val Glu Gly Val Asp Gly Val Phe Ile Gly Pro165 170 175 Ala Asp Leu Ser Ala Ala Met Gly His Arg Gly Asn Pro Gly His Pro180 185 190 Glu Val Gln Ala Ala Ile Glu Asp Ala Ile Val Arg Ile Gly Lys Ala195 200 205 Gly Lys Ala Ala Gly Ile Leu Ser Ala Asp Glu Lys Leu Ala Arg Arg210 215 220

Tyr Ile Glu Leu Gly Ala Ala Phe Val Ala Val Gly Val Asp Thr Thr225 230 235 240

Val Leu Met Arg Gly Leu Arg Glu Leu Ala Gly Lys Phe Lys Asp Thr245 250 255

Val Val Val Pro Ser Ala Gly Gly Gly Ala Tyr260 265

<210> 37

<211> 906

<212> DNA

<213> Pseudomonas putida U

<220>

<221> CDS

<222> (1) .. (906)

<223> hpaA

<400> 37 atg agc gac cgg cat ccg ata ccg aac atc aac att ggc cag gtt tac 48 Met Ser Asp Arg His Pro Ile Pro Asn Ile Asn Ile Gly Gln Val Tyr1 5 10 15

gac cag cgc tac agc gac agc gag gtg cat tac gac cgg ctg ggc aac 96 Asp Gln Arg Tyr Ser Asp Ser Glu Val His Tyr Asp Arg Leu Gly Asn

ctg gcg ggc ttt ttc ggg cgc aac atg ccg gtg cac cgg cat gac cgg 144 Leu Ala Gly Phe Phe Gly Arg Asn Met Pro Val His Arg His Asp Arg

ttt ttc cag gtg cat tac gtg aag tcg ggc aca gta cgg gtg tat ctg 192 Phe Phe Gln Val His Tyr Val Lys Ser Gly Thr Val Arg Val Tyr Leu

50 55 60

gat gac cag cag tac atc gag gcc ggg ccg atg ttc ttc ctc acg cca 240 Asp Asp Gln Gln Tyr Ile Glu Ala Gly Pro Met Phe Phe Leu Thr Pro65 70 75 80

ccc acg gtg gcg cac gcg ttc gtc acc gaa gct gac agc gac ggg cat 288 Pro Thr Val Ala His Ala Phe Val Thr Glu Ala Asp Ser Asp Gly His

85 90 95

gtg ctg acg gtg cgc cag caa ctg gtg tgg caa ttg atc gaa gcc gac 336 Val Leu Thr Val Arg Gln Gln Leu Val Trp Gln Leu Ile Glu Ala Asp

100 105 110

gcc agc ctg ctg ccg gcg ggc atg cag gtg cag cca gcc tgt gtg gcg 384 Ala Ser Leu Leu Pro Ala Gly Met Gln Val Gln Pro Ala Cys Val Ala

115 120 125

ctg ggc aac ctg ccg gcc gaa tac aag gcc gag gcg cag cgc ctg caa 432 Leu Gly Asn Leu Pro Ala Glu Tyr Lys Ala Glu Ala Gln Arg Leu Gln

130 135 140

ggc tgg ctg gac gcg ttg agt gac gag ttt gcc acg cag caa ccg ggt 480 Gly Trp Leu Asp Ala Leu Ser Asp Glu Phe Ala Thr Gln Gln Pro Gly145 150 155 160

cgc gag gcg gcg ttg cag tcg ctg acc cgc ctg atc atg atc agc ctg 528 Arg Glu Ala Ala Leu Gln Ser Leu Thr Arg Leu Ile Met Ile Ser Leu

165 170 175

ctg cgg ctg tgc ccc aac tcg ctg gaa tcg acc ccg gcg cgg cat gaa 576 Leu Arg Leu Cys Pro Asn Ser Leu Glu Ser Thr Pro Ala Arg His Glu

180 185 190

gac ctg aag atc ttc cac cgt ttc aat gcc ctg atc gaa gcg cat tac 624 Asp Leu Lys Ile Phe His Arg Phe Asn Ala Leu Ile Glu Ala His Tyr

195 200 205

ctt gag cat tgg ccg ctg gcc cgc tac gcg cag cag att ggc gtg acc 672 Leu Glu His Trp Pro Leu Ala Arg Tyr Ala Gln Gln Ile Gly Val Thr

210 215 220

gag gca cgg ctg aac gat gtg tgc cgg cgc atc gcc gac ttg cca tcc 720 Glu Ala Arg Leu Asn Asp Val Cys Arg Arg Ile Ala Asp Leu Pro Ser225 230 235 240

aag cgc ctg gtg ctg gaa cgg ctg atg cag gag gcc aag cgt ttg ctg 768 Lys Arg Leu Val Leu Glu Arg Leu Met Gln Glu Ala Lys Arg Leu Leu

245 250 255

ttg ttt tcc ggc agc acg gcc aac gaa atc tgt tac cag ctc ggc ttc 816 Leu Phe Ser Gly Ser Thr Ala Asn Glu Ile Cys Tyr Gln Leu Gly Phe

260 265 270

aag gat ccg gcc tat ttc agc cgc ttc ttc aac cgc tac gcc aag ctc 864 Lys Asp Pro Ala Tyr Phe Ser Arg Phe Phe Asn Arg Tyr Ala Lys Leu

275 280 285

aca ccc ggg gag tac cgc cag cgg cag gca gaa ttg cag tga 906 Thr Pro Gly Glu Tyr Arg Gln Arg Gln Ala Glu Leu Gln

290 295 300

<210> 38

<211> 301

<212> PRT

<213> Pseudomonas putida U

<400> 38

Met Ser Asp Arg His Pro Ile Pro Asn Ile Asn Ile Gly Gln Val Tyr1 5 10 15

Asp Gln Arg Tyr Ser Asp Ser Glu Val His Tyr Asp Arg Leu Gly Asn20 25 30

Leu Ala Gly Phe Phe Gly Arg Asn Met Pro Val His Arg His Asp Arg35 40 45

Phe Phe Gln Val His Tyr Val Lys Ser Gly Thr Val Arg Val Tyr Leu50 55 60

Asp Asp Gln Gln Tyr Ile Glu Ala Gly Pro Met Phe Phe Leu Thr Pro65 70 75 80

Pro Thr Val Ala His Ala Phe Val Thr Glu Ala Asp Ser Asp Gly His85 90 95

Val Leu Thr Val Arg Gln Gln Leu Val Trp Gln Leu Ile Glu Ala Asp100 105 110

Ala Ser Leu Leu Pro Ala Gly Met Gln Val Gln Pro Ala Cys Val Ala115 120 125

Leu Gly Asn Leu Pro Ala Glu Tyr Lys Ala Glu Ala Gln Arg Leu Gln130 135 140

Gly Trp Leu Asp Ala Leu Ser Asp Glu Phe Ala Thr Gln Gln Pro Gly145 150 155 160

Arg Glu Ala Ala Leu Gln Ser Leu Thr Arg Leu Ile Met Ile Ser Leu 165 170 175

Leu Arg Leu Cys Pro Asn Ser Leu Glu Ser Thr Pro Ala Arg His Glu180 185 190

Asp Leu Lys Ile Phe His Arg Phe Asn Ala Leu Ile Glu Ala His Tyr195 200 205

Leu Glu His Trp Pro Leu Ala Arg Tyr Ala Gln Gln Ile Gly Val Thr210 215 220

Glu Ala Arg Leu Asn Asp Val Cys Arg Arg Ile Ala Asp Leu Pro Ser225 230 235 240

Lys Arg Leu Val Leu Glu Arg Leu Met Gln Glu Ala Lys Arg Leu Leu245 250 255

Leu Phe Ser Gly Ser Thr Ala Asn Glu Ile Cys Tyr Gln Leu Gly Phe260 265 270

Lys Asp Pro Ala Tyr Phe Ser Arg Phe Phe Asn Arg Tyr Ala Lys Leu275 280 285

Thr Pro Gly Glu Tyr Arg Gln Arg Gln Ala Glu Leu Gln290 295 300

<210> 39

<211> 1308

<212> DNA

<213> Pseudomonas putida U

<220>

<221> CDS

<222> (1) .. (1308)

<223> hpaX

<400> 39 atg agc aca ctc gaa caa gcc tcg ccg cgc gag gca cac gtt gaa cgg 48 Met Ser Thr Leu Glu Gln Ala Ser Pro Arg Glu Ala His Val Glu Arg 1 5 10 15

gcc gac agt acc cat cgg gca gtc acc tgg cgg ctg atg ccg ctg ctg 96 Ala Asp Ser Thr His Arg Ala Val Thr Trp Arg Leu Met Pro Leu Leu

ctg gtg tgc tac ctg ttc gcc cac ctg gac cgc atc aac att ggc ttc 144 Leu Val Cys Tyr Leu Phe Ala His Leu Asp Arg Ile Asn Ile Gly Phe

gcc aag atg cag atg agc cag gac ctg cat ttg tcc gac acg gtc tat 192 Ala Lys Met Gln Met Ser Gln Asp Leu His Leu Ser Asp Thr Val Tyr

50 55 60

ggc ctg ggt gcc ggg ctg ttc ttc att gcc tat gcg ctg ttc ggc gtc 240 Gly Leu Gly Ala Gly Leu Phe Phe Ile Ala Tyr Ala Leu Phe Gly Val65 70 75 80

ccc agc aac ctg atg ctc gac cgc gtt ggc cca cgc cgc tgg atc gcc 288 Pro Ser Asn Leu Met Leu Asp Arg Val Gly Pro Arg Arg Trp Ile Ala

85 90 95

tgc ctg atg gtg gtg tgg ggg ctg ttg tcg acc agc atg ctg ctg atc 336 Cys Leu Met Val Val Trp Gly Leu Leu Ser Thr Ser Met Leu Leu Ile

100 105 110

gaa agc agc agc gcg ttc tac ctg ttg cgc ttt gcc ctg ggc gcg gcc 384 Glu Ser Ser Ser Ala Phe Tyr Leu Leu Arg Phe Ala Leu Gly Ala Ala

115 120 125

gag gcc ggg ttc ttc ccg ggc att ctg gtt tac ctc aac cgc tgg tac 432 Glu Ala Gly Phe Phe Pro Gly Ile Leu Val Tyr Leu Asn Arg Trp Tyr

130 135 140

ccg gcc ggg cgc cgc gcc cag gtc acc gcg ctg ttc gcc att gcc gtg 480 Pro Ala Gly Arg Arg Ala Gln Val Thr Ala Leu Phe Ala Ile Ala Val 145 150 155 160

ccg ttg gcc gga gtg gtc ggc ggg cca gtg tcc ggg gcc ata ctg gcc 528 Pro Leu Ala Gly Val Val Gly Gly Pro Val Ser Gly Ala Ile Leu Ala

165 170 175

ttc atg cac gac acg ggc ggg ctg cgt ggc tgg cag tgg atg ttc ctg 576 Phe Met His Asp Thr Gly Gly Leu Arg Gly Trp Gln Trp Met Phe Leu

180 185 190

ctc gaa ggg gcg ccg gtg gtg ttg ctg ggc ctg gtg gta ctg gcc gtt 624 Leu Glu Gly Ala Pro Val Val Leu Leu Gly Leu Val Val Leu Ala Val

195 200 205

ttg ccg gag cac ttc gag cgg gtg agc tgg ctg gat gag cag cag aaa 672 Leu Pro Glu His Phe Glu Arg Val Ser Trp Leu Asp Glu Gln Gln Lys

210 215 220

gcc acg ctg cgc gcg caa ttc ggt gag gaa gaa cag cgc aag ccc gta 720 Ala Thr Leu Arg Ala Gln Phe Gly Glu Glu Glu Gln Arg Lys Pro Val225 230 235 240

acc tcg ttc ggc gcc att ttc gca agc cgt gcg ctg tgg ctg ttg gtg 768

Thr Ser Phe Gly Ala Ile Phe Ala Ser Arg Ala Leu Trp Leu Leu Val245 250 255

gcc gtg tat tgc gcg gtg atg ctg gcg gtg aat acc ctt gcg ttc tgg 816 Ala Val Tyr Cys Ala Val Met Leu Ala Val Asn Thr Leu Ala Phe Trp

260 265 270

atg ccc agc ctg att cac agt gcc ggt gtg gcc agc gac gcc agt gtc 864 Met Pro Ser Leu Ile His Ser Ala Gly Val Ala Ser Asp Ala Ser Val

275 280 285

ggc ctg ctc agc gct gtg ccg tac gtg gcc ggc tgc gtg ttc atg ctg 912 Gly Leu Leu Ser Ala Val Pro Tyr Val Ala Gly Cys Val Phe Met Leu

290 295 300

gcg tgc ggc cgc tcc agc gac cgc caa cgc gaa cgc cgc tgg cac ctg 960 Ala Cys Gly Arg Ser Ser Asp Arg Gln Arg Glu Arg Arg Trp His Leu305 310 315 320

tgc gta ccg ctg ctg atg gct gcc atc ggc atc gct att gcg gcc att 1008 Cys Val Pro Leu Leu Met Ala Ala Ile Gly Ile Ala Ile Ala Ala Ile

325 330 335

gcc ccc gag cag gcg ctg ccg gta atg gcc ggc ctg gtg ctg gcc ggc 1056 Ala Pro Glu Gln Ala Leu Pro Val Met Ala Gly Leu Val Leu Ala Gly

340 345 350

atg ggc gcc agc gct gcg ctg ccg atg ttc tgg caa ctg ccg ccg gcg 1104 Met Gly Ala Ser Ala Ala Leu Pro Met Phe Trp Gln Leu Pro Pro Ala

355 360 365

ttc ctc aac gcc cgt acc cag gcc gcc ggc att gcc ctg atc agc tcg 1152 Phe Leu Asn Ala Arg Thr Gln Ala Ala Gly Ile Ala Leu Ile Ser Ser

370 375 380

ctg ggc agc atc gcc tcg ttc ttc acg ccc tac ttc atc ggc tgg gtg 1200 Leu Gly Ser Ile Ala Ser Phe Phe Thr Pro Tyr Phe Ile Gly Trp Val385 390 395 400

cgc gac acc acc cac agc gcc agc ctt gct ctg tac gta ctc gcc gtc 1248 Arg Asp Thr Thr His Ser Ala Ser Leu Ala Leu Tyr Val Leu Ala Val

405 410 415

ttc atc gcc ctg ggc ggc ctg ctg gtg ttg cgc acc cag gct gcc atc 1296 Phe Ile Ala Leu Gly Gly Leu Leu Val Leu Arg Thr Gln Ala Ala Ile

420 425 430

gtc aac cct tga 1308 Val Asn Pro

<210> 40

<211> 435

<212> PRT

<213> Pseudomonas putida U

<400> 40 Met Ser Thr Leu Glu Gln Ala Ser Pro Arg Glu Ala His Val Glu Arg1 5 10 15

Ala Asp Ser Thr His Arg Ala Val Thr Trp Arg Leu Met Pro Leu Leu20 25 30

Leu Val Cys Tyr Leu Phe Ala His Leu Asp Arg Ile Asn Ile Gly Phe35 40 45

Ala Lys Met Gln Met Ser Gln Asp Leu His Leu Ser Asp Thr Val Tyr50 55 60

Gly Leu Gly Ala Gly Leu Phe Phe Ile Ala Tyr Ala Leu Phe Gly Val65 70 75 80

Pro Ser Asn Leu Met Leu Asp Arg Val Gly Pro Arg Arg Trp Ile Ala85 90 95

Cys Leu Met Val Val Trp Gly Leu Leu Ser Thr Ser Met Leu Leu Ile100 105 110

Glu Ser Ser Ser Ala Phe Tyr Leu Leu Arg Phe Ala Leu Gly Ala Ala115 120 125

Glu Ala Gly Phe Phe Pro Gly Ile Leu Val Tyr Leu Asn Arg Trp Tyr130 135 140

Pro Ala Gly Arg Arg Ala Gln Val Thr Ala Leu Phe Ala Ile Ala Val145 150 155 160

Pro Leu Ala Gly Val Val Gly Gly Pro Val Ser Gly Ala Ile Leu Ala165 170 175

Phe Met His Asp Thr Gly Gly Leu Arg Gly Trp Gln Trp Met Phe Leu180 185 190

Leu Glu Gly Ala Pro Val Val Leu Leu Gly Leu Val Val Leu Ala Val195 200 205

Leu Pro Glu His Phe Glu Arg Val Ser Trp Leu Asp Glu Gln Gln Lys210 215 220

Ala Thr Leu Arg Ala Gln Phe Gly Glu Glu Glu Gln Arg Lys Pro Val225 230 235 240

Thr Ser Phe Gly Ala Ile Phe Ala Ser Arg Ala Leu Trp Leu Leu Val 245 250 255

Ala Val Tyr Cys Ala Val Met Leu Ala Val Asn Thr Leu Ala Phe Trp260 265 270

Met Pro Ser Leu Ile His Ser Ala Gly Val Ala Ser Asp Ala Ser Val275 280 285

Gly Leu Leu Ser Ala Val Pro Tyr Val Ala Gly Cys Val Phe Met Leu290 295 300

Ala Cys Gly Arg Ser Ser Asp Arg Gln Arg Glu Arg Arg Trp His Leu305 310 315 320

Cys Val Pro Leu Leu Met Ala Ala Ile Gly Ile Ala Ile Ala Ala Ile325 330 335 Ala Pro Glu Gln Ala Leu Pro Val Met Ala Gly Leu Val Leu Ala Gly340 345 350 Met Gly Ala Ser Ala Ala Leu Pro Met Phe Trp Gln Leu Pro Pro Ala355 360 365 Phe Leu Asn Ala Arg Thr Gln Ala Ala Gly Ile Ala Leu Ile Ser Ser370 375 380 Leu Gly Ser Ile Ala Ser Phe Phe Thr Pro Tyr Phe Ile Gly Trp Val385 390 395 400 Arg Asp Thr Thr His Ser Ala Ser Leu Ala Leu Tyr Val Leu Ala Val405 410 415 Phe Ile Ala Leu Gly Gly Leu Leu Val Leu Arg Thr Gln Ala Ala Ile420 425 430 Val Asn Pro 435 <210> 41 <211> 423 <212> DNA <213> Pseudpmonas putida U <220> <221> CDS <222> (1) .. (423) <223> hpaR1 <400> 41 atg acc aca ccg aga ccc tcc ctg acc ctg acc ttg ctg cag gcg cgcMet Thr Thr Pro Arg Pro Ser Leu Thr Leu Thr Leu Leu Gln Ala Arg1 5 10 15 48 gaa gcc acc atg gcg ttc ttc cgc ccg gcg ctg aat gcc cat gac ctgGlu Ala Thr Met Ala Phe Phe Arg Pro Ala Leu Asn Ala His Asp Leu20 25 30 96 acc gag cag caa tgg cgg gta atc cgt atc ctg cgc cag caa ggc gagThr Glu Gln Gln Trp Arg Val Ile Arg Ile Leu Arg Gln Gln Gly Glu35 40 45 144 ctg gaa agc cat cag ttg gcg gag ctg gcc tgt atc ctc aaa ccc agtLeu Glu Ser His Gln Leu Ala Glu Leu Ala Cys Ile Leu Lys Pro Ser50 55 60 192 atg agc ggg gtg ctc aag cgc ctg gag cgt gac ggc atc gta gcg cggMet Ser Gly Val Leu Lys Arg Leu Glu Arg Asp Gly Ile Val Ala Arg65 70 75 80 240 cgc aag tcg ccg gag gac cag cgc cgg gtg ttc atc agc ctg acc gagArg Lys Ser Pro Glu Asp Gln Arg Arg Val Phe Ile Ser Leu Thr Glu 288

85 90 95

gcc ggc cag caa gcg ttt ctg gcg atg agc gag gag atg acc cgc aacAla Gly Gln Gln Ala Phe Leu Ala Met Ser Glu Glu Met Thr Arg Asn100 105 110 336 tac gac aag atc ctc gcc cag ttt ggc gat gac aag ctg cag cag ctgTyr Asp Lys Ile Leu Ala Gln Phe Gly Asp Asp Lys Leu Gln Gln Leu115 120 125 384 atg cag ctg ctg ggt gaa atg aag aag atc aaa ccc tgaMet Gln Leu Leu Gly Glu Met Lys Lys Ile Lys Pro130 135 140 423 <210> <211> <212> <213> <400> 42 140 PRT Pseudpmonas putida U 42 Met Thr Thr Pro Arg Pro Ser Leu Thr Leu Thr Leu Leu Gln Ala Arg1 5 10 15 Glu Ala Thr Met Ala Phe Phe Arg Pro Ala Leu Asn Ala His Asp Leu20 25 30 Thr Glu Gln Gln Trp Arg Val Ile Arg Ile Leu Arg Gln Gln Gly Glu35 40 45 Leu Glu Ser His Gln Leu Ala Glu Leu Ala Cys Ile Leu Lys Pro Ser50 55 60 Met Ser Gly Val Leu Lys Arg Leu Glu Arg Asp Gly Ile Val Ala Arg65 70 75 80 Arg Lys Ser Pro Glu Asp Gln Arg Arg Val Phe Ile Ser Leu Thr Glu85 90 95 Ala Gly Gln Gln Ala Phe Leu Ala Met Ser Glu Glu Met Thr Arg Asn100 105 110 Tyr Asp Lys Ile Leu Ala Gln Phe Gly Asp Asp Lys Leu Gln Gln Leu115 120 125 Met Gln Leu Leu Gly Glu Met Lys Lys Ile Lys Pro130 135 140 <210> <211> <212> <213> <220> <221> <222> <223> 43 423 DNA Pseudomonas putida U CDS (1) .. (423) hpaR2

<400> 43 atg acc aag acg caa cct tcg ctc acg cta agc ctg ttg cag gcc cga 48 Met Thr Lys Thr Gln Pro Ser Leu Thr Leu Ser Leu Leu Gln Ala Arg1 5 10 15

gaa gcc gcg atg gca ttt ttc agg ccg ctg ttg aac cag cac gac ctg 96 Glu Ala Ala Met Ala Phe Phe Arg Pro Leu Leu Asn Gln His Asp Leu

acc gag cag caa tgg cgg gta atc cgc atc ctc aag cag cac ggc gag 144 Thr Glu Gln Gln Trp Arg Val Ile Arg Ile Leu Lys Gln His Gly Glu

ctg gag aat tat cag ttg gcg gaa ctg gcc tgc atc ctc aag ccg agc 192 Leu Glu Asn Tyr Gln Leu Ala Glu Leu Ala Cys Ile Leu Lys Pro Ser

50 55 60

atg acc ggg gta ctg ggg cgc ctg gag cga gac ggg ctg gtg cgg cgg 240 Met Thr Gly Val Leu Gly Arg Leu Glu Arg Asp Gly Leu Val Arg Arg65 70 75 80

cag aag gcc gcg cag gac cag cga cgg gtg ttc gtc agc ctg acc gaa 288 Gln Lys Ala Ala Gln Asp Gln Arg Arg Val Phe Val Ser Leu Thr Glu

85 90 95

aga ggg gag gcg tgc ttt gcc tcg atg aag gaa ggc atg gag gcc aac 336 Arg Gly Glu Ala Cys Phe Ala Ser Met Lys Glu Gly Met Glu Ala Asn

100 105 110

tac cag aag att cag gcg cag ttt ggt gaa gag aag ctg cag cag ctg 384 Tyr Gln Lys Ile Gln Ala Gln Phe Gly Glu Glu Lys Leu Gln Gln Leu

115 120 125

atg ggg ttg ttg aat gac ctg aag cgc atc gcg cca taa 423 Met Gly Leu Leu Asn Asp Leu Lys Arg Ile Ala Pro

130 135 140

<210> 44

<211> 140

<212> PRT

<213> Pseudomonas putida U

<400> 44

Met Thr Lys Thr Gln Pro Ser Leu Thr Leu Ser Leu Leu Gln Ala Arg1 5 10 15

Glu Ala Ala Met Ala Phe Phe Arg Pro Leu Leu Asn Gln His Asp Leu20 25 30

Thr Glu Gln Gln Trp Arg Val Ile Arg Ile Leu Lys Gln His Gly Glu35 40 45

Leu Glu Asn Tyr Gln Leu Ala Glu Leu Ala Cys Ile Leu Lys Pro Ser50 55 60

Met Thr Gly Val Leu Gly Arg Leu Glu Arg Asp Gly Leu Val Arg Arg 65 70 75 80

Gln Lys Ala Ala Gln Asp Gln Arg Arg Val Phe Val Ser Leu Thr Glu85 90 95

Arg Gly Glu Ala Cys Phe Ala Ser Met Lys Glu Gly Met Glu Ala Asn100 105 110

Tyr Gln Lys Ile Gln Ala Gln Phe Gly Glu Glu Lys Leu Gln Gln Leu115 120 125

Met Gly Leu Leu Asn Asp Leu Lys Arg Ile Ala Pro130 135 140

<210> 45

<211> 12722

<212> DNA

<213> Pseudomonas putida U

<220>

<221> misc_feature

<222> (1) .. (12722)

<223> cluster hpa

<400> 45 atgaccacac cgagaccctc cctgaccctg accttgctgc aggcgcgcga agccaccatg 60 gcgttcttcc gcccggcgct gaatgcccat gacctgaccg agcagcaatg gcgggtaatc 120 cgtatcctgc gccagcaagg cgagctggaa agccatcagt tggcggagct ggcctgtatc 180 ctcaaaccca gtatgagcgg ggtgctcaag cgcctggagc gtgacggcat cgtagcgcgg 240 cgcaagtcgc cggaggacca gcgccgggtg ttcatcagcc tgaccgaggc cggccagcaa 300 gcgtttctgg cgatgagcga ggagatgacc cgcaactacg acaagatcct cgcccagttt 360 ggcgatgaca agctgcagca gctgatgcag ctgctgggtg aaatgaagaa gatcaaaccc 420 tgacgcgcca ggcgtcagcg gttgagtgac agcgagtctt ccagcacttt cagcagtgct 480 gccgcgcgcc gctcataggc gtcggggcct gcgtacatca gctctacata caggctgtcg 540 atgatgccca ggtaggcatc ggcatacagc gccaggcggc tgtgctgctc atgcgcccag 600 ccgtggcgag cttgcagggc cacgctgaac ccttcgcgta tgccgtccag gtactgttca 660 aagcccgaag tgacaatcgg cttgatgccc gccgggggca ggaacgccgt gcgcaacacg 720 aagcgcagtt gggccgagtc gcgataacgt tcggccaggt gcagggccag ccagtgcccc 780 gccgccaggc cgtcgcgggc ttcctgcgca aagccgtgct cgacaaaggc cgtttcctgc 840 acaagcgcac gctggaacac ctccacgaac aaggcgtcct tgttggcgaa atgcgcatac 900 agcgatgcct tgcgcatgcc cgccaactgg gcgatttcgt tcagcgaaga ggcgtcataa 960 ccgtactcgg cgaagtggcc gacggcggca tcgcacacac gcaccgcaga aggggaaagg 1020 tctttcaaca gcatcactcc gtcaggggcg cggcgggccg cgcgcgtctt gagggtggga 1080 ttgtggtgat cgaaaatgca cgggtcaatg cttgtcgcaa ggcaatttcc gggcgccatg 1140 gaaagtgcaa tgttcccctc gtaacgtgca ttcctccacc caatcgccgc tcacatactg 1200 atcgcgtctt cgaatccaat aagaaagaga ccgctcatga aaaagccaaa ccccctgctg 1260 gaagacctga agtccgtcct gccgaccatt gccgccaatg ccatgcgtgc agagcaggac 1320 cgcagtgtgc cggcagagaa tatcgccttg ctgaaaagca tcggcatgca ccgcgctttc 1380 ttgcccaaac acttcggcgg catggaaatc accctgccgg agttcgccca gtgcatcgcc 1440 ttgctggcgg gggcctgcgc cagcacagcc tgggccatga gcctgctgtg cacccacagc 1500 caccagatgg caatgttctc gcccaagcta caacaggagg tgtggggtag cgacccggat 1560 gctaccgcca gcagcagtat cgcgccgttc ggccgcactg aagaggttga gggtggcgtg 1620 tcgttcagcg gcgaaatggg ctggagttcc ggttgcgacc acgccgaatg ggcgattctc 1680 ggtttccgcc gcaagaatgc cgaaggcgct caggattact gcttcgccat cctgcctcgc 1740 agtgactatg aaatccgtga tgactggtat gccgtgggca tgcgcggcag cggcagcaag 1800 accctgatcg tgcgtgatgc cttcgtgccc gagcaccgca tccagaaggc caaggacatg 1860 atggagggca agtcggcggg ctttggtttg taccccgaca gcaagatttt cttcgccccg 1920 tatcgcccgt attttgccag cggcttctcc acggtcagct tgggcgttgc cgagcgcatg 1980 ctggaggtgt tccgcgagaa aacccgcaac cgcgtgcgtg cctacaccgg tgctgccgtg 2040 ggcgccgcca ccccggcgct gatgcgcctg gccgagtcga cccatcaggt ggccgctgcc 2100 cgggcattgc tggaaaagag ctgggacgag attgccgagc acagtgcccg tcacgaatac 2160 ccgtcgcgtg gcacgctggc gttctggcgt accaaccagg gctacgccgt gaagatgtgc 2220 atccaggccg tcgaccgcct gatggaagcg gccggtggtg gcgcctggtt cgagagcaac 2280 gaactgcagc ggctgttccg cgattcgcac atgaccggtg cccatgccta caccgattac 2340 gacgtgtgtg cgcaaatcct cggccgcgag ctgatgggcc tggagcctga cccggcgatg 2400 gtctgagccg ccacttgttt tcacccatcc cctacaagca caacaacaaa cagggcaggc 2460 tgccaggcct gcccgggagt cttgcatgtc caaagaaacc ttcgattcac gtgccttccg 2520 ccgcgccctg ggcaacttcg ccaccggcgt gaccgtggtg actgccgccg gccccagtgg 2580 ccgcaaggtc ggcgttaccg ccaacagctt caactcggtg tcgctggacc cggcgctgat 2640 cctgtggagc atcgacaagc gctccaccag ccatgaagtg ttcgaagagg cctcgcactt 2700 tgccgtgaac attctggctg cggaccagat cgacctgtcc aacaactttg cccgcccgaa 2760 ggaagatcgc tttgccggta tcgactacga gaccggcact ggcggcgcgc cgttgttcgc 2820 cgattgcgcg gcgcgctttg agtgtgaaaa gtaccagcag ctggacggtg gcgatcactg 2880 gatcctggtg ggcaaggtag tggcctttga tgactttggc cgctcgccgc tgctgtatca 2940 ccagggcgcc tattcaatgg tgctgccgca tacccgcatg acccaaggcg cagaggggca 3000 ggcaccgagc agccacttcc agggccgcct gcagcacaac ctgtactacc tgatgaccca 3060 ggcgctgcgt gcctaccagg ctgactacca gccacgccag ctgtgtaccg gcctgcgcac 3120 cagcgaggca cgcatgctga tggtgctgga gaacgatgcg ggcctgagcc tgaacgacct 3180 gcaacgcgaa gtggcgatgc cggcgcggga gatcgaggaa gcggttgcca acctcaagcg 3240 caaagggctg attgccgatg acgaagggcg agtgcggcta tcggtgaagg gcgtggacga 3300 gaccgaggcg ttgtggacca ttgcccggca acagcaggac aaggtgttcg ggcagttcag 3360 tgaacagcag ctggagactt tcaagaccgt gctcaaggcc cttatcaaca tctgaacacg 3420 ctttgggatg gcaccggctg ttttggatgg caccggctgt gccggtgttc gcggatgaac 3480 ccgctcccac aggtccagcg ccagtagcaa cttcggcgcg gtacctgtgg gagcggcttt 3540 agccgcgaac accggcaaag ccggtgccat ccaaccagaa gcctcagtag gcaccacccc 3600 cggcactggg gactaccact gtatccttga acttccccgc cagctcgcgc agcccgcgca 3660 tcagcaccgt ggtatccaca cccaccgcca caaacgccgc acccagctcg atgtagcgtc 3720 gcgccagttt ctcgtccgcg ctgagaatgc cggcggcttt gcccgccttg ccaatgcgca 3780 cgattgcgtc ttcaatcgcc gcctgcacct ccgggtgccc ggggttgccg cgatgcccca 3840 tggccgcact caggtctgca ggcccgatga acacgccatc cacaccttcc actgcaacga 3900 tctcgtccag gttggccagg ccttccttgt tctcgatctg caccagcagg cacatttgct 3960 catcggcgtg gtccaggtaa ccggggaggg tgttccagcg cgaagcccgc gccagcgcgc 4020 tgcccacccc gcgaatgccc ttgggcgggt aatgcatggc cttgaccagt tgccgcgcct 4080 gttcggcagt ttccaccatc ggcaccagca aggtttgtgc gccgatatcc agcacctgct 4140 tgatcagcgc ggtatcgccg atcaccgggc ggatcactgc ctggctgggg tagggtgcca 4200 ccgcctgcaa ctgggcgagc atgccgcgca ggtcgttggg cgcgtgttcg ccgtcgatca 4260 gcagccagtc gaaaccggca ttggccgcca gctcggcgca gtaggcatcg gccaggccga 4320 gccacaggcc gatttgcggt tcaccgctgt gcaggcgtcg cttgaagtgg ttgatgggca 4380 tgtccatgag caggtcctta aacgaagcgg caggcgatgg agccgagcat gtcgtagtcg 4440 acgtggaagg tgtcacctgg gcgagcggcg accgggcggg tgaacgaacc cccaaggatg 4500 atctggccgg gctgcaaggt gacgtcgtac ggcgccagtt tgttggccag ccaggcaacg 4560 cctttggccg ggtggttgag cacggcagcg ctgaccccgg attcctcgat cacgccattg 4620 cggtagagca ccgccggcac tttgcgcagg tcgatttcgg tggggcgcac ggcccgcccg 4680 cccatcacca cgccggcatt ggcggcgttg tcggagatgg tgtcgaacac cttgcgggtg 4740 gcctgggttt gcgggtccac ctgctggatg cgcgcgtcaa tgatttccag cgccgggatc 4800 acccactcgg tggcgtccag cacatcaaac acggtgatgt tcgggccctt cagcggcttg 4860 ccgaggatga acgccaactc cacttcaacc cgcggcacga tgaagcgctc gaaggggatg 4920 tcgctgcctt cgtcgaacag catgtcgtcg agcaaggcgc cgtagtcggg ctcggtgatg 4980 ttcgacgata cctgcatggc gcgcgaggtc aggccgatct tgtggcccac cagcttgcgc 5040 ccggcggcga tcttttttgc cacccaggcg cgctggatgg cgtaggcgtc ttcgatggtg 5100 attgccggtt gctccagcga gaactggcgc acttgctcgc gggagcgttc ggcctggtcg 5160 aggcggtcgg cggcgtgctg gatgaaagcg ttgtctagca tgggggcggt ctcttgattc 5220 aagggttgac gatggcagcc tgggtgcgca acaccagcag gccgcccagg gcgatgaaga 5280 cggcgagtac gtacagagca aggctggcgc tgtgggtggt gtcgcgcacc cagccgatga 5340 agtagggcgt gaagaacgag gcgatgctgc ccagcgagct gatcagggca atgccggcgg 5400 cctgggtacg ggcgttgagg aacgccggcg gcagttgcca gaacatcggc agcgcagcgc 5460 tggcgcccat gccggccagc accaggccgg ccattaccgg cagcgcctgc tcgggggcaa 5520 tggccgcaat agcgatgccg atggcagcca tcagcagcgg tacgcacagg tgccagcggc 5580 gttcgcgttg gcggtcgctg gagcggccgc acgccagcat gaacacgcag ccggccacgt 5640 acggcacagc gctgagcagg ccgacactgg cgtcgctggc cacaccggca ctgtgaatca 5700 ggctgggcat ccagaacgca agggtattca ccgccagcat caccgcgcaa tacacggcca 5760 ccaacagcca cagcgcacgg cttgcgaaaa tggcgccgaa cgaggttacg ggcttgcgct 5820 gttcttcctc accgaattgc gcgcgcagcg tggctttctg ctgctcatcc agccagctca 5880 cccgctcgaa gtgctccggc aaaacggcca gtaccaccag gcccagcaac accaccggcg 5940 ccccttcgag caggaacatc cactgccagc cacgcagccc gcccgtgtcg tgcatgaagg 6000 ccagtatggc cccggacact ggcccgccga ccactccggc caacggcacg gcaatggcga 6060 acagcgcggt gacctgggcg cggcgcccgg ccgggtacca gcggttgagg taaaccagaa 6120 tgcccgggaa gaacccggcc tcggccgcgc ccagggcaaa gcgcaacagg tagaacgcgc 6180 tgctgctttc gatcagcagc atgctggtcg acaacagccc ccacaccacc atcaggcagg 6240 cgatccagcg gcgtgggcca acgcggtcga gcatcaggtt gctggggacg ccgaacagcg 6300 cataggcaat gaagaacagc ccggcaccca ggccatagac cgtgtcggac aaatgcaggt 6360 cctggctcat ctgcatcttg gcgaagccaa tgttgatgcg gtccaggtgg gcgaacaggt 6420 agcacaccag cagcagcggc atcagccgcc aggtgactgc ccgatgggta ctgtcggccc 6480 gttcaacgtg tgcctcgcgc ggcgaggctt gttcgagtgt gctcatgttt ttgtacttat 6540 tctgtaatga gtcggggagg gcgtggtttg agccggcgcg ctagcggttg aacagtgggt 6600 gcaaggtgct gtgcttggcg tcgtagacct gggcggtgct gtggtcgatc tgcacggtga 6660 tgccgatcgg gcgctgttgc agcagtgggt ccaggcgcgc tttcaacact gccagcaagc 6720 tgtcgcccac tgttttgtgc acctcggcgc tacggccggt agccatgcgc aggttggcgt 6780 acagaaagcc gtattcgcct ttgccgtcgg ccaccgcgca atgggcggcg gggtaggcca 6840 gcacgcgtgt accgccagtg gggaacacgg ctttgccttc ggcatcgcgc tgttcgagca 6900 tggtgtcggc cagggcgcgg cacaggccgg ggatgtcggc gtcggtttcc aggtcggggg 6960 tatagagcag aaccaggtgt ggcatggggg cctcctcggt gaggggcggc tggccacccg 7020 ccagggcgac cagccgcgaa cgggtgggtt acaggcggct ggtgggcacc acggcggccg 7080 ggttggcggc ctgggcagcg gggatggcac caccgtcctg cggggtgacc gggaagatcg 7140 cgttgatctg gccggtgccg gaagagccga agtagggcgt gaccacttcg gccttgccgt 7200 cgtaatcgga ccagcccagc gcacccagca gcattgccgt gtcgtgcatg aagccttcac 7260 cgtggccttt ggcggcgtac tccggcagca tcccgcagaa cgcttcccac tcgccgtcct 7320 gccacatttg caccacacgg tggtcgaggg tttcgaggaa cgggctccac accttggtgg 7380 caaagtccgg cgcctggccg ttctgcgcga agcggtgcga cagcgagccg ctggccagga 7440 acgccacggt gccgtcgtag tggtcttcta ctgccttgcg catggcccag cccaggcggg 7500 cactgtcggc caggtagtgc gaggtgcaca gggccgagac cgagaccact ttgaagtgct 7560 ggtcctggtt catgtagcgc atgggcacca gggtgccgta ttccggggcg agggtggtgg 7620 cgtggtgggc catggtttcg acgttgaagc ggttgcactc ctcggccagc agcttgccca 7680 gctcgggatt gccggggaat gcgtagggca tgttgctgat gaagtgcggc agttcgttgc 7740 tggtgtacac gccctcgaaa tgcggcccgc acagcacgtg gtagttggcg ttgaccagcc 7800 agtgcgtgtc gaacacgacg atggtgtcca cgcccagctc acggcaacgg cggctgattt 7860 cgtgatgccc gtcgatggcc gcctggcgaa agccttggcg cgggcctggc agttcggaca 7920 tgtacatgga cggtacatgg gtaatcttgg cagtgagagc gagtttgccc atgggggtct 7980 ccgataagac gctgttgttg ttttggggct gacccggtcc cttgtaggag cggccttgtt 8040 ccgggatggg gcgcacagcg gccccggcga tatctgcggc gaggctgaaa tccaggggcc 8100 gctgcgcgcc ccatcgcggg cacaaggccg ctcctacacc cgggcggtgt aaaccgcaca 8160 gagggttaga tgccccagcg aggaatgtgg tgattaccca tggaaataca cacgttcttg 8220 atctctgcaa agacctcgaa gctgtactgc ccgccctcac gcccggtacc ggaacctttc 8280 acgccgccga acggctggcg caggtcgcgt acgttctggc tgttgatgaa caccatgccg 8340 gcctcgatgc cacgggccag gcgatgggct ttgccgatgt cctgggtcca gatgtacgag 8400 gccaggccat actcggtgtc gttggccagt tgcagcgcct cggcttcgtc cttgaacggg 8460 atcaggcaca ccaccgggcc aaagatttct tcctgggcaa tgcgcatctt gttgttcacg 8520 tcggcgaata cggtgggctg gatgaactgc cccttggcca ggtgcgcagg caggttggcc 8580 gggcgctcca ggcccccggc gaccaggcgt gcaccttctt cgatgccaat gcggatgtac 8640 ccggtgacct tgtcatagtg ctgctgggtg atcatcgaac cgacctgggt tttcgggtcg 8700 gtcgggtcac ctacgatcag gcgcttggcg cgcgccgcaa actctgcgac aaactgcggg 8760 tacacgcttt cctggatgaa gatgcggctg ccggcggtgc agcgctcgcc gttcagcgag 8820 aagatggtga acagcgcggc gtccagcgca cgctcaaggt ctgcgtcttc gaagatcagc 8880 acgggcgact tgccgcccag ttccatcgag tactttttaa ggcctgcggt ctgcatgatc 8940 ttcttgccgg tggcggtacc gccggtgaag gaaatggcgc gcacatcggg gtggcggacc 9000 agggcatcgc cggcggtagc gccgtaaccc tggatcacgt tcagcacccc gttggggatg 9060 ccggcttcta ccgccaggcg gcccagttcg ttggcggtca gaggcgacag ctcgctcatc 9120 ttcagcacgg cggtgttgcc cagcgccagg cacggcgcag tcttccaggt agccgtcatg 9180 aacggcacgt tccatgggct taccaggccg cacacaccca ccggctggta cagggtgtag 9240 ttgagcatct ggtcgtcgac cgggtaggta tggccgtcca tgcgcgtgca cacttcggcg 9300 aagaagtcga agttgtgcga ggcacgcggg atcagcacgt tcttggtctg gtggatcggc 9360 aggccggtgt cgagggtttc cagctcggcg agtttcggca cgttctgctc aatcagctca 9420 cccagcttgc gcatcagccg ggcacgttcc ttggccgggg tgttggccca cttggggaag 9480 gcttccttgg ccgcagccac agcctgggcc acttcctcgg cgccgccgct ggcgacttcg 9540 cagatggcgt cgccggtggc cgggttgtag ttgacgaagg tgtctttgct ctcgacctca 9600 cggccgttga tccagtgctt gatcatgctg ctcatgcctt gttgttcttg aagaagtcag 9660 cttcgctgac gatacggttg accaggcgac cgacgccttc cacttccacc accacttcgt 9720 cacccggcac cacatcggcc aggccttctg gcgtgccggt ggcgatcatg tcgcccggtt 9780 gcagggtcat gaagctggag aagtattcga tgaggtgcgg gatgtcgaag atcatgtccg 9840 cggtggtgcc ttcctgcttc agctcaccgt tgatccaggt gcgcagcttc aggttgctga 9900 cgtctggcac atcggccgca tcgacgatcc acgggccgac cggggtggtg gcatcgcggt 9960 ttttcacccg caggttgggg cggtagtagt tttccaggta gtcgcggatg gcgtagtcgt 10020 tgcacacggt gtagccggca acgtaggcca gggcgtcctc acgcttgacg ttcttcgccg 10080 ctttgccgat caccgccacc agctcgcact cgtagtgcat gtattcgacg ttgtccgggc 10140 gccaggtgac ctggatgtgg ccggtgtagg tgcctggcga cttgatgaaa gccaacggtt 10200 cggtgggcgg cgcgaaggcc agctccctgg cgtggtcggc gtagttcagg cccagggcga 10260 acatgctgcc ggtggcgggt ggcagccagg tgacctggtc ctgatggacc aggcggccgt 10320 cggcaaggcg caggtgatcg tcttcgaccg tgacatcgtg ggcctggccg tcgaactgga 10380 tacgggcgtg tttcacaggt aattcctcac tcggcgacga tgtggttggt cagcttgccc 10440 aggccgtcga tctcgatgtc gacgcggtca cctggctgta catcgacgcg gccctcgggg 10500 gttccggtga tcaggatgtc gccggcgtgc agggtcatga actcgctgat ttcggcaatc 10560 agctgcgcca ccgtgcgtac gcagttggcg gtgttgttgt gctggcgcag ttcgccgttc 10620 acatacaggc gcaggcccag ggcatcgggg ttggccactt ggctggcggg caccagttca 10680 gggccgaccg ggcaaaaacc atcacggcac ttggccttga ctgcagggcg gtagtagctg 10740 gcttcgggca ggctcacttc gttgacgatg gtgtagcccg ccacatgctc cagggcatcg 10800 gccacgctga cgcggctggc gtccttgcca atcaccactc ccagcgccgg gccgggttgc 10860 acgcgctgca cgccggccgg gaataccacc tggccttcat gctggttgcg ggtgttcggg 10920 gtcttgacga acaacaccgg cttgaccggc agttgcttgt acggtgcttc cacgaacgcc 10980 gcttggtgct gctgcagcaa accctggtag ttcagcgcga cgccgaacag ggtgccgctg 11040 gcaacgtcaa gcagggcatg gctcatgctc ttctcctggc agtgcagggc ggtggccgtc 11100 ctgcggattt cgttaatgtg ttaatgttat agttaatatg ttaacgatgg tcaaggggtg 11160 gccagtggcg cctgccggca aggcaaggca ccatgggcca tcgtcaacag ggtcaagcga 11220 tttgcgagca agcagccatg agcgaccggc atccgatacc gaacatcaac attggccagg 11280 tttacgacca gcgctacagc gacagcgagg tgcattacga ccggctgggc aacctggcgg 11340 gctttttcgg gcgcaacatg ccggtgcacc ggcatgaccg gtttttccag gtgcattacg 11400 tgaagtcggg cacagtacgg gtgtatctgg atgaccagca gtacatcgag gccgggccga 11460 tgttcttcct cacgccaccc acggtggcgc acgcgttcgt caccgaagct gacagcgacg 11520 ggcatgtgct gacggtgcgc cagcaactgg tgtggcaatt gatcgaagcc gacgccagcc 11580 tgctgccggc gggcatgcag gtgcagccag cctgtgtggc gctgggcaac ctgccggccg 11640 aatacaaggc cgaggcgcag cgcctgcaag gctggctgga cgcgttgagt gacgagtttg 11700 ccacgcagca accgggtcgc gaggcggcgt tgcagtcgct gacccgcctg atcatgatca 11760 gcctgctgcg gctgtgcccc aactcgctgg aatcgacccc ggcgcggcat gaagacctga 11820 agatcttcca ccgtttcaat gccctgatcg aagcgcatta ccttgagcat tggccgctgg 11880 cccgctacgc gcagcagatt ggcgtgaccg aggcacggct gaacgatgtg tgccggcgca 11940 tcgccgactt gccatccaag cgcctggtgc tggaacggct gatgcaggag gccaagcgtt 12000 tgctgttgtt ttccggcagc acggccaacg aaatctgtta ccagctcggc ttcaaggatc 12060 cggcctattt cagccgcttc ttcaaccgct acgccaagct cacacccggg gagtaccgcc 12120 agcggcaggc agaattgcag tgaaatggcc atggcggctc acccgggtgc tgttgttgtt 12180 tacagcggat ggtcgcagcc cgcgcgccgg gcttgaatgg gttttccgtg gaacagattg 12240 cactttccat cgtgcatgcc cttaaattcg tgaattgaga aaaagccaca ggtttgacca 12300 tgaccaagac gcaaccttcg ctcacgctaa gcctgttgca ggcccgagaa gccgcgatgg 12360 catttttcag gccgctgttg aaccagcacg acctgaccga gcagcaatgg cgggtaatcc 12420 gcatcctcaa gcagcacggc gagctggaga attatcagtt ggcggaactg gcctgcatcc 12480 tcaagccgag catgaccggg gtactggggc gcctggagcg agacgggctg gtgcggcggc 12540 agaaggccgc gcaggaccag cgacgggtgt tcgtcagcct gaccgaaaga ggggaggcgt 12600 gctttgcctc gatgaaggaa ggcatggagg ccaactacca gaagattcag gcgcagtttg 12660 gtgaagagaa gctgcagcag ctgatggggt tgttgaatga cctgaagcgc atcgcgccat 12720 aa 12722


 

Patentes similares o relacionadas:

Procedimiento para la producción de aceituna en polvo, del 17 de Agosto de 2016, de TUBITAK: Procedimiento para la obtención de la aceituna en polvo, caracterizado porque se muelen aceitunas enteras deshuesadas y secas en presencia de […]

Materiales de unión a fosfato y sus usos, del 13 de Julio de 2016, de MEDICAL RESEARCH COUNCIL: Una composicion de hierro ferrico para uso en un metodo de tratamiento de hiperfosfatemia, en donde la composicion de hierro ferrico es un […]

Alimento para bebés y lactantes que contiene aceitunas enteras, del 12 de Febrero de 2016, de TUBITAK: Alimento para bebés o lactantes que contiene aceitunas libres de sal y no químicamente tratadas que cubre las siguientes etapas de procesamiento: a. las aceitunas se ponen […]

Aparato y procedimiento de desaceitado para la fabricación de chips de patata con bajo contenido en aceite, del 1 de Julio de 2015, de Frito-Lay Trading Company GmbH (100.0%): Aparato para desaceitar rodajas de patata, comprendiendo dicho aparato un transportador longitudinal alargado provisto de un extremo aguas arriba y […]

Composiciones ingeribles que contienen un aceite odorífero, del 8 de Abril de 2015, de R.P. SCHERER TECHNOLOGIES, LLC: Una cápsula de gel blando para la administración de un aceite odorífero y que tiene un olor reducido tras la ingestión, conteniendo dicha cápsula […]

Imagen de 'Sistema mejorado de obtención de la anchoa para su consumo'Sistema mejorado de obtención de la anchoa para su consumo, del 29 de Enero de 2015, de MADRIGAL BALLESTER, Guillermo: El sistema mejorado de obtención de la anchoa para su consumo, se desarrolla en fases sucesivas hasta obtener una anchoa apta para su consumo, sin sal y baja en sal. Para […]

Método de preparación de un producto tratado con calor, del 14 de Enero de 2015, de NOVOZYMES A/S: Método para preparar un producto tratado con calor, que comprende las fases secuenciales de: a) proveer una materia prima que comprende hidrato […]

Imagen de 'PROCEDIMIENTO PARA REDUCIR EL AMARGOR Y PICOR DE UN ACEITE DE…'PROCEDIMIENTO PARA REDUCIR EL AMARGOR Y PICOR DE UN ACEITE DE OLIVA, del 29 de Septiembre de 2014, de ASOCIACIÓN EMPRESARIAL DE INVESTIGACIÓN CENTRO TECNOLÓGICO NACIONAL AGROALIMENTARIO "EXTREMADURA" (CTAEX): La invención es un procedimiento para reducir los atributos de "amargor" y "picante" en un aceite de oliva, que comprende introducir oxígeno en dicho […]

Utilizamos cookies para mejorar nuestros servicios y mostrarle publicidad relevante. Si continua navegando, consideramos que acepta su uso. Puede obtener más información aquí. .