NUEVA HIDROXIFENILACETALDEHÍDO DESHIDROGENASA, ÁCIDO NUCLEICO QUE LA CODIFICA Y VECTORES Y MICROORGANISMOS RECOMBINANTES QUE LA EXPRESAN.
Nueva hidroxifenilacetaldehído deshidrogenasa, ácido nucleico que la codifica y vectores y microorganismos recombinantes que la expresan.
La nueva hidroxifenilacetaldehído deshidrogenasa forma parte de una vía bacteriana de degradación de tiramina y/o dopamina hasta ácido pirúvico y ácido succínico desconocida hasta ahora. Actúa tras la tiramina oxidasa que transforma tiramina o dopamina en 4-hidroxifenilacetaldehído y 3,4-dihidroxifenilacetaldehído, generando a partir de ellos ácido 4-hidroxifenilacético y 3,4-hidroxifenilacético, respectivamente. Así, puede usarse en procedimientos dirigidos a disminuir la cantidad de tiramina y/o dopamina, por ejemplo en alimentos. La invención se refiere también a las moléculas de ácido nucleico que codifican la enzima, los vectores que permitan su expresión y a microorganismos recombinantes transformados con dichos vectores.
Tipo: Patente de Invención. Resumen de patente/invención. Número de Solicitud: P201130649.
Solicitante: BIOGES STARTERS S.A..
Nacionalidad solicitante: España.
Inventor/es: ARCOS RODRIGUEZ,MARIO, RODRIGUEZ OLIVERA,ELIAS, NAHARRO CARRASCO,GERMAN, LUENGO RODRIGUEZ,JOSE MARIA.
Fecha de Publicación: .
Clasificación Internacional de Patentes:
- A23B4/22 NECESIDADES CORRIENTES DE LA VIDA. › A23 ALIMENTOS O PRODUCTOS ALIMENTICIOS; SU TRATAMIENTO, NO CUBIERTO POR OTRAS CLASES. › A23B CONSERVACION, P.EJ. MEDIANTE ENLATADO, DE CARNE, PESCADO, HUEVOS, FRUTAS, VERDURAS, SEMILLAS COMESTIBLES; MADURACION QUIMICA DE FRUTAS Y VERDURAS; PRODUCTOS CONSERVADOS, MADURADOS O ENLATADOS. › A23B 4/00 Métodos generales de conservación para carne, embutidos, pescado o productos a base de pescado. › Microorganismos; Enzimas.
- A23C19/097 A23 […] › A23C PRODUCTOS LACTEOS, p. ej. LECHE, MANTEQUILLA, QUESO; SUCEDANEOS DE LA LECHE O DEL QUESO; SU FABRICACION (obtención de composiciones a base de proteínas para la alimentación A23J 1/00; preparación de péptidos, p. ej. de proteinas, en general C07K 1/00). › A23C 19/00 Queso; Preparados a base de queso; Fabricación de estos productos (sucedáneos del queso A23C 20/00; caseína A23J 1/20). › Conservación.
- A23L1/015
- C12N9/02 QUIMICA; METALURGIA. › C12 BIOQUIMICA; CERVEZA; BEBIDAS ALCOHOLICAS; VINO; VINAGRE; MICROBIOLOGIA; ENZIMOLOGIA; TECNICAS DE MUTACION O DE GENETICA. › C12N MICROORGANISMOS O ENZIMAS; COMPOSICIONES QUE LOS CONTIENEN; PROPAGACION, CULTIVO O CONSERVACION DE MICROORGANISMOS; TECNICAS DE MUTACION O DE INGENIERIA GENETICA; MEDIOS DE CULTIVO (medios para ensayos microbiológicos C12Q 1/00). › C12N 9/00 Enzimas, p. ej. ligasas (6.); Proenzimas; Composiciones que las contienen (preparaciones para la limpieza de los dientes que contienen enzimas A61K 8/66, A61Q 11/00; preparaciones de uso médico que contienen enzimas A61K 38/43; composiciones detergentes que contienen enzimas C11D ); Procesos para preparar, activar, inhibir, separar o purificar enzimas. › Oxidorreductasas (1.), p. ej. luciferasa.
PDF original: ES-2387150_A1.pdf
Fragmento de la descripción:
Nueva hidroxifenilacetaldehído deshidrogenasa, ácido nucleico que la codifica y vectores y microorganismos recombinantes que la expresan
CAMPO TÉCNICO DE LA INVENCIÓN.
El campo técnico de la invención pertenece a la Biotecnología. La invención se refiere a una nueva hidroxifenilacetaldehído deshidrogenasa. Forma parte de una via de degradación bacteriana de degradación de tiramina y dopamina, desconocida hasta ahora, y lleva a cabo la transformación de los compuestos generados por dicha vía a partir de tiramina y dopamina en los ácidos 4-hidroxifenilacético y 3, 4-dihidroxifenilacético respectivamente, compuestos que, al ser degradados por las enzimas de otro cluster complementario, son finalmente degradados en ácido pirúvico y ácido succínico. Por tanto, la enzima puede usarse en procedimientos para disminuir el contenido de tiramina y/o dopamina en muestras que las contengan, preferentemente alimentos y bebidas.
ESTADO DE LA TÉCNICA
Aminas biogénicas. Aspectos generales
Las aminas son compuestos químicos derivados del amoníaco que resultan de la sustitución de los hidrógenos de esa molécula por radicales alquilo. Según se sustituyan uno, dos o tres hidrógenos, las aminas serán primarias, secundarias
o terciarias. Cuando son originadas como consecuencia de la actividad de organismos vivos y poseen actividad biológica (cumplen importantes funciones en las células) reciben el nombre de aminas biogénicas o biogénicas. En función del número de grupos amino presentes en la molécula podemos diferenciar, monoaminas, diaminas y poliaminas. Las monoaminas alifáticas están muy extendidas en la naturaleza donde también es abundante la diamina putrescina, mientras que las poliamidas espermidina y espermina son producidas por animales, por plantas y por la mayoría de las bacterias (1) .
Las aminas aromáticas, originadas por descarboxilación de aminoácidos, son las aminas más comunes en los alimentos (histamina, 2-feniletilamina, tiramina, etc.) y también tienen gran importancia como transmisores dentro del sistema nervioso central (dopamina, noradrenalina, epinefrina, serotonina, etc.) .
Podemos hacer una distinción entre aminas biogénicas endógenas, que son aquellas que son sintetizadas en diferentes tejidos de los organismos superiores (como por ejemplo la adrenalina producida en la médula adrenal o la histamina en los mastocitos) y aminas biogénicas exógenas, que son las ingeridas en la dieta. Estas aminas biogénicas exógenas pueden estar presentes en los alimentos de origen vegetal (frutas y hortalizas) , o bien pueden aparecer en los alimentos como consecuencia de la actividad microbiana durante el procesado (cura de carnes y quesos) o durante el almacenaje de los mismos. Debido a que pueden provocar efectos nocivos tanto en el hombre como en los animales, son consideradas sustancias tóxicas.
Las aminas biogénicas más importantes que pueden encontrarse en los alimentos son la histamina, la putrescina, la cadaverina, la tiramina, la triptamina, la feniletilamina, la espermina y la espermidina; y los alimentos que las contienen pueden ser muy variados (pescado, carne, huevos, quesos, bebidas fermentadas, etc.) (2 ) .
Afortunadamente, los organismos cuentan con diferentes sistemas naturales de destoxificación (monoaminooxidasa -MAO- o la diaminooxidasa –DAO-) que les permiten eliminar las aminas biogénicas, evitando los efectos perjudiciales causados por estos compuestos. Sin embargo, puede haber casos en que estos sistemas no funcionan correctamente,
o se encuentran inhibidos por la acción de determinados fármacos, por lo que la presencia de aminas biogénicas en los alimentos puede suponer un grave problema para la salud.
Por todas estas razones es muy interesante seleccionar microorganismos que al ser utilizados en los procesos de elaboración de alimentos (curados, fermentaciones, etc.) , no acumulen aminas biogénicas, o que lo hagan en concentraciones que no sean peligrosas para la salud. La Ingeniería Genética y la Ingeniería Metabólica podrían contribuir a obtener este tipo de cepas asegurando, además, que se conserven otra serie de propiedades y características que son necesarias para mantener los estándares de identidad y calidad de los alimentos.
Las aminas biogénicas como neurotransmisores
Desde hace décadas se tiene constancia de que la transmisión catecolaminérgica está mediada por aminas biogénicas entre las que se incluyen las catecolaminas (dopamina, noradrenalina y adrenalina) derivadas del aminoácido tirosina; la indolamina serotonina, sintetizada a partir del triptófano; y la histamina, producida a partir del aminoácido histidina.
Catecolaminas
Bajo el término catecolaminas se engloban todas aquellas aminas biogénicas derivadas de la tirosina que contienen un grupo catecol y un grupo amino en su molécula. El primer paso en la síntesis de catecolaminas está catalizado por la enzima tirosinahidroxilasa mediante una reacción que requiere oxigeno como substrato y tetrahidrobiopterina como cofactor, y permite obtener como producto final dihidroxifenilalanina (DOPA) (Figura 1) . Por lo tanto, la tasa de tirosinahidroxilasa va a ser el factor limitante para la síntesis de las tres aminas neurotransmisoras catecolaminérgicas (dopamina, noradrenalina y adrenalina) .
La dopamina se produce por la descarboxilación de L-DOPA. Esta reacción se lleva a cabo por la enzima DOPA descarboxilasa. El área del cerebro donde se encuentra en mayor abundancia es en el corpus estriatum, jugando un papel esencial en la coordinación de los movimientos corporales (3) . En pacientes que padecen la enfermedad de Parkinson, por ejemplo, se ha observado degeneración de las neuronas dopaminérgicas, lo que va a dar lugar a la característica disfunción motora asociada a esta enfermedad (4) .
La noradrenalina, también llamada norepinefrina, requiere para su síntesis, a partir de dopamina, la acción de la dopamina-β-hidroxilasa. Esta catecolamina se produce mayoritariamente en las neuronas de los ganglios simpáticos y su acción está relacionada con el sueño, la vigilia, la atención y la conducta.
La adrenalina, también llamada epinefrina, está presente en el cerebro en niveles más bajos que las otras dos catecolaminas. La enzima que sintetiza la adrenalina, la feniletanolamina-N-metiltransferasa, se localiza solo en las neuronas secretoras de esta catecolamina.
Las enzimas más importantes en el catabolismo de catecolaminas son la monoaminooxidasa (MAO) y la catecol Ometiltransferasa (COMT) (5) . Estas enzimas se encuentran respectivamente en las mitocondrias y en el citoplasma tanto de las células neuronales como de las gliales. Los inhibidores de estas enzimas se utilizan en clínica como antidepresivos (6) .
Histamina
Esta amina biogénica neurotransmisora se produce por descarboxilación de la histidina debido a la acción de la histidinadescarboxilasa (Figura 2ª) . En su metabolismo intervienen tanto la histidinametiltransferasa como la MAO. La mayor concentración de este neurotransmisor se encuentra en las neuronas del hipotálamo y su acción está relacionada con los procesos de alerta y atención. La histamina también es liberada por los macrófagos en respuesta a reacciones alérgicas o a daños en los tejidos.
Serotonina
Esta indolamina, también llamada 5-hidroxitriptamina, se sintetiza en las neuronas a partir del triptófano ingerido con los alimentos tras ser hidroxilado a 5-hidroxitriptófano mediante una reacción catalizada por la enzima triptófano-5hidroxilasa. Posteriormente, el 5-hidroxitriptófano se descarboxila por medio de la acción de una 5-hidroxitriptofano descarboxilasa para dar lugar a la serotonina (Figura 2B) . La principal enzima encargada de su degradación es la MAO, al igual que sucede en las demás aminas biogénicas. La serotonina está implicada en la regulación del sueño y de la vigilia.
Además de las monoaminas neurotransmisoras, existen otras aminas biogénicas que poseen una estructura molecular parecida y que actúan como neuromoduladores o “falsos neurotransmisores”. Estas aminas endógenas, también denominadas aminas “traza” o microaminas, se encuentran en pequeñas cantidades en el sistema nervioso central y su
estudio está adquiriendo una importante relevancia en los últimos años.
Aminas... [Seguir leyendo]
Reivindicaciones:
1. Una molécula de ácido nucleico aislada que comprende una secuencia que codifica una proteína capaz de actuar como 4-hidroxifenilacetaldehído deshidrogenasa, seleccionada del grupo que consiste en:
a) una secuencia de ácido nucleico que es idéntica; al menos en un 60%, a la secuencia representada por SEQ ID NO:7;
b) una secuencia de ácido nucleico que comprende una secuencia que hibrida en condiciones estrictas con las secuencias de a) ;
c) una secuencia de ácido nucleico que comprende una secuencia que codifica un polipéptido cuya secuencia de aminoácidos es idéntica, al menos en un 60%, a la secuencia representada por SEQ ID NO:8;
d) una secuencia de ácido nucleico que comprende una secuencia que codifica una variante alélica natural de un polipéptido que comprende la secuencia de aminoácidos representada por SEQ ID NO:8, donde la molécula de ácido nucleico hibrida, bajo condiciones estrictas, con una secuencia de DNA que comprende la secuencia mencionada en a) ,
o una secuencia complementaria a la misma.
2. Molécula de ácido nucleico aislada según la reivindicación 1, seleccionada del grupo que consiste en:
a) una molécula de ácido nucleico que comprende una secuencia que es idéntica a la secuencia representada por SEQ ID NO:7;
b) una molécula de ácido nucleico que comprende una secuencia que codifica un polipéptido cuya secuencia de aminoácidos es idéntica a la secuencia representada por SEQ ID NO:8.
3. Un polipéptido purificado con actividad de hidroxifenilacetaldehído deshidrogenasa cuya secuencia de aminoácidos es idéntica al menos en un 60% a la secuencia polipeptídica representada por SEQ ID NO:8.
4. Polipéptido purificado con actividad hidroxifenilacetaldehído deshidrogenasa según la reivindicación 3, cuya secuencia es idéntica a la secuencia polipeptídica representada por SEQ ID NO:8.
5. Un vector de expresión que comprende una molécula de ácido nucleico según una cualquiera de las reivindicaciones 1 ó 2.
6. Vector de expresión según la reivindicación 5, que es un plásmido.
7. Vector de expresión según la reivindicación 6, en el que el que la molécula de ácido nucleico de una cualquiera de las reivindicaciones 1 ó 2 está insertada en el plásmido pK18::mob.
8. Un organismo hospedador transformado con un vector de expresión según una cualquiera de las reivindicaciones 5 a
7.
9. Organismo hospedador según la reivindicación 8, que es una bacteria.
10. Organismo hospedador según la reivindicación 9, que es una bacteria capaz de transformar un azúcar en ácido láctico.
11. Organismo hospedador según una cualquiera de las reivindicaciones 8 a 10, en el que el vector de expresión de una cualquiera de las reivindicaciones 5 a 7 está insertado en el genoma del hospedador.
12. Organismo hospedador según una cualquiera de las reivindicaciones 8 a 10, en el que el vector de expresión de una cualquiera de las reivindicaciones 5 a 7 permanece como forma replicativa autónoma.
13. Una composición que comprende el polipéptido o complejo proteico de la reivindicación 3 ó 4.
37
Figura 1 Figura 2
39
Figura 3
Figura 4 Figura 5
Figura 6 Figura 7
Figura 8
Figura 9
1 TCAGGCGAAACGCTCGAAGCGGTACGGTGACGGGTCGATCAGCGGGGTGGCCTGGGCCACCAGGTCTGCCGCCAG AGTCCGCTTTGCGAGCTTCGCCATGCCACTGCCCAGCTAGTCGCCCCACCGGACCCGGTGGTCCAGACGGCGGTC -2 & A F R E F R Y P S P D I L P T A Q A V L D A A L
76 CTGGCCAGCAGCAGGCGAGGTGCCGAAGCCATGCCCGGAAAAGCCGGTGGCCAGGGTCAGGCCCGGAATACTGGC GACCGGTCGTCGTCCGCTCCACGGCTTCGGTACGGGCCTTTTCGGCCACCGGTCCCAGTCCGGGCCTTATGACCG -2 Q G A A P S T G F G H G S F G T A L T L G P I S A
151 CACCGGGCCGATGACCGGGTTGGAGTCGGGGGTGACGTCAATCGTGCCGGCCCAGGCGCTGGCGATACGGGCCTG GTGGCCCGGCTACTGGCCCAACCTCAGCCCCCACTGCAGTTAGCACGGCCGGGTCCGCGACCGCTATGCCCGGAC -2 V P G I V P N S D P T V D I T G A W A S A I R A Q
226 TTCGAACACCGGCCAGGCCGCTTTCAGGTTGCGCATGGCCTCGTCGTTGAGGGCCGGGTTGGCGTGCGGGTCTTG AAGCTTGTGGCCGGTCCGGCGAAAGTCCAACGCGTACCGGAGCAGCAACTCCCGGCCCAACCGCACGCCCAGAAC -2 E F V P W A A K L N R M A E D N L A P N A H P D Q
301 TACCCGTACACGCTCGAAGGGGGTTACATCCGTTGCCTTCCAGCGCCGGGCCAGGGCCAGGTCCTTGAAGAAGTA ATGGGCATGTGCGAGCTTCCCCCAATGTAGGCAACGGAAGGTCGCGGCCCGGTCCCGGTCCAGGAACTTCTTCAT -2 V R V R E F P T V D T A K W R R A L A L D K F F Y
376 CTTGCCAAAGCTGATGCGCAAAAAGTCCCGCTGGGCACGCAGCTGGGGCAGGTAACGCTTGCCCAGCAGCAGGTG GAACGGTTTCGACTACGCGTTTTTCAGGGCGACCCGTGCGTCGACCCCGTCCATTGCGAACGGGTCGTCGTCCAC -2 K G F S I R L F D R Q A R L Q P L Y R K G L L L H
451 ATCGAGGGTGAGGAAGGCGTCCAGCGCGCCGCGCTGGGTGATGATGTAGCCGCCGTCCTTGTGCTTGCGGAAGGA TAGCTCCCACTCCTTCCGCAGGTCGCGCGGCGCGACCCACTACTACATCGGCGGCAGGAACACGAACGCCTTCCT -2 D L T L F A D L A G R Q T I I Y G G D K H K R F S
526 AAAATCTGGTGCGCCCACGGCGATGTCGGTTGGCCCGTCCATGGGCTCTGTGCGCAGCACGGAACAGGTCAGCGG TTTTAGACCACGCGGGTGCCGCTACAGCCAACCGGGCAGGTACCCGAGACACGCGTCGTGCCTTGTCCAGTCGCC -2 F D P A G V A I D T P G D M P E T R L V S C T L P
601 CAAGGTCGGCAGGTTGATGCCCAGGTTGCCGAGGAACTTGCGCGACCACAGGCCACCGGCCAGCAACACCTGGTC GTTCCAGCCGTCCAACTACGGGTCCAACGGCTCCTTGAACGCGCTGGTGTCCGGTGGCCGGTCGTTGTGGACCAG -2 L T P L N I G L N G L F K R S W L G G A L L V Q D
676 GCAGCGGATTTCACCTTGCTCGGTGACCACCCCGCTGACACGGCCGGCTGCGGTGACCAGCGTGCGCACCGCGCA CGTCGCCTAAAGTGGAACGAGCCACTGGTGGGGCGACTGTGCCGGCCGACGCCACTGGTCGCACGCGTGGCGCGT -2 C R I E G Q E T V V G S V R G A A T V L T R V A C
751 GTTCTCCACTACCACTGCACCTTTGGCGATCGCCGCCCGGGCGATGGCGCTGGCGGCCAGGGTCGGTTCGGCGCG CAAGAGGTGATGGTGACGTGGAAACCGCTAGCGGCGGGCCCGCTACCGCGACCGCCGGTCCCAGCCAAGCCGCGC -2 N E V V V A G K A I A A R A I A S A A L T P E A R
826 GGCGTCGGAGGGGGTGAAGATGCCACCTGCCCAATCCGCCCGACCACCCGGCACCATCCGGGTGATTTCCCGCGT CCGCAGCCTCCCCCACTTCTACGGTGGACGGGTTAGGCGGGCTGGTGGGCCGTGGTAGGCCCACTAAAGGGCGCA -2 A D S P T F I G G A W D A R G G P V M R T I E R T
901 GCTCAGCAGGCGCGAATCCAGGCCCAGCGCCTCGACGCTTTTCAGCCAGCCTTCATGCATGCCCATCTGCGTGTC CGAGTCGTCCGCGCTTAGGTCCGGGTCGCGGAGCTGCGAAAAGTCGGTCGGAAGTACGTACGGGTAGACGCACAG -2 S L L R S D L G L A E V S K L W G E H M G M Q T D
976 GTTACGGCCGATGAACATGATGCCGGCTTGCCGATAGCCAACGTCGCTGCCAACCCGTGCGGGCATCTCGGCCCA CAATGCCGGCTACTTGTACTACGGCCGAACGGCTATCGGTTGCAGCGACGGTTGGGCACGCCCGTAGAGCCGGGT -2 N R G I F M I G A Q R Y G V D S G V R A P M E A W
1051 CAGCCGATCAGCCGCCAGTGCCAGGGGAATGTCATGGGCGTGGCGGTTGGTCTTGCGCACCCAGCCCAGGTTGCG GTCGGCTAGTCGGCGGTCACGGTCCCCTTACAGTACCCGCACCGCCAACCAGAACGCGTGGGTCGGGTCCAACGC -2 L R D A A L A L P I D H A H R N T K R V W G L N R
1126 CGACGACTGCTCCCCAGCGATGCGCCCCTTCTCCAGCACCACCACCGGTATGTTGCGTTCGGCGAGGCTCAGTGC GCTGCTGACGAGGGGTCGCTACGCGGGGAAGAGGTCGTGGTGGTGGCCATACAACGCAAGCCGCTCCGAGTCACG -2 S S Q E G A I R G K E L V V V P I N R E A L S L A
1201 GGCGGTGAGGCCGATAATGCCGCCACCGATGATCACCACGGTAGTGGCGTCGGGGTGGCGGGTGCTGGTTTGCAC CCGCCACTCCGGCTATTACGGCGGTGGCTACTAGTGGTGCCATCACCGCAGCCCCACCGCCCACGACCAAACGTG -2 A T L G I I G G G I I V V T T A D P H R T S T Q V
1276 AGGGGCGATCGTGGGAGACATGGCTTTACTCTTTGTTGTGCGTGCAGGGGGAGTGTTCAGCGCCAGCCAGCAGCC TCCCCGCTAGCACCCTCTGTACCGAAATGAGAAACAACACGCACGTCCCCCTCACAAGTCGCGGTCGGTCGTCGG -2 P A I T P S M
tynA
1351 TCACTGGCCAAGGCGGATCAGGGTCACTTGCGCTTGCCCCGCACCGCGGTAGGCGGTGACCTCCAGCTCGACCTT AGTGACCGGTTCCGCCTAGTCCCAGTGAACGCGAACGGGGCGTGGCGCCATCCGCCACTGGAGGTCGAGCTGGAA
1426 GTAAACGGTGGAGCCCAGCGGCGGGCAGGTGACCGTGGTGGCCGGGTCGATGCCGCGGAACTTCTCGCCGATCAC CATTTGCCACCTCGGGTCGCCGCCCGTCCACTGGCACCACCGGCCCAGCTACGGCGCCTTGAAGAGCGGCTAGTG
1501 GTCCATGACCCGTGGTACATCGGCAGGGTCCTGGATGAACACGCGCGAGTTGATGACATCGGCCAGGCTGGCATC CAGGTACTGGGCACCATGTAGCCGTCCCAGGACCTACTTGTGCGCGCTCAACTACTGTAGCCGGTCCGACCGTAG
1576 GACTGCGGCCAGCGCGGTTTCGATGTTGGCGAACACCTGGTGGGTCTGTTCGATGACGTCCTCTGGAATGACCTG CTGACGCCGGTCGCGCCAAAGCTACAACCGCTTGTGGACCACCCAGACAAGCTACTGCAGGAGACCTTACTGGAC
1651 GGTCTGCGGGTTGCGTCCGGCGGTGTTGGAGACGTGAATCCAGTTGTCCACCGCCACCAGGCGGGAGTAGCTGGC CCAGACGCCCAACGCAGGCCGCCACAACCTCTGCACTTAGGTCAACAGGTGGCGGTGGTCCGCCCTCATCGACCG
1726 CATGGCTTCGAACTTGGAGCCGGTTTTCAGTTTGATGATCTGTGTCATGGGCTTTGCCTTGTTATCCGGTTGCGG GTACCGAAGCTTGAACCTCGGCCAAAAGTCAAACTACTAGACACAGTACCCGAAACGGAACAATAGGCCAACGCC
1801 GGATCAGCTGAGAACGGGGGTTTCCCAGAGGTTGAGCTTTACGCCGATGCCTTGCTCGAGCGCCTTGCGGTACAC CCTAGTCGACTCTTGCCCCCAAAGGGTCTCCAACTCGAAATGCGGCTACGGAACGAGCTCGCGGAACGCCATGTG -2 & S L V P T E W L N L K V G I G Q E L A K R Y V
1876 CACGGTGCCCCAGGCCACGTCTTCGACGGGCATGCCGCCCACCGACATCAGGATGATTTCGTCGTCATGCAGGCG GTGCCACGGGGTCCGGTGCAGAAGCTGCCCGTACGGCGGGTGGCTGTAGTCCTACTAAAGCAGCAGTACGTCCGC -2 V T G W A V D E V P M G G V S M L I I E D D H L R
1951 GCCCGGTGCGTCGCCGCTGATGATCTTGCCGATGTCTTCCACCTGCTCGGCGGCCAGCGTGCCTTCGGCAATCAT CGGGCCACGCAGCGGCGACTACTAGAACGGCTACAGAAGGTGGACGAGCCGCCGGTCGCACGGAAGCCGTTAGTA -2 G P A D G S I I K G I D E V Q E A A L T G E A I M
2026 GTCCATGAAGCGCACACCTACCAGCGGTACGTGGTTGTGCGCAGGCTTGGGCAGCTCTTCGAACCAGGCCTCGTA CAGGTACTTCGCGTGTGGATGGTCGCCATGCACCAACACGCGTCCGAACCCGTCGAGAAGCTTGGTCCGGAGCAT -2 D M F R V G V L P V H N H A P K P L E E F W A E Y 2101 GAGGCCGGTGTTGTCCACCACCTTGCGCACGTCGTCCTGCTCCATGCCGGCGTCGATACTGCACGGGGCTGGCAT CTCCGGCCACAACAGGTGGTGGAACGCGTGCAGCAGGACGAGGTACGGCCGCAGCTATGACGTGCCCCGACCGTA -2 L G T N D V V K R V D D Q E M G A D I S C P A P M
2176 GGCCAGGAACGCGCCAGGCTTGACCCACTCGCGGCGCACCAGCGGGTACTGGCTGGGGTCGCCGACTTCGCCCGA CCGGTCCTTGCGCGGTCCGAACTGGGTGAGCGCCGCGTGGTCGCCCATGACCGACCCCAGCGGCTGAAGCGGGCT -2 A L F A G P K V W E R R V L P Y Q S P D G V E G S
2251 GCTGCAGTAGCTGACCAGGTCGGAACCGCGTACCACTTCTTCCAGGGTTTCCACCACCTGGACATGAGTGATTTG CGACGTCATCGACTGGTCCAGCCTTGGCGCATGGTGAAGAAGGTCCCAAAGGTGGTGGACCTGTACTCACTAAAC -2 S C Y S V L D S G R V V E E L T E V V Q V H T I Q
2326 CGGGAAGCTGGTTTTCACCCAGGCGACGAAGGCATCCAGGTTCTTCTGGCCACGGCCCTTGACCTTGAGGGTGTC GCCCTTCGACCAAAAGTGGGTCCGCTGCTTCCGTAGGTCCAAGAAGACCGGTGCCGGGAACTGGAACTCCCACAG -2 P F S T K V W A V F A D L N K Q G R G K V K L T D
2401 GATCAGCGGGCAGACGGCCATGAACGCAGCGACCGTGGTCTTGCCCATCACCCCCGGGCCGGCCAGGCCGATCAC CTAGTCGCCCGTCTGCCGGTACTTGCGTCGCTGGCACCAGAACGGGTAGTGGGGGCCCGGCCGGTCCGGCTAGTG -2 I L P C V A M F A A V T T K G M V G P G A L G I V
2476 CTTGGCGTCCTTGCGCGCCAGGTGGCGGGCGCCGACGCCCGGGATGGCGCCGGTGCGGTAGGCCGACAGCAGGTT GAACCGCAGGAACGCGCGGTCCACCGCCCGCGGCTGCGGGCCCTACCGCGGCCACGCCATCCGGCTGTCGTCCAA -2 K A D K R A L H R A G V G P I A G T R Y A S L L N
2551 GGCCGACATGTGTGCCAGTGGCGCGCCGGTGTCGGCATCGTTGAGGGTGAACATCAGGATCGAGCGGGGCAGGCC CCGGCTGTACACACGGTCACCGCGCGGCCACAGCCGTAGCAACTCCCACTTGTAGTCCTAGCTCGCCCCGTCCGG -2 A S M H A L P A G T D A D N L T F M L I S R P L G
2626 TTTCTCACGGTTGGCGATGTTCGAGCCGTACCACTTGGCGCCTGCGGTCTGGAAGTTGCCGCCGAGGTACGCCGG AAAGAGTGCCAACCGCTACAAGCTCGGCATGGTGAACCGCGGACGCCAGACCTTCAACGGCGGCTCCATGCGGCC -2 K E R N A I N S G Y W K A G A T Q F N G G L Y A P
2701 CATCGCCATCATGCGCCGGTCGGCGGTGGGCTTGGGCATGTTGGGGAATGGCGAGTGCTCGGGGAAGGTAATCAT GTAGCGGTAGTACGCGGCCAGCCGCCACCCGAACCCGTACAACCCCTTACCGCTCACGAGCCCCTTCCATTAGTA -2 M A M M R R D A T P K P M N P F P S H E P F T I M
2776 CGCGCCGTGCGAGTCGCTGTTCGGGCCGGCCATGCGGTAGTCACCCTGGTACAGCAGGCCGAACATTTCTTCCAT GCGCGGCACGCTCAGCGACAAGCCCGGCCGGTACGCCATCAGTGGGACCATGTCGTCCGGCTTGTAAAGAAGGTA -2 A G H S D S N P G A M R Y D G Q Y L L G F M E E M
2851 GGTGTCGACACAGGCCGGCATGTCGGTGACGCCGGCACGGATCATGTCCTGCTCGGACAGGTAGATGAAGTCAAT CCACAGCTGTGTCCGGCCGTACAGCCACTGCGGCCGTGCCTAGTACAGGACGAGCCTGTCCATCTACTTCAGTTA -2 T D V C A P M D T V G A R I M D Q E S L Y I F D I
2926 TCTGGTATCGAGGGTCATGGCGGGTCTCGCAGGGCTGGCTGCCGTCGGATTTGTTGTTGGTTTCGAGGCAACCAG AGACCATAGCTCCCAGTACCGCCCAGAGCGTCCCGACCGACGGCAGCCTAAACAACAACCAAAGCTCCGTTGGTC -2 R T D L T M
tynB
3001 TTTCGCTAACGACTGGTAGGTCGTCTTGTGTCTGCCTGCCAGCCGAGTTGACCGTCAGTGCCAGGGCTTCAATGG AAAGCGATTGCTGACCATCCAGCAGAACACAGACGGACGGTCGGCTCAACTGGCAGTCACGGTCCCGAAGTTACC -3 & H
3076 CCCGCGAGCGAGAAGCTGGCCGGGGTGTGGCGCAGGCTGAGGGCGGTCAGCAGGCACACCACCAGGGTGCACAGG GGGCGCTCGCTCTTCGACCGGCCCCACACCGCGTCCGACTCCCGCCAGTCGTCCGTGTGGTGGTCCCACGTGTCC -3 G A L S F S A P T H R L S L A T L L C V V L T C L
3151 GCCAGCAGCGCGGCCCATGCGGTCGGGCCGTGGTTGAGTACCACTGCGGCCAGCGGGGCGGCGCCGGCAGACGCC CGGTCGTCGCGCCGGGTACGCCAGCCCGGCACCAACTCATGGTGACGCCGGTCGCCCCGCCGCGGCCGTCTGCGG -3 A L L A A W A T P G H N L V V A A L P A A G A S A
3226 GACAGCTGGATGGCGCCCAGCAGCGCTGCGGTGGAACCCAGTGCCTTTTCTTGCGAGGCCATCACCAGCGACATC CTGTCGACCTACCGCGGGTCGTCGCGACGCCACCTTGGGTCACGGAAAAGAACGCTCCGGTAGTGGTCGCTGTAG -3 S L Q I A G L L A A T S G L A K E Q S A M V L S M
3301 AGCGTCGACTCGGCTATCCCCAGGCCGAACAGGGCTATCACCATGCCGCCGGCCACACCTGGCAGCCCCAGGCCG TCGCAGCTGAGCCGATAGGGGTCCGGCTTGTCCCGATAGTGGTACGGCGGCCGGTGTGGACCGTCGGGGTCCGGC -3 L T S E A I G L G F L A I V M G G A V G P L G L G
3376 GTCAGTGCACCGAGCAGGCTGATGCAGGCACCGCCGGCCATGCACAGCACGCCCACCCGAGTCAAGGTATTGAGG CAGTCACGTGGCTCGTCCGACTACGTCCGTGGCGGCCGGTACGTGTCGTGCGGGTGGGCTCAGTTCCATAACTCC -3 T L A G L L S I C A G G A M C L V G V R T L T N L
3451 CCCAGCCGGCTGATCAGGTGGCTGGCCGTCATGGCGCCGAGCAGGATCGACACCCCGGTGGCGCCAAACAGCAGG GGGTCGGCCGACTAGTCCACCGACCGGCAGTACCGCGGCTCGTCCTAGCTGTGGGGCCACCGCGGTTTGTCGTCC -3 G L R S I L H S A T M A G L L I S V G T A G F L L
3526 CCGAAGGCCTGGGCGCTCAGGCCGTAGTGGGCCTGGTACACCAGGGTGGCACCGCCGATGTAGGCGAACAGGAAG GGCTTCCGGACCCGCGAGTCCGGCATCACCCGGACCATGTGGTCCCACCGTGGCGGCTACATCCGCTTGTCCTTC -3 G F A Q A S L G Y H A Q Y V L T A G G I Y A F L F
3601 AAGAATACCGCAGCAACCGCCAGGGTCGGGCGCAGGAAGCGGCGGTCGGCGAGGATGGCCAGGTAGGTGCTGCAG TTCTTATGGCGTCGTTGGCGGTCCCAGCCCGCGTCCTTCGCCGCCAGCCGCTCCTACCGGTCCATCCACGACGTC -3 F F V A A V A L T P R L F R R D A L I A L Y T S C
3676 GCGTGGCCCAGGCGCAGGGGTTCGCGTTTGCTGGGCGGCAGGGTTTCGGGCAGGTTCAGCAGGCTGTTGACCAGC CGCACCGGGTCCGCGTCCCCAAGCGCAAACGACCCGCCGTCCCAAAGCCCGTCCAAGTCGTCCGACAACTGGTCG -3 A H G L R L P E R K S P P L T E P L N L L S N V L
3751 ACCGTCACGCCCATGCCGGCGAGTACCAGCATTACTGCACGCCAGCCGAAATGTGCGTCGATCACGCCGCCCAGG TGGCAGTGCGGGTACGGCCGCTCATGGTCGTAATGACGTGCGGTCGGCTTTACACGCAGCTAGTGCGGCGGGTCC -3 V T V G M G A L V L M V A R W G F H A D I V G G L
3826 GCAGGTGCCAGGATCGGTGCGACGCCTTCGATGGTCATCAGCAGGGCGAACAGTTTGGTCGCGGCCACGCCCTGG CGTCCACGGTCCTAGCCACGCTGCGGAAGCTACCAGTAGTCGTCCCGCTTGTCAAACCAGCGCCGGTGCGGGACC -3 A P A L I P A V G E I T M L L A F L K T A A V G Q
3901 CTCACATCACGCACCATGCTCATGATCACCACCAGGGTCAGCGCACTGCCCAGGCCCTGGAAAAAGCGCAGCATG GAGTGTAGTGCGTGGTACGAGTACTAGTGGTGGTCCCAGTCGCGTGACGGGTCCGGGACCTTTTTCGCGTCGTAC -3 S V D R V M S M I V V L T L A S G L G Q F F R L M
3976 ATCAGGGTGTCGAGGCTGGGGGCTGCGGCTGCGCCCAGCGAGCACAGGATGAACAGCAGCAGGCCGGCCAGCAGC TAGTCCCACAGCTCCGACCCCCGACGCCGACGCGGGTCGCTCGTGTCCTACTTGTCGTCGTCCGGCCGGTCGTCG -3 I L T D L S P A A A A G L S C L I F L L L G A L L
4051 GGCTTGCGCCGGCCATAAGCGTCGACGATGGGGCCGAAGATCAGCTGGCCGGCGCCCATGGCCAGCAGGAAGAAG CCGAACGCGGCCGGTATTCGCAGCTGCTACCCCGGCTTCTAGTCGACCGGCCGCGGGTACCGGTCGTCCTTCTTC -3 P K R R G Y A D V I P G F I L Q G A G M A L L F F
4126 GTCAGTGTCAGCTGTACGCGGGTGAAGCTAGCCTGATAGTGGCTGGCGATTTCCGGCAGGCTCGACAGGTACATG CAGTCACAGTCGACATGCGCCCACTTCGATCGGACTATCACCGACCGCTAAAGGCCGTCCGAGCTGTCCATGTAC -3 T L T L Q V R T F S A Q Y H S A I E P L S S L Y M
4201 TCGACGGCGGAAGGGCCGAGGGCGCCGATCAGGCCTAGGCCCAGGGCGAAGCTGAAGGGTATGGGAGGGGAGGGA AGCTGCCGCCTTCCCGGCTCCCGCGGCTAGTCCGGATCCGGGTCCCGCTTCGACTTCCCATACCCTCCCCTCCCT -3 D V A S P G L A G I L G L G L A F S F P I P P S P
4276 TTGGCTTGCATGGTTTTCTCTGGCTGATTTTTCGCCTACCGACCGGTAGGTTTGCGAATATTATTCGCCGAGTCG AACCGAACGTACCAAAAGAGACCGACTAAAAAGCGGATGGCTGGCCATCCAAACGCTTATAATAAGCGGCTCAGC -3 N A Q M
tynF
4351 GCCAAGGTCAAACCCTTCCGCAAGGCCACTGATTCCTGTGGGGAGCGGGCATGCCCGCGAACACCGGCAAAGCCG CGGTTCCAGTTTGGGAAGGCGTTCCGGTGACTAAGGACACCCCTCGCCCGTACGGGCGCTTGTGGCCGTTTCGGC
4426 GTGCCACCGAGTCGCCTTCTTCGCGGGCATGCCCGCTCCCACATTGACCGCAGAGGTTGGTTACCGTGGTTGCGT CACGGTGGCTCAGCGGAAGAAGCGCCCGTACGGGCGAGGGTGTAACTGGCGTCTCCAACCAATGGCACCAACGCA
4501 CAGAACGGCACAGCCACGGTCAGCTGGCTATACACATTGGTACCATTCCCGCCCACCTGGTTGCCGCCGTTGCTC GTCTTGCCGTGTCGGTGCCAGTCGACCGATATGTGTAACCATGGTAAGGGCGGGTGGACCAACGGCGGCAACGAG -3 & F P V A V T L Q S Y V N T G N G G V Q N G G N S
4576 TCGTCCTTGCGCGGCTGGTAAAGGCCCACCAGCGGGCTGATTATCAGGTGCTCGTTGACTGCCCATTCCACATAC AGCAGGAACGCGCCGACCATTTCCGGGTGGTCGCCCGACTAATAGTCCACGAGCAACTGACGGGTAAGGTGTATG -3 E D K R P Q Y L G V L P S I I L H E N V A W E V Y
4651 AGGTCCAGCTCCCGCGCATCGAGGTTGAGGCTTTCGCGGGTGCGTACGGTGTCGAAGTCGAAGTACAGCGCCCCG TCCAGGTCGAGGGCGCGTAGCTCCAACTCCGAAAGCGCCCACGCATGCCACAGCTTCAGCTTCATGTCGCGGGGC -3 L D L E R A D L N L S E R T R V T D F D F Y L A G
4726 ACTGTGAGATTTTCCAGCGGTGTCGCCTTCACGCCCACATGGTGGATACCCGTGTTGCTGTTGAAGGGGCCGGCG TGACACTCTAAAAGGTCGCCACAGCGGAAGTGCGGGTGTACCACCTATGGGCACAACGACAACTTCCCCGGCCGC -3 V T L N E L P T A K V G V H H I G T N S N F P G A
4801 TAGTTGGCAGCGACTTCACCCTGGAACCAGGTGCCGTAACCGCTGGACAGGCCGCTGAACAGCGCGTCCCAGCCT ATCAACCGTCGCTGAAGTGGGACCTTGGTCCACGGCATTGGCGACCTGTCCGGCGACTTGTCGCGCAGGGTCGGA -3 Y N A A V E G Q F W T G Y G S S L G S F L A D W G
4876 GCCGAGTAGCGGGTGTAGCGGTAGGTAACCTGCGGTGCCCACGGCAGGTCGGCGAAGGTGTAGCCGGCCTGCAGG CGGCTCATCGCCCACATCGCCATCCATTGGACGCCACGGGTGCCGTCCAGCCGCTTCCACATCGGCCGGACGTCC -3 A S Y R T Y R Y T V Q P A W P L D A F T Y G A Q L
4951 TACCAGGCTTGCTCGGGGCCGTCGGTCTTGTCCTGCCAGGCGTATTCGAAGGCGAAACTGGCATTGTCGATGCCA ATGGTCCGAACGAGCCCCGGCAGCCAGAACAGGACGGTCCGCATAAGCTTCCGCTTTGACCGTAACAGCTACGGT -3 Y W A Q E P G D T K D Q W A Y E F A F S A N D I G
5026 GCGTTGCCTTCGCCGCGCACGCTATACACGTCCATGCCTTCGCGGGCTTTCTGAAAGTCGCTGGCCCATTGGTCG CGCAACGGAAGCGGCGCGTGCGATATGTGCAGGTACGGAAGCGCCCGAAAGACTTTCAGCGACCGGGTAACCAGC -3 A N G E G R V S Y V D M G E R A K Q F D S A W Q D
5101 GTGACGTCGATGCCGTGAATCCAGGTCAGCCCGAGGGTGCCCAAGGCTTGGGTGTAGTCCAGCGTGCCGGCGGCC CACTGCAGCTACGGCACTTAGGTCCAGTCGGGCTCCCACGGGTTCCGAACCCACATCAGGTCGCACGGCCGCCGG -3 T V D I G H I W T L G L T G L A Q T Y D L T G A A
5176 AGTTCGGTTTCGGCCTGGGCGCGGTTGTCGGATTTCAGCCACAGCAGGCTGCCATGCAGGCCATCGCTGCCCCCC TCAAGCCAAAGCCGGACCCGCGCCAACAGCCTAAAGTCGGTGTCGTCCGACGGTACGTCCGGTAGCGACGGGGGG -3 L E T E A Q A R N D S K L W L L S G H L G D S G G
5251 AGGCGCAGCATTGCGGTGCGGTCGAAGGCGTGGCGGGCGGCCAGGTAGTAGGCCCCGCCGCGGTCCAGCGCACCG TCCGCGTCGTAACGCCACGCCAGCTTCCGCACCGCCCGCCGGTCCATCATCCGGGGCGGCGCCAGGTCGCGTGGC -3 L R L M A T R D F A H R A A L Y Y A G G R D L A G
5326 TCGGCGACGCCGTTGCCCAGGTTCGGGCCGTCGTCGTTGATCAAAAAACCACTGCCCAGGCGAATGGTCTGGCGG AGCCGCTGCGGCAACGGGTCCAAGCCCGGCAGCAGCAACTAGTTTTTTGGTGACGGGTCCGCTTACCAGACCGCC -3 D A V G N G L N P G D D N I L F G S G L R I T Q R
5401 CCGGCGGAAACGTCCACTCCATCCTTGCCCAGCACCGGGAACAGGTCGGCCGAGCGCCAGCCGAGGAAGGCGTCT GGCCGCCTTTGCAGGTGAGGTAGGAACGGGTCGTGGCCCTTGTCCAGCCGGCTCGCGGTCGGCTCCTTCCGCAGA -3 G A S V D V G D K G L V P F L D A S R W G L F A D
5476 TCGATCTTGGTGGTGCGTTCGGAGCCATCGGTGTTGCCGGCCGCATCGCCATCGCCCCAGGTGGCCGAGCTCACC
AGCTAGAACCACCACGCAAGCCTCGGTAGCCACAACGGCCGGCGTAGCGGTAGCGGGGTCCACCGGCTCGAGTGG -3 E I K T T R E S G D T N G A A D G D G W T A S S V
5551 CAGTTCAGGCTGCCGTACAGCGTGCCGTTGCCGGCCAGGCCCTGGTCACCGCTGAGGCCATACTTGATAAAGCCT GTCAAGTCCGACGGCATGTCGCACGGCAACGGCCGGTCCGGGACCAGTGGCGACTCCGGTATGAACTATTTCGGA -3 W N L S G Y L T G N G A L G Q D G S L G Y K I F G
5626 TCACGCCAGGTCGAACCCCCTGTGGTGCCGTCGTAGTTCTTGCGGCTGTTGAACATGCCCCATACCGCCAGCATG AGTGCGGTCCAGCTTGGGGGACACCACGGCAGCATCAAGAACGCCGACAACTTGTACGGGGTATGGCGGTCGTAC -3 E R W T S G G T T G D Y N K R S N F M G W V A L M
5701 TCGGCGTTCAGGTGGCTGTCATCGTCGGCGTACAGCTCAACGGCCGGCGCGGCCTGGCTGGCCAGCAAGGTTGCC AGCCGCAAGTCCACCGACAGTAGCAGCCGCATGTCGAGTTGCCGGCCGCGCCGGACCGACCGGTCGTTCCAACGG -3 D A N L H S D D D A Y L E V A P A A Q S A L L T A
5776 AGGGCCAGGCTGGACAGCGTCTGTGGTTTGACCATTTGCACATCCCTCGTTTGTTCTCGGCCACCTTCACAGGGG TCCCGGTCCGACCTGTCGCAGACACCAAACTGGTAAACGTGTAGGGAGCAAACAAGAGCCGGTGGAAGTGTCCCC -3 L A L S S L T Q P K V M
tynE
5851 CCTTTGTTGTTCGGGGGCACCCTCGGTTCTGGCGAGGGGCCATCGCGGTTGGCGGCGATGGCCTATTAGGGCGTG GGAAACAACAAGCCCCCGTGGGAGCCAAGACCGCTCCCCGGTAGCGCCAACCGCCGCTACCGGATAATCCCGCAC
5926 TGCGGTGGGGCGGGGTCTTGTTCGTGGCTGCCAAGGCGCTTGCACGCCTTGGCCACAGGCGCGGTCAGTAGCGGA ACGCCACCCCGCCCCAGAACAAGCACCGACGGTTCCGCGAACGTGCGGAACCGGTGTCCGCGCCAGTCATCGCCT -1 & Y R I
6001 TCATCACCGACTTGAGCTCGGTGAAGTCATCGATGAAGGCCGAGCCGAACTCGCGGCCAATGCCGGAAGCCTTGA AGTAGTGGCTGAACTCGAGCCACTTCAGTAGCTACTTCCGGCTCGGCTTGAGCGCCGGTTACGGCCTTCGGAACT -1 M V S K L E T F D D I F A S G F E R G I G S A K I
6076 TGCCCCCAAACGGTACAGCCGGGTCGAGCAGGGTGTGCATGTTGACCCACAGGGTACCGGCCTGGATTTGCGGGA ACGGGGGTTTGCCATGTCGGCCCAGCTCGTCCCACACGTACAACTGGGTGTCCCATGGCCGGACCTAAACGCCCT -1 G G F P V A P D L L T H M N V W L T G A Q I Q P I
6151 TCATGCGCATGGCCTTGCCCAGGTCGTTGGTCCACAGGCTGGCGCTGAGGCCGTAGGGCGAGGCGTTCATCAGGT AGTACGCGTACCGGAACGGGTCCAGCAACCAGGTGTCCGACCGCGACTCCGGCATCCCGCTCCGCAAGTAGTCCA -1 M R M A K G L D N T W L S A S L G Y P S A N M L H
6226 GCAGCAGTTCGTCTTCGTCGTCATAAGGCAGGAAGGTCGCCACAGGGCCGAAGGTTTCCTGGGTGAGCAGGGTGT CGTCGTCAAGCAGAAGCAGCAGTATTCCGTCCTTCCAGCGGTGTCCCGGCTTCCAAAGGACCCACTCGTCCCACA -1 L L E D E D D Y P L F T A V P G F T E Q T L L T D
6301 CGCAGGCTGACCGGGCGAGGATTACCGTGGGTTCGACGAAACAGCCGGGGCCGTCGCCCAGGGTGCCGCCGTGAA GCGTCCGACTGGCCCGCTCCTAATGGCACCCAAGCTGCTTTGTCGGCCCCGGCAGCGGGTCCCACGGCGGCACTT -1 C A S R A L I V T P E V F C G P G D G L T G G H I
6376 TGATCTGGCTGCCTTCGGCGCGGGCGATGGCGAACAGTTCGGCCAGCTTCTGCTGGTGCGGCTTGTTGGCCACGG ACTAGACCGACGGAAGCCGCGCCCGCTACCGCTTGTCAAGCCGGTCGAAGACGACCACGCCGAACAACCGGTGCC -1 I Q S G E A R A I A F L E A L K Q Q H P K N A V P
6451 GGCCGAACTGGGTGGCCTCGTCCAGTGGCGAGCCGATTTTCAGTTGGCCCAGGCGCTGGGACAGGGCGTCCAGCA CCGGCTTGACCCACCGGAGCAGGTCACCGCTCGGCTAAAAGTCAACCGGGTCCGCGACCCTGTCCCGCAGGTCGT -1 G F Q T A E D L P S G I K L Q G L R Q S L A D L L
6526 GCGGGTCGATGCGCGAGCGGTGCACATAGAAGCGCTCGCCCGCGGCGCAGATTTGCCCCGAGTGCAGGAAGCCGG CGCCCAGCTACGCGCTCGCCACGTGTATCTTCGCGAGCGGGCGCCGCGTCTAAACGGGGCTCACGTCCTTCGGCC -1 P D I R S R H V Y F R E G A A C I Q G S H L F G A
6601 CCTCGATGATGCCGTCCACAGCCTTGTCGGTTGCCACGTCGGGCAGGAAGGCCACCGCGTTCTTGCCGCCCAGTT GGAGCTACTACGGCAGGTGTCGGAACAGCCAACGGTGCAGCCCGTCCTTCCGGTGGCGCAAGAACGGCGGGTCAA -1 E I I G D V A K D T A V D P L F A V A N K G G L E
6676 CCAGTGTCGCACGGGTCAGCTTGGCGCCCATGGCAGCCTGGCCTACGGCGATGCCAGTGGGCACGGAGCCGGTGA GGTCACAGCGTGCCCAGTCGAACCGCGGGTACCGTCGGACCGGATGCCGCTACGGTCACCCGTGCCTCGGCCACT -1 L T A R T L K A G M A A Q G V A I G T P V S G T F
6751 ACGAGACCTTGTCGGTACCTGCGTGCTCGATCAGTGCCTTGCCCACCAGGCCACCACCGGTCAGCACGTTCAGTG TGCTCTGGAACAGCCATGGACGCACGAGCTAGTCACGGAACGGGTGGTCCGGTGGTGGCCAGTCGTGCAAGTCAC -1 S V K D T G A H E I L A K G V L G G G T L V N L A
6826 CACCGGCCGGCAGGCCTGCTTCGGTGGCCAGTTCGGCAATGCGCAGCAGCGTCAGCGGGGTGAATTCGCTGGGCT GTGGCCGGCCGTCCGGACGAAGCCACCGGTCAAGCCGTTACGCGTCGTCGCAGTCGCCCCACTTAAGCGACCCGA -1 G A P L G A E T A L E A I R L L T L P T F E S P K 6901 TGAGGATAATGCTGCAGCCGGTTGTCAGGGCCGAGGCCAGCTTCCAGATGGCGATCATGCTGGCGAAGTTCCACG ACTCCTATTACGACGTCGGCCAACAGTCCCGGCTCCGGTCGAAGGTCTACCGCTAGTACGACCGCTTCAAGGTGC -1 L I I S C G T T L A S A L K W I A I M S A F N W P
6976 GCACGATGCCCACCACCACGCCAATCGGCTCGCGCAGGGTGAAGGCGCTGTAGCGCTCACCGGCGAACGAGGGCA CGTGCTACGGGTGGTGGTGCGGTTAGCCGAGCGCGTCCCACTTCCGCGACATCGCGAGTGGCCGCTTGCTCCCGT -1 V I G V V V G I P E R L T F A S Y R E G A F S P L
7051 GCGACGGGGTGATGGTCTGGCCGGTGATCTTGGTCGCCCAGCCGGCGTAGTAGCGCAGGAAGTGCGCGGCCTGCT CGCTGCCCCACTACCAGACCGGCCACTAGAACCAGCGGGTCGGCCGCATCATCGCGTCCTTCACGCGCCGGACGA -1 S P T I T Q G T I K T A W G A Y Y R L F H A A Q Q
7126 GTACTTCGAACGCACGGGAAATGCCGATGAGCTTGCCGGATTGCAAGGTTTCCAGCTGCGCCAGTTCTTCGCGGT CATGAAGCTTGCGTGCCCTTTACGGCTACTCGAACGGCCTAACGTTCCAAAGGTCGACGCGGTCAAGAAGCGCCA -1 V E F A R S I G I L K G S Q L T E L Q A L E E R N
7201 TGGCTTCCAGCAGGTCGGCCAGCTTGAACAGCACTGCGGCGCGGGCGGCGGGGCTGGTGTGCGACCAGGCGGTAA ACCGAAGGTCGTCCAGCCGGTCGAACTTGTCGTGACGCCGCGCCCGCCGCCCCGACCACACGCTGGTCCGCCATT -1 A E L L D A L K F L V A A R A A P S T H S W A T F
7276 AGCCTTGGCGCGAGGAGCTGACGGCATGGTCGACATCGGCCTGGTTGGCGTCGGCGATGTGGGCGATGGTCTGGC TCGGAACCGCGCTCCTCGACTGCCGTACCAGCTGTAGCCGGACCAACCGCAGCCGCTACACCCGCTACCAGACCG -1 G Q R S S S V A H D V D A Q N A D A I H A I T Q G
7351 CGTTGGCCGGGTTGACCACGGCAATGTTCGACGACGACTGGCTGGCGAGGTGCTGGCCGTGGATGAACACGCCAT GCAACCGGCCCAACTGGTGCCGTTACAAGCTGCTGCTGACCGACCGCTCCACGACCGGCACCTACTTGTGCGGTA -1 N A P N V V A I N S S S Q S A L H Q G H I F V G H
7426 GCTCGCGGGCCAGGAAGGCCGTGACGGCAGGTAGGAGGGTGATGTCGCTCATGCAGACTCCGGGGCAGTTGGCCA CGAGCGCCCGGTCCTTCCGGCACTGCCGTCCATCCTCCCACTACAGCGAGTACGTCTGAGGCCCCGTCAACCGGT -1 E R A L F A T V A P L L T I D S M
tynC
7501 AAGTTTGCAGCTTAATAAGCGGGGCAGTGCGGTGCTTGTGCCTGCGTGACAGGTGCATGACTGTGGCTGCCAACC TTCAAACGTCGAATTATTCGCCCCGTCACGCCACGAACACGGACGCACTGTCCACGTACTGACACCGACGGTTGG
7576 GCACTGGGTAAGCCTTGTGGGAGCGGCCTTGTGTCGCGATAGGGCCGCAGAGCGGCCCCGGCGATGTTGGCGGCG CGTGACCCATTCGGAACACCCTCGCCGGAACACAGCGCTATCCCGGCGTCTCGCCGGGGCCGCTACAACCGCCGC
7651 AAGCTGAAAATGCTGGGGCCGCTTCGCGCCCCTATCGCGACGCAAGGCCGCTCCCACAAAAAAAGCGAGCGTAGG TTCGACTTTTACGACCCCGGCGAAGCGCGGGGATAGCGCTGCGTTCCGGCGAGGGTGTTTTTTTCGCTCGCATCC
7726 CCGGGCTGATTGCTGGCAGGCAGCAACAAGCCCGGCGGCAGCCATCGGCAAGACGCCATGCCACCGGCAGCGCAC GGCCCGACTAACGACCGTCCGTCGTTGTTCGGGCCGCCGTCGGTAGCCGTTCTGCGGTACGGTGGCCGTCGCGTG
tynG
+3 T M S L N N K L T E H 7801 AGTAATCACTCGTTCAACGCCACAAAAACAAGCCGGGGCATACGATGTCACTCAATAACAAGCTCACCGAGCACC TCATTAGTGAGCAAGTTGCGGTGTTTTTGTTCGGCCCCGTATGCTACAGTGAGTTATTGTTCGAGTGGCTCGTGG
+3 L N R G T V G F P T A L A S T V G L I M A S P V I 7876 TCAACCGCGGCACTGTCGGTTTCCCCACCGCACTGGCCAGCACTGTCGGGCTGATCATGGCCAGCCCGGTGATCC AGTTGGCGCCGTGACAGCCAAAGGGGTGGCGTGACCGGTCGTGACAGCCCGACTAGTACCGGTCGGGCCACTAGG
+3 L T A T M G F G I G G S A F A V A M V I A A L M M 7951 TCACCGCGACCATGGGCTTTGGCATCGGCGGCAGCGCCTTCGCCGTGGCCATGGTCATCGCCGCACTGATGATGC AGTGGCGCTGGTACCCGAAACCGTAGCCGCCGTCGCGGAAGCGGCACCGGTACCAGTAGCGGCGTGACTACTACG +3 L A Q S T T F A E A A S I L P T T G S V Y D Y I N8026 TGGCGCAGTCCACCACCTTTGCCGAGGCTGCGTCGATCCTGCCGACCACGGGCTCGGTATACGACTACATCAACT ACCGCGTCAGGTGGTGGAAACGGCTCCGACGCAGCTAGGACGGCTGGTGCCCGAGCCATATGCTGATGTAGTTGA
+3 C G M G R F F A I T G T L S A Y L I V H V F A G T 8101 GTGGCATGGGCCGTTTCTTCGCCATTACCGGCACGCTGTCGGCCTACCTGATCGTGCATGTGTTCGCCGGTACCG CACCGTACCCGGCAAAGAAGCGGTAATGGCCGTGCGACAGCCGGATGGACTAGCACGTACACAAGCGGCCATGGC
+3 A E T I L S G V M A L V N F E H L N T L A E S A G 8176 CCGAAACCATCCTGTCGGGGGTGATGGCGCTGGTGAACTTCGAGCACCTCAATACCCTGGCGGAATCCGCCGGCG GGCTTTGGTAGGACAGCCCCCACTACCGCGACCACTTGAAGCTCGTGGAGTTATGGGACCGCCTTAGGCGGCCGC
+3 G S W L L G V C F V V A F A V L N A F G V S A F S 8251 GTTCGTGGCTGCTGGGGGTGTGCTTCGTGGTGGCGTTTGCGGTGCTCAATGCCTTTGGCGTCAGCGCCTTCAGCC CAAGCACCGACGACCCCCACACGAAGCACCACCGCAAACGCCACGAGTTACGGAAACCGCAGTCGCGGAAGTCGG
+3 R A E V V L T F G M W T T L M V F G V L G L I A A 8326 GCGCGGAAGTGGTCCTCACCTTCGGCATGTGGACCACCTTGATGGTGTTCGGCGTGCTTGGCCTGATCGCCGCAC CGCGCCTTCACCAGGAGTGGAAGCCGTACACCTGGTGGAACTACCACAAGCCGCACGAACCGGACTAGCGGCGTG
+3 P A V E L D G P F G V S L V G T D L M T I L S L V 8401 CCGCAGTGGAACTGGACGGCCCGTTCGGCGTGTCGCTGGTGGGCACCGACCTGATGACCATCCTCTCGCTGGTCG GGCGTCACCTTGACCTGCCGGGCAAGCCGCACAGCGACCACCCGTGGCTGGACTACTGGTAGGAGAGCGACCAGC
+3 G M A M F M F V G C E F V T P L A P E L R R S A W 8476 GCATGGCCATGTTCATGTTCGTTGGCTGCGAGTTCGTCACGCCGCTTGCCCCCGAACTGCGTCGCTCGGCCTGGG CGTACCGGTACAAGTACAAGCAACCGACGCTCAAGCAGTGCGGCGAACGGGGGCTTGACGCAGCGAGCCGGACCC
+3 V L P R A M A L G L F G V A S C M F I Y G A A M K 8551 TGCTGCCGCGGGCCATGGCGCTGGGCCTGTTTGGCGTGGCCAGCTGCATGTTCATCTACGGAGCGGCGATGAAGC ACGACGGCGCCCGGTACCGCGACCCGGACAAACCGCACCGGTCGACGTACAAGTAGATGCCTCGCCGCTACTTCG
+3 R Q V E N V V L D A A S G V H L L D T P M A I P R8626 GCCAGGTGGAAAACGTGGTGCTGGATGCCGCCAGTGGCGTGCACCTGCTGGACACGCCCATGGCCATCCCGCGCT CGGTCCACCTTTTGCACCACGACCTACGGCGGTCACCGCACGTGGACGACCTGTGCGGGTACCGGTAGGGCGCGA
+3 F A E Q V M G D I G P V W L G I G F L F A G A A T8701 TCGCCGAGCAGGTGATGGGTGATATTGGCCCAGTGTGGCTGGGTATCGGCTTCCTGTTCGCCGGCGCGGCCACCA AGCGGCTCGTCCACTACCCACTATAACCGGGTCACACCGACCCATAGCCGAAGGACAAGCGGCCGCGCCGGTGGT
+3 I N T L M A G V P R I L Y G M A V D G A L P K V F 8776 TCAACACGCTGATGGCCGGTGTGCCACGCATTCTTTACGGCATGGCGGTGGACGGCGCGTTGCCCAAGGTGTTCA AGTTGTGCGACTACCGGCCACACGGTGCGTAAGAAATGCCGTACCGCCACCTGCCGCGCAACGGGTTCCACAAGT
+3 T Y L H P R F K T P L L C I L V V A L I P C L H A 8851 CCTACCTGCACCCGCGCTTCAAGACGCCGCTGCTGTGCATCCTGGTGGTGGCGTTGATCCCTTGCCTGCATGCCT GGATGGACGTGGGCGCGAAGTTCTGCGGCGACGACACGTAGGACCACCACCGCAACTAGGGAACGGACGTACGGA
+3 W Y L G G N P D N I L H L V L A A V C A W S T A Y 8926 GGTACCTGGGCGGCAACCCGGACAACATCCTGCACCTGGTGCTGGCCGCCGTGTGCGCCTGGAGCACCGCCTACC CCATGGACCCGCCGTTGGGCCTGTTGTAGGACGTGGACCACGACCGGCGGCACACGCGGACCTCGTGGCGGATGG
+3 L L V T L S V V I L R I R R P D L P R A Y R S P L 9001 TGCTGGTGACCCTGTCGGTGGTGATATTGCGCATCCGCCGCCCAGACCTGCCGCGTGCCTACCGCTCGCCGCTGT ACGACCACTGGGACAGCCACCACTATAACGCGTAGGCGGCGGGTCTGGACGGCGCACGGATGGCGAGCGGCGACA
+3 F P L P Q I F S S S G I L I G M A F I T P P G M N 9076 TCCCGTTGCCGCAGATATTCTCCAGTAGCGGTATCCTCATCGGCATGGCGTTCATCACACCGCCGGGCATGAACC AGGGCAACGGCGTCTATAAGAGGTCATCGCCATAGGAGTAGCCGTACCGCAAGTAGTGTGGCGGCCCGTACTTGG
+3 P A D V Y V P F A I M L G A T A A Y A L F W T L W 9151 CTGCCGATGTCTACGTGCCGTTCGCCATCATGCTTGGCGCCACTGCGGCCTATGCATTGTTCTGGACGCTGTGGG GACGGCTACAGATGCACGGCAAGCGGTAGTACGAACCGCGGTGACGCCGGATACGTAACAAGACCTGCGACACCC
+3 V Q K V N P F K P A R V E D V L E K E F A A E P G 9226 TGCAGAAGGTCAACCCGTTCAAGCCGGCGCGGGTCGAGGATGTGCTCGAGAAAGAGTTTGCTGCCGAGCCTGGCC ACGTCTTCCAGTTGGGCAAGTTCGGCCGCGCCCAGCTCCTACACGAGCTCTTTCTCAAACGACGGCTCGGACCGG
+3 H A V E H V L H D Q K F A & 9301 ACGCCGTGGAGCACGTGCTGCATGATCAGAAATTTGCGTGAACGCTTGCTGGCGCCCCGAGCGCCTTCAGGCTAT TGCGGCACCTCGTGCACGACGTACTAGTCTTTAAACGCACTTGCGAACGACCGCGGGGCTCGCGGAAGTCCGATA
9376 CGCCCAGGCGCCACGCTGGCATGCCTGGCGCGCAACCTGGGGCAGCAGAACCTGGTGGCGGCCGGGGTGATCCAC GCGGGTCCGCGGTGCGACCGTACGGACCGCGCGTTGGACCCCGTCGTCTTGGACCACCGCCGGCCCCACTAGGTG
9451 GACCCGGCCCAGGGTTGGCAGGCCACGGTGCACGAACGCGTCGAGGCCCACCTGCTGATGCACATCGTCACCTGT CTGGGCCGGGTCCCAACCGTCCGGTGCCACGTGCTTGCGCAGCTCCGGGTGGACGACTACGTGTAGCAGTGGACA
9526 GAGTTCCAGCTGCAGTTGCCTGCTCCGCAAGGGGGCGAGGTCAGCCTGGAGCTGCGCCATACCGGTGCGCTTCGC CTCAAGGTCGACGTCAACGGACGAGGCGTTCCCCCGCTCCAGTCGGACCTCGACGCGGTATGGCCACGCGAAGCG
9601 CGTGCCGGCCTGGCCTGTGTGTACCGCAAGGGCGACCGGGCGCGCTTCGCCCGACTGCGCGACCGGTTGCTGCAG GCACGGCCGGACCGGACACACATGGCGTTCCCGCTGGCCCGCGCGAAGCGGGCTGACGCGCTGGCCAACGACGTC
9676 CAGGCCGCACTGGTGGCGGCGCTGATGCCGCTGGATTTCAAGCGCCTGACCTTGGCCTGGCGCGACGGCCAATGG GTCCGGCGTGACCACCGCCGCGACTACGGCGACCTAAAGTTCGCGGACTGGAACCGGACCGCGCTGCCGGTTACC 9751 TTGCTGACCCTGGAGCACATGGGCGGTAGCGAAGTGGTCAACCGCATGCCAGCGTTTCGCCGCTACATCCCCATC AACGACTGGGACCTCGTGTACCCGCCATCGCTTCACCAGTTGGCGTACGGTCGCAAAGCGGCGATGTAGGGGTAG
9826 AGCCCGCAACAGCGGGCGCACCTGATGGCCAGCCTGGCCCAGTTCAACACTTTGCTACCTAACCTTTGACGCAAA TCGGGCGTTGTCGCCCGCGTGGACTACCGGTCGGACCGGGTCAAGTTGTGAAACGATGGATTGGAAACTGCGTTT
9901 CTGGCATACGCCTTGCTGTATCAAGCGACGAATGATGACAGTTGTGCGCACATAGATAACATGTTAACAATGTGC GACCGTATGCGGAACGACATAGTTCGCTGCTTACTACTGTCAACACGCGTGTATCTATTGTACAATTGTTACACG
tynR
+3 M H T Q Q S N R Q G L E R W9976 GCATAACAACAAATCCTGCGTCGAGGGCAGCCATGCATACTCAACAATCCAACCGTCAGGGGCTGGAACGCTGGA
+3 T T A M Q Q I C G R F E T E L A S N H S L F I G E10051 CCACGGCCATGCAACAGATCTGTGGCCGTTTCGAGACGGAACTTGCGTCCAATCACTCGCTGTTCATCGGCGAGG GGTGCCGGTACGTTGTCTAGACACCGGCAAAGCTCTGCCTTGAACGCAGGTTAGTGAGCGACAAGTAGCCGCTCC
+3 V S T F S R A G L P L A N L R T N A G N I R R L G 10126 TTTCTACCTTTTCCCGTGCCGGCTTGCCGCTGGCCAACCTGCGCACCAATGCCGGCAACATCCGCCGGCTGGGCG AAAGATGGAAAAGGGCACGGCCGAACGGCGACCGGTTGGACGCGTGGTTACGGCCGTTGTAGGCGGCCGACCCGC
+3 E N P T L D D D Q H C F L V S Q R A G H S T V S Q10201 AAAACCCGACCCTTGACGATGACCAGCATTGTTTCCTGGTCAGCCAGCGTGCGGGGCATTCCACCGTGTCCCAGG TTTTGGGCTGGGAACTGCTACTGGTCGTAACAAAGGACCAGTCGGTCGCACGCCCCGTAAGGTGGCACAGGGTCC
+3 G G M Q V S L A P G E L L L M D S V G R C E I T P10276 GGGGCATGCAGGTCAGCCTGGCGCCGGGTGAGCTGCTGCTGATGGATTCGGTCGGGCGCTGCGAAATCACCCCCA CCCCGTACGTCCAGTCGGACCGCGGCCCACTCGACGACGACTACCTAAGCCAGCCCGCGACGCTTTAGTGGGGGT
+3 S G L I E H V S L A L S R E Q V R K Y V Q G S G P10351 GTGGGTTGATCGAACATGTCTCGCTGGCCCTGTCGCGTGAGCAGGTACGCAAGTATGTGCAAGGCAGCGGCCCGA CACCCAACTAGCTTGTACAGAGCGACCGGGACAGCGCACTCGTCCATGCGTTCATACACGTTCCGTCGCCGGGCT
+3 M F G K I S S S N A C G R M L H V L M D Q L C K D10426 TGTTTGGCAAGATCTCCTCGAGCAACGCCTGCGGGCGCATGCTGCATGTGCTGATGGACCAACTGTGCAAGGACG ACAAACCGTTCTAGAGGAGCTCGTTGCGGACGCCCGCGTACGACGTACACGACTACCTGGTTGACACGTTCCTGC
+3 G N V S G D G A Q G D A L Q T A F I A L L E P G F10501 GCAATGTAAGCGGTGATGGGGCCCAGGGCGACGCGCTGCAGACCGCCTTCATTGCCCTGCTGGAGCCAGGCTTCG CGTTACATTCGCCACTACCCCGGGTCCCGCTGCGCGACGTCTGGCGGAAGTAACGGGACGACCTCGGTCCGAAGC
+3 E R H G E A L G N L G A L N G A N L R G Y V Q Q V10576 AGCGCCATGGCGAAGCGCTGGGCAACCTTGGGGCCTTGAACGGGGCCAACCTGCGGGGCTACGTGCAGCAGGTGA TCGCGGTACCGCTTCGCGACCCGTTGGAACCCCGGAACTTGCCCCGGTTGGACGCCCCGATGCACGTCGTCCACT
+3 I D E S L S Q P G L T P S N L A G R L N I S V R H10651 TCGACGAGTCCCTGTCACAGCCCGGGCTGACCCCGTCCAACCTGGCCGGTCGCCTGAACATCTCGGTGCGTCACC AGCTGCTCAGGGACAGTGTCGGGCCCGACTGGGGCAGGTTGGACCGGCCAGCGGACTTGTAGAGCCACGCAGTGG
+3 L Y R L F E E E G D S V C R Y I Q R A R L K R S A10726 TGTACCGGCTGTTCGAGGAGGAGGGCGATAGTGTGTGCCGCTACATTCAGCGGGCGCGCCTGAAGCGCAGTGCGG ACATGGCCGACAAGCTCCTCCTCCCGCTATCACACACGGCGATGTAAGTCGCCCGCGCGGACTTCGCGTCACGCC
+3 D D L A N P F F R S E S I T S I A Y K W G F T D S 10801 ATGACCTGGCCAACCCGTTCTTCAGGAGCGAGTCGATTACCTCGATTGCCTACAAGTGGGGGTTTACCGACTCGG TACTGGACCGGTTGGGCAAGAAGTCCTCGCTCAGCTAATGGAGCTAACGGATGTTCACCCCCAAATGGCTGAGCC
+3 A H F S R S F K K Q F E R S P K D Y R A Q A M V & 10876 CGCATTTCAGCCGCTCGTTCAAGAAACAGTTCGAACGCTCGCCCAAGGACTACCGGGCGCAGGCGATGGTTTGAG GCGTAAAGTCGGCGAGCAAGTTCTTTGTCAAGCTTGCGAGCGGGTTCCTGATGGCCCGCGTCCGCTACCAAACTC
10951 TGTGATGGTGCTGCTTGTGCGGGCCTCATCGCCGGCAAGTCACTTGGCGGCGGTTCAGCGACGGCCGTTGAAGTA ACACTACCACGACGAACACGCCCGGAGTAGCGGCCGTTCAGTGAACCGCCGCCAAGTCGCTGCCGGCAACTTCAT & R R G N F Y
11026 GCCCGACAGCTGGTGCACGGTCTTGCCGGCAGTGAGCAGCAGCGGGCGGAAATGGTCCTTGCCGAGGATGCGCGC CGGGCTGTCGACCACGTGCCAGAACGGCCGTCACTCGTCGTCGCCCGCCTTTACCAGGAACGGCTCCTACGCGCG -2 G S L Q H V T K G A T L L L P R F H D K G L I R A
11101 ATGCTTGACCGAGCTGACCAGGTCATAGCGCTTCGATCCCTCCTGCATACCCTCGGCGAGTATCTTGCAAATGAT TACGAACTGGCTCGACTGGTCCAGTATCGCGAAGCTAGGGAGGACGTATGGGAGCCGCTCATAGAACGTTTACTA -2 H K V S S V L D Y R K S G E Q M G E A L I K C I I
11176 GTGGCTGGGCGTGACGCCAAAGCCGGAGTAGCCCTGCACATAGAAAGCGTTGGGGCGGTTGTCGAGGGTGCCTAT CACCGACCCGCACTGCGGTTTCGGCCTCATCGGGACGTGTATCTTTCGCAACCCCGCCAACAGCTCCCACGGATA -2 H S P T V G F G S Y G Q V Y F A N P R N D L T G I 11251 CTGCGGAAACAGGTTGGCACTGGTGGCCATCGGGCCGCCCCAGGCCAGGTCGATGCGCACGTCTTTCAGGTAGGG
GACGCCTTTGTCCAACCGTGACCACCGGTAGCCCGGCGGGGTCCGGTCCAGCTACGCGTGCAGAAAGTCCATCCC -2 Q P F L N A S T A M P G G W A L D I R V D K L Y P
11326 GAAAATCTTCAGCATCAGCGCGCGGTTCCACGCCTTCAGGTCCAGCGGGAAGTGCTCGACGAAGGGCGTGGCGGC CTTTTAGAAGTCGTAGTCGCGCGCCAAGGTGCGGAAGTCCAGGTCGCCCTTCACGAGCTGCTTCCCGCACCGCCG -2 F I K L M L A R N W A K L D L P F H E V F P T A A
11401 GCCAAACAGCAGGCGGTTCTCGCGGGTGACCCGGTAGTAGTCGATCACCGGGCGGATGTCGCTGTAGGCCCCGCG CGGTTTGTCGTCCGCCAAGAGCGCCCACTGGGCCATCATCAGCTAGTGGCCCGCCTACAGCGACATCCGGGGCGC -2 G F L L R N E R T V R Y Y D I V P R I D S Y A G R
11476 TATCGGGCTGATGCGCTCGATCAGCTCATCCGGCAATGGCTCGGTCATCATCTGGAAGGCATAGGTGTTTATAGT ATAGCCCGACTACGCGAGCTAGTCGAGTAGGCCGTTACCGAGCCAGTAGTAGACCTTCCGTATCCACAAATATCA -2 I P S I R E I L E D P L P E T M M Q F A Y T N I T
11551 GCGTGCGTGCAGCTGCGGCTCCAGCTTGTTGAGGAAGCTGTCGCACGCCCACAGCAGCTTGCTGGCGCGTACCGA CGCACGCACGTCGACGCCGAGGTCGAACAACTCCTTCGACAGCGTGCGGGTGTCGTCGAACGACCGCGCATGGCT -2 R A H L Q P E L K N L F S D C A W L L K S A R V S
11626 GCCACGGCCGGTGCGTACCGTGATGCGCTCGCCGTAGGTCACTTCCAGGGCCGGGCTGTGTTCGAAGATGCGCGC CGGTGCCGGCCACGCATGGCACTACGCGAGCGGCATCCAGTGAAGGTCCCGGCCCGACACAAGCTTCTACGCGCG -2 G R G T R V T I R E G Y T V E L A P S H E F I R A 11701 ACCATGGCCCACCAGTGCCTGCGCTTCGCCCAGCAGCAGGTTCAGGGAATGCACATGGCCACCGCCCATGTGCAT TGGTACCGGGTGGTCACGGACGCGAAGCGGGTCGTCGTCCAAGTCCCTTACGTGTACCGGTGGCGGGTACACGTA -2 G H G V L A Q A E G L L L N L S H V H G G G M H M
11776 CAGGGCGCTGCTGTAGGCGTTGCTGCCGATGATCTGGCGCACTTCGCTGCCACCGAGAAAACGGATCTCGTCGCG GTCCCGCGACGACATCCGCAACGACGGCTACTAGACCGCGTGAAGCGACGGTGGCTCTTTTGCCTAGAGCAGCGC -2 L A S S Y A N S G I I Q R V E S G G L F R I E D R
11851 GGTATTGATCGCCTTGAACGCCTTCTCCCATTTGCGCAGGGTCTGTTCCTGGCGGCGGTTGAAGCCCATGTAGCC CCATAACTAGCGGAACTTGCGGAAGAGGGTAAACGCGTCCCAGACAAGGACCGCCGCCAACTTCGGGTACATCGG -2 T N I A K F A K E W K R L T Q E Q R R N F G M Y G
11926 ATAGCCGTGGCAGAAGTCGGCGTCGATGGCGTAGCGGGCGATGCGGTCCTTGATGATGCCGGCGCCCAGTTCGCT TATCGGCACCGTCTTCAGCCGCAGCTACCGCATCGCCCGCTACGCCAGGAACTACTACGGCCGCGGGTCAAGCGA -2 Y G H C F D A D I A Y R A I R D K I I G A G L E S
12001 GATTTCGAAAATATCCCTCACGCCCTGATCACCGACGCTGCTGCGGATCTTCTCCAGGTCGTGGCCGATGCCCGC CTAAAGCTTTTATAGGGAGTGCGGGACTAGTGGCTGCGACGACGCCTAGAAGAGGTCCAGCACCGGCTACGGGCG -2 I E F I D R V G Q D G V S S R I K E L D H G I G A
12076 CATGATCTGCCCGCCGTTGCGCCCGCTACCGCCGTAGCCCAGATAACGGCCCTCGAGCACGACGATATTGGTCAC GTACTAGACGGGCGGCAACGCGGGCGATGGCGGCATCGGGTCTATTGCCGGGAGCTCGTGCTGCTATAACCAGTG -2 M I Q G G N R G S G G Y G L Y R G E L V V I N T V
12151 GCCTTGTTCCGCCAGCTCCAGGGCGGTGTTAATGCCGGAGAAACCGCCACCGATCACCACGACATCGGCCTCGAT CGGAACAAGGCGGTCGAGGTCCCGCCACAATTACGGCCTCTTTGGCGGTGGCTAGTGGTGCTGTAGCCGGAGCTA -2 G Q E A L E L A T N I G S F G G G I V V V D A E I
12226 GTCGCGTTCCAGGGTTGGGAAGCTCAGGTTGTACTTCTTGGTCGCCGAGTAGTAGGTGGGGCTCTCGAGGGTGAT CAGCGCAAGGTCCCAACCCTTCGAGTCCAACATGAAGAACCAGCGGCTCATCATCCACCCCGAGAGCTCCCACTA -2 D R E L T P F S L N Y K K T A S Y Y T P S E L T I
12301 CATGACGCCGCCTGCTGACTGGAAATGGGTAGAAATCATTCTATTAATGTATTAATGATTGTGCACTGGCATACT GTACTGCGGCGGACGACTGACCTTTACCCATCTTTAGTAAGATAATTACATAATTACTAACACGTGACCGTATGA -2 M V G G A S Q F H T S I M tynDhpaR
+3 M T T P R P S L T L T L L 12376 CGCCGGTTTGCTATTTCCAGCCTCCTTGAGCCCGCATGACCACACCGAGACCCTCCCTGACCCTGACCTTGCTGC GCGGCCAAACGATAAAGGTCGGAGGAACTCGGGCGTACTGGTGTGGCTCTGGGAGGGACTGGGACTGGAACGACG
+3 Q A R E A T M A F F R P A L N A H D L T E Q Q W R12451 AGGCGCGCGAAGCCACCATGGCGTTCTTCCGCCCGGCGCTGAATGCCCATGACCTGACCGAGCAGCAATGGCGGG TCCGCGCGCTTCGGTGGTACCGCAAGAAGGCGGGCCGCGACTTACGGGTACTGGACTGGCTCGTCGTTACCGCCC
+3 V I R I L R Q Q G E L E S H Q L A E L A C I L K P12526 TAATCCGTATCCTGCGCCAGCAAGGCGAGCTGGAAAGCCATCAGTTGGCGGAGCTGGCCTGTATCCTCAAACCCA ATTAGGCATAGGACGCGGTCGTTCCGCTCGACCTTTCGGTAGTCAACCGCCTCGACCGGACATAGGAGTTTGGGT
+3 S M S G V L K R L E R D G I V A R R K S P E D Q R12601 GTATGAGCGGGGTGCTCAAGCGCCTGGAGCGTGACGGCATCGTAGCGCGGCGCAAGTCGCCGGAGGACCAGCGCC
CATACTCGCCCCACGAGTTCGCGGACCTCGCACTGCCGTAGCATCGCGCCGCGTTCAGCGGCCTCCTGGTCGCGG
+3 R V F I S L T E A G Q Q A F L A M S E E M T R N Y12676 GGGTGTTCATCAGCCTGACCGAGGCCGGCCAGCAAGCGTTTCTGGCGATGAGCGAGGAGATGACCCGCAACTACG CCCACAAGTAGTCGGACTGGCTCCGGCCGGTCGTTCGCAAAGACCGCTACTCGCTCCTCTACTGGGCGTTGATGC
+3 D K I L A Q F G D D K L Q Q L M Q L L G E M K K I 12751 ACAAGATCCTCGCCCAGTTTGGCGATGACAAGCTGCAGCAGCTGATGCAGCTGCTGGGTGAAATGAAGAAGATCA TGTTCTAGGAGCGGGTCAAACCGCTACTGTTCGACGTCGTCGACTACGTCGACGACCCACTTTACTTCTTCTAGT
+3 K P &
12826 AACCCTGACGCGCCAGGCGTCAGCGGTTGAGTGACAGCGAGTCTTCCAGCACTTTCAGCAGTGCTGCCGCGCGCC TTGGGACTGCGCGGTCCGCAGTCGCCAACTCACTGTCGCTCAGAAGGTCGTGAAAGTCGTCACGACGGCGCGCGG -1 & R N L S L S D E L V K L L A A A R R
12901 GCTCATAGGCGTCGGGGCCTGCGTACATCAGCTCTACATACAGGCTGTCGATGATGCCCAGGTAGGCATCGGCAT CGAGTATCCGCAGCCCCGGACGCATGTAGTCGAGATGTATGTCCGACAGCTACTACGGGTCCATCCGTAGCCGTA -1 E Y A D P G A Y M L E V Y L S D I I G L Y A D A Y
12976 ACAGCGCCAGGCGGCTGTGCTGCTCATGCGCCCAGCCGTGGCGAGCTTGCAGGGCCACGCTGAACCCTTCGCGTA TGTCGCGGTCCGCCGACACGACGAGTACGCGGGTCGGCACCGCTCGAACGTCCCGGTGCGACTTGGGAAGCGCAT -1 L A L R S H Q E H A W G H R A Q L A V S F G E R I
13051 TGCCGTCCAGGTACTGTTCAAAGCCCGAAGTGACAATCGGCTTGATGCCCGCCGGGGGCAGGAACGCCGTGCGCA ACGGCAGGTCCATGACAAGTTTCGGGCTTCACTGTTAGCCGAACTACGGGCGGCCCCCGTCCTTGCGGCACGCGT -1 G D L Y Q E F G S T V I P K I G A P P L F A T R L
13126 ACACGAAGCGCAGTTGGGCCGAGTCGCGATAACGTTCGGCCAGGTGCAGGGCCAGCCAGTGCCCCGCCGCCAGGC TGTGCTTCGCGTCAACCCGGCTCAGCGCTATTGCAAGCCGGTCCACGTCCCGGTCGGTCACGGGGCGGCGGTCCG -1 V F R L Q A S D R Y R E A L H L A L W H G A A L G
13201 CGTCGCGGGCTTCCTGCGCAAAGCCGTGCTCGACAAAGGCCGTTTCCTGCACAAGCGCACGCTGGAACACCTCCA GCAGCGCCCGAAGGACGCGTTTCGGCACGAGCTGTTTCCGGCAAAGGACGTGTTCGCGTGCGACCTTGTGGAGGT -1 D R A E Q A F G H E V F A T E Q V L A R Q F V E V
13276 CGAACAAGGCGTCCTTGTTGGCGAAATGCGCATACAGCGATGCCTTGCGCATGCCCGCCAACTGGGCGATTTCGT GCTTGTTCCGCAGGAACAACCGCTTTACGCGTATGTCGCTACGGAACGCGTACGGGCGGTTGACCCGCTAAAGCA -1 F L A D K N A F H A Y L S A K R M G A L Q A I E N
13351 TCAGCGAAGAGGCGTCATAACCGTACTCGGCGAAGTGGCCGACGGCGGCATCGCACACACGCACCGCAGAAGGGG AGTCGCTTCTCCGCAGTATTGGCATGAGCCGCTTCACCGGCTGCCGCCGTAGCGTGTGTGCGTGGCGTCTTCCCC -1 L S S A D Y G Y E A F H G V A A D C V R V A S P S
13426 AAAGGTCTTTCAACAGCATCACTCCGTCAGGGGCGCGGCGGGCCGCGCGCGTCTTGAGGGTGGGATTGTGGTGAT TTTCCAGAAAGTTGTCGTAGTGAGGCAGTCCCCGCGCCGCCCGGCGCGCGCAGAACTCCCACCCTAACACCACTA -1 L D K L L M
tetR
13501 CGAAAATGCACGGGTCAATGCTTGTCGCAAGGCAATTTCCGGGCGCCATGGAAAGTGCAATGTTCCCCTCGTAAC GCTTTTACGTGCCCAGTTACGAACAGCGTTCCGTTAAAGGCCCGCGGTACCTTTCACGTTACAAGGGGAGCATTG
hpaB
+3 M 13576 GTGCATTCCTCCACCCAATCGCCGCTCACATACTGATCGCGTCTTCGAATCCAATAAGAAAGAGACCGCTCATGA CACGTAAGGAGGTGGGTTAGCGGCGAGTGTATGACTAGCGCAGAAGCTTAGGTTATTCTTTCTCTGGCGAGTACT
+3 K K P N P L L E D L K S V L P T I A A N A M R A E 13651 AAAAGCCAAACCCCCTGCTGGAAGACCTGAAGTCCGTCCTGCCGACCATTGCCGCCAATGCCATGCGTGCAGAGC TTTTCGGTTTGGGGGACGACCTTCTGGACTTCAGGCAGGACGGCTGGTAACGGCGGTTACGGTACGCACGTCTCG
+3 Q D R S V P A E N I A L L K S I G M H R A F L P K13726 AGGACCGCAGTGTGCCGGCAGAGAATATCGCCTTGCTGAAAAGCATCGGCATGCACCGCGCTTTCTTGCCCAAAC TCCTGGCGTCACACGGCCGTCTCTTATAGCGGAACGACTTTTCGTAGCCGTACGTGGCGCGAAAGAACGGGTTTG
+3 H F G G M E I T L P E F A Q C I A L L A G A C A S13801 ACTTCGGCGGCATGGAAATCACCCTGCCGGAGTTCGCCCAGTGCATCGCCTTGCTGGCGGGGGCCTGCGCCAGCA TGAAGCCGCCGTACCTTTAGTGGGACGGCCTCAAGCGGGTCACGTAGCGGAACGACCGCCCCCGGACGCGGTCGT
+3 T A W A M S L L C T H S H Q M A M F S P K L Q Q E13876 CAGCCTGGGCCATGAGCCTGCTGTGCACCCACAGCCACCAGATGGCAATGTTCTCGCCCAAGCTACAACAGGAGG GTCGGACCCGGTACTCGGACGACACGTGGGTGTCGGTGGTCTACCGTTACAAGAGCGGGTTCGATGTTGTCCTCC
+3 V W G S D P D A T A S S S I A P F G R T E E V E G
13951 TGTGGGGTAGCGACCCGGATGCTACCGCCAGCAGCAGTATCGCGCCGTTCGGCCGCACTGAAGAGGTTGAGGGTG ACACCCCATCGCTGGGCCTACGATGGCGGTCGTCGTCATAGCGCGGCAAGCCGGCGTGACTTCTCCAACTCCCAC
+3 G V S F S G E M G W S S G C D H A E W A I L G F R 14026 GCGTGTCGTTCAGCGGCGAAATGGGCTGGAGTTCCGGTTGCGACCACGCCGAATGGGCGATTCTCGGTTTCCGCC CGCACAGCAAGTCGCCGCTTTACCCGACCTCAAGGCCAACGCTGGTGCGGCTTACCCGCTAAGAGCCAAAGGCGG
+3 R K N A E G A Q D Y C F A I L P R S D Y E I R D D14101 GCAAGAATGCCGAAGGCGCTCAGGATTACTGCTTCGCCATCCTGCCTCGCAGTGACTATGAAATCCGTGATGACT CGTTCTTACGGCTTCCGCGAGTCCTAATGACGAAGCGGTAGGACGGAGCGTCACTGATACTTTAGGCACTACTGA
+3 W Y A V G M R G S G S K T L I V R D A F V P E H R 14176 GGTATGCCGTGGGCATGCGCGGCAGCGGCAGCAAGACCCTGATCGTGCGTGATGCCTTCGTGCCCGAGCACCGCA CCATACGGCACCCGTACGCGCCGTCGCCGTCGTTCTGGGACTAGCACGCACTACGGAAGCACGGGCTCGTGGCGT
+3 I Q K A K D M M E G K S A G F G L Y P D S K I F F14251 TCCAGAAGGCCAAGGACATGATGGAGGGCAAGTCGGCGGGCTTTGGTTTGTACCCCGACAGCAAGATTTTCTTCG AGGTCTTCCGGTTCCTGTACTACCTCCCGTTCAGCCGCCCGAAACCAAACATGGGGCTGTCGTTCTAAAAGAAGC
+3 A P Y R P Y F A S G F S T V S L G V A E R M L E V 14326 CCCCGTATCGCCCGTATTTTGCCAGCGGCTTCTCCACGGTCAGCTTGGGCGTTGCCGAGCGCATGCTGGAGGTGT GGGGCATAGCGGGCATAAAACGGTCGCCGAAGAGGTGCCAGTCGAACCCGCAACGGCTCGCGTACGACCTCCACA
+3 F R E K T R N R V R A Y T G A A V G A A T P A L M 14401 TCCGCGAGAAAACCCGCAACCGCGTGCGTGCCTACACCGGTGCTGCCGTGGGCGCCGCCACCCCGGCGCTGATGC AGGCGCTCTTTTGGGCGTTGGCGCACGCACGGATGTGGCCACGACGGCACCCGCGGCGGTGGGGCCGCGACTACG
+3 R L A E S T H Q V A A A R A L L E K S W D E I A E14476 GCCTGGCCGAGTCGACCCATCAGGTGGCCGCTGCCCGGGCATTGCTGGAAAAGAGCTGGGACGAGATTGCCGAGC CGGACCGGCTCAGCTGGGTAGTCCACCGGCGACGGGCCCGTAACGACCTTTTCTCGACCCTGCTCTAACGGCTCG
+3 H S A R H E Y P S R G T L A F W R T N Q G Y A V K14551 ACAGTGCCCGTCACGAATACCCGTCGCGTGGCACGCTGGCGTTCTGGCGTACCAACCAGGGCTACGCCGTGAAGA TGTCACGGGCAGTGCTTATGGGCAGCGCACCGTGCGACCGCAAGACCGCATGGTTGGTCCCGATGCGGCACTTCT
+3 M C I Q A V D R L M E A A G G G A W F E S N E L Q14626 TGTGCATCCAGGCCGTCGACCGCCTGATGGAAGCGGCCGGTGGTGGCGCCTGGTTCGAGAGCAACGAACTGCAGC ACACGTAGGTCCGGCAGCTGGCGGACTACCTTCGCCGGCCACCACCGCGGACCAAGCTCTCGTTGCTTGACGTCG
+3 R L F R D S H M T G A H A Y T D Y D V C A Q I L G14701 GGCTGTTCCGCGATTCGCACATGACCGGTGCCCATGCCTACACCGATTACGACGTGTGTGCGCAAATCCTCGGCC CCGACAAGGCGCTAAGCGTGTACTGGCCACGGGTACGGATGTGGCTAATGCTGCACACACGCGTTTAGGAGCCGG
+3 R E L M G L E P D P A M V & 14776 GCGAGCTGATGGGCCTGGAGCCTGACCCGGCGATGGTCTGAGCCGCCACTTGTTTTCACCCATCCCCTACAAGCA CGCTCGACTACCCGGACCTCGGACTGGGCCGCTACCAGACTCGGCGGTGAACAAAAGTGGGTAGGGGATGTTCGT
hpaC
+1 M S K E T F D S R A 14851 CAACAACAAACAGGGCAGGCTGCCAGGCCTGCCCGGGAGTCTTGCATGTCCAAAGAAACCTTCGATTCACGTGCC GTTGTTGTTTGTCCCGTCCGACGGTCCGGACGGGCCCTCAGAACGTACAGGTTTCTTTGGAAGCTAAGTGCACGG
+1 F R R A L G N F A T G V T V V T A A G P S G R K V 14926 TTCCGCCGCGCCCTGGGCAACTTCGCCACCGGCGTGACCGTGGTGACTGCCGCCGGCCCCAGTGGCCGCAAGGTC AAGGCGGCGCGGGACCCGTTGAAGCGGTGGCCGCACTGGCACCACTGACGGCGGCCGGGGTCACCGGCGTTCCAG
+1 G V T A N S F N S V S L D P A L I L W S I D K R S 15001 GGCGTTACCGCCAACAGCTTCAACTCGGTGTCGCTGGACCCGGCGCTGATCCTGTGGAGCATCGACAAGCGCTCC CCGCAATGGCGGTTGTCGAAGTTGAGCCACAGCGACCTGGGCCGCGACTAGGACACCTCGTAGCTGTTCGCGAGG
+1 T S H E V F E E A S H F A V N I L A A D Q I D L S15076 ACCAGCCATGAAGTGTTCGAAGAGGCCTCGCACTTTGCCGTGAACATTCTGGCTGCGGACCAGATCGACCTGTCC TGGTCGGTACTTCACAAGCTTCTCCGGAGCGTGAAACGGCACTTGTAAGACCGACGCCTGGTCTAGCTGGACAGG
+1 N N F A R P K E D R F A G I D Y E T G T G G A P L 15151 AACAACTTTGCCCGCCCGAAGGAAGATCGCTTTGCCGGTATCGACTACGAGACCGGCACTGGCGGCGCGCCGTTG TTGTTGAAACGGGCGGGCTTCCTTCTAGCGAAACGGCCATAGCTGATGCTCTGGCCGTGACCGCCGCGCGGCAAC
+1 F A D C A A R F E C E K Y Q Q L D G G D H W I L V15226 TTCGCCGATTGCGCGGCGCGCTTTGAGTGTGAAAAGTACCAGCAGCTGGACGGTGGCGATCACTGGATCCTGGTG AAGCGGCTAACGCGCCGCGCGAAACTCACACTTTTCATGGTCGTCGACCTGCCACCGCTAGTGACCTAGGACCAC
+1 G K V V A F D D F G R S P L L Y H Q G A Y S M V L15301 GGCAAGGTAGTGGCCTTTGATGACTTTGGCCGCTCGCCGCTGCTGTATCACCAGGGCGCCTATTCAATGGTGCTG CCGTTCCATCACCGGAAACTACTGAAACCGGCGAGCGGCGACGACATAGTGGTCCCGCGGATAAGTTACCACGAC
+1 P H T R M T Q G A E G Q A P S S H F Q G R L Q H N15376 CCGCATACCCGCATGACCCAAGGCGCAGAGGGGCAGGCACCGAGCAGCCACTTCCAGGGCCGCCTGCAGCACAAC GGCGTATGGGCGTACTGGGTTCCGCGTCTCCCCGTCCGTGGCTCGTCGGTGAAGGTCCCGGCGGACGTCGTGTTG
+1 L Y Y L M T Q A L R A Y Q A D Y Q P R Q L C T G L15451 CTGTACTACCTGATGACCCAGGCGCTGCGTGCCTACCAGGCTGACTACCAGCCACGCCAGCTGTGTACCGGCCTG GACATGATGGACTACTGGGTCCGCGACGCACGGATGGTCCGACTGATGGTCGGTGCGGTCGACACATGGCCGGAC
+1 R T S E A R M L M V L E N D A G L S L N D L Q R E15526 CGCACCAGCGAGGCACGCATGCTGATGGTGCTGGAGAACGATGCGGGCCTGAGCCTGAACGACCTGCAACGCGAA GCGTGGTCGCTCCGTGCGTACGACTACCACGACCTCTTGCTACGCCCGGACTCGGACTTGCTGGACGTTGCGCTT
+1 V A M P A R E I E E A V A N L K R K G L I A D D E 15601 GTGGCGATGCCGGCGCGGGAGATCGAGGAAGCGGTTGCCAACCTCAAGCGCAAAGGGCTGATTGCCGATGACGAA CACCGCTACGGCCGCGCCCTCTAGCTCCTTCGCCAACGGTTGGAGTTCGCGTTTCCCGACTAACGGCTACTGCTT
+1 G R V R L S V K G V D E T E A L W T I A R Q Q Q D15676 GGGCGAGTGCGGCTATCGGTGAAGGGCGTGGACGAGACCGAGGCGTTGTGGACCATTGCCCGGCAACAGCAGGAC CCCGCTCACGCCGATAGCCACTTCCCGCACCTGCTCTGGCTCCGCAACACCTGGTAACGGGCCGTTGTCGTCCTG
+1 K V F G Q F S E Q Q L E T F K T V L K A L I N I & 15751 AAGGTGTTCGGGCAGTTCAGTGAACAGCAGCTGGAGACTTTCAAGACCGTGCTCAAGGCCCTTATCAACATCTGA TTCCACAAGCCCGTCAAGTCACTTGTCGTCGACCTCTGAAAGTTCTGGCACGAGTTCCGGGAATAGTTGTAGACT
15826 ACACGCTTTGGGATGGCACCGGCTGTTTTGGATGGCACCGGCTGTGCCGGTGTTCGCGGATGAACCCGCTCCCAC TGTGCGAAACCCTACCGTGGCCGACAAAACCTACCGTGGCCGACACGGCCACAAGCGCCTACTTGGGCGAGGGTG
15901 AGGTCCAGCGCCAGTAGCAACTTCGGCGCGGTACCTGTGGGAGCGGCTTTAGCCGCGAACACCGGCAAAGCCGGT TCCAGGTCGCGGTCATCGTTGAAGCCGCGCCATGGACACCCTCGCCGAAATCGGCGCTTGTGGCCGTTTCGGCCA
15976 GCCATCCAACCAGAAGCCTCAGTAGGCACCACCCCCGGCACTGGGGACTACCACTGTATCCTTGAACTTCCCCGC CGGTAGGTTGGTCTTCGGAGTCATCCGTGGTGGGGGCCGTGACCCCTGATGGTGACATAGGAACTTGAAGGGGCG -2 G D L W F G & Y A G G G A S P V V V T D K F K G A
16051 CAGCTCGCGCAGCCCGCGCATCAGCACCGTGGTATCCACACCCACCGCCACAAACGCCGCACCCAGCTCGATGTA GTCGAGCGCGTCGGGCGCGTAGTCGTGGCACCATAGGTGTGGGTGGCGGTGTTTGCGGCGTGGGTCGAGCTACAT -2 L E R L G R M L V T T D V G V A V F A A G L E I Y
16126 GCGTCGCGCCAGTTTCTCGTCCGCGCTGAGAATGCCGGCGGCTTTGCCCGCCTTGCCAATGCGCACGATTGCGTC CGCAGCGCGGTCAAAGAGCAGGCGCGACTCTTACGGCCGCCGAAACGGGCGGAACGGTTACGCGTGCTAACGCAG -2 R R A L K E D A S L I G A A K G A K G I R V I A D
16201 TTCAATCGCCGCCTGCACCTCCGGGTGCCCGGGGTTGCCGCGATGCCCCATGGCCGCACTCAGGTCTGCAGGCCC AAGTTAGCGGCGGACGTGGAGGCCCACGGGCCCCAACGGCGCTACGGGGTACCGGCGTGAGTCCAGACGTCCGGG -2 E I A A Q V E P H G P N G R H G M A A S L D A P G
16276 GATGAACACGCCATCCACACCTTCCACTGCAACGATCTCGTCCAGGTTGGCCAGGCCTTCCTTGTTCTCGATCTG CTACTTGTGCGGTAGGTGTGGAAGGTGACGTTGCTAGAGCAGGTCCAACCGGTCCGGAAGGAACAAGAGCTAGAC -2 I F V G D V G E V A V I E D L N A L G E K N E I Q
16351 CACCAGCAGGCACATTTGCTCATCGGCGTGGTCCAGGTAACCGGGGAGGGTGTTCCAGCGCGAAGCCCGCGCCAG GTGGTCGTCCGTGTAAACGAGTAGCCGCACCAGGTCCATTGGCCCCTCCCACAAGGTCGCGCTTCGGGCGCGGTC -2 V L L C M Q E D A H D L Y G P L T N W R S A R A L
16426 CGCGCTGCCCACCCCGCGAATGCCCTTGGGCGGGTAATGCATGGCCTTGACCAGTTGCCGCGCCTGTTCGGCAGT GCGCGACGGGTGGGGCGCTTACGGGAACCCGCCCATTACGTACCGGAACTGGTCAACGGCGCGGACAAGCCGTCA -2 A S G V G R I G K P P Y H M A K V L Q R A Q E A T
16501 TTCCACCATCGGCACCAGCAAGGTTTGTGCGCCGATATCCAGCACCTGCTTGATCAGCGCGGTATCGCCGATCAC AAGGTGGTAGCCGTGGTCGTTCCAAACACGCGGCTATAGGTCGTGGACGAACTAGTCGCGCCATAGCGGCTAGTG -2 E V M P V L L T Q A G I D L V Q K I L A T D G I V
16576 CGGGCGGATCACTGCCTGGCTGGGGTAGGGTGCCACCGCCTGCAACTGGGCGAGCATGCCGCGCAGGTCGTTGGG GCCCGCCTAGTGACGGACCGACCCCATCCCACGGTGGCGGACGTTGACCCGCTCGTACGGCGCGTCCAGCAACCC -2 P R I V A Q S P Y P A V A Q L Q A L M G R L D N P
16651 CGCGTGTTCGCCGTCGATCAGCAGCCAGTCGAAACCGGCATTGGCCGCCAGCTCGGCGCAGTAGGCATCGGCCAG
GCGCACAAGCGGCAGCTAGTCGTCGGTCAGCTTTGGCCGTAACCGGCGGTCGAGCCGCGTCATCCGTAGCCGGTC -2 A H E G D I L L W D F G A N A A L E A C Y A D A L
16726 GCCGAGCCACAGGCCGATTTGCGGTTCACCGCTGTGCAGGCGTCGCTTGAAGTGGTTGATGGGCATGTCCATGAG CGGCTCGGTGTCCGGCTAAACGCCAAGTGGCGACACGTCCGCAGCGAACTTCACCAACTACCCGTACAGGTACTC -2 G L W L G I Q P E G S H L R R K F H N I P M D M
hpaI
16801 CAGGTCCTTAAACGAAGCGGCAGGCGATGGAGCCGAGCATGTCGTAGTCGACGTGGAAGGTGTCACCTGGGCGAG GTCCAGGAATTTGCTTCGCCGTCCGCTACCTCGGCTCGTACAGCATCAGCTGCACCTTCCACAGTGGACCCGCTC -1 # V F R C A I S G L M D Y D V H F T D G P R A
16876 CGGCGACCGGGCGGGTGAACGAACCCCCAAGGATGATCTGGCCGGGCTGCAAGGTGACGTCGTACGGCGCCAGTT GCCGCTGGCCCGCCCACTTGCTTGGGGGTTCCTACTAGACCGGCCCGACGTTCCACTGCAGCATGCCGCGGTCAA -1 A V P R T F S G G L I I Q G P Q L T V D Y P A L K
16951 TGTTGGCCAGCCAGGCAACGCCTTTGGCCGGGTGGTTGAGCACGGCAGCGCTGACCCCGGATTCCTCGATCACGC ACAACCGGTCGGTCCGTTGCGGAAACCGGCCCACCAACTCGTGCCGTCGCGACTGGGGCCTAAGGAGCTAGTGCG -1 N A L W A V G K A P H N L V A A S V G S E E I V G
17026 CATTGCGGTAGAGCACCGCCGGCACTTTGCGCAGGTCGATTTCGGTGGGGCGCACGGCCCGCCCGCCCATCACCA GTAACGCCATCTCGTGGCGGCCGTGAAACGCGTCCAGCTAAAGCCACCCCGCGTGCCGGGCGGGCGGGTAGTGGT -1 N R Y L V A P V K R L D I E T P R V A R G G M V V
17101 CGCCGGCATTGGCGGCGTTGTCGGAGATGGTGTCGAACACCTTGCGGGTGGCCTGGGTTTGCGGGTCCACCTGCT GCGGCCGTAACCGCCGCAACAGCCTCTACCACAGCTTGTGGAACGCCCACCGGACCCAAACGCCCAGGTGGACGA -1 G A N A A N D S I T D F V K R T A Q T Q P D V Q Q
17176 GGATGCGCGCGTCAATGATTTCCAGCGCCGGGATCACCCACTCGGTGGCGTCCAGCACATCAAACACGGTGATGT CCTACGCGCGCAGTTACTAAAGGTCGCGGCCCTAGTGGGTGAGCCACCGCAGGTCGTGTAGTTTGTGCCACTACA -1 I R A D I I E L A P I V W E T A D L V D F V T I N
17251 TCGGGCCCTTCAGCGGCTTGCCGAGGATGAACGCCAACTCCACTTCAACCCGCGGCACGATGAAGCGCTCGAAGG AGCCCGGGAAGTCGCCGAACGGCTCCTACTTGCGGTTGAGGTGAAGTTGGGCGCCGTGCTACTTCGCGAGCTTCC -1 P G K L P K G L I F A L E V E V R P V I F R E F P
17326 GGATGTCGCTGCCTTCGTCGAACAGCATGTCGTCGAGCAAGGCGCCGTAGTCGGGCTCGGTGATGTTCGACGATA CCTACAGCGACGGAAGCAGCTTGTCGTACAGCAGCTCGTTCCGCGGCATCAGCCCGAGCCACTACAAGCTGCTAT -1 I D S G E D F L M D D L L A G Y D P E T I N S S V
17401 CCTGCATGGCGCGCGAGGTCAGGCCGATCTTGTGGCCCACCAGCTTGCGCCCGGCGGCGATCTTTTTTGCCACCC GGACGTACCGCGCGCTCCAGTCCGGCTAGAACACCGGGTGGTCGAACGCGGGCCGCCGCTAGAAAAAACGGTGGG -1 Q M A R S T L G I K H G V L K R G A A I K K A V W
17476 AGGCGCGCTGGATGGCGTAGGCGTCTTCGATGGTGATTGCCGGTTGCTCCAGCGAGAACTGGCGCACTTGCTCGC TCCGCGCGACCTACCGCATCCGCAGAAGCTACCACTAACGGCCAACGAGGTCGCTCTTGACCGCGTGAACGAGCG -1 A R Q I A Y A D E I T I A P Q E L S F Q R V Q E R
17551 GGGAGCGTTCGGCCTGGTCGAGGCGGTCGGCGGCGTGCTGGATGAAAGCGTTGTCTAGCATGGGGGCGGTCTCTT CCCTCGCAAGCCGGACCAGCTCCGCCAGCCGCCGCACGACCTACTTTCGCAACAGATCGTACCCCCGCCAGAGAA -1 S R E A Q D L R D A A H Q I F A N D L M
hpaH
17626 GATTCAAGGGTTGACGATGGCAGCCTGGGTGCGCAACACCAGCAGGCCGCCCAGGGCGATGAAGACGGCGAGTAC CTAAGTTCCCAACTGCTACCGTCGGACCCACGCGTTGTGGTCGTCCGGCGGGTCCCGCTACTTCTGCCGCTCATG -2 & P N V I A A Q T R L V L L G G L A I F V A L V
17701 GTACAGAGCAAGGCTGGCGCTGTGGGTGGTGTCGCGCACCCAGCCGATGAAGTAGGGCGTGAAGAACGAGGCGAT CATGTCTCGTTCCGACCGCGACACCCACCACAGCGCGTGGGTCGGCTACTTCATCCCGCACTTCTTGCTCCGCTA -2 Y L A L S A S H T T D R V W G I F Y P T F F S A I
17776 GCTGCCCAGCGAGCTGATCAGGGCAATGCCGGCGGCCTGGGTACGGGCGTTGAGGAACGCCGGCGGCAGTTGCCA CGACGGGTCGCTCGACTAGTCCCGTTACGGCCGCCGGACCCATGCCCGCAACTCCTTGCGGCCGCCGTCAACGGT -2 S G L S S I L A I G A A Q T R A N L F A P P L Q W
17851 GAACATCGGCAGCGCAGCGCTGGCGCCCATGCCGGCCAGCACCAGGCCGGCCATTACCGGCAGCGCCTGCTCGGG CTTGTAGCCGTCGCGTCGCGACCGCGGGTACGGCCGGTCGTGGTCCGGCCGGTAATGGCCGTCGCGGACGAGCCC -2 F M P L A A S A G M G A L V L G A M V P L A Q E P
17926 GGCAATGGCCGCAATAGCGATGCCGATGGCAGCCATCAGCAGCGGTACGCACAGGTGCCAGCGGCGTTCGCGTTG CCGTTACCGGCGTTATCGCTACGGCTACCGTCGGTAGTCGTCGCCATGCGTGTCCACGGTCGCCGCAAGCGCAAC
- 2 A I A A I A I G I A A M L L P V C L H W R R E R Q
18001 GCGGTCGCTGGAGCGGCCGCACGCCAGCATGAACACGCAGCCGGCCACGTACGGCACAGCGCTGAGCAGGCCGAC CGCCAGCGACCTCGCCGGCGTGCGGTCGTACTTGTGCGTCGGCCGGTGCATGCCGTGTCGCGACTCGTCCGGCTG -2 R D S S R G C A L M F V C G A V Y P V A S L L G V
18076 ACTGGCGTCGCTGGCCACACCGGCACTGTGAATCAGGCTGGGCATCCAGAACGCAAGGGTATTCACCGCCAGCAT TGACCGCAGCGACCGGTGTGGCCGTGACACTTAGTCCGACCCGTAGGTCTTGCGTTCCCATAAGTGGCGGTCGTA -2 S A D S A V G A S H I L S P M W F A L T N V A L M
18151 CACCGCGCAATACACGGCCACCAACAGCCACAGCGCACGGCTTGCGAAAATGGCGCCGAACGAGGTTACGGGCTT GTGGCGCGTTATGTGCCGGTGGTTGTCGGTGTCGCGTGCCGAACGCTTTTACCGCGGCTTGCTCCAATGCCCGAA -2 V A C Y V A V L L W L A R S A F I A G F S T V P K
18226 GCGCTGTTCTTCCTCACCGAATTGCGCGCGCAGCGTGGCTTTCTGCTGCTCATCCAGCCAGCTCACCCGCTCGAA CGCGACAAGAAGGAGTGGCTTAACGCGCGCGTCGCACCGAAAGACGACGAGTAGGTCGGTCGAGTGGGCGAGCTT -2 R Q E E E G F Q A R L T A K Q Q E D L W S V R E F
18301 GTGCTCCGGCAAAACGGCCAGTACCACCAGGCCCAGCAACACCACCGGCGCCCCTTCGAGCAGGAACATCCACTG CACGAGGCCGTTTTGCCGGTCATGGTGGTCCGGGTCGTTGTGGTGGCCGCGGGGAAGCTCGTCCTTGTAGGTGAC -2 H E P L V A L V V L G L L V V P A G E L L F M W Q
18376 CCAGCCACGCAGCCCGCCCGTGTCGTGCATGAAGGCCAGTATGGCCCCGGACACTGGCCCGCCGACCACTCCGGC GGTCGGTGCGTCGGGCGGGCACAGCACGTACTTCCGGTCATACCGGGGCCTGTGACCGGGCGGCTGGTGAGGCCG -2 W G R L G G T D H M F A L I A G S V P G G V V G A
18451 CAACGGCACGGCAATGGCGAACAGCGCGGTGACCTGGGCGCGGCGCCCGGCCGGGTACCAGCGGTTGAGGTAAAC GTTGCCGTGCCGTTACCGCTTGTCGCGCCACTGGACCCGCGCCGCGGGCCGGCCCATGGTCGCCAACTCCATTTG -2 L P V A I A F L A T V Q A R R G A P Y W R N L Y V
18526 CAGAATGCCCGGGAAGAACCCGGCCTCGGCCGCGCCCAGGGCAAAGCGCAACAGGTAGAACGCGCTGCTGCTTTC GTCTTACGGGCCCTTCTTGGGCCGGAGCCGGCGCGGGTCCCGTTTCGCGTTGTCCATCTTGCGCGACGACGAAAG -2 L I G P F F G A E A A G L A F R L L Y F A S S S E
18601 GATCAGCAGCATGCTGGTCGACAACAGCCCCCACACCACCATCAGGCAGGCGATCCAGCGGCGTGGGCCAACGCG CTAGTCGTCGTACGACCAGCTGTTGTCGGGGGTGTGGTGGTAGTCCGTCCGCTAGGTCGCCGCACCCGGTTGCGC -2 I L L M S T S L L G W V V M L C A I W R R P G V R
18676 GTCGAGCATCAGGTTGCTGGGGACGCCGAACAGCGCATAGGCAATGAAGAACAGCCCGGCACCCAGGCCATAGAC CAGCTCGTAGTCCAACGACCCCTGCGGCTTGTCGCGTATCCGTTACTTCTTGTCGGGCCGTGGGTCCGGTATCTG -2 D L M L N S P V G F L A Y A I F F L G A G L G Y V
18751 CGTGTCGGACAAATGCAGGTCCTGGCTCATCTGCATCTTGGCGAAGCCAATGTTGATGCGGTCCAGGTGGGCGAA GCACAGCCTGTTTACGTCCAGGACCGAGTAGACGTAGAACCGCTTCGGTTACAACTACGCCAGGTCCACCCGCTT -2 T D S L H L D Q S M Q M K A F G I N I R D L H A F
18826 CAGGTAGCACACCAGCAGCAGCGGCATCAGCCGCCAGGTGACTGCCCGATGGGTACTGTCGGCCCGTTCAACGTG GTCCATCGTGTGGTCGTCGTCGCCGTAGTCGGCGGTCCACTGACGGGCTACCCATGACAGCCGGGCAAGTTGCAC -2 L Y C V L L L P M L R W T V A R H T S D A R E V H
18901 TGCCTCGCGCGGCGAGGCTTGTTCGAGTGTGCTCATGTTTTTGTACTTATTCTGTAATGAGTCGGGGAGGGCGTG ACGGAGCGCGCCGCTCCGAACAAGCTCACACGAGTACAAAAACATGAATAAGACATTACTCAGCCCCTCCCGCAC -2 A E R P S A Q E L T S M
hpaX
18976 GTTTGAGCCGGCGCGCTAGCGGTTGAACAGTGGGTGCAAGGTGCTGTGCTTGGCGTCGTAGACCTGGGCGGTGCT CAAACTCGGCCGCGCGATCGCCAACTTGTCACCCACGTTCCACGACACGAACCGCAGCATCTGGACCCGCCACGA @ R N F L P H L T S H K A D Y V Q A T S
19051 GTGGTCGATCTGCACGGTGATGCCGATCGGGCGCTGTTGCAGCAGTGGGTCCAGGCGCGCTTTCAACACTGCCAG CACCAGCTAGACGTGCCACTACGGCTAGCCCGCGACAACGTCGTCACCCAGGTCCGCGCGAAAGTTGTGACGGTC -2 H D I Q V T I G I P R Q Q L L P D L R A K L V A L
19126 CAAGCTGTCGCCCACTGTTTTGTGCACCTCGGCGCTACGGCCGGTAGCCATGCGCAGGTTGGCGTACAGAAAGCC GTTCGACAGCGGGTGACAAAACACGTGGAGCCGCGATGCCGGCCATCGGTACGCGTCCAACCGCATGTCTTTCGG -2 L S D G V T K H V E A S R G T A M R L N A Y L F G
19201 GTATTCGCCTTTGCCGTCGGCCACCGCGCAATGGGCGGCGGGGTAGGCCAGCACGCGTGTACCGCCAGTGGGGAA CATAAGCGGAAACGGCAGCCGGTGGCGCGTTACCCGCCGCCCCATCCGGTCGTGCGCACATGGCGGTCACCCCTT -2 Y E G K G D A V A C H A A P Y A L V R T G G T P F
19276 CACGGCTTTGCCTTCGGCATCGCGCTGTTCGAGCATGGTGTCGGCCAGGGCGCGGCACAGGCCGGGGATGTCGGC
GTGCCGAAACGGAAGCCGTAGCGCGACAAGCTCGTACCACAGCCGGTCCCGCGCCGTGTCCGGCCCCTACAGCCG -2 V A K G E A D R Q E L M T D A L A R C L G P I D A
19351 GTCGGTTTCCAGGTCGGGGGTATAGAGCAGAACCAGGTGTGGCATGGGGGCCTCCTCGGTGAGGGGCGGCTGGCC CAGCCAAAGGTCCAGCCCCCATATCTCGTCTTGGTCCACACCGTACCCCCGGAGGAGCCACTCCCCGCCGACCGG -2 D T E L D P T Y L L V L H P M
hpaF
19426 ACCCGCCAGGGCGACCAGCCGCGAACGGGTGGGTTACAGGCGGCTGGTGGGCACCACGGCGGCCGGGTTGGCGGC TGGGCGGTCCCGCTGGTCGGCGCTTGCCCACCCAATGTCCGCCGACCACCCGTGGTGCCGCCGGCCCAACCGCCG # L R S T P V V A A P N A A
19501 CTGGGCAGCGGGGATGGCACCACCGTCCTGCGGGGTGACCGGGAAGATCGCGTTGATCTGGCCGGTGCCGGAAGA GACCCGTCGCCCCTACCGTGGTGGCAGGACGCCCCACTGGCCCTTCTAGCGCAACTAGACCGGCCACGGCCTTCT -2 Q A A P I A G G D Q P T V P F I A N I Q G T G S S
19576 GCCGAAGTAGGGCGTGACCACTTCGGCCTTGCCGTCGTAATCGGACCAGCCCAGCGCACCCAGCAGCATTGCCGT CGGCTTCATCCCGCACTGGTGAAGCCGGAACGGCAGCATTAGCCTGGTCGGGTCGCGTGGGTCGTCGTAACGGCA -2 G F Y P T V V E A K G D Y D S W G L A G L L M A T
19651 GTCGTGCATGAAGCCTTCACCGTGGCCTTTGGCGGCGTACTCCGGCAGCATCCCGCAGAACGCTTCCCACTCGCC CAGCACGTACTTCGGAAGTGGCACCGGAAACCGCCGCATGAGGCCGTCGTAGGGCGTCTTGCGAAGGGTGAGCGG -2 D H M F G E G H G K A A Y E P L M G C F A E W E G
19726 GTCCTGCCACATTTGCACCACACGGTGGTCGAGGGTTTCGAGGAACGGGCTCCACACCTTGGTGGCAAAGTCCGG CAGGACGGTGTAAACGTGGTGTGCCACCAGCTCCCAAAGCTCCTTGCCCGAGGTGTGGAACCACCGTTTCAGGCC -2 D Q W M Q V V R H D L T E L F P S W V K T A F D P
19801 CGCCTGGCCGTTCTGCGCGAAGCGGTGCGACAGCGAGCCGCTGGCCAGGAACGCCACGGTGCCGTCGTAGTGGTC GCGGACCGGCAAGACGCGCTTCGCCACGCTGTCGCTCGGCGACCGGTCCTTGCGGTGCCACGGCAGCATCACCAG -2 A Q G N Q A F R H S L S G S A L F A V T G D Y H D
19876 TTCTACTGCCTTGCGCATGGCCCAGCCCAGGCGGGCACTGTCGGCCAGGTAGTGCGAGGTGCACAGGGCCGAGAC AAGATGACGGAACGCGTACCGGGTCGGGTCCGCCCGTGACAGCCGGTCCATCACGCTCCACGTGTCCCGGCTCTG -2 E V A K R M A W G L R A S D A L Y H S T C L A S V
19951 CGAGACCACTTTGAAGTGCTGGTCCTGGTTCATGTAGCGCATGGGCACCAGGGTGCCGTATTCCGGGGCGAGGGT GCTCTGGTGAAACTTCACGACCAGGACCAAGTACATCGCGTACCCGTGGTCCCACGGCATAAGGCCCCGCTCCCA -2 S V V K F H Q D Q N M Y R M P V L T G Y E P A L T
20026 GGTGGCGTGGTGGGCCATGGTTTCGACGTTGAAGCGGTTGCACTCCTCGGCCAGCAGCTTGCCCAGCTCGGGATT CCACCGCACCACCCGGTACCAAAGCTGCAACTTCGCCAACGTGAGGAGCCGGTCGTCGAACGGGTCGAGCCCTAA -2 T A H H A M T E V N F R N C E E A L L K G L E P N
20101 GCCGGGGAATGCGTAGGGCATGTTGCTGATGAAGTGCGGCAGTTCGTTGCTGGTGTACACGCCCTCGAAATGCGG CGGCCCCTTACGCATCCCGTACAACGACTACTTCACGCCGTCAAGCAACGACCACATGTGCGGGAGCTTTACGCC -2 G P F A Y P M N S I F H P L E N S T Y V G E F H P
20176 CCCGCACAGCACGTGGTAGTTGGCGTTGACCAGCCAGTGCGTGTCGAACACGACGATGGTGTCCACGCCCAGCTC GGGCGTGTCGTGCACCATCAACCGCAACTGGTCGGTCACGCACAGCTTGTGCTGCTACCACAGGTGCGGGTCGAG -2 G C L V H Y N A N V L W H T D F V V I T D V G L E
20251 ACGGCAACGGCGGCTGATTTCGTGATGCCCGTCGATGGCCGCCTGGCGAAAGCCTTGGCGCGGGCCTGGCAGTTC TGCCGTTGCCGCCGACTAAAGCACTACGGGCAGCTACCGGCGGACCGCTTTCGGAACCGCGCCCGGACCGTCAAG -2 R C R R S I E H H G D I A A Q R F G Q R P G P L E
20326 GGACATGTACATGGACGGTACATGGGTAATCTTGGCAGTGAGAGCGAGTTTGCCCATGGGGGTCTCCGATAAGAC CCTGTACATGTACCTGCCATGTACCCATTAGAACCGTCACTCTCGCTCAAACGGGTACCCCCAGAGGCTATTCTG -2 S M Y M S P V H T I K A T L A L K G M
hpaD
20401 GCTGTTGTTGTTTTGGGGCTGACCCGGTCCCTTGTAGGAGCGGCCTTGTTCCGGGATGGGGCGCACAGCGGCCCC CGACAACAACAAAACCCCGACTGGGCCAGGGAACATCCTCGCCGGAACAAGGCCCTACCCCGCGTGTCGCCGGGG
20476 GGCGATATCTGCGGCGAGGCTGAAATCCAGGGGCCGCTGCGCGCCCCATCGCGGGCACAAGGCCGCTCCTACACC CCGCTATAGACGCCGCTCCGACTTTAGGTCCCCGGCGACGCGCGGGGTAGCGCCCGTGTTCCGGCGAGGATGTGG
20551 CGGGCGGTGTAAACCGCACAGAGGGTTAGATGCCCCAGCGAGGAATGTGGTGATTACCCATGGAAATACACACGT GCCCGCCACATTTGGCGTGTCTCCCAATCTACGGGGTCGCTCCTTACACCACTAATGGGTACCTTTATGTGTGCA # I G W R P I H H N G M S I C V N
20626 TCTTGATCTCTGCAAAGACCTCGAAGCTGTACTGCCCGCCCTCACGCCCGGTACCGGAACCTTTCACGCCGCCGA AGAACTAGAGACGTTTCTGGAGCTTCGACATGACGGGCGGGAGTGCGGGCCATGGCCTTGGAAAGTGCGGCGGCT
- 1 K I E A F V E F S Y Q G G E R G T G S G K V G G F
20701 ACGGCTGGCGCAGGTCGCGTACGTTCTGGCTGTTGATGAACACCATGCCGGCCTCGATGCCACGGGCCAGGCGAT TGCCGACCGCGTCCAGCGCATGCAAGACCGACAACTACTTGTGGTACGGCCGGAGCTACGGTGCCCGGTCCGCTA -1 P Q R L D R V N Q S N I F V M G A E I G R A L R H
20776 GGGCTTTGCCGATGTCCTGGGTCCAGATGTACGAGGCCAGGCCATACTCGGTGTCGTTGGCCAGTTGCAGCGCCT CCCGAAACGGCTACAGGACCCAGGTCTACATGCTCCGGTCCGGTATGAGCCACAGCAACCGGTCAACGTCGCGGA -1 A K G I D Q T W I Y S A L G Y E T D N A L Q L A E
20851 CGGCTTCGTCCTTGAACGGGATCAGGCACACCACCGGGCCAAAGATTTCTTCCTGGGCAATGCGCATCTTGTTGT GCCGAAGCAGGAACTTGCCCTAGTCCGTGTGGTGGCCCGGTTTCTAAAGAAGGACCCGTTACGCGTAGAACAACA -1 A E D K F P I L C V V P G F I E E Q A I R M K N N
20926 TCACGTCGGCGAATACGGTGGGCTGGATGAACTGCCCCTTGGCCAGGTGCGCAGGCAGGTTGGCCGGGCGCTCCA AGTGCAGCCGCTTATGCCACCCGACCTACTTGACGGGGAACCGGTCCACGCGTCCGTCCAACCGGCCCGCGAGGT -1 V D A F V T P Q I F Q G K A L H A P L N A P R E L
21001 GGCCCCCGGCGACCAGGCGTGCACCTTCTTCGATGCCAATGCGGATGTACCCGGTGACCTTGTCATAGTGCTGCT CCGGGGGCCGCTGGTCCGCACGTGGAAGAAGCTACGGTTACGCCTACATGGGCCACTGGAACAGTATCACGACGA -1 G G A V L R A G E E I G I R I Y G T V K D Y H Q Q
21076 GGGTGATCATCGAACCGACCTGGGTTTTCGGGTCGGTCGGGTCACCTACGATCAGGCGCTTGGCGCGCGCCGCAA CCCACTAGTAGCTTGGCTGGACCCAAAAGCCCAGCCAGCCCAGTGGATGCTAGTCCGCGAACCGCGCGCGGCGTT -1 T I M S G V Q T K P D T P D G V I L R K A R A A F
21151 ACTCTGCGACAAACTGCGGGTACACGCTTTCCTGGATGAAGATGCGGCTGCCGGCGGTGCAGCGCTCGCCGTTCA TGAGACGCTGTTTGACGCCCATGTGCGAAAGGACCTACTTCTACGCCGACGGCCGCCACGTCGCGAGCGGCAAGT -1 E A V F Q P Y V S E Q I F I R S G A T C R E G N L
21226 GCGAGAAGATGGTGAACAGCGCGGCGTCCAGCGCACGCTCAAGGTCTGCGTCTTCGAAGATCAGCACGGGCGACT CGCTCTTCTACCACTTGTCGCGCCGCAGGTCGCGTGCGAGTTCCAGACGCAGAAGCTTCTAGTCGTGCCCGCTGA -1 S F I T F L A A D L A R E L D A D E F I L V P S K
21301 TGCCGCCCAGTTCCATCGAGTACTTTTTAAGGCCTGCGGTCTGCATGATCTTCTTGCCGGTGGCGGTACCGCCGG ACGGCGGGTCAAGGTAGCTCATGAAAAATTCCGGACGCCAGACGTACTAGAAGAACGGCCACCGCCATGGCGGCC -1 G G L E M S Y K K L G A T Q M I K K G T A T G G T
21376 TGAAGGAAATGGCGCGCACATCGGGGTGGCGGACCAGGGCATCGCCGGCGGTAGCGCCGTAACCCTGGATCACGT ACTTCCTTTACCGCGCGTGTAGCCCCACCGCCTGGTCCCGTAGCGGCCGCCATCGCGGCATTGGGACCTAGTGCA -1 F S I A R V D P H R V L A D G A T A G Y G Q I V N
21451 TCAGCACCCCGTTGGGGATGCCGGCTTCTACCGCCAGGCGGCCCAGTTCGTTGGCGGTCAGAGGCGACAGCTCGC AGTCGTGGGGCAACCCCTACGGCCGAAGATGGCGGTCCGCCGGGTCAAGCAACCGCCAGTCTCCGCTGTCGAGCG -1 L V G N P I G A E V A L R G L E N A T L P S L E S
21526 TCATCTTCAGCACGGCGGTGTTGCCCAGCGCCAGGCACGGCGCAGTCTTCCAGGTAGCCGTCATGAACGGCACGT AGTAGAAGTCGTGCCGCCACAACGGGTCGCGGTCCGTGCCGCGTCAGAAGGTCCATCGGCAGTACTTGCCGTGCA -1 M K L V A T N G L A L C P A T K W T A T M F P V N
21601 TCCATGGGCTTACCAGGCCGCACACACCCACCGGCTGGTACAGGGTGTAGTTGAGCATCTGGTCGTCGACCGGGT AGGTACCCGAATGGTCCGGCGTGTGTGGGTGGCCGACCATGTCCCACATCAACTCGTAGACCAGCAGCTGGCCCA -1 W P S V L G C V G V P Q Y L T Y N L M Q D D V P Y
21676 AGGTATGGCCGTCCATGCGCGTGCACACTTCGGCGAAGAAGTCGAAGTTGTGCGAGGCACGCGGGATCAGCACGT TCCATACCGGCAGGTACGCGCACGTGTGAAGCCGCTTCTTCAGCTTCAACACGCTCCGTGCGCCCTAGTCGTGCA -1 T H G D M R T C V E A F F D F N H S A R P I L V N
21751 TCTTGGTCTGGTGGATCGGCAGGCCGGTGTCGAGGGTTTCCAGCTCGGCGAGTTTCGGCACGTTCTGCTCAATCA AGAACCAGACCACCTAGCCGTCCGGCCACAGCTCCCAAAGGTCGAGCCGCTCAAAGCCGTGCAAGACGAGTTAGT -1 K T Q H I P L G T D L T E L E A L K P V N Q E I L
21826 GCTCACCCAGCTTGCGCATCAGCCGGGCACGTTCCTTGGCCGGGGTGTTGGCCCACTTGGGGAAGGCTTCCTTGG CGAGTGGGTCGAACGCGTAGTCGGCCCGTGCAAGGAACCGGCCCCACAACCGGGTGAACCCCTTCCGAAGGAACC -1 E G L K R M L R A R E K A P T N A W K P F A E K A
21901 CCGCAGCCACAGCCTGGGCCACTTCCTCGGCGCCGCCGCTGGCGACTTCGCAGATGGCGTCGCCGGTGGCCGGGT GGCGTCGGTGTCGGACCCGGTGAAGGAGCCGCGGCGGCGACCGCTGAAGCGTCTACCGCAGCGGCCACCGGCCCA -1 A A V A Q A V E E A G G S A V E C I A D G T A P N
21976 TGTAGTTGACGAAGGTGTCTTTGCTCTCGACCTCACGGCCGTTGATCCAGTGCTTGATCATGCTGCTCATGCCTT ACATCAACTGCTTCCACAGAAACGAGAGCTGGAGTGCCGGCAACTAGGTCACGAACTAGTACGACGAGTACGGAA -1 Y N V F T D K S E V E R G N I W H K I M
hpaE
- 2 & A K 22051 GTTGTTCTTGAAGAAGTCAGCTTCGCTGACGATACGGTTGACCAGGCGACCGACGCCTTCCACTTCCACCACCAC CAACAAGAACTTCTTCAGTCGAAGCGACTGCTATGCCAACTGGTCCGCTGGCTGCGGAAGGTGAAGGTGGTGGTG -2 N N K F F D A E S V I R N V L R G V G E V E V V V
22126 TTCGTCACCCGGCACCACATCGGCCAGGCCTTCTGGCGTGCCGGTGGCGATCATGTCGCCCGGTTGCAGGGTCAT AAGCAGTGGGCCGTGGTGTAGCCGGTCCGGAAGACCGCACGGCCACCGCTAGTACAGCGGGCCAACGTCCCAGTA -2 E D G P V V D A L G E P T G T A I M D G P Q L T M
22201 GAAGCTGGAGAAGTATTCGATGAGGTGCGGGATGTCGAAGATCATGTCCGCGGTGGTGCCTTCCTGCTTCAGCTC CTTCGACCTCTTCATAAGCTACTCCACGCCCTACAGCTTCTAGTACAGGCGCCACCACGGAAGGACGAAGTCGAG -2 F S S F Y E I L H P I D F I M D A T T G E Q K L E
22276 ACCGTTGATCCAGGTGCGCAGCTTCAGGTTGCTGACGTCTGGCACATCGGCCGCATCGACGATCCACGGGCCGAC TGGCAACTAGGTCCACGCGTCGAAGTCCAACGACTGCAGACCGTGTAGCCGGCGTAGCTGCTAGGTGCCCGGCTG -2 G N I W T R L K L N S V D P V D A A D V I W P G V
22351 CGGGGTGGTGGCATCGCGGTTTTTCACCCGCAGGTTGGGGCGGTAGTAGTTTTCCAGGTAGTCGCGGATGGCGTA GCCCCACCACCGTAGCGCCAAAAAGTGGGCGTCCAACCCCGCCATCATCAAAAGGTCCATCAGCGCCTACCGCAT -2 P T T A D R N K V R L N P R Y Y N E L Y D R I A Y
22426 GTCGTTGCACACGGTGTAGCCGGCAACGTAGGCCAGGGCGTCCTCACGCTTGACGTTCTTCGCCGCTTTGCCGAT CAGCAACGTGTGCCACATCGGCCGTTGCATCCGGTCCCGCAGGAGTGCGAACTGCAAGAAGCGGCGAAACGGCTA -2 D N C V T Y G A V Y A L A D E R K V N K A A K G I
22501 CACCGCCACCAGCTCGCACTCGTAGTGCATGTATTCGACGTTGTCCGGGCGCCAGGTGACCTGGATGTGGCCGGT GTGGCGGTGGTCGAGCGTGAGCATCACGTACATAAGCTGCAACAGGCCCGCGGTCCACTGGACCTACACCGGCCA -2 V A V L E C E Y H M Y E V N D P R W T V Q I H G T
22576 GTAGGTGCCTGGCGACTTGATGAAAGCCAACGGTTCGGTGGGCGGCGCGAAGGCCAGCTCCCTGGCGTGGTCGGC CATCCACGGACCGCTGAACTACTTTCGGTTGCCAAGCCACCCGCCGCGCTTCCGGTCGAGGGACCGCACCAGCCG -2 Y T G P S K I F A L P E T P P A F A L E R A H D A
22651 GTAGTTCAGGCCCAGGGCGAACATGCTGCCGGTGGCGGGTGGCAGCCAGGTGACCTGGTCCTGATGGACCAGGCG CATCAAGTCCGGGTCCCGCTTGTACGACGGCCACCGCCCACCGTCGGTCCACTGGACCAGGACTACCTGGTCCGC -2 Y N L G L A F M S G T A P P L W T V Q D Q H V L R
22726 GCCGTCGGCAAGGCGCAGGTGATCGTCTTCGACCGTGACATCGTGGGCCTGGCCGTCGAACTGGATACGGGCGTG CGGCAGCCGTTCCGCGTCCACTAGCAGAAGCTGGCACTGTAGCACCCGGACCGGCAGCTTGACCTATGCCCGCAC -2 G D A L R L H D D E V T V D H A Q G D F Q I R A H
22801 TTTCACAGGTAATTCCTCACTCGGCGACGATGTGGTTGGTCAGCTTGCCCAGGCCGTCGATCTCGATGTCGACGC AAAGTGTCCATTAAGGAGTGAGCCGCTGCTACACCAACCAGTCGAACGGGTCCGGCAGCTAGAGCTACAGCTGCG -1 & E A V I H N T L K G L G D I E I D V R -2 K V
hpaG2
22876 GGTCACCTGGCTGTACATCGACGCGGCCCTCGGGGGTTCCGGTGATCAGGATGTCGCCGGCGTGCAGGGTCATGA CCAGTGGACCGACATGTAGCTGCGCCGGGAGCCCCCAAGGCCACTAGTCCTACAGCGGCCGCACGTCCCAGTACT -1 D G P Q V D V R G E P T G T I L I D G A H L T M F
22951 ACTCGCTGATTTCGGCAATCAGCTGCGCCACCGTGCGTACGCAGTTGGCGGTGTTGTTGTGCTGGCGCAGTTCGC TGAGCGACTAAAGCCGTTAGTCGACGCGGTGGCACGCATGCGTCAACCGCCACAACAACACGACCGCGTCAAGCG -1 E S I E A I L Q A V T R V C N A T N N H Q R L E G
23026 CGTTCACATACAGGCGCAGGCCCAGGGCATCGGGGTTGGCCACTTGGCTGGCGGGCACCAGTTCAGGGCCGACCG GCAAGTGTATGTCCGCGTCCGGGTCCCGTAGCCCCAACCGGTGAACCGACCGCCCGTGGTCAAGTCCCGGCTGGC -1 N V Y L R L G L A D P N A V Q S A P V L E P G V P
23101 GGCAAAAACCATCACGGCACTTGGCCTTGACTGCAGGGCGGTAGTAGCTGGCTTCGGGCAGGCTCACTTCGTTGA CCGTTTTTGGTAGTGCCGTGAACCGGAACTGACGTCCCGCCATCATCGACCGAAGCCCGTCCGAGTGAAGCAACT -1 C F G D R C K A K V A P R Y Y S A E P L S V E N V
23176 CGATGGTGTAGCCCGCCACATGCTCCAGGGCATCGGCCACGCTGACGCGGCTGGCGTCCTTGCCAATCACCACTC GCTACCACATCGGGCGGTGTACGAGGTCCCGTAGCCGGTGCGACTGCGCCGACCGCAGGAACGGTTAGTGGTGAG -1 I T Y G A V H E L A D A V S V R S A D K G I V V G
23251 CCAGCGCCGGGCCGGGTTGCACGCGCTGCACGCCGGCCGGGAATACCACCTGGCCTTCATGCTGGTTGCGGGTGT GGTCGCGGCCCGGCCCAACGTGCGCGACGTGCGGCCGGCCCTTATGGTGGACCGGAAGTACGACCAACGCCCACA -1 L A P G P Q V R Q V G A P F V V Q G E H Q N R T N
23326 TCGGGGTCTTGACGAACAACACCGGCTTGACCGGCAGTTGCTTGTACGGTGCTTCCACGAACGCCGCTTGGTGCT AGCCCCAGAACTGCTTGTTGTGGCCGAACTGGCCGTCAACGAACATGCCACGAAGGTGCTTGCGGCGAACCACGA
- 1 P T K V F L V P K V P L Q K Y P A E V F A A Q H Q
23401 GCTGCAGCAAACCCTGGTAGTTCAGCGCGACGCCGAACAGGGTGCCGCTGGCAACGTCAAGCAGGGCATGGCTCA CGACGTCGTTTGGGACCATCAAGTCGCGCTGCGGCTTGTCCCACGGCGACCGTTGCAGTTCGTCCCGTACCGAGT -1 Q L L G Q Y N L A V G F L T G S A V D L L A H S M
hpaG1
23476 TGCTCTTCTCCTGGCAGTGCAGGGCGGTGGCCGTCCTGCGGATTTCGTTAATGTGTTAATGTTATAGTTAATATG ACGAGAAGAGGACCGTCACGTCCCGCCACCGGCAGGACGCCTAAAGCAATTACACAATTACAATATCAATTATAC
23551 TTAACGATGGTCAAGGGGTGGCCAGTGGCGCCTGCCGGCAAGGCAAGGCACCATGGGCCATCGTCAACAGGGTCA AATTGCTACCAGTTCCCCACCGGTCACCGCGGACGGCCGTTCCGTTCCGTGGTACCCGGTAGCAGTTGTCCCAGT
hpaA
+2 M S D R H P I P N I N I G Q V Y D Q 23626 AGCGATTTGCGAGCAAGCAGCCATGAGCGACCGGCATCCGATACCGAACATCAACATTGGCCAGGTTTACGACCA TCGCTAAACGCTCGTTCGTCGGTACTCGCTGGCCGTAGGCTATGGCTTGTAGTTGTAACCGGTCCAAATGCTGGT
+2 R Y S D S E V H Y D R L G N L A G F F G R N M P V 23701 GCGCTACAGCGACAGCGAGGTGCATTACGACCGGCTGGGCAACCTGGCGGGCTTTTTCGGGCGCAACATGCCGGT CGCGATGTCGCTGTCGCTCCACGTAATGCTGGCCGACCCGTTGGACCGCCCGAAAAAGCCCGCGTTGTACGGCCA
+2 H R H D R F F Q V H Y V K S G T V R V Y L D D Q Q23776 GCACCGGCATGACCGGTTTTTCCAGGTGCATTACGTGAAGTCGGGCACAGTACGGGTGTATCTGGATGACCAGCA
+2 Y I E A G P M F F L T P P T V A H A F V T E A D S 23851 GTACATCGAGGCCGGGCCGATGTTCTTCCTCACGCCACCCACGGTGGCGCACGCGTTCGTCACCGAAGCTGACAG CATGTAGCTCCGGCCCGGCTACAAGAAGGAGTGCGGTGGGTGCCACCGCGTGCGCAAGCAGTGGCTTCGACTGTC
+2 D G H V L T V R Q Q L V W Q L I E A D A S L L P A 23926 CGACGGGCATGTGCTGACGGTGCGCCAGCAACTGGTGTGGCAATTGATCGAAGCCGACGCCAGCCTGCTGCCGGC GCTGCCCGTACACGACTGCCACGCGGTCGTTGACCACACCGTTAACTAGCTTCGGCTGCGGTCGGACGACGGCCG
+2 G M Q V Q P A C V A L G N L P A E Y K A E A Q R L 24001 GGGCATGCAGGTGCAGCCAGCCTGTGTGGCGCTGGGCAACCTGCCGGCCGAATACAAGGCCGAGGCGCAGCGCCT CCCGTACGTCCACGTCGGTCGGACACACCGCGACCCGTTGGACGGCCGGCTTATGTTCCGGCTCCGCGTCGCGGA
+2 Q G W L D A L S D E F A T Q Q P G R E A A L Q S L 24076 GCAAGGCTGGCTGGACGCGTTGAGTGACGAGTTTGCCACGCAGCAACCGGGTCGCGAGGCGGCGTTGCAGTCGCT CGTTCCGACCGACCTGCGCAACTCACTGCTCAAACGGTGCGTCGTTGGCCCAGCGCTCCGCCGCAACGTCAGCGA
+2 T R L I M I S L L R L C P N S L E S T P A R H E D 24151 GACCCGCCTGATCATGATCAGCCTGCTGCGGCTGTGCCCCAACTCGCTGGAATCGACCCCGGCGCGGCATGAAGA CTGGGCGGACTAGTACTAGTCGGACGACGCCGACACGGGGTTGAGCGACCTTAGCTGGGGCCGCGCCGTACTTCT
+2 L K I F H R F N A L I E A H Y L E H W P L A R Y A 24226 CCTGAAGATCTTCCACCGTTTCAATGCCCTGATCGAAGCGCATTACCTTGAGCATTGGCCGCTGGCCCGCTACGC GGACTTCTAGAAGGTGGCAAAGTTACGGGACTAGCTTCGCGTAATGGAACTCGTAACCGGCGACCGGGCGATGCG
+2 Q Q I G V T E A R L N D V C R R I A D L P S K R L 24301 GCAGCAGATTGGCGTGACCGAGGCACGGCTGAACGATGTGTGCCGGCGCATCGCCGACTTGCCATCCAAGCGCCT CGTCGTCTAACCGCACTGGCTCCGTGCCGACTTGCTACACACGGCCGCGTAGCGGCTGAACGGTAGGTTCGCGGA
+2 V L E R L M Q E A K R L L L F S G S T A N E I C Y 24376 GGTGCTGGAACGGCTGATGCAGGAGGCCAAGCGTTTGCTGTTGTTTTCCGGCAGCACGGCCAACGAAATCTGTTA CCACGACCTTGCCGACTACGTCCTCCGGTTCGCAAACGACAACAAAAGGCCGTCGTGCCGGTTGCTTTAGACAAT
+2 Q L G F K D P A Y F S R F F N R Y A K L T P G E Y24451 CCAGCTCGGCTTCAAGGATCCGGCCTATTTCAGCCGCTTCTTCAACCGCTACGCCAAGCTCACACCCGGGGAGTA GGTCGAGCCGAAGTTCCTAGGCCGGATAAAGTCGGCGAAGAAGTTGGCGATGCGGTTCGAGTGTGGGCCCCTCAT
+2 R Q R Q A E L Q &24526 CCGCCAGCGGCAGGCAGAATTGCAGTGAAATGGCCATGGCGGCTCACCCGGGTGCTGTTGTTGTTTACAGCGGAT GGCGGTCGCCGTCCGTCTTAACGTCACTTTACCGGTACCGCCGAGTGGGCCCACGACAACAACAAATGTCGCCTA
24601 GGTCGCAGCCCGCGCGCCGGGCTTGAATGGGTTTTCCGTGGAACAGATTGCACTTTCCATCGTGCATGCCCTTAA CCAGCGTCGGGCGCGCGGCCCGAACTTACCCAAAAGGCACCTTGTCTAACGTGAAAGGTAGCACGTACGGGAATT
hpaR
+2 M T K T Q P S L T L S L L Q24676 ATTCGTGAATTGAGAAAAAGCCACAGGTTTGACCATGACCAAGACGCAACCTTCGCTCACGCTAAGCCTGTTGCA TAAGCACTTAACTCTTTTTCGGTGTCCAAACTGGTACTGGTTCTGCGTTGGAAGCGAGTGCGATTCGGACAACGT +2 A R E A A M A F F R P L L N Q H D L T E Q Q W R V24751 GGCCCGAGAAGCCGCGATGGCATTTTTCAGGCCGCTGTTGAACCAGCACGACCTGACCGAGCAGCAATGGCGGGT CCGGGCTCTTCGGCGCTACCGTAAAAAGTCCGGCGACAACTTGGTCGTGCTGGACTGGCTCGTCGTTACCGCCCA
+2 I R I L K Q H G E L E N Y Q L A E L A C I L K P S24826 AATCCGCATCCTCAAGCAGCACGGCGAGCTGGAGAATTATCAGTTGGCGGAACTGGCCTGCATCCTCAAGCCGAG TTAGGCGTAGGAGTTCGTCGTGCCGCTCGACCTCTTAATAGTCAACCGCCTTGACCGGACGTAGGAGTTCGGCTC
+2 M T G V L G R L E R D G L V R R Q K A A Q D Q R R24901 CATGACCGGGGTACTGGGGCGCCTGGAGCGAGACGGGCTGGTGCGGCGGCAGAAGGCCGCGCAGGACCAGCGACG GTACTGGCCCCATGACCCCGCGGACCTCGCTCTGCCCGACCACGCCGCCGTCTTCCGGCGCGTCCTGGTCGCTGC
+2 V F V S L T E R G E A C F A S M K E G M E A N Y Q24976 GGTGTTCGTCAGCCTGACCGAAAGAGGGGAGGCGTGCTTTGCCTCGATGAAGGAAGGCATGGAGGCCAACTACCA CCACAAGCAGTCGGACTGGCTTTCTCCCCTCCGCACGAAACGGAGCTACTTCCTTCCGTACCTCCGGTTGATGGT
+2 K I Q A Q F G E E K L Q Q L M G L L N D L K R I A25051 GAAGATTCAGGCGCAGTTTGGTGAAGAGAAGCTGCAGCAGCTGATGGGGTTGTTGAATGACCTGAAGCGCATCGC CTTCTAAGTCCGCGTCAAACCACTTCTCTTCGACGTCGTCGACTACCCCAACAACTTACTGGACTTCGCGTAGCG
+2 P # 25126 GCCATAA CGGTATT
P. putida silvestre Mutante A2
mM mM
5
Mutante A0
mM
4 3 2 1
mM Mutante A7 mM tynR::pK18mob
5
Figura 10
Figura 11
Figura 12
Figura 13
Figura 14 Figura 15
Figura 16
LISTADO DE SECUENCIAS
<110> BIOGES STARTERS S.A.
<120> NUEVA HIDROXIFENILACETALDEHÍDO DESHIDROGENASA, ÁCIDO NUCLEICO QUE LA CODIFICA Y VECTORES Y MICROORGANISMOS RECOMBINANTES QUELA EXPRESAN
<130> P-101091
<160> 45
<170> PatentIn version 3.3
<210> 1
<211> 25132
<212> DNA
<213> Pseudomonas putida U
<220>
<221> misc_feature
<222> (1) .. (25132)
<223> Secuencia que contiene los cluster tyn y hpa
<400> 1 tcaggcgaaa cgctcgaagc ggtacggtga cgggtcgatc agcggggtgg cctgggccac 60 caggtctgcc gccagctggc cagcagcagg cgaggtgccg aagccatgcc cggaaaagcc 120 ggtggccagg gtcaggcccg gaatactggc caccgggccg atgaccgggt tggagtcggg 180 ggtgacgtca atcgtgccgg cccaggcgct ggcgatacgg gcctgttcga acaccggcca 240 ggccgctttc aggttgcgca tggcctcgtc gttgagggcc gggttggcgt gcgggtcttg 300 tacccgtaca cgctcgaagg gggttacatc cgttgccttc cagcgccggg ccagggccag 360 gtccttgaag aagtacttgc caaagctgat gcgcaaaaag tcccgctggg cacgcagctg 420 gggcaggtaa cgcttgccca gcagcaggtg atcgagggtg aggaaggcgt ccagcgcgcc 480 gcgctgggtg atgatgtagc cgccgtcctt gtgcttgcgg aaggaaaaat ctggtgcgcc 540 cacggcgatg tcggttggcc cgtccatggg ctctgtgcgc agcacggaac aggtcagcgg 600 caaggtcggc aggttgatgc ccaggttgcc gaggaacttg cgcgaccaca ggccaccggc 660 cagcaacacc tggtcgcagc ggatttcacc ttgctcggtg accaccccgc tgacacggcc 720 ggctgcggtg accagcgtgc gcaccgcgca gttctccact accactgcac ctttggcgat 780 cgccgcccgg gcgatggcgc tggcggccag ggtcggttcg gcgcgggcgt cggagggggt 840 gaagatgcca cctgcccaat ccgcccgacc acccggcacc atccgggtga tttcccgcgt 900 gctcagcagg cgcgaatcca ggcccagcgc ctcgacgctt ttcagccagc cttcatgcat 960 gcccatctgc gtgtcgttac ggccgatgaa catgatgccg gcttgccgat agccaacgtc 1020 gctgccaacc cgtgcgggca tctcggccca cagccgatca gccgccagtg ccaggggaat 1080 gtcatgggcg tggcggttgg tcttgcgcac ccagcccagg ttgcgcgacg actgctcccc 1140 agcgatgcgc cccttctcca gcaccaccac cggtatgttg cgttcggcga ggctcagtgc 1200 ggcggtgagg ccgataatgc cgccaccgat gatcaccacg gtagtggcgt cggggtggcg 1260 ggtgctggtt tgcacagggg cgatcgtggg agacatggct ttactctttg ttgtgcgtgc 1320 agggggagtg ttcagcgcca gccagcagcc tcactggcca aggcggatca gggtcacttg 1380 cgcttgcccc gcaccgcggt aggcggtgac ctccagctcg accttgtaaa cggtggagcc 1440 cagcggcggg caggtgaccg tggtggccgg gtcgatgccg cggaacttct cgccgatcac 1500 gtccatgacc cgtggtacat cggcagggtc ctggatgaac acgcgcgagt tgatgacatc 1560 ggccaggctg gcatcgactg cggccagcgc ggtttcgatg ttggcgaaca cctggtgggt 1620 ctgttcgatg acgtcctctg gaatgacctg ggtctgcggg ttgcgtccgg cggtgttgga 1680 gacgtgaatc cagttgtcca ccgccaccag gcgggagtag ctggccatgg cttcgaactt 1740 ggagccggtt ttcagtttga tgatctgtgt catgggcttt gccttgttat ccggttgcgg 1800 ggatcagctg agaacggggg tttcccagag gttgagcttt acgccgatgc cttgctcgag 1860 cgccttgcgg tacaccacgg tgccccaggc cacgtcttcg acgggcatgc cgcccaccga 1920 catcaggatg atttcgtcgt catgcaggcg gcccggtgcg tcgccgctga tgatcttgcc 1980 gatgtcttcc acctgctcgg cggccagcgt gccttcggca atcatgtcca tgaagcgcac 2040 acctaccagc ggtacgtggt tgtgcgcagg cttgggcagc tcttcgaacc aggcctcgta 2100 gaggccggtg ttgtccacca ccttgcgcac gtcgtcctgc tccatgccgg cgtcgatact 2160 gcacggggct ggcatggcca ggaacgcgcc aggcttgacc cactcgcggc gcaccagcgg 2220 gtactggctg gggtcgccga cttcgcccga gctgcagtag ctgaccaggt cggaaccgcg 2280 taccacttct tccagggttt ccaccacctg gacatgagtg atttgcggga agctggtttt 2340 cacccaggcg acgaaggcat ccaggttctt ctggccacgg cccttgacct tgagggtgtc 2400 gatcagcggg cagacggcca tgaacgcagc gaccgtggtc ttgcccatca cccccgggcc 2460 ggccaggccg atcaccttgg cgtccttgcg cgccaggtgg cgggcgccga cgcccgggat 2520 ggcgccggtg cggtaggccg acagcaggtt ggccgacatg tgtgccagtg gcgcgccggt 2580 gtcggcatcg ttgagggtga acatcaggat cgagcggggc aggcctttct cacggttggc 2640 gatgttcgag ccgtaccact tggcgcctgc ggtctggaag ttgccgccga ggtacgccgg 2700 catcgccatc atgcgccggt cggcggtggg cttgggcatg ttggggaatg gcgagtgctc 2760 ggggaaggta atcatcgcgc cgtgcgagtc gctgttcggg ccggccatgc ggtagtcacc 2820 ctggtacagc aggccgaaca tttcttccat ggtgtcgaca caggccggca tgtcggtgac 2880 gccggcacgg atcatgtcct gctcggacag gtagatgaag tcaattctgg tatcgagggt 2940 catggcgggt ctcgcagggc tggctgccgt cggatttgtt gttggtttcg aggcaaccag 3000 tttcgctaac gactggtagg tcgtcttgtg tctgcctgcc agccgagttg accgtcagtg 3060 ccagggcttc aatggcccgc gagcgagaag ctggccgggg tgtggcgcag gctgagggcg 3120 gtcagcaggc acaccaccag ggtgcacagg gccagcagcg cggcccatgc ggtcgggccg 3180 tggttgagta ccactgcggc cagcggggcg gcgccggcag acgccgacag ctggatggcg 3240 cccagcagcg ctgcggtgga acccagtgcc ttttcttgcg aggccatcac cagcgacatc 3300 agcgtcgact cggctatccc caggccgaac agggctatca ccatgccgcc ggccacacct 3360 ggcagcccca ggccggtcag tgcaccgagc aggctgatgc aggcaccgcc ggccatgcac 3420 agcacgccca cccgagtcaa ggtattgagg cccagccggc tgatcaggtg gctggccgtc 3480 atggcgccga gcaggatcga caccccggtg gcgccaaaca gcaggccgaa ggcctgggcg 3540 ctcaggccgt agtgggcctg gtacaccagg gtggcaccgc cgatgtaggc gaacaggaag 3600 aagaataccg cagcaaccgc cagggtcggg cgcaggaagc ggcggtcggc gaggatggcc 3660 aggtaggtgc tgcaggcgtg gcccaggcgc aggggttcgc gtttgctggg cggcagggtt 3720 tcgggcaggt tcagcaggct gttgaccagc accgtcacgc ccatgccggc gagtaccagc 3780 attactgcac gccagccgaa atgtgcgtcg atcacgccgc ccagggcagg tgccaggatc 3840 ggtgcgacgc cttcgatggt catcagcagg gcgaacagtt tggtcgcggc cacgccctgg 3900 ctcacatcac gcaccatgct catgatcacc accagggtca gcgcactgcc caggccctgg 3960 aaaaagcgca gcatgatcag ggtgtcgagg ctgggggctg cggctgcgcc cagcgagcac 4020 aggatgaaca gcagcaggcc ggccagcagc ggcttgcgcc ggccataagc gtcgacgatg 4080 gggccgaaga tcagctggcc ggcgcccatg gccagcagga agaaggtcag tgtcagctgt 4140 acgcgggtga agctagcctg atagtggctg gcgatttccg gcaggctcga caggtacatg 4200 tcgacggcgg aagggccgag ggcgccgatc aggcctaggc ccagggcgaa gctgaagggt 4260 atgggagggg agggattggc ttgcatggtt ttctctggct gatttttcgc ctaccgaccg 4320 gtaggtttgc gaatattatt cgccgagtcg gccaaggtca aacccttccg caaggccact 4380 gattcctgtg gggagcgggc atgcccgcga acaccggcaa agccggtgcc accgagtcgc 4440 cttcttcgcg ggcatgcccg ctcccacatt gaccgcagag gttggttacc gtggttgcgt 4500 cagaacggca cagccacggt cagctggcta tacacattgg taccattccc gcccacctgg 4560 ttgccgccgt tgctctcgtc cttgcgcggc tggtaaaggc ccaccagcgg gctgattatc 4620 aggtgctcgt tgactgccca ttccacatac aggtccagct cccgcgcatc gaggttgagg 4680 ctttcgcggg tgcgtacggt gtcgaagtcg aagtacagcg ccccgactgt gagattttcc 4740 agcggtgtcg ccttcacgcc cacatggtgg atacccgtgt tgctgttgaa ggggccggcg 4800 tagttggcag cgacttcacc ctggaaccag gtgccgtaac cgctggacag gccgctgaac 4860 agcgcgtccc agcctgccga gtagcgggtg tagcggtagg taacctgcgg tgcccacggc 4920 aggtcggcga aggtgtagcc ggcctgcagg taccaggctt gctcggggcc gtcggtcttg 4980 tcctgccagg cgtattcgaa ggcgaaactg gcattgtcga tgccagcgtt gccttcgccg 5040 cgcacgctat acacgtccat gccttcgcgg gctttctgaa agtcgctggc ccattggtcg 5100 gtgacgtcga tgccgtgaat ccaggtcagc ccgagggtgc ccaaggcttg ggtgtagtcc 5160 agcgtgccgg cggccagttc ggtttcggcc tgggcgcggt tgtcggattt cagccacagc 5220 aggctgccat gcaggccatc gctgcccccc aggcgcagca ttgcggtgcg gtcgaaggcg 5280 tggcgggcgg ccaggtagta ggccccgccg cggtccagcg caccgtcggc gacgccgttg 5340 cccaggttcg ggccgtcgtc gttgatcaaa aaaccactgc ccaggcgaat ggtctggcgg 5400 ccggcggaaa cgtccactcc atccttgccc agcaccggga acaggtcggc cgagcgccag 5460 ccgaggaagg cgtcttcgat cttggtggtg cgttcggagc catcggtgtt gccggccgca 5520 tcgccatcgc cccaggtggc cgagctcacc cagttcaggc tgccgtacag cgtgccgttg 5580 ccggccaggc cctggtcacc gctgaggcca tacttgataa agccttcacg ccaggtcgaa 5640 ccccctgtgg tgccgtcgta gttcttgcgg ctgttgaaca tgccccatac cgccagcatg 5700 tcggcgttca ggtggctgtc atcgtcggcg tacagctcaa cggccggcgc ggcctggctg 5760 gccagcaagg ttgccagggc caggctggac agcgtctgtg gtttgaccat ttgcacatcc 5820 ctcgtttgtt ctcggccacc ttcacagggg cctttgttgt tcgggggcac cctcggttct 5880 ggcgaggggc catcgcggtt ggcggcgatg gcctattagg gcgtgtgcgg tggggcgggg 5940 tcttgttcgt ggctgccaag gcgcttgcac gccttggcca caggcgcggt cagtagcgga 6000 tcatcaccga cttgagctcg gtgaagtcat cgatgaaggc cgagccgaac tcgcggccaa 6060 tgccggaagc cttgatgccc ccaaacggta cagccgggtc gagcagggtg tgcatgttga 6120 cccacagggt accggcctgg atttgcggga tcatgcgcat ggccttgccc aggtcgttgg 6180 tccacaggct ggcgctgagg ccgtagggcg aggcgttcat caggtgcagc agttcgtctt 6240 cgtcgtcata aggcaggaag gtcgccacag ggccgaaggt ttcctgggtg agcagggtgt 6300 cgcaggctga ccgggcgagg attaccgtgg gttcgacgaa acagccgggg ccgtcgccca 6360 gggtgccgcc gtgaatgatc tggctgcctt cggcgcgggc gatggcgaac agttcggcca 6420 gcttctgctg gtgcggcttg ttggccacgg ggccgaactg ggtggcctcg tccagtggcg 6480 agccgatttt cagttggccc aggcgctggg acagggcgtc cagcagcggg tcgatgcgcg 6540 agcggtgcac atagaagcgc tcgcccgcgg cgcagatttg ccccgagtgc aggaagccgg 6600 cctcgatgat gccgtccaca gccttgtcgg ttgccacgtc gggcaggaag gccaccgcgt 6660 tcttgccgcc cagttccagt gtcgcacggg tcagcttggc gcccatggca gcctggccta 6720 cggcgatgcc agtgggcacg gagccggtga acgagacctt gtcggtacct gcgtgctcga 6780 tcagtgcctt gcccaccagg ccaccaccgg tcagcacgtt cagtgcaccg gccggcaggc 6840 ctgcttcggt ggccagttcg gcaatgcgca gcagcgtcag cggggtgaat tcgctgggct 6900 tgaggataat gctgcagccg gttgtcaggg ccgaggccag cttccagatg gcgatcatgc 6960 tggcgaagtt ccacggcacg atgcccacca ccacgccaat cggctcgcgc agggtgaagg 7020 cgctgtagcg ctcaccggcg aacgagggca gcgacggggt gatggtctgg ccggtgatct 7080 tggtcgccca gccggcgtag tagcgcagga agtgcgcggc ctgctgtact tcgaacgcac 7140 gggaaatgcc gatgagcttg ccggattgca aggtttccag ctgcgccagt tcttcgcggt 7200 tggcttccag caggtcggcc agcttgaaca gcactgcggc gcgggcggcg gggctggtgt 7260 gcgaccaggc ggtaaagcct tggcgcgagg agctgacggc atggtcgaca tcggcctggt 7320 tggcgtcggc gatgtgggcg atggtctggc cgttggccgg gttgaccacg gcaatgttcg 7380 acgacgactg gctggcgagg tgctggccgt ggatgaacac gccatgctcg cgggccagga 7440 aggccgtgac ggcaggtagg agggtgatgt cgctcatgca gactccgggg cagttggcca 7500 aagtttgcag cttaataagc ggggcagtgc ggtgcttgtg cctgcgtgac aggtgcatga 7560 ctgtggctgc caaccgcact gggtaagcct tgtgggagcg gccttgtgtc gcgatagggc 7620 cgcagagcgg ccccggcgat gttggcggcg aagctgaaaa tgctggggcc gcttcgcgcc 7680 cctatcgcga cgcaaggccg ctcccacaaa aaaagcgagc gtaggccggg ctgattgctg 7740 gcaggcagca acaagcccgg cggcagccat cggcaagacg ccatgccacc ggcagcgcac 7800 agtaatcact cgttcaacgc cacaaaaaca agccggggca tacgatgtca ctcaataaca 7860 agctcaccga gcacctcaac cgcggcactg tcggtttccc caccgcactg gccagcactg 7920 tcgggctgat catggccagc ccggtgatcc tcaccgcgac catgggcttt ggcatcggcg 7980 gcagcgcctt cgccgtggcc atggtcatcg ccgcactgat gatgctggcg cagtccacca 8040 cctttgccga ggctgcgtcg atcctgccga ccacgggctc ggtatacgac tacatcaact 8100 gtggcatggg ccgtttcttc gccattaccg gcacgctgtc ggcctacctg atcgtgcatg 8160 tgttcgccgg taccgccgaa accatcctgt cgggggtgat ggcgctggtg aacttcgagc 8220 acctcaatac cctggcggaa tccgccggcg gttcgtggct gctgggggtg tgcttcgtgg 8280 tggcgtttgc ggtgctcaat gcctttggcg tcagcgcctt cagccgcgcg gaagtggtcc 8340 tcaccttcgg catgtggacc accttgatgg tgttcggcgt gcttggcctg atcgccgcac 8400 ccgcagtgga actggacggc ccgttcggcg tgtcgctggt gggcaccgac ctgatgacca 8460 tcctctcgct ggtcggcatg gccatgttca tgttcgttgg ctgcgagttc gtcacgccgc 8520 ttgcccccga actgcgtcgc tcggcctggg tgctgccgcg ggccatggcg ctgggcctgt 8580 ttggcgtggc cagctgcatg ttcatctacg gagcggcgat gaagcgccag gtggaaaacg 8640 tggtgctgga tgccgccagt ggcgtgcacc tgctggacac gcccatggcc atcccgcgct 8700 tcgccgagca ggtgatgggt gatattggcc cagtgtggct gggtatcggc ttcctgttcg 8760 ccggcgcggc caccatcaac acgctgatgg ccggtgtgcc acgcattctt tacggcatgg 8820 cggtggacgg cgcgttgccc aaggtgttca cctacctgca cccgcgcttc aagacgccgc 8880 tgctgtgcat cctggtggtg gcgttgatcc cttgcctgca tgcctggtac ctgggcggca 8940 acccggacaa catcctgcac ctggtgctgg ccgccgtgtg cgcctggagc accgcctacc 9000 tgctggtgac cctgtcggtg gtgatattgc gcatccgccg cccagacctg ccgcgtgcct 9060 accgctcgcc gctgttcccg ttgccgcaga tattctccag tagcggtatc ctcatcggca 9120 tggcgttcat cacaccgccg ggcatgaacc ctgccgatgt ctacgtgccg ttcgccatca 9180 tgcttggcgc cactgcggcc tatgcattgt tctggacgct gtgggtgcag aaggtcaacc 9240 cgttcaagcc ggcgcgggtc gaggatgtgc tcgagaaaga gtttgctgcc gagcctggcc 9300 acgccgtgga gcacgtgctg catgatcaga aatttgcgtg aacgcttgct ggcgccccga 9360 gcgccttcag gctatcgccc aggcgccacg ctggcatgcc tggcgcgcaa cctggggcag 9420 cagaacctgg tggcggccgg ggtgatccac gacccggccc agggttggca ggccacggtg 9480 cacgaacgcg tcgaggccca cctgctgatg cacatcgtca cctgtgagtt ccagctgcag 9540 ttgcctgctc cgcaaggggg cgaggtcagc ctggagctgc gccataccgg tgcgcttcgc 9600 cgtgccggcc tggcctgtgt gtaccgcaag ggcgaccggg cgcgcttcgc ccgactgcgc 9660 gaccggttgc tgcagcaggc cgcactggtg gcggcgctga tgccgctgga tttcaagcgc 9720 ctgaccttgg cctggcgcga cggccaatgg ttgctgaccc tggagcacat gggcggtagc 9780 gaagtggtca accgcatgcc agcgtttcgc cgctacatcc ccatcagccc gcaacagcgg 9840 gcgcacctga tggccagcct ggcccagttc aacactttgc tacctaacct ttgacgcaaa 9900 ctggcatacg ccttgctgta tcaagcgacg aatgatgaca gttgtgcgca catagataac 9960 atgttaacaa tgtgcgcata acaacaaatc ctgcgtcgag ggcagccatg catactcaac 10020 aatccaaccg tcaggggctg gaacgctgga ccacggccat gcaacagatc tgtggccgtt 10080 tcgagacgga acttgcgtcc aatcactcgc tgttcatcgg cgaggtttct accttttccc 10140 gtgccggctt gccgctggcc aacctgcgca ccaatgccgg caacatccgc cggctgggcg 10200 aaaacccgac ccttgacgat gaccagcatt gtttcctggt cagccagcgt gcggggcatt 10260 ccaccgtgtc ccaggggggc atgcaggtca gcctggcgcc gggtgagctg ctgctgatgg 10320 attcggtcgg gcgctgcgaa atcaccccca gtgggttgat cgaacatgtc tcgctggccc 10380 tgtcgcgtga gcaggtacgc aagtatgtgc aaggcagcgg cccgatgttt ggcaagatct 10440 cctcgagcaa cgcctgcggg cgcatgctgc atgtgctgat ggaccaactg tgcaaggacg 10500 gcaatgtaag cggtgatggg gcccagggcg acgcgctgca gaccgccttc attgccctgc 10560 tggagccagg cttcgagcgc catggcgaag cgctgggcaa ccttggggcc ttgaacgggg 10620 ccaacctgcg gggctacgtg cagcaggtga tcgacgagtc cctgtcacag cccgggctga 10680 ccccgtccaa cctggccggt cgcctgaaca tctcggtgcg tcacctgtac cggctgttcg 10740 aggaggaggg cgatagtgtg tgccgctaca ttcagcgggc gcgcctgaag cgcagtgcgg 10800 atgacctggc caacccgttc ttcaggagcg agtcgattac ctcgattgcc tacaagtggg 10860 ggtttaccga ctcggcgcat ttcagccgct cgttcaagaa acagttcgaa cgctcgccca 10920 aggactaccg ggcgcaggcg atggtttgag tgtgatggtg ctgcttgtgc gggcctcatc 10980 gccggcaagt cacttggcgg cggttcagcg acggccgttg aagtagcccg acagctggtg 11040 cacggtcttg ccggcagtga gcagcagcgg gcggaaatgg tccttgccga ggatgcgcgc 11100 atgcttgacc gagctgacca ggtcatagcg cttcgatccc tcctgcatac cctcggcgag 11160 tatcttgcaa atgatgtggc tgggcgtgac gccaaagccg gagtagccct gcacatagaa 11220 agcgttgggg cggttgtcga gggtgcctat ctgcggaaac aggttggcac tggtggccat 11280 cgggccgccc caggccaggt cgatgcgcac gtctttcagg taggggaaaa tcttcagcat 11340 cagcgcgcgg ttccacgcct tcaggtccag cgggaagtgc tcgacgaagg gcgtggcggc 11400 gccaaacagc aggcggttct cgcgggtgac ccggtagtag tcgatcaccg ggcggatgtc 11460 gctgtaggcc ccgcgtatcg ggctgatgcg ctcgatcagc tcatccggca atggctcggt 11520 catcatctgg aaggcatagg tgtttatagt gcgtgcgtgc agctgcggct ccagcttgtt 11580 gaggaagctg tcgcacgccc acagcagctt gctggcgcgt accgagccac ggccggtgcg 11640 taccgtgatg cgctcgccgt aggtcacttc cagggccggg ctgtgttcga agatgcgcgc 11700 accatggccc accagtgcct gcgcttcgcc cagcagcagg ttcagggaat gcacatggcc 11760 accgcccatg tgcatcaggg cgctgctgta ggcgttgctg ccgatgatct ggcgcacttc 11820 gctgccaccg agaaaacgga tctcgtcgcg ggtattgatc gccttgaacg ccttctccca 11880 tttgcgcagg gtctgttcct ggcggcggtt gaagcccatg tagccatagc cgtggcagaa 11940 gtcggcgtcg atggcgtagc gggcgatgcg gtccttgatg atgccggcgc ccagttcgct 12000 gatttcgaaa atatccctca cgccctgatc accgacgctg ctgcggatct tctccaggtc 12060 gtggccgatg cccgccatga tctgcccgcc gttgcgcccg ctaccgccgt agcccagata 12120 acggccctcg agcacgacga tattggtcac gccttgttcc gccagctcca gggcggtgtt 12180 aatgccggag aaaccgccac cgatcaccac gacatcggcc tcgatgtcgc gttccagggt 12240 tgggaagctc aggttgtact tcttggtcgc cgagtagtag gtggggctct cgagggtgat 12300 catgacgccg cctgctgact ggaaatgggt agaaatcatt ctattaatgt attaatgatt 12360 gtgcactggc atactcgccg gtttgctatt tccagcctcc ttgagcccgc atgaccacac 12420 cgagaccctc cctgaccctg accttgctgc aggcgcgcga agccaccatg gcgttcttcc 12480 gcccggcgct gaatgcccat gacctgaccg agcagcaatg gcgggtaatc cgtatcctgc 12540 gccagcaagg cgagctggaa agccatcagt tggcggagct ggcctgtatc ctcaaaccca 12600 gtatgagcgg ggtgctcaag cgcctggagc gtgacggcat cgtagcgcgg cgcaagtcgc 12660 cggaggacca gcgccgggtg ttcatcagcc tgaccgaggc cggccagcaa gcgtttctgg 12720 cgatgagcga ggagatgacc cgcaactacg acaagatcct cgcccagttt ggcgatgaca 12780 agctgcagca gctgatgcag ctgctgggtg aaatgaagaa gatcaaaccc tgacgcgcca 12840 ggcgtcagcg gttgagtgac agcgagtctt ccagcacttt cagcagtgct gccgcgcgcc 12900 gctcataggc gtcggggcct gcgtacatca gctctacata caggctgtcg atgatgccca 12960 ggtaggcatc ggcatacagc gccaggcggc tgtgctgctc atgcgcccag ccgtggcgag 13020 cttgcagggc cacgctgaac ccttcgcgta tgccgtccag gtactgttca aagcccgaag 13080 tgacaatcgg cttgatgccc gccgggggca ggaacgccgt gcgcaacacg aagcgcagtt 13140 gggccgagtc gcgataacgt tcggccaggt gcagggccag ccagtgcccc gccgccaggc 13200 cgtcgcgggc ttcctgcgca aagccgtgct cgacaaaggc cgtttcctgc acaagcgcac 13260 gctggaacac ctccacgaac aaggcgtcct tgttggcgaa atgcgcatac agcgatgcct 13320 tgcgcatgcc cgccaactgg gcgatttcgt tcagcgaaga ggcgtcataa ccgtactcgg 13380 cgaagtggcc gacggcggca tcgcacacac gcaccgcaga aggggaaagg tctttcaaca 13440 gcatcactcc gtcaggggcg cggcgggccg cgcgcgtctt gagggtggga ttgtggtgat 13500 cgaaaatgca cgggtcaatg cttgtcgcaa ggcaatttcc gggcgccatg gaaagtgcaa 13560 tgttcccctc gtaacgtgca ttcctccacc caatcgccgc tcacatactg atcgcgtctt 13620 cgaatccaat aagaaagaga ccgctcatga aaaagccaaa ccccctgctg gaagacctga 13680 agtccgtcct gccgaccatt gccgccaatg ccatgcgtgc agagcaggac cgcagtgtgc 13740 cggcagagaa tatcgccttg ctgaaaagca tcggcatgca ccgcgctttc ttgcccaaac 13800 acttcggcgg catggaaatc accctgccgg agttcgccca gtgcatcgcc ttgctggcgg 13860 gggcctgcgc cagcacagcc tgggccatga gcctgctgtg cacccacagc caccagatgg 13920 caatgttctc gcccaagcta caacaggagg tgtggggtag cgacccggat gctaccgcca 13980 gcagcagtat cgcgccgttc ggccgcactg aagaggttga gggtggcgtg tcgttcagcg 14040 gcgaaatggg ctggagttcc ggttgcgacc acgccgaatg ggcgattctc ggtttccgcc 14100 gcaagaatgc cgaaggcgct caggattact gcttcgccat cctgcctcgc agtgactatg 14160 aaatccgtga tgactggtat gccgtgggca tgcgcggcag cggcagcaag accctgatcg 14220 tgcgtgatgc cttcgtgccc gagcaccgca tccagaaggc caaggacatg atggagggca 14280 agtcggcggg ctttggtttg taccccgaca gcaagatttt cttcgccccg tatcgcccgt 14340 attttgccag cggcttctcc acggtcagct tgggcgttgc cgagcgcatg ctggaggtgt 14400 tccgcgagaa aacccgcaac cgcgtgcgtg cctacaccgg tgctgccgtg ggcgccgcca 14460 ccccggcgct gatgcgcctg gccgagtcga cccatcaggt ggccgctgcc cgggcattgc 14520 tggaaaagag ctgggacgag attgccgagc acagtgcccg tcacgaatac ccgtcgcgtg 14580 gcacgctggc gttctggcgt accaaccagg gctacgccgt gaagatgtgc atccaggccg 14640 tcgaccgcct gatggaagcg gccggtggtg gcgcctggtt cgagagcaac gaactgcagc 14700 ggctgttccg cgattcgcac atgaccggtg cccatgccta caccgattac gacgtgtgtg 14760 cgcaaatcct cggccgcgag ctgatgggcc tggagcctga cccggcgatg gtctgagccg 14820 ccacttgttt tcacccatcc cctacaagca caacaacaaa cagggcaggc tgccaggcct 14880 gcccgggagt cttgcatgtc caaagaaacc ttcgattcac gtgccttccg ccgcgccctg 14940 ggcaacttcg ccaccggcgt gaccgtggtg actgccgccg gccccagtgg ccgcaaggtc 15000 ggcgttaccg ccaacagctt caactcggtg tcgctggacc cggcgctgat cctgtggagc 15060 atcgacaagc gctccaccag ccatgaagtg ttcgaagagg cctcgcactt tgccgtgaac 15120 attctggctg cggaccagat cgacctgtcc aacaactttg cccgcccgaa ggaagatcgc 15180 tttgccggta tcgactacga gaccggcact ggcggcgcgc cgttgttcgc cgattgcgcg 15240 gcgcgctttg agtgtgaaaa gtaccagcag ctggacggtg gcgatcactg gatcctggtg 15300 ggcaaggtag tggcctttga tgactttggc cgctcgccgc tgctgtatca ccagggcgcc 15360 tattcaatgg tgctgccgca tacccgcatg acccaaggcg cagaggggca ggcaccgagc 15420 agccacttcc agggccgcct gcagcacaac ctgtactacc tgatgaccca ggcgctgcgt 15480 gcctaccagg ctgactacca gccacgccag ctgtgtaccg gcctgcgcac cagcgaggca 15540 cgcatgctga tggtgctgga gaacgatgcg ggcctgagcc tgaacgacct gcaacgcgaa 15600 gtggcgatgc cggcgcggga gatcgaggaa gcggttgcca acctcaagcg caaagggctg 15660 attgccgatg acgaagggcg agtgcggcta tcggtgaagg gcgtggacga gaccgaggcg 15720 ttgtggacca ttgcccggca acagcaggac aaggtgttcg ggcagttcag tgaacagcag 15780 ctggagactt tcaagaccgt gctcaaggcc cttatcaaca tctgaacacg ctttgggatg 15840 gcaccggctg ttttggatgg caccggctgt gccggtgttc gcggatgaac ccgctcccac 15900 aggtccagcg ccagtagcaa cttcggcgcg gtacctgtgg gagcggcttt agccgcgaac 15960 accggcaaag ccggtgccat ccaaccagaa gcctcagtag gcaccacccc cggcactggg 16020 gactaccact gtatccttga acttccccgc cagctcgcgc agcccgcgca tcagcaccgt 16080 ggtatccaca cccaccgcca caaacgccgc acccagctcg atgtagcgtc gcgccagttt 16140 ctcgtccgcg ctgagaatgc cggcggcttt gcccgccttg ccaatgcgca cgattgcgtc 16200 ttcaatcgcc gcctgcacct ccgggtgccc ggggttgccg cgatgcccca tggccgcact 16260 caggtctgca ggcccgatga acacgccatc cacaccttcc actgcaacga tctcgtccag 16320 gttggccagg ccttccttgt tctcgatctg caccagcagg cacatttgct catcggcgtg 16380 gtccaggtaa ccggggaggg tgttccagcg cgaagcccgc gccagcgcgc tgcccacccc 16440 gcgaatgccc ttgggcgggt aatgcatggc cttgaccagt tgccgcgcct gttcggcagt 16500 ttccaccatc ggcaccagca aggtttgtgc gccgatatcc agcacctgct tgatcagcgc 16560 ggtatcgccg atcaccgggc ggatcactgc ctggctgggg tagggtgcca ccgcctgcaa 16620 ctgggcgagc atgccgcgca ggtcgttggg cgcgtgttcg ccgtcgatca gcagccagtc 16680 gaaaccggca ttggccgcca gctcggcgca gtaggcatcg gccaggccga gccacaggcc 16740 gatttgcggt tcaccgctgt gcaggcgtcg cttgaagtgg ttgatgggca tgtccatgag 16800 caggtcctta aacgaagcgg caggcgatgg agccgagcat gtcgtagtcg acgtggaagg 16860 tgtcacctgg gcgagcggcg accgggcggg tgaacgaacc cccaaggatg atctggccgg 16920 gctgcaaggt gacgtcgtac ggcgccagtt tgttggccag ccaggcaacg cctttggccg 16980 ggtggttgag cacggcagcg ctgaccccgg attcctcgat cacgccattg cggtagagca 17040 ccgccggcac tttgcgcagg tcgatttcgg tggggcgcac ggcccgcccg cccatcacca 17100 cgccggcatt ggcggcgttg tcggagatgg tgtcgaacac cttgcgggtg gcctgggttt 17160 gcgggtccac ctgctggatg cgcgcgtcaa tgatttccag cgccgggatc acccactcgg 17220 tggcgtccag cacatcaaac acggtgatgt tcgggccctt cagcggcttg ccgaggatga 17280 acgccaactc cacttcaacc cgcggcacga tgaagcgctc gaaggggatg tcgctgcctt 17340 cgtcgaacag catgtcgtcg agcaaggcgc cgtagtcggg ctcggtgatg ttcgacgata 17400 cctgcatggc gcgcgaggtc aggccgatct tgtggcccac cagcttgcgc ccggcggcga 17460 tcttttttgc cacccaggcg cgctggatgg cgtaggcgtc ttcgatggtg attgccggtt 17520 gctccagcga gaactggcgc acttgctcgc gggagcgttc ggcctggtcg aggcggtcgg 17580 cggcgtgctg gatgaaagcg ttgtctagca tgggggcggt ctcttgattc aagggttgac 17640 gatggcagcc tgggtgcgca acaccagcag gccgcccagg gcgatgaaga cggcgagtac 17700 gtacagagca aggctggcgc tgtgggtggt gtcgcgcacc cagccgatga agtagggcgt 17760 gaagaacgag gcgatgctgc ccagcgagct gatcagggca atgccggcgg cctgggtacg 17820 ggcgttgagg aacgccggcg gcagttgcca gaacatcggc agcgcagcgc tggcgcccat 17880 gccggccagc accaggccgg ccattaccgg cagcgcctgc tcgggggcaa tggccgcaat 17940 agcgatgccg atggcagcca tcagcagcgg tacgcacagg tgccagcggc gttcgcgttg 18000 gcggtcgctg gagcggccgc acgccagcat gaacacgcag ccggccacgt acggcacagc 18060 gctgagcagg ccgacactgg cgtcgctggc cacaccggca ctgtgaatca ggctgggcat 18120 ccagaacgca agggtattca ccgccagcat caccgcgcaa tacacggcca ccaacagcca 18180 cagcgcacgg cttgcgaaaa tggcgccgaa cgaggttacg ggcttgcgct gttcttcctc 18240 accgaattgc gcgcgcagcg tggctttctg ctgctcatcc agccagctca cccgctcgaa 18300 gtgctccggc aaaacggcca gtaccaccag gcccagcaac accaccggcg ccccttcgag 18360 caggaacatc cactgccagc cacgcagccc gcccgtgtcg tgcatgaagg ccagtatggc 18420 cccggacact ggcccgccga ccactccggc caacggcacg gcaatggcga acagcgcggt 18480 gacctgggcg cggcgcccgg ccgggtacca gcggttgagg taaaccagaa tgcccgggaa 18540 gaacccggcc tcggccgcgc ccagggcaaa gcgcaacagg tagaacgcgc tgctgctttc 18600 gatcagcagc atgctggtcg acaacagccc ccacaccacc atcaggcagg cgatccagcg 18660 gcgtgggcca acgcggtcga gcatcaggtt gctggggacg ccgaacagcg cataggcaat 18720 gaagaacagc ccggcaccca ggccatagac cgtgtcggac aaatgcaggt cctggctcat 18780 ctgcatcttg gcgaagccaa tgttgatgcg gtccaggtgg gcgaacaggt agcacaccag 18840 cagcagcggc atcagccgcc aggtgactgc ccgatgggta ctgtcggccc gttcaacgtg 18900 tgcctcgcgc ggcgaggctt gttcgagtgt gctcatgttt ttgtacttat tctgtaatga 18960 gtcggggagg gcgtggtttg agccggcgcg ctagcggttg aacagtgggt gcaaggtgct 19020 gtgcttggcg tcgtagacct gggcggtgct gtggtcgatc tgcacggtga tgccgatcgg 19080 gcgctgttgc agcagtgggt ccaggcgcgc tttcaacact gccagcaagc tgtcgcccac 19140 tgttttgtgc acctcggcgc tacggccggt agccatgcgc aggttggcgt acagaaagcc 19200 gtattcgcct ttgccgtcgg ccaccgcgca atgggcggcg gggtaggcca gcacgcgtgt 19260 accgccagtg gggaacacgg ctttgccttc ggcatcgcgc tgttcgagca tggtgtcggc 19320 cagggcgcgg cacaggccgg ggatgtcggc gtcggtttcc aggtcggggg tatagagcag 19380 aaccaggtgt ggcatggggg cctcctcggt gaggggcggc tggccacccg ccagggcgac 19440 cagccgcgaa cgggtgggtt acaggcggct ggtgggcacc acggcggccg ggttggcggc 19500 ctgggcagcg gggatggcac caccgtcctg cggggtgacc gggaagatcg cgttgatctg 19560 gccggtgccg gaagagccga agtagggcgt gaccacttcg gccttgccgt cgtaatcgga 19620 ccagcccagc gcacccagca gcattgccgt gtcgtgcatg aagccttcac cgtggccttt 19680 ggcggcgtac tccggcagca tcccgcagaa cgcttcccac tcgccgtcct gccacatttg 19740 caccacacgg tggtcgaggg tttcgaggaa cgggctccac accttggtgg caaagtccgg 19800 cgcctggccg ttctgcgcga agcggtgcga cagcgagccg ctggccagga acgccacggt 19860 gccgtcgtag tggtcttcta ctgccttgcg catggcccag cccaggcggg cactgtcggc 19920 caggtagtgc gaggtgcaca gggccgagac cgagaccact ttgaagtgct ggtcctggtt 19980 catgtagcgc atgggcacca gggtgccgta ttccggggcg agggtggtgg cgtggtgggc 20040 catggtttcg acgttgaagc ggttgcactc ctcggccagc agcttgccca gctcgggatt 20100 gccggggaat gcgtagggca tgttgctgat gaagtgcggc agttcgttgc tggtgtacac 20160 gccctcgaaa tgcggcccgc acagcacgtg gtagttggcg ttgaccagcc agtgcgtgtc 20220 gaacacgacg atggtgtcca cgcccagctc acggcaacgg cggctgattt cgtgatgccc 20280 gtcgatggcc gcctggcgaa agccttggcg cgggcctggc agttcggaca tgtacatgga 20340 cggtacatgg gtaatcttgg cagtgagagc gagtttgccc atgggggtct ccgataagac 20400 gctgttgttg ttttggggct gacccggtcc cttgtaggag cggccttgtt ccgggatggg 20460 gcgcacagcg gccccggcga tatctgcggc gaggctgaaa tccaggggcc gctgcgcgcc 20520 ccatcgcggg cacaaggccg ctcctacacc cgggcggtgt aaaccgcaca gagggttaga 20580 tgccccagcg aggaatgtgg tgattaccca tggaaataca cacgttcttg atctctgcaa 20640 agacctcgaa gctgtactgc ccgccctcac gcccggtacc ggaacctttc acgccgccga 20700 acggctggcg caggtcgcgt acgttctggc tgttgatgaa caccatgccg gcctcgatgc 20760 cacgggccag gcgatgggct ttgccgatgt cctgggtcca gatgtacgag gccaggccat 20820 actcggtgtc gttggccagt tgcagcgcct cggcttcgtc cttgaacggg atcaggcaca 20880 ccaccgggcc aaagatttct tcctgggcaa tgcgcatctt gttgttcacg tcggcgaata 20940 cggtgggctg gatgaactgc cccttggcca ggtgcgcagg caggttggcc gggcgctcca 21000 ggcccccggc gaccaggcgt gcaccttctt cgatgccaat gcggatgtac ccggtgacct 21060 tgtcatagtg ctgctgggtg atcatcgaac cgacctgggt tttcgggtcg gtcgggtcac 21120 ctacgatcag gcgcttggcg cgcgccgcaa actctgcgac aaactgcggg tacacgcttt 21180 cctggatgaa gatgcggctg ccggcggtgc agcgctcgcc gttcagcgag aagatggtga 21240 acagcgcggc gtccagcgca cgctcaaggt ctgcgtcttc gaagatcagc acgggcgact 21300 tgccgcccag ttccatcgag tactttttaa ggcctgcggt ctgcatgatc ttcttgccgg 21360 tggcggtacc gccggtgaag gaaatggcgc gcacatcggg gtggcggacc agggcatcgc 21420 cggcggtagc gccgtaaccc tggatcacgt tcagcacccc gttggggatg ccggcttcta 21480 ccgccaggcg gcccagttcg ttggcggtca gaggcgacag ctcgctcatc ttcagcacgg 21540 cggtgttgcc cagcgccagg cacggcgcag tcttccaggt agccgtcatg aacggcacgt 21600 tccatgggct taccaggccg cacacaccca ccggctggta cagggtgtag ttgagcatct 21660 ggtcgtcgac cgggtaggta tggccgtcca tgcgcgtgca cacttcggcg aagaagtcga 21720 agttgtgcga ggcacgcggg atcagcacgt tcttggtctg gtggatcggc aggccggtgt 21780 cgagggtttc cagctcggcg agtttcggca cgttctgctc aatcagctca cccagcttgc 21840 gcatcagccg ggcacgttcc ttggccgggg tgttggccca cttggggaag gcttccttgg 21900 ccgcagccac agcctgggcc acttcctcgg cgccgccgct ggcgacttcg cagatggcgt 21960 cgccggtggc cgggttgtag ttgacgaagg tgtctttgct ctcgacctca cggccgttga 22020 tccagtgctt gatcatgctg ctcatgcctt gttgttcttg aagaagtcag cttcgctgac 22080 gatacggttg accaggcgac cgacgccttc cacttccacc accacttcgt cacccggcac 22140 cacatcggcc aggccttctg gcgtgccggt ggcgatcatg tcgcccggtt gcagggtcat 22200 gaagctggag aagtattcga tgaggtgcgg gatgtcgaag atcatgtccg cggtggtgcc 22260 ttcctgcttc agctcaccgt tgatccaggt gcgcagcttc aggttgctga cgtctggcac 22320 atcggccgca tcgacgatcc acgggccgac cggggtggtg gcatcgcggt ttttcacccg 22380 caggttgggg cggtagtagt tttccaggta gtcgcggatg gcgtagtcgt tgcacacggt 22440 gtagccggca acgtaggcca gggcgtcctc acgcttgacg ttcttcgccg ctttgccgat 22500 caccgccacc agctcgcact cgtagtgcat gtattcgacg ttgtccgggc gccaggtgac 22560 ctggatgtgg ccggtgtagg tgcctggcga cttgatgaaa gccaacggtt cggtgggcgg 22620 cgcgaaggcc agctccctgg cgtggtcggc gtagttcagg cccagggcga acatgctgcc 22680 ggtggcgggt ggcagccagg tgacctggtc ctgatggacc aggcggccgt cggcaaggcg 22740 caggtgatcg tcttcgaccg tgacatcgtg ggcctggccg tcgaactgga tacgggcgtg 22800 tttcacaggt aattcctcac tcggcgacga tgtggttggt cagcttgccc aggccgtcga 22860 tctcgatgtc gacgcggtca cctggctgta catcgacgcg gccctcgggg gttccggtga 22920 tcaggatgtc gccggcgtgc agggtcatga actcgctgat ttcggcaatc agctgcgcca 22980 ccgtgcgtac gcagttggcg gtgttgttgt gctggcgcag ttcgccgttc acatacaggc 23040 gcaggcccag ggcatcgggg ttggccactt ggctggcggg caccagttca gggccgaccg 23100 ggcaaaaacc atcacggcac ttggccttga ctgcagggcg gtagtagctg gcttcgggca 23160 ggctcacttc gttgacgatg gtgtagcccg ccacatgctc cagggcatcg gccacgctga 23220 cgcggctggc gtccttgcca atcaccactc ccagcgccgg gccgggttgc acgcgctgca 23280 cgccggccgg gaataccacc tggccttcat gctggttgcg ggtgttcggg gtcttgacga 23340 acaacaccgg cttgaccggc agttgcttgt acggtgcttc cacgaacgcc gcttggtgct 23400 gctgcagcaa accctggtag ttcagcgcga cgccgaacag ggtgccgctg gcaacgtcaa 23460 gcagggcatg gctcatgctc ttctcctggc agtgcagggc ggtggccgtc ctgcggattt 23520 cgttaatgtg ttaatgttat agttaatatg ttaacgatgg tcaaggggtg gccagtggcg 23580 cctgccggca aggcaaggca ccatgggcca tcgtcaacag ggtcaagcga tttgcgagca 23640 agcagccatg agcgaccggc atccgatacc gaacatcaac attggccagg tttacgacca 23700 gcgctacagc gacagcgagg tgcattacga ccggctgggc aacctggcgg gctttttcgg 23760 gcgcaacatg ccggtgcacc ggcatgaccg gtttttccag gtgcattacg tgaagtcggg 23820 cacagtacgg gtgtatctgg atgaccagca gtacatcgag gccgggccga tgttcttcct 23880 cacgccaccc acggtggcgc acgcgttcgt caccgaagct gacagcgacg ggcatgtgct 23940 gacggtgcgc cagcaactgg tgtggcaatt gatcgaagcc gacgccagcc tgctgccggc 24000 gggcatgcag gtgcagccag cctgtgtggc gctgggcaac ctgccggccg aatacaaggc 24060 cgaggcgcag cgcctgcaag gctggctgga cgcgttgagt gacgagtttg ccacgcagca 24120 accgggtcgc gaggcggcgt tgcagtcgct gacccgcctg atcatgatca gcctgctgcg 24180 gctgtgcccc aactcgctgg aatcgacccc ggcgcggcat gaagacctga agatcttcca 24240 ccgtttcaat gccctgatcg aagcgcatta ccttgagcat tggccgctgg cccgctacgc 24300 gcagcagatt ggcgtgaccg aggcacggct gaacgatgtg tgccggcgca tcgccgactt 24360 gccatccaag cgcctggtgc tggaacggct gatgcaggag gccaagcgtt tgctgttgtt 24420 ttccggcagc acggccaacg aaatctgtta ccagctcggc ttcaaggatc cggcctattt 24480 cagccgcttc ttcaaccgct acgccaagct cacacccggg gagtaccgcc agcggcaggc 24540 agaattgcag tgaaatggcc atggcggctc acccgggtgc tgttgttgtt tacagcggat 24600 ggtcgcagcc cgcgcgccgg gcttgaatgg gttttccgtg gaacagattg cactttccat 24660 cgtgcatgcc cttaaattcg tgaattgaga aaaagccaca ggtttgacca tgaccaagac 24720 gcaaccttcg ctcacgctaa gcctgttgca ggcccgagaa gccgcgatgg catttttcag 24780 gccgctgttg aaccagcacg acctgaccga gcagcaatgg cgggtaatcc gcatcctcaa 24840 gcagcacggc gagctggaga attatcagtt ggcggaactg gcctgcatcc tcaagccgag 24900 catgaccggg gtactggggc gcctggagcg agacgggctg gtgcggcggc agaaggccgc 24960 gcaggaccag cgacgggtgt tcgtcagcct gaccgaaaga ggggaggcgt gctttgcctc 25020 gatgaaggaa ggcatggagg ccaactacca gaagattcag gcgcagtttg gtgaagagaa 25080 gctgcagcag ctgatggggt tgttgaatga cctgaagcgc atcgcgccat aa 25132
<210> 2
<211> 12339
<212> DNA
<213> Pseudomonas putida U
<220>
<221> misc_feature
<222> (1) .. (12339)
<223> Cluster tyn
<400> 2 tcaggcgaaa cgctcgaagc ggtacggtga cgggtcgatc agcggggtgg cctgggccac 60 caggtctgcc gccagctggc cagcagcagg cgaggtgccg aagccatgcc cggaaaagcc 120 ggtggccagg gtcaggcccg gaatactggc caccgggccg atgaccgggt tggagtcggg 180 ggtgacgtca atcgtgccgg cccaggcgct ggcgatacgg gcctgttcga acaccggcca 240 ggccgctttc aggttgcgca tggcctcgtc gttgagggcc gggttggcgt gcgggtcttg 300 tacccgtaca cgctcgaagg gggttacatc cgttgccttc cagcgccggg ccagggccag 360 gtccttgaag aagtacttgc caaagctgat gcgcaaaaag tcccgctggg cacgcagctg 420 gggcaggtaa cgcttgccca gcagcaggtg atcgagggtg aggaaggcgt ccagcgcgcc 480 gcgctgggtg atgatgtagc cgccgtcctt gtgcttgcgg aaggaaaaat ctggtgcgcc 540 cacggcgatg tcggttggcc cgtccatggg ctctgtgcgc agcacggaac aggtcagcgg 600 caaggtcggc aggttgatgc ccaggttgcc gaggaacttg cgcgaccaca ggccaccggc 660 cagcaacacc tggtcgcagc ggatttcacc ttgctcggtg accaccccgc tgacacggcc 720 ggctgcggtg accagcgtgc gcaccgcgca gttctccact accactgcac ctttggcgat 780 cgccgcccgg gcgatggcgc tggcggccag ggtcggttcg gcgcgggcgt cggagggggt 840 gaagatgcca cctgcccaat ccgcccgacc acccggcacc atccgggtga tttcccgcgt 900 gctcagcagg cgcgaatcca ggcccagcgc ctcgacgctt ttcagccagc cttcatgcat 960 gcccatctgc gtgtcgttac ggccgatgaa catgatgccg gcttgccgat agccaacgtc 1020 gctgccaacc cgtgcgggca tctcggccca cagccgatca gccgccagtg ccaggggaat 1080 gtcatgggcg tggcggttgg tcttgcgcac ccagcccagg ttgcgcgacg actgctcccc 1140 agcgatgcgc cccttctcca gcaccaccac cggtatgttg cgttcggcga ggctcagtgc 1200 ggcggtgagg ccgataatgc cgccaccgat gatcaccacg gtagtggcgt cggggtggcg 1260 ggtgctggtt tgcacagggg cgatcgtggg agacatggct ttactctttg ttgtgcgtgc 1320 agggggagtg ttcagcgcca gccagcagcc tcactggcca aggcggatca gggtcacttg 1380 cgcttgcccc gcaccgcggt aggcggtgac ctccagctcg accttgtaaa cggtggagcc 1440 cagcggcggg caggtgaccg tggtggccgg gtcgatgccg cggaacttct cgccgatcac 1500 gtccatgacc cgtggtacat cggcagggtc ctggatgaac acgcgcgagt tgatgacatc 1560 ggccaggctg gcatcgactg cggccagcgc ggtttcgatg ttggcgaaca cctggtgggt 1620 ctgttcgatg acgtcctctg gaatgacctg ggtctgcggg ttgcgtccgg cggtgttgga 1680 gacgtgaatc cagttgtcca ccgccaccag gcgggagtag ctggccatgg cttcgaactt 1740 ggagccggtt ttcagtttga tgatctgtgt catgggcttt gccttgttat ccggttgcgg 1800 ggatcagctg agaacggggg tttcccagag gttgagcttt acgccgatgc cttgctcgag 1860 cgccttgcgg tacaccacgg tgccccaggc cacgtcttcg acgggcatgc cgcccaccga 1920 catcaggatg atttcgtcgt catgcaggcg gcccggtgcg tcgccgctga tgatcttgcc 1980 gatgtcttcc acctgctcgg cggccagcgt gccttcggca atcatgtcca tgaagcgcac 2040 acctaccagc ggtacgtggt tgtgcgcagg cttgggcagc tcttcgaacc aggcctcgta 2100 gaggccggtg ttgtccacca ccttgcgcac gtcgtcctgc tccatgccgg cgtcgatact 2160 gcacggggct ggcatggcca ggaacgcgcc aggcttgacc cactcgcggc gcaccagcgg 2220 gtactggctg gggtcgccga cttcgcccga gctgcagtag ctgaccaggt cggaaccgcg 2280 taccacttct tccagggttt ccaccacctg gacatgagtg atttgcggga agctggtttt 2340 cacccaggcg acgaaggcat ccaggttctt ctggccacgg cccttgacct tgagggtgtc 2400 gatcagcggg cagacggcca tgaacgcagc gaccgtggtc ttgcccatca cccccgggcc 2460 ggccaggccg atcaccttgg cgtccttgcg cgccaggtgg cgggcgccga cgcccgggat 2520 ggcgccggtg cggtaggccg acagcaggtt ggccgacatg tgtgccagtg gcgcgccggt 2580 gtcggcatcg ttgagggtga acatcaggat cgagcggggc aggcctttct cacggttggc 2640 gatgttcgag ccgtaccact tggcgcctgc ggtctggaag ttgccgccga ggtacgccgg 2700 catcgccatc atgcgccggt cggcggtggg cttgggcatg ttggggaatg gcgagtgctc 2760 ggggaaggta atcatcgcgc cgtgcgagtc gctgttcggg ccggccatgc ggtagtcacc 2820 ctggtacagc aggccgaaca tttcttccat ggtgtcgaca caggccggca tgtcggtgac 2880 gccggcacgg atcatgtcct gctcggacag gtagatgaag tcaattctgg tatcgagggt 2940 catggcgggt ctcgcagggc tggctgccgt cggatttgtt gttggtttcg aggcaaccag 3000 tttcgctaac gactggtagg tcgtcttgtg tctgcctgcc agccgagttg accgtcagtg 3060 ccagggcttc aatggcccgc gagcgagaag ctggccgggg tgtggcgcag gctgagggcg 3120 gtcagcaggc acaccaccag ggtgcacagg gccagcagcg cggcccatgc ggtcgggccg 3180 tggttgagta ccactgcggc cagcggggcg gcgccggcag acgccgacag ctggatggcg 3240 cccagcagcg ctgcggtgga acccagtgcc ttttcttgcg aggccatcac cagcgacatc 3300 agcgtcgact cggctatccc caggccgaac agggctatca ccatgccgcc ggccacacct 3360 ggcagcccca ggccggtcag tgcaccgagc aggctgatgc aggcaccgcc ggccatgcac 3420 agcacgccca cccgagtcaa ggtattgagg cccagccggc tgatcaggtg gctggccgtc 3480 atggcgccga gcaggatcga caccccggtg gcgccaaaca gcaggccgaa ggcctgggcg 3540 ctcaggccgt agtgggcctg gtacaccagg gtggcaccgc cgatgtaggc gaacaggaag 3600 aagaataccg cagcaaccgc cagggtcggg cgcaggaagc ggcggtcggc gaggatggcc 3660 aggtaggtgc tgcaggcgtg gcccaggcgc aggggttcgc gtttgctggg cggcagggtt 3720 tcgggcaggt tcagcaggct gttgaccagc accgtcacgc ccatgccggc gagtaccagc 3780 attactgcac gccagccgaa atgtgcgtcg atcacgccgc ccagggcagg tgccaggatc 3840 ggtgcgacgc cttcgatggt catcagcagg gcgaacagtt tggtcgcggc cacgccctgg 3900 ctcacatcac gcaccatgct catgatcacc accagggtca gcgcactgcc caggccctgg 3960 aaaaagcgca gcatgatcag ggtgtcgagg ctgggggctg cggctgcgcc cagcgagcac 4020 aggatgaaca gcagcaggcc ggccagcagc ggcttgcgcc ggccataagc gtcgacgatg 4080 gggccgaaga tcagctggcc ggcgcccatg gccagcagga agaaggtcag tgtcagctgt 4140 acgcgggtga agctagcctg atagtggctg gcgatttccg gcaggctcga caggtacatg 4200 tcgacggcgg aagggccgag ggcgccgatc aggcctaggc ccagggcgaa gctgaagggt 4260 atgggagggg agggattggc ttgcatggtt ttctctggct gatttttcgc ctaccgaccg 4320 gtaggtttgc gaatattatt cgccgagtcg gccaaggtca aacccttccg caaggccact 4380 gattcctgtg gggagcgggc atgcccgcga acaccggcaa agccggtgcc accgagtcgc 4440 cttcttcgcg ggcatgcccg ctcccacatt gaccgcagag gttggttacc gtggttgcgt 4500 cagaacggca cagccacggt cagctggcta tacacattgg taccattccc gcccacctgg 4560 ttgccgccgt tgctctcgtc cttgcgcggc tggtaaaggc ccaccagcgg gctgattatc 4620 aggtgctcgt tgactgccca ttccacatac aggtccagct cccgcgcatc gaggttgagg 4680 ctttcgcggg tgcgtacggt gtcgaagtcg aagtacagcg ccccgactgt gagattttcc 4740 agcggtgtcg ccttcacgcc cacatggtgg atacccgtgt tgctgttgaa ggggccggcg 4800 tagttggcag cgacttcacc ctggaaccag gtgccgtaac cgctggacag gccgctgaac 4860 agcgcgtccc agcctgccga gtagcgggtg tagcggtagg taacctgcgg tgcccacggc 4920 aggtcggcga aggtgtagcc ggcctgcagg taccaggctt gctcggggcc gtcggtcttg 4980 tcctgccagg cgtattcgaa ggcgaaactg gcattgtcga tgccagcgtt gccttcgccg 5040 cgcacgctat acacgtccat gccttcgcgg gctttctgaa agtcgctggc ccattggtcg 5100 gtgacgtcga tgccgtgaat ccaggtcagc ccgagggtgc ccaaggcttg ggtgtagtcc 5160 agcgtgccgg cggccagttc ggtttcggcc tgggcgcggt tgtcggattt cagccacagc 5220 aggctgccat gcaggccatc gctgcccccc aggcgcagca ttgcggtgcg gtcgaaggcg 5280 tggcgggcgg ccaggtagta ggccccgccg cggtccagcg caccgtcggc gacgccgttg 5340 cccaggttcg ggccgtcgtc gttgatcaaa aaaccactgc ccaggcgaat ggtctggcgg 5400 ccggcggaaa cgtccactcc atccttgccc agcaccggga acaggtcggc cgagcgccag 5460 ccgaggaagg cgtcttcgat cttggtggtg cgttcggagc catcggtgtt gccggccgca 5520 tcgccatcgc cccaggtggc cgagctcacc cagttcaggc tgccgtacag cgtgccgttg 5580 ccggccaggc cctggtcacc gctgaggcca tacttgataa agccttcacg ccaggtcgaa 5640 ccccctgtgg tgccgtcgta gttcttgcgg ctgttgaaca tgccccatac cgccagcatg 5700 tcggcgttca ggtggctgtc atcgtcggcg tacagctcaa cggccggcgc ggcctggctg 5760 gccagcaagg ttgccagggc caggctggac agcgtctgtg gtttgaccat ttgcacatcc 5820 ctcgtttgtt ctcggccacc ttcacagggg cctttgttgt tcgggggcac cctcggttct 5880 ggcgaggggc catcgcggtt ggcggcgatg gcctattagg gcgtgtgcgg tggggcgggg 5940 tcttgttcgt ggctgccaag gcgcttgcac gccttggcca caggcgcggt cagtagcgga 6000 tcatcaccga cttgagctcg gtgaagtcat cgatgaaggc cgagccgaac tcgcggccaa 6060 tgccggaagc cttgatgccc ccaaacggta cagccgggtc gagcagggtg tgcatgttga 6120 cccacagggt accggcctgg atttgcggga tcatgcgcat ggccttgccc aggtcgttgg 6180 tccacaggct ggcgctgagg ccgtagggcg aggcgttcat caggtgcagc agttcgtctt 6240 cgtcgtcata aggcaggaag gtcgccacag ggccgaaggt ttcctgggtg agcagggtgt 6300 cgcaggctga ccgggcgagg attaccgtgg gttcgacgaa acagccgggg ccgtcgccca 6360 gggtgccgcc gtgaatgatc tggctgcctt cggcgcgggc gatggcgaac agttcggcca 6420 gcttctgctg gtgcggcttg ttggccacgg ggccgaactg ggtggcctcg tccagtggcg 6480 agccgatttt cagttggccc aggcgctggg acagggcgtc cagcagcggg tcgatgcgcg 6540 agcggtgcac atagaagcgc tcgcccgcgg cgcagatttg ccccgagtgc aggaagccgg 6600 cctcgatgat gccgtccaca gccttgtcgg ttgccacgtc gggcaggaag gccaccgcgt 6660 tcttgccgcc cagttccagt gtcgcacggg tcagcttggc gcccatggca gcctggccta 6720 cggcgatgcc agtgggcacg gagccggtga acgagacctt gtcggtacct gcgtgctcga 6780 tcagtgcctt gcccaccagg ccaccaccgg tcagcacgtt cagtgcaccg gccggcaggc 6840 ctgcttcggt ggccagttcg gcaatgcgca gcagcgtcag cggggtgaat tcgctgggct 6900 tgaggataat gctgcagccg gttgtcaggg ccgaggccag cttccagatg gcgatcatgc 6960 tggcgaagtt ccacggcacg atgcccacca ccacgccaat cggctcgcgc agggtgaagg 7020 cgctgtagcg ctcaccggcg aacgagggca gcgacggggt gatggtctgg ccggtgatct 7080 tggtcgccca gccggcgtag tagcgcagga agtgcgcggc ctgctgtact tcgaacgcac 7140 gggaaatgcc gatgagcttg ccggattgca aggtttccag ctgcgccagt tcttcgcggt 7200 tggcttccag caggtcggcc agcttgaaca gcactgcggc gcgggcggcg gggctggtgt 7260 gcgaccaggc ggtaaagcct tggcgcgagg agctgacggc atggtcgaca tcggcctggt 7320 tggcgtcggc gatgtgggcg atggtctggc cgttggccgg gttgaccacg gcaatgttcg 7380 acgacgactg gctggcgagg tgctggccgt ggatgaacac gccatgctcg cgggccagga 7440 aggccgtgac ggcaggtagg agggtgatgt cgctcatgca gactccgggg cagttggcca 7500 aagtttgcag cttaataagc ggggcagtgc ggtgcttgtg cctgcgtgac aggtgcatga 7560 ctgtggctgc caaccgcact gggtaagcct tgtgggagcg gccttgtgtc gcgatagggc 7620 cgcagagcgg ccccggcgat gttggcggcg aagctgaaaa tgctggggcc gcttcgcgcc 7680 cctatcgcga cgcaaggccg ctcccacaaa aaaagcgagc gtaggccggg ctgattgctg 7740 gcaggcagca acaagcccgg cggcagccat cggcaagacg ccatgccacc ggcagcgcac 7800 agtaatcact cgttcaacgc cacaaaaaca agccggggca tacgatgtca ctcaataaca 7860 agctcaccga gcacctcaac cgcggcactg tcggtttccc caccgcactg gccagcactg 7920 tcgggctgat catggccagc ccggtgatcc tcaccgcgac catgggcttt ggcatcggcg 7980 gcagcgcctt cgccgtggcc atggtcatcg ccgcactgat gatgctggcg cagtccacca 8040 cctttgccga ggctgcgtcg atcctgccga ccacgggctc ggtatacgac tacatcaact 8100 gtggcatggg ccgtttcttc gccattaccg gcacgctgtc ggcctacctg atcgtgcatg 8160 tgttcgccgg taccgccgaa accatcctgt cgggggtgat ggcgctggtg aacttcgagc 8220 acctcaatac cctggcggaa tccgccggcg gttcgtggct gctgggggtg tgcttcgtgg 8280 tggcgtttgc ggtgctcaat gcctttggcg tcagcgcctt cagccgcgcg gaagtggtcc 8340 tcaccttcgg catgtggacc accttgatgg tgttcggcgt gcttggcctg atcgccgcac 8400 ccgcagtgga actggacggc ccgttcggcg tgtcgctggt gggcaccgac ctgatgacca 8460 tcctctcgct ggtcggcatg gccatgttca tgttcgttgg ctgcgagttc gtcacgccgc 8520 ttgcccccga actgcgtcgc tcggcctggg tgctgccgcg ggccatggcg ctgggcctgt 8580 ttggcgtggc cagctgcatg ttcatctacg gagcggcgat gaagcgccag gtggaaaacg 8640 tggtgctgga tgccgccagt ggcgtgcacc tgctggacac gcccatggcc atcccgcgct 8700 tcgccgagca ggtgatgggt gatattggcc cagtgtggct gggtatcggc ttcctgttcg 8760 ccggcgcggc caccatcaac acgctgatgg ccggtgtgcc acgcattctt tacggcatgg 8820 cggtggacgg cgcgttgccc aaggtgttca cctacctgca cccgcgcttc aagacgccgc 8880 tgctgtgcat cctggtggtg gcgttgatcc cttgcctgca tgcctggtac ctgggcggca 8940 acccggacaa catcctgcac ctggtgctgg ccgccgtgtg cgcctggagc accgcctacc 9000 tgctggtgac cctgtcggtg gtgatattgc gcatccgccg cccagacctg ccgcgtgcct 9060 accgctcgcc gctgttcccg ttgccgcaga tattctccag tagcggtatc ctcatcggca 9120 tggcgttcat cacaccgccg ggcatgaacc ctgccgatgt ctacgtgccg ttcgccatca 9180 tgcttggcgc cactgcggcc tatgcattgt tctggacgct gtgggtgcag aaggtcaacc 9240 cgttcaagcc ggcgcgggtc gaggatgtgc tcgagaaaga gtttgctgcc gagcctggcc 9300 acgccgtgga gcacgtgctg catgatcaga aatttgcgtg aacgcttgct ggcgccccga 9360 gcgccttcag gctatcgccc aggcgccacg ctggcatgcc tggcgcgcaa cctggggcag 9420 cagaacctgg tggcggccgg ggtgatccac gacccggccc agggttggca ggccacggtg 9480 cacgaacgcg tcgaggccca cctgctgatg cacatcgtca cctgtgagtt ccagctgcag 9540 ttgcctgctc cgcaaggggg cgaggtcagc ctggagctgc gccataccgg tgcgcttcgc 9600 cgtgccggcc tggcctgtgt gtaccgcaag ggcgaccggg cgcgcttcgc ccgactgcgc 9660 gaccggttgc tgcagcaggc cgcactggtg gcggcgctga tgccgctgga tttcaagcgc 9720 ctgaccttgg cctggcgcga cggccaatgg ttgctgaccc tggagcacat gggcggtagc 9780 gaagtggtca accgcatgcc agcgtttcgc cgctacatcc ccatcagccc gcaacagcgg 9840 gcgcacctga tggccagcct ggcccagttc aacactttgc tacctaacct ttgacgcaaa 9900 ctggcatacg ccttgctgta tcaagcgacg aatgatgaca gttgtgcgca catagataac 9960 atgttaacaa tgtgcgcata acaacaaatc ctgcgtcgag ggcagccatg catactcaac 10020 aatccaaccg tcaggggctg gaacgctgga ccacggccat gcaacagatc tgtggccgtt 10080 tcgagacgga acttgcgtcc aatcactcgc tgttcatcgg cgaggtttct accttttccc 10140 gtgccggctt gccgctggcc aacctgcgca ccaatgccgg caacatccgc cggctgggcg 10200 aaaacccgac ccttgacgat gaccagcatt gtttcctggt cagccagcgt gcggggcatt 10260 ccaccgtgtc ccaggggggc atgcaggtca gcctggcgcc gggtgagctg ctgctgatgg 10320 attcggtcgg gcgctgcgaa atcaccccca gtgggttgat cgaacatgtc tcgctggccc 10380 tgtcgcgtga gcaggtacgc aagtatgtgc aaggcagcgg cccgatgttt ggcaagatct 10440 cctcgagcaa cgcctgcggg cgcatgctgc atgtgctgat ggaccaactg tgcaaggacg 10500 gcaatgtaag cggtgatggg gcccagggcg acgcgctgca gaccgccttc attgccctgc 10560 tggagccagg cttcgagcgc catggcgaag cgctgggcaa ccttggggcc ttgaacgggg 10620 ccaacctgcg gggctacgtg cagcaggtga tcgacgagtc cctgtcacag cccgggctga 10680 ccccgtccaa cctggccggt cgcctgaaca tctcggtgcg tcacctgtac cggctgttcg 10740 aggaggaggg cgatagtgtg tgccgctaca ttcagcgggc gcgcctgaag cgcagtgcgg 10800 atgacctggc caacccgttc ttcaggagcg agtcgattac ctcgattgcc tacaagtggg 10860 ggtttaccga ctcggcgcat ttcagccgct cgttcaagaa acagttcgaa cgctcgccca 10920 aggactaccg ggcgcaggcg atggtttgag tgtgatggtg ctgcttgtgc gggcctcatc 10980 gccggcaagt cacttggcgg cggttcagcg acggccgttg aagtagcccg acagctggtg 11040 cacggtcttg ccggcagtga gcagcagcgg gcggaaatgg tccttgccga ggatgcgcgc 11100 atgcttgacc gagctgacca ggtcatagcg cttcgatccc tcctgcatac cctcggcgag 11160 tatcttgcaa atgatgtggc tgggcgtgac gccaaagccg gagtagccct gcacatagaa 11220 agcgttgggg cggttgtcga gggtgcctat ctgcggaaac aggttggcac tggtggccat 11280 cgggccgccc caggccaggt cgatgcgcac gtctttcagg taggggaaaa tcttcagcat 11340 cagcgcgcgg ttccacgcct tcaggtccag cgggaagtgc tcgacgaagg gcgtggcggc 11400 gccaaacagc aggcggttct cgcgggtgac ccggtagtag tcgatcaccg ggcggatgtc 11460 gctgtaggcc ccgcgtatcg ggctgatgcg ctcgatcagc tcatccggca atggctcggt 11520 catcatctgg aaggcatagg tgtttatagt gcgtgcgtgc agctgcggct ccagcttgtt 11580 gaggaagctg tcgcacgccc acagcagctt gctggcgcgt accgagccac ggccggtgcg 11640 taccgtgatg cgctcgccgt aggtcacttc cagggccggg ctgtgttcga agatgcgcgc 11700 accatggccc accagtgcct gcgcttcgcc cagcagcagg ttcagggaat gcacatggcc 11760 accgcccatg tgcatcaggg cgctgctgta ggcgttgctg ccgatgatct ggcgcacttc 11820 gctgccaccg agaaaacgga tctcgtcgcg ggtattgatc gccttgaacg ccttctccca 11880 tttgcgcagg gtctgttcct ggcggcggtt gaagcccatg tagccatagc cgtggcagaa 11940 gtcggcgtcg atggcgtagc gggcgatgcg gtccttgatg atgccggcgc ccagttcgct 12000 gatttcgaaa atatccctca cgccctgatc accgacgctg ctgcggatct tctccaggtc 12060 gtggccgatg cccgccatga tctgcccgcc gttgcgcccg ctaccgccgt agcccagata 12120 acggccctcg agcacgacga tattggtcac gccttgttcc gccagctcca gggcggtgtt 12180 aatgccggag aaaccgccac cgatcaccac gacatcggcc tcgatgtcgc gttccagggt 12240 tgggaagctc aggttgtact tcttggtcgc cgagtagtag gtggggctct cgagggtgat 12300 catgacgccg cctgctgact ggaaatgggt agaaatcat 12339
<210> 3
<211> 1296
<212> DNA
<213> Pseudomonas putida U
<220>
<221> CDS
<222> (1) .. (1296)
<223> Secuencia codificante de TynA
<400> 3 atg tct ccc acg atc gcc cct gtg caa acc agc acc cgc cac ccc gac 48 Met Ser Pro Thr Ile Ala Pro Val Gln Thr Ser Thr Arg His Pro Asp1 5 10 15
gcc act acc gtg gtg atc atc ggt ggc ggc att atc ggc ctc acc gcc 96 Ala Thr Thr Val Val Ile Ile Gly Gly Gly Ile Ile Gly Leu Thr Ala
gca ctg agc ctc gcc gaa cgc aac ata ccg gtg gtg gtg ctg gag aag 144 Ala Leu Ser Leu Ala Glu Arg Asn Ile Pro Val Val Val Leu Glu Lys
ggg cgc atc gct ggg gag cag tcg tcg cgc aac ctg ggc tgg gtg cgc 192 Gly Arg Ile Ala Gly Glu Gln Ser Ser Arg Asn Leu Gly Trp Val Arg
50 55 60
aag acc aac cgc cac gcc cat gac att ccc ctg gca ctg gcg gct gat 240 Lys Thr Asn Arg His Ala His Asp Ile Pro Leu Ala Leu Ala Ala Asp65 70 75 80
cgg ctg tgg gcc gag atg ccc gca cgg gtt ggc agc gac gtt ggc tat 288 Arg Leu Trp Ala Glu Met Pro Ala Arg Val Gly Ser Asp Val Gly Tyr
85 90 95
cgg caa gcc ggc atc atg ttc atc ggc cgt aac gac acg cag atg ggc 336 Arg Gln Ala Gly Ile Met Phe Ile Gly Arg Asn Asp Thr Gln Met Gly
100 105 110
atg cat gaa ggc tgg ctg aaa agc gtc gag gcg ctg ggc ctg gat tcg 384 Met His Glu Gly Trp Leu Lys Ser Val Glu Ala Leu Gly Leu Asp Ser
115 120 125 cgc ctg ctg agc acg cgg gaa atc acc cgg atg gtg ccg ggt ggt cgg 432 Arg Leu Leu Ser Thr Arg Glu Ile Thr Arg Met Val Pro Gly Gly Arg
130 135 140
gcg gat tgg gca ggt ggc atc ttc acc ccc tcc gac gcc cgc gcc gaa 480 Ala Asp Trp Ala Gly Gly Ile Phe Thr Pro Ser Asp Ala Arg Ala Glu145 150 155 160
ccg acc ctg gcc gcc agc gcc atc gcc cgg gcg gcg atc gcc aaa ggt 528 Pro Thr Leu Ala Ala Ser Ala Ile Ala Arg Ala Ala Ile Ala Lys Gly
165 170 175
gca gtg gta gtg gag aac tgc gcg gtg cgc acg ctg gtc acc gca gcc 576 Ala Val Val Val Glu Asn Cys Ala Val Arg Thr Leu Val Thr Ala Ala
180 185 190
ggc cgt gtc agc ggg gtg gtc acc gag caa ggt gaa atc cgc tgc gac 624 Gly Arg Val Ser Gly Val Val Thr Glu Gln Gly Glu Ile Arg Cys Asp
195 200 205
cag gtg ttg ctg gcc ggt ggc ctg tgg tcg cgc aag ttc ctc ggc aac 672 Gln Val Leu Leu Ala Gly Gly Leu Trp Ser Arg Lys Phe Leu Gly Asn
210 215 220
ctg ggc atc aac ctg ccg acc ttg ccg ctg acc tgt tcc gtg ctg cgc 720 Leu Gly Ile Asn Leu Pro Thr Leu Pro Leu Thr Cys Ser Val Leu Arg225 230 235 240
aca gag ccc atg gac ggg cca acc gac atc gcc gtg ggc gca cca gat 768 Thr Glu Pro Met Asp Gly Pro Thr Asp Ile Ala Val Gly Ala Pro Asp
245 250 255
ttt tcc ttc cgc aag cac aag gac ggc ggc tac atc atc acc cag cgc 816 Phe Ser Phe Arg Lys His Lys Asp Gly Gly Tyr Ile Ile Thr Gln Arg
260 265 270
ggc gcg ctg gac gcc ttc ctc acc ctc gat cac ctg ctg ctg ggc aag 864 Gly Ala Leu Asp Ala Phe Leu Thr Leu Asp His Leu Leu Leu Gly Lys
275 280 285
cgt tac ctg ccc cag ctg cgt gcc cag cgg gac ttt ttg cgc atc agc 912 Arg Tyr Leu Pro Gln Leu Arg Ala Gln Arg Asp Phe Leu Arg Ile Ser
290 295 300
ttt ggc aag tac ttc ttc aag gac ctg gcc ctg gcc cgg cgc tgg aag 960 Phe Gly Lys Tyr Phe Phe Lys Asp Leu Ala Leu Ala Arg Arg Trp Lys305 310 315 320
gca acg gat gta acc ccc ttc gag cgt gta cgg gta caa gac ccg cac 1008 Ala Thr Asp Val Thr Pro Phe Glu Arg Val Arg Val Gln Asp Pro His
325 330 335
gcc aac ccg gcc ctc aac gac gag gcc atg cgc aac ctg aaa gcg gcc 1056 Ala Asn Pro Ala Leu Asn Asp Glu Ala Met Arg Asn Leu Lys Ala Ala
340 345 350
tgg ccg gtg ttc gaa cag gcc cgt atc gcc agc gcc tgg gcc ggc acg 1104 Trp Pro Val Phe Glu Gln Ala Arg Ile Ala Ser Ala Trp Ala Gly Thr
355 360 365 att gac gtc acc ccc gac tcc aac ccg gtc atc ggc ccg gtg gcc agtIle Asp Val Thr Pro Asp Ser Asn Pro Val Ile Gly Pro Val Ala Ser370 375 380 1152 att ccg ggc ctg acc ctg gcc acc ggc ttt tcc ggg cat ggc ttc ggcIle Pro Gly Leu Thr Leu Ala Thr Gly Phe Ser Gly His Gly Phe Gly385 390 395 400 1200 acc tcg cct gct gct ggc cag ctg gcg gca gac ctg gtg gcc cag gccThr Ser Pro Ala Ala Gly Gln Leu Ala Ala Asp Leu Val Ala Gln Ala405 410 415 1248 acc ccg ctg atc gac ccg tca ccg tac cgc ttc gag cgt ttc gcc tgaThr Pro Leu Ile Asp Pro Ser Pro Tyr Arg Phe Glu Arg Phe Ala420 425 430 1296 <210> 4 <211> 431 <212> PRT <213> Pseudomonas putida U <400> 4 Met Ser Pro Thr Ile Ala Pro Val Gln Thr Ser Thr Arg His Pro Asp1 5 10 15 Ala Thr Thr Val Val Ile Ile Gly Gly Gly Ile Ile Gly Leu Thr Ala20 25 30 Ala Leu Ser Leu Ala Glu Arg Asn Ile Pro Val Val Val Leu Glu Lys35 40 45 Gly Arg Ile Ala Gly Glu Gln Ser Ser Arg Asn Leu Gly Trp Val Arg50 55 60 Lys Thr Asn Arg His Ala His Asp Ile Pro Leu Ala Leu Ala Ala Asp65 70 75 80 Arg Leu Trp Ala Glu Met Pro Ala Arg Val Gly Ser Asp Val Gly Tyr85 90 95 Arg Gln Ala Gly Ile Met Phe Ile Gly Arg Asn Asp Thr Gln Met Gly100 105 110 Met His Glu Gly Trp Leu Lys Ser Val Glu Ala Leu Gly Leu Asp Ser115 120 125 Arg Leu Leu Ser Thr Arg Glu Ile Thr Arg Met Val Pro Gly Gly Arg 130 135 140 Ala Asp Trp Ala Gly Gly Ile Phe Thr Pro Ser Asp Ala Arg Ala Glu145 150 155 160 Pro Thr Leu Ala Ala Ser Ala Ile Ala Arg Ala Ala Ile Ala Lys Gly165 170 175Ala Val Val Val Glu Asn Cys Ala Val Arg Thr Leu Val Thr Ala Ala180 185 190
Gly Arg Val Ser Gly Val Val Thr Glu Gln Gly Glu Ile Arg Cys Asp195 200 205
Gln Val Leu Leu Ala Gly Gly Leu Trp Ser Arg Lys Phe Leu Gly Asn210 215 220
Leu Gly Ile Asn Leu Pro Thr Leu Pro Leu Thr Cys Ser Val Leu Arg225 230 235 240
Thr Glu Pro Met Asp Gly Pro Thr Asp Ile Ala Val Gly Ala Pro Asp245 250 255
Phe Ser Phe Arg Lys His Lys Asp Gly Gly Tyr Ile Ile Thr Gln Arg260 265 270
Gly Ala Leu Asp Ala Phe Leu Thr Leu Asp His Leu Leu Leu Gly Lys275 280 285
Arg Tyr Leu Pro Gln Leu Arg Ala Gln Arg Asp Phe Leu Arg Ile Ser290 295 300
Phe Gly Lys Tyr Phe Phe Lys Asp Leu Ala Leu Ala Arg Arg Trp Lys305 310 315 320
Ala Thr Asp Val Thr Pro Phe Glu Arg Val Arg Val Gln Asp Pro His325 330 335
Ala Asn Pro Ala Leu Asn Asp Glu Ala Met Arg Asn Leu Lys Ala Ala340 345 350
Trp Pro Val Phe Glu Gln Ala Arg Ile Ala Ser Ala Trp Ala Gly Thr355 360 365
Ile Asp Val Thr Pro Asp Ser Asn Pro Val Ile Gly Pro Val Ala Ser370 375 380
Ile Pro Gly Leu Thr Leu Ala Thr Gly Phe Ser Gly His Gly Phe Gly385 390 395 400
Thr Ser Pro Ala Ala Gly Gln Leu Ala Ala Asp Leu Val Ala Gln Ala405 410 415
Thr Pro Leu Ile Asp Pro Ser Pro Tyr Arg Phe Glu Arg Phe Ala420 425 430
<210> 5
<211> 1140
<212> DNA
<213> Pseudomonas putida U
<220>
<221> CDS
<222> (1) .. (1140)
<223> TynB <400> 5 atg acc ctc gat acc aga att gac ttc atc tac ctg tcc gag cag gac 48 Met Thr Leu Asp Thr Arg Ile Asp Phe Ile Tyr Leu Ser Glu Gln Asp1 5 10 15
atg atc cgt gcc ggc gtc acc gac atg ccg gcc tgt gtc gac acc atg 96 Met Ile Arg Ala Gly Val Thr Asp Met Pro Ala Cys Val Asp Thr Met
gaa gaa atg ttc ggc ctg ctg tac cag ggt gac tac cgc atg gcc ggc 144 Glu Glu Met Phe Gly Leu Leu Tyr Gln Gly Asp Tyr Arg Met Ala Gly
ccg aac agc gac tcg cac ggc gcg atg att acc ttc ccc gag cac tcg 192 Pro Asn Ser Asp Ser His Gly Ala Met Ile Thr Phe Pro Glu His Ser
50 55 60
cca ttc ccc aac atg ccc aag ccc acc gcc gac cgg cgc atg atg gcg 240 Pro Phe Pro Asn Met Pro Lys Pro Thr Ala Asp Arg Arg Met Met Ala65 70 75 80
atg ccg gcg tac ctc ggc ggc aac ttc cag acc gca ggc gcc aag tgg 288 Met Pro Ala Tyr Leu Gly Gly Asn Phe Gln Thr Ala Gly Ala Lys Trp
85 90 95
tac ggc tcg aac atc gcc aac cgt gag aaa ggc ctg ccc cgc tcg atc 336 Tyr Gly Ser Asn Ile Ala Asn Arg Glu Lys Gly Leu Pro Arg Ser Ile
100 105 110
ctg atg ttc acc ctc aac gat gcc gac acc ggc gcg cca ctg gca cac 384 Leu Met Phe Thr Leu Asn Asp Ala Asp Thr Gly Ala Pro Leu Ala His
115 120 125
atg tcg gcc aac ctg ctg tcg gcc tac cgc acc ggc gcc atc ccg ggc 432 Met Ser Ala Asn Leu Leu Ser Ala Tyr Arg Thr Gly Ala Ile Pro Gly
130 135 140
gtc ggc gcc cgc cac ctg gcg cgc aag gac gcc aag gtg atc ggc ctg 480 Val Gly Ala Arg His Leu Ala Arg Lys Asp Ala Lys Val Ile Gly Leu145 150 155 160
gcc ggc ccg ggg gtg atg ggc aag acc acg gtc gct gcg ttc atg gcc 528 Ala Gly Pro Gly Val Met Gly Lys Thr Thr Val Ala Ala Phe Met Ala
165 170 175
gtc tgc ccg ctg atc gac acc ctc aag gtc aag ggc cgt ggc cag aag 576 Val Cys Pro Leu Ile Asp Thr Leu Lys Val Lys Gly Arg Gly Gln Lys
180 185 190
aac ctg gat gcc ttc gtc gcc tgg gtg aaa acc agc ttc ccg caa atc 624 Asn Leu Asp Ala Phe Val Ala Trp Val Lys Thr Ser Phe Pro Gln Ile
195 200 205
act cat gtc cag gtg gtg gaa acc ctg gaa gaa gtg gta cgc ggt tcc 672 Thr His Val Gln Val Val Glu Thr Leu Glu Glu Val Val Arg Gly Ser
210 215 220
gac ctg gtc agc tac tgc agc tcg ggc gaa gtc ggc gac ccc agc cag 720 Asp Leu Val Ser Tyr Cys Ser Ser Gly Glu Val Gly Asp Pro Ser Gln 225 230 235 240
tac ccg ctg gtg cgc cgc gag tgg gtc aag cct ggc gcg ttc ctg gcc 768 Tyr Pro Leu Val Arg Arg Glu Trp Val Lys Pro Gly Ala Phe Leu Ala
245 250 255
atg cca gcc ccg tgc agt atc gac gcc ggc atg gag cag gac gac gtg 816 Met Pro Ala Pro Cys Ser Ile Asp Ala Gly Met Glu Gln Asp Asp Val
260 265 270
cgc aag gtg gtg gac aac acc ggc ctc tac gag gcc tgg ttc gaa gag 864 Arg Lys Val Val Asp Asn Thr Gly Leu Tyr Glu Ala Trp Phe Glu Glu
275 280 285
ctg ccc aag cct gcg cac aac cac gta ccg ctg gta ggt gtg cgc ttc 912 Leu Pro Lys Pro Ala His Asn His Val Pro Leu Val Gly Val Arg Phe
290 295 300
atg gac atg att gcc gaa ggc acg ctg gcc gcc gag cag gtg gaa gac 960 Met Asp Met Ile Ala Glu Gly Thr Leu Ala Ala Glu Gln Val Glu Asp305 310 315 320
atc ggc aag atc atc agc ggc gac gca ccg ggc cgc ctg cat gac gac 1008 Ile Gly Lys Ile Ile Ser Gly Asp Ala Pro Gly Arg Leu His Asp Asp
325 330 335
gaa atc atc ctg atg tcg gtg ggc ggc atg ccc gtc gaa gac gtg gcc 1056 Glu Ile Ile Leu Met Ser Val Gly Gly Met Pro Val Glu Asp Val Ala
340 345 350
tgg ggc acc gtg gtg tac cgc aag gcg ctc gag caa ggc atc ggc gta 1104 Trp Gly Thr Val Val Tyr Arg Lys Ala Leu Glu Gln Gly Ile Gly Val
355 360 365
aag ctc aac ctc tgg gaa acc ccc gtt ctc agc tga 1140 Lys Leu Asn Leu Trp Glu Thr Pro Val Leu Ser
370 375
<210> 6
<211> 379
<212> PRT
<213> Pseudomonas putida U
<400> 6
Met Thr Leu Asp Thr Arg Ile Asp Phe Ile Tyr Leu Ser Glu Gln Asp1 5 10 15
Met Ile Arg Ala Gly Val Thr Asp Met Pro Ala Cys Val Asp Thr Met20 25 30
Glu Glu Met Phe Gly Leu Leu Tyr Gln Gly Asp Tyr Arg Met Ala Gly35 40 45
Pro Asn Ser Asp Ser His Gly Ala Met Ile Thr Phe Pro Glu His Ser50 55 60
Pro Phe Pro Asn Met Pro Lys Pro Thr Ala Asp Arg Arg Met Met Ala 65 70 75 80
Met Pro Ala Tyr Leu Gly Gly Asn Phe Gln Thr Ala Gly Ala Lys Trp85 90 95
Tyr Gly Ser Asn Ile Ala Asn Arg Glu Lys Gly Leu Pro Arg Ser Ile100 105 110
Leu Met Phe Thr Leu Asn Asp Ala Asp Thr Gly Ala Pro Leu Ala His115 120 125
Met Ser Ala Asn Leu Leu Ser Ala Tyr Arg Thr Gly Ala Ile Pro Gly130 135 140
Val Gly Ala Arg His Leu Ala Arg Lys Asp Ala Lys Val Ile Gly Leu145 150 155 160
Ala Gly Pro Gly Val Met Gly Lys Thr Thr Val Ala Ala Phe Met Ala165 170 175
Val Cys Pro Leu Ile Asp Thr Leu Lys Val Lys Gly Arg Gly Gln Lys180 185 190
Asn Leu Asp Ala Phe Val Ala Trp Val Lys Thr Ser Phe Pro Gln Ile195 200 205
Thr His Val Gln Val Val Glu Thr Leu Glu Glu Val Val Arg Gly Ser210 215 220
Asp Leu Val Ser Tyr Cys Ser Ser Gly Glu Val Gly Asp Pro Ser Gln225 230 235 240
Tyr Pro Leu Val Arg Arg Glu Trp Val Lys Pro Gly Ala Phe Leu Ala245 250 255
Met Pro Ala Pro Cys Ser Ile Asp Ala Gly Met Glu Gln Asp Asp Val260 265 270
Arg Lys Val Val Asp Asn Thr Gly Leu Tyr Glu Ala Trp Phe Glu Glu275 280 285
Leu Pro Lys Pro Ala His Asn His Val Pro Leu Val Gly Val Arg Phe290 295 300
Met Asp Met Ile Ala Glu Gly Thr Leu Ala Ala Glu Gln Val Glu Asp305 310 315 320
Ile Gly Lys Ile Ile Ser Gly Asp Ala Pro Gly Arg Leu His Asp Asp325 330 335
Glu Ile Ile Leu Met Ser Val Gly Gly Met Pro Val Glu Asp Val Ala340 345 350
Trp Gly Thr Val Val Tyr Arg Lys Ala Leu Glu Gln Gly Ile Gly Val355 360 365
Lys Leu Asn Leu Trp Glu Thr Pro Val Leu Ser370 375
<210> 7
<211> 1488
<212> DNA
<213> Pseudomonas putida U
<220>
<221> CDS
<222> (1) .. (1488)
<223> TynC
<400> 7 atg agc gac atc acc ctc cta cct gcc gtc acg gcc ttc ctg gcc cgc 48 Met Ser Asp Ile Thr Leu Leu Pro Ala Val Thr Ala Phe Leu Ala Arg1 5 10 15
gag cat ggc gtg ttc atc cac ggc cag cac ctc gcc agc cag tcg tcg 96 Glu His Gly Val Phe Ile His Gly Gln His Leu Ala Ser Gln Ser Ser
tcg aac att gcc gtg gtc aac ccg gcc aac ggc cag acc atc gcc cac 144 Ser Asn Ile Ala Val Val Asn Pro Ala Asn Gly Gln Thr Ile Ala His
atc gcc gac gcc aac cag gcc gat gtc gac cat gcc gtc agc tcc tcg 192 Ile Ala Asp Ala Asn Gln Ala Asp Val Asp His Ala Val Ser Ser Ser
50 55 60
cgc caa ggc ttt acc gcc tgg tcg cac acc agc ccc gcc gcc cgc gcc 240 Arg Gln Gly Phe Thr Ala Trp Ser His Thr Ser Pro Ala Ala Arg Ala65 70 75 80
gca gtg ctg ttc aag ctg gcc gac ctg ctg gaa gcc aac cgc gaa gaa 288 Ala Val Leu Phe Lys Leu Ala Asp Leu Leu Glu Ala Asn Arg Glu Glu
85 90 95
ctg gcg cag ctg gaa acc ttg caa tcc ggc aag ctc atc ggc att tcc 336 Leu Ala Gln Leu Glu Thr Leu Gln Ser Gly Lys Leu Ile Gly Ile Ser
100 105 110
cgt gcg ttc gaa gta cag cag gcc gcg cac ttc ctg cgc tac tac gcc 384 Arg Ala Phe Glu Val Gln Gln Ala Ala His Phe Leu Arg Tyr Tyr Ala
115 120 125
ggc tgg gcg acc aag atc acc ggc cag acc atc acc ccg tcg ctg ccc 432 Gly Trp Ala Thr Lys Ile Thr Gly Gln Thr Ile Thr Pro Ser Leu Pro
130 135 140
tcg ttc gcc ggt gag cgc tac agc gcc ttc acc ctg cgc gag ccg att 480 Ser Phe Ala Gly Glu Arg Tyr Ser Ala Phe Thr Leu Arg Glu Pro Ile145 150 155 160
ggc gtg gtg gtg ggc atc gtg ccg tgg aac ttc gcc agc atg atc gcc 528 Gly Val Val Val Gly Ile Val Pro Trp Asn Phe Ala Ser Met Ile Ala
165 170 175
atc tgg aag ctg gcc tcg gcc ctg aca acc ggc tgc agc att atc ctc 576 Ile Trp Lys Leu Ala Ser Ala Leu Thr Thr Gly Cys Ser Ile Ile Leu
180 185 190 aag ccc agc gaa ttc acc ccg ctg acg ctg ctg cgc att gcc gaa ctg 624 Lys Pro Ser Glu Phe Thr Pro Leu Thr Leu Leu Arg Ile Ala Glu Leu
195 200 205
gcc acc gaa gca ggc ctg ccg gcc ggt gca ctg aac gtg ctg acc ggt 672 Ala Thr Glu Ala Gly Leu Pro Ala Gly Ala Leu Asn Val Leu Thr Gly
210 215 220
ggt ggc ctg gtg ggc aag gca ctg atc gag cac gca ggt acc gac aag 720 Gly Gly Leu Val Gly Lys Ala Leu Ile Glu His Ala Gly Thr Asp Lys225 230 235 240
gtc tcg ttc acc ggc tcc gtg ccc act ggc atc gcc gta ggc cag gct 768 Val Ser Phe Thr Gly Ser Val Pro Thr Gly Ile Ala Val Gly Gln Ala
245 250 255
gcc atg ggc gcc aag ctg acc cgt gcg aca ctg gaa ctg ggc ggc aag 816 Ala Met Gly Ala Lys Leu Thr Arg Ala Thr Leu Glu Leu Gly Gly Lys
260 265 270
aac gcg gtg gcc ttc ctg ccc gac gtg gca acc gac aag gct gtg gac 864 Asn Ala Val Ala Phe Leu Pro Asp Val Ala Thr Asp Lys Ala Val Asp
275 280 285
ggc atc atc gag gcc ggc ttc ctg cac tcg ggg caa atc tgc gcc gcg 912 Gly Ile Ile Glu Ala Gly Phe Leu His Ser Gly Gln Ile Cys Ala Ala
290 295 300
ggc gag cgc ttc tat gtg cac cgc tcg cgc atc gac ccg ctg ctg gac 960 Gly Glu Arg Phe Tyr Val His Arg Ser Arg Ile Asp Pro Leu Leu Asp305 310 315 320
gcc ctg tcc cag cgc ctg ggc caa ctg aaa atc ggc tcg cca ctg gac 1008 Ala Leu Ser Gln Arg Leu Gly Gln Leu Lys Ile Gly Ser Pro Leu Asp
325 330 335
gag gcc acc cag ttc ggc ccc gtg gcc aac aag ccg cac cag cag aag 1056 Glu Ala Thr Gln Phe Gly Pro Val Ala Asn Lys Pro His Gln Gln Lys
340 345 350
ctg gcc gaa ctg ttc gcc atc gcc cgc gcc gaa ggc agc cag atc att 1104 Leu Ala Glu Leu Phe Ala Ile Ala Arg Ala Glu Gly Ser Gln Ile Ile
355 360 365
cac ggc ggc acc ctg ggc gac ggc ccc ggc tgt ttc gtc gaa ccc acg 1152 His Gly Gly Thr Leu Gly Asp Gly Pro Gly Cys Phe Val Glu Pro Thr
370 375 380
gta atc ctc gcc cgg tca gcc tgc gac acc ctg ctc acc cag gaa acc 1200 Val Ile Leu Ala Arg Ser Ala Cys Asp Thr Leu Leu Thr Gln Glu Thr385 390 395 400
ttc ggc cct gtg gcg acc ttc ctg cct tat gac gac gaa gac gaa ctg 1248 Phe Gly Pro Val Ala Thr Phe Leu Pro Tyr Asp Asp Glu Asp Glu Leu
405 410 415
ctg cac ctg atg aac gcc tcg ccc tac ggc ctc agc gcc agc ctg tgg 1296 Leu His Leu Met Asn Ala Ser Pro Tyr Gly Leu Ser Ala Ser Leu Trp
420 425 430 acc aac gac ctg ggc aag gcc atg cgc atg atc ccg caa atc cag gccThr Asn Asp Leu Gly Lys Ala Met Arg Met Ile Pro Gln Ile Gln Ala435 440 445 1344 ggt acc ctg tgg gtc aac atg cac acc ctg ctc gac ccg gct gta ccgGly Thr Leu Trp Val Asn Met His Thr Leu Leu Asp Pro Ala Val Pro 450 455 460 1392 ttt ggg ggc atc aag gct tcc ggc att ggc cgc gag ttc ggc tcg gccPhe Gly Gly Ile Lys Ala Ser Gly Ile Gly Arg Glu Phe Gly Ser Ala465 470 475 480 1440 ttc atc gat gac ttc acc gag ctc aag tcg gtg atg atc cgc tac tgaPhe Ile Asp Asp Phe Thr Glu Leu Lys Ser Val Met Ile Arg Tyr485 490 495 1488 <210> 8 <211> 495 <212> PRT <213> Pseudomonas putida U <400> 8 Met Ser Asp Ile Thr Leu Leu Pro Ala Val Thr Ala Phe Leu Ala Arg1 5 10 15 Glu His Gly Val Phe Ile His Gly Gln His Leu Ala Ser Gln Ser Ser20 25 30 Ser Asn Ile Ala Val Val Asn Pro Ala Asn Gly Gln Thr Ile Ala His35 40 45 Ile Ala Asp Ala Asn Gln Ala Asp Val Asp His Ala Val Ser Ser Ser50 55 60 Arg Gln Gly Phe Thr Ala Trp Ser His Thr Ser Pro Ala Ala Arg Ala65 70 75 80 Ala Val Leu Phe Lys Leu Ala Asp Leu Leu Glu Ala Asn Arg Glu Glu85 90 95 Leu Ala Gln Leu Glu Thr Leu Gln Ser Gly Lys Leu Ile Gly Ile Ser100 105 110 Arg Ala Phe Glu Val Gln Gln Ala Ala His Phe Leu Arg Tyr Tyr Ala115 120 125 Gly Trp Ala Thr Lys Ile Thr Gly Gln Thr Ile Thr Pro Ser Leu Pro130 135 140 Ser Phe Ala Gly Glu Arg Tyr Ser Ala Phe Thr Leu Arg Glu Pro Ile145 150 155 160 Gly Val Val Val Gly Ile Val Pro Trp Asn Phe Ala Ser Met Ile Ala165 170 175Ile Trp Lys Leu Ala Ser Ala Leu Thr Thr Gly Cys Ser Ile Ile Leu180 185 190
Lys Pro Ser Glu Phe Thr Pro Leu Thr Leu Leu Arg Ile Ala Glu Leu195 200 205
Ala Thr Glu Ala Gly Leu Pro Ala Gly Ala Leu Asn Val Leu Thr Gly210 215 220
Gly Gly Leu Val Gly Lys Ala Leu Ile Glu His Ala Gly Thr Asp Lys225 230 235 240
Val Ser Phe Thr Gly Ser Val Pro Thr Gly Ile Ala Val Gly Gln Ala245 250 255
Ala Met Gly Ala Lys Leu Thr Arg Ala Thr Leu Glu Leu Gly Gly Lys260 265 270
Asn Ala Val Ala Phe Leu Pro Asp Val Ala Thr Asp Lys Ala Val Asp275 280 285
Gly Ile Ile Glu Ala Gly Phe Leu His Ser Gly Gln Ile Cys Ala Ala290 295 300
Gly Glu Arg Phe Tyr Val His Arg Ser Arg Ile Asp Pro Leu Leu Asp305 310 315 320
Ala Leu Ser Gln Arg Leu Gly Gln Leu Lys Ile Gly Ser Pro Leu Asp325 330 335
Glu Ala Thr Gln Phe Gly Pro Val Ala Asn Lys Pro His Gln Gln Lys340 345 350
Leu Ala Glu Leu Phe Ala Ile Ala Arg Ala Glu Gly Ser Gln Ile Ile355 360 365
His Gly Gly Thr Leu Gly Asp Gly Pro Gly Cys Phe Val Glu Pro Thr370 375 380
Val Ile Leu Ala Arg Ser Ala Cys Asp Thr Leu Leu Thr Gln Glu Thr385 390 395 400
Phe Gly Pro Val Ala Thr Phe Leu Pro Tyr Asp Asp Glu Asp Glu Leu405 410 415
Leu His Leu Met Asn Ala Ser Pro Tyr Gly Leu Ser Ala Ser Leu Trp420 425 430
Thr Asn Asp Leu Gly Lys Ala Met Arg Met Ile Pro Gln Ile Gln Ala435 440 445
Gly Thr Leu Trp Val Asn Met His Thr Leu Leu Asp Pro Ala Val Pro450 455 460
Phe Gly Gly Ile Lys Ala Ser Gly Ile Gly Arg Glu Phe Gly Ser Ala465 470 475 480
Phe Ile Asp Asp Phe Thr Glu Leu Lys Ser Val Met Ile Arg Tyr
485 490 495
<210> 9
<211> 942
<212> DNA
<213> Pseudomonas putida U
<220>
<221> CDS
<222> (1) .. (942)
<223> TynR
<400> 9 atg cat act caa caa tcc aac cgt cag ggg ctg gaa cgc tgg acc acg 48 Met His Thr Gln Gln Ser Asn Arg Gln Gly Leu Glu Arg Trp Thr Thr1 5 10 15
gcc atg caa cag atc tgt ggc cgt ttc gag acg gaa ctt gcg tcc aat 96 Ala Met Gln Gln Ile Cys Gly Arg Phe Glu Thr Glu Leu Ala Ser Asn
cac tcg ctg ttc atc ggc gag gtt tct acc ttt tcc cgt gcc ggc ttg 144 His Ser Leu Phe Ile Gly Glu Val Ser Thr Phe Ser Arg Ala Gly Leu
ccg ctg gcc aac ctg cgc acc aat gcc ggc aac atc cgc cgg ctg ggc 192 Pro Leu Ala Asn Leu Arg Thr Asn Ala Gly Asn Ile Arg Arg Leu Gly
50 55 60
gaa aac ccg acc ctt gac gat gac cag cat tgt ttc ctg gtc agc cag 240 Glu Asn Pro Thr Leu Asp Asp Asp Gln His Cys Phe Leu Val Ser Gln65 70 75 80
cgt gcg ggg cat tcc acc gtg tcc cag ggg ggc atg cag gtc agc ctg 288 Arg Ala Gly His Ser Thr Val Ser Gln Gly Gly Met Gln Val Ser Leu
85 90 95
gcg ccg ggt gag ctg ctg ctg atg gat tcg gtc ggg cgc tgc gaa atc 336 Ala Pro Gly Glu Leu Leu Leu Met Asp Ser Val Gly Arg Cys Glu Ile
100 105 110
acc ccc agt ggg ttg atc gaa cat gtc tcg ctg gcc ctg tcg cgt gag 384 Thr Pro Ser Gly Leu Ile Glu His Val Ser Leu Ala Leu Ser Arg Glu
115 120 125
cag gta cgc aag tat gtg caa ggc agc ggc ccg atg ttt ggc aag atc 432 Gln Val Arg Lys Tyr Val Gln Gly Ser Gly Pro Met Phe Gly Lys Ile
130 135 140
tcc tcg agc aac gcc tgc ggg cgc atg ctg cat gtg ctg atg gac caa 480 Ser Ser Ser Asn Ala Cys Gly Arg Met Leu His Val Leu Met Asp Gln145 150 155 160
ctg tgc aag gac ggc aat gta agc ggt gat ggg gcc cag ggc gac gcg 528 Leu Cys Lys Asp Gly Asn Val Ser Gly Asp Gly Ala Gln Gly Asp Ala
165 170 175
ctg cag acc gcc ttc att gcc ctg ctg gag cca ggc ttc gag cgc cat 576
Leu Gln Thr Ala Phe Ile Ala Leu Leu Glu Pro Gly Phe Glu Arg His180 185 190
ggc gaa gcg ctg ggc aac ctt ggg gcc ttg aac ggg gcc aac ctg cgg 624 Gly Glu Ala Leu Gly Asn Leu Gly Ala Leu Asn Gly Ala Asn Leu Arg
195 200 205
ggc tac gtg cag cag gtg atc gac gag tcc ctg tca cag ccc ggg ctg 672 Gly Tyr Val Gln Gln Val Ile Asp Glu Ser Leu Ser Gln Pro Gly Leu
210 215 220
acc ccg tcc aac ctg gcc ggt cgc ctg aac atc tcg gtg cgt cac ctg 720 Thr Pro Ser Asn Leu Ala Gly Arg Leu Asn Ile Ser Val Arg His Leu225 230 235 240
tac cgg ctg ttc gag gag gag ggc gat agt gtg tgc cgc tac att cag 768 Tyr Arg Leu Phe Glu Glu Glu Gly Asp Ser Val Cys Arg Tyr Ile Gln
245 250 255
cgg gcg cgc ctg aag cgc agt gcg gat gac ctg gcc aac ccg ttc ttc 816 Arg Ala Arg Leu Lys Arg Ser Ala Asp Asp Leu Ala Asn Pro Phe Phe
260 265 270
agg agc gag tcg att acc tcg att gcc tac aag tgg ggg ttt acc gac 864 Arg Ser Glu Ser Ile Thr Ser Ile Ala Tyr Lys Trp Gly Phe Thr Asp
275 280 285
tcg gcg cat ttc agc cgc tcg ttc aag aaa cag ttc gaa cgc tcg ccc 912 Ser Ala His Phe Ser Arg Ser Phe Lys Lys Gln Phe Glu Arg Ser Pro
290 295 300
aag gac tac cgg gcg cag gcg atg gtt tga 942 Lys Asp Tyr Arg Ala Gln Ala Met Val305 310
<210> 10
<211> 313
<212> PRT
<213> Pseudomonas putida U
<400> 10
Met His Thr Gln Gln Ser Asn Arg Gln Gly Leu Glu Arg Trp Thr Thr1 5 10 15
Ala Met Gln Gln Ile Cys Gly Arg Phe Glu Thr Glu Leu Ala Ser Asn20 25 30
His Ser Leu Phe Ile Gly Glu Val Ser Thr Phe Ser Arg Ala Gly Leu35 40 45
Pro Leu Ala Asn Leu Arg Thr Asn Ala Gly Asn Ile Arg Arg Leu Gly50 55 60
Glu Asn Pro Thr Leu Asp Asp Asp Gln His Cys Phe Leu Val Ser Gln65 70 75 80
Arg Ala Gly His Ser Thr Val Ser Gln Gly Gly Met Gln Val Ser Leu
85 90 95
Ala Pro Gly Glu Leu Leu Leu Met Asp Ser Val Gly Arg Cys Glu Ile100 105 110
Thr Pro Ser Gly Leu Ile Glu His Val Ser Leu Ala Leu Ser Arg Glu115 120 125
Gln Val Arg Lys Tyr Val Gln Gly Ser Gly Pro Met Phe Gly Lys Ile130 135 140
Ser Ser Ser Asn Ala Cys Gly Arg Met Leu His Val Leu Met Asp Gln145 150 155 160
Leu Cys Lys Asp Gly Asn Val Ser Gly Asp Gly Ala Gln Gly Asp Ala165 170 175
Leu Gln Thr Ala Phe Ile Ala Leu Leu Glu Pro Gly Phe Glu Arg His180 185 190
Gly Glu Ala Leu Gly Asn Leu Gly Ala Leu Asn Gly Ala Asn Leu Arg195 200 205
Gly Tyr Val Gln Gln Val Ile Asp Glu Ser Leu Ser Gln Pro Gly Leu 210 215 220
Thr Pro Ser Asn Leu Ala Gly Arg Leu Asn Ile Ser Val Arg His Leu225 230 235 240
Tyr Arg Leu Phe Glu Glu Glu Gly Asp Ser Val Cys Arg Tyr Ile Gln245 250 255
Arg Ala Arg Leu Lys Arg Ser Ala Asp Asp Leu Ala Asn Pro Phe Phe260 265 270
Arg Ser Glu Ser Ile Thr Ser Ile Ala Tyr Lys Trp Gly Phe Thr Asp275 280 285
Ser Ala His Phe Ser Arg Ser Phe Lys Lys Gln Phe Glu Arg Ser Pro290 295 300
Lys Asp Tyr Arg Ala Gln Ala Met Val305 310
<210> 11
<211> 1335
<212> DNA
<213> Pseudomonas putida U
<220>
<221> CDS
<222> (1) .. (1335)
<223> TynD
<400> 11 atg att tct acc cat ttc cag tca gca ggc ggc gtc atg atc acc ctc 48 Met Ile Ser Thr His Phe Gln Ser Ala Gly Gly Val Met Ile Thr Leu1 5 10 15 gag agc ccc acc tac tac tcg gcg acc aag aag tac aac ctg agc ttc 96 Glu Ser Pro Thr Tyr Tyr Ser Ala Thr Lys Lys Tyr Asn Leu Ser Phe
cca acc ctg gaa cgc gac atc gag gcc gat gtc gtg gtg atc ggt ggc 144 Pro Thr Leu Glu Arg Asp Ile Glu Ala Asp Val Val Val Ile Gly Gly
ggt ttc tcc ggc att aac acc gcc ctg gag ctg gcg gaa caa ggc gtg 192 Gly Phe Ser Gly Ile Asn Thr Ala Leu Glu Leu Ala Glu Gln Gly Val
50 55 60
acc aat atc gtc gtg ctc gag ggc cgt tat ctg ggc tac ggc ggt agc 240 Thr Asn Ile Val Val Leu Glu Gly Arg Tyr Leu Gly Tyr Gly Gly Ser65 70 75 80
ggg cgc aac ggc ggg cag atc atg gcg ggc atc ggc cac gac ctg gag 288 Gly Arg Asn Gly Gly Gln Ile Met Ala Gly Ile Gly His Asp Leu Glu
85 90 95
aag atc cgc agc agc gtc ggt gat cag ggc gtg agg gat att ttc gaa 336 Lys Ile Arg Ser Ser Val Gly Asp Gln Gly Val Arg Asp Ile Phe Glu
100 105 110
atc agc gaa ctg ggc gcc ggc atc atc aag gac cgc atc gcc cgc tac 384 Ile Ser Glu Leu Gly Ala Gly Ile Ile Lys Asp Arg Ile Ala Arg Tyr
115 120 125
gcc atc gac gcc gac ttc tgc cac ggc tat ggc tac atg ggc ttc aac 432 Ala Ile Asp Ala Asp Phe Cys His Gly Tyr Gly Tyr Met Gly Phe Asn
130 135 140
cgc cgc cag gaa cag acc ctg cgc aaa tgg gag aag gcg ttc aag gcg 480 Arg Arg Gln Glu Gln Thr Leu Arg Lys Trp Glu Lys Ala Phe Lys Ala145 150 155 160
atc aat acc cgc gac gag atc cgt ttt ctc ggt ggc agc gaa gtg cgc 528 Ile Asn Thr Arg Asp Glu Ile Arg Phe Leu Gly Gly Ser Glu Val Arg
165 170 175
cag atc atc ggc agc aac gcc tac agc agc gcc ctg atg cac atg ggc 576 Gln Ile Ile Gly Ser Asn Ala Tyr Ser Ser Ala Leu Met His Met Gly
180 185 190
ggt ggc cat gtg cat tcc ctg aac ctg ctg ctg ggc gaa gcg cag gca 624 Gly Gly His Val His Ser Leu Asn Leu Leu Leu Gly Glu Ala Gln Ala
195 200 205
ctg gtg ggc cat ggt gcg cgc atc ttc gaa cac agc ccg gcc ctg gaa 672 Leu Val Gly His Gly Ala Arg Ile Phe Glu His Ser Pro Ala Leu Glu
210 215 220
gtg acc tac ggc gag cgc atc acg gta cgc acc ggc cgt ggc tcg gta 720 Val Thr Tyr Gly Glu Arg Ile Thr Val Arg Thr Gly Arg Gly Ser Val225 230 235 240
cgc gcc agc aag ctg ctg tgg gcg tgc gac agc ttc ctc aac aag ctg 768 Arg Ala Ser Lys Leu Leu Trp Ala Cys Asp Ser Phe Leu Asn Lys Leu
245 250 255
gag ccg cag ctg cac gca cgc act ata aac acc tat gcc ttc cag atg 816 Glu Pro Gln Leu His Ala Arg Thr Ile Asn Thr Tyr Ala Phe Gln Met
260 265 270
atg acc gag cca ttg ccg gat gag ctg atc gag cgc atc agc ccg ata 864 Met Thr Glu Pro Leu Pro Asp Glu Leu Ile Glu Arg Ile Ser Pro Ile
275 280 285
cgc ggg gcc tac agc gac atc cgc ccg gtg atc gac tac tac cgg gtc 912 Arg Gly Ala Tyr Ser Asp Ile Arg Pro Val Ile Asp Tyr Tyr Arg Val
290 295 300
acc cgc gag aac cgc ctg ctg ttt ggc gcc gcc acg ccc ttc gtc gag 960 Thr Arg Glu Asn Arg Leu Leu Phe Gly Ala Ala Thr Pro Phe Val Glu305 310 315 320
cac ttc ccg ctg gac ctg aag gcg tgg aac cgc gcg ctg atg ctg aag 1008 His Phe Pro Leu Asp Leu Lys Ala Trp Asn Arg Ala Leu Met Leu Lys
325 330 335
att ttc ccc tac ctg aaa gac gtg cgc atc gac ctg gcc tgg ggc ggc 1056 Ile Phe Pro Tyr Leu Lys Asp Val Arg Ile Asp Leu Ala Trp Gly Gly
340 345 350
ccg atg gcc acc agt gcc aac ctg ttt ccg cag ata ggc acc ctc gac 1104 Pro Met Ala Thr Ser Ala Asn Leu Phe Pro Gln Ile Gly Thr Leu Asp
355 360 365
aac cgc ccc aac gct ttc tat gtg cag ggc tac tcc ggc ttt ggc gtc 1152 Asn Arg Pro Asn Ala Phe Tyr Val Gln Gly Tyr Ser Gly Phe Gly Val
370 375 380
acg ccc agc cac atc att tgc aag ata ctc gcc gag ggt atg cag gag 1200 Thr Pro Ser His Ile Ile Cys Lys Ile Leu Ala Glu Gly Met Gln Glu385 390 395 400
gga tcg aag cgc tat gac ctg gtc agc tcg gtc aag cat gcg cgc atc 1248 Gly Ser Lys Arg Tyr Asp Leu Val Ser Ser Val Lys His Ala Arg Ile
405 410 415
ctc ggc aag gac cat ttc cgc ccg ctg ctg ctc act gcc ggc aag acc 1296 Leu Gly Lys Asp His Phe Arg Pro Leu Leu Leu Thr Ala Gly Lys Thr
420 425 430
gtg cac cag ctg tcg ggc tac ttc aac ggc cgt cgc tga 1335 Val His Gln Leu Ser Gly Tyr Phe Asn Gly Arg Arg
435 440
<210> 12
<211> 444
<212> PRT
<213> Pseudomonas putida U
<400> 12
Met Ile Ser Thr His Phe Gln Ser Ala Gly Gly Val Met Ile Thr Leu 1 5 10 15
Glu Ser Pro Thr Tyr Tyr Ser Ala Thr Lys Lys Tyr Asn Leu Ser Phe20 25 30
Pro Thr Leu Glu Arg Asp Ile Glu Ala Asp Val Val Val Ile Gly Gly35 40 45
Gly Phe Ser Gly Ile Asn Thr Ala Leu Glu Leu Ala Glu Gln Gly Val50 55 60
Thr Asn Ile Val Val Leu Glu Gly Arg Tyr Leu Gly Tyr Gly Gly Ser65 70 75 80
Gly Arg Asn Gly Gly Gln Ile Met Ala Gly Ile Gly His Asp Leu Glu85 90 95
Lys Ile Arg Ser Ser Val Gly Asp Gln Gly Val Arg Asp Ile Phe Glu100 105 110
Ile Ser Glu Leu Gly Ala Gly Ile Ile Lys Asp Arg Ile Ala Arg Tyr115 120 125
Ala Ile Asp Ala Asp Phe Cys His Gly Tyr Gly Tyr Met Gly Phe Asn130 135 140
Arg Arg Gln Glu Gln Thr Leu Arg Lys Trp Glu Lys Ala Phe Lys Ala145 150 155 160
Ile Asn Thr Arg Asp Glu Ile Arg Phe Leu Gly Gly Ser Glu Val Arg165 170 175
Gln Ile Ile Gly Ser Asn Ala Tyr Ser Ser Ala Leu Met His Met Gly180 185 190
Gly Gly His Val His Ser Leu Asn Leu Leu Leu Gly Glu Ala Gln Ala195 200 205
Leu Val Gly His Gly Ala Arg Ile Phe Glu His Ser Pro Ala Leu Glu210 215 220
Val Thr Tyr Gly Glu Arg Ile Thr Val Arg Thr Gly Arg Gly Ser Val225 230 235 240
Arg Ala Ser Lys Leu Leu Trp Ala Cys Asp Ser Phe Leu Asn Lys Leu245 250 255
Glu Pro Gln Leu His Ala Arg Thr Ile Asn Thr Tyr Ala Phe Gln Met260 265 270
Met Thr Glu Pro Leu Pro Asp Glu Leu Ile Glu Arg Ile Ser Pro Ile275 280 285
Arg Gly Ala Tyr Ser Asp Ile Arg Pro Val Ile Asp Tyr Tyr Arg Val290 295 300
Thr Arg Glu Asn Arg Leu Leu Phe Gly Ala Ala Thr Pro Phe Val Glu305 310 315 320
His Phe Pro Leu Asp Leu Lys Ala Trp Asn Arg Ala Leu Met Leu Lys325 330 335
Ile Phe Pro Tyr Leu Lys Asp Val Arg Ile Asp Leu Ala Trp Gly Gly340 345 350
Pro Met Ala Thr Ser Ala Asn Leu Phe Pro Gln Ile Gly Thr Leu Asp355 360 365
Asn Arg Pro Asn Ala Phe Tyr Val Gln Gly Tyr Ser Gly Phe Gly Val370 375 380
Thr Pro Ser His Ile Ile Cys Lys Ile Leu Ala Glu Gly Met Gln Glu385 390 395 400
Gly Ser Lys Arg Tyr Asp Leu Val Ser Ser Val Lys His Ala Arg Ile405 410 415
Leu Gly Lys Asp His Phe Arg Pro Leu Leu Leu Thr Ala Gly Lys Thr420 425 430
Val His Gln Leu Ser Gly Tyr Phe Asn Gly Arg Arg435 440
<210> 13
<211> 1218
<212> DNA
<213> Pseudomonas putida U
<220> <221> CDS
<222> (1) .. (1218)
<223> TynF
<400> 13 atg caa gcc aat ccc tcc cct ccc ata ccc ttc agc ttc gcc ctg ggc 48 Met Gln Ala Asn Pro Ser Pro Pro Ile Pro Phe Ser Phe Ala Leu Gly1 5 10 15
cta ggc ctg atc ggc gcc ctc ggc cct tcc gcc gtc gac atg tac ctg 96 Leu Gly Leu Ile Gly Ala Leu Gly Pro Ser Ala Val Asp Met Tyr Leu
tcg agc ctg ccg gaa atc gcc agc cac tat cag gct agc ttc acc cgc 144 Ser Ser Leu Pro Glu Ile Ala Ser His Tyr Gln Ala Ser Phe Thr Arg
gta cag ctg aca ctg acc ttc ttc ctg ctg gcc atg ggc gcc ggc cag 192 Val Gln Leu Thr Leu Thr Phe Phe Leu Leu Ala Met Gly Ala Gly Gln
50 55 60
ctg atc ttc ggc ccc atc gtc gac gct tat ggc cgg cgc aag ccg ctg 240 Leu Ile Phe Gly Pro Ile Val Asp Ala Tyr Gly Arg Arg Lys Pro Leu65 70 75 80
ctg gcc ggc ctg ctg ctg ttc atc ctg tgc tcg ctg ggc gca gcc gca 288 Leu Ala Gly Leu Leu Leu Phe Ile Leu Cys Ser Leu Gly Ala Ala Ala
85 90 95
gcc ccc agc ctc gac acc ctg atc atg ctg cgc ttt ttc cag ggc ctg 336 Ala Pro Ser Leu Asp Thr Leu Ile Met Leu Arg Phe Phe Gln Gly Leu
100 105 110
ggc agt gcg ctg acc ctg gtg gtg atc atg agc atg gtg cgt gat gtg 384 Gly Ser Ala Leu Thr Leu Val Val Ile Met Ser Met Val Arg Asp Val
115 120 125
agc cag ggc gtg gcc gcg acc aaa ctg ttc gcc ctg ctg atg acc atc 432 Ser Gln Gly Val Ala Ala Thr Lys Leu Phe Ala Leu Leu Met Thr Ile
130 135 140
gaa ggc gtc gca ccg atc ctg gca cct gcc ctg ggc ggc gtg atc gac 480 Glu Gly Val Ala Pro Ile Leu Ala Pro Ala Leu Gly Gly Val Ile Asp145 150 155 160
gca cat ttc ggc tgg cgt gca gta atg ctg gta ctc gcc ggc atg ggc 528 Ala His Phe Gly Trp Arg Ala Val Met Leu Val Leu Ala Gly Met Gly
165 170 175
gtg acg gtg ctg gtc aac agc ctg ctg aac ctg ccc gaa acc ctg ccg 576 Val Thr Val Leu Val Asn Ser Leu Leu Asn Leu Pro Glu Thr Leu Pro
180 185 190
ccc agc aaa cgc gaa ccc ctg cgc ctg ggc cac gcc tgc agc acc tac 624 Pro Ser Lys Arg Glu Pro Leu Arg Leu Gly His Ala Cys Ser Thr Tyr
195 200 205
ctg gcc atc ctc gcc gac cgc cgc ttc ctg cgc ccg acc ctg gcg gtt 672 Leu Ala Ile Leu Ala Asp Arg Arg Phe Leu Arg Pro Thr Leu Ala Val 210 215 220
gct gcg gta ttc ttc ttc ctg ttc gcc tac atc ggc ggt gcc acc ctg 720 Ala Ala Val Phe Phe Phe Leu Phe Ala Tyr Ile Gly Gly Ala Thr Leu225 230 235 240
gtg tac cag gcc cac tac ggc ctg agc gcc cag gcc ttc ggc ctg ctg 768 Val Tyr Gln Ala His Tyr Gly Leu Ser Ala Gln Ala Phe Gly Leu Leu
245 250 255
ttt ggc gcc acc ggg gtg tcg atc ctg ctc ggc gcc atg acg gcc agc 816 Phe Gly Ala Thr Gly Val Ser Ile Leu Leu Gly Ala Met Thr Ala Ser
260 265 270
cac ctg atc agc cgg ctg ggc ctc aat acc ttg act cgg gtg ggc gtg 864 His Leu Ile Ser Arg Leu Gly Leu Asn Thr Leu Thr Arg Val Gly Val
275 280 285
ctg tgc atg gcc ggc ggt gcc tgc atc agc ctg ctc ggt gca ctg acc 912 Leu Cys Met Ala Gly Gly Ala Cys Ile Ser Leu Leu Gly Ala Leu Thr
290 295 300
ggc ctg ggg ctg cca ggt gtg gcc ggc ggc atg gtg ata gcc ctg ttc 960 Gly Leu Gly Leu Pro Gly Val Ala Gly Gly Met Val Ile Ala Leu Phe305 310 315 320
ggc ctg ggg ata gcc gag tcg acg ctg atg tcg ctg gtg atg gcc tcg 1008 Gly Leu Gly Ile Ala Glu Ser Thr Leu Met Ser Leu Val Met Ala Ser
325 330 335
caa gaa aag gca ctg ggt tcc acc gca gcg ctg ctg ggc gcc atc cag 1056 Gln Glu Lys Ala Leu Gly Ser Thr Ala Ala Leu Leu Gly Ala Ile Gln
340 345 350
ctg tcg gcg tct gcc ggc gcc gcc ccg ctg gcc gca gtg gta ctc aac 1104 Leu Ser Ala Ser Ala Gly Ala Ala Pro Leu Ala Ala Val Val Leu Asn
355 360 365
cac ggc ccg acc gca tgg gcc gcg ctg ctg gcc ctg tgc acc ctg gtg 1152 His Gly Pro Thr Ala Trp Ala Ala Leu Leu Ala Leu Cys Thr Leu Val
370 375 380
gtg tgc ctg ctg acc gcc ctc agc ctg cgc cac acc ccg gcc agc ttc 1200 Val Cys Leu Leu Thr Ala Leu Ser Leu Arg His Thr Pro Ala Ser Phe385 390 395 400
tcg ctc gcg ggc cat tga 1218 Ser Leu Ala Gly His
<210> 14
<211> 405
<212> PRT
<213> Pseudomonas putida U
<400> 14
Met Gln Ala Asn Pro Ser Pro Pro Ile Pro Phe Ser Phe Ala Leu Gly 1 5 10 15
Leu Gly Leu Ile Gly Ala Leu Gly Pro Ser Ala Val Asp Met Tyr Leu20 25 30
Ser Ser Leu Pro Glu Ile Ala Ser His Tyr Gln Ala Ser Phe Thr Arg35 40 45
Val Gln Leu Thr Leu Thr Phe Phe Leu Leu Ala Met Gly Ala Gly Gln50 55 60
Leu Ile Phe Gly Pro Ile Val Asp Ala Tyr Gly Arg Arg Lys Pro Leu65 70 75 80
Leu Ala Gly Leu Leu Leu Phe Ile Leu Cys Ser Leu Gly Ala Ala Ala85 90 95
Ala Pro Ser Leu Asp Thr Leu Ile Met Leu Arg Phe Phe Gln Gly Leu100 105 110
Gly Ser Ala Leu Thr Leu Val Val Ile Met Ser Met Val Arg Asp Val115 120 125
Ser Gln Gly Val Ala Ala Thr Lys Leu Phe Ala Leu Leu Met Thr Ile130 135 140
Glu Gly Val Ala Pro Ile Leu Ala Pro Ala Leu Gly Gly Val Ile Asp145 150 155 160
Ala His Phe Gly Trp Arg Ala Val Met Leu Val Leu Ala Gly Met Gly165 170 175
Val Thr Val Leu Val Asn Ser Leu Leu Asn Leu Pro Glu Thr Leu Pro 180 185 190
Pro Ser Lys Arg Glu Pro Leu Arg Leu Gly His Ala Cys Ser Thr Tyr195 200 205
Leu Ala Ile Leu Ala Asp Arg Arg Phe Leu Arg Pro Thr Leu Ala Val210 215 220
Ala Ala Val Phe Phe Phe Leu Phe Ala Tyr Ile Gly Gly Ala Thr Leu225 230 235 240
Val Tyr Gln Ala His Tyr Gly Leu Ser Ala Gln Ala Phe Gly Leu Leu245 250 255
Phe Gly Ala Thr Gly Val Ser Ile Leu Leu Gly Ala Met Thr Ala Ser260 265 270
His Leu Ile Ser Arg Leu Gly Leu Asn Thr Leu Thr Arg Val Gly Val275 280 285
Leu Cys Met Ala Gly Gly Ala Cys Ile Ser Leu Leu Gly Ala Leu Thr290 295 300
Gly Leu Gly Leu Pro Gly Val Ala Gly Gly Met Val Ile Ala Leu Phe305 310 315 320
Gly Leu Gly Ile Ala Glu Ser Thr Leu Met Ser Leu Val Met Ala Ser325 330 335
Gln Glu Lys Ala Leu Gly Ser Thr Ala Ala Leu Leu Gly Ala Ile Gln340 345 350
Leu Ser Ala Ser Ala Gly Ala Ala Pro Leu Ala Ala Val Val Leu Asn355 360 365
His Gly Pro Thr Ala Trp Ala Ala Leu Leu Ala Leu Cys Thr Leu Val370 375 380
Val Cys Leu Leu Thr Ala Leu Ser Leu Arg His Thr Pro Ala Ser Phe385 390 395 400
Ser Leu Ala Gly His405
<210> 15
<211> 1311
<212> DNA
<213> Pseudomonas putida U
<220>
<221> CDS
<222> (1) .. (1311)
<223> TynE
<400> 15 atg gtc aaa cca cag acg ctg tcc agc ctg gcc ctg gca acc ttg ctg 48 Met Val Lys Pro Gln Thr Leu Ser Ser Leu Ala Leu Ala Thr Leu Leu1 5 10 15
gcc agc cag gcc gcg ccg gcc gtt gag ctg tac gcc gac gat gac agc 96 Ala Ser Gln Ala Ala Pro Ala Val Glu Leu Tyr Ala Asp Asp Asp Ser
cac ctg aac gcc gac atg ctg gcg gta tgg ggc atg ttc aac agc cgc 144 His Leu Asn Ala Asp Met Leu Ala Val Trp Gly Met Phe Asn Ser Arg
aag aac tac gac ggc acc aca ggg ggt tcg acc tgg cgt gaa ggc ttt 192 Lys Asn Tyr Asp Gly Thr Thr Gly Gly Ser Thr Trp Arg Glu Gly Phe
50 55 60
atc aag tat ggc ctc agc ggt gac cag ggc ctg gcc ggc aac ggc acg 240 Ile Lys Tyr Gly Leu Ser Gly Asp Gln Gly Leu Ala Gly Asn Gly Thr65 70 75 80
ctg tac ggc agc ctg aac tgg gtg agc tcg gcc acc tgg ggc gat ggc 288 Leu Tyr Gly Ser Leu Asn Trp Val Ser Ser Ala Thr Trp Gly Asp Gly
85 90 95
gat gcg gcc ggc aac acc gat ggc tcc gaa cgc acc acc aag atc gaa 336 Asp Ala Ala Gly Asn Thr Asp Gly Ser Glu Arg Thr Thr Lys Ile Glu
100 105 110
gac gcc ttc ctc ggc tgg cgc tcg gcc gac ctg ttc ccg gtg ctg ggc 384
Asp Ala Phe Leu Gly Trp Arg Ser Ala Asp Leu Phe Pro Val Leu Gly115 120 125
aag gat gga gtg gac gtt tcc gcc ggc cgc cag acc att cgc ctg ggc 432 Lys Asp Gly Val Asp Val Ser Ala Gly Arg Gln Thr Ile Arg Leu Gly
130 135 140
agt ggt ttt ttg atc aac gac gac ggc ccg aac ctg ggc aac ggc gtc 480 Ser Gly Phe Leu Ile Asn Asp Asp Gly Pro Asn Leu Gly Asn Gly Val145 150 155 160
gcc gac ggt gcg ctg gac cgc ggc ggg gcc tac tac ctg gcc gcc cgc 528 Ala Asp Gly Ala Leu Asp Arg Gly Gly Ala Tyr Tyr Leu Ala Ala Arg
165 170 175
cac gcc ttc gac cgc acc gca atg ctg cgc ctg ggg ggc agc gat ggc 576 His Ala Phe Asp Arg Thr Ala Met Leu Arg Leu Gly Gly Ser Asp Gly
180 185 190
ctg cat ggc agc ctg ctg tgg ctg aaa tcc gac aac cgc gcc cag gcc 624 Leu His Gly Ser Leu Leu Trp Leu Lys Ser Asp Asn Arg Ala Gln Ala
195 200 205
gaa acc gaa ctg gcc gcc ggc acg ctg gac tac acc caa gcc ttg ggc 672 Glu Thr Glu Leu Ala Ala Gly Thr Leu Asp Tyr Thr Gln Ala Leu Gly
210 215 220
acc ctc ggg ctg acc tgg att cac ggc atc gac gtc acc gac caa tgg 720 Thr Leu Gly Leu Thr Trp Ile His Gly Ile Asp Val Thr Asp Gln Trp225 230 235 240
gcc agc gac ttt cag aaa gcc cgc gaa ggc atg gac gtg tat agc gtg 768 Ala Ser Asp Phe Gln Lys Ala Arg Glu Gly Met Asp Val Tyr Ser Val
245 250 255
cgc ggc gaa ggc aac gct ggc atc gac aat gcc agt ttc gcc ttc gaa 816 Arg Gly Glu Gly Asn Ala Gly Ile Asp Asn Ala Ser Phe Ala Phe Glu
260 265 270
tac gcc tgg cag gac aag acc gac ggc ccc gag caa gcc tgg tac ctg 864 Tyr Ala Trp Gln Asp Lys Thr Asp Gly Pro Glu Gln Ala Trp Tyr Leu
275 280 285
cag gcc ggc tac acc ttc gcc gac ctg ccg tgg gca ccg cag gtt acc 912 Gln Ala Gly Tyr Thr Phe Ala Asp Leu Pro Trp Ala Pro Gln Val Thr
290 295 300
tac cgc tac acc cgc tac tcg gca ggc tgg gac gcg ctg ttc agc ggc 960 Tyr Arg Tyr Thr Arg Tyr Ser Ala Gly Trp Asp Ala Leu Phe Ser Gly305 310 315 320
ctg tcc agc ggt tac ggc acc tgg ttc cag ggt gaa gtc gct gcc aac 1008 Leu Ser Ser Gly Tyr Gly Thr Trp Phe Gln Gly Glu Val Ala Ala Asn
325 330 335
tac gcc ggc ccc ttc aac agc aac acg ggt atc cac cat gtg ggc gtg 1056 Tyr Ala Gly Pro Phe Asn Ser Asn Thr Gly Ile His His Val Gly Val
340 345 350
aag gcg aca ccg ctg gaa aat ctc aca gtc ggg gcg ctg tac ttc gacLys Ala Thr Pro Leu Glu Asn Leu Thr Val Gly Ala Leu Tyr Phe Asp355 360 365 1104 ttc gac acc gta cgc acc cgc gaa agc ctc aac ctc gat gcg cgg gagPhe Asp Thr Val Arg Thr Arg Glu Ser Leu Asn Leu Asp Ala Arg Glu370 375 380 1152 ctg gac ctg tat gtg gaa tgg gca gtc aac gag cac ctg ata atc agcLeu Asp Leu Tyr Val Glu Trp Ala Val Asn Glu His Leu Ile Ile Ser385 390 395 400 1200 ccg ctg gtg ggc ctt tac cag ccg cgc aag gac gag agc aac ggc ggcPro Leu Val Gly Leu Tyr Gln Pro Arg Lys Asp Glu Ser Asn Gly Gly405 410 415 1248 aac cag gtg ggc ggg aat ggt acc aat gtg tat agc cag ctg acc gtgAsn Gln Val Gly Gly Asn Gly Thr Asn Val Tyr Ser Gln Leu Thr Val420 425 430 1296 gct gtg ccg ttc tgaAla Val Pro Phe 1311 435 <210> 16 <211> 436 <212> PRT <213> Pseudomonas putida U <400> 16 Met Val Lys Pro Gln Thr Leu Ser Ser Leu Ala Leu Ala Thr Leu Leu1 5 10 15 Ala Ser Gln Ala Ala Pro Ala Val Glu Leu Tyr Ala Asp Asp Asp Ser20 25 30 His Leu Asn Ala Asp Met Leu Ala Val Trp Gly Met Phe Asn Ser Arg35 40 45 Lys Asn Tyr Asp Gly Thr Thr Gly Gly Ser Thr Trp Arg Glu Gly Phe50 55 60 Ile Lys Tyr Gly Leu Ser Gly Asp Gln Gly Leu Ala Gly Asn Gly Thr65 70 75 80 Leu Tyr Gly Ser Leu Asn Trp Val Ser Ser Ala Thr Trp Gly Asp Gly85 90 95 Asp Ala Ala Gly Asn Thr Asp Gly Ser Glu Arg Thr Thr Lys Ile Glu100 105 110 Asp Ala Phe Leu Gly Trp Arg Ser Ala Asp Leu Phe Pro Val Leu Gly115 120 125 Lys Asp Gly Val Asp Val Ser Ala Gly Arg Gln Thr Ile Arg Leu Gly130 135 140Ser Gly Phe Leu Ile Asn Asp Asp Gly Pro Asn Leu Gly Asn Gly Val145 150 155 160
Ala Asp Gly Ala Leu Asp Arg Gly Gly Ala Tyr Tyr Leu Ala Ala Arg165 170 175
His Ala Phe Asp Arg Thr Ala Met Leu Arg Leu Gly Gly Ser Asp Gly180 185 190
Leu His Gly Ser Leu Leu Trp Leu Lys Ser Asp Asn Arg Ala Gln Ala195 200 205
Glu Thr Glu Leu Ala Ala Gly Thr Leu Asp Tyr Thr Gln Ala Leu Gly210 215 220
Thr Leu Gly Leu Thr Trp Ile His Gly Ile Asp Val Thr Asp Gln Trp225 230 235 240
Ala Ser Asp Phe Gln Lys Ala Arg Glu Gly Met Asp Val Tyr Ser Val245 250 255
Arg Gly Glu Gly Asn Ala Gly Ile Asp Asn Ala Ser Phe Ala Phe Glu260 265 270
Tyr Ala Trp Gln Asp Lys Thr Asp Gly Pro Glu Gln Ala Trp Tyr Leu275 280 285
Gln Ala Gly Tyr Thr Phe Ala Asp Leu Pro Trp Ala Pro Gln Val Thr290 295 300
Tyr Arg Tyr Thr Arg Tyr Ser Ala Gly Trp Asp Ala Leu Phe Ser Gly305 310 315 320
Leu Ser Ser Gly Tyr Gly Thr Trp Phe Gln Gly Glu Val Ala Ala Asn325 330 335
Tyr Ala Gly Pro Phe Asn Ser Asn Thr Gly Ile His His Val Gly Val340 345 350
Lys Ala Thr Pro Leu Glu Asn Leu Thr Val Gly Ala Leu Tyr Phe Asp355 360 365
Phe Asp Thr Val Arg Thr Arg Glu Ser Leu Asn Leu Asp Ala Arg Glu370 375 380
Leu Asp Leu Tyr Val Glu Trp Ala Val Asn Glu His Leu Ile Ile Ser385 390 395 400
Pro Leu Val Gly Leu Tyr Gln Pro Arg Lys Asp Glu Ser Asn Gly Gly405 410 415
Asn Gln Val Gly Gly Asn Gly Thr Asn Val Tyr Ser Gln Leu Thr Val420 425 430
Ala Val Pro Phe 435
<210> 17
<211> 1497
<212> DNA
<213> Pseudomonas putida U
<220>
<221> CDS
<222> (1) .. (1497)
<223> TynG
<400> 17 atg tca ctc aat aac aag ctc acc gag cac ctc aac cgc ggc act gtc 48 Met Ser Leu Asn Asn Lys Leu Thr Glu His Leu Asn Arg Gly Thr Val1 5 10 15
ggt ttc ccc acc gca ctg gcc agc act gtc ggg ctg atc atg gcc agc 96 Gly Phe Pro Thr Ala Leu Ala Ser Thr Val Gly Leu Ile Met Ala Ser
ccg gtg atc ctc acc gcg acc atg ggc ttt ggc atc ggc ggc agc gcc 144 Pro Val Ile Leu Thr Ala Thr Met Gly Phe Gly Ile Gly Gly Ser Ala
ttc gcc gtg gcc atg gtc atc gcc gca ctg atg atg ctg gcg cag tcc 192 Phe Ala Val Ala Met Val Ile Ala Ala Leu Met Met Leu Ala Gln Ser
50 55 60
acc acc ttt gcc gag gct gcg tcg atc ctg ccg acc acg ggc tcg gta 240 Thr Thr Phe Ala Glu Ala Ala Ser Ile Leu Pro Thr Thr Gly Ser Val65 70 75 80
tac gac tac atc aac tgt ggc atg ggc cgt ttc ttc gcc att acc ggc 288 Tyr Asp Tyr Ile Asn Cys Gly Met Gly Arg Phe Phe Ala Ile Thr Gly
85 90 95
acg ctg tcg gcc tac ctg atc gtg cat gtg ttc gcc ggt acc gcc gaa 336 Thr Leu Ser Ala Tyr Leu Ile Val His Val Phe Ala Gly Thr Ala Glu
100 105 110
acc atc ctg tcg ggg gtg atg gcg ctg gtg aac ttc gag cac ctc aat 384 Thr Ile Leu Ser Gly Val Met Ala Leu Val Asn Phe Glu His Leu Asn
115 120 125
acc ctg gcg gaa tcc gcc ggc ggt tcg tgg ctg ctg ggg gtg tgc ttc 432 Thr Leu Ala Glu Ser Ala Gly Gly Ser Trp Leu Leu Gly Val Cys Phe
130 135 140
gtg gtg gcg ttt gcg gtg ctc aat gcc ttt ggc gtc agc gcc ttc agc 480 Val Val Ala Phe Ala Val Leu Asn Ala Phe Gly Val Ser Ala Phe Ser145 150 155 160
cgc gcg gaa gtg gtc ctc acc ttc ggc atg tgg acc acc ttg atg gtg 528 Arg Ala Glu Val Val Leu Thr Phe Gly Met Trp Thr Thr Leu Met Val
165 170 175
ttc ggc gtg ctt ggc ctg atc gcc gca ccc gca gtg gaa ctg gac ggc 576 Phe Gly Val Leu Gly Leu Ile Ala Ala Pro Ala Val Glu Leu Asp Gly
180 185 190
ccg ttc ggc gtg tcg ctg gtg ggc acc gac ctg atg acc atc ctc tcg 624
Pro Phe Gly Val Ser Leu Val Gly Thr Asp Leu Met Thr Ile Leu Ser195 200 205
ctg gtc ggc atg gcc atg ttc atg ttc gtt ggc tgc gag ttc gtc acg 672 Leu Val Gly Met Ala Met Phe Met Phe Val Gly Cys Glu Phe Val Thr
210 215 220
ccg ctt gcc ccc gaa ctg cgt cgc tcg gcc tgg gtg ctg ccg cgg gcc 720 Pro Leu Ala Pro Glu Leu Arg Arg Ser Ala Trp Val Leu Pro Arg Ala225 230 235 240
atg gcg ctg ggc ctg ttt ggc gtg gcc agc tgc atg ttc atc tac gga 768 Met Ala Leu Gly Leu Phe Gly Val Ala Ser Cys Met Phe Ile Tyr Gly
245 250 255
gcg gcg atg aag cgc cag gtg gaa aac gtg gtg ctg gat gcc gcc agt 816 Ala Ala Met Lys Arg Gln Val Glu Asn Val Val Leu Asp Ala Ala Ser
260 265 270
ggc gtg cac ctg ctg gac acg ccc atg gcc atc ccg cgc ttc gcc gag 864 Gly Val His Leu Leu Asp Thr Pro Met Ala Ile Pro Arg Phe Ala Glu
275 280 285
cag gtg atg ggt gat att ggc cca gtg tgg ctg ggt atc ggc ttc ctg 912 Gln Val Met Gly Asp Ile Gly Pro Val Trp Leu Gly Ile Gly Phe Leu
290 295 300
ttc gcc ggc gcg gcc acc atc aac acg ctg atg gcc ggt gtg cca cgc 960 Phe Ala Gly Ala Ala Thr Ile Asn Thr Leu Met Ala Gly Val Pro Arg305 310 315 320
att ctt tac ggc atg gcg gtg gac ggc gcg ttg ccc aag gtg ttc acc 1008 Ile Leu Tyr Gly Met Ala Val Asp Gly Ala Leu Pro Lys Val Phe Thr
325 330 335
tac ctg cac ccg cgc ttc aag acg ccg ctg ctg tgc atc ctg gtg gtg 1056 Tyr Leu His Pro Arg Phe Lys Thr Pro Leu Leu Cys Ile Leu Val Val
340 345 350
gcg ttg atc cct tgc ctg cat gcc tgg tac ctg ggc ggc aac ccg gac 1104 Ala Leu Ile Pro Cys Leu His Ala Trp Tyr Leu Gly Gly Asn Pro Asp
355 360 365
aac atc ctg cac ctg gtg ctg gcc gcc gtg tgc gcc tgg agc acc gcc 1152 Asn Ile Leu His Leu Val Leu Ala Ala Val Cys Ala Trp Ser Thr Ala
370 375 380
tac ctg ctg gtg acc ctg tcg gtg gtg ata ttg cgc atc cgc cgc cca 1200 Tyr Leu Leu Val Thr Leu Ser Val Val Ile Leu Arg Ile Arg Arg Pro385 390 395 400
gac ctg ccg cgt gcc tac cgc tcg ccg ctg ttc ccg ttg ccg cag ata 1248 Asp Leu Pro Arg Ala Tyr Arg Ser Pro Leu Phe Pro Leu Pro Gln Ile
405 410 415
ttc tcc agt agc ggt atc ctc atc ggc atg gcg ttc atc aca ccg ccg 1296 Phe Ser Ser Ser Gly Ile Leu Ile Gly Met Ala Phe Ile Thr Pro Pro
420 425 430 ggc atg aac cct gcc gat gtc tac gtg ccg ttc gcc atc atg ctt ggc 1344 Gly Met Asn Pro Ala Asp Val Tyr Val Pro Phe Ala Ile Met Leu Gly
435 440 445
gcc act gcg gcc tat gca ttg ttc tgg acg ctg tgg gtg cag aag gtc 1392 Ala Thr Ala Ala Tyr Ala Leu Phe Trp Thr Leu Trp Val Gln Lys Val
450 455 460
aac ccg ttc aag ccg gcg cgg gtc gag gat gtg ctc gag aaa gag ttt 1440 Asn Pro Phe Lys Pro Ala Arg Val Glu Asp Val Leu Glu Lys Glu Phe465 470 475 480
gct gcc gag cct ggc cac gcc gtg gag cac gtg ctg cat gat cag aaa 1488 Ala Ala Glu Pro Gly His Ala Val Glu His Val Leu His Asp Gln Lys
485 490 495
ttt gcg tga 1497 Phe Ala 165 170 175
<210> 18 <211> 498 <212> PRT <213> Pseudomonas putida U <400> 18 Met Ser Leu Asn Asn Lys Leu Thr Glu His Leu Asn Arg Gly Thr Val1 5 10 15 Gly Phe Pro Thr Ala Leu Ala Ser Thr Val Gly Leu Ile Met Ala Ser20 25 30 Pro Val Ile Leu Thr Ala Thr Met Gly Phe Gly Ile Gly Gly Ser Ala35 40 45 Phe Ala Val Ala Met Val Ile Ala Ala Leu Met Met Leu Ala Gln Ser 50 55 60 Thr Thr Phe Ala Glu Ala Ala Ser Ile Leu Pro Thr Thr Gly Ser Val65 70 75 80 Tyr Asp Tyr Ile Asn Cys Gly Met Gly Arg Phe Phe Ala Ile Thr Gly85 90 95 Thr Leu Ser Ala Tyr Leu Ile Val His Val Phe Ala Gly Thr Ala Glu100 105 110 Thr Ile Leu Ser Gly Val Met Ala Leu Val Asn Phe Glu His Leu Asn115 120 125 Thr Leu Ala Glu Ser Ala Gly Gly Ser Trp Leu Leu Gly Val Cys Phe130 135 140 Val Val Ala Phe Ala Val Leu Asn Ala Phe Gly Val Ser Ala Phe Ser145 150 155 160 Arg Ala Glu Val Val Leu Thr Phe Gly Met Trp Thr Thr Leu Met ValPhe Gly Val Leu Gly Leu Ile Ala Ala Pro Ala Val Glu Leu Asp Gly180 185 190
Pro Phe Gly Val Ser Leu Val Gly Thr Asp Leu Met Thr Ile Leu Ser195 200 205
Leu Val Gly Met Ala Met Phe Met Phe Val Gly Cys Glu Phe Val Thr210 215 220
Pro Leu Ala Pro Glu Leu Arg Arg Ser Ala Trp Val Leu Pro Arg Ala225 230 235 240
Met Ala Leu Gly Leu Phe Gly Val Ala Ser Cys Met Phe Ile Tyr Gly245 250 255
Ala Ala Met Lys Arg Gln Val Glu Asn Val Val Leu Asp Ala Ala Ser260 265 270
Gly Val His Leu Leu Asp Thr Pro Met Ala Ile Pro Arg Phe Ala Glu275 280 285
Gln Val Met Gly Asp Ile Gly Pro Val Trp Leu Gly Ile Gly Phe Leu290 295 300
Phe Ala Gly Ala Ala Thr Ile Asn Thr Leu Met Ala Gly Val Pro Arg305 310 315 320
Ile Leu Tyr Gly Met Ala Val Asp Gly Ala Leu Pro Lys Val Phe Thr325 330 335
Tyr Leu His Pro Arg Phe Lys Thr Pro Leu Leu Cys Ile Leu Val Val340 345 350
Ala Leu Ile Pro Cys Leu His Ala Trp Tyr Leu Gly Gly Asn Pro Asp355 360 365
Asn Ile Leu His Leu Val Leu Ala Ala Val Cys Ala Trp Ser Thr Ala370 375 380
Tyr Leu Leu Val Thr Leu Ser Val Val Ile Leu Arg Ile Arg Arg Pro385 390 395 400
Asp Leu Pro Arg Ala Tyr Arg Ser Pro Leu Phe Pro Leu Pro Gln Ile405 410 415
Phe Ser Ser Ser Gly Ile Leu Ile Gly Met Ala Phe Ile Thr Pro Pro420 425 430
Gly Met Asn Pro Ala Asp Val Tyr Val Pro Phe Ala Ile Met Leu Gly435 440 445
Ala Thr Ala Ala Tyr Ala Leu Phe Trp Thr Leu Trp Val Gln Lys Val450 455 460
Asn Pro Phe Lys Pro Ala Arg Val Glu Asp Val Leu Glu Lys Glu Phe465 470 475 480
Ala Ala Glu Pro Gly His Ala Val Glu His Val Leu His Asp Gln Lys485 490 495
Phe Ala
<210> 19
<211> 1170
<212> DNA
<213> Pseudomonas putida U
<220>
<221> CDS
<222> (1) .. (1170)
<223> HpaB
<400> 19 atg aaa aag cca aac ccc ctg ctg gaa gac ctg aag tcc gtc ctg ccg 48 Met Lys Lys Pro Asn Pro Leu Leu Glu Asp Leu Lys Ser Val Leu Pro1 5 10 15
acc att gcc gcc aat gcc atg cgt gca gag cag gac cgc agt gtg ccg 96 Thr Ile Ala Ala Asn Ala Met Arg Ala Glu Gln Asp Arg Ser Val Pro
gca gag aat atc gcc ttg ctg aaa agc atc ggc atg cac cgc gct ttc 144 Ala Glu Asn Ile Ala Leu Leu Lys Ser Ile Gly Met His Arg Ala Phe
ttg ccc aaa cac ttc ggc ggc atg gaa atc acc ctg ccg gag ttc gcc 192 Leu Pro Lys His Phe Gly Gly Met Glu Ile Thr Leu Pro Glu Phe Ala
50 55 60
cag tgc atc gcc ttg ctg gcg ggg gcc tgc gcc agc aca gcc tgg gcc 240 Gln Cys Ile Ala Leu Leu Ala Gly Ala Cys Ala Ser Thr Ala Trp Ala65 70 75 80
atg agc ctg ctg tgc acc cac agc cac cag atg gca atg ttc tcg ccc 288 Met Ser Leu Leu Cys Thr His Ser His Gln Met Ala Met Phe Ser Pro
85 90 95
aag cta caa cag gag gtg tgg ggt agc gac ccg gat gct acc gcc agc 336 Lys Leu Gln Gln Glu Val Trp Gly Ser Asp Pro Asp Ala Thr Ala Ser
100 105 110
agc agt atc gcg ccg ttc ggc cgc act gaa gag gtt gag ggt ggc gtg 384 Ser Ser Ile Ala Pro Phe Gly Arg Thr Glu Glu Val Glu Gly Gly Val
115 120 125
tcg ttc agc ggc gaa atg ggc tgg agt tcc ggt tgc gac cac gcc gaa 432 Ser Phe Ser Gly Glu Met Gly Trp Ser Ser Gly Cys Asp His Ala Glu
130 135 140
tgg gcg att ctc ggt ttc cgc cgc aag aat gcc gaa ggc gct cag gat 480 Trp Ala Ile Leu Gly Phe Arg Arg Lys Asn Ala Glu Gly Ala Gln Asp145 150 155 160
tac tgc ttc gcc atc ctg cct cgc agt gac tat gaa atc cgt gat gac 528
Tyr Cys Phe Ala Ile Leu Pro Arg Ser Asp Tyr Glu Ile Arg Asp Asp165 170 175
tgg tat gcc gtg ggc atg cgc ggc agc ggc agc aag acc ctg atc gtg 576 Trp Tyr Ala Val Gly Met Arg Gly Ser Gly Ser Lys Thr Leu Ile Val
180 185 190
cgt gat gcc ttc gtg ccc gag cac cgc atc cag aag gcc aag gac atg 624 Arg Asp Ala Phe Val Pro Glu His Arg Ile Gln Lys Ala Lys Asp Met
195 200 205
atg gag ggc aag tcg gcg ggc ttt ggt ttg tac ccc gac agc aag att 672 Met Glu Gly Lys Ser Ala Gly Phe Gly Leu Tyr Pro Asp Ser Lys Ile
210 215 220
ttc ttc gcc ccg tat cgc ccg tat ttt gcc agc ggc ttc tcc acg gtc 720 Phe Phe Ala Pro Tyr Arg Pro Tyr Phe Ala Ser Gly Phe Ser Thr Val225 230 235 240
agc ttg ggc gtt gcc gag cgc atg ctg gag gtg ttc cgc gag aaa acc 768 Ser Leu Gly Val Ala Glu Arg Met Leu Glu Val Phe Arg Glu Lys Thr
245 250 255
cgc aac cgc gtg cgt gcc tac acc ggt gct gcc gtg ggc gcc gcc acc 816 Arg Asn Arg Val Arg Ala Tyr Thr Gly Ala Ala Val Gly Ala Ala Thr
260 265 270
ccg gcg ctg atg cgc ctg gcc gag tcg acc cat cag gtg gcc gct gcc 864 Pro Ala Leu Met Arg Leu Ala Glu Ser Thr His Gln Val Ala Ala Ala
275 280 285
cgg gca ttg ctg gaa aag agc tgg gac gag att gcc gag cac agt gcc 912 Arg Ala Leu Leu Glu Lys Ser Trp Asp Glu Ile Ala Glu His Ser Ala
290 295 300
cgt cac gaa tac ccg tcg cgt ggc acg ctg gcg ttc tgg cgt acc aac 960 Arg His Glu Tyr Pro Ser Arg Gly Thr Leu Ala Phe Trp Arg Thr Asn305 310 315 320
cag ggc tac gcc gtg aag atg tgc atc cag gcc gtc gac cgc ctg atg 1008 Gln Gly Tyr Ala Val Lys Met Cys Ile Gln Ala Val Asp Arg Leu Met
325 330 335
gaa gcg gcc ggt ggt ggc gcc tgg ttc gag agc aac gaa ctg cag cgg 1056 Glu Ala Ala Gly Gly Gly Ala Trp Phe Glu Ser Asn Glu Leu Gln Arg
340 345 350
ctg ttc cgc gat tcg cac atg acc ggt gcc cat gcc tac acc gat tac 1104 Leu Phe Arg Asp Ser His Met Thr Gly Ala His Ala Tyr Thr Asp Tyr
355 360 365
gac gtg tgt gcg caa atc ctc ggc cgc gag ctg atg ggc ctg gag cct 1152 Asp Val Cys Ala Gln Ile Leu Gly Arg Glu Leu Met Gly Leu Glu Pro
370 375 380
gac ccg gcg atg gtc tga 1170 Asp Pro Ala Met Val385
<210> 20
<211> 389
<212> PRT
<213> Pseudomonas putida U
<400> 20
Met Lys Lys Pro Asn Pro Leu Leu Glu Asp Leu Lys Ser Val Leu Pro1 5 10 15
Thr Ile Ala Ala Asn Ala Met Arg Ala Glu Gln Asp Arg Ser Val Pro20 25 30
Ala Glu Asn Ile Ala Leu Leu Lys Ser Ile Gly Met His Arg Ala Phe35 40 45
Leu Pro Lys His Phe Gly Gly Met Glu Ile Thr Leu Pro Glu Phe Ala50 55 60
Gln Cys Ile Ala Leu Leu Ala Gly Ala Cys Ala Ser Thr Ala Trp Ala65 70 75 80
Met Ser Leu Leu Cys Thr His Ser His Gln Met Ala Met Phe Ser Pro85 90 95
Lys Leu Gln Gln Glu Val Trp Gly Ser Asp Pro Asp Ala Thr Ala Ser100 105 110
Ser Ser Ile Ala Pro Phe Gly Arg Thr Glu Glu Val Glu Gly Gly Val115 120 125
Ser Phe Ser Gly Glu Met Gly Trp Ser Ser Gly Cys Asp His Ala Glu130 135 140
Trp Ala Ile Leu Gly Phe Arg Arg Lys Asn Ala Glu Gly Ala Gln Asp145 150 155 160
Tyr Cys Phe Ala Ile Leu Pro Arg Ser Asp Tyr Glu Ile Arg Asp Asp165 170 175
Trp Tyr Ala Val Gly Met Arg Gly Ser Gly Ser Lys Thr Leu Ile Val180 185 190
Arg Asp Ala Phe Val Pro Glu His Arg Ile Gln Lys Ala Lys Asp Met195 200 205
Met Glu Gly Lys Ser Ala Gly Phe Gly Leu Tyr Pro Asp Ser Lys Ile210 215 220
Phe Phe Ala Pro Tyr Arg Pro Tyr Phe Ala Ser Gly Phe Ser Thr Val225 230 235 240
Ser Leu Gly Val Ala Glu Arg Met Leu Glu Val Phe Arg Glu Lys Thr245 250 255
Arg Asn Arg Val Arg Ala Tyr Thr Gly Ala Ala Val Gly Ala Ala Thr260 265 270
Pro Ala Leu Met Arg Leu Ala Glu Ser Thr His Gln Val Ala Ala Ala275 280 285 Arg Ala Leu Leu Glu Lys Ser Trp Asp Glu Ile Ala Glu His Ser Ala 290 295 300 Arg His Glu Tyr Pro Ser Arg Gly Thr Leu Ala Phe Trp Arg Thr Asn305 310 315 320 Gln Gly Tyr Ala Val Lys Met Cys Ile Gln Ala Val Asp Arg Leu Met325 330 335 Glu Ala Ala Gly Gly Gly Ala Trp Phe Glu Ser Asn Glu Leu Gln Arg340 345 350 Leu Phe Arg Asp Ser His Met Thr Gly Ala His Ala Tyr Thr Asp Tyr355 360 365 Asp Val Cys Ala Gln Ile Leu Gly Arg Glu Leu Met Gly Leu Glu Pro370 375 380 Asp Pro Ala Met Val385 <210> 21 <211> 930 <212> DNA <213> Pseudmonas putida U <220> <221> CDS <222> (1) .. (930) <223> hpaC <400> 21 atg tcc aaa gaa acc ttc gat tca cgt gcc ttc cgc cgc gcc ctg ggcMet Ser Lys Glu Thr Phe Asp Ser Arg Ala Phe Arg Arg Ala Leu Gly1 5 10 15 48 aac ttc gcc acc ggc gtg acc gtg gtg act gcc gcc ggc ccc agt ggcAsn Phe Ala Thr Gly Val Thr Val Val Thr Ala Ala Gly Pro Ser Gly20 25 30 96 cgc aag gtc ggc gtt acc gcc aac agc ttc aac tcg gtg tcg ctg gacArg Lys Val Gly Val Thr Ala Asn Ser Phe Asn Ser Val Ser Leu Asp35 40 45 144 ccg gcg ctg atc ctg tgg agc atc gac aag cgc tcc acc agc cat gaaPro Ala Leu Ile Leu Trp Ser Ile Asp Lys Arg Ser Thr Ser His Glu50 55 60 192 gtg ttc gaa gag gcc tcg cac ttt gcc gtg aac att ctg gct gcg gacVal Phe Glu Glu Ala Ser His Phe Ala Val Asn Ile Leu Ala Ala Asp65 70 75 80 240 cag atc gac ctg tcc aac aac ttt gcc cgc ccg aag gaa gat cgc ttt Gln Ile Asp Leu Ser Asn Asn Phe Ala Arg Pro Lys Glu Asp Arg Phe85 90 95 288gcc ggt atc gac tac gag acc ggc act ggc ggc gcg ccg ttg ttc gcc 336 Ala Gly Ile Asp Tyr Glu Thr Gly Thr Gly Gly Ala Pro Leu Phe Ala
100 105 110
gat tgc gcg gcg cgc ttt gag tgt gaa aag tac cag cag ctg gac ggt 384 Asp Cys Ala Ala Arg Phe Glu Cys Glu Lys Tyr Gln Gln Leu Asp Gly
115 120 125
ggc gat cac tgg atc ctg gtg ggc aag gta gtg gcc ttt gat gac ttt 432 Gly Asp His Trp Ile Leu Val Gly Lys Val Val Ala Phe Asp Asp Phe
130 135 140
ggc cgc tcg ccg ctg ctg tat cac cag ggc gcc tat tca atg gtg ctg 480 Gly Arg Ser Pro Leu Leu Tyr His Gln Gly Ala Tyr Ser Met Val Leu145 150 155 160
ccg cat acc cgc atg acc caa ggc gca gag ggg cag gca ccg agc agc 528 Pro His Thr Arg Met Thr Gln Gly Ala Glu Gly Gln Ala Pro Ser Ser
165 170 175
cac ttc cag ggc cgc ctg cag cac aac ctg tac tac ctg atg acc cag 576 His Phe Gln Gly Arg Leu Gln His Asn Leu Tyr Tyr Leu Met Thr Gln
180 185 190
gcg ctg cgt gcc tac cag gct gac tac cag cca cgc cag ctg tgt acc 624 Ala Leu Arg Ala Tyr Gln Ala Asp Tyr Gln Pro Arg Gln Leu Cys Thr
195 200 205
ggc ctg cgc acc agc gag gca cgc atg ctg atg gtg ctg gag aac gat 672 Gly Leu Arg Thr Ser Glu Ala Arg Met Leu Met Val Leu Glu Asn Asp
210 215 220
gcg ggc ctg agc ctg aac gac ctg caa cgc gaa gtg gcg atg ccg gcg 720 Ala Gly Leu Ser Leu Asn Asp Leu Gln Arg Glu Val Ala Met Pro Ala225 230 235 240
cgg gag atc gag gaa gcg gtt gcc aac ctc aag cgc aaa ggg ctg att 768 Arg Glu Ile Glu Glu Ala Val Ala Asn Leu Lys Arg Lys Gly Leu Ile
245 250 255
gcc gat gac gaa ggg cga gtg cgg cta tcg gtg aag ggc gtg gac gag 816 Ala Asp Asp Glu Gly Arg Val Arg Leu Ser Val Lys Gly Val Asp Glu
260 265 270
acc gag gcg ttg tgg acc att gcc cgg caa cag cag gac aag gtg ttc 864 Thr Glu Ala Leu Trp Thr Ile Ala Arg Gln Gln Gln Asp Lys Val Phe
275 280 285
ggg cag ttc agt gaa cag cag ctg gag act ttc aag acc gtg ctc aag 912 Gly Gln Phe Ser Glu Gln Gln Leu Glu Thr Phe Lys Thr Val Leu Lys
290 295 300
gcc ctt atc aac atc tga 930 Ala Leu Ile Asn Ile 305
<210> 22
<211> 309
<212> PRT
<213> Pseudmonas putida U
<400> 22 Met Ser Lys Glu Thr Phe Asp Ser Arg Ala Phe Arg Arg Ala Leu Gly1 5 10 15
Asn Phe Ala Thr Gly Val Thr Val Val Thr Ala Ala Gly Pro Ser Gly20 25 30
Arg Lys Val Gly Val Thr Ala Asn Ser Phe Asn Ser Val Ser Leu Asp35 40 45
Pro Ala Leu Ile Leu Trp Ser Ile Asp Lys Arg Ser Thr Ser His Glu50 55 60
Val Phe Glu Glu Ala Ser His Phe Ala Val Asn Ile Leu Ala Ala Asp65 70 75 80
Gln Ile Asp Leu Ser Asn Asn Phe Ala Arg Pro Lys Glu Asp Arg Phe85 90 95
Ala Gly Ile Asp Tyr Glu Thr Gly Thr Gly Gly Ala Pro Leu Phe Ala100 105 110
Asp Cys Ala Ala Arg Phe Glu Cys Glu Lys Tyr Gln Gln Leu Asp Gly115 120 125
Gly Asp His Trp Ile Leu Val Gly Lys Val Val Ala Phe Asp Asp Phe130 135 140
Gly Arg Ser Pro Leu Leu Tyr His Gln Gly Ala Tyr Ser Met Val Leu145 150 155 160
Pro His Thr Arg Met Thr Gln Gly Ala Glu Gly Gln Ala Pro Ser Ser165 170 175
His Phe Gln Gly Arg Leu Gln His Asn Leu Tyr Tyr Leu Met Thr Gln180 185 190
Ala Leu Arg Ala Tyr Gln Ala Asp Tyr Gln Pro Arg Gln Leu Cys Thr195 200 205
Gly Leu Arg Thr Ser Glu Ala Arg Met Leu Met Val Leu Glu Asn Asp210 215 220
Ala Gly Leu Ser Leu Asn Asp Leu Gln Arg Glu Val Ala Met Pro Ala225 230 235 240
Arg Glu Ile Glu Glu Ala Val Ala Asn Leu Lys Arg Lys Gly Leu Ile245 250 255
Ala Asp Asp Glu Gly Arg Val Arg Leu Ser Val Lys Gly Val Asp Glu260 265 270
Thr Glu Ala Leu Trp Thr Ile Ala Arg Gln Gln Gln Asp Lys Val Phe275 280 285
Gly Gln Phe Ser Glu Gln Gln Leu Glu Thr Phe Lys Thr Val Leu Lys290 295 300
Ala Leu Ile Asn Ile 305
<210> 23
<211> 924
<212> DNA
<213> Pseudomonas putida U
<220>
<221> CDS
<222> (1) .. (924)
<223> hpaD
<400> 23 atg ggc aaa ctc gct ctc act gcc aag att acc cat gta ccg tcc atg 48 Met Gly Lys Leu Ala Leu Thr Ala Lys Ile Thr His Val Pro Ser Met1 5 10 15
tac atg tcc gaa ctg cca ggc ccg cgc caa ggc ttt cgc cag gcg gcc 96 Tyr Met Ser Glu Leu Pro Gly Pro Arg Gln Gly Phe Arg Gln Ala Ala
atc gac ggg cat cac gaa atc agc cgc cgt tgc cgt gag ctg ggc gtg 144 Ile Asp Gly His His Glu Ile Ser Arg Arg Cys Arg Glu Leu Gly Val
gac acc atc gtc gtg ttc gac acg cac tgg ctg gtc aac gcc aac tac 192 Asp Thr Ile Val Val Phe Asp Thr His Trp Leu Val Asn Ala Asn Tyr
50 55 60
cac gtg ctg tgc ggg ccg cat ttc gag ggc gtg tac acc agc aac gaa 240 His Val Leu Cys Gly Pro His Phe Glu Gly Val Tyr Thr Ser Asn Glu65 70 75 80
ctg ccg cac ttc atc agc aac atg ccc tac gca ttc ccc ggc aat ccc 288 Leu Pro His Phe Ile Ser Asn Met Pro Tyr Ala Phe Pro Gly Asn Pro
85 90 95
gag ctg ggc aag ctg ctg gcc gag gag tgc aac cgc ttc aac gtc gaa 336 Glu Leu Gly Lys Leu Leu Ala Glu Glu Cys Asn Arg Phe Asn Val Glu
100 105 110
acc atg gcc cac cac gcc acc acc ctc gcc ccg gaa tac ggc acc ctg 384 Thr Met Ala His His Ala Thr Thr Leu Ala Pro Glu Tyr Gly Thr Leu
115 120 125
gtg ccc atg cgc tac atg aac cag gac cag cac ttc aaa gtg gtc tcg 432 Val Pro Met Arg Tyr Met Asn Gln Asp Gln His Phe Lys Val Val Ser
130 135 140
gtc tcg gcc ctg tgc acc tcg cac tac ctg gcc gac agt gcc cgc ctg 480 Val Ser Ala Leu Cys Thr Ser His Tyr Leu Ala Asp Ser Ala Arg Leu145 150 155 160 ggc tgg gcc atg cgc aag gca gta gaa gac cac tac gac ggc acc gtg 528 Gly Trp Ala Met Arg Lys Ala Val Glu Asp His Tyr Asp Gly Thr Val
165 170 175
gcg ttc ctg gcc agc ggc tcg ctg tcg cac cgc ttc gcg cag aac ggc 576 Ala Phe Leu Ala Ser Gly Ser Leu Ser His Arg Phe Ala Gln Asn Gly
180 185 190
cag gcg ccg gac ttt gcc acc aag gtg tgg agc ccg ttc ctc gaa acc 624 Gln Ala Pro Asp Phe Ala Thr Lys Val Trp Ser Pro Phe Leu Glu Thr
195 200 205
ctc gac cac cgt gtg gtg caa atg tgg cag gac ggc gag tgg gaa gcg 672 Leu Asp His Arg Val Val Gln Met Trp Gln Asp Gly Glu Trp Glu Ala
210 215 220
ttc tgc ggg atg ctg ccg gag tac gcc gcc aaa ggc cac ggt gaa ggc 720 Phe Cys Gly Met Leu Pro Glu Tyr Ala Ala Lys Gly His Gly Glu Gly225 230 235 240
ttc atg cac gac acg gca atg ctg ctg ggt gcg ctg ggc tgg tcc gat 768 Phe Met His Asp Thr Ala Met Leu Leu Gly Ala Leu Gly Trp Ser Asp
245 250 255
tac gac ggc aag gcc gaa gtg gtc acg ccc tac ttc ggc tct tcc ggc 816 Tyr Asp Gly Lys Ala Glu Val Val Thr Pro Tyr Phe Gly Ser Ser Gly
260 265 270
acc ggc cag atc aac gcg atc ttc ccg gtc acc ccg cag gac ggt ggt 864 Thr Gly Gln Ile Asn Ala Ile Phe Pro Val Thr Pro Gln Asp Gly Gly
275 280 285
gcc atc ccc gct gcc cag gcc gcc aac ccg gcc gcc gtg gtg ccc acc 912 Ala Ile Pro Ala Ala Gln Ala Ala Asn Pro Ala Ala Val Val Pro Thr
290 295 300
agc cgc ctg taa 924 Ser Arg Leu305
<210> 24
<211> 307
<212> PRT
<213> Pseudomonas putida U
<400> 24
Met Gly Lys Leu Ala Leu Thr Ala Lys Ile Thr His Val Pro Ser Met 1 5 10 15 Tyr Met Ser Glu Leu Pro Gly Pro Arg Gln Gly Phe Arg Gln Ala Ala
Ile Asp Gly His His Glu Ile Ser Arg Arg Cys Arg Glu Leu Gly Val35 40 45
Asp Thr Ile Val Val Phe Asp Thr His Trp Leu Val Asn Ala Asn Tyr50 55 60 His Val Leu Cys Gly Pro His Phe Glu Gly Val Tyr Thr Ser Asn Glu65 70 75 80
Leu Pro His Phe Ile Ser Asn Met Pro Tyr Ala Phe Pro Gly Asn Pro85 90 95
Glu Leu Gly Lys Leu Leu Ala Glu Glu Cys Asn Arg Phe Asn Val Glu100 105 110
Thr Met Ala His His Ala Thr Thr Leu Ala Pro Glu Tyr Gly Thr Leu115 120 125
Val Pro Met Arg Tyr Met Asn Gln Asp Gln His Phe Lys Val Val Ser130 135 140
Val Ser Ala Leu Cys Thr Ser His Tyr Leu Ala Asp Ser Ala Arg Leu145 150 155 160
Gly Trp Ala Met Arg Lys Ala Val Glu Asp His Tyr Asp Gly Thr Val165 170 175
Ala Phe Leu Ala Ser Gly Ser Leu Ser His Arg Phe Ala Gln Asn Gly180 185 190
Gln Ala Pro Asp Phe Ala Thr Lys Val Trp Ser Pro Phe Leu Glu Thr195 200 205
Leu Asp His Arg Val Val Gln Met Trp Gln Asp Gly Glu Trp Glu Ala210 215 220
Phe Cys Gly Met Leu Pro Glu Tyr Ala Ala Lys Gly His Gly Glu Gly225 230 235 240
Phe Met His Asp Thr Ala Met Leu Leu Gly Ala Leu Gly Trp Ser Asp245 250 255
Tyr Asp Gly Lys Ala Glu Val Val Thr Pro Tyr Phe Gly Ser Ser Gly260 265 270
Thr Gly Gln Ile Asn Ala Ile Phe Pro Val Thr Pro Gln Asp Gly Gly275 280 285
Ala Ile Pro Ala Ala Gln Ala Ala Asn Pro Ala Ala Val Val Pro Thr 290 295 300
Ser Arg Leu305
<210> 25
<211> 1461
<212> DNA
<213> Pseudomonas putida U
<220>
<221> CDS
<222> (1) .. (1461)
<223> hpaE <400> 25 atg atc aag cac tgg atc aac ggc cgt gag gtc gag agc aaa gac acc 48 Met Ile Lys His Trp Ile Asn Gly Arg Glu Val Glu Ser Lys Asp Thr1 5 10 15
ttc gtc aac tac aac ccg gcc acc ggc gac gcc atc tgc gaa gtc gcc 96 Phe Val Asn Tyr Asn Pro Ala Thr Gly Asp Ala Ile Cys Glu Val Ala
agc ggc ggc gcc gag gaa gtg gcc cag gct gtg gct gcg gcc aag gaa 144 Ser Gly Gly Ala Glu Glu Val Ala Gln Ala Val Ala Ala Ala Lys Glu
gcc ttc ccc aag tgg gcc aac acc ccg gcc aag gaa cgt gcc cgg ctg 192 Ala Phe Pro Lys Trp Ala Asn Thr Pro Ala Lys Glu Arg Ala Arg Leu
50 55 60
atg cgc aag ctg ggt gag ctg att gag cag aac gtg ccg aaa ctc gcc 240 Met Arg Lys Leu Gly Glu Leu Ile Glu Gln Asn Val Pro Lys Leu Ala65 70 75 80
gag ctg gaa acc ctc gac acc ggc ctg ccg atc cac cag acc aag aac 288 Glu Leu Glu Thr Leu Asp Thr Gly Leu Pro Ile His Gln Thr Lys Asn
85 90 95
gtg ctg atc ccg cgt gcc tcg cac aac ttc gac ttc ttc gcc gaa gtg 336 Val Leu Ile Pro Arg Ala Ser His Asn Phe Asp Phe Phe Ala Glu Val
100 105 110
tgc acg cgc atg gac ggc cat acc tac ccg gtc gac gac cag atg ctc 384 Cys Thr Arg Met Asp Gly His Thr Tyr Pro Val Asp Asp Gln Met Leu
115 120 125
aac tac acc ctg tac cag ccg gtg ggt gtg tgc ggc ctg gta agc cca 432 Asn Tyr Thr Leu Tyr Gln Pro Val Gly Val Cys Gly Leu Val Ser Pro
130 135 140
tgg aac gtg ccg ttc atg acg gct acc tgg aag act gcg ccg tgc ctg 480 Trp Asn Val Pro Phe Met Thr Ala Thr Trp Lys Thr Ala Pro Cys Leu145 150 155 160
gcg ctg ggc aac acc gcc gtg ctg aag atg agc gag ctg tcg cct ctg 528 Ala Leu Gly Asn Thr Ala Val Leu Lys Met Ser Glu Leu Ser Pro Leu
165 170 175
acc gcc aac gaa ctg ggc cgc ctg gcg gta gaa gcc ggc atc ccc aac 576 Thr Ala Asn Glu Leu Gly Arg Leu Ala Val Glu Ala Gly Ile Pro Asn
180 185 190
ggg gtg ctg aac gtg atc cag ggt tac ggc gct acc gcc ggc gat gcc 624 Gly Val Leu Asn Val Ile Gln Gly Tyr Gly Ala Thr Ala Gly Asp Ala
195 200 205
ctg gtc cgc cac ccc gat gtg cgc gcc att tcc ttc acc ggc ggt acc 672 Leu Val Arg His Pro Asp Val Arg Ala Ile Ser Phe Thr Gly Gly Thr
210 215 220
gcc acc ggc aag aag atc atg cag acc gca ggc ctt aaa aag tac tcg 720 Ala Thr Gly Lys Lys Ile Met Gln Thr Ala Gly Leu Lys Lys Tyr Ser225 230 235 240
atg gaa ctg ggc ggc aag tcg ccc gtg ctg atc ttc gaa gac gca gac 768 Met Glu Leu Gly Gly Lys Ser Pro Val Leu Ile Phe Glu Asp Ala Asp
245 250 255
ctt gag cgt gcg ctg gac gcc gcg ctg ttc acc atc ttc tcg ctg aac 816 Leu Glu Arg Ala Leu Asp Ala Ala Leu Phe Thr Ile Phe Ser Leu Asn
260 265 270
ggc gag cgc tgc acc gcc ggc agc cgc atc ttc atc cag gaa agc gtg 864 Gly Glu Arg Cys Thr Ala Gly Ser Arg Ile Phe Ile Gln Glu Ser Val
275 280 285
tac ccg cag ttt gtc gca gag ttt gcg gcg cgc gcc aag cgc ctg atc 912 Tyr Pro Gln Phe Val Ala Glu Phe Ala Ala Arg Ala Lys Arg Leu Ile
290 295 300
gta ggt gac ccg acc gac ccg aaa acc cag gtc ggt tcg atg atc acc 960 Val Gly Asp Pro Thr Asp Pro Lys Thr Gln Val Gly Ser Met Ile Thr305 310 315 320
cag cag cac tat gac aag gtc acc ggg tac atc cgc att ggc atc gaa 1008 Gln Gln His Tyr Asp Lys Val Thr Gly Tyr Ile Arg Ile Gly Ile Glu
325 330 335
gaa ggt gca cgc ctg gtc gcc ggg ggc ctg gag cgc ccg gcc aac ctg 1056 Glu Gly Ala Arg Leu Val Ala Gly Gly Leu Glu Arg Pro Ala Asn Leu
340 345 350
cct gcg cac ctg gcc aag ggg cag ttc atc cag ccc acc gta ttc gcc 1104 Pro Ala His Leu Ala Lys Gly Gln Phe Ile Gln Pro Thr Val Phe Ala
355 360 365
gac gtg aac aac aag atg cgc att gcc cag gaa gaa atc ttt ggc ccg 1152 Asp Val Asn Asn Lys Met Arg Ile Ala Gln Glu Glu Ile Phe Gly Pro
370 375 380
gtg gtg tgc ctg atc ccg ttc aag gac gaa gcc gag gcg ctg caa ctg 1200 Val Val Cys Leu Ile Pro Phe Lys Asp Glu Ala Glu Ala Leu Gln Leu385 390 395 400
gcc aac gac acc gag tat ggc ctg gcc tcg tac atc tgg acc cag gac 1248 Ala Asn Asp Thr Glu Tyr Gly Leu Ala Ser Tyr Ile Trp Thr Gln Asp
405 410 415
atc ggc aaa gcc cat cgc ctg gcc cgt ggc atc gag gcc ggc atg gtg 1296 Ile Gly Lys Ala His Arg Leu Ala Arg Gly Ile Glu Ala Gly Met Val
420 425 430
ttc atc aac agc cag aac gta cgc gac ctg cgc cag ccg ttc ggc ggc 1344 Phe Ile Asn Ser Gln Asn Val Arg Asp Leu Arg Gln Pro Phe Gly Gly
435 440 445
gtg aaa ggt tcc ggt acc ggg cgt gag ggc ggg cag tac agc ttc gag 1392 Val Lys Gly Ser Gly Thr Gly Arg Glu Gly Gly Gln Tyr Ser Phe Glu
450 455 460
gtc ttt gca gag atc aag aac gtg tgt att tcc atg ggt aat cac cacVal Phe Ala Glu Ile Lys Asn Val Cys Ile Ser Met Gly Asn His His465 470 475 480 1440 att cct cgc tgg ggc atc taaIle Pro Arg Trp Gly Ile485 1461 <210> 26 <211> 486 <212> PRT <213> Pseudomonas putida U <400> 26 Met Ile Lys His Trp Ile Asn Gly Arg Glu Val Glu Ser Lys Asp Thr1 5 10 15 Phe Val Asn Tyr Asn Pro Ala Thr Gly Asp Ala Ile Cys Glu Val Ala20 25 30 Ser Gly Gly Ala Glu Glu Val Ala Gln Ala Val Ala Ala Ala Lys Glu35 40 45 Ala Phe Pro Lys Trp Ala Asn Thr Pro Ala Lys Glu Arg Ala Arg Leu50 55 60 Met Arg Lys Leu Gly Glu Leu Ile Glu Gln Asn Val Pro Lys Leu Ala65 70 75 80 Glu Leu Glu Thr Leu Asp Thr Gly Leu Pro Ile His Gln Thr Lys Asn85 90 95 Val Leu Ile Pro Arg Ala Ser His Asn Phe Asp Phe Phe Ala Glu Val100 105 110 Cys Thr Arg Met Asp Gly His Thr Tyr Pro Val Asp Asp Gln Met Leu115 120 125 Asn Tyr Thr Leu Tyr Gln Pro Val Gly Val Cys Gly Leu Val Ser Pro130 135 140 Trp Asn Val Pro Phe Met Thr Ala Thr Trp Lys Thr Ala Pro Cys Leu145 150 155 160 Ala Leu Gly Asn Thr Ala Val Leu Lys Met Ser Glu Leu Ser Pro Leu165 170 175 Thr Ala Asn Glu Leu Gly Arg Leu Ala Val Glu Ala Gly Ile Pro Asn180 185 190 Gly Val Leu Asn Val Ile Gln Gly Tyr Gly Ala Thr Ala Gly Asp Ala195 200 205 Leu Val Arg His Pro Asp Val Arg Ala Ile Ser Phe Thr Gly Gly Thr210 215 220 Ala Thr Gly Lys Lys Ile Met Gln Thr Ala Gly Leu Lys Lys Tyr Ser225 230 235 240
Met Glu Leu Gly Gly Lys Ser Pro Val Leu Ile Phe Glu Asp Ala Asp245 250 255
Leu Glu Arg Ala Leu Asp Ala Ala Leu Phe Thr Ile Phe Ser Leu Asn260 265 270
Gly Glu Arg Cys Thr Ala Gly Ser Arg Ile Phe Ile Gln Glu Ser Val275 280 285
Tyr Pro Gln Phe Val Ala Glu Phe Ala Ala Arg Ala Lys Arg Leu Ile290 295 300
Val Gly Asp Pro Thr Asp Pro Lys Thr Gln Val Gly Ser Met Ile Thr305 310 315 320
Gln Gln His Tyr Asp Lys Val Thr Gly Tyr Ile Arg Ile Gly Ile Glu325 330 335
Glu Gly Ala Arg Leu Val Ala Gly Gly Leu Glu Arg Pro Ala Asn Leu340 345 350
Pro Ala His Leu Ala Lys Gly Gln Phe Ile Gln Pro Thr Val Phe Ala355 360 365
Asp Val Asn Asn Lys Met Arg Ile Ala Gln Glu Glu Ile Phe Gly Pro 370 375 380
Val Val Cys Leu Ile Pro Phe Lys Asp Glu Ala Glu Ala Leu Gln Leu385 390 395 400
Ala Asn Asp Thr Glu Tyr Gly Leu Ala Ser Tyr Ile Trp Thr Gln Asp405 410 415
Ile Gly Lys Ala His Arg Leu Ala Arg Gly Ile Glu Ala Gly Met Val420 425 430
Phe Ile Asn Ser Gln Asn Val Arg Asp Leu Arg Gln Pro Phe Gly Gly435 440 445
Val Lys Gly Ser Gly Thr Gly Arg Glu Gly Gly Gln Tyr Ser Phe Glu450 455 460
Val Phe Ala Glu Ile Lys Asn Val Cys Ile Ser Met Gly Asn His His465 470 475 480
Ile Pro Arg Trp Gly Ile485
<210> 27
<211> 405
<212> DNA
<213> Pseudomonas putida U
<220>
<221> CDS
<222> (1) .. (405)
<223> hpaF
<400> 27 atg cca cac ctg gtt ctg ctc tat acc ccc gac ctg gaa acc gac gcc 48 Met Pro His Leu Val Leu Leu Tyr Thr Pro Asp Leu Glu Thr Asp Ala1 5 10 15
gac atc ccc ggc ctg tgc cgc gcc ctg gcc gac acc atg ctc gaa cag 96 Asp Ile Pro Gly Leu Cys Arg Ala Leu Ala Asp Thr Met Leu Glu Gln
cgc gat gcc gaa ggc aaa gcc gtg ttc ccc act ggc ggt aca cgc gtg 144 Arg Asp Ala Glu Gly Lys Ala Val Phe Pro Thr Gly Gly Thr Arg Val
ctg gcc tac ccc gcc gcc cat tgc gcg gtg gcc gac ggc aaa ggc gaa 192 Leu Ala Tyr Pro Ala Ala His Cys Ala Val Ala Asp Gly Lys Gly Glu
50 55 60
tac ggc ttt ctg tac gcc aac ctg cgc atg gct acc ggc cgt agc gcc 240 Tyr Gly Phe Leu Tyr Ala Asn Leu Arg Met Ala Thr Gly Arg Ser Ala65 70 75 80
gag gtg cac aaa aca gtg ggc gac agc ttg ctg gca gtg ttg aaa gcg 288 Glu Val His Lys Thr Val Gly Asp Ser Leu Leu Ala Val Leu Lys Ala
85 90 95
cgc ctg gac cca ctg ctg caa cag cgc ccg atc ggc atc acc gtg cag 336 Arg Leu Asp Pro Leu Leu Gln Gln Arg Pro Ile Gly Ile Thr Val Gln
100 105 110
atc gac cac agc acc gcc cag gtc tac gac gcc aag cac agc acc ttg 384 Ile Asp His Ser Thr Ala Gln Val Tyr Asp Ala Lys His Ser Thr Leu
115 120 125
cac cca ctg ttc aac cgc tag 405 His Pro Leu Phe Asn Arg
<210> 28
<211> 134
<212> PRT
<213> Pseudomonas putida U
<400> 28 Met Pro His Leu Val Leu Leu Tyr Thr Pro Asp Leu Glu Thr Asp Ala1 5 10 15
Asp Ile Pro Gly Leu Cys Arg Ala Leu Ala Asp Thr Met Leu Glu Gln20 25 30
Arg Asp Ala Glu Gly Lys Ala Val Phe Pro Thr Gly Gly Thr Arg Val35 40 45
Leu Ala Tyr Pro Ala Ala His Cys Ala Val Ala Asp Gly Lys Gly Glu50 55 60
Tyr Gly Phe Leu Tyr Ala Asn Leu Arg Met Ala Thr Gly Arg Ser Ala 65 70 75 80
Glu Val His Lys Thr Val Gly Asp Ser Leu Leu Ala Val Leu Lys Ala85 90 95
Arg Leu Asp Pro Leu Leu Gln Gln Arg Pro Ile Gly Ile Thr Val Gln100 105 110
Ile Asp His Ser Thr Ala Gln Val Tyr Asp Ala Lys His Ser Thr Leu115 120 125
His Pro Leu Phe Asn Arg130
<210> 29
<211> 660
<212> DNA
<213> Pseudomonas putida U
<220>
<221> CDS
<222> (1) .. (660)
<223> hpaG1
<400> 29 atg agc cat gcc ctg ctt gac gtt gcc agc ggc acc ctg ttc ggc gtc 48 Met Ser His Ala Leu Leu Asp Val Ala Ser Gly Thr Leu Phe Gly Val1 5 10 15
gcg ctg aac tac cag ggt ttg ctg cag cag cac caa gcg gcg ttc gtg 96 Ala Leu Asn Tyr Gln Gly Leu Leu Gln Gln His Gln Ala Ala Phe Val
gaa gca ccg tac aag caa ctg ccg gtc aag ccg gtg ttg ttc gtc aag 144 Glu Ala Pro Tyr Lys Gln Leu Pro Val Lys Pro Val Leu Phe Val Lys
acc ccg aac acc cgc aac cag cat gaa ggc cag gtg gta ttc ccg gcc 192 Thr Pro Asn Thr Arg Asn Gln His Glu Gly Gln Val Val Phe Pro Ala
50 55 60
ggc gtg cag cgc gtg caa ccc ggc ccg gcg ctg gga gtg gtg att ggc 240 Gly Val Gln Arg Val Gln Pro Gly Pro Ala Leu Gly Val Val Ile Gly65 70 75 80
aag gac gcc agc cgc gtc agc gtg gcc gat gcc ctg gag cat gtg gcg 288 Lys Asp Ala Ser Arg Val Ser Val Ala Asp Ala Leu Glu His Val Ala
85 90 95
ggc tac acc atc gtc aac gaa gtg agc ctg ccc gaa gcc agc tac tac 336 Gly Tyr Thr Ile Val Asn Glu Val Ser Leu Pro Glu Ala Ser Tyr Tyr
100 105 110
cgc cct gca gtc aag gcc aag tgc cgt gat ggt ttt tgc ccg gtc ggc 384 Arg Pro Ala Val Lys Ala Lys Cys Arg Asp Gly Phe Cys Pro Val Gly
115 120 125
cct gaa ctg gtg ccc gcc agc caa gtg gcc aac ccc gat gcc ctg ggcPro Glu Leu Val Pro Ala Ser Gln Val Ala Asn Pro Asp Ala Leu Gly130 135 140 432 ctg cgc ctg tat gtg aac ggc gaa ctg cgc cag cac aac aac acc gccLeu Arg Leu Tyr Val Asn Gly Glu Leu Arg Gln His Asn Asn Thr Ala145 150 155 160 480 aac tgc gta cgc acg gtg gcg cag ctg att gcc gaa atc agc gag ttcAsn Cys Val Arg Thr Val Ala Gln Leu Ile Ala Glu Ile Ser Glu Phe165 170 175 528 atg acc ctg cac gcc ggc gac atc ctg atc acc gga acc ccc gag ggcMet Thr Leu His Ala Gly Asp Ile Leu Ile Thr Gly Thr Pro Glu Gly180 185 190 576 cgc gtc gat gta cag cca ggt gac cgc gtc gac atc gag atc gac ggcArg Val Asp Val Gln Pro Gly Asp Arg Val Asp Ile Glu Ile Asp Gly195 200 205 624 ctg ggc aag ctg acc aac cac atc gtc gcc gag tgaLeu Gly Lys Leu Thr Asn His Ile Val Ala Glu210 215 660 <210> 30 <211> 219 <212> PRT <213> Pseudomonas putida U <400> 30 Met Ser His Ala Leu Leu Asp Val Ala Ser Gly Thr Leu Phe Gly Val1 5 10 15 Ala Leu Asn Tyr Gln Gly Leu Leu Gln Gln His Gln Ala Ala Phe Val20 25 30 Glu Ala Pro Tyr Lys Gln Leu Pro Val Lys Pro Val Leu Phe Val Lys35 40 45 Thr Pro Asn Thr Arg Asn Gln His Glu Gly Gln Val Val Phe Pro Ala50 55 60 Gly Val Gln Arg Val Gln Pro Gly Pro Ala Leu Gly Val Val Ile Gly65 70 75 80 Lys Asp Ala Ser Arg Val Ser Val Ala Asp Ala Leu Glu His Val Ala85 90 95 Gly Tyr Thr Ile Val Asn Glu Val Ser Leu Pro Glu Ala Ser Tyr Tyr100 105 110 Arg Pro Ala Val Lys Ala Lys Cys Arg Asp Gly Phe Cys Pro Val Gly115 120 125 Pro Glu Leu Val Pro Ala Ser Gln Val Ala Asn Pro Asp Ala Leu Gly130 135 140Leu Arg Leu Tyr Val Asn Gly Glu Leu Arg Gln His Asn Asn Thr Ala145 150 155 160
Asn Cys Val Arg Thr Val Ala Gln Leu Ile Ala Glu Ile Ser Glu Phe165 170 175
Met Thr Leu His Ala Gly Asp Ile Leu Ile Thr Gly Thr Pro Glu Gly180 185 190
Arg Val Asp Val Gln Pro Gly Asp Arg Val Asp Ile Glu Ile Asp Gly195 200 205
Leu Gly Lys Leu Thr Asn His Ile Val Ala Glu210 215
<210> 31
<211> 765
<212> DNA
<213> Pseudomonas putida U
<220>
<221> CDS
<222> (1) .. (765)
<223> hpaG2
<400> 31 gtg aaa cac gcc cgt atc cag ttc gac ggc cag gcc cac gat gtc acg 48 Val Lys His Ala Arg Ile Gln Phe Asp Gly Gln Ala His Asp Val Thr1 5 10 15
gtc gaa gac gat cac ctg cgc ctt gcc gac ggc cgc ctg gtc cat cag 96 Val Glu Asp Asp His Leu Arg Leu Ala Asp Gly Arg Leu Val His Gln
gac cag gtc acc tgg ctg cca ccc gcc acc ggc agc atg ttc gcc ctg 144 Asp Gln Val Thr Trp Leu Pro Pro Ala Thr Gly Ser Met Phe Ala Leu
ggc ctg aac tac gcc gac cac gcc agg gag ctg gcc ttc gcg ccg ccc 192 Gly Leu Asn Tyr Ala Asp His Ala Arg Glu Leu Ala Phe Ala Pro Pro
50 55 60
acc gaa ccg ttg gct ttc atc aag tcg cca ggc acc tac acc ggc cac 240 Thr Glu Pro Leu Ala Phe Ile Lys Ser Pro Gly Thr Tyr Thr Gly His65 70 75 80
atc cag gtc acc tgg cgc ccg gac aac gtc gaa tac atg cac tac gag 288 Ile Gln Val Thr Trp Arg Pro Asp Asn Val Glu Tyr Met His Tyr Glu
85 90 95
tgc gag ctg gtg gcg gtg atc ggc aaa gcg gcg aag aac gtc aag cgt 336 Cys Glu Leu Val Ala Val Ile Gly Lys Ala Ala Lys Asn Val Lys Arg
100 105 110
gag gac gcc ctg gcc tac gtt gcc ggc tac acc gtg tgc aac gac tac 384 Glu Asp Ala Leu Ala Tyr Val Ala Gly Tyr Thr Val Cys Asn Asp Tyr
115 120 125 gcc atc cgc gac tac ctg gaa aac tac tac cgc ccc aac ctg cgg gtg 432 Ala Ile Arg Asp Tyr Leu Glu Asn Tyr Tyr Arg Pro Asn Leu Arg Val
130 135 140
aaa aac cgc gat gcc acc acc ccg gtc ggc ccg tgg atc gtc gat gcg 480 Lys Asn Arg Asp Ala Thr Thr Pro Val Gly Pro Trp Ile Val Asp Ala145 150 155 160
gcc gat gtg cca gac gtc agc aac ctg aag ctg cgc acc tgg atc aac 528 Ala Asp Val Pro Asp Val Ser Asn Leu Lys Leu Arg Thr Trp Ile Asn
165 170 175
ggt gag ctg aag cag gaa ggc acc acc gcg gac atg atc ttc gac atc 576 Gly Glu Leu Lys Gln Glu Gly Thr Thr Ala Asp Met Ile Phe Asp Ile
180 185 190
ccg cac ctc atc gaa tac ttc tcc agc ttc atg acc ctg caa ccg ggc 624 Pro His Leu Ile Glu Tyr Phe Ser Ser Phe Met Thr Leu Gln Pro Gly
195 200 205
gac atg atc gcc acc ggc acg cca gaa ggc ctg gcc gat gtg gtg ccg 672 Asp Met Ile Ala Thr Gly Thr Pro Glu Gly Leu Ala Asp Val Val Pro
210 215 220
ggt gac gaa gtg gtg gtg gaa gtg gaa ggc gtc ggt cgc ctg gtc aac 720 Gly Asp Glu Val Val Val Glu Val Glu Gly Val Gly Arg Leu Val Asn225 230 235 240
cgt atc gtc agc gaa gct gac ttc ttc aag aac aac aag gca tga 765 Arg Ile Val Ser Glu Ala Asp Phe Phe Lys Asn Asn Lys Ala
245 250
<210> 32
<211> 254
<212> PRT
<213> Pseudomonas putida U
<400> 32
Val Lys His Ala Arg Ile Gln Phe Asp Gly Gln Ala His Asp Val Thr1 5 10 15
Val Glu Asp Asp His Leu Arg Leu Ala Asp Gly Arg Leu Val His Gln20 25 30
Asp Gln Val Thr Trp Leu Pro Pro Ala Thr Gly Ser Met Phe Ala Leu35 40 45
Gly Leu Asn Tyr Ala Asp His Ala Arg Glu Leu Ala Phe Ala Pro Pro50 55 60
Thr Glu Pro Leu Ala Phe Ile Lys Ser Pro Gly Thr Tyr Thr Gly His65 70 75 80
Ile Gln Val Thr Trp Arg Pro Asp Asn Val Glu Tyr Met His Tyr Glu85 90 95
Cys Glu Leu Val Ala Val Ile Gly Lys Ala Ala Lys Asn Val Lys Arg
100 105 110 Glu Asp Ala Leu Ala Tyr Val Ala Gly Tyr Thr Val Cys Asn Asp Tyr115 120 125 Ala Ile Arg Asp Tyr Leu Glu Asn Tyr Tyr Arg Pro Asn Leu Arg Val130 135 140 Lys Asn Arg Asp Ala Thr Thr Pro Val Gly Pro Trp Ile Val Asp Ala145 150 155 160 Ala Asp Val Pro Asp Val Ser Asn Leu Lys Leu Arg Thr Trp Ile Asn165 170 175 Gly Glu Leu Lys Gln Glu Gly Thr Thr Ala Asp Met Ile Phe Asp Ile180 185 190 Pro His Leu Ile Glu Tyr Phe Ser Ser Phe Met Thr Leu Gln Pro Gly 195 200 205 Asp Met Ile Ala Thr Gly Thr Pro Glu Gly Leu Ala Asp Val Val Pro210 215 220 Gly Asp Glu Val Val Val Glu Val Glu Gly Val Gly Arg Leu Val Asn225 230 235 240 Arg Ile Val Ser Glu Ala Asp Phe Phe Lys Asn Asn Lys Ala245 250 <210> 33 <211> 804 <212> DNA <213> Pseudomonas putida U <220> <221> CDS <222> (1) .. (804) <223> hpaH <400> 33 atg cta gac aac gct ttc atc cag cac gcc gcc gac cgc ctc gac cagMet Leu Asp Asn Ala Phe Ile Gln His Ala Ala Asp Arg Leu Asp Gln1 5 10 15 48 gcc gaa cgc tcc cgc gag caa gtg cgc cag ttc tcg ctg gag caa ccgAla Glu Arg Ser Arg Glu Gln Val Arg Gln Phe Ser Leu Glu Gln Pro20 25 30 96 gca atc acc atc gaa gac gcc tac gcc atc cag cgc gcc tgg gtg gcaAla Ile Thr Ile Glu Asp Ala Tyr Ala Ile Gln Arg Ala Trp Val Ala35 40 45 144 aaa aag atc gcc gcc ggg cgc aag ctg gtg ggc cac aag atc ggc ctgLys Lys Ile Ala Ala Gly Arg Lys Leu Val Gly His Lys Ile Gly Leu50 55 60 192 acc tcg cgc gcc atg cag gta tcg tcg aac atc acc gag ccc gac tacThr Ser Arg Ala Met Gln Val Ser Ser Asn Ile Thr Glu Pro Asp Tyr 24065 70 75 80
ggc gcc ttg ctc gac gac atg ctg ttc gac gaa ggc agc gac atc ccc 288 Gly Ala Leu Leu Asp Asp Met Leu Phe Asp Glu Gly Ser Asp Ile Pro
85 90 95
ttc gag cgc ttc atc gtg ccg cgg gtt gaa gtg gag ttg gcg ttc atc 336 Phe Glu Arg Phe Ile Val Pro Arg Val Glu Val Glu Leu Ala Phe Ile
100 105 110
ctc ggc aag ccg ctg aag ggc ccg aac atc acc gtg ttt gat gtg ctg 384 Leu Gly Lys Pro Leu Lys Gly Pro Asn Ile Thr Val Phe Asp Val Leu
115 120 125
gac gcc acc gag tgg gtg atc ccg gcg ctg gaa atc att gac gcg cgc 432 Asp Ala Thr Glu Trp Val Ile Pro Ala Leu Glu Ile Ile Asp Ala Arg
130 135 140
atc cag cag gtg gac ccg caa acc cag gcc acc cgc aag gtg ttc gac 480 Ile Gln Gln Val Asp Pro Gln Thr Gln Ala Thr Arg Lys Val Phe Asp145 150 155 160
acc atc tcc gac aac gcc gcc aat gcc ggc gtg gtg atg ggc ggg cgg 528 Thr Ile Ser Asp Asn Ala Ala Asn Ala Gly Val Val Met Gly Gly Arg
165 170 175
gcc gtg cgc ccc acc gaa atc gac ctg cgc aaa gtg ccg gcg gtg ctc 576 Ala Val Arg Pro Thr Glu Ile Asp Leu Arg Lys Val Pro Ala Val Leu
180 185 190
tac cgc aat ggc gtg atc gag gaa tcc ggg gtc agc gct gcc gtg ctc 624 Tyr Arg Asn Gly Val Ile Glu Glu Ser Gly Val Ser Ala Ala Val Leu
195 200 205
aac cac ccg gcc aaa ggc gtt gcc tgg ctg gcc aac aaa ctg gcg ccg 672 Asn His Pro Ala Lys Gly Val Ala Trp Leu Ala Asn Lys Leu Ala Pro
210 215 220
tac gac gtc acc ttg cag ccc ggc cag atc atc ctt ggg ggt tcg ttc 720 Tyr Asp Val Thr Leu Gln Pro Gly Gln Ile Ile Leu Gly Gly Ser Phe225 230 235 240
acc cgc ccg gtc gcc gct cgc cca ggt gac acc ttc cac gtc gac tac 768 Thr Arg Pro Val Ala Ala Arg Pro Gly Asp Thr Phe His Val Asp Tyr
245 250 255
gac atg ctc ggc tcc atc gcc tgc cgc ttc gtt taa 804 Asp Met Leu Gly Ser Ile Ala Cys Arg Phe Val
260 265
<210> 34
<211> 267
<212> PRT
<213> Pseudomonas putida U
<400> 34 Met Leu Asp Asn Ala Phe Ile Gln His Ala Ala Asp Arg Leu Asp Gln1 5 10 15
Ala Glu Arg Ser Arg Glu Gln Val Arg Gln Phe Ser Leu Glu Gln Pro20 25 30
Ala Ile Thr Ile Glu Asp Ala Tyr Ala Ile Gln Arg Ala Trp Val Ala35 40 45
Lys Lys Ile Ala Ala Gly Arg Lys Leu Val Gly His Lys Ile Gly Leu50 55 60
Thr Ser Arg Ala Met Gln Val Ser Ser Asn Ile Thr Glu Pro Asp Tyr65 70 75 80
Gly Ala Leu Leu Asp Asp Met Leu Phe Asp Glu Gly Ser Asp Ile Pro85 90 95
Phe Glu Arg Phe Ile Val Pro Arg Val Glu Val Glu Leu Ala Phe Ile100 105 110
Leu Gly Lys Pro Leu Lys Gly Pro Asn Ile Thr Val Phe Asp Val Leu115 120 125
Asp Ala Thr Glu Trp Val Ile Pro Ala Leu Glu Ile Ile Asp Ala Arg130 135 140
Ile Gln Gln Val Asp Pro Gln Thr Gln Ala Thr Arg Lys Val Phe Asp145 150 155 160
Thr Ile Ser Asp Asn Ala Ala Asn Ala Gly Val Val Met Gly Gly Arg165 170 175
Ala Val Arg Pro Thr Glu Ile Asp Leu Arg Lys Val Pro Ala Val Leu180 185 190
Tyr Arg Asn Gly Val Ile Glu Glu Ser Gly Val Ser Ala Ala Val Leu195 200 205
Asn His Pro Ala Lys Gly Val Ala Trp Leu Ala Asn Lys Leu Ala Pro 210 215 220
Tyr Asp Val Thr Leu Gln Pro Gly Gln Ile Ile Leu Gly Gly Ser Phe225 230 235 240
Thr Arg Pro Val Ala Ala Arg Pro Gly Asp Thr Phe His Val Asp Tyr245 250 255
Asp Met Leu Gly Ser Ile Ala Cys Arg Phe Val260 265
<210> 35
<211> 804
<212> DNA
<213> Pseudomonas putida U
<220>
<221> CDS
<222> (1) .. (804)
<223> hpaI <400> 35 atg gac atg ccc atc aac cac ttc aag cga cgc ctg cac agc ggt gaa 48 Met Asp Met Pro Ile Asn His Phe Lys Arg Arg Leu His Ser Gly Glu1 5 10 15
ccg caa atc ggc ctg tgg ctc ggc ctg gcc gat gcc tac tgc gcc gag 96 Pro Gln Ile Gly Leu Trp Leu Gly Leu Ala Asp Ala Tyr Cys Ala Glu
ctg gcg gcc aat gcc ggt ttc gac tgg ctg ctg atc gac ggc gaa cac 144 Leu Ala Ala Asn Ala Gly Phe Asp Trp Leu Leu Ile Asp Gly Glu His
gcg ccc aac gac ctg cgc ggc atg ctc gcc cag ttg cag gcg gtg gca 192 Ala Pro Asn Asp Leu Arg Gly Met Leu Ala Gln Leu Gln Ala Val Ala
50 55 60
ccc tac ccc agc cag gca gtg atc cgc ccg gtg atc ggc gat acc gcg 240 Pro Tyr Pro Ser Gln Ala Val Ile Arg Pro Val Ile Gly Asp Thr Ala65 70 75 80
ctg atc aag cag gtg ctg gat atc ggc gca caa acc ttg ctg gtg ccg 288 Leu Ile Lys Gln Val Leu Asp Ile Gly Ala Gln Thr Leu Leu Val Pro
85 90 95
atg gtg gaa act gcc gaa cag gcg cgg caa ctg gtc aag gcc atg cat 336 Met Val Glu Thr Ala Glu Gln Ala Arg Gln Leu Val Lys Ala Met His
100 105 110
tac ccg ccc aag ggc att cgc ggg gtg ggc agc gcg ctg gcg cgg gct 384 Tyr Pro Pro Lys Gly Ile Arg Gly Val Gly Ser Ala Leu Ala Arg Ala
115 120 125
tcg cgc tgg aac acc ctc ccc ggt tac ctg gac cac gcc gat gag caa 432 Ser Arg Trp Asn Thr Leu Pro Gly Tyr Leu Asp His Ala Asp Glu Gln
130 135 140
atg tgc ctg ctg gtg cag atc gag aac aag gaa ggc ctg gcc aac ctg 480 Met Cys Leu Leu Val Gln Ile Glu Asn Lys Glu Gly Leu Ala Asn Leu145 150 155 160
gac gag atc gtt gca gtg gaa ggt gtg gat ggc gtg ttc atc ggg cct 528 Asp Glu Ile Val Ala Val Glu Gly Val Asp Gly Val Phe Ile Gly Pro
165 170 175
gca gac ctg agt gcg gcc atg ggg cat cgc ggc aac ccc ggg cac ccg 576 Ala Asp Leu Ser Ala Ala Met Gly His Arg Gly Asn Pro Gly His Pro
180 185 190
gag gtg cag gcg gcg att gaa gac gca atc gtg cgc att ggc aag gcg 624 Glu Val Gln Ala Ala Ile Glu Asp Ala Ile Val Arg Ile Gly Lys Ala
195 200 205
ggc aaa gcc gcc ggc att ctc agc gcg gac gag aaa ctg gcg cga cgc 672 Gly Lys Ala Ala Gly Ile Leu Ser Ala Asp Glu Lys Leu Ala Arg Arg
210 215 220
tac atc gag ctg ggt gcg gcg ttt gtg gcg gtg ggt gtg gat acc acg 720
Tyr Ile Glu Leu Gly Ala Ala Phe Val Ala Val Gly Val Asp Thr Thr225 230 235 240 gtg ctg atg cgc ggg ctg cgc gag ctg gcg ggg aag ttc aag gat acaVal Leu Met Arg Gly Leu Arg Glu Leu Ala Gly Lys Phe Lys Asp Thr245 250 255 768 gtg gta gtc ccc agt gcc ggg ggt ggt gcc tac tgaVal Val Val Pro Ser Ala Gly Gly Gly Ala Tyr260 265 804 <210> 36 <211> 267 <212> PRT <213> Pseudomonas putida U <400> 36 Met Asp Met Pro Ile Asn His Phe Lys Arg Arg Leu His Ser Gly Glu1 5 10 15 Pro Gln Ile Gly Leu Trp Leu Gly Leu Ala Asp Ala Tyr Cys Ala Glu20 25 30 Leu Ala Ala Asn Ala Gly Phe Asp Trp Leu Leu Ile Asp Gly Glu His35 40 45 Ala Pro Asn Asp Leu Arg Gly Met Leu Ala Gln Leu Gln Ala Val Ala50 55 60 Pro Tyr Pro Ser Gln Ala Val Ile Arg Pro Val Ile Gly Asp Thr Ala65 70 75 80 Leu Ile Lys Gln Val Leu Asp Ile Gly Ala Gln Thr Leu Leu Val Pro85 90 95 Met Val Glu Thr Ala Glu Gln Ala Arg Gln Leu Val Lys Ala Met His100 105 110 Tyr Pro Pro Lys Gly Ile Arg Gly Val Gly Ser Ala Leu Ala Arg Ala115 120 125 Ser Arg Trp Asn Thr Leu Pro Gly Tyr Leu Asp His Ala Asp Glu Gln130 135 140 Met Cys Leu Leu Val Gln Ile Glu Asn Lys Glu Gly Leu Ala Asn Leu145 150 155 160 Asp Glu Ile Val Ala Val Glu Gly Val Asp Gly Val Phe Ile Gly Pro165 170 175 Ala Asp Leu Ser Ala Ala Met Gly His Arg Gly Asn Pro Gly His Pro180 185 190 Glu Val Gln Ala Ala Ile Glu Asp Ala Ile Val Arg Ile Gly Lys Ala195 200 205 Gly Lys Ala Ala Gly Ile Leu Ser Ala Asp Glu Lys Leu Ala Arg Arg210 215 220Tyr Ile Glu Leu Gly Ala Ala Phe Val Ala Val Gly Val Asp Thr Thr225 230 235 240
Val Leu Met Arg Gly Leu Arg Glu Leu Ala Gly Lys Phe Lys Asp Thr245 250 255
Val Val Val Pro Ser Ala Gly Gly Gly Ala Tyr260 265
<210> 37
<211> 906
<212> DNA
<213> Pseudomonas putida U
<220>
<221> CDS
<222> (1) .. (906)
<223> hpaA
<400> 37 atg agc gac cgg cat ccg ata ccg aac atc aac att ggc cag gtt tac 48 Met Ser Asp Arg His Pro Ile Pro Asn Ile Asn Ile Gly Gln Val Tyr1 5 10 15
gac cag cgc tac agc gac agc gag gtg cat tac gac cgg ctg ggc aac 96 Asp Gln Arg Tyr Ser Asp Ser Glu Val His Tyr Asp Arg Leu Gly Asn
ctg gcg ggc ttt ttc ggg cgc aac atg ccg gtg cac cgg cat gac cgg 144 Leu Ala Gly Phe Phe Gly Arg Asn Met Pro Val His Arg His Asp Arg
ttt ttc cag gtg cat tac gtg aag tcg ggc aca gta cgg gtg tat ctg 192 Phe Phe Gln Val His Tyr Val Lys Ser Gly Thr Val Arg Val Tyr Leu
50 55 60
gat gac cag cag tac atc gag gcc ggg ccg atg ttc ttc ctc acg cca 240 Asp Asp Gln Gln Tyr Ile Glu Ala Gly Pro Met Phe Phe Leu Thr Pro65 70 75 80
ccc acg gtg gcg cac gcg ttc gtc acc gaa gct gac agc gac ggg cat 288 Pro Thr Val Ala His Ala Phe Val Thr Glu Ala Asp Ser Asp Gly His
85 90 95
gtg ctg acg gtg cgc cag caa ctg gtg tgg caa ttg atc gaa gcc gac 336 Val Leu Thr Val Arg Gln Gln Leu Val Trp Gln Leu Ile Glu Ala Asp
100 105 110
gcc agc ctg ctg ccg gcg ggc atg cag gtg cag cca gcc tgt gtg gcg 384 Ala Ser Leu Leu Pro Ala Gly Met Gln Val Gln Pro Ala Cys Val Ala
115 120 125
ctg ggc aac ctg ccg gcc gaa tac aag gcc gag gcg cag cgc ctg caa 432 Leu Gly Asn Leu Pro Ala Glu Tyr Lys Ala Glu Ala Gln Arg Leu Gln
130 135 140
ggc tgg ctg gac gcg ttg agt gac gag ttt gcc acg cag caa ccg ggt 480 Gly Trp Leu Asp Ala Leu Ser Asp Glu Phe Ala Thr Gln Gln Pro Gly145 150 155 160
cgc gag gcg gcg ttg cag tcg ctg acc cgc ctg atc atg atc agc ctg 528 Arg Glu Ala Ala Leu Gln Ser Leu Thr Arg Leu Ile Met Ile Ser Leu
165 170 175
ctg cgg ctg tgc ccc aac tcg ctg gaa tcg acc ccg gcg cgg cat gaa 576 Leu Arg Leu Cys Pro Asn Ser Leu Glu Ser Thr Pro Ala Arg His Glu
180 185 190
gac ctg aag atc ttc cac cgt ttc aat gcc ctg atc gaa gcg cat tac 624 Asp Leu Lys Ile Phe His Arg Phe Asn Ala Leu Ile Glu Ala His Tyr
195 200 205
ctt gag cat tgg ccg ctg gcc cgc tac gcg cag cag att ggc gtg acc 672 Leu Glu His Trp Pro Leu Ala Arg Tyr Ala Gln Gln Ile Gly Val Thr
210 215 220
gag gca cgg ctg aac gat gtg tgc cgg cgc atc gcc gac ttg cca tcc 720 Glu Ala Arg Leu Asn Asp Val Cys Arg Arg Ile Ala Asp Leu Pro Ser225 230 235 240
aag cgc ctg gtg ctg gaa cgg ctg atg cag gag gcc aag cgt ttg ctg 768 Lys Arg Leu Val Leu Glu Arg Leu Met Gln Glu Ala Lys Arg Leu Leu
245 250 255
ttg ttt tcc ggc agc acg gcc aac gaa atc tgt tac cag ctc ggc ttc 816 Leu Phe Ser Gly Ser Thr Ala Asn Glu Ile Cys Tyr Gln Leu Gly Phe
260 265 270
aag gat ccg gcc tat ttc agc cgc ttc ttc aac cgc tac gcc aag ctc 864 Lys Asp Pro Ala Tyr Phe Ser Arg Phe Phe Asn Arg Tyr Ala Lys Leu
275 280 285
aca ccc ggg gag tac cgc cag cgg cag gca gaa ttg cag tga 906 Thr Pro Gly Glu Tyr Arg Gln Arg Gln Ala Glu Leu Gln
290 295 300
<210> 38
<211> 301
<212> PRT
<213> Pseudomonas putida U
<400> 38
Met Ser Asp Arg His Pro Ile Pro Asn Ile Asn Ile Gly Gln Val Tyr1 5 10 15
Asp Gln Arg Tyr Ser Asp Ser Glu Val His Tyr Asp Arg Leu Gly Asn20 25 30
Leu Ala Gly Phe Phe Gly Arg Asn Met Pro Val His Arg His Asp Arg35 40 45
Phe Phe Gln Val His Tyr Val Lys Ser Gly Thr Val Arg Val Tyr Leu50 55 60
Asp Asp Gln Gln Tyr Ile Glu Ala Gly Pro Met Phe Phe Leu Thr Pro65 70 75 80
Pro Thr Val Ala His Ala Phe Val Thr Glu Ala Asp Ser Asp Gly His85 90 95
Val Leu Thr Val Arg Gln Gln Leu Val Trp Gln Leu Ile Glu Ala Asp100 105 110
Ala Ser Leu Leu Pro Ala Gly Met Gln Val Gln Pro Ala Cys Val Ala115 120 125
Leu Gly Asn Leu Pro Ala Glu Tyr Lys Ala Glu Ala Gln Arg Leu Gln130 135 140
Gly Trp Leu Asp Ala Leu Ser Asp Glu Phe Ala Thr Gln Gln Pro Gly145 150 155 160
Arg Glu Ala Ala Leu Gln Ser Leu Thr Arg Leu Ile Met Ile Ser Leu 165 170 175
Leu Arg Leu Cys Pro Asn Ser Leu Glu Ser Thr Pro Ala Arg His Glu180 185 190
Asp Leu Lys Ile Phe His Arg Phe Asn Ala Leu Ile Glu Ala His Tyr195 200 205
Leu Glu His Trp Pro Leu Ala Arg Tyr Ala Gln Gln Ile Gly Val Thr210 215 220
Glu Ala Arg Leu Asn Asp Val Cys Arg Arg Ile Ala Asp Leu Pro Ser225 230 235 240
Lys Arg Leu Val Leu Glu Arg Leu Met Gln Glu Ala Lys Arg Leu Leu245 250 255
Leu Phe Ser Gly Ser Thr Ala Asn Glu Ile Cys Tyr Gln Leu Gly Phe260 265 270
Lys Asp Pro Ala Tyr Phe Ser Arg Phe Phe Asn Arg Tyr Ala Lys Leu275 280 285
Thr Pro Gly Glu Tyr Arg Gln Arg Gln Ala Glu Leu Gln290 295 300
<210> 39
<211> 1308
<212> DNA
<213> Pseudomonas putida U
<220>
<221> CDS
<222> (1) .. (1308)
<223> hpaX
<400> 39 atg agc aca ctc gaa caa gcc tcg ccg cgc gag gca cac gtt gaa cgg 48 Met Ser Thr Leu Glu Gln Ala Ser Pro Arg Glu Ala His Val Glu Arg 1 5 10 15
gcc gac agt acc cat cgg gca gtc acc tgg cgg ctg atg ccg ctg ctg 96 Ala Asp Ser Thr His Arg Ala Val Thr Trp Arg Leu Met Pro Leu Leu
ctg gtg tgc tac ctg ttc gcc cac ctg gac cgc atc aac att ggc ttc 144 Leu Val Cys Tyr Leu Phe Ala His Leu Asp Arg Ile Asn Ile Gly Phe
gcc aag atg cag atg agc cag gac ctg cat ttg tcc gac acg gtc tat 192 Ala Lys Met Gln Met Ser Gln Asp Leu His Leu Ser Asp Thr Val Tyr
50 55 60
ggc ctg ggt gcc ggg ctg ttc ttc att gcc tat gcg ctg ttc ggc gtc 240 Gly Leu Gly Ala Gly Leu Phe Phe Ile Ala Tyr Ala Leu Phe Gly Val65 70 75 80
ccc agc aac ctg atg ctc gac cgc gtt ggc cca cgc cgc tgg atc gcc 288 Pro Ser Asn Leu Met Leu Asp Arg Val Gly Pro Arg Arg Trp Ile Ala
85 90 95
tgc ctg atg gtg gtg tgg ggg ctg ttg tcg acc agc atg ctg ctg atc 336 Cys Leu Met Val Val Trp Gly Leu Leu Ser Thr Ser Met Leu Leu Ile
100 105 110
gaa agc agc agc gcg ttc tac ctg ttg cgc ttt gcc ctg ggc gcg gcc 384 Glu Ser Ser Ser Ala Phe Tyr Leu Leu Arg Phe Ala Leu Gly Ala Ala
115 120 125
gag gcc ggg ttc ttc ccg ggc att ctg gtt tac ctc aac cgc tgg tac 432 Glu Ala Gly Phe Phe Pro Gly Ile Leu Val Tyr Leu Asn Arg Trp Tyr
130 135 140
ccg gcc ggg cgc cgc gcc cag gtc acc gcg ctg ttc gcc att gcc gtg 480 Pro Ala Gly Arg Arg Ala Gln Val Thr Ala Leu Phe Ala Ile Ala Val 145 150 155 160
ccg ttg gcc gga gtg gtc ggc ggg cca gtg tcc ggg gcc ata ctg gcc 528 Pro Leu Ala Gly Val Val Gly Gly Pro Val Ser Gly Ala Ile Leu Ala
165 170 175
ttc atg cac gac acg ggc ggg ctg cgt ggc tgg cag tgg atg ttc ctg 576 Phe Met His Asp Thr Gly Gly Leu Arg Gly Trp Gln Trp Met Phe Leu
180 185 190
ctc gaa ggg gcg ccg gtg gtg ttg ctg ggc ctg gtg gta ctg gcc gtt 624 Leu Glu Gly Ala Pro Val Val Leu Leu Gly Leu Val Val Leu Ala Val
195 200 205
ttg ccg gag cac ttc gag cgg gtg agc tgg ctg gat gag cag cag aaa 672 Leu Pro Glu His Phe Glu Arg Val Ser Trp Leu Asp Glu Gln Gln Lys
210 215 220
gcc acg ctg cgc gcg caa ttc ggt gag gaa gaa cag cgc aag ccc gta 720 Ala Thr Leu Arg Ala Gln Phe Gly Glu Glu Glu Gln Arg Lys Pro Val225 230 235 240
acc tcg ttc ggc gcc att ttc gca agc cgt gcg ctg tgg ctg ttg gtg 768
Thr Ser Phe Gly Ala Ile Phe Ala Ser Arg Ala Leu Trp Leu Leu Val245 250 255
gcc gtg tat tgc gcg gtg atg ctg gcg gtg aat acc ctt gcg ttc tgg 816 Ala Val Tyr Cys Ala Val Met Leu Ala Val Asn Thr Leu Ala Phe Trp
260 265 270
atg ccc agc ctg att cac agt gcc ggt gtg gcc agc gac gcc agt gtc 864 Met Pro Ser Leu Ile His Ser Ala Gly Val Ala Ser Asp Ala Ser Val
275 280 285
ggc ctg ctc agc gct gtg ccg tac gtg gcc ggc tgc gtg ttc atg ctg 912 Gly Leu Leu Ser Ala Val Pro Tyr Val Ala Gly Cys Val Phe Met Leu
290 295 300
gcg tgc ggc cgc tcc agc gac cgc caa cgc gaa cgc cgc tgg cac ctg 960 Ala Cys Gly Arg Ser Ser Asp Arg Gln Arg Glu Arg Arg Trp His Leu305 310 315 320
tgc gta ccg ctg ctg atg gct gcc atc ggc atc gct att gcg gcc att 1008 Cys Val Pro Leu Leu Met Ala Ala Ile Gly Ile Ala Ile Ala Ala Ile
325 330 335
gcc ccc gag cag gcg ctg ccg gta atg gcc ggc ctg gtg ctg gcc ggc 1056 Ala Pro Glu Gln Ala Leu Pro Val Met Ala Gly Leu Val Leu Ala Gly
340 345 350
atg ggc gcc agc gct gcg ctg ccg atg ttc tgg caa ctg ccg ccg gcg 1104 Met Gly Ala Ser Ala Ala Leu Pro Met Phe Trp Gln Leu Pro Pro Ala
355 360 365
ttc ctc aac gcc cgt acc cag gcc gcc ggc att gcc ctg atc agc tcg 1152 Phe Leu Asn Ala Arg Thr Gln Ala Ala Gly Ile Ala Leu Ile Ser Ser
370 375 380
ctg ggc agc atc gcc tcg ttc ttc acg ccc tac ttc atc ggc tgg gtg 1200 Leu Gly Ser Ile Ala Ser Phe Phe Thr Pro Tyr Phe Ile Gly Trp Val385 390 395 400
cgc gac acc acc cac agc gcc agc ctt gct ctg tac gta ctc gcc gtc 1248 Arg Asp Thr Thr His Ser Ala Ser Leu Ala Leu Tyr Val Leu Ala Val
405 410 415
ttc atc gcc ctg ggc ggc ctg ctg gtg ttg cgc acc cag gct gcc atc 1296 Phe Ile Ala Leu Gly Gly Leu Leu Val Leu Arg Thr Gln Ala Ala Ile
420 425 430
gtc aac cct tga 1308 Val Asn Pro
<210> 40
<211> 435
<212> PRT
<213> Pseudomonas putida U
<400> 40 Met Ser Thr Leu Glu Gln Ala Ser Pro Arg Glu Ala His Val Glu Arg1 5 10 15
Ala Asp Ser Thr His Arg Ala Val Thr Trp Arg Leu Met Pro Leu Leu20 25 30
Leu Val Cys Tyr Leu Phe Ala His Leu Asp Arg Ile Asn Ile Gly Phe35 40 45
Ala Lys Met Gln Met Ser Gln Asp Leu His Leu Ser Asp Thr Val Tyr50 55 60
Gly Leu Gly Ala Gly Leu Phe Phe Ile Ala Tyr Ala Leu Phe Gly Val65 70 75 80
Pro Ser Asn Leu Met Leu Asp Arg Val Gly Pro Arg Arg Trp Ile Ala85 90 95
Cys Leu Met Val Val Trp Gly Leu Leu Ser Thr Ser Met Leu Leu Ile100 105 110
Glu Ser Ser Ser Ala Phe Tyr Leu Leu Arg Phe Ala Leu Gly Ala Ala115 120 125
Glu Ala Gly Phe Phe Pro Gly Ile Leu Val Tyr Leu Asn Arg Trp Tyr130 135 140
Pro Ala Gly Arg Arg Ala Gln Val Thr Ala Leu Phe Ala Ile Ala Val145 150 155 160
Pro Leu Ala Gly Val Val Gly Gly Pro Val Ser Gly Ala Ile Leu Ala165 170 175
Phe Met His Asp Thr Gly Gly Leu Arg Gly Trp Gln Trp Met Phe Leu180 185 190
Leu Glu Gly Ala Pro Val Val Leu Leu Gly Leu Val Val Leu Ala Val195 200 205
Leu Pro Glu His Phe Glu Arg Val Ser Trp Leu Asp Glu Gln Gln Lys210 215 220
Ala Thr Leu Arg Ala Gln Phe Gly Glu Glu Glu Gln Arg Lys Pro Val225 230 235 240
Thr Ser Phe Gly Ala Ile Phe Ala Ser Arg Ala Leu Trp Leu Leu Val 245 250 255
Ala Val Tyr Cys Ala Val Met Leu Ala Val Asn Thr Leu Ala Phe Trp260 265 270
Met Pro Ser Leu Ile His Ser Ala Gly Val Ala Ser Asp Ala Ser Val275 280 285
Gly Leu Leu Ser Ala Val Pro Tyr Val Ala Gly Cys Val Phe Met Leu290 295 300
Ala Cys Gly Arg Ser Ser Asp Arg Gln Arg Glu Arg Arg Trp His Leu305 310 315 320
Cys Val Pro Leu Leu Met Ala Ala Ile Gly Ile Ala Ile Ala Ala Ile325 330 335 Ala Pro Glu Gln Ala Leu Pro Val Met Ala Gly Leu Val Leu Ala Gly340 345 350 Met Gly Ala Ser Ala Ala Leu Pro Met Phe Trp Gln Leu Pro Pro Ala355 360 365 Phe Leu Asn Ala Arg Thr Gln Ala Ala Gly Ile Ala Leu Ile Ser Ser370 375 380 Leu Gly Ser Ile Ala Ser Phe Phe Thr Pro Tyr Phe Ile Gly Trp Val385 390 395 400 Arg Asp Thr Thr His Ser Ala Ser Leu Ala Leu Tyr Val Leu Ala Val405 410 415 Phe Ile Ala Leu Gly Gly Leu Leu Val Leu Arg Thr Gln Ala Ala Ile420 425 430 Val Asn Pro 435 <210> 41 <211> 423 <212> DNA <213> Pseudpmonas putida U <220> <221> CDS <222> (1) .. (423) <223> hpaR1 <400> 41 atg acc aca ccg aga ccc tcc ctg acc ctg acc ttg ctg cag gcg cgcMet Thr Thr Pro Arg Pro Ser Leu Thr Leu Thr Leu Leu Gln Ala Arg1 5 10 15 48 gaa gcc acc atg gcg ttc ttc cgc ccg gcg ctg aat gcc cat gac ctgGlu Ala Thr Met Ala Phe Phe Arg Pro Ala Leu Asn Ala His Asp Leu20 25 30 96 acc gag cag caa tgg cgg gta atc cgt atc ctg cgc cag caa ggc gagThr Glu Gln Gln Trp Arg Val Ile Arg Ile Leu Arg Gln Gln Gly Glu35 40 45 144 ctg gaa agc cat cag ttg gcg gag ctg gcc tgt atc ctc aaa ccc agtLeu Glu Ser His Gln Leu Ala Glu Leu Ala Cys Ile Leu Lys Pro Ser50 55 60 192 atg agc ggg gtg ctc aag cgc ctg gag cgt gac ggc atc gta gcg cggMet Ser Gly Val Leu Lys Arg Leu Glu Arg Asp Gly Ile Val Ala Arg65 70 75 80 240 cgc aag tcg ccg gag gac cag cgc cgg gtg ttc atc agc ctg acc gagArg Lys Ser Pro Glu Asp Gln Arg Arg Val Phe Ile Ser Leu Thr Glu 28885 90 95
gcc ggc cag caa gcg ttt ctg gcg atg agc gag gag atg acc cgc aacAla Gly Gln Gln Ala Phe Leu Ala Met Ser Glu Glu Met Thr Arg Asn100 105 110 336 tac gac aag atc ctc gcc cag ttt ggc gat gac aag ctg cag cag ctgTyr Asp Lys Ile Leu Ala Gln Phe Gly Asp Asp Lys Leu Gln Gln Leu115 120 125 384 atg cag ctg ctg ggt gaa atg aag aag atc aaa ccc tgaMet Gln Leu Leu Gly Glu Met Lys Lys Ile Lys Pro130 135 140 423 <210> <211> <212> <213> <400> 42 140 PRT Pseudpmonas putida U 42 Met Thr Thr Pro Arg Pro Ser Leu Thr Leu Thr Leu Leu Gln Ala Arg1 5 10 15 Glu Ala Thr Met Ala Phe Phe Arg Pro Ala Leu Asn Ala His Asp Leu20 25 30 Thr Glu Gln Gln Trp Arg Val Ile Arg Ile Leu Arg Gln Gln Gly Glu35 40 45 Leu Glu Ser His Gln Leu Ala Glu Leu Ala Cys Ile Leu Lys Pro Ser50 55 60 Met Ser Gly Val Leu Lys Arg Leu Glu Arg Asp Gly Ile Val Ala Arg65 70 75 80 Arg Lys Ser Pro Glu Asp Gln Arg Arg Val Phe Ile Ser Leu Thr Glu85 90 95 Ala Gly Gln Gln Ala Phe Leu Ala Met Ser Glu Glu Met Thr Arg Asn100 105 110 Tyr Asp Lys Ile Leu Ala Gln Phe Gly Asp Asp Lys Leu Gln Gln Leu115 120 125 Met Gln Leu Leu Gly Glu Met Lys Lys Ile Lys Pro130 135 140 <210> <211> <212> <213> <220> <221> <222> <223> 43 423 DNA Pseudomonas putida U CDS (1) .. (423) hpaR2<400> 43 atg acc aag acg caa cct tcg ctc acg cta agc ctg ttg cag gcc cga 48 Met Thr Lys Thr Gln Pro Ser Leu Thr Leu Ser Leu Leu Gln Ala Arg1 5 10 15
gaa gcc gcg atg gca ttt ttc agg ccg ctg ttg aac cag cac gac ctg 96 Glu Ala Ala Met Ala Phe Phe Arg Pro Leu Leu Asn Gln His Asp Leu
acc gag cag caa tgg cgg gta atc cgc atc ctc aag cag cac ggc gag 144 Thr Glu Gln Gln Trp Arg Val Ile Arg Ile Leu Lys Gln His Gly Glu
ctg gag aat tat cag ttg gcg gaa ctg gcc tgc atc ctc aag ccg agc 192 Leu Glu Asn Tyr Gln Leu Ala Glu Leu Ala Cys Ile Leu Lys Pro Ser
50 55 60
atg acc ggg gta ctg ggg cgc ctg gag cga gac ggg ctg gtg cgg cgg 240 Met Thr Gly Val Leu Gly Arg Leu Glu Arg Asp Gly Leu Val Arg Arg65 70 75 80
cag aag gcc gcg cag gac cag cga cgg gtg ttc gtc agc ctg acc gaa 288 Gln Lys Ala Ala Gln Asp Gln Arg Arg Val Phe Val Ser Leu Thr Glu
85 90 95
aga ggg gag gcg tgc ttt gcc tcg atg aag gaa ggc atg gag gcc aac 336 Arg Gly Glu Ala Cys Phe Ala Ser Met Lys Glu Gly Met Glu Ala Asn
100 105 110
tac cag aag att cag gcg cag ttt ggt gaa gag aag ctg cag cag ctg 384 Tyr Gln Lys Ile Gln Ala Gln Phe Gly Glu Glu Lys Leu Gln Gln Leu
115 120 125
atg ggg ttg ttg aat gac ctg aag cgc atc gcg cca taa 423 Met Gly Leu Leu Asn Asp Leu Lys Arg Ile Ala Pro
130 135 140
<210> 44
<211> 140
<212> PRT
<213> Pseudomonas putida U
<400> 44
Met Thr Lys Thr Gln Pro Ser Leu Thr Leu Ser Leu Leu Gln Ala Arg1 5 10 15
Glu Ala Ala Met Ala Phe Phe Arg Pro Leu Leu Asn Gln His Asp Leu20 25 30
Thr Glu Gln Gln Trp Arg Val Ile Arg Ile Leu Lys Gln His Gly Glu35 40 45
Leu Glu Asn Tyr Gln Leu Ala Glu Leu Ala Cys Ile Leu Lys Pro Ser50 55 60
Met Thr Gly Val Leu Gly Arg Leu Glu Arg Asp Gly Leu Val Arg Arg 65 70 75 80
Gln Lys Ala Ala Gln Asp Gln Arg Arg Val Phe Val Ser Leu Thr Glu85 90 95
Arg Gly Glu Ala Cys Phe Ala Ser Met Lys Glu Gly Met Glu Ala Asn100 105 110
Tyr Gln Lys Ile Gln Ala Gln Phe Gly Glu Glu Lys Leu Gln Gln Leu115 120 125
Met Gly Leu Leu Asn Asp Leu Lys Arg Ile Ala Pro130 135 140
<210> 45
<211> 12722
<212> DNA
<213> Pseudomonas putida U
<220>
<221> misc_feature
<222> (1) .. (12722)
<223> cluster hpa
<400> 45 atgaccacac cgagaccctc cctgaccctg accttgctgc aggcgcgcga agccaccatg 60 gcgttcttcc gcccggcgct gaatgcccat gacctgaccg agcagcaatg gcgggtaatc 120 cgtatcctgc gccagcaagg cgagctggaa agccatcagt tggcggagct ggcctgtatc 180 ctcaaaccca gtatgagcgg ggtgctcaag cgcctggagc gtgacggcat cgtagcgcgg 240 cgcaagtcgc cggaggacca gcgccgggtg ttcatcagcc tgaccgaggc cggccagcaa 300 gcgtttctgg cgatgagcga ggagatgacc cgcaactacg acaagatcct cgcccagttt 360 ggcgatgaca agctgcagca gctgatgcag ctgctgggtg aaatgaagaa gatcaaaccc 420 tgacgcgcca ggcgtcagcg gttgagtgac agcgagtctt ccagcacttt cagcagtgct 480 gccgcgcgcc gctcataggc gtcggggcct gcgtacatca gctctacata caggctgtcg 540 atgatgccca ggtaggcatc ggcatacagc gccaggcggc tgtgctgctc atgcgcccag 600 ccgtggcgag cttgcagggc cacgctgaac ccttcgcgta tgccgtccag gtactgttca 660 aagcccgaag tgacaatcgg cttgatgccc gccgggggca ggaacgccgt gcgcaacacg 720 aagcgcagtt gggccgagtc gcgataacgt tcggccaggt gcagggccag ccagtgcccc 780 gccgccaggc cgtcgcgggc ttcctgcgca aagccgtgct cgacaaaggc cgtttcctgc 840 acaagcgcac gctggaacac ctccacgaac aaggcgtcct tgttggcgaa atgcgcatac 900 agcgatgcct tgcgcatgcc cgccaactgg gcgatttcgt tcagcgaaga ggcgtcataa 960 ccgtactcgg cgaagtggcc gacggcggca tcgcacacac gcaccgcaga aggggaaagg 1020 tctttcaaca gcatcactcc gtcaggggcg cggcgggccg cgcgcgtctt gagggtggga 1080 ttgtggtgat cgaaaatgca cgggtcaatg cttgtcgcaa ggcaatttcc gggcgccatg 1140 gaaagtgcaa tgttcccctc gtaacgtgca ttcctccacc caatcgccgc tcacatactg 1200 atcgcgtctt cgaatccaat aagaaagaga ccgctcatga aaaagccaaa ccccctgctg 1260 gaagacctga agtccgtcct gccgaccatt gccgccaatg ccatgcgtgc agagcaggac 1320 cgcagtgtgc cggcagagaa tatcgccttg ctgaaaagca tcggcatgca ccgcgctttc 1380 ttgcccaaac acttcggcgg catggaaatc accctgccgg agttcgccca gtgcatcgcc 1440 ttgctggcgg gggcctgcgc cagcacagcc tgggccatga gcctgctgtg cacccacagc 1500 caccagatgg caatgttctc gcccaagcta caacaggagg tgtggggtag cgacccggat 1560 gctaccgcca gcagcagtat cgcgccgttc ggccgcactg aagaggttga gggtggcgtg 1620 tcgttcagcg gcgaaatggg ctggagttcc ggttgcgacc acgccgaatg ggcgattctc 1680 ggtttccgcc gcaagaatgc cgaaggcgct caggattact gcttcgccat cctgcctcgc 1740 agtgactatg aaatccgtga tgactggtat gccgtgggca tgcgcggcag cggcagcaag 1800 accctgatcg tgcgtgatgc cttcgtgccc gagcaccgca tccagaaggc caaggacatg 1860 atggagggca agtcggcggg ctttggtttg taccccgaca gcaagatttt cttcgccccg 1920 tatcgcccgt attttgccag cggcttctcc acggtcagct tgggcgttgc cgagcgcatg 1980 ctggaggtgt tccgcgagaa aacccgcaac cgcgtgcgtg cctacaccgg tgctgccgtg 2040 ggcgccgcca ccccggcgct gatgcgcctg gccgagtcga cccatcaggt ggccgctgcc 2100 cgggcattgc tggaaaagag ctgggacgag attgccgagc acagtgcccg tcacgaatac 2160 ccgtcgcgtg gcacgctggc gttctggcgt accaaccagg gctacgccgt gaagatgtgc 2220 atccaggccg tcgaccgcct gatggaagcg gccggtggtg gcgcctggtt cgagagcaac 2280 gaactgcagc ggctgttccg cgattcgcac atgaccggtg cccatgccta caccgattac 2340 gacgtgtgtg cgcaaatcct cggccgcgag ctgatgggcc tggagcctga cccggcgatg 2400 gtctgagccg ccacttgttt tcacccatcc cctacaagca caacaacaaa cagggcaggc 2460 tgccaggcct gcccgggagt cttgcatgtc caaagaaacc ttcgattcac gtgccttccg 2520 ccgcgccctg ggcaacttcg ccaccggcgt gaccgtggtg actgccgccg gccccagtgg 2580 ccgcaaggtc ggcgttaccg ccaacagctt caactcggtg tcgctggacc cggcgctgat 2640 cctgtggagc atcgacaagc gctccaccag ccatgaagtg ttcgaagagg cctcgcactt 2700 tgccgtgaac attctggctg cggaccagat cgacctgtcc aacaactttg cccgcccgaa 2760 ggaagatcgc tttgccggta tcgactacga gaccggcact ggcggcgcgc cgttgttcgc 2820 cgattgcgcg gcgcgctttg agtgtgaaaa gtaccagcag ctggacggtg gcgatcactg 2880 gatcctggtg ggcaaggtag tggcctttga tgactttggc cgctcgccgc tgctgtatca 2940 ccagggcgcc tattcaatgg tgctgccgca tacccgcatg acccaaggcg cagaggggca 3000 ggcaccgagc agccacttcc agggccgcct gcagcacaac ctgtactacc tgatgaccca 3060 ggcgctgcgt gcctaccagg ctgactacca gccacgccag ctgtgtaccg gcctgcgcac 3120 cagcgaggca cgcatgctga tggtgctgga gaacgatgcg ggcctgagcc tgaacgacct 3180 gcaacgcgaa gtggcgatgc cggcgcggga gatcgaggaa gcggttgcca acctcaagcg 3240 caaagggctg attgccgatg acgaagggcg agtgcggcta tcggtgaagg gcgtggacga 3300 gaccgaggcg ttgtggacca ttgcccggca acagcaggac aaggtgttcg ggcagttcag 3360 tgaacagcag ctggagactt tcaagaccgt gctcaaggcc cttatcaaca tctgaacacg 3420 ctttgggatg gcaccggctg ttttggatgg caccggctgt gccggtgttc gcggatgaac 3480 ccgctcccac aggtccagcg ccagtagcaa cttcggcgcg gtacctgtgg gagcggcttt 3540 agccgcgaac accggcaaag ccggtgccat ccaaccagaa gcctcagtag gcaccacccc 3600 cggcactggg gactaccact gtatccttga acttccccgc cagctcgcgc agcccgcgca 3660 tcagcaccgt ggtatccaca cccaccgcca caaacgccgc acccagctcg atgtagcgtc 3720 gcgccagttt ctcgtccgcg ctgagaatgc cggcggcttt gcccgccttg ccaatgcgca 3780 cgattgcgtc ttcaatcgcc gcctgcacct ccgggtgccc ggggttgccg cgatgcccca 3840 tggccgcact caggtctgca ggcccgatga acacgccatc cacaccttcc actgcaacga 3900 tctcgtccag gttggccagg ccttccttgt tctcgatctg caccagcagg cacatttgct 3960 catcggcgtg gtccaggtaa ccggggaggg tgttccagcg cgaagcccgc gccagcgcgc 4020 tgcccacccc gcgaatgccc ttgggcgggt aatgcatggc cttgaccagt tgccgcgcct 4080 gttcggcagt ttccaccatc ggcaccagca aggtttgtgc gccgatatcc agcacctgct 4140 tgatcagcgc ggtatcgccg atcaccgggc ggatcactgc ctggctgggg tagggtgcca 4200 ccgcctgcaa ctgggcgagc atgccgcgca ggtcgttggg cgcgtgttcg ccgtcgatca 4260 gcagccagtc gaaaccggca ttggccgcca gctcggcgca gtaggcatcg gccaggccga 4320 gccacaggcc gatttgcggt tcaccgctgt gcaggcgtcg cttgaagtgg ttgatgggca 4380 tgtccatgag caggtcctta aacgaagcgg caggcgatgg agccgagcat gtcgtagtcg 4440 acgtggaagg tgtcacctgg gcgagcggcg accgggcggg tgaacgaacc cccaaggatg 4500 atctggccgg gctgcaaggt gacgtcgtac ggcgccagtt tgttggccag ccaggcaacg 4560 cctttggccg ggtggttgag cacggcagcg ctgaccccgg attcctcgat cacgccattg 4620 cggtagagca ccgccggcac tttgcgcagg tcgatttcgg tggggcgcac ggcccgcccg 4680 cccatcacca cgccggcatt ggcggcgttg tcggagatgg tgtcgaacac cttgcgggtg 4740 gcctgggttt gcgggtccac ctgctggatg cgcgcgtcaa tgatttccag cgccgggatc 4800 acccactcgg tggcgtccag cacatcaaac acggtgatgt tcgggccctt cagcggcttg 4860 ccgaggatga acgccaactc cacttcaacc cgcggcacga tgaagcgctc gaaggggatg 4920 tcgctgcctt cgtcgaacag catgtcgtcg agcaaggcgc cgtagtcggg ctcggtgatg 4980 ttcgacgata cctgcatggc gcgcgaggtc aggccgatct tgtggcccac cagcttgcgc 5040 ccggcggcga tcttttttgc cacccaggcg cgctggatgg cgtaggcgtc ttcgatggtg 5100 attgccggtt gctccagcga gaactggcgc acttgctcgc gggagcgttc ggcctggtcg 5160 aggcggtcgg cggcgtgctg gatgaaagcg ttgtctagca tgggggcggt ctcttgattc 5220 aagggttgac gatggcagcc tgggtgcgca acaccagcag gccgcccagg gcgatgaaga 5280 cggcgagtac gtacagagca aggctggcgc tgtgggtggt gtcgcgcacc cagccgatga 5340 agtagggcgt gaagaacgag gcgatgctgc ccagcgagct gatcagggca atgccggcgg 5400 cctgggtacg ggcgttgagg aacgccggcg gcagttgcca gaacatcggc agcgcagcgc 5460 tggcgcccat gccggccagc accaggccgg ccattaccgg cagcgcctgc tcgggggcaa 5520 tggccgcaat agcgatgccg atggcagcca tcagcagcgg tacgcacagg tgccagcggc 5580 gttcgcgttg gcggtcgctg gagcggccgc acgccagcat gaacacgcag ccggccacgt 5640 acggcacagc gctgagcagg ccgacactgg cgtcgctggc cacaccggca ctgtgaatca 5700 ggctgggcat ccagaacgca agggtattca ccgccagcat caccgcgcaa tacacggcca 5760 ccaacagcca cagcgcacgg cttgcgaaaa tggcgccgaa cgaggttacg ggcttgcgct 5820 gttcttcctc accgaattgc gcgcgcagcg tggctttctg ctgctcatcc agccagctca 5880 cccgctcgaa gtgctccggc aaaacggcca gtaccaccag gcccagcaac accaccggcg 5940 ccccttcgag caggaacatc cactgccagc cacgcagccc gcccgtgtcg tgcatgaagg 6000 ccagtatggc cccggacact ggcccgccga ccactccggc caacggcacg gcaatggcga 6060 acagcgcggt gacctgggcg cggcgcccgg ccgggtacca gcggttgagg taaaccagaa 6120 tgcccgggaa gaacccggcc tcggccgcgc ccagggcaaa gcgcaacagg tagaacgcgc 6180 tgctgctttc gatcagcagc atgctggtcg acaacagccc ccacaccacc atcaggcagg 6240 cgatccagcg gcgtgggcca acgcggtcga gcatcaggtt gctggggacg ccgaacagcg 6300 cataggcaat gaagaacagc ccggcaccca ggccatagac cgtgtcggac aaatgcaggt 6360 cctggctcat ctgcatcttg gcgaagccaa tgttgatgcg gtccaggtgg gcgaacaggt 6420 agcacaccag cagcagcggc atcagccgcc aggtgactgc ccgatgggta ctgtcggccc 6480 gttcaacgtg tgcctcgcgc ggcgaggctt gttcgagtgt gctcatgttt ttgtacttat 6540 tctgtaatga gtcggggagg gcgtggtttg agccggcgcg ctagcggttg aacagtgggt 6600 gcaaggtgct gtgcttggcg tcgtagacct gggcggtgct gtggtcgatc tgcacggtga 6660 tgccgatcgg gcgctgttgc agcagtgggt ccaggcgcgc tttcaacact gccagcaagc 6720 tgtcgcccac tgttttgtgc acctcggcgc tacggccggt agccatgcgc aggttggcgt 6780 acagaaagcc gtattcgcct ttgccgtcgg ccaccgcgca atgggcggcg gggtaggcca 6840 gcacgcgtgt accgccagtg gggaacacgg ctttgccttc ggcatcgcgc tgttcgagca 6900 tggtgtcggc cagggcgcgg cacaggccgg ggatgtcggc gtcggtttcc aggtcggggg 6960 tatagagcag aaccaggtgt ggcatggggg cctcctcggt gaggggcggc tggccacccg 7020 ccagggcgac cagccgcgaa cgggtgggtt acaggcggct ggtgggcacc acggcggccg 7080 ggttggcggc ctgggcagcg gggatggcac caccgtcctg cggggtgacc gggaagatcg 7140 cgttgatctg gccggtgccg gaagagccga agtagggcgt gaccacttcg gccttgccgt 7200 cgtaatcgga ccagcccagc gcacccagca gcattgccgt gtcgtgcatg aagccttcac 7260 cgtggccttt ggcggcgtac tccggcagca tcccgcagaa cgcttcccac tcgccgtcct 7320 gccacatttg caccacacgg tggtcgaggg tttcgaggaa cgggctccac accttggtgg 7380 caaagtccgg cgcctggccg ttctgcgcga agcggtgcga cagcgagccg ctggccagga 7440 acgccacggt gccgtcgtag tggtcttcta ctgccttgcg catggcccag cccaggcggg 7500 cactgtcggc caggtagtgc gaggtgcaca gggccgagac cgagaccact ttgaagtgct 7560 ggtcctggtt catgtagcgc atgggcacca gggtgccgta ttccggggcg agggtggtgg 7620 cgtggtgggc catggtttcg acgttgaagc ggttgcactc ctcggccagc agcttgccca 7680 gctcgggatt gccggggaat gcgtagggca tgttgctgat gaagtgcggc agttcgttgc 7740 tggtgtacac gccctcgaaa tgcggcccgc acagcacgtg gtagttggcg ttgaccagcc 7800 agtgcgtgtc gaacacgacg atggtgtcca cgcccagctc acggcaacgg cggctgattt 7860 cgtgatgccc gtcgatggcc gcctggcgaa agccttggcg cgggcctggc agttcggaca 7920 tgtacatgga cggtacatgg gtaatcttgg cagtgagagc gagtttgccc atgggggtct 7980 ccgataagac gctgttgttg ttttggggct gacccggtcc cttgtaggag cggccttgtt 8040 ccgggatggg gcgcacagcg gccccggcga tatctgcggc gaggctgaaa tccaggggcc 8100 gctgcgcgcc ccatcgcggg cacaaggccg ctcctacacc cgggcggtgt aaaccgcaca 8160 gagggttaga tgccccagcg aggaatgtgg tgattaccca tggaaataca cacgttcttg 8220 atctctgcaa agacctcgaa gctgtactgc ccgccctcac gcccggtacc ggaacctttc 8280 acgccgccga acggctggcg caggtcgcgt acgttctggc tgttgatgaa caccatgccg 8340 gcctcgatgc cacgggccag gcgatgggct ttgccgatgt cctgggtcca gatgtacgag 8400 gccaggccat actcggtgtc gttggccagt tgcagcgcct cggcttcgtc cttgaacggg 8460 atcaggcaca ccaccgggcc aaagatttct tcctgggcaa tgcgcatctt gttgttcacg 8520 tcggcgaata cggtgggctg gatgaactgc cccttggcca ggtgcgcagg caggttggcc 8580 gggcgctcca ggcccccggc gaccaggcgt gcaccttctt cgatgccaat gcggatgtac 8640 ccggtgacct tgtcatagtg ctgctgggtg atcatcgaac cgacctgggt tttcgggtcg 8700 gtcgggtcac ctacgatcag gcgcttggcg cgcgccgcaa actctgcgac aaactgcggg 8760 tacacgcttt cctggatgaa gatgcggctg ccggcggtgc agcgctcgcc gttcagcgag 8820 aagatggtga acagcgcggc gtccagcgca cgctcaaggt ctgcgtcttc gaagatcagc 8880 acgggcgact tgccgcccag ttccatcgag tactttttaa ggcctgcggt ctgcatgatc 8940 ttcttgccgg tggcggtacc gccggtgaag gaaatggcgc gcacatcggg gtggcggacc 9000 agggcatcgc cggcggtagc gccgtaaccc tggatcacgt tcagcacccc gttggggatg 9060 ccggcttcta ccgccaggcg gcccagttcg ttggcggtca gaggcgacag ctcgctcatc 9120 ttcagcacgg cggtgttgcc cagcgccagg cacggcgcag tcttccaggt agccgtcatg 9180 aacggcacgt tccatgggct taccaggccg cacacaccca ccggctggta cagggtgtag 9240 ttgagcatct ggtcgtcgac cgggtaggta tggccgtcca tgcgcgtgca cacttcggcg 9300 aagaagtcga agttgtgcga ggcacgcggg atcagcacgt tcttggtctg gtggatcggc 9360 aggccggtgt cgagggtttc cagctcggcg agtttcggca cgttctgctc aatcagctca 9420 cccagcttgc gcatcagccg ggcacgttcc ttggccgggg tgttggccca cttggggaag 9480 gcttccttgg ccgcagccac agcctgggcc acttcctcgg cgccgccgct ggcgacttcg 9540 cagatggcgt cgccggtggc cgggttgtag ttgacgaagg tgtctttgct ctcgacctca 9600 cggccgttga tccagtgctt gatcatgctg ctcatgcctt gttgttcttg aagaagtcag 9660 cttcgctgac gatacggttg accaggcgac cgacgccttc cacttccacc accacttcgt 9720 cacccggcac cacatcggcc aggccttctg gcgtgccggt ggcgatcatg tcgcccggtt 9780 gcagggtcat gaagctggag aagtattcga tgaggtgcgg gatgtcgaag atcatgtccg 9840 cggtggtgcc ttcctgcttc agctcaccgt tgatccaggt gcgcagcttc aggttgctga 9900 cgtctggcac atcggccgca tcgacgatcc acgggccgac cggggtggtg gcatcgcggt 9960 ttttcacccg caggttgggg cggtagtagt tttccaggta gtcgcggatg gcgtagtcgt 10020 tgcacacggt gtagccggca acgtaggcca gggcgtcctc acgcttgacg ttcttcgccg 10080 ctttgccgat caccgccacc agctcgcact cgtagtgcat gtattcgacg ttgtccgggc 10140 gccaggtgac ctggatgtgg ccggtgtagg tgcctggcga cttgatgaaa gccaacggtt 10200 cggtgggcgg cgcgaaggcc agctccctgg cgtggtcggc gtagttcagg cccagggcga 10260 acatgctgcc ggtggcgggt ggcagccagg tgacctggtc ctgatggacc aggcggccgt 10320 cggcaaggcg caggtgatcg tcttcgaccg tgacatcgtg ggcctggccg tcgaactgga 10380 tacgggcgtg tttcacaggt aattcctcac tcggcgacga tgtggttggt cagcttgccc 10440 aggccgtcga tctcgatgtc gacgcggtca cctggctgta catcgacgcg gccctcgggg 10500 gttccggtga tcaggatgtc gccggcgtgc agggtcatga actcgctgat ttcggcaatc 10560 agctgcgcca ccgtgcgtac gcagttggcg gtgttgttgt gctggcgcag ttcgccgttc 10620 acatacaggc gcaggcccag ggcatcgggg ttggccactt ggctggcggg caccagttca 10680 gggccgaccg ggcaaaaacc atcacggcac ttggccttga ctgcagggcg gtagtagctg 10740 gcttcgggca ggctcacttc gttgacgatg gtgtagcccg ccacatgctc cagggcatcg 10800 gccacgctga cgcggctggc gtccttgcca atcaccactc ccagcgccgg gccgggttgc 10860 acgcgctgca cgccggccgg gaataccacc tggccttcat gctggttgcg ggtgttcggg 10920 gtcttgacga acaacaccgg cttgaccggc agttgcttgt acggtgcttc cacgaacgcc 10980 gcttggtgct gctgcagcaa accctggtag ttcagcgcga cgccgaacag ggtgccgctg 11040 gcaacgtcaa gcagggcatg gctcatgctc ttctcctggc agtgcagggc ggtggccgtc 11100 ctgcggattt cgttaatgtg ttaatgttat agttaatatg ttaacgatgg tcaaggggtg 11160 gccagtggcg cctgccggca aggcaaggca ccatgggcca tcgtcaacag ggtcaagcga 11220 tttgcgagca agcagccatg agcgaccggc atccgatacc gaacatcaac attggccagg 11280 tttacgacca gcgctacagc gacagcgagg tgcattacga ccggctgggc aacctggcgg 11340 gctttttcgg gcgcaacatg ccggtgcacc ggcatgaccg gtttttccag gtgcattacg 11400 tgaagtcggg cacagtacgg gtgtatctgg atgaccagca gtacatcgag gccgggccga 11460 tgttcttcct cacgccaccc acggtggcgc acgcgttcgt caccgaagct gacagcgacg 11520 ggcatgtgct gacggtgcgc cagcaactgg tgtggcaatt gatcgaagcc gacgccagcc 11580 tgctgccggc gggcatgcag gtgcagccag cctgtgtggc gctgggcaac ctgccggccg 11640 aatacaaggc cgaggcgcag cgcctgcaag gctggctgga cgcgttgagt gacgagtttg 11700 ccacgcagca accgggtcgc gaggcggcgt tgcagtcgct gacccgcctg atcatgatca 11760 gcctgctgcg gctgtgcccc aactcgctgg aatcgacccc ggcgcggcat gaagacctga 11820 agatcttcca ccgtttcaat gccctgatcg aagcgcatta ccttgagcat tggccgctgg 11880 cccgctacgc gcagcagatt ggcgtgaccg aggcacggct gaacgatgtg tgccggcgca 11940 tcgccgactt gccatccaag cgcctggtgc tggaacggct gatgcaggag gccaagcgtt 12000 tgctgttgtt ttccggcagc acggccaacg aaatctgtta ccagctcggc ttcaaggatc 12060 cggcctattt cagccgcttc ttcaaccgct acgccaagct cacacccggg gagtaccgcc 12120 agcggcaggc agaattgcag tgaaatggcc atggcggctc acccgggtgc tgttgttgtt 12180 tacagcggat ggtcgcagcc cgcgcgccgg gcttgaatgg gttttccgtg gaacagattg 12240 cactttccat cgtgcatgcc cttaaattcg tgaattgaga aaaagccaca ggtttgacca 12300 tgaccaagac gcaaccttcg ctcacgctaa gcctgttgca ggcccgagaa gccgcgatgg 12360 catttttcag gccgctgttg aaccagcacg acctgaccga gcagcaatgg cgggtaatcc 12420 gcatcctcaa gcagcacggc gagctggaga attatcagtt ggcggaactg gcctgcatcc 12480 tcaagccgag catgaccggg gtactggggc gcctggagcg agacgggctg gtgcggcggc 12540 agaaggccgc gcaggaccag cgacgggtgt tcgtcagcct gaccgaaaga ggggaggcgt 12600 gctttgcctc gatgaaggaa ggcatggagg ccaactacca gaagattcag gcgcagtttg 12660 gtgaagagaa gctgcagcag ctgatggggt tgttgaatga cctgaagcgc atcgcgccat 12720 aa 12722
Patentes similares o relacionadas:
Procedimiento para la producción de aceituna en polvo, del 17 de Agosto de 2016, de TUBITAK: Procedimiento para la obtención de la aceituna en polvo, caracterizado porque se muelen aceitunas enteras deshuesadas y secas en presencia de […]
Materiales de unión a fosfato y sus usos, del 13 de Julio de 2016, de MEDICAL RESEARCH COUNCIL: Una composicion de hierro ferrico para uso en un metodo de tratamiento de hiperfosfatemia, en donde la composicion de hierro ferrico es un […]
Alimento para bebés y lactantes que contiene aceitunas enteras, del 12 de Febrero de 2016, de TUBITAK: Alimento para bebés o lactantes que contiene aceitunas libres de sal y no químicamente tratadas que cubre las siguientes etapas de procesamiento: a. las aceitunas se ponen […]
Aparato y procedimiento de desaceitado para la fabricación de chips de patata con bajo contenido en aceite, del 1 de Julio de 2015, de Frito-Lay Trading Company GmbH (100.0%): Aparato para desaceitar rodajas de patata, comprendiendo dicho aparato un transportador longitudinal alargado provisto de un extremo aguas arriba y […]
Composiciones ingeribles que contienen un aceite odorífero, del 8 de Abril de 2015, de R.P. SCHERER TECHNOLOGIES, LLC: Una cápsula de gel blando para la administración de un aceite odorífero y que tiene un olor reducido tras la ingestión, conteniendo dicha cápsula […]
Sistema mejorado de obtención de la anchoa para su consumo, del 29 de Enero de 2015, de MADRIGAL BALLESTER, Guillermo: El sistema mejorado de obtención de la anchoa para su consumo, se desarrolla en fases sucesivas hasta obtener una anchoa apta para su consumo, sin sal y baja en sal. Para […]
Método de preparación de un producto tratado con calor, del 14 de Enero de 2015, de NOVOZYMES A/S: Método para preparar un producto tratado con calor, que comprende las fases secuenciales de: a) proveer una materia prima que comprende hidrato […]
PROCEDIMIENTO PARA REDUCIR EL AMARGOR Y PICOR DE UN ACEITE DE OLIVA, del 29 de Septiembre de 2014, de ASOCIACIÓN EMPRESARIAL DE INVESTIGACIÓN CENTRO TECNOLÓGICO NACIONAL AGROALIMENTARIO "EXTREMADURA" (CTAEX): La invención es un procedimiento para reducir los atributos de "amargor" y "picante" en un aceite de oliva, que comprende introducir oxígeno en dicho […]