Sophie

Sophie

distrib > Mandriva > 2008.1 > i586 > by-pkgid > 81f8aa2eaa90f484dfc9928bfae72fc4 > files > 30

hmmer-2.3.2-4mdv2008.1.i586.rpm

ID   7LES_DROME     STANDARD;      PRT;  2554 AA.
AC   P13368;
DT   01-JAN-1990 (Rel. 13, Created)
DT   01-JAN-1990 (Rel. 13, Last sequence update)
DT   01-NOV-1997 (Rel. 35, Last annotation update)
DE   SEVENLESS PROTEIN (EC 2.7.1.112).
GN   SEV.
OS   Drosophila melanogaster (Fruit fly).
OC   Eukaryota; Metazoa; Arthropoda; Tracheata; Hexapoda; Insecta;
OC   Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha;
OC   Ephydroidea; Drosophilidae; Drosophila.
RN   [1]
RP   SEQUENCE FROM N.A.
RC   STRAIN=CANTON-S;
RX   MEDLINE; 88282538.
RA   BASLER K., HAFEN E.;
RT   "Control of photoreceptor cell fate by the sevenless protein requires
RT   a functional tyrosine kinase domain.";
RL   Cell 54:299-311(1988).
RN   [2]
RP   SEQUENCE FROM N.A.
RC   STRAIN=OREGON-R;
RX   MEDLINE; 88329706.
RA   BOWTELL D.L.L., SIMON M.A., RUBIN G.M.;
RT   "Nucleotide sequence and structure of the sevenless gene of
RT   Drosophila melanogaster.";
RL   Genes Dev. 2:620-634(1988).
RN   [3]
RP   IDENTIFICATION OF FN-III REPEATS.
RX   MEDLINE; 90199889.
RA   NORTON P.A., HYNES R.O., RESS D.J.G.;
RT   "Sevenless: seven found?";
RL   Cell 61:15-16(1990).
CC   -!- FUNCTION: RECEPTOR FOR AN EXTRACELLULAR SIGNAL REQUIRED TO
CC       INSTRUCT A CELL TO DIFFERENTIATE INTO A R7 PHOTORECEPTOR. THE
CC       LIGAND FOR SEV IS THE BOSS (BRIDE OF SEVENLESS) PROTEIN ON THE
CC       SURFACE OF THE NEIGHBORING R8 CELL.
CC   -!- CATALYTIC ACTIVITY: ATP + A PROTEIN TYROSINE = ADP +
CC       PROTEIN TYROSINE PHOSPHATE.
CC   -!- SUBUNIT: MAY FORM A COMPLEX WITH DRK AND SOS.
CC   -!- SIMILARITY: BELONGS TO THE INSULIN RECEPTOR FAMILY OF TYROSINE-
CC       PROTEIN KINASES. SEVENLESS SUBFAMILY.
CC   -!- SIMILARITY: CONTAINS 7 FIBRONECTIN TYPE III-LIKE DOMAINS.
CC   -!- CAUTION: UNCLEAR WHETHER THE POTENTIAL MEMBRANE SPANNING REGION
CC       NEAR THE N-TERMINUS IS PRESENT AS A TRANSMEMBRANE DOMAIN IN THE
CC       NATIVE PROTEIN OR SERVES AS A CLEAVED SIGNAL SEQUENCE.
CC   --------------------------------------------------------------------------
CC   This SWISS-PROT entry is copyright. It is produced through a collaboration
CC   between  the Swiss Institute of Bioinformatics  and the  EMBL outstation -
CC   the European Bioinformatics Institute.  There are no  restrictions on  its
CC   use  by  non-profit  institutions as long  as its content  is  in  no  way
CC   modified and this statement is not removed.  Usage  by  and for commercial
CC   entities requires a license agreement (See http://www.isb-sib.ch/announce/
CC   or send an email to license@isb-sib.ch).
CC   --------------------------------------------------------------------------
DR   EMBL; X13666; CAA31960.1; ALT_INIT.
DR   EMBL; J03158; AAA28882.1; -.
DR   PIR; A28912; TVFF7L.
DR   HSSP; P11362; 1FGI.
DR   FLYBASE; FBgn0003366; sev.
DR   PFAM; PF00041; fn3; 6.
DR   PFAM; PF00069; pkinase; 1.
DR   PROSITE; PS00107; PROTEIN_KINASE_ATP; 1.
DR   PROSITE; PS00109; PROTEIN_KINASE_TYR; 1.
DR   PROSITE; PS00239; RECEPTOR_TYR_KIN_II; 1.
DR   PROSITE; PS50011; PROTEIN_KINASE_DOM; 1.
KW   Transferase; Tyrosine-protein kinase; Transmembrane; ATP-binding;
KW   Phosphorylation; Receptor; Vision; Repeat.
FT   DOMAIN        1   2123       EXTRACELLULAR (POTENTIAL).
FT   TRANSMEM    102    122       POTENTIAL.
FT   TRANSMEM   2124   2147       POTENTIAL.
FT   DOMAIN     2148   2554       CYTOPLASMIC (POTENTIAL).
FT   DOMAIN      311    431       FIBRONECTIN TYPE-III.
FT   DOMAIN      436    528       FIBRONECTIN TYPE-III.
FT   DOMAIN      822    921       FIBRONECTIN TYPE-III.
FT   DOMAIN     1298   1392       FIBRONECTIN TYPE-III.
FT   DOMAIN     1680   1794       FIBRONECTIN TYPE-III.
FT   DOMAIN     1797   1897       FIBRONECTIN TYPE-III.
FT   DOMAIN     1898   1988       FIBRONECTIN TYPE-III.
FT   DOMAIN     2038   2046       POLY-ARG.
FT   DOMAIN     2209   2485       PROTEIN KINASE.
FT   NP_BIND    2215   2223       ATP (BY SIMILARITY).
FT   BINDING    2242   2242       ATP (BY SIMILARITY).
FT   MUTAGEN    2242   2242       K->M: INACTIVATES THE PROTEIN.
FT   MOD_RES    2380   2380       PHOSPHORYLATION (AUTO-) (BY SIMILARITY).
FT   CARBOHYD     30     30       POTENTIAL.
FT   CARBOHYD    129    129       POTENTIAL.
FT   CARBOHYD    481    481       POTENTIAL.
FT   CARBOHYD    505    505       POTENTIAL.
FT   CARBOHYD    617    617       POTENTIAL.
FT   CARBOHYD    647    647       POTENTIAL.
FT   CARBOHYD    966    966       POTENTIAL.
FT   CARBOHYD   1228   1228       POTENTIAL.
FT   CARBOHYD   1313   1313       POTENTIAL.
FT   CARBOHYD   1353   1353       POTENTIAL.
FT   CARBOHYD   1550   1550       POTENTIAL.
FT   CARBOHYD   1557   1557       POTENTIAL.
FT   CARBOHYD   1639   1639       POTENTIAL.
FT   CARBOHYD   1725   1725       POTENTIAL.
FT   CARBOHYD   1756   1756       POTENTIAL.
FT   CARBOHYD   1804   1804       POTENTIAL.
FT   CARBOHYD   1889   1889       POTENTIAL.
FT   CARBOHYD   1947   1947       POTENTIAL.
FT   CARBOHYD   2073   2073       POTENTIAL.
FT   VARIANT     392    392       M -> V.
FT   VARIANT    1668   1668       A -> V.
FT   VARIANT    1703   1703       N -> H.
FT   VARIANT    1730   1730       R -> K.
FT   VARIANT    1731   1731       G -> E.
FT   VARIANT    1741   1741       V -> M.
FT   VARIANT    2271   2271       R -> C.
FT   CONFLICT   1823   1823       E -> Q (IN REF. 2).
SQ   SEQUENCE   2554 AA;  287107 MW;  1143D891 CRC32;
     MTMFWQQNVD HQSDEQDKQA KGAAPTKRLN ISFNVKIAVN VNTKMTTTHI NQQAPGTSSS
     SSNSQNASPS KIVVRQQSSS FDLRQQLARL GRQLASGQDG HGGISTILII NLLLLILLSI
     CCDVCRSHNY TVHQSPEPVS KDQMRLLRPK LDSDVVEKVA IWHKHAAAAP PSIVEGIAIS
     SRPQSTMAHH PDDRDRDRDP SEEQHGVDER MVLERVTRDC VQRCIVEEDL FLDEFGIQCE
     KADNGEKCYK TRCTKGCAQW YRALKELESC QEACLSLQFY PYDMPCIGAC EMAQRDYWHL
     QRLAISHLVE RTQPQLERAP RADGQSTPLT IRWAMHFPEH YLASRPFNIQ YQFVDHHGEE
     LDLEQEDQDA SGETGSSAWF NLADYDCDEY YMCEILEALI PYTQYRFRFE LPFGENRDEV
     LYSPATPAYQ TPPEGAPISA PVIEHLMGLD DSHLAVHWHP GRFTNGPIEG YRLRLSSSEG
     NATSEQLVPA GRGSYIFSQL QAGTNYTLAL SMINKQGEGP VAKGFVQTHS ARNEKPAKDL
     TESVLLVGRR AVMWQSLEPA GENSMIYQSQ EELADIAWSK REQQLWLLNV HGELRSLKFE
     SGQMVSPAQQ LKLDLGNISS GRWVPRRLSF DWLHHRLYFA MESPERNQSS FQIISTDLLG
     ESAQKVGESF DLPVEQLEVD ALNGWIFWRN EESLWRQDLH GRMIHRLLRI RQPGWFLVQP
     QHFIIHLMLP QEGKFLEISY DGGFKHPLPL PPPSNGAGNG PASSHWQSFA LLGRSLLLPD
     SGQLILVEQQ GQAASPSASW PLKNLPDCWA VILLVPESQP LTSAGGKPHS LKALLGAQAA
     KISWKEPERN PYQSADAARS WSYELEVLDV ASQSAFSIRN IRGPIFGLQR LQPDNLYQLR
     VRAINVDGEP GEWTEPLAAR TWPLGPHRLR WASRQGSVIH TNELGEGLEV QQEQLERLPG
     PMTMVNESVG YYVTGDGLLH CINLVHSQWG CPISEPLQHV GSVTYDWRGG RVYWTDLARN
     CVVRMDPWSG SRELLPVFEA NFLALDPRQG HLYYATSSQL SRHGSTPDEA VTYYRVNGLE
     GSIASFVLDT QQDQLFWLVK GSGALRLYRA PLTAGGDSLQ MIQQIKGVFQ AVPDSLQLLR
     PLGALLWLER SGRRARLVRL AAPLDVMELP TPDQASPASA LQLLDPQPLP PRDEGVIPMT
     VLPDSVRLDD GHWDDFHVRW QPSTSGGNHS VSYRLLLEFG QRLQTLDLST PFARLTQLPQ
     AQLQLKISIT PRTAWRSGDT TRVQLTTPPV APSQPRRLRV FVERLATALQ EANVSAVLRW
     DAPEQGQEAP MQALEYHISC WVGSELHEEL RLNQSALEAR VEHLQPDQTY HFQVEARVAA
     TGAAAGAASH ALHVAPEVQA VPRVLYANAE FIGELDLDTR NRRRLVHTAS PVEHLVGIEG
     EQRLLWVNEH VELLTHVPGS APAKLARMRA EVLALAVDWI QRIVYWAELD ATAPQAAIIY
     RLDLCNFEGK ILQGERVWST PRGRLLKDLV ALPQAQSLIW LEYEQGSPRN GSLRGRNLTD
     GSELEWATVQ PLIRLHAGSL EPGSETLNLV DNQGKLCVYD VARQLCTASA LRAQLNLLGE
     DSIAGQLAQD SGYLYAVKNW SIRAYGRRRQ QLEYTVELEP EEVRLLQAHN YQAYPPKNCL
     LLPSSGGSLL KATDCEEQRC LLNLPMITAS EDCPLPIPGV RYQLNLTLAR GPGSEEHDHG
     VEPLGQWLLG AGESLNLTDL LPFTRYRVSG ILSSFYQKKL ALPTLVLAPL ELLTASATPS
     PPRNFSVRVL SPRELEVSWL PPEQLRSESV YYTLHWQQEL DGENVQDRRE WEAHERRLET
     AGTHRLTGIK PGSGYSLWVQ AHATPTKSNS SERLHVRSFA ELPELQLLEL GPYSLSLTWA
     GTPDPLGSLQ LECRSSAEQL RRNVAGNHTK MVVEPLQPRT RYQCRLLLGY AATPGAPLYH
     GTAEVYETLG DAPSQPGKPQ LEHIAEEVFR VTWTAARGNG APIALYNLEA LQARSDIRRR
     RRRRRRNSGG SLEQLPWAEE PVVVEDQWLD FCNTTELSCI VKSLHSSRLL LFRVRARSLE
     HGWGPYSEES ERVAEPFVSP EKRGSLVLAI IAPAAIVSSC VLALVLVRKV QKRRLRAKKL
     LQQSRPSIWS NLSTLQTQQQ LMAVRNRAFS TTLSDADIAL LPQINWSQLK LLRFLGSGAF
     GEVYEGQLKT EDSEEPQRVA IKSLRKGASE FAELLQEAQL MSNFKHENIV RLVGICFDTE
     SISLIMEHME AGDLLSYLRA ARATSTQEPQ PTAGLSLSEL LAMCIDVANG CSYLEDMHFV
     HRDLACRNCL VTESTGSTDR RRTVKIGDFG LARDIYKSDY YRKEGEGLLP VRWMSPESLV
     DGLFTTQSDV WAFGVLCWEI LTLGQQPYAA RNNFEVLAHV KEGGRLQQPP MCTEKLYSLL
     LLCWRTDPWE RPSFRRCYNT LHAISTDLRR TQMASATADT VVSCSRPEFK VRFDGQPLEE
     HREHNERPED ENLTLREVPL KDKQLYANEG VSRL
//