current user: public

If you have questions about the server, please let us know.

Query: [A] COG5107 Pre-mRNA 3`-end processing (cleavage and polyadenylation) factor, from VFDB

Results of FFAS03 search in VFDB
Master-slave alignment(slide right to see more) does not show gaps in the query sequence, use ali links to display alignment between query and templates.
    .   10    .   20    .   30    .   40    .   50    .   60    .   70    .   80    .   90    .  100    .  110    .  120    .  130    .  140    .  150    .  160    .  170    .  180    .  190    .  200    .  210    .  220    .  230    .  240    .  250    .  260    .  270    .  280    .  290    .  300    .  310    .  320    .  330    .  340    .  350    .  360    .  370    .  380    .  390    .  400    .  410    .  420    .  430    .  440    .  450    .  460    .  470    .  480    .  490    .  500    .  510    .  520    .  530    .  540    .  550    .  560    .  570    .  580    .  590    .  600    .  610    .  620    .  630    .  640    .  650    .  660    .  670    .
# Score Template Links and tools%idFirst MSSSTTPDLLYPSADKVAEPSDNIHGDELRLRERIKDNPTNILSYFQLIQYLETQESYAKVREVYEQFHNTFPFYSPAWTLQLKGELARDEFETVEKILAQCLSGKLENNDLSLWSTYLDYIRRKNNLITGGQEARAVIVKAFQLVMQKCAIFEPKSSSFWNEYLNFLEQWKPFNKWEEQQRIDMLREFYKKMLCVPFDNLEKMWNRYTQWEQEINSLTARKFIGELSAEYMKARSLYQEWLNVTNGLKRASPINLRTANKKNIPQPGTSDSNIQQLQIWLNWIKWERENKLMLSEDMLSQRISYVYKQGIQYMIFSAEMWYDYSMYISENSDRQNILYTALLANPDSPSLTFKLSECYELDNDSESVSNCFDKCTQTLLSQYKKIASDVNSGEDNNTEYEQELLYKQREKLTFVFCVYMNTMKRISGLSAARTVFGKCRKLKRILTHDVYVENAYLEFQNQNDYKTAFKVLELGLKYFQNDGVYINKYLDFLIFLNKDSQIKTLFETSVEKVQDLTQLKEIYKKMISYESKFGNLNNVYSLEKRFFERFPQENLIEVFTSRYQIQNSNLIKKLELTYMYNEEEDSYFSSGNGDGHHGSYNMSSSDRKRLMEETGNNGNFSNKKFKRDSELPTEVLDLLSVIPKRQYFNTNLLDAQKLVNFLNDQVEIPTVESTKSGLast
1 -155.000[A] COG5107 Pre-mRNA 3`-end processing (cleavage and polyadenylation) factor  ali  100  1MSSSTTPDLLYPSADKVAEPSDNIHGDELRLRERIKDNPTNILSYFQLIQYLETQESYAKVREVYEQFHNTFPFYSPAWTLQLKGELARDEFETVEKILAQCLSGKLENNDLSLWSTYLDYIRRKNNLITGGQEARAVIVKAFQLVMQKCAIFEPKSSSFWNEYLNFLEQWKPFNKWEEQQRIDMLREFYKKMLCVPFDNLEKMWNRYTQWEQEINSLTARKFIGELSAEYMKARSLYQEWLNVTNGLKRASPINLRTANKKNIPQPGTSDSNIQQLQIWLNWIKWERENKLMLSEDMLSQRISYVYKQGIQYMIFSAEMWYDYSMYISENSDRQNILYTALLANPDSPSLTFKLSECYELDNDSESVSNCFDKCTQTLLSQYKKIASDVNSGEDNNTEYEQELLYKQREKLTFVFCVYMNTMKRISGLSAARTVFGKCRKLKRILTHDVYVENAYLEFQNQNDYKTAFKVLELGLKYFQNDGVYINKYLDFLIFLNKDSQIKTLFETSVEKVQDLTQLKEIYKKMISYESKFGNLNNVYSLEKRFFERFPQENLIEVFTSRYQIQNSNLIKKLELTYMYNEEEDSYFSSGNGDGHHGSYNMSSSDRKRLMEETGNNGNFSNKKFKRDSELPTEVLDLLSVIPKRQYFNTNLLDAQKLVNFLNDQVEIPTVESTKSG 677
2 -95.000[A] KOG0128 RNA-binding protein SART3 (RRM superfamily)  ali follow..  11  1MSDVDMES------GSDDSGMEDLDEEIQKIKQKMIDDSQSVVLANQLLILLRKNGDFDELDIKRRQFVEWAPLNPLNWKNWIEDFQNRKPEPSVAEVEEMFEKALFDENDVTIWVERAMYAYKVAN-DKNKKEDFKFCRDVCSKALENLGTRYDSGGHIWLIFLEYEMSYLKNNAPDYQRLADQVFALFERALHCPTDQLEDVYVLAEQFCTEFKQ---HHKLEELKKTYNSTMRQKEQLSKFEELIQQ----------------------EETKKQGLKQFFDHEKKSGI-------PSRIKMAHERLVSELDDDEEAWIAYGAWADIEQVAVKVYSRALRHCPYSFVLHQQALLAFERDRPNEEIDALWERARSNVINSAEEGRSLYRTYAFLLRRRIHLTGSS-------------DYSPMAEVFDEGAALLREWFSMAWDTTADYRQMQAYFYASLMKNMDKCRNIWNDILASFGRFAGKWIEAVRLERQFGDKENARKYLNKALNSVS--DNINEIYMYYVQFEREEGTLAELDLVLEKVNSQVAHRAIRPQKKVSEKPAPAPKSKQDHIQKRTSGGEPIVKKVKGDDGGFKAPLPPSNAKSSSAVSSSNASSTPAPGSFAVQK-EDARTIFVSNLDFT-------TTEDEIRQAIEGVASIRFARKANS. 627
3 -90.300[A] KOG1914 mRNA cleavage and polyadenylation factor I complex, subunit RNA14  ali follow..  22  2.......................SGLSMRNPERRIETNPFDVDAWNLLLREHQS-RPIDQERDFYESLVKQFPNSGRYWKAYIEHELRSKNFENVEKLFSRCLVSVL---NIDLWKCYIHYVFETKGQR---DQYREEMAKAYDFALEKVGMD-VQAYSIFTEYIAFLKKVPAVGQYAENQRITAVRKIYQKALATPMHNLELIWNDYCTYEKAINITLAEKLIAERGKEYQNARRVEKDLQQMTRGLNRQAVSVPP----------KGTATEFKQVELWKNLIAWEKTNPLQTEEGQHARRVVYTYEQSLLCLGYYPDIWYEAAMFLQEALETISLYERAITGLKESKLLYFAYADFQEEHKQFEAVKNIYDRLLG-----------------------------IEHINPTLTYVQLMRFIRRSEGPNNARLVFKRAREDKR-TGYQVFVAAALLEYNCMKDKEVAIRVFKLGLKKYENEPEFGLAYADFLSNLNEDNNTRVVFERILTSSKPADKSIRIWDRFLDFESCVGDLASILKVEKRRKTAYEEANHSMLVIDRYKFMDLMPCSGEQLKLIGYNALKGTESIAGPSFVGSKNVPPDISQMIPFKPRVNCTASFHPVPGGVFPPPQSVAHLMSLLPPPTCFIGPFINVELLCNMINNM-QLPNVSYPKS. 658
4 -84.400[A] KOG1258 mRNA processing protein  ali follow..  13  2...............DYGDIYIAEETEWDKYNRQINKNPDDFDAWEGLVRASEHKQAINTLRSVYDRFLGKYPLLFGYWKKYADFEFFVAGAEASEHIYERGIAGIPH--SVDLWTNYCAFKMETN-------GDANEVRELFMQGANMVGLDF-LSHPFWDKYLEFE---------ERQERPDNVFQLLERLIHIPLHQYARYFERFVQVSQSQPIQQLLPPDVLASIRADVTRRIYNIHLQIFQKVQLETAKRWTFESEIKRPYFHVKELDEAQLVNWRKYLDFEEVEGD-------FQRICHLYERCLITCALYDEFWFRYARWMSAQNDVSIIYERASCIFASRPGIRVQYALFEESQGNIASAKAIYQSILTQLPGNLEAVLG---------------------------WVGLERRNAPNYDLTNAHAVLRSIINCNTGITEVLITEDIKLVWKIEGDIELARNMFLQNAPALLDCRHFWISFLRFELEQPHHARVSNVMEMIRNKTRPPRTIMDLTKLYMEYLCHQSNDPSVLQEYLLIRDVFGPFSVRESHWKKLDEGQDLKQVSTRLLSTNGHPGISVNEAKIKSGESPYEKYYRLQGVVETVTSGVAANGSS....................................................... 612
5 -65.600[D] KOG1915 Cell cycle control protein (crooked neck)  ali follow..  11  1MASGKDSDRNLGYMTRKDAELKLPRMTQVK-EARERQEAEFRPPNQTITDSAELSDYRLRRRKEFEDQIRRARLNTQVWVKYADFEMKNKSVNEARNVWDRAVSLLPR--VDQLWYKFIHMEEKLGN--------IAGARQILERWIHC-----SPDQQAWLCFIKFE---------LKYNEIECARSIYERFVLC--HPKVSAYIRYAKFEMKHGQVELAMKVFERAKKELDDEEAEILFVAFAEFEEQYKFALDQI-------------PKGRAENLYSKFVAFEKQNGDKEGIEDIIGKRRCQYEDEVRKNPLNYDSWFDFVRLEETVDRIREIYERAVANVPPPIYLWINYAFFAEVTEDVESTRDVYRACLKLIPHSKFSFAK--------------------------IWLLAAQHEIRQLNLTGARQILGNAIGKAPKD--KIFKKYIEIELQL-RNIDRCRKLYERYLEWSPGNCYAWRKYAEFEMSLAETERTRAIFELAISQP-ALDMPELLWKTYIDFEISEGELERTRALYERLLDRTKHCKVWVDFAKFEASAAEHKEDEEEEDAIERKKDGIKRAREIFDRANTYNKDSTPELKEERAMLLEDWLNMETGFGKLGDVRVVQSKLPKKVKKRKLYLFPEESETTSLKILEAAHKWKKQKVKA.. 653
6 -55.600[A] KOG2047 mRNA splicing factor  ali follow..  11  4..................SKDLYPSQEDLLYEEELLRNQFSLKLWWRYLIA-KAESPFKKRFIIYERALKALPGSYKLWYAYLNLPVTHPQYDSLNNTFERGLVTMHK--MPRIWVMYLQTLTVQQLI--------TRTRRTFDRALCAL--PVTQHDRIWEPYLVFVSQNGIPIETSKSERWQESAERLASVLNIKGKTKHKLWLELCELLVHHANVISGLNVDAIIRGGIRKFMLWTSLADYYIRKNLLEKARDIYEEGMMK-----VVTVRDFSVIFDVYSRFEESTVAKKMEELMNRRPALANSVLLRQNPHNVEQWHRRVKIFEGNAKQILTYTEAVRAVGKPHTLWVAFAKLYENHKDLVNTRVIFDKAVQVNYKTVDHL--------------------------ASVWCEWAEMELRHKNFKGALELMRRATAVPTVEVRRLWSFYVDLEESL-GTLESTRAVYEKILDLRIATPQIIMNYAFLLEENKYFEDAFKVYERGVKSMAPSDAVRTLYLQYAKLEEDYGLAKRAMKVYEEATKKVPEGQKLEMYE..................................................................................................................... 664
7 -47.400[A] KOG0495 HAT repeat protein  ali follow..  258VDPKGYLTDLQSMIPTYGGDINDIKKARLLLKSVRETNPNHPPAWIASARLEEVTGKVQMARNLIMRGCEMNIQSEDLWLEAARLQ----PPDTAKAVIAQAARHIPT--SVRIWIKAADLESET-----------KAKRRVFRKALEHI----PNSVRLWKAAVELE-NPDDARILLSRAVECCNTSVLNKAREN-IPTDRQIWTTAAKLEEANGNIHMVEKIIDRSLTSLTVNGVEINRDQFQEAIEAEKSGAVNCCQSIVKAVIGIGVEEEDRKQTWIDDAEFCAKENA-------FECARAVYAHALQIFPSKKSIWLRAAYFEKNHESLEALLQRAVAHCPKSEILWLMGAKSKWMAGDVPAARGILSLAFQANPNSEDAAVKLESENSEYERARRLLAKARGSAPTPRVMMKSARLEWALEKFDEALRLLEEAVEVFPDFP-KLWMMKGQIEEQQ-RRTDDAAATYTLGLKKCPTSIPLWILSANLEERKGVLTKARSILERGRLRNP---KVAVLWLEAIRVELRAGLKEIASTMMARALQECPNAGELWAEAIFMETKPQRKTKSVDALKKCEHDPHVLLAVSKLFWSEHKFSKCRDWFNRTVKIDPDLGDA-ELLHGTEAQQQEVLDRCISAEPTSKNIQNWQFKTPEVLRAVVRELSIP........ 912
8 -44.200[R] KOG2396 HAT (Half-A-TPR) repeat-containing protein  ali follow..  10  26................TRAEIAEIVKQRRKFEYRLKRPSPLKEDFIAYIDYEVKLDELARIVEIYRLATMRYKGDINLWFRYLEF-CKQKRHGRMKKALAQAIRFHPK--VAGVWIYAASWEFDRN-------LNVTAARALMLNGLRVC----SNSEDLWVEYLRMELDVTEKVDFLKEKGSNVLQTIYSGAVEAIPSSFDRFLEILEATDLAHSDEMRNTILSDLKRDFCNEPEYWNWLARHEMSGCISNEAGLEFANPQMQKAIQVFEEGLQTVSMFEIYINFGDENEISSLSNPIISHIINVYQKADETGCLTEELADEYVSLYLKLEKTHEAEKLCSEKFAGSAKLWLSRVSIEPSKADFQTVFELLSNALRKVPISESFAHQRTYLDKLVEMSILSATKSHGSDHVFSLASTVVKFVLETKGAHSARKIYKRFLALPGPSL-VLYKGCIEIETNLISGLSNARKLYDSAVASYGQDVELWKNYYSLETKLGTSETANGVYWRARKTLNESAD............................................................................................................................................................... 645
9 -40.600[R] KOG1124 FOG: TPR repeat  ali follow..  10  130LSNIAKKMPIFEPERSESSSSSSAAAAARAQERPLAVNL---DLSLYKAKVLARNFRYKDAEKILEKCIAYWPEDGRPYVALGKILSKQSKLAEARILYEKGCQSTQG-ENSYIWQCWAVLENRLGNV--------RRARELFDAATVA-----KKHVAAWHGWANLEI---------KQGNISKARNLLAKGLKFCGRN-EYIYQTLALLEAKAGRYEQARYLFKQATI-----------------------------------------CNSRSCASWLAWAQLEIQQER-------YPAARKLFEKAVQASPKNRFAWHVWGVFVGNVERGRKLLKIGHALNPRDPVLLQSLGLLEYKHSSANLARALLRRASELDPRHQP------------------------------VWIAWGWMEWKEGNTTTARELYQRALSIDANSASRCLQAWGVLEQRA-GNLSAARRLFRSSLNINSQSYVTWMTWAQLEEDQGDTERAEEIRNLYFQQ------RTEVVDDASWVTGFLDIIDPALDTVKRLLNFGQNNDN.......................................................................................................................... 578
10 -27.600[R] COG5191 Uncharacterized conserved protein, contains HAT (Half-A-TPR) repeat  ali follow..  26................NRDEINNIIKTRRVFEEKLARRQVKLNDFLSYIQYEINLETLRAKRHK------------RLNITGKITISDYAGPRKVLFLFLRATNKFFG--DVTLWLDYIHYAQKI--------KAVNIVGKICVAALQK-----PNNAELWVVACDHEF--------SINANVSAARALMNRALRL-NQENPVIWAAYFRLELSYMTKLFARILTGNISSKTENITNGVSEDTIGSLSSDTIQLPMVSMEEFLGSSSSEVRKNDSDLNISDDIGNISSKEQQTQKFANVLLQIILNSRKNLSL--------------------------------QNYVGFFVSVLDALFECFDVPVVQYMYQENIIGICNEHFEHFKNESGEIYGVLLHRWCFLEIFI-------------KLRGAYPSNDSGISGKGIFGLKKNDPRITSMVLTDP-GFVDDLQAIVEKYQSISSDFKTKKIFYSFFVKTLHAISSSSAAESSIALALHMLVINKLKILTFDDSLQEIYKEAQIQSGTFMSNATLS................................................................................................................................. 488
11 -23.100[B] KOG1156 N-terminal acetyltransferase  ali follow..  73LRNDLKSHVCWHVYGLLQRSDKKYDEAIKCYRNALKWDKDNLQILRDLSLLQIQMRDLEGYRETRYQLLQLRPAQRASWIGYAIAYHLLEDYEMAAKILEEFRKTQQDYEYSELLLYQNQVLREAGLY--------REALEHLCTYEKQ----ICDKLAVEETKGELLLQL---------CRLEDAADVYRGLQERNPENWAYY-ERLKIYEEAWTKYPRGLVPRRLPLNFLSGEKFKECLDKFLRMNFSKGCPPVFNTLRSLYKDKEKVAIIEELVVGYETSLKSCRLFNPNDDGKEEPPTTL---------------LWVQYYDKIGQPSIALEYINTAIESTPTLIELFLVKAKIYKHAGNIKEAARWMDEAQAL------------------------------DTADRFINSKCAKYMLKANLIKEAEEMCSKFTREGTSAVENLNEMQCMWFQTECAQAYKAMNKFGEALKKCHEIERHFIEITDDQFDFHTYCMRKITLRS--------------YVDLLKLEDVLRQFKAARIAIEIYLKLHDNPLTDENKEHEADTANMSDKELKKLRNKQRRAQKKAQIEEEKKNAEKEKQQRNQKKKKDDDDEEIGGPKEELIPEKLAKVETPLEEAIKFLTPLKNLVKNKIETEKFLLMLQSVKRAFAI...... 701
12 -20.900[DO] KOG1174 Anaphase-promoting complex (APC), subunit 7  ali follow..  17...............SNVRLLSSLLLTMSNNNPELFSPPQKYQLLVYHADSLFHDKEYRNAVSKYTMALQQ-PSEIEVKYKMAECYTMLKQDKDAIAILDGIPS---RQRTPKINMMLANLYKKAGRE--------RPSVTSYKEVLRQCPLALDAILGLLSLSVKGAEVASMTMNVQTVPNLDWLSVWIKAYAFVHTGDNSRAISTICSLEKKSLLRDNVDLLGSLADLYFRAGDNKNSVLKFEQAQM----------------------LDLYLIKGMDVY-------GYLLAREGRLEDVENLGCRLFNISDQHAEPWVVHSFYSKRYSRALYLGAKAIQLNSNSVQALLLKGAALRNMGRVQEAIIHFREAIRL------------------------------APCRLDCYEGLIECYLASNSIREAMVMANNVYKTLGANA-QTLTLLATVCLEDPVTQEKAKTLLDKALTQRPDYIKAVVKKAELLSREQKYEDGIALLRNALANQSDCVLHRILGDFLVA----VNEYQEAMDQYSIALSLDPNDQKSLEGMQKMEKEESPTDATQEEDVDDMEGSGEEGDLEGSDSEA................................................................................ 552
13 -20.700[T] KOG4162 Predicted calmodulin-binding protein  ali follow..  359........................AEALAVRDTVLSQSPEFRQARQHAMGATVRWGLVQLLNESFEKALKFSFGEQHVWRQYGLSLMAAEKHSHALRVLQESMKLTPSDPLPCLLASRLCY---------ESLETVKQGLDYAQQALKREVKGLRPSSQLFVGIGHQQLAIQSNLKSERDACHKLALDALERAVQFDGNDHLAEYY------------------------------------------------------------------------------------SLQYALLGQLAEALVHIRFALALRMEHAPCLHLFALLLRRPREALGVVEDALHEFPDNLQLLHVKAHLQLHLEDAETALGTVQHMLAVWRDVYEAQLAGEEEKHSDTKSGVHLAHSSQMSDQIEIWLLLADVYLRIDQPNEALNCIHEASQIYPLSHQIMFMRGQVHVYL--EQWFDAKQCFLNAVAANPNHTEALRALGEAHLVLGEPRLAEKMLKDAAK----DPSCPKIWFALGKVMEILGDFHASADCFATSLQLEPSCPVL......................................................................................................................... 851
14 -20.200[DO] KOG1155 Anaphase-promoting complex (APC), Cdc23 subunit  ali follow..  52.QRGSSSIRRRFSTNESISTPLPSVGFSQAATPLPEEDEAIDGDIYLLAKSYFDCREYRRASHMLRDQVSKKSLFLRYYALYLAGEKRKEENRELVSLERD-LSALRRTGAIDSFGLYL-------GVVLKEKGNESLARASLVESVNS-------YPWNWSAWSELQSLCTSIEILNSLNLNNHWMKEF--FLGNAYQELRMHTESLAKYEYLQGIFSFSNYIQAQTAKAQYSLREFDQVEIMFEELLR---------------------NDPYRVEDMDLY-------SNVLYAKEACAALSYLAHKVFLTDKYRPESCCNYYSLKGQHEKAVMYFRRALKLNKKYLSAWTLMGHEYVEMKNTPAAIDAYRRAVDI------------------------------NPTDYRAWYGLGQAYEMMGMPFYALHYFRKSIFFLPNDSRLWIAMAKCYQTEQLYMLEEAIKCYKRAVNCTDTEGIALNQLAKLHQKLGRNEEAAYYFEKDLERMDAEGLEGPALVFLATHFKNHKKFEEAEVYCTRLLDYSGPEKE.......................................................................................................................... 553
15 -20.100[R] KOG1129 TPR repeat-containing protein  ali follow..  5...................SDSTYVDELENEDMGLAEDQNVIAPNARPGTSFARPKTSAKGVNPILRPTTN-AGRPLSGVVRPQSSFKSGSMDQAVRTARTAKTARAVSSTS--------ARNMRLGTASMAAGADGEFVNLARLNIDKYAADPQVNRQLFEYVFY------------YLNDIRVAHQIAGTASKAAGFEDYYWKNQLAKCYLRLGMLQDATKQLQSSLEQKKLIETFALLSKAYNRVDQPMAA----------------------LKTYSAGLEVFPENARVQEALGEYDESVKLYKRVLDAESNNIEAIACVYYYGGKPELAMRYYRRILQMGVSSPELFLNIGLCCMAAQQFDFALSSILRAQST----------------------------MTDDVAADVWYNIGQILVDIGDLVSAARSFRIALSHDPDHSESLV-----ILKHREGKIDEARSLYSSATSKNPYMFEGNYNLGLVSFTQGKYHECRELIEKALA----------AFPEHEHCKKILNHLKPLYES....................................................................................................................................... 457
16 -19.900[U] KOG0547 Translocase of outer mitochondrial membrane complex, subunit TOM70/TOM72  ali follow..  121.SQRQAYAVQLKNRGNHFFTAKNFNEAIKYYQYAIELDPNEPVFYSNISACYISTGDLEKVIEFTTKALEIKPDHSKALLRRASANESLGNFTD----------AMFDLSVLSLNGDFDGASIEPMLERNLNKQAMKVLNENLSKDEGRG-SQVLPSNTSLASFFGIFDSHLEVSSVNTSSNYDTAYALLSDALQRLYSATDEGY------------LVANDLLTKSTDMYHSLLSANTVDDPLRENAALALCYT----------FHFLKNNLLDAQVLLQESINLHPTPALTLADKENSQEFFKFFQKAVDLNPEYPPTYYHRGQ-LQDYKNAKEDFQKAQSLNPENVYPYIQLACLLYKQGKFTESEAFFNETKLKFPTLPE------------------------------VPTFFAEILTDRGDFDTAIKQYDIAKRLEEVQEKIHVGIGPLITQLDEEKFNAAIKLLTKACELDPRSEQAKIGLAQLKLQMEKIDEAIELFEDSAILARTMDEK---FAEAAKIQKRLRADPIISAKMELTLARYRAKGML......................................................................................................................... 639
17 -18.300[DO] KOG1173 Anaphase-promoting complex (APC), Cdc16 subunit  ali follow..  109IQNVEVEVMTTSLINQPVDASSGCYLESNSVFGGEENHRNELLSSIYLMKVYEALDNRGMAMDFYVQALHKSIYCFEALEALVQHEMLMAWEEF-------------------------ELMHHLPLAQQSSEADAKFILKLYESRLK--------------KY----------YELISARNAEEMSPIVNPDILQFIKEFTARVQQSGSSDTQMPKVSAVKVPLTPSQFMSPAQKVLEDLKAPTFSLQTSLSKASSLIDASHRSMFDSSSRRRSRDHDTDTLIPIAECLDRVQRSDCDYKQCLKILNELLKVDPFHNTALTIQIA-NGDFNRLFYVAHKLVDRYPDKAISWYAVGCYYDMIGKSDPARRYLSKATAL------------------------------DRLYGPAWLAYGHSFANENEHEQAMAAYFKATQLMRGCHLPLL--YIGVECGLTKNLELAEKFFLQAMNIAPLDVYVLHELGVIKYEYEFFDGAATIFQCTVDIVKQRAKSEPLFINLGHSLRKVHKYEEALYNFQYALLLKPQDPAIEAFHKSLALNRDCIVTSTILKSCIEDLMDDSATIDEICSAALRDVAKNITANSRRVLNSDKFNGMKLKFDEEEEFANSDSNMVVEM................................... 714
18 -18.100[S] KOG4340 Uncharacterized conserved protein  ali follow..  10  1MAGLSGAQIPDGEFTALVYRLIRDARYAEAVQLLGQLHPELEQYRLYQAQALYKACLYPEATRVAFLLLDNPAYHSRVLRLQAAIKYSEGDLPGSRSLVEQLLSGEGGEESGG--------GQVNLGCLLYKEGQYEAACSKFSATLQASGYQPDLSYNLALAYYS-------------SRQYASALKHIAEIIERGIRQHPELGVGMTTEGFDVRSVGNTLVLHQLKAAIEYQLRNYEVAQETLTDMPPRAEEELDPVTLHNQALMNMDARPTEGFEKLQFLLQQNPFPP------ETFGNLLLLYCK------------------YEYFDLAADVLAENAHLTYKTPYLYDFLDALITCQTAPEEAFIKLDGLAGMLTEQLRRLTKQVQEARHNRDKKAVNEYDETMEKYIPVLMAQAKIYWNLENYPMVEKVFRKSVEFCNDHD-VWKLNVAHVLFMQENKYKEAIGFYEPIVKKHYDNAIVLANLCVSYIMTSQNEEAEELMRKIEKEEEQLSYDDIVNLVIGTLYCAKGNYEFGISRVIKSLEPYNKK............................................................................................................................ 540
19 -18.100[O] KOG0548 Molecular co-chaperone STI1  ali follow..  14...................SSGDFTTAINHFTEAIALAPTNHVLFSNRSAAHASLHQYAEALSDAKETIKLKPYWPKGYSRLGAAHLGLNQFELAVTAYKKGLDVDPTNEAL----------KSGLADAEASVARSRAAPNPFGDAFQG------------WTKLTSDPSTRGFLQQPDFVNMMQEIQKNPSSLNLYLKDQRVMQSLGVLLNVKFRPPPPQGDEAEVPESDMGQSSSNEPEVEKKREPEPEPEPEVTEEKEKKERKEKAKK----------------KELGNAAYKKKDFETAIQHYSTAIEIDDEDISYLTNRAA-MGKYNECIEDCNKAVERGRELRSDYKMVARALTRKGKMAKCSKDYEPAIEAFQKALTEHRNPDTLKRLNDAERAKKEWEQKDPKLGDEEREKGNDFFKEQKYPEAIKHYTEAIKRNPNDHKAYSNRAA---YTKLGAMPEGLKDAEKCIELDPTFSKGYSRKAAVQFFLKEYDNAMETYQAGLE----DPSNQELLDGVKRCVQQINKANRGDLTPEELKERQAKGMQDPEIQNILTDPVMRQVLSD....................................................................................................... 539
20 -17.600[D] KOG1126 DNA-binding cell division cycle control protein  ali follow..  82........KQGISAVEACRSNWRSIQPNINDSISSRGHPDASCMLDVLGTMYKKAGFLKKATDCFVEAVSINPYNFSAFQNLTAIGVPLDANNPYLTAMKGFEKSQTNATASVPEPSFLKKSKESSSSSNKFSVSESIANSYSNSSISAF---------------------TKWFDRVDASELPGSEKERHQSLKLSQSQTSKNLLAFNDAQKADSNNRDTSLKSHFVEPRTQALRPGARLTYKLREARSSKRGESTPQ--------SFREEDNNLMELLKLF-----GKGVYLLAQYKLREALNCFQSLPIEQQNTPFVLAITYFELVDYEKSEEVFQKLRDLSPSRVKDMEVFSTALWHLQKSVPLSYLAHETLET------------------------------NPYSPESWCILANCFSLQREHSQALKCINRAIQLDPTFEYAYTLQ----EHSANEEYEKSKTSFRKAIRVNVRHYNAWYGLGMVYLKTGRNDQADFHFQRAAE----NPNNSVLITCIGMIYERCKDYKKALDFYD..................................................................................................................................... 557
21 -17.200[U] KOG2376 Signal recognition particle, subunit Srp72  ali follow..  7................................ENPKTSTPAIEDLFTSLHKHIKDTKYEEAVKVADQVLSIVPTDEDAIRCKVVALIKDDKFDDYLIKDVKINGALSFPIDLGFHKAYCLYRENKL----------DEALVCLKGLERESKTLLLEAQILN-----------------CLGKVDACVDVYQKLNKSGIKLI-----EVNLVAALIRAGKASQVLESLKIRPTTTYQLAYNTACSLIENSNYVDAEQLLLTAMRIGQETLTEGDYSDDYIETQLAPISVQLAYVQQVLGQTQESKSSYVDIIKRNLADESLALAVNNLVKDISDGLRKFDLLKDKDSQNFAIYANRVLLLLHANKMDQARELCATLPG------------------------------MFPESVIPTLLQAAVLVRENKAAKAEELLGQCAENFPEKSKLVLLARAQI-AASASHPHVAAESLSK-IPDIQHLPATVATIVALRERAGDNDGATAVLDSAIRSMTDSNMLRILMPVAAAFKLRHGQEEEASRLYEEIVKNHNSTDALVGLVTTLARVNVEKAEAYEKQLKPLPEKTSGAKPIEGISAASLSQEEVKKEKVKRKRKPKYPKGFDLENSGPTPDPER-------LP.................................. 586
22 -16.500[R] KOG1128 Uncharacterized conserved protein, contains TPR repeats  ali follow..  251.....NDDTLLEQVAITEQGARVDGRTLNACQLSCL-----LWIARHESATHRHDVLVHERCSPVLDTVIAAR---RYWSIQAAALLARAELERGRRQVDRSCTQSE------------LVVKLQQGVDDPVLIKDRLLRTSYILASGSEALLILEKLEMWDGVIDCYKQL---------GQMDKAETLIRRLIEQKPNDLGDITRNLEYFTKELSDDRNARAHRSLGHLLLMDKKFEEAYKHLRRSLE-----------------------QPIQLGTWFNA-------GYCAWKLENFKESTQCYHRCVSLQPDHFEAWNNLSA-HGQKPKAWKLLQEALKYNYEHPNVWENYMLLSVDVGEFSQAIQAYHRLLDMNKRGADDEVLELIAQTLLRREAEISMDESEDKA---------NEAENRKEKEEMIKLLARISANHQTLSPKTLRVYALL-KKPSVLSSETRTEFEKYVRLLEKSLAAANGKLTWPKEEKLALEVVETAVRLAEDRLE------LAKFIASDTSVKEASAKVRLSLRGILTRLDKDSGSRVSGDETEKLQEIVEVAKSLLDS.................................................................................................. 784
23 -16.500[NU] COG3063 Tfp pilus assembly protein PilF  ali follow..  11  39............................................................................................................................................................................................................................................................................................................................KTQLAMEYMR-GQDYRQATASIEDALKSDPKNELAWLVRAEIYQYLKVNDKAQESFRQALSIKPDSAEINNNY--------------------------------LCGRLNRPAESMAYFDKALADPTYPTPYIANLNKGICSAKQGQFGLAEAYLKRSLAAQPQFPPAFKELARTKMLAGQLGDADYYFKKYQSRVEVLQADD------WKIAKALGNAQAAYEYEAQLQANFPYSEELQTVLTGQ.................................................................................................................. 253
24 -14.600[S] COG3898 Uncharacterized membrane-bound protein  ali follow..  58.........................................................................................................................................................................................LWWLIRSLWNSP--------YTISRYFRVRRRDRGYQALSTGMIAAGAGDGA-LARKKTKEAAKLIRSDQEPLIHLLEAQASLLEGDHEGARQKFESMLDDPEM-----RLLGLRGLYLEAERL-------------------GDRNAARHYAGRAAAVAPQLAWAAESTLEELTARGDWDGALKLVDAQKSTRQIERDAANRRRAVLLTAKAQSLADEANKLQPDFAPAAVAAAAALFKQNDVRKGSKILETAWRAEPHPE----IAELYTHARPGDAVLDRLNRAKKLQEMKKNHAESSMTVARAALDAQDFSTARSEAEAAIRMDRREGAYLLLADI---EEAETGDQGKVRQLLSKAVRAPRDPAEWRAPMERLGQLIDSRDEGTTVPVIEALAKPASEKPIDAIEPVAAGSETTGKGTDRSEDQVTPVTAAAFAAVPADAEAVEPAEELTRLPPGVDPDEEAEKSPRRFRLF............... 531
25 -14.500[V] KOG0624 dsRNA-activated protein kinase inhibitor P58, contains TPR and DnaJ domains  ali follow..  19FAGTAEEVAKHLELGSQFLARAQFADALTQYHAAIELDPKSYQAIYRRATTYLAMGRGKAAIVDLERVLELKPDFYGARIQRGNILLKQGELEAAEADFNIVLNHDSSNNDVQEKTALIEQHRQLRHQIKSAGGDCATVEEYINHIIEI----------------DASLYRMRAKCLEERGELKKAIHDMRIVSKLSTDSTDTMFETLYTVGDLEESLNVIRECLKLNPDHKSCYPFYKKLRKVVKSLESMKKKVE-------------NSDWMACLEEGQKTMKFDPTPSVQLNVFRITNRCQR----------------------AGHISEAIAECNEILNDDPSDADILCERAEAHILDEDYDSAIEDYQKATEVNPDHREAKEGLEHAKRLKTQAGKRYKILGVKRNASKREITKAYRKLAQKWHPDNFSDEEEKKKAEKKFIDIAAAKEVLQDEEKRRQFDQGVDPLDPEAQRQGGG................................................................................................................................................................................................... 460
26 -14.500[O] COG4235 Cytochrome c biogenesis factor  ali follow..  10  52TAQAPALLDRALDPKADPLNEEEMSRLALGMRTQLQKNPGDIEGWIMLGRVGMALGNASIATDAYATAYRLDPKNSDAALGYAEALTRSSDNRLGGELLRQLVRTDHSNIRVLSMYAFNAF----------EQQRFGEAVAAWEMMLKLLPANDTRRAVIERSIAQAMQHLSPQESK.................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................... 221
27 -14.300[U] COG5010 Flp pilus assembly protein TadD, contains TPR repeats  ali follow..  11  36RKELTTGSIPKLTRPVQSMNATELAAAAERIGQAYERNPKDREAGLNYANLLRMTGRNEQALAVMQQVAIYHPADREVLGAYGKAQAAAGQLEQALATISRAQTPDRPDWKL----------KSAEGAILDQLGRSAEARLRYREALDLKPNEPSVLSNLGMSYLL-------------TKDLRTAETYLKSAASQPRQNLALAVGLQGRFQEAETIAGQELTSEQAEANVAYLRSVLSQQGAWKQLAKAD.......................................................................................................................................................................................................................................................................................................................................................................................................................................... 269
28 -14.100[G] COG2956 Predicted N-acetylglucosaminyl transferase  ali follow..  49........................................................................................................................................................................................................................................................................................................................................NQQDKAVDLFLDMLKEDTGTVEAHLTLGNLFRSRGEVDRAIRIHQTLMESASLTYEQRLLAIQQLGRDRAEDMFNQLTDETDFRIGALQQLLQIYQATSEWQKAIDVAERLVKLGKDKQAHFYCELAL-QHMASDDLDRAMTLLKKGAAADKNSARVSIMMGRVFMAKGEYAKAVESLQRVISQ--DRELVSETLEMLQTCYQQLGKTAEWAEFLQRAVEENTGADAELMLADIIEARDGSEAAQVYITRQLQRHPTMRVFHKLMDYHLNEAEEGRAKESLMVLRDMVGEKVRSKPRYRCQKCGFTAYTLYWHCPSCRAWSTPIRGLDGL................... 389
29 -13.100[Z] KOG1840 Kinesin light chain  ali follow..  115...............SNVGVGGMRKKKVGGTKLQNGNEEPSSELLNQARNLVSSGDSTHKALELTHRAAKLF--WIMCLHVTAAVHCKLKEYNEAIPVLQRSVEIPVVEEGEEHALAKFAGLMQLGDTYAMVGQ-LESSISCYTEGLNVLGENDPRVGETCRYL---------AEALVQALRFDEAQQVCETALSI-RRLMGLICETKGDHENALEHLVLASMAMAANGQESEVAFVDTSIGDSYLSLSRFDEAICAYQKSLTALKTAKGENHPAVGSVYIRLADLYNRTGKVREAKSYCENALRIYESHNEIASGLTDISVICES-MNEVEQAITLLQKALKI-IMIAGIEAQMGVLYYMMGKYMESYNTFKSAISKLRATGKK---------------QSTFFGIALNQMGLACIQLDAIEEAVELFEEAKCILEQECGPYHPETLGLYSNLAGA-YDAIGRLDDAIKLLGHVVGANPVTEDEKRRLAQLLKEAGNVTGRKAKSLKTLI...................................................................................................................................................................... 639
30 -12.900[S] KOG3060 Uncharacterized conserved protein  ali follow..  11  78SLEFPGSLRVMKFKAMRYEALEQYDEADEVLDAIIAKDETNAAPRKRKIAILKARGRRLEAIKELNEYLKKFMSDQEAWHELCNMYLAEGEFGKAAFCMEEVLLHNPH--SHLIHQRLAEIRYTM-----GGVENMESARTYYSQALK---LNPHNLRALYGIYLCCNHLDNSRAVSSKRKELQKLSQWALEQL................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................... 261
31 -12.400[O] KOG0550 Molecular chaperone (DnaJ superfamily)  ali follow..  2....................TEVETTHMNAGTESQQEPAELAEKQKAIGNAFYKEKKYAEAIKAYTEAIDLGSDSALYYSNRAATYMQIGEFELALCDAKQSDRIKPDVPKTQSRIRQAYEGLSILNEAEVKNKQAGLALNALDRLQRRIDSTTQPPMSWMYL---------KAQVYIFQNDMDRAQKIAHDVLRLNPKNVEALVLRMYYSGENAKAITHFQEALKLDPDCTTAKTLFKQVRKLENTKNQGNDLFR-------------QGNYQDAYEKYSEALQIDPDNKET--------VAKLYMNRATVLLR------------LKRPEEALSDSDNALAIDSSYLKGLKVRAKAHEALEKWEEAVRDVQSAIEL------------------------------DASDANLRQELRRLQLELKKSKRKDHYKILGVSKEATDI-EIKKAYR----------KLALVYHPDKNAGNLEAEARFKEVGEAYTILSDPESRRR-FDSGVD...................................................................................................................................................................... 415
32 -12.100[S] KOG4648 Uncharacterized conserved protein, contains LRR repeats  ali follow..  1...........................MTSANKAIELQLQVKQNAEELQDFMRDLENWEKDIKQKDMELRRQNGVPEENLNFRKKKKGKAKESSKKTREENTKNRIKSYDYEAWAKLDVDRILDELDKDDSTHESLSQESESEEDGIHV-GNKYFKQGKYDEAIDCYTKGMDADPYNPRLKKFAVAESDCNLAVALNRSYTKAYSRRGAARFALQKLEEAKKDYERVLELEPNNFEATNELRKISQALASKENSYPKEADIVIKSTEGERKQIEAQQN--KQQAISEKDRGNGFFKEGKYERAIECYTRGIAADGANALLPANRAM-IQKYEEAEKDCTQAILLDGSYSKAFARRGTARTFLGKLNEAKQDFETVLLLEPGNKQAVTELSKIKKELIEKGHWDDVIDNPPHPGSTKPLKKVIIEETGNLIQTIDVPDSTTAAAPENNPINLAN---IAATGTTSKKNSSQDVLFPTSDTPRAKVLKIEEVSDTSSLQPQASLKQDVCQSYSEKMPIEIEQK--ANSFQLESDFRQLKSSPDMLYQYLKQIEPSLYPKLFQK.................................................................................................................... 584
33 -12.000[R] KOG4197 FOG: PPR repeat  ali follow..  48.........................DQFKQLHSQSITRGVAPNPTFKLFVFWCSRGHVSYAYKLFVKIPEP---DVVVWNNMIKGWSKVDCDGEGVRLYLNMLKEGVTPDSH-TFPFLLNGLKRDGGALACGKKLHCHVVKFMDMARGVFDRRCKEDVFSWNLMI---------SGYNRMKEYEESIELLVEMERNLVSPTSVTLLLVLSACSKVKDKDLCKRVHEYVSECDIAVRIFRSMKARDVISWTSIVKGYVERGNLKLARTYFDQMPVRDRISWTIMIDGYLRAGCFNESLEIFREMQ---SAGMIPDEFTMVSVLTACAHLGSLEIGEWIKTYIDKNIKNDVVVGNALIDMYFKCGCSEKAQKVFHDM---------------------------------DQRDKFTWTAMVVGLANNGQGQEAIKVFFQMQDMSIQPDDITYLGVLSA--NHSGMVDQARKFFAKMRSDHRIEPSLYGCMVDMLGRAGLVKEAYEILRK-----MPMNPNSIVWGALLGASRLHNDEPMAELAAKKILELEPDNGVYALLCNIYAGCKRWKDLREVRRKIVDVAIKKTPGFSLIEVNGFAHEFVAGDKSHLQSEEIYMKLEELAQESTFAAYLPDTSELLFEAGDA----------YSVANRFVRLSGHPGTKNWKS. 684
34 -11.900[R] KOG1125 TPR repeat-containing protein  ali follow..  147ISTDAEQRGQPLRVPETSSLDLDIQTQLEKWDD-VKFHGDRNTKGHPMAERKSSSSRTGSKELLWSSEHRSQP---ELSGGKSALNSESASELELVAPTQARLTKEHRW-------------------GSALLSRNHSLEEEFERAKAAVESDTEFWDKMQAEWEEMARRISENQEAQNQVTISASEKGYYFHTENPFKDWPGAFEE-----------------------------------------------------------------------------------GLKRLKEGDLPVTILFMEAAILQDPGDAEAWQFL---NENEQAAIVALQRCLELQPNNLKALMALAVSYTNTGHQQDACDALKNWIKQNPKRMSKSPVDSSVLEGVKELYLEAAHQNGDMIDPDLQTGLGVLFHLSGEFNRAIDAFNAALTVRPEDYSLWNRLGATLA---GDRSEEAVEAYTRALEIQPGFIRSRYNLGISCINLGAYREAVSNFLTALSLQRKSRNSGNIWAALRIALSLMDQPELFQAANLGDLDVLLRAFNLD........................................................................................................................ 625
35 -11.700[U] KOG3081 Vesicle coat complex COPI, epsilon subunit  ali follow..  21.........................................................................................................................................................................................................................................................................................................GAYQTAINNSEIANLSPENAVERDCLVFRSYIALGSYQLVISEIDESAATPLQAVKLLAMYLSTPQNKESTISSLKEWLADSTIGNNDTLRLIAGIIFMHEEDYNETLKHHAGGTMDLYALNVQIFIKMHRAEYAEKQLRVMQQIDEDHTLTQLASAWLNLAVGGSKIQEAYLIFEDFSEKYPMTCLILNGKAVCCMQMGNFDEAETLLLEALN--KDAKDPETLANLVVCSLHV----KSSSRHLSQLKLSHPEHILVKRVSSAEDNFE.............................................................................................................. 286
36 -11.300[R] COG4785 Lipoprotein NlpI, contains TPR repeats  ali follow..  51LASRALTDDERAQL--LYDSLGLRALARNDFSQALAIRPDMPEVFNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYAHLNRGIALYYGGRDKLAQDDLLAFYQDDPNDPFRSLWLYLAEQKLDEKQAKEVLKQHFEKSQTLMERLKADATDNTSLAEHLSETNFYL------GKYYLSLGDLDSATALFKLAVAN---------HNFVEHRYALLELSLLGQDQDDLAESDQQ........................................................................................................................................................................................................................................................................................................................................................................................................................................................... 294
37 -11.200[O] KOG0530 Protein farnesyltransferase, alpha subunit/protein geranylgeranyltransferase type I, alpha subunit  ali follow..  10  3.DSDIPSSTLYKDNVDWKDITPIYPSKEEEVAVKIAVTEDFTDAFAYFRAILIKNEKSDRVMALLEDCIRLNPANYTVW-----LTELGWDLKKEMRYLSDIIQESPK--NYQVWHH------RRFIVETIGESAVNDELHFCSEVIRD-----NKNYHAWQ-HRQWV--------------VRTFKVPLEKELTFALHMLLLDNRNNSAYNY-------RYFLMTLYDKTEDASQLDIEINLAKKFIENIPNNESAWNYLAGLLITNGVTSNSDVVSFVEDLYETTPEE-------------KRSPFLLAFIADMMLENIENQKSAEESAGRAKKLYKDLQSIDPVRVNYYRHQSLLAQTMLIKAQTKVTAK................................................................................................................................................................................................................................................................................................................ 328
38 -11.200[S] KOG4507 Uncharacterized conserved protein, contains TPR repeats  ali follow..  185LSAPLLPKEDPIFTYLSKRLGRSIDDIGHLIHEGLQKNTSSWVLYNMASFYWRIKNEPYQVVECAMRALHFSSRHNKALVNLANVLHRAHFSADAAVVVHAALDDSDFFTSYTLGNIYAM------------LGEYNHSVLCYDHALQA----------------GFEQAIKRKHAVLCQQKLEQKLEAQHRSLQRTLNELKEYQKQHDHYLRQQEILEKHKLIQEEQILHETQMAKEAQLGNHQICRLVNQQHSLHCRWDQPVRYHRGDIFENVDYVQFGEDSSTSSMMSVNFDVQSNQSDINDSVKSSPVAHSILWIWGRDSDAYRDKQHILWPKRADCTESYPRVPVGGELPTYFLPPENKGLRIHELSSDDYSTEEEAQTDFRKSHTLSYLVKELEVRMDLKAKMPDDHARKILLSRINNYTIPEEEIGSFLFHAINKPNAPIWLIL-NEAGLYWRAVGNSTFAIACLQRALNLAPLQYQ-LVNLANLLIHYGLHLDATKLLLQALAINSSEPLTFSLGNAYLA----LKNISGALEAFRQALKLTTKCPECENFLYNITSSVCSGNCHEKTLDNSRDAAEEEPSERGTEEDPVFSVENSGRDSDALRLESTVVEESNGSDEMENSDETKMSEEILALVDEFQQAWPLEGFGGALEMKGRRLDLQGIRVLKKG 870
39 -11.200[DO] KOG4322 Anaphase-promoting complex (APC), subunit 5  ali follow..  10  69.....................QGPDITLSKLYKLIEESCPQLANSVQIRIKLMAEGELKDMEQFFDDLSDSFSGTEGLFLRHMILAYSKLSFSQVFKLYTALQQYFQNGEK----EFFLSQQASLLKNDETKALTPASLQKELNNLLKF------------AHYLSYLNNLRVQ---DVFSSTHSLLHYFDRLIRYAALNLAALHCRFGHYQQAELALQEAIRIAQESNDHVCLQHCLSWLYVLGQKRSDSYVLLEHSVKKAVHFGLPYLASLGIQSLVQQRAFAGKTANKDALKDSDLLHWKHSLSELIDISIAQKTAIWRLYGRSTMALQQAQMLLSVQQNNTESFAVALCHLAELHAEQGCFAAASEVLKHLKERFPPNSQHIQFDRAMNDGKYHLADSLVTGITALNSIEGVYRKAVVLQAQNQMSEAHKLLQKLLVHCQKLKNTEMVISVLLSYWRSSSPTIALPMLLQALALSKEASETVLNLAFAQLILGIPEQALSLLHMAIEPILADGAILDKGRAMFLVPKKAEALEAAIENLNEAKNYFAKVDCKERIRD.................................................................................................................... 712
40 -10.600[H] COG3071 Uncharacterized enzyme of heme biosynthesis  ali follow..  58.........................................................................................................................................................................................VEWVLRRIFRTG--------ARTRGWFLGRKRTRARNQMKAALIKLAEGD--FLQVEKLLTRNADHAEQPMVNYLLAAEAAQQRGDEFRTN-----QYLERAAEVADGDQLPVNITRVRIQLAQ-------------------GHIHAARHGVDRLLDQAPRHPEVLRLAEQAYLRSGAYRSLLDILPAMSKTQIHTPEEVAALEQQAYIGIMNQCMKDQSRKVRNEIPLQVALAEHLIECDDHDVAQKIILDSLKHQYDE--RLALLIPRLK---AGNPEPLEKSLRQQIKQHGATPLLNSTLGQLMLKHGEWEKASEAFKAALAQRPDGY----DYAWLADALDKLHRPEDAAQARREGLLLTLRQNGE......................................................................................................................... 397
41 -10.200[TDBLU] COG5032 Phosphatidylinositol kinase and protein kinases of the PI-3 kinase family  ali follow..  1441VSLNFHDYSFDQQLLLHENSGT--DSALSCYEIIIQKDPENKKAKIGLLNSMLQSGHYESLVLSLDSFIINDN-IEASWRSILKKCLSKSNLESFEAKLGSIFYQYLRKDSFAELTERLQPLYVANTGAHSAYDCYDILSKLFSRIAETDGIVSDNLDIVLRRRLSQVAP-----YGKFKHQILSTHLVGYEKFENT-KKTAEIYLEIARISRKNGQFQRAFNAILKAMDLDKPLATIEHAQSELNFSLNNNMFDLVDEHEERPKNRKETLGNPLKGKVFLKLTKWLGKAG-----QLGLKDLETYYHKAVEIYSECENTHYYLGHHRVLVTRIINEFGRSLESMPKLLTLWLDFGAEELRLSKDDGEKYFREHIIS----------------------------------------------SRKKSLELMNSNVCRLSMK---PQYFFLVALSQMISRVCHPYKILEHIIANVVASYPGETLWQLMATIKSTSQKRSLRGKSILNVLHSRKLSMSSKVDIKAL----------SQSAILITEKLINL................................................................................................................................ 1989
42 -10.200[O] COG5536 Protein prenyltransferase, alpha subunit  ali follow..  12  35ICYTTGYEQGMAYFRAIMAKKEYSLRALNLTGFLIMNNPAHYTVWAYRFQILNHTPSIDNELEWLDEIAEDFQKNYQVWHHRQKILSLTKNYERELEFTKKMFEI--DSKNYHVW-SYRVWILQN--------NDYSQELKLTNELLEK-----IYNNSAWNHRFYVLFETSKVVSWSLEEELN----YLKDKILFAPDN-QSAWNYLCGVLDKSGPSKLDNLIANLRKNLPALHKPLLEFLAMYEPSSSEEIYQKL-----------ANEVDVPHAALWT............................................................................................................................................................................................................................................................................................................................................................................................................ 286
43 -10.200[S] KOG4814 Uncharacterized conserved protein  ali follow..  5..............................................EVVENLVTNDNSPNIPEAIDRLFSDI-----------NINRESMAEITDIQI---------EEMAVNLW----NWALTIGGGWLVNEEQKIRLHYV---ACKLLSMCEASFASEQSIQRLIMMNMRIGKEWLDAGNFLIADECFQAAVAS----LEQLYVKLIQRSSPEADLTMEKITVEAQGDFQRASMCVLQCKDMLMRLPQMTSSLHHLCYNFGVETQKNNKYE-ESSFWLSQSYDIGKMDKKSTGPEMLAKVLRLLATNYLD------------WDDTKYYDKALNAVNLANKEHLSSPGLFLKMKILLKGETSNEELLEAVMEILDFCLNIAKLLMDHERESVGFHFLTIIHERFKSSENIGKVLILHTDMLLQRKEELLAKEKIEEILTAESMNWLHNILWRQAASSFEVQNYTDALQWYYYSLRFYSTDTKLQRNMACCYLNLQQLDKAKEAVAEAERH-----DPRNVFTQFYIFK-IEGNSERALQAIITLENILTDEES.......................................................................................................................... 503
44 -10.000[A] KOG4206 Spliceosomal protein snRNP-U1A/U2B  ali follow..  3.............................................................................................................................................................................................................................................................................................................................................................................................................................................................................TADIPPNQSIYIQNLNERIKKEELKRSLYCLFSQFGRILDVVALKTPKLRGQ-----------AWVTFSEVTAAGHAVRQMQNFPFY--------KPMRLQYAKAKSDCLAKAEGTFVPKDKKRKQEEKVERKREDSQRPNTANGPSANGPSANNGVPAPSFQPSGQETMPPNNILFIQNLPHE-------TTSMMLQLLFEQYPGFKEIRMIDA. 192
45 -9.930[S] KOG2796 Uncharacterized conserved protein  ali follow..  17..............................................................................................................................................................................NADSVEQSFVGLKQLWRAAVDLCGRLLTAHGQGYGKSGLLTSHTTDSLQLWFVRLALLVKLGLFQNAEMEFEPFGNLDQPDLYYEY-----YPHVYPGRRGSMVPFSMRILHAELQQYLGNPQESLDRLHKVKTVCSKILANLEQ----AEDGGMSSVTQEGRQASIRLWRSRLGRVMYSMANCLLLMKDYVLAVDAYHSVIKY-----------------------------YPEQEPQLLSGIGRISLQIGDIKTAEKYFQDVEKVTQKLDGLQGKIMVLMNHLGQNNFAEAHRFFTEILRMDPRNAVANNNAAVCLLYLGKLKDSLRQLEAMVQQDPRHYLHESVLFNLTTMYESSRSMQKKQALLEAVAGKEGDSFNTQCLK..................................................................................................................... 377
46 -9.920[R] COG0457 FOG: TPR repeat  ali follow..  394LHSFSFMDVDDKGKCSMHQLMRKSLKDYQDQDLKKEVHNFMLDFYSNQLKDIDIKEITPEHEIALTEAFYH-EDLLKWFISVSDPFNRAAFWQLITPMYEEMLQILEAKHGPE--VTTLNILDVLYYKMGEYKKALQFSERALAIGETILGVQHPDVATSLDNL---------AGLYESMGNYKQALQLSERALEIYEKVLGP---QHRDVAITLDNL---AGLYESMGEYEKALIFYQRTIEIKEKVLGPQHSNFATSLDNLAVLYRQMGEYEKALQLSQRALEIYEKGPQHPDIATTLNNIALLYDS------------------MGDYQKTLPLYQRALEI-LGIATTLNNLAGFYRRVGDYEKALSLSQRSLEI----------------------DEKVLGSQHPDVARTLNSLALIYENIGDYEKALAFYQRSLDIREKVLGPQHPDVGRT-YEIMGDHEKALTLYQRTIEQHPDVATILNNLAGLHYRIGEYKKALPLYQRALD------VEKKLGQNQPNSVVIKNNYNRLLSKMSENEKK................................................................................................................................ 914
47 -9.770[S] COG4700 Uncharacterized protein conserved in bacteria containing a divergent form of TPR repeats  ali follow..  11  46MLPETGADRHGHTLLMRLQDKLNPERHLRKLTEELAIAE-TNQNHYALANELARLGRYHEAVPHYQQALSGIF-EAAMMLSLAQAQFAIQEFAACQQTLEDVMRYNPDFQSADGHLLFARTLAAQEKY--------ADAESEFEVLISY--------------YPGPQARIYYAEMLAKMSRLREANEQYVAVVDTAKRSRPHYRKHHREWIKTANERLKQSVVQ.................................................................................................................................................................................................................................................................................................................................................................................................................................................................... 248
48 -9.490[K] COG5290 IkappaB kinase complex, IKAP component  ali follow..  689............NSTDFKPLPLVEEGVEDERVRAIERGS-------ILVSVIPSKSSVGNLETIYPRIMV---------LAEVRKNIMAKRYKEAFIVCRT-LDILHDYAPELFIENLEVFINQIGRVD-SCLSEDDVTKTKYKETLYSGISKSFGMEP--APLTEMQIYMKKKMFDPKTSKVNKICDAVLNVLLSNPEYKKKYLQTIITAYASQNPQNLSAALKLISENSEEKDSCVTYLCFLQDVNVVYKSALSLYDVSLALLVAQKSQMDPREYLPFLQELQDNEPLRRKFLIDDYLGN----YEKALEHLSEDGNVSEEVIDYVESHDLYKHGLALYRYDSEKQNVIYNIYAKHLSSNQMYTDAAVAYKEAMGAYQSAKR-------------------------------WREAMSIAVQPEEVESVAEELISSLTFEHR-----YVDAADIQLEYLDNVKEAVALYCKA--------YRYDIASLVAIKAKKDELLEEVVDPGLGEGFG-IIAELLADCKGQINSQLRRLRELRA------------KKEENPYAFYGQETEQADDVSVAPSETSTQESFFTRYTGKTGGTAKTGASRRTAKNKRREERKRARGKKGTIYEEEYLVQSVGRLIERLNQTKP------DAVRVVEGLCRRNMREQAHQIQK. 1294
49 -9.220[A] COG5104 Splicing factor  ali follow..  169.FVQQKDKRQKRSNDYQHENYDTYEAAERAFFKFLDSHNVNSWTWEQTVRELCDAKGYYVMKDPWHR--------KCAFDAYIQSDAEKNRVTKIRKEFIEMLKSSDKIHSYTLWRTVKNEFSS--------HPAFNATSSETEQ------------QQLFFEYKQ---KLLEDEKQLEKDRRKEALDDFCSLLRNMNFEPYTRWSVAQAKFDQDPRYTRNSNMKYLSKLDALV------FEDHVKHLEREYILDKQ-----------------------------KQKKEKHRIERKNRDAFRALLQDLRVQKKITLRTWKELYPIIKDDPRYLNLLGQ-----------------------SGSTPLDLFWDTIVDLENMYREKRNLVLDCLEVLQISVDDTSNIPEIIARLSEKLKDREESEAVTEDLIEEVVNRLRDKAIHKKAE--------KRADERRIRRKIDNLRSAIKYLKPPISADASYDEIRPLISILPEFAAL--------HSEEHRMAAFDKYIRRLREKRELEKQYQNRRGYYDVGKDESYLANSARPHSGYEDGRLEYSADLASKSNRNEINTMQDVQENSISHVTATQPAVKNIVDDAESSEEGEIR...................................................... 695
50 -9.100[S] COG1747 Uncharacterized N-terminal domain of the transcription elongation factor GreA  ali follow..  2.............................DTRDLTAYSAEKFKELDRIIAEAKRQSILDVLKGICDEHLAHSKIIALYISGIISLSKQLLDDSCLVTLLTIFGDN-----------QIVEHLCTRVLEYGESKLALRALGECYK-----------------------------------TSGNEQLYDVWERLVRIDYEEAEITRVLADKYEQEGNKEKATEFYKKALYRFIARRQ---------------------------------------------------------------NAAIKEVWTKLVALIPDDVEFFYREQKKISEKLGEGRGSV-----------LMQDVYVYYKENEDWTTCINILKHILEHDEKDVWARKEIIENFRC----------KYRGHSQLEEYLKISNISQSWRNVFEAINDFEKHISFDE-RIAKVCNDELLIDFAKRRAHTMLLKMAISALQTLGKE------HIWVLKSVLKRQDLAAKIR---------QDPEWALKVIIT---SFDNNCNLKKVKQELVPSLLSVGEWTSWSTKAR----KILKESTGFAANPSNIDFYTVRSCPVSLEEKLAVEFKAQKNFFARIDILNTFMDKADTDSDAFREMFDYFNTFLRAFSVVDGNVIAAYLVVTRVST............. 500

FFAS is supported by the NIH grant R01-GM087218-01
1 4 2 3 1 8   jobs submitted since Jan 1, 2011
Comments and questions to: webmaster

Selected papers from Godzik Lab
Ying Zhang, Ines Thiele, Dana Weekes, Zhanwen Li, Lukasz Jaroszewski, Krzysztof Ginalski, Ashley Deacon, John Wooley, Scott Lesley, Ian Wilson, Bernhard Palsson, Andrei Osterman, Adam Godzik. Three-Dimensional Structural View of the Central Metabolic Network of Thermotoga maritima. Science. 2009 Sep 18;325(5947):1544-9.

Mayya Sedova, Mallika Iyer, Zhanwen Li, Lukasz Jaroszewski, Kai W Post, Thomas Hrabe, Eduard Porta-Pardo, Adam Godzik Cancer3D 2.0:: interactive analysis of 3D patterns of cancer mutations in cancer subsets. Nucleic Acids Research, gky1098 2018; Published on November 8 2018.

Luz JG, Hassig CA, Pickle C, Godzik A., Meyer BJ, Wilson IA. XOL-1, primary determinant of sexual fate in C. elegans, is a GHMP kinase family member and a structural prototype for a class of developmental regulators. Genes Dev. 2003 Apr 15;17(8):977-90. Epub 2003 Apr 02.