## date Wed May 20 17:55:45 2009 ## source-version: geneid_v1.3 -- geneid@imim.es # Sequence chrUn_gl000223 - Length = 180455 bps # Optimal Gene Structure. 4 genes. Score = 986.75 # Gene 1 (Reverse). 3 exons. 232 aa. Score = 62.85 Internal 136 567 54.15 - 2 1 -0.99 4.59 36.36 59.81 AA 88:232 chrUn_gl000223_1 Internal 22518 22613 1.67 - 2 1 4.94 2.42 1.52 4.88 AA 56: 88 chrUn_gl000223_1 First 22850 23015 7.02 - 0 1 4.98 -0.26 6.41 13.56 AA 1: 56 chrUn_gl000223_1 >chrUn_gl000223_1|SGP_v1.0_predicted_cDNA_1|694_NN ATGCGTGCTGGTCATGCAAATATCTGTCCGTCATTTCAGGGGTCAGTGACATTCAGAGAT GTGGCCATAGACTTCTCCCAGGAGGAGTGGAAATGGCTTCAGCCTGCTCAAAGAGATTTG TACAGATGTGTAATGTTGGAGAACTATGGCCATCTGGTCTCACTGGGTCTTTCCATTTCT AAGCCAGATGTGGTTTCCTTATTGGAGCAAGGGAAAGAACCCTGGCTGGGGAAAAGGGAA GTGAAAAGAGATCTGTTTTCAGTTTCAGAGTCAAGTGGTGAGATCAAAGACTTTTCACCA AAAAATGTCATTTATGATGACTCATCCCAGTATTTGATCATGGAAAGAATTCTAAGTCAA GGCCCTGTGTATTCCAGTTTTAAAGGAGGCTGGAAATGCAAGGATCATACTGAGATGCTG CAAGAAAATCAGGGATGTATTAGGAAAGTAACAGTCTCTCATCAAGAAGCCCTGGCTCAA CATATGAATATCAGTACTGTGGAGAGGCCCTATGGATGCCATGAATGTGGAAAAACTTTT GGTCGACGCTTTTCCCTGGTGTTACACCAGAGGACTCATACTGGAGAGAAACCATATGCA TGTAAGGAATGTGGCAAAACCTTTAGCCAGATTTCAAACCTTGTGAAACACCAAATGATA CATACTGGAAAGAAACCCCATGAGTGTAAGGACT >chrUn_gl000223_1|SGP_v1.0_predicted_protein_1|232_AA MRAGHANICPSFQGSVTFRDVAIDFSQEEWKWLQPAQRDLYRCVMLENYGHLVSLGLSIS KPDVVSLLEQGKEPWLGKREVKRDLFSVSESSGEIKDFSPKNVIYDDSSQYLIMERILSQ GPVYSSFKGGWKCKDHTEMLQENQGCIRKVTVSHQEALAQHMNISTVERPYGCHECGKTF GRRFSLVLHQRTHTGEKPYACKECGKTFSQISNLVKHQMIHTGKKPHECKDt # Gene 2 (Reverse). 4 exons. 739 aa. Score = 447.39 Terminal 47147 49125 428.83 - 2 0 0.00 4.36 224.23 670.58 AA 80:739 chrUn_gl000223_2 Internal 57231 57326 16.75 - 2 1 4.94 4.91 9.62 6.00 AA 48: 80 chrUn_gl000223_2 Internal 57998 58124 5.08 - 0 1 1.49 1.44 8.69 6.45 AA 6: 48 chrUn_gl000223_2 First 64597 64611 -3.27 - 0 0 4.39 0.75 1.06 -5.27 AA 1: 5 chrUn_gl000223_2 >chrUn_gl000223_2|SGP_v1.0_predicted_cDNA_2|2217_NN ATGACCATGTTACAGGAGTCATTCTCATTTGACGATTTATCTGTGGACTTCACCCAAAAG GAGTGGCAGCTACTGGATCCCTCTCAGAAGAATTTATACAAGGATGTGATGTTGGAGAAC TATAGCAGCCTAGTGTCACTGGGGTATGAAGTTATGAAACCAGATGTCATCTTCAAATTG GAGCAAGGAGAAGAGCCGTGGGTAGGAGATGGAGAAATTCCAAGTTCAGATTCTCCAGAA GTCTGGAAAGTAGATGGTAACATGATGTGGCACCAGGATAACCAAGACAAGCTTAAAATT ATAAAAAGAGGTCATGAATGTGATGCATTTGGAAAAAATTTCAATCTGAACATGAACTTT GTTCCTTTAAGGAAATCAAACAGTGAAGGTGACTTAGATGGATTGATTTTAAAACATCAT TTAGATTTGCTTATTCCAAAAGGAGATTATGGAAAAGCAGAATCAGATGACTTTAATGTG TTTGATAATTTTTTTCTCCATTCCAAGCCTGAGGATACTGATACCTGGTTAAAATACTAT GACTGTGATAAATATAAAGAGAGCTATAAAAAGTCACAGATTATCATATATCATAGAAAT CGTTTAGGGGAGAAACTCTATGAATGCAGTGAATGTAGGAAGCGCTTCAGTAAGAAACCA AGTCTCATTAAACATCAGAGCAGACATATAAGAGACATAGCCTTTGGCTGTGGTAATTGT GGCAAAACCTTTCCCCAGAAGTCTCAGTTTATTACACATCACAGAACTCATACAGGAGAA AAACCTTATAATTGTAGCCAGTGTGGGAAAGCCTTCTCCCAAAAGTCACAGCTCACATCC CATCAGCGGACACATACAGGAGAGAAACCTTATGAGTGTGGTGAATGTGGGAAAGCCTTC TCCCGGAAGTCACATCTCATATCGCATTGGAGAACACACACAGGAGAGAAACCCTATGGA TGCAATGAATGTGGGAGGGCCTTTAGTGAAAAGTCCAATCTCATTAACCATCAGAGAATT CATACCGGTGAGAAGCCTTTTGAATGCAGGGAATGTGGGAAAGCCTTCAGCAGGAAGTCA CAACTCGTTACACATCACAGAACTCACACAGGAACAAAACCCTTTGGATGTAGTGATTGT AGAAAAGCATTCTTTGAGAAGTCAGAGCTTATTAGACATCAGACAATTCATACTGGAGAG AAACCCTATGAATGCAGCGAGTGTAGGAAAGCATTTAGAGAGAGGTCGAGTCTCATTAAT CATCAGAGAACACATACAGGAGAGAAACCTCATGGATGCATTCAGTGTGGGAAGGCCTTC TCCCAGAAGTCACATCTCATATCACATCAGATGACACACACAGGAGAAAAACCCTTTATA TGCAGTAAATGTGGGAAAGCCTTCAGCAGGAAATCACAGCTCGTTAGACATCAGAGAACT CATACGGGAGAAAAACCGTATGAATGCAGTGAGTGTGGGAAAGCTTTCAGTGAAAAATTA AGTCTCACTAATCATCAAAGAATTCATACAGGAGAAAAACCATATGTATGCAGTGAATGT GGGAAAGCCTTTTGTCAGAAGTCACATCTCATATCACATCAGAGGACACATACAGGGGAG AAACCCTATGAATGCAGTGAATGTGGGAAGGCCTTTGGTGAGAAGTCAAGTCTTGCAACT CATCAGAGAACTCATACTGGAGAAAAACCGTATGAATGCAGGGACTGTGAAAAAGCTTTC TCCCAGAAATCACAGCTAAATACCCATCAGAGAATTCACACTGGAGAGAAACCCTATGAA TGCAGTCTTTGTAGGAAAGCTTTTTTTGAGAAGTCGGAGCTAATTAGACATCTGAGAACT CATACAGGAGAAAAACCTTATGAATGCAATGAATGTAGAAAAGCCTTCAGGGAGAAGTCA AGTCTCATCAATCATCAGAGAATACATACAGGAGAGAAGCCTTTTGAATGCAGTGAGTGT GGCAAAGCTTTCTCTCGGAAGTCACACCTTATACCACATCAAAGGACACATACGGGTGAG AAACCCTATGGATGCAGTGAATGTAGGAAGGCCTTCTCTCAGAAGTCACAGCTGGTTAAT CATCAGAGAATTCATACAGGAGAGAAGCCTTATCGATGCATTGAATGTGGGAAAGCTTTC TCACAGAAGTCACAGCTCATCAATCATCAGAGAACTCATACAGTAAAAAAATCCTAG >chrUn_gl000223_2|SGP_v1.0_predicted_protein_2|739_AA MTMLQESFSFDDLSVDFTQKEWQLLDPSQKNLYKDVMLENYSSLVSLGYEVMKPDVIFKL EQGEEPWVGDGEIPSSDSPEVWKVDGNMMWHQDNQDKLKIIKRGHECDAFGKNFNLNMNF VPLRKSNSEGDLDGLILKHHLDLLIPKGDYGKAESDDFNVFDNFFLHSKPEDTDTWLKYY DCDKYKESYKKSQIIIYHRNRLGEKLYECSECRKRFSKKPSLIKHQSRHIRDIAFGCGNC GKTFPQKSQFITHHRTHTGEKPYNCSQCGKAFSQKSQLTSHQRTHTGEKPYECGECGKAF SRKSHLISHWRTHTGEKPYGCNECGRAFSEKSNLINHQRIHTGEKPFECRECGKAFSRKS QLVTHHRTHTGTKPFGCSDCRKAFFEKSELIRHQTIHTGEKPYECSECRKAFRERSSLIN HQRTHTGEKPHGCIQCGKAFSQKSHLISHQMTHTGEKPFICSKCGKAFSRKSQLVRHQRT HTGEKPYECSECGKAFSEKLSLTNHQRIHTGEKPYVCSECGKAFCQKSHLISHQRTHTGE KPYECSECGKAFGEKSSLATHQRTHTGEKPYECRDCEKAFSQKSQLNTHQRIHTGEKPYE CSLCRKAFFEKSELIRHLRTHTGEKPYECNECRKAFREKSSLINHQRIHTGEKPFECSEC GKAFSRKSHLIPHQRTHTGEKPYGCSECRKAFSQKSQLVNHQRIHTGEKPYRCIECGKAF SQKSQLINHQRTHTVKKS* # Gene 3 (Reverse). 4 exons. 534 aa. Score = 249.14 Terminal 94597 95942 232.72 - 2 0 0.00 6.30 141.94 216.55 AA 86:534 chrUn_gl000223_3 Internal 98546 98641 5.06 - 2 1 2.75 3.76 5.12 3.55 AA 54: 86 chrUn_gl000223_3 Internal 98910 99036 10.06 - 0 1 4.98 1.61 7.90 9.18 AA 12: 54 chrUn_gl000223_3 First 119205 119237 1.30 - 0 0 4.98 2.36 2.09 -11.54 AA 1: 11 chrUn_gl000223_3 >chrUn_gl000223_3|SGP_v1.0_predicted_cDNA_3|1602_NN ATGGCCACCAGTTTCCGGACAGCTTCGTGCTGGGGATTATTGTCATTCAAGGATATATCT ATGGAGTTCACCTGGGATGAATGGCAGCTACTGGATTCTACACAGAAGTACCTGTACAGA GATGTGATATTGGAAAACTATCATAACCTGATATCAGTGGGGTATCATGGTACCAAGCCT GACTTAATCTTCAAGTTGGAACAAGGAGAAGATCCATGGATAATAAATGCCAAAATTTCC AGGCAGAGCTGTCCAGATGGCTGGGAAGAATGGTACCAGAACAATCAAGATGAGCTTGAG AGTATTGAAAGAAGCTATGCTTGTAGTGTGTTGGGAAGACTTAATCTGAGCAAAACCCAT GATTCTTCAAGACAGAGACTCTATAACACACGTGGAAAAAGTTTGACACAAAACTCAGCT CCAAGCAGAAGTTATTTAAGAAAGAATCCTGATAAGTTTCATGGTTATGAAGAACCATAT TTTCTTAAGCATCAAAGAGCTCATAGCATAGAAAAAAACTGTGTGTGTAGTGAATGTGGG AAAGCTTTTCGTTGTAAGTCACAGCTCATTGTACATCTCAGAATTCATACAGGAGAGAGA CCTTATGAATGCAGTAAATGTGAAAGAGCCTTCAGTGCCAAGTCAAACCTTAATGCTCAT CAGAGAGTTCATACAGGAGAAAAACCCTACTCATGTAGTGAGTGCGAGAAGGTCTTCTCT TTCAGGTCACAGCTCATTGTCCATCAGGAAATTCACACAGGAGGGAAACCCTATGGCTGC AGTGAATGTGGGAAAGCCTACAGTTGGAAATCACAGCTTCTTTTACACCAGAGAAGTCAC ACAGGAGTGAAACCGTATGAATGCAGCGAATGTGGGAAAGCCTTTAGTTTGAAGTCTCCA TTCGTTGTACACCAGAGAACTCATACAGGAGTGAAACCCCATAAATGCAGTGAATGTGGG AAAGCCTTTAGGAGTAAGTCCTATCTCCTTGTTCACATCCGAATGCATACAGGAGAAAAA CCCTATCAATGCAGTGATTGTGGGAAAGCCTTCAATATGAAGACACAACTCATTGTACAT CAGGGAGTTCACACAGGAAATAATCCTTATCAATGCGGTGAATGTGGGAAAGCCTTTGGT AGGAAGGAACAGCTCACTGCACATCTGAGAGCTCATGCAGGAGAGAAGCCCTATGGATGC AGTGAATGTGGGAAGGCTTTCAGCAGCAAGTCATACCTTGTTATACATAGGAGAACACAC ACCGGAGAGAGACCCTATGAATGTAGTTTGTGTGAGAGAGCCTTTTGTGGAAAATCACAG CTGATTATACATCAGAGAACTCATTCAACTGAGAAGCCCTATGAATGCAATGAATGTGAA AAAGCCTACCCTAGGAAGGCATCACTTCAGATACACCAGAAAACTCATTCGGGAGAGAAA CCTTTTAAATGCAGTGAATGTGGAAAAGCCTTCACTCAGAAGTCATCTCTCAGTGAACAT CAGAGAGTTCACACCGGAGAGAAACCATGGAAATGCTCTGAATGTGGGAAATCCTTCTGT TGGAATTCAGGGCTTCGTATACATCGGAAGACTCATAAATGA >chrUn_gl000223_3|SGP_v1.0_predicted_protein_3|534_AA MATSFRTASCWGLLSFKDISMEFTWDEWQLLDSTQKYLYRDVILENYHNLISVGYHGTKP DLIFKLEQGEDPWIINAKISRQSCPDGWEEWYQNNQDELESIERSYACSVLGRLNLSKTH DSSRQRLYNTRGKSLTQNSAPSRSYLRKNPDKFHGYEEPYFLKHQRAHSIEKNCVCSECG KAFRCKSQLIVHLRIHTGERPYECSKCERAFSAKSNLNAHQRVHTGEKPYSCSECEKVFS FRSQLIVHQEIHTGGKPYGCSECGKAYSWKSQLLLHQRSHTGVKPYECSECGKAFSLKSP FVVHQRTHTGVKPHKCSECGKAFRSKSYLLVHIRMHTGEKPYQCSDCGKAFNMKTQLIVH QGVHTGNNPYQCGECGKAFGRKEQLTAHLRAHAGEKPYGCSECGKAFSSKSYLVIHRRTH TGERPYECSLCERAFCGKSQLIIHQRTHSTEKPYECNECEKAYPRKASLQIHQKTHSGEK PFKCSECGKAFTQKSSLSEHQRVHTGEKPWKCSECGKSFCWNSGLRIHRKTHK* # Gene 4 (Forward). 2 exons. 481 aa. Score = 227.37 First 172980 173142 8.57 + 0 1 -1.25 3.72 9.02 20.89 AA 1: 55 chrUn_gl000223_4 Internal 179013 180293 218.80 + 2 1 5.49 3.89 115.01 312.93 AA 55:481 chrUn_gl000223_4 >chrUn_gl000223_4|SGP_v1.0_predicted_cDNA_4|1444_NN ATGAGAACCACTTCAGGTTTAACAGGGATTATGCTTTTCCAGATATCATTTGAGGATGTG GCTGTGGATTTCACGCTGGAGGAATGGCAGCTACTTAATCCTACTCAGAAGAACTTGTAC AGAGATGTGATGTTGGAGAACTATAGCAATCTGGTTTTCTTGGAAGTCTGGCTAGATAAT CCCAAAATGTGGCTCCGAGATAATCAAGACAACCTTAAAAGTATGGAGAGAGGCCATAAA TATGATGTTTTTGGAAAAATATTTAATTCAAGCATAAACATTGTTCATGTAGGACTGCGA TCCCATAAATGTGGCACAGGAGAAAAAAGTTTGAAATGTCCTTTTGATTTGCTTATTCCA AAAAATAATTGTGAAAGAAAGAAAATTGATGAACTCAATAAGAAATTATTGTTCTGTATC AAACCTGGCAGAACCCATGGTGGGATAAAATACTGTGATTGCAGTACATGTAGAAAATCC AGCAACGAAGAGCCATGGCTCACTGCTAATCACATAACACACACAGGAGTCTATTTATGC ATGGAATGTGGCAGATTTTTTAACAAGAAGTCACAACTTGTTATACACCAGAGAACTCAT ACAGGAGAGAAGCCCTATCAATGCAGTGAGTGTGGAAAAGCCTTTTCACAGAAGTCACTG CTCACGGTTCATCAAAGAACTCACTCAGGAGAAAAACCGCATGGGTGCAGCGAATGTCAG AAAGCTTTTAGTAGGAAGTCACTCCTCATTTTACATCAGAGAATTCATACTGGAGAGAAG CCGTATGGATGCAGTGAATGTGGAAAAGCCTTCAGTAGGAAGTCGCAGCTTAAAAGACAT CAGATAACGCACACAATAGAGAAACCCTACAGTTGCAGTGAGTGTGGGAAAGCATTCTCC CAGAAATTAAAACTCATCACACATCAGAGAGCGCACACAGGAGAGAAACCCTATCCATGT AGTCACTGTGGAAAAGCCTTCTTTTGGAAGTCGCAGCTGATTACTCATCAGAGGACCCAC ACAGGGAAGAAACCTTACGGATGTGGTGAGTGTCAAAAAGCCTTCAGCAGGAACTCACTT CTCATTAGGCATCAGAGGATTCATACAGGAGAGAAGCCCTACGAATGCAACGAATGTGGT GAAGCCTTCATCAGAAAACCACAGCTGATTAAACATCAGATAACTCACACAGGAGAGAAG AACTATCGATGCAGTGATTGTGAGGAGGCCTTCTTTAAGAAGTCAGAGTTAATAAGACAT CAAAAAATTCACTTAGGAGAGAAACCATATGGATGCATTCAATGTGGGAAAACCTTCTTT GGGAAGTCCCAGCTCCTAACGCATCACAGAACACACACTGGGGAGAAGCCTTATGAATGC AGTGAGTGTGGGAAGGCCTTCACCCAGAAGTCAAGCCTGATATCACATCAGAGAACACAT ACAG >chrUn_gl000223_4|SGP_v1.0_predicted_protein_4|481_AA MRTTSGLTGIMLFQISFEDVAVDFTLEEWQLLNPTQKNLYRDVMLENYSNLVFLEVWLDN PKMWLRDNQDNLKSMERGHKYDVFGKIFNSSINIVHVGLRSHKCGTGEKSLKCPFDLLIP KNNCERKKIDELNKKLLFCIKPGRTHGGIKYCDCSTCRKSSNEEPWLTANHITHTGVYLC MECGRFFNKKSQLVIHQRTHTGEKPYQCSECGKAFSQKSLLTVHQRTHSGEKPHGCSECQ KAFSRKSLLILHQRIHTGEKPYGCSECGKAFSRKSQLKRHQITHTIEKPYSCSECGKAFS QKLKLITHQRAHTGEKPYPCSHCGKAFFWKSQLITHQRTHTGKKPYGCGECQKAFSRNSL LIRHQRIHTGEKPYECNECGEAFIRKPQLIKHQITHTGEKNYRCSDCEEAFFKKSELIRH QKIHLGEKPYGCIQCGKTFFGKSQLLTHHRTHTGEKPYECSECGKAFTQKSSLISHQRTH Tag