Position
Chromosome | chr17 |
Start | 7568900 |
End | 7608908 |
Strand | + |
view in Gbrowse: chr17:7568900..7608908
Most similar sequence in NCBI nr database
Accession | Description | E-value | Score |
---|---|---|---|
XP_028043810.1 | ATP-binding cassette sub-family A member 3-like isoform X2 | 0.0 | 3398 |
ORF sequence (5106 bp)
ATGATGGCCTTAGAAAAATTACAACTACTTATTTGGAAAAATGTCCTTCTACAATACAGACGCAAATGGCAGACATTATTTGAAATAGCAACTCCTATATTTTTTTGCTTTTTTCTAATACTTATGAGATATTTAGTGCCGCCCAAAGCAATACCGGCTAAAACTTACACATCCTTTGATGTCACCTATTTCAATAAGACAAGATTGCTTAGGGGAAATCTGTCATCAGCTGACATGGTATTGGCATACTCACCAATAAATCAATTGACTGAAAAAGTTGCAATCAATGCTATGGCTGAACTTGCTCGAGATTCAATTGATCCTGATATATTTTTTTTTATATCACTTGCAATTTCATACACTCCTAAAGGATATAAAAATGCCAAAGATATGGAAGCAGCACTAATACAGCCCAATGCTATGAATAGCATTTTGCTTGGTATACAATTTGATGATGAAATCGCTAATGCTACGTCTTGGCCAGATGATATAAAAATAACTTTTAGGTTTCCTGCTGTTATGAGATCGACAACGTCTGATCATCAAATGCAATTGAGCTGGCAAACAAACTTACTATTTCCATTGTTTCCGAATTCTGGTCCACGAGAACCAGATGATCAATATGGAGGAACTTCGCCAGGTTACTTCTCGGAACTATTCTTGTTCATGCAGCACGCAATTTCCAAAAGCATCGTAAAAGAGAAGACCGGGAAAAACATTGATACCAAAATATATCTACAGCGTTTACCTCAACTTGAATCTAGGGTGGATCAGCTACTGTTCATTTTACAAAGATTTGTATCCATAGCTATAATGCTGGGCTTCGTCTATACATTCGTGAATACAGTCAGGGCTGTCACAACTGAGAAAGAACTGCAGCTAAAGGAAACAATGGCAATTATGGGTCTACCGTCTTGGCTGCACTGGTTAGCGTGGTTCATCAAACAATTTTCATTCTTACTTGTGACTATTGTCATAATGGTTATATTGTTTAAGATTCCGTTATCCGAAACTGAAACAGGAGTGTCGTACTCCGTGTTCACCTACACGCCGTGGTCGGTCCTTTTATTCTTTTTGATTCTATTCGTCATTGCATCATTGACATTCTCTTTTATGATAAGCGTTTTCTTCACTAGAGCAAACACCGCCGCGTCTTTTATGGCTCTCATATGGTTCGCAACGTTCGCAGTCTTTATGTTCACCCAAATGATTAACGAGACCATGTCTGCCCCAGTTAAAATAACACTATGTTTGTTATCAAACACTGCCATTGGATACGCATTCCAGATGATTATCATTGCCGAAGGAACGATGCAAGGTTTACAATGGGCCAATTTCTTTGAGCCGATATCGTACAGTGAAAACTTTCAACCGGCTCATGTAGCCTTCATGCTAATATTGGACTCCGTGATGTACATGGCGATCGCCGTGTATGTGGAGAACATACGACCCGGAATGTACGGAGTGCCGTTGCCTTGGTACTATCCTTTCACTGTTAGTTATTGGTGGCCGAACAGAATATCTGACACAAATCGAGAGAAAACATCTGACGATCTTGAATATAATGATGCATTATTGTCGGTGGTACACGACGAGGAACCCAAAGGCGTTCCCATTGGCGTAAACATTCAAAATCTCACAAAAAGATATAAGGGACGGGGGAAAGCTGTCGATAATTTAAATCTTAGACTTTATGAAAACGAGATTACAGTACTTCTGGGTCACAATGGAGCGGGGAAAACGACAACGATTTCTATGTTAACAGGAATGGTCCCTCCAACATCAGGATCGGTAACGATCAATGGATATGATATCGTGACGGAAACGGAAAAAGCTAGACGTTCTCTTGGAATATGTCCTCAACATAATGTACTGTTCCCTGACCTGACCGTAGCTGAACATTTAATTTTTTATTCAAAGCTAAAGGGGATTCCCGAATCAGAAATCAATGAAGAAATAGATCATTTCGTCAAACTTCTCGAATTGGATGACAAGAGGCACGCGGCGGCATCGAGTCTGTCGGGCGGGCAGAAGCGGCGGCTGTCGGCGGGCTGCGCGCTGTGCGGGCGCTCGCGCGTGGTGCTGCTGGACGAGCCCACGTCGGGCCTGGACCCGGCGGCGCGGCGCGCGCTGTGGGACCTGCTGCAGCGCGAGAAGCGCGGCCGCACCGTGCTGCTGACCACGCACTTCATGGACGAGGCGGACGTGCTGGCCGACCGCGTGGCCGTGCTCGCCGCCGGCCGCCTGGCCTGTCTGGGCTCCCCCTACTTCCTCAAGCGCCACTACGGACTCGGCTACAAGCTCGCGCTCGTCAAGGACGCCGCCTGCCAAGTCGATCTTGTCACGGAATTCTTCAAAACATATGTCCCTAATCTCAAGCAAAATTCAAATATCGGCTCAGAATTAACTTACATTTTACCTAGTGAGAGTGTAAGTAAATTTCCCGAAATGCTCAAAAAACTTGAAGAAAAGAAAGAGTCTTTATGTATATCAAGCTACGGGTTATCAGTGACTAGCCTTGAGGAAGTTTTTATGAAAGCTGGAATCGAAGATAACAATGTTGAAATAAAAGAAACTAAAGGAGATATCGAAATGATTGATATGAATGGGGACATGTTGAATAAATATTTTATTTCAGAGGACAATGAGCCTTTACATAAAACGCAAGGGTTTCATTTACTGAAGAATCACATAAAAGCTATGTTCTTGAAACTGATGTACAACACATTGAGAAATAAGGCACTCGCTGCGATACAAATAATATGGCCAATAATAAACATTATTTTATCGATGATAGTTTCACTATCTTGGAAATTCTTGAATGTATTGCCCCCGCTTGAACTAAGCCTGGAGAGTGGATTTAAAGGAACCGAGACATTAGTATCGCAAGGTAACGATTTGAGAGACGGCAGTACTGAAGCTAATGTCATGATGGCATACAAAGATTATTTCAAACGATCAACGTACCCCGGCTTGAAGTTATTGGATGTCGGAACTTCTAACTTAAAAAACGTCTATTTGAAACTTATCGCAGAAGATCAATCTCGAGTACGATATGAAGATCTAGTGGGAGCTACATTTCGTAATAACAGCATAACAGCCTGGTTCAGTAACTACGGTCTTCATGATTCTGCTATCTCGCTTTCACTAGTCGAGAACGCAATAATTCGTTCTCTGTCACCCAATACAACTTTGACATTTGTCAACCATCCGCTACCATATTCAGTTGAAGGAATGGTTCAAGTAATGTCTACCGGAACAAACACCGCCTTCATGTTTTCTTTTAGCCTGGGATTTTGTATAGCGGTTATAAGCTCTTTTCTGGTTCTCTTTGTAATTAAAGAGCGCATCAGCGGTGCGAAACTTCTCCAAAGAGTATCAGGAGTGCGACCGGTAGTAATGTGGAGTACTGCCCTCATTTGGGATTGGATTTGGTTGTTTCTGAACCACATTTGCATTATAGTCACTATTGCGTGCTTCCAAGAAATGGGAATGTCGACGCCTGCTGAACTTGGTCGAATTTTATTAGTTCTGATGGTGTTTTCATTGGCAATTATACCGTTGCACTACCTCGCATCGTTCTGTTTCGAAGAAGCCGCCACCGGTTTCAGTAAAATGGTGTTTGTAAATATATTTTGTGGTTCAATGTTGTTCCTTGTCACTGAAGTATTACGGATGCCTTTCATAAATGCTGCTGCTTACGCAGAAATACTTGAGTATCCATTTTCATTGTTACCAATCTACTGTGTCAGCAAGAGTGTCAGGGAAATGGTGACATCTTCAATAAAGATTAAAGCCTGCGACAGCTTATGCAACCAATTAAATTATAAAAATTGCACACGACTAACTATATGCAATGAACTAGACATATCCATGTGTTGTATTGAGGATAATCCATTTTTAGGGTGGAAGGAACCGGGTATTGCAAGATATCTATTTACTATGATAGTTGTAGCGACTGTGTCATTTGCAATATTGCTAGCCAAGGAATACGAACTTTGGAACAAGACTATGATGTTATCTGGTACAAAACCAAAATCTAATGAGAGTAAAAAGGTTGAAGTAAATGCAGAAGTTGAAGATGATGATGTTGTGGAGGAAAAACAGCGTGTTCTAGCAATGACAAGTAGTGAGGTCACCGCACACAGCCTCGTGTGTCGCGAGCTGAGCAAGCGCTACCGGCGCCTCGTAGCCGTCGACCGACTCACGTTCGCGGTGCGCGGCGGAGAGTGCTTCGGCCTGCTCGGAGTCAACGGCGCCGGCAAGACCAGCACCTTCCGCATGCTGACCGGCGACGCACGCGTGTCGGACGGCGACGCGCTCGTGCACGGACACTCCGTGCGAGCACACGTGCAGGACGTGCACCGCCTCATTGGTTACTGCCCCCAATTCGATGCACTGTTTGACAATTTAACCGCAAGGGAGATATTGAAGATTTTCTGTTTGCTGCGTGGCATTCCTACGTCAATAGGCGAAACTCATGCCATTCATCTTGCTAAACAATTGGGATTCATAAAGCACTATGACAAGAAGGTTCGGGAATGTAGTGGTGGAACAAAACGTAAAATCAGTACAGCGGTCGCGTTGCTCGGTGATTACCCAGTTATATTCCTGGATGAGCCTACGACAGGCATGGATCCGGCGTCGAAGCGGCTCGTGTGGCGCGGCATCAGCAGCGCGGTGGGCGGCGGGCGCAGCGTGGTGCTGACGTCACACAGCATGGAGGAGTGCGAGGCTCTCTGCTCCAAGCTCACCGTCATGGTCAACGGCAGGCTCTGCTGTCTCGGCTCGCTGCAACATCTCAAGAGCAAATTCTCACAGGGATACACAATAATCGTGAAATGTAAATCGGGTCCAAATCGAGACGCAGCAGTGCTAGACGTCCACAACTATATGACTACAAATTTTGTTGGTGCTAACCTCATCGAGACGTACCTGGGCATGAGCACGTACCACGTGTCGTCGGCGGGGCTGCCGTGGTGGCGCGTGTTCAGTGCGCTCGAACTAGCGCGGGACTCGCTGCCGCTTGATGACTACTCGGTCGCGCAGACAACACTCGAGCAAGTTTTCCTCGCATTTACAAAGCTCCAACGTCCTATAAATTAA
Protein sequence (1701 aa)
MMALEKLQLLIWKNVLLQYRRKWQTLFEIATPIFFCFFLILMRYLVPPKAIPAKTYTSFDVTYFNKTRLLRGNLSSADMVLAYSPINQLTEKVAINAMAELARDSIDPDIFFFISLAISYTPKGYKNAKDMEAALIQPNAMNSILLGIQFDDEIANATSWPDDIKITFRFPAVMRSTTSDHQMQLSWQTNLLFPLFPNSGPREPDDQYGGTSPGYFSELFLFMQHAISKSIVKEKTGKNIDTKIYLQRLPQLESRVDQLLFILQRFVSIAIMLGFVYTFVNTVRAVTTEKELQLKETMAIMGLPSWLHWLAWFIKQFSFLLVTIVIMVILFKIPLSETETGVSYSVFTYTPWSVLLFFLILFVIASLTFSFMISVFFTRANTAASFMALIWFATFAVFMFTQMINETMSAPVKITLCLLSNTAIGYAFQMIIIAEGTMQGLQWANFFEPISYSENFQPAHVAFMLILDSVMYMAIAVYVENIRPGMYGVPLPWYYPFTVSYWWPNRISDTNREKTSDDLEYNDALLSVVHDEEPKGVPIGVNIQNLTKRYKGRGKAVDNLNLRLYENEITVLLGHNGAGKTTTISMLTGMVPPTSGSVTINGYDIVTETEKARRSLGICPQHNVLFPDLTVAEHLIFYSKLKGIPESEINEEIDHFVKLLELDDKRHAAASSLSGGQKRRLSAGCALCGRSRVVLLDEPTSGLDPAARRALWDLLQREKRGRTVLLTTHFMDEADVLADRVAVLAAGRLACLGSPYFLKRHYGLGYKLALVKDAACQVDLVTEFFKTYVPNLKQNSNIGSELTYILPSESVSKFPEMLKKLEEKKESLCISSYGLSVTSLEEVFMKAGIEDNNVEIKETKGDIEMIDMNGDMLNKYFISEDNEPLHKTQGFHLLKNHIKAMFLKLMYNTLRNKALAAIQIIWPIINIILSMIVSLSWKFLNVLPPLELSLESGFKGTETLVSQGNDLRDGSTEANVMMAYKDYFKRSTYPGLKLLDVGTSNLKNVYLKLIAEDQSRVRYEDLVGATFRNNSITAWFSNYGLHDSAISLSLVENAIIRSLSPNTTLTFVNHPLPYSVEGMVQVMSTGTNTAFMFSFSLGFCIAVISSFLVLFVIKERISGAKLLQRVSGVRPVVMWSTALIWDWIWLFLNHICIIVTIACFQEMGMSTPAELGRILLVLMVFSLAIIPLHYLASFCFEEAATGFSKMVFVNIFCGSMLFLVTEVLRMPFINAAAYAEILEYPFSLLPIYCVSKSVREMVTSSIKIKACDSLCNQLNYKNCTRLTICNELDISMCCIEDNPFLGWKEPGIARYLFTMIVVATVSFAILLAKEYELWNKTMMLSGTKPKSNESKKVEVNAEVEDDDVVEEKQRVLAMTSSEVTAHSLVCRELSKRYRRLVAVDRLTFAVRGGECFGLLGVNGAGKTSTFRMLTGDARVSDGDALVHGHSVRAHVQDVHRLIGYCPQFDALFDNLTAREILKIFCLLRGIPTSIGETHAIHLAKQLGFIKHYDKKVRECSGGTKRKISTAVALLGDYPVIFLDEPTTGMDPASKRLVWRGISSAVGGGRSVVLTSHSMEECEALCSKLTVMVNGRLCCLGSLQHLKSKFSQGYTIIVKCKSGPNRDAAVLDVHNYMTTNFVGANLIETYLGMSTYHVSSAGLPWWRVFSALELARDSLPLDDYSVAQTTLEQVFLAFTKLQRPIN
Domains and motifs
Database | ID | Description | Start | End | Evalue | InterPro ID |
---|---|---|---|---|---|---|
PANTHER | PTHR19229 | - | 7 | 1690 | 0.0 | IPR026082 |
Pfam | PF12698 | ABC-2 family transporter protein | 24 | 477 | 3.2e-17 | - |
SUPERFAMILY | SSF52540 | - | 527 | 772 | 1.5e-61 | IPR027417 |
Gene3D | 3.40.50.300 | - | 538 | 772 | 8.8e-71 | - |
ProSiteProfiles | PS50893 | ATP-binding cassette, ABC transporter-type domain profile. | 541 | 771 | 21.349 | IPR003439 |
CDD | cd03263 | ABC_subfamily_A | 543 | 759 | 5.4e-113 | - |
Pfam | PF00005 | ABC transporter | 557 | 701 | 5.8e-31 | IPR003439 |
SMART | SM00382 | - | 566 | 748 | 7.9e-09 | IPR003593 |
Pfam | PF12698 | ABC-2 family transporter protein | 989 | 1327 | 2.7e-30 | - |
Gene3D | 3.40.50.300 | - | 1372 | 1616 | 2.1e-55 | - |
CDD | cd03263 | ABC_subfamily_A | 1384 | 1602 | 1.2e-97 | - |
ProSiteProfiles | PS50893 | ATP-binding cassette, ABC transporter-type domain profile. | 1384 | 1614 | 16.54 | IPR003439 |
SUPERFAMILY | SSF52540 | - | 1384 | 1605 | 2.0e-48 | IPR027417 |
Pfam | PF00005 | ABC transporter | 1400 | 1542 | 8.5e-20 | IPR003439 |
SMART | SM00382 | - | 1408 | 1593 | 1.1e-03 | IPR003593 |
InterPro assignment
InterPro ID | InterPro description |
---|---|
IPR003439 | ABC transporter-like |
IPR003593 | AAA+ ATPase domain |
IPR026082 | ABC transporter A |
IPR027417 | P-loop containing nucleoside triphosphate hydrolase |
Gene ontology (GO) assignment
GO category | GO ID | GO description |
---|---|---|
molecular function | GO:0005524 | ATP binding |
cellular component | GO:0016021 | integral component of membrane |
molecular function | GO:0016887 | ATPase activity |
molecular function | GO:0042626 | ATPase-coupled transmembrane transporter activity |
biological process | GO:0055085 | transmembrane transport |
Species | Accession |
---|---|
Danaus plexippus | DPOGS200378 |
Heliconius melpomene | HMEL005382g1.t1 |
Manduca sexta | XP_030022238.1 |
Plutella xylostella | g742.t1 |
Spodoptera frugiperda (corn) | GSSPFG00014947001-PA |
Spodoptera frugiperda (rice) | SFRICE015440-PA |
Acyrthosiphon pisum | XP_003246127.1 XP_029344491.1 XP_029344492.1 |
Aedes aegypti | AAEL008386-PC |
Anopheles gambiae | AGAP006379-PA AGAP012156-PA |
Apis mellifera | XP_397465.5 |
Drosophila melanogaster | FBpp0304765 |
Tribolium castaneum | XP_008199153.1 XP_015840355.1 |
Homo sapiens | NP_001080.2 |
Mus musculus | NP_001034670.1 NP_038883.2 XP_006524433.1 |