Position
Chromosome | chr17 |
Start | 7517823 |
End | 7548912 |
Strand | + |
view in Gbrowse: chr17:7517823..7548912
Most similar sequence in NCBI nr database
Accession | Description | E-value | Score |
---|---|---|---|
XP_028043812.1 | ATP-binding cassette sub-family A member 3 isoform X1 | 0.0 | 3518 |
ORF sequence (5106 bp)
ATGTCAGCTGTTGCAAAACTCAAACTTCTCATCTGGAAAAATGTTCTTCTCCAAAAAAGACACAAGTGGCAAACCATTTTTGAAATAGCATCACCAGTTATATTTTCTTTATTTTTAATATTAACAAGATGCCTCGTAGACCCAAAATCTAAACCTGCTATTACTTATTCACCATTTCAACCAACATATCTCAACATTACTGGTAGAAACCTAGGAAATTTAACAGCAGCTAAAACAGGAACACTTGCATTCTCTCCAGAAAACCCTTTAACAAGAAATGTTGTAAGGGATGCTATTGCAATGGTTGCAAATGACAACTTAAGTTTCCTTTTTACTTTTATATTTGACTCAAATATTCTACCTCAGCCTAAAGGATATAAGAATGCTAAAGATATGGAACTGGCACTTACACAACCCAATGCAATGAATCAAATTTTAGTTGGAATCCAATTTGATGATGTAATGGCCAATGCTACAGAATGGCCTGAAAACATAACTGTAAGATTTAGATTTCCAGCTGTAATGAGAACTCCGATGATTGAGCATCCACTGAGAGCTAGTTGGAGGACTAATTTACTCTTTCCATTGTTTCCAAGGCCAGGGCCAAGAGATGCAGACGATATGTATGGTGGAAAAACTCCTGGATATTCACCAGAAATGTTTTTAGCGGTACAGCATGCAGTGTCACAAGAAATAATCAAACAAAAAACTGGTAAAGCTATAAATACCAAAGTATACCTACAAAGATTTCCTCAGGTGGCATACAGAGAAGACGAATTGTTGATAGCTTTAGAGAGATTTATATCTATGATAATAATGCTTTGCTTTGCCTACACATTTGTGAACACCGTAAAAATGGTAACAAATGAAAAAGAATTACAACTCAAGGAAACTATGATAATAATGGGTCTACCCTCATGGCTCCATTGGCTTGCTTGGTTTATAAAACAATTCTCATTTCTTTTAATATCAGTACTTCTAATTGTGATCTTGCTAAAGATTCCTTTTAATCACACTGAAGATGGTCAAGGCTACTCTGTGCTCACATTTACACCTTGGTCTGTTCTATTTTTCTTTATGGTGCTTTACGTGATGGCATCACTTGCATTTTCTTTCATGATCAGCGTCTTTTTCAATAAAGCCAACACCGCCTCTTCGTTTATGGGTCTGGCGTGGTTTTCTACTTATGCCGTTTTCATGTTGACGCAAGTGCTGTACGAAGATATAAGTCTTACTACAAAATTACTTCTTAGTCTTATATCGAATACCGCCATGGGTTACGCCTTCCAGATGATTATAATGTGTGAAGGCACATCCAGAGGACTTCAATGGAACGAGTTTTTCTCTCCAATATCATATCACGACAAATTTCAACCGGGGCATGTTATGTTGATGTTGATGCTGGACACAATATTGTACATGTTAATTGCTATGTATGTCGAAAAAATACGACCGGGGAAATTCGGTGTACCGATGCCGTGGTATTTTCCATTCACGAAAAAATTTTGGTCGACAAATAAGTCGAGGATCGCTGCTGCAAAAACGGAAGAATCTAGCGATACAGCGTACCATGATGCTCTCCTGAAAGTTGTTCACGACGAAGAACCTAAAGATTCGCCCGTAGGTGTCGACATTCAAAATCTCACAAAGGTTTATAAAGGCAGAAAGGCGGCCGTTGATAATTTAACGTTAAGATTATATGAAAACGAAATCACCGTATTGCTCGGTCACAATGGGGCTGGCAAAACTACAACGATATCGATGTTAATCGGTATGATTCCTCCGACCTCCGGTACGGCTACGATAAGCGGCTACAATATAGTTACGGAAACTGAAATGGCACGTAGTTCTATAGGCATATGTCCGCAGCACAATGTCCTCTTCCCCGATCTTACTGTAGCGGAGCACATAGAATTTTACGCAAGATTAAAAGGAGTTTCGAACAATGAAATCCAGAAAGAGGTTGATCATTTCGTCAAACTTCTTGAGCTTGAGGAAAAAAGGCACGCGGCGGCGTCGAGTCTGTCTGGCGGGCAGAAGCGGCGGCTGTCGGCGGGCTGCGCGCTGTGCGGGCGCTCGCGCGTGGTGCTGCTGGACGAGCCCACGTCGGGCCTGGACCCGGCGGCGCGGCGCGCGCTGTGGGACCTGCTGCAGCGCGAGAAGCGCGGCCGCACCGTGCTGCTGACCACGCACTTCATGGACGAGGCGGACGTGCTGGCCGACCGCGTGGCCGTGCTCGCCGCCGGCCGCCTGGCCTGTCTGGGCTCCCCCTACTTCCTCAAGCGCCACTACGGACTCGGCTACAAGCTCGCGCTCGTCAAGGACGCCGCCTGCCAAGTCGATCTTGTTACAGAATTCTTCAAAACATACATCCCGGATATAAAGGAAAATACTAATATAGGATCGGAACTGACTTACATCCTACCTAACGAATATGTTAGTAAATTTCCCGACATGTTCAAAGATTTAGAAGAAAAAAGAGAATTGTTGAAAATATCTAGTTATGGTCTGTCATTGGCCAGCCTCGAAGAAGTGTTTATGAAAGCTGGTGCAGAATATAGTACAGAAAACGAGCCTAAAAGAAACCATCACAACATAGTTCCTTATAGCGATGATGCTATTGCGCCCATAGAATCTCAAAATTATAGTCATCTCGATTCTTCGGAGAAAGTCCGTGGTTTTAAACTTCTCAAAAATCATATAAAAGCAATGTTTTTGAAGTTAGCATACAACACTATGAGGAATAAACTGGTGGCTTTTATACAATTTGTGAGCCCCGTGATCAATATAATACTTTCTGTTATAATTTCGAGATCCTGGAAGTTCCTTTCGCAACTACCCCCACTAACGCTTCGTTTAGAGAGCGGATTTAAAACCACTCAAACTCTAATATCACAAATTCTTAATATAACAGAAGAGAGTATAGAAGCCAGAGCTCTGAAGGCCTATAAAAACTACTTTAAGCTCTCCACTTATCCTGGAATGACTTTGACGGACATTGGTACAATGGACATGGGAAAATTTTACTTCAAGCTGAGTGATTCCGATCTTCCTCGCGTAAAATACGAAACATTGGTCGGCGCTACATTTGGCCAAAACAAAATAACTGCTTGGTTCAGTAATTATGGATACCACGATTCTGCTATCTCGCTCGCACTTGTGAATGACGCCATACTGCGTGCCCTGTCGCCTGGAAGTTCCTTGAGAATCGTCAACTTTCCACTGCCGTATTCGATAGAAAACCTGGTTGAAGTAATGGCTACCGGTAGCAGCATGGGATTTCAATTCGCATTTAATATTGGATTCTGTATGGCTTTCGTGACTTCGTTTCTCGTCCTTTTTTCGGTTAAAGAGCGGCGTAGCGGGGCCAAGCTCTTGCAGCGCGTGTCGGGCGTGCGGCCGGCCGTCCTGTGGCTCAGTGCGCTTGTGTGGGACTGGCTGTGGCTGTTCCTCATCTACTTGTGTATTGTCTTTACTCTCGCTTGTTTCCACGAGAAGACGCTCTCCACGCCTCAGGAATTAGGGCGGGTGTTACTAGTTCTCGTAGTGTTTTCACTGGCAATAATACCAATACACTATCTGGCATCATTCTACTTTGACTCTGCTGCGACTGGATTTTCGAAAATGTGCTTCGTCAACGTATTCAGTGGTTGTATGCCGTTTTTAATTACTGAAGTATTAAGATTACCACAAGTCGCAAGTCCTTTTTATGCACATCTGTTTGATTGGATTTTTTCGCCGCTACCTATTTATTGTATCAGCCGGAGTTTCAGAGACATGAGTGTATCGTCATTTTCATTGTTGGCCTGCGAAGGCCTCTGCGACCAATTACGTATTGAGAATTGCACACGACACACAATATGCAACCAGCTCAACCTTACCGTGTGTTGTATTGAAGACGATCCTTTTATGAAATGGAGCGAACCTGGAATCGGAAGATACTTGTTTACTATGTTCGCTGTCGGTACAGTAACGTTCAGCATACTACTCGTGAAGGAATATGAACTTCTTGCCAAGCTCATGTACAAACCTCGTGATAAAATGTCGCAAACGAAAACCTCTGTAGAGGAGGCAGTCGAAGACGATGACGTCGCCACGGAGAGACGGCGCGTACTGGCATTGTCGCGTAATGAGGTCACCGCGTTCAGCCTCGTGTGTCGCGAGCTGAGCAAGCGCTACCGGCGCCTCGTAGCCGTCGACCGACTCACGTTCGCGGTGCGCGGCGGAGAGTGCTTCGGCCTGCTCGGAGTCAACGGCGCCGGCAAGACCAGCACCTTCCGCATGCTGACCGGCGACGCACGCGTGTCGGACGGCGACGCGCTCGTGCACGGACACTCGGTGCGAGCACATGTGCAGGACGTGCACCGCCTCATTGGTTATTGCCCCCAGTTCGACGCATTATTCGATAACCTTACTGCGAGAGAGACCCTCCAGATCTTTTGTTTGTTACGCGGTATACCTCAGCGAGTTGGTGATGTACTTGCTCATCGTCTCGCTGTTGAACTGGGATTTGTCGTACACTACGATAAAAAGGTTCATGAGTGCAGCGGTGGAACAAAAAGAAAAATAAGTACTGCTGTAGCTTTGTTAAGCGCTTCGTCGTTAGTTTTCCTGGATGAACCAACAACAGGCATGGATCCTGCGTCGAAGCGGCTCGTGTGGCGCGGCATCAGCAGCGCGGTGGGCGGCGGGCGCAGCGTGGTGCTGACGTCACACAGCATGGAGGAGTGCGAGGCTCTCTGCTCCAAGCTCACCGTCATGGTCAACGGCAGGCTCTGCTGTCTCGGCTCGCTGCAACATCTCAAGAGCAAATTCTCGCAGGGCTATACGATTGTTATTAAATGTCGATCGGGAACTGACAGGGATAGCGATATCACACGCATCGACCAATATATGAAGAAAAATTTCAATGAAGTTAAACTCGTCGAGACGTACCTGGGCATGAGCACGTACCACGTGTCGTCGGCGGGGCTGCCGTGGTGGCGCGTGTTCAGTGCGCTCGAACTAGCGCGGGACTCGCTGCCGCTTGATGACTACTCGGTCGCGCAGACAACACTCGAGCAAGTTTTTCTCGCATTTACAAACCAACAGCGTTCACTAGATTAA
Protein sequence (1701 aa)
MSAVAKLKLLIWKNVLLQKRHKWQTIFEIASPVIFSLFLILTRCLVDPKSKPAITYSPFQPTYLNITGRNLGNLTAAKTGTLAFSPENPLTRNVVRDAIAMVANDNLSFLFTFIFDSNILPQPKGYKNAKDMELALTQPNAMNQILVGIQFDDVMANATEWPENITVRFRFPAVMRTPMIEHPLRASWRTNLLFPLFPRPGPRDADDMYGGKTPGYSPEMFLAVQHAVSQEIIKQKTGKAINTKVYLQRFPQVAYREDELLIALERFISMIIMLCFAYTFVNTVKMVTNEKELQLKETMIIMGLPSWLHWLAWFIKQFSFLLISVLLIVILLKIPFNHTEDGQGYSVLTFTPWSVLFFFMVLYVMASLAFSFMISVFFNKANTASSFMGLAWFSTYAVFMLTQVLYEDISLTTKLLLSLISNTAMGYAFQMIIMCEGTSRGLQWNEFFSPISYHDKFQPGHVMLMLMLDTILYMLIAMYVEKIRPGKFGVPMPWYFPFTKKFWSTNKSRIAAAKTEESSDTAYHDALLKVVHDEEPKDSPVGVDIQNLTKVYKGRKAAVDNLTLRLYENEITVLLGHNGAGKTTTISMLIGMIPPTSGTATISGYNIVTETEMARSSIGICPQHNVLFPDLTVAEHIEFYARLKGVSNNEIQKEVDHFVKLLELEEKRHAAASSLSGGQKRRLSAGCALCGRSRVVLLDEPTSGLDPAARRALWDLLQREKRGRTVLLTTHFMDEADVLADRVAVLAAGRLACLGSPYFLKRHYGLGYKLALVKDAACQVDLVTEFFKTYIPDIKENTNIGSELTYILPNEYVSKFPDMFKDLEEKRELLKISSYGLSLASLEEVFMKAGAEYSTENEPKRNHHNIVPYSDDAIAPIESQNYSHLDSSEKVRGFKLLKNHIKAMFLKLAYNTMRNKLVAFIQFVSPVINIILSVIISRSWKFLSQLPPLTLRLESGFKTTQTLISQILNITEESIEARALKAYKNYFKLSTYPGMTLTDIGTMDMGKFYFKLSDSDLPRVKYETLVGATFGQNKITAWFSNYGYHDSAISLALVNDAILRALSPGSSLRIVNFPLPYSIENLVEVMATGSSMGFQFAFNIGFCMAFVTSFLVLFSVKERRSGAKLLQRVSGVRPAVLWLSALVWDWLWLFLIYLCIVFTLACFHEKTLSTPQELGRVLLVLVVFSLAIIPIHYLASFYFDSAATGFSKMCFVNVFSGCMPFLITEVLRLPQVASPFYAHLFDWIFSPLPIYCISRSFRDMSVSSFSLLACEGLCDQLRIENCTRHTICNQLNLTVCCIEDDPFMKWSEPGIGRYLFTMFAVGTVTFSILLVKEYELLAKLMYKPRDKMSQTKTSVEEAVEDDDVATERRRVLALSRNEVTAFSLVCRELSKRYRRLVAVDRLTFAVRGGECFGLLGVNGAGKTSTFRMLTGDARVSDGDALVHGHSVRAHVQDVHRLIGYCPQFDALFDNLTARETLQIFCLLRGIPQRVGDVLAHRLAVELGFVVHYDKKVHECSGGTKRKISTAVALLSASSLVFLDEPTTGMDPASKRLVWRGISSAVGGGRSVVLTSHSMEECEALCSKLTVMVNGRLCCLGSLQHLKSKFSQGYTIVIKCRSGTDRDSDITRIDQYMKKNFNEVKLVETYLGMSTYHVSSAGLPWWRVFSALELARDSLPLDDYSVAQTTLEQVFLAFTNQQRSLD
Domains and motifs
Database | ID | Description | Start | End | Evalue | InterPro ID |
---|---|---|---|---|---|---|
PANTHER | PTHR19229 | - | 7 | 1690 | 0.0 | IPR026082 |
Pfam | PF12698 | ABC-2 family transporter protein | 223 | 479 | 2.3e-20 | - |
Gene3D | 3.40.50.300 | - | 533 | 774 | 6.6e-69 | - |
CDD | cd03263 | ABC_subfamily_A | 543 | 761 | 5.1e-112 | - |
ProSiteProfiles | PS50893 | ATP-binding cassette, ABC transporter-type domain profile. | 543 | 773 | 20.908 | IPR003439 |
SUPERFAMILY | SSF52540 | - | 543 | 764 | 2.8e-59 | IPR027417 |
Pfam | PF00005 | ABC transporter | 560 | 703 | 2.6e-29 | IPR003439 |
SMART | SM00382 | - | 568 | 750 | 1.7e-09 | IPR003593 |
Pfam | PF12698 | ABC-2 family transporter protein | 920 | 1329 | 1.1e-31 | - |
Gene3D | 3.40.50.300 | - | 1373 | 1616 | 9.1e-55 | - |
SUPERFAMILY | SSF52540 | - | 1383 | 1608 | 1.3e-47 | IPR027417 |
CDD | cd03263 | ABC_subfamily_A | 1384 | 1602 | 3.6e-97 | - |
ProSiteProfiles | PS50893 | ATP-binding cassette, ABC transporter-type domain profile. | 1384 | 1614 | 16.827 | IPR003439 |
Pfam | PF00005 | ABC transporter | 1400 | 1542 | 4.7e-20 | IPR003439 |
SMART | SM00382 | - | 1408 | 1593 | 7.1e-04 | IPR003593 |
InterPro assignment
InterPro ID | InterPro description |
---|---|
IPR003439 | ABC transporter-like |
IPR003593 | AAA+ ATPase domain |
IPR026082 | ABC transporter A |
IPR027417 | P-loop containing nucleoside triphosphate hydrolase |
Gene ontology (GO) assignment
GO category | GO ID | GO description |
---|---|---|
molecular function | GO:0005524 | ATP binding |
cellular component | GO:0016021 | integral component of membrane |
molecular function | GO:0016887 | ATPase activity |
molecular function | GO:0042626 | ATPase-coupled transmembrane transporter activity |
biological process | GO:0055085 | transmembrane transport |
Species | Accession |
---|---|
Danaus plexippus | DPOGS200378 |
Heliconius melpomene | HMEL005382g1.t1 |
Manduca sexta | XP_030022238.1 |
Plutella xylostella | g742.t1 |
Spodoptera frugiperda (corn) | GSSPFG00014947001-PA |
Spodoptera frugiperda (rice) | SFRICE015440-PA |
Acyrthosiphon pisum | XP_003246127.1 XP_029344491.1 XP_029344492.1 |
Aedes aegypti | AAEL008386-PC |
Anopheles gambiae | AGAP006379-PA AGAP012156-PA |
Apis mellifera | XP_397465.5 |
Drosophila melanogaster | FBpp0304765 |
Tribolium castaneum | XP_008199153.1 XP_015840355.1 |
Homo sapiens | NP_001080.2 |
Mus musculus | NP_001034670.1 NP_038883.2 XP_006524433.1 |