KWMTBOMO07383

Position

ChromosomeChr12
Start11622974
End11656359
Strand-

view in Gbrowse: Chr12:11622974..11656359

Most similar sequence in NCBI nr database

AccessionDescriptionE-valueScore
XP_012546133.1multidrug resistance-associated protein 1 isoform X20.03122

ORF sequence (4551 bp)

ATGTCTTACAACTCCACGCTTGACAGCTTTTGCGGGACACCGTTTTGGAACAGTACTGCCACCTGGTACACGGATAATCCTGAACTAACACCATGTTTCCAGCAGACTGTCCTCATTTGGACACCCTGTCTCTATCTCTGGGTGTTTGCATTCCTTGACTTATATTACATATTCAATAGTAAGGAACGAAATATACCATGGAACATACTGAACATCACGAAATTGCTAGTAACATGTCTTTTAATAGTTTTGAAATTTGTGGACTTGGGTGTAGCAGTCCATTTATCTTCAAACGGAGAAAAGGAAGTCTACAATGCGAATTATTACAGTCCTGTAATAAAAATATTAACATTCGGCTTATCAGCAACATTGCTATTCTATAACAGAAAGTATGGTATGAGAGCATCTGGAGTGCTCTTTTTCTTCTGGTTACTGCTCGTCGTAGCTGGAATACCCCAGCTAAGATCCGAAATTATCGATCACAAGAATTTAGATGACGACGAAAACGTCAAATACAATTTTATTAGTTACATGGTCTACTATCCTTTGATAGTTCTCATGTTCATACTTAATTGTTTTGCCGATTTGCCACCTAAGGACACGCCTTATAAGTATCAAAAGAATCAATGTCCGGAGAATGCAGCGGGATTCCCGAGCCGCCTCACGTTTAGCTGGTTCGATCCGCTCGCCCTGACTGGATTCCGCAGGAGTCTTGTTGAGAACGATCTGTGGGCCCTGAACCCACCGGACTCTTCGAAAGAATGCGTTCCGAAATTCGATAAATTTTGGGAAAGATCTCTGAAGAAACGCGAACTATCGAACGGTACTAAGGCGACATACCGAAAAACTTCAGCCAGTGTCAACTTCAAGCCAGAAAATGAAAAGAAGCCAGCCTCGATATTACCAGCACTGTGTCTCGCATTTGGAGGACAGTTTCTTTTTGGTTCTATCTTGAAGTTAATCAACGATATTCTCATGTTTATATCTCCACAGCTCCTCAAATTATTAATAAGTTTCGTTAAGAACGATGAACCGGACTGGAAAGGTTACGCATACGCCGTCGCGCTGCTGCTGTGTGCCATTACTCAAACAATGCTATTGGCGCACTACTTCACTAGGATGTATCTAGTGGGTATGAGGATCAGAACAGCGTTGACGAGCGCCATCTACAGAAAATCCCTACGTTTGTCGAACTCGGCTAGAAAGGAGTCAACCGTTGGAGAAATTGTCAATTTGATGTCTGTCGATGCTCAGAGATTCGTTGAGTTAACGGCATATTTGAACATGATATGGTCGGCGCCTCTGCAGATCGCCCTCGCCCTATATTTCTTGTGGGCTATCTTAGGTCCGTCAGTTCTCGCTGGTTTGGCGGTAATGATCATTCTTATACCGGTGAACGGTCTAATAGCGAACAGGGTCAAGACATTGCAAATAAAACAGATGAAGTACAAGGATGAGAGAGTTAAACTTATGAACGAAGTGCTCAATGGAATTAAGGTTCTGAAAATGTACGCTTGGGAGCCTAGCTTCGAAGATCAGATCCTGCAAATAAGGAACAAGGAAATGCACGTACTGAAACAAACGGCGTATCTAAATTCTGCGACCTCTTTCATTTGGTCATGTGCTCCGTTTTTGGTTTCACTGATGTCGTTCGGATGCTTTGTATTGGTGAACGATAAGGAAACTCTGGACTCTGAGAAGGCTTTCGTTGCATTATCACTTTTCAACATACTGCGTTTTCCACTGTCAATGCTGCCTAACGTTATATCGAATGTAGTACAAACGTCGGTAGGCATAAAGAGATTGAACAAATTCATGAACTGCGATGAGCTCGACATTAGTTCCGTCGATCACGACAAGAAAGAGCCCAGTCCAATTGTTATCGAAAACGGGAATTTCACGTGGGGCGAAAAAGATGCCGATCCTGTATTGAAGAACATTAATTTGAATGTACCGCGTGGCTCTTTGGTGGCCATAGTCGGAGCTGTAGGTTCCGGGAAGAGTTCATTGCTCGCTGCAATGCTTGGCGAGATGAACAAGATATCAGGCAGAGTGAATACACACGGCAGCATAGCGTACGTCCCGCAACAGGCCTGGATCCAAAATGCCACGCTACAAGATAATATACTCTTTGGGAAGCCTCTGCAGCAGCAGTCTTATAATAACGTTATTAATGTGTGCGCCCTCAAACCCGACTTTGACGTGTTACCTGGAGGAGATCAGACCGAAATCGGAGAGAAAGGTATAAACTTATCGGGCGGTCAGAAGCAGCGCGTGTCGCTGGCGCGCGCCGTGTACCACGAGGCGGACAACTACCTGCTGGACGACCCGCTGAGCGCCGTGGACTCGCACGTCGGCAAGCACATCTTCGACAAGGTCATCGGGCCCGCCGGCCTGCTCAAGGACAGGACGCGCGTCTGGGTCACGCACAACGTGTCCTACCTCGCGCAGACCGACCTGGTCGTCGTGCTGCGCGACGGACAGGTCTCCGAGGCCGGCTCCTACCAGCACCTGCTCGAGAAGAAGGGCGCCTTCGCTGACTTCCTGCTTCATCATCTGAGCGACATCGAGAAAACTTCGCCTGACGAGTTAGATGTTCTCAAGCAAGACCTGGAAACAAAACTCGGTACAGAATTCCAGAACAAACTGCAAAGGGCTCGCTCGCTCTCCGAAAGTACTTCAGAGTCGGAACAAACTCCCGCAGGTGATAGAGCTGGCAGCGTGAAACAAAAGACTCCTGACGCCCTCACTCAAAGCAATCTCAAAGAAAAGAACAAACTGATTGAAGCCGAGAAAGCGGAAACAGGAAGTGTGAAATGGAGTATATATAAGCACTACTTGATGAGCGTCGGTGTTTTCGCGTCCGTCGTGACGATTCTGATGAATCTGATCCTGCAAGTGTTCCAAGTCGGCTCCAACTACTGGCTGGCGGAGTGGTCCAGTGATTCGAAAATCATTGTTAACGGAACAGTGGACAGAGCAAAAAGGGACATGTACCTCGGTGTGTATGGAGCGCTCGGAGCAGGACAGGTGCTGTCGGTGTCGGTGTCGTCGCTGGCGCTATACCTGGGCACGCTGGCGGCGGCGCGCGCCCTCCACGCGACTCTGCTGGCCGGCGTGCTGCGGGCGCCGTCCATCGGCTTCTTCGACTGCACGCCCGTCGGCCGCGTGCTCAACCGCTTCAGCAAGGACGTTGACGTGCTCGACAACACGCTGCCCATGACGCTGCGGGGCTGGACGTCCTGCTTCTTCTCGGTACTGGGAACGCTGTTCGTGATAAGCGTGTCGACGCCCTTATTCTTAGTGATCGTGGTCCCGGTGGGTCTGATCTACTACGTGATCCAGAGGTTCTACGTGGCGACGTCCCGGCAACTCAAACGGCTGGAGTCGATATCGCGATCGCCCATCTATTCGCATTTCGGCGAGAGTATCACCGGGGCGTCCACAATACGAGCCTACGGTGTAACTGATAGATTTATTGAAGAAAGCGAAAGTCGAGTGGATCACAATCAGTCGTGCTATTACCCGAGCTGCATCGCGAATCGGTGGCTGGCCATCCGTCTGGAAATGATTGGAAACTTCATTATATTCAGCGCGGCTGTGTTCGCGGTGCTCGGTAGAAACTCTATCTATCCGGGTATTGTAGGACTCTCGATCAGCTATGCGCTACAGATCACTCAAACACTCAATTGGTTGGTCAGAATGACCTCTGAAGTTGAAACGAATATTGTCGCCGTCGAAAGGATAAAAGAGTACGCGGAAACCGAACAGGAGGCCGCGTGGAACTTGGAGAAAGGGCCCGGCGCAACGTGGCCCGAGACGGGAGCGCTACAGCTGGAGCAGCTGACGCTGGAGTACCGCCCCGGCGAGCCGGCGCTGCGTGACGTCACGTGCACGGTGGCGCCGCGCGACAAGCTCGGCATCGTGGGACGCACCGGCGCCGGCAAGTCTACTCTCACGCTCGGGCTCTTCAGGATCGTGGAGGCAGCCGCCGGACGCATCCTGATCGACGGCGTCGACATCGCGACCCTCGGGCTACACCAGCTGCGGTCGCGCATCACCATCATCCCGCAGGAGCCGATCCTGTTCTCGGGCACGCTGCGCTCCAACCTGGACCCGTTCGAGGCGTACAGCGACGAGGAGATCTGGCGCGCGCTGGAACACGCCCACCTCCGTGCCTTCGTGCAGGGGCTGCCGGCCGGGCTGCGGCACGAGGTGGCGGAGGGCGGCGAGAACCTGTCGGTGGGACAGCGCCAGCTCGTGTGCCTGGCGCGCGCGCTGCTACGCAAGACGCCGCTGCTGGTGCTGGACGAGGCCACCGCGGCCGTCGACCTCGAGACTGACGACCTCATCCAGAAGACGATCCGCTCGGAGTTCGCGTCGTGCACGGTGCTCACTATCGCGCACCGCCTCAACACCATCATGGACTCCACCAAAGTGATGGTGCTCGACCGCGGGCAGCTCGTAGAATATGCGGCCCCTCAACAACTACTTAACGATAAAAACTCCATTTTCTACTCCATGGCTAAAGATGCCGGAATAGTTAACTAA

Protein sequence (1516 aa)

MSYNSTLDSFCGTPFWNSTATWYTDNPELTPCFQQTVLIWTPCLYLWVFAFLDLYYIFNSKERNIPWNILNITKLLVTCLLIVLKFVDLGVAVHLSSNGEKEVYNANYYSPVIKILTFGLSATLLFYNRKYGMRASGVLFFFWLLLVVAGIPQLRSEIIDHKNLDDDENVKYNFISYMVYYPLIVLMFILNCFADLPPKDTPYKYQKNQCPENAAGFPSRLTFSWFDPLALTGFRRSLVENDLWALNPPDSSKECVPKFDKFWERSLKKRELSNGTKATYRKTSASVNFKPENEKKPASILPALCLAFGGQFLFGSILKLINDILMFISPQLLKLLISFVKNDEPDWKGYAYAVALLLCAITQTMLLAHYFTRMYLVGMRIRTALTSAIYRKSLRLSNSARKESTVGEIVNLMSVDAQRFVELTAYLNMIWSAPLQIALALYFLWAILGPSVLAGLAVMIILIPVNGLIANRVKTLQIKQMKYKDERVKLMNEVLNGIKVLKMYAWEPSFEDQILQIRNKEMHVLKQTAYLNSATSFIWSCAPFLVSLMSFGCFVLVNDKETLDSEKAFVALSLFNILRFPLSMLPNVISNVVQTSVGIKRLNKFMNCDELDISSVDHDKKEPSPIVIENGNFTWGEKDADPVLKNINLNVPRGSLVAIVGAVGSGKSSLLAAMLGEMNKISGRVNTHGSIAYVPQQAWIQNATLQDNILFGKPLQQQSYNNVINVCALKPDFDVLPGGDQTEIGEKGINLSGGQKQRVSLARAVYHEADNYLLDDPLSAVDSHVGKHIFDKVIGPAGLLKDRTRVWVTHNVSYLAQTDLVVVLRDGQVSEAGSYQHLLEKKGAFADFLLHHLSDIEKTSPDELDVLKQDLETKLGTEFQNKLQRARSLSESTSESEQTPAGDRAGSVKQKTPDALTQSNLKEKNKLIEAEKAETGSVKWSIYKHYLMSVGVFASVVTILMNLILQVFQVGSNYWLAEWSSDSKIIVNGTVDRAKRDMYLGVYGALGAGQVLSVSVSSLALYLGTLAAARALHATLLAGVLRAPSIGFFDCTPVGRVLNRFSKDVDVLDNTLPMTLRGWTSCFFSVLGTLFVISVSTPLFLVIVVPVGLIYYVIQRFYVATSRQLKRLESISRSPIYSHFGESITGASTIRAYGVTDRFIEESESRVDHNQSCYYPSCIANRWLAIRLEMIGNFIIFSAAVFAVLGRNSIYPGIVGLSISYALQITQTLNWLVRMTSEVETNIVAVERIKEYAETEQEAAWNLEKGPGATWPETGALQLEQLTLEYRPGEPALRDVTCTVAPRDKLGIVGRTGAGKSTLTLGLFRIVEAAAGRILIDGVDIATLGLHQLRSRITIIPQEPILFSGTLRSNLDPFEAYSDEEIWRALEHAHLRAFVQGLPAGLRHEVAEGGENLSVGQRQLVCLARALLRKTPLLVLDEATAAVDLETDDLIQKTIRSEFASCTVLTIAHRLNTIMDSTKVMVLDRGQLVEYAAPQQLLNDKNSIFYSMAKDAGIVN

Corresponding sequences in KAIKObase version 1

BMgn010331BMgn015914

Domains and motifs

DatabaseIDDescriptionStartEndEvalueInterPro ID
TIGRFAMTIGR00957MRP_assoc_pro: multi drug resistance-associated protein (MRP) 1215140.0 IPR005292
PANTHERPTHR24223- 5415150.0 -
PANTHERPTHR24223:SF350- 5415150.0 -
SUPERFAMILYSSF90123- 3046171.7e-42 IPR036640
Gene3D1.20.1560.10- 3066094.2e-28 IPR036640
ProSiteProfilesPS50929ABC transporter integral membrane type-1 fused domain profile. 31359440.477 IPR011527
CDDcd18595ABC_6TM_MRP1_2_3_6_D1_like 3146024.4e-174 -
PfamPF00664ABC transporter transmembrane region 3145825.3e-43 IPR011527
Gene3D3.40.50.300- 6248563.6e-70 -
SUPERFAMILYSSF52540- 6248491.7e-57 IPR027417
CDDcd03250ABCC_MRP_domain1 6268285.3e-117 -
ProSiteProfilesPS50893ATP-binding cassette, ABC transporter-type domain profile. 62685122.329 IPR003439
PfamPF00005ABC transporter 6447788.8e-21 IPR003439
SMARTSM00382- 6538282.8e-05 IPR003593
ProSitePatternsPS00211ABC transporters family signature. 751765- IPR017871
Gene3D1.20.1560.10- 91912606.4e-43 IPR036640
CDDcd18603ABC_6TM_MRP1_2_3_6_D2_like 95512533.2e-175 -
ProSiteProfilesPS50929ABC transporter integral membrane type-1 fused domain profile. 956124136.556 IPR011527
SUPERFAMILYSSF90123- 95612586.7e-54 IPR036640
PfamPF00664ABC transporter transmembrane region 95812281.1e-37 IPR011527
Gene3D3.40.50.300- 126115117.8e-85 -
SUPERFAMILYSSF52540- 126915088.8e-69 IPR027417
CDDcd03244ABCC_MRP_domain2 127514948.0e-128 -
ProSiteProfilesPS50893ATP-binding cassette, ABC transporter-type domain profile. 1277151018.267 IPR003439
PfamPF00005ABC transporter 129314412.5e-28 IPR003439
SMARTSM00382- 130214871.6e-11 IPR003593
ProSitePatternsPS00211ABC transporters family signature. 14131427- IPR017871

InterPro assignment

InterPro IDInterPro description
IPR003439ABC transporter-like
IPR003593AAA+ ATPase domain
IPR005292Multi drug resistance-associated protein
IPR011527ABC transporter type 1, transmembrane domain
IPR017871ABC transporter, conserved site
IPR027417P-loop containing nucleoside triphosphate hydrolase
IPR036640ABC transporter type 1, transmembrane domain superfamily

Gene ontology (GO) assignment

GO categoryGO IDGO description
molecular functionGO:0005524ATP binding
cellular componentGO:0016021integral component of membrane
molecular functionGO:0016887ATPase activity
molecular functionGO:0022857transmembrane transporter activity
molecular functionGO:0042626ATPase-coupled transmembrane transporter activity
biological processGO:0055085transmembrane transport
SpeciesAccession
Danaus plexippusDPOGS208494
Heliconius melpomeneHMEL008762g1.t1
Manduca sextaXP_030027496.1
Plutella xylostellag12142.t2
Spodoptera frugiperda (corn)GSSPFG00009633001.2-PA
Spodoptera frugiperda (rice)SFRICE017661-PA
SFRICE032051-PA
Acyrthosiphon pisumXP_003243122.1
Aedes aegyptiAAEL004743-PO
Anopheles gambiaeAGAP009835-PA
AGAP009835-PB
Apis melliferaXP_006558751.1
XP_026295849.1
XP_026295850.1
XP_026295851.1
Drosophila melanogasterFBpp0089069
Tribolium castaneumXP_008197283.1
XP_008197292.1
Homo sapiensNP_004987.2
Mus musculusNP_032602.1

The expression data was obtained from RNA-seq data of ten tissues/locations.
Three replicates were sequenced from each tissue/location.

The Y axis is the abundance value (transcripts per million (TPM))
The X axis is the tissues/locations with the following abbreviations:
ASG: anterior silk gland; FB: fat body; MG: midgut;
MSG_A: middle silk gland (anterior); MSG_M: middle silk gland (middle); MSG_P: middle silk gland (posterior);
MT: Malpighian tubules; OV: ovary; PSG: posterior silk gland; TT: testis

For more details, please refer to the article "Reference transcriptome data in silkworm Bombyx mori" (Yokoi et al., 2019).
Expression data (TPM) can be obtained from National Bioscience Database Center.

Expression data of KWMTBOMO07383:

Expression data of alternative splicing isoform(s) of KWMTBOMO07383:

MSTRG.4010.1 (position: Chr12:11621905..11657374)

MSTRG.4010.2 (position: Chr12:11621991..11657195)

MSTRG.4010.3 (position: Chr12:11621991..11633939)

MSTRG.4010.4 (position: Chr12:11622103..11657333)

MSTRG.4010.6 (position: Chr12:11638641..11654776)

The expression data was obtained from RNA-seq data of ten tissues/locations.
Three replicates were sequenced from each tissue/location.

The Y axis is the abundance value with log10 conversion (value of transcripts per million (TPM) plus 1 is used to avoid negative output)
The X axis is the tissues/locations with the following abbreviations:
ASG: anterior silk gland; FB: fat body; MG: midgut;
MSG_A: middle silk gland (anterior); MSG_M: middle silk gland (middle); MSG_P: middle silk gland (posterior);
MT: Malpighian tubules; OV: ovary; PSG: posterior silk gland; TT: testis

For more details, please refer to the article "Reference transcriptome data in silkworm Bombyx mori" (Yokoi et al., 2019).
Expression data (TPM) can be obtained from National Bioscience Database Center.

Expression data of KWMTBOMO07383:

Expression data of alternative splicing isoform(s) of KWMTBOMO07383:

MSTRG.4010.1 (position: Chr12:11621905..11657374)

MSTRG.4010.2 (position: Chr12:11621991..11657195)

MSTRG.4010.3 (position: Chr12:11621991..11633939)

MSTRG.4010.4 (position: Chr12:11622103..11657333)

MSTRG.4010.6 (position: Chr12:11638641..11654776)