FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:NAA20-MYO1E (FusionGDB2 ID:56960)

Fusion Gene Summary for NAA20-MYO1E

check button Fusion gene summary
Fusion gene informationFusion gene name: NAA20-MYO1E
Fusion gene ID: 56960
HgeneTgene
Gene symbol

NAA20

MYO1E

Gene ID

51126

4643

Gene nameN-alpha-acetyltransferase 20, NatB catalytic subunitmyosin IE
SynonymsNAT3|NAT3P|NAT5|NAT5P|dJ1002M8.1FSGS6|HuncM-IC|MYO1C
Cytomap

20p11.23

15q22.2

Type of geneprotein-codingprotein-coding
DescriptionN-alpha-acetyltransferase 20N-acetyltransferase 3 homologN-acetyltransferase 5 (ARD1 homolog, S. cerevisiae)N-acetyltransferase 5 (GCN5-related, putative)N-acetyltransferase 5, ARD1 subunit (arrest-defective 1, S. cerevisiae, homolog)N-terminal acetyunconventional myosin-IeMYO1E variant proteinmyosin-ICunconventional myosin 1E
Modification date2020032020200313
UniProtAcc

P61599

Q12965

Ensembl transtripts involved in fusion geneENST00000484480, ENST00000310450, 
ENST00000334982, ENST00000398602, 
ENST00000288235, ENST00000558814, 
Fusion gene scores* DoF score8 X 7 X 3=16810 X 11 X 5=550
# samples 812
** MAII scorelog2(8/168*10)=-1.0703893278914
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(12/550*10)=-2.1963972128035
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: NAA20 [Title/Abstract] AND MYO1E [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointNAA20(20003124)-MYO1E(59553708), # samples:1
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID

check buttonFusion gene breakpoints across NAA20 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across MYO1E (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4STADTCGA-HU-A4GJ-01ANAA20chr20

20003124

+MYO1Echr15

59553708

-


Top

Fusion Gene ORF analysis for NAA20-MYO1E

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
3UTR-3CDSENST00000484480ENST00000288235NAA20chr20

20003124

+MYO1Echr15

59553708

-
3UTR-intronENST00000484480ENST00000558814NAA20chr20

20003124

+MYO1Echr15

59553708

-
5CDS-intronENST00000310450ENST00000558814NAA20chr20

20003124

+MYO1Echr15

59553708

-
5CDS-intronENST00000334982ENST00000558814NAA20chr20

20003124

+MYO1Echr15

59553708

-
5CDS-intronENST00000398602ENST00000558814NAA20chr20

20003124

+MYO1Echr15

59553708

-
In-frameENST00000310450ENST00000288235NAA20chr20

20003124

+MYO1Echr15

59553708

-
In-frameENST00000334982ENST00000288235NAA20chr20

20003124

+MYO1Echr15

59553708

-
In-frameENST00000398602ENST00000288235NAA20chr20

20003124

+MYO1Echr15

59553708

-

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000334982NAA20chr2020003124+ENST00000288235MYO1Echr1559553708-60053592935381169
ENST00000310450NAA20chr2020003124+ENST00000288235MYO1Echr1559553708-58111658733441085
ENST00000398602NAA20chr2020003124+ENST00000288235MYO1Echr1559553708-632367761738561079

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000334982ENST00000288235NAA20chr2020003124+MYO1Echr1559553708-0.0005289980.999471
ENST00000310450ENST00000288235NAA20chr2020003124+MYO1Echr1559553708-0.0004230590.999577
ENST00000398602ENST00000288235NAA20chr2020003124+MYO1Echr1559553708-0.0007379650.99926203

Top

Fusion Genomic Features for NAA20-MYO1E


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.

Top

Fusion Protein Features for NAA20-MYO1E


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr20:20003124/chr15:59553708)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
NAA20

P61599

MYO1E

Q12965

FUNCTION: Catalytic subunit of the NatB complex which catalyzes acetylation of the N-terminal methionine residues of peptides beginning with Met-Asp, Met-Glu, Met-Asn and Met-Gln. Proteins with cell cycle functions are overrepresented in the pool of NatB substrates. Required for maintaining the structure and function of actomyosin fibers and for proper cellular migration. {ECO:0000269|PubMed:18570629}.FUNCTION: Myosins are actin-based motor molecules with ATPase activity. Unconventional myosins serve in intracellular movements. Their highly divergent tails bind to membranous compartments, which are then moved relative to actin filaments. Binds to membranes containing anionic phospholipids via its tail domain. Required for normal morphology of the glomerular basement membrane, normal development of foot processes by kidney podocytes and normal kidney function. In dendritic cells, may control the movement of class II-containing cytoplasmic vesicles along the actin cytoskeleton by connecting them with the actin network via ARL14EP and ARL14. {ECO:0000269|PubMed:11940582, ECO:0000269|PubMed:17257598, ECO:0000269|PubMed:20860408}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
TgeneMYO1Echr20:20003124chr15:59553708ENST000002882351281051_1108491109.0DomainSH3
TgeneMYO1Echr20:20003124chr15:59553708ENST00000288235128695_724491109.0DomainIQ
TgeneMYO1Echr20:20003124chr15:59553708ENST00000288235128730_922491109.0DomainTH1
TgeneMYO1Echr20:20003124chr15:59553708ENST00000288235128112_119491109.0Nucleotide bindingATP
TgeneMYO1Echr20:20003124chr15:59553708ENST00000288235128581_591491109.0RegionActin-binding

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneNAA20chr20:20003124chr15:59553708ENST00000310450+252_15726112.0DomainN-acetyltransferase
HgeneNAA20chr20:20003124chr15:59553708ENST00000334982+262_15726179.0DomainN-acetyltransferase
TgeneMYO1Echr20:20003124chr15:59553708ENST0000028823512819_692491109.0DomainMyosin motor


Top

Fusion Gene Sequence for NAA20-MYO1E


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>56960_56960_1_NAA20-MYO1E_NAA20_chr20_20003124_ENST00000310450_MYO1E_chr15_59553708_ENST00000288235_length(transcript)=5811nt_BP=165nt
GGTTCCGCGGCGGCCACGCTCCGGGAGACTTCCGGCAGGGCGGGCGCGGGGTCTTGGCGAACGGTCTTCGGAAGCGGCGGCGGCGCGATG
ACCACGCTACGGGCCTTTACCTGCGACGACCTGTTCCGCTTCAACAACATTAACTTGGATCCACTTACAGAAACTACATATATAGGATCT
GTATTAATCTCAGTCAACCCTTTCAAGCAGATGCCATATTTTGGGGAAAAGGAAATTGAAATGTACCAAGGAGCGGCACAGTATGAAAAC
CCACCACATATCTATGCCCTTGCAGATAATATGTACAGAAACATGATCATTGACAGAGAGAACCAGTGCGTCATTATCAGTGGTGAAAGT
GGTGCTGGAAAAACAGTGGCTGCCAAATATATCATGAGCTACATCTCCAGAGTGTCTGGAGGAGGGACCAAAGTCCAGCACGTGAAGGAC
ATTATCCTGCAGTCCAACCCGCTGCTGGAGGCCTTCGGGAACGCCAAGACCGTCCGGAACAACAACTCCAGCCGATTTGGAAAATACTTT
GAAATCCAGTTCAGTCCAGGTGGGGAACCAGATGGTGGAAAGATCTCCAACTTCCTTCTGGAAAAATCTAGGGTGGTGATGAGGAACCCA
GGAGAGCGGAGTTTTCACATATTTTACCAGCTCATCGAGGGCGCCTCTGCAGAGCAGAAACACAGCCTTGGCATCACCAGCATGGACTAT
TATTACTACCTGAGCCTCTCGGGCTCATACAAGGTTGATGACATTGACGACAGGCGGGAGTTTCAGGAAACTCTGCACGCCATGAATGTG
ATTGGGATCTTTGCAGAAGAGCAAACGCTGGTGTTGCAGATAGTGGCGGGTATTCTCCACCTGGGAAACATCAGCTTCAAAGAAGTTGGC
AACTACGCGGCTGTGGAGAGTGAAGAGTTTTTAGCTTTTCCTGCATATCTGCTAGGGATAAACCAGGACCGGTTGAAAGAAAAGCTAACA
AGCCGGCAGATGGATAGCAAGTGGGGAGGCAAATCCGAATCCATCCACGTGACCCTCAACGTAGAGCAGGCCTGTTACACCCGGGATGCG
CTCGCCAAGGCCCTGCACGCCCGGGTCTTTGATTTCTTGGTAGATTCCATCAATAAAGCCATGGAGAAAGACCATGAAGAATACAACATT
GGCGTCCTAGACATCTATGGCTTTGAAATATTCCAGAAAAATGGCTTTGAACAGTTTTGTATCAATTTTGTTAATGAAAAACTGCAGCAG
ATTTTTATTGAACTGACATTAAAGGCAGAACAGGAAGAATATGTTCAAGAGGGAATAAGATGGACACCCATTGAGTACTTTAATAATAAA
ATCGTATGTGACCTCATAGAGAACAAAGTGAACCCTCCTGGCATCATGAGCATCCTGGATGACGTGTGCGCCACGATGCATGCGGTGGGT
GAGGGGGCAGATCAGACGCTGCTCCAGAAACTTCAGATGCAGATTGGGAGTCATGAGCACTTCAACAGTTGGAACCAAGGCTTCATCATT
CATCATTATGCTGGGAAGGTATCCTATGACATGGATGGCTTTTGTGAAAGGAACCGGGATGTGCTTTTTATGGATCTCATCGAGCTTATG
CAGAGCAGCGAGCTGCCTTTCATAAAGTCTTTATTTCCGGAAAATCTGCAGGCTGACAAGAAAGGGCGCCCAACTACTGCCGGAAGCAAA
ATAAAGAAACAAGCCAATGACCTTGTGAGCACCCTGATGAAATGTACGCCCCACTACATTCGCTGCATCAAGCCAAACGAAACCAAGAAG
CCCAGAGACTGGGAGGAAAGCAGGGTAAAGCATCAAGTCGAATATTTGGGTCTGAAAGAGAACATTCGAGTGAGAAGAGCTGGCTATGCC
TATCGGCGCATCTTCCAAAAATTCCTACAGAGGTATGCCATTCTGACCAAAGCCACCTGGCCTTCTTGGCAGGGAGAGGAGAAGCAAGGC
GTCCTGCACCTGCTGCAGTCGGTCAACATGGACAGCGACCAGTTCCAGCTGGGGAGGAGTAAAGTGTTCATCAAAGCCCCCGAGTCTCTA
TTTCTTTTAGAAGAGATGAGAGAGAGAAAGTATGATGGGTATGCTCGAGTGATACAGAAATCATGGAGGAAATTCGTGGCCCGGAAGAAA
TACGTTCAAATGAGAGAAGAAGCCTCAGACCTCTTATTGAACAAGAAGGAGAGAAGGAGAAACAGTATTAACAGGAACTTTATAGGGGAT
TATATTGGGATGGAAGAGCACCCAGAACTCCAGCAGTTCGTGGGCAAGAGGGAGAAGATTGATTTCGCAGACACAGTCACCAAGTATGAC
AGGAGGTTCAAGGGTGTAAAGCGAGACCTGCTCCTTACCCCAAAGTGCTTGTACTTAATCGGACGAGAAAAAGTCAAACAGGGCCCAGAC
AAGGGCCTGGTGAAAGAAGTCCTGAAGCGGAAAATCGAGATAGAACGGATCTTGTCTGTGTCCCTCAGTACTATGCAGGATGACATTTTT
ATTCTCCATGAGCAAGAGTATGACAGTTTGCTTGAATCTGTCTTCAAAACTGAATTCCTAAGCCTCTTAGCAAAGCGTTACGAGGAGAAG
ACCCAGAAGCAACTACCTCTGAAATTCAGCAATACGCTTGAACTGAAGTTGAAAAAGGAAAACTGGGGCCCCTGGAGTGCAGGGGGCTCC
CGGCAAGTGCAGTTCCACCAAGGGTTTGGGGACCTGGCTGTCCTCAAGCCCAGTAACAAAGTGCTGCAGGTCAGCATCGGACCTGGACTG
CCCAAGAACTCCCGTCCTACCAGAAGGAACACTACCCAAAATACAGGTTATTCCAGTGGGACTCAAAATGCCAACTACCCAGTGAGAGCT
GCCCCTCCTCCCCCAGGATACCATCAGAACGGAGTCATCAGAAACCAGTATGTGCCATATCCCCATGCTCCTGGAAGCCAGAGGTCCAAT
CAGAAAAGCCTGTACACCTCCATGGCCCGCCCGCCCTTGCCTCGGCAGCAGTCTACCAGTTCAGACCGAGTGTCACAGACGCCAGAGAGC
CTGGATTTCCTCAAGGTCCCGGACCAGGGAGCTGCAGGGGTCAGGAGACAAACAACCAGTCGGCCTCCCCCAGCAGGGGGCAGACCCAAG
CCCCAGCCCAAGCCCAAGCCTCAGGTGCCACAGTGCAAGGCTTTGTATGCCTATGACGCTCAGGACACAGACGAACTCAGCTTTAATGCC
AATGACATTATTGATATTATCAAAGAAGATCCTTCTGGCTGGTGGACGGGTCGACTACGAGGCAAGCAGGGCCTGTTCCCCAACAACTAT
GTGACCAAGATCTGAGGTGCCCGTGACTCTGACACATGGGGCAGAGGAGCTCCAGGCACAGACCAGGGGAGGGGATATTTAGGGGCTCCC
CTTACAATCCACAATGAGCAATTGCTTCTCCAAGGCCTGGAGCTATTCTGGTACCTTCCCCATGGAGGACACTGAAAAGGCTGGGTTGGG
GACAGGGAGTATCACTCCATAAGTGATCCTAAAAGGTAGCCTCTTCATAGGAACCCAGGAGGACAAAACCACCATGCATTAAGATTTATT
TATTGTATTTAAACCTGGTGAGAGGACAAGTGAGGTCTGCTCAGACCTTGTAGGCTTCTATCAAAACAGCACCCTGCTTGCTCACCAGGC
CTAGAGAATGGCTGTAGGTGGCCGCTGACAAGTGCCTTTAGTTGAAGAGCACATTTCTTTCATCTCTCTTGTCCATACCTGATAGACACA
TTCCTCTCTGCCACCTTCCTTCAGGGAGGACCCGCCCTCTGCAGACTGGGCTTAGCGTGAGCAGGCACTTCCCATGTACGTGCCAAGGGT
AAGCTGGCCTGCTGAGCCCAGGGCGACAGAGGGGCACTGGTTTACACTTTGCCGGGACCATCAGGGCCGCCAAGCAGGTCAGGGGCTGGG
GGCTGGGGGCTGGGCTGCTGGCTTTGCTTTCTCTGGGTCTTCAATTAGAATGTGGCTGGCCCATATTGGTTTGTGTTTAAATGCTGTACT
TACTACAAGAAGGATCTTTTTTCAAGCTGTACATTTATAAAAACAGATCATATACTGTATATATAAAAATCTTGAGATGGTAGAAACATG
TATGAATGTACTAAGTAGTATTCCACTGTACTCATTCATAAGGTAGGTTTTCTTACAAAACTCACACCAGGTACTTAAAGATGTGCTCTG
CTTTTTTCCAACTACGGAGTGTCACTGCTTTCTAGGTCAGTCCCTGCAGACTCTTCTCAACTCTTTCCCTATAGGAAACTTACTCCGCGT
CCTGCCCCCACCTCCTAAATAAATAAAGGAATCGGCGAACACCTTCTTCTTTTATGACATTTGTTATGGGTTGAATTGTGTGCCTCCAAA
GAAGATGTTGGAGTCCCAGCCACCAGTACCTCACAATATGACCTTATTTGGAAATAGAGTCTTTATGGAGCAAACAAACAAAACTAATTA
AAATGAAGTCATTAGAGTGGGTCTTAATCCAATATAATGGTGTCCTGATGAAAAGGGGAAATTTGGACACAGAGACCAACATGCCAGAGG
AAAGACCATGTGAAAAGGCAGGGAGAAGACAGCCATTCACAAGCCCTGGACAGAGGCCTGGAACAGATTCTCCCTCACACCCTCACAAGG
AACCAGCCCTGCCCACCTTGGTCTTGGACTTCCAACCTCCAGAACTGTGAGGCAATACATTTCGCTGTTTAAGCCACTCGGTTTGTGGTA
GTTTGTTACGGCAGCCCTAACAAACTAAAACAATGTTTTCTTGCCATAATAGTGAAAAGCTGGAATCAGTGTAATGCCCATCAATAATGG
AATGGTTGTAAGTAGAGACTGCCATTTCTCTACCTTTCCTCCTCTGGGTGCCATAGCCATTCTTAAGCTACTGGTAACTTAGAGTTTGAG
CAGCATACACCTTTGAGGCTTTCAGGATCCTCAAGTCCATGGGAAAAAATTATCAGGGCTTACGTATAAGGGAAAAAAATTGATCTAGAT
TTGAAACGAAGATACATTTAATTCTGCTGTGTTTTACTTCAAGAACAGGGAGCAAACTCCATTCTGTAACAGTAACGAATAATGTTAAAA
GGCTACTTCAGTTCCATGTTGCGACCAGAGAGGAAAGTAGAGCCATGATGCGGTGAGTTGAGGAAAGGTAATCCTGTTGACACTAGTGGT
CTCTAAAGTACTTTCACATTCATGGACTCCCCGCAGTGGGAGGAAGATACCACCAGAACTGGGACAGAGGAGCATGCTGGAGTCCCCACA
GCCACCAGCACTCAGGCTGCGTTCTCAGGGCAGGAACACCAGGACCATCTTCCCTTGTTATCATAAGCCCCTGGCTGGCTGGGACAAGCC
TTTGTTTTTCCTCCTGATTGATTGATTGGCATTTTACACTTGTGTGTAACTGGGAACCATCGGGCTCTAGGTCACAAAACAGGGACTGAG
TTAAACCCCCACGAGTAACACCCTCATCCATGGAAATCACAGACCAAGCCCTTAGCACCTGCATCGTGTGCCCAGCCAGTGCCAGGCTCA
AGGAGATGCCCAACAAATGGTGCTGAAATAGGTAAGTGACCCACAGGGGAAGGAGACTTAGATCTGCGTAGAGATGCTGAAATGCTGACA
CTATTAAGAAAAGTTTGGAGAAATAGTGGTTTATCTTAAACTTATTTGTATATATTTAGAAATTCAACATGGGGACTACCAGAAAAATCT

>56960_56960_1_NAA20-MYO1E_NAA20_chr20_20003124_ENST00000310450_MYO1E_chr15_59553708_ENST00000288235_length(amino acids)=1085AA_BP=26
MTTLRAFTCDDLFRFNNINLDPLTETTYIGSVLISVNPFKQMPYFGEKEIEMYQGAAQYENPPHIYALADNMYRNMIIDRENQCVIISGE
SGAGKTVAAKYIMSYISRVSGGGTKVQHVKDIILQSNPLLEAFGNAKTVRNNNSSRFGKYFEIQFSPGGEPDGGKISNFLLEKSRVVMRN
PGERSFHIFYQLIEGASAEQKHSLGITSMDYYYYLSLSGSYKVDDIDDRREFQETLHAMNVIGIFAEEQTLVLQIVAGILHLGNISFKEV
GNYAAVESEEFLAFPAYLLGINQDRLKEKLTSRQMDSKWGGKSESIHVTLNVEQACYTRDALAKALHARVFDFLVDSINKAMEKDHEEYN
IGVLDIYGFEIFQKNGFEQFCINFVNEKLQQIFIELTLKAEQEEYVQEGIRWTPIEYFNNKIVCDLIENKVNPPGIMSILDDVCATMHAV
GEGADQTLLQKLQMQIGSHEHFNSWNQGFIIHHYAGKVSYDMDGFCERNRDVLFMDLIELMQSSELPFIKSLFPENLQADKKGRPTTAGS
KIKKQANDLVSTLMKCTPHYIRCIKPNETKKPRDWEESRVKHQVEYLGLKENIRVRRAGYAYRRIFQKFLQRYAILTKATWPSWQGEEKQ
GVLHLLQSVNMDSDQFQLGRSKVFIKAPESLFLLEEMRERKYDGYARVIQKSWRKFVARKKYVQMREEASDLLLNKKERRRNSINRNFIG
DYIGMEEHPELQQFVGKREKIDFADTVTKYDRRFKGVKRDLLLTPKCLYLIGREKVKQGPDKGLVKEVLKRKIEIERILSVSLSTMQDDI
FILHEQEYDSLLESVFKTEFLSLLAKRYEEKTQKQLPLKFSNTLELKLKKENWGPWSAGGSRQVQFHQGFGDLAVLKPSNKVLQVSIGPG
LPKNSRPTRRNTTQNTGYSSGTQNANYPVRAAPPPPGYHQNGVIRNQYVPYPHAPGSQRSNQKSLYTSMARPPLPRQQSTSSDRVSQTPE
SLDFLKVPDQGAAGVRRQTTSRPPPAGGRPKPQPKPKPQVPQCKALYAYDAQDTDELSFNANDIIDIIKEDPSGWWTGRLRGKQGLFPNN

--------------------------------------------------------------
>56960_56960_2_NAA20-MYO1E_NAA20_chr20_20003124_ENST00000334982_MYO1E_chr15_59553708_ENST00000288235_length(transcript)=6005nt_BP=359nt
TTTCCTTCCCAGGTGTCGCCGCCTGCTTTCTGCAGCCGCTGCCTGGGAGGGGTTCCGAGGTTCTGGGCAGCCACAGCCCCGGAGGCCTTA
CAGGAGCGACCTGGCAATTCCCGCCGCCGGCCGCCAGAGGAAGCTGCTTTGCTGCTGTCGTTGCCCAGCAACTCGTAGCCCGGAAGCAGT
ACCCCGTCTCGCTCGGTTCCGCGGCGGCCACGCTCCGGGAGACTTCCGGCAGGGCGGGCGCGGGGTCTTGGCGAACGGTCTTCGGAAGCG
GCGGCGGCGCGATGACCACGCTACGGGCCTTTACCTGCGACGACCTGTTCCGCTTCAACAACATTAACTTGGATCCACTTACAGAAACTA
CATATATAGGATCTGTATTAATCTCAGTCAACCCTTTCAAGCAGATGCCATATTTTGGGGAAAAGGAAATTGAAATGTACCAAGGAGCGG
CACAGTATGAAAACCCACCACATATCTATGCCCTTGCAGATAATATGTACAGAAACATGATCATTGACAGAGAGAACCAGTGCGTCATTA
TCAGTGGTGAAAGTGGTGCTGGAAAAACAGTGGCTGCCAAATATATCATGAGCTACATCTCCAGAGTGTCTGGAGGAGGGACCAAAGTCC
AGCACGTGAAGGACATTATCCTGCAGTCCAACCCGCTGCTGGAGGCCTTCGGGAACGCCAAGACCGTCCGGAACAACAACTCCAGCCGAT
TTGGAAAATACTTTGAAATCCAGTTCAGTCCAGGTGGGGAACCAGATGGTGGAAAGATCTCCAACTTCCTTCTGGAAAAATCTAGGGTGG
TGATGAGGAACCCAGGAGAGCGGAGTTTTCACATATTTTACCAGCTCATCGAGGGCGCCTCTGCAGAGCAGAAACACAGCCTTGGCATCA
CCAGCATGGACTATTATTACTACCTGAGCCTCTCGGGCTCATACAAGGTTGATGACATTGACGACAGGCGGGAGTTTCAGGAAACTCTGC
ACGCCATGAATGTGATTGGGATCTTTGCAGAAGAGCAAACGCTGGTGTTGCAGATAGTGGCGGGTATTCTCCACCTGGGAAACATCAGCT
TCAAAGAAGTTGGCAACTACGCGGCTGTGGAGAGTGAAGAGTTTTTAGCTTTTCCTGCATATCTGCTAGGGATAAACCAGGACCGGTTGA
AAGAAAAGCTAACAAGCCGGCAGATGGATAGCAAGTGGGGAGGCAAATCCGAATCCATCCACGTGACCCTCAACGTAGAGCAGGCCTGTT
ACACCCGGGATGCGCTCGCCAAGGCCCTGCACGCCCGGGTCTTTGATTTCTTGGTAGATTCCATCAATAAAGCCATGGAGAAAGACCATG
AAGAATACAACATTGGCGTCCTAGACATCTATGGCTTTGAAATATTCCAGAAAAATGGCTTTGAACAGTTTTGTATCAATTTTGTTAATG
AAAAACTGCAGCAGATTTTTATTGAACTGACATTAAAGGCAGAACAGGAAGAATATGTTCAAGAGGGAATAAGATGGACACCCATTGAGT
ACTTTAATAATAAAATCGTATGTGACCTCATAGAGAACAAAGTGAACCCTCCTGGCATCATGAGCATCCTGGATGACGTGTGCGCCACGA
TGCATGCGGTGGGTGAGGGGGCAGATCAGACGCTGCTCCAGAAACTTCAGATGCAGATTGGGAGTCATGAGCACTTCAACAGTTGGAACC
AAGGCTTCATCATTCATCATTATGCTGGGAAGGTATCCTATGACATGGATGGCTTTTGTGAAAGGAACCGGGATGTGCTTTTTATGGATC
TCATCGAGCTTATGCAGAGCAGCGAGCTGCCTTTCATAAAGTCTTTATTTCCGGAAAATCTGCAGGCTGACAAGAAAGGGCGCCCAACTA
CTGCCGGAAGCAAAATAAAGAAACAAGCCAATGACCTTGTGAGCACCCTGATGAAATGTACGCCCCACTACATTCGCTGCATCAAGCCAA
ACGAAACCAAGAAGCCCAGAGACTGGGAGGAAAGCAGGGTAAAGCATCAAGTCGAATATTTGGGTCTGAAAGAGAACATTCGAGTGAGAA
GAGCTGGCTATGCCTATCGGCGCATCTTCCAAAAATTCCTACAGAGGTATGCCATTCTGACCAAAGCCACCTGGCCTTCTTGGCAGGGAG
AGGAGAAGCAAGGCGTCCTGCACCTGCTGCAGTCGGTCAACATGGACAGCGACCAGTTCCAGCTGGGGAGGAGTAAAGTGTTCATCAAAG
CCCCCGAGTCTCTATTTCTTTTAGAAGAGATGAGAGAGAGAAAGTATGATGGGTATGCTCGAGTGATACAGAAATCATGGAGGAAATTCG
TGGCCCGGAAGAAATACGTTCAAATGAGAGAAGAAGCCTCAGACCTCTTATTGAACAAGAAGGAGAGAAGGAGAAACAGTATTAACAGGA
ACTTTATAGGGGATTATATTGGGATGGAAGAGCACCCAGAACTCCAGCAGTTCGTGGGCAAGAGGGAGAAGATTGATTTCGCAGACACAG
TCACCAAGTATGACAGGAGGTTCAAGGGTGTAAAGCGAGACCTGCTCCTTACCCCAAAGTGCTTGTACTTAATCGGACGAGAAAAAGTCA
AACAGGGCCCAGACAAGGGCCTGGTGAAAGAAGTCCTGAAGCGGAAAATCGAGATAGAACGGATCTTGTCTGTGTCCCTCAGTACTATGC
AGGATGACATTTTTATTCTCCATGAGCAAGAGTATGACAGTTTGCTTGAATCTGTCTTCAAAACTGAATTCCTAAGCCTCTTAGCAAAGC
GTTACGAGGAGAAGACCCAGAAGCAACTACCTCTGAAATTCAGCAATACGCTTGAACTGAAGTTGAAAAAGGAAAACTGGGGCCCCTGGA
GTGCAGGGGGCTCCCGGCAAGTGCAGTTCCACCAAGGGTTTGGGGACCTGGCTGTCCTCAAGCCCAGTAACAAAGTGCTGCAGGTCAGCA
TCGGACCTGGACTGCCCAAGAACTCCCGTCCTACCAGAAGGAACACTACCCAAAATACAGGTTATTCCAGTGGGACTCAAAATGCCAACT
ACCCAGTGAGAGCTGCCCCTCCTCCCCCAGGATACCATCAGAACGGAGTCATCAGAAACCAGTATGTGCCATATCCCCATGCTCCTGGAA
GCCAGAGGTCCAATCAGAAAAGCCTGTACACCTCCATGGCCCGCCCGCCCTTGCCTCGGCAGCAGTCTACCAGTTCAGACCGAGTGTCAC
AGACGCCAGAGAGCCTGGATTTCCTCAAGGTCCCGGACCAGGGAGCTGCAGGGGTCAGGAGACAAACAACCAGTCGGCCTCCCCCAGCAG
GGGGCAGACCCAAGCCCCAGCCCAAGCCCAAGCCTCAGGTGCCACAGTGCAAGGCTTTGTATGCCTATGACGCTCAGGACACAGACGAAC
TCAGCTTTAATGCCAATGACATTATTGATATTATCAAAGAAGATCCTTCTGGCTGGTGGACGGGTCGACTACGAGGCAAGCAGGGCCTGT
TCCCCAACAACTATGTGACCAAGATCTGAGGTGCCCGTGACTCTGACACATGGGGCAGAGGAGCTCCAGGCACAGACCAGGGGAGGGGAT
ATTTAGGGGCTCCCCTTACAATCCACAATGAGCAATTGCTTCTCCAAGGCCTGGAGCTATTCTGGTACCTTCCCCATGGAGGACACTGAA
AAGGCTGGGTTGGGGACAGGGAGTATCACTCCATAAGTGATCCTAAAAGGTAGCCTCTTCATAGGAACCCAGGAGGACAAAACCACCATG
CATTAAGATTTATTTATTGTATTTAAACCTGGTGAGAGGACAAGTGAGGTCTGCTCAGACCTTGTAGGCTTCTATCAAAACAGCACCCTG
CTTGCTCACCAGGCCTAGAGAATGGCTGTAGGTGGCCGCTGACAAGTGCCTTTAGTTGAAGAGCACATTTCTTTCATCTCTCTTGTCCAT
ACCTGATAGACACATTCCTCTCTGCCACCTTCCTTCAGGGAGGACCCGCCCTCTGCAGACTGGGCTTAGCGTGAGCAGGCACTTCCCATG
TACGTGCCAAGGGTAAGCTGGCCTGCTGAGCCCAGGGCGACAGAGGGGCACTGGTTTACACTTTGCCGGGACCATCAGGGCCGCCAAGCA
GGTCAGGGGCTGGGGGCTGGGGGCTGGGCTGCTGGCTTTGCTTTCTCTGGGTCTTCAATTAGAATGTGGCTGGCCCATATTGGTTTGTGT
TTAAATGCTGTACTTACTACAAGAAGGATCTTTTTTCAAGCTGTACATTTATAAAAACAGATCATATACTGTATATATAAAAATCTTGAG
ATGGTAGAAACATGTATGAATGTACTAAGTAGTATTCCACTGTACTCATTCATAAGGTAGGTTTTCTTACAAAACTCACACCAGGTACTT
AAAGATGTGCTCTGCTTTTTTCCAACTACGGAGTGTCACTGCTTTCTAGGTCAGTCCCTGCAGACTCTTCTCAACTCTTTCCCTATAGGA
AACTTACTCCGCGTCCTGCCCCCACCTCCTAAATAAATAAAGGAATCGGCGAACACCTTCTTCTTTTATGACATTTGTTATGGGTTGAAT
TGTGTGCCTCCAAAGAAGATGTTGGAGTCCCAGCCACCAGTACCTCACAATATGACCTTATTTGGAAATAGAGTCTTTATGGAGCAAACA
AACAAAACTAATTAAAATGAAGTCATTAGAGTGGGTCTTAATCCAATATAATGGTGTCCTGATGAAAAGGGGAAATTTGGACACAGAGAC
CAACATGCCAGAGGAAAGACCATGTGAAAAGGCAGGGAGAAGACAGCCATTCACAAGCCCTGGACAGAGGCCTGGAACAGATTCTCCCTC
ACACCCTCACAAGGAACCAGCCCTGCCCACCTTGGTCTTGGACTTCCAACCTCCAGAACTGTGAGGCAATACATTTCGCTGTTTAAGCCA
CTCGGTTTGTGGTAGTTTGTTACGGCAGCCCTAACAAACTAAAACAATGTTTTCTTGCCATAATAGTGAAAAGCTGGAATCAGTGTAATG
CCCATCAATAATGGAATGGTTGTAAGTAGAGACTGCCATTTCTCTACCTTTCCTCCTCTGGGTGCCATAGCCATTCTTAAGCTACTGGTA
ACTTAGAGTTTGAGCAGCATACACCTTTGAGGCTTTCAGGATCCTCAAGTCCATGGGAAAAAATTATCAGGGCTTACGTATAAGGGAAAA
AAATTGATCTAGATTTGAAACGAAGATACATTTAATTCTGCTGTGTTTTACTTCAAGAACAGGGAGCAAACTCCATTCTGTAACAGTAAC
GAATAATGTTAAAAGGCTACTTCAGTTCCATGTTGCGACCAGAGAGGAAAGTAGAGCCATGATGCGGTGAGTTGAGGAAAGGTAATCCTG
TTGACACTAGTGGTCTCTAAAGTACTTTCACATTCATGGACTCCCCGCAGTGGGAGGAAGATACCACCAGAACTGGGACAGAGGAGCATG
CTGGAGTCCCCACAGCCACCAGCACTCAGGCTGCGTTCTCAGGGCAGGAACACCAGGACCATCTTCCCTTGTTATCATAAGCCCCTGGCT
GGCTGGGACAAGCCTTTGTTTTTCCTCCTGATTGATTGATTGGCATTTTACACTTGTGTGTAACTGGGAACCATCGGGCTCTAGGTCACA
AAACAGGGACTGAGTTAAACCCCCACGAGTAACACCCTCATCCATGGAAATCACAGACCAAGCCCTTAGCACCTGCATCGTGTGCCCAGC
CAGTGCCAGGCTCAAGGAGATGCCCAACAAATGGTGCTGAAATAGGTAAGTGACCCACAGGGGAAGGAGACTTAGATCTGCGTAGAGATG
CTGAAATGCTGACACTATTAAGAAAAGTTTGGAGAAATAGTGGTTTATCTTAAACTTATTTGTATATATTTAGAAATTCAACATGGGGAC

>56960_56960_2_NAA20-MYO1E_NAA20_chr20_20003124_ENST00000334982_MYO1E_chr15_59553708_ENST00000288235_length(amino acids)=1169AA_BP=110
MQPLPGRGSEVLGSHSPGGLTGATWQFPPPAARGSCFAAVVAQQLVARKQYPVSLGSAAATLRETSGRAGAGSWRTVFGSGGGAMTTLRA
FTCDDLFRFNNINLDPLTETTYIGSVLISVNPFKQMPYFGEKEIEMYQGAAQYENPPHIYALADNMYRNMIIDRENQCVIISGESGAGKT
VAAKYIMSYISRVSGGGTKVQHVKDIILQSNPLLEAFGNAKTVRNNNSSRFGKYFEIQFSPGGEPDGGKISNFLLEKSRVVMRNPGERSF
HIFYQLIEGASAEQKHSLGITSMDYYYYLSLSGSYKVDDIDDRREFQETLHAMNVIGIFAEEQTLVLQIVAGILHLGNISFKEVGNYAAV
ESEEFLAFPAYLLGINQDRLKEKLTSRQMDSKWGGKSESIHVTLNVEQACYTRDALAKALHARVFDFLVDSINKAMEKDHEEYNIGVLDI
YGFEIFQKNGFEQFCINFVNEKLQQIFIELTLKAEQEEYVQEGIRWTPIEYFNNKIVCDLIENKVNPPGIMSILDDVCATMHAVGEGADQ
TLLQKLQMQIGSHEHFNSWNQGFIIHHYAGKVSYDMDGFCERNRDVLFMDLIELMQSSELPFIKSLFPENLQADKKGRPTTAGSKIKKQA
NDLVSTLMKCTPHYIRCIKPNETKKPRDWEESRVKHQVEYLGLKENIRVRRAGYAYRRIFQKFLQRYAILTKATWPSWQGEEKQGVLHLL
QSVNMDSDQFQLGRSKVFIKAPESLFLLEEMRERKYDGYARVIQKSWRKFVARKKYVQMREEASDLLLNKKERRRNSINRNFIGDYIGME
EHPELQQFVGKREKIDFADTVTKYDRRFKGVKRDLLLTPKCLYLIGREKVKQGPDKGLVKEVLKRKIEIERILSVSLSTMQDDIFILHEQ
EYDSLLESVFKTEFLSLLAKRYEEKTQKQLPLKFSNTLELKLKKENWGPWSAGGSRQVQFHQGFGDLAVLKPSNKVLQVSIGPGLPKNSR
PTRRNTTQNTGYSSGTQNANYPVRAAPPPPGYHQNGVIRNQYVPYPHAPGSQRSNQKSLYTSMARPPLPRQQSTSSDRVSQTPESLDFLK

--------------------------------------------------------------
>56960_56960_3_NAA20-MYO1E_NAA20_chr20_20003124_ENST00000398602_MYO1E_chr15_59553708_ENST00000288235_length(transcript)=6323nt_BP=677nt
CGGCGCGATGACCACGCTACGGGCCTTTACCTGCGACGACCTGTTCCGCTTCAACAACATGTGAGTGACACGCGCCTAGGGCCAGCCGCT
CTTGGCCGGGCCTCGCTTCCCGGCCCCGCCAGCTCCGGCCCCAGCCGCGCCTTCCCGGCCGGACCCCAAGCCCGGACGGGGACCCGCGAG
AACCACCCCCCGCCGGCCAACGTGGGCGCCTCCCTGGGGCCACAGGGTCCAGGGAGAGGTGGGCCTGGAGTCGACGCACGCCGGCCTTGG
CGACCTTGGCCGAGGTGCTCAAACTTTGTCTCTGGACCCTGCGAGCTGGCAGGTTCTGGGGCAGCGCTCGGGCCTCCGCTCGTGGTCGCT
GTCAGGAGGCGCTGGGGTGATGGGGCGAGCTGGAGACGCGGCCTGGAGCCCACGACCTGGGCTTCTCGCCCTGCCCTACTGCTTGCTGGT
TCGTGGCCGGGCCGCGCCTCTTCTGGGCAGAGGGCACCGCCGACGGCCAGGCCGCAGAGGGGGCTTGCGAGGATCGGAAGCTGGTCAGCA
GCTGTTGGGATGTGTACATCGTGAAGCCCACCTGGTGGCCGTGGTTAAATTGTATCCAAATGAAATGACACTGATCACTGCGTCCCCTGC
AGCAGATGCTGTCTCAGTCATGTAACTTGGATCCACTTACAGAAACTACATATATAGGATCTGTATTAATCTCAGTCAACCCTTTCAAGC
AGATGCCATATTTTGGGGAAAAGGAAATTGAAATGTACCAAGGAGCGGCACAGTATGAAAACCCACCACATATCTATGCCCTTGCAGATA
ATATGTACAGAAACATGATCATTGACAGAGAGAACCAGTGCGTCATTATCAGTGGTGAAAGTGGTGCTGGAAAAACAGTGGCTGCCAAAT
ATATCATGAGCTACATCTCCAGAGTGTCTGGAGGAGGGACCAAAGTCCAGCACGTGAAGGACATTATCCTGCAGTCCAACCCGCTGCTGG
AGGCCTTCGGGAACGCCAAGACCGTCCGGAACAACAACTCCAGCCGATTTGGAAAATACTTTGAAATCCAGTTCAGTCCAGGTGGGGAAC
CAGATGGTGGAAAGATCTCCAACTTCCTTCTGGAAAAATCTAGGGTGGTGATGAGGAACCCAGGAGAGCGGAGTTTTCACATATTTTACC
AGCTCATCGAGGGCGCCTCTGCAGAGCAGAAACACAGCCTTGGCATCACCAGCATGGACTATTATTACTACCTGAGCCTCTCGGGCTCAT
ACAAGGTTGATGACATTGACGACAGGCGGGAGTTTCAGGAAACTCTGCACGCCATGAATGTGATTGGGATCTTTGCAGAAGAGCAAACGC
TGGTGTTGCAGATAGTGGCGGGTATTCTCCACCTGGGAAACATCAGCTTCAAAGAAGTTGGCAACTACGCGGCTGTGGAGAGTGAAGAGT
TTTTAGCTTTTCCTGCATATCTGCTAGGGATAAACCAGGACCGGTTGAAAGAAAAGCTAACAAGCCGGCAGATGGATAGCAAGTGGGGAG
GCAAATCCGAATCCATCCACGTGACCCTCAACGTAGAGCAGGCCTGTTACACCCGGGATGCGCTCGCCAAGGCCCTGCACGCCCGGGTCT
TTGATTTCTTGGTAGATTCCATCAATAAAGCCATGGAGAAAGACCATGAAGAATACAACATTGGCGTCCTAGACATCTATGGCTTTGAAA
TATTCCAGAAAAATGGCTTTGAACAGTTTTGTATCAATTTTGTTAATGAAAAACTGCAGCAGATTTTTATTGAACTGACATTAAAGGCAG
AACAGGAAGAATATGTTCAAGAGGGAATAAGATGGACACCCATTGAGTACTTTAATAATAAAATCGTATGTGACCTCATAGAGAACAAAG
TGAACCCTCCTGGCATCATGAGCATCCTGGATGACGTGTGCGCCACGATGCATGCGGTGGGTGAGGGGGCAGATCAGACGCTGCTCCAGA
AACTTCAGATGCAGATTGGGAGTCATGAGCACTTCAACAGTTGGAACCAAGGCTTCATCATTCATCATTATGCTGGGAAGGTATCCTATG
ACATGGATGGCTTTTGTGAAAGGAACCGGGATGTGCTTTTTATGGATCTCATCGAGCTTATGCAGAGCAGCGAGCTGCCTTTCATAAAGT
CTTTATTTCCGGAAAATCTGCAGGCTGACAAGAAAGGGCGCCCAACTACTGCCGGAAGCAAAATAAAGAAACAAGCCAATGACCTTGTGA
GCACCCTGATGAAATGTACGCCCCACTACATTCGCTGCATCAAGCCAAACGAAACCAAGAAGCCCAGAGACTGGGAGGAAAGCAGGGTAA
AGCATCAAGTCGAATATTTGGGTCTGAAAGAGAACATTCGAGTGAGAAGAGCTGGCTATGCCTATCGGCGCATCTTCCAAAAATTCCTAC
AGAGGTATGCCATTCTGACCAAAGCCACCTGGCCTTCTTGGCAGGGAGAGGAGAAGCAAGGCGTCCTGCACCTGCTGCAGTCGGTCAACA
TGGACAGCGACCAGTTCCAGCTGGGGAGGAGTAAAGTGTTCATCAAAGCCCCCGAGTCTCTATTTCTTTTAGAAGAGATGAGAGAGAGAA
AGTATGATGGGTATGCTCGAGTGATACAGAAATCATGGAGGAAATTCGTGGCCCGGAAGAAATACGTTCAAATGAGAGAAGAAGCCTCAG
ACCTCTTATTGAACAAGAAGGAGAGAAGGAGAAACAGTATTAACAGGAACTTTATAGGGGATTATATTGGGATGGAAGAGCACCCAGAAC
TCCAGCAGTTCGTGGGCAAGAGGGAGAAGATTGATTTCGCAGACACAGTCACCAAGTATGACAGGAGGTTCAAGGGTGTAAAGCGAGACC
TGCTCCTTACCCCAAAGTGCTTGTACTTAATCGGACGAGAAAAAGTCAAACAGGGCCCAGACAAGGGCCTGGTGAAAGAAGTCCTGAAGC
GGAAAATCGAGATAGAACGGATCTTGTCTGTGTCCCTCAGTACTATGCAGGATGACATTTTTATTCTCCATGAGCAAGAGTATGACAGTT
TGCTTGAATCTGTCTTCAAAACTGAATTCCTAAGCCTCTTAGCAAAGCGTTACGAGGAGAAGACCCAGAAGCAACTACCTCTGAAATTCA
GCAATACGCTTGAACTGAAGTTGAAAAAGGAAAACTGGGGCCCCTGGAGTGCAGGGGGCTCCCGGCAAGTGCAGTTCCACCAAGGGTTTG
GGGACCTGGCTGTCCTCAAGCCCAGTAACAAAGTGCTGCAGGTCAGCATCGGACCTGGACTGCCCAAGAACTCCCGTCCTACCAGAAGGA
ACACTACCCAAAATACAGGTTATTCCAGTGGGACTCAAAATGCCAACTACCCAGTGAGAGCTGCCCCTCCTCCCCCAGGATACCATCAGA
ACGGAGTCATCAGAAACCAGTATGTGCCATATCCCCATGCTCCTGGAAGCCAGAGGTCCAATCAGAAAAGCCTGTACACCTCCATGGCCC
GCCCGCCCTTGCCTCGGCAGCAGTCTACCAGTTCAGACCGAGTGTCACAGACGCCAGAGAGCCTGGATTTCCTCAAGGTCCCGGACCAGG
GAGCTGCAGGGGTCAGGAGACAAACAACCAGTCGGCCTCCCCCAGCAGGGGGCAGACCCAAGCCCCAGCCCAAGCCCAAGCCTCAGGTGC
CACAGTGCAAGGCTTTGTATGCCTATGACGCTCAGGACACAGACGAACTCAGCTTTAATGCCAATGACATTATTGATATTATCAAAGAAG
ATCCTTCTGGCTGGTGGACGGGTCGACTACGAGGCAAGCAGGGCCTGTTCCCCAACAACTATGTGACCAAGATCTGAGGTGCCCGTGACT
CTGACACATGGGGCAGAGGAGCTCCAGGCACAGACCAGGGGAGGGGATATTTAGGGGCTCCCCTTACAATCCACAATGAGCAATTGCTTC
TCCAAGGCCTGGAGCTATTCTGGTACCTTCCCCATGGAGGACACTGAAAAGGCTGGGTTGGGGACAGGGAGTATCACTCCATAAGTGATC
CTAAAAGGTAGCCTCTTCATAGGAACCCAGGAGGACAAAACCACCATGCATTAAGATTTATTTATTGTATTTAAACCTGGTGAGAGGACA
AGTGAGGTCTGCTCAGACCTTGTAGGCTTCTATCAAAACAGCACCCTGCTTGCTCACCAGGCCTAGAGAATGGCTGTAGGTGGCCGCTGA
CAAGTGCCTTTAGTTGAAGAGCACATTTCTTTCATCTCTCTTGTCCATACCTGATAGACACATTCCTCTCTGCCACCTTCCTTCAGGGAG
GACCCGCCCTCTGCAGACTGGGCTTAGCGTGAGCAGGCACTTCCCATGTACGTGCCAAGGGTAAGCTGGCCTGCTGAGCCCAGGGCGACA
GAGGGGCACTGGTTTACACTTTGCCGGGACCATCAGGGCCGCCAAGCAGGTCAGGGGCTGGGGGCTGGGGGCTGGGCTGCTGGCTTTGCT
TTCTCTGGGTCTTCAATTAGAATGTGGCTGGCCCATATTGGTTTGTGTTTAAATGCTGTACTTACTACAAGAAGGATCTTTTTTCAAGCT
GTACATTTATAAAAACAGATCATATACTGTATATATAAAAATCTTGAGATGGTAGAAACATGTATGAATGTACTAAGTAGTATTCCACTG
TACTCATTCATAAGGTAGGTTTTCTTACAAAACTCACACCAGGTACTTAAAGATGTGCTCTGCTTTTTTCCAACTACGGAGTGTCACTGC
TTTCTAGGTCAGTCCCTGCAGACTCTTCTCAACTCTTTCCCTATAGGAAACTTACTCCGCGTCCTGCCCCCACCTCCTAAATAAATAAAG
GAATCGGCGAACACCTTCTTCTTTTATGACATTTGTTATGGGTTGAATTGTGTGCCTCCAAAGAAGATGTTGGAGTCCCAGCCACCAGTA
CCTCACAATATGACCTTATTTGGAAATAGAGTCTTTATGGAGCAAACAAACAAAACTAATTAAAATGAAGTCATTAGAGTGGGTCTTAAT
CCAATATAATGGTGTCCTGATGAAAAGGGGAAATTTGGACACAGAGACCAACATGCCAGAGGAAAGACCATGTGAAAAGGCAGGGAGAAG
ACAGCCATTCACAAGCCCTGGACAGAGGCCTGGAACAGATTCTCCCTCACACCCTCACAAGGAACCAGCCCTGCCCACCTTGGTCTTGGA
CTTCCAACCTCCAGAACTGTGAGGCAATACATTTCGCTGTTTAAGCCACTCGGTTTGTGGTAGTTTGTTACGGCAGCCCTAACAAACTAA
AACAATGTTTTCTTGCCATAATAGTGAAAAGCTGGAATCAGTGTAATGCCCATCAATAATGGAATGGTTGTAAGTAGAGACTGCCATTTC
TCTACCTTTCCTCCTCTGGGTGCCATAGCCATTCTTAAGCTACTGGTAACTTAGAGTTTGAGCAGCATACACCTTTGAGGCTTTCAGGAT
CCTCAAGTCCATGGGAAAAAATTATCAGGGCTTACGTATAAGGGAAAAAAATTGATCTAGATTTGAAACGAAGATACATTTAATTCTGCT
GTGTTTTACTTCAAGAACAGGGAGCAAACTCCATTCTGTAACAGTAACGAATAATGTTAAAAGGCTACTTCAGTTCCATGTTGCGACCAG
AGAGGAAAGTAGAGCCATGATGCGGTGAGTTGAGGAAAGGTAATCCTGTTGACACTAGTGGTCTCTAAAGTACTTTCACATTCATGGACT
CCCCGCAGTGGGAGGAAGATACCACCAGAACTGGGACAGAGGAGCATGCTGGAGTCCCCACAGCCACCAGCACTCAGGCTGCGTTCTCAG
GGCAGGAACACCAGGACCATCTTCCCTTGTTATCATAAGCCCCTGGCTGGCTGGGACAAGCCTTTGTTTTTCCTCCTGATTGATTGATTG
GCATTTTACACTTGTGTGTAACTGGGAACCATCGGGCTCTAGGTCACAAAACAGGGACTGAGTTAAACCCCCACGAGTAACACCCTCATC
CATGGAAATCACAGACCAAGCCCTTAGCACCTGCATCGTGTGCCCAGCCAGTGCCAGGCTCAAGGAGATGCCCAACAAATGGTGCTGAAA
TAGGTAAGTGACCCACAGGGGAAGGAGACTTAGATCTGCGTAGAGATGCTGAAATGCTGACACTATTAAGAAAAGTTTGGAGAAATAGTG
GTTTATCTTAAACTTATTTGTATATATTTAGAAATTCAACATGGGGACTACCAGAAAAATCTGGGATAAATAACATTTCCCTTTTTTTCC

>56960_56960_3_NAA20-MYO1E_NAA20_chr20_20003124_ENST00000398602_MYO1E_chr15_59553708_ENST00000288235_length(amino acids)=1079AA_BP=20
MRPLQQMLSQSCNLDPLTETTYIGSVLISVNPFKQMPYFGEKEIEMYQGAAQYENPPHIYALADNMYRNMIIDRENQCVIISGESGAGKT
VAAKYIMSYISRVSGGGTKVQHVKDIILQSNPLLEAFGNAKTVRNNNSSRFGKYFEIQFSPGGEPDGGKISNFLLEKSRVVMRNPGERSF
HIFYQLIEGASAEQKHSLGITSMDYYYYLSLSGSYKVDDIDDRREFQETLHAMNVIGIFAEEQTLVLQIVAGILHLGNISFKEVGNYAAV
ESEEFLAFPAYLLGINQDRLKEKLTSRQMDSKWGGKSESIHVTLNVEQACYTRDALAKALHARVFDFLVDSINKAMEKDHEEYNIGVLDI
YGFEIFQKNGFEQFCINFVNEKLQQIFIELTLKAEQEEYVQEGIRWTPIEYFNNKIVCDLIENKVNPPGIMSILDDVCATMHAVGEGADQ
TLLQKLQMQIGSHEHFNSWNQGFIIHHYAGKVSYDMDGFCERNRDVLFMDLIELMQSSELPFIKSLFPENLQADKKGRPTTAGSKIKKQA
NDLVSTLMKCTPHYIRCIKPNETKKPRDWEESRVKHQVEYLGLKENIRVRRAGYAYRRIFQKFLQRYAILTKATWPSWQGEEKQGVLHLL
QSVNMDSDQFQLGRSKVFIKAPESLFLLEEMRERKYDGYARVIQKSWRKFVARKKYVQMREEASDLLLNKKERRRNSINRNFIGDYIGME
EHPELQQFVGKREKIDFADTVTKYDRRFKGVKRDLLLTPKCLYLIGREKVKQGPDKGLVKEVLKRKIEIERILSVSLSTMQDDIFILHEQ
EYDSLLESVFKTEFLSLLAKRYEEKTQKQLPLKFSNTLELKLKKENWGPWSAGGSRQVQFHQGFGDLAVLKPSNKVLQVSIGPGLPKNSR
PTRRNTTQNTGYSSGTQNANYPVRAAPPPPGYHQNGVIRNQYVPYPHAPGSQRSNQKSLYTSMARPPLPRQQSTSSDRVSQTPESLDFLK

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for NAA20-MYO1E


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for NAA20-MYO1E


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for NAA20-MYO1E


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource