FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:VPS41-CEP41 (FusionGDB2 ID:98431)

Fusion Gene Summary for VPS41-CEP41

check button Fusion gene summary
Fusion gene informationFusion gene name: VPS41-CEP41
Fusion gene ID: 98431
HgeneTgene
Gene symbol

VPS41

CEP41

Gene ID

27072

95681

Gene nameVPS41 subunit of HOPS complexcentrosomal protein 41
SynonymsHVPS41|HVSP41|hVps41pJBTS15|TSGA14
Cytomap

7p14.1

7q32.2

Type of geneprotein-codingprotein-coding
Descriptionvacuolar protein sorting-associated protein 41 homologS53VPS41, HOPS complex subunitvacuolar assembly protein 41vacuolar protein sorting 41 homologcentrosomal protein of 41 kDacentrosomal protein 41 kDacentrosomal protein 41kDatestis specific protein A14testis specific, 14testis-specific gene A14 protein
Modification date2020031320200313
UniProtAcc.

Q9BYV8

Ensembl transtripts involved in fusion geneENST00000310301, ENST00000395969, 
ENST00000466017, 
ENST00000495702, 
ENST00000223208, ENST00000343969, 
ENST00000489512, ENST00000541543, 
Fusion gene scores* DoF score15 X 14 X 9=18905 X 4 X 5=100
# samples 176
** MAII scorelog2(17/1890*10)=-3.47477958297073
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(6/100*10)=-0.736965594166206
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: VPS41 [Title/Abstract] AND CEP41 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointVPS41(38794302)-CEP41(130067859), # samples:2
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID

check buttonFusion gene breakpoints across VPS41 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across CEP41 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4ACCTCGA-OR-A5KZ-01AVPS41chr7

38794302

-CEP41chr7

130067859

-


Top

Fusion Gene ORF analysis for VPS41-CEP41

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-5UTRENST00000310301ENST00000495702VPS41chr7

38794302

-CEP41chr7

130067859

-
5CDS-5UTRENST00000395969ENST00000495702VPS41chr7

38794302

-CEP41chr7

130067859

-
In-frameENST00000310301ENST00000223208VPS41chr7

38794302

-CEP41chr7

130067859

-
In-frameENST00000310301ENST00000343969VPS41chr7

38794302

-CEP41chr7

130067859

-
In-frameENST00000310301ENST00000489512VPS41chr7

38794302

-CEP41chr7

130067859

-
In-frameENST00000310301ENST00000541543VPS41chr7

38794302

-CEP41chr7

130067859

-
In-frameENST00000395969ENST00000223208VPS41chr7

38794302

-CEP41chr7

130067859

-
In-frameENST00000395969ENST00000343969VPS41chr7

38794302

-CEP41chr7

130067859

-
In-frameENST00000395969ENST00000489512VPS41chr7

38794302

-CEP41chr7

130067859

-
In-frameENST00000395969ENST00000541543VPS41chr7

38794302

-CEP41chr7

130067859

-
intron-3CDSENST00000466017ENST00000223208VPS41chr7

38794302

-CEP41chr7

130067859

-
intron-3CDSENST00000466017ENST00000343969VPS41chr7

38794302

-CEP41chr7

130067859

-
intron-3CDSENST00000466017ENST00000489512VPS41chr7

38794302

-CEP41chr7

130067859

-
intron-3CDSENST00000466017ENST00000541543VPS41chr7

38794302

-CEP41chr7

130067859

-
intron-5UTRENST00000466017ENST00000495702VPS41chr7

38794302

-CEP41chr7

130067859

-

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000310301VPS41chr738794302-ENST00000223208CEP41chr7130067859-80521843552931958
ENST00000310301VPS41chr738794302-ENST00000541543CEP41chr7130067859-30281843552667870
ENST00000310301VPS41chr738794302-ENST00000343969CEP41chr7130067859-28681843552715886
ENST00000310301VPS41chr738794302-ENST00000489512CEP41chr7130067859-48811843551974639
ENST00000395969VPS41chr738794302-ENST00000223208CEP41chr7130067859-7922171302801933
ENST00000395969VPS41chr738794302-ENST00000541543CEP41chr7130067859-2898171302537845
ENST00000395969VPS41chr738794302-ENST00000343969CEP41chr7130067859-2738171302585861
ENST00000395969VPS41chr738794302-ENST00000489512CEP41chr7130067859-4751171301844614

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000310301ENST00000223208VPS41chr738794302-CEP41chr7130067859-9.20E-050.999908
ENST00000310301ENST00000541543VPS41chr738794302-CEP41chr7130067859-0.0008957660.9991042
ENST00000310301ENST00000343969VPS41chr738794302-CEP41chr7130067859-0.0011943350.99880564
ENST00000310301ENST00000489512VPS41chr738794302-CEP41chr7130067859-0.0001955730.99980444
ENST00000395969ENST00000223208VPS41chr738794302-CEP41chr7130067859-0.0001069850.99989295
ENST00000395969ENST00000541543VPS41chr738794302-CEP41chr7130067859-0.0012533560.99874663
ENST00000395969ENST00000343969VPS41chr738794302-CEP41chr7130067859-0.0016354450.9983645
ENST00000395969ENST00000489512VPS41chr738794302-CEP41chr7130067859-0.0002420370.99975795

Top

Fusion Genomic Features for VPS41-CEP41


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.

Top

Fusion Protein Features for VPS41-CEP41


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr7:38794302/chr7:130067859)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
.CEP41

Q9BYV8

FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.FUNCTION: Required during ciliogenesis for tubulin glutamylation in cilium. Probably acts by participating in the transport of TTLL6, a tubulin polyglutamylase, between the basal body and the cilium. {ECO:0000269|PubMed:22246503}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneVPS41chr7:38794302chr7:130067859ENST00000310301-212918_27596855.0Compositional biasNote=Poly-Glu
TgeneCEP41chr7:38794302chr7:130067859ENST00000223208011169_26611374.0DomainRhodanese
TgeneCEP41chr7:38794302chr7:130067859ENST00000343969010169_26611302.0DomainRhodanese
TgeneCEP41chr7:38794302chr7:130067859ENST0000048951203169_2661155.0DomainRhodanese
TgeneCEP41chr7:38794302chr7:130067859ENST0000054154309169_26611286.0DomainRhodanese

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneVPS41chr7:38794302chr7:130067859ENST00000310301-2129568_712596855.0RepeatNote=CHCR
HgeneVPS41chr7:38794302chr7:130067859ENST00000310301-2129791_839596855.0Zinc fingerRING-type%3B atypical


Top

Fusion Gene Sequence for VPS41-CEP41


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>98431_98431_1_VPS41-CEP41_VPS41_chr7_38794302_ENST00000310301_CEP41_chr7_130067859_ENST00000223208_length(transcript)=8052nt_BP=1843nt
AAGTGTGTGGTTTCCTCTTATTTCCGGGTCTGTCAGGTGACTCTCCCGTGGCGCCATGGCGGAAGCAGAGGAGCAGGAAACTGGGTCCCT
TGAAGAATCTACAGATGAGTCTGAGGAAGAAGAGAGCGAAGAGGAACCCAAGCTGAAGTATGAAAGGCTTTCCAATGGGGTAACTGAAAT
ACTTCAGAAGGATGCAGCTAGCTGCATGACAGTCCATGACAAGTTTTTGGCATTGGGCACACATTATGGCAAGGTTTATTTACTTGATGT
CCAGGGGAACATCACTCAGAAGTTTGATGTAAGTCCTGTGAAGATAAATCAGATTAGCTTGGATGAAAGTGGAGAGCACATGGGTGTGTG
TTCAGAGGATGGCAAGGTGCAGGTATTTGGACTGTATTCTGGAGAAGAATTTCACGAGACTTTTGACTGTCCCATTAAAATTATTGCTGT
GCACCCACATTTCGTGAGATCCAGTTGCAAGCAGTTTGTGACCGGAGGGAAGAAGCTGCTACTGTTTGAACGGTCTTGGATGAACAGATG
GAAGTCTGCTGTTCTGCATGAAGGGGAAGGGAACATAAGGAGTGTGAAGTGGAGAGGCCATCTGATTGCTTGGGCCAATAATATGGGTGT
GAAGATTTTTGACATCATCTCAAAGCAAAGAATCACCAATGTGCCCCGGGATGATATAAGTCTTCGCCCAGACATGTATCCCTGCAGCCT
CTGCTGGAAGGACAATGTGACACTGATTATTGGCTGGGGGACTTCTGTCAAGGTGTGCTCAGTGAAGGAACGGCATGCCAGTGAAATGAG
GGATTTGCCAAGTCGATATGTTGAAATAGTGTCTCAGTTTGAAACTGAATTCTACATCAGTGGACTTGCACCTCTCTGTGATCAGCTTGT
TGTACTTTCGTATGTAAAGGAGATTTCAGAAAAAACGGAAAGAGAATACTGTGCCAGGCCTAGACTGGACATCATCCAGCCACTTTCTGA
GACTTGTGAAGAGATCTCTTCTGATGCTTTGACAGTCAGAGGCTTTCAGGAGAATGAATGTAGAGATTATCATTTAGAATACTCTGAAGG
GGAATCACTTTTTTACATCGTGAGTCCGAGAGATGTTGTAGTGGCCAAGGAACGAGACCAAGATGATCACATTGACTGGCTCCTTGAAAA
GAAGAAATATGAAGAAGCATTGATGGCAGCTGAAATTAGCCAAAAAAATATTAAAAGACATAAGATTCTGGATATTGGCTTGGCATATAT
AAATCACCTGGTGGAGAGAGGAGACTATGACATAGCAGCACGCAAATGCCAGAAAATTCTTGGGAAAAATGCAGCACTCTGGGAATATGA
AGTTTATAAATTTAAAGAAATTGGACAGCTTAAGGCTATTAGTCCTTATTTGCCAAGAGGTGATCCAGTTCTGAAACCACTCATCTATGA
AATGATCTTACATGAATTTTTGGAGAGTGATTATGAGGGTTTTGCCACATTGATCCGAGAATGGCCTGGAGATCTGTATAATAATTCAGT
CATAGTTCAAGCAGTTCGGGATCATTTGAAGAAAGATAGTCAGAACAAGACTTTACTTAAAACCCTGGCAGAATTGTACACCTATGACAA
GAACTATGGCAATGCTCTGGAAATATACTTAACATTAAGACATAAAGACGTTTTTCAGTTGATCCACAAGCATAATCTTTTCAGTTCTAT
CAAGGATAAAATTGTTTTATTAATGGATTTTGATTCAGAGAAAGCTGTTGACATGCTTTTGGACAATGAAGATAAAATTTCAATTAAAAA
GGTAGTGGAAGAATTGGAAGACAGACCAGAGCTACAGCATGTGTATCTGATGAAAAGGATACCACAGAACCCAAGATACCAGCATATCAA
ATCAAGACTGGACACTGGTAACAGTATGACTAAATATACTGAGAAGCTCGAAGAGATTAAGAAAAATTATAGATACAAAAAAGATGAGCT
TTTCAAGAGACTAAAAGTTACAACTTTTGCCCAGCTGATCATCCAAGTTGCTTCCCTCTCTGATCAAACACTGGAAGTGACAGCTGAGGA
GATTCAAAGGCTGGAAGACAATGATTCTGCAGCTTCAGACCCTGATGCTGAAACCACTGCCAGGACCAATGGGAAAGGAAATCCAGGTGA
GCAGTCGCCGAGCCCTGAGCAGTTCATAAACAACGCAGGAGCAGGGGACTCCAGCCGCTCAACTCTTCAGAGTGTCATCAGTGGTGTTGG
GGAACTGGATCTAGACAAAGGGCCAGTGAAGAAAGCAGAGCCCCATACCAAAGACAAACCTTATCCTGACTGCCCCTTCCTGCTGCTAGA
TGTGCGTGATAGAGATTCTTACCAGCAGTGCCACATTGTTGGAGCTTACAGTTACCCAATTGCAACTCTGTCTAGAACAATGAACCCTTA
TTCAAATGATATTCTTGAATATAAAAATGCCCATGGCAAGATCATCATTCTGTATGACGATGATGAAAGGCTGGCCAGTCAGGCGGCCAC
CACCATGTGCGAGCGTGGATTTGAAAACCTCTTCATGCTTTCCGGAGGTCTAAAAGTCTTAGCTCAGAAATTCCCGGAAGGACTGATTAC
TGGTTCCCTGCCAGCATCTTGCCAGCAGGCCCTTCCTCCTGGGTCTGCCCGGAAACGATCCAGCCCCAAAGGGCCACCCCTACCAGCTGA
GAATAAATGGAGATTTACCCCAGAAGACTTAAAAAAGATAGAATATTATCTGGAAGAGGAGCAAGGGCCTGCAGATCATCCTAGCCGACT
GAACCAAGCTAACTCCTCCGGAAGAGAGTCCAAGGTGCCTGGTGCCCGAAGCGCTCAGAATCTGCCAGGTGGCGGCCCCGCCAGCCACTC
AAACCCCCGCTCCCTCAGCAGTGGTCACCTGCAAGGCAAACCCTGGAAGTAAAGACTTTGTCTCACTTAGGCAAATAAATGTTCTTCCTC
TTTTCAGAACCCCTGAGCATTTCCCAAGTTGGGTCATTTCCAGAAACTTCTGCAGAGGAAAGACCATGACATATGTATGGGATAGGCCTC
TTCCCTGTCCCTGTCTCCAGTTCCTCTCCCCTCAGGAGGCCACCTCAGGAAGGATTTTGACAGGTGACCATAAAAACCAGAGGTGTGACA
GGCTCTAGTCTCCTCCCTGTTGTTTACAGCATATTTAAGTCTTGTGAAACTGTATAGGAAAAAAGTATCTACGTTTTTAGTTTTTTTGTT
TTGTTTTTTTTTAATAAGGTCCAGCTTGTTGGGTCTCTCTGTGTTGTTTTGTAAATACTTCAGTCACATCTGCCCGTGTGCCTGTCCCTG
CCACCTTTTCATTCACTGTTCTTGACTTCATGAAGGCTTCTCCGGGCAGTCTTGGTGTGAGAAGCTGTTTCCAAGGGTGCAGATGAGCCT
TAGGTTCCAGCCGCCTGCAGACCCCACCCCAGCAGGCTCACTCAGCAAGGAGCTCTCTGCCCAGCATATTGCAGGCCCTGTTTTGAGTAT
GGAAGCCAGTGCCTGTGTACTCACTGAAATTGAAGATGAGGAAAGTAGCTGTACACTCACTGAATGCTCCCCCTTACTAGATATTTCCTG
GAGCCAGAAAGGTATGCATGTGGGTGTCTTCACACCGGGGAGGAGGGCCTCTCATGGGAAAGCCCTGGCCACCACACCGGCCTGTGCCCC
TTGAAGCCCACCAAAGCGGCCCTCACTTGTGGTCAGTATATCAGTTATGACGCCCATTGCCCAGCTTCAGTCCATCCATGTTAGATGGAC
AGAAATTATGGCCAGTTGAAAATACCAGCTTTGGTTGGACAACTGTGGACACACAAGGTGAAGAGGACTCCGAAGTCCTTTGTCAGGGCT
GACAACCTCGTAAGCCCTTGCTTAGAAATACAGTATTAGTCTAATTGAGTAATTAGTGCAATTTCCTGCTTACTTTTCATTCTCATGACT
GAACTGTGATTAGGAAGTTGTGATTATAGATTCTGGTTTTGGCCGGAATTTTGAATCAGCATTAATTGAATTGCTAAATGACTGACATTC
ATTCCATTTAATTGGGGGAACAAAAGGCCTCAGGTAAGGATGAGGAACTCTGAAATCAGATGGAAAAGAGCGGTGTTAATTTTTATGGTC
TGTGATCGTAGCTGTGATAAGGGACTGAGGAATAAATTGTGCTCTTTGTCATGGCAACCAGCTTCTGAAAAGCCCACTGAAAATTGCCTG
TCCTGCTGGTAACTGCTACGGGGTAAGATTTGCCTTAACAGTACTATTTTCTCGCCACCAAAAAAAAAAAAAAAAAAAAAAAAAAATGCA
CCACAGTATTTCTAGCATGGGGGCTGTGTTTGTATGAGAAATAAACGTAATAAATATCTCATAGAGACATATGGAAAAATAACTTTCAGA
TTCAGCCCAGTTCTGTTTTAGAGTGTGTTTATTCTTCTCTACTTGATTTCCAAAGTGCAACATTTTCCGATGCTTTAGAAATCAAACAAA
CCAGGGACATTGTTCAGATGTCAAGCCATGCCCAATTTTCCACAAGATTCAAGAATCTTGTATAAAATTCAGCCAACGTACACATAGCTT
TAATGAGGAGCCTGTCATGTTTCCCCATAAATTTATTGCCTGAGAACTTAGTTCAGCCTTTGCTAATGCCAAAATGCTCTGGCTTTGTAT
TTTCTTTACAGCATAGATAGAAAAATGCACATTTTTCCACACTCAGCTTTCCCCTAGCATGGACAAGATTTTCAGCCATTTTTGCCACAT
ATACATTTTTAAGGAAAAAAGATTTTTCTCTGTAAGAAAGTTCTGGTTATGCTGTTTTAAAGGTGACTTGTCAGGAGTTGAGACTTCCCT
GCCGGATTCTATTTTGAAAGTAAATGGTCTTCCCTCCTTGTTCCGATTCTGCGTTCCCATCGTCAGACAACTTTGGAGTATTAGAAACCA
CTGTATATATGTGGAAAGCCAGGTCAGCCAGACCTGTTAGAATTGGTGTGCACTCACCTGAGAGATCTGGCAGGTTGGATATATTTATGT
GTATTTCTCCACAGTGCTTGCTTTGCCCTGTTGGTAAGGATTTTAAATAACCATGCTCAAAAGAGCTGTTCTAATCTGCGTTTTGCATGT
TAAGTGTTAATATCAAACATTCTTTACGTGCTCGAGGTATTGCTTTTAACATTCTACTTTGCCAGTTTCTTCATTAGATTAATTGACATG
TATTATTTAAATGACCAGTGATGCTTTGTGCAATTATGAATGTTGAAGATTAAAGTACATAGTTACTAATTTGTCGTTTGCTATTAATAT
GCTGAAAACTGCCAACTTCTCTCTTCTTTTCTGTCGAGATGATTTGGGGGAGCCACAGGAGACTGGTGTGATTTTTGCTGCATCTCCTAG
GAAAGCATTTTTTAAAAAAAATAAATGAATCAGGAAATCAGTCCAATTAGGGCAGGGGGCCTCAGCTCTCCAGTCAGAAAGCCTGGATTT
CTTTTCCTGCTCAGGCTGGGACTGAAGCCACCTTCAACAACTGGATCATGGCTTCCTACCAGCGTCTCAGGGGTTGACTAGCTGCCCTTG
TCTGGGGCTTGTGAACCCTGAGACAGAAGGTGCTTCATCGATGTACAACTACAGCACCCTGAACAGCAGTGATGGCCAAAGTTTAAATAA
TACCTTAAATGCTTTAAAGGGGTTTGTGTTTAAGGAAGGGAGAAAAGAAAAAAAGAAAGGAAGGGAGAAAGAAGCAAGGAAATGGGAAGA
AGGGAGGGCAGAGAGAAGAAAAGAAGCAGAAAGCGAGAGAGAAAGGAGAAAGCTACATTACTTATTTGAAAACAAAAGGAACACCCTGGG
CTGTAATAATGTCAGGCTCAATCTCTTGAAAAAGTATGGAATGATTTAAATGGCCTGCATTCATTTTATCTTTTATCTTTTTTTTTTTTT
TTAACCTTTAGTGGTTAGCCAGGACCAGCACGATCATATTGGGCTTGGTATAAATCCGAATGAAAAGAGACCAAATAACATTCATTAGTT
GCTCAGGGATTTTTCCTGTGGTGCTATTTAAATTATACAAAAATTCTTAAGACTTTAGGCTACTCGACCAAGAAAACAGAACAAAACAAA
AAATCGTGTTTTCTCTAATTCCCTTGTGGAATGTAAGTGAAATCAGAGTCCTAGGCTAGGAAGAAATACGTAGGTAATTTTTCTTGTGTT
GGTTTTGGTTTCTGTCATGTTGTTTATTGGCTATAGATTCTGTCTTTTATGTTATCTGACTTTTTTAGAGCGAATAATTAGTTTCTGTCC
ACCTGGATTTAAATCCATGACCACCTTCTTGCTCTACTCTGAAGATAATCAGTAAGAACCTTTCTTTCTCCAGTTCTAAAACGTTCTCAG
TGTCTTTAATGTGTGTTTATTTTCTTCCCAATTCTTTCAAAGATTTAACTCCCACGATACTTTTTTTTCCCCAGGAAACACCGCAAATGT
GTGGAATATAATTCACCAGTTTAATTATGTGAGCATGTTGAGTACTTACATGCAGGTCCATTAATTTTTCACTAACAATTATTTTTTCAT
GCAAAAGCAATAAATAACATTGTGCTTCCAAAATGTTCAGAATACATTTGGGTAGTAATACTTTCCTAGATTTACAATAATTATTGAAAT
TATTATTATGACATCTTTAAATGGATACACAGGTCAGTTACAACATAAAAAATGTAATGGTGGAAATTTGTCAGCCTTGAATTCAGGCAG
AACAGGATTCGGTGCATGATTTTATGTGTTTTCGAAACGGTGTCTGTCACATTGTGATCCCCTGATGGCTCCCCTCTCTGTTGCTGATCC
TCTTTGTTCTGTACAGAAGCAAATTCTCACCTGTGTAACATCCTGAAGCACCTGGTAAAATGTGAGGCAAAGAGAGGCCACTTCTCAAAT
GCTGTGGACGGATGGGCTGCATCTTTTAAAGGATATCAATATGTCTTCCTGTTGCAATTATTTTGACTATAAGCTCCCAGAGAGTGAAAA
TCACCAACCCTGTGACAGTGGCCCTTAATAAGTGTTCATGAATAAATGAATTGAACCTGTCAAGATTGAAGTTTGGATGTGATGCCCACT
GTGGTGGCCACTCAGTGCTAGTCTGTCATTCTGGAGACCCAGAAAGCTCGTTATCTTCTGTCCCCCTTGTGTATCCTGCCTTTGTGGGCG
AGGCATTCAAAACCCTGAGGTTTTTAGATCTCCCTCTACAGGAAGTACCCAGAGAGCTGCTGGGGGTGTTATTACCCTGTCTCTGCCAGG
CTTAAAGTATCCTCCCAAATTCAGCACGCAAAGGTCACACACCACCCCCATCTTAAGAGTAGGTTTTCTCTTTTGTTTCAAATCTTGAAG
ATTTCCAGAAAATAATAATACTAGCCAATGTTTGTGGAGAGCTTACTTTGCTCCTAGATTCTTCATTATATGTTTTGACCCATTTAATTT
CCACAACAGTCTGATGAGGCAGGCACCCCCATTTTCAGATGAGGAGACTGAGGTTCGGGTAGAAATTAAGTGACTCAGGTTTGCCTTCAG
TCTCTGACCAAGGGTTTGAAAAGCCAGGAGCCAGCCTGAGAACTGTGGCTCCAGAGGGTGTTCTTTCAGCCACTCCGCTCTATGCTTCTC
ACTCTGGGTGGGCACAACCATGTTCTCCTTTGGTGATGCCTCTAACTTACCGTGAAAATGTACCTTTCCCTTCGCTATTGGCTTCCCTTC
CCTCCTAGTCAGCCGAGATTCTTTTGAAAACTTTCCTCCGCTTGCCTGCACAAAAGGCGATGGAAATTCAGGAACTGAAACATCTGCTCT
GGGGAATGCGTATTTCCACATTTCCACCGCCTGTGTCTGCTGTCTTATCTTGAAGACAGGTGCTCCAGGGCTTCCGAGGTTATTTTGTCT
GTTAATGGACACCTTGCAAAGTACCACTTAAGGAATGAGAATTACAAACTTTTAATTATATTGTAGGGGGAAAAAAGTAGGCTGTTTTCC

>98431_98431_1_VPS41-CEP41_VPS41_chr7_38794302_ENST00000310301_CEP41_chr7_130067859_ENST00000223208_length(amino acids)=958AA_BP=596
MAEAEEQETGSLEESTDESEEEESEEEPKLKYERLSNGVTEILQKDAASCMTVHDKFLALGTHYGKVYLLDVQGNITQKFDVSPVKINQI
SLDESGEHMGVCSEDGKVQVFGLYSGEEFHETFDCPIKIIAVHPHFVRSSCKQFVTGGKKLLLFERSWMNRWKSAVLHEGEGNIRSVKWR
GHLIAWANNMGVKIFDIISKQRITNVPRDDISLRPDMYPCSLCWKDNVTLIIGWGTSVKVCSVKERHASEMRDLPSRYVEIVSQFETEFY
ISGLAPLCDQLVVLSYVKEISEKTEREYCARPRLDIIQPLSETCEEISSDALTVRGFQENECRDYHLEYSEGESLFYIVSPRDVVVAKER
DQDDHIDWLLEKKKYEEALMAAEISQKNIKRHKILDIGLAYINHLVERGDYDIAARKCQKILGKNAALWEYEVYKFKEIGQLKAISPYLP
RGDPVLKPLIYEMILHEFLESDYEGFATLIREWPGDLYNNSVIVQAVRDHLKKDSQNKTLLKTLAELYTYDKNYGNALEIYLTLRHKDVF
QLIHKHNLFSSIKDKIVLLMDFDSEKAVDMLLDNEDKISIKKVVEELEDRPELQHVYLMKRIPQNPRYQHIKSRLDTGNSMTKYTEKLEE
IKKNYRYKKDELFKRLKVTTFAQLIIQVASLSDQTLEVTAEEIQRLEDNDSAASDPDAETTARTNGKGNPGEQSPSPEQFINNAGAGDSS
RSTLQSVISGVGELDLDKGPVKKAEPHTKDKPYPDCPFLLLDVRDRDSYQQCHIVGAYSYPIATLSRTMNPYSNDILEYKNAHGKIIILY
DDDERLASQAATTMCERGFENLFMLSGGLKVLAQKFPEGLITGSLPASCQQALPPGSARKRSSPKGPPLPAENKWRFTPEDLKKIEYYLE

--------------------------------------------------------------
>98431_98431_2_VPS41-CEP41_VPS41_chr7_38794302_ENST00000310301_CEP41_chr7_130067859_ENST00000343969_length(transcript)=2868nt_BP=1843nt
AAGTGTGTGGTTTCCTCTTATTTCCGGGTCTGTCAGGTGACTCTCCCGTGGCGCCATGGCGGAAGCAGAGGAGCAGGAAACTGGGTCCCT
TGAAGAATCTACAGATGAGTCTGAGGAAGAAGAGAGCGAAGAGGAACCCAAGCTGAAGTATGAAAGGCTTTCCAATGGGGTAACTGAAAT
ACTTCAGAAGGATGCAGCTAGCTGCATGACAGTCCATGACAAGTTTTTGGCATTGGGCACACATTATGGCAAGGTTTATTTACTTGATGT
CCAGGGGAACATCACTCAGAAGTTTGATGTAAGTCCTGTGAAGATAAATCAGATTAGCTTGGATGAAAGTGGAGAGCACATGGGTGTGTG
TTCAGAGGATGGCAAGGTGCAGGTATTTGGACTGTATTCTGGAGAAGAATTTCACGAGACTTTTGACTGTCCCATTAAAATTATTGCTGT
GCACCCACATTTCGTGAGATCCAGTTGCAAGCAGTTTGTGACCGGAGGGAAGAAGCTGCTACTGTTTGAACGGTCTTGGATGAACAGATG
GAAGTCTGCTGTTCTGCATGAAGGGGAAGGGAACATAAGGAGTGTGAAGTGGAGAGGCCATCTGATTGCTTGGGCCAATAATATGGGTGT
GAAGATTTTTGACATCATCTCAAAGCAAAGAATCACCAATGTGCCCCGGGATGATATAAGTCTTCGCCCAGACATGTATCCCTGCAGCCT
CTGCTGGAAGGACAATGTGACACTGATTATTGGCTGGGGGACTTCTGTCAAGGTGTGCTCAGTGAAGGAACGGCATGCCAGTGAAATGAG
GGATTTGCCAAGTCGATATGTTGAAATAGTGTCTCAGTTTGAAACTGAATTCTACATCAGTGGACTTGCACCTCTCTGTGATCAGCTTGT
TGTACTTTCGTATGTAAAGGAGATTTCAGAAAAAACGGAAAGAGAATACTGTGCCAGGCCTAGACTGGACATCATCCAGCCACTTTCTGA
GACTTGTGAAGAGATCTCTTCTGATGCTTTGACAGTCAGAGGCTTTCAGGAGAATGAATGTAGAGATTATCATTTAGAATACTCTGAAGG
GGAATCACTTTTTTACATCGTGAGTCCGAGAGATGTTGTAGTGGCCAAGGAACGAGACCAAGATGATCACATTGACTGGCTCCTTGAAAA
GAAGAAATATGAAGAAGCATTGATGGCAGCTGAAATTAGCCAAAAAAATATTAAAAGACATAAGATTCTGGATATTGGCTTGGCATATAT
AAATCACCTGGTGGAGAGAGGAGACTATGACATAGCAGCACGCAAATGCCAGAAAATTCTTGGGAAAAATGCAGCACTCTGGGAATATGA
AGTTTATAAATTTAAAGAAATTGGACAGCTTAAGGCTATTAGTCCTTATTTGCCAAGAGGTGATCCAGTTCTGAAACCACTCATCTATGA
AATGATCTTACATGAATTTTTGGAGAGTGATTATGAGGGTTTTGCCACATTGATCCGAGAATGGCCTGGAGATCTGTATAATAATTCAGT
CATAGTTCAAGCAGTTCGGGATCATTTGAAGAAAGATAGTCAGAACAAGACTTTACTTAAAACCCTGGCAGAATTGTACACCTATGACAA
GAACTATGGCAATGCTCTGGAAATATACTTAACATTAAGACATAAAGACGTTTTTCAGTTGATCCACAAGCATAATCTTTTCAGTTCTAT
CAAGGATAAAATTGTTTTATTAATGGATTTTGATTCAGAGAAAGCTGTTGACATGCTTTTGGACAATGAAGATAAAATTTCAATTAAAAA
GGTAGTGGAAGAATTGGAAGACAGACCAGAGCTACAGCATGTGTATCTGATGAAAAGGATACCACAGAACCCAAGATACCAGCATATCAA
ATCAAGACTGGACACTGGTAACAGTATGACTAAATATACTGAGAAGCTCGAAGAGATTAAGAAAAATTATAGATACAAAAAAGATGAGCT
TTTCAAGAGACTAAAAGTTACAACTTTTGCCCAGCTGATCATCCAAGTTGCTTCCCTCTCTGATCAAACACTGGAAGTGACAGCTGAGGA
GATTCAAAGGCTGGAAGACAATGATTCTGCAGCTTCAGACCCTGATGCTGAAACCACTGCCAGGACCAATGGGAAAGGAAATCCAGGTGA
GCAGTCGCCGAGCCCTGAGCAGTTCATAAACAACGCAGGAGCAGGGGACTCCAGCCGCTCAACTCTTCAGAGTGTCATCAGTGGTGTTGG
GGAACTGGATCTAGACAAAGGGCCAGTGAAGAAAGCAGAGCCCCATACCAAAGACAAACCTTATCCTGACTGCCCCTTCCTGCTGCTAGA
TGTGCGTGATAGAGATTCTTACCAGCAGTGCCACATTGTTGGAGCTTACAGTTACCCAATTGCAACTCTGTCTAGAACAATGAACCCTTA
TTCAAATGATATTCTTGAATATAAAAATGCCCATGGCAAGATCATCATTCTGTATGACGATGATGAAAGGCTGGCCAGTCAGGCGGCCAC
CACCATGTGCGAGCGTGGATTTGAAAACCTCTTCATGCTTTCCGGAGGCCGACTGAACCAAGCTAACTCCTCCGGAAGAGAGTCCAAGGT
GCCTGGTGCCCGAAGCGCTCAGAATCTGCCAGGTGGCGGCCCCGCCAGCCACTCAAACCCCCGCTCCCTCAGCAGTGGTCACCTGCAAGG
CAAACCCTGGAAGTAAAGACTTTGTCTCACTTAGGCAAATAAATGTTCTTCCTCTTTTCAGAACCCCTGAGCATTTCCCAAGTTGGGTCA

>98431_98431_2_VPS41-CEP41_VPS41_chr7_38794302_ENST00000310301_CEP41_chr7_130067859_ENST00000343969_length(amino acids)=886AA_BP=596
MAEAEEQETGSLEESTDESEEEESEEEPKLKYERLSNGVTEILQKDAASCMTVHDKFLALGTHYGKVYLLDVQGNITQKFDVSPVKINQI
SLDESGEHMGVCSEDGKVQVFGLYSGEEFHETFDCPIKIIAVHPHFVRSSCKQFVTGGKKLLLFERSWMNRWKSAVLHEGEGNIRSVKWR
GHLIAWANNMGVKIFDIISKQRITNVPRDDISLRPDMYPCSLCWKDNVTLIIGWGTSVKVCSVKERHASEMRDLPSRYVEIVSQFETEFY
ISGLAPLCDQLVVLSYVKEISEKTEREYCARPRLDIIQPLSETCEEISSDALTVRGFQENECRDYHLEYSEGESLFYIVSPRDVVVAKER
DQDDHIDWLLEKKKYEEALMAAEISQKNIKRHKILDIGLAYINHLVERGDYDIAARKCQKILGKNAALWEYEVYKFKEIGQLKAISPYLP
RGDPVLKPLIYEMILHEFLESDYEGFATLIREWPGDLYNNSVIVQAVRDHLKKDSQNKTLLKTLAELYTYDKNYGNALEIYLTLRHKDVF
QLIHKHNLFSSIKDKIVLLMDFDSEKAVDMLLDNEDKISIKKVVEELEDRPELQHVYLMKRIPQNPRYQHIKSRLDTGNSMTKYTEKLEE
IKKNYRYKKDELFKRLKVTTFAQLIIQVASLSDQTLEVTAEEIQRLEDNDSAASDPDAETTARTNGKGNPGEQSPSPEQFINNAGAGDSS
RSTLQSVISGVGELDLDKGPVKKAEPHTKDKPYPDCPFLLLDVRDRDSYQQCHIVGAYSYPIATLSRTMNPYSNDILEYKNAHGKIIILY

--------------------------------------------------------------
>98431_98431_3_VPS41-CEP41_VPS41_chr7_38794302_ENST00000310301_CEP41_chr7_130067859_ENST00000489512_length(transcript)=4881nt_BP=1843nt
AAGTGTGTGGTTTCCTCTTATTTCCGGGTCTGTCAGGTGACTCTCCCGTGGCGCCATGGCGGAAGCAGAGGAGCAGGAAACTGGGTCCCT
TGAAGAATCTACAGATGAGTCTGAGGAAGAAGAGAGCGAAGAGGAACCCAAGCTGAAGTATGAAAGGCTTTCCAATGGGGTAACTGAAAT
ACTTCAGAAGGATGCAGCTAGCTGCATGACAGTCCATGACAAGTTTTTGGCATTGGGCACACATTATGGCAAGGTTTATTTACTTGATGT
CCAGGGGAACATCACTCAGAAGTTTGATGTAAGTCCTGTGAAGATAAATCAGATTAGCTTGGATGAAAGTGGAGAGCACATGGGTGTGTG
TTCAGAGGATGGCAAGGTGCAGGTATTTGGACTGTATTCTGGAGAAGAATTTCACGAGACTTTTGACTGTCCCATTAAAATTATTGCTGT
GCACCCACATTTCGTGAGATCCAGTTGCAAGCAGTTTGTGACCGGAGGGAAGAAGCTGCTACTGTTTGAACGGTCTTGGATGAACAGATG
GAAGTCTGCTGTTCTGCATGAAGGGGAAGGGAACATAAGGAGTGTGAAGTGGAGAGGCCATCTGATTGCTTGGGCCAATAATATGGGTGT
GAAGATTTTTGACATCATCTCAAAGCAAAGAATCACCAATGTGCCCCGGGATGATATAAGTCTTCGCCCAGACATGTATCCCTGCAGCCT
CTGCTGGAAGGACAATGTGACACTGATTATTGGCTGGGGGACTTCTGTCAAGGTGTGCTCAGTGAAGGAACGGCATGCCAGTGAAATGAG
GGATTTGCCAAGTCGATATGTTGAAATAGTGTCTCAGTTTGAAACTGAATTCTACATCAGTGGACTTGCACCTCTCTGTGATCAGCTTGT
TGTACTTTCGTATGTAAAGGAGATTTCAGAAAAAACGGAAAGAGAATACTGTGCCAGGCCTAGACTGGACATCATCCAGCCACTTTCTGA
GACTTGTGAAGAGATCTCTTCTGATGCTTTGACAGTCAGAGGCTTTCAGGAGAATGAATGTAGAGATTATCATTTAGAATACTCTGAAGG
GGAATCACTTTTTTACATCGTGAGTCCGAGAGATGTTGTAGTGGCCAAGGAACGAGACCAAGATGATCACATTGACTGGCTCCTTGAAAA
GAAGAAATATGAAGAAGCATTGATGGCAGCTGAAATTAGCCAAAAAAATATTAAAAGACATAAGATTCTGGATATTGGCTTGGCATATAT
AAATCACCTGGTGGAGAGAGGAGACTATGACATAGCAGCACGCAAATGCCAGAAAATTCTTGGGAAAAATGCAGCACTCTGGGAATATGA
AGTTTATAAATTTAAAGAAATTGGACAGCTTAAGGCTATTAGTCCTTATTTGCCAAGAGGTGATCCAGTTCTGAAACCACTCATCTATGA
AATGATCTTACATGAATTTTTGGAGAGTGATTATGAGGGTTTTGCCACATTGATCCGAGAATGGCCTGGAGATCTGTATAATAATTCAGT
CATAGTTCAAGCAGTTCGGGATCATTTGAAGAAAGATAGTCAGAACAAGACTTTACTTAAAACCCTGGCAGAATTGTACACCTATGACAA
GAACTATGGCAATGCTCTGGAAATATACTTAACATTAAGACATAAAGACGTTTTTCAGTTGATCCACAAGCATAATCTTTTCAGTTCTAT
CAAGGATAAAATTGTTTTATTAATGGATTTTGATTCAGAGAAAGCTGTTGACATGCTTTTGGACAATGAAGATAAAATTTCAATTAAAAA
GGTAGTGGAAGAATTGGAAGACAGACCAGAGCTACAGCATGTGTATCTGATGAAAAGGATACCACAGAACCCAAGATACCAGCATATCAA
ATCAAGACTGGACACTGGTGCCTGTGTGTATCTTACCAGCTCCCCAGCACTGCCAGACTGTGCCATGAATGGATTATGCTTCTGATGAAA
TCCCTGGCTGCGTTTTGTTCAGTCACCACCGCAGGGTTCTTGTATATTCTCAAGCACCCTCTCTGGGTTTCTGTTTTGCAGCACTGCAGG
CTGATTAAGGGACACTTTCCAAACCACAGGGCTCTCCCTTTATTAGTCCCCACAGTCTTTCATTGGCTGCATTGTCCTCCCATTTCTATC
TATATTCAATCTAATTTCCAATTTCTTTAGCTTTCTGAAATTCTGTAGGCTTTCTCCACTCAGTCTTTTCTAGCCTTATTTTTGTCTTTT
TAAAAATTCACCTTCCTTTTATTATCTCTAGACAAACTATTCATTCCCTCTTTTCTTCTTTAGGCTTTACTTTTCCTGAATATTTGCTAA
TAGCCTTGACTGTAAGCATCAAACTCATTATAGACTAGGTACTCTTCATTTTTATTTGTTGTTTAATTCTGTTTCTTCCGTCAAAGGAAG
AAATATATGTGAAAGGCTTTTATAAACCAGAAAGCACTATACACACATATCACTTATTGAGGTTAACAAAGGACCACTGTGCCTCTTTTC
ATGCTTTCTGCTCTTCCAAGCCAAGAGAACTTCTCCATAGAGGAGGGAAATATTGAAGAAAAATAAAGCCATGGAGTTGTCTCTTAGCAA
CTGAGTTCGTGCCTTATAAATCTCATCTACTTTAGATTACTGCTGCACAAAATCACTTGCTTTTCTTATTGGCTCAGTATAAATAATGTT
ATTAAATTATAGGAAAACAATGGAAAAAATCTAATGAATGACTCATTCACTGAATCAGTTGCACTCTGTGTTTGCACAGCACTGTTGCTA
TAGTCGGGTTGTTTCTTCCTCTTGATTCTAATTGTAAAGTTTTGTTATATATCTTTTATCACAACTTTAGTTGTAAGTAAACAAAGTAGA
AGCTAAATGCTACCTTATACATTTATAGGTTGCTTAAGTCGTGAAGCAATATGAGTCATATATGTAGCACATAACATATACTGTATATAA
GTCAAGTTGTACATAGTTTAAAGAAAATCAGGTGTGCAGAAACTCAGACCAAATTGGGAAGTATTTGCATTATAAAATAACTCATTATAA
GGATTTTTCAGTTAGTAGATCTCCATAGTAAATTTCATTCAGCAAATGTGTATTTTTCCTATTTATTTATTTATTTATTTATTGAGATGA
AGTCTCGCTCTGTTGCCCAAGCTGGAGTGCAGTGGCGTGATCTTGGCTCACTGCAACCTCTGCCTCCCAGGTTCAAGCGATTCTTCTGCC
TCAGCCTCCTGAGTAGCTGGGACTACAGGTGCGCACCACCATGCCTGGCTAATTTTTATATTTTTAGTAGAGACGGGGTTTTACTATGTT
GGTCAGGCTGGTCTCGAACTCCTGACCTTGTGATCCTCCCGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACTGTGCCAG
GTCTAGTTATTTATTTATTTATTGTAGAGATGGGCTCTCACTGTGTTGGCCAGGGTGGTCTTGAACTCCTGGCCTCGAGCAATCCTTCCA
CCTCAGTCTCCCAAAGTGCTGGGATTACAGGTGTGAGCTATCACACCTGGCCATTTTTTTTTTTTTTTTTTTTTTTGAGATAGAGTCTCA
CTCTGTCACCCGCATTGTAGTGCAGTGGCGCATTCTTGCTCACTGCAACCTCCCCTTCCTGGGCTCAGGGGATCCTCCTACCTCAGCCTC
TCAAGTAGCTGGGACTACAGGGATGTGCCACCATGCCTGGCTAGTTTTTGTGTATGTGTGTGTTTTGTAGAGACGAGGTCTCACTATTTT
GCCCAGGCTGGTCCGGAACTCCTGGGCTCAAGCAATCCTGCCTCCTCAGCCTCCCAACGTGCTGGGATTGCAGGCATGATCCAGTGCACC
TGACCATCATTCAGCAAGTATTTTTGAGTGTCTCCTATGCTAGACAATGGAAGGGATAAAGATTATTCATACGTGTGTGTGTGTGTGTGT
GTGTACGTGCCCATAAATAGTCCTTGGCTTTCTGTCTAGTAAGGTAGGTAAGACAATACAGTGATAAATGCCTTGTGAGTTGTAAGCCGA
GTGCTATAGAGGCTCAGAGGAGGGAAAGGTCACTTCCTGCTGAGTGAGCTCTTACGAGTATGTGGCAAGGTGATACGTTAGAGAATGCAA
GACTAAGAGTTTGTCAAGTTAAGTAATTAAGGCAACATTAAGAGGTAGTGAGTTGGTGTCCAGTGCTAGAGCAAGGAGTTTCAGCTTTAG
TTAGGAAGCAATGGGGAGCCACTGACTTTTTTTTTTCTTTTTGATCAAAAAGTCTTATCTGGGAAAATGTATCTGGCAGTAATGTGTAAT
TTCAATTTTTTTTAAAAAAAATTGGTTCTTGGATAGTCTTTTAGGCTTCTTTTGTATGAAACAAAGCCATGTTTATAAATAAGTGAATTA
TGAAATGGATTCATTTCCAATCTTTTTAAAAGGTAGTCAGAAACTATCTTCAGATGCAATTAGATACAGTAAGATAGGCTGCATAGTTAA
GATGTAAGCAGTTTACACAGTGGGATAAAAATGCTACTTTTCATACTTTCTCAAACCATTTTTAGGGGTATTTTTGTGTCAGCCATCCTG
CTATCTCATTTGTCCAATTAAAGCAAGGAAGAAATCCTCCATTCTCCTTTTTTCCCCATTCTTGGATAGAATGAATGTCTATAGTGAAAA
GAACTCATCAGTAGTCCTCTCAGATTTGGTCTGAATTTCACCAAATATTTGTATAATTGCAAAAAACTTGTTTCTCAGCCTTCTCCTTTA
GGGGGAAAACAAATAGAGGGCATGATGGCCTACCTCCCTGCATTGTGGTGAGGGTCAGCTGATACAAAATAAAGTGTTAAGTGTAATTAC

>98431_98431_3_VPS41-CEP41_VPS41_chr7_38794302_ENST00000310301_CEP41_chr7_130067859_ENST00000489512_length(amino acids)=639AA_BP=596
MAEAEEQETGSLEESTDESEEEESEEEPKLKYERLSNGVTEILQKDAASCMTVHDKFLALGTHYGKVYLLDVQGNITQKFDVSPVKINQI
SLDESGEHMGVCSEDGKVQVFGLYSGEEFHETFDCPIKIIAVHPHFVRSSCKQFVTGGKKLLLFERSWMNRWKSAVLHEGEGNIRSVKWR
GHLIAWANNMGVKIFDIISKQRITNVPRDDISLRPDMYPCSLCWKDNVTLIIGWGTSVKVCSVKERHASEMRDLPSRYVEIVSQFETEFY
ISGLAPLCDQLVVLSYVKEISEKTEREYCARPRLDIIQPLSETCEEISSDALTVRGFQENECRDYHLEYSEGESLFYIVSPRDVVVAKER
DQDDHIDWLLEKKKYEEALMAAEISQKNIKRHKILDIGLAYINHLVERGDYDIAARKCQKILGKNAALWEYEVYKFKEIGQLKAISPYLP
RGDPVLKPLIYEMILHEFLESDYEGFATLIREWPGDLYNNSVIVQAVRDHLKKDSQNKTLLKTLAELYTYDKNYGNALEIYLTLRHKDVF
QLIHKHNLFSSIKDKIVLLMDFDSEKAVDMLLDNEDKISIKKVVEELEDRPELQHVYLMKRIPQNPRYQHIKSRLDTGACVYLTSSPALP

--------------------------------------------------------------
>98431_98431_4_VPS41-CEP41_VPS41_chr7_38794302_ENST00000310301_CEP41_chr7_130067859_ENST00000541543_length(transcript)=3028nt_BP=1843nt
AAGTGTGTGGTTTCCTCTTATTTCCGGGTCTGTCAGGTGACTCTCCCGTGGCGCCATGGCGGAAGCAGAGGAGCAGGAAACTGGGTCCCT
TGAAGAATCTACAGATGAGTCTGAGGAAGAAGAGAGCGAAGAGGAACCCAAGCTGAAGTATGAAAGGCTTTCCAATGGGGTAACTGAAAT
ACTTCAGAAGGATGCAGCTAGCTGCATGACAGTCCATGACAAGTTTTTGGCATTGGGCACACATTATGGCAAGGTTTATTTACTTGATGT
CCAGGGGAACATCACTCAGAAGTTTGATGTAAGTCCTGTGAAGATAAATCAGATTAGCTTGGATGAAAGTGGAGAGCACATGGGTGTGTG
TTCAGAGGATGGCAAGGTGCAGGTATTTGGACTGTATTCTGGAGAAGAATTTCACGAGACTTTTGACTGTCCCATTAAAATTATTGCTGT
GCACCCACATTTCGTGAGATCCAGTTGCAAGCAGTTTGTGACCGGAGGGAAGAAGCTGCTACTGTTTGAACGGTCTTGGATGAACAGATG
GAAGTCTGCTGTTCTGCATGAAGGGGAAGGGAACATAAGGAGTGTGAAGTGGAGAGGCCATCTGATTGCTTGGGCCAATAATATGGGTGT
GAAGATTTTTGACATCATCTCAAAGCAAAGAATCACCAATGTGCCCCGGGATGATATAAGTCTTCGCCCAGACATGTATCCCTGCAGCCT
CTGCTGGAAGGACAATGTGACACTGATTATTGGCTGGGGGACTTCTGTCAAGGTGTGCTCAGTGAAGGAACGGCATGCCAGTGAAATGAG
GGATTTGCCAAGTCGATATGTTGAAATAGTGTCTCAGTTTGAAACTGAATTCTACATCAGTGGACTTGCACCTCTCTGTGATCAGCTTGT
TGTACTTTCGTATGTAAAGGAGATTTCAGAAAAAACGGAAAGAGAATACTGTGCCAGGCCTAGACTGGACATCATCCAGCCACTTTCTGA
GACTTGTGAAGAGATCTCTTCTGATGCTTTGACAGTCAGAGGCTTTCAGGAGAATGAATGTAGAGATTATCATTTAGAATACTCTGAAGG
GGAATCACTTTTTTACATCGTGAGTCCGAGAGATGTTGTAGTGGCCAAGGAACGAGACCAAGATGATCACATTGACTGGCTCCTTGAAAA
GAAGAAATATGAAGAAGCATTGATGGCAGCTGAAATTAGCCAAAAAAATATTAAAAGACATAAGATTCTGGATATTGGCTTGGCATATAT
AAATCACCTGGTGGAGAGAGGAGACTATGACATAGCAGCACGCAAATGCCAGAAAATTCTTGGGAAAAATGCAGCACTCTGGGAATATGA
AGTTTATAAATTTAAAGAAATTGGACAGCTTAAGGCTATTAGTCCTTATTTGCCAAGAGGTGATCCAGTTCTGAAACCACTCATCTATGA
AATGATCTTACATGAATTTTTGGAGAGTGATTATGAGGGTTTTGCCACATTGATCCGAGAATGGCCTGGAGATCTGTATAATAATTCAGT
CATAGTTCAAGCAGTTCGGGATCATTTGAAGAAAGATAGTCAGAACAAGACTTTACTTAAAACCCTGGCAGAATTGTACACCTATGACAA
GAACTATGGCAATGCTCTGGAAATATACTTAACATTAAGACATAAAGACGTTTTTCAGTTGATCCACAAGCATAATCTTTTCAGTTCTAT
CAAGGATAAAATTGTTTTATTAATGGATTTTGATTCAGAGAAAGCTGTTGACATGCTTTTGGACAATGAAGATAAAATTTCAATTAAAAA
GGTAGTGGAAGAATTGGAAGACAGACCAGAGCTACAGCATGTGTATCTGATGAAAAGGATACCACAGAACCCAAGATACCAGCATATCAA
ATCAAGACTGGACACTGATTATAGATACAAAAAAGATGAGCTTTTCAAGAGACTAAAAGTTACAACTTTTGCCCAGCTGATCATCCAAGT
TGCTTCCCTCTCTGATCAAACACTGGAAGTGACAGCTGAGGAGATTCAAAGGCTGGAAGACAATGATTCTGCAGCTTCAGACCCTGATGC
TGAAACCACTGCCAGGACCAATGGGAAAGGAAATCCAGGTGAGCAGTCGCCGAGCCCTGAGCAGTTCATAAACAACGCAGGAGCAGGGGA
CTCCAGCCGCTCAACTCTTCAGAGTGTCATCAGTGGTGTTGGGGAACTGGATCTAGACAAAGGGCCAGTGAAGAAAGCAGAGCCCCATAC
CAAAGACAAACCTTATCCTGACTGCCCCTTCCTGCTGCTAGATGTGCGTGATAGAGATTCTTACCAGCAGTGCCACATTGTTGGAGCTTA
CAGTTACCCAATTGCAACTCTGTCTAGAACAATGAACCCTTATTCAAATGATATTCTTGAATATAAAAATGCCCATGGCAAGATCATCAT
TCTGTATGACGATGATGAAAGGCTGGCCAGTCAGGCGGCCACCACCATGTGCGAGCGTGGATTTGAAAACCTCTTCATGCTTTCCGGAGG
CCGACTGAACCAAGCTAACTCCTCCGGAAGAGAGTCCAAGGTGCCTGGTGCCCGAAGCGCTCAGAATCTGCCAGGTGGCGGCCCCGCCAG
CCACTCAAACCCCCGCTCCCTCAGCAGTGGTCACCTGCAAGGCAAACCCTGGAAGTAAAGACTTTGTCTCACTTAGGCAAATAAATGTTC
TTCCTCTTTTCAGAACCCCTGAGCATTTCCCAAGTTGGGTCATTTCCAGAAACTTCTGCAGAGGAAAGACCATGACATATGTATGGGATA
GGCCTCTTCCCTGTCCCTGTCTCCAGTTCCTCTCCCCTCAGGAGGCCACCTCAGGAAGGATTTTGACAGGTGACCATAAAAACCAGAGGT
GTGACAGGCTCTAGTCTCCTCCCTGTTGTTTACAGCATATTTAAGTCTTGTGAAACTGTATAGGAAAAAAGTATCTACGTTTTTAGTTTT

>98431_98431_4_VPS41-CEP41_VPS41_chr7_38794302_ENST00000310301_CEP41_chr7_130067859_ENST00000541543_length(amino acids)=870AA_BP=596
MAEAEEQETGSLEESTDESEEEESEEEPKLKYERLSNGVTEILQKDAASCMTVHDKFLALGTHYGKVYLLDVQGNITQKFDVSPVKINQI
SLDESGEHMGVCSEDGKVQVFGLYSGEEFHETFDCPIKIIAVHPHFVRSSCKQFVTGGKKLLLFERSWMNRWKSAVLHEGEGNIRSVKWR
GHLIAWANNMGVKIFDIISKQRITNVPRDDISLRPDMYPCSLCWKDNVTLIIGWGTSVKVCSVKERHASEMRDLPSRYVEIVSQFETEFY
ISGLAPLCDQLVVLSYVKEISEKTEREYCARPRLDIIQPLSETCEEISSDALTVRGFQENECRDYHLEYSEGESLFYIVSPRDVVVAKER
DQDDHIDWLLEKKKYEEALMAAEISQKNIKRHKILDIGLAYINHLVERGDYDIAARKCQKILGKNAALWEYEVYKFKEIGQLKAISPYLP
RGDPVLKPLIYEMILHEFLESDYEGFATLIREWPGDLYNNSVIVQAVRDHLKKDSQNKTLLKTLAELYTYDKNYGNALEIYLTLRHKDVF
QLIHKHNLFSSIKDKIVLLMDFDSEKAVDMLLDNEDKISIKKVVEELEDRPELQHVYLMKRIPQNPRYQHIKSRLDTDYRYKKDELFKRL
KVTTFAQLIIQVASLSDQTLEVTAEEIQRLEDNDSAASDPDAETTARTNGKGNPGEQSPSPEQFINNAGAGDSSRSTLQSVISGVGELDL
DKGPVKKAEPHTKDKPYPDCPFLLLDVRDRDSYQQCHIVGAYSYPIATLSRTMNPYSNDILEYKNAHGKIIILYDDDERLASQAATTMCE

--------------------------------------------------------------
>98431_98431_5_VPS41-CEP41_VPS41_chr7_38794302_ENST00000395969_CEP41_chr7_130067859_ENST00000223208_length(transcript)=7922nt_BP=1713nt
ATGGCGGAAGCAGAGGAGCAGGAAACTGGGTCCCTTGAAGAATCTACAGATGAGTCTGAGGAAGAAGAGAGCGAAGAGGAACCCAAGCTG
AAGTATGAAAGGCTTTCCAATGGGGTAACTGAAATACTTCAGAAGGATGCAGCTAGCTGCATGACAGTCCATGACAAGTTTTTGGCATTG
GGCACACATTATGGCAAGGTTTATTTACTTGATGTCCAGGGGAACATCACTCAGAAGTTTGATGTAGTGCAGGTATTTGGACTGTATTCT
GGAGAAGAATTTCACGAGACTTTTGACTGTCCCATTAAAATTATTGCTGTGCACCCACATTTCGTGAGATCCAGTTGCAAGCAGTTTGTG
ACCGGAGGGAAGAAGCTGCTACTGTTTGAACGGTCTTGGATGAACAGATGGAAGTCTGCTGTTCTGCATGAAGGGGAAGGGAACATAAGG
AGTGTGAAGTGGAGAGGCCATCTGATTGCTTGGGCCAATAATATGGGTGTGAAGATTTTTGACATCATCTCAAAGCAAAGAATCACCAAT
GTGCCCCGGGATGATATAAGTCTTCGCCCAGACATGTATCCCTGCAGCCTCTGCTGGAAGGACAATGTGACACTGATTATTGGCTGGGGG
ACTTCTGTCAAGGTGTGCTCAGTGAAGGAACGGCATGCCAGTGAAATGAGGGATTTGCCAAGTCGATATGTTGAAATAGTGTCTCAGTTT
GAAACTGAATTCTACATCAGTGGACTTGCACCTCTCTGTGATCAGCTTGTTGTACTTTCGTATGTAAAGGAGATTTCAGAAAAAACGGAA
AGAGAATACTGTGCCAGGCCTAGACTGGACATCATCCAGCCACTTTCTGAGACTTGTGAAGAGATCTCTTCTGATGCTTTGACAGTCAGA
GGCTTTCAGGAGAATGAATGTAGAGATTATCATTTAGAATACTCTGAAGGGGAATCACTTTTTTACATCGTGAGTCCGAGAGATGTTGTA
GTGGCCAAGGAACGAGACCAAGATGATCACATTGACTGGCTCCTTGAAAAGAAGAAATATGAAGAAGCATTGATGGCAGCTGAAATTAGC
CAAAAAAATATTAAAAGACATAAGATTCTGGATATTGGCTTGGCATATATAAATCACCTGGTGGAGAGAGGAGACTATGACATAGCAGCA
CGCAAATGCCAGAAAATTCTTGGGAAAAATGCAGCACTCTGGGAATATGAAGTTTATAAATTTAAAGAAATTGGACAGCTTAAGGCTATT
AGTCCTTATTTGCCAAGAGGTGATCCAGTTCTGAAACCACTCATCTATGAAATGATCTTACATGAATTTTTGGAGAGTGATTATGAGGGT
TTTGCCACATTGATCCGAGAATGGCCTGGAGATCTGTATAATAATTCAGTCATAGTTCAAGCAGTTCGGGATCATTTGAAGAAAGATAGT
CAGAACAAGACTTTACTTAAAACCCTGGCAGAATTGTACACCTATGACAAGAACTATGGCAATGCTCTGGAAATATACTTAACATTAAGA
CATAAAGACGTTTTTCAGTTGATCCACAAGCATAATCTTTTCAGTTCTATCAAGGATAAAATTGTTTTATTAATGGATTTTGATTCAGAG
AAAGCTGTTGACATGCTTTTGGACAATGAAGATAAAATTTCAATTAAAAAGGTAGTGGAAGAATTGGAAGACAGACCAGAGCTACAGCAT
GTGTATCTGATGAAAAGGATACCACAGAACCCAAGATACCAGCATATCAAATCAAGACTGGACACTGGTAACAGTATGACTAAATATACT
GAGAAGCTCGAAGAGATTAAGAAAAATTATAGATACAAAAAAGATGAGCTTTTCAAGAGACTAAAAGTTACAACTTTTGCCCAGCTGATC
ATCCAAGTTGCTTCCCTCTCTGATCAAACACTGGAAGTGACAGCTGAGGAGATTCAAAGGCTGGAAGACAATGATTCTGCAGCTTCAGAC
CCTGATGCTGAAACCACTGCCAGGACCAATGGGAAAGGAAATCCAGGTGAGCAGTCGCCGAGCCCTGAGCAGTTCATAAACAACGCAGGA
GCAGGGGACTCCAGCCGCTCAACTCTTCAGAGTGTCATCAGTGGTGTTGGGGAACTGGATCTAGACAAAGGGCCAGTGAAGAAAGCAGAG
CCCCATACCAAAGACAAACCTTATCCTGACTGCCCCTTCCTGCTGCTAGATGTGCGTGATAGAGATTCTTACCAGCAGTGCCACATTGTT
GGAGCTTACAGTTACCCAATTGCAACTCTGTCTAGAACAATGAACCCTTATTCAAATGATATTCTTGAATATAAAAATGCCCATGGCAAG
ATCATCATTCTGTATGACGATGATGAAAGGCTGGCCAGTCAGGCGGCCACCACCATGTGCGAGCGTGGATTTGAAAACCTCTTCATGCTT
TCCGGAGGTCTAAAAGTCTTAGCTCAGAAATTCCCGGAAGGACTGATTACTGGTTCCCTGCCAGCATCTTGCCAGCAGGCCCTTCCTCCT
GGGTCTGCCCGGAAACGATCCAGCCCCAAAGGGCCACCCCTACCAGCTGAGAATAAATGGAGATTTACCCCAGAAGACTTAAAAAAGATA
GAATATTATCTGGAAGAGGAGCAAGGGCCTGCAGATCATCCTAGCCGACTGAACCAAGCTAACTCCTCCGGAAGAGAGTCCAAGGTGCCT
GGTGCCCGAAGCGCTCAGAATCTGCCAGGTGGCGGCCCCGCCAGCCACTCAAACCCCCGCTCCCTCAGCAGTGGTCACCTGCAAGGCAAA
CCCTGGAAGTAAAGACTTTGTCTCACTTAGGCAAATAAATGTTCTTCCTCTTTTCAGAACCCCTGAGCATTTCCCAAGTTGGGTCATTTC
CAGAAACTTCTGCAGAGGAAAGACCATGACATATGTATGGGATAGGCCTCTTCCCTGTCCCTGTCTCCAGTTCCTCTCCCCTCAGGAGGC
CACCTCAGGAAGGATTTTGACAGGTGACCATAAAAACCAGAGGTGTGACAGGCTCTAGTCTCCTCCCTGTTGTTTACAGCATATTTAAGT
CTTGTGAAACTGTATAGGAAAAAAGTATCTACGTTTTTAGTTTTTTTGTTTTGTTTTTTTTTAATAAGGTCCAGCTTGTTGGGTCTCTCT
GTGTTGTTTTGTAAATACTTCAGTCACATCTGCCCGTGTGCCTGTCCCTGCCACCTTTTCATTCACTGTTCTTGACTTCATGAAGGCTTC
TCCGGGCAGTCTTGGTGTGAGAAGCTGTTTCCAAGGGTGCAGATGAGCCTTAGGTTCCAGCCGCCTGCAGACCCCACCCCAGCAGGCTCA
CTCAGCAAGGAGCTCTCTGCCCAGCATATTGCAGGCCCTGTTTTGAGTATGGAAGCCAGTGCCTGTGTACTCACTGAAATTGAAGATGAG
GAAAGTAGCTGTACACTCACTGAATGCTCCCCCTTACTAGATATTTCCTGGAGCCAGAAAGGTATGCATGTGGGTGTCTTCACACCGGGG
AGGAGGGCCTCTCATGGGAAAGCCCTGGCCACCACACCGGCCTGTGCCCCTTGAAGCCCACCAAAGCGGCCCTCACTTGTGGTCAGTATA
TCAGTTATGACGCCCATTGCCCAGCTTCAGTCCATCCATGTTAGATGGACAGAAATTATGGCCAGTTGAAAATACCAGCTTTGGTTGGAC
AACTGTGGACACACAAGGTGAAGAGGACTCCGAAGTCCTTTGTCAGGGCTGACAACCTCGTAAGCCCTTGCTTAGAAATACAGTATTAGT
CTAATTGAGTAATTAGTGCAATTTCCTGCTTACTTTTCATTCTCATGACTGAACTGTGATTAGGAAGTTGTGATTATAGATTCTGGTTTT
GGCCGGAATTTTGAATCAGCATTAATTGAATTGCTAAATGACTGACATTCATTCCATTTAATTGGGGGAACAAAAGGCCTCAGGTAAGGA
TGAGGAACTCTGAAATCAGATGGAAAAGAGCGGTGTTAATTTTTATGGTCTGTGATCGTAGCTGTGATAAGGGACTGAGGAATAAATTGT
GCTCTTTGTCATGGCAACCAGCTTCTGAAAAGCCCACTGAAAATTGCCTGTCCTGCTGGTAACTGCTACGGGGTAAGATTTGCCTTAACA
GTACTATTTTCTCGCCACCAAAAAAAAAAAAAAAAAAAAAAAAAAATGCACCACAGTATTTCTAGCATGGGGGCTGTGTTTGTATGAGAA
ATAAACGTAATAAATATCTCATAGAGACATATGGAAAAATAACTTTCAGATTCAGCCCAGTTCTGTTTTAGAGTGTGTTTATTCTTCTCT
ACTTGATTTCCAAAGTGCAACATTTTCCGATGCTTTAGAAATCAAACAAACCAGGGACATTGTTCAGATGTCAAGCCATGCCCAATTTTC
CACAAGATTCAAGAATCTTGTATAAAATTCAGCCAACGTACACATAGCTTTAATGAGGAGCCTGTCATGTTTCCCCATAAATTTATTGCC
TGAGAACTTAGTTCAGCCTTTGCTAATGCCAAAATGCTCTGGCTTTGTATTTTCTTTACAGCATAGATAGAAAAATGCACATTTTTCCAC
ACTCAGCTTTCCCCTAGCATGGACAAGATTTTCAGCCATTTTTGCCACATATACATTTTTAAGGAAAAAAGATTTTTCTCTGTAAGAAAG
TTCTGGTTATGCTGTTTTAAAGGTGACTTGTCAGGAGTTGAGACTTCCCTGCCGGATTCTATTTTGAAAGTAAATGGTCTTCCCTCCTTG
TTCCGATTCTGCGTTCCCATCGTCAGACAACTTTGGAGTATTAGAAACCACTGTATATATGTGGAAAGCCAGGTCAGCCAGACCTGTTAG
AATTGGTGTGCACTCACCTGAGAGATCTGGCAGGTTGGATATATTTATGTGTATTTCTCCACAGTGCTTGCTTTGCCCTGTTGGTAAGGA
TTTTAAATAACCATGCTCAAAAGAGCTGTTCTAATCTGCGTTTTGCATGTTAAGTGTTAATATCAAACATTCTTTACGTGCTCGAGGTAT
TGCTTTTAACATTCTACTTTGCCAGTTTCTTCATTAGATTAATTGACATGTATTATTTAAATGACCAGTGATGCTTTGTGCAATTATGAA
TGTTGAAGATTAAAGTACATAGTTACTAATTTGTCGTTTGCTATTAATATGCTGAAAACTGCCAACTTCTCTCTTCTTTTCTGTCGAGAT
GATTTGGGGGAGCCACAGGAGACTGGTGTGATTTTTGCTGCATCTCCTAGGAAAGCATTTTTTAAAAAAAATAAATGAATCAGGAAATCA
GTCCAATTAGGGCAGGGGGCCTCAGCTCTCCAGTCAGAAAGCCTGGATTTCTTTTCCTGCTCAGGCTGGGACTGAAGCCACCTTCAACAA
CTGGATCATGGCTTCCTACCAGCGTCTCAGGGGTTGACTAGCTGCCCTTGTCTGGGGCTTGTGAACCCTGAGACAGAAGGTGCTTCATCG
ATGTACAACTACAGCACCCTGAACAGCAGTGATGGCCAAAGTTTAAATAATACCTTAAATGCTTTAAAGGGGTTTGTGTTTAAGGAAGGG
AGAAAAGAAAAAAAGAAAGGAAGGGAGAAAGAAGCAAGGAAATGGGAAGAAGGGAGGGCAGAGAGAAGAAAAGAAGCAGAAAGCGAGAGA
GAAAGGAGAAAGCTACATTACTTATTTGAAAACAAAAGGAACACCCTGGGCTGTAATAATGTCAGGCTCAATCTCTTGAAAAAGTATGGA
ATGATTTAAATGGCCTGCATTCATTTTATCTTTTATCTTTTTTTTTTTTTTTAACCTTTAGTGGTTAGCCAGGACCAGCACGATCATATT
GGGCTTGGTATAAATCCGAATGAAAAGAGACCAAATAACATTCATTAGTTGCTCAGGGATTTTTCCTGTGGTGCTATTTAAATTATACAA
AAATTCTTAAGACTTTAGGCTACTCGACCAAGAAAACAGAACAAAACAAAAAATCGTGTTTTCTCTAATTCCCTTGTGGAATGTAAGTGA
AATCAGAGTCCTAGGCTAGGAAGAAATACGTAGGTAATTTTTCTTGTGTTGGTTTTGGTTTCTGTCATGTTGTTTATTGGCTATAGATTC
TGTCTTTTATGTTATCTGACTTTTTTAGAGCGAATAATTAGTTTCTGTCCACCTGGATTTAAATCCATGACCACCTTCTTGCTCTACTCT
GAAGATAATCAGTAAGAACCTTTCTTTCTCCAGTTCTAAAACGTTCTCAGTGTCTTTAATGTGTGTTTATTTTCTTCCCAATTCTTTCAA
AGATTTAACTCCCACGATACTTTTTTTTCCCCAGGAAACACCGCAAATGTGTGGAATATAATTCACCAGTTTAATTATGTGAGCATGTTG
AGTACTTACATGCAGGTCCATTAATTTTTCACTAACAATTATTTTTTCATGCAAAAGCAATAAATAACATTGTGCTTCCAAAATGTTCAG
AATACATTTGGGTAGTAATACTTTCCTAGATTTACAATAATTATTGAAATTATTATTATGACATCTTTAAATGGATACACAGGTCAGTTA
CAACATAAAAAATGTAATGGTGGAAATTTGTCAGCCTTGAATTCAGGCAGAACAGGATTCGGTGCATGATTTTATGTGTTTTCGAAACGG
TGTCTGTCACATTGTGATCCCCTGATGGCTCCCCTCTCTGTTGCTGATCCTCTTTGTTCTGTACAGAAGCAAATTCTCACCTGTGTAACA
TCCTGAAGCACCTGGTAAAATGTGAGGCAAAGAGAGGCCACTTCTCAAATGCTGTGGACGGATGGGCTGCATCTTTTAAAGGATATCAAT
ATGTCTTCCTGTTGCAATTATTTTGACTATAAGCTCCCAGAGAGTGAAAATCACCAACCCTGTGACAGTGGCCCTTAATAAGTGTTCATG
AATAAATGAATTGAACCTGTCAAGATTGAAGTTTGGATGTGATGCCCACTGTGGTGGCCACTCAGTGCTAGTCTGTCATTCTGGAGACCC
AGAAAGCTCGTTATCTTCTGTCCCCCTTGTGTATCCTGCCTTTGTGGGCGAGGCATTCAAAACCCTGAGGTTTTTAGATCTCCCTCTACA
GGAAGTACCCAGAGAGCTGCTGGGGGTGTTATTACCCTGTCTCTGCCAGGCTTAAAGTATCCTCCCAAATTCAGCACGCAAAGGTCACAC
ACCACCCCCATCTTAAGAGTAGGTTTTCTCTTTTGTTTCAAATCTTGAAGATTTCCAGAAAATAATAATACTAGCCAATGTTTGTGGAGA
GCTTACTTTGCTCCTAGATTCTTCATTATATGTTTTGACCCATTTAATTTCCACAACAGTCTGATGAGGCAGGCACCCCCATTTTCAGAT
GAGGAGACTGAGGTTCGGGTAGAAATTAAGTGACTCAGGTTTGCCTTCAGTCTCTGACCAAGGGTTTGAAAAGCCAGGAGCCAGCCTGAG
AACTGTGGCTCCAGAGGGTGTTCTTTCAGCCACTCCGCTCTATGCTTCTCACTCTGGGTGGGCACAACCATGTTCTCCTTTGGTGATGCC
TCTAACTTACCGTGAAAATGTACCTTTCCCTTCGCTATTGGCTTCCCTTCCCTCCTAGTCAGCCGAGATTCTTTTGAAAACTTTCCTCCG
CTTGCCTGCACAAAAGGCGATGGAAATTCAGGAACTGAAACATCTGCTCTGGGGAATGCGTATTTCCACATTTCCACCGCCTGTGTCTGC
TGTCTTATCTTGAAGACAGGTGCTCCAGGGCTTCCGAGGTTATTTTGTCTGTTAATGGACACCTTGCAAAGTACCACTTAAGGAATGAGA
ATTACAAACTTTTAATTATATTGTAGGGGGAAAAAAGTAGGCTGTTTTCCTGATAGGTCTAGCCATTCATTCAGTAAACCATATTGATAT

>98431_98431_5_VPS41-CEP41_VPS41_chr7_38794302_ENST00000395969_CEP41_chr7_130067859_ENST00000223208_length(amino acids)=933AA_BP=571
MAEAEEQETGSLEESTDESEEEESEEEPKLKYERLSNGVTEILQKDAASCMTVHDKFLALGTHYGKVYLLDVQGNITQKFDVVQVFGLYS
GEEFHETFDCPIKIIAVHPHFVRSSCKQFVTGGKKLLLFERSWMNRWKSAVLHEGEGNIRSVKWRGHLIAWANNMGVKIFDIISKQRITN
VPRDDISLRPDMYPCSLCWKDNVTLIIGWGTSVKVCSVKERHASEMRDLPSRYVEIVSQFETEFYISGLAPLCDQLVVLSYVKEISEKTE
REYCARPRLDIIQPLSETCEEISSDALTVRGFQENECRDYHLEYSEGESLFYIVSPRDVVVAKERDQDDHIDWLLEKKKYEEALMAAEIS
QKNIKRHKILDIGLAYINHLVERGDYDIAARKCQKILGKNAALWEYEVYKFKEIGQLKAISPYLPRGDPVLKPLIYEMILHEFLESDYEG
FATLIREWPGDLYNNSVIVQAVRDHLKKDSQNKTLLKTLAELYTYDKNYGNALEIYLTLRHKDVFQLIHKHNLFSSIKDKIVLLMDFDSE
KAVDMLLDNEDKISIKKVVEELEDRPELQHVYLMKRIPQNPRYQHIKSRLDTGNSMTKYTEKLEEIKKNYRYKKDELFKRLKVTTFAQLI
IQVASLSDQTLEVTAEEIQRLEDNDSAASDPDAETTARTNGKGNPGEQSPSPEQFINNAGAGDSSRSTLQSVISGVGELDLDKGPVKKAE
PHTKDKPYPDCPFLLLDVRDRDSYQQCHIVGAYSYPIATLSRTMNPYSNDILEYKNAHGKIIILYDDDERLASQAATTMCERGFENLFML
SGGLKVLAQKFPEGLITGSLPASCQQALPPGSARKRSSPKGPPLPAENKWRFTPEDLKKIEYYLEEEQGPADHPSRLNQANSSGRESKVP

--------------------------------------------------------------
>98431_98431_6_VPS41-CEP41_VPS41_chr7_38794302_ENST00000395969_CEP41_chr7_130067859_ENST00000343969_length(transcript)=2738nt_BP=1713nt
ATGGCGGAAGCAGAGGAGCAGGAAACTGGGTCCCTTGAAGAATCTACAGATGAGTCTGAGGAAGAAGAGAGCGAAGAGGAACCCAAGCTG
AAGTATGAAAGGCTTTCCAATGGGGTAACTGAAATACTTCAGAAGGATGCAGCTAGCTGCATGACAGTCCATGACAAGTTTTTGGCATTG
GGCACACATTATGGCAAGGTTTATTTACTTGATGTCCAGGGGAACATCACTCAGAAGTTTGATGTAGTGCAGGTATTTGGACTGTATTCT
GGAGAAGAATTTCACGAGACTTTTGACTGTCCCATTAAAATTATTGCTGTGCACCCACATTTCGTGAGATCCAGTTGCAAGCAGTTTGTG
ACCGGAGGGAAGAAGCTGCTACTGTTTGAACGGTCTTGGATGAACAGATGGAAGTCTGCTGTTCTGCATGAAGGGGAAGGGAACATAAGG
AGTGTGAAGTGGAGAGGCCATCTGATTGCTTGGGCCAATAATATGGGTGTGAAGATTTTTGACATCATCTCAAAGCAAAGAATCACCAAT
GTGCCCCGGGATGATATAAGTCTTCGCCCAGACATGTATCCCTGCAGCCTCTGCTGGAAGGACAATGTGACACTGATTATTGGCTGGGGG
ACTTCTGTCAAGGTGTGCTCAGTGAAGGAACGGCATGCCAGTGAAATGAGGGATTTGCCAAGTCGATATGTTGAAATAGTGTCTCAGTTT
GAAACTGAATTCTACATCAGTGGACTTGCACCTCTCTGTGATCAGCTTGTTGTACTTTCGTATGTAAAGGAGATTTCAGAAAAAACGGAA
AGAGAATACTGTGCCAGGCCTAGACTGGACATCATCCAGCCACTTTCTGAGACTTGTGAAGAGATCTCTTCTGATGCTTTGACAGTCAGA
GGCTTTCAGGAGAATGAATGTAGAGATTATCATTTAGAATACTCTGAAGGGGAATCACTTTTTTACATCGTGAGTCCGAGAGATGTTGTA
GTGGCCAAGGAACGAGACCAAGATGATCACATTGACTGGCTCCTTGAAAAGAAGAAATATGAAGAAGCATTGATGGCAGCTGAAATTAGC
CAAAAAAATATTAAAAGACATAAGATTCTGGATATTGGCTTGGCATATATAAATCACCTGGTGGAGAGAGGAGACTATGACATAGCAGCA
CGCAAATGCCAGAAAATTCTTGGGAAAAATGCAGCACTCTGGGAATATGAAGTTTATAAATTTAAAGAAATTGGACAGCTTAAGGCTATT
AGTCCTTATTTGCCAAGAGGTGATCCAGTTCTGAAACCACTCATCTATGAAATGATCTTACATGAATTTTTGGAGAGTGATTATGAGGGT
TTTGCCACATTGATCCGAGAATGGCCTGGAGATCTGTATAATAATTCAGTCATAGTTCAAGCAGTTCGGGATCATTTGAAGAAAGATAGT
CAGAACAAGACTTTACTTAAAACCCTGGCAGAATTGTACACCTATGACAAGAACTATGGCAATGCTCTGGAAATATACTTAACATTAAGA
CATAAAGACGTTTTTCAGTTGATCCACAAGCATAATCTTTTCAGTTCTATCAAGGATAAAATTGTTTTATTAATGGATTTTGATTCAGAG
AAAGCTGTTGACATGCTTTTGGACAATGAAGATAAAATTTCAATTAAAAAGGTAGTGGAAGAATTGGAAGACAGACCAGAGCTACAGCAT
GTGTATCTGATGAAAAGGATACCACAGAACCCAAGATACCAGCATATCAAATCAAGACTGGACACTGGTAACAGTATGACTAAATATACT
GAGAAGCTCGAAGAGATTAAGAAAAATTATAGATACAAAAAAGATGAGCTTTTCAAGAGACTAAAAGTTACAACTTTTGCCCAGCTGATC
ATCCAAGTTGCTTCCCTCTCTGATCAAACACTGGAAGTGACAGCTGAGGAGATTCAAAGGCTGGAAGACAATGATTCTGCAGCTTCAGAC
CCTGATGCTGAAACCACTGCCAGGACCAATGGGAAAGGAAATCCAGGTGAGCAGTCGCCGAGCCCTGAGCAGTTCATAAACAACGCAGGA
GCAGGGGACTCCAGCCGCTCAACTCTTCAGAGTGTCATCAGTGGTGTTGGGGAACTGGATCTAGACAAAGGGCCAGTGAAGAAAGCAGAG
CCCCATACCAAAGACAAACCTTATCCTGACTGCCCCTTCCTGCTGCTAGATGTGCGTGATAGAGATTCTTACCAGCAGTGCCACATTGTT
GGAGCTTACAGTTACCCAATTGCAACTCTGTCTAGAACAATGAACCCTTATTCAAATGATATTCTTGAATATAAAAATGCCCATGGCAAG
ATCATCATTCTGTATGACGATGATGAAAGGCTGGCCAGTCAGGCGGCCACCACCATGTGCGAGCGTGGATTTGAAAACCTCTTCATGCTT
TCCGGAGGCCGACTGAACCAAGCTAACTCCTCCGGAAGAGAGTCCAAGGTGCCTGGTGCCCGAAGCGCTCAGAATCTGCCAGGTGGCGGC
CCCGCCAGCCACTCAAACCCCCGCTCCCTCAGCAGTGGTCACCTGCAAGGCAAACCCTGGAAGTAAAGACTTTGTCTCACTTAGGCAAAT
AAATGTTCTTCCTCTTTTCAGAACCCCTGAGCATTTCCCAAGTTGGGTCATTTCCAGAAACTTCTGCAGAGGAAAGACCATGACATATGT

>98431_98431_6_VPS41-CEP41_VPS41_chr7_38794302_ENST00000395969_CEP41_chr7_130067859_ENST00000343969_length(amino acids)=861AA_BP=571
MAEAEEQETGSLEESTDESEEEESEEEPKLKYERLSNGVTEILQKDAASCMTVHDKFLALGTHYGKVYLLDVQGNITQKFDVVQVFGLYS
GEEFHETFDCPIKIIAVHPHFVRSSCKQFVTGGKKLLLFERSWMNRWKSAVLHEGEGNIRSVKWRGHLIAWANNMGVKIFDIISKQRITN
VPRDDISLRPDMYPCSLCWKDNVTLIIGWGTSVKVCSVKERHASEMRDLPSRYVEIVSQFETEFYISGLAPLCDQLVVLSYVKEISEKTE
REYCARPRLDIIQPLSETCEEISSDALTVRGFQENECRDYHLEYSEGESLFYIVSPRDVVVAKERDQDDHIDWLLEKKKYEEALMAAEIS
QKNIKRHKILDIGLAYINHLVERGDYDIAARKCQKILGKNAALWEYEVYKFKEIGQLKAISPYLPRGDPVLKPLIYEMILHEFLESDYEG
FATLIREWPGDLYNNSVIVQAVRDHLKKDSQNKTLLKTLAELYTYDKNYGNALEIYLTLRHKDVFQLIHKHNLFSSIKDKIVLLMDFDSE
KAVDMLLDNEDKISIKKVVEELEDRPELQHVYLMKRIPQNPRYQHIKSRLDTGNSMTKYTEKLEEIKKNYRYKKDELFKRLKVTTFAQLI
IQVASLSDQTLEVTAEEIQRLEDNDSAASDPDAETTARTNGKGNPGEQSPSPEQFINNAGAGDSSRSTLQSVISGVGELDLDKGPVKKAE
PHTKDKPYPDCPFLLLDVRDRDSYQQCHIVGAYSYPIATLSRTMNPYSNDILEYKNAHGKIIILYDDDERLASQAATTMCERGFENLFML

--------------------------------------------------------------
>98431_98431_7_VPS41-CEP41_VPS41_chr7_38794302_ENST00000395969_CEP41_chr7_130067859_ENST00000489512_length(transcript)=4751nt_BP=1713nt
ATGGCGGAAGCAGAGGAGCAGGAAACTGGGTCCCTTGAAGAATCTACAGATGAGTCTGAGGAAGAAGAGAGCGAAGAGGAACCCAAGCTG
AAGTATGAAAGGCTTTCCAATGGGGTAACTGAAATACTTCAGAAGGATGCAGCTAGCTGCATGACAGTCCATGACAAGTTTTTGGCATTG
GGCACACATTATGGCAAGGTTTATTTACTTGATGTCCAGGGGAACATCACTCAGAAGTTTGATGTAGTGCAGGTATTTGGACTGTATTCT
GGAGAAGAATTTCACGAGACTTTTGACTGTCCCATTAAAATTATTGCTGTGCACCCACATTTCGTGAGATCCAGTTGCAAGCAGTTTGTG
ACCGGAGGGAAGAAGCTGCTACTGTTTGAACGGTCTTGGATGAACAGATGGAAGTCTGCTGTTCTGCATGAAGGGGAAGGGAACATAAGG
AGTGTGAAGTGGAGAGGCCATCTGATTGCTTGGGCCAATAATATGGGTGTGAAGATTTTTGACATCATCTCAAAGCAAAGAATCACCAAT
GTGCCCCGGGATGATATAAGTCTTCGCCCAGACATGTATCCCTGCAGCCTCTGCTGGAAGGACAATGTGACACTGATTATTGGCTGGGGG
ACTTCTGTCAAGGTGTGCTCAGTGAAGGAACGGCATGCCAGTGAAATGAGGGATTTGCCAAGTCGATATGTTGAAATAGTGTCTCAGTTT
GAAACTGAATTCTACATCAGTGGACTTGCACCTCTCTGTGATCAGCTTGTTGTACTTTCGTATGTAAAGGAGATTTCAGAAAAAACGGAA
AGAGAATACTGTGCCAGGCCTAGACTGGACATCATCCAGCCACTTTCTGAGACTTGTGAAGAGATCTCTTCTGATGCTTTGACAGTCAGA
GGCTTTCAGGAGAATGAATGTAGAGATTATCATTTAGAATACTCTGAAGGGGAATCACTTTTTTACATCGTGAGTCCGAGAGATGTTGTA
GTGGCCAAGGAACGAGACCAAGATGATCACATTGACTGGCTCCTTGAAAAGAAGAAATATGAAGAAGCATTGATGGCAGCTGAAATTAGC
CAAAAAAATATTAAAAGACATAAGATTCTGGATATTGGCTTGGCATATATAAATCACCTGGTGGAGAGAGGAGACTATGACATAGCAGCA
CGCAAATGCCAGAAAATTCTTGGGAAAAATGCAGCACTCTGGGAATATGAAGTTTATAAATTTAAAGAAATTGGACAGCTTAAGGCTATT
AGTCCTTATTTGCCAAGAGGTGATCCAGTTCTGAAACCACTCATCTATGAAATGATCTTACATGAATTTTTGGAGAGTGATTATGAGGGT
TTTGCCACATTGATCCGAGAATGGCCTGGAGATCTGTATAATAATTCAGTCATAGTTCAAGCAGTTCGGGATCATTTGAAGAAAGATAGT
CAGAACAAGACTTTACTTAAAACCCTGGCAGAATTGTACACCTATGACAAGAACTATGGCAATGCTCTGGAAATATACTTAACATTAAGA
CATAAAGACGTTTTTCAGTTGATCCACAAGCATAATCTTTTCAGTTCTATCAAGGATAAAATTGTTTTATTAATGGATTTTGATTCAGAG
AAAGCTGTTGACATGCTTTTGGACAATGAAGATAAAATTTCAATTAAAAAGGTAGTGGAAGAATTGGAAGACAGACCAGAGCTACAGCAT
GTGTATCTGATGAAAAGGATACCACAGAACCCAAGATACCAGCATATCAAATCAAGACTGGACACTGGTGCCTGTGTGTATCTTACCAGC
TCCCCAGCACTGCCAGACTGTGCCATGAATGGATTATGCTTCTGATGAAATCCCTGGCTGCGTTTTGTTCAGTCACCACCGCAGGGTTCT
TGTATATTCTCAAGCACCCTCTCTGGGTTTCTGTTTTGCAGCACTGCAGGCTGATTAAGGGACACTTTCCAAACCACAGGGCTCTCCCTT
TATTAGTCCCCACAGTCTTTCATTGGCTGCATTGTCCTCCCATTTCTATCTATATTCAATCTAATTTCCAATTTCTTTAGCTTTCTGAAA
TTCTGTAGGCTTTCTCCACTCAGTCTTTTCTAGCCTTATTTTTGTCTTTTTAAAAATTCACCTTCCTTTTATTATCTCTAGACAAACTAT
TCATTCCCTCTTTTCTTCTTTAGGCTTTACTTTTCCTGAATATTTGCTAATAGCCTTGACTGTAAGCATCAAACTCATTATAGACTAGGT
ACTCTTCATTTTTATTTGTTGTTTAATTCTGTTTCTTCCGTCAAAGGAAGAAATATATGTGAAAGGCTTTTATAAACCAGAAAGCACTAT
ACACACATATCACTTATTGAGGTTAACAAAGGACCACTGTGCCTCTTTTCATGCTTTCTGCTCTTCCAAGCCAAGAGAACTTCTCCATAG
AGGAGGGAAATATTGAAGAAAAATAAAGCCATGGAGTTGTCTCTTAGCAACTGAGTTCGTGCCTTATAAATCTCATCTACTTTAGATTAC
TGCTGCACAAAATCACTTGCTTTTCTTATTGGCTCAGTATAAATAATGTTATTAAATTATAGGAAAACAATGGAAAAAATCTAATGAATG
ACTCATTCACTGAATCAGTTGCACTCTGTGTTTGCACAGCACTGTTGCTATAGTCGGGTTGTTTCTTCCTCTTGATTCTAATTGTAAAGT
TTTGTTATATATCTTTTATCACAACTTTAGTTGTAAGTAAACAAAGTAGAAGCTAAATGCTACCTTATACATTTATAGGTTGCTTAAGTC
GTGAAGCAATATGAGTCATATATGTAGCACATAACATATACTGTATATAAGTCAAGTTGTACATAGTTTAAAGAAAATCAGGTGTGCAGA
AACTCAGACCAAATTGGGAAGTATTTGCATTATAAAATAACTCATTATAAGGATTTTTCAGTTAGTAGATCTCCATAGTAAATTTCATTC
AGCAAATGTGTATTTTTCCTATTTATTTATTTATTTATTTATTGAGATGAAGTCTCGCTCTGTTGCCCAAGCTGGAGTGCAGTGGCGTGA
TCTTGGCTCACTGCAACCTCTGCCTCCCAGGTTCAAGCGATTCTTCTGCCTCAGCCTCCTGAGTAGCTGGGACTACAGGTGCGCACCACC
ATGCCTGGCTAATTTTTATATTTTTAGTAGAGACGGGGTTTTACTATGTTGGTCAGGCTGGTCTCGAACTCCTGACCTTGTGATCCTCCC
GCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACTGTGCCAGGTCTAGTTATTTATTTATTTATTGTAGAGATGGGCTCTCA
CTGTGTTGGCCAGGGTGGTCTTGAACTCCTGGCCTCGAGCAATCCTTCCACCTCAGTCTCCCAAAGTGCTGGGATTACAGGTGTGAGCTA
TCACACCTGGCCATTTTTTTTTTTTTTTTTTTTTTTGAGATAGAGTCTCACTCTGTCACCCGCATTGTAGTGCAGTGGCGCATTCTTGCT
CACTGCAACCTCCCCTTCCTGGGCTCAGGGGATCCTCCTACCTCAGCCTCTCAAGTAGCTGGGACTACAGGGATGTGCCACCATGCCTGG
CTAGTTTTTGTGTATGTGTGTGTTTTGTAGAGACGAGGTCTCACTATTTTGCCCAGGCTGGTCCGGAACTCCTGGGCTCAAGCAATCCTG
CCTCCTCAGCCTCCCAACGTGCTGGGATTGCAGGCATGATCCAGTGCACCTGACCATCATTCAGCAAGTATTTTTGAGTGTCTCCTATGC
TAGACAATGGAAGGGATAAAGATTATTCATACGTGTGTGTGTGTGTGTGTGTGTACGTGCCCATAAATAGTCCTTGGCTTTCTGTCTAGT
AAGGTAGGTAAGACAATACAGTGATAAATGCCTTGTGAGTTGTAAGCCGAGTGCTATAGAGGCTCAGAGGAGGGAAAGGTCACTTCCTGC
TGAGTGAGCTCTTACGAGTATGTGGCAAGGTGATACGTTAGAGAATGCAAGACTAAGAGTTTGTCAAGTTAAGTAATTAAGGCAACATTA
AGAGGTAGTGAGTTGGTGTCCAGTGCTAGAGCAAGGAGTTTCAGCTTTAGTTAGGAAGCAATGGGGAGCCACTGACTTTTTTTTTTCTTT
TTGATCAAAAAGTCTTATCTGGGAAAATGTATCTGGCAGTAATGTGTAATTTCAATTTTTTTTAAAAAAAATTGGTTCTTGGATAGTCTT
TTAGGCTTCTTTTGTATGAAACAAAGCCATGTTTATAAATAAGTGAATTATGAAATGGATTCATTTCCAATCTTTTTAAAAGGTAGTCAG
AAACTATCTTCAGATGCAATTAGATACAGTAAGATAGGCTGCATAGTTAAGATGTAAGCAGTTTACACAGTGGGATAAAAATGCTACTTT
TCATACTTTCTCAAACCATTTTTAGGGGTATTTTTGTGTCAGCCATCCTGCTATCTCATTTGTCCAATTAAAGCAAGGAAGAAATCCTCC
ATTCTCCTTTTTTCCCCATTCTTGGATAGAATGAATGTCTATAGTGAAAAGAACTCATCAGTAGTCCTCTCAGATTTGGTCTGAATTTCA
CCAAATATTTGTATAATTGCAAAAAACTTGTTTCTCAGCCTTCTCCTTTAGGGGGAAAACAAATAGAGGGCATGATGGCCTACCTCCCTG

>98431_98431_7_VPS41-CEP41_VPS41_chr7_38794302_ENST00000395969_CEP41_chr7_130067859_ENST00000489512_length(amino acids)=614AA_BP=571
MAEAEEQETGSLEESTDESEEEESEEEPKLKYERLSNGVTEILQKDAASCMTVHDKFLALGTHYGKVYLLDVQGNITQKFDVVQVFGLYS
GEEFHETFDCPIKIIAVHPHFVRSSCKQFVTGGKKLLLFERSWMNRWKSAVLHEGEGNIRSVKWRGHLIAWANNMGVKIFDIISKQRITN
VPRDDISLRPDMYPCSLCWKDNVTLIIGWGTSVKVCSVKERHASEMRDLPSRYVEIVSQFETEFYISGLAPLCDQLVVLSYVKEISEKTE
REYCARPRLDIIQPLSETCEEISSDALTVRGFQENECRDYHLEYSEGESLFYIVSPRDVVVAKERDQDDHIDWLLEKKKYEEALMAAEIS
QKNIKRHKILDIGLAYINHLVERGDYDIAARKCQKILGKNAALWEYEVYKFKEIGQLKAISPYLPRGDPVLKPLIYEMILHEFLESDYEG
FATLIREWPGDLYNNSVIVQAVRDHLKKDSQNKTLLKTLAELYTYDKNYGNALEIYLTLRHKDVFQLIHKHNLFSSIKDKIVLLMDFDSE

--------------------------------------------------------------
>98431_98431_8_VPS41-CEP41_VPS41_chr7_38794302_ENST00000395969_CEP41_chr7_130067859_ENST00000541543_length(transcript)=2898nt_BP=1713nt
ATGGCGGAAGCAGAGGAGCAGGAAACTGGGTCCCTTGAAGAATCTACAGATGAGTCTGAGGAAGAAGAGAGCGAAGAGGAACCCAAGCTG
AAGTATGAAAGGCTTTCCAATGGGGTAACTGAAATACTTCAGAAGGATGCAGCTAGCTGCATGACAGTCCATGACAAGTTTTTGGCATTG
GGCACACATTATGGCAAGGTTTATTTACTTGATGTCCAGGGGAACATCACTCAGAAGTTTGATGTAGTGCAGGTATTTGGACTGTATTCT
GGAGAAGAATTTCACGAGACTTTTGACTGTCCCATTAAAATTATTGCTGTGCACCCACATTTCGTGAGATCCAGTTGCAAGCAGTTTGTG
ACCGGAGGGAAGAAGCTGCTACTGTTTGAACGGTCTTGGATGAACAGATGGAAGTCTGCTGTTCTGCATGAAGGGGAAGGGAACATAAGG
AGTGTGAAGTGGAGAGGCCATCTGATTGCTTGGGCCAATAATATGGGTGTGAAGATTTTTGACATCATCTCAAAGCAAAGAATCACCAAT
GTGCCCCGGGATGATATAAGTCTTCGCCCAGACATGTATCCCTGCAGCCTCTGCTGGAAGGACAATGTGACACTGATTATTGGCTGGGGG
ACTTCTGTCAAGGTGTGCTCAGTGAAGGAACGGCATGCCAGTGAAATGAGGGATTTGCCAAGTCGATATGTTGAAATAGTGTCTCAGTTT
GAAACTGAATTCTACATCAGTGGACTTGCACCTCTCTGTGATCAGCTTGTTGTACTTTCGTATGTAAAGGAGATTTCAGAAAAAACGGAA
AGAGAATACTGTGCCAGGCCTAGACTGGACATCATCCAGCCACTTTCTGAGACTTGTGAAGAGATCTCTTCTGATGCTTTGACAGTCAGA
GGCTTTCAGGAGAATGAATGTAGAGATTATCATTTAGAATACTCTGAAGGGGAATCACTTTTTTACATCGTGAGTCCGAGAGATGTTGTA
GTGGCCAAGGAACGAGACCAAGATGATCACATTGACTGGCTCCTTGAAAAGAAGAAATATGAAGAAGCATTGATGGCAGCTGAAATTAGC
CAAAAAAATATTAAAAGACATAAGATTCTGGATATTGGCTTGGCATATATAAATCACCTGGTGGAGAGAGGAGACTATGACATAGCAGCA
CGCAAATGCCAGAAAATTCTTGGGAAAAATGCAGCACTCTGGGAATATGAAGTTTATAAATTTAAAGAAATTGGACAGCTTAAGGCTATT
AGTCCTTATTTGCCAAGAGGTGATCCAGTTCTGAAACCACTCATCTATGAAATGATCTTACATGAATTTTTGGAGAGTGATTATGAGGGT
TTTGCCACATTGATCCGAGAATGGCCTGGAGATCTGTATAATAATTCAGTCATAGTTCAAGCAGTTCGGGATCATTTGAAGAAAGATAGT
CAGAACAAGACTTTACTTAAAACCCTGGCAGAATTGTACACCTATGACAAGAACTATGGCAATGCTCTGGAAATATACTTAACATTAAGA
CATAAAGACGTTTTTCAGTTGATCCACAAGCATAATCTTTTCAGTTCTATCAAGGATAAAATTGTTTTATTAATGGATTTTGATTCAGAG
AAAGCTGTTGACATGCTTTTGGACAATGAAGATAAAATTTCAATTAAAAAGGTAGTGGAAGAATTGGAAGACAGACCAGAGCTACAGCAT
GTGTATCTGATGAAAAGGATACCACAGAACCCAAGATACCAGCATATCAAATCAAGACTGGACACTGATTATAGATACAAAAAAGATGAG
CTTTTCAAGAGACTAAAAGTTACAACTTTTGCCCAGCTGATCATCCAAGTTGCTTCCCTCTCTGATCAAACACTGGAAGTGACAGCTGAG
GAGATTCAAAGGCTGGAAGACAATGATTCTGCAGCTTCAGACCCTGATGCTGAAACCACTGCCAGGACCAATGGGAAAGGAAATCCAGGT
GAGCAGTCGCCGAGCCCTGAGCAGTTCATAAACAACGCAGGAGCAGGGGACTCCAGCCGCTCAACTCTTCAGAGTGTCATCAGTGGTGTT
GGGGAACTGGATCTAGACAAAGGGCCAGTGAAGAAAGCAGAGCCCCATACCAAAGACAAACCTTATCCTGACTGCCCCTTCCTGCTGCTA
GATGTGCGTGATAGAGATTCTTACCAGCAGTGCCACATTGTTGGAGCTTACAGTTACCCAATTGCAACTCTGTCTAGAACAATGAACCCT
TATTCAAATGATATTCTTGAATATAAAAATGCCCATGGCAAGATCATCATTCTGTATGACGATGATGAAAGGCTGGCCAGTCAGGCGGCC
ACCACCATGTGCGAGCGTGGATTTGAAAACCTCTTCATGCTTTCCGGAGGCCGACTGAACCAAGCTAACTCCTCCGGAAGAGAGTCCAAG
GTGCCTGGTGCCCGAAGCGCTCAGAATCTGCCAGGTGGCGGCCCCGCCAGCCACTCAAACCCCCGCTCCCTCAGCAGTGGTCACCTGCAA
GGCAAACCCTGGAAGTAAAGACTTTGTCTCACTTAGGCAAATAAATGTTCTTCCTCTTTTCAGAACCCCTGAGCATTTCCCAAGTTGGGT
CATTTCCAGAAACTTCTGCAGAGGAAAGACCATGACATATGTATGGGATAGGCCTCTTCCCTGTCCCTGTCTCCAGTTCCTCTCCCCTCA
GGAGGCCACCTCAGGAAGGATTTTGACAGGTGACCATAAAAACCAGAGGTGTGACAGGCTCTAGTCTCCTCCCTGTTGTTTACAGCATAT
TTAAGTCTTGTGAAACTGTATAGGAAAAAAGTATCTACGTTTTTAGTTTTTTTGTTTTGTTTTTTTTTAATAAGGTCCAGCTTGTTGGGT

>98431_98431_8_VPS41-CEP41_VPS41_chr7_38794302_ENST00000395969_CEP41_chr7_130067859_ENST00000541543_length(amino acids)=845AA_BP=571
MAEAEEQETGSLEESTDESEEEESEEEPKLKYERLSNGVTEILQKDAASCMTVHDKFLALGTHYGKVYLLDVQGNITQKFDVVQVFGLYS
GEEFHETFDCPIKIIAVHPHFVRSSCKQFVTGGKKLLLFERSWMNRWKSAVLHEGEGNIRSVKWRGHLIAWANNMGVKIFDIISKQRITN
VPRDDISLRPDMYPCSLCWKDNVTLIIGWGTSVKVCSVKERHASEMRDLPSRYVEIVSQFETEFYISGLAPLCDQLVVLSYVKEISEKTE
REYCARPRLDIIQPLSETCEEISSDALTVRGFQENECRDYHLEYSEGESLFYIVSPRDVVVAKERDQDDHIDWLLEKKKYEEALMAAEIS
QKNIKRHKILDIGLAYINHLVERGDYDIAARKCQKILGKNAALWEYEVYKFKEIGQLKAISPYLPRGDPVLKPLIYEMILHEFLESDYEG
FATLIREWPGDLYNNSVIVQAVRDHLKKDSQNKTLLKTLAELYTYDKNYGNALEIYLTLRHKDVFQLIHKHNLFSSIKDKIVLLMDFDSE
KAVDMLLDNEDKISIKKVVEELEDRPELQHVYLMKRIPQNPRYQHIKSRLDTDYRYKKDELFKRLKVTTFAQLIIQVASLSDQTLEVTAE
EIQRLEDNDSAASDPDAETTARTNGKGNPGEQSPSPEQFINNAGAGDSSRSTLQSVISGVGELDLDKGPVKKAEPHTKDKPYPDCPFLLL
DVRDRDSYQQCHIVGAYSYPIATLSRTMNPYSNDILEYKNAHGKIIILYDDDERLASQAATTMCERGFENLFMLSGGRLNQANSSGRESK

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for VPS41-CEP41


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with
HgeneVPS41chr7:38794302chr7:130067859ENST00000310301-21291_540596.0855.0ARL8B


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for VPS41-CEP41


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for VPS41-CEP41


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource