FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:GOLIM4-NCEH1 (FusionGDB2 ID:33949)

Fusion Gene Summary for GOLIM4-NCEH1

check button Fusion gene summary
Fusion gene informationFusion gene name: GOLIM4-NCEH1
Fusion gene ID: 33949
HgeneTgene
Gene symbol

GOLIM4

NCEH1

Gene ID

27333

57552

Gene namegolgi integral membrane protein 4neutral cholesterol ester hydrolase 1
SynonymsGIMPC|GOLPH4|GPP130|P138AADACL1|NCEH
Cytomap

3q26.2

3q26.31

Type of geneprotein-codingprotein-coding
DescriptionGolgi integral membrane protein 4130 kDa golgi-localized phosphoproteincis Golgi-localized calcium-binding proteingolgi integral membrane protein, cisgolgi phosphoprotein 4golgi phosphoprotein of 130 kDagolgi-localized phosphoprotein of 130 kDatypeneutral cholesterol ester hydrolase 1acetylalkylglycerol acetylhydrolasealkylacetylglycerol acetylhydrolasearylacetamide deacetylase-like 1
Modification date2020031320200313
UniProtAcc

O00461

Q6PIU2

Ensembl transtripts involved in fusion geneENST00000309027, ENST00000470487, 
ENST00000543711, ENST00000273512, 
ENST00000475381, ENST00000538775, 
Fusion gene scores* DoF score14 X 11 X 10=154012 X 11 X 9=1188
# samples 1817
** MAII scorelog2(18/1540*10)=-3.09686153925259
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(17/1188*10)=-2.80492818466306
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: GOLIM4 [Title/Abstract] AND NCEH1 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointGOLIM4(167812887)-NCEH1(172365904), # samples:2
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID

check buttonFusion gene breakpoints across GOLIM4 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across NCEH1 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4LUADTCGA-38-4630-01AGOLIM4chr3

167812887

-NCEH1chr3

172365904

-


Top

Fusion Gene ORF analysis for GOLIM4-NCEH1

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-intronENST00000309027ENST00000543711GOLIM4chr3

167812887

-NCEH1chr3

172365904

-
5CDS-intronENST00000470487ENST00000543711GOLIM4chr3

167812887

-NCEH1chr3

172365904

-
In-frameENST00000309027ENST00000273512GOLIM4chr3

167812887

-NCEH1chr3

172365904

-
In-frameENST00000309027ENST00000475381GOLIM4chr3

167812887

-NCEH1chr3

172365904

-
In-frameENST00000309027ENST00000538775GOLIM4chr3

167812887

-NCEH1chr3

172365904

-
In-frameENST00000470487ENST00000273512GOLIM4chr3

167812887

-NCEH1chr3

172365904

-
In-frameENST00000470487ENST00000475381GOLIM4chr3

167812887

-NCEH1chr3

172365904

-
In-frameENST00000470487ENST00000538775GOLIM4chr3

167812887

-NCEH1chr3

172365904

-

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000470487GOLIM4chr3167812887-ENST00000475381NCEH1chr3172365904-51928778831965360
ENST00000470487GOLIM4chr3167812887-ENST00000273512NCEH1chr3172365904-47968778831965360
ENST00000470487GOLIM4chr3167812887-ENST00000538775NCEH1chr3172365904-48208778831989368
ENST00000309027GOLIM4chr3167812887-ENST00000475381NCEH1chr3172365904-45612462521334360
ENST00000309027GOLIM4chr3167812887-ENST00000273512NCEH1chr3172365904-41652462521334360
ENST00000309027GOLIM4chr3167812887-ENST00000538775NCEH1chr3172365904-41892462521358368

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000470487ENST00000475381GOLIM4chr3167812887-NCEH1chr3172365904-0.0002759440.9997241
ENST00000470487ENST00000273512GOLIM4chr3167812887-NCEH1chr3172365904-0.0002434810.9997565
ENST00000470487ENST00000538775GOLIM4chr3167812887-NCEH1chr3172365904-0.0002468330.9997532
ENST00000309027ENST00000475381GOLIM4chr3167812887-NCEH1chr3172365904-0.0002413260.99975866
ENST00000309027ENST00000273512GOLIM4chr3167812887-NCEH1chr3172365904-0.0002127840.9997873
ENST00000309027ENST00000538775GOLIM4chr3167812887-NCEH1chr3172365904-0.0002167790.9997832

Top

Fusion Genomic Features for GOLIM4-NCEH1


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.

Top

Fusion Protein Features for GOLIM4-NCEH1


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr3:167812887/chr3:172365904)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
GOLIM4

O00461

NCEH1

Q6PIU2

FUNCTION: Plays a role in endosome to Golgi protein trafficking; mediates protein transport along the late endosome-bypass pathway from the early endosome to the Golgi. {ECO:0000269|PubMed:15331763}.FUNCTION: Hydrolyzes 2-acetyl monoalkylglycerol ether, the penultimate precursor of the pathway for de novo synthesis of platelet-activating factor (PubMed:17052608). May be responsible for cholesterol ester hydrolysis in macrophages (By similarity). Also involved in organ detoxification by hydrolyzing exogenous organophosphorus compounds (By similarity). May contribute to cancer pathogenesis by promoting tumor cell migration (PubMed:17052608). {ECO:0000250|UniProtKB:Q8BLF1, ECO:0000269|PubMed:17052608}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneGOLIM4chr3:167812887chr3:172365904ENST00000470487-1162_1262697.0Topological domainCytoplasmic
HgeneGOLIM4chr3:167812887chr3:172365904ENST00000470487-11613_3362697.0TransmembraneHelical%3B Signal-anchor for type II membrane protein
TgeneNCEH1chr3:167812887chr3:172365904ENST0000047538105113_11546409.0MotifInvolved in the stabilization of the negatively charged intermediate by the formation of the oxyanion hole
TgeneNCEH1chr3:167812887chr3:172365904ENST0000054371104113_1150276.0MotifInvolved in the stabilization of the negatively charged intermediate by the formation of the oxyanion hole
TgeneNCEH1chr3:167812887chr3:172365904ENST00000543711041_40276.0Topological domainCytoplasmic
TgeneNCEH1chr3:167812887chr3:172365904ENST000005437110426_4080276.0Topological domainLumenal
TgeneNCEH1chr3:167812887chr3:172365904ENST00000543711045_250276.0TransmembraneHelical%3B Signal-anchor for type II membrane protein

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneGOLIM4chr3:167812887chr3:172365904ENST00000470487-11635_24462697.0Coiled coilOntology_term=ECO:0000255
HgeneGOLIM4chr3:167812887chr3:172365904ENST00000470487-116311_68162697.0Compositional biasNote=Glu-rich
HgeneGOLIM4chr3:167812887chr3:172365904ENST00000470487-116404_51362697.0Compositional biasNote=Gln-rich
HgeneGOLIM4chr3:167812887chr3:172365904ENST00000470487-116176_24862697.0RegionNote=Golgi targeting
HgeneGOLIM4chr3:167812887chr3:172365904ENST00000470487-11638_10762697.0RegionNote=Golgi targeting
HgeneGOLIM4chr3:167812887chr3:172365904ENST00000470487-11680_17562697.0RegionNote=Endosome targeting
HgeneGOLIM4chr3:167812887chr3:172365904ENST00000470487-11634_69662697.0Topological domainLumenal
TgeneNCEH1chr3:167812887chr3:172365904ENST00000475381051_446409.0Topological domainCytoplasmic
TgeneNCEH1chr3:167812887chr3:172365904ENST000004753810526_40846409.0Topological domainLumenal
TgeneNCEH1chr3:167812887chr3:172365904ENST00000475381055_2546409.0TransmembraneHelical%3B Signal-anchor for type II membrane protein


Top

Fusion Gene Sequence for GOLIM4-NCEH1


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>33949_33949_1_GOLIM4-NCEH1_GOLIM4_chr3_167812887_ENST00000309027_NCEH1_chr3_172365904_ENST00000273512_length(transcript)=4165nt_BP=246nt
GCTCGGACCAAGGGGAGGAGACGGCGGGGGCGGCCGCGGCTTTGGGTCCAGGCGGGACTATGGGAAACGGGATGTGCTCCCGAAAGCAGA
AGCGGATTTTCCAGACGCTGCTGCTGCTGACCGTCGTGTTCGGCTTTCTCTACGGCGCGATGCTCTACTACGAGCTGCAGACGCAGCTGC
GGAAAGCCGAGGCGGTGGCGCTCAAGTACCAGCAGCACCAGGAGTCCCTCTCCGCCCAGTTACAAGAGTAACCTGATCCACTACCTGGGA
CTGAGCCATCACCTGCTGGCACTGAATTTTATCATTGTTTCTTTTGGCAAAAAAAGCGCGTGGTCTTCTGCCCAAGTGAAGGTGACCGAC
ACAGACTTTGATGGTGTGGAAGTCAGAGTGTTTGAAGGCCCTCCGAAGCCCGAAGAGCCACTGAAACGCAGCGTCGTTTATATCCACGGA
GGAGGCTGGGCCTTGGCAAGTGCAAAAATCAGGTATTATGATGAGCTGTGTACAGCAATGGCTGAGGAATTGAATGCTGTCATTGTTTCC
ATTGAATACAGGCTAGTTCCAAAGGTTTATTTTCCTGAGCAAATTCATGATGTTGTACGGGCCACAAAGTATTTCCTGAAGCCAGAAGTC
TTACAGAAGTATATGGTTGATCCAGGCAGAATTTGCATTTCTGGTGACAGTGCTGGTGGAAATCTGGCTGCTGCCCTTGGACAACAGTTT
ACTCAAGATGCCAGCCTAAAAAATAAGCTCAAACTACAAGCTTTAATTTATCCAGTTCTTCAAGCTTTAGATTTTAACACACCATCTTAT
CAGCAAAATGTGAACACCCCAATCCTGCCCCGCTATGTCATGGTGAAGTATTGGGTGGACTACTTCAAAGGCAACTATGACTTTGTGCAG
GCAATGATCGTTAACAATCACACTTCACTTGATGTGGAAGAGGCTGCTGCTGTCAGGGCCCGTCTAAACTGGACATCCCTCTTGCCTGCA
TCCTTCACAAAGAACTACAAGCCTGTTGTACAGACCACAGGCAATGCCAGGATTGTCCAGGAGCTTCCTCAGTTGCTGGATGCCCGCTCC
GCCCCACTCATTGCAGACCAGGCAGTGCTGCAGCTCCTCCCAAAGACCTACATTCTGACGTGTGAGCATGATGTCCTCAGAGACGATGGC
ATCATGTATGCCAAGCGTTTGGAGAGTGCCGGTGTGGAGGTGACCCTGGATCACTTTGAGGATGGCTTTCACGGATGTATGATTTTCACT
AGCTGGCCCACCAACTTCTCAGTGGGAATCCGGACTAGGAATAGTTACATCAAGTGGCTAGATCAAAACCTGTAAAGGAGCAAAACTTCC
AGAAGCCTCGAGCCCCTCTTGACCTCCTACACCTGCTTTGGAAAGACATGCACTTTTTAGTTGACTAATTCTTCCTCCCATTCCCCTCTA
CTTGCGAGTTATGGAATTTCTATTCCATAACTGAAGTCTTTATGATAACCTAATTTTTAAAAATGAATTTGACTAACTTAAGTGCAAAAC
ATGTAAATTTGGTTCCCAGAGTGGGCCAATCTCTCTGTTCTTGTTATCTTAGCCAACTATACTGATACCTACAGCTACAGAAAGCAGGAC
TAGGAACTGGAAATAACTTTGGGTCCTGCCTTCATTAGGACGTTCTTTTTAGAAGCAGTTCTTCCAGCTCTGGATCATAGAGTGACCTTT
AATAAGTTAAAAAAACGAGGACTCCTTAATTCTGCTAGAGTTAACCTTGAGTTCAGAGCAGTATTAAATGCGTGCACTTTCAGGTCAGTA
CTGGGGACCAAGTACCCTCTGGTCTTTTGTGAATGGATGGTTTTGTTTCCTATGGGAATTTTGGCAAAGGTTTTCTGGAAAGAACAAGTT
TCTCAAAGGACTTTCTTCCTCTAGAATGTTCATTTTATGAGATCGCTATCTGTAAGTCCAGTTGGATTACAGGAATACTTGAAAGTTACT
TTCTACCACTATTAGAAAATATGAAGTCGCATGCACTGGATATCTATATATCATTAGGTTTTTGTTGTGTTTTTGGTTATGCTGTCCCCC
TTCTCCTTGGGGAGATATTTGGGAGCAAACTTATTTAGATTTAGAGTAAACTTTTCATTATAGAGCAAGTAAAAACAGACAAATGAAACA
ACCTAGTGTTTCACATAAAAATACTTCTGACATAAAGTACCAAGAGCAGTGTGAATATACTTGGCATAGTCAAAAAAGAAAATACATTTA
ATATTAGTTCAAAATTGTTAAAAATACCTTTAGAAGGTCTAGTCTATTATTGAAAACTCAATTTTTTCACTTATATGGCTTTAAAATGGA
GCTATTTTGCTACAATATAATGTATTGTTTATTTTTTTAAGTTATTTAATGTTAATATACATAGCTAGACTTAAGGTTTTTCAGAAAGAT
GTCCATAATAAATATTAAAAACAATGGTATTTTTTAAAAAACTGCCTTAGGGTTTTAAAACCTTCCCTACAGTTATAACCACGTGTAATT
TTGTGGAAATGATATAACAGCTATTAATACTACTATAACATAGGCATAAATATTTTCGTGTTTATATGCATATACAAGTTAAAATAATTA
GAAACTATGACTGCGCCTAGTAAAGTCATCTAGGTTTATAGTTCAGTAGCTTAGGCAAGGCACACACTGCTCATCTCCGCTTTTTAGGGT
CAGAGGAACACAAGCTCATGTTCTGAGTGAAGGGCGTACACTGGCACCTGGTGTTGCCTAGATCCCCCATCTCCTCCTTCCAGCCAGGTC
TGGAAGTTTCAACAGCCCAAGCTTAACTTCATGTAAAGTCTTCACTGCCAGTGGGAACATCTTTGACACAACAAGACACTCCAATTGTGA
TTTGAGTTGAGGATCTCTGCCTGCCTTCCTGCCGTCCTTCCTTCTTCCCCGATCCATGCTACTTTTAGGGGCTGCGGAGAGCAGCAGCAG
AGCTGAGTAATGATACAGGGCACCACGGAGAGAAAGTAGAACCATTTCACTCCTGGGAAGATGGGGTATTTCCCACTTCCAGCAACGAAA
TAACAAATGAAAAGTTGCATACTTATTGATGTATTGTATGAGCCAGTAGCATTTTATGTACAAAACAGAAGTCAATGCAACAGTATGTAT
GTGTGCCTGTGTGTGTATAAAAATAACCATTGAAGCTAACTTGCTAATGTACTTAGGCAAGCCACTTCCCATCTCTGGGCCTCGTCTTTC
CTCCCTCTAAAATCAAAGAGCTGAATTATGTGATCCTTGAGGTCTCTTCCACTTATAATACCAACTGTCTTGTCAGACTGGCAAATTATA
TTGGCCTCTCCTTATGTGGTGGTTTTTTTGGTAGGTCATAGTTCCTTATACACAGACACCTGCATCATCGAAGGTCTTTTTTTCCTAAAA
AAAAAAAATGGGATTTTAGTTCTTATTCTGTGATAACTATCCTCCTCATATAATACTATTCTTTTTGACACCATTTGAAGGAACCAATAT
TTGGACCTTATTTTGAGGTTGTCTGTCTCGAAGAAAAAGAAAATAAAATGTATAGGCAGGGTTCCTTCAATTGGCATTTTCCCCAGAATT
GTGAGCCAAAGCCTATAGTAATTGCAGACAGCAAATGATTCCGGATCTCTAAAAGGCTCTCTCAGATGAAAAGGGAGTAAAGGAAAAAAG
AGGTCAACCACTGTTTCTGATAATGTACTTGAGTTTCATTGTTCTTTTAGTTTGTATTCTTATAAAAAATGTTTACACTCTGCAGATTGA
TTTTTTTTTTTTAGTACTGTGGCTTTCTTTTCCTATTTTATGAAAAAAATGATAATCTTTTTGTAAAATTGTCTGTGAAATATAAACATT
AATATATAAAGAAAAACCTTGAAGTGCTGTATAGTGAAGTATAAATTAATGTTTTATTGATTTGTGAAGAATTTAAGACTATTATATAAT
TATCTTGGTGGATCTATTTTATGCATGACCTTTTAACCTTTGACTTTGCTTATTTCCCACTACGAAGGGGAAGGTAGATTTTATGAATGA
TTTTAATAGCAAATATATTTTATAAAGTGAAAATCCAGTGTGGAGGTAGCAAAGCATCTATCTATTCTGAATCATGTTTGGAAATAAAAT

>33949_33949_1_GOLIM4-NCEH1_GOLIM4_chr3_167812887_ENST00000309027_NCEH1_chr3_172365904_ENST00000273512_length(amino acids)=360AA_BP=
MIHYLGLSHHLLALNFIIVSFGKKSAWSSAQVKVTDTDFDGVEVRVFEGPPKPEEPLKRSVVYIHGGGWALASAKIRYYDELCTAMAEEL
NAVIVSIEYRLVPKVYFPEQIHDVVRATKYFLKPEVLQKYMVDPGRICISGDSAGGNLAAALGQQFTQDASLKNKLKLQALIYPVLQALD
FNTPSYQQNVNTPILPRYVMVKYWVDYFKGNYDFVQAMIVNNHTSLDVEEAAAVRARLNWTSLLPASFTKNYKPVVQTTGNARIVQELPQ
LLDARSAPLIADQAVLQLLPKTYILTCEHDVLRDDGIMYAKRLESAGVEVTLDHFEDGFHGCMIFTSWPTNFSVGIRTRNSYIKWLDQNL

--------------------------------------------------------------
>33949_33949_2_GOLIM4-NCEH1_GOLIM4_chr3_167812887_ENST00000309027_NCEH1_chr3_172365904_ENST00000475381_length(transcript)=4561nt_BP=246nt
GCTCGGACCAAGGGGAGGAGACGGCGGGGGCGGCCGCGGCTTTGGGTCCAGGCGGGACTATGGGAAACGGGATGTGCTCCCGAAAGCAGA
AGCGGATTTTCCAGACGCTGCTGCTGCTGACCGTCGTGTTCGGCTTTCTCTACGGCGCGATGCTCTACTACGAGCTGCAGACGCAGCTGC
GGAAAGCCGAGGCGGTGGCGCTCAAGTACCAGCAGCACCAGGAGTCCCTCTCCGCCCAGTTACAAGAGTAACCTGATCCACTACCTGGGA
CTGAGCCATCACCTGCTGGCACTGAATTTTATCATTGTTTCTTTTGGCAAAAAAAGCGCGTGGTCTTCTGCCCAAGTGAAGGTGACCGAC
ACAGACTTTGATGGTGTGGAAGTCAGAGTGTTTGAAGGCCCTCCGAAGCCCGAAGAGCCACTGAAACGCAGCGTCGTTTATATCCACGGA
GGAGGCTGGGCCTTGGCAAGTGCAAAAATCAGGTATTATGATGAGCTGTGTACAGCAATGGCTGAGGAATTGAATGCTGTCATTGTTTCC
ATTGAATACAGGCTAGTTCCAAAGGTTTATTTTCCTGAGCAAATTCATGATGTTGTACGGGCCACAAAGTATTTCCTGAAGCCAGAAGTC
TTACAGAAGTATATGGTTGATCCAGGCAGAATTTGCATTTCTGGTGACAGTGCTGGTGGAAATCTGGCTGCTGCCCTTGGACAACAGTTT
ACTCAAGATGCCAGCCTAAAAAATAAGCTCAAACTACAAGCTTTAATTTATCCAGTTCTTCAAGCTTTAGATTTTAACACACCATCTTAT
CAGCAAAATGTGAACACCCCAATCCTGCCCCGCTATGTCATGGTGAAGTATTGGGTGGACTACTTCAAAGGCAACTATGACTTTGTGCAG
GCAATGATCGTTAACAATCACACTTCACTTGATGTGGAAGAGGCTGCTGCTGTCAGGGCCCGTCTAAACTGGACATCCCTCTTGCCTGCA
TCCTTCACAAAGAACTACAAGCCTGTTGTACAGACCACAGGCAATGCCAGGATTGTCCAGGAGCTTCCTCAGTTGCTGGATGCCCGCTCC
GCCCCACTCATTGCAGACCAGGCAGTGCTGCAGCTCCTCCCAAAGACCTACATTCTGACGTGTGAGCATGATGTCCTCAGAGACGATGGC
ATCATGTATGCCAAGCGTTTGGAGAGTGCCGGTGTGGAGGTGACCCTGGATCACTTTGAGGATGGCTTTCACGGATGTATGATTTTCACT
AGCTGGCCCACCAACTTCTCAGTGGGAATCCGGACTAGGAATAGTTACATCAAGTGGCTAGATCAAAACCTGTAAAGGAGCAAAACTTCC
AGAAGCCTCGAGCCCCTCTTGACCTCCTACACCTGCTTTGGAAAGACATGCACTTTTTAGTTGACTAATTCTTCCTCCCATTCCCCTCTA
CTTGCGAGTTATGGAATTTCTATTCCATAACTGAAGTCTTTATGATAACCTAATTTTTAAAAATGAATTTGACTAACTTAAGTGCAAAAC
ATGTAAATTTGGTTCCCAGAGTGGGCCAATCTCTCTGTTCTTGTTATCTTAGCCAACTATACTGATACCTACAGCTACAGAAAGCAGGAC
TAGGAACTGGAAATAACTTTGGGTCCTGCCTTCATTAGGACGTTCTTTTTAGAAGCAGTTCTTCCAGCTCTGGATCATAGAGTGACCTTT
AATAAGTTAAAAAAACGAGGACTCCTTAATTCTGCTAGAGTTAACCTTGAGTTCAGAGCAGTATTAAATGCGTGCACTTTCAGGTCAGTA
CTGGGGACCAAGTACCCTCTGGTCTTTTGTGAATGGATGGTTTTGTTTCCTATGGGAATTTTGGCAAAGGTTTTCTGGAAAGAACAAGTT
TCTCAAAGGACTTTCTTCCTCTAGAATGTTCATTTTATGAGATCGCTATCTGTAAGTCCAGTTGGATTACAGGAATACTTGAAAGTTACT
TTCTACCACTATTAGAAAATATGAAGTCGCATGCACTGGATATCTATATATCATTAGGTTTTTGTTGTGTTTTTGGTTATGCTGTCCCCC
TTCTCCTTGGGGAGATATTTGGGAGCAAACTTATTTAGATTTAGAGTAAACTTTTCATTATAGAGCAAGTAAAAACAGACAAATGAAACA
ACCTAGTGTTTCACATAAAAATACTTCTGACATAAAGTACCAAGAGCAGTGTGAATATACTTGGCATAGTCAAAAAAGAAAATACATTTA
ATATTAGTTCAAAATTGTTAAAAATACCTTTAGAAGGTCTAGTCTATTATTGAAAACTCAATTTTTTCACTTATATGGCTTTAAAATGGA
GCTATTTTGCTACAATATAATGTATTGTTTATTTTTTTAAGTTATTTAATGTTAATATACATAGCTAGACTTAAGGTTTTTCAGAAAGAT
GTCCATAATAAATATTAAAAACAATGGTATTTTTTAAAAAACTGCCTTAGGGTTTTAAAACCTTCCCTACAGTTATAACCACGTGTAATT
TTGTGGAAATGATATAACAGCTATTAATACTACTATAACATAGGCATAAATATTTTCGTGTTTATATGCATATACAAGTTAAAATAATTA
GAAACTATGACTGCGCCTAGTAAAGTCATCTAGGTTTATAGTTCAGTAGCTTAGGCAAGGCACACACTGCTCATCTCCGCTTTTTAGGGT
CAGAGGAACACAAGCTCATGTTCTGAGTGAAGGGCGTACACTGGCACCTGGTGTTGCCTAGATCCCCCATCTCCTCCTTCCAGCCAGGTC
TGGAAGTTTCAACAGCCCAAGCTTAACTTCATGTAAAGTCTTCACTGCCAGTGGGAACATCTTTGACACAACAAGACACTCCAATTGTGA
TTTGAGTTGAGGATCTCTGCCTGCCTTCCTGCCGTCCTTCCTTCTTCCCCGATCCATGCTACTTTTAGGGGCTGCGGAGAGCAGCAGCAG
AGCTGAGTAATGATACAGGGCACCACGGAGAGAAAGTAGAACCATTTCACTCCTGGGAAGATGGGGTATTTCCCACTTCCAGCAACGAAA
TAACAAATGAAAAGTTGCATACTTATTGATGTATTGTATGAGCCAGTAGCATTTTATGTACAAAACAGAAGTCAATGCAACAGTATGTAT
GTGTGCCTGTGTGTGTATAAAAATAACCATTGAAGCTAACTTGCTAATGTACTTAGGCAAGCCACTTCCCATCTCTGGGCCTCGTCTTTC
CTCCCTCTAAAATCAAAGAGCTGAATTATGTGATCCTTGAGGTCTCTTCCACTTATAATACCAACTGTCTTGTCAGACTGGCAAATTATA
TTGGCCTCTCCTTATGTGGTGGTTTTTTTGGTAGGTCATAGTTCCTTATACACAGACACCTGCATCATCGAAGGTCTTTTTTTCCTAAAA
AAAAAAAATGGGATTTTAGTTCTTATTCTGTGATAACTATCCTCCTCATATAATACTATTCTTTTTGACACCATTTGAAGGAACCAATAT
TTGGACCTTATTTTGAGGTTGTCTGTCTCGAAGAAAAAGAAAATAAAATGTATAGGCAGGGTTCCTTCAATTGGCATTTTCCCCAGAATT
GTGAGCCAAAGCCTATAGTAATTGCAGACAGCAAATGATTCCGGATCTCTAAAAGGCTCTCTCAGATGAAAAGGGAGTAAAGGAAAAAAG
AGGTCAACCACTGTTTCTGATAATGTACTTGAGTTTCATTGTTCTTTTAGTTTGTATTCTTATAAAAAATGTTTACACTCTGCAGATTGA
TTTTTTTTTTTTAGTACTGTGGCTTTCTTTTCCTATTTTATGAAAAAAATGATAATCTTTTTGTAAAATTGTCTGTGAAATATAAACATT
AATATATAAAGAAAAACCTTGAAGTGCTGTATAGTGAAGTATAAATTAATGTTTTATTGATTTGTGAAGAATTTAAGACTATTATATAAT
TATCTTGGTGGATCTATTTTATGCATGACCTTTTAACCTTTGACTTTGCTTATTTCCCACTACGAAGGGGAAGGTAGATTTTATGAATGA
TTTTAATAGCAAATATATTTTATAAAGTGAAAATCCAGTGTGGAGGTAGCAAAGCATCTATCTATTCTGAATCATGTTTGGAAATAAAAT
TGCTCCATCTGGGAATGTGCTTTCATTTTCCCGTCTCATTTTCTGTTTCCCATTTGAAAACAGTGTTCTCTTCTCTTCTTTCTGTATGCC
AAATTGCCAGCCACAATTCTTTCCCAATCTTCTGCCCAGTGGAGTACTCCAGCTGTCTTTCCTTTAGAGAGGTGGTTCAGTGGCCTCCTT
GATGACATTTGTTTCATGTCTGGATACAGTATGTGTTGCCCTCCTTTGGACTGGCTAGAATATTCATAAAAGCCAGGCAGGCCCTGGGAC
TATTTTGGGACCTTCAATACACATGTTAGGAAAAGGATGATCATAATGCCAAGAGTTTGAGCTGAATTGTTTCAGCCAAAGCATCACTAT

>33949_33949_2_GOLIM4-NCEH1_GOLIM4_chr3_167812887_ENST00000309027_NCEH1_chr3_172365904_ENST00000475381_length(amino acids)=360AA_BP=
MIHYLGLSHHLLALNFIIVSFGKKSAWSSAQVKVTDTDFDGVEVRVFEGPPKPEEPLKRSVVYIHGGGWALASAKIRYYDELCTAMAEEL
NAVIVSIEYRLVPKVYFPEQIHDVVRATKYFLKPEVLQKYMVDPGRICISGDSAGGNLAAALGQQFTQDASLKNKLKLQALIYPVLQALD
FNTPSYQQNVNTPILPRYVMVKYWVDYFKGNYDFVQAMIVNNHTSLDVEEAAAVRARLNWTSLLPASFTKNYKPVVQTTGNARIVQELPQ
LLDARSAPLIADQAVLQLLPKTYILTCEHDVLRDDGIMYAKRLESAGVEVTLDHFEDGFHGCMIFTSWPTNFSVGIRTRNSYIKWLDQNL

--------------------------------------------------------------
>33949_33949_3_GOLIM4-NCEH1_GOLIM4_chr3_167812887_ENST00000309027_NCEH1_chr3_172365904_ENST00000538775_length(transcript)=4189nt_BP=246nt
GCTCGGACCAAGGGGAGGAGACGGCGGGGGCGGCCGCGGCTTTGGGTCCAGGCGGGACTATGGGAAACGGGATGTGCTCCCGAAAGCAGA
AGCGGATTTTCCAGACGCTGCTGCTGCTGACCGTCGTGTTCGGCTTTCTCTACGGCGCGATGCTCTACTACGAGCTGCAGACGCAGCTGC
GGAAAGCCGAGGCGGTGGCGCTCAAGTACCAGCAGCACCAGGAGTCCCTCTCCGCCCAGTTACAAGAGTAACCTGATCCACTACCTGGGA
CTGAGCCATCACCTGCTGGCACTGAATTTTATCATTGTTTCTTTTGGCAAAAAAAGCGCGTGGTCTTCTGCCCAAGTGAAGGTGACCGAC
ACAGACTTTGATGGTGTGGAAGTCAGAGTGTTTGAAGGCCCTCCGAAGCCCGAAGAGCCACTGAAACGCAGCGTCGTTTATATCCACGGA
GGAGGCTGGGCCTTGGCAAGTGCAAGTGCGTCCTGGTCACCTTCAGATGAAATCAGGTATTATGATGAGCTGTGTACAGCAATGGCTGAG
GAATTGAATGCTGTCATTGTTTCCATTGAATACAGGCTAGTTCCAAAGGTTTATTTTCCTGAGCAAATTCATGATGTTGTACGGGCCACA
AAGTATTTCCTGAAGCCAGAAGTCTTACAGAAGTATATGGTTGATCCAGGCAGAATTTGCATTTCTGGTGACAGTGCTGGTGGAAATCTG
GCTGCTGCCCTTGGACAACAGTTTACTCAAGATGCCAGCCTAAAAAATAAGCTCAAACTACAAGCTTTAATTTATCCAGTTCTTCAAGCT
TTAGATTTTAACACACCATCTTATCAGCAAAATGTGAACACCCCAATCCTGCCCCGCTATGTCATGGTGAAGTATTGGGTGGACTACTTC
AAAGGCAACTATGACTTTGTGCAGGCAATGATCGTTAACAATCACACTTCACTTGATGTGGAAGAGGCTGCTGCTGTCAGGGCCCGTCTA
AACTGGACATCCCTCTTGCCTGCATCCTTCACAAAGAACTACAAGCCTGTTGTACAGACCACAGGCAATGCCAGGATTGTCCAGGAGCTT
CCTCAGTTGCTGGATGCCCGCTCCGCCCCACTCATTGCAGACCAGGCAGTGCTGCAGCTCCTCCCAAAGACCTACATTCTGACGTGTGAG
CATGATGTCCTCAGAGACGATGGCATCATGTATGCCAAGCGTTTGGAGAGTGCCGGTGTGGAGGTGACCCTGGATCACTTTGAGGATGGC
TTTCACGGATGTATGATTTTCACTAGCTGGCCCACCAACTTCTCAGTGGGAATCCGGACTAGGAATAGTTACATCAAGTGGCTAGATCAA
AACCTGTAAAGGAGCAAAACTTCCAGAAGCCTCGAGCCCCTCTTGACCTCCTACACCTGCTTTGGAAAGACATGCACTTTTTAGTTGACT
AATTCTTCCTCCCATTCCCCTCTACTTGCGAGTTATGGAATTTCTATTCCATAACTGAAGTCTTTATGATAACCTAATTTTTAAAAATGA
ATTTGACTAACTTAAGTGCAAAACATGTAAATTTGGTTCCCAGAGTGGGCCAATCTCTCTGTTCTTGTTATCTTAGCCAACTATACTGAT
ACCTACAGCTACAGAAAGCAGGACTAGGAACTGGAAATAACTTTGGGTCCTGCCTTCATTAGGACGTTCTTTTTAGAAGCAGTTCTTCCA
GCTCTGGATCATAGAGTGACCTTTAATAAGTTAAAAAAACGAGGACTCCTTAATTCTGCTAGAGTTAACCTTGAGTTCAGAGCAGTATTA
AATGCGTGCACTTTCAGGTCAGTACTGGGGACCAAGTACCCTCTGGTCTTTTGTGAATGGATGGTTTTGTTTCCTATGGGAATTTTGGCA
AAGGTTTTCTGGAAAGAACAAGTTTCTCAAAGGACTTTCTTCCTCTAGAATGTTCATTTTATGAGATCGCTATCTGTAAGTCCAGTTGGA
TTACAGGAATACTTGAAAGTTACTTTCTACCACTATTAGAAAATATGAAGTCGCATGCACTGGATATCTATATATCATTAGGTTTTTGTT
GTGTTTTTGGTTATGCTGTCCCCCTTCTCCTTGGGGAGATATTTGGGAGCAAACTTATTTAGATTTAGAGTAAACTTTTCATTATAGAGC
AAGTAAAAACAGACAAATGAAACAACCTAGTGTTTCACATAAAAATACTTCTGACATAAAGTACCAAGAGCAGTGTGAATATACTTGGCA
TAGTCAAAAAAGAAAATACATTTAATATTAGTTCAAAATTGTTAAAAATACCTTTAGAAGGTCTAGTCTATTATTGAAAACTCAATTTTT
TCACTTATATGGCTTTAAAATGGAGCTATTTTGCTACAATATAATGTATTGTTTATTTTTTTAAGTTATTTAATGTTAATATACATAGCT
AGACTTAAGGTTTTTCAGAAAGATGTCCATAATAAATATTAAAAACAATGGTATTTTTTAAAAAACTGCCTTAGGGTTTTAAAACCTTCC
CTACAGTTATAACCACGTGTAATTTTGTGGAAATGATATAACAGCTATTAATACTACTATAACATAGGCATAAATATTTTCGTGTTTATA
TGCATATACAAGTTAAAATAATTAGAAACTATGACTGCGCCTAGTAAAGTCATCTAGGTTTATAGTTCAGTAGCTTAGGCAAGGCACACA
CTGCTCATCTCCGCTTTTTAGGGTCAGAGGAACACAAGCTCATGTTCTGAGTGAAGGGCGTACACTGGCACCTGGTGTTGCCTAGATCCC
CCATCTCCTCCTTCCAGCCAGGTCTGGAAGTTTCAACAGCCCAAGCTTAACTTCATGTAAAGTCTTCACTGCCAGTGGGAACATCTTTGA
CACAACAAGACACTCCAATTGTGATTTGAGTTGAGGATCTCTGCCTGCCTTCCTGCCGTCCTTCCTTCTTCCCCGATCCATGCTACTTTT
AGGGGCTGCGGAGAGCAGCAGCAGAGCTGAGTAATGATACAGGGCACCACGGAGAGAAAGTAGAACCATTTCACTCCTGGGAAGATGGGG
TATTTCCCACTTCCAGCAACGAAATAACAAATGAAAAGTTGCATACTTATTGATGTATTGTATGAGCCAGTAGCATTTTATGTACAAAAC
AGAAGTCAATGCAACAGTATGTATGTGTGCCTGTGTGTGTATAAAAATAACCATTGAAGCTAACTTGCTAATGTACTTAGGCAAGCCACT
TCCCATCTCTGGGCCTCGTCTTTCCTCCCTCTAAAATCAAAGAGCTGAATTATGTGATCCTTGAGGTCTCTTCCACTTATAATACCAACT
GTCTTGTCAGACTGGCAAATTATATTGGCCTCTCCTTATGTGGTGGTTTTTTTGGTAGGTCATAGTTCCTTATACACAGACACCTGCATC
ATCGAAGGTCTTTTTTTCCTAAAAAAAAAAAATGGGATTTTAGTTCTTATTCTGTGATAACTATCCTCCTCATATAATACTATTCTTTTT
GACACCATTTGAAGGAACCAATATTTGGACCTTATTTTGAGGTTGTCTGTCTCGAAGAAAAAGAAAATAAAATGTATAGGCAGGGTTCCT
TCAATTGGCATTTTCCCCAGAATTGTGAGCCAAAGCCTATAGTAATTGCAGACAGCAAATGATTCCGGATCTCTAAAAGGCTCTCTCAGA
TGAAAAGGGAGTAAAGGAAAAAAGAGGTCAACCACTGTTTCTGATAATGTACTTGAGTTTCATTGTTCTTTTAGTTTGTATTCTTATAAA
AAATGTTTACACTCTGCAGATTGATTTTTTTTTTTTAGTACTGTGGCTTTCTTTTCCTATTTTATGAAAAAAATGATAATCTTTTTGTAA
AATTGTCTGTGAAATATAAACATTAATATATAAAGAAAAACCTTGAAGTGCTGTATAGTGAAGTATAAATTAATGTTTTATTGATTTGTG
AAGAATTTAAGACTATTATATAATTATCTTGGTGGATCTATTTTATGCATGACCTTTTAACCTTTGACTTTGCTTATTTCCCACTACGAA
GGGGAAGGTAGATTTTATGAATGATTTTAATAGCAAATATATTTTATAAAGTGAAAATCCAGTGTGGAGGTAGCAAAGCATCTATCTATT

>33949_33949_3_GOLIM4-NCEH1_GOLIM4_chr3_167812887_ENST00000309027_NCEH1_chr3_172365904_ENST00000538775_length(amino acids)=368AA_BP=
MIHYLGLSHHLLALNFIIVSFGKKSAWSSAQVKVTDTDFDGVEVRVFEGPPKPEEPLKRSVVYIHGGGWALASASASWSPSDEIRYYDEL
CTAMAEELNAVIVSIEYRLVPKVYFPEQIHDVVRATKYFLKPEVLQKYMVDPGRICISGDSAGGNLAAALGQQFTQDASLKNKLKLQALI
YPVLQALDFNTPSYQQNVNTPILPRYVMVKYWVDYFKGNYDFVQAMIVNNHTSLDVEEAAAVRARLNWTSLLPASFTKNYKPVVQTTGNA
RIVQELPQLLDARSAPLIADQAVLQLLPKTYILTCEHDVLRDDGIMYAKRLESAGVEVTLDHFEDGFHGCMIFTSWPTNFSVGIRTRNSY

--------------------------------------------------------------
>33949_33949_4_GOLIM4-NCEH1_GOLIM4_chr3_167812887_ENST00000470487_NCEH1_chr3_172365904_ENST00000273512_length(transcript)=4796nt_BP=877nt
AATGAGCAAGGAGGCCGAGTGGGACTTCCTCCCGGAATCCCGTTGGCCAGAATAGCCGGGCCGTGGGTGACACGTAAGTTGGGCAGGAGG
TGGCGGGGCGGCAGAGGCACCAGCCGACCCGTCAGTGACACCGCTGTGCCGTCCCCAAAACCAGCCGAGACAGCTGGCCCCCACCCTTCC
ACCCATTGGGCAGGCCGCACGGGGGCGCGGCCCGGAGTCCTGGTCCCTTTGTTGGGCGCGCACCCCCTCCCTTAGGTGGCAACAAAGTCG
TGCAGTGGGAGCCGCCGCGATAGGGCGGGGAGTGGCCAGGGCGGGACTCCAAGAACTGCCCGGGGGCAGCGGGGCCAAAAAGTGGGAAGA
AGGAAAAAAGGCAGGAGGCATCTGGGGACAGGCGCGAGGGCAGCCGGCTCTGAAGTATGCGGAGGGCCTCCTCCCGGCCCCGGGCATTCG
CGGAGAACGAGCCTCGCAGAAGTTTGGCTGCAGCTGCCCGGGCGGCGTCGATGGCTGCGCGCCCCGCGCCGCGCGGGGGCTGAGCGGGCG
CCACTTCCCCTCCGGGCCGGCTTTTGTGTCTGGCATCTCCTCCTCATGCTGCGTCTGGCCACCTACTGCGGCGGCCGCTGCTGAGACGCT
CGCTCGGACCAAGGGGAGGAGACGGCGGGGGCGGCCGCGGCTTTGGGTCCAGGCGGGACTATGGGAAACGGGATGTGCTCCCGAAAGCAG
AAGCGGATTTTCCAGACGCTGCTGCTGCTGACCGTCGTGTTCGGCTTTCTCTACGGCGCGATGCTCTACTACGAGCTGCAGACGCAGCTG
CGGAAAGCCGAGGCGGTGGCGCTCAAGTACCAGCAGCACCAGGAGTCCCTCTCCGCCCAGTTACAAGAGTAACCTGATCCACTACCTGGG
ACTGAGCCATCACCTGCTGGCACTGAATTTTATCATTGTTTCTTTTGGCAAAAAAAGCGCGTGGTCTTCTGCCCAAGTGAAGGTGACCGA
CACAGACTTTGATGGTGTGGAAGTCAGAGTGTTTGAAGGCCCTCCGAAGCCCGAAGAGCCACTGAAACGCAGCGTCGTTTATATCCACGG
AGGAGGCTGGGCCTTGGCAAGTGCAAAAATCAGGTATTATGATGAGCTGTGTACAGCAATGGCTGAGGAATTGAATGCTGTCATTGTTTC
CATTGAATACAGGCTAGTTCCAAAGGTTTATTTTCCTGAGCAAATTCATGATGTTGTACGGGCCACAAAGTATTTCCTGAAGCCAGAAGT
CTTACAGAAGTATATGGTTGATCCAGGCAGAATTTGCATTTCTGGTGACAGTGCTGGTGGAAATCTGGCTGCTGCCCTTGGACAACAGTT
TACTCAAGATGCCAGCCTAAAAAATAAGCTCAAACTACAAGCTTTAATTTATCCAGTTCTTCAAGCTTTAGATTTTAACACACCATCTTA
TCAGCAAAATGTGAACACCCCAATCCTGCCCCGCTATGTCATGGTGAAGTATTGGGTGGACTACTTCAAAGGCAACTATGACTTTGTGCA
GGCAATGATCGTTAACAATCACACTTCACTTGATGTGGAAGAGGCTGCTGCTGTCAGGGCCCGTCTAAACTGGACATCCCTCTTGCCTGC
ATCCTTCACAAAGAACTACAAGCCTGTTGTACAGACCACAGGCAATGCCAGGATTGTCCAGGAGCTTCCTCAGTTGCTGGATGCCCGCTC
CGCCCCACTCATTGCAGACCAGGCAGTGCTGCAGCTCCTCCCAAAGACCTACATTCTGACGTGTGAGCATGATGTCCTCAGAGACGATGG
CATCATGTATGCCAAGCGTTTGGAGAGTGCCGGTGTGGAGGTGACCCTGGATCACTTTGAGGATGGCTTTCACGGATGTATGATTTTCAC
TAGCTGGCCCACCAACTTCTCAGTGGGAATCCGGACTAGGAATAGTTACATCAAGTGGCTAGATCAAAACCTGTAAAGGAGCAAAACTTC
CAGAAGCCTCGAGCCCCTCTTGACCTCCTACACCTGCTTTGGAAAGACATGCACTTTTTAGTTGACTAATTCTTCCTCCCATTCCCCTCT
ACTTGCGAGTTATGGAATTTCTATTCCATAACTGAAGTCTTTATGATAACCTAATTTTTAAAAATGAATTTGACTAACTTAAGTGCAAAA
CATGTAAATTTGGTTCCCAGAGTGGGCCAATCTCTCTGTTCTTGTTATCTTAGCCAACTATACTGATACCTACAGCTACAGAAAGCAGGA
CTAGGAACTGGAAATAACTTTGGGTCCTGCCTTCATTAGGACGTTCTTTTTAGAAGCAGTTCTTCCAGCTCTGGATCATAGAGTGACCTT
TAATAAGTTAAAAAAACGAGGACTCCTTAATTCTGCTAGAGTTAACCTTGAGTTCAGAGCAGTATTAAATGCGTGCACTTTCAGGTCAGT
ACTGGGGACCAAGTACCCTCTGGTCTTTTGTGAATGGATGGTTTTGTTTCCTATGGGAATTTTGGCAAAGGTTTTCTGGAAAGAACAAGT
TTCTCAAAGGACTTTCTTCCTCTAGAATGTTCATTTTATGAGATCGCTATCTGTAAGTCCAGTTGGATTACAGGAATACTTGAAAGTTAC
TTTCTACCACTATTAGAAAATATGAAGTCGCATGCACTGGATATCTATATATCATTAGGTTTTTGTTGTGTTTTTGGTTATGCTGTCCCC
CTTCTCCTTGGGGAGATATTTGGGAGCAAACTTATTTAGATTTAGAGTAAACTTTTCATTATAGAGCAAGTAAAAACAGACAAATGAAAC
AACCTAGTGTTTCACATAAAAATACTTCTGACATAAAGTACCAAGAGCAGTGTGAATATACTTGGCATAGTCAAAAAAGAAAATACATTT
AATATTAGTTCAAAATTGTTAAAAATACCTTTAGAAGGTCTAGTCTATTATTGAAAACTCAATTTTTTCACTTATATGGCTTTAAAATGG
AGCTATTTTGCTACAATATAATGTATTGTTTATTTTTTTAAGTTATTTAATGTTAATATACATAGCTAGACTTAAGGTTTTTCAGAAAGA
TGTCCATAATAAATATTAAAAACAATGGTATTTTTTAAAAAACTGCCTTAGGGTTTTAAAACCTTCCCTACAGTTATAACCACGTGTAAT
TTTGTGGAAATGATATAACAGCTATTAATACTACTATAACATAGGCATAAATATTTTCGTGTTTATATGCATATACAAGTTAAAATAATT
AGAAACTATGACTGCGCCTAGTAAAGTCATCTAGGTTTATAGTTCAGTAGCTTAGGCAAGGCACACACTGCTCATCTCCGCTTTTTAGGG
TCAGAGGAACACAAGCTCATGTTCTGAGTGAAGGGCGTACACTGGCACCTGGTGTTGCCTAGATCCCCCATCTCCTCCTTCCAGCCAGGT
CTGGAAGTTTCAACAGCCCAAGCTTAACTTCATGTAAAGTCTTCACTGCCAGTGGGAACATCTTTGACACAACAAGACACTCCAATTGTG
ATTTGAGTTGAGGATCTCTGCCTGCCTTCCTGCCGTCCTTCCTTCTTCCCCGATCCATGCTACTTTTAGGGGCTGCGGAGAGCAGCAGCA
GAGCTGAGTAATGATACAGGGCACCACGGAGAGAAAGTAGAACCATTTCACTCCTGGGAAGATGGGGTATTTCCCACTTCCAGCAACGAA
ATAACAAATGAAAAGTTGCATACTTATTGATGTATTGTATGAGCCAGTAGCATTTTATGTACAAAACAGAAGTCAATGCAACAGTATGTA
TGTGTGCCTGTGTGTGTATAAAAATAACCATTGAAGCTAACTTGCTAATGTACTTAGGCAAGCCACTTCCCATCTCTGGGCCTCGTCTTT
CCTCCCTCTAAAATCAAAGAGCTGAATTATGTGATCCTTGAGGTCTCTTCCACTTATAATACCAACTGTCTTGTCAGACTGGCAAATTAT
ATTGGCCTCTCCTTATGTGGTGGTTTTTTTGGTAGGTCATAGTTCCTTATACACAGACACCTGCATCATCGAAGGTCTTTTTTTCCTAAA
AAAAAAAAATGGGATTTTAGTTCTTATTCTGTGATAACTATCCTCCTCATATAATACTATTCTTTTTGACACCATTTGAAGGAACCAATA
TTTGGACCTTATTTTGAGGTTGTCTGTCTCGAAGAAAAAGAAAATAAAATGTATAGGCAGGGTTCCTTCAATTGGCATTTTCCCCAGAAT
TGTGAGCCAAAGCCTATAGTAATTGCAGACAGCAAATGATTCCGGATCTCTAAAAGGCTCTCTCAGATGAAAAGGGAGTAAAGGAAAAAA
GAGGTCAACCACTGTTTCTGATAATGTACTTGAGTTTCATTGTTCTTTTAGTTTGTATTCTTATAAAAAATGTTTACACTCTGCAGATTG
ATTTTTTTTTTTTAGTACTGTGGCTTTCTTTTCCTATTTTATGAAAAAAATGATAATCTTTTTGTAAAATTGTCTGTGAAATATAAACAT
TAATATATAAAGAAAAACCTTGAAGTGCTGTATAGTGAAGTATAAATTAATGTTTTATTGATTTGTGAAGAATTTAAGACTATTATATAA
TTATCTTGGTGGATCTATTTTATGCATGACCTTTTAACCTTTGACTTTGCTTATTTCCCACTACGAAGGGGAAGGTAGATTTTATGAATG
ATTTTAATAGCAAATATATTTTATAAAGTGAAAATCCAGTGTGGAGGTAGCAAAGCATCTATCTATTCTGAATCATGTTTGGAAATAAAA

>33949_33949_4_GOLIM4-NCEH1_GOLIM4_chr3_167812887_ENST00000470487_NCEH1_chr3_172365904_ENST00000273512_length(amino acids)=360AA_BP=
MIHYLGLSHHLLALNFIIVSFGKKSAWSSAQVKVTDTDFDGVEVRVFEGPPKPEEPLKRSVVYIHGGGWALASAKIRYYDELCTAMAEEL
NAVIVSIEYRLVPKVYFPEQIHDVVRATKYFLKPEVLQKYMVDPGRICISGDSAGGNLAAALGQQFTQDASLKNKLKLQALIYPVLQALD
FNTPSYQQNVNTPILPRYVMVKYWVDYFKGNYDFVQAMIVNNHTSLDVEEAAAVRARLNWTSLLPASFTKNYKPVVQTTGNARIVQELPQ
LLDARSAPLIADQAVLQLLPKTYILTCEHDVLRDDGIMYAKRLESAGVEVTLDHFEDGFHGCMIFTSWPTNFSVGIRTRNSYIKWLDQNL

--------------------------------------------------------------
>33949_33949_5_GOLIM4-NCEH1_GOLIM4_chr3_167812887_ENST00000470487_NCEH1_chr3_172365904_ENST00000475381_length(transcript)=5192nt_BP=877nt
AATGAGCAAGGAGGCCGAGTGGGACTTCCTCCCGGAATCCCGTTGGCCAGAATAGCCGGGCCGTGGGTGACACGTAAGTTGGGCAGGAGG
TGGCGGGGCGGCAGAGGCACCAGCCGACCCGTCAGTGACACCGCTGTGCCGTCCCCAAAACCAGCCGAGACAGCTGGCCCCCACCCTTCC
ACCCATTGGGCAGGCCGCACGGGGGCGCGGCCCGGAGTCCTGGTCCCTTTGTTGGGCGCGCACCCCCTCCCTTAGGTGGCAACAAAGTCG
TGCAGTGGGAGCCGCCGCGATAGGGCGGGGAGTGGCCAGGGCGGGACTCCAAGAACTGCCCGGGGGCAGCGGGGCCAAAAAGTGGGAAGA
AGGAAAAAAGGCAGGAGGCATCTGGGGACAGGCGCGAGGGCAGCCGGCTCTGAAGTATGCGGAGGGCCTCCTCCCGGCCCCGGGCATTCG
CGGAGAACGAGCCTCGCAGAAGTTTGGCTGCAGCTGCCCGGGCGGCGTCGATGGCTGCGCGCCCCGCGCCGCGCGGGGGCTGAGCGGGCG
CCACTTCCCCTCCGGGCCGGCTTTTGTGTCTGGCATCTCCTCCTCATGCTGCGTCTGGCCACCTACTGCGGCGGCCGCTGCTGAGACGCT
CGCTCGGACCAAGGGGAGGAGACGGCGGGGGCGGCCGCGGCTTTGGGTCCAGGCGGGACTATGGGAAACGGGATGTGCTCCCGAAAGCAG
AAGCGGATTTTCCAGACGCTGCTGCTGCTGACCGTCGTGTTCGGCTTTCTCTACGGCGCGATGCTCTACTACGAGCTGCAGACGCAGCTG
CGGAAAGCCGAGGCGGTGGCGCTCAAGTACCAGCAGCACCAGGAGTCCCTCTCCGCCCAGTTACAAGAGTAACCTGATCCACTACCTGGG
ACTGAGCCATCACCTGCTGGCACTGAATTTTATCATTGTTTCTTTTGGCAAAAAAAGCGCGTGGTCTTCTGCCCAAGTGAAGGTGACCGA
CACAGACTTTGATGGTGTGGAAGTCAGAGTGTTTGAAGGCCCTCCGAAGCCCGAAGAGCCACTGAAACGCAGCGTCGTTTATATCCACGG
AGGAGGCTGGGCCTTGGCAAGTGCAAAAATCAGGTATTATGATGAGCTGTGTACAGCAATGGCTGAGGAATTGAATGCTGTCATTGTTTC
CATTGAATACAGGCTAGTTCCAAAGGTTTATTTTCCTGAGCAAATTCATGATGTTGTACGGGCCACAAAGTATTTCCTGAAGCCAGAAGT
CTTACAGAAGTATATGGTTGATCCAGGCAGAATTTGCATTTCTGGTGACAGTGCTGGTGGAAATCTGGCTGCTGCCCTTGGACAACAGTT
TACTCAAGATGCCAGCCTAAAAAATAAGCTCAAACTACAAGCTTTAATTTATCCAGTTCTTCAAGCTTTAGATTTTAACACACCATCTTA
TCAGCAAAATGTGAACACCCCAATCCTGCCCCGCTATGTCATGGTGAAGTATTGGGTGGACTACTTCAAAGGCAACTATGACTTTGTGCA
GGCAATGATCGTTAACAATCACACTTCACTTGATGTGGAAGAGGCTGCTGCTGTCAGGGCCCGTCTAAACTGGACATCCCTCTTGCCTGC
ATCCTTCACAAAGAACTACAAGCCTGTTGTACAGACCACAGGCAATGCCAGGATTGTCCAGGAGCTTCCTCAGTTGCTGGATGCCCGCTC
CGCCCCACTCATTGCAGACCAGGCAGTGCTGCAGCTCCTCCCAAAGACCTACATTCTGACGTGTGAGCATGATGTCCTCAGAGACGATGG
CATCATGTATGCCAAGCGTTTGGAGAGTGCCGGTGTGGAGGTGACCCTGGATCACTTTGAGGATGGCTTTCACGGATGTATGATTTTCAC
TAGCTGGCCCACCAACTTCTCAGTGGGAATCCGGACTAGGAATAGTTACATCAAGTGGCTAGATCAAAACCTGTAAAGGAGCAAAACTTC
CAGAAGCCTCGAGCCCCTCTTGACCTCCTACACCTGCTTTGGAAAGACATGCACTTTTTAGTTGACTAATTCTTCCTCCCATTCCCCTCT
ACTTGCGAGTTATGGAATTTCTATTCCATAACTGAAGTCTTTATGATAACCTAATTTTTAAAAATGAATTTGACTAACTTAAGTGCAAAA
CATGTAAATTTGGTTCCCAGAGTGGGCCAATCTCTCTGTTCTTGTTATCTTAGCCAACTATACTGATACCTACAGCTACAGAAAGCAGGA
CTAGGAACTGGAAATAACTTTGGGTCCTGCCTTCATTAGGACGTTCTTTTTAGAAGCAGTTCTTCCAGCTCTGGATCATAGAGTGACCTT
TAATAAGTTAAAAAAACGAGGACTCCTTAATTCTGCTAGAGTTAACCTTGAGTTCAGAGCAGTATTAAATGCGTGCACTTTCAGGTCAGT
ACTGGGGACCAAGTACCCTCTGGTCTTTTGTGAATGGATGGTTTTGTTTCCTATGGGAATTTTGGCAAAGGTTTTCTGGAAAGAACAAGT
TTCTCAAAGGACTTTCTTCCTCTAGAATGTTCATTTTATGAGATCGCTATCTGTAAGTCCAGTTGGATTACAGGAATACTTGAAAGTTAC
TTTCTACCACTATTAGAAAATATGAAGTCGCATGCACTGGATATCTATATATCATTAGGTTTTTGTTGTGTTTTTGGTTATGCTGTCCCC
CTTCTCCTTGGGGAGATATTTGGGAGCAAACTTATTTAGATTTAGAGTAAACTTTTCATTATAGAGCAAGTAAAAACAGACAAATGAAAC
AACCTAGTGTTTCACATAAAAATACTTCTGACATAAAGTACCAAGAGCAGTGTGAATATACTTGGCATAGTCAAAAAAGAAAATACATTT
AATATTAGTTCAAAATTGTTAAAAATACCTTTAGAAGGTCTAGTCTATTATTGAAAACTCAATTTTTTCACTTATATGGCTTTAAAATGG
AGCTATTTTGCTACAATATAATGTATTGTTTATTTTTTTAAGTTATTTAATGTTAATATACATAGCTAGACTTAAGGTTTTTCAGAAAGA
TGTCCATAATAAATATTAAAAACAATGGTATTTTTTAAAAAACTGCCTTAGGGTTTTAAAACCTTCCCTACAGTTATAACCACGTGTAAT
TTTGTGGAAATGATATAACAGCTATTAATACTACTATAACATAGGCATAAATATTTTCGTGTTTATATGCATATACAAGTTAAAATAATT
AGAAACTATGACTGCGCCTAGTAAAGTCATCTAGGTTTATAGTTCAGTAGCTTAGGCAAGGCACACACTGCTCATCTCCGCTTTTTAGGG
TCAGAGGAACACAAGCTCATGTTCTGAGTGAAGGGCGTACACTGGCACCTGGTGTTGCCTAGATCCCCCATCTCCTCCTTCCAGCCAGGT
CTGGAAGTTTCAACAGCCCAAGCTTAACTTCATGTAAAGTCTTCACTGCCAGTGGGAACATCTTTGACACAACAAGACACTCCAATTGTG
ATTTGAGTTGAGGATCTCTGCCTGCCTTCCTGCCGTCCTTCCTTCTTCCCCGATCCATGCTACTTTTAGGGGCTGCGGAGAGCAGCAGCA
GAGCTGAGTAATGATACAGGGCACCACGGAGAGAAAGTAGAACCATTTCACTCCTGGGAAGATGGGGTATTTCCCACTTCCAGCAACGAA
ATAACAAATGAAAAGTTGCATACTTATTGATGTATTGTATGAGCCAGTAGCATTTTATGTACAAAACAGAAGTCAATGCAACAGTATGTA
TGTGTGCCTGTGTGTGTATAAAAATAACCATTGAAGCTAACTTGCTAATGTACTTAGGCAAGCCACTTCCCATCTCTGGGCCTCGTCTTT
CCTCCCTCTAAAATCAAAGAGCTGAATTATGTGATCCTTGAGGTCTCTTCCACTTATAATACCAACTGTCTTGTCAGACTGGCAAATTAT
ATTGGCCTCTCCTTATGTGGTGGTTTTTTTGGTAGGTCATAGTTCCTTATACACAGACACCTGCATCATCGAAGGTCTTTTTTTCCTAAA
AAAAAAAAATGGGATTTTAGTTCTTATTCTGTGATAACTATCCTCCTCATATAATACTATTCTTTTTGACACCATTTGAAGGAACCAATA
TTTGGACCTTATTTTGAGGTTGTCTGTCTCGAAGAAAAAGAAAATAAAATGTATAGGCAGGGTTCCTTCAATTGGCATTTTCCCCAGAAT
TGTGAGCCAAAGCCTATAGTAATTGCAGACAGCAAATGATTCCGGATCTCTAAAAGGCTCTCTCAGATGAAAAGGGAGTAAAGGAAAAAA
GAGGTCAACCACTGTTTCTGATAATGTACTTGAGTTTCATTGTTCTTTTAGTTTGTATTCTTATAAAAAATGTTTACACTCTGCAGATTG
ATTTTTTTTTTTTAGTACTGTGGCTTTCTTTTCCTATTTTATGAAAAAAATGATAATCTTTTTGTAAAATTGTCTGTGAAATATAAACAT
TAATATATAAAGAAAAACCTTGAAGTGCTGTATAGTGAAGTATAAATTAATGTTTTATTGATTTGTGAAGAATTTAAGACTATTATATAA
TTATCTTGGTGGATCTATTTTATGCATGACCTTTTAACCTTTGACTTTGCTTATTTCCCACTACGAAGGGGAAGGTAGATTTTATGAATG
ATTTTAATAGCAAATATATTTTATAAAGTGAAAATCCAGTGTGGAGGTAGCAAAGCATCTATCTATTCTGAATCATGTTTGGAAATAAAA
TTGCTCCATCTGGGAATGTGCTTTCATTTTCCCGTCTCATTTTCTGTTTCCCATTTGAAAACAGTGTTCTCTTCTCTTCTTTCTGTATGC
CAAATTGCCAGCCACAATTCTTTCCCAATCTTCTGCCCAGTGGAGTACTCCAGCTGTCTTTCCTTTAGAGAGGTGGTTCAGTGGCCTCCT
TGATGACATTTGTTTCATGTCTGGATACAGTATGTGTTGCCCTCCTTTGGACTGGCTAGAATATTCATAAAAGCCAGGCAGGCCCTGGGA
CTATTTTGGGACCTTCAATACACATGTTAGGAAAAGGATGATCATAATGCCAAGAGTTTGAGCTGAATTGTTTCAGCCAAAGCATCACTA

>33949_33949_5_GOLIM4-NCEH1_GOLIM4_chr3_167812887_ENST00000470487_NCEH1_chr3_172365904_ENST00000475381_length(amino acids)=360AA_BP=
MIHYLGLSHHLLALNFIIVSFGKKSAWSSAQVKVTDTDFDGVEVRVFEGPPKPEEPLKRSVVYIHGGGWALASAKIRYYDELCTAMAEEL
NAVIVSIEYRLVPKVYFPEQIHDVVRATKYFLKPEVLQKYMVDPGRICISGDSAGGNLAAALGQQFTQDASLKNKLKLQALIYPVLQALD
FNTPSYQQNVNTPILPRYVMVKYWVDYFKGNYDFVQAMIVNNHTSLDVEEAAAVRARLNWTSLLPASFTKNYKPVVQTTGNARIVQELPQ
LLDARSAPLIADQAVLQLLPKTYILTCEHDVLRDDGIMYAKRLESAGVEVTLDHFEDGFHGCMIFTSWPTNFSVGIRTRNSYIKWLDQNL

--------------------------------------------------------------
>33949_33949_6_GOLIM4-NCEH1_GOLIM4_chr3_167812887_ENST00000470487_NCEH1_chr3_172365904_ENST00000538775_length(transcript)=4820nt_BP=877nt
AATGAGCAAGGAGGCCGAGTGGGACTTCCTCCCGGAATCCCGTTGGCCAGAATAGCCGGGCCGTGGGTGACACGTAAGTTGGGCAGGAGG
TGGCGGGGCGGCAGAGGCACCAGCCGACCCGTCAGTGACACCGCTGTGCCGTCCCCAAAACCAGCCGAGACAGCTGGCCCCCACCCTTCC
ACCCATTGGGCAGGCCGCACGGGGGCGCGGCCCGGAGTCCTGGTCCCTTTGTTGGGCGCGCACCCCCTCCCTTAGGTGGCAACAAAGTCG
TGCAGTGGGAGCCGCCGCGATAGGGCGGGGAGTGGCCAGGGCGGGACTCCAAGAACTGCCCGGGGGCAGCGGGGCCAAAAAGTGGGAAGA
AGGAAAAAAGGCAGGAGGCATCTGGGGACAGGCGCGAGGGCAGCCGGCTCTGAAGTATGCGGAGGGCCTCCTCCCGGCCCCGGGCATTCG
CGGAGAACGAGCCTCGCAGAAGTTTGGCTGCAGCTGCCCGGGCGGCGTCGATGGCTGCGCGCCCCGCGCCGCGCGGGGGCTGAGCGGGCG
CCACTTCCCCTCCGGGCCGGCTTTTGTGTCTGGCATCTCCTCCTCATGCTGCGTCTGGCCACCTACTGCGGCGGCCGCTGCTGAGACGCT
CGCTCGGACCAAGGGGAGGAGACGGCGGGGGCGGCCGCGGCTTTGGGTCCAGGCGGGACTATGGGAAACGGGATGTGCTCCCGAAAGCAG
AAGCGGATTTTCCAGACGCTGCTGCTGCTGACCGTCGTGTTCGGCTTTCTCTACGGCGCGATGCTCTACTACGAGCTGCAGACGCAGCTG
CGGAAAGCCGAGGCGGTGGCGCTCAAGTACCAGCAGCACCAGGAGTCCCTCTCCGCCCAGTTACAAGAGTAACCTGATCCACTACCTGGG
ACTGAGCCATCACCTGCTGGCACTGAATTTTATCATTGTTTCTTTTGGCAAAAAAAGCGCGTGGTCTTCTGCCCAAGTGAAGGTGACCGA
CACAGACTTTGATGGTGTGGAAGTCAGAGTGTTTGAAGGCCCTCCGAAGCCCGAAGAGCCACTGAAACGCAGCGTCGTTTATATCCACGG
AGGAGGCTGGGCCTTGGCAAGTGCAAGTGCGTCCTGGTCACCTTCAGATGAAATCAGGTATTATGATGAGCTGTGTACAGCAATGGCTGA
GGAATTGAATGCTGTCATTGTTTCCATTGAATACAGGCTAGTTCCAAAGGTTTATTTTCCTGAGCAAATTCATGATGTTGTACGGGCCAC
AAAGTATTTCCTGAAGCCAGAAGTCTTACAGAAGTATATGGTTGATCCAGGCAGAATTTGCATTTCTGGTGACAGTGCTGGTGGAAATCT
GGCTGCTGCCCTTGGACAACAGTTTACTCAAGATGCCAGCCTAAAAAATAAGCTCAAACTACAAGCTTTAATTTATCCAGTTCTTCAAGC
TTTAGATTTTAACACACCATCTTATCAGCAAAATGTGAACACCCCAATCCTGCCCCGCTATGTCATGGTGAAGTATTGGGTGGACTACTT
CAAAGGCAACTATGACTTTGTGCAGGCAATGATCGTTAACAATCACACTTCACTTGATGTGGAAGAGGCTGCTGCTGTCAGGGCCCGTCT
AAACTGGACATCCCTCTTGCCTGCATCCTTCACAAAGAACTACAAGCCTGTTGTACAGACCACAGGCAATGCCAGGATTGTCCAGGAGCT
TCCTCAGTTGCTGGATGCCCGCTCCGCCCCACTCATTGCAGACCAGGCAGTGCTGCAGCTCCTCCCAAAGACCTACATTCTGACGTGTGA
GCATGATGTCCTCAGAGACGATGGCATCATGTATGCCAAGCGTTTGGAGAGTGCCGGTGTGGAGGTGACCCTGGATCACTTTGAGGATGG
CTTTCACGGATGTATGATTTTCACTAGCTGGCCCACCAACTTCTCAGTGGGAATCCGGACTAGGAATAGTTACATCAAGTGGCTAGATCA
AAACCTGTAAAGGAGCAAAACTTCCAGAAGCCTCGAGCCCCTCTTGACCTCCTACACCTGCTTTGGAAAGACATGCACTTTTTAGTTGAC
TAATTCTTCCTCCCATTCCCCTCTACTTGCGAGTTATGGAATTTCTATTCCATAACTGAAGTCTTTATGATAACCTAATTTTTAAAAATG
AATTTGACTAACTTAAGTGCAAAACATGTAAATTTGGTTCCCAGAGTGGGCCAATCTCTCTGTTCTTGTTATCTTAGCCAACTATACTGA
TACCTACAGCTACAGAAAGCAGGACTAGGAACTGGAAATAACTTTGGGTCCTGCCTTCATTAGGACGTTCTTTTTAGAAGCAGTTCTTCC
AGCTCTGGATCATAGAGTGACCTTTAATAAGTTAAAAAAACGAGGACTCCTTAATTCTGCTAGAGTTAACCTTGAGTTCAGAGCAGTATT
AAATGCGTGCACTTTCAGGTCAGTACTGGGGACCAAGTACCCTCTGGTCTTTTGTGAATGGATGGTTTTGTTTCCTATGGGAATTTTGGC
AAAGGTTTTCTGGAAAGAACAAGTTTCTCAAAGGACTTTCTTCCTCTAGAATGTTCATTTTATGAGATCGCTATCTGTAAGTCCAGTTGG
ATTACAGGAATACTTGAAAGTTACTTTCTACCACTATTAGAAAATATGAAGTCGCATGCACTGGATATCTATATATCATTAGGTTTTTGT
TGTGTTTTTGGTTATGCTGTCCCCCTTCTCCTTGGGGAGATATTTGGGAGCAAACTTATTTAGATTTAGAGTAAACTTTTCATTATAGAG
CAAGTAAAAACAGACAAATGAAACAACCTAGTGTTTCACATAAAAATACTTCTGACATAAAGTACCAAGAGCAGTGTGAATATACTTGGC
ATAGTCAAAAAAGAAAATACATTTAATATTAGTTCAAAATTGTTAAAAATACCTTTAGAAGGTCTAGTCTATTATTGAAAACTCAATTTT
TTCACTTATATGGCTTTAAAATGGAGCTATTTTGCTACAATATAATGTATTGTTTATTTTTTTAAGTTATTTAATGTTAATATACATAGC
TAGACTTAAGGTTTTTCAGAAAGATGTCCATAATAAATATTAAAAACAATGGTATTTTTTAAAAAACTGCCTTAGGGTTTTAAAACCTTC
CCTACAGTTATAACCACGTGTAATTTTGTGGAAATGATATAACAGCTATTAATACTACTATAACATAGGCATAAATATTTTCGTGTTTAT
ATGCATATACAAGTTAAAATAATTAGAAACTATGACTGCGCCTAGTAAAGTCATCTAGGTTTATAGTTCAGTAGCTTAGGCAAGGCACAC
ACTGCTCATCTCCGCTTTTTAGGGTCAGAGGAACACAAGCTCATGTTCTGAGTGAAGGGCGTACACTGGCACCTGGTGTTGCCTAGATCC
CCCATCTCCTCCTTCCAGCCAGGTCTGGAAGTTTCAACAGCCCAAGCTTAACTTCATGTAAAGTCTTCACTGCCAGTGGGAACATCTTTG
ACACAACAAGACACTCCAATTGTGATTTGAGTTGAGGATCTCTGCCTGCCTTCCTGCCGTCCTTCCTTCTTCCCCGATCCATGCTACTTT
TAGGGGCTGCGGAGAGCAGCAGCAGAGCTGAGTAATGATACAGGGCACCACGGAGAGAAAGTAGAACCATTTCACTCCTGGGAAGATGGG
GTATTTCCCACTTCCAGCAACGAAATAACAAATGAAAAGTTGCATACTTATTGATGTATTGTATGAGCCAGTAGCATTTTATGTACAAAA
CAGAAGTCAATGCAACAGTATGTATGTGTGCCTGTGTGTGTATAAAAATAACCATTGAAGCTAACTTGCTAATGTACTTAGGCAAGCCAC
TTCCCATCTCTGGGCCTCGTCTTTCCTCCCTCTAAAATCAAAGAGCTGAATTATGTGATCCTTGAGGTCTCTTCCACTTATAATACCAAC
TGTCTTGTCAGACTGGCAAATTATATTGGCCTCTCCTTATGTGGTGGTTTTTTTGGTAGGTCATAGTTCCTTATACACAGACACCTGCAT
CATCGAAGGTCTTTTTTTCCTAAAAAAAAAAAATGGGATTTTAGTTCTTATTCTGTGATAACTATCCTCCTCATATAATACTATTCTTTT
TGACACCATTTGAAGGAACCAATATTTGGACCTTATTTTGAGGTTGTCTGTCTCGAAGAAAAAGAAAATAAAATGTATAGGCAGGGTTCC
TTCAATTGGCATTTTCCCCAGAATTGTGAGCCAAAGCCTATAGTAATTGCAGACAGCAAATGATTCCGGATCTCTAAAAGGCTCTCTCAG
ATGAAAAGGGAGTAAAGGAAAAAAGAGGTCAACCACTGTTTCTGATAATGTACTTGAGTTTCATTGTTCTTTTAGTTTGTATTCTTATAA
AAAATGTTTACACTCTGCAGATTGATTTTTTTTTTTTAGTACTGTGGCTTTCTTTTCCTATTTTATGAAAAAAATGATAATCTTTTTGTA
AAATTGTCTGTGAAATATAAACATTAATATATAAAGAAAAACCTTGAAGTGCTGTATAGTGAAGTATAAATTAATGTTTTATTGATTTGT
GAAGAATTTAAGACTATTATATAATTATCTTGGTGGATCTATTTTATGCATGACCTTTTAACCTTTGACTTTGCTTATTTCCCACTACGA
AGGGGAAGGTAGATTTTATGAATGATTTTAATAGCAAATATATTTTATAAAGTGAAAATCCAGTGTGGAGGTAGCAAAGCATCTATCTAT

>33949_33949_6_GOLIM4-NCEH1_GOLIM4_chr3_167812887_ENST00000470487_NCEH1_chr3_172365904_ENST00000538775_length(amino acids)=368AA_BP=
MIHYLGLSHHLLALNFIIVSFGKKSAWSSAQVKVTDTDFDGVEVRVFEGPPKPEEPLKRSVVYIHGGGWALASASASWSPSDEIRYYDEL
CTAMAEELNAVIVSIEYRLVPKVYFPEQIHDVVRATKYFLKPEVLQKYMVDPGRICISGDSAGGNLAAALGQQFTQDASLKNKLKLQALI
YPVLQALDFNTPSYQQNVNTPILPRYVMVKYWVDYFKGNYDFVQAMIVNNHTSLDVEEAAAVRARLNWTSLLPASFTKNYKPVVQTTGNA
RIVQELPQLLDARSAPLIADQAVLQLLPKTYILTCEHDVLRDDGIMYAKRLESAGVEVTLDHFEDGFHGCMIFTSWPTNFSVGIRTRNSY

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for GOLIM4-NCEH1


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for GOLIM4-NCEH1


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for GOLIM4-NCEH1


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource