FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:CASC3-NFE2L1 (FusionGDB2 ID:13154)

Fusion Gene Summary for CASC3-NFE2L1

check button Fusion gene summary
Fusion gene informationFusion gene name: CASC3-NFE2L1
Fusion gene ID: 13154
HgeneTgene
Gene symbol

CASC3

NFE2L1

Gene ID

22794

4779

Gene nameCASC3 exon junction complex subunitnuclear factor, erythroid 2 like 1
SynonymsBTZ|MLN51LCR-F1|NRF1|TCF11
Cytomap

17q21.1

17q21.32

Type of geneprotein-codingprotein-coding
Descriptionprotein CASC3MLN 51barentszcancer susceptibility 3cancer susceptibility candidate 3cancer susceptibility candidate gene 3 proteinmetastatic lymph node 51metastatic lymph node gene 51 proteinprotein barentszendoplasmic reticulum membrane sensor NFE2L1NF-E2-related factor 1NFE2-related factor 1TCF-11locus control region-factor 1nuclear factor erythroid 2-related factor 1nuclear factor, erythroid derived 2, like 1protein NRF1, p120 formtranscription fa
Modification date2020031320200313
UniProtAcc

O15234

Q14494

Ensembl transtripts involved in fusion geneENST00000264645, ENST00000361665, 
ENST00000536222, ENST00000579481, 
ENST00000582155, ENST00000583378, 
ENST00000357480, ENST00000362042, 
ENST00000585291, 
Fusion gene scores* DoF score21 X 10 X 10=210011 X 9 X 5=495
# samples 2811
** MAII scorelog2(28/2100*10)=-2.90689059560852
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(11/495*10)=-2.16992500144231
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: CASC3 [Title/Abstract] AND NFE2L1 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointCASC3(38325699)-NFE2L1(46133747), # samples:1
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneCASC3

GO:0000398

mRNA splicing, via spliceosome

29301961


check buttonFusion gene breakpoints across CASC3 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across NFE2L1 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4ESCATCGA-V5-A7RECASC3chr17

38325699

+NFE2L1chr17

46133747

+


Top

Fusion Gene ORF analysis for CASC3-NFE2L1

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-intronENST00000264645ENST00000361665CASC3chr17

38325699

+NFE2L1chr17

46133747

+
5CDS-intronENST00000264645ENST00000536222CASC3chr17

38325699

+NFE2L1chr17

46133747

+
5CDS-intronENST00000264645ENST00000579481CASC3chr17

38325699

+NFE2L1chr17

46133747

+
5CDS-intronENST00000264645ENST00000582155CASC3chr17

38325699

+NFE2L1chr17

46133747

+
5CDS-intronENST00000264645ENST00000583378CASC3chr17

38325699

+NFE2L1chr17

46133747

+
In-frameENST00000264645ENST00000357480CASC3chr17

38325699

+NFE2L1chr17

46133747

+
In-frameENST00000264645ENST00000362042CASC3chr17

38325699

+NFE2L1chr17

46133747

+
In-frameENST00000264645ENST00000585291CASC3chr17

38325699

+NFE2L1chr17

46133747

+

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000264645CASC3chr1738325699+ENST00000362042NFE2L1chr1746133747+5962231417241221316
ENST00000264645CASC3chr1738325699+ENST00000585291NFE2L1chr1746133747+5879231417240321286
ENST00000264645CASC3chr1738325699+ENST00000357480NFE2L1chr1746133747+5871231417240321286

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000264645ENST00000362042CASC3chr1738325699+NFE2L1chr1746133747+0.0014365990.99856347
ENST00000264645ENST00000585291CASC3chr1738325699+NFE2L1chr1746133747+0.0014077670.9985922
ENST00000264645ENST00000357480CASC3chr1738325699+NFE2L1chr1746133747+0.001427820.9985721

Top

Fusion Genomic Features for CASC3-NFE2L1


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)
CASC3chr1738325699+NFE2L1chr1746133747+6.60E-091
CASC3chr1738325699+NFE2L1chr1746133747+6.60E-091

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.
genomic feature of top 1%

Top

Fusion Protein Features for CASC3-NFE2L1


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr17:38325699/chr17:46133747)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
CASC3

O15234

NFE2L1

Q14494

FUNCTION: Required for pre-mRNA splicing as component of the spliceosome (PubMed:28502770, PubMed:29301961). Core component of the splicing-dependent multiprotein exon junction complex (EJC) deposited at splice junctions on mRNAs. The EJC is a dynamic structure consisting of core proteins and several peripheral nuclear and cytoplasmic associated factors that join the complex only transiently either during EJC assembly or during subsequent mRNA metabolism. The EJC marks the position of the exon-exon junction in the mature mRNA for the gene expression machinery and the core components remain bound to spliced mRNAs throughout all stages of mRNA metabolism thereby influencing downstream processes including nuclear mRNA export, subcellular mRNA localization, translation efficiency and nonsense-mediated mRNA decay (NMD). Stimulates the ATPase and RNA-helicase activities of EIF4A3. Plays a role in the stress response by participating in cytoplasmic stress granules assembly and by favoring cell recovery following stress. Component of the dendritic ribonucleoprotein particles (RNPs) in hippocampal neurons. May play a role in mRNA transport. Binds spliced mRNA in sequence-independent manner, 20-24 nucleotides upstream of mRNA exon-exon junctions. Binds poly(G) and poly(U) RNA homopolymer. {ECO:0000269|PubMed:17375189, ECO:0000269|PubMed:17652158, ECO:0000269|PubMed:28502770, ECO:0000269|PubMed:29301961}.FUNCTION: [Endoplasmic reticulum membrane sensor NFE2L1]: Endoplasmic reticulum membrane sensor that translocates into the nucleus in response to various stresses to act as a transcription factor (PubMed:20932482, PubMed:24448410). Constitutes a precursor of the transcription factor NRF1 (By similarity). Able to detect various cellular stresses, such as cholesterol excess, oxidative stress or proteasome inhibition (PubMed:20932482). In response to stress, it is released from the endoplasmic reticulum membrane following cleavage by the protease DDI2 and translocates into the nucleus to form the transcription factor NRF1 (By similarity). Acts as a key sensor of cholesterol excess: in excess cholesterol conditions, the endoplasmic reticulum membrane form of the protein directly binds cholesterol via its CRAC motif, preventing cleavage and release of the transcription factor NRF1, thereby allowing expression of genes promoting cholesterol removal, such as CD36 (By similarity). Involved in proteasome homeostasis: in response to proteasome inhibition, it is released from the endoplasmic reticulum membrane, translocates to the nucleus and activates expression of genes encoding proteasome subunits (PubMed:20932482). {ECO:0000250|UniProtKB:Q61985, ECO:0000269|PubMed:20932482, ECO:0000269|PubMed:24448410}.; FUNCTION: [Transcription factor NRF1]: CNC-type bZIP family transcription factor that translocates to the nucleus and regulates expression of target genes in response to various stresses (PubMed:8932385, PubMed:9421508). Heterodimerizes with small-Maf proteins (MAFF, MAFG or MAFK) and binds DNA motifs including the antioxidant response elements (AREs), which regulate expression of genes involved in oxidative stress response (PubMed:8932385, PubMed:9421508). Activates or represses expression of target genes, depending on the context (PubMed:8932385, PubMed:9421508). Plays a key role in cholesterol homeostasis by acting as a sensor of cholesterol excess: in low cholesterol conditions, translocates into the nucleus and represses expression of genes involved in defense against cholesterol excess, such as CD36 (By similarity). In excess cholesterol conditions, the endoplasmic reticulum membrane form of the protein directly binds cholesterol via its CRAC motif, preventing cleavage and release of the transcription factor NRF1, thereby allowing expression of genes promoting cholesterol removal (By similarity). Critical for redox balance in response to oxidative stress: acts by binding the AREs motifs on promoters and mediating activation of oxidative stress response genes, such as GCLC, GCLM, GSS, MT1 and MT2 (By similarity). Plays an essential role during fetal liver hematopoiesis: probably has a protective function against oxidative stress and is involved in lipid homeostasis in the liver (By similarity). Involved in proteasome homeostasis: in response to proteasome inhibition, mediates the 'bounce-back' of proteasome subunits by translocating into the nucleus and activating expression of genes encoding proteasome subunits (PubMed:20932482). Also involved in regulating glucose flux (By similarity). Together with CEBPB; represses expression of DSPP during odontoblast differentiation (PubMed:15308669). In response to ascorbic acid induction, activates expression of SP7/Osterix in osteoblasts. {ECO:0000250|UniProtKB:Q61985, ECO:0000269|PubMed:15308669, ECO:0000269|PubMed:20932482, ECO:0000269|PubMed:8932385, ECO:0000269|PubMed:9421508}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneCASC3chr17:38325699chr17:46133747ENST00000264645+121495_1316961130.6666666666667Coiled coilOntology_term=ECO:0000255
HgeneCASC3chr17:38325699chr17:46133747ENST00000264645+1214392_3956961130.6666666666667Compositional biasNote=Poly-Pro
HgeneCASC3chr17:38325699chr17:46133747ENST00000264645+121441_466961130.6666666666667Compositional biasNote=Poly-Gly
HgeneCASC3chr17:38325699chr17:46133747ENST00000264645+1214425_4286961130.6666666666667Compositional biasNote=Poly-Pro
HgeneCASC3chr17:38325699chr17:46133747ENST00000264645+1214643_6486961130.6666666666667Compositional biasNote=Poly-Pro
HgeneCASC3chr17:38325699chr17:46133747ENST00000264645+1214692_6956961130.6666666666667Compositional biasNote=Poly-Pro
HgeneCASC3chr17:38325699chr17:46133747ENST00000264645+1214204_2106961130.6666666666667MotifNuclear localization signal 1
HgeneCASC3chr17:38325699chr17:46133747ENST00000264645+1214254_2626961130.6666666666667MotifNuclear localization signal 2
HgeneCASC3chr17:38325699chr17:46133747ENST00000264645+1214462_4666961130.6666666666667MotifNote=Nuclear export signal
HgeneCASC3chr17:38325699chr17:46133747ENST00000264645+1214137_2836961130.6666666666667RegionNote=Sufficient to form the EJC
TgeneNFE2L1chr17:38325699chr17:46133747ENST0000035748015496_517170743.0Compositional biasNote=Poly-Ser
TgeneNFE2L1chr17:38325699chr17:46133747ENST0000036204216496_517170773.0Compositional biasNote=Poly-Ser
TgeneNFE2L1chr17:38325699chr17:46133747ENST0000058529126496_517170743.0Compositional biasNote=Poly-Ser
TgeneNFE2L1chr17:38325699chr17:46133747ENST0000035748015654_717170743.0DomainbZIP
TgeneNFE2L1chr17:38325699chr17:46133747ENST0000036204216654_717170773.0DomainbZIP
TgeneNFE2L1chr17:38325699chr17:46133747ENST0000058529126654_717170743.0DomainbZIP
TgeneNFE2L1chr17:38325699chr17:46133747ENST0000035748015476_480170743.0MotifDestruction motif
TgeneNFE2L1chr17:38325699chr17:46133747ENST0000036204216476_480170773.0MotifDestruction motif
TgeneNFE2L1chr17:38325699chr17:46133747ENST0000058529126476_480170743.0MotifDestruction motif
TgeneNFE2L1chr17:38325699chr17:46133747ENST0000035748015191_199170743.0RegionCholesterol recognition/amino acid consensus (CRAC) region
TgeneNFE2L1chr17:38325699chr17:46133747ENST0000035748015379_383170743.0RegionCPD
TgeneNFE2L1chr17:38325699chr17:46133747ENST0000035748015656_675170743.0RegionBasic motif
TgeneNFE2L1chr17:38325699chr17:46133747ENST0000035748015682_696170743.0RegionLeucine-zipper
TgeneNFE2L1chr17:38325699chr17:46133747ENST0000036204216191_199170773.0RegionCholesterol recognition/amino acid consensus (CRAC) region
TgeneNFE2L1chr17:38325699chr17:46133747ENST0000036204216379_383170773.0RegionCPD
TgeneNFE2L1chr17:38325699chr17:46133747ENST0000036204216656_675170773.0RegionBasic motif
TgeneNFE2L1chr17:38325699chr17:46133747ENST0000036204216682_696170773.0RegionLeucine-zipper
TgeneNFE2L1chr17:38325699chr17:46133747ENST0000058529126191_199170743.0RegionCholesterol recognition/amino acid consensus (CRAC) region
TgeneNFE2L1chr17:38325699chr17:46133747ENST0000058529126379_383170743.0RegionCPD
TgeneNFE2L1chr17:38325699chr17:46133747ENST0000058529126656_675170743.0RegionBasic motif
TgeneNFE2L1chr17:38325699chr17:46133747ENST0000058529126682_696170743.0RegionLeucine-zipper

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneCASC3chr17:38325699chr17:46133747ENST00000264645+1214377_7036961130.6666666666667RegionNote=Necessary for localization in cytoplasmic stress granules
TgeneNFE2L1chr17:38325699chr17:46133747ENST0000035748015125_288170743.0Compositional biasNote=Asp/Glu-rich (acidic)
TgeneNFE2L1chr17:38325699chr17:46133747ENST0000036204216125_288170773.0Compositional biasNote=Asp/Glu-rich (acidic)
TgeneNFE2L1chr17:38325699chr17:46133747ENST0000058529126125_288170743.0Compositional biasNote=Asp/Glu-rich (acidic)
TgeneNFE2L1chr17:38325699chr17:46133747ENST00000357480157_24170743.0TransmembraneHelical%3B Signal-anchor for type II membrane protein
TgeneNFE2L1chr17:38325699chr17:46133747ENST00000362042167_24170773.0TransmembraneHelical%3B Signal-anchor for type II membrane protein
TgeneNFE2L1chr17:38325699chr17:46133747ENST00000585291267_24170743.0TransmembraneHelical%3B Signal-anchor for type II membrane protein


Top

Fusion Gene Sequence for CASC3-NFE2L1


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>13154_13154_1_CASC3-NFE2L1_CASC3_chr17_38325699_ENST00000264645_NFE2L1_chr17_46133747_ENST00000357480_length(transcript)=5871nt_BP=2314nt
CACACACACACACACACACACACACACCCCAACACACACACACACACCCCAACACACACACACACACACACACACACACACACACACACA
CACACACACACACACACAGCGGGATGGCCGAGCGCCGCACGCGTAGCACGCCGGGACTAGCTATCCAGCCTCCCAGCAGCCTCTGCGACG
GGCGCGGTGCGTAAGTACCTCGCCGGTGGTGGCCGTTCTCCGTAAGATGGCGGACCGGCGGCGGCAGCGCGCTTCGCAAGACACCGAGGA
CGAGGAATCTGGTGCTTCGGGCTCCGACAGCGGCGGCTCCCCGTTGCGGGGAGGCGGGAGCTGCAGCGGTAGCGCCGGAGGCGGCGGCAG
CGGCTCTCTGCCTTCACAGCGCGGAGGCCGAACCGGGGCCCTTCATCTGCGGCGGGTGGAGAGCGGGGGCGCCAAGAGTGCTGAGGAGTC
GGAGTGTGAGAGTGAAGATGGCATTGAAGGTGATGCTGTTCTCTCGGATTATGAAAGTGCAGAAGACTCGGAAGGTGAAGAAGGTGAATA
CAGTGAAGAGGAAAACTCCAAAGTGGAGCTGAAATCAGAAGCTAATGATGCTGTTAATTCTTCAACAAAAGAAGAGAAGGGAGAAGAAAA
GCCTGACACCAAAAGCACTGTGACTGGAGAGAGGCAAAGTGGGGACGGACAGGAGAGCACAGAGCCTGTGGAGAACAAAGTGGGTAAAAA
GGGCCCTAAGCATTTGGATGATGATGAAGATCGGAAGAATCCAGCATACATACCTCGGAAAGGGCTCTTCTTTGAGCATGATCTTCGAGG
GCAAACTCAGGAGGAGGAAGTCAGACCCAAGGGGCGTCAGCGAAAGCTATGGAAGGATGAGGGTCGCTGGGAGCATGACAAGTTCCGGGA
AGATGAGCAGGCCCCAAAGTCCCGACAGGAGCTCATTGCTCTTTATGGTTATGACATTCGCTCAGCTCATAATCCTGATGACATCAAACC
TCGAAGAATCCGGAAACCCCGATATGGGAGTCCTCCACAAAGAGATCCAAACTGGAACGGTGAGCGGCTAAACAAGTCTCATCGCCACCA
GGGTCTTGGGGGCACCCTACCACCAAGGACATTTATTAACAGGAATGCTGCAGGTACCGGCCGTATGTCTGCACCCAGGAATTATTCTCG
ATCTGGGGGCTTCAAGGAAGGTCGTGCTGGTTTTAGGCCTGTGGAAGCTGGTGGGCAGCATGGTGGCCGGTCTGGTGAGACTGTTAAGCA
TGAGATTAGTTACCGGTCACGGCGCCTAGAGCAGACTTCTGTGAGGGATCCATCTCCAGAAGCAGATGCTCCAGTGCTTGGCAGTCCTGA
GAAGGAAGAGGCAGCCTCAGAGCCACCAGCTGCTGCTCCTGATGCTGCACCACCACCCCCTGATAGGCCCATTGAGAAGAAATCCTATTC
CCGGGCAAGAAGAACTCGAACCAAAGTTGGAGATGCAGTCAAGCTTGCAGAGGAGGTGCCCCCTCCTCCTGAAGGACTGATTCCAGCACC
TCCAGTCCCAGAAACCACCCCAACTCCACCTACTAAGACTGGGACCTGGGAAGCTCCGGTGGATTCTAGTACAAGTGGACTTGAGCAAGA
TGTGGCACAACTAAATATAGCAGAACAGAATTGGAGTCCGGGGCAGCCTTCTTTCCTGCAACCACGGGAACTTCGAGGTATGCCCAACCA
TATACACATGGGAGCAGGACCTCCACCTCAGTTTAACCGGATGGAAGAAATGGGTGTCCAGGGTGGTCGAGCCAAACGCTATTCATCCCA
GCGGCAAAGACCTGTGCCAGAGCCCCCCGCCCCTCCAGTGCATATCAGTATCATGGAGGGACATTACTATGATCCACTGCAGTTCCAGGG
ACCAATCTATACCCATGGTGACAGCCCTGCCCCGCTGCCTCCACAGGGCATGCTTGTGCAGCCAGGAATGAACCTTCCCCACCCAGGTTT
ACATCCCCACCAGACACCAGCTCCTCTGCCCAATCCAGGCCTCTATCCCCCACCAGTGTCCATGTCTCCAGGACAGCCACCACCTCAGCA
GTTGCTTGCTCCTACTTACTTTTCTGCTCCAGGCGTCATGAACTTTGGTAATCCCAGTTACCCTTATGCTCCAGGGGCACTGCCTCCCCC
ACCACCGCCTCATCTGTATCCTAATACACAGGCCCCATCACAGGTATATGGAGGAGTGACCTACTATAACCCCGCCCAGCAGCAGGTGCA
GCCAAAGCCCTCCCCACCCCGGAGGACTCCCCAGCCAGTCACCATCAAGCCCCCTCCACCTGAGGACATAGATCTGATTGACATCCTTTG
GCGACAGGATATTGATCTGGGGGCTGGGCGTGAGGTTTTTGACTATAGTCACCGCCAGAAGGAGCAGGATGTGGAGAAGGAGCTGCGAGA
TGGAGGCGAGCAGGACACCTGGGCAGGCGAGGGCGCGGAAGCTCTGGCACGGAACCTGCTAGTGGATGGAGAGACTGGGGAGAGCTTCCC
TGCACAGTTTCCAGCAGACATTTCCAGCATAACAGAAGCAGTGCCTAGTGAGAGTGAGCCCCCTGCTCTTCAAAACAACCTCTTGTCTCC
TCTTCTGACCGGGACAGAGTCACCATTTGATTTGGAACAGCAGTGGCAAGATCTCATGTCCATCATGGAAATGCAGGCCATGGAAGTGAA
CACATCAGCAAGTGAAATCCTGTACAGTGCCCCTCCTGGAGACCCACTGAGCACCAACTACAGCCTTGCCCCCAACACTCCCATCAATCA
GAATGTCAGCCTGCATCAGGCGTCCCTGGGGGGCTGCAGCCAGGACTTCTTACTCTTCAGCCCCGAGGTGGAAAGCCTGCCTGTGGCCAG
TAGCTCCACGCTGCTCCCGTTGGCCCCCAGCAATTCTACCAGCCTCAACTCCACCTTCGGCTCCACCAACCTGACAGGGCTCTTCTTTCC
ACCCCAGCTCAATGGCACAGCCAATGACACAGCAGGCCCAGAGCTGCCTGACCCTTTGGGGGGTCTGTTAGATGAAGCTATGTTGGATGA
GATCAGCCTTATGGACCTGGCCATTGAAGAAGGCTTTAACCCTGTGCAGGCCTCCCAGCTGGAGGAGGAATTTGACTCTGACTCAGGCCT
TTCCTTAGACTCGAGCCATAGCCCTTCTTCCCTAAGCAGCTCTGAAGGCAGTTCTTCCTCTTCTTCCTCCTCCTCTTCCTCTTCTTCCTC
TGCTTCTTCCTCTGCCTCTTCCTCCTTTTCTGAGGAAGGTGCGGTTGGCTACAGCTCTGACTCTGAGACCCTGGATCTGGAAGAGGCCGA
GGGTGCTGTGGGCTACCAGCCTGAGTATTCCAAGTTCTGCCGCATGAGCTACCAGGATCCAGCTCAGCTCTCATGCCTGCCCTACCTGGA
GCACGTGGGCCACAACCACACATACAACATGGCACCCAGTGCCCTGGACTCAGCCGACCTGCCACCACCCAGTGCCCTCAAGAAAGGCAG
CAAGGAGAAGCAGGCTGACTTCCTGGACAAGCAGATGAGCCGGGATGAGCACCGAGCCCGAGCCATGAAGATCCCTTTCACCAATGACAA
AATCATCAACCTGCCTGTGGAGGAGTTCAATGAACTGCTGTCCAAATACCAGTTGAGTGAAGCCCAGCTGAGCCTCATCCGAGACATCCG
GCGCCGGGGCAAGAACAAGATGGCGGCGCAGAACTGCCGCAAGCGCAAGCTGGACACCATCCTGAATCTGGAGCGTGATGTGGAGGACCT
GCAGCGTGACAAAGCCCGGCTGCTGCGGGAGAAAGTGGAGTTCCTGCGCTCCCTGCGACAGATGAAGCAGAAGGTCCAGAGCCTGTACCA
GGAGGTGTTTGGGCGGCTGCGAGATGAGAACGGACGACCCTACTCGCCCAGTCAGTATGCGCTCCAGTACGCCGGGGACGGCAGTGTCCT
CCTCATCCCCCGCACGATGGCCGACCAGCAGGCCCGGCGGCAGGAGAGGAAGCCAAAGGACCGGAGAAAGTGAGCCTGGGGAAGAAGGGG
GTTTGAAGCCCACCAAGACCGAAACTGGAGAAGGGCTGGACCTGGACCTGGACCTGGACCTACAGCGGGGACTTAAATGCCTTCTTATCC
AATATATCTTCTCAGATGGGATGACTGCGGGTCAGTGTACAGGAAGAGGCAGGCACTGGCTGGCTCAGCTCCACTCGGGTGGAGTGGAAG
TGGCCAGACCATTTAGACGGACAGGGTCCTCACCCTACCCCTTTCCTGTGAGGCAGGGGTGGTGGTGGAGTTGCTGGAGGTAGAGGAGCT
ATGTGGAGCAAAGGCCGACAGAGGGGAAGGAATGGACCTGTGAGAGGAAGGGAAGGTGGCAGAAAGTCTCATTTCAGGAAGGAGGGATAG
AAGGAAGGAAGGAAGGAACCCCCCCCCCCCCGAAAAAAAAATCAAAGCGGGAAGAAAATCAGAGGGAAGGTTAAGGTTGGCTCTGGCCAG
GATTCCAGGCAGCAGGTTGGAGTGACTGGTGGGCCTAGATCACTGGTGTGATAAACCCCAATTTTCACCCCGGGGGGGGTGGGGTACACA
GACACAGGGTGGGGGTGGGGAGGGACGGTGTTAACTCTTTCTGCTCCTTGCATTTTGACATCCCTGAAGGGGAGCTCTTGGATATCATTG
GCCATGTTTCAATCGAATGGAGCCACTGGGCCCCAACACTGGCTTTGAGATTTAGAGTCAAAGGGTAGAGTGAACAGGAAAGGGTCACGT
GGTCCCATGTTGCAACAGCCCCAACATCACGCATGTCATTCACTGCCTTGCCACTCCATCTCCCTCCGTGCTCCAGCCACCCCTGAGCTG
AGGCTCCCATTGTCTCCATCAGAGCCTGCATGTGTATGCCGTCCTCCCCTGGTCCGGTGTTTGTGTTCCCCACCCCTCACAGACTGCCTG
AGCTCTTCTGTAAGCTGGGGTAGGGTGATGGCAGTGCTCCGGGAACTGGGCCTGCAGCCTTCCTCTTCTGGGACTGCTGTGAGGCAGAGG
AATGATGGAGAATCTAGTGTAGCAGCCTCCAGGCAGGATTCAGCACAACACTGGGGAGTCACCCTTCCCTCGGGCCTCTGCCTACCAACA
ACTGGGCTTATCACTGGGAAAACACAAAAAATTACACAACCCAGCAACAACAAAAGAACTAGTCCTCTTAGAATTTCTTGCGCTTTGATT
TTTTTAGGGCTTGTGCCCTGTTTCACTTATAGGGTCTAGAATGCTTGTGTTGAGTAAAAAGGAGATGCCCAATATTCAAAGCTGCTAAAT
GTTCTCTTTGCCATAAAGACTCCGTGTAACTGTGTGAACACTTGGGATTTTTCTCCTCTGTCCCGAGGTCGTCGTCTGCTTTCTTTTTTG
GGTTTCTTTCTAGAAGATTGAGAAGTGCATATGACAGGCTGAGAGCACCTCCCCAAACACACAAGCTCTCAGCCACAGGCAGCTTCTCCA
CAGCCCCAGCTTCGCACAGGCTCCTGGAGGGCTGCCTGGGGGAGGCAGACATGGGAGTGCCAAGGTGGCCAGATGGTTCCAGGACTACAA
TGTCTTTATTTTTAACTGTTTGCCACTGCTGCCCTCACCCCTGCCCGGCTCTGGAGTACCGTCTGCCCCAGACAAGTGGGAGTGAAATGG
GGGTGGGGGGAAGCACTGATTCCCAGTTAGGGGGTGCCTAACTGAGCAGTAGGGATAGAAGGTGTGAACCTGGGAGTGCTTTTATAAATT
ATTTTCCTTGTAGATTTTATTTTTAATTTATCTCTGTGACCTGCCAGGGAGAGGGGAGAGAGAGAGAGATGCTGTTGAGCACATGACAAA

>13154_13154_1_CASC3-NFE2L1_CASC3_chr17_38325699_ENST00000264645_NFE2L1_chr17_46133747_ENST00000357480_length(amino acids)=1286AA_BP=996
MRRARCVSTSPVVAVLRKMADRRRQRASQDTEDEESGASGSDSGGSPLRGGGSCSGSAGGGGSGSLPSQRGGRTGALHLRRVESGGAKSA
EESECESEDGIEGDAVLSDYESAEDSEGEEGEYSEEENSKVELKSEANDAVNSSTKEEKGEEKPDTKSTVTGERQSGDGQESTEPVENKV
GKKGPKHLDDDEDRKNPAYIPRKGLFFEHDLRGQTQEEEVRPKGRQRKLWKDEGRWEHDKFREDEQAPKSRQELIALYGYDIRSAHNPDD
IKPRRIRKPRYGSPPQRDPNWNGERLNKSHRHQGLGGTLPPRTFINRNAAGTGRMSAPRNYSRSGGFKEGRAGFRPVEAGGQHGGRSGET
VKHEISYRSRRLEQTSVRDPSPEADAPVLGSPEKEEAASEPPAAAPDAAPPPPDRPIEKKSYSRARRTRTKVGDAVKLAEEVPPPPEGLI
PAPPVPETTPTPPTKTGTWEAPVDSSTSGLEQDVAQLNIAEQNWSPGQPSFLQPRELRGMPNHIHMGAGPPPQFNRMEEMGVQGGRAKRY
SSQRQRPVPEPPAPPVHISIMEGHYYDPLQFQGPIYTHGDSPAPLPPQGMLVQPGMNLPHPGLHPHQTPAPLPNPGLYPPPVSMSPGQPP
PQQLLAPTYFSAPGVMNFGNPSYPYAPGALPPPPPPHLYPNTQAPSQVYGGVTYYNPAQQQVQPKPSPPRRTPQPVTIKPPPPEDIDLID
ILWRQDIDLGAGREVFDYSHRQKEQDVEKELRDGGEQDTWAGEGAEALARNLLVDGETGESFPAQFPADISSITEAVPSESEPPALQNNL
LSPLLTGTESPFDLEQQWQDLMSIMEMQAMEVNTSASEILYSAPPGDPLSTNYSLAPNTPINQNVSLHQASLGGCSQDFLLFSPEVESLP
VASSSTLLPLAPSNSTSLNSTFGSTNLTGLFFPPQLNGTANDTAGPELPDPLGGLLDEAMLDEISLMDLAIEEGFNPVQASQLEEEFDSD
SGLSLDSSHSPSSLSSSEGSSSSSSSSSSSSSSASSSASSSFSEEGAVGYSSDSETLDLEEAEGAVGYQPEYSKFCRMSYQDPAQLSCLP
YLEHVGHNHTYNMAPSALDSADLPPPSALKKGSKEKQADFLDKQMSRDEHRARAMKIPFTNDKIINLPVEEFNELLSKYQLSEAQLSLIR
DIRRRGKNKMAAQNCRKRKLDTILNLERDVEDLQRDKARLLREKVEFLRSLRQMKQKVQSLYQEVFGRLRDENGRPYSPSQYALQYAGDG

--------------------------------------------------------------
>13154_13154_2_CASC3-NFE2L1_CASC3_chr17_38325699_ENST00000264645_NFE2L1_chr17_46133747_ENST00000362042_length(transcript)=5962nt_BP=2314nt
CACACACACACACACACACACACACACCCCAACACACACACACACACCCCAACACACACACACACACACACACACACACACACACACACA
CACACACACACACACACAGCGGGATGGCCGAGCGCCGCACGCGTAGCACGCCGGGACTAGCTATCCAGCCTCCCAGCAGCCTCTGCGACG
GGCGCGGTGCGTAAGTACCTCGCCGGTGGTGGCCGTTCTCCGTAAGATGGCGGACCGGCGGCGGCAGCGCGCTTCGCAAGACACCGAGGA
CGAGGAATCTGGTGCTTCGGGCTCCGACAGCGGCGGCTCCCCGTTGCGGGGAGGCGGGAGCTGCAGCGGTAGCGCCGGAGGCGGCGGCAG
CGGCTCTCTGCCTTCACAGCGCGGAGGCCGAACCGGGGCCCTTCATCTGCGGCGGGTGGAGAGCGGGGGCGCCAAGAGTGCTGAGGAGTC
GGAGTGTGAGAGTGAAGATGGCATTGAAGGTGATGCTGTTCTCTCGGATTATGAAAGTGCAGAAGACTCGGAAGGTGAAGAAGGTGAATA
CAGTGAAGAGGAAAACTCCAAAGTGGAGCTGAAATCAGAAGCTAATGATGCTGTTAATTCTTCAACAAAAGAAGAGAAGGGAGAAGAAAA
GCCTGACACCAAAAGCACTGTGACTGGAGAGAGGCAAAGTGGGGACGGACAGGAGAGCACAGAGCCTGTGGAGAACAAAGTGGGTAAAAA
GGGCCCTAAGCATTTGGATGATGATGAAGATCGGAAGAATCCAGCATACATACCTCGGAAAGGGCTCTTCTTTGAGCATGATCTTCGAGG
GCAAACTCAGGAGGAGGAAGTCAGACCCAAGGGGCGTCAGCGAAAGCTATGGAAGGATGAGGGTCGCTGGGAGCATGACAAGTTCCGGGA
AGATGAGCAGGCCCCAAAGTCCCGACAGGAGCTCATTGCTCTTTATGGTTATGACATTCGCTCAGCTCATAATCCTGATGACATCAAACC
TCGAAGAATCCGGAAACCCCGATATGGGAGTCCTCCACAAAGAGATCCAAACTGGAACGGTGAGCGGCTAAACAAGTCTCATCGCCACCA
GGGTCTTGGGGGCACCCTACCACCAAGGACATTTATTAACAGGAATGCTGCAGGTACCGGCCGTATGTCTGCACCCAGGAATTATTCTCG
ATCTGGGGGCTTCAAGGAAGGTCGTGCTGGTTTTAGGCCTGTGGAAGCTGGTGGGCAGCATGGTGGCCGGTCTGGTGAGACTGTTAAGCA
TGAGATTAGTTACCGGTCACGGCGCCTAGAGCAGACTTCTGTGAGGGATCCATCTCCAGAAGCAGATGCTCCAGTGCTTGGCAGTCCTGA
GAAGGAAGAGGCAGCCTCAGAGCCACCAGCTGCTGCTCCTGATGCTGCACCACCACCCCCTGATAGGCCCATTGAGAAGAAATCCTATTC
CCGGGCAAGAAGAACTCGAACCAAAGTTGGAGATGCAGTCAAGCTTGCAGAGGAGGTGCCCCCTCCTCCTGAAGGACTGATTCCAGCACC
TCCAGTCCCAGAAACCACCCCAACTCCACCTACTAAGACTGGGACCTGGGAAGCTCCGGTGGATTCTAGTACAAGTGGACTTGAGCAAGA
TGTGGCACAACTAAATATAGCAGAACAGAATTGGAGTCCGGGGCAGCCTTCTTTCCTGCAACCACGGGAACTTCGAGGTATGCCCAACCA
TATACACATGGGAGCAGGACCTCCACCTCAGTTTAACCGGATGGAAGAAATGGGTGTCCAGGGTGGTCGAGCCAAACGCTATTCATCCCA
GCGGCAAAGACCTGTGCCAGAGCCCCCCGCCCCTCCAGTGCATATCAGTATCATGGAGGGACATTACTATGATCCACTGCAGTTCCAGGG
ACCAATCTATACCCATGGTGACAGCCCTGCCCCGCTGCCTCCACAGGGCATGCTTGTGCAGCCAGGAATGAACCTTCCCCACCCAGGTTT
ACATCCCCACCAGACACCAGCTCCTCTGCCCAATCCAGGCCTCTATCCCCCACCAGTGTCCATGTCTCCAGGACAGCCACCACCTCAGCA
GTTGCTTGCTCCTACTTACTTTTCTGCTCCAGGCGTCATGAACTTTGGTAATCCCAGTTACCCTTATGCTCCAGGGGCACTGCCTCCCCC
ACCACCGCCTCATCTGTATCCTAATACACAGGCCCCATCACAGGTATATGGAGGAGTGACCTACTATAACCCCGCCCAGCAGCAGGTGCA
GCCAAAGCCCTCCCCACCCCGGAGGACTCCCCAGCCAGTCACCATCAAGCCCCCTCCACCTGAGGACATAGATCTGATTGACATCCTTTG
GCGACAGGATATTGATCTGGGGGCTGGGCGTGAGGTTTTTGACTATAGTCACCGCCAGAAGGAGCAGGATGTGGAGAAGGAGCTGCGAGA
TGGAGGCGAGCAGGACACCTGGGCAGGCGAGGGCGCGGAAGCTCTGGCACGGAACCTGCTAGTGGATGGAGAGACTGGGGAGAGCTTCCC
TGCACAGGTGCCTAGTGGGGAGGACCAGACGGCCCTGTCCCTGGAAGAGTGCCTTAGGCTGCTGGAAGCCACCTGCCCCTTTGGGGAGAA
TGCTGAGTTTCCAGCAGACATTTCCAGCATAACAGAAGCAGTGCCTAGTGAGAGTGAGCCCCCTGCTCTTCAAAACAACCTCTTGTCTCC
TCTTCTGACCGGGACAGAGTCACCATTTGATTTGGAACAGCAGTGGCAAGATCTCATGTCCATCATGGAAATGCAGGCCATGGAAGTGAA
CACATCAGCAAGTGAAATCCTGTACAGTGCCCCTCCTGGAGACCCACTGAGCACCAACTACAGCCTTGCCCCCAACACTCCCATCAATCA
GAATGTCAGCCTGCATCAGGCGTCCCTGGGGGGCTGCAGCCAGGACTTCTTACTCTTCAGCCCCGAGGTGGAAAGCCTGCCTGTGGCCAG
TAGCTCCACGCTGCTCCCGTTGGCCCCCAGCAATTCTACCAGCCTCAACTCCACCTTCGGCTCCACCAACCTGACAGGGCTCTTCTTTCC
ACCCCAGCTCAATGGCACAGCCAATGACACAGCAGGCCCAGAGCTGCCTGACCCTTTGGGGGGTCTGTTAGATGAAGCTATGTTGGATGA
GATCAGCCTTATGGACCTGGCCATTGAAGAAGGCTTTAACCCTGTGCAGGCCTCCCAGCTGGAGGAGGAATTTGACTCTGACTCAGGCCT
TTCCTTAGACTCGAGCCATAGCCCTTCTTCCCTAAGCAGCTCTGAAGGCAGTTCTTCCTCTTCTTCCTCCTCCTCTTCCTCTTCTTCCTC
TGCTTCTTCCTCTGCCTCTTCCTCCTTTTCTGAGGAAGGTGCGGTTGGCTACAGCTCTGACTCTGAGACCCTGGATCTGGAAGAGGCCGA
GGGTGCTGTGGGCTACCAGCCTGAGTATTCCAAGTTCTGCCGCATGAGCTACCAGGATCCAGCTCAGCTCTCATGCCTGCCCTACCTGGA
GCACGTGGGCCACAACCACACATACAACATGGCACCCAGTGCCCTGGACTCAGCCGACCTGCCACCACCCAGTGCCCTCAAGAAAGGCAG
CAAGGAGAAGCAGGCTGACTTCCTGGACAAGCAGATGAGCCGGGATGAGCACCGAGCCCGAGCCATGAAGATCCCTTTCACCAATGACAA
AATCATCAACCTGCCTGTGGAGGAGTTCAATGAACTGCTGTCCAAATACCAGTTGAGTGAAGCCCAGCTGAGCCTCATCCGAGACATCCG
GCGCCGGGGCAAGAACAAGATGGCGGCGCAGAACTGCCGCAAGCGCAAGCTGGACACCATCCTGAATCTGGAGCGTGATGTGGAGGACCT
GCAGCGTGACAAAGCCCGGCTGCTGCGGGAGAAAGTGGAGTTCCTGCGCTCCCTGCGACAGATGAAGCAGAAGGTCCAGAGCCTGTACCA
GGAGGTGTTTGGGCGGCTGCGAGATGAGAACGGACGACCCTACTCGCCCAGTCAGTATGCGCTCCAGTACGCCGGGGACGGCAGTGTCCT
CCTCATCCCCCGCACGATGGCCGACCAGCAGGCCCGGCGGCAGGAGAGGAAGCCAAAGGACCGGAGAAAGTGAGCCTGGGGAAGAAGGGG
GTTTGAAGCCCACCAAGACCGAAACTGGAGAAGGGCTGGACCTGGACCTGGACCTGGACCTACAGCGGGGACTTAAATGCCTTCTTATCC
AATATATCTTCTCAGATGGGATGACTGCGGGTCAGTGTACAGGAAGAGGCAGGCACTGGCTGGCTCAGCTCCACTCGGGTGGAGTGGAAG
TGGCCAGACCATTTAGACGGACAGGGTCCTCACCCTACCCCTTTCCTGTGAGGCAGGGGTGGTGGTGGAGTTGCTGGAGGTAGAGGAGCT
ATGTGGAGCAAAGGCCGACAGAGGGGAAGGAATGGACCTGTGAGAGGAAGGGAAGGTGGCAGAAAGTCTCATTTCAGGAAGGAGGGATAG
AAGGAAGGAAGGAAGGAACCCCCCCCCCCCCGAAAAAAAAATCAAAGCGGGAAGAAAATCAGAGGGAAGGTTAAGGTTGGCTCTGGCCAG
GATTCCAGGCAGCAGGTTGGAGTGACTGGTGGGCCTAGATCACTGGTGTGATAAACCCCAATTTTCACCCCGGGGGGGGTGGGGTACACA
GACACAGGGTGGGGGTGGGGAGGGACGGTGTTAACTCTTTCTGCTCCTTGCATTTTGACATCCCTGAAGGGGAGCTCTTGGATATCATTG
GCCATGTTTCAATCGAATGGAGCCACTGGGCCCCAACACTGGCTTTGAGATTTAGAGTCAAAGGGTAGAGTGAACAGGAAAGGGTCACGT
GGTCCCATGTTGCAACAGCCCCAACATCACGCATGTCATTCACTGCCTTGCCACTCCATCTCCCTCCGTGCTCCAGCCACCCCTGAGCTG
AGGCTCCCATTGTCTCCATCAGAGCCTGCATGTGTATGCCGTCCTCCCCTGGTCCGGTGTTTGTGTTCCCCACCCCTCACAGACTGCCTG
AGCTCTTCTGTAAGCTGGGGTAGGGTGATGGCAGTGCTCCGGGAACTGGGCCTGCAGCCTTCCTCTTCTGGGACTGCTGTGAGGCAGAGG
AATGATGGAGAATCTAGTGTAGCAGCCTCCAGGCAGGATTCAGCACAACACTGGGGAGTCACCCTTCCCTCGGGCCTCTGCCTACCAACA
ACTGGGCTTATCACTGGGAAAACACAAAAAATTACACAACCCAGCAACAACAAAAGAACTAGTCCTCTTAGAATTTCTTGCGCTTTGATT
TTTTTAGGGCTTGTGCCCTGTTTCACTTATAGGGTCTAGAATGCTTGTGTTGAGTAAAAAGGAGATGCCCAATATTCAAAGCTGCTAAAT
GTTCTCTTTGCCATAAAGACTCCGTGTAACTGTGTGAACACTTGGGATTTTTCTCCTCTGTCCCGAGGTCGTCGTCTGCTTTCTTTTTTG
GGTTTCTTTCTAGAAGATTGAGAAGTGCATATGACAGGCTGAGAGCACCTCCCCAAACACACAAGCTCTCAGCCACAGGCAGCTTCTCCA
CAGCCCCAGCTTCGCACAGGCTCCTGGAGGGCTGCCTGGGGGAGGCAGACATGGGAGTGCCAAGGTGGCCAGATGGTTCCAGGACTACAA
TGTCTTTATTTTTAACTGTTTGCCACTGCTGCCCTCACCCCTGCCCGGCTCTGGAGTACCGTCTGCCCCAGACAAGTGGGAGTGAAATGG
GGGTGGGGGGAAGCACTGATTCCCAGTTAGGGGGTGCCTAACTGAGCAGTAGGGATAGAAGGTGTGAACCTGGGAGTGCTTTTATAAATT
ATTTTCCTTGTAGATTTTATTTTTAATTTATCTCTGTGACCTGCCAGGGAGAGGGGAGAGAGAGAGAGATGCTGTTGAGCACATGACAAA

>13154_13154_2_CASC3-NFE2L1_CASC3_chr17_38325699_ENST00000264645_NFE2L1_chr17_46133747_ENST00000362042_length(amino acids)=1316AA_BP=1026
MRRARCVSTSPVVAVLRKMADRRRQRASQDTEDEESGASGSDSGGSPLRGGGSCSGSAGGGGSGSLPSQRGGRTGALHLRRVESGGAKSA
EESECESEDGIEGDAVLSDYESAEDSEGEEGEYSEEENSKVELKSEANDAVNSSTKEEKGEEKPDTKSTVTGERQSGDGQESTEPVENKV
GKKGPKHLDDDEDRKNPAYIPRKGLFFEHDLRGQTQEEEVRPKGRQRKLWKDEGRWEHDKFREDEQAPKSRQELIALYGYDIRSAHNPDD
IKPRRIRKPRYGSPPQRDPNWNGERLNKSHRHQGLGGTLPPRTFINRNAAGTGRMSAPRNYSRSGGFKEGRAGFRPVEAGGQHGGRSGET
VKHEISYRSRRLEQTSVRDPSPEADAPVLGSPEKEEAASEPPAAAPDAAPPPPDRPIEKKSYSRARRTRTKVGDAVKLAEEVPPPPEGLI
PAPPVPETTPTPPTKTGTWEAPVDSSTSGLEQDVAQLNIAEQNWSPGQPSFLQPRELRGMPNHIHMGAGPPPQFNRMEEMGVQGGRAKRY
SSQRQRPVPEPPAPPVHISIMEGHYYDPLQFQGPIYTHGDSPAPLPPQGMLVQPGMNLPHPGLHPHQTPAPLPNPGLYPPPVSMSPGQPP
PQQLLAPTYFSAPGVMNFGNPSYPYAPGALPPPPPPHLYPNTQAPSQVYGGVTYYNPAQQQVQPKPSPPRRTPQPVTIKPPPPEDIDLID
ILWRQDIDLGAGREVFDYSHRQKEQDVEKELRDGGEQDTWAGEGAEALARNLLVDGETGESFPAQVPSGEDQTALSLEECLRLLEATCPF
GENAEFPADISSITEAVPSESEPPALQNNLLSPLLTGTESPFDLEQQWQDLMSIMEMQAMEVNTSASEILYSAPPGDPLSTNYSLAPNTP
INQNVSLHQASLGGCSQDFLLFSPEVESLPVASSSTLLPLAPSNSTSLNSTFGSTNLTGLFFPPQLNGTANDTAGPELPDPLGGLLDEAM
LDEISLMDLAIEEGFNPVQASQLEEEFDSDSGLSLDSSHSPSSLSSSEGSSSSSSSSSSSSSSASSSASSSFSEEGAVGYSSDSETLDLE
EAEGAVGYQPEYSKFCRMSYQDPAQLSCLPYLEHVGHNHTYNMAPSALDSADLPPPSALKKGSKEKQADFLDKQMSRDEHRARAMKIPFT
NDKIINLPVEEFNELLSKYQLSEAQLSLIRDIRRRGKNKMAAQNCRKRKLDTILNLERDVEDLQRDKARLLREKVEFLRSLRQMKQKVQS

--------------------------------------------------------------
>13154_13154_3_CASC3-NFE2L1_CASC3_chr17_38325699_ENST00000264645_NFE2L1_chr17_46133747_ENST00000585291_length(transcript)=5879nt_BP=2314nt
CACACACACACACACACACACACACACCCCAACACACACACACACACCCCAACACACACACACACACACACACACACACACACACACACA
CACACACACACACACACAGCGGGATGGCCGAGCGCCGCACGCGTAGCACGCCGGGACTAGCTATCCAGCCTCCCAGCAGCCTCTGCGACG
GGCGCGGTGCGTAAGTACCTCGCCGGTGGTGGCCGTTCTCCGTAAGATGGCGGACCGGCGGCGGCAGCGCGCTTCGCAAGACACCGAGGA
CGAGGAATCTGGTGCTTCGGGCTCCGACAGCGGCGGCTCCCCGTTGCGGGGAGGCGGGAGCTGCAGCGGTAGCGCCGGAGGCGGCGGCAG
CGGCTCTCTGCCTTCACAGCGCGGAGGCCGAACCGGGGCCCTTCATCTGCGGCGGGTGGAGAGCGGGGGCGCCAAGAGTGCTGAGGAGTC
GGAGTGTGAGAGTGAAGATGGCATTGAAGGTGATGCTGTTCTCTCGGATTATGAAAGTGCAGAAGACTCGGAAGGTGAAGAAGGTGAATA
CAGTGAAGAGGAAAACTCCAAAGTGGAGCTGAAATCAGAAGCTAATGATGCTGTTAATTCTTCAACAAAAGAAGAGAAGGGAGAAGAAAA
GCCTGACACCAAAAGCACTGTGACTGGAGAGAGGCAAAGTGGGGACGGACAGGAGAGCACAGAGCCTGTGGAGAACAAAGTGGGTAAAAA
GGGCCCTAAGCATTTGGATGATGATGAAGATCGGAAGAATCCAGCATACATACCTCGGAAAGGGCTCTTCTTTGAGCATGATCTTCGAGG
GCAAACTCAGGAGGAGGAAGTCAGACCCAAGGGGCGTCAGCGAAAGCTATGGAAGGATGAGGGTCGCTGGGAGCATGACAAGTTCCGGGA
AGATGAGCAGGCCCCAAAGTCCCGACAGGAGCTCATTGCTCTTTATGGTTATGACATTCGCTCAGCTCATAATCCTGATGACATCAAACC
TCGAAGAATCCGGAAACCCCGATATGGGAGTCCTCCACAAAGAGATCCAAACTGGAACGGTGAGCGGCTAAACAAGTCTCATCGCCACCA
GGGTCTTGGGGGCACCCTACCACCAAGGACATTTATTAACAGGAATGCTGCAGGTACCGGCCGTATGTCTGCACCCAGGAATTATTCTCG
ATCTGGGGGCTTCAAGGAAGGTCGTGCTGGTTTTAGGCCTGTGGAAGCTGGTGGGCAGCATGGTGGCCGGTCTGGTGAGACTGTTAAGCA
TGAGATTAGTTACCGGTCACGGCGCCTAGAGCAGACTTCTGTGAGGGATCCATCTCCAGAAGCAGATGCTCCAGTGCTTGGCAGTCCTGA
GAAGGAAGAGGCAGCCTCAGAGCCACCAGCTGCTGCTCCTGATGCTGCACCACCACCCCCTGATAGGCCCATTGAGAAGAAATCCTATTC
CCGGGCAAGAAGAACTCGAACCAAAGTTGGAGATGCAGTCAAGCTTGCAGAGGAGGTGCCCCCTCCTCCTGAAGGACTGATTCCAGCACC
TCCAGTCCCAGAAACCACCCCAACTCCACCTACTAAGACTGGGACCTGGGAAGCTCCGGTGGATTCTAGTACAAGTGGACTTGAGCAAGA
TGTGGCACAACTAAATATAGCAGAACAGAATTGGAGTCCGGGGCAGCCTTCTTTCCTGCAACCACGGGAACTTCGAGGTATGCCCAACCA
TATACACATGGGAGCAGGACCTCCACCTCAGTTTAACCGGATGGAAGAAATGGGTGTCCAGGGTGGTCGAGCCAAACGCTATTCATCCCA
GCGGCAAAGACCTGTGCCAGAGCCCCCCGCCCCTCCAGTGCATATCAGTATCATGGAGGGACATTACTATGATCCACTGCAGTTCCAGGG
ACCAATCTATACCCATGGTGACAGCCCTGCCCCGCTGCCTCCACAGGGCATGCTTGTGCAGCCAGGAATGAACCTTCCCCACCCAGGTTT
ACATCCCCACCAGACACCAGCTCCTCTGCCCAATCCAGGCCTCTATCCCCCACCAGTGTCCATGTCTCCAGGACAGCCACCACCTCAGCA
GTTGCTTGCTCCTACTTACTTTTCTGCTCCAGGCGTCATGAACTTTGGTAATCCCAGTTACCCTTATGCTCCAGGGGCACTGCCTCCCCC
ACCACCGCCTCATCTGTATCCTAATACACAGGCCCCATCACAGGTATATGGAGGAGTGACCTACTATAACCCCGCCCAGCAGCAGGTGCA
GCCAAAGCCCTCCCCACCCCGGAGGACTCCCCAGCCAGTCACCATCAAGCCCCCTCCACCTGAGGACATAGATCTGATTGACATCCTTTG
GCGACAGGATATTGATCTGGGGGCTGGGCGTGAGGTTTTTGACTATAGTCACCGCCAGAAGGAGCAGGATGTGGAGAAGGAGCTGCGAGA
TGGAGGCGAGCAGGACACCTGGGCAGGCGAGGGCGCGGAAGCTCTGGCACGGAACCTGCTAGTGGATGGAGAGACTGGGGAGAGCTTCCC
TGCACAGTTTCCAGCAGACATTTCCAGCATAACAGAAGCAGTGCCTAGTGAGAGTGAGCCCCCTGCTCTTCAAAACAACCTCTTGTCTCC
TCTTCTGACCGGGACAGAGTCACCATTTGATTTGGAACAGCAGTGGCAAGATCTCATGTCCATCATGGAAATGCAGGCCATGGAAGTGAA
CACATCAGCAAGTGAAATCCTGTACAGTGCCCCTCCTGGAGACCCACTGAGCACCAACTACAGCCTTGCCCCCAACACTCCCATCAATCA
GAATGTCAGCCTGCATCAGGCGTCCCTGGGGGGCTGCAGCCAGGACTTCTTACTCTTCAGCCCCGAGGTGGAAAGCCTGCCTGTGGCCAG
TAGCTCCACGCTGCTCCCGTTGGCCCCCAGCAATTCTACCAGCCTCAACTCCACCTTCGGCTCCACCAACCTGACAGGGCTCTTCTTTCC
ACCCCAGCTCAATGGCACAGCCAATGACACAGCAGGCCCAGAGCTGCCTGACCCTTTGGGGGGTCTGTTAGATGAAGCTATGTTGGATGA
GATCAGCCTTATGGACCTGGCCATTGAAGAAGGCTTTAACCCTGTGCAGGCCTCCCAGCTGGAGGAGGAATTTGACTCTGACTCAGGCCT
TTCCTTAGACTCGAGCCATAGCCCTTCTTCCCTAAGCAGCTCTGAAGGCAGTTCTTCCTCTTCTTCCTCCTCCTCTTCCTCTTCTTCCTC
TGCTTCTTCCTCTGCCTCTTCCTCCTTTTCTGAGGAAGGTGCGGTTGGCTACAGCTCTGACTCTGAGACCCTGGATCTGGAAGAGGCCGA
GGGTGCTGTGGGCTACCAGCCTGAGTATTCCAAGTTCTGCCGCATGAGCTACCAGGATCCAGCTCAGCTCTCATGCCTGCCCTACCTGGA
GCACGTGGGCCACAACCACACATACAACATGGCACCCAGTGCCCTGGACTCAGCCGACCTGCCACCACCCAGTGCCCTCAAGAAAGGCAG
CAAGGAGAAGCAGGCTGACTTCCTGGACAAGCAGATGAGCCGGGATGAGCACCGAGCCCGAGCCATGAAGATCCCTTTCACCAATGACAA
AATCATCAACCTGCCTGTGGAGGAGTTCAATGAACTGCTGTCCAAATACCAGTTGAGTGAAGCCCAGCTGAGCCTCATCCGAGACATCCG
GCGCCGGGGCAAGAACAAGATGGCGGCGCAGAACTGCCGCAAGCGCAAGCTGGACACCATCCTGAATCTGGAGCGTGATGTGGAGGACCT
GCAGCGTGACAAAGCCCGGCTGCTGCGGGAGAAAGTGGAGTTCCTGCGCTCCCTGCGACAGATGAAGCAGAAGGTCCAGAGCCTGTACCA
GGAGGTGTTTGGGCGGCTGCGAGATGAGAACGGACGACCCTACTCGCCCAGTCAGTATGCGCTCCAGTACGCCGGGGACGGCAGTGTCCT
CCTCATCCCCCGCACGATGGCCGACCAGCAGGCCCGGCGGCAGGAGAGGAAGCCAAAGGACCGGAGAAAGTGAGCCTGGGGAAGAAGGGG
GTTTGAAGCCCACCAAGACCGAAACTGGAGAAGGGCTGGACCTGGACCTGGACCTGGACCTACAGCGGGGACTTAAATGCCTTCTTATCC
AATATATCTTCTCAGATGGGATGACTGCGGGTCAGTGTACAGGAAGAGGCAGGCACTGGCTGGCTCAGCTCCACTCGGGTGGAGTGGAAG
TGGCCAGACCATTTAGACGGACAGGGTCCTCACCCTACCCCTTTCCTGTGAGGCAGGGGTGGTGGTGGAGTTGCTGGAGGTAGAGGAGCT
ATGTGGAGCAAAGGCCGACAGAGGGGAAGGAATGGACCTGTGAGAGGAAGGGAAGGTGGCAGAAAGTCTCATTTCAGGAAGGAGGGATAG
AAGGAAGGAAGGAAGGAACCCCCCCCCCCCCGAAAAAAAAATCAAAGCGGGAAGAAAATCAGAGGGAAGGTTAAGGTTGGCTCTGGCCAG
GATTCCAGGCAGCAGGTTGGAGTGACTGGTGGGCCTAGATCACTGGTGTGATAAACCCCAATTTTCACCCCGGGGGGGGTGGGGTACACA
GACACAGGGTGGGGGTGGGGAGGGACGGTGTTAACTCTTTCTGCTCCTTGCATTTTGACATCCCTGAAGGGGAGCTCTTGGATATCATTG
GCCATGTTTCAATCGAATGGAGCCACTGGGCCCCAACACTGGCTTTGAGATTTAGAGTCAAAGGGTAGAGTGAACAGGAAAGGGTCACGT
GGTCCCATGTTGCAACAGCCCCAACATCACGCATGTCATTCACTGCCTTGCCACTCCATCTCCCTCCGTGCTCCAGCCACCCCTGAGCTG
AGGCTCCCATTGTCTCCATCAGAGCCTGCATGTGTATGCCGTCCTCCCCTGGTCCGGTGTTTGTGTTCCCCACCCCTCACAGACTGCCTG
AGCTCTTCTGTAAGCTGGGGTAGGGTGATGGCAGTGCTCCGGGAACTGGGCCTGCAGCCTTCCTCTTCTGGGACTGCTGTGAGGCAGAGG
AATGATGGAGAATCTAGTGTAGCAGCCTCCAGGCAGGATTCAGCACAACACTGGGGAGTCACCCTTCCCTCGGGCCTCTGCCTACCAACA
ACTGGGCTTATCACTGGGAAAACACAAAAAATTACACAACCCAGCAACAACAAAAGAACTAGTCCTCTTAGAATTTCTTGCGCTTTGATT
TTTTTAGGGCTTGTGCCCTGTTTCACTTATAGGGTCTAGAATGCTTGTGTTGAGTAAAAAGGAGATGCCCAATATTCAAAGCTGCTAAAT
GTTCTCTTTGCCATAAAGACTCCGTGTAACTGTGTGAACACTTGGGATTTTTCTCCTCTGTCCCGAGGTCGTCGTCTGCTTTCTTTTTTG
GGTTTCTTTCTAGAAGATTGAGAAGTGCATATGACAGGCTGAGAGCACCTCCCCAAACACACAAGCTCTCAGCCACAGGCAGCTTCTCCA
CAGCCCCAGCTTCGCACAGGCTCCTGGAGGGCTGCCTGGGGGAGGCAGACATGGGAGTGCCAAGGTGGCCAGATGGTTCCAGGACTACAA
TGTCTTTATTTTTAACTGTTTGCCACTGCTGCCCTCACCCCTGCCCGGCTCTGGAGTACCGTCTGCCCCAGACAAGTGGGAGTGAAATGG
GGGTGGGGGGAAGCACTGATTCCCAGTTAGGGGGTGCCTAACTGAGCAGTAGGGATAGAAGGTGTGAACCTGGGAGTGCTTTTATAAATT
ATTTTCCTTGTAGATTTTATTTTTAATTTATCTCTGTGACCTGCCAGGGAGAGGGGAGAGAGAGAGAGATGCTGTTGAGCACATGACAAA

>13154_13154_3_CASC3-NFE2L1_CASC3_chr17_38325699_ENST00000264645_NFE2L1_chr17_46133747_ENST00000585291_length(amino acids)=1286AA_BP=996
MRRARCVSTSPVVAVLRKMADRRRQRASQDTEDEESGASGSDSGGSPLRGGGSCSGSAGGGGSGSLPSQRGGRTGALHLRRVESGGAKSA
EESECESEDGIEGDAVLSDYESAEDSEGEEGEYSEEENSKVELKSEANDAVNSSTKEEKGEEKPDTKSTVTGERQSGDGQESTEPVENKV
GKKGPKHLDDDEDRKNPAYIPRKGLFFEHDLRGQTQEEEVRPKGRQRKLWKDEGRWEHDKFREDEQAPKSRQELIALYGYDIRSAHNPDD
IKPRRIRKPRYGSPPQRDPNWNGERLNKSHRHQGLGGTLPPRTFINRNAAGTGRMSAPRNYSRSGGFKEGRAGFRPVEAGGQHGGRSGET
VKHEISYRSRRLEQTSVRDPSPEADAPVLGSPEKEEAASEPPAAAPDAAPPPPDRPIEKKSYSRARRTRTKVGDAVKLAEEVPPPPEGLI
PAPPVPETTPTPPTKTGTWEAPVDSSTSGLEQDVAQLNIAEQNWSPGQPSFLQPRELRGMPNHIHMGAGPPPQFNRMEEMGVQGGRAKRY
SSQRQRPVPEPPAPPVHISIMEGHYYDPLQFQGPIYTHGDSPAPLPPQGMLVQPGMNLPHPGLHPHQTPAPLPNPGLYPPPVSMSPGQPP
PQQLLAPTYFSAPGVMNFGNPSYPYAPGALPPPPPPHLYPNTQAPSQVYGGVTYYNPAQQQVQPKPSPPRRTPQPVTIKPPPPEDIDLID
ILWRQDIDLGAGREVFDYSHRQKEQDVEKELRDGGEQDTWAGEGAEALARNLLVDGETGESFPAQFPADISSITEAVPSESEPPALQNNL
LSPLLTGTESPFDLEQQWQDLMSIMEMQAMEVNTSASEILYSAPPGDPLSTNYSLAPNTPINQNVSLHQASLGGCSQDFLLFSPEVESLP
VASSSTLLPLAPSNSTSLNSTFGSTNLTGLFFPPQLNGTANDTAGPELPDPLGGLLDEAMLDEISLMDLAIEEGFNPVQASQLEEEFDSD
SGLSLDSSHSPSSLSSSEGSSSSSSSSSSSSSSASSSASSSFSEEGAVGYSSDSETLDLEEAEGAVGYQPEYSKFCRMSYQDPAQLSCLP
YLEHVGHNHTYNMAPSALDSADLPPPSALKKGSKEKQADFLDKQMSRDEHRARAMKIPFTNDKIINLPVEEFNELLSKYQLSEAQLSLIR
DIRRRGKNKMAAQNCRKRKLDTILNLERDVEDLQRDKARLLREKVEFLRSLRQMKQKVQSLYQEVFGRLRDENGRPYSPSQYALQYAGDG

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for CASC3-NFE2L1


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for CASC3-NFE2L1


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for CASC3-NFE2L1


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource