|
Fusion Gene Summary | |
Fusion Gene ORF analysis | |
Fusion Genomic Features | |
Fusion Protein Features | |
Fusion Gene Sequence | |
Fusion Gene PPI analysis | |
Related Drugs | |
Related Diseases |
Fusion gene:CASC3-NFE2L1 (FusionGDB2 ID:13154) |
Fusion Gene Summary for CASC3-NFE2L1 |
Fusion gene summary |
Fusion gene information | Fusion gene name: CASC3-NFE2L1 | Fusion gene ID: 13154 | Hgene | Tgene | Gene symbol | CASC3 | NFE2L1 | Gene ID | 22794 | 4779 |
Gene name | CASC3 exon junction complex subunit | nuclear factor, erythroid 2 like 1 | |
Synonyms | BTZ|MLN51 | LCR-F1|NRF1|TCF11 | |
Cytomap | 17q21.1 | 17q21.32 | |
Type of gene | protein-coding | protein-coding | |
Description | protein CASC3MLN 51barentszcancer susceptibility 3cancer susceptibility candidate 3cancer susceptibility candidate gene 3 proteinmetastatic lymph node 51metastatic lymph node gene 51 proteinprotein barentsz | endoplasmic reticulum membrane sensor NFE2L1NF-E2-related factor 1NFE2-related factor 1TCF-11locus control region-factor 1nuclear factor erythroid 2-related factor 1nuclear factor, erythroid derived 2, like 1protein NRF1, p120 formtranscription fa | |
Modification date | 20200313 | 20200313 | |
UniProtAcc | O15234 | Q14494 | |
Ensembl transtripts involved in fusion gene | ENST00000264645, | ENST00000361665, ENST00000536222, ENST00000579481, ENST00000582155, ENST00000583378, ENST00000357480, ENST00000362042, ENST00000585291, | |
Fusion gene scores | * DoF score | 21 X 10 X 10=2100 | 11 X 9 X 5=495 |
# samples | 28 | 11 | |
** MAII score | log2(28/2100*10)=-2.90689059560852 possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs). DoF>8 and MAII<0 | log2(11/495*10)=-2.16992500144231 possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs). DoF>8 and MAII<0 | |
Context | PubMed: CASC3 [Title/Abstract] AND NFE2L1 [Title/Abstract] AND fusion [Title/Abstract] | ||
Most frequent breakpoint | CASC3(38325699)-NFE2L1(46133747), # samples:1 | ||
Anticipated loss of major functional domain due to fusion event. |
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types ** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10) |
Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez |
Partner | Gene | GO ID | GO term | PubMed ID |
Hgene | CASC3 | GO:0000398 | mRNA splicing, via spliceosome | 29301961 |
Fusion gene breakpoints across CASC3 (5'-gene) * Click on the image to open the UCSC genome browser with custom track showing this image in a new window. |
Fusion gene breakpoints across NFE2L1 (3'-gene) * Click on the image to open the UCSC genome browser with custom track showing this image in a new window. |
Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0) * All genome coordinats were lifted-over on hg19. * Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser. |
Source | Disease | Sample | Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand |
ChimerDB4 | ESCA | TCGA-V5-A7RE | CASC3 | chr17 | 38325699 | + | NFE2L1 | chr17 | 46133747 | + |
Top |
Fusion Gene ORF analysis for CASC3-NFE2L1 |
Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure. * Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser. |
ORF | Henst | Tenst | Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand |
5CDS-intron | ENST00000264645 | ENST00000361665 | CASC3 | chr17 | 38325699 | + | NFE2L1 | chr17 | 46133747 | + |
5CDS-intron | ENST00000264645 | ENST00000536222 | CASC3 | chr17 | 38325699 | + | NFE2L1 | chr17 | 46133747 | + |
5CDS-intron | ENST00000264645 | ENST00000579481 | CASC3 | chr17 | 38325699 | + | NFE2L1 | chr17 | 46133747 | + |
5CDS-intron | ENST00000264645 | ENST00000582155 | CASC3 | chr17 | 38325699 | + | NFE2L1 | chr17 | 46133747 | + |
5CDS-intron | ENST00000264645 | ENST00000583378 | CASC3 | chr17 | 38325699 | + | NFE2L1 | chr17 | 46133747 | + |
In-frame | ENST00000264645 | ENST00000357480 | CASC3 | chr17 | 38325699 | + | NFE2L1 | chr17 | 46133747 | + |
In-frame | ENST00000264645 | ENST00000362042 | CASC3 | chr17 | 38325699 | + | NFE2L1 | chr17 | 46133747 | + |
In-frame | ENST00000264645 | ENST00000585291 | CASC3 | chr17 | 38325699 | + | NFE2L1 | chr17 | 46133747 | + |
ORFfinder result based on the fusion transcript sequence of in-frame fusion genes. |
Henst | Tenst | Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand | Seq length (transcript) | BP loci (transcript) | Predicted start (transcript) | Predicted stop (transcript) | Seq length (amino acids) |
ENST00000264645 | CASC3 | chr17 | 38325699 | + | ENST00000362042 | NFE2L1 | chr17 | 46133747 | + | 5962 | 2314 | 172 | 4122 | 1316 |
ENST00000264645 | CASC3 | chr17 | 38325699 | + | ENST00000585291 | NFE2L1 | chr17 | 46133747 | + | 5879 | 2314 | 172 | 4032 | 1286 |
ENST00000264645 | CASC3 | chr17 | 38325699 | + | ENST00000357480 | NFE2L1 | chr17 | 46133747 | + | 5871 | 2314 | 172 | 4032 | 1286 |
DeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated. |
Henst | Tenst | Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand | No-coding score | Coding score |
ENST00000264645 | ENST00000362042 | CASC3 | chr17 | 38325699 | + | NFE2L1 | chr17 | 46133747 | + | 0.001436599 | 0.99856347 |
ENST00000264645 | ENST00000585291 | CASC3 | chr17 | 38325699 | + | NFE2L1 | chr17 | 46133747 | + | 0.001407767 | 0.9985922 |
ENST00000264645 | ENST00000357480 | CASC3 | chr17 | 38325699 | + | NFE2L1 | chr17 | 46133747 | + | 0.00142782 | 0.9985721 |
Top |
Fusion Genomic Features for CASC3-NFE2L1 |
FusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints. |
Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand | 1-p | p (fusion gene breakpoint) |
CASC3 | chr17 | 38325699 | + | NFE2L1 | chr17 | 46133747 | + | 6.60E-09 | 1 |
CASC3 | chr17 | 38325699 | + | NFE2L1 | chr17 | 46133747 | + | 6.60E-09 | 1 |
Distribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page. |
Distribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page. |
Top |
Fusion Protein Features for CASC3-NFE2L1 |
Four levels of functional features of fusion genes Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr17:38325699/chr17:46133747) - FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels. - How to search 1. Put your fusion gene symbol. 2. Press the tab key until there will be shown the breakpoint information filled. 4. Go down and press 'Search' tab twice. 4. Go down to have the hyperlink of the search result. 5. Click the hyperlink. 6. See the FGviewer result for your fusion gene. |
Main function of each fusion partner protein. (from UniProt) |
Hgene | Tgene |
CASC3 | NFE2L1 |
FUNCTION: Required for pre-mRNA splicing as component of the spliceosome (PubMed:28502770, PubMed:29301961). Core component of the splicing-dependent multiprotein exon junction complex (EJC) deposited at splice junctions on mRNAs. The EJC is a dynamic structure consisting of core proteins and several peripheral nuclear and cytoplasmic associated factors that join the complex only transiently either during EJC assembly or during subsequent mRNA metabolism. The EJC marks the position of the exon-exon junction in the mature mRNA for the gene expression machinery and the core components remain bound to spliced mRNAs throughout all stages of mRNA metabolism thereby influencing downstream processes including nuclear mRNA export, subcellular mRNA localization, translation efficiency and nonsense-mediated mRNA decay (NMD). Stimulates the ATPase and RNA-helicase activities of EIF4A3. Plays a role in the stress response by participating in cytoplasmic stress granules assembly and by favoring cell recovery following stress. Component of the dendritic ribonucleoprotein particles (RNPs) in hippocampal neurons. May play a role in mRNA transport. Binds spliced mRNA in sequence-independent manner, 20-24 nucleotides upstream of mRNA exon-exon junctions. Binds poly(G) and poly(U) RNA homopolymer. {ECO:0000269|PubMed:17375189, ECO:0000269|PubMed:17652158, ECO:0000269|PubMed:28502770, ECO:0000269|PubMed:29301961}. | FUNCTION: [Endoplasmic reticulum membrane sensor NFE2L1]: Endoplasmic reticulum membrane sensor that translocates into the nucleus in response to various stresses to act as a transcription factor (PubMed:20932482, PubMed:24448410). Constitutes a precursor of the transcription factor NRF1 (By similarity). Able to detect various cellular stresses, such as cholesterol excess, oxidative stress or proteasome inhibition (PubMed:20932482). In response to stress, it is released from the endoplasmic reticulum membrane following cleavage by the protease DDI2 and translocates into the nucleus to form the transcription factor NRF1 (By similarity). Acts as a key sensor of cholesterol excess: in excess cholesterol conditions, the endoplasmic reticulum membrane form of the protein directly binds cholesterol via its CRAC motif, preventing cleavage and release of the transcription factor NRF1, thereby allowing expression of genes promoting cholesterol removal, such as CD36 (By similarity). Involved in proteasome homeostasis: in response to proteasome inhibition, it is released from the endoplasmic reticulum membrane, translocates to the nucleus and activates expression of genes encoding proteasome subunits (PubMed:20932482). {ECO:0000250|UniProtKB:Q61985, ECO:0000269|PubMed:20932482, ECO:0000269|PubMed:24448410}.; FUNCTION: [Transcription factor NRF1]: CNC-type bZIP family transcription factor that translocates to the nucleus and regulates expression of target genes in response to various stresses (PubMed:8932385, PubMed:9421508). Heterodimerizes with small-Maf proteins (MAFF, MAFG or MAFK) and binds DNA motifs including the antioxidant response elements (AREs), which regulate expression of genes involved in oxidative stress response (PubMed:8932385, PubMed:9421508). Activates or represses expression of target genes, depending on the context (PubMed:8932385, PubMed:9421508). Plays a key role in cholesterol homeostasis by acting as a sensor of cholesterol excess: in low cholesterol conditions, translocates into the nucleus and represses expression of genes involved in defense against cholesterol excess, such as CD36 (By similarity). In excess cholesterol conditions, the endoplasmic reticulum membrane form of the protein directly binds cholesterol via its CRAC motif, preventing cleavage and release of the transcription factor NRF1, thereby allowing expression of genes promoting cholesterol removal (By similarity). Critical for redox balance in response to oxidative stress: acts by binding the AREs motifs on promoters and mediating activation of oxidative stress response genes, such as GCLC, GCLM, GSS, MT1 and MT2 (By similarity). Plays an essential role during fetal liver hematopoiesis: probably has a protective function against oxidative stress and is involved in lipid homeostasis in the liver (By similarity). Involved in proteasome homeostasis: in response to proteasome inhibition, mediates the 'bounce-back' of proteasome subunits by translocating into the nucleus and activating expression of genes encoding proteasome subunits (PubMed:20932482). Also involved in regulating glucose flux (By similarity). Together with CEBPB; represses expression of DSPP during odontoblast differentiation (PubMed:15308669). In response to ascorbic acid induction, activates expression of SP7/Osterix in osteoblasts. {ECO:0000250|UniProtKB:Q61985, ECO:0000269|PubMed:15308669, ECO:0000269|PubMed:20932482, ECO:0000269|PubMed:8932385, ECO:0000269|PubMed:9421508}. |
Retention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at * Minus value of BPloci means that the break pointn is located before the CDS. |
- In-frame and retained protein feature among the 13 regional features. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Protein feature | Protein feature note |
Hgene | CASC3 | chr17:38325699 | chr17:46133747 | ENST00000264645 | + | 12 | 14 | 95_131 | 696 | 1130.6666666666667 | Coiled coil | Ontology_term=ECO:0000255 |
Hgene | CASC3 | chr17:38325699 | chr17:46133747 | ENST00000264645 | + | 12 | 14 | 392_395 | 696 | 1130.6666666666667 | Compositional bias | Note=Poly-Pro |
Hgene | CASC3 | chr17:38325699 | chr17:46133747 | ENST00000264645 | + | 12 | 14 | 41_46 | 696 | 1130.6666666666667 | Compositional bias | Note=Poly-Gly |
Hgene | CASC3 | chr17:38325699 | chr17:46133747 | ENST00000264645 | + | 12 | 14 | 425_428 | 696 | 1130.6666666666667 | Compositional bias | Note=Poly-Pro |
Hgene | CASC3 | chr17:38325699 | chr17:46133747 | ENST00000264645 | + | 12 | 14 | 643_648 | 696 | 1130.6666666666667 | Compositional bias | Note=Poly-Pro |
Hgene | CASC3 | chr17:38325699 | chr17:46133747 | ENST00000264645 | + | 12 | 14 | 692_695 | 696 | 1130.6666666666667 | Compositional bias | Note=Poly-Pro |
Hgene | CASC3 | chr17:38325699 | chr17:46133747 | ENST00000264645 | + | 12 | 14 | 204_210 | 696 | 1130.6666666666667 | Motif | Nuclear localization signal 1 |
Hgene | CASC3 | chr17:38325699 | chr17:46133747 | ENST00000264645 | + | 12 | 14 | 254_262 | 696 | 1130.6666666666667 | Motif | Nuclear localization signal 2 |
Hgene | CASC3 | chr17:38325699 | chr17:46133747 | ENST00000264645 | + | 12 | 14 | 462_466 | 696 | 1130.6666666666667 | Motif | Note=Nuclear export signal |
Hgene | CASC3 | chr17:38325699 | chr17:46133747 | ENST00000264645 | + | 12 | 14 | 137_283 | 696 | 1130.6666666666667 | Region | Note=Sufficient to form the EJC |
Tgene | NFE2L1 | chr17:38325699 | chr17:46133747 | ENST00000357480 | 1 | 5 | 496_517 | 170 | 743.0 | Compositional bias | Note=Poly-Ser | |
Tgene | NFE2L1 | chr17:38325699 | chr17:46133747 | ENST00000362042 | 1 | 6 | 496_517 | 170 | 773.0 | Compositional bias | Note=Poly-Ser | |
Tgene | NFE2L1 | chr17:38325699 | chr17:46133747 | ENST00000585291 | 2 | 6 | 496_517 | 170 | 743.0 | Compositional bias | Note=Poly-Ser | |
Tgene | NFE2L1 | chr17:38325699 | chr17:46133747 | ENST00000357480 | 1 | 5 | 654_717 | 170 | 743.0 | Domain | bZIP | |
Tgene | NFE2L1 | chr17:38325699 | chr17:46133747 | ENST00000362042 | 1 | 6 | 654_717 | 170 | 773.0 | Domain | bZIP | |
Tgene | NFE2L1 | chr17:38325699 | chr17:46133747 | ENST00000585291 | 2 | 6 | 654_717 | 170 | 743.0 | Domain | bZIP | |
Tgene | NFE2L1 | chr17:38325699 | chr17:46133747 | ENST00000357480 | 1 | 5 | 476_480 | 170 | 743.0 | Motif | Destruction motif | |
Tgene | NFE2L1 | chr17:38325699 | chr17:46133747 | ENST00000362042 | 1 | 6 | 476_480 | 170 | 773.0 | Motif | Destruction motif | |
Tgene | NFE2L1 | chr17:38325699 | chr17:46133747 | ENST00000585291 | 2 | 6 | 476_480 | 170 | 743.0 | Motif | Destruction motif | |
Tgene | NFE2L1 | chr17:38325699 | chr17:46133747 | ENST00000357480 | 1 | 5 | 191_199 | 170 | 743.0 | Region | Cholesterol recognition/amino acid consensus (CRAC) region | |
Tgene | NFE2L1 | chr17:38325699 | chr17:46133747 | ENST00000357480 | 1 | 5 | 379_383 | 170 | 743.0 | Region | CPD | |
Tgene | NFE2L1 | chr17:38325699 | chr17:46133747 | ENST00000357480 | 1 | 5 | 656_675 | 170 | 743.0 | Region | Basic motif | |
Tgene | NFE2L1 | chr17:38325699 | chr17:46133747 | ENST00000357480 | 1 | 5 | 682_696 | 170 | 743.0 | Region | Leucine-zipper | |
Tgene | NFE2L1 | chr17:38325699 | chr17:46133747 | ENST00000362042 | 1 | 6 | 191_199 | 170 | 773.0 | Region | Cholesterol recognition/amino acid consensus (CRAC) region | |
Tgene | NFE2L1 | chr17:38325699 | chr17:46133747 | ENST00000362042 | 1 | 6 | 379_383 | 170 | 773.0 | Region | CPD | |
Tgene | NFE2L1 | chr17:38325699 | chr17:46133747 | ENST00000362042 | 1 | 6 | 656_675 | 170 | 773.0 | Region | Basic motif | |
Tgene | NFE2L1 | chr17:38325699 | chr17:46133747 | ENST00000362042 | 1 | 6 | 682_696 | 170 | 773.0 | Region | Leucine-zipper | |
Tgene | NFE2L1 | chr17:38325699 | chr17:46133747 | ENST00000585291 | 2 | 6 | 191_199 | 170 | 743.0 | Region | Cholesterol recognition/amino acid consensus (CRAC) region | |
Tgene | NFE2L1 | chr17:38325699 | chr17:46133747 | ENST00000585291 | 2 | 6 | 379_383 | 170 | 743.0 | Region | CPD | |
Tgene | NFE2L1 | chr17:38325699 | chr17:46133747 | ENST00000585291 | 2 | 6 | 656_675 | 170 | 743.0 | Region | Basic motif | |
Tgene | NFE2L1 | chr17:38325699 | chr17:46133747 | ENST00000585291 | 2 | 6 | 682_696 | 170 | 743.0 | Region | Leucine-zipper |
- In-frame and not-retained protein feature among the 13 regional features. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Protein feature | Protein feature note |
Hgene | CASC3 | chr17:38325699 | chr17:46133747 | ENST00000264645 | + | 12 | 14 | 377_703 | 696 | 1130.6666666666667 | Region | Note=Necessary for localization in cytoplasmic stress granules |
Tgene | NFE2L1 | chr17:38325699 | chr17:46133747 | ENST00000357480 | 1 | 5 | 125_288 | 170 | 743.0 | Compositional bias | Note=Asp/Glu-rich (acidic) | |
Tgene | NFE2L1 | chr17:38325699 | chr17:46133747 | ENST00000362042 | 1 | 6 | 125_288 | 170 | 773.0 | Compositional bias | Note=Asp/Glu-rich (acidic) | |
Tgene | NFE2L1 | chr17:38325699 | chr17:46133747 | ENST00000585291 | 2 | 6 | 125_288 | 170 | 743.0 | Compositional bias | Note=Asp/Glu-rich (acidic) | |
Tgene | NFE2L1 | chr17:38325699 | chr17:46133747 | ENST00000357480 | 1 | 5 | 7_24 | 170 | 743.0 | Transmembrane | Helical%3B Signal-anchor for type II membrane protein | |
Tgene | NFE2L1 | chr17:38325699 | chr17:46133747 | ENST00000362042 | 1 | 6 | 7_24 | 170 | 773.0 | Transmembrane | Helical%3B Signal-anchor for type II membrane protein | |
Tgene | NFE2L1 | chr17:38325699 | chr17:46133747 | ENST00000585291 | 2 | 6 | 7_24 | 170 | 743.0 | Transmembrane | Helical%3B Signal-anchor for type II membrane protein |
Top |
Fusion Gene Sequence for CASC3-NFE2L1 |
For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones. |
>13154_13154_1_CASC3-NFE2L1_CASC3_chr17_38325699_ENST00000264645_NFE2L1_chr17_46133747_ENST00000357480_length(transcript)=5871nt_BP=2314nt CACACACACACACACACACACACACACCCCAACACACACACACACACCCCAACACACACACACACACACACACACACACACACACACACA CACACACACACACACACAGCGGGATGGCCGAGCGCCGCACGCGTAGCACGCCGGGACTAGCTATCCAGCCTCCCAGCAGCCTCTGCGACG GGCGCGGTGCGTAAGTACCTCGCCGGTGGTGGCCGTTCTCCGTAAGATGGCGGACCGGCGGCGGCAGCGCGCTTCGCAAGACACCGAGGA CGAGGAATCTGGTGCTTCGGGCTCCGACAGCGGCGGCTCCCCGTTGCGGGGAGGCGGGAGCTGCAGCGGTAGCGCCGGAGGCGGCGGCAG CGGCTCTCTGCCTTCACAGCGCGGAGGCCGAACCGGGGCCCTTCATCTGCGGCGGGTGGAGAGCGGGGGCGCCAAGAGTGCTGAGGAGTC GGAGTGTGAGAGTGAAGATGGCATTGAAGGTGATGCTGTTCTCTCGGATTATGAAAGTGCAGAAGACTCGGAAGGTGAAGAAGGTGAATA CAGTGAAGAGGAAAACTCCAAAGTGGAGCTGAAATCAGAAGCTAATGATGCTGTTAATTCTTCAACAAAAGAAGAGAAGGGAGAAGAAAA GCCTGACACCAAAAGCACTGTGACTGGAGAGAGGCAAAGTGGGGACGGACAGGAGAGCACAGAGCCTGTGGAGAACAAAGTGGGTAAAAA GGGCCCTAAGCATTTGGATGATGATGAAGATCGGAAGAATCCAGCATACATACCTCGGAAAGGGCTCTTCTTTGAGCATGATCTTCGAGG GCAAACTCAGGAGGAGGAAGTCAGACCCAAGGGGCGTCAGCGAAAGCTATGGAAGGATGAGGGTCGCTGGGAGCATGACAAGTTCCGGGA AGATGAGCAGGCCCCAAAGTCCCGACAGGAGCTCATTGCTCTTTATGGTTATGACATTCGCTCAGCTCATAATCCTGATGACATCAAACC TCGAAGAATCCGGAAACCCCGATATGGGAGTCCTCCACAAAGAGATCCAAACTGGAACGGTGAGCGGCTAAACAAGTCTCATCGCCACCA GGGTCTTGGGGGCACCCTACCACCAAGGACATTTATTAACAGGAATGCTGCAGGTACCGGCCGTATGTCTGCACCCAGGAATTATTCTCG ATCTGGGGGCTTCAAGGAAGGTCGTGCTGGTTTTAGGCCTGTGGAAGCTGGTGGGCAGCATGGTGGCCGGTCTGGTGAGACTGTTAAGCA TGAGATTAGTTACCGGTCACGGCGCCTAGAGCAGACTTCTGTGAGGGATCCATCTCCAGAAGCAGATGCTCCAGTGCTTGGCAGTCCTGA GAAGGAAGAGGCAGCCTCAGAGCCACCAGCTGCTGCTCCTGATGCTGCACCACCACCCCCTGATAGGCCCATTGAGAAGAAATCCTATTC CCGGGCAAGAAGAACTCGAACCAAAGTTGGAGATGCAGTCAAGCTTGCAGAGGAGGTGCCCCCTCCTCCTGAAGGACTGATTCCAGCACC TCCAGTCCCAGAAACCACCCCAACTCCACCTACTAAGACTGGGACCTGGGAAGCTCCGGTGGATTCTAGTACAAGTGGACTTGAGCAAGA TGTGGCACAACTAAATATAGCAGAACAGAATTGGAGTCCGGGGCAGCCTTCTTTCCTGCAACCACGGGAACTTCGAGGTATGCCCAACCA TATACACATGGGAGCAGGACCTCCACCTCAGTTTAACCGGATGGAAGAAATGGGTGTCCAGGGTGGTCGAGCCAAACGCTATTCATCCCA GCGGCAAAGACCTGTGCCAGAGCCCCCCGCCCCTCCAGTGCATATCAGTATCATGGAGGGACATTACTATGATCCACTGCAGTTCCAGGG ACCAATCTATACCCATGGTGACAGCCCTGCCCCGCTGCCTCCACAGGGCATGCTTGTGCAGCCAGGAATGAACCTTCCCCACCCAGGTTT ACATCCCCACCAGACACCAGCTCCTCTGCCCAATCCAGGCCTCTATCCCCCACCAGTGTCCATGTCTCCAGGACAGCCACCACCTCAGCA GTTGCTTGCTCCTACTTACTTTTCTGCTCCAGGCGTCATGAACTTTGGTAATCCCAGTTACCCTTATGCTCCAGGGGCACTGCCTCCCCC ACCACCGCCTCATCTGTATCCTAATACACAGGCCCCATCACAGGTATATGGAGGAGTGACCTACTATAACCCCGCCCAGCAGCAGGTGCA GCCAAAGCCCTCCCCACCCCGGAGGACTCCCCAGCCAGTCACCATCAAGCCCCCTCCACCTGAGGACATAGATCTGATTGACATCCTTTG GCGACAGGATATTGATCTGGGGGCTGGGCGTGAGGTTTTTGACTATAGTCACCGCCAGAAGGAGCAGGATGTGGAGAAGGAGCTGCGAGA TGGAGGCGAGCAGGACACCTGGGCAGGCGAGGGCGCGGAAGCTCTGGCACGGAACCTGCTAGTGGATGGAGAGACTGGGGAGAGCTTCCC TGCACAGTTTCCAGCAGACATTTCCAGCATAACAGAAGCAGTGCCTAGTGAGAGTGAGCCCCCTGCTCTTCAAAACAACCTCTTGTCTCC TCTTCTGACCGGGACAGAGTCACCATTTGATTTGGAACAGCAGTGGCAAGATCTCATGTCCATCATGGAAATGCAGGCCATGGAAGTGAA CACATCAGCAAGTGAAATCCTGTACAGTGCCCCTCCTGGAGACCCACTGAGCACCAACTACAGCCTTGCCCCCAACACTCCCATCAATCA GAATGTCAGCCTGCATCAGGCGTCCCTGGGGGGCTGCAGCCAGGACTTCTTACTCTTCAGCCCCGAGGTGGAAAGCCTGCCTGTGGCCAG TAGCTCCACGCTGCTCCCGTTGGCCCCCAGCAATTCTACCAGCCTCAACTCCACCTTCGGCTCCACCAACCTGACAGGGCTCTTCTTTCC ACCCCAGCTCAATGGCACAGCCAATGACACAGCAGGCCCAGAGCTGCCTGACCCTTTGGGGGGTCTGTTAGATGAAGCTATGTTGGATGA GATCAGCCTTATGGACCTGGCCATTGAAGAAGGCTTTAACCCTGTGCAGGCCTCCCAGCTGGAGGAGGAATTTGACTCTGACTCAGGCCT TTCCTTAGACTCGAGCCATAGCCCTTCTTCCCTAAGCAGCTCTGAAGGCAGTTCTTCCTCTTCTTCCTCCTCCTCTTCCTCTTCTTCCTC TGCTTCTTCCTCTGCCTCTTCCTCCTTTTCTGAGGAAGGTGCGGTTGGCTACAGCTCTGACTCTGAGACCCTGGATCTGGAAGAGGCCGA GGGTGCTGTGGGCTACCAGCCTGAGTATTCCAAGTTCTGCCGCATGAGCTACCAGGATCCAGCTCAGCTCTCATGCCTGCCCTACCTGGA GCACGTGGGCCACAACCACACATACAACATGGCACCCAGTGCCCTGGACTCAGCCGACCTGCCACCACCCAGTGCCCTCAAGAAAGGCAG CAAGGAGAAGCAGGCTGACTTCCTGGACAAGCAGATGAGCCGGGATGAGCACCGAGCCCGAGCCATGAAGATCCCTTTCACCAATGACAA AATCATCAACCTGCCTGTGGAGGAGTTCAATGAACTGCTGTCCAAATACCAGTTGAGTGAAGCCCAGCTGAGCCTCATCCGAGACATCCG GCGCCGGGGCAAGAACAAGATGGCGGCGCAGAACTGCCGCAAGCGCAAGCTGGACACCATCCTGAATCTGGAGCGTGATGTGGAGGACCT GCAGCGTGACAAAGCCCGGCTGCTGCGGGAGAAAGTGGAGTTCCTGCGCTCCCTGCGACAGATGAAGCAGAAGGTCCAGAGCCTGTACCA GGAGGTGTTTGGGCGGCTGCGAGATGAGAACGGACGACCCTACTCGCCCAGTCAGTATGCGCTCCAGTACGCCGGGGACGGCAGTGTCCT CCTCATCCCCCGCACGATGGCCGACCAGCAGGCCCGGCGGCAGGAGAGGAAGCCAAAGGACCGGAGAAAGTGAGCCTGGGGAAGAAGGGG GTTTGAAGCCCACCAAGACCGAAACTGGAGAAGGGCTGGACCTGGACCTGGACCTGGACCTACAGCGGGGACTTAAATGCCTTCTTATCC AATATATCTTCTCAGATGGGATGACTGCGGGTCAGTGTACAGGAAGAGGCAGGCACTGGCTGGCTCAGCTCCACTCGGGTGGAGTGGAAG TGGCCAGACCATTTAGACGGACAGGGTCCTCACCCTACCCCTTTCCTGTGAGGCAGGGGTGGTGGTGGAGTTGCTGGAGGTAGAGGAGCT ATGTGGAGCAAAGGCCGACAGAGGGGAAGGAATGGACCTGTGAGAGGAAGGGAAGGTGGCAGAAAGTCTCATTTCAGGAAGGAGGGATAG AAGGAAGGAAGGAAGGAACCCCCCCCCCCCCGAAAAAAAAATCAAAGCGGGAAGAAAATCAGAGGGAAGGTTAAGGTTGGCTCTGGCCAG GATTCCAGGCAGCAGGTTGGAGTGACTGGTGGGCCTAGATCACTGGTGTGATAAACCCCAATTTTCACCCCGGGGGGGGTGGGGTACACA GACACAGGGTGGGGGTGGGGAGGGACGGTGTTAACTCTTTCTGCTCCTTGCATTTTGACATCCCTGAAGGGGAGCTCTTGGATATCATTG GCCATGTTTCAATCGAATGGAGCCACTGGGCCCCAACACTGGCTTTGAGATTTAGAGTCAAAGGGTAGAGTGAACAGGAAAGGGTCACGT GGTCCCATGTTGCAACAGCCCCAACATCACGCATGTCATTCACTGCCTTGCCACTCCATCTCCCTCCGTGCTCCAGCCACCCCTGAGCTG AGGCTCCCATTGTCTCCATCAGAGCCTGCATGTGTATGCCGTCCTCCCCTGGTCCGGTGTTTGTGTTCCCCACCCCTCACAGACTGCCTG AGCTCTTCTGTAAGCTGGGGTAGGGTGATGGCAGTGCTCCGGGAACTGGGCCTGCAGCCTTCCTCTTCTGGGACTGCTGTGAGGCAGAGG AATGATGGAGAATCTAGTGTAGCAGCCTCCAGGCAGGATTCAGCACAACACTGGGGAGTCACCCTTCCCTCGGGCCTCTGCCTACCAACA ACTGGGCTTATCACTGGGAAAACACAAAAAATTACACAACCCAGCAACAACAAAAGAACTAGTCCTCTTAGAATTTCTTGCGCTTTGATT TTTTTAGGGCTTGTGCCCTGTTTCACTTATAGGGTCTAGAATGCTTGTGTTGAGTAAAAAGGAGATGCCCAATATTCAAAGCTGCTAAAT GTTCTCTTTGCCATAAAGACTCCGTGTAACTGTGTGAACACTTGGGATTTTTCTCCTCTGTCCCGAGGTCGTCGTCTGCTTTCTTTTTTG GGTTTCTTTCTAGAAGATTGAGAAGTGCATATGACAGGCTGAGAGCACCTCCCCAAACACACAAGCTCTCAGCCACAGGCAGCTTCTCCA CAGCCCCAGCTTCGCACAGGCTCCTGGAGGGCTGCCTGGGGGAGGCAGACATGGGAGTGCCAAGGTGGCCAGATGGTTCCAGGACTACAA TGTCTTTATTTTTAACTGTTTGCCACTGCTGCCCTCACCCCTGCCCGGCTCTGGAGTACCGTCTGCCCCAGACAAGTGGGAGTGAAATGG GGGTGGGGGGAAGCACTGATTCCCAGTTAGGGGGTGCCTAACTGAGCAGTAGGGATAGAAGGTGTGAACCTGGGAGTGCTTTTATAAATT ATTTTCCTTGTAGATTTTATTTTTAATTTATCTCTGTGACCTGCCAGGGAGAGGGGAGAGAGAGAGAGATGCTGTTGAGCACATGACAAA >13154_13154_1_CASC3-NFE2L1_CASC3_chr17_38325699_ENST00000264645_NFE2L1_chr17_46133747_ENST00000357480_length(amino acids)=1286AA_BP=996 MRRARCVSTSPVVAVLRKMADRRRQRASQDTEDEESGASGSDSGGSPLRGGGSCSGSAGGGGSGSLPSQRGGRTGALHLRRVESGGAKSA EESECESEDGIEGDAVLSDYESAEDSEGEEGEYSEEENSKVELKSEANDAVNSSTKEEKGEEKPDTKSTVTGERQSGDGQESTEPVENKV GKKGPKHLDDDEDRKNPAYIPRKGLFFEHDLRGQTQEEEVRPKGRQRKLWKDEGRWEHDKFREDEQAPKSRQELIALYGYDIRSAHNPDD IKPRRIRKPRYGSPPQRDPNWNGERLNKSHRHQGLGGTLPPRTFINRNAAGTGRMSAPRNYSRSGGFKEGRAGFRPVEAGGQHGGRSGET VKHEISYRSRRLEQTSVRDPSPEADAPVLGSPEKEEAASEPPAAAPDAAPPPPDRPIEKKSYSRARRTRTKVGDAVKLAEEVPPPPEGLI PAPPVPETTPTPPTKTGTWEAPVDSSTSGLEQDVAQLNIAEQNWSPGQPSFLQPRELRGMPNHIHMGAGPPPQFNRMEEMGVQGGRAKRY SSQRQRPVPEPPAPPVHISIMEGHYYDPLQFQGPIYTHGDSPAPLPPQGMLVQPGMNLPHPGLHPHQTPAPLPNPGLYPPPVSMSPGQPP PQQLLAPTYFSAPGVMNFGNPSYPYAPGALPPPPPPHLYPNTQAPSQVYGGVTYYNPAQQQVQPKPSPPRRTPQPVTIKPPPPEDIDLID ILWRQDIDLGAGREVFDYSHRQKEQDVEKELRDGGEQDTWAGEGAEALARNLLVDGETGESFPAQFPADISSITEAVPSESEPPALQNNL LSPLLTGTESPFDLEQQWQDLMSIMEMQAMEVNTSASEILYSAPPGDPLSTNYSLAPNTPINQNVSLHQASLGGCSQDFLLFSPEVESLP VASSSTLLPLAPSNSTSLNSTFGSTNLTGLFFPPQLNGTANDTAGPELPDPLGGLLDEAMLDEISLMDLAIEEGFNPVQASQLEEEFDSD SGLSLDSSHSPSSLSSSEGSSSSSSSSSSSSSSASSSASSSFSEEGAVGYSSDSETLDLEEAEGAVGYQPEYSKFCRMSYQDPAQLSCLP YLEHVGHNHTYNMAPSALDSADLPPPSALKKGSKEKQADFLDKQMSRDEHRARAMKIPFTNDKIINLPVEEFNELLSKYQLSEAQLSLIR DIRRRGKNKMAAQNCRKRKLDTILNLERDVEDLQRDKARLLREKVEFLRSLRQMKQKVQSLYQEVFGRLRDENGRPYSPSQYALQYAGDG -------------------------------------------------------------- >13154_13154_2_CASC3-NFE2L1_CASC3_chr17_38325699_ENST00000264645_NFE2L1_chr17_46133747_ENST00000362042_length(transcript)=5962nt_BP=2314nt CACACACACACACACACACACACACACCCCAACACACACACACACACCCCAACACACACACACACACACACACACACACACACACACACA CACACACACACACACACAGCGGGATGGCCGAGCGCCGCACGCGTAGCACGCCGGGACTAGCTATCCAGCCTCCCAGCAGCCTCTGCGACG GGCGCGGTGCGTAAGTACCTCGCCGGTGGTGGCCGTTCTCCGTAAGATGGCGGACCGGCGGCGGCAGCGCGCTTCGCAAGACACCGAGGA CGAGGAATCTGGTGCTTCGGGCTCCGACAGCGGCGGCTCCCCGTTGCGGGGAGGCGGGAGCTGCAGCGGTAGCGCCGGAGGCGGCGGCAG CGGCTCTCTGCCTTCACAGCGCGGAGGCCGAACCGGGGCCCTTCATCTGCGGCGGGTGGAGAGCGGGGGCGCCAAGAGTGCTGAGGAGTC GGAGTGTGAGAGTGAAGATGGCATTGAAGGTGATGCTGTTCTCTCGGATTATGAAAGTGCAGAAGACTCGGAAGGTGAAGAAGGTGAATA CAGTGAAGAGGAAAACTCCAAAGTGGAGCTGAAATCAGAAGCTAATGATGCTGTTAATTCTTCAACAAAAGAAGAGAAGGGAGAAGAAAA GCCTGACACCAAAAGCACTGTGACTGGAGAGAGGCAAAGTGGGGACGGACAGGAGAGCACAGAGCCTGTGGAGAACAAAGTGGGTAAAAA GGGCCCTAAGCATTTGGATGATGATGAAGATCGGAAGAATCCAGCATACATACCTCGGAAAGGGCTCTTCTTTGAGCATGATCTTCGAGG GCAAACTCAGGAGGAGGAAGTCAGACCCAAGGGGCGTCAGCGAAAGCTATGGAAGGATGAGGGTCGCTGGGAGCATGACAAGTTCCGGGA AGATGAGCAGGCCCCAAAGTCCCGACAGGAGCTCATTGCTCTTTATGGTTATGACATTCGCTCAGCTCATAATCCTGATGACATCAAACC TCGAAGAATCCGGAAACCCCGATATGGGAGTCCTCCACAAAGAGATCCAAACTGGAACGGTGAGCGGCTAAACAAGTCTCATCGCCACCA GGGTCTTGGGGGCACCCTACCACCAAGGACATTTATTAACAGGAATGCTGCAGGTACCGGCCGTATGTCTGCACCCAGGAATTATTCTCG ATCTGGGGGCTTCAAGGAAGGTCGTGCTGGTTTTAGGCCTGTGGAAGCTGGTGGGCAGCATGGTGGCCGGTCTGGTGAGACTGTTAAGCA TGAGATTAGTTACCGGTCACGGCGCCTAGAGCAGACTTCTGTGAGGGATCCATCTCCAGAAGCAGATGCTCCAGTGCTTGGCAGTCCTGA GAAGGAAGAGGCAGCCTCAGAGCCACCAGCTGCTGCTCCTGATGCTGCACCACCACCCCCTGATAGGCCCATTGAGAAGAAATCCTATTC CCGGGCAAGAAGAACTCGAACCAAAGTTGGAGATGCAGTCAAGCTTGCAGAGGAGGTGCCCCCTCCTCCTGAAGGACTGATTCCAGCACC TCCAGTCCCAGAAACCACCCCAACTCCACCTACTAAGACTGGGACCTGGGAAGCTCCGGTGGATTCTAGTACAAGTGGACTTGAGCAAGA TGTGGCACAACTAAATATAGCAGAACAGAATTGGAGTCCGGGGCAGCCTTCTTTCCTGCAACCACGGGAACTTCGAGGTATGCCCAACCA TATACACATGGGAGCAGGACCTCCACCTCAGTTTAACCGGATGGAAGAAATGGGTGTCCAGGGTGGTCGAGCCAAACGCTATTCATCCCA GCGGCAAAGACCTGTGCCAGAGCCCCCCGCCCCTCCAGTGCATATCAGTATCATGGAGGGACATTACTATGATCCACTGCAGTTCCAGGG ACCAATCTATACCCATGGTGACAGCCCTGCCCCGCTGCCTCCACAGGGCATGCTTGTGCAGCCAGGAATGAACCTTCCCCACCCAGGTTT ACATCCCCACCAGACACCAGCTCCTCTGCCCAATCCAGGCCTCTATCCCCCACCAGTGTCCATGTCTCCAGGACAGCCACCACCTCAGCA GTTGCTTGCTCCTACTTACTTTTCTGCTCCAGGCGTCATGAACTTTGGTAATCCCAGTTACCCTTATGCTCCAGGGGCACTGCCTCCCCC ACCACCGCCTCATCTGTATCCTAATACACAGGCCCCATCACAGGTATATGGAGGAGTGACCTACTATAACCCCGCCCAGCAGCAGGTGCA GCCAAAGCCCTCCCCACCCCGGAGGACTCCCCAGCCAGTCACCATCAAGCCCCCTCCACCTGAGGACATAGATCTGATTGACATCCTTTG GCGACAGGATATTGATCTGGGGGCTGGGCGTGAGGTTTTTGACTATAGTCACCGCCAGAAGGAGCAGGATGTGGAGAAGGAGCTGCGAGA TGGAGGCGAGCAGGACACCTGGGCAGGCGAGGGCGCGGAAGCTCTGGCACGGAACCTGCTAGTGGATGGAGAGACTGGGGAGAGCTTCCC TGCACAGGTGCCTAGTGGGGAGGACCAGACGGCCCTGTCCCTGGAAGAGTGCCTTAGGCTGCTGGAAGCCACCTGCCCCTTTGGGGAGAA TGCTGAGTTTCCAGCAGACATTTCCAGCATAACAGAAGCAGTGCCTAGTGAGAGTGAGCCCCCTGCTCTTCAAAACAACCTCTTGTCTCC TCTTCTGACCGGGACAGAGTCACCATTTGATTTGGAACAGCAGTGGCAAGATCTCATGTCCATCATGGAAATGCAGGCCATGGAAGTGAA CACATCAGCAAGTGAAATCCTGTACAGTGCCCCTCCTGGAGACCCACTGAGCACCAACTACAGCCTTGCCCCCAACACTCCCATCAATCA GAATGTCAGCCTGCATCAGGCGTCCCTGGGGGGCTGCAGCCAGGACTTCTTACTCTTCAGCCCCGAGGTGGAAAGCCTGCCTGTGGCCAG TAGCTCCACGCTGCTCCCGTTGGCCCCCAGCAATTCTACCAGCCTCAACTCCACCTTCGGCTCCACCAACCTGACAGGGCTCTTCTTTCC ACCCCAGCTCAATGGCACAGCCAATGACACAGCAGGCCCAGAGCTGCCTGACCCTTTGGGGGGTCTGTTAGATGAAGCTATGTTGGATGA GATCAGCCTTATGGACCTGGCCATTGAAGAAGGCTTTAACCCTGTGCAGGCCTCCCAGCTGGAGGAGGAATTTGACTCTGACTCAGGCCT TTCCTTAGACTCGAGCCATAGCCCTTCTTCCCTAAGCAGCTCTGAAGGCAGTTCTTCCTCTTCTTCCTCCTCCTCTTCCTCTTCTTCCTC TGCTTCTTCCTCTGCCTCTTCCTCCTTTTCTGAGGAAGGTGCGGTTGGCTACAGCTCTGACTCTGAGACCCTGGATCTGGAAGAGGCCGA GGGTGCTGTGGGCTACCAGCCTGAGTATTCCAAGTTCTGCCGCATGAGCTACCAGGATCCAGCTCAGCTCTCATGCCTGCCCTACCTGGA GCACGTGGGCCACAACCACACATACAACATGGCACCCAGTGCCCTGGACTCAGCCGACCTGCCACCACCCAGTGCCCTCAAGAAAGGCAG CAAGGAGAAGCAGGCTGACTTCCTGGACAAGCAGATGAGCCGGGATGAGCACCGAGCCCGAGCCATGAAGATCCCTTTCACCAATGACAA AATCATCAACCTGCCTGTGGAGGAGTTCAATGAACTGCTGTCCAAATACCAGTTGAGTGAAGCCCAGCTGAGCCTCATCCGAGACATCCG GCGCCGGGGCAAGAACAAGATGGCGGCGCAGAACTGCCGCAAGCGCAAGCTGGACACCATCCTGAATCTGGAGCGTGATGTGGAGGACCT GCAGCGTGACAAAGCCCGGCTGCTGCGGGAGAAAGTGGAGTTCCTGCGCTCCCTGCGACAGATGAAGCAGAAGGTCCAGAGCCTGTACCA GGAGGTGTTTGGGCGGCTGCGAGATGAGAACGGACGACCCTACTCGCCCAGTCAGTATGCGCTCCAGTACGCCGGGGACGGCAGTGTCCT CCTCATCCCCCGCACGATGGCCGACCAGCAGGCCCGGCGGCAGGAGAGGAAGCCAAAGGACCGGAGAAAGTGAGCCTGGGGAAGAAGGGG GTTTGAAGCCCACCAAGACCGAAACTGGAGAAGGGCTGGACCTGGACCTGGACCTGGACCTACAGCGGGGACTTAAATGCCTTCTTATCC AATATATCTTCTCAGATGGGATGACTGCGGGTCAGTGTACAGGAAGAGGCAGGCACTGGCTGGCTCAGCTCCACTCGGGTGGAGTGGAAG TGGCCAGACCATTTAGACGGACAGGGTCCTCACCCTACCCCTTTCCTGTGAGGCAGGGGTGGTGGTGGAGTTGCTGGAGGTAGAGGAGCT ATGTGGAGCAAAGGCCGACAGAGGGGAAGGAATGGACCTGTGAGAGGAAGGGAAGGTGGCAGAAAGTCTCATTTCAGGAAGGAGGGATAG AAGGAAGGAAGGAAGGAACCCCCCCCCCCCCGAAAAAAAAATCAAAGCGGGAAGAAAATCAGAGGGAAGGTTAAGGTTGGCTCTGGCCAG GATTCCAGGCAGCAGGTTGGAGTGACTGGTGGGCCTAGATCACTGGTGTGATAAACCCCAATTTTCACCCCGGGGGGGGTGGGGTACACA GACACAGGGTGGGGGTGGGGAGGGACGGTGTTAACTCTTTCTGCTCCTTGCATTTTGACATCCCTGAAGGGGAGCTCTTGGATATCATTG GCCATGTTTCAATCGAATGGAGCCACTGGGCCCCAACACTGGCTTTGAGATTTAGAGTCAAAGGGTAGAGTGAACAGGAAAGGGTCACGT GGTCCCATGTTGCAACAGCCCCAACATCACGCATGTCATTCACTGCCTTGCCACTCCATCTCCCTCCGTGCTCCAGCCACCCCTGAGCTG AGGCTCCCATTGTCTCCATCAGAGCCTGCATGTGTATGCCGTCCTCCCCTGGTCCGGTGTTTGTGTTCCCCACCCCTCACAGACTGCCTG AGCTCTTCTGTAAGCTGGGGTAGGGTGATGGCAGTGCTCCGGGAACTGGGCCTGCAGCCTTCCTCTTCTGGGACTGCTGTGAGGCAGAGG AATGATGGAGAATCTAGTGTAGCAGCCTCCAGGCAGGATTCAGCACAACACTGGGGAGTCACCCTTCCCTCGGGCCTCTGCCTACCAACA ACTGGGCTTATCACTGGGAAAACACAAAAAATTACACAACCCAGCAACAACAAAAGAACTAGTCCTCTTAGAATTTCTTGCGCTTTGATT TTTTTAGGGCTTGTGCCCTGTTTCACTTATAGGGTCTAGAATGCTTGTGTTGAGTAAAAAGGAGATGCCCAATATTCAAAGCTGCTAAAT GTTCTCTTTGCCATAAAGACTCCGTGTAACTGTGTGAACACTTGGGATTTTTCTCCTCTGTCCCGAGGTCGTCGTCTGCTTTCTTTTTTG GGTTTCTTTCTAGAAGATTGAGAAGTGCATATGACAGGCTGAGAGCACCTCCCCAAACACACAAGCTCTCAGCCACAGGCAGCTTCTCCA CAGCCCCAGCTTCGCACAGGCTCCTGGAGGGCTGCCTGGGGGAGGCAGACATGGGAGTGCCAAGGTGGCCAGATGGTTCCAGGACTACAA TGTCTTTATTTTTAACTGTTTGCCACTGCTGCCCTCACCCCTGCCCGGCTCTGGAGTACCGTCTGCCCCAGACAAGTGGGAGTGAAATGG GGGTGGGGGGAAGCACTGATTCCCAGTTAGGGGGTGCCTAACTGAGCAGTAGGGATAGAAGGTGTGAACCTGGGAGTGCTTTTATAAATT ATTTTCCTTGTAGATTTTATTTTTAATTTATCTCTGTGACCTGCCAGGGAGAGGGGAGAGAGAGAGAGATGCTGTTGAGCACATGACAAA >13154_13154_2_CASC3-NFE2L1_CASC3_chr17_38325699_ENST00000264645_NFE2L1_chr17_46133747_ENST00000362042_length(amino acids)=1316AA_BP=1026 MRRARCVSTSPVVAVLRKMADRRRQRASQDTEDEESGASGSDSGGSPLRGGGSCSGSAGGGGSGSLPSQRGGRTGALHLRRVESGGAKSA EESECESEDGIEGDAVLSDYESAEDSEGEEGEYSEEENSKVELKSEANDAVNSSTKEEKGEEKPDTKSTVTGERQSGDGQESTEPVENKV GKKGPKHLDDDEDRKNPAYIPRKGLFFEHDLRGQTQEEEVRPKGRQRKLWKDEGRWEHDKFREDEQAPKSRQELIALYGYDIRSAHNPDD IKPRRIRKPRYGSPPQRDPNWNGERLNKSHRHQGLGGTLPPRTFINRNAAGTGRMSAPRNYSRSGGFKEGRAGFRPVEAGGQHGGRSGET VKHEISYRSRRLEQTSVRDPSPEADAPVLGSPEKEEAASEPPAAAPDAAPPPPDRPIEKKSYSRARRTRTKVGDAVKLAEEVPPPPEGLI PAPPVPETTPTPPTKTGTWEAPVDSSTSGLEQDVAQLNIAEQNWSPGQPSFLQPRELRGMPNHIHMGAGPPPQFNRMEEMGVQGGRAKRY SSQRQRPVPEPPAPPVHISIMEGHYYDPLQFQGPIYTHGDSPAPLPPQGMLVQPGMNLPHPGLHPHQTPAPLPNPGLYPPPVSMSPGQPP PQQLLAPTYFSAPGVMNFGNPSYPYAPGALPPPPPPHLYPNTQAPSQVYGGVTYYNPAQQQVQPKPSPPRRTPQPVTIKPPPPEDIDLID ILWRQDIDLGAGREVFDYSHRQKEQDVEKELRDGGEQDTWAGEGAEALARNLLVDGETGESFPAQVPSGEDQTALSLEECLRLLEATCPF GENAEFPADISSITEAVPSESEPPALQNNLLSPLLTGTESPFDLEQQWQDLMSIMEMQAMEVNTSASEILYSAPPGDPLSTNYSLAPNTP INQNVSLHQASLGGCSQDFLLFSPEVESLPVASSSTLLPLAPSNSTSLNSTFGSTNLTGLFFPPQLNGTANDTAGPELPDPLGGLLDEAM LDEISLMDLAIEEGFNPVQASQLEEEFDSDSGLSLDSSHSPSSLSSSEGSSSSSSSSSSSSSSASSSASSSFSEEGAVGYSSDSETLDLE EAEGAVGYQPEYSKFCRMSYQDPAQLSCLPYLEHVGHNHTYNMAPSALDSADLPPPSALKKGSKEKQADFLDKQMSRDEHRARAMKIPFT NDKIINLPVEEFNELLSKYQLSEAQLSLIRDIRRRGKNKMAAQNCRKRKLDTILNLERDVEDLQRDKARLLREKVEFLRSLRQMKQKVQS -------------------------------------------------------------- >13154_13154_3_CASC3-NFE2L1_CASC3_chr17_38325699_ENST00000264645_NFE2L1_chr17_46133747_ENST00000585291_length(transcript)=5879nt_BP=2314nt CACACACACACACACACACACACACACCCCAACACACACACACACACCCCAACACACACACACACACACACACACACACACACACACACA CACACACACACACACACAGCGGGATGGCCGAGCGCCGCACGCGTAGCACGCCGGGACTAGCTATCCAGCCTCCCAGCAGCCTCTGCGACG GGCGCGGTGCGTAAGTACCTCGCCGGTGGTGGCCGTTCTCCGTAAGATGGCGGACCGGCGGCGGCAGCGCGCTTCGCAAGACACCGAGGA CGAGGAATCTGGTGCTTCGGGCTCCGACAGCGGCGGCTCCCCGTTGCGGGGAGGCGGGAGCTGCAGCGGTAGCGCCGGAGGCGGCGGCAG CGGCTCTCTGCCTTCACAGCGCGGAGGCCGAACCGGGGCCCTTCATCTGCGGCGGGTGGAGAGCGGGGGCGCCAAGAGTGCTGAGGAGTC GGAGTGTGAGAGTGAAGATGGCATTGAAGGTGATGCTGTTCTCTCGGATTATGAAAGTGCAGAAGACTCGGAAGGTGAAGAAGGTGAATA CAGTGAAGAGGAAAACTCCAAAGTGGAGCTGAAATCAGAAGCTAATGATGCTGTTAATTCTTCAACAAAAGAAGAGAAGGGAGAAGAAAA GCCTGACACCAAAAGCACTGTGACTGGAGAGAGGCAAAGTGGGGACGGACAGGAGAGCACAGAGCCTGTGGAGAACAAAGTGGGTAAAAA GGGCCCTAAGCATTTGGATGATGATGAAGATCGGAAGAATCCAGCATACATACCTCGGAAAGGGCTCTTCTTTGAGCATGATCTTCGAGG GCAAACTCAGGAGGAGGAAGTCAGACCCAAGGGGCGTCAGCGAAAGCTATGGAAGGATGAGGGTCGCTGGGAGCATGACAAGTTCCGGGA AGATGAGCAGGCCCCAAAGTCCCGACAGGAGCTCATTGCTCTTTATGGTTATGACATTCGCTCAGCTCATAATCCTGATGACATCAAACC TCGAAGAATCCGGAAACCCCGATATGGGAGTCCTCCACAAAGAGATCCAAACTGGAACGGTGAGCGGCTAAACAAGTCTCATCGCCACCA GGGTCTTGGGGGCACCCTACCACCAAGGACATTTATTAACAGGAATGCTGCAGGTACCGGCCGTATGTCTGCACCCAGGAATTATTCTCG ATCTGGGGGCTTCAAGGAAGGTCGTGCTGGTTTTAGGCCTGTGGAAGCTGGTGGGCAGCATGGTGGCCGGTCTGGTGAGACTGTTAAGCA TGAGATTAGTTACCGGTCACGGCGCCTAGAGCAGACTTCTGTGAGGGATCCATCTCCAGAAGCAGATGCTCCAGTGCTTGGCAGTCCTGA GAAGGAAGAGGCAGCCTCAGAGCCACCAGCTGCTGCTCCTGATGCTGCACCACCACCCCCTGATAGGCCCATTGAGAAGAAATCCTATTC CCGGGCAAGAAGAACTCGAACCAAAGTTGGAGATGCAGTCAAGCTTGCAGAGGAGGTGCCCCCTCCTCCTGAAGGACTGATTCCAGCACC TCCAGTCCCAGAAACCACCCCAACTCCACCTACTAAGACTGGGACCTGGGAAGCTCCGGTGGATTCTAGTACAAGTGGACTTGAGCAAGA TGTGGCACAACTAAATATAGCAGAACAGAATTGGAGTCCGGGGCAGCCTTCTTTCCTGCAACCACGGGAACTTCGAGGTATGCCCAACCA TATACACATGGGAGCAGGACCTCCACCTCAGTTTAACCGGATGGAAGAAATGGGTGTCCAGGGTGGTCGAGCCAAACGCTATTCATCCCA GCGGCAAAGACCTGTGCCAGAGCCCCCCGCCCCTCCAGTGCATATCAGTATCATGGAGGGACATTACTATGATCCACTGCAGTTCCAGGG ACCAATCTATACCCATGGTGACAGCCCTGCCCCGCTGCCTCCACAGGGCATGCTTGTGCAGCCAGGAATGAACCTTCCCCACCCAGGTTT ACATCCCCACCAGACACCAGCTCCTCTGCCCAATCCAGGCCTCTATCCCCCACCAGTGTCCATGTCTCCAGGACAGCCACCACCTCAGCA GTTGCTTGCTCCTACTTACTTTTCTGCTCCAGGCGTCATGAACTTTGGTAATCCCAGTTACCCTTATGCTCCAGGGGCACTGCCTCCCCC ACCACCGCCTCATCTGTATCCTAATACACAGGCCCCATCACAGGTATATGGAGGAGTGACCTACTATAACCCCGCCCAGCAGCAGGTGCA GCCAAAGCCCTCCCCACCCCGGAGGACTCCCCAGCCAGTCACCATCAAGCCCCCTCCACCTGAGGACATAGATCTGATTGACATCCTTTG GCGACAGGATATTGATCTGGGGGCTGGGCGTGAGGTTTTTGACTATAGTCACCGCCAGAAGGAGCAGGATGTGGAGAAGGAGCTGCGAGA TGGAGGCGAGCAGGACACCTGGGCAGGCGAGGGCGCGGAAGCTCTGGCACGGAACCTGCTAGTGGATGGAGAGACTGGGGAGAGCTTCCC TGCACAGTTTCCAGCAGACATTTCCAGCATAACAGAAGCAGTGCCTAGTGAGAGTGAGCCCCCTGCTCTTCAAAACAACCTCTTGTCTCC TCTTCTGACCGGGACAGAGTCACCATTTGATTTGGAACAGCAGTGGCAAGATCTCATGTCCATCATGGAAATGCAGGCCATGGAAGTGAA CACATCAGCAAGTGAAATCCTGTACAGTGCCCCTCCTGGAGACCCACTGAGCACCAACTACAGCCTTGCCCCCAACACTCCCATCAATCA GAATGTCAGCCTGCATCAGGCGTCCCTGGGGGGCTGCAGCCAGGACTTCTTACTCTTCAGCCCCGAGGTGGAAAGCCTGCCTGTGGCCAG TAGCTCCACGCTGCTCCCGTTGGCCCCCAGCAATTCTACCAGCCTCAACTCCACCTTCGGCTCCACCAACCTGACAGGGCTCTTCTTTCC ACCCCAGCTCAATGGCACAGCCAATGACACAGCAGGCCCAGAGCTGCCTGACCCTTTGGGGGGTCTGTTAGATGAAGCTATGTTGGATGA GATCAGCCTTATGGACCTGGCCATTGAAGAAGGCTTTAACCCTGTGCAGGCCTCCCAGCTGGAGGAGGAATTTGACTCTGACTCAGGCCT TTCCTTAGACTCGAGCCATAGCCCTTCTTCCCTAAGCAGCTCTGAAGGCAGTTCTTCCTCTTCTTCCTCCTCCTCTTCCTCTTCTTCCTC TGCTTCTTCCTCTGCCTCTTCCTCCTTTTCTGAGGAAGGTGCGGTTGGCTACAGCTCTGACTCTGAGACCCTGGATCTGGAAGAGGCCGA GGGTGCTGTGGGCTACCAGCCTGAGTATTCCAAGTTCTGCCGCATGAGCTACCAGGATCCAGCTCAGCTCTCATGCCTGCCCTACCTGGA GCACGTGGGCCACAACCACACATACAACATGGCACCCAGTGCCCTGGACTCAGCCGACCTGCCACCACCCAGTGCCCTCAAGAAAGGCAG CAAGGAGAAGCAGGCTGACTTCCTGGACAAGCAGATGAGCCGGGATGAGCACCGAGCCCGAGCCATGAAGATCCCTTTCACCAATGACAA AATCATCAACCTGCCTGTGGAGGAGTTCAATGAACTGCTGTCCAAATACCAGTTGAGTGAAGCCCAGCTGAGCCTCATCCGAGACATCCG GCGCCGGGGCAAGAACAAGATGGCGGCGCAGAACTGCCGCAAGCGCAAGCTGGACACCATCCTGAATCTGGAGCGTGATGTGGAGGACCT GCAGCGTGACAAAGCCCGGCTGCTGCGGGAGAAAGTGGAGTTCCTGCGCTCCCTGCGACAGATGAAGCAGAAGGTCCAGAGCCTGTACCA GGAGGTGTTTGGGCGGCTGCGAGATGAGAACGGACGACCCTACTCGCCCAGTCAGTATGCGCTCCAGTACGCCGGGGACGGCAGTGTCCT CCTCATCCCCCGCACGATGGCCGACCAGCAGGCCCGGCGGCAGGAGAGGAAGCCAAAGGACCGGAGAAAGTGAGCCTGGGGAAGAAGGGG GTTTGAAGCCCACCAAGACCGAAACTGGAGAAGGGCTGGACCTGGACCTGGACCTGGACCTACAGCGGGGACTTAAATGCCTTCTTATCC AATATATCTTCTCAGATGGGATGACTGCGGGTCAGTGTACAGGAAGAGGCAGGCACTGGCTGGCTCAGCTCCACTCGGGTGGAGTGGAAG TGGCCAGACCATTTAGACGGACAGGGTCCTCACCCTACCCCTTTCCTGTGAGGCAGGGGTGGTGGTGGAGTTGCTGGAGGTAGAGGAGCT ATGTGGAGCAAAGGCCGACAGAGGGGAAGGAATGGACCTGTGAGAGGAAGGGAAGGTGGCAGAAAGTCTCATTTCAGGAAGGAGGGATAG AAGGAAGGAAGGAAGGAACCCCCCCCCCCCCGAAAAAAAAATCAAAGCGGGAAGAAAATCAGAGGGAAGGTTAAGGTTGGCTCTGGCCAG GATTCCAGGCAGCAGGTTGGAGTGACTGGTGGGCCTAGATCACTGGTGTGATAAACCCCAATTTTCACCCCGGGGGGGGTGGGGTACACA GACACAGGGTGGGGGTGGGGAGGGACGGTGTTAACTCTTTCTGCTCCTTGCATTTTGACATCCCTGAAGGGGAGCTCTTGGATATCATTG GCCATGTTTCAATCGAATGGAGCCACTGGGCCCCAACACTGGCTTTGAGATTTAGAGTCAAAGGGTAGAGTGAACAGGAAAGGGTCACGT GGTCCCATGTTGCAACAGCCCCAACATCACGCATGTCATTCACTGCCTTGCCACTCCATCTCCCTCCGTGCTCCAGCCACCCCTGAGCTG AGGCTCCCATTGTCTCCATCAGAGCCTGCATGTGTATGCCGTCCTCCCCTGGTCCGGTGTTTGTGTTCCCCACCCCTCACAGACTGCCTG AGCTCTTCTGTAAGCTGGGGTAGGGTGATGGCAGTGCTCCGGGAACTGGGCCTGCAGCCTTCCTCTTCTGGGACTGCTGTGAGGCAGAGG AATGATGGAGAATCTAGTGTAGCAGCCTCCAGGCAGGATTCAGCACAACACTGGGGAGTCACCCTTCCCTCGGGCCTCTGCCTACCAACA ACTGGGCTTATCACTGGGAAAACACAAAAAATTACACAACCCAGCAACAACAAAAGAACTAGTCCTCTTAGAATTTCTTGCGCTTTGATT TTTTTAGGGCTTGTGCCCTGTTTCACTTATAGGGTCTAGAATGCTTGTGTTGAGTAAAAAGGAGATGCCCAATATTCAAAGCTGCTAAAT GTTCTCTTTGCCATAAAGACTCCGTGTAACTGTGTGAACACTTGGGATTTTTCTCCTCTGTCCCGAGGTCGTCGTCTGCTTTCTTTTTTG GGTTTCTTTCTAGAAGATTGAGAAGTGCATATGACAGGCTGAGAGCACCTCCCCAAACACACAAGCTCTCAGCCACAGGCAGCTTCTCCA CAGCCCCAGCTTCGCACAGGCTCCTGGAGGGCTGCCTGGGGGAGGCAGACATGGGAGTGCCAAGGTGGCCAGATGGTTCCAGGACTACAA TGTCTTTATTTTTAACTGTTTGCCACTGCTGCCCTCACCCCTGCCCGGCTCTGGAGTACCGTCTGCCCCAGACAAGTGGGAGTGAAATGG GGGTGGGGGGAAGCACTGATTCCCAGTTAGGGGGTGCCTAACTGAGCAGTAGGGATAGAAGGTGTGAACCTGGGAGTGCTTTTATAAATT ATTTTCCTTGTAGATTTTATTTTTAATTTATCTCTGTGACCTGCCAGGGAGAGGGGAGAGAGAGAGAGATGCTGTTGAGCACATGACAAA >13154_13154_3_CASC3-NFE2L1_CASC3_chr17_38325699_ENST00000264645_NFE2L1_chr17_46133747_ENST00000585291_length(amino acids)=1286AA_BP=996 MRRARCVSTSPVVAVLRKMADRRRQRASQDTEDEESGASGSDSGGSPLRGGGSCSGSAGGGGSGSLPSQRGGRTGALHLRRVESGGAKSA EESECESEDGIEGDAVLSDYESAEDSEGEEGEYSEEENSKVELKSEANDAVNSSTKEEKGEEKPDTKSTVTGERQSGDGQESTEPVENKV GKKGPKHLDDDEDRKNPAYIPRKGLFFEHDLRGQTQEEEVRPKGRQRKLWKDEGRWEHDKFREDEQAPKSRQELIALYGYDIRSAHNPDD IKPRRIRKPRYGSPPQRDPNWNGERLNKSHRHQGLGGTLPPRTFINRNAAGTGRMSAPRNYSRSGGFKEGRAGFRPVEAGGQHGGRSGET VKHEISYRSRRLEQTSVRDPSPEADAPVLGSPEKEEAASEPPAAAPDAAPPPPDRPIEKKSYSRARRTRTKVGDAVKLAEEVPPPPEGLI PAPPVPETTPTPPTKTGTWEAPVDSSTSGLEQDVAQLNIAEQNWSPGQPSFLQPRELRGMPNHIHMGAGPPPQFNRMEEMGVQGGRAKRY SSQRQRPVPEPPAPPVHISIMEGHYYDPLQFQGPIYTHGDSPAPLPPQGMLVQPGMNLPHPGLHPHQTPAPLPNPGLYPPPVSMSPGQPP PQQLLAPTYFSAPGVMNFGNPSYPYAPGALPPPPPPHLYPNTQAPSQVYGGVTYYNPAQQQVQPKPSPPRRTPQPVTIKPPPPEDIDLID ILWRQDIDLGAGREVFDYSHRQKEQDVEKELRDGGEQDTWAGEGAEALARNLLVDGETGESFPAQFPADISSITEAVPSESEPPALQNNL LSPLLTGTESPFDLEQQWQDLMSIMEMQAMEVNTSASEILYSAPPGDPLSTNYSLAPNTPINQNVSLHQASLGGCSQDFLLFSPEVESLP VASSSTLLPLAPSNSTSLNSTFGSTNLTGLFFPPQLNGTANDTAGPELPDPLGGLLDEAMLDEISLMDLAIEEGFNPVQASQLEEEFDSD SGLSLDSSHSPSSLSSSEGSSSSSSSSSSSSSSASSSASSSFSEEGAVGYSSDSETLDLEEAEGAVGYQPEYSKFCRMSYQDPAQLSCLP YLEHVGHNHTYNMAPSALDSADLPPPSALKKGSKEKQADFLDKQMSRDEHRARAMKIPFTNDKIINLPVEEFNELLSKYQLSEAQLSLIR DIRRRGKNKMAAQNCRKRKLDTILNLERDVEDLQRDKARLLREKVEFLRSLRQMKQKVQSLYQEVFGRLRDENGRPYSPSQYALQYAGDG -------------------------------------------------------------- |
Top |
Fusion Gene PPI Analysis for CASC3-NFE2L1 |
Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in |
Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160) |
Hgene | Hgene's interactors | Tgene | Tgene's interactors |
- Retained PPIs in in-frame fusion. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Still interaction with |
- Lost PPIs in in-frame fusion. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Interaction lost with |
- Retained PPIs, but lost function due to frame-shift fusion. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Interaction lost with |
Top |
Related Drugs for CASC3-NFE2L1 |
Drugs targeting genes involved in this fusion gene. (DrugBank Version 5.1.8 2021-05-08) |
Partner | Gene | UniProtAcc | DrugBank ID | Drug name | Drug activity | Drug type | Drug status |
Top |
Related Diseases for CASC3-NFE2L1 |
Diseases associated with fusion partners. (DisGeNet 4.0) |
Partner | Gene | Disease ID | Disease name | # pubmeds | Source |