FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:CUX1-YEATS2 (FusionGDB2 ID:20746)

Fusion Gene Summary for CUX1-YEATS2

check button Fusion gene summary
Fusion gene informationFusion gene name: CUX1-YEATS2
Fusion gene ID: 20746
HgeneTgene
Gene symbol

CUX1

YEATS2

Gene ID

1523

55689

Gene namecut like homeobox 1YEATS domain containing 2
SynonymsCASP|CDP|CDP/Cut|CDP1|COY1|CUTL1|CUX|Clox|Cux/CDP|GDDI|GOLIM6|Nbla10317|p100|p110|p200|p75FAME4
Cytomap

7q22.1

3q27.1

Type of geneprotein-codingprotein-coding
Descriptionprotein CASPHomeobox protein cut-like 1CCAAT displacement proteinCUX1 gene Alternatively Spliced Productcut homologgolgi integral membrane protein 6homeobox protein cux-1putative protein product of Nbla10317YEATS domain-containing protein 2
Modification date2020032020200313
UniProtAcc

P39880

.
Ensembl transtripts involved in fusion geneENST00000292538, ENST00000360264, 
ENST00000393824, ENST00000425244, 
ENST00000437600, ENST00000547394, 
ENST00000292535, ENST00000546411, 
ENST00000549414, ENST00000550008, 
ENST00000556210, ENST00000560541, 
ENST00000305135, 
Fusion gene scores* DoF score32 X 26 X 13=108169 X 8 X 5=360
# samples 379
** MAII scorelog2(37/10816*10)=-4.86949797576587
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(9/360*10)=-2
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: CUX1 [Title/Abstract] AND YEATS2 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointCUX1(101459373)-YEATS2(183490093), # samples:2
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
TgeneYEATS2

GO:0000122

negative regulation of transcription by RNA polymerase II

18838386

TgeneYEATS2

GO:0043966

histone H3 acetylation

18838386

TgeneYEATS2

GO:0045892

negative regulation of transcription, DNA-templated

18838386


check buttonFusion gene breakpoints across CUX1 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across YEATS2 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4PRADTCGA-ZG-A9L1-01ACUX1chr7

101459373

-YEATS2chr3

183490093

+
ChimerDB4PRADTCGA-ZG-A9L1-01ACUX1chr7

101459373

+YEATS2chr3

183490093

+


Top

Fusion Gene ORF analysis for CUX1-YEATS2

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
In-frameENST00000292538ENST00000305135CUX1chr7

101459373

+YEATS2chr3

183490093

+
In-frameENST00000360264ENST00000305135CUX1chr7

101459373

+YEATS2chr3

183490093

+
In-frameENST00000393824ENST00000305135CUX1chr7

101459373

+YEATS2chr3

183490093

+
In-frameENST00000425244ENST00000305135CUX1chr7

101459373

+YEATS2chr3

183490093

+
In-frameENST00000437600ENST00000305135CUX1chr7

101459373

+YEATS2chr3

183490093

+
In-frameENST00000547394ENST00000305135CUX1chr7

101459373

+YEATS2chr3

183490093

+
intron-3CDSENST00000292535ENST00000305135CUX1chr7

101459373

+YEATS2chr3

183490093

+
intron-3CDSENST00000546411ENST00000305135CUX1chr7

101459373

+YEATS2chr3

183490093

+
intron-3CDSENST00000549414ENST00000305135CUX1chr7

101459373

+YEATS2chr3

183490093

+
intron-3CDSENST00000550008ENST00000305135CUX1chr7

101459373

+YEATS2chr3

183490093

+
intron-3CDSENST00000556210ENST00000305135CUX1chr7

101459373

+YEATS2chr3

183490093

+
intron-3CDSENST00000560541ENST00000305135CUX1chr7

101459373

+YEATS2chr3

183490093

+

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000437600CUX1chr7101459373+ENST00000305135YEATS2chr3183490093+47794153522736794
ENST00000292538CUX1chr7101459373+ENST00000305135YEATS2chr3183490093+445389262410794
ENST00000393824CUX1chr7101459373+ENST00000305135YEATS2chr3183490093+445086232407794
ENST00000547394CUX1chr7101459373+ENST00000305135YEATS2chr3183490093+444985222406794
ENST00000360264CUX1chr7101459373+ENST00000305135YEATS2chr3183490093+444783202404794
ENST00000425244CUX1chr7101459373+ENST00000305135YEATS2chr3183490093+443773102394794

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000437600ENST00000305135CUX1chr7101459373+YEATS2chr3183490093+0.0056760380.99432397
ENST00000292538ENST00000305135CUX1chr7101459373+YEATS2chr3183490093+0.0047098940.99529016
ENST00000393824ENST00000305135CUX1chr7101459373+YEATS2chr3183490093+0.0047123590.99528766
ENST00000547394ENST00000305135CUX1chr7101459373+YEATS2chr3183490093+0.0047000450.99529994
ENST00000360264ENST00000305135CUX1chr7101459373+YEATS2chr3183490093+0.0046761870.99532384
ENST00000425244ENST00000305135CUX1chr7101459373+YEATS2chr3183490093+0.0047266710.9952734

Top

Fusion Genomic Features for CUX1-YEATS2


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)
CUX1chr7101459373+YEATS2chr3183490092+0.00042630.99957365
CUX1chr7101459373+YEATS2chr3183490092+0.00042630.99957365

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.
genomic feature of top 1%

Top

Fusion Protein Features for CUX1-YEATS2


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr7:101459373/chr3:183490093)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
CUX1

P39880

.
FUNCTION: Transcription factor involved in the control of neuronal differentiation in the brain. Regulates dendrite development and branching, and dendritic spine formation in cortical layers II-III. Also involved in the control of synaptogenesis. In addition, it has probably a broad role in mammalian development as a repressor of developmentally regulated gene expression. May act by preventing binding of positively-activing CCAAT factors to promoters. Component of nf-munr repressor; binds to the matrix attachment regions (MARs) (5' and 3') of the immunoglobulin heavy chain enhancer. Represses T-cell receptor (TCR) beta enhancer function by binding to MARbeta, an ATC-rich DNA sequence located upstream of the TCR beta enhancer. Binds to the TH enhancer; may require the basic helix-loop-helix protein TCF4 as a coactivator. {ECO:0000250|UniProtKB:P53564}.; FUNCTION: [CDP/Cux p110]: Plays a role in cell cycle progression, in particular at the G1/S transition. As cells progress into S phase, a fraction of CUX1 molecules is proteolytically processed into N-terminally truncated proteins of 110 kDa. While CUX1 only transiently binds to DNA and carries the CCAAT-displacement activity, CDP/Cux p110 makes a stable interaction with DNA and stimulates expression of genes such as POLA1. {ECO:0000269|PubMed:15099520}.FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
TgeneYEATS2chr7:101459373chr3:183490093ENST000003051351431794_8426491423.0Compositional biasNote=Gly-rich

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneCUX1chr7:101459373chr3:183490093ENST00000292535+12456_40701506.0Coiled coilOntology_term=ECO:0000255
HgeneCUX1chr7:101459373chr3:183490093ENST00000292538+123502_55621679.0Coiled coilOntology_term=ECO:0000255
HgeneCUX1chr7:101459373chr3:183490093ENST00000292538+12367_45021679.0Coiled coilOntology_term=ECO:0000255
HgeneCUX1chr7:101459373chr3:183490093ENST00000360264+12456_407211517.0Coiled coilOntology_term=ECO:0000255
HgeneCUX1chr7:101459373chr3:183490093ENST00000393824+122502_55621640.0Coiled coilOntology_term=ECO:0000255
HgeneCUX1chr7:101459373chr3:183490093ENST00000393824+12267_45021640.0Coiled coilOntology_term=ECO:0000255
HgeneCUX1chr7:101459373chr3:183490093ENST00000425244+12256_40721633.0Coiled coilOntology_term=ECO:0000255
HgeneCUX1chr7:101459373chr3:183490093ENST00000437600+123502_55621677.0Coiled coilOntology_term=ECO:0000255
HgeneCUX1chr7:101459373chr3:183490093ENST00000437600+12367_45021677.0Coiled coilOntology_term=ECO:0000255
HgeneCUX1chr7:101459373chr3:183490093ENST00000546411+12456_40701404.0Coiled coilOntology_term=ECO:0000255
HgeneCUX1chr7:101459373chr3:183490093ENST00000547394+122502_55621663.0Coiled coilOntology_term=ECO:0000255
HgeneCUX1chr7:101459373chr3:183490093ENST00000547394+12267_45021663.0Coiled coilOntology_term=ECO:0000255
HgeneCUX1chr7:101459373chr3:183490093ENST00000549414+12356_40701484.0Coiled coilOntology_term=ECO:0000255
HgeneCUX1chr7:101459373chr3:183490093ENST00000550008+12256_40701450.0Coiled coilOntology_term=ECO:0000255
HgeneCUX1chr7:101459373chr3:183490093ENST00000556210+12256_40701348.0Coiled coilOntology_term=ECO:0000255
HgeneCUX1chr7:101459373chr3:183490093ENST00000292535+1241406_143801506.0Compositional biasNote=Ala-rich
HgeneCUX1chr7:101459373chr3:183490093ENST00000360264+1241406_1438211517.0Compositional biasNote=Ala-rich
HgeneCUX1chr7:101459373chr3:183490093ENST00000425244+1221406_143821633.0Compositional biasNote=Ala-rich
HgeneCUX1chr7:101459373chr3:183490093ENST00000546411+1241406_143801404.0Compositional biasNote=Ala-rich
HgeneCUX1chr7:101459373chr3:183490093ENST00000549414+1231406_143801484.0Compositional biasNote=Ala-rich
HgeneCUX1chr7:101459373chr3:183490093ENST00000550008+1221406_143801450.0Compositional biasNote=Ala-rich
HgeneCUX1chr7:101459373chr3:183490093ENST00000556210+1221406_143801348.0Compositional biasNote=Ala-rich
HgeneCUX1chr7:101459373chr3:183490093ENST00000292535+1241117_120401506.0DNA bindingCUT 3
HgeneCUX1chr7:101459373chr3:183490093ENST00000292535+1241244_130301506.0DNA bindingHomeobox
HgeneCUX1chr7:101459373chr3:183490093ENST00000292535+124542_62901506.0DNA bindingCUT 1
HgeneCUX1chr7:101459373chr3:183490093ENST00000292535+124934_102101506.0DNA bindingCUT 2
HgeneCUX1chr7:101459373chr3:183490093ENST00000360264+1241117_1204211517.0DNA bindingCUT 3
HgeneCUX1chr7:101459373chr3:183490093ENST00000360264+1241244_1303211517.0DNA bindingHomeobox
HgeneCUX1chr7:101459373chr3:183490093ENST00000360264+124542_629211517.0DNA bindingCUT 1
HgeneCUX1chr7:101459373chr3:183490093ENST00000360264+124934_1021211517.0DNA bindingCUT 2
HgeneCUX1chr7:101459373chr3:183490093ENST00000425244+1221117_120421633.0DNA bindingCUT 3
HgeneCUX1chr7:101459373chr3:183490093ENST00000425244+1221244_130321633.0DNA bindingHomeobox
HgeneCUX1chr7:101459373chr3:183490093ENST00000425244+122542_62921633.0DNA bindingCUT 1
HgeneCUX1chr7:101459373chr3:183490093ENST00000425244+122934_102121633.0DNA bindingCUT 2
HgeneCUX1chr7:101459373chr3:183490093ENST00000546411+1241117_120401404.0DNA bindingCUT 3
HgeneCUX1chr7:101459373chr3:183490093ENST00000546411+1241244_130301404.0DNA bindingHomeobox
HgeneCUX1chr7:101459373chr3:183490093ENST00000546411+124542_62901404.0DNA bindingCUT 1
HgeneCUX1chr7:101459373chr3:183490093ENST00000546411+124934_102101404.0DNA bindingCUT 2
HgeneCUX1chr7:101459373chr3:183490093ENST00000549414+1231117_120401484.0DNA bindingCUT 3
HgeneCUX1chr7:101459373chr3:183490093ENST00000549414+1231244_130301484.0DNA bindingHomeobox
HgeneCUX1chr7:101459373chr3:183490093ENST00000549414+123542_62901484.0DNA bindingCUT 1
HgeneCUX1chr7:101459373chr3:183490093ENST00000549414+123934_102101484.0DNA bindingCUT 2
HgeneCUX1chr7:101459373chr3:183490093ENST00000550008+1221117_120401450.0DNA bindingCUT 3
HgeneCUX1chr7:101459373chr3:183490093ENST00000550008+1221244_130301450.0DNA bindingHomeobox
HgeneCUX1chr7:101459373chr3:183490093ENST00000550008+122542_62901450.0DNA bindingCUT 1
HgeneCUX1chr7:101459373chr3:183490093ENST00000550008+122934_102101450.0DNA bindingCUT 2
HgeneCUX1chr7:101459373chr3:183490093ENST00000556210+1221117_120401348.0DNA bindingCUT 3
HgeneCUX1chr7:101459373chr3:183490093ENST00000556210+1221244_130301348.0DNA bindingHomeobox
HgeneCUX1chr7:101459373chr3:183490093ENST00000556210+122542_62901348.0DNA bindingCUT 1
HgeneCUX1chr7:101459373chr3:183490093ENST00000556210+122934_102101348.0DNA bindingCUT 2
HgeneCUX1chr7:101459373chr3:183490093ENST00000292538+1231_61921679.0Topological domainCytoplasmic
HgeneCUX1chr7:101459373chr3:183490093ENST00000292538+123641_67821679.0Topological domainLumenal
HgeneCUX1chr7:101459373chr3:183490093ENST00000393824+1221_61921640.0Topological domainCytoplasmic
HgeneCUX1chr7:101459373chr3:183490093ENST00000393824+122641_67821640.0Topological domainLumenal
HgeneCUX1chr7:101459373chr3:183490093ENST00000437600+1231_61921677.0Topological domainCytoplasmic
HgeneCUX1chr7:101459373chr3:183490093ENST00000437600+123641_67821677.0Topological domainLumenal
HgeneCUX1chr7:101459373chr3:183490093ENST00000547394+1221_61921663.0Topological domainCytoplasmic
HgeneCUX1chr7:101459373chr3:183490093ENST00000547394+122641_67821663.0Topological domainLumenal
HgeneCUX1chr7:101459373chr3:183490093ENST00000292538+123620_64021679.0TransmembraneHelical%3B Anchor for type IV membrane protein
HgeneCUX1chr7:101459373chr3:183490093ENST00000393824+122620_64021640.0TransmembraneHelical%3B Anchor for type IV membrane protein
HgeneCUX1chr7:101459373chr3:183490093ENST00000437600+123620_64021677.0TransmembraneHelical%3B Anchor for type IV membrane protein
HgeneCUX1chr7:101459373chr3:183490093ENST00000547394+122620_64021663.0TransmembraneHelical%3B Anchor for type IV membrane protein
TgeneYEATS2chr7:101459373chr3:183490093ENST00000305135143147_806491423.0Coiled coilOntology_term=ECO:0000255
TgeneYEATS2chr7:101459373chr3:183490093ENST000003051351431200_3456491423.0DomainYEATS
TgeneYEATS2chr7:101459373chr3:183490093ENST000003051351431259_2616491423.0RegionHistone H3K27cr binding
TgeneYEATS2chr7:101459373chr3:183490093ENST000003051351431282_2846491423.0RegionHistone H3K27cr binding


Top

Fusion Gene Sequence for CUX1-YEATS2


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>20746_20746_1_CUX1-YEATS2_CUX1_chr7_101459373_ENST00000292538_YEATS2_chr3_183490093_ENST00000305135_length(transcript)=4453nt_BP=89nt
CTCACTCCGTCTCAATATGTCTCAAGATGGCGGCCAATGTGGGATCGATGTTTCAATATTGGAAGCGCTTTGATTTACAGCAGCTGCAGG
TTGTCGGGGTACCAGTTGGGTCTGCTTTACCTTCAACAGTGAAGCAGGCTGTGGCGATCAGTGGTGGCCAGATCCTGGTAGCCAAGGCCA
GCTCTTCTGTCTCCAAAGCAGTTGGGCCAAAGCAAGTTGTAACCCAAGGAGTTGCCAAAGCAATTGTGAGTGGAGGTGGAGGAACCATTG
TTGCTCAGCCAGTGCAGACCTTAACCAAGGCCCAGGTTACTGCCGCTGGTCCTCAGAAGAGTGGATCCCAGGGTTCAGTAATGGCAACGT
TGCAGCTACCAGCCACTAATTTGGCCAACTTGGCAAATTTGCCTCCTGGCACTAAACTCTACCTAACTACAAACAGCAAGAACCCTTCAG
GAAAAGGAAAACTGCTGCTGATCCCTCAAGGAGCCATCCTGCGAGCTACGAACAATGCTAATCTCCAGTCTGGCTCAGCTGCCAGTGGTG
GGAGTGGTGCCGGAGGAGGAGGAGGAGGAGGAGGAGGAGGCGGCAGTGGCAGCGGTGGAGGCGGCAGCACAGGAGGAGGAGGAGGAACAG
CAGGAGGAGGAACTCAAAGTACTGCTGGCCCTGGAGGGATATCTCAGCACCTGACTTACACATCTTACATCCTCAAGCAAACTCCCCAGG
GCACATTTTTAGTTGGCCAGCCATCACCCCAGACTTCTGGAAAACAACTCACCACTGGGTCAGTGGTCCAAGGAACACTGGGAGTCAGCA
CATCTTCTGCACAAGGACAACAAACGCTAAAAGTCATCTCTGGACAGAAAACCACATTGTTTACACAGGCAGCCCATGGAGGACAGGCAT
CTCTAATGAAAATATCCGATAGCACCTTGAAGACTGTGCCAGCCACCTCACAGCTCTCGAAGCCTGGAACCACAATGCTGAGAGTAGCAG
GAGGGGTTATCACAACTGCCACTTCCCCTGCCGTGGCCCTCTCAGCAAACGGTCCTGCACAACAGTCTGAAGGAATGGCTCCCGTGTCTT
CATCTACGGTCAGTTCTGTAACGAAAACTTCTGGGCAGCAGCAAGTGTGTGTGAGCCAGGCCACCGTGGGAACCTGCAAGGCTGCCACCC
CCACCGTCGTCAGCGCCACGTCCCTCGTGCCTACACCAAACCCCATCTCTGGGAAAGCCACAGTATCCGGACTGTTAAAGATTCACTCCA
GTCAGTCCAGTCCGCAGCAGGCCGTCCTGACGATTCCCAGCCAGCTCAAACCACTCAGCGTAAACACATCTGGAGGGGTGCAGACGATCC
TGATGCCTGTGAATAAAGTGGTTCAGTCATTTTCTACCAGCAAGCCACCTGCCATTCTGCCTGTAGCTGCCCCAACTCCAGTTGTCCCCA
GCTCTGCTCCAGCAGCTGTTGCAAAAGTGAAGACTGAACCAGAAACACCTGGACCGAGTTGCCTCTCTCAGGAGGGTCAGACAGCAGTGA
AAACAGAAGAAAGTTCTGAGCTGGGAAACTATGTCATTAAGATAGACCATTTAGAAACTATCCAGCAACTCCTAACTGCAGTAGTAAAGA
AGATTCCATTAATCACTGCAAAAAGTGAAGATGCCAGCTGCTTTTCTGCAAAGTCTGTGGAGCAGTACTATGGCTGGAACATTGGAAAAA
GGAGAGCCGCTGAGTGGCAAAGAGCAATGACAATGCGAAAAGTCTTACAAGAAATCCTGGAGAAGAATCCGAGATTTCACCACCTGACTC
CCCTCAAAACCAAGCACATCGCTCACTGGTGCCGCTGTCATGGCTACACCCCACCGGACCCTGAGAGCCTGAGGAATGACGGGGACTCCA
TCGAGGACGTGCTGACCCAGATCGACAGCGAGCCCGAGTGCCCATCATCATTCTCCTCTGCTGACAACCTCTGCCGCAAACTGGAGGACC
TGCAACAGTTCCAGAAAAGGGAACCCGAGAATGAGGAGGAGGTGGACATCCTCAGCCTCTCCGAGCCAGTGAAGATAAACATCAAGAAGG
AGCAGGAAGAGAAACAAGAGGAAGTCAAGTTCTACCTGCCACCAACCCCAGGGTCTGAATTTATTGGGGATGTCACACAGAAGATTGGGA
TCACCCTGCAGCCCGTGGCACTCCACAGGAACGTGTATGCGTCCGTGGTGGAGGACATGATCCTGAAGGCTACAGAACAGCTGGTGAATG
ATATCCTGAGACAGGCTTTGGCAGTTGGATACCAGACAGCTTCTCACAACAGGATTCCCAAAGAAATTACAGTGAGTAATATTCACCAGG
CCATTTGCAACATTCCTTTTCTGGACTTCCTCACAAACAAACACATGGGAATATTGAATGAGGACCAGTGAGCGGAGTGAGGTGCCCTGG
AGAAGCAGGCTTTGAAGGCACAGCGAAGCTGTAACTGAGGACCCTGCTGCTCGGGAAGGAGGTGGTTTCCAGTGTGACTCGGCATGTCAT
GGCTACCCAACCTTTGCCGCTGCCTGTTCCCACGTGTCACCAGCACGCTGCACTCCAGATGAAATCCTCCTAGGACAGGAGTTTGTTTCC
TGAGTGTGGAGTGAGGCTGTCAGTGGATCCGTGCTTTGTCGGCCAGCGTTTCTGCAGTCTTTGTAAAGGCCCCACGAGAGCGGGCCAGGC
CGTGTGCCTCAGGCCCTTCTCCCTGGGTGTGCTTAAGGGGGCTCCTTGGGCCCGCCTCCCCAGGAGGTAGAAAATGAGTGGCAGGCTAGA
GATTTCACCCATTTTGTGGGCTGGAGTTACCAGTAGCTCCAGCAGTTACCCTGAAGAGAGATTGGGCTTCAGCCTTCAGCAGGTGGTTCT
CTCCCATGCCTGGCCTTGGTGTGGAGGGGCTGTACTCTGAGCCCAAGTGAGTCAGCTATAGGAAGAGGCCATACCTAGAGCCAAGAACCA
TGAAGGCCTGAGAGACGGCAGACTGAGCAGAATTCCTTTTTTGAGCACGAGAGCATTACTAGAACCATTGTCAAAGCAGTGGCAAGGGAC
GGAGAGGTCCCAACAGGAGTCAGGAAGAGGTTTGATTATAACCAAGAAAACTCACTATGCTAGGAATAGACTGTGTGCACCAGTCCCAGA
CACTTGGCAGAAGTGTAGCAGCGTTACACATGTGTGCGAAGCAGATCGCAGGTTCCACGCCATCTGCATGGCCTGCAGGAGCTTCTGCTG
CTGACCCCATGCTGAGTGGCCAGTGGGGAGCGGCGCCCGGCAGGCTCTTCTGGGGTCGTCTGTCCTATCCGTGGATTGTATATACTCTTC
TCTGTTAAGGAGTTTTTCCCAAGAAGAAAAGTATTTAAAAGAAATACCAGTGAGTGCCTTAAAGTTGGAGAAGTAACTGCCCATGCCCAG
AAATAAGGATGCCAGTGCCCAGAAGCAGTGAGATTAGTCTGTGTCCACAAGCAGAGGCCCCCTCGATGGGAGGGAGTGGCAGGCAGGAGA
AGGTGGCGCTGCCAGGTGCCCGGGTCTATTGGAGGCGCCCCATCTCAGACTTCCTAACACAGCCTGTGTGGAAGGCAGAACAAAGAATGC
ATGCCCAGTCAGAAATCTGTTCTATTCTGCTCCAGGAAAATCGGAAACCTGTGAGTCAGAGTCAGAGAAACTTACCCAAGCAACGTAATT
CCTGTTTTCATGGGTCCTGTAGATGTTTGAGTCAGGAAGGTAAGGCGGGGAGTGACTGAATAAACTCTGCCTTTTAAATTGAGCATCTGG
GCCGGGCATGGTGGCTCACGCCTGTAATCCCAGCACTCTGGGAGGTCGAGGTGGGTGGGTCACCTGAGGTTGGGAGTTCGAGACCAGCCC
GACCAACATGGTGAAACCCCGTCTCTACTAAAAATACAGAAAATTAGCTGGGCATGGTGGTGTGTGCCTGTAATTCCAGCTACTCGGGAG
GCTGAGGCAGGAAGAATCACTTGAACCCAGGAGGCGGAGGTTGCAGTGTGCCAAGATCATACCACTGCACTCCAGCCCTGGTGACAGAGG
AGACCCCGTCTCAAAAATTGATTGATCAATTCAGCATCTGAGGGCTGCAAGTACAGAAGGAATCTATTCTCAGCAGGGCATAGGGCACGC
ACTGGCTTAACAGTTTAGTATATAAGGCTCAAATAGTCTATACCTGAACTGCTATAAGCAAGGTCGATAGGGAAGTGGATAGATTGCTTC
AGCAAAGTGAACTGTGAGATCTCCAGGACAGAGGGAGAAAGATCTGATCCAAATGAGAACAGATTGGTTATTGCAGGTATCACAGCCTAA
AGAAATTATCTTTTTGCAAAAGAAATATTAAATGATTTAGCAGTCTCCACGTGTGTTAATGTTTCAAACGTGTATCATAATGTGTATAAT

>20746_20746_1_CUX1-YEATS2_CUX1_chr7_101459373_ENST00000292538_YEATS2_chr3_183490093_ENST00000305135_length(amino acids)=794AA_BP=111
MAANVGSMFQYWKRFDLQQLQVVGVPVGSALPSTVKQAVAISGGQILVAKASSSVSKAVGPKQVVTQGVAKAIVSGGGGTIVAQPVQTLT
KAQVTAAGPQKSGSQGSVMATLQLPATNLANLANLPPGTKLYLTTNSKNPSGKGKLLLIPQGAILRATNNANLQSGSAASGGSGAGGGGG
GGGGGGSGSGGGGSTGGGGGTAGGGTQSTAGPGGISQHLTYTSYILKQTPQGTFLVGQPSPQTSGKQLTTGSVVQGTLGVSTSSAQGQQT
LKVISGQKTTLFTQAAHGGQASLMKISDSTLKTVPATSQLSKPGTTMLRVAGGVITTATSPAVALSANGPAQQSEGMAPVSSSTVSSVTK
TSGQQQVCVSQATVGTCKAATPTVVSATSLVPTPNPISGKATVSGLLKIHSSQSSPQQAVLTIPSQLKPLSVNTSGGVQTILMPVNKVVQ
SFSTSKPPAILPVAAPTPVVPSSAPAAVAKVKTEPETPGPSCLSQEGQTAVKTEESSELGNYVIKIDHLETIQQLLTAVVKKIPLITAKS
EDASCFSAKSVEQYYGWNIGKRRAAEWQRAMTMRKVLQEILEKNPRFHHLTPLKTKHIAHWCRCHGYTPPDPESLRNDGDSIEDVLTQID
SEPECPSSFSSADNLCRKLEDLQQFQKREPENEEEVDILSLSEPVKINIKKEQEEKQEEVKFYLPPTPGSEFIGDVTQKIGITLQPVALH

--------------------------------------------------------------
>20746_20746_2_CUX1-YEATS2_CUX1_chr7_101459373_ENST00000360264_YEATS2_chr3_183490093_ENST00000305135_length(transcript)=4447nt_BP=83nt
CCGTCTCAATATGTCTCAAGATGGCGGCCAATGTGGGATCGATGTTTCAATATTGGAAGCGCTTTGATTTACAGCAGCTGCAGGTTGTCG
GGGTACCAGTTGGGTCTGCTTTACCTTCAACAGTGAAGCAGGCTGTGGCGATCAGTGGTGGCCAGATCCTGGTAGCCAAGGCCAGCTCTT
CTGTCTCCAAAGCAGTTGGGCCAAAGCAAGTTGTAACCCAAGGAGTTGCCAAAGCAATTGTGAGTGGAGGTGGAGGAACCATTGTTGCTC
AGCCAGTGCAGACCTTAACCAAGGCCCAGGTTACTGCCGCTGGTCCTCAGAAGAGTGGATCCCAGGGTTCAGTAATGGCAACGTTGCAGC
TACCAGCCACTAATTTGGCCAACTTGGCAAATTTGCCTCCTGGCACTAAACTCTACCTAACTACAAACAGCAAGAACCCTTCAGGAAAAG
GAAAACTGCTGCTGATCCCTCAAGGAGCCATCCTGCGAGCTACGAACAATGCTAATCTCCAGTCTGGCTCAGCTGCCAGTGGTGGGAGTG
GTGCCGGAGGAGGAGGAGGAGGAGGAGGAGGAGGCGGCAGTGGCAGCGGTGGAGGCGGCAGCACAGGAGGAGGAGGAGGAACAGCAGGAG
GAGGAACTCAAAGTACTGCTGGCCCTGGAGGGATATCTCAGCACCTGACTTACACATCTTACATCCTCAAGCAAACTCCCCAGGGCACAT
TTTTAGTTGGCCAGCCATCACCCCAGACTTCTGGAAAACAACTCACCACTGGGTCAGTGGTCCAAGGAACACTGGGAGTCAGCACATCTT
CTGCACAAGGACAACAAACGCTAAAAGTCATCTCTGGACAGAAAACCACATTGTTTACACAGGCAGCCCATGGAGGACAGGCATCTCTAA
TGAAAATATCCGATAGCACCTTGAAGACTGTGCCAGCCACCTCACAGCTCTCGAAGCCTGGAACCACAATGCTGAGAGTAGCAGGAGGGG
TTATCACAACTGCCACTTCCCCTGCCGTGGCCCTCTCAGCAAACGGTCCTGCACAACAGTCTGAAGGAATGGCTCCCGTGTCTTCATCTA
CGGTCAGTTCTGTAACGAAAACTTCTGGGCAGCAGCAAGTGTGTGTGAGCCAGGCCACCGTGGGAACCTGCAAGGCTGCCACCCCCACCG
TCGTCAGCGCCACGTCCCTCGTGCCTACACCAAACCCCATCTCTGGGAAAGCCACAGTATCCGGACTGTTAAAGATTCACTCCAGTCAGT
CCAGTCCGCAGCAGGCCGTCCTGACGATTCCCAGCCAGCTCAAACCACTCAGCGTAAACACATCTGGAGGGGTGCAGACGATCCTGATGC
CTGTGAATAAAGTGGTTCAGTCATTTTCTACCAGCAAGCCACCTGCCATTCTGCCTGTAGCTGCCCCAACTCCAGTTGTCCCCAGCTCTG
CTCCAGCAGCTGTTGCAAAAGTGAAGACTGAACCAGAAACACCTGGACCGAGTTGCCTCTCTCAGGAGGGTCAGACAGCAGTGAAAACAG
AAGAAAGTTCTGAGCTGGGAAACTATGTCATTAAGATAGACCATTTAGAAACTATCCAGCAACTCCTAACTGCAGTAGTAAAGAAGATTC
CATTAATCACTGCAAAAAGTGAAGATGCCAGCTGCTTTTCTGCAAAGTCTGTGGAGCAGTACTATGGCTGGAACATTGGAAAAAGGAGAG
CCGCTGAGTGGCAAAGAGCAATGACAATGCGAAAAGTCTTACAAGAAATCCTGGAGAAGAATCCGAGATTTCACCACCTGACTCCCCTCA
AAACCAAGCACATCGCTCACTGGTGCCGCTGTCATGGCTACACCCCACCGGACCCTGAGAGCCTGAGGAATGACGGGGACTCCATCGAGG
ACGTGCTGACCCAGATCGACAGCGAGCCCGAGTGCCCATCATCATTCTCCTCTGCTGACAACCTCTGCCGCAAACTGGAGGACCTGCAAC
AGTTCCAGAAAAGGGAACCCGAGAATGAGGAGGAGGTGGACATCCTCAGCCTCTCCGAGCCAGTGAAGATAAACATCAAGAAGGAGCAGG
AAGAGAAACAAGAGGAAGTCAAGTTCTACCTGCCACCAACCCCAGGGTCTGAATTTATTGGGGATGTCACACAGAAGATTGGGATCACCC
TGCAGCCCGTGGCACTCCACAGGAACGTGTATGCGTCCGTGGTGGAGGACATGATCCTGAAGGCTACAGAACAGCTGGTGAATGATATCC
TGAGACAGGCTTTGGCAGTTGGATACCAGACAGCTTCTCACAACAGGATTCCCAAAGAAATTACAGTGAGTAATATTCACCAGGCCATTT
GCAACATTCCTTTTCTGGACTTCCTCACAAACAAACACATGGGAATATTGAATGAGGACCAGTGAGCGGAGTGAGGTGCCCTGGAGAAGC
AGGCTTTGAAGGCACAGCGAAGCTGTAACTGAGGACCCTGCTGCTCGGGAAGGAGGTGGTTTCCAGTGTGACTCGGCATGTCATGGCTAC
CCAACCTTTGCCGCTGCCTGTTCCCACGTGTCACCAGCACGCTGCACTCCAGATGAAATCCTCCTAGGACAGGAGTTTGTTTCCTGAGTG
TGGAGTGAGGCTGTCAGTGGATCCGTGCTTTGTCGGCCAGCGTTTCTGCAGTCTTTGTAAAGGCCCCACGAGAGCGGGCCAGGCCGTGTG
CCTCAGGCCCTTCTCCCTGGGTGTGCTTAAGGGGGCTCCTTGGGCCCGCCTCCCCAGGAGGTAGAAAATGAGTGGCAGGCTAGAGATTTC
ACCCATTTTGTGGGCTGGAGTTACCAGTAGCTCCAGCAGTTACCCTGAAGAGAGATTGGGCTTCAGCCTTCAGCAGGTGGTTCTCTCCCA
TGCCTGGCCTTGGTGTGGAGGGGCTGTACTCTGAGCCCAAGTGAGTCAGCTATAGGAAGAGGCCATACCTAGAGCCAAGAACCATGAAGG
CCTGAGAGACGGCAGACTGAGCAGAATTCCTTTTTTGAGCACGAGAGCATTACTAGAACCATTGTCAAAGCAGTGGCAAGGGACGGAGAG
GTCCCAACAGGAGTCAGGAAGAGGTTTGATTATAACCAAGAAAACTCACTATGCTAGGAATAGACTGTGTGCACCAGTCCCAGACACTTG
GCAGAAGTGTAGCAGCGTTACACATGTGTGCGAAGCAGATCGCAGGTTCCACGCCATCTGCATGGCCTGCAGGAGCTTCTGCTGCTGACC
CCATGCTGAGTGGCCAGTGGGGAGCGGCGCCCGGCAGGCTCTTCTGGGGTCGTCTGTCCTATCCGTGGATTGTATATACTCTTCTCTGTT
AAGGAGTTTTTCCCAAGAAGAAAAGTATTTAAAAGAAATACCAGTGAGTGCCTTAAAGTTGGAGAAGTAACTGCCCATGCCCAGAAATAA
GGATGCCAGTGCCCAGAAGCAGTGAGATTAGTCTGTGTCCACAAGCAGAGGCCCCCTCGATGGGAGGGAGTGGCAGGCAGGAGAAGGTGG
CGCTGCCAGGTGCCCGGGTCTATTGGAGGCGCCCCATCTCAGACTTCCTAACACAGCCTGTGTGGAAGGCAGAACAAAGAATGCATGCCC
AGTCAGAAATCTGTTCTATTCTGCTCCAGGAAAATCGGAAACCTGTGAGTCAGAGTCAGAGAAACTTACCCAAGCAACGTAATTCCTGTT
TTCATGGGTCCTGTAGATGTTTGAGTCAGGAAGGTAAGGCGGGGAGTGACTGAATAAACTCTGCCTTTTAAATTGAGCATCTGGGCCGGG
CATGGTGGCTCACGCCTGTAATCCCAGCACTCTGGGAGGTCGAGGTGGGTGGGTCACCTGAGGTTGGGAGTTCGAGACCAGCCCGACCAA
CATGGTGAAACCCCGTCTCTACTAAAAATACAGAAAATTAGCTGGGCATGGTGGTGTGTGCCTGTAATTCCAGCTACTCGGGAGGCTGAG
GCAGGAAGAATCACTTGAACCCAGGAGGCGGAGGTTGCAGTGTGCCAAGATCATACCACTGCACTCCAGCCCTGGTGACAGAGGAGACCC
CGTCTCAAAAATTGATTGATCAATTCAGCATCTGAGGGCTGCAAGTACAGAAGGAATCTATTCTCAGCAGGGCATAGGGCACGCACTGGC
TTAACAGTTTAGTATATAAGGCTCAAATAGTCTATACCTGAACTGCTATAAGCAAGGTCGATAGGGAAGTGGATAGATTGCTTCAGCAAA
GTGAACTGTGAGATCTCCAGGACAGAGGGAGAAAGATCTGATCCAAATGAGAACAGATTGGTTATTGCAGGTATCACAGCCTAAAGAAAT
TATCTTTTTGCAAAAGAAATATTAAATGATTTAGCAGTCTCCACGTGTGTTAATGTTTCAAACGTGTATCATAATGTGTATAATTGTGTA

>20746_20746_2_CUX1-YEATS2_CUX1_chr7_101459373_ENST00000360264_YEATS2_chr3_183490093_ENST00000305135_length(amino acids)=794AA_BP=111
MAANVGSMFQYWKRFDLQQLQVVGVPVGSALPSTVKQAVAISGGQILVAKASSSVSKAVGPKQVVTQGVAKAIVSGGGGTIVAQPVQTLT
KAQVTAAGPQKSGSQGSVMATLQLPATNLANLANLPPGTKLYLTTNSKNPSGKGKLLLIPQGAILRATNNANLQSGSAASGGSGAGGGGG
GGGGGGSGSGGGGSTGGGGGTAGGGTQSTAGPGGISQHLTYTSYILKQTPQGTFLVGQPSPQTSGKQLTTGSVVQGTLGVSTSSAQGQQT
LKVISGQKTTLFTQAAHGGQASLMKISDSTLKTVPATSQLSKPGTTMLRVAGGVITTATSPAVALSANGPAQQSEGMAPVSSSTVSSVTK
TSGQQQVCVSQATVGTCKAATPTVVSATSLVPTPNPISGKATVSGLLKIHSSQSSPQQAVLTIPSQLKPLSVNTSGGVQTILMPVNKVVQ
SFSTSKPPAILPVAAPTPVVPSSAPAAVAKVKTEPETPGPSCLSQEGQTAVKTEESSELGNYVIKIDHLETIQQLLTAVVKKIPLITAKS
EDASCFSAKSVEQYYGWNIGKRRAAEWQRAMTMRKVLQEILEKNPRFHHLTPLKTKHIAHWCRCHGYTPPDPESLRNDGDSIEDVLTQID
SEPECPSSFSSADNLCRKLEDLQQFQKREPENEEEVDILSLSEPVKINIKKEQEEKQEEVKFYLPPTPGSEFIGDVTQKIGITLQPVALH

--------------------------------------------------------------
>20746_20746_3_CUX1-YEATS2_CUX1_chr7_101459373_ENST00000393824_YEATS2_chr3_183490093_ENST00000305135_length(transcript)=4450nt_BP=86nt
ACTCCGTCTCAATATGTCTCAAGATGGCGGCCAATGTGGGATCGATGTTTCAATATTGGAAGCGCTTTGATTTACAGCAGCTGCAGGTTG
TCGGGGTACCAGTTGGGTCTGCTTTACCTTCAACAGTGAAGCAGGCTGTGGCGATCAGTGGTGGCCAGATCCTGGTAGCCAAGGCCAGCT
CTTCTGTCTCCAAAGCAGTTGGGCCAAAGCAAGTTGTAACCCAAGGAGTTGCCAAAGCAATTGTGAGTGGAGGTGGAGGAACCATTGTTG
CTCAGCCAGTGCAGACCTTAACCAAGGCCCAGGTTACTGCCGCTGGTCCTCAGAAGAGTGGATCCCAGGGTTCAGTAATGGCAACGTTGC
AGCTACCAGCCACTAATTTGGCCAACTTGGCAAATTTGCCTCCTGGCACTAAACTCTACCTAACTACAAACAGCAAGAACCCTTCAGGAA
AAGGAAAACTGCTGCTGATCCCTCAAGGAGCCATCCTGCGAGCTACGAACAATGCTAATCTCCAGTCTGGCTCAGCTGCCAGTGGTGGGA
GTGGTGCCGGAGGAGGAGGAGGAGGAGGAGGAGGAGGCGGCAGTGGCAGCGGTGGAGGCGGCAGCACAGGAGGAGGAGGAGGAACAGCAG
GAGGAGGAACTCAAAGTACTGCTGGCCCTGGAGGGATATCTCAGCACCTGACTTACACATCTTACATCCTCAAGCAAACTCCCCAGGGCA
CATTTTTAGTTGGCCAGCCATCACCCCAGACTTCTGGAAAACAACTCACCACTGGGTCAGTGGTCCAAGGAACACTGGGAGTCAGCACAT
CTTCTGCACAAGGACAACAAACGCTAAAAGTCATCTCTGGACAGAAAACCACATTGTTTACACAGGCAGCCCATGGAGGACAGGCATCTC
TAATGAAAATATCCGATAGCACCTTGAAGACTGTGCCAGCCACCTCACAGCTCTCGAAGCCTGGAACCACAATGCTGAGAGTAGCAGGAG
GGGTTATCACAACTGCCACTTCCCCTGCCGTGGCCCTCTCAGCAAACGGTCCTGCACAACAGTCTGAAGGAATGGCTCCCGTGTCTTCAT
CTACGGTCAGTTCTGTAACGAAAACTTCTGGGCAGCAGCAAGTGTGTGTGAGCCAGGCCACCGTGGGAACCTGCAAGGCTGCCACCCCCA
CCGTCGTCAGCGCCACGTCCCTCGTGCCTACACCAAACCCCATCTCTGGGAAAGCCACAGTATCCGGACTGTTAAAGATTCACTCCAGTC
AGTCCAGTCCGCAGCAGGCCGTCCTGACGATTCCCAGCCAGCTCAAACCACTCAGCGTAAACACATCTGGAGGGGTGCAGACGATCCTGA
TGCCTGTGAATAAAGTGGTTCAGTCATTTTCTACCAGCAAGCCACCTGCCATTCTGCCTGTAGCTGCCCCAACTCCAGTTGTCCCCAGCT
CTGCTCCAGCAGCTGTTGCAAAAGTGAAGACTGAACCAGAAACACCTGGACCGAGTTGCCTCTCTCAGGAGGGTCAGACAGCAGTGAAAA
CAGAAGAAAGTTCTGAGCTGGGAAACTATGTCATTAAGATAGACCATTTAGAAACTATCCAGCAACTCCTAACTGCAGTAGTAAAGAAGA
TTCCATTAATCACTGCAAAAAGTGAAGATGCCAGCTGCTTTTCTGCAAAGTCTGTGGAGCAGTACTATGGCTGGAACATTGGAAAAAGGA
GAGCCGCTGAGTGGCAAAGAGCAATGACAATGCGAAAAGTCTTACAAGAAATCCTGGAGAAGAATCCGAGATTTCACCACCTGACTCCCC
TCAAAACCAAGCACATCGCTCACTGGTGCCGCTGTCATGGCTACACCCCACCGGACCCTGAGAGCCTGAGGAATGACGGGGACTCCATCG
AGGACGTGCTGACCCAGATCGACAGCGAGCCCGAGTGCCCATCATCATTCTCCTCTGCTGACAACCTCTGCCGCAAACTGGAGGACCTGC
AACAGTTCCAGAAAAGGGAACCCGAGAATGAGGAGGAGGTGGACATCCTCAGCCTCTCCGAGCCAGTGAAGATAAACATCAAGAAGGAGC
AGGAAGAGAAACAAGAGGAAGTCAAGTTCTACCTGCCACCAACCCCAGGGTCTGAATTTATTGGGGATGTCACACAGAAGATTGGGATCA
CCCTGCAGCCCGTGGCACTCCACAGGAACGTGTATGCGTCCGTGGTGGAGGACATGATCCTGAAGGCTACAGAACAGCTGGTGAATGATA
TCCTGAGACAGGCTTTGGCAGTTGGATACCAGACAGCTTCTCACAACAGGATTCCCAAAGAAATTACAGTGAGTAATATTCACCAGGCCA
TTTGCAACATTCCTTTTCTGGACTTCCTCACAAACAAACACATGGGAATATTGAATGAGGACCAGTGAGCGGAGTGAGGTGCCCTGGAGA
AGCAGGCTTTGAAGGCACAGCGAAGCTGTAACTGAGGACCCTGCTGCTCGGGAAGGAGGTGGTTTCCAGTGTGACTCGGCATGTCATGGC
TACCCAACCTTTGCCGCTGCCTGTTCCCACGTGTCACCAGCACGCTGCACTCCAGATGAAATCCTCCTAGGACAGGAGTTTGTTTCCTGA
GTGTGGAGTGAGGCTGTCAGTGGATCCGTGCTTTGTCGGCCAGCGTTTCTGCAGTCTTTGTAAAGGCCCCACGAGAGCGGGCCAGGCCGT
GTGCCTCAGGCCCTTCTCCCTGGGTGTGCTTAAGGGGGCTCCTTGGGCCCGCCTCCCCAGGAGGTAGAAAATGAGTGGCAGGCTAGAGAT
TTCACCCATTTTGTGGGCTGGAGTTACCAGTAGCTCCAGCAGTTACCCTGAAGAGAGATTGGGCTTCAGCCTTCAGCAGGTGGTTCTCTC
CCATGCCTGGCCTTGGTGTGGAGGGGCTGTACTCTGAGCCCAAGTGAGTCAGCTATAGGAAGAGGCCATACCTAGAGCCAAGAACCATGA
AGGCCTGAGAGACGGCAGACTGAGCAGAATTCCTTTTTTGAGCACGAGAGCATTACTAGAACCATTGTCAAAGCAGTGGCAAGGGACGGA
GAGGTCCCAACAGGAGTCAGGAAGAGGTTTGATTATAACCAAGAAAACTCACTATGCTAGGAATAGACTGTGTGCACCAGTCCCAGACAC
TTGGCAGAAGTGTAGCAGCGTTACACATGTGTGCGAAGCAGATCGCAGGTTCCACGCCATCTGCATGGCCTGCAGGAGCTTCTGCTGCTG
ACCCCATGCTGAGTGGCCAGTGGGGAGCGGCGCCCGGCAGGCTCTTCTGGGGTCGTCTGTCCTATCCGTGGATTGTATATACTCTTCTCT
GTTAAGGAGTTTTTCCCAAGAAGAAAAGTATTTAAAAGAAATACCAGTGAGTGCCTTAAAGTTGGAGAAGTAACTGCCCATGCCCAGAAA
TAAGGATGCCAGTGCCCAGAAGCAGTGAGATTAGTCTGTGTCCACAAGCAGAGGCCCCCTCGATGGGAGGGAGTGGCAGGCAGGAGAAGG
TGGCGCTGCCAGGTGCCCGGGTCTATTGGAGGCGCCCCATCTCAGACTTCCTAACACAGCCTGTGTGGAAGGCAGAACAAAGAATGCATG
CCCAGTCAGAAATCTGTTCTATTCTGCTCCAGGAAAATCGGAAACCTGTGAGTCAGAGTCAGAGAAACTTACCCAAGCAACGTAATTCCT
GTTTTCATGGGTCCTGTAGATGTTTGAGTCAGGAAGGTAAGGCGGGGAGTGACTGAATAAACTCTGCCTTTTAAATTGAGCATCTGGGCC
GGGCATGGTGGCTCACGCCTGTAATCCCAGCACTCTGGGAGGTCGAGGTGGGTGGGTCACCTGAGGTTGGGAGTTCGAGACCAGCCCGAC
CAACATGGTGAAACCCCGTCTCTACTAAAAATACAGAAAATTAGCTGGGCATGGTGGTGTGTGCCTGTAATTCCAGCTACTCGGGAGGCT
GAGGCAGGAAGAATCACTTGAACCCAGGAGGCGGAGGTTGCAGTGTGCCAAGATCATACCACTGCACTCCAGCCCTGGTGACAGAGGAGA
CCCCGTCTCAAAAATTGATTGATCAATTCAGCATCTGAGGGCTGCAAGTACAGAAGGAATCTATTCTCAGCAGGGCATAGGGCACGCACT
GGCTTAACAGTTTAGTATATAAGGCTCAAATAGTCTATACCTGAACTGCTATAAGCAAGGTCGATAGGGAAGTGGATAGATTGCTTCAGC
AAAGTGAACTGTGAGATCTCCAGGACAGAGGGAGAAAGATCTGATCCAAATGAGAACAGATTGGTTATTGCAGGTATCACAGCCTAAAGA
AATTATCTTTTTGCAAAAGAAATATTAAATGATTTAGCAGTCTCCACGTGTGTTAATGTTTCAAACGTGTATCATAATGTGTATAATTGT

>20746_20746_3_CUX1-YEATS2_CUX1_chr7_101459373_ENST00000393824_YEATS2_chr3_183490093_ENST00000305135_length(amino acids)=794AA_BP=111
MAANVGSMFQYWKRFDLQQLQVVGVPVGSALPSTVKQAVAISGGQILVAKASSSVSKAVGPKQVVTQGVAKAIVSGGGGTIVAQPVQTLT
KAQVTAAGPQKSGSQGSVMATLQLPATNLANLANLPPGTKLYLTTNSKNPSGKGKLLLIPQGAILRATNNANLQSGSAASGGSGAGGGGG
GGGGGGSGSGGGGSTGGGGGTAGGGTQSTAGPGGISQHLTYTSYILKQTPQGTFLVGQPSPQTSGKQLTTGSVVQGTLGVSTSSAQGQQT
LKVISGQKTTLFTQAAHGGQASLMKISDSTLKTVPATSQLSKPGTTMLRVAGGVITTATSPAVALSANGPAQQSEGMAPVSSSTVSSVTK
TSGQQQVCVSQATVGTCKAATPTVVSATSLVPTPNPISGKATVSGLLKIHSSQSSPQQAVLTIPSQLKPLSVNTSGGVQTILMPVNKVVQ
SFSTSKPPAILPVAAPTPVVPSSAPAAVAKVKTEPETPGPSCLSQEGQTAVKTEESSELGNYVIKIDHLETIQQLLTAVVKKIPLITAKS
EDASCFSAKSVEQYYGWNIGKRRAAEWQRAMTMRKVLQEILEKNPRFHHLTPLKTKHIAHWCRCHGYTPPDPESLRNDGDSIEDVLTQID
SEPECPSSFSSADNLCRKLEDLQQFQKREPENEEEVDILSLSEPVKINIKKEQEEKQEEVKFYLPPTPGSEFIGDVTQKIGITLQPVALH

--------------------------------------------------------------
>20746_20746_4_CUX1-YEATS2_CUX1_chr7_101459373_ENST00000425244_YEATS2_chr3_183490093_ENST00000305135_length(transcript)=4437nt_BP=73nt
ATGTCTCAAGATGGCGGCCAATGTGGGATCGATGTTTCAATATTGGAAGCGCTTTGATTTACAGCAGCTGCAGGTTGTCGGGGTACCAGT
TGGGTCTGCTTTACCTTCAACAGTGAAGCAGGCTGTGGCGATCAGTGGTGGCCAGATCCTGGTAGCCAAGGCCAGCTCTTCTGTCTCCAA
AGCAGTTGGGCCAAAGCAAGTTGTAACCCAAGGAGTTGCCAAAGCAATTGTGAGTGGAGGTGGAGGAACCATTGTTGCTCAGCCAGTGCA
GACCTTAACCAAGGCCCAGGTTACTGCCGCTGGTCCTCAGAAGAGTGGATCCCAGGGTTCAGTAATGGCAACGTTGCAGCTACCAGCCAC
TAATTTGGCCAACTTGGCAAATTTGCCTCCTGGCACTAAACTCTACCTAACTACAAACAGCAAGAACCCTTCAGGAAAAGGAAAACTGCT
GCTGATCCCTCAAGGAGCCATCCTGCGAGCTACGAACAATGCTAATCTCCAGTCTGGCTCAGCTGCCAGTGGTGGGAGTGGTGCCGGAGG
AGGAGGAGGAGGAGGAGGAGGAGGCGGCAGTGGCAGCGGTGGAGGCGGCAGCACAGGAGGAGGAGGAGGAACAGCAGGAGGAGGAACTCA
AAGTACTGCTGGCCCTGGAGGGATATCTCAGCACCTGACTTACACATCTTACATCCTCAAGCAAACTCCCCAGGGCACATTTTTAGTTGG
CCAGCCATCACCCCAGACTTCTGGAAAACAACTCACCACTGGGTCAGTGGTCCAAGGAACACTGGGAGTCAGCACATCTTCTGCACAAGG
ACAACAAACGCTAAAAGTCATCTCTGGACAGAAAACCACATTGTTTACACAGGCAGCCCATGGAGGACAGGCATCTCTAATGAAAATATC
CGATAGCACCTTGAAGACTGTGCCAGCCACCTCACAGCTCTCGAAGCCTGGAACCACAATGCTGAGAGTAGCAGGAGGGGTTATCACAAC
TGCCACTTCCCCTGCCGTGGCCCTCTCAGCAAACGGTCCTGCACAACAGTCTGAAGGAATGGCTCCCGTGTCTTCATCTACGGTCAGTTC
TGTAACGAAAACTTCTGGGCAGCAGCAAGTGTGTGTGAGCCAGGCCACCGTGGGAACCTGCAAGGCTGCCACCCCCACCGTCGTCAGCGC
CACGTCCCTCGTGCCTACACCAAACCCCATCTCTGGGAAAGCCACAGTATCCGGACTGTTAAAGATTCACTCCAGTCAGTCCAGTCCGCA
GCAGGCCGTCCTGACGATTCCCAGCCAGCTCAAACCACTCAGCGTAAACACATCTGGAGGGGTGCAGACGATCCTGATGCCTGTGAATAA
AGTGGTTCAGTCATTTTCTACCAGCAAGCCACCTGCCATTCTGCCTGTAGCTGCCCCAACTCCAGTTGTCCCCAGCTCTGCTCCAGCAGC
TGTTGCAAAAGTGAAGACTGAACCAGAAACACCTGGACCGAGTTGCCTCTCTCAGGAGGGTCAGACAGCAGTGAAAACAGAAGAAAGTTC
TGAGCTGGGAAACTATGTCATTAAGATAGACCATTTAGAAACTATCCAGCAACTCCTAACTGCAGTAGTAAAGAAGATTCCATTAATCAC
TGCAAAAAGTGAAGATGCCAGCTGCTTTTCTGCAAAGTCTGTGGAGCAGTACTATGGCTGGAACATTGGAAAAAGGAGAGCCGCTGAGTG
GCAAAGAGCAATGACAATGCGAAAAGTCTTACAAGAAATCCTGGAGAAGAATCCGAGATTTCACCACCTGACTCCCCTCAAAACCAAGCA
CATCGCTCACTGGTGCCGCTGTCATGGCTACACCCCACCGGACCCTGAGAGCCTGAGGAATGACGGGGACTCCATCGAGGACGTGCTGAC
CCAGATCGACAGCGAGCCCGAGTGCCCATCATCATTCTCCTCTGCTGACAACCTCTGCCGCAAACTGGAGGACCTGCAACAGTTCCAGAA
AAGGGAACCCGAGAATGAGGAGGAGGTGGACATCCTCAGCCTCTCCGAGCCAGTGAAGATAAACATCAAGAAGGAGCAGGAAGAGAAACA
AGAGGAAGTCAAGTTCTACCTGCCACCAACCCCAGGGTCTGAATTTATTGGGGATGTCACACAGAAGATTGGGATCACCCTGCAGCCCGT
GGCACTCCACAGGAACGTGTATGCGTCCGTGGTGGAGGACATGATCCTGAAGGCTACAGAACAGCTGGTGAATGATATCCTGAGACAGGC
TTTGGCAGTTGGATACCAGACAGCTTCTCACAACAGGATTCCCAAAGAAATTACAGTGAGTAATATTCACCAGGCCATTTGCAACATTCC
TTTTCTGGACTTCCTCACAAACAAACACATGGGAATATTGAATGAGGACCAGTGAGCGGAGTGAGGTGCCCTGGAGAAGCAGGCTTTGAA
GGCACAGCGAAGCTGTAACTGAGGACCCTGCTGCTCGGGAAGGAGGTGGTTTCCAGTGTGACTCGGCATGTCATGGCTACCCAACCTTTG
CCGCTGCCTGTTCCCACGTGTCACCAGCACGCTGCACTCCAGATGAAATCCTCCTAGGACAGGAGTTTGTTTCCTGAGTGTGGAGTGAGG
CTGTCAGTGGATCCGTGCTTTGTCGGCCAGCGTTTCTGCAGTCTTTGTAAAGGCCCCACGAGAGCGGGCCAGGCCGTGTGCCTCAGGCCC
TTCTCCCTGGGTGTGCTTAAGGGGGCTCCTTGGGCCCGCCTCCCCAGGAGGTAGAAAATGAGTGGCAGGCTAGAGATTTCACCCATTTTG
TGGGCTGGAGTTACCAGTAGCTCCAGCAGTTACCCTGAAGAGAGATTGGGCTTCAGCCTTCAGCAGGTGGTTCTCTCCCATGCCTGGCCT
TGGTGTGGAGGGGCTGTACTCTGAGCCCAAGTGAGTCAGCTATAGGAAGAGGCCATACCTAGAGCCAAGAACCATGAAGGCCTGAGAGAC
GGCAGACTGAGCAGAATTCCTTTTTTGAGCACGAGAGCATTACTAGAACCATTGTCAAAGCAGTGGCAAGGGACGGAGAGGTCCCAACAG
GAGTCAGGAAGAGGTTTGATTATAACCAAGAAAACTCACTATGCTAGGAATAGACTGTGTGCACCAGTCCCAGACACTTGGCAGAAGTGT
AGCAGCGTTACACATGTGTGCGAAGCAGATCGCAGGTTCCACGCCATCTGCATGGCCTGCAGGAGCTTCTGCTGCTGACCCCATGCTGAG
TGGCCAGTGGGGAGCGGCGCCCGGCAGGCTCTTCTGGGGTCGTCTGTCCTATCCGTGGATTGTATATACTCTTCTCTGTTAAGGAGTTTT
TCCCAAGAAGAAAAGTATTTAAAAGAAATACCAGTGAGTGCCTTAAAGTTGGAGAAGTAACTGCCCATGCCCAGAAATAAGGATGCCAGT
GCCCAGAAGCAGTGAGATTAGTCTGTGTCCACAAGCAGAGGCCCCCTCGATGGGAGGGAGTGGCAGGCAGGAGAAGGTGGCGCTGCCAGG
TGCCCGGGTCTATTGGAGGCGCCCCATCTCAGACTTCCTAACACAGCCTGTGTGGAAGGCAGAACAAAGAATGCATGCCCAGTCAGAAAT
CTGTTCTATTCTGCTCCAGGAAAATCGGAAACCTGTGAGTCAGAGTCAGAGAAACTTACCCAAGCAACGTAATTCCTGTTTTCATGGGTC
CTGTAGATGTTTGAGTCAGGAAGGTAAGGCGGGGAGTGACTGAATAAACTCTGCCTTTTAAATTGAGCATCTGGGCCGGGCATGGTGGCT
CACGCCTGTAATCCCAGCACTCTGGGAGGTCGAGGTGGGTGGGTCACCTGAGGTTGGGAGTTCGAGACCAGCCCGACCAACATGGTGAAA
CCCCGTCTCTACTAAAAATACAGAAAATTAGCTGGGCATGGTGGTGTGTGCCTGTAATTCCAGCTACTCGGGAGGCTGAGGCAGGAAGAA
TCACTTGAACCCAGGAGGCGGAGGTTGCAGTGTGCCAAGATCATACCACTGCACTCCAGCCCTGGTGACAGAGGAGACCCCGTCTCAAAA
ATTGATTGATCAATTCAGCATCTGAGGGCTGCAAGTACAGAAGGAATCTATTCTCAGCAGGGCATAGGGCACGCACTGGCTTAACAGTTT
AGTATATAAGGCTCAAATAGTCTATACCTGAACTGCTATAAGCAAGGTCGATAGGGAAGTGGATAGATTGCTTCAGCAAAGTGAACTGTG
AGATCTCCAGGACAGAGGGAGAAAGATCTGATCCAAATGAGAACAGATTGGTTATTGCAGGTATCACAGCCTAAAGAAATTATCTTTTTG
CAAAAGAAATATTAAATGATTTAGCAGTCTCCACGTGTGTTAATGTTTCAAACGTGTATCATAATGTGTATAATTGTGTAACAAAATTGT

>20746_20746_4_CUX1-YEATS2_CUX1_chr7_101459373_ENST00000425244_YEATS2_chr3_183490093_ENST00000305135_length(amino acids)=794AA_BP=111
MAANVGSMFQYWKRFDLQQLQVVGVPVGSALPSTVKQAVAISGGQILVAKASSSVSKAVGPKQVVTQGVAKAIVSGGGGTIVAQPVQTLT
KAQVTAAGPQKSGSQGSVMATLQLPATNLANLANLPPGTKLYLTTNSKNPSGKGKLLLIPQGAILRATNNANLQSGSAASGGSGAGGGGG
GGGGGGSGSGGGGSTGGGGGTAGGGTQSTAGPGGISQHLTYTSYILKQTPQGTFLVGQPSPQTSGKQLTTGSVVQGTLGVSTSSAQGQQT
LKVISGQKTTLFTQAAHGGQASLMKISDSTLKTVPATSQLSKPGTTMLRVAGGVITTATSPAVALSANGPAQQSEGMAPVSSSTVSSVTK
TSGQQQVCVSQATVGTCKAATPTVVSATSLVPTPNPISGKATVSGLLKIHSSQSSPQQAVLTIPSQLKPLSVNTSGGVQTILMPVNKVVQ
SFSTSKPPAILPVAAPTPVVPSSAPAAVAKVKTEPETPGPSCLSQEGQTAVKTEESSELGNYVIKIDHLETIQQLLTAVVKKIPLITAKS
EDASCFSAKSVEQYYGWNIGKRRAAEWQRAMTMRKVLQEILEKNPRFHHLTPLKTKHIAHWCRCHGYTPPDPESLRNDGDSIEDVLTQID
SEPECPSSFSSADNLCRKLEDLQQFQKREPENEEEVDILSLSEPVKINIKKEQEEKQEEVKFYLPPTPGSEFIGDVTQKIGITLQPVALH

--------------------------------------------------------------
>20746_20746_5_CUX1-YEATS2_CUX1_chr7_101459373_ENST00000437600_YEATS2_chr3_183490093_ENST00000305135_length(transcript)=4779nt_BP=415nt
TGGCCGGCGCGACCGCGGACTTTGGGGGGGGTTGTCCCGAGGGGCGGGGGTCGCCGGTCGGCCCGCGGAGGGGAGGGGGAGGGCGGCGGT
GGGGGGGTTGGGGGAGCAGGGGCCGGGGCCGCGCGCGCGAGGCCGGGCCGCGGCCGCCGCGCCCTGGGGCTGCGACTCCGGCTCCGGCTC
GGGCTCGCCATGGCTGCGGCGGTGCCGTGAGGAGGAGTCCGCGTCCTCGGTGGCGGCGGCGGCGGCCGGCTCCAGGCCGGGTTTTGGCGC
CGCCCGCCTGCTGCCTCCTGGCGGCTCCTGAACTCCAGCCCCCTCTCTATCAGCCGCTCACTCCGTCTCAATATGTCTCAAGATGGCGGC
CAATGTGGGATCGATGTTTCAATATTGGAAGCGCTTTGATTTACAGCAGCTGCAGGTTGTCGGGGTACCAGTTGGGTCTGCTTTACCTTC
AACAGTGAAGCAGGCTGTGGCGATCAGTGGTGGCCAGATCCTGGTAGCCAAGGCCAGCTCTTCTGTCTCCAAAGCAGTTGGGCCAAAGCA
AGTTGTAACCCAAGGAGTTGCCAAAGCAATTGTGAGTGGAGGTGGAGGAACCATTGTTGCTCAGCCAGTGCAGACCTTAACCAAGGCCCA
GGTTACTGCCGCTGGTCCTCAGAAGAGTGGATCCCAGGGTTCAGTAATGGCAACGTTGCAGCTACCAGCCACTAATTTGGCCAACTTGGC
AAATTTGCCTCCTGGCACTAAACTCTACCTAACTACAAACAGCAAGAACCCTTCAGGAAAAGGAAAACTGCTGCTGATCCCTCAAGGAGC
CATCCTGCGAGCTACGAACAATGCTAATCTCCAGTCTGGCTCAGCTGCCAGTGGTGGGAGTGGTGCCGGAGGAGGAGGAGGAGGAGGAGG
AGGAGGCGGCAGTGGCAGCGGTGGAGGCGGCAGCACAGGAGGAGGAGGAGGAACAGCAGGAGGAGGAACTCAAAGTACTGCTGGCCCTGG
AGGGATATCTCAGCACCTGACTTACACATCTTACATCCTCAAGCAAACTCCCCAGGGCACATTTTTAGTTGGCCAGCCATCACCCCAGAC
TTCTGGAAAACAACTCACCACTGGGTCAGTGGTCCAAGGAACACTGGGAGTCAGCACATCTTCTGCACAAGGACAACAAACGCTAAAAGT
CATCTCTGGACAGAAAACCACATTGTTTACACAGGCAGCCCATGGAGGACAGGCATCTCTAATGAAAATATCCGATAGCACCTTGAAGAC
TGTGCCAGCCACCTCACAGCTCTCGAAGCCTGGAACCACAATGCTGAGAGTAGCAGGAGGGGTTATCACAACTGCCACTTCCCCTGCCGT
GGCCCTCTCAGCAAACGGTCCTGCACAACAGTCTGAAGGAATGGCTCCCGTGTCTTCATCTACGGTCAGTTCTGTAACGAAAACTTCTGG
GCAGCAGCAAGTGTGTGTGAGCCAGGCCACCGTGGGAACCTGCAAGGCTGCCACCCCCACCGTCGTCAGCGCCACGTCCCTCGTGCCTAC
ACCAAACCCCATCTCTGGGAAAGCCACAGTATCCGGACTGTTAAAGATTCACTCCAGTCAGTCCAGTCCGCAGCAGGCCGTCCTGACGAT
TCCCAGCCAGCTCAAACCACTCAGCGTAAACACATCTGGAGGGGTGCAGACGATCCTGATGCCTGTGAATAAAGTGGTTCAGTCATTTTC
TACCAGCAAGCCACCTGCCATTCTGCCTGTAGCTGCCCCAACTCCAGTTGTCCCCAGCTCTGCTCCAGCAGCTGTTGCAAAAGTGAAGAC
TGAACCAGAAACACCTGGACCGAGTTGCCTCTCTCAGGAGGGTCAGACAGCAGTGAAAACAGAAGAAAGTTCTGAGCTGGGAAACTATGT
CATTAAGATAGACCATTTAGAAACTATCCAGCAACTCCTAACTGCAGTAGTAAAGAAGATTCCATTAATCACTGCAAAAAGTGAAGATGC
CAGCTGCTTTTCTGCAAAGTCTGTGGAGCAGTACTATGGCTGGAACATTGGAAAAAGGAGAGCCGCTGAGTGGCAAAGAGCAATGACAAT
GCGAAAAGTCTTACAAGAAATCCTGGAGAAGAATCCGAGATTTCACCACCTGACTCCCCTCAAAACCAAGCACATCGCTCACTGGTGCCG
CTGTCATGGCTACACCCCACCGGACCCTGAGAGCCTGAGGAATGACGGGGACTCCATCGAGGACGTGCTGACCCAGATCGACAGCGAGCC
CGAGTGCCCATCATCATTCTCCTCTGCTGACAACCTCTGCCGCAAACTGGAGGACCTGCAACAGTTCCAGAAAAGGGAACCCGAGAATGA
GGAGGAGGTGGACATCCTCAGCCTCTCCGAGCCAGTGAAGATAAACATCAAGAAGGAGCAGGAAGAGAAACAAGAGGAAGTCAAGTTCTA
CCTGCCACCAACCCCAGGGTCTGAATTTATTGGGGATGTCACACAGAAGATTGGGATCACCCTGCAGCCCGTGGCACTCCACAGGAACGT
GTATGCGTCCGTGGTGGAGGACATGATCCTGAAGGCTACAGAACAGCTGGTGAATGATATCCTGAGACAGGCTTTGGCAGTTGGATACCA
GACAGCTTCTCACAACAGGATTCCCAAAGAAATTACAGTGAGTAATATTCACCAGGCCATTTGCAACATTCCTTTTCTGGACTTCCTCAC
AAACAAACACATGGGAATATTGAATGAGGACCAGTGAGCGGAGTGAGGTGCCCTGGAGAAGCAGGCTTTGAAGGCACAGCGAAGCTGTAA
CTGAGGACCCTGCTGCTCGGGAAGGAGGTGGTTTCCAGTGTGACTCGGCATGTCATGGCTACCCAACCTTTGCCGCTGCCTGTTCCCACG
TGTCACCAGCACGCTGCACTCCAGATGAAATCCTCCTAGGACAGGAGTTTGTTTCCTGAGTGTGGAGTGAGGCTGTCAGTGGATCCGTGC
TTTGTCGGCCAGCGTTTCTGCAGTCTTTGTAAAGGCCCCACGAGAGCGGGCCAGGCCGTGTGCCTCAGGCCCTTCTCCCTGGGTGTGCTT
AAGGGGGCTCCTTGGGCCCGCCTCCCCAGGAGGTAGAAAATGAGTGGCAGGCTAGAGATTTCACCCATTTTGTGGGCTGGAGTTACCAGT
AGCTCCAGCAGTTACCCTGAAGAGAGATTGGGCTTCAGCCTTCAGCAGGTGGTTCTCTCCCATGCCTGGCCTTGGTGTGGAGGGGCTGTA
CTCTGAGCCCAAGTGAGTCAGCTATAGGAAGAGGCCATACCTAGAGCCAAGAACCATGAAGGCCTGAGAGACGGCAGACTGAGCAGAATT
CCTTTTTTGAGCACGAGAGCATTACTAGAACCATTGTCAAAGCAGTGGCAAGGGACGGAGAGGTCCCAACAGGAGTCAGGAAGAGGTTTG
ATTATAACCAAGAAAACTCACTATGCTAGGAATAGACTGTGTGCACCAGTCCCAGACACTTGGCAGAAGTGTAGCAGCGTTACACATGTG
TGCGAAGCAGATCGCAGGTTCCACGCCATCTGCATGGCCTGCAGGAGCTTCTGCTGCTGACCCCATGCTGAGTGGCCAGTGGGGAGCGGC
GCCCGGCAGGCTCTTCTGGGGTCGTCTGTCCTATCCGTGGATTGTATATACTCTTCTCTGTTAAGGAGTTTTTCCCAAGAAGAAAAGTAT
TTAAAAGAAATACCAGTGAGTGCCTTAAAGTTGGAGAAGTAACTGCCCATGCCCAGAAATAAGGATGCCAGTGCCCAGAAGCAGTGAGAT
TAGTCTGTGTCCACAAGCAGAGGCCCCCTCGATGGGAGGGAGTGGCAGGCAGGAGAAGGTGGCGCTGCCAGGTGCCCGGGTCTATTGGAG
GCGCCCCATCTCAGACTTCCTAACACAGCCTGTGTGGAAGGCAGAACAAAGAATGCATGCCCAGTCAGAAATCTGTTCTATTCTGCTCCA
GGAAAATCGGAAACCTGTGAGTCAGAGTCAGAGAAACTTACCCAAGCAACGTAATTCCTGTTTTCATGGGTCCTGTAGATGTTTGAGTCA
GGAAGGTAAGGCGGGGAGTGACTGAATAAACTCTGCCTTTTAAATTGAGCATCTGGGCCGGGCATGGTGGCTCACGCCTGTAATCCCAGC
ACTCTGGGAGGTCGAGGTGGGTGGGTCACCTGAGGTTGGGAGTTCGAGACCAGCCCGACCAACATGGTGAAACCCCGTCTCTACTAAAAA
TACAGAAAATTAGCTGGGCATGGTGGTGTGTGCCTGTAATTCCAGCTACTCGGGAGGCTGAGGCAGGAAGAATCACTTGAACCCAGGAGG
CGGAGGTTGCAGTGTGCCAAGATCATACCACTGCACTCCAGCCCTGGTGACAGAGGAGACCCCGTCTCAAAAATTGATTGATCAATTCAG
CATCTGAGGGCTGCAAGTACAGAAGGAATCTATTCTCAGCAGGGCATAGGGCACGCACTGGCTTAACAGTTTAGTATATAAGGCTCAAAT
AGTCTATACCTGAACTGCTATAAGCAAGGTCGATAGGGAAGTGGATAGATTGCTTCAGCAAAGTGAACTGTGAGATCTCCAGGACAGAGG
GAGAAAGATCTGATCCAAATGAGAACAGATTGGTTATTGCAGGTATCACAGCCTAAAGAAATTATCTTTTTGCAAAAGAAATATTAAATG
ATTTAGCAGTCTCCACGTGTGTTAATGTTTCAAACGTGTATCATAATGTGTATAATTGTGTAACAAAATTGTCTACAATAAATCTTTTGG

>20746_20746_5_CUX1-YEATS2_CUX1_chr7_101459373_ENST00000437600_YEATS2_chr3_183490093_ENST00000305135_length(amino acids)=794AA_BP=111
MAANVGSMFQYWKRFDLQQLQVVGVPVGSALPSTVKQAVAISGGQILVAKASSSVSKAVGPKQVVTQGVAKAIVSGGGGTIVAQPVQTLT
KAQVTAAGPQKSGSQGSVMATLQLPATNLANLANLPPGTKLYLTTNSKNPSGKGKLLLIPQGAILRATNNANLQSGSAASGGSGAGGGGG
GGGGGGSGSGGGGSTGGGGGTAGGGTQSTAGPGGISQHLTYTSYILKQTPQGTFLVGQPSPQTSGKQLTTGSVVQGTLGVSTSSAQGQQT
LKVISGQKTTLFTQAAHGGQASLMKISDSTLKTVPATSQLSKPGTTMLRVAGGVITTATSPAVALSANGPAQQSEGMAPVSSSTVSSVTK
TSGQQQVCVSQATVGTCKAATPTVVSATSLVPTPNPISGKATVSGLLKIHSSQSSPQQAVLTIPSQLKPLSVNTSGGVQTILMPVNKVVQ
SFSTSKPPAILPVAAPTPVVPSSAPAAVAKVKTEPETPGPSCLSQEGQTAVKTEESSELGNYVIKIDHLETIQQLLTAVVKKIPLITAKS
EDASCFSAKSVEQYYGWNIGKRRAAEWQRAMTMRKVLQEILEKNPRFHHLTPLKTKHIAHWCRCHGYTPPDPESLRNDGDSIEDVLTQID
SEPECPSSFSSADNLCRKLEDLQQFQKREPENEEEVDILSLSEPVKINIKKEQEEKQEEVKFYLPPTPGSEFIGDVTQKIGITLQPVALH

--------------------------------------------------------------
>20746_20746_6_CUX1-YEATS2_CUX1_chr7_101459373_ENST00000547394_YEATS2_chr3_183490093_ENST00000305135_length(transcript)=4449nt_BP=85nt
CTCCGTCTCAATATGTCTCAAGATGGCGGCCAATGTGGGATCGATGTTTCAATATTGGAAGCGCTTTGATTTACAGCAGCTGCAGGTTGT
CGGGGTACCAGTTGGGTCTGCTTTACCTTCAACAGTGAAGCAGGCTGTGGCGATCAGTGGTGGCCAGATCCTGGTAGCCAAGGCCAGCTC
TTCTGTCTCCAAAGCAGTTGGGCCAAAGCAAGTTGTAACCCAAGGAGTTGCCAAAGCAATTGTGAGTGGAGGTGGAGGAACCATTGTTGC
TCAGCCAGTGCAGACCTTAACCAAGGCCCAGGTTACTGCCGCTGGTCCTCAGAAGAGTGGATCCCAGGGTTCAGTAATGGCAACGTTGCA
GCTACCAGCCACTAATTTGGCCAACTTGGCAAATTTGCCTCCTGGCACTAAACTCTACCTAACTACAAACAGCAAGAACCCTTCAGGAAA
AGGAAAACTGCTGCTGATCCCTCAAGGAGCCATCCTGCGAGCTACGAACAATGCTAATCTCCAGTCTGGCTCAGCTGCCAGTGGTGGGAG
TGGTGCCGGAGGAGGAGGAGGAGGAGGAGGAGGAGGCGGCAGTGGCAGCGGTGGAGGCGGCAGCACAGGAGGAGGAGGAGGAACAGCAGG
AGGAGGAACTCAAAGTACTGCTGGCCCTGGAGGGATATCTCAGCACCTGACTTACACATCTTACATCCTCAAGCAAACTCCCCAGGGCAC
ATTTTTAGTTGGCCAGCCATCACCCCAGACTTCTGGAAAACAACTCACCACTGGGTCAGTGGTCCAAGGAACACTGGGAGTCAGCACATC
TTCTGCACAAGGACAACAAACGCTAAAAGTCATCTCTGGACAGAAAACCACATTGTTTACACAGGCAGCCCATGGAGGACAGGCATCTCT
AATGAAAATATCCGATAGCACCTTGAAGACTGTGCCAGCCACCTCACAGCTCTCGAAGCCTGGAACCACAATGCTGAGAGTAGCAGGAGG
GGTTATCACAACTGCCACTTCCCCTGCCGTGGCCCTCTCAGCAAACGGTCCTGCACAACAGTCTGAAGGAATGGCTCCCGTGTCTTCATC
TACGGTCAGTTCTGTAACGAAAACTTCTGGGCAGCAGCAAGTGTGTGTGAGCCAGGCCACCGTGGGAACCTGCAAGGCTGCCACCCCCAC
CGTCGTCAGCGCCACGTCCCTCGTGCCTACACCAAACCCCATCTCTGGGAAAGCCACAGTATCCGGACTGTTAAAGATTCACTCCAGTCA
GTCCAGTCCGCAGCAGGCCGTCCTGACGATTCCCAGCCAGCTCAAACCACTCAGCGTAAACACATCTGGAGGGGTGCAGACGATCCTGAT
GCCTGTGAATAAAGTGGTTCAGTCATTTTCTACCAGCAAGCCACCTGCCATTCTGCCTGTAGCTGCCCCAACTCCAGTTGTCCCCAGCTC
TGCTCCAGCAGCTGTTGCAAAAGTGAAGACTGAACCAGAAACACCTGGACCGAGTTGCCTCTCTCAGGAGGGTCAGACAGCAGTGAAAAC
AGAAGAAAGTTCTGAGCTGGGAAACTATGTCATTAAGATAGACCATTTAGAAACTATCCAGCAACTCCTAACTGCAGTAGTAAAGAAGAT
TCCATTAATCACTGCAAAAAGTGAAGATGCCAGCTGCTTTTCTGCAAAGTCTGTGGAGCAGTACTATGGCTGGAACATTGGAAAAAGGAG
AGCCGCTGAGTGGCAAAGAGCAATGACAATGCGAAAAGTCTTACAAGAAATCCTGGAGAAGAATCCGAGATTTCACCACCTGACTCCCCT
CAAAACCAAGCACATCGCTCACTGGTGCCGCTGTCATGGCTACACCCCACCGGACCCTGAGAGCCTGAGGAATGACGGGGACTCCATCGA
GGACGTGCTGACCCAGATCGACAGCGAGCCCGAGTGCCCATCATCATTCTCCTCTGCTGACAACCTCTGCCGCAAACTGGAGGACCTGCA
ACAGTTCCAGAAAAGGGAACCCGAGAATGAGGAGGAGGTGGACATCCTCAGCCTCTCCGAGCCAGTGAAGATAAACATCAAGAAGGAGCA
GGAAGAGAAACAAGAGGAAGTCAAGTTCTACCTGCCACCAACCCCAGGGTCTGAATTTATTGGGGATGTCACACAGAAGATTGGGATCAC
CCTGCAGCCCGTGGCACTCCACAGGAACGTGTATGCGTCCGTGGTGGAGGACATGATCCTGAAGGCTACAGAACAGCTGGTGAATGATAT
CCTGAGACAGGCTTTGGCAGTTGGATACCAGACAGCTTCTCACAACAGGATTCCCAAAGAAATTACAGTGAGTAATATTCACCAGGCCAT
TTGCAACATTCCTTTTCTGGACTTCCTCACAAACAAACACATGGGAATATTGAATGAGGACCAGTGAGCGGAGTGAGGTGCCCTGGAGAA
GCAGGCTTTGAAGGCACAGCGAAGCTGTAACTGAGGACCCTGCTGCTCGGGAAGGAGGTGGTTTCCAGTGTGACTCGGCATGTCATGGCT
ACCCAACCTTTGCCGCTGCCTGTTCCCACGTGTCACCAGCACGCTGCACTCCAGATGAAATCCTCCTAGGACAGGAGTTTGTTTCCTGAG
TGTGGAGTGAGGCTGTCAGTGGATCCGTGCTTTGTCGGCCAGCGTTTCTGCAGTCTTTGTAAAGGCCCCACGAGAGCGGGCCAGGCCGTG
TGCCTCAGGCCCTTCTCCCTGGGTGTGCTTAAGGGGGCTCCTTGGGCCCGCCTCCCCAGGAGGTAGAAAATGAGTGGCAGGCTAGAGATT
TCACCCATTTTGTGGGCTGGAGTTACCAGTAGCTCCAGCAGTTACCCTGAAGAGAGATTGGGCTTCAGCCTTCAGCAGGTGGTTCTCTCC
CATGCCTGGCCTTGGTGTGGAGGGGCTGTACTCTGAGCCCAAGTGAGTCAGCTATAGGAAGAGGCCATACCTAGAGCCAAGAACCATGAA
GGCCTGAGAGACGGCAGACTGAGCAGAATTCCTTTTTTGAGCACGAGAGCATTACTAGAACCATTGTCAAAGCAGTGGCAAGGGACGGAG
AGGTCCCAACAGGAGTCAGGAAGAGGTTTGATTATAACCAAGAAAACTCACTATGCTAGGAATAGACTGTGTGCACCAGTCCCAGACACT
TGGCAGAAGTGTAGCAGCGTTACACATGTGTGCGAAGCAGATCGCAGGTTCCACGCCATCTGCATGGCCTGCAGGAGCTTCTGCTGCTGA
CCCCATGCTGAGTGGCCAGTGGGGAGCGGCGCCCGGCAGGCTCTTCTGGGGTCGTCTGTCCTATCCGTGGATTGTATATACTCTTCTCTG
TTAAGGAGTTTTTCCCAAGAAGAAAAGTATTTAAAAGAAATACCAGTGAGTGCCTTAAAGTTGGAGAAGTAACTGCCCATGCCCAGAAAT
AAGGATGCCAGTGCCCAGAAGCAGTGAGATTAGTCTGTGTCCACAAGCAGAGGCCCCCTCGATGGGAGGGAGTGGCAGGCAGGAGAAGGT
GGCGCTGCCAGGTGCCCGGGTCTATTGGAGGCGCCCCATCTCAGACTTCCTAACACAGCCTGTGTGGAAGGCAGAACAAAGAATGCATGC
CCAGTCAGAAATCTGTTCTATTCTGCTCCAGGAAAATCGGAAACCTGTGAGTCAGAGTCAGAGAAACTTACCCAAGCAACGTAATTCCTG
TTTTCATGGGTCCTGTAGATGTTTGAGTCAGGAAGGTAAGGCGGGGAGTGACTGAATAAACTCTGCCTTTTAAATTGAGCATCTGGGCCG
GGCATGGTGGCTCACGCCTGTAATCCCAGCACTCTGGGAGGTCGAGGTGGGTGGGTCACCTGAGGTTGGGAGTTCGAGACCAGCCCGACC
AACATGGTGAAACCCCGTCTCTACTAAAAATACAGAAAATTAGCTGGGCATGGTGGTGTGTGCCTGTAATTCCAGCTACTCGGGAGGCTG
AGGCAGGAAGAATCACTTGAACCCAGGAGGCGGAGGTTGCAGTGTGCCAAGATCATACCACTGCACTCCAGCCCTGGTGACAGAGGAGAC
CCCGTCTCAAAAATTGATTGATCAATTCAGCATCTGAGGGCTGCAAGTACAGAAGGAATCTATTCTCAGCAGGGCATAGGGCACGCACTG
GCTTAACAGTTTAGTATATAAGGCTCAAATAGTCTATACCTGAACTGCTATAAGCAAGGTCGATAGGGAAGTGGATAGATTGCTTCAGCA
AAGTGAACTGTGAGATCTCCAGGACAGAGGGAGAAAGATCTGATCCAAATGAGAACAGATTGGTTATTGCAGGTATCACAGCCTAAAGAA
ATTATCTTTTTGCAAAAGAAATATTAAATGATTTAGCAGTCTCCACGTGTGTTAATGTTTCAAACGTGTATCATAATGTGTATAATTGTG

>20746_20746_6_CUX1-YEATS2_CUX1_chr7_101459373_ENST00000547394_YEATS2_chr3_183490093_ENST00000305135_length(amino acids)=794AA_BP=111
MAANVGSMFQYWKRFDLQQLQVVGVPVGSALPSTVKQAVAISGGQILVAKASSSVSKAVGPKQVVTQGVAKAIVSGGGGTIVAQPVQTLT
KAQVTAAGPQKSGSQGSVMATLQLPATNLANLANLPPGTKLYLTTNSKNPSGKGKLLLIPQGAILRATNNANLQSGSAASGGSGAGGGGG
GGGGGGSGSGGGGSTGGGGGTAGGGTQSTAGPGGISQHLTYTSYILKQTPQGTFLVGQPSPQTSGKQLTTGSVVQGTLGVSTSSAQGQQT
LKVISGQKTTLFTQAAHGGQASLMKISDSTLKTVPATSQLSKPGTTMLRVAGGVITTATSPAVALSANGPAQQSEGMAPVSSSTVSSVTK
TSGQQQVCVSQATVGTCKAATPTVVSATSLVPTPNPISGKATVSGLLKIHSSQSSPQQAVLTIPSQLKPLSVNTSGGVQTILMPVNKVVQ
SFSTSKPPAILPVAAPTPVVPSSAPAAVAKVKTEPETPGPSCLSQEGQTAVKTEESSELGNYVIKIDHLETIQQLLTAVVKKIPLITAKS
EDASCFSAKSVEQYYGWNIGKRRAAEWQRAMTMRKVLQEILEKNPRFHHLTPLKTKHIAHWCRCHGYTPPDPESLRNDGDSIEDVLTQID
SEPECPSSFSSADNLCRKLEDLQQFQKREPENEEEVDILSLSEPVKINIKKEQEEKQEEVKFYLPPTPGSEFIGDVTQKIGITLQPVALH

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for CUX1-YEATS2


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for CUX1-YEATS2


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for CUX1-YEATS2


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource