FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:CRLS1-NFX1 (FusionGDB2 ID:19520)

Fusion Gene Summary for CRLS1-NFX1

check button Fusion gene summary
Fusion gene informationFusion gene name: CRLS1-NFX1
Fusion gene ID: 19520
HgeneTgene
Gene symbol

CRLS1

NFX1

Gene ID

54675

4799

Gene namecardiolipin synthase 1nuclear transcription factor, X-box binding 1
SynonymsC20orf155|CLS|CLS1|GCD10|dJ967N21.6NFX2|TEG-42|Tex42
Cytomap

20p12.3

9p13.3

Type of geneprotein-codingprotein-coding
Descriptioncardiolipin synthase (CMP-forming)transcriptional repressor NF-X1nuclear transcription factor, X box-binding protein 1
Modification date2020031320200313
UniProtAcc

Q9UJA2

ZNFX1

Ensembl transtripts involved in fusion geneENST00000378863, ENST00000378868, 
ENST00000452938, ENST00000464921, 
ENST00000463421, ENST00000318524, 
ENST00000379521, ENST00000379540, 
Fusion gene scores* DoF score8 X 8 X 5=32014 X 12 X 9=1512
# samples 815
** MAII scorelog2(8/320*10)=-2
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(15/1512*10)=-3.33342373372519
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: CRLS1 [Title/Abstract] AND NFX1 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointCRLS1(5996136)-NFX1(33328579), # samples:3
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
TgeneNFX1

GO:0000122

negative regulation of transcription by RNA polymerase II

7964459

TgeneNFX1

GO:0051865

protein autoubiquitination

10500182


check buttonFusion gene breakpoints across CRLS1 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across NFX1 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4KIRCTCGA-BP-5199-01ACRLS1chr20

5996136

-NFX1chr9

33328579

+
ChimerDB4KIRCTCGA-BP-5199-01ACRLS1chr20

5996136

+NFX1chr9

33328579

+


Top

Fusion Gene ORF analysis for CRLS1-NFX1

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-intronENST00000378863ENST00000463421CRLS1chr20

5996136

+NFX1chr9

33328579

+
5CDS-intronENST00000378868ENST00000463421CRLS1chr20

5996136

+NFX1chr9

33328579

+
5CDS-intronENST00000452938ENST00000463421CRLS1chr20

5996136

+NFX1chr9

33328579

+
In-frameENST00000378863ENST00000318524CRLS1chr20

5996136

+NFX1chr9

33328579

+
In-frameENST00000378863ENST00000379521CRLS1chr20

5996136

+NFX1chr9

33328579

+
In-frameENST00000378863ENST00000379540CRLS1chr20

5996136

+NFX1chr9

33328579

+
In-frameENST00000378868ENST00000318524CRLS1chr20

5996136

+NFX1chr9

33328579

+
In-frameENST00000378868ENST00000379521CRLS1chr20

5996136

+NFX1chr9

33328579

+
In-frameENST00000378868ENST00000379540CRLS1chr20

5996136

+NFX1chr9

33328579

+
In-frameENST00000452938ENST00000318524CRLS1chr20

5996136

+NFX1chr9

33328579

+
In-frameENST00000452938ENST00000379521CRLS1chr20

5996136

+NFX1chr9

33328579

+
In-frameENST00000452938ENST00000379540CRLS1chr20

5996136

+NFX1chr9

33328579

+
intron-3CDSENST00000464921ENST00000318524CRLS1chr20

5996136

+NFX1chr9

33328579

+
intron-3CDSENST00000464921ENST00000379521CRLS1chr20

5996136

+NFX1chr9

33328579

+
intron-3CDSENST00000464921ENST00000379540CRLS1chr20

5996136

+NFX1chr9

33328579

+
intron-intronENST00000464921ENST00000463421CRLS1chr20

5996136

+NFX1chr9

33328579

+

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000378863CRLS1chr205996136+ENST00000379540NFX1chr933328579+33677311572187676
ENST00000378863CRLS1chr205996136+ENST00000379521NFX1chr933328579+23737311571899580
ENST00000378863CRLS1chr205996136+ENST00000318524NFX1chr933328579+23107311571326389
ENST00000452938CRLS1chr205996136+ENST00000379540NFX1chr933328579+3307671972127676
ENST00000452938CRLS1chr205996136+ENST00000379521NFX1chr933328579+2313671971839580
ENST00000452938CRLS1chr205996136+ENST00000318524NFX1chr933328579+2250671971266389
ENST00000378868CRLS1chr205996136+ENST00000379540NFX1chr933328579+30634271501883577
ENST00000378868CRLS1chr205996136+ENST00000379521NFX1chr933328579+20694271501595481
ENST00000378868CRLS1chr205996136+ENST00000318524NFX1chr933328579+20064271501022290

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000378863ENST00000379540CRLS1chr205996136+NFX1chr933328579+0.0077193780.9922806
ENST00000378863ENST00000379521CRLS1chr205996136+NFX1chr933328579+0.0137431090.9862569
ENST00000378863ENST00000318524CRLS1chr205996136+NFX1chr933328579+0.077488670.92251134
ENST00000452938ENST00000379540CRLS1chr205996136+NFX1chr933328579+0.0088853960.9911146
ENST00000452938ENST00000379521CRLS1chr205996136+NFX1chr933328579+0.0174788070.98252124
ENST00000452938ENST00000318524CRLS1chr205996136+NFX1chr933328579+0.083883780.91611624
ENST00000378868ENST00000379540CRLS1chr205996136+NFX1chr933328579+0.0032394890.99676055
ENST00000378868ENST00000379521CRLS1chr205996136+NFX1chr933328579+0.0037700570.99622995
ENST00000378868ENST00000318524CRLS1chr205996136+NFX1chr933328579+0.0069043390.9930957

Top

Fusion Genomic Features for CRLS1-NFX1


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)
CRLS1chr205996136+NFX1chr933328578+1.60E-060.99999845
CRLS1chr205996136+NFX1chr933328578+1.60E-060.99999845

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.
genomic feature of top 1%

Top

Fusion Protein Features for CRLS1-NFX1


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr20:5996136/chr9:33328579)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
CRLS1

Q9UJA2

NFX1

ZNFX1

FUNCTION: Catalyzes the synthesis of cardiolipin (CL) (diphosphatidylglycerol) by specifically transferring a phosphatidyl group from CDP-diacylglycerol to phosphatidylglycerol (PG). CL is a key phospholipid in mitochondrial membranes and plays important roles in maintaining the functional integrity and dynamics of mitochondria under both optimal and stress conditions. {ECO:0000269|PubMed:16547353, ECO:0000269|PubMed:16678169}.1918

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneCRLS1chr20:5996136chr9:33328579ENST00000378863+37109_129191302.0TransmembraneHelical
HgeneCRLS1chr20:5996136chr9:33328579ENST00000378863+37133_153191302.0TransmembraneHelical
TgeneNFX1chr20:5996136chr9:33328579ENST000003185248161084_1089635834.0Compositional biasNote=Poly-Pro
TgeneNFX1chr20:5996136chr9:33328579ENST000003795218211084_10896351025.0Compositional biasNote=Poly-Pro
TgeneNFX1chr20:5996136chr9:33328579ENST000003795408241084_10896351121.0Compositional biasNote=Poly-Pro
TgeneNFX1chr20:5996136chr9:33328579ENST00000318524816994_1062635834.0DomainR3H
TgeneNFX1chr20:5996136chr9:33328579ENST00000379521821994_10626351025.0DomainR3H
TgeneNFX1chr20:5996136chr9:33328579ENST00000379540824994_10626351121.0DomainR3H
TgeneNFX1chr20:5996136chr9:33328579ENST00000318524816694_713635834.0Zinc fingerNote=NF-X1-type 5
TgeneNFX1chr20:5996136chr9:33328579ENST00000318524816721_740635834.0Zinc fingerNote=NF-X1-type 6
TgeneNFX1chr20:5996136chr9:33328579ENST00000318524816832_854635834.0Zinc fingerNote=NF-X1-type 7
TgeneNFX1chr20:5996136chr9:33328579ENST00000318524816863_884635834.0Zinc fingerNote=NF-X1-type 8
TgeneNFX1chr20:5996136chr9:33328579ENST00000379521821694_7136351025.0Zinc fingerNote=NF-X1-type 5
TgeneNFX1chr20:5996136chr9:33328579ENST00000379521821721_7406351025.0Zinc fingerNote=NF-X1-type 6
TgeneNFX1chr20:5996136chr9:33328579ENST00000379521821832_8546351025.0Zinc fingerNote=NF-X1-type 7
TgeneNFX1chr20:5996136chr9:33328579ENST00000379521821863_8846351025.0Zinc fingerNote=NF-X1-type 8
TgeneNFX1chr20:5996136chr9:33328579ENST00000379540824694_7136351121.0Zinc fingerNote=NF-X1-type 5
TgeneNFX1chr20:5996136chr9:33328579ENST00000379540824721_7406351121.0Zinc fingerNote=NF-X1-type 6
TgeneNFX1chr20:5996136chr9:33328579ENST00000379540824832_8546351121.0Zinc fingerNote=NF-X1-type 7
TgeneNFX1chr20:5996136chr9:33328579ENST00000379540824863_8846351121.0Zinc fingerNote=NF-X1-type 8

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneCRLS1chr20:5996136chr9:33328579ENST00000378863+37190_212191302.0TransmembraneHelical
HgeneCRLS1chr20:5996136chr9:33328579ENST00000378863+37250_270191302.0TransmembraneHelical
HgeneCRLS1chr20:5996136chr9:33328579ENST00000378863+37272_292191302.0TransmembraneHelical
HgeneCRLS1chr20:5996136chr9:33328579ENST00000378868+37109_12992203.0TransmembraneHelical
HgeneCRLS1chr20:5996136chr9:33328579ENST00000378868+37133_15392203.0TransmembraneHelical
HgeneCRLS1chr20:5996136chr9:33328579ENST00000378868+37190_21292203.0TransmembraneHelical
HgeneCRLS1chr20:5996136chr9:33328579ENST00000378868+37250_27092203.0TransmembraneHelical
HgeneCRLS1chr20:5996136chr9:33328579ENST00000378868+37272_29292203.0TransmembraneHelical
TgeneNFX1chr20:5996136chr9:33328579ENST00000318524816358_409635834.0Zinc fingerRING-type%3B atypical
TgeneNFX1chr20:5996136chr9:33328579ENST00000318524816453_471635834.0Zinc fingerNote=NF-X1-type 1
TgeneNFX1chr20:5996136chr9:33328579ENST00000318524816506_525635834.0Zinc fingerNote=NF-X1-type 2
TgeneNFX1chr20:5996136chr9:33328579ENST00000318524816567_586635834.0Zinc fingerNote=NF-X1-type 3
TgeneNFX1chr20:5996136chr9:33328579ENST00000318524816632_655635834.0Zinc fingerNote=NF-X1-type 4
TgeneNFX1chr20:5996136chr9:33328579ENST00000379521821358_4096351025.0Zinc fingerRING-type%3B atypical
TgeneNFX1chr20:5996136chr9:33328579ENST00000379521821453_4716351025.0Zinc fingerNote=NF-X1-type 1
TgeneNFX1chr20:5996136chr9:33328579ENST00000379521821506_5256351025.0Zinc fingerNote=NF-X1-type 2
TgeneNFX1chr20:5996136chr9:33328579ENST00000379521821567_5866351025.0Zinc fingerNote=NF-X1-type 3
TgeneNFX1chr20:5996136chr9:33328579ENST00000379521821632_6556351025.0Zinc fingerNote=NF-X1-type 4
TgeneNFX1chr20:5996136chr9:33328579ENST00000379540824358_4096351121.0Zinc fingerRING-type%3B atypical
TgeneNFX1chr20:5996136chr9:33328579ENST00000379540824453_4716351121.0Zinc fingerNote=NF-X1-type 1
TgeneNFX1chr20:5996136chr9:33328579ENST00000379540824506_5256351121.0Zinc fingerNote=NF-X1-type 2
TgeneNFX1chr20:5996136chr9:33328579ENST00000379540824567_5866351121.0Zinc fingerNote=NF-X1-type 3
TgeneNFX1chr20:5996136chr9:33328579ENST00000379540824632_6556351121.0Zinc fingerNote=NF-X1-type 4


Top

Fusion Gene Sequence for CRLS1-NFX1


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>19520_19520_1_CRLS1-NFX1_CRLS1_chr20_5996136_ENST00000378863_NFX1_chr9_33328579_ENST00000318524_length(transcript)=2310nt_BP=731nt
TGTATAAGTGGAGTGTGCTGGGGTGTGTAAAGTAGTATGGAGGCAGCGGTAGCCCAGTGTCTGAGTGGTTGCCGGGTCTCCATGGAGAAG
CGGCTCGCCAGTGTCCCAGGCTGCTGAGCTCTCGCCGCCCGAGACCCCGCGGCGCGGCCGCAGGGCCATGCTAGCCTTGCGCGTGGCGCG
CGGCTCGTGGGGGGCCCTGCGCGGCGCCGCTTGGGCTCCGGGAACGCGGCCGAGTAAGCGACGCGCCTGCTGGGCCCTGCTGCCGCCCGT
GCCCTGCTGCTTGGGCTGCCTGGCCGAACGCTGGAGGCTGCGTCCGGCCGCTCTTGGCTTGCGGCTGCCCGGGATCGGCCAGCGGAACCA
CTGTTCGGGCGCGGGGAAGGCGGCTCCCAGGCCAGCGGCCGGAGCGGGCGCCGCTGCCGAAGCCCCGGGCGGCCAGTGGGGCCCGGCGAG
CACCCCCAGCCTGTATGAAAACCCATGGACAATCCCGAATATGTTGTCAATGACGAGAATTGGCTTGGCCCCAGTTCTGGGCTATTTGAT
TATTGAAGAAGATTTTAATATTGCACTAGGAGTTTTTGCTTTAGCTGGACTAACAGATTTGTTGGATGGATTTATTGCTCGAAACTGGGC
CAATCAAAGATCAGCTTTGGGAAGTGCTCTTGATCCACTTGCTGATAAAATACTTATCAGTATCTTATATGTTAGCTTGACCTATGCAGA
TCTTATTCCAGATTTCATTCATACCTGTGAAAAGCTCTGCCATGAAGGAGACTGTGGACCATGCTCTCGCACATCAGTTATTTCCTGCAG
ATGCTCTTTCAGAACAAAGGAGCTTCCATGTACCAGTCTCAAAAGTGAAGATGCTACATTTATGTGTGACAAGCGGTGTAACAAGAAACG
GTTGTGTGGACGGCATAAATGTAATGAGATATGCTGTGTGGATAAGGAGCACAAGTGTCCTTTGATTTGTGGGAGGAAACTCCGTTGTGG
CCTTCATAGGTGTGAAGAACCTTGTCATCGTGGAAACTGCCAGACATGCTGGCAAGCCAGTTTTGATGAATTAACCTGCCATTGTGGTGC
ATCAGTGATTTACCCTCCAGTTCCCTGTGGTACTAGGCCCCCTGAATGTACCCAAACCTGCGCTAGAGTCCATGAGTGTGACCATCCAGT
ATATCATTCTTGTCATAGTGAGGAGAAGTGTCCCCCTTGCACTTTCCTAACTCAGAAGTGGTGCATGGGCAAGCATGAGCAGTCCCACTA
CTGGGCGTCTACCCAGAAGAAAAGAAGTCATTATATGAAAAAGATACCTGCACACGCATGTTTATAGCAGCACAATTCACAATTGCAAAA
ATGTGGAACCAGCCCAACTGCCCATCAGTCAACAAGTGGATAAAGAAATTGTGGTGTATCTATACGTACCATGGAATACTACTCAGCCAG
GAACGAAATAATGGCATTCACAGCAACCTGGATGGATTTGGAGACCATTATTCTAAGTGAAGTAACTCAGGAATGGAAAACCAAACATCG
TATGTTCTCAATTATAAGAGGGAGCTAAGCTATGAGGACGCAAAGGCATGAGAATGATACAATGGACTTTGGGGACTCTGGGGAAAGGAC
GGGAGGCGGGTGAGGGATAAAAAACTACACACTGGGTCCGAGCACGGTGGCTCATGCCTATAATCCCAGCACTTTGGGAGGCCAAGGCAG
GTAGATTATGAGGTCGGGAGTTCGAAACCAGCCTGGCCAACAAGGTGAGACCCCCCCCATCTCTACTAAAATAATTAGCCGGGCCTGGTG
GCGCATCCCTGTAATCCCAGCTACTTTACTTGGGAGGCTGAGGCAGGAGAATTGCTTGAACTCGGGAGGCGGATGTTGCAGTGAGCCAAT
ATCGCACCACTGCACTCCAGCCTGGGCAACAGAGCAAGACTCCGTCTCGAAAAAAAAAATAAATTTTAAAAACTACACATTGGGTACAAT
GTACACTCCTCGGGTGATGGGTGCACCAAAATCTCAAAAATCACTACTAAGGAACTTATCCATGTAACCAAACACCACCTGTTCCCCAAG
AACTATTGAAATAAGAAAAAAGAAAAAAAAAAAATCACTGATTGCAGGTCACCCTAACAGATATAATAATGAAAAAGTCTGAGATGTTGC
GAGAATTACCAAAATGTGATGCAGACACGAAATGAGCATGTGCTGTTGGAAAAGTGGAACAGATAGACTTGCTGGAGACAGGGTTGCCAC

>19520_19520_1_CRLS1-NFX1_CRLS1_chr20_5996136_ENST00000378863_NFX1_chr9_33328579_ENST00000318524_length(amino acids)=389AA_BP=189
MLALRVARGSWGALRGAAWAPGTRPSKRRACWALLPPVPCCLGCLAERWRLRPAALGLRLPGIGQRNHCSGAGKAAPRPAAGAGAAAEAP
GGQWGPASTPSLYENPWTIPNMLSMTRIGLAPVLGYLIIEEDFNIALGVFALAGLTDLLDGFIARNWANQRSALGSALDPLADKILISIL
YVSLTYADLIPDFIHTCEKLCHEGDCGPCSRTSVISCRCSFRTKELPCTSLKSEDATFMCDKRCNKKRLCGRHKCNEICCVDKEHKCPLI
CGRKLRCGLHRCEEPCHRGNCQTCWQASFDELTCHCGASVIYPPVPCGTRPPECTQTCARVHECDHPVYHSCHSEEKCPPCTFLTQKWCM

--------------------------------------------------------------
>19520_19520_2_CRLS1-NFX1_CRLS1_chr20_5996136_ENST00000378863_NFX1_chr9_33328579_ENST00000379521_length(transcript)=2373nt_BP=731nt
TGTATAAGTGGAGTGTGCTGGGGTGTGTAAAGTAGTATGGAGGCAGCGGTAGCCCAGTGTCTGAGTGGTTGCCGGGTCTCCATGGAGAAG
CGGCTCGCCAGTGTCCCAGGCTGCTGAGCTCTCGCCGCCCGAGACCCCGCGGCGCGGCCGCAGGGCCATGCTAGCCTTGCGCGTGGCGCG
CGGCTCGTGGGGGGCCCTGCGCGGCGCCGCTTGGGCTCCGGGAACGCGGCCGAGTAAGCGACGCGCCTGCTGGGCCCTGCTGCCGCCCGT
GCCCTGCTGCTTGGGCTGCCTGGCCGAACGCTGGAGGCTGCGTCCGGCCGCTCTTGGCTTGCGGCTGCCCGGGATCGGCCAGCGGAACCA
CTGTTCGGGCGCGGGGAAGGCGGCTCCCAGGCCAGCGGCCGGAGCGGGCGCCGCTGCCGAAGCCCCGGGCGGCCAGTGGGGCCCGGCGAG
CACCCCCAGCCTGTATGAAAACCCATGGACAATCCCGAATATGTTGTCAATGACGAGAATTGGCTTGGCCCCAGTTCTGGGCTATTTGAT
TATTGAAGAAGATTTTAATATTGCACTAGGAGTTTTTGCTTTAGCTGGACTAACAGATTTGTTGGATGGATTTATTGCTCGAAACTGGGC
CAATCAAAGATCAGCTTTGGGAAGTGCTCTTGATCCACTTGCTGATAAAATACTTATCAGTATCTTATATGTTAGCTTGACCTATGCAGA
TCTTATTCCAGATTTCATTCATACCTGTGAAAAGCTCTGCCATGAAGGAGACTGTGGACCATGCTCTCGCACATCAGTTATTTCCTGCAG
ATGCTCTTTCAGAACAAAGGAGCTTCCATGTACCAGTCTCAAAAGTGAAGATGCTACATTTATGTGTGACAAGCGGTGTAACAAGAAACG
GTTGTGTGGACGGCATAAATGTAATGAGATATGCTGTGTGGATAAGGAGCACAAGTGTCCTTTGATTTGTGGGAGGAAACTCCGTTGTGG
CCTTCATAGGTGTGAAGAACCTTGTCATCGTGGAAACTGCCAGACATGCTGGCAAGCCAGTTTTGATGAATTAACCTGCCATTGTGGTGC
ATCAGTGATTTACCCTCCAGTTCCCTGTGGTACTAGGCCCCCTGAATGTACCCAAACCTGCGCTAGAGTCCATGAGTGTGACCATCCAGT
ATATCATTCTTGTCATAGTGAGGAGAAGTGTCCCCCTTGCACTTTCCTAACTCAGAAGTGGTGCATGGGCAAGCATGAGTTTCGGAGCAA
CATCCCCTGTCACCTGGTTGATATCTCTTGCGGATTACCCTGCAGTGCCACGCTACCATGTGGGATGCACAAATGTCAGAGACTCTGTCA
CAAAGGGGAGTGTCTTGTGGATGAGCCCTGCAAGCAGCCCTGCACCACCCCCAGAGCTGACTGTGGTCACCCGTGTATGGCACCCTGCCA
TACCAGCTCACCCTGCCCTGTGACTGCTTGTAAAGCTAAGGTAGAGCTACAGTGTGAATGTGGACGAAGAAAAGAGATGGTGATTTGCTC
TGAAGCATCTAGTACTTATCAAAGAATAGCTGCAATCTCCATGGCCTCTAAGATAACAGACATGCAGCTTGGAGGTTCAGTGGAGATCAG
CAAGTTAATTACCAAAAAGGAAGTTCATCAAGCCAGGCTGGAGTGTGATGAGGAGTGTTCAGCCTTGGAAAGGAAAAAGAGATTAGCAGA
GGCATTTCATATCAGTGAGGATTCTGATCCTTTCAATATACGTTCTTCAGGGTCAAAATTCAGTGATAGTTTGAAAGAAGATGCCAGGAA
GGACTTAAAGTTTGTCAGTGACGTTGAGAAGGAAATGGAAACCCTCGTGGAGGCCGTGAATAAGGTTGAAGTCGAAACATCCCACTGGAC
ATTTCTCTAAGGCCAGCTTGATGAAAAAAAAAAAGCAATCAAGTCCTGGAAACACTTTCAACCTAGAGTAGTTGTAGGAAAAAGTCACAA
CTTCTTTGAGGATCCTTATCTTTCTCAAAACACAGCCATTATAACATCTGTGGAAGCCTGTCCTGGTTCAGGACAGTTGACTGCAGAAAA
GATCTACAGTCGGCTGGGCACAGTGTCTCATGCCTGTAATGCCAGCACTTTGGGAGGCCAAGGCAGGCAGATCACTTGAGGTCAGGAGTT
CTAGATCAGCTTGGCCAACATGACGAAACCCTGTCTCTACTAAAAATATAAAAATTAGCTGGGTGTGGTGGCGGGCACCTGTTATTCCAG
CTACTTGCAAGGCTTAGGCAGGAGAATTCCTTGAACCCGGGAGGCAGAGGTTGCTGTGAGCTGAGATTCTGCCACTGCACTCCAGCCTGG

>19520_19520_2_CRLS1-NFX1_CRLS1_chr20_5996136_ENST00000378863_NFX1_chr9_33328579_ENST00000379521_length(amino acids)=580AA_BP=189
MLALRVARGSWGALRGAAWAPGTRPSKRRACWALLPPVPCCLGCLAERWRLRPAALGLRLPGIGQRNHCSGAGKAAPRPAAGAGAAAEAP
GGQWGPASTPSLYENPWTIPNMLSMTRIGLAPVLGYLIIEEDFNIALGVFALAGLTDLLDGFIARNWANQRSALGSALDPLADKILISIL
YVSLTYADLIPDFIHTCEKLCHEGDCGPCSRTSVISCRCSFRTKELPCTSLKSEDATFMCDKRCNKKRLCGRHKCNEICCVDKEHKCPLI
CGRKLRCGLHRCEEPCHRGNCQTCWQASFDELTCHCGASVIYPPVPCGTRPPECTQTCARVHECDHPVYHSCHSEEKCPPCTFLTQKWCM
GKHEFRSNIPCHLVDISCGLPCSATLPCGMHKCQRLCHKGECLVDEPCKQPCTTPRADCGHPCMAPCHTSSPCPVTACKAKVELQCECGR
RKEMVICSEASSTYQRIAAISMASKITDMQLGGSVEISKLITKKEVHQARLECDEECSALERKKRLAEAFHISEDSDPFNIRSSGSKFSD

--------------------------------------------------------------
>19520_19520_3_CRLS1-NFX1_CRLS1_chr20_5996136_ENST00000378863_NFX1_chr9_33328579_ENST00000379540_length(transcript)=3367nt_BP=731nt
TGTATAAGTGGAGTGTGCTGGGGTGTGTAAAGTAGTATGGAGGCAGCGGTAGCCCAGTGTCTGAGTGGTTGCCGGGTCTCCATGGAGAAG
CGGCTCGCCAGTGTCCCAGGCTGCTGAGCTCTCGCCGCCCGAGACCCCGCGGCGCGGCCGCAGGGCCATGCTAGCCTTGCGCGTGGCGCG
CGGCTCGTGGGGGGCCCTGCGCGGCGCCGCTTGGGCTCCGGGAACGCGGCCGAGTAAGCGACGCGCCTGCTGGGCCCTGCTGCCGCCCGT
GCCCTGCTGCTTGGGCTGCCTGGCCGAACGCTGGAGGCTGCGTCCGGCCGCTCTTGGCTTGCGGCTGCCCGGGATCGGCCAGCGGAACCA
CTGTTCGGGCGCGGGGAAGGCGGCTCCCAGGCCAGCGGCCGGAGCGGGCGCCGCTGCCGAAGCCCCGGGCGGCCAGTGGGGCCCGGCGAG
CACCCCCAGCCTGTATGAAAACCCATGGACAATCCCGAATATGTTGTCAATGACGAGAATTGGCTTGGCCCCAGTTCTGGGCTATTTGAT
TATTGAAGAAGATTTTAATATTGCACTAGGAGTTTTTGCTTTAGCTGGACTAACAGATTTGTTGGATGGATTTATTGCTCGAAACTGGGC
CAATCAAAGATCAGCTTTGGGAAGTGCTCTTGATCCACTTGCTGATAAAATACTTATCAGTATCTTATATGTTAGCTTGACCTATGCAGA
TCTTATTCCAGATTTCATTCATACCTGTGAAAAGCTCTGCCATGAAGGAGACTGTGGACCATGCTCTCGCACATCAGTTATTTCCTGCAG
ATGCTCTTTCAGAACAAAGGAGCTTCCATGTACCAGTCTCAAAAGTGAAGATGCTACATTTATGTGTGACAAGCGGTGTAACAAGAAACG
GTTGTGTGGACGGCATAAATGTAATGAGATATGCTGTGTGGATAAGGAGCACAAGTGTCCTTTGATTTGTGGGAGGAAACTCCGTTGTGG
CCTTCATAGGTGTGAAGAACCTTGTCATCGTGGAAACTGCCAGACATGCTGGCAAGCCAGTTTTGATGAATTAACCTGCCATTGTGGTGC
ATCAGTGATTTACCCTCCAGTTCCCTGTGGTACTAGGCCCCCTGAATGTACCCAAACCTGCGCTAGAGTCCATGAGTGTGACCATCCAGT
ATATCATTCTTGTCATAGTGAGGAGAAGTGTCCCCCTTGCACTTTCCTAACTCAGAAGTGGTGCATGGGCAAGCATGAGTTTCGGAGCAA
CATCCCCTGTCACCTGGTTGATATCTCTTGCGGATTACCCTGCAGTGCCACGCTACCATGTGGGATGCACAAATGTCAGAGACTCTGTCA
CAAAGGGGAGTGTCTTGTGGATGAGCCCTGCAAGCAGCCCTGCACCACCCCCAGAGCTGACTGTGGTCACCCGTGTATGGCACCCTGCCA
TACCAGCTCACCCTGCCCTGTGACTGCTTGTAAAGCTAAGGTAGAGCTACAGTGTGAATGTGGACGAAGAAAAGAGATGGTGATTTGCTC
TGAAGCATCTAGTACTTATCAAAGAATAGCTGCAATCTCCATGGCCTCTAAGATAACAGACATGCAGCTTGGAGGTTCAGTGGAGATCAG
CAAGTTAATTACCAAAAAGGAAGTTCATCAAGCCAGGCTGGAGTGTGATGAGGAGTGTTCAGCCTTGGAAAGGAAAAAGAGATTAGCAGA
GGCATTTCATATCAGTGAGGATTCTGATCCTTTCAATATACGTTCTTCAGGGTCAAAATTCAGTGATAGTTTGAAAGAAGATGCCAGGAA
GGACTTAAAGTTTGTCAGTGACGTTGAGAAGGAAATGGAAACCCTCGTGGAGGCCGTGAATAAGGGAAAGAATAGTAAGAAAAGCCACAG
CTTCCCTCCCATGAACAGAGACCACCGCCGGATCATCCATGACTTGGCCCAAGTTTATGGCCTGGAGAGCGTGAGCTATGACAGTGAACC
GAAGCGCAATGTGGTGGTCACTGCCATCAGGGGGAAGTCCGTTTGTCCTCCTACCACGCTGACAGGTGTGCTTGAAAGGGAAATGCAGGC
ACGGCCTCCACCACCGATTCCTCATCACAGACATCAGTCAGACAAGAATCCTGGGAGCAGTAATTTACAGAAAATAACCAAGGAGCCAAT
AATTGACTATTTTGACGTCCAGGACTAAGAAGATCATGATGCACTTAGATAAAAGAATGATTAGGTATAGTGGAGACTTATTTGCCAGCA
GATAAATCATGCCCGTTCCCCTCTGCCTGGCAGAATCACAGTCTCACATACTGTCTTGTACTGACACATCCAAAGCATGAGTGTGTCAGA
AATCCCTTGTCTATTCCTGTCTGTATAAAGTGTTTCATTATGACCAGATCTCTGATTGTATGGTCACTAGGTATGCAATCACGCATTCAA
AGAGGCTCTTTACACCATCACTGTGATTGCTCTGAGAGTTGAGGGACTATTGGGCTTTATTTGGACAAACCAAACTTTTAGCCTGAAACC
AACTTTATGCCACTAAGTCATAGCCTCAGTTGTCCCAGTTATTTGTCCTCCTGAAAATGCCTGAAACATCAGACAGACATTGCTTGCTTT
ACCCAAACTGATCAAAATCTTTAGGAGCACAAATGAATTTTTTAGTCTGAAATACCAAATAATGAATTGGTATACCATATCCGGAATCAC
ACATGTTATCTTAAACCCAGCCATCATACCTAAGTCTTTTGCCAAAACCTCTCATAGGTATATCTAGCTGAACTTATTTTGGCATTTTCA
ATGTGATCAGTTCTAGACCTAGAAGGGGGTCAGGCTGCTTTACAGAATTCTATTTCCTTAAGTCCCTGGCACTTCTCATACCACATCACT
GAACCTGTTCAGTAACAATCAGTTTGGCCGTCCCCCATGATGGTAGGAAATATAGAGAGCAAGTTCTTCTGCCAGGGTCACACTGTGGTC
TCTGAACTGACCAGTATATCCCTAACTCCTCTTTGATAGAGAAAGAGTCTCAAATGGACAACTGTCCTGTGTTGCTTTCCCTAGGCCTTC
AGCAGCCTATTGGCTCTCCCTGCCTCTGAGCTCTGGACTCTGTTTGAATATTCCAAGTAGTATATGGACAGTCCAGGGCTTATGCCCAGC
AGCCCACTGGAGGCATTCTTCAGGCTCCTTTAAGGCAGGTGCATTGATAGTTCCATTAGTGTGACCCTTGCATTGGCACCCCTCCAGCCT
GGAGGCCAGGCTTCCAGCAACTTCCTTCTGCCCTAGAGCAAGCCATGAGCCCCAGAGCAGTAGCAGGAGACTTGAGAAGTAGAGTGACAA

>19520_19520_3_CRLS1-NFX1_CRLS1_chr20_5996136_ENST00000378863_NFX1_chr9_33328579_ENST00000379540_length(amino acids)=676AA_BP=189
MLALRVARGSWGALRGAAWAPGTRPSKRRACWALLPPVPCCLGCLAERWRLRPAALGLRLPGIGQRNHCSGAGKAAPRPAAGAGAAAEAP
GGQWGPASTPSLYENPWTIPNMLSMTRIGLAPVLGYLIIEEDFNIALGVFALAGLTDLLDGFIARNWANQRSALGSALDPLADKILISIL
YVSLTYADLIPDFIHTCEKLCHEGDCGPCSRTSVISCRCSFRTKELPCTSLKSEDATFMCDKRCNKKRLCGRHKCNEICCVDKEHKCPLI
CGRKLRCGLHRCEEPCHRGNCQTCWQASFDELTCHCGASVIYPPVPCGTRPPECTQTCARVHECDHPVYHSCHSEEKCPPCTFLTQKWCM
GKHEFRSNIPCHLVDISCGLPCSATLPCGMHKCQRLCHKGECLVDEPCKQPCTTPRADCGHPCMAPCHTSSPCPVTACKAKVELQCECGR
RKEMVICSEASSTYQRIAAISMASKITDMQLGGSVEISKLITKKEVHQARLECDEECSALERKKRLAEAFHISEDSDPFNIRSSGSKFSD
SLKEDARKDLKFVSDVEKEMETLVEAVNKGKNSKKSHSFPPMNRDHRRIIHDLAQVYGLESVSYDSEPKRNVVVTAIRGKSVCPPTTLTG

--------------------------------------------------------------
>19520_19520_4_CRLS1-NFX1_CRLS1_chr20_5996136_ENST00000378868_NFX1_chr9_33328579_ENST00000318524_length(transcript)=2006nt_BP=427nt
AGTTGTACAGCATGGGTTGAGGTGGCCAGATCCTGGCAGGGGTCTCAACTCCACTGATAGCTAAAGCATGTTGGGGTTTGAATGCTGGAT
ACTTTGAAGTTGCCATATCCTGACTGAAGTCCTTCCCAGAACGGCAGTAGTTGGTCGAGTATGCCACAGTATGAAAACCCATGGACAATC
CCGAATATGTTGTCAATGACGAGAATTGGCTTGGCCCCAGTTCTGGGCTATTTGATTATTGAAGAAGATTTTAATATTGCACTAGGAGTT
TTTGCTTTAGCTGGACTAACAGATTTGTTGGATGGATTTATTGCTCGAAACTGGGCCAATCAAAGATCAGCTTTGGGAAGTGCTCTTGAT
CCACTTGCTGATAAAATACTTATCAGTATCTTATATGTTAGCTTGACCTATGCAGATCTTATTCCAGATTTCATTCATACCTGTGAAAAG
CTCTGCCATGAAGGAGACTGTGGACCATGCTCTCGCACATCAGTTATTTCCTGCAGATGCTCTTTCAGAACAAAGGAGCTTCCATGTACC
AGTCTCAAAAGTGAAGATGCTACATTTATGTGTGACAAGCGGTGTAACAAGAAACGGTTGTGTGGACGGCATAAATGTAATGAGATATGC
TGTGTGGATAAGGAGCACAAGTGTCCTTTGATTTGTGGGAGGAAACTCCGTTGTGGCCTTCATAGGTGTGAAGAACCTTGTCATCGTGGA
AACTGCCAGACATGCTGGCAAGCCAGTTTTGATGAATTAACCTGCCATTGTGGTGCATCAGTGATTTACCCTCCAGTTCCCTGTGGTACT
AGGCCCCCTGAATGTACCCAAACCTGCGCTAGAGTCCATGAGTGTGACCATCCAGTATATCATTCTTGTCATAGTGAGGAGAAGTGTCCC
CCTTGCACTTTCCTAACTCAGAAGTGGTGCATGGGCAAGCATGAGCAGTCCCACTACTGGGCGTCTACCCAGAAGAAAAGAAGTCATTAT
ATGAAAAAGATACCTGCACACGCATGTTTATAGCAGCACAATTCACAATTGCAAAAATGTGGAACCAGCCCAACTGCCCATCAGTCAACA
AGTGGATAAAGAAATTGTGGTGTATCTATACGTACCATGGAATACTACTCAGCCAGGAACGAAATAATGGCATTCACAGCAACCTGGATG
GATTTGGAGACCATTATTCTAAGTGAAGTAACTCAGGAATGGAAAACCAAACATCGTATGTTCTCAATTATAAGAGGGAGCTAAGCTATG
AGGACGCAAAGGCATGAGAATGATACAATGGACTTTGGGGACTCTGGGGAAAGGACGGGAGGCGGGTGAGGGATAAAAAACTACACACTG
GGTCCGAGCACGGTGGCTCATGCCTATAATCCCAGCACTTTGGGAGGCCAAGGCAGGTAGATTATGAGGTCGGGAGTTCGAAACCAGCCT
GGCCAACAAGGTGAGACCCCCCCCATCTCTACTAAAATAATTAGCCGGGCCTGGTGGCGCATCCCTGTAATCCCAGCTACTTTACTTGGG
AGGCTGAGGCAGGAGAATTGCTTGAACTCGGGAGGCGGATGTTGCAGTGAGCCAATATCGCACCACTGCACTCCAGCCTGGGCAACAGAG
CAAGACTCCGTCTCGAAAAAAAAAATAAATTTTAAAAACTACACATTGGGTACAATGTACACTCCTCGGGTGATGGGTGCACCAAAATCT
CAAAAATCACTACTAAGGAACTTATCCATGTAACCAAACACCACCTGTTCCCCAAGAACTATTGAAATAAGAAAAAAGAAAAAAAAAAAA
TCACTGATTGCAGGTCACCCTAACAGATATAATAATGAAAAAGTCTGAGATGTTGCGAGAATTACCAAAATGTGATGCAGACACGAAATG
AGCATGTGCTGTTGGAAAAGTGGAACAGATAGACTTGCTGGAGACAGGGTTGCCACAAAGTTTCAATTTGTAAAAAATGTGATATCTATG

>19520_19520_4_CRLS1-NFX1_CRLS1_chr20_5996136_ENST00000378868_NFX1_chr9_33328579_ENST00000318524_length(amino acids)=290AA_BP=90
MPQYENPWTIPNMLSMTRIGLAPVLGYLIIEEDFNIALGVFALAGLTDLLDGFIARNWANQRSALGSALDPLADKILISILYVSLTYADL
IPDFIHTCEKLCHEGDCGPCSRTSVISCRCSFRTKELPCTSLKSEDATFMCDKRCNKKRLCGRHKCNEICCVDKEHKCPLICGRKLRCGL
HRCEEPCHRGNCQTCWQASFDELTCHCGASVIYPPVPCGTRPPECTQTCARVHECDHPVYHSCHSEEKCPPCTFLTQKWCMGKHEQSHYW

--------------------------------------------------------------
>19520_19520_5_CRLS1-NFX1_CRLS1_chr20_5996136_ENST00000378868_NFX1_chr9_33328579_ENST00000379521_length(transcript)=2069nt_BP=427nt
AGTTGTACAGCATGGGTTGAGGTGGCCAGATCCTGGCAGGGGTCTCAACTCCACTGATAGCTAAAGCATGTTGGGGTTTGAATGCTGGAT
ACTTTGAAGTTGCCATATCCTGACTGAAGTCCTTCCCAGAACGGCAGTAGTTGGTCGAGTATGCCACAGTATGAAAACCCATGGACAATC
CCGAATATGTTGTCAATGACGAGAATTGGCTTGGCCCCAGTTCTGGGCTATTTGATTATTGAAGAAGATTTTAATATTGCACTAGGAGTT
TTTGCTTTAGCTGGACTAACAGATTTGTTGGATGGATTTATTGCTCGAAACTGGGCCAATCAAAGATCAGCTTTGGGAAGTGCTCTTGAT
CCACTTGCTGATAAAATACTTATCAGTATCTTATATGTTAGCTTGACCTATGCAGATCTTATTCCAGATTTCATTCATACCTGTGAAAAG
CTCTGCCATGAAGGAGACTGTGGACCATGCTCTCGCACATCAGTTATTTCCTGCAGATGCTCTTTCAGAACAAAGGAGCTTCCATGTACC
AGTCTCAAAAGTGAAGATGCTACATTTATGTGTGACAAGCGGTGTAACAAGAAACGGTTGTGTGGACGGCATAAATGTAATGAGATATGC
TGTGTGGATAAGGAGCACAAGTGTCCTTTGATTTGTGGGAGGAAACTCCGTTGTGGCCTTCATAGGTGTGAAGAACCTTGTCATCGTGGA
AACTGCCAGACATGCTGGCAAGCCAGTTTTGATGAATTAACCTGCCATTGTGGTGCATCAGTGATTTACCCTCCAGTTCCCTGTGGTACT
AGGCCCCCTGAATGTACCCAAACCTGCGCTAGAGTCCATGAGTGTGACCATCCAGTATATCATTCTTGTCATAGTGAGGAGAAGTGTCCC
CCTTGCACTTTCCTAACTCAGAAGTGGTGCATGGGCAAGCATGAGTTTCGGAGCAACATCCCCTGTCACCTGGTTGATATCTCTTGCGGA
TTACCCTGCAGTGCCACGCTACCATGTGGGATGCACAAATGTCAGAGACTCTGTCACAAAGGGGAGTGTCTTGTGGATGAGCCCTGCAAG
CAGCCCTGCACCACCCCCAGAGCTGACTGTGGTCACCCGTGTATGGCACCCTGCCATACCAGCTCACCCTGCCCTGTGACTGCTTGTAAA
GCTAAGGTAGAGCTACAGTGTGAATGTGGACGAAGAAAAGAGATGGTGATTTGCTCTGAAGCATCTAGTACTTATCAAAGAATAGCTGCA
ATCTCCATGGCCTCTAAGATAACAGACATGCAGCTTGGAGGTTCAGTGGAGATCAGCAAGTTAATTACCAAAAAGGAAGTTCATCAAGCC
AGGCTGGAGTGTGATGAGGAGTGTTCAGCCTTGGAAAGGAAAAAGAGATTAGCAGAGGCATTTCATATCAGTGAGGATTCTGATCCTTTC
AATATACGTTCTTCAGGGTCAAAATTCAGTGATAGTTTGAAAGAAGATGCCAGGAAGGACTTAAAGTTTGTCAGTGACGTTGAGAAGGAA
ATGGAAACCCTCGTGGAGGCCGTGAATAAGGTTGAAGTCGAAACATCCCACTGGACATTTCTCTAAGGCCAGCTTGATGAAAAAAAAAAA
GCAATCAAGTCCTGGAAACACTTTCAACCTAGAGTAGTTGTAGGAAAAAGTCACAACTTCTTTGAGGATCCTTATCTTTCTCAAAACACA
GCCATTATAACATCTGTGGAAGCCTGTCCTGGTTCAGGACAGTTGACTGCAGAAAAGATCTACAGTCGGCTGGGCACAGTGTCTCATGCC
TGTAATGCCAGCACTTTGGGAGGCCAAGGCAGGCAGATCACTTGAGGTCAGGAGTTCTAGATCAGCTTGGCCAACATGACGAAACCCTGT
CTCTACTAAAAATATAAAAATTAGCTGGGTGTGGTGGCGGGCACCTGTTATTCCAGCTACTTGCAAGGCTTAGGCAGGAGAATTCCTTGA

>19520_19520_5_CRLS1-NFX1_CRLS1_chr20_5996136_ENST00000378868_NFX1_chr9_33328579_ENST00000379521_length(amino acids)=481AA_BP=90
MPQYENPWTIPNMLSMTRIGLAPVLGYLIIEEDFNIALGVFALAGLTDLLDGFIARNWANQRSALGSALDPLADKILISILYVSLTYADL
IPDFIHTCEKLCHEGDCGPCSRTSVISCRCSFRTKELPCTSLKSEDATFMCDKRCNKKRLCGRHKCNEICCVDKEHKCPLICGRKLRCGL
HRCEEPCHRGNCQTCWQASFDELTCHCGASVIYPPVPCGTRPPECTQTCARVHECDHPVYHSCHSEEKCPPCTFLTQKWCMGKHEFRSNI
PCHLVDISCGLPCSATLPCGMHKCQRLCHKGECLVDEPCKQPCTTPRADCGHPCMAPCHTSSPCPVTACKAKVELQCECGRRKEMVICSE
ASSTYQRIAAISMASKITDMQLGGSVEISKLITKKEVHQARLECDEECSALERKKRLAEAFHISEDSDPFNIRSSGSKFSDSLKEDARKD

--------------------------------------------------------------
>19520_19520_6_CRLS1-NFX1_CRLS1_chr20_5996136_ENST00000378868_NFX1_chr9_33328579_ENST00000379540_length(transcript)=3063nt_BP=427nt
AGTTGTACAGCATGGGTTGAGGTGGCCAGATCCTGGCAGGGGTCTCAACTCCACTGATAGCTAAAGCATGTTGGGGTTTGAATGCTGGAT
ACTTTGAAGTTGCCATATCCTGACTGAAGTCCTTCCCAGAACGGCAGTAGTTGGTCGAGTATGCCACAGTATGAAAACCCATGGACAATC
CCGAATATGTTGTCAATGACGAGAATTGGCTTGGCCCCAGTTCTGGGCTATTTGATTATTGAAGAAGATTTTAATATTGCACTAGGAGTT
TTTGCTTTAGCTGGACTAACAGATTTGTTGGATGGATTTATTGCTCGAAACTGGGCCAATCAAAGATCAGCTTTGGGAAGTGCTCTTGAT
CCACTTGCTGATAAAATACTTATCAGTATCTTATATGTTAGCTTGACCTATGCAGATCTTATTCCAGATTTCATTCATACCTGTGAAAAG
CTCTGCCATGAAGGAGACTGTGGACCATGCTCTCGCACATCAGTTATTTCCTGCAGATGCTCTTTCAGAACAAAGGAGCTTCCATGTACC
AGTCTCAAAAGTGAAGATGCTACATTTATGTGTGACAAGCGGTGTAACAAGAAACGGTTGTGTGGACGGCATAAATGTAATGAGATATGC
TGTGTGGATAAGGAGCACAAGTGTCCTTTGATTTGTGGGAGGAAACTCCGTTGTGGCCTTCATAGGTGTGAAGAACCTTGTCATCGTGGA
AACTGCCAGACATGCTGGCAAGCCAGTTTTGATGAATTAACCTGCCATTGTGGTGCATCAGTGATTTACCCTCCAGTTCCCTGTGGTACT
AGGCCCCCTGAATGTACCCAAACCTGCGCTAGAGTCCATGAGTGTGACCATCCAGTATATCATTCTTGTCATAGTGAGGAGAAGTGTCCC
CCTTGCACTTTCCTAACTCAGAAGTGGTGCATGGGCAAGCATGAGTTTCGGAGCAACATCCCCTGTCACCTGGTTGATATCTCTTGCGGA
TTACCCTGCAGTGCCACGCTACCATGTGGGATGCACAAATGTCAGAGACTCTGTCACAAAGGGGAGTGTCTTGTGGATGAGCCCTGCAAG
CAGCCCTGCACCACCCCCAGAGCTGACTGTGGTCACCCGTGTATGGCACCCTGCCATACCAGCTCACCCTGCCCTGTGACTGCTTGTAAA
GCTAAGGTAGAGCTACAGTGTGAATGTGGACGAAGAAAAGAGATGGTGATTTGCTCTGAAGCATCTAGTACTTATCAAAGAATAGCTGCA
ATCTCCATGGCCTCTAAGATAACAGACATGCAGCTTGGAGGTTCAGTGGAGATCAGCAAGTTAATTACCAAAAAGGAAGTTCATCAAGCC
AGGCTGGAGTGTGATGAGGAGTGTTCAGCCTTGGAAAGGAAAAAGAGATTAGCAGAGGCATTTCATATCAGTGAGGATTCTGATCCTTTC
AATATACGTTCTTCAGGGTCAAAATTCAGTGATAGTTTGAAAGAAGATGCCAGGAAGGACTTAAAGTTTGTCAGTGACGTTGAGAAGGAA
ATGGAAACCCTCGTGGAGGCCGTGAATAAGGGAAAGAATAGTAAGAAAAGCCACAGCTTCCCTCCCATGAACAGAGACCACCGCCGGATC
ATCCATGACTTGGCCCAAGTTTATGGCCTGGAGAGCGTGAGCTATGACAGTGAACCGAAGCGCAATGTGGTGGTCACTGCCATCAGGGGG
AAGTCCGTTTGTCCTCCTACCACGCTGACAGGTGTGCTTGAAAGGGAAATGCAGGCACGGCCTCCACCACCGATTCCTCATCACAGACAT
CAGTCAGACAAGAATCCTGGGAGCAGTAATTTACAGAAAATAACCAAGGAGCCAATAATTGACTATTTTGACGTCCAGGACTAAGAAGAT
CATGATGCACTTAGATAAAAGAATGATTAGGTATAGTGGAGACTTATTTGCCAGCAGATAAATCATGCCCGTTCCCCTCTGCCTGGCAGA
ATCACAGTCTCACATACTGTCTTGTACTGACACATCCAAAGCATGAGTGTGTCAGAAATCCCTTGTCTATTCCTGTCTGTATAAAGTGTT
TCATTATGACCAGATCTCTGATTGTATGGTCACTAGGTATGCAATCACGCATTCAAAGAGGCTCTTTACACCATCACTGTGATTGCTCTG
AGAGTTGAGGGACTATTGGGCTTTATTTGGACAAACCAAACTTTTAGCCTGAAACCAACTTTATGCCACTAAGTCATAGCCTCAGTTGTC
CCAGTTATTTGTCCTCCTGAAAATGCCTGAAACATCAGACAGACATTGCTTGCTTTACCCAAACTGATCAAAATCTTTAGGAGCACAAAT
GAATTTTTTAGTCTGAAATACCAAATAATGAATTGGTATACCATATCCGGAATCACACATGTTATCTTAAACCCAGCCATCATACCTAAG
TCTTTTGCCAAAACCTCTCATAGGTATATCTAGCTGAACTTATTTTGGCATTTTCAATGTGATCAGTTCTAGACCTAGAAGGGGGTCAGG
CTGCTTTACAGAATTCTATTTCCTTAAGTCCCTGGCACTTCTCATACCACATCACTGAACCTGTTCAGTAACAATCAGTTTGGCCGTCCC
CCATGATGGTAGGAAATATAGAGAGCAAGTTCTTCTGCCAGGGTCACACTGTGGTCTCTGAACTGACCAGTATATCCCTAACTCCTCTTT
GATAGAGAAAGAGTCTCAAATGGACAACTGTCCTGTGTTGCTTTCCCTAGGCCTTCAGCAGCCTATTGGCTCTCCCTGCCTCTGAGCTCT
GGACTCTGTTTGAATATTCCAAGTAGTATATGGACAGTCCAGGGCTTATGCCCAGCAGCCCACTGGAGGCATTCTTCAGGCTCCTTTAAG
GCAGGTGCATTGATAGTTCCATTAGTGTGACCCTTGCATTGGCACCCCTCCAGCCTGGAGGCCAGGCTTCCAGCAACTTCCTTCTGCCCT
AGAGCAAGCCATGAGCCCCAGAGCAGTAGCAGGAGACTTGAGAAGTAGAGTGACAAAAACAAGCACTTAATTAAATTATAAAATTTAACT

>19520_19520_6_CRLS1-NFX1_CRLS1_chr20_5996136_ENST00000378868_NFX1_chr9_33328579_ENST00000379540_length(amino acids)=577AA_BP=90
MPQYENPWTIPNMLSMTRIGLAPVLGYLIIEEDFNIALGVFALAGLTDLLDGFIARNWANQRSALGSALDPLADKILISILYVSLTYADL
IPDFIHTCEKLCHEGDCGPCSRTSVISCRCSFRTKELPCTSLKSEDATFMCDKRCNKKRLCGRHKCNEICCVDKEHKCPLICGRKLRCGL
HRCEEPCHRGNCQTCWQASFDELTCHCGASVIYPPVPCGTRPPECTQTCARVHECDHPVYHSCHSEEKCPPCTFLTQKWCMGKHEFRSNI
PCHLVDISCGLPCSATLPCGMHKCQRLCHKGECLVDEPCKQPCTTPRADCGHPCMAPCHTSSPCPVTACKAKVELQCECGRRKEMVICSE
ASSTYQRIAAISMASKITDMQLGGSVEISKLITKKEVHQARLECDEECSALERKKRLAEAFHISEDSDPFNIRSSGSKFSDSLKEDARKD
LKFVSDVEKEMETLVEAVNKGKNSKKSHSFPPMNRDHRRIIHDLAQVYGLESVSYDSEPKRNVVVTAIRGKSVCPPTTLTGVLEREMQAR

--------------------------------------------------------------
>19520_19520_7_CRLS1-NFX1_CRLS1_chr20_5996136_ENST00000452938_NFX1_chr9_33328579_ENST00000318524_length(transcript)=2250nt_BP=671nt
CTGAGTGGTTGCCGGGTCTCCATGGAGAAGCGGCTCGCCAGTGTCCCAGGCTGCTGAGCTCTCGCCGCCCGAGACCCCGCGGCGCGGCCG
CAGGGCCATGCTAGCCTTGCGCGTGGCGCGCGGCTCGTGGGGGGCCCTGCGCGGCGCCGCTTGGGCTCCGGGAACGCGGCCGAGTAAGCG
ACGCGCCTGCTGGGCCCTGCTGCCGCCCGTGCCCTGCTGCTTGGGCTGCCTGGCCGAACGCTGGAGGCTGCGTCCGGCCGCTCTTGGCTT
GCGGCTGCCCGGGATCGGCCAGCGGAACCACTGTTCGGGCGCGGGGAAGGCGGCTCCCAGGCCAGCGGCCGGAGCGGGCGCCGCTGCCGA
AGCCCCGGGCGGCCAGTGGGGCCCGGCGAGCACCCCCAGCCTGTATGAAAACCCATGGACAATCCCGAATATGTTGTCAATGACGAGAAT
TGGCTTGGCCCCAGTTCTGGGCTATTTGATTATTGAAGAAGATTTTAATATTGCACTAGGAGTTTTTGCTTTAGCTGGACTAACAGATTT
GTTGGATGGATTTATTGCTCGAAACTGGGCCAATCAAAGATCAGCTTTGGGAAGTGCTCTTGATCCACTTGCTGATAAAATACTTATCAG
TATCTTATATGTTAGCTTGACCTATGCAGATCTTATTCCAGATTTCATTCATACCTGTGAAAAGCTCTGCCATGAAGGAGACTGTGGACC
ATGCTCTCGCACATCAGTTATTTCCTGCAGATGCTCTTTCAGAACAAAGGAGCTTCCATGTACCAGTCTCAAAAGTGAAGATGCTACATT
TATGTGTGACAAGCGGTGTAACAAGAAACGGTTGTGTGGACGGCATAAATGTAATGAGATATGCTGTGTGGATAAGGAGCACAAGTGTCC
TTTGATTTGTGGGAGGAAACTCCGTTGTGGCCTTCATAGGTGTGAAGAACCTTGTCATCGTGGAAACTGCCAGACATGCTGGCAAGCCAG
TTTTGATGAATTAACCTGCCATTGTGGTGCATCAGTGATTTACCCTCCAGTTCCCTGTGGTACTAGGCCCCCTGAATGTACCCAAACCTG
CGCTAGAGTCCATGAGTGTGACCATCCAGTATATCATTCTTGTCATAGTGAGGAGAAGTGTCCCCCTTGCACTTTCCTAACTCAGAAGTG
GTGCATGGGCAAGCATGAGCAGTCCCACTACTGGGCGTCTACCCAGAAGAAAAGAAGTCATTATATGAAAAAGATACCTGCACACGCATG
TTTATAGCAGCACAATTCACAATTGCAAAAATGTGGAACCAGCCCAACTGCCCATCAGTCAACAAGTGGATAAAGAAATTGTGGTGTATC
TATACGTACCATGGAATACTACTCAGCCAGGAACGAAATAATGGCATTCACAGCAACCTGGATGGATTTGGAGACCATTATTCTAAGTGA
AGTAACTCAGGAATGGAAAACCAAACATCGTATGTTCTCAATTATAAGAGGGAGCTAAGCTATGAGGACGCAAAGGCATGAGAATGATAC
AATGGACTTTGGGGACTCTGGGGAAAGGACGGGAGGCGGGTGAGGGATAAAAAACTACACACTGGGTCCGAGCACGGTGGCTCATGCCTA
TAATCCCAGCACTTTGGGAGGCCAAGGCAGGTAGATTATGAGGTCGGGAGTTCGAAACCAGCCTGGCCAACAAGGTGAGACCCCCCCCAT
CTCTACTAAAATAATTAGCCGGGCCTGGTGGCGCATCCCTGTAATCCCAGCTACTTTACTTGGGAGGCTGAGGCAGGAGAATTGCTTGAA
CTCGGGAGGCGGATGTTGCAGTGAGCCAATATCGCACCACTGCACTCCAGCCTGGGCAACAGAGCAAGACTCCGTCTCGAAAAAAAAAAT
AAATTTTAAAAACTACACATTGGGTACAATGTACACTCCTCGGGTGATGGGTGCACCAAAATCTCAAAAATCACTACTAAGGAACTTATC
CATGTAACCAAACACCACCTGTTCCCCAAGAACTATTGAAATAAGAAAAAAGAAAAAAAAAAAATCACTGATTGCAGGTCACCCTAACAG
ATATAATAATGAAAAAGTCTGAGATGTTGCGAGAATTACCAAAATGTGATGCAGACACGAAATGAGCATGTGCTGTTGGAAAAGTGGAAC
AGATAGACTTGCTGGAGACAGGGTTGCCACAAAGTTTCAATTTGTAAAAAATGTGATATCTATGAAATGCAATAAAGTGAAGTCCAATAA

>19520_19520_7_CRLS1-NFX1_CRLS1_chr20_5996136_ENST00000452938_NFX1_chr9_33328579_ENST00000318524_length(amino acids)=389AA_BP=189
MLALRVARGSWGALRGAAWAPGTRPSKRRACWALLPPVPCCLGCLAERWRLRPAALGLRLPGIGQRNHCSGAGKAAPRPAAGAGAAAEAP
GGQWGPASTPSLYENPWTIPNMLSMTRIGLAPVLGYLIIEEDFNIALGVFALAGLTDLLDGFIARNWANQRSALGSALDPLADKILISIL
YVSLTYADLIPDFIHTCEKLCHEGDCGPCSRTSVISCRCSFRTKELPCTSLKSEDATFMCDKRCNKKRLCGRHKCNEICCVDKEHKCPLI
CGRKLRCGLHRCEEPCHRGNCQTCWQASFDELTCHCGASVIYPPVPCGTRPPECTQTCARVHECDHPVYHSCHSEEKCPPCTFLTQKWCM

--------------------------------------------------------------
>19520_19520_8_CRLS1-NFX1_CRLS1_chr20_5996136_ENST00000452938_NFX1_chr9_33328579_ENST00000379521_length(transcript)=2313nt_BP=671nt
CTGAGTGGTTGCCGGGTCTCCATGGAGAAGCGGCTCGCCAGTGTCCCAGGCTGCTGAGCTCTCGCCGCCCGAGACCCCGCGGCGCGGCCG
CAGGGCCATGCTAGCCTTGCGCGTGGCGCGCGGCTCGTGGGGGGCCCTGCGCGGCGCCGCTTGGGCTCCGGGAACGCGGCCGAGTAAGCG
ACGCGCCTGCTGGGCCCTGCTGCCGCCCGTGCCCTGCTGCTTGGGCTGCCTGGCCGAACGCTGGAGGCTGCGTCCGGCCGCTCTTGGCTT
GCGGCTGCCCGGGATCGGCCAGCGGAACCACTGTTCGGGCGCGGGGAAGGCGGCTCCCAGGCCAGCGGCCGGAGCGGGCGCCGCTGCCGA
AGCCCCGGGCGGCCAGTGGGGCCCGGCGAGCACCCCCAGCCTGTATGAAAACCCATGGACAATCCCGAATATGTTGTCAATGACGAGAAT
TGGCTTGGCCCCAGTTCTGGGCTATTTGATTATTGAAGAAGATTTTAATATTGCACTAGGAGTTTTTGCTTTAGCTGGACTAACAGATTT
GTTGGATGGATTTATTGCTCGAAACTGGGCCAATCAAAGATCAGCTTTGGGAAGTGCTCTTGATCCACTTGCTGATAAAATACTTATCAG
TATCTTATATGTTAGCTTGACCTATGCAGATCTTATTCCAGATTTCATTCATACCTGTGAAAAGCTCTGCCATGAAGGAGACTGTGGACC
ATGCTCTCGCACATCAGTTATTTCCTGCAGATGCTCTTTCAGAACAAAGGAGCTTCCATGTACCAGTCTCAAAAGTGAAGATGCTACATT
TATGTGTGACAAGCGGTGTAACAAGAAACGGTTGTGTGGACGGCATAAATGTAATGAGATATGCTGTGTGGATAAGGAGCACAAGTGTCC
TTTGATTTGTGGGAGGAAACTCCGTTGTGGCCTTCATAGGTGTGAAGAACCTTGTCATCGTGGAAACTGCCAGACATGCTGGCAAGCCAG
TTTTGATGAATTAACCTGCCATTGTGGTGCATCAGTGATTTACCCTCCAGTTCCCTGTGGTACTAGGCCCCCTGAATGTACCCAAACCTG
CGCTAGAGTCCATGAGTGTGACCATCCAGTATATCATTCTTGTCATAGTGAGGAGAAGTGTCCCCCTTGCACTTTCCTAACTCAGAAGTG
GTGCATGGGCAAGCATGAGTTTCGGAGCAACATCCCCTGTCACCTGGTTGATATCTCTTGCGGATTACCCTGCAGTGCCACGCTACCATG
TGGGATGCACAAATGTCAGAGACTCTGTCACAAAGGGGAGTGTCTTGTGGATGAGCCCTGCAAGCAGCCCTGCACCACCCCCAGAGCTGA
CTGTGGTCACCCGTGTATGGCACCCTGCCATACCAGCTCACCCTGCCCTGTGACTGCTTGTAAAGCTAAGGTAGAGCTACAGTGTGAATG
TGGACGAAGAAAAGAGATGGTGATTTGCTCTGAAGCATCTAGTACTTATCAAAGAATAGCTGCAATCTCCATGGCCTCTAAGATAACAGA
CATGCAGCTTGGAGGTTCAGTGGAGATCAGCAAGTTAATTACCAAAAAGGAAGTTCATCAAGCCAGGCTGGAGTGTGATGAGGAGTGTTC
AGCCTTGGAAAGGAAAAAGAGATTAGCAGAGGCATTTCATATCAGTGAGGATTCTGATCCTTTCAATATACGTTCTTCAGGGTCAAAATT
CAGTGATAGTTTGAAAGAAGATGCCAGGAAGGACTTAAAGTTTGTCAGTGACGTTGAGAAGGAAATGGAAACCCTCGTGGAGGCCGTGAA
TAAGGTTGAAGTCGAAACATCCCACTGGACATTTCTCTAAGGCCAGCTTGATGAAAAAAAAAAAGCAATCAAGTCCTGGAAACACTTTCA
ACCTAGAGTAGTTGTAGGAAAAAGTCACAACTTCTTTGAGGATCCTTATCTTTCTCAAAACACAGCCATTATAACATCTGTGGAAGCCTG
TCCTGGTTCAGGACAGTTGACTGCAGAAAAGATCTACAGTCGGCTGGGCACAGTGTCTCATGCCTGTAATGCCAGCACTTTGGGAGGCCA
AGGCAGGCAGATCACTTGAGGTCAGGAGTTCTAGATCAGCTTGGCCAACATGACGAAACCCTGTCTCTACTAAAAATATAAAAATTAGCT
GGGTGTGGTGGCGGGCACCTGTTATTCCAGCTACTTGCAAGGCTTAGGCAGGAGAATTCCTTGAACCCGGGAGGCAGAGGTTGCTGTGAG

>19520_19520_8_CRLS1-NFX1_CRLS1_chr20_5996136_ENST00000452938_NFX1_chr9_33328579_ENST00000379521_length(amino acids)=580AA_BP=189
MLALRVARGSWGALRGAAWAPGTRPSKRRACWALLPPVPCCLGCLAERWRLRPAALGLRLPGIGQRNHCSGAGKAAPRPAAGAGAAAEAP
GGQWGPASTPSLYENPWTIPNMLSMTRIGLAPVLGYLIIEEDFNIALGVFALAGLTDLLDGFIARNWANQRSALGSALDPLADKILISIL
YVSLTYADLIPDFIHTCEKLCHEGDCGPCSRTSVISCRCSFRTKELPCTSLKSEDATFMCDKRCNKKRLCGRHKCNEICCVDKEHKCPLI
CGRKLRCGLHRCEEPCHRGNCQTCWQASFDELTCHCGASVIYPPVPCGTRPPECTQTCARVHECDHPVYHSCHSEEKCPPCTFLTQKWCM
GKHEFRSNIPCHLVDISCGLPCSATLPCGMHKCQRLCHKGECLVDEPCKQPCTTPRADCGHPCMAPCHTSSPCPVTACKAKVELQCECGR
RKEMVICSEASSTYQRIAAISMASKITDMQLGGSVEISKLITKKEVHQARLECDEECSALERKKRLAEAFHISEDSDPFNIRSSGSKFSD

--------------------------------------------------------------
>19520_19520_9_CRLS1-NFX1_CRLS1_chr20_5996136_ENST00000452938_NFX1_chr9_33328579_ENST00000379540_length(transcript)=3307nt_BP=671nt
CTGAGTGGTTGCCGGGTCTCCATGGAGAAGCGGCTCGCCAGTGTCCCAGGCTGCTGAGCTCTCGCCGCCCGAGACCCCGCGGCGCGGCCG
CAGGGCCATGCTAGCCTTGCGCGTGGCGCGCGGCTCGTGGGGGGCCCTGCGCGGCGCCGCTTGGGCTCCGGGAACGCGGCCGAGTAAGCG
ACGCGCCTGCTGGGCCCTGCTGCCGCCCGTGCCCTGCTGCTTGGGCTGCCTGGCCGAACGCTGGAGGCTGCGTCCGGCCGCTCTTGGCTT
GCGGCTGCCCGGGATCGGCCAGCGGAACCACTGTTCGGGCGCGGGGAAGGCGGCTCCCAGGCCAGCGGCCGGAGCGGGCGCCGCTGCCGA
AGCCCCGGGCGGCCAGTGGGGCCCGGCGAGCACCCCCAGCCTGTATGAAAACCCATGGACAATCCCGAATATGTTGTCAATGACGAGAAT
TGGCTTGGCCCCAGTTCTGGGCTATTTGATTATTGAAGAAGATTTTAATATTGCACTAGGAGTTTTTGCTTTAGCTGGACTAACAGATTT
GTTGGATGGATTTATTGCTCGAAACTGGGCCAATCAAAGATCAGCTTTGGGAAGTGCTCTTGATCCACTTGCTGATAAAATACTTATCAG
TATCTTATATGTTAGCTTGACCTATGCAGATCTTATTCCAGATTTCATTCATACCTGTGAAAAGCTCTGCCATGAAGGAGACTGTGGACC
ATGCTCTCGCACATCAGTTATTTCCTGCAGATGCTCTTTCAGAACAAAGGAGCTTCCATGTACCAGTCTCAAAAGTGAAGATGCTACATT
TATGTGTGACAAGCGGTGTAACAAGAAACGGTTGTGTGGACGGCATAAATGTAATGAGATATGCTGTGTGGATAAGGAGCACAAGTGTCC
TTTGATTTGTGGGAGGAAACTCCGTTGTGGCCTTCATAGGTGTGAAGAACCTTGTCATCGTGGAAACTGCCAGACATGCTGGCAAGCCAG
TTTTGATGAATTAACCTGCCATTGTGGTGCATCAGTGATTTACCCTCCAGTTCCCTGTGGTACTAGGCCCCCTGAATGTACCCAAACCTG
CGCTAGAGTCCATGAGTGTGACCATCCAGTATATCATTCTTGTCATAGTGAGGAGAAGTGTCCCCCTTGCACTTTCCTAACTCAGAAGTG
GTGCATGGGCAAGCATGAGTTTCGGAGCAACATCCCCTGTCACCTGGTTGATATCTCTTGCGGATTACCCTGCAGTGCCACGCTACCATG
TGGGATGCACAAATGTCAGAGACTCTGTCACAAAGGGGAGTGTCTTGTGGATGAGCCCTGCAAGCAGCCCTGCACCACCCCCAGAGCTGA
CTGTGGTCACCCGTGTATGGCACCCTGCCATACCAGCTCACCCTGCCCTGTGACTGCTTGTAAAGCTAAGGTAGAGCTACAGTGTGAATG
TGGACGAAGAAAAGAGATGGTGATTTGCTCTGAAGCATCTAGTACTTATCAAAGAATAGCTGCAATCTCCATGGCCTCTAAGATAACAGA
CATGCAGCTTGGAGGTTCAGTGGAGATCAGCAAGTTAATTACCAAAAAGGAAGTTCATCAAGCCAGGCTGGAGTGTGATGAGGAGTGTTC
AGCCTTGGAAAGGAAAAAGAGATTAGCAGAGGCATTTCATATCAGTGAGGATTCTGATCCTTTCAATATACGTTCTTCAGGGTCAAAATT
CAGTGATAGTTTGAAAGAAGATGCCAGGAAGGACTTAAAGTTTGTCAGTGACGTTGAGAAGGAAATGGAAACCCTCGTGGAGGCCGTGAA
TAAGGGAAAGAATAGTAAGAAAAGCCACAGCTTCCCTCCCATGAACAGAGACCACCGCCGGATCATCCATGACTTGGCCCAAGTTTATGG
CCTGGAGAGCGTGAGCTATGACAGTGAACCGAAGCGCAATGTGGTGGTCACTGCCATCAGGGGGAAGTCCGTTTGTCCTCCTACCACGCT
GACAGGTGTGCTTGAAAGGGAAATGCAGGCACGGCCTCCACCACCGATTCCTCATCACAGACATCAGTCAGACAAGAATCCTGGGAGCAG
TAATTTACAGAAAATAACCAAGGAGCCAATAATTGACTATTTTGACGTCCAGGACTAAGAAGATCATGATGCACTTAGATAAAAGAATGA
TTAGGTATAGTGGAGACTTATTTGCCAGCAGATAAATCATGCCCGTTCCCCTCTGCCTGGCAGAATCACAGTCTCACATACTGTCTTGTA
CTGACACATCCAAAGCATGAGTGTGTCAGAAATCCCTTGTCTATTCCTGTCTGTATAAAGTGTTTCATTATGACCAGATCTCTGATTGTA
TGGTCACTAGGTATGCAATCACGCATTCAAAGAGGCTCTTTACACCATCACTGTGATTGCTCTGAGAGTTGAGGGACTATTGGGCTTTAT
TTGGACAAACCAAACTTTTAGCCTGAAACCAACTTTATGCCACTAAGTCATAGCCTCAGTTGTCCCAGTTATTTGTCCTCCTGAAAATGC
CTGAAACATCAGACAGACATTGCTTGCTTTACCCAAACTGATCAAAATCTTTAGGAGCACAAATGAATTTTTTAGTCTGAAATACCAAAT
AATGAATTGGTATACCATATCCGGAATCACACATGTTATCTTAAACCCAGCCATCATACCTAAGTCTTTTGCCAAAACCTCTCATAGGTA
TATCTAGCTGAACTTATTTTGGCATTTTCAATGTGATCAGTTCTAGACCTAGAAGGGGGTCAGGCTGCTTTACAGAATTCTATTTCCTTA
AGTCCCTGGCACTTCTCATACCACATCACTGAACCTGTTCAGTAACAATCAGTTTGGCCGTCCCCCATGATGGTAGGAAATATAGAGAGC
AAGTTCTTCTGCCAGGGTCACACTGTGGTCTCTGAACTGACCAGTATATCCCTAACTCCTCTTTGATAGAGAAAGAGTCTCAAATGGACA
ACTGTCCTGTGTTGCTTTCCCTAGGCCTTCAGCAGCCTATTGGCTCTCCCTGCCTCTGAGCTCTGGACTCTGTTTGAATATTCCAAGTAG
TATATGGACAGTCCAGGGCTTATGCCCAGCAGCCCACTGGAGGCATTCTTCAGGCTCCTTTAAGGCAGGTGCATTGATAGTTCCATTAGT
GTGACCCTTGCATTGGCACCCCTCCAGCCTGGAGGCCAGGCTTCCAGCAACTTCCTTCTGCCCTAGAGCAAGCCATGAGCCCCAGAGCAG

>19520_19520_9_CRLS1-NFX1_CRLS1_chr20_5996136_ENST00000452938_NFX1_chr9_33328579_ENST00000379540_length(amino acids)=676AA_BP=189
MLALRVARGSWGALRGAAWAPGTRPSKRRACWALLPPVPCCLGCLAERWRLRPAALGLRLPGIGQRNHCSGAGKAAPRPAAGAGAAAEAP
GGQWGPASTPSLYENPWTIPNMLSMTRIGLAPVLGYLIIEEDFNIALGVFALAGLTDLLDGFIARNWANQRSALGSALDPLADKILISIL
YVSLTYADLIPDFIHTCEKLCHEGDCGPCSRTSVISCRCSFRTKELPCTSLKSEDATFMCDKRCNKKRLCGRHKCNEICCVDKEHKCPLI
CGRKLRCGLHRCEEPCHRGNCQTCWQASFDELTCHCGASVIYPPVPCGTRPPECTQTCARVHECDHPVYHSCHSEEKCPPCTFLTQKWCM
GKHEFRSNIPCHLVDISCGLPCSATLPCGMHKCQRLCHKGECLVDEPCKQPCTTPRADCGHPCMAPCHTSSPCPVTACKAKVELQCECGR
RKEMVICSEASSTYQRIAAISMASKITDMQLGGSVEISKLITKKEVHQARLECDEECSALERKKRLAEAFHISEDSDPFNIRSSGSKFSD
SLKEDARKDLKFVSDVEKEMETLVEAVNKGKNSKKSHSFPPMNRDHRRIIHDLAQVYGLESVSYDSEPKRNVVVTAIRGKSVCPPTTLTG

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for CRLS1-NFX1


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with
TgeneNFX1chr20:5996136chr9:33328579ENST000003185248169_26635.3333333333334834.0PABPC1 and PABC4
TgeneNFX1chr20:5996136chr9:33328579ENST000003795218219_26635.33333333333341025.0PABPC1 and PABC4
TgeneNFX1chr20:5996136chr9:33328579ENST000003795408249_26635.33333333333341121.0PABPC1 and PABC4


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for CRLS1-NFX1


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for CRLS1-NFX1


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource