FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:SULF1-TERF1 (FusionGDB2 ID:88037)

Fusion Gene Summary for SULF1-TERF1

check button Fusion gene summary
Fusion gene informationFusion gene name: SULF1-TERF1
Fusion gene ID: 88037
HgeneTgene
Gene symbol

SULF1

TERF1

Gene ID

23213

7013

Gene namesulfatase 1telomeric repeat binding factor 1
SynonymsSULF-1PIN2|TRBF1|TRF|TRF1|hTRF1-AS|t-TRF1
Cytomap

8q13.2-q13.3

8q21.11

Type of geneprotein-codingprotein-coding
Descriptionextracellular sulfatase Sulf-1sulfatase FPtelomeric repeat-binding factor 1NIMA-interacting protein 2TTAGGG repeat-binding factor 1telomeric protein Pin2/TRF1telomeric repeat binding factor (NIMA-interacting) 1
Modification date2020031320200327
UniProtAcc.

TINF2

Ensembl transtripts involved in fusion geneENST00000521946, ENST00000260128, 
ENST00000402687, ENST00000419716, 
ENST00000458141, 
ENST00000276602, 
ENST00000276603, ENST00000520783, 
Fusion gene scores* DoF score13 X 15 X 5=9754 X 4 X 3=48
# samples 165
** MAII scorelog2(16/975*10)=-2.60733031374961
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(5/48*10)=0.0588936890535686
effective Gene in Pan-Cancer Fusion Genes (eGinPCFGs).
DoF>8 and MAII>0
Context

PubMed: SULF1 [Title/Abstract] AND TERF1 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointSULF1(70533486)-TERF1(73958196), # samples:1
SULF1(70533486)-TERF1(73958195), # samples:1
Anticipated loss of major functional domain due to fusion event.SULF1-TERF1 seems lost the major protein functional domain in Tgene partner, which is a transcription factor due to the frame-shifted ORF.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneSULF1

GO:0001937

negative regulation of endothelial cell proliferation

16778174

HgeneSULF1

GO:0016525

negative regulation of angiogenesis

16778174

HgeneSULF1

GO:0030177

positive regulation of Wnt signaling pathway

19520866

HgeneSULF1

GO:0030201

heparan sulfate proteoglycan metabolic process

18687675|19666466|19822709

HgeneSULF1

GO:0048010

vascular endothelial growth factor receptor signaling pathway

16778174

TgeneTERF1

GO:0007004

telomere maintenance via telomerase

23685356

TgeneTERF1

GO:0008156

negative regulation of DNA replication

11327863

TgeneTERF1

GO:1905778

negative regulation of exonuclease activity

15200954

TgeneTERF1

GO:1905839

negative regulation of telomeric D-loop disassembly

15200954


check buttonFusion gene breakpoints across SULF1 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across TERF1 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4SARCTCGA-DX-A7ET-01ASULF1chr8

70533486

+TERF1chr8

73958196

+
ChimerDB4SARCTCGA-DX-A7ETSULF1chr8

70533486

+TERF1chr8

73958195

+


Top

Fusion Gene ORF analysis for SULF1-TERF1

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
3UTR-3CDSENST00000521946ENST00000276602SULF1chr8

70533486

+TERF1chr8

73958196

+
3UTR-3CDSENST00000521946ENST00000276602SULF1chr8

70533486

+TERF1chr8

73958195

+
3UTR-3CDSENST00000521946ENST00000276603SULF1chr8

70533486

+TERF1chr8

73958196

+
3UTR-3CDSENST00000521946ENST00000276603SULF1chr8

70533486

+TERF1chr8

73958195

+
3UTR-intronENST00000521946ENST00000520783SULF1chr8

70533486

+TERF1chr8

73958196

+
3UTR-intronENST00000521946ENST00000520783SULF1chr8

70533486

+TERF1chr8

73958195

+
5CDS-intronENST00000260128ENST00000520783SULF1chr8

70533486

+TERF1chr8

73958196

+
5CDS-intronENST00000260128ENST00000520783SULF1chr8

70533486

+TERF1chr8

73958195

+
5CDS-intronENST00000402687ENST00000520783SULF1chr8

70533486

+TERF1chr8

73958196

+
5CDS-intronENST00000402687ENST00000520783SULF1chr8

70533486

+TERF1chr8

73958195

+
5CDS-intronENST00000419716ENST00000520783SULF1chr8

70533486

+TERF1chr8

73958196

+
5CDS-intronENST00000419716ENST00000520783SULF1chr8

70533486

+TERF1chr8

73958195

+
5CDS-intronENST00000458141ENST00000520783SULF1chr8

70533486

+TERF1chr8

73958196

+
5CDS-intronENST00000458141ENST00000520783SULF1chr8

70533486

+TERF1chr8

73958195

+
Frame-shiftENST00000260128ENST00000276603SULF1chr8

70533486

+TERF1chr8

73958196

+
Frame-shiftENST00000260128ENST00000276603SULF1chr8

70533486

+TERF1chr8

73958195

+
Frame-shiftENST00000402687ENST00000276603SULF1chr8

70533486

+TERF1chr8

73958196

+
Frame-shiftENST00000402687ENST00000276603SULF1chr8

70533486

+TERF1chr8

73958195

+
Frame-shiftENST00000419716ENST00000276603SULF1chr8

70533486

+TERF1chr8

73958196

+
Frame-shiftENST00000419716ENST00000276603SULF1chr8

70533486

+TERF1chr8

73958195

+
Frame-shiftENST00000458141ENST00000276603SULF1chr8

70533486

+TERF1chr8

73958196

+
Frame-shiftENST00000458141ENST00000276603SULF1chr8

70533486

+TERF1chr8

73958195

+
In-frameENST00000260128ENST00000276602SULF1chr8

70533486

+TERF1chr8

73958196

+
In-frameENST00000260128ENST00000276602SULF1chr8

70533486

+TERF1chr8

73958195

+
In-frameENST00000402687ENST00000276602SULF1chr8

70533486

+TERF1chr8

73958196

+
In-frameENST00000402687ENST00000276602SULF1chr8

70533486

+TERF1chr8

73958195

+
In-frameENST00000419716ENST00000276602SULF1chr8

70533486

+TERF1chr8

73958196

+
In-frameENST00000419716ENST00000276602SULF1chr8

70533486

+TERF1chr8

73958195

+
In-frameENST00000458141ENST00000276602SULF1chr8

70533486

+TERF1chr8

73958196

+
In-frameENST00000458141ENST00000276602SULF1chr8

70533486

+TERF1chr8

73958195

+

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000260128SULF1chr870533486+ENST00000276602TERF1chr873958196+447323117172381554
ENST00000458141SULF1chr870533486+ENST00000276602TERF1chr873958196+431121495552219554
ENST00000402687SULF1chr870533486+ENST00000276602TERF1chr873958196+451123497552419554
ENST00000419716SULF1chr870533486+ENST00000276602TERF1chr873958196+431721555612225554
ENST00000260128SULF1chr870533486+ENST00000276602TERF1chr873958195+447323117172381554
ENST00000458141SULF1chr870533486+ENST00000276602TERF1chr873958195+431121495552219554
ENST00000402687SULF1chr870533486+ENST00000276602TERF1chr873958195+451123497552419554
ENST00000419716SULF1chr870533486+ENST00000276602TERF1chr873958195+431721555612225554

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000260128ENST00000276602SULF1chr870533486+TERF1chr873958196+0.0002894020.9997106
ENST00000458141ENST00000276602SULF1chr870533486+TERF1chr873958196+0.0002946890.99970526
ENST00000402687ENST00000276602SULF1chr870533486+TERF1chr873958196+0.0002903240.99970967
ENST00000419716ENST00000276602SULF1chr870533486+TERF1chr873958196+0.0003124140.99968755
ENST00000260128ENST00000276602SULF1chr870533486+TERF1chr873958195+0.0002894020.9997106
ENST00000458141ENST00000276602SULF1chr870533486+TERF1chr873958195+0.0002946890.99970526
ENST00000402687ENST00000276602SULF1chr870533486+TERF1chr873958195+0.0002903240.99970967
ENST00000419716ENST00000276602SULF1chr870533486+TERF1chr873958195+0.0003124140.99968755

Top

Fusion Genomic Features for SULF1-TERF1


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)
SULF1chr870533486+TERF1chr873958195+0.0005361910.99946386
SULF1chr870533486+TERF1chr873958195+0.0005361910.99946386
SULF1chr870533486+TERF1chr873958195+0.0005361910.99946386
SULF1chr870533486+TERF1chr873958195+0.0005361910.99946386

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.
genomic feature of top 1%

Top

Fusion Protein Features for SULF1-TERF1


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr8:70533486/chr8:73958196)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
.TERF1

TINF2

FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.451

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
TgeneTERF1chr8:70533486chr8:73958195ENST0000027660279403_428361420.0DNA bindingH-T-H motif
TgeneTERF1chr8:70533486chr8:73958195ENST00000276603810403_428381440.0DNA bindingH-T-H motif
TgeneTERF1chr8:70533486chr8:73958196ENST0000027660279403_428361420.0DNA bindingH-T-H motif
TgeneTERF1chr8:70533486chr8:73958196ENST00000276603810403_428381440.0DNA bindingH-T-H motif
TgeneTERF1chr8:70533486chr8:73958195ENST0000027660279375_432361420.0DomainHTH myb-type
TgeneTERF1chr8:70533486chr8:73958196ENST0000027660279375_432361420.0DomainHTH myb-type

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
TgeneTERF1chr8:70533486chr8:73958195ENST00000276602792_71361420.0Compositional biasNote=Asp/Glu-rich (acidic)
TgeneTERF1chr8:70533486chr8:73958195ENST000002766027955_62361420.0Compositional biasNote=Poly-Glu
TgeneTERF1chr8:70533486chr8:73958195ENST000002766038102_71381440.0Compositional biasNote=Asp/Glu-rich (acidic)
TgeneTERF1chr8:70533486chr8:73958195ENST0000027660381055_62381440.0Compositional biasNote=Poly-Glu
TgeneTERF1chr8:70533486chr8:73958196ENST00000276602792_71361420.0Compositional biasNote=Asp/Glu-rich (acidic)
TgeneTERF1chr8:70533486chr8:73958196ENST000002766027955_62361420.0Compositional biasNote=Poly-Glu
TgeneTERF1chr8:70533486chr8:73958196ENST000002766038102_71381440.0Compositional biasNote=Asp/Glu-rich (acidic)
TgeneTERF1chr8:70533486chr8:73958196ENST0000027660381055_62381440.0Compositional biasNote=Poly-Glu
TgeneTERF1chr8:70533486chr8:73958195ENST00000276603810375_432381440.0DomainHTH myb-type
TgeneTERF1chr8:70533486chr8:73958196ENST00000276603810375_432381440.0DomainHTH myb-type
TgeneTERF1chr8:70533486chr8:73958195ENST0000027660279337_356361420.0MotifNuclear localization signal
TgeneTERF1chr8:70533486chr8:73958195ENST00000276603810337_356381440.0MotifNuclear localization signal
TgeneTERF1chr8:70533486chr8:73958196ENST0000027660279337_356361420.0MotifNuclear localization signal
TgeneTERF1chr8:70533486chr8:73958196ENST00000276603810337_356381440.0MotifNuclear localization signal
TgeneTERF1chr8:70533486chr8:73958195ENST000002766027958_268361420.0RegionNote=TRFH dimerization
TgeneTERF1chr8:70533486chr8:73958195ENST0000027660381058_268381440.0RegionNote=TRFH dimerization
TgeneTERF1chr8:70533486chr8:73958196ENST000002766027958_268361420.0RegionNote=TRFH dimerization
TgeneTERF1chr8:70533486chr8:73958196ENST0000027660381058_268381440.0RegionNote=TRFH dimerization


Top

Fusion Gene Sequence for SULF1-TERF1


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>88037_88037_1_SULF1-TERF1_SULF1_chr8_70533486_ENST00000260128_TERF1_chr8_73958195_ENST00000276602_length(transcript)=4473nt_BP=2311nt
AGGTTACTTGACTGGGAGTTCTCAGACCTCCAGTTTCAGCCCTGCCCTCAGCCTCCAATCCGTAAGAGACACCCAGCCCCAGCAATTGGA
TTGGGCAGCCCGTCTTGACACACCACTGTGCTGAGTGCTTGAGGACGTGTTTCAACAGATGGTTGGGGTTAGTGTGTGTCATCACATTCG
AGTGGGGATTAAGAGAAGGAAGGCTGCCTTGCTGGAGCTGTGTGGTCTTCTCCAAGTGAGAGTCGCAGGCAATAGAACTACTTTGCTTTT
GGAGGAAAAGGAGGAATTCATTTTCAGCAGACACAAGAAAAGCAGTTTTTTTTTCAGGGATTCTTCACTTCTCTTGAACAAGGAACTCAC
TCAGAGACTAACACAAAGGAAGTAATTTCTTACCTGGTCATTATTTAGTCTACAATAAGTTCATCCTTCTTCAGTGTGACCAGTAAATTC
TTCCCATACTCTTGAAGAGAGCATAATTGGAATGGAGAGGTGCTGACGGCCACCCACCATCATCTAAAGAAGATAAACTTGGCAAATGAC
ATGCAGGTTCTTCAAGGCAGAATAATTGCAGAAAATCTTCAAAGGACCCTATCTGCAGATGTTCTGAATACCTCTGAGAATAGAGATTGA
TTATTCAACCAGGATACCTAATTCAAGAACTCCAGAAATCAGGAGACGGAGACATTTTGTCAGTTTTGCAACATTGGACCAAATACAATG
AAGTATTCTTGCTGTGCTCTGGTTTTGGCTGTCCTGGGCACAGAATTGCTGGGAAGCCTCTGTTCGACTGTCAGATCCCCGAGGTTCAGA
GGACGGATACAGCAGGAACGAAAAAACATCCGACCCAACATTATTCTTGTGCTTACCGATGATCAAGATGTGGAGCTGGGGTCCCTGCAA
GTCATGAACAAAACGAGAAAGATTATGGAACATGGGGGGGCCACCTTCATCAATGCCTTTGTGACTACACCCATGTGCTGCCCGTCACGG
TCCTCCATGCTCACCGGGAAGTATGTGCACAATCACAATGTCTACACCAACAACGAGAACTGCTCTTCCCCCTCGTGGCAGGCCATGCAT
GAGCCTCGGACTTTTGCTGTATATCTTAACAACACTGGCTACAGAACAGCCTTTTTTGGAAAATACCTCAATGAATATAATGGCAGCTAC
ATCCCCCCTGGGTGGCGAGAATGGCTTGGATTAATCAAGAATTCTCGCTTCTATAATTACACTGTTTGTCGCAATGGCATCAAAGAAAAG
CATGGATTTGATTATGCAAAGGACTACTTCACAGACTTAATCACTAACGAGAGCATTAATTACTTCAAAATGTCTAAGAGAATGTATCCC
CATAGGCCCGTTATGATGGTGATCAGCCACGCTGCGCCCCACGGCCCCGAGGACTCAGCCCCACAGTTTTCTAAACTGTACCCCAATGCT
TCCCAACACATAACTCCTAGTTATAACTATGCACCAAATATGGATAAACACTGGATTATGCAGTACACAGGACCAATGCTGCCCATCCAC
ATGGAATTTACAAACATTCTACAGCGCAAAAGGCTCCAGACTTTGATGTCAGTGGATGATTCTGTGGAGAGGCTGTATAACATGCTCGTG
GAGACGGGGGAGCTGGAGAATACTTACATCATTTACACCGCCGACCATGGTTACCATATTGGGCAGTTTGGACTGGTCAAGGGGAAATCC
ATGCCATATGACTTTGATATTCGTGTGCCTTTTTTTATTCGTGGTCCAAGTGTAGAACCAGGATCAATAGTCCCACAGATCGTTCTCAAC
ATTGACTTGGCCCCCACGATCCTGGATATTGCTGGGCTCGACACACCTCCTGATGTGGACGGCAAGTCTGTCCTCAAACTTCTGGACCCA
GAAAAGCCAGGTAACAGGTTTCGAACAAACAAGAAGGCCAAAATTTGGCGTGATACATTCCTAGTGGAAAGAGGCAAATTTCTACGTAAG
AAGGAAGAATCCAGCAAGAATATCCAACAGTCAAATCACTTGCCCAAATATGAACGGGTCAAAGAACTATGCCAGCAGGCCAGGTACCAG
ACAGCCTGTGAACAACCGGGGCAGAAGTGGCAATGCATTGAGGATACATCTGGCAAGCTTCGAATTCACAAGTGTAAAGGACCCAGTGAC
CTGCTCACAGTCCGGCAGAGCACGCGGAACCTCTACGCTCGCGGCTTCCATGACAAAGACAAAGAGTGCAGTTGTAGGGAGTCTGGTTAC
CGTGCCAGCAGAAGCCAAAGAAAGAGTCAACGGCAATTCTTGAGAAACCAGGGGACTCCAAGCATGGCTTTGGGAAGAAGACAAGAATTT
GAGATCTGGCGTGAGGAAATATGGAGAGGGAAACTGGTCTAAAATACTGTTGCATTATAAATTCAACAACCGGACAAGTGTCATGTTAAA
AGACAGATGGAGGACCATGAAGAAACTAAAACTGATTTCCTCAGACAGCGAAGACTGATTGTGTTTGTAAAAGCTTGATGAAAGGACAGT
TAAGTATTTTGATCACTGCATTTTGTTTGAAACTTGTGTCATTGATGTAATTTAAAACTTTTGTTTAAAGCATTACAGTATTTTTCTGTG
ACCATCAATTAATGAGGGTTTGTGCTACCAGAGTTAAAGCATATGCTATCATTGTATTCTTTAAGAACCTTATTTTGATAAAATGTAAAT
TTGTTGAACCCTGCCACATTTAGTATCCCCACCCCCAAATCCTGTTCCAATGAAAAAATTAAAACCTGATACGAAAAAAAAAAAATTCCA
GTTAACCTATTTTGTGTCTGTAGGCTGACCTCAACCCTGTAACGTAACCCATTAAAATGAATTTCTTTTTTTTTAAGACAGAGTTTCTCT
CTGTTGCCCAGGCTGGAGTGCAGTGGCGCAATTTCAGCTCACTGCAACCTCTGCCTCCCAGGTTCAAGTGATTCTCCTGCCTCAGCCTCC
TGAGTAGCTGGGATTACAGGCACACACCACCACGCCCAGCTAATTTTTGTATTTTTAGTAGAGGCGGGGTTTCACCATGCTGGTCAGGAT
GTTCTCCAACTCCTGACTTCATGATCCACCCACCTCGGCCTCCCAAAGTGCTGAGATTACAGACGTGAGCCACTGCGTCCTGCCTAAAAT
GAATTTTCTAGATGATTGAATAACAGTAGTCCTTTGATAGAAGATAATGACTTGGTTTATGGCCTTAATATACTACTTAATTACTTAAGA
TGTTTATTAATAGAATGATAAATGTACAGAGTAACCTATAAGCATGACATACTTTTGCTTTCAGTAGTTTCATGTAAAGAAAAAAACTTG
AAAATAGTAATACCTGAGTACCCATGGGAATAATAGACACTGGGGAGGTAGGGTGGGGAGCGGGAGCAAGAGCTGAAAAACTTACCTACT
GGGGACTGTGCTCACTACCTGGGTGACAGGATCATACGTACCCCAAACCTCAACATCACACAGTATACTCAGCTAACAAACCTGCCCATG
TGTTTCCTGAATCTAAAATAAAAATCGAAATAATTTTTTTAAAAAAGAAAAAGACAATAGTATTACCCATGGGACAAAATTTGTACTATT
AGCAAGAATCATTTTGTGTCTCATTTAGAAACAATTTGACTTTTGTTCCAGTGTTTAAACTTTGACAAAAATGGTTTTGAATAGATCTTT
ATAACCTGATGCCATAAATACAAGATTCTCTGATACCTTCATTTAATATATCAATATTGGGCCTAAAACAGTATTCTGTAAAGCTTAAAT
TGGTATTAACTATGATCATCTTGATGTCTATGATAGATAATAAACAAGGTCATACATACCTTACTAAACAATTTTGGTTTTTCACCAACA
TTTTATTCTTTAAAAGATTTAGACTAACAGAATTATTTAGCATTTCGAGTCATGTGCTTTATTTAGCAAGTGAGTAAAAATATTGGAATA
TTGAAGTATTTGCATAAAAAATCAAATGGTGGTGTTTTGTAATCTCTATTGTATTTCCTATTAAGGTTTCATATATTACTTTCCCATTGT
TCCTGAATTTGTTATCCTATATATAAACAGAAACATGGATGAGTACCATATACTTCCTGACTCCTTTATCTTGTTACACCACAAGATACT
GCACTATTTATTGGAGATATTTTGATGATTAGTGCCAAGGCTGGCATCATGAGCCAAAATAGACTTGTTAAGTCAACCTCATTCCAAACA
CCTGCTATAGATCATCAAATCTAAGTTGCCATTAATTGTAAGATGTCCTGTTATTTCACACACACAACTAGAAAAGAAAAATCGGTATCA
ATTATAGTATGGTGCTTTCTTATTTTGAATTTTACTTATTAGAAGAGCTATTTTAGATGTATTTAAACATAAGATTTTTATCACATAACT

>88037_88037_1_SULF1-TERF1_SULF1_chr8_70533486_ENST00000260128_TERF1_chr8_73958195_ENST00000276602_length(amino acids)=554AA_BP=532
MKYSCCALVLAVLGTELLGSLCSTVRSPRFRGRIQQERKNIRPNIILVLTDDQDVELGSLQVMNKTRKIMEHGGATFINAFVTTPMCCPS
RSSMLTGKYVHNHNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKE
KHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPI
HMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGSIVPQIVL
NIDLAPTILDIAGLDTPPDVDGKSVLKLLDPEKPGNRFRTNKKAKIWRDTFLVERGKFLRKKEESSKNIQQSNHLPKYERVKELCQQARY
QTACEQPGQKWQCIEDTSGKLRIHKCKGPSDLLTVRQSTRNLYARGFHDKDKECSCRESGYRASRSQRKSQRQFLRNQGTPSMALGRRQE

--------------------------------------------------------------
>88037_88037_2_SULF1-TERF1_SULF1_chr8_70533486_ENST00000260128_TERF1_chr8_73958196_ENST00000276602_length(transcript)=4473nt_BP=2311nt
AGGTTACTTGACTGGGAGTTCTCAGACCTCCAGTTTCAGCCCTGCCCTCAGCCTCCAATCCGTAAGAGACACCCAGCCCCAGCAATTGGA
TTGGGCAGCCCGTCTTGACACACCACTGTGCTGAGTGCTTGAGGACGTGTTTCAACAGATGGTTGGGGTTAGTGTGTGTCATCACATTCG
AGTGGGGATTAAGAGAAGGAAGGCTGCCTTGCTGGAGCTGTGTGGTCTTCTCCAAGTGAGAGTCGCAGGCAATAGAACTACTTTGCTTTT
GGAGGAAAAGGAGGAATTCATTTTCAGCAGACACAAGAAAAGCAGTTTTTTTTTCAGGGATTCTTCACTTCTCTTGAACAAGGAACTCAC
TCAGAGACTAACACAAAGGAAGTAATTTCTTACCTGGTCATTATTTAGTCTACAATAAGTTCATCCTTCTTCAGTGTGACCAGTAAATTC
TTCCCATACTCTTGAAGAGAGCATAATTGGAATGGAGAGGTGCTGACGGCCACCCACCATCATCTAAAGAAGATAAACTTGGCAAATGAC
ATGCAGGTTCTTCAAGGCAGAATAATTGCAGAAAATCTTCAAAGGACCCTATCTGCAGATGTTCTGAATACCTCTGAGAATAGAGATTGA
TTATTCAACCAGGATACCTAATTCAAGAACTCCAGAAATCAGGAGACGGAGACATTTTGTCAGTTTTGCAACATTGGACCAAATACAATG
AAGTATTCTTGCTGTGCTCTGGTTTTGGCTGTCCTGGGCACAGAATTGCTGGGAAGCCTCTGTTCGACTGTCAGATCCCCGAGGTTCAGA
GGACGGATACAGCAGGAACGAAAAAACATCCGACCCAACATTATTCTTGTGCTTACCGATGATCAAGATGTGGAGCTGGGGTCCCTGCAA
GTCATGAACAAAACGAGAAAGATTATGGAACATGGGGGGGCCACCTTCATCAATGCCTTTGTGACTACACCCATGTGCTGCCCGTCACGG
TCCTCCATGCTCACCGGGAAGTATGTGCACAATCACAATGTCTACACCAACAACGAGAACTGCTCTTCCCCCTCGTGGCAGGCCATGCAT
GAGCCTCGGACTTTTGCTGTATATCTTAACAACACTGGCTACAGAACAGCCTTTTTTGGAAAATACCTCAATGAATATAATGGCAGCTAC
ATCCCCCCTGGGTGGCGAGAATGGCTTGGATTAATCAAGAATTCTCGCTTCTATAATTACACTGTTTGTCGCAATGGCATCAAAGAAAAG
CATGGATTTGATTATGCAAAGGACTACTTCACAGACTTAATCACTAACGAGAGCATTAATTACTTCAAAATGTCTAAGAGAATGTATCCC
CATAGGCCCGTTATGATGGTGATCAGCCACGCTGCGCCCCACGGCCCCGAGGACTCAGCCCCACAGTTTTCTAAACTGTACCCCAATGCT
TCCCAACACATAACTCCTAGTTATAACTATGCACCAAATATGGATAAACACTGGATTATGCAGTACACAGGACCAATGCTGCCCATCCAC
ATGGAATTTACAAACATTCTACAGCGCAAAAGGCTCCAGACTTTGATGTCAGTGGATGATTCTGTGGAGAGGCTGTATAACATGCTCGTG
GAGACGGGGGAGCTGGAGAATACTTACATCATTTACACCGCCGACCATGGTTACCATATTGGGCAGTTTGGACTGGTCAAGGGGAAATCC
ATGCCATATGACTTTGATATTCGTGTGCCTTTTTTTATTCGTGGTCCAAGTGTAGAACCAGGATCAATAGTCCCACAGATCGTTCTCAAC
ATTGACTTGGCCCCCACGATCCTGGATATTGCTGGGCTCGACACACCTCCTGATGTGGACGGCAAGTCTGTCCTCAAACTTCTGGACCCA
GAAAAGCCAGGTAACAGGTTTCGAACAAACAAGAAGGCCAAAATTTGGCGTGATACATTCCTAGTGGAAAGAGGCAAATTTCTACGTAAG
AAGGAAGAATCCAGCAAGAATATCCAACAGTCAAATCACTTGCCCAAATATGAACGGGTCAAAGAACTATGCCAGCAGGCCAGGTACCAG
ACAGCCTGTGAACAACCGGGGCAGAAGTGGCAATGCATTGAGGATACATCTGGCAAGCTTCGAATTCACAAGTGTAAAGGACCCAGTGAC
CTGCTCACAGTCCGGCAGAGCACGCGGAACCTCTACGCTCGCGGCTTCCATGACAAAGACAAAGAGTGCAGTTGTAGGGAGTCTGGTTAC
CGTGCCAGCAGAAGCCAAAGAAAGAGTCAACGGCAATTCTTGAGAAACCAGGGGACTCCAAGCATGGCTTTGGGAAGAAGACAAGAATTT
GAGATCTGGCGTGAGGAAATATGGAGAGGGAAACTGGTCTAAAATACTGTTGCATTATAAATTCAACAACCGGACAAGTGTCATGTTAAA
AGACAGATGGAGGACCATGAAGAAACTAAAACTGATTTCCTCAGACAGCGAAGACTGATTGTGTTTGTAAAAGCTTGATGAAAGGACAGT
TAAGTATTTTGATCACTGCATTTTGTTTGAAACTTGTGTCATTGATGTAATTTAAAACTTTTGTTTAAAGCATTACAGTATTTTTCTGTG
ACCATCAATTAATGAGGGTTTGTGCTACCAGAGTTAAAGCATATGCTATCATTGTATTCTTTAAGAACCTTATTTTGATAAAATGTAAAT
TTGTTGAACCCTGCCACATTTAGTATCCCCACCCCCAAATCCTGTTCCAATGAAAAAATTAAAACCTGATACGAAAAAAAAAAAATTCCA
GTTAACCTATTTTGTGTCTGTAGGCTGACCTCAACCCTGTAACGTAACCCATTAAAATGAATTTCTTTTTTTTTAAGACAGAGTTTCTCT
CTGTTGCCCAGGCTGGAGTGCAGTGGCGCAATTTCAGCTCACTGCAACCTCTGCCTCCCAGGTTCAAGTGATTCTCCTGCCTCAGCCTCC
TGAGTAGCTGGGATTACAGGCACACACCACCACGCCCAGCTAATTTTTGTATTTTTAGTAGAGGCGGGGTTTCACCATGCTGGTCAGGAT
GTTCTCCAACTCCTGACTTCATGATCCACCCACCTCGGCCTCCCAAAGTGCTGAGATTACAGACGTGAGCCACTGCGTCCTGCCTAAAAT
GAATTTTCTAGATGATTGAATAACAGTAGTCCTTTGATAGAAGATAATGACTTGGTTTATGGCCTTAATATACTACTTAATTACTTAAGA
TGTTTATTAATAGAATGATAAATGTACAGAGTAACCTATAAGCATGACATACTTTTGCTTTCAGTAGTTTCATGTAAAGAAAAAAACTTG
AAAATAGTAATACCTGAGTACCCATGGGAATAATAGACACTGGGGAGGTAGGGTGGGGAGCGGGAGCAAGAGCTGAAAAACTTACCTACT
GGGGACTGTGCTCACTACCTGGGTGACAGGATCATACGTACCCCAAACCTCAACATCACACAGTATACTCAGCTAACAAACCTGCCCATG
TGTTTCCTGAATCTAAAATAAAAATCGAAATAATTTTTTTAAAAAAGAAAAAGACAATAGTATTACCCATGGGACAAAATTTGTACTATT
AGCAAGAATCATTTTGTGTCTCATTTAGAAACAATTTGACTTTTGTTCCAGTGTTTAAACTTTGACAAAAATGGTTTTGAATAGATCTTT
ATAACCTGATGCCATAAATACAAGATTCTCTGATACCTTCATTTAATATATCAATATTGGGCCTAAAACAGTATTCTGTAAAGCTTAAAT
TGGTATTAACTATGATCATCTTGATGTCTATGATAGATAATAAACAAGGTCATACATACCTTACTAAACAATTTTGGTTTTTCACCAACA
TTTTATTCTTTAAAAGATTTAGACTAACAGAATTATTTAGCATTTCGAGTCATGTGCTTTATTTAGCAAGTGAGTAAAAATATTGGAATA
TTGAAGTATTTGCATAAAAAATCAAATGGTGGTGTTTTGTAATCTCTATTGTATTTCCTATTAAGGTTTCATATATTACTTTCCCATTGT
TCCTGAATTTGTTATCCTATATATAAACAGAAACATGGATGAGTACCATATACTTCCTGACTCCTTTATCTTGTTACACCACAAGATACT
GCACTATTTATTGGAGATATTTTGATGATTAGTGCCAAGGCTGGCATCATGAGCCAAAATAGACTTGTTAAGTCAACCTCATTCCAAACA
CCTGCTATAGATCATCAAATCTAAGTTGCCATTAATTGTAAGATGTCCTGTTATTTCACACACACAACTAGAAAAGAAAAATCGGTATCA
ATTATAGTATGGTGCTTTCTTATTTTGAATTTTACTTATTAGAAGAGCTATTTTAGATGTATTTAAACATAAGATTTTTATCACATAACT

>88037_88037_2_SULF1-TERF1_SULF1_chr8_70533486_ENST00000260128_TERF1_chr8_73958196_ENST00000276602_length(amino acids)=554AA_BP=532
MKYSCCALVLAVLGTELLGSLCSTVRSPRFRGRIQQERKNIRPNIILVLTDDQDVELGSLQVMNKTRKIMEHGGATFINAFVTTPMCCPS
RSSMLTGKYVHNHNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKE
KHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPI
HMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGSIVPQIVL
NIDLAPTILDIAGLDTPPDVDGKSVLKLLDPEKPGNRFRTNKKAKIWRDTFLVERGKFLRKKEESSKNIQQSNHLPKYERVKELCQQARY
QTACEQPGQKWQCIEDTSGKLRIHKCKGPSDLLTVRQSTRNLYARGFHDKDKECSCRESGYRASRSQRKSQRQFLRNQGTPSMALGRRQE

--------------------------------------------------------------
>88037_88037_3_SULF1-TERF1_SULF1_chr8_70533486_ENST00000402687_TERF1_chr8_73958195_ENST00000276602_length(transcript)=4511nt_BP=2349nt
TTTGTCTGGCAGCGTTGGTTCTATGGGGGTGTGTGATGATAATAATACGCGGGCTTATATAACCGTCTTCATCTTGCGAGCACTTCGCAG
ACCGTCGCTAATGAATCTTGGGGCCGGTGTCGGGCCGGGGCGGCTTGATCGGCAACTAGGAAACCCCAGGCGCAGAGGCCAGGAGCGAGG
GCAGCGAGGATCAGAGGCCAGGCCTTCCCGGCTGCCGGCGCTCCTCGGAGGTCAGGGCAGATGAGGAACATGACTCTCCCCCTTCGGAGG
AGGAAGGAAGTCCCGCTGCCACCTTATCTCTGCTCCTCTGCCTCCTCCCTGTTCCCAGAGCTTTTTCTCTAGAGAAGATTTTGAAGGCGG
CTTTTGGATTCTTCACTTCTCTTGAACAAGGAACTCACTCAGAGACTAACACAAAGGAAGTAATTTCTTACCTGGTCATTATTTAGTCTA
CAATAAGTTCATCCTTCTTCAGTGTGACCAGTAAATTCTTCCCATACTCTTGAAGAGAGCATAATTGGAATGGAGAGGTGCTGACGGCCA
CCCACCATCATCTAAAGAAGATAAACTTGGCAAATGACATGCAGGTTCTTCAAGGCAGAATAATTGCAGAAAATCTTCAAAGGACCCTAT
CTGCAGATGTTCTGAATACCTCTGAGAATAGAGATTGATTATTCAACCAGGATACCTAATTCAAGAACTCCAGAAATCAGGAGACGGAGA
CATTTTGTCAGTTTTGCAACATTGGACCAAATACAATGAAGTATTCTTGCTGTGCTCTGGTTTTGGCTGTCCTGGGCACAGAATTGCTGG
GAAGCCTCTGTTCGACTGTCAGATCCCCGAGGTTCAGAGGACGGATACAGCAGGAACGAAAAAACATCCGACCCAACATTATTCTTGTGC
TTACCGATGATCAAGATGTGGAGCTGGGGTCCCTGCAAGTCATGAACAAAACGAGAAAGATTATGGAACATGGGGGGGCCACCTTCATCA
ATGCCTTTGTGACTACACCCATGTGCTGCCCGTCACGGTCCTCCATGCTCACCGGGAAGTATGTGCACAATCACAATGTCTACACCAACA
ACGAGAACTGCTCTTCCCCCTCGTGGCAGGCCATGCATGAGCCTCGGACTTTTGCTGTATATCTTAACAACACTGGCTACAGAACAGCCT
TTTTTGGAAAATACCTCAATGAATATAATGGCAGCTACATCCCCCCTGGGTGGCGAGAATGGCTTGGATTAATCAAGAATTCTCGCTTCT
ATAATTACACTGTTTGTCGCAATGGCATCAAAGAAAAGCATGGATTTGATTATGCAAAGGACTACTTCACAGACTTAATCACTAACGAGA
GCATTAATTACTTCAAAATGTCTAAGAGAATGTATCCCCATAGGCCCGTTATGATGGTGATCAGCCACGCTGCGCCCCACGGCCCCGAGG
ACTCAGCCCCACAGTTTTCTAAACTGTACCCCAATGCTTCCCAACACATAACTCCTAGTTATAACTATGCACCAAATATGGATAAACACT
GGATTATGCAGTACACAGGACCAATGCTGCCCATCCACATGGAATTTACAAACATTCTACAGCGCAAAAGGCTCCAGACTTTGATGTCAG
TGGATGATTCTGTGGAGAGGCTGTATAACATGCTCGTGGAGACGGGGGAGCTGGAGAATACTTACATCATTTACACCGCCGACCATGGTT
ACCATATTGGGCAGTTTGGACTGGTCAAGGGGAAATCCATGCCATATGACTTTGATATTCGTGTGCCTTTTTTTATTCGTGGTCCAAGTG
TAGAACCAGGATCAATAGTCCCACAGATCGTTCTCAACATTGACTTGGCCCCCACGATCCTGGATATTGCTGGGCTCGACACACCTCCTG
ATGTGGACGGCAAGTCTGTCCTCAAACTTCTGGACCCAGAAAAGCCAGGTAACAGGTTTCGAACAAACAAGAAGGCCAAAATTTGGCGTG
ATACATTCCTAGTGGAAAGAGGCAAATTTCTACGTAAGAAGGAAGAATCCAGCAAGAATATCCAACAGTCAAATCACTTGCCCAAATATG
AACGGGTCAAAGAACTATGCCAGCAGGCCAGGTACCAGACAGCCTGTGAACAACCGGGGCAGAAGTGGCAATGCATTGAGGATACATCTG
GCAAGCTTCGAATTCACAAGTGTAAAGGACCCAGTGACCTGCTCACAGTCCGGCAGAGCACGCGGAACCTCTACGCTCGCGGCTTCCATG
ACAAAGACAAAGAGTGCAGTTGTAGGGAGTCTGGTTACCGTGCCAGCAGAAGCCAAAGAAAGAGTCAACGGCAATTCTTGAGAAACCAGG
GGACTCCAAGCATGGCTTTGGGAAGAAGACAAGAATTTGAGATCTGGCGTGAGGAAATATGGAGAGGGAAACTGGTCTAAAATACTGTTG
CATTATAAATTCAACAACCGGACAAGTGTCATGTTAAAAGACAGATGGAGGACCATGAAGAAACTAAAACTGATTTCCTCAGACAGCGAA
GACTGATTGTGTTTGTAAAAGCTTGATGAAAGGACAGTTAAGTATTTTGATCACTGCATTTTGTTTGAAACTTGTGTCATTGATGTAATT
TAAAACTTTTGTTTAAAGCATTACAGTATTTTTCTGTGACCATCAATTAATGAGGGTTTGTGCTACCAGAGTTAAAGCATATGCTATCAT
TGTATTCTTTAAGAACCTTATTTTGATAAAATGTAAATTTGTTGAACCCTGCCACATTTAGTATCCCCACCCCCAAATCCTGTTCCAATG
AAAAAATTAAAACCTGATACGAAAAAAAAAAAATTCCAGTTAACCTATTTTGTGTCTGTAGGCTGACCTCAACCCTGTAACGTAACCCAT
TAAAATGAATTTCTTTTTTTTTAAGACAGAGTTTCTCTCTGTTGCCCAGGCTGGAGTGCAGTGGCGCAATTTCAGCTCACTGCAACCTCT
GCCTCCCAGGTTCAAGTGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTACAGGCACACACCACCACGCCCAGCTAATTTTTGTAT
TTTTAGTAGAGGCGGGGTTTCACCATGCTGGTCAGGATGTTCTCCAACTCCTGACTTCATGATCCACCCACCTCGGCCTCCCAAAGTGCT
GAGATTACAGACGTGAGCCACTGCGTCCTGCCTAAAATGAATTTTCTAGATGATTGAATAACAGTAGTCCTTTGATAGAAGATAATGACT
TGGTTTATGGCCTTAATATACTACTTAATTACTTAAGATGTTTATTAATAGAATGATAAATGTACAGAGTAACCTATAAGCATGACATAC
TTTTGCTTTCAGTAGTTTCATGTAAAGAAAAAAACTTGAAAATAGTAATACCTGAGTACCCATGGGAATAATAGACACTGGGGAGGTAGG
GTGGGGAGCGGGAGCAAGAGCTGAAAAACTTACCTACTGGGGACTGTGCTCACTACCTGGGTGACAGGATCATACGTACCCCAAACCTCA
ACATCACACAGTATACTCAGCTAACAAACCTGCCCATGTGTTTCCTGAATCTAAAATAAAAATCGAAATAATTTTTTTAAAAAAGAAAAA
GACAATAGTATTACCCATGGGACAAAATTTGTACTATTAGCAAGAATCATTTTGTGTCTCATTTAGAAACAATTTGACTTTTGTTCCAGT
GTTTAAACTTTGACAAAAATGGTTTTGAATAGATCTTTATAACCTGATGCCATAAATACAAGATTCTCTGATACCTTCATTTAATATATC
AATATTGGGCCTAAAACAGTATTCTGTAAAGCTTAAATTGGTATTAACTATGATCATCTTGATGTCTATGATAGATAATAAACAAGGTCA
TACATACCTTACTAAACAATTTTGGTTTTTCACCAACATTTTATTCTTTAAAAGATTTAGACTAACAGAATTATTTAGCATTTCGAGTCA
TGTGCTTTATTTAGCAAGTGAGTAAAAATATTGGAATATTGAAGTATTTGCATAAAAAATCAAATGGTGGTGTTTTGTAATCTCTATTGT
ATTTCCTATTAAGGTTTCATATATTACTTTCCCATTGTTCCTGAATTTGTTATCCTATATATAAACAGAAACATGGATGAGTACCATATA
CTTCCTGACTCCTTTATCTTGTTACACCACAAGATACTGCACTATTTATTGGAGATATTTTGATGATTAGTGCCAAGGCTGGCATCATGA
GCCAAAATAGACTTGTTAAGTCAACCTCATTCCAAACACCTGCTATAGATCATCAAATCTAAGTTGCCATTAATTGTAAGATGTCCTGTT
ATTTCACACACACAACTAGAAAAGAAAAATCGGTATCAATTATAGTATGGTGCTTTCTTATTTTGAATTTTACTTATTAGAAGAGCTATT
TTAGATGTATTTAAACATAAGATTTTTATCACATAACTTGTGGATACAAAAAAAGAGAAATTGTAAGCAAAATAAGTTGGGTAATATATT

>88037_88037_3_SULF1-TERF1_SULF1_chr8_70533486_ENST00000402687_TERF1_chr8_73958195_ENST00000276602_length(amino acids)=554AA_BP=532
MKYSCCALVLAVLGTELLGSLCSTVRSPRFRGRIQQERKNIRPNIILVLTDDQDVELGSLQVMNKTRKIMEHGGATFINAFVTTPMCCPS
RSSMLTGKYVHNHNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKE
KHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPI
HMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGSIVPQIVL
NIDLAPTILDIAGLDTPPDVDGKSVLKLLDPEKPGNRFRTNKKAKIWRDTFLVERGKFLRKKEESSKNIQQSNHLPKYERVKELCQQARY
QTACEQPGQKWQCIEDTSGKLRIHKCKGPSDLLTVRQSTRNLYARGFHDKDKECSCRESGYRASRSQRKSQRQFLRNQGTPSMALGRRQE

--------------------------------------------------------------
>88037_88037_4_SULF1-TERF1_SULF1_chr8_70533486_ENST00000402687_TERF1_chr8_73958196_ENST00000276602_length(transcript)=4511nt_BP=2349nt
TTTGTCTGGCAGCGTTGGTTCTATGGGGGTGTGTGATGATAATAATACGCGGGCTTATATAACCGTCTTCATCTTGCGAGCACTTCGCAG
ACCGTCGCTAATGAATCTTGGGGCCGGTGTCGGGCCGGGGCGGCTTGATCGGCAACTAGGAAACCCCAGGCGCAGAGGCCAGGAGCGAGG
GCAGCGAGGATCAGAGGCCAGGCCTTCCCGGCTGCCGGCGCTCCTCGGAGGTCAGGGCAGATGAGGAACATGACTCTCCCCCTTCGGAGG
AGGAAGGAAGTCCCGCTGCCACCTTATCTCTGCTCCTCTGCCTCCTCCCTGTTCCCAGAGCTTTTTCTCTAGAGAAGATTTTGAAGGCGG
CTTTTGGATTCTTCACTTCTCTTGAACAAGGAACTCACTCAGAGACTAACACAAAGGAAGTAATTTCTTACCTGGTCATTATTTAGTCTA
CAATAAGTTCATCCTTCTTCAGTGTGACCAGTAAATTCTTCCCATACTCTTGAAGAGAGCATAATTGGAATGGAGAGGTGCTGACGGCCA
CCCACCATCATCTAAAGAAGATAAACTTGGCAAATGACATGCAGGTTCTTCAAGGCAGAATAATTGCAGAAAATCTTCAAAGGACCCTAT
CTGCAGATGTTCTGAATACCTCTGAGAATAGAGATTGATTATTCAACCAGGATACCTAATTCAAGAACTCCAGAAATCAGGAGACGGAGA
CATTTTGTCAGTTTTGCAACATTGGACCAAATACAATGAAGTATTCTTGCTGTGCTCTGGTTTTGGCTGTCCTGGGCACAGAATTGCTGG
GAAGCCTCTGTTCGACTGTCAGATCCCCGAGGTTCAGAGGACGGATACAGCAGGAACGAAAAAACATCCGACCCAACATTATTCTTGTGC
TTACCGATGATCAAGATGTGGAGCTGGGGTCCCTGCAAGTCATGAACAAAACGAGAAAGATTATGGAACATGGGGGGGCCACCTTCATCA
ATGCCTTTGTGACTACACCCATGTGCTGCCCGTCACGGTCCTCCATGCTCACCGGGAAGTATGTGCACAATCACAATGTCTACACCAACA
ACGAGAACTGCTCTTCCCCCTCGTGGCAGGCCATGCATGAGCCTCGGACTTTTGCTGTATATCTTAACAACACTGGCTACAGAACAGCCT
TTTTTGGAAAATACCTCAATGAATATAATGGCAGCTACATCCCCCCTGGGTGGCGAGAATGGCTTGGATTAATCAAGAATTCTCGCTTCT
ATAATTACACTGTTTGTCGCAATGGCATCAAAGAAAAGCATGGATTTGATTATGCAAAGGACTACTTCACAGACTTAATCACTAACGAGA
GCATTAATTACTTCAAAATGTCTAAGAGAATGTATCCCCATAGGCCCGTTATGATGGTGATCAGCCACGCTGCGCCCCACGGCCCCGAGG
ACTCAGCCCCACAGTTTTCTAAACTGTACCCCAATGCTTCCCAACACATAACTCCTAGTTATAACTATGCACCAAATATGGATAAACACT
GGATTATGCAGTACACAGGACCAATGCTGCCCATCCACATGGAATTTACAAACATTCTACAGCGCAAAAGGCTCCAGACTTTGATGTCAG
TGGATGATTCTGTGGAGAGGCTGTATAACATGCTCGTGGAGACGGGGGAGCTGGAGAATACTTACATCATTTACACCGCCGACCATGGTT
ACCATATTGGGCAGTTTGGACTGGTCAAGGGGAAATCCATGCCATATGACTTTGATATTCGTGTGCCTTTTTTTATTCGTGGTCCAAGTG
TAGAACCAGGATCAATAGTCCCACAGATCGTTCTCAACATTGACTTGGCCCCCACGATCCTGGATATTGCTGGGCTCGACACACCTCCTG
ATGTGGACGGCAAGTCTGTCCTCAAACTTCTGGACCCAGAAAAGCCAGGTAACAGGTTTCGAACAAACAAGAAGGCCAAAATTTGGCGTG
ATACATTCCTAGTGGAAAGAGGCAAATTTCTACGTAAGAAGGAAGAATCCAGCAAGAATATCCAACAGTCAAATCACTTGCCCAAATATG
AACGGGTCAAAGAACTATGCCAGCAGGCCAGGTACCAGACAGCCTGTGAACAACCGGGGCAGAAGTGGCAATGCATTGAGGATACATCTG
GCAAGCTTCGAATTCACAAGTGTAAAGGACCCAGTGACCTGCTCACAGTCCGGCAGAGCACGCGGAACCTCTACGCTCGCGGCTTCCATG
ACAAAGACAAAGAGTGCAGTTGTAGGGAGTCTGGTTACCGTGCCAGCAGAAGCCAAAGAAAGAGTCAACGGCAATTCTTGAGAAACCAGG
GGACTCCAAGCATGGCTTTGGGAAGAAGACAAGAATTTGAGATCTGGCGTGAGGAAATATGGAGAGGGAAACTGGTCTAAAATACTGTTG
CATTATAAATTCAACAACCGGACAAGTGTCATGTTAAAAGACAGATGGAGGACCATGAAGAAACTAAAACTGATTTCCTCAGACAGCGAA
GACTGATTGTGTTTGTAAAAGCTTGATGAAAGGACAGTTAAGTATTTTGATCACTGCATTTTGTTTGAAACTTGTGTCATTGATGTAATT
TAAAACTTTTGTTTAAAGCATTACAGTATTTTTCTGTGACCATCAATTAATGAGGGTTTGTGCTACCAGAGTTAAAGCATATGCTATCAT
TGTATTCTTTAAGAACCTTATTTTGATAAAATGTAAATTTGTTGAACCCTGCCACATTTAGTATCCCCACCCCCAAATCCTGTTCCAATG
AAAAAATTAAAACCTGATACGAAAAAAAAAAAATTCCAGTTAACCTATTTTGTGTCTGTAGGCTGACCTCAACCCTGTAACGTAACCCAT
TAAAATGAATTTCTTTTTTTTTAAGACAGAGTTTCTCTCTGTTGCCCAGGCTGGAGTGCAGTGGCGCAATTTCAGCTCACTGCAACCTCT
GCCTCCCAGGTTCAAGTGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTACAGGCACACACCACCACGCCCAGCTAATTTTTGTAT
TTTTAGTAGAGGCGGGGTTTCACCATGCTGGTCAGGATGTTCTCCAACTCCTGACTTCATGATCCACCCACCTCGGCCTCCCAAAGTGCT
GAGATTACAGACGTGAGCCACTGCGTCCTGCCTAAAATGAATTTTCTAGATGATTGAATAACAGTAGTCCTTTGATAGAAGATAATGACT
TGGTTTATGGCCTTAATATACTACTTAATTACTTAAGATGTTTATTAATAGAATGATAAATGTACAGAGTAACCTATAAGCATGACATAC
TTTTGCTTTCAGTAGTTTCATGTAAAGAAAAAAACTTGAAAATAGTAATACCTGAGTACCCATGGGAATAATAGACACTGGGGAGGTAGG
GTGGGGAGCGGGAGCAAGAGCTGAAAAACTTACCTACTGGGGACTGTGCTCACTACCTGGGTGACAGGATCATACGTACCCCAAACCTCA
ACATCACACAGTATACTCAGCTAACAAACCTGCCCATGTGTTTCCTGAATCTAAAATAAAAATCGAAATAATTTTTTTAAAAAAGAAAAA
GACAATAGTATTACCCATGGGACAAAATTTGTACTATTAGCAAGAATCATTTTGTGTCTCATTTAGAAACAATTTGACTTTTGTTCCAGT
GTTTAAACTTTGACAAAAATGGTTTTGAATAGATCTTTATAACCTGATGCCATAAATACAAGATTCTCTGATACCTTCATTTAATATATC
AATATTGGGCCTAAAACAGTATTCTGTAAAGCTTAAATTGGTATTAACTATGATCATCTTGATGTCTATGATAGATAATAAACAAGGTCA
TACATACCTTACTAAACAATTTTGGTTTTTCACCAACATTTTATTCTTTAAAAGATTTAGACTAACAGAATTATTTAGCATTTCGAGTCA
TGTGCTTTATTTAGCAAGTGAGTAAAAATATTGGAATATTGAAGTATTTGCATAAAAAATCAAATGGTGGTGTTTTGTAATCTCTATTGT
ATTTCCTATTAAGGTTTCATATATTACTTTCCCATTGTTCCTGAATTTGTTATCCTATATATAAACAGAAACATGGATGAGTACCATATA
CTTCCTGACTCCTTTATCTTGTTACACCACAAGATACTGCACTATTTATTGGAGATATTTTGATGATTAGTGCCAAGGCTGGCATCATGA
GCCAAAATAGACTTGTTAAGTCAACCTCATTCCAAACACCTGCTATAGATCATCAAATCTAAGTTGCCATTAATTGTAAGATGTCCTGTT
ATTTCACACACACAACTAGAAAAGAAAAATCGGTATCAATTATAGTATGGTGCTTTCTTATTTTGAATTTTACTTATTAGAAGAGCTATT
TTAGATGTATTTAAACATAAGATTTTTATCACATAACTTGTGGATACAAAAAAAGAGAAATTGTAAGCAAAATAAGTTGGGTAATATATT

>88037_88037_4_SULF1-TERF1_SULF1_chr8_70533486_ENST00000402687_TERF1_chr8_73958196_ENST00000276602_length(amino acids)=554AA_BP=532
MKYSCCALVLAVLGTELLGSLCSTVRSPRFRGRIQQERKNIRPNIILVLTDDQDVELGSLQVMNKTRKIMEHGGATFINAFVTTPMCCPS
RSSMLTGKYVHNHNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKE
KHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPI
HMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGSIVPQIVL
NIDLAPTILDIAGLDTPPDVDGKSVLKLLDPEKPGNRFRTNKKAKIWRDTFLVERGKFLRKKEESSKNIQQSNHLPKYERVKELCQQARY
QTACEQPGQKWQCIEDTSGKLRIHKCKGPSDLLTVRQSTRNLYARGFHDKDKECSCRESGYRASRSQRKSQRQFLRNQGTPSMALGRRQE

--------------------------------------------------------------
>88037_88037_5_SULF1-TERF1_SULF1_chr8_70533486_ENST00000419716_TERF1_chr8_73958195_ENST00000276602_length(transcript)=4317nt_BP=2155nt
GTGATGATAATAATACGCGGGCTTATATAACCGTCTTCATCTTGCGAGCACTTCGCAGACCGTCGCTAATGAATCTTGGGGCCGGTGTCG
GGCCGGGGCGGCTTGATCGGCAACTAGGAAACCCCAGGCGCAGAGGCCAGGAGCGAGGGCAGCGAGGATCAGAGGCCAGGCCTTCCCGGC
TGCCGGCGCTCCTCGGAGGTCAGGGCAGATGAGGAACATGACTCTCCCCCTTCGGAGGAGGAAGGAAGTCCCGCTGCCACCTTATCTCTG
CTCCTCTGCCTCCTCCCTGTTCCCAGAGCTTTTTCTCTAGAGAAGATTTTGAAGGCGGCTTTTGTGCTGACGGCCACCCACCATCATCTA
AAGAAGATAAACTTGGCAAATGACATGCAGGTTCTTCAAGGCAGAATAATTGCAGAAAATCTTCAAAGGACCCTATCTGCAGATGTTCTG
AATACCTCTGAGAATAGAGATTGATTATTCAACCAGGATACCTAATTCAAGAACTCCAGAAATCAGGAGACGGAGACATTTTGTCAGTTT
TGCAACATTGGACCAAATACAATGAAGTATTCTTGCTGTGCTCTGGTTTTGGCTGTCCTGGGCACAGAATTGCTGGGAAGCCTCTGTTCG
ACTGTCAGATCCCCGAGGTTCAGAGGACGGATACAGCAGGAACGAAAAAACATCCGACCCAACATTATTCTTGTGCTTACCGATGATCAA
GATGTGGAGCTGGGGTCCCTGCAAGTCATGAACAAAACGAGAAAGATTATGGAACATGGGGGGGCCACCTTCATCAATGCCTTTGTGACT
ACACCCATGTGCTGCCCGTCACGGTCCTCCATGCTCACCGGGAAGTATGTGCACAATCACAATGTCTACACCAACAACGAGAACTGCTCT
TCCCCCTCGTGGCAGGCCATGCATGAGCCTCGGACTTTTGCTGTATATCTTAACAACACTGGCTACAGAACAGCCTTTTTTGGAAAATAC
CTCAATGAATATAATGGCAGCTACATCCCCCCTGGGTGGCGAGAATGGCTTGGATTAATCAAGAATTCTCGCTTCTATAATTACACTGTT
TGTCGCAATGGCATCAAAGAAAAGCATGGATTTGATTATGCAAAGGACTACTTCACAGACTTAATCACTAACGAGAGCATTAATTACTTC
AAAATGTCTAAGAGAATGTATCCCCATAGGCCCGTTATGATGGTGATCAGCCACGCTGCGCCCCACGGCCCCGAGGACTCAGCCCCACAG
TTTTCTAAACTGTACCCCAATGCTTCCCAACACATAACTCCTAGTTATAACTATGCACCAAATATGGATAAACACTGGATTATGCAGTAC
ACAGGACCAATGCTGCCCATCCACATGGAATTTACAAACATTCTACAGCGCAAAAGGCTCCAGACTTTGATGTCAGTGGATGATTCTGTG
GAGAGGCTGTATAACATGCTCGTGGAGACGGGGGAGCTGGAGAATACTTACATCATTTACACCGCCGACCATGGTTACCATATTGGGCAG
TTTGGACTGGTCAAGGGGAAATCCATGCCATATGACTTTGATATTCGTGTGCCTTTTTTTATTCGTGGTCCAAGTGTAGAACCAGGATCA
ATAGTCCCACAGATCGTTCTCAACATTGACTTGGCCCCCACGATCCTGGATATTGCTGGGCTCGACACACCTCCTGATGTGGACGGCAAG
TCTGTCCTCAAACTTCTGGACCCAGAAAAGCCAGGTAACAGGTTTCGAACAAACAAGAAGGCCAAAATTTGGCGTGATACATTCCTAGTG
GAAAGAGGCAAATTTCTACGTAAGAAGGAAGAATCCAGCAAGAATATCCAACAGTCAAATCACTTGCCCAAATATGAACGGGTCAAAGAA
CTATGCCAGCAGGCCAGGTACCAGACAGCCTGTGAACAACCGGGGCAGAAGTGGCAATGCATTGAGGATACATCTGGCAAGCTTCGAATT
CACAAGTGTAAAGGACCCAGTGACCTGCTCACAGTCCGGCAGAGCACGCGGAACCTCTACGCTCGCGGCTTCCATGACAAAGACAAAGAG
TGCAGTTGTAGGGAGTCTGGTTACCGTGCCAGCAGAAGCCAAAGAAAGAGTCAACGGCAATTCTTGAGAAACCAGGGGACTCCAAGCATG
GCTTTGGGAAGAAGACAAGAATTTGAGATCTGGCGTGAGGAAATATGGAGAGGGAAACTGGTCTAAAATACTGTTGCATTATAAATTCAA
CAACCGGACAAGTGTCATGTTAAAAGACAGATGGAGGACCATGAAGAAACTAAAACTGATTTCCTCAGACAGCGAAGACTGATTGTGTTT
GTAAAAGCTTGATGAAAGGACAGTTAAGTATTTTGATCACTGCATTTTGTTTGAAACTTGTGTCATTGATGTAATTTAAAACTTTTGTTT
AAAGCATTACAGTATTTTTCTGTGACCATCAATTAATGAGGGTTTGTGCTACCAGAGTTAAAGCATATGCTATCATTGTATTCTTTAAGA
ACCTTATTTTGATAAAATGTAAATTTGTTGAACCCTGCCACATTTAGTATCCCCACCCCCAAATCCTGTTCCAATGAAAAAATTAAAACC
TGATACGAAAAAAAAAAAATTCCAGTTAACCTATTTTGTGTCTGTAGGCTGACCTCAACCCTGTAACGTAACCCATTAAAATGAATTTCT
TTTTTTTTAAGACAGAGTTTCTCTCTGTTGCCCAGGCTGGAGTGCAGTGGCGCAATTTCAGCTCACTGCAACCTCTGCCTCCCAGGTTCA
AGTGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTACAGGCACACACCACCACGCCCAGCTAATTTTTGTATTTTTAGTAGAGGCG
GGGTTTCACCATGCTGGTCAGGATGTTCTCCAACTCCTGACTTCATGATCCACCCACCTCGGCCTCCCAAAGTGCTGAGATTACAGACGT
GAGCCACTGCGTCCTGCCTAAAATGAATTTTCTAGATGATTGAATAACAGTAGTCCTTTGATAGAAGATAATGACTTGGTTTATGGCCTT
AATATACTACTTAATTACTTAAGATGTTTATTAATAGAATGATAAATGTACAGAGTAACCTATAAGCATGACATACTTTTGCTTTCAGTA
GTTTCATGTAAAGAAAAAAACTTGAAAATAGTAATACCTGAGTACCCATGGGAATAATAGACACTGGGGAGGTAGGGTGGGGAGCGGGAG
CAAGAGCTGAAAAACTTACCTACTGGGGACTGTGCTCACTACCTGGGTGACAGGATCATACGTACCCCAAACCTCAACATCACACAGTAT
ACTCAGCTAACAAACCTGCCCATGTGTTTCCTGAATCTAAAATAAAAATCGAAATAATTTTTTTAAAAAAGAAAAAGACAATAGTATTAC
CCATGGGACAAAATTTGTACTATTAGCAAGAATCATTTTGTGTCTCATTTAGAAACAATTTGACTTTTGTTCCAGTGTTTAAACTTTGAC
AAAAATGGTTTTGAATAGATCTTTATAACCTGATGCCATAAATACAAGATTCTCTGATACCTTCATTTAATATATCAATATTGGGCCTAA
AACAGTATTCTGTAAAGCTTAAATTGGTATTAACTATGATCATCTTGATGTCTATGATAGATAATAAACAAGGTCATACATACCTTACTA
AACAATTTTGGTTTTTCACCAACATTTTATTCTTTAAAAGATTTAGACTAACAGAATTATTTAGCATTTCGAGTCATGTGCTTTATTTAG
CAAGTGAGTAAAAATATTGGAATATTGAAGTATTTGCATAAAAAATCAAATGGTGGTGTTTTGTAATCTCTATTGTATTTCCTATTAAGG
TTTCATATATTACTTTCCCATTGTTCCTGAATTTGTTATCCTATATATAAACAGAAACATGGATGAGTACCATATACTTCCTGACTCCTT
TATCTTGTTACACCACAAGATACTGCACTATTTATTGGAGATATTTTGATGATTAGTGCCAAGGCTGGCATCATGAGCCAAAATAGACTT
GTTAAGTCAACCTCATTCCAAACACCTGCTATAGATCATCAAATCTAAGTTGCCATTAATTGTAAGATGTCCTGTTATTTCACACACACA
ACTAGAAAAGAAAAATCGGTATCAATTATAGTATGGTGCTTTCTTATTTTGAATTTTACTTATTAGAAGAGCTATTTTAGATGTATTTAA

>88037_88037_5_SULF1-TERF1_SULF1_chr8_70533486_ENST00000419716_TERF1_chr8_73958195_ENST00000276602_length(amino acids)=554AA_BP=532
MKYSCCALVLAVLGTELLGSLCSTVRSPRFRGRIQQERKNIRPNIILVLTDDQDVELGSLQVMNKTRKIMEHGGATFINAFVTTPMCCPS
RSSMLTGKYVHNHNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKE
KHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPI
HMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGSIVPQIVL
NIDLAPTILDIAGLDTPPDVDGKSVLKLLDPEKPGNRFRTNKKAKIWRDTFLVERGKFLRKKEESSKNIQQSNHLPKYERVKELCQQARY
QTACEQPGQKWQCIEDTSGKLRIHKCKGPSDLLTVRQSTRNLYARGFHDKDKECSCRESGYRASRSQRKSQRQFLRNQGTPSMALGRRQE

--------------------------------------------------------------
>88037_88037_6_SULF1-TERF1_SULF1_chr8_70533486_ENST00000419716_TERF1_chr8_73958196_ENST00000276602_length(transcript)=4317nt_BP=2155nt
GTGATGATAATAATACGCGGGCTTATATAACCGTCTTCATCTTGCGAGCACTTCGCAGACCGTCGCTAATGAATCTTGGGGCCGGTGTCG
GGCCGGGGCGGCTTGATCGGCAACTAGGAAACCCCAGGCGCAGAGGCCAGGAGCGAGGGCAGCGAGGATCAGAGGCCAGGCCTTCCCGGC
TGCCGGCGCTCCTCGGAGGTCAGGGCAGATGAGGAACATGACTCTCCCCCTTCGGAGGAGGAAGGAAGTCCCGCTGCCACCTTATCTCTG
CTCCTCTGCCTCCTCCCTGTTCCCAGAGCTTTTTCTCTAGAGAAGATTTTGAAGGCGGCTTTTGTGCTGACGGCCACCCACCATCATCTA
AAGAAGATAAACTTGGCAAATGACATGCAGGTTCTTCAAGGCAGAATAATTGCAGAAAATCTTCAAAGGACCCTATCTGCAGATGTTCTG
AATACCTCTGAGAATAGAGATTGATTATTCAACCAGGATACCTAATTCAAGAACTCCAGAAATCAGGAGACGGAGACATTTTGTCAGTTT
TGCAACATTGGACCAAATACAATGAAGTATTCTTGCTGTGCTCTGGTTTTGGCTGTCCTGGGCACAGAATTGCTGGGAAGCCTCTGTTCG
ACTGTCAGATCCCCGAGGTTCAGAGGACGGATACAGCAGGAACGAAAAAACATCCGACCCAACATTATTCTTGTGCTTACCGATGATCAA
GATGTGGAGCTGGGGTCCCTGCAAGTCATGAACAAAACGAGAAAGATTATGGAACATGGGGGGGCCACCTTCATCAATGCCTTTGTGACT
ACACCCATGTGCTGCCCGTCACGGTCCTCCATGCTCACCGGGAAGTATGTGCACAATCACAATGTCTACACCAACAACGAGAACTGCTCT
TCCCCCTCGTGGCAGGCCATGCATGAGCCTCGGACTTTTGCTGTATATCTTAACAACACTGGCTACAGAACAGCCTTTTTTGGAAAATAC
CTCAATGAATATAATGGCAGCTACATCCCCCCTGGGTGGCGAGAATGGCTTGGATTAATCAAGAATTCTCGCTTCTATAATTACACTGTT
TGTCGCAATGGCATCAAAGAAAAGCATGGATTTGATTATGCAAAGGACTACTTCACAGACTTAATCACTAACGAGAGCATTAATTACTTC
AAAATGTCTAAGAGAATGTATCCCCATAGGCCCGTTATGATGGTGATCAGCCACGCTGCGCCCCACGGCCCCGAGGACTCAGCCCCACAG
TTTTCTAAACTGTACCCCAATGCTTCCCAACACATAACTCCTAGTTATAACTATGCACCAAATATGGATAAACACTGGATTATGCAGTAC
ACAGGACCAATGCTGCCCATCCACATGGAATTTACAAACATTCTACAGCGCAAAAGGCTCCAGACTTTGATGTCAGTGGATGATTCTGTG
GAGAGGCTGTATAACATGCTCGTGGAGACGGGGGAGCTGGAGAATACTTACATCATTTACACCGCCGACCATGGTTACCATATTGGGCAG
TTTGGACTGGTCAAGGGGAAATCCATGCCATATGACTTTGATATTCGTGTGCCTTTTTTTATTCGTGGTCCAAGTGTAGAACCAGGATCA
ATAGTCCCACAGATCGTTCTCAACATTGACTTGGCCCCCACGATCCTGGATATTGCTGGGCTCGACACACCTCCTGATGTGGACGGCAAG
TCTGTCCTCAAACTTCTGGACCCAGAAAAGCCAGGTAACAGGTTTCGAACAAACAAGAAGGCCAAAATTTGGCGTGATACATTCCTAGTG
GAAAGAGGCAAATTTCTACGTAAGAAGGAAGAATCCAGCAAGAATATCCAACAGTCAAATCACTTGCCCAAATATGAACGGGTCAAAGAA
CTATGCCAGCAGGCCAGGTACCAGACAGCCTGTGAACAACCGGGGCAGAAGTGGCAATGCATTGAGGATACATCTGGCAAGCTTCGAATT
CACAAGTGTAAAGGACCCAGTGACCTGCTCACAGTCCGGCAGAGCACGCGGAACCTCTACGCTCGCGGCTTCCATGACAAAGACAAAGAG
TGCAGTTGTAGGGAGTCTGGTTACCGTGCCAGCAGAAGCCAAAGAAAGAGTCAACGGCAATTCTTGAGAAACCAGGGGACTCCAAGCATG
GCTTTGGGAAGAAGACAAGAATTTGAGATCTGGCGTGAGGAAATATGGAGAGGGAAACTGGTCTAAAATACTGTTGCATTATAAATTCAA
CAACCGGACAAGTGTCATGTTAAAAGACAGATGGAGGACCATGAAGAAACTAAAACTGATTTCCTCAGACAGCGAAGACTGATTGTGTTT
GTAAAAGCTTGATGAAAGGACAGTTAAGTATTTTGATCACTGCATTTTGTTTGAAACTTGTGTCATTGATGTAATTTAAAACTTTTGTTT
AAAGCATTACAGTATTTTTCTGTGACCATCAATTAATGAGGGTTTGTGCTACCAGAGTTAAAGCATATGCTATCATTGTATTCTTTAAGA
ACCTTATTTTGATAAAATGTAAATTTGTTGAACCCTGCCACATTTAGTATCCCCACCCCCAAATCCTGTTCCAATGAAAAAATTAAAACC
TGATACGAAAAAAAAAAAATTCCAGTTAACCTATTTTGTGTCTGTAGGCTGACCTCAACCCTGTAACGTAACCCATTAAAATGAATTTCT
TTTTTTTTAAGACAGAGTTTCTCTCTGTTGCCCAGGCTGGAGTGCAGTGGCGCAATTTCAGCTCACTGCAACCTCTGCCTCCCAGGTTCA
AGTGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTACAGGCACACACCACCACGCCCAGCTAATTTTTGTATTTTTAGTAGAGGCG
GGGTTTCACCATGCTGGTCAGGATGTTCTCCAACTCCTGACTTCATGATCCACCCACCTCGGCCTCCCAAAGTGCTGAGATTACAGACGT
GAGCCACTGCGTCCTGCCTAAAATGAATTTTCTAGATGATTGAATAACAGTAGTCCTTTGATAGAAGATAATGACTTGGTTTATGGCCTT
AATATACTACTTAATTACTTAAGATGTTTATTAATAGAATGATAAATGTACAGAGTAACCTATAAGCATGACATACTTTTGCTTTCAGTA
GTTTCATGTAAAGAAAAAAACTTGAAAATAGTAATACCTGAGTACCCATGGGAATAATAGACACTGGGGAGGTAGGGTGGGGAGCGGGAG
CAAGAGCTGAAAAACTTACCTACTGGGGACTGTGCTCACTACCTGGGTGACAGGATCATACGTACCCCAAACCTCAACATCACACAGTAT
ACTCAGCTAACAAACCTGCCCATGTGTTTCCTGAATCTAAAATAAAAATCGAAATAATTTTTTTAAAAAAGAAAAAGACAATAGTATTAC
CCATGGGACAAAATTTGTACTATTAGCAAGAATCATTTTGTGTCTCATTTAGAAACAATTTGACTTTTGTTCCAGTGTTTAAACTTTGAC
AAAAATGGTTTTGAATAGATCTTTATAACCTGATGCCATAAATACAAGATTCTCTGATACCTTCATTTAATATATCAATATTGGGCCTAA
AACAGTATTCTGTAAAGCTTAAATTGGTATTAACTATGATCATCTTGATGTCTATGATAGATAATAAACAAGGTCATACATACCTTACTA
AACAATTTTGGTTTTTCACCAACATTTTATTCTTTAAAAGATTTAGACTAACAGAATTATTTAGCATTTCGAGTCATGTGCTTTATTTAG
CAAGTGAGTAAAAATATTGGAATATTGAAGTATTTGCATAAAAAATCAAATGGTGGTGTTTTGTAATCTCTATTGTATTTCCTATTAAGG
TTTCATATATTACTTTCCCATTGTTCCTGAATTTGTTATCCTATATATAAACAGAAACATGGATGAGTACCATATACTTCCTGACTCCTT
TATCTTGTTACACCACAAGATACTGCACTATTTATTGGAGATATTTTGATGATTAGTGCCAAGGCTGGCATCATGAGCCAAAATAGACTT
GTTAAGTCAACCTCATTCCAAACACCTGCTATAGATCATCAAATCTAAGTTGCCATTAATTGTAAGATGTCCTGTTATTTCACACACACA
ACTAGAAAAGAAAAATCGGTATCAATTATAGTATGGTGCTTTCTTATTTTGAATTTTACTTATTAGAAGAGCTATTTTAGATGTATTTAA

>88037_88037_6_SULF1-TERF1_SULF1_chr8_70533486_ENST00000419716_TERF1_chr8_73958196_ENST00000276602_length(amino acids)=554AA_BP=532
MKYSCCALVLAVLGTELLGSLCSTVRSPRFRGRIQQERKNIRPNIILVLTDDQDVELGSLQVMNKTRKIMEHGGATFINAFVTTPMCCPS
RSSMLTGKYVHNHNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKE
KHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPI
HMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGSIVPQIVL
NIDLAPTILDIAGLDTPPDVDGKSVLKLLDPEKPGNRFRTNKKAKIWRDTFLVERGKFLRKKEESSKNIQQSNHLPKYERVKELCQQARY
QTACEQPGQKWQCIEDTSGKLRIHKCKGPSDLLTVRQSTRNLYARGFHDKDKECSCRESGYRASRSQRKSQRQFLRNQGTPSMALGRRQE

--------------------------------------------------------------
>88037_88037_7_SULF1-TERF1_SULF1_chr8_70533486_ENST00000458141_TERF1_chr8_73958195_ENST00000276602_length(transcript)=4311nt_BP=2149nt
AGGTTACTTGACTGGGAGTTCTCAGACCTCCAGTTTCAGCCCTGCCCTCAGCCTCCAATCCGTAAGAGACACCCAGCCCCAGCAATTGGA
TTGGGCAGCCCGTCTTGACACACCACTGTGCTGAGTGCTTGAGGACGTGTTTCAACAGATGGTTGGGGTTAGTGTGTGTCATCACATTCG
AGTGGGGATTAAGAGAAGGAAGGCTGCCTTGCTGGAGCTGTGTGGTCTTCTCCAAGTGAGAGTCGCAGGCAATAGAACTACTTTGCTTTT
GGAGGAAAAGGAGGAATTCATTTTCAGCAGACACAAGAAAAGCAGTTTTTTTTTCAGGTGCTGACGGCCACCCACCATCATCTAAAGAAG
ATAAACTTGGCAAATGACATGCAGGTTCTTCAAGGCAGAATAATTGCAGAAAATCTTCAAAGGACCCTATCTGCAGATGTTCTGAATACC
TCTGAGAATAGAGATTGATTATTCAACCAGGATACCTAATTCAAGAACTCCAGAAATCAGGAGACGGAGACATTTTGTCAGTTTTGCAAC
ATTGGACCAAATACAATGAAGTATTCTTGCTGTGCTCTGGTTTTGGCTGTCCTGGGCACAGAATTGCTGGGAAGCCTCTGTTCGACTGTC
AGATCCCCGAGGTTCAGAGGACGGATACAGCAGGAACGAAAAAACATCCGACCCAACATTATTCTTGTGCTTACCGATGATCAAGATGTG
GAGCTGGGGTCCCTGCAAGTCATGAACAAAACGAGAAAGATTATGGAACATGGGGGGGCCACCTTCATCAATGCCTTTGTGACTACACCC
ATGTGCTGCCCGTCACGGTCCTCCATGCTCACCGGGAAGTATGTGCACAATCACAATGTCTACACCAACAACGAGAACTGCTCTTCCCCC
TCGTGGCAGGCCATGCATGAGCCTCGGACTTTTGCTGTATATCTTAACAACACTGGCTACAGAACAGCCTTTTTTGGAAAATACCTCAAT
GAATATAATGGCAGCTACATCCCCCCTGGGTGGCGAGAATGGCTTGGATTAATCAAGAATTCTCGCTTCTATAATTACACTGTTTGTCGC
AATGGCATCAAAGAAAAGCATGGATTTGATTATGCAAAGGACTACTTCACAGACTTAATCACTAACGAGAGCATTAATTACTTCAAAATG
TCTAAGAGAATGTATCCCCATAGGCCCGTTATGATGGTGATCAGCCACGCTGCGCCCCACGGCCCCGAGGACTCAGCCCCACAGTTTTCT
AAACTGTACCCCAATGCTTCCCAACACATAACTCCTAGTTATAACTATGCACCAAATATGGATAAACACTGGATTATGCAGTACACAGGA
CCAATGCTGCCCATCCACATGGAATTTACAAACATTCTACAGCGCAAAAGGCTCCAGACTTTGATGTCAGTGGATGATTCTGTGGAGAGG
CTGTATAACATGCTCGTGGAGACGGGGGAGCTGGAGAATACTTACATCATTTACACCGCCGACCATGGTTACCATATTGGGCAGTTTGGA
CTGGTCAAGGGGAAATCCATGCCATATGACTTTGATATTCGTGTGCCTTTTTTTATTCGTGGTCCAAGTGTAGAACCAGGATCAATAGTC
CCACAGATCGTTCTCAACATTGACTTGGCCCCCACGATCCTGGATATTGCTGGGCTCGACACACCTCCTGATGTGGACGGCAAGTCTGTC
CTCAAACTTCTGGACCCAGAAAAGCCAGGTAACAGGTTTCGAACAAACAAGAAGGCCAAAATTTGGCGTGATACATTCCTAGTGGAAAGA
GGCAAATTTCTACGTAAGAAGGAAGAATCCAGCAAGAATATCCAACAGTCAAATCACTTGCCCAAATATGAACGGGTCAAAGAACTATGC
CAGCAGGCCAGGTACCAGACAGCCTGTGAACAACCGGGGCAGAAGTGGCAATGCATTGAGGATACATCTGGCAAGCTTCGAATTCACAAG
TGTAAAGGACCCAGTGACCTGCTCACAGTCCGGCAGAGCACGCGGAACCTCTACGCTCGCGGCTTCCATGACAAAGACAAAGAGTGCAGT
TGTAGGGAGTCTGGTTACCGTGCCAGCAGAAGCCAAAGAAAGAGTCAACGGCAATTCTTGAGAAACCAGGGGACTCCAAGCATGGCTTTG
GGAAGAAGACAAGAATTTGAGATCTGGCGTGAGGAAATATGGAGAGGGAAACTGGTCTAAAATACTGTTGCATTATAAATTCAACAACCG
GACAAGTGTCATGTTAAAAGACAGATGGAGGACCATGAAGAAACTAAAACTGATTTCCTCAGACAGCGAAGACTGATTGTGTTTGTAAAA
GCTTGATGAAAGGACAGTTAAGTATTTTGATCACTGCATTTTGTTTGAAACTTGTGTCATTGATGTAATTTAAAACTTTTGTTTAAAGCA
TTACAGTATTTTTCTGTGACCATCAATTAATGAGGGTTTGTGCTACCAGAGTTAAAGCATATGCTATCATTGTATTCTTTAAGAACCTTA
TTTTGATAAAATGTAAATTTGTTGAACCCTGCCACATTTAGTATCCCCACCCCCAAATCCTGTTCCAATGAAAAAATTAAAACCTGATAC
GAAAAAAAAAAAATTCCAGTTAACCTATTTTGTGTCTGTAGGCTGACCTCAACCCTGTAACGTAACCCATTAAAATGAATTTCTTTTTTT
TTAAGACAGAGTTTCTCTCTGTTGCCCAGGCTGGAGTGCAGTGGCGCAATTTCAGCTCACTGCAACCTCTGCCTCCCAGGTTCAAGTGAT
TCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTACAGGCACACACCACCACGCCCAGCTAATTTTTGTATTTTTAGTAGAGGCGGGGTTT
CACCATGCTGGTCAGGATGTTCTCCAACTCCTGACTTCATGATCCACCCACCTCGGCCTCCCAAAGTGCTGAGATTACAGACGTGAGCCA
CTGCGTCCTGCCTAAAATGAATTTTCTAGATGATTGAATAACAGTAGTCCTTTGATAGAAGATAATGACTTGGTTTATGGCCTTAATATA
CTACTTAATTACTTAAGATGTTTATTAATAGAATGATAAATGTACAGAGTAACCTATAAGCATGACATACTTTTGCTTTCAGTAGTTTCA
TGTAAAGAAAAAAACTTGAAAATAGTAATACCTGAGTACCCATGGGAATAATAGACACTGGGGAGGTAGGGTGGGGAGCGGGAGCAAGAG
CTGAAAAACTTACCTACTGGGGACTGTGCTCACTACCTGGGTGACAGGATCATACGTACCCCAAACCTCAACATCACACAGTATACTCAG
CTAACAAACCTGCCCATGTGTTTCCTGAATCTAAAATAAAAATCGAAATAATTTTTTTAAAAAAGAAAAAGACAATAGTATTACCCATGG
GACAAAATTTGTACTATTAGCAAGAATCATTTTGTGTCTCATTTAGAAACAATTTGACTTTTGTTCCAGTGTTTAAACTTTGACAAAAAT
GGTTTTGAATAGATCTTTATAACCTGATGCCATAAATACAAGATTCTCTGATACCTTCATTTAATATATCAATATTGGGCCTAAAACAGT
ATTCTGTAAAGCTTAAATTGGTATTAACTATGATCATCTTGATGTCTATGATAGATAATAAACAAGGTCATACATACCTTACTAAACAAT
TTTGGTTTTTCACCAACATTTTATTCTTTAAAAGATTTAGACTAACAGAATTATTTAGCATTTCGAGTCATGTGCTTTATTTAGCAAGTG
AGTAAAAATATTGGAATATTGAAGTATTTGCATAAAAAATCAAATGGTGGTGTTTTGTAATCTCTATTGTATTTCCTATTAAGGTTTCAT
ATATTACTTTCCCATTGTTCCTGAATTTGTTATCCTATATATAAACAGAAACATGGATGAGTACCATATACTTCCTGACTCCTTTATCTT
GTTACACCACAAGATACTGCACTATTTATTGGAGATATTTTGATGATTAGTGCCAAGGCTGGCATCATGAGCCAAAATAGACTTGTTAAG
TCAACCTCATTCCAAACACCTGCTATAGATCATCAAATCTAAGTTGCCATTAATTGTAAGATGTCCTGTTATTTCACACACACAACTAGA
AAAGAAAAATCGGTATCAATTATAGTATGGTGCTTTCTTATTTTGAATTTTACTTATTAGAAGAGCTATTTTAGATGTATTTAAACATAA

>88037_88037_7_SULF1-TERF1_SULF1_chr8_70533486_ENST00000458141_TERF1_chr8_73958195_ENST00000276602_length(amino acids)=554AA_BP=532
MKYSCCALVLAVLGTELLGSLCSTVRSPRFRGRIQQERKNIRPNIILVLTDDQDVELGSLQVMNKTRKIMEHGGATFINAFVTTPMCCPS
RSSMLTGKYVHNHNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKE
KHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPI
HMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGSIVPQIVL
NIDLAPTILDIAGLDTPPDVDGKSVLKLLDPEKPGNRFRTNKKAKIWRDTFLVERGKFLRKKEESSKNIQQSNHLPKYERVKELCQQARY
QTACEQPGQKWQCIEDTSGKLRIHKCKGPSDLLTVRQSTRNLYARGFHDKDKECSCRESGYRASRSQRKSQRQFLRNQGTPSMALGRRQE

--------------------------------------------------------------
>88037_88037_8_SULF1-TERF1_SULF1_chr8_70533486_ENST00000458141_TERF1_chr8_73958196_ENST00000276602_length(transcript)=4311nt_BP=2149nt
AGGTTACTTGACTGGGAGTTCTCAGACCTCCAGTTTCAGCCCTGCCCTCAGCCTCCAATCCGTAAGAGACACCCAGCCCCAGCAATTGGA
TTGGGCAGCCCGTCTTGACACACCACTGTGCTGAGTGCTTGAGGACGTGTTTCAACAGATGGTTGGGGTTAGTGTGTGTCATCACATTCG
AGTGGGGATTAAGAGAAGGAAGGCTGCCTTGCTGGAGCTGTGTGGTCTTCTCCAAGTGAGAGTCGCAGGCAATAGAACTACTTTGCTTTT
GGAGGAAAAGGAGGAATTCATTTTCAGCAGACACAAGAAAAGCAGTTTTTTTTTCAGGTGCTGACGGCCACCCACCATCATCTAAAGAAG
ATAAACTTGGCAAATGACATGCAGGTTCTTCAAGGCAGAATAATTGCAGAAAATCTTCAAAGGACCCTATCTGCAGATGTTCTGAATACC
TCTGAGAATAGAGATTGATTATTCAACCAGGATACCTAATTCAAGAACTCCAGAAATCAGGAGACGGAGACATTTTGTCAGTTTTGCAAC
ATTGGACCAAATACAATGAAGTATTCTTGCTGTGCTCTGGTTTTGGCTGTCCTGGGCACAGAATTGCTGGGAAGCCTCTGTTCGACTGTC
AGATCCCCGAGGTTCAGAGGACGGATACAGCAGGAACGAAAAAACATCCGACCCAACATTATTCTTGTGCTTACCGATGATCAAGATGTG
GAGCTGGGGTCCCTGCAAGTCATGAACAAAACGAGAAAGATTATGGAACATGGGGGGGCCACCTTCATCAATGCCTTTGTGACTACACCC
ATGTGCTGCCCGTCACGGTCCTCCATGCTCACCGGGAAGTATGTGCACAATCACAATGTCTACACCAACAACGAGAACTGCTCTTCCCCC
TCGTGGCAGGCCATGCATGAGCCTCGGACTTTTGCTGTATATCTTAACAACACTGGCTACAGAACAGCCTTTTTTGGAAAATACCTCAAT
GAATATAATGGCAGCTACATCCCCCCTGGGTGGCGAGAATGGCTTGGATTAATCAAGAATTCTCGCTTCTATAATTACACTGTTTGTCGC
AATGGCATCAAAGAAAAGCATGGATTTGATTATGCAAAGGACTACTTCACAGACTTAATCACTAACGAGAGCATTAATTACTTCAAAATG
TCTAAGAGAATGTATCCCCATAGGCCCGTTATGATGGTGATCAGCCACGCTGCGCCCCACGGCCCCGAGGACTCAGCCCCACAGTTTTCT
AAACTGTACCCCAATGCTTCCCAACACATAACTCCTAGTTATAACTATGCACCAAATATGGATAAACACTGGATTATGCAGTACACAGGA
CCAATGCTGCCCATCCACATGGAATTTACAAACATTCTACAGCGCAAAAGGCTCCAGACTTTGATGTCAGTGGATGATTCTGTGGAGAGG
CTGTATAACATGCTCGTGGAGACGGGGGAGCTGGAGAATACTTACATCATTTACACCGCCGACCATGGTTACCATATTGGGCAGTTTGGA
CTGGTCAAGGGGAAATCCATGCCATATGACTTTGATATTCGTGTGCCTTTTTTTATTCGTGGTCCAAGTGTAGAACCAGGATCAATAGTC
CCACAGATCGTTCTCAACATTGACTTGGCCCCCACGATCCTGGATATTGCTGGGCTCGACACACCTCCTGATGTGGACGGCAAGTCTGTC
CTCAAACTTCTGGACCCAGAAAAGCCAGGTAACAGGTTTCGAACAAACAAGAAGGCCAAAATTTGGCGTGATACATTCCTAGTGGAAAGA
GGCAAATTTCTACGTAAGAAGGAAGAATCCAGCAAGAATATCCAACAGTCAAATCACTTGCCCAAATATGAACGGGTCAAAGAACTATGC
CAGCAGGCCAGGTACCAGACAGCCTGTGAACAACCGGGGCAGAAGTGGCAATGCATTGAGGATACATCTGGCAAGCTTCGAATTCACAAG
TGTAAAGGACCCAGTGACCTGCTCACAGTCCGGCAGAGCACGCGGAACCTCTACGCTCGCGGCTTCCATGACAAAGACAAAGAGTGCAGT
TGTAGGGAGTCTGGTTACCGTGCCAGCAGAAGCCAAAGAAAGAGTCAACGGCAATTCTTGAGAAACCAGGGGACTCCAAGCATGGCTTTG
GGAAGAAGACAAGAATTTGAGATCTGGCGTGAGGAAATATGGAGAGGGAAACTGGTCTAAAATACTGTTGCATTATAAATTCAACAACCG
GACAAGTGTCATGTTAAAAGACAGATGGAGGACCATGAAGAAACTAAAACTGATTTCCTCAGACAGCGAAGACTGATTGTGTTTGTAAAA
GCTTGATGAAAGGACAGTTAAGTATTTTGATCACTGCATTTTGTTTGAAACTTGTGTCATTGATGTAATTTAAAACTTTTGTTTAAAGCA
TTACAGTATTTTTCTGTGACCATCAATTAATGAGGGTTTGTGCTACCAGAGTTAAAGCATATGCTATCATTGTATTCTTTAAGAACCTTA
TTTTGATAAAATGTAAATTTGTTGAACCCTGCCACATTTAGTATCCCCACCCCCAAATCCTGTTCCAATGAAAAAATTAAAACCTGATAC
GAAAAAAAAAAAATTCCAGTTAACCTATTTTGTGTCTGTAGGCTGACCTCAACCCTGTAACGTAACCCATTAAAATGAATTTCTTTTTTT
TTAAGACAGAGTTTCTCTCTGTTGCCCAGGCTGGAGTGCAGTGGCGCAATTTCAGCTCACTGCAACCTCTGCCTCCCAGGTTCAAGTGAT
TCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTACAGGCACACACCACCACGCCCAGCTAATTTTTGTATTTTTAGTAGAGGCGGGGTTT
CACCATGCTGGTCAGGATGTTCTCCAACTCCTGACTTCATGATCCACCCACCTCGGCCTCCCAAAGTGCTGAGATTACAGACGTGAGCCA
CTGCGTCCTGCCTAAAATGAATTTTCTAGATGATTGAATAACAGTAGTCCTTTGATAGAAGATAATGACTTGGTTTATGGCCTTAATATA
CTACTTAATTACTTAAGATGTTTATTAATAGAATGATAAATGTACAGAGTAACCTATAAGCATGACATACTTTTGCTTTCAGTAGTTTCA
TGTAAAGAAAAAAACTTGAAAATAGTAATACCTGAGTACCCATGGGAATAATAGACACTGGGGAGGTAGGGTGGGGAGCGGGAGCAAGAG
CTGAAAAACTTACCTACTGGGGACTGTGCTCACTACCTGGGTGACAGGATCATACGTACCCCAAACCTCAACATCACACAGTATACTCAG
CTAACAAACCTGCCCATGTGTTTCCTGAATCTAAAATAAAAATCGAAATAATTTTTTTAAAAAAGAAAAAGACAATAGTATTACCCATGG
GACAAAATTTGTACTATTAGCAAGAATCATTTTGTGTCTCATTTAGAAACAATTTGACTTTTGTTCCAGTGTTTAAACTTTGACAAAAAT
GGTTTTGAATAGATCTTTATAACCTGATGCCATAAATACAAGATTCTCTGATACCTTCATTTAATATATCAATATTGGGCCTAAAACAGT
ATTCTGTAAAGCTTAAATTGGTATTAACTATGATCATCTTGATGTCTATGATAGATAATAAACAAGGTCATACATACCTTACTAAACAAT
TTTGGTTTTTCACCAACATTTTATTCTTTAAAAGATTTAGACTAACAGAATTATTTAGCATTTCGAGTCATGTGCTTTATTTAGCAAGTG
AGTAAAAATATTGGAATATTGAAGTATTTGCATAAAAAATCAAATGGTGGTGTTTTGTAATCTCTATTGTATTTCCTATTAAGGTTTCAT
ATATTACTTTCCCATTGTTCCTGAATTTGTTATCCTATATATAAACAGAAACATGGATGAGTACCATATACTTCCTGACTCCTTTATCTT
GTTACACCACAAGATACTGCACTATTTATTGGAGATATTTTGATGATTAGTGCCAAGGCTGGCATCATGAGCCAAAATAGACTTGTTAAG
TCAACCTCATTCCAAACACCTGCTATAGATCATCAAATCTAAGTTGCCATTAATTGTAAGATGTCCTGTTATTTCACACACACAACTAGA
AAAGAAAAATCGGTATCAATTATAGTATGGTGCTTTCTTATTTTGAATTTTACTTATTAGAAGAGCTATTTTAGATGTATTTAAACATAA

>88037_88037_8_SULF1-TERF1_SULF1_chr8_70533486_ENST00000458141_TERF1_chr8_73958196_ENST00000276602_length(amino acids)=554AA_BP=532
MKYSCCALVLAVLGTELLGSLCSTVRSPRFRGRIQQERKNIRPNIILVLTDDQDVELGSLQVMNKTRKIMEHGGATFINAFVTTPMCCPS
RSSMLTGKYVHNHNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKE
KHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPI
HMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFFIRGPSVEPGSIVPQIVL
NIDLAPTILDIAGLDTPPDVDGKSVLKLLDPEKPGNRFRTNKKAKIWRDTFLVERGKFLRKKEESSKNIQQSNHLPKYERVKELCQQARY
QTACEQPGQKWQCIEDTSGKLRIHKCKGPSDLLTVRQSTRNLYARGFHDKDKECSCRESGYRASRSQRKSQRQFLRNQGTPSMALGRRQE

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for SULF1-TERF1


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with
TgeneTERF1chr8:70533486chr8:73958195ENST0000027660279265_378361.0420.0RLIM
TgeneTERF1chr8:70533486chr8:73958195ENST00000276603810265_378381.0440.0RLIM
TgeneTERF1chr8:70533486chr8:73958196ENST0000027660279265_378361.0420.0RLIM
TgeneTERF1chr8:70533486chr8:73958196ENST00000276603810265_378381.0440.0RLIM


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for SULF1-TERF1


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for SULF1-TERF1


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource