FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:SULF1-RPS29 (FusionGDB2 ID:88036)

Fusion Gene Summary for SULF1-RPS29

check button Fusion gene summary
Fusion gene informationFusion gene name: SULF1-RPS29
Fusion gene ID: 88036
HgeneTgene
Gene symbol

SULF1

RPS29

Gene ID

23213

6235

Gene namesulfatase 1ribosomal protein S29
SynonymsSULF-1DBA13|S29|uS14
Cytomap

8q13.2-q13.3

14q21.3

Type of geneprotein-codingprotein-coding
Descriptionextracellular sulfatase Sulf-1sulfatase FP40S ribosomal protein S29small ribosomal subunit protein uS14
Modification date2020031320200313
UniProtAcc..
Ensembl transtripts involved in fusion geneENST00000521946, ENST00000260128, 
ENST00000402687, ENST00000419716, 
ENST00000458141, 
ENST00000245458, 
ENST00000396020, ENST00000557111, 
Fusion gene scores* DoF score13 X 15 X 5=97516 X 14 X 5=1120
# samples 1617
** MAII scorelog2(16/975*10)=-2.60733031374961
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(17/1120*10)=-2.71989208080726
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: SULF1 [Title/Abstract] AND RPS29 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointSULF1(70512988)-RPS29(50052767), # samples:3
Anticipated loss of major functional domain due to fusion event.SULF1-RPS29 seems lost the major protein functional domain in Tgene partner, which is a cell metabolism gene due to the frame-shifted ORF.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneSULF1

GO:0001937

negative regulation of endothelial cell proliferation

16778174

HgeneSULF1

GO:0016525

negative regulation of angiogenesis

16778174

HgeneSULF1

GO:0030177

positive regulation of Wnt signaling pathway

19520866

HgeneSULF1

GO:0030201

heparan sulfate proteoglycan metabolic process

18687675|19666466|19822709

HgeneSULF1

GO:0048010

vascular endothelial growth factor receptor signaling pathway

16778174

TgeneRPS29

GO:0002181

cytoplasmic translation

25957688


check buttonFusion gene breakpoints across SULF1 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across RPS29 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4OVTCGA-09-1667-01CSULF1chr8

70501376

+RPS29chr14

50052767

-
ChimerDB4OVTCGA-09-1667-01CSULF1chr8

70512988

+RPS29chr14

50052767

-
ChimerDB4OVTCGA-09-1667SULF1chr8

70512988

+RPS29chr14

50044571

-
ChimerDB4OVTCGA-09-1667SULF1chr8

70512988

+RPS29chr14

50052767

-


Top

Fusion Gene ORF analysis for SULF1-RPS29

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
3UTR-3CDSENST00000521946ENST00000245458SULF1chr8

70512988

+RPS29chr14

50052767

-
3UTR-3CDSENST00000521946ENST00000396020SULF1chr8

70512988

+RPS29chr14

50052767

-
3UTR-3CDSENST00000521946ENST00000396020SULF1chr8

70512988

+RPS29chr14

50044571

-
3UTR-5UTRENST00000521946ENST00000557111SULF1chr8

70512988

+RPS29chr14

50052767

-
3UTR-intronENST00000521946ENST00000245458SULF1chr8

70512988

+RPS29chr14

50044571

-
3UTR-intronENST00000521946ENST00000557111SULF1chr8

70512988

+RPS29chr14

50044571

-
5CDS-5UTRENST00000260128ENST00000557111SULF1chr8

70501376

+RPS29chr14

50052767

-
5CDS-5UTRENST00000260128ENST00000557111SULF1chr8

70512988

+RPS29chr14

50052767

-
5CDS-5UTRENST00000402687ENST00000557111SULF1chr8

70501376

+RPS29chr14

50052767

-
5CDS-5UTRENST00000402687ENST00000557111SULF1chr8

70512988

+RPS29chr14

50052767

-
5CDS-5UTRENST00000419716ENST00000557111SULF1chr8

70501376

+RPS29chr14

50052767

-
5CDS-5UTRENST00000419716ENST00000557111SULF1chr8

70512988

+RPS29chr14

50052767

-
5CDS-5UTRENST00000458141ENST00000557111SULF1chr8

70501376

+RPS29chr14

50052767

-
5CDS-5UTRENST00000458141ENST00000557111SULF1chr8

70512988

+RPS29chr14

50052767

-
5CDS-intronENST00000260128ENST00000245458SULF1chr8

70512988

+RPS29chr14

50044571

-
5CDS-intronENST00000260128ENST00000557111SULF1chr8

70512988

+RPS29chr14

50044571

-
5CDS-intronENST00000402687ENST00000245458SULF1chr8

70512988

+RPS29chr14

50044571

-
5CDS-intronENST00000402687ENST00000557111SULF1chr8

70512988

+RPS29chr14

50044571

-
5CDS-intronENST00000419716ENST00000245458SULF1chr8

70512988

+RPS29chr14

50044571

-
5CDS-intronENST00000419716ENST00000557111SULF1chr8

70512988

+RPS29chr14

50044571

-
5CDS-intronENST00000458141ENST00000245458SULF1chr8

70512988

+RPS29chr14

50044571

-
5CDS-intronENST00000458141ENST00000557111SULF1chr8

70512988

+RPS29chr14

50044571

-
Frame-shiftENST00000260128ENST00000245458SULF1chr8

70512988

+RPS29chr14

50052767

-
Frame-shiftENST00000260128ENST00000396020SULF1chr8

70512988

+RPS29chr14

50052767

-
Frame-shiftENST00000402687ENST00000245458SULF1chr8

70512988

+RPS29chr14

50052767

-
Frame-shiftENST00000402687ENST00000396020SULF1chr8

70512988

+RPS29chr14

50052767

-
Frame-shiftENST00000419716ENST00000245458SULF1chr8

70512988

+RPS29chr14

50052767

-
Frame-shiftENST00000419716ENST00000396020SULF1chr8

70512988

+RPS29chr14

50052767

-
Frame-shiftENST00000458141ENST00000245458SULF1chr8

70512988

+RPS29chr14

50052767

-
Frame-shiftENST00000458141ENST00000396020SULF1chr8

70512988

+RPS29chr14

50052767

-
In-frameENST00000260128ENST00000245458SULF1chr8

70501376

+RPS29chr14

50052767

-
In-frameENST00000260128ENST00000396020SULF1chr8

70501376

+RPS29chr14

50052767

-
In-frameENST00000260128ENST00000396020SULF1chr8

70512988

+RPS29chr14

50044571

-
In-frameENST00000402687ENST00000245458SULF1chr8

70501376

+RPS29chr14

50052767

-
In-frameENST00000402687ENST00000396020SULF1chr8

70501376

+RPS29chr14

50052767

-
In-frameENST00000402687ENST00000396020SULF1chr8

70512988

+RPS29chr14

50044571

-
In-frameENST00000419716ENST00000245458SULF1chr8

70501376

+RPS29chr14

50052767

-
In-frameENST00000419716ENST00000396020SULF1chr8

70501376

+RPS29chr14

50052767

-
In-frameENST00000419716ENST00000396020SULF1chr8

70512988

+RPS29chr14

50044571

-
In-frameENST00000458141ENST00000245458SULF1chr8

70501376

+RPS29chr14

50052767

-
In-frameENST00000458141ENST00000396020SULF1chr8

70501376

+RPS29chr14

50052767

-
In-frameENST00000458141ENST00000396020SULF1chr8

70512988

+RPS29chr14

50044571

-
intron-3CDSENST00000521946ENST00000245458SULF1chr8

70501376

+RPS29chr14

50052767

-
intron-3CDSENST00000521946ENST00000396020SULF1chr8

70501376

+RPS29chr14

50052767

-
intron-5UTRENST00000521946ENST00000557111SULF1chr8

70501376

+RPS29chr14

50052767

-

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000260128SULF1chr870512988+ENST00000396020RPS29chr1450044571-278416027171643308
ENST00000458141SULF1chr870512988+ENST00000396020RPS29chr1450044571-262214405551481308
ENST00000402687SULF1chr870512988+ENST00000396020RPS29chr1450044571-282216407551681308
ENST00000419716SULF1chr870512988+ENST00000396020RPS29chr1450044571-262814465611487308
ENST00000260128SULF1chr870501376+ENST00000396020RPS29chr1450052767-273314517171592291
ENST00000260128SULF1chr870501376+ENST00000245458RPS29chr1450052767-165514517171559280
ENST00000458141SULF1chr870501376+ENST00000396020RPS29chr1450052767-257112895551430291
ENST00000458141SULF1chr870501376+ENST00000245458RPS29chr1450052767-149312895551397280
ENST00000402687SULF1chr870501376+ENST00000396020RPS29chr1450052767-277114897551630291
ENST00000402687SULF1chr870501376+ENST00000245458RPS29chr1450052767-169314897551597280
ENST00000419716SULF1chr870501376+ENST00000396020RPS29chr1450052767-257712955611436291
ENST00000419716SULF1chr870501376+ENST00000245458RPS29chr1450052767-149912955611403280

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000260128ENST00000396020SULF1chr870512988+RPS29chr1450044571-0.0013981090.99860185
ENST00000458141ENST00000396020SULF1chr870512988+RPS29chr1450044571-0.0017129770.998287
ENST00000402687ENST00000396020SULF1chr870512988+RPS29chr1450044571-0.0014849840.99851507
ENST00000419716ENST00000396020SULF1chr870512988+RPS29chr1450044571-0.0019102890.99808973
ENST00000260128ENST00000396020SULF1chr870501376+RPS29chr1450052767-0.0011771380.99882287
ENST00000260128ENST00000245458SULF1chr870501376+RPS29chr1450052767-0.000796640.9992034
ENST00000458141ENST00000396020SULF1chr870501376+RPS29chr1450052767-0.0013217240.99867827
ENST00000458141ENST00000245458SULF1chr870501376+RPS29chr1450052767-0.0009426870.99905735
ENST00000402687ENST00000396020SULF1chr870501376+RPS29chr1450052767-0.0011360320.99886394
ENST00000402687ENST00000245458SULF1chr870501376+RPS29chr1450052767-0.000827420.9991725
ENST00000419716ENST00000396020SULF1chr870501376+RPS29chr1450052767-0.0014596640.9985404
ENST00000419716ENST00000245458SULF1chr870501376+RPS29chr1450052767-0.0011014930.9988985

Top

Fusion Genomic Features for SULF1-RPS29


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.

Top

Fusion Protein Features for SULF1-RPS29


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr8:70512988/chr14:50052767)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
..
FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note


Top

Fusion Gene Sequence for SULF1-RPS29


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>88036_88036_1_SULF1-RPS29_SULF1_chr8_70501376_ENST00000260128_RPS29_chr14_50052767_ENST00000245458_length(transcript)=1655nt_BP=1451nt
AGGTTACTTGACTGGGAGTTCTCAGACCTCCAGTTTCAGCCCTGCCCTCAGCCTCCAATCCGTAAGAGACACCCAGCCCCAGCAATTGGA
TTGGGCAGCCCGTCTTGACACACCACTGTGCTGAGTGCTTGAGGACGTGTTTCAACAGATGGTTGGGGTTAGTGTGTGTCATCACATTCG
AGTGGGGATTAAGAGAAGGAAGGCTGCCTTGCTGGAGCTGTGTGGTCTTCTCCAAGTGAGAGTCGCAGGCAATAGAACTACTTTGCTTTT
GGAGGAAAAGGAGGAATTCATTTTCAGCAGACACAAGAAAAGCAGTTTTTTTTTCAGGGATTCTTCACTTCTCTTGAACAAGGAACTCAC
TCAGAGACTAACACAAAGGAAGTAATTTCTTACCTGGTCATTATTTAGTCTACAATAAGTTCATCCTTCTTCAGTGTGACCAGTAAATTC
TTCCCATACTCTTGAAGAGAGCATAATTGGAATGGAGAGGTGCTGACGGCCACCCACCATCATCTAAAGAAGATAAACTTGGCAAATGAC
ATGCAGGTTCTTCAAGGCAGAATAATTGCAGAAAATCTTCAAAGGACCCTATCTGCAGATGTTCTGAATACCTCTGAGAATAGAGATTGA
TTATTCAACCAGGATACCTAATTCAAGAACTCCAGAAATCAGGAGACGGAGACATTTTGTCAGTTTTGCAACATTGGACCAAATACAATG
AAGTATTCTTGCTGTGCTCTGGTTTTGGCTGTCCTGGGCACAGAATTGCTGGGAAGCCTCTGTTCGACTGTCAGATCCCCGAGGTTCAGA
GGACGGATACAGCAGGAACGAAAAAACATCCGACCCAACATTATTCTTGTGCTTACCGATGATCAAGATGTGGAGCTGGGGTCCCTGCAA
GTCATGAACAAAACGAGAAAGATTATGGAACATGGGGGGGCCACCTTCATCAATGCCTTTGTGACTACACCCATGTGCTGCCCGTCACGG
TCCTCCATGCTCACCGGGAAGTATGTGCACAATCACAATGTCTACACCAACAACGAGAACTGCTCTTCCCCCTCGTGGCAGGCCATGCAT
GAGCCTCGGACTTTTGCTGTATATCTTAACAACACTGGCTACAGAACAGCCTTTTTTGGAAAATACCTCAATGAATATAATGGCAGCTAC
ATCCCCCCTGGGTGGCGAGAATGGCTTGGATTAATCAAGAATTCTCGCTTCTATAATTACACTGTTTGTCGCAATGGCATCAAAGAAAAG
CATGGATTTGATTATGCAAAGGACTACTTCACAGACTTAATCACTAACGAGAGCATTAATTACTTCAAAATGTCTAAGAGAATGTATCCC
CATAGGCCCGTTATGATGGTGATCAGCCACGCTGCGCCCCACGGCCCCGAGGACTCAGCCCCACAGTTTTCTAAACTGTACCCCAATGCT
TCCCAACACATTCGTGTCTGTTCAAACCGGCACGGTCTGATCCGGAAATATGGCCTCAATATGTGCCGCCAGTGTTTCCGTCAGTACGCG
AAGGATATCGGTTTCATTAAGTTGGACTAAATGCTCTTCCTTCAGAGGATTATCCGGGGCATCTACTCAATGAAAAACCATGATAATTCT

>88036_88036_1_SULF1-RPS29_SULF1_chr8_70501376_ENST00000260128_RPS29_chr14_50052767_ENST00000245458_length(amino acids)=280AA_BP=245
MKYSCCALVLAVLGTELLGSLCSTVRSPRFRGRIQQERKNIRPNIILVLTDDQDVELGSLQVMNKTRKIMEHGGATFINAFVTTPMCCPS
RSSMLTGKYVHNHNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKE
KHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHIRVCSNRHGLIRKYGLNMCRQCFRQY

--------------------------------------------------------------
>88036_88036_2_SULF1-RPS29_SULF1_chr8_70501376_ENST00000260128_RPS29_chr14_50052767_ENST00000396020_length(transcript)=2733nt_BP=1451nt
AGGTTACTTGACTGGGAGTTCTCAGACCTCCAGTTTCAGCCCTGCCCTCAGCCTCCAATCCGTAAGAGACACCCAGCCCCAGCAATTGGA
TTGGGCAGCCCGTCTTGACACACCACTGTGCTGAGTGCTTGAGGACGTGTTTCAACAGATGGTTGGGGTTAGTGTGTGTCATCACATTCG
AGTGGGGATTAAGAGAAGGAAGGCTGCCTTGCTGGAGCTGTGTGGTCTTCTCCAAGTGAGAGTCGCAGGCAATAGAACTACTTTGCTTTT
GGAGGAAAAGGAGGAATTCATTTTCAGCAGACACAAGAAAAGCAGTTTTTTTTTCAGGGATTCTTCACTTCTCTTGAACAAGGAACTCAC
TCAGAGACTAACACAAAGGAAGTAATTTCTTACCTGGTCATTATTTAGTCTACAATAAGTTCATCCTTCTTCAGTGTGACCAGTAAATTC
TTCCCATACTCTTGAAGAGAGCATAATTGGAATGGAGAGGTGCTGACGGCCACCCACCATCATCTAAAGAAGATAAACTTGGCAAATGAC
ATGCAGGTTCTTCAAGGCAGAATAATTGCAGAAAATCTTCAAAGGACCCTATCTGCAGATGTTCTGAATACCTCTGAGAATAGAGATTGA
TTATTCAACCAGGATACCTAATTCAAGAACTCCAGAAATCAGGAGACGGAGACATTTTGTCAGTTTTGCAACATTGGACCAAATACAATG
AAGTATTCTTGCTGTGCTCTGGTTTTGGCTGTCCTGGGCACAGAATTGCTGGGAAGCCTCTGTTCGACTGTCAGATCCCCGAGGTTCAGA
GGACGGATACAGCAGGAACGAAAAAACATCCGACCCAACATTATTCTTGTGCTTACCGATGATCAAGATGTGGAGCTGGGGTCCCTGCAA
GTCATGAACAAAACGAGAAAGATTATGGAACATGGGGGGGCCACCTTCATCAATGCCTTTGTGACTACACCCATGTGCTGCCCGTCACGG
TCCTCCATGCTCACCGGGAAGTATGTGCACAATCACAATGTCTACACCAACAACGAGAACTGCTCTTCCCCCTCGTGGCAGGCCATGCAT
GAGCCTCGGACTTTTGCTGTATATCTTAACAACACTGGCTACAGAACAGCCTTTTTTGGAAAATACCTCAATGAATATAATGGCAGCTAC
ATCCCCCCTGGGTGGCGAGAATGGCTTGGATTAATCAAGAATTCTCGCTTCTATAATTACACTGTTTGTCGCAATGGCATCAAAGAAAAG
CATGGATTTGATTATGCAAAGGACTACTTCACAGACTTAATCACTAACGAGAGCATTAATTACTTCAAAATGTCTAAGAGAATGTATCCC
CATAGGCCCGTTATGATGGTGATCAGCCACGCTGCGCCCCACGGCCCCGAGGACTCAGCCCCACAGTTTTCTAAACTGTACCCCAATGCT
TCCCAACACATTCGTGTCTGTTCAAACCGGCACGGTCTGATCCGGAAATATGGCCTCAATATGTGCCGCCAGTGTTTCCGTCAGTACGCG
AAGGATATCGGTTTCATTAAGAAAGACCTGAGCTGTCTTCCTTGGCACTGCCTATGGAGGTGACACCCATCTCCTCCATCATGGCCATCC
TGAGACCGCTCGCGAAGCCCAAGATCATCAAAAAGAGCACCAAGTTCACTGGGAACCAGTCAGACTGATATGTCAAAATTAAGGGTAACT
GGTGGAAACACAGAGGTATTAACAACAGGGTTCATAGAAGGTTTGAGGGCCAGATCTATGCCCAACATTGGTTATGGGAGAAACAAAAAG
ACAAAGCACATACTGCCCAGTGGCTTCTGGAAGTTCCTGGTCCACAACGTTAAGGAGCTGGAAGTACTGCTGGTGAGCAACAAATCTTAC
TGTGTTGAGATCACTCATGATGTTTCTTCCAAGAACTGCAAAGCCATCTTGGAAAGAGCAGCCCAGGTGGTCATCAGAGTCACCAATGCC
AATGCCAGCCTGCACAGTGCAGAAAGTGAATAGACAGTGAATGTGTTTGTTTTATTGGGGTTTAAATAAAACCAATAAAACTGTAAAAAC
AAAAACAACAAAAACCAGAGACCCTGGCTTTTGGACAGTTGTTCGTGTGGCTAATGCCCCCAACAGTTTTATTTTTTAAAATTTTATTTA
TGTATTGTTTTTGAGATGAAGTCTTCCTCTGACTGGAGTGCAGTGATGCAAGCTCAGCTCTCTGCAACTTCCACCTCCCAGGTTCAAGTG
ATTCTTCTGTCTCAGCCTCCCAAGTGGCTGGGATTACAGGTGCCCACTACCACACCTGACTAATTTTTGTATTTTTGTAGAGATGGGGTT
TCATCCTCTTGGCCAGGCTGGTTTCGAACTCCTGACCTCAAGGGATCCTCCCACCTCAGCTTCCCAAAGTGCTGGGATCACAGGCTTGTA
TTAGTTCATTTTCACACTGCTGATAAAGACATACCTAAGACTGGGAAGAAAAATAGGTTTATTGGACTCACAGTTCCATGTGGCTGGGGA
TGCCTCACAATTATGGTAGAAGGCAAAAGGCACTTCTTAGATGGCAGCGGCAAGCAGGAAATGAGGAAGACATGAAAGCAGATATCCCTT
ATAAAACCATCAGTTCTCGTGATACTTACTCACTACCATGAGAAAGTATGGGGGAAACCGCTCCCATGATTCAGTTATCTCCCACCGGGT

>88036_88036_2_SULF1-RPS29_SULF1_chr8_70501376_ENST00000260128_RPS29_chr14_50052767_ENST00000396020_length(amino acids)=291AA_BP=245
MKYSCCALVLAVLGTELLGSLCSTVRSPRFRGRIQQERKNIRPNIILVLTDDQDVELGSLQVMNKTRKIMEHGGATFINAFVTTPMCCPS
RSSMLTGKYVHNHNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKE
KHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHIRVCSNRHGLIRKYGLNMCRQCFRQY

--------------------------------------------------------------
>88036_88036_3_SULF1-RPS29_SULF1_chr8_70501376_ENST00000402687_RPS29_chr14_50052767_ENST00000245458_length(transcript)=1693nt_BP=1489nt
TTTGTCTGGCAGCGTTGGTTCTATGGGGGTGTGTGATGATAATAATACGCGGGCTTATATAACCGTCTTCATCTTGCGAGCACTTCGCAG
ACCGTCGCTAATGAATCTTGGGGCCGGTGTCGGGCCGGGGCGGCTTGATCGGCAACTAGGAAACCCCAGGCGCAGAGGCCAGGAGCGAGG
GCAGCGAGGATCAGAGGCCAGGCCTTCCCGGCTGCCGGCGCTCCTCGGAGGTCAGGGCAGATGAGGAACATGACTCTCCCCCTTCGGAGG
AGGAAGGAAGTCCCGCTGCCACCTTATCTCTGCTCCTCTGCCTCCTCCCTGTTCCCAGAGCTTTTTCTCTAGAGAAGATTTTGAAGGCGG
CTTTTGGATTCTTCACTTCTCTTGAACAAGGAACTCACTCAGAGACTAACACAAAGGAAGTAATTTCTTACCTGGTCATTATTTAGTCTA
CAATAAGTTCATCCTTCTTCAGTGTGACCAGTAAATTCTTCCCATACTCTTGAAGAGAGCATAATTGGAATGGAGAGGTGCTGACGGCCA
CCCACCATCATCTAAAGAAGATAAACTTGGCAAATGACATGCAGGTTCTTCAAGGCAGAATAATTGCAGAAAATCTTCAAAGGACCCTAT
CTGCAGATGTTCTGAATACCTCTGAGAATAGAGATTGATTATTCAACCAGGATACCTAATTCAAGAACTCCAGAAATCAGGAGACGGAGA
CATTTTGTCAGTTTTGCAACATTGGACCAAATACAATGAAGTATTCTTGCTGTGCTCTGGTTTTGGCTGTCCTGGGCACAGAATTGCTGG
GAAGCCTCTGTTCGACTGTCAGATCCCCGAGGTTCAGAGGACGGATACAGCAGGAACGAAAAAACATCCGACCCAACATTATTCTTGTGC
TTACCGATGATCAAGATGTGGAGCTGGGGTCCCTGCAAGTCATGAACAAAACGAGAAAGATTATGGAACATGGGGGGGCCACCTTCATCA
ATGCCTTTGTGACTACACCCATGTGCTGCCCGTCACGGTCCTCCATGCTCACCGGGAAGTATGTGCACAATCACAATGTCTACACCAACA
ACGAGAACTGCTCTTCCCCCTCGTGGCAGGCCATGCATGAGCCTCGGACTTTTGCTGTATATCTTAACAACACTGGCTACAGAACAGCCT
TTTTTGGAAAATACCTCAATGAATATAATGGCAGCTACATCCCCCCTGGGTGGCGAGAATGGCTTGGATTAATCAAGAATTCTCGCTTCT
ATAATTACACTGTTTGTCGCAATGGCATCAAAGAAAAGCATGGATTTGATTATGCAAAGGACTACTTCACAGACTTAATCACTAACGAGA
GCATTAATTACTTCAAAATGTCTAAGAGAATGTATCCCCATAGGCCCGTTATGATGGTGATCAGCCACGCTGCGCCCCACGGCCCCGAGG
ACTCAGCCCCACAGTTTTCTAAACTGTACCCCAATGCTTCCCAACACATTCGTGTCTGTTCAAACCGGCACGGTCTGATCCGGAAATATG
GCCTCAATATGTGCCGCCAGTGTTTCCGTCAGTACGCGAAGGATATCGGTTTCATTAAGTTGGACTAAATGCTCTTCCTTCAGAGGATTA

>88036_88036_3_SULF1-RPS29_SULF1_chr8_70501376_ENST00000402687_RPS29_chr14_50052767_ENST00000245458_length(amino acids)=280AA_BP=245
MKYSCCALVLAVLGTELLGSLCSTVRSPRFRGRIQQERKNIRPNIILVLTDDQDVELGSLQVMNKTRKIMEHGGATFINAFVTTPMCCPS
RSSMLTGKYVHNHNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKE
KHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHIRVCSNRHGLIRKYGLNMCRQCFRQY

--------------------------------------------------------------
>88036_88036_4_SULF1-RPS29_SULF1_chr8_70501376_ENST00000402687_RPS29_chr14_50052767_ENST00000396020_length(transcript)=2771nt_BP=1489nt
TTTGTCTGGCAGCGTTGGTTCTATGGGGGTGTGTGATGATAATAATACGCGGGCTTATATAACCGTCTTCATCTTGCGAGCACTTCGCAG
ACCGTCGCTAATGAATCTTGGGGCCGGTGTCGGGCCGGGGCGGCTTGATCGGCAACTAGGAAACCCCAGGCGCAGAGGCCAGGAGCGAGG
GCAGCGAGGATCAGAGGCCAGGCCTTCCCGGCTGCCGGCGCTCCTCGGAGGTCAGGGCAGATGAGGAACATGACTCTCCCCCTTCGGAGG
AGGAAGGAAGTCCCGCTGCCACCTTATCTCTGCTCCTCTGCCTCCTCCCTGTTCCCAGAGCTTTTTCTCTAGAGAAGATTTTGAAGGCGG
CTTTTGGATTCTTCACTTCTCTTGAACAAGGAACTCACTCAGAGACTAACACAAAGGAAGTAATTTCTTACCTGGTCATTATTTAGTCTA
CAATAAGTTCATCCTTCTTCAGTGTGACCAGTAAATTCTTCCCATACTCTTGAAGAGAGCATAATTGGAATGGAGAGGTGCTGACGGCCA
CCCACCATCATCTAAAGAAGATAAACTTGGCAAATGACATGCAGGTTCTTCAAGGCAGAATAATTGCAGAAAATCTTCAAAGGACCCTAT
CTGCAGATGTTCTGAATACCTCTGAGAATAGAGATTGATTATTCAACCAGGATACCTAATTCAAGAACTCCAGAAATCAGGAGACGGAGA
CATTTTGTCAGTTTTGCAACATTGGACCAAATACAATGAAGTATTCTTGCTGTGCTCTGGTTTTGGCTGTCCTGGGCACAGAATTGCTGG
GAAGCCTCTGTTCGACTGTCAGATCCCCGAGGTTCAGAGGACGGATACAGCAGGAACGAAAAAACATCCGACCCAACATTATTCTTGTGC
TTACCGATGATCAAGATGTGGAGCTGGGGTCCCTGCAAGTCATGAACAAAACGAGAAAGATTATGGAACATGGGGGGGCCACCTTCATCA
ATGCCTTTGTGACTACACCCATGTGCTGCCCGTCACGGTCCTCCATGCTCACCGGGAAGTATGTGCACAATCACAATGTCTACACCAACA
ACGAGAACTGCTCTTCCCCCTCGTGGCAGGCCATGCATGAGCCTCGGACTTTTGCTGTATATCTTAACAACACTGGCTACAGAACAGCCT
TTTTTGGAAAATACCTCAATGAATATAATGGCAGCTACATCCCCCCTGGGTGGCGAGAATGGCTTGGATTAATCAAGAATTCTCGCTTCT
ATAATTACACTGTTTGTCGCAATGGCATCAAAGAAAAGCATGGATTTGATTATGCAAAGGACTACTTCACAGACTTAATCACTAACGAGA
GCATTAATTACTTCAAAATGTCTAAGAGAATGTATCCCCATAGGCCCGTTATGATGGTGATCAGCCACGCTGCGCCCCACGGCCCCGAGG
ACTCAGCCCCACAGTTTTCTAAACTGTACCCCAATGCTTCCCAACACATTCGTGTCTGTTCAAACCGGCACGGTCTGATCCGGAAATATG
GCCTCAATATGTGCCGCCAGTGTTTCCGTCAGTACGCGAAGGATATCGGTTTCATTAAGAAAGACCTGAGCTGTCTTCCTTGGCACTGCC
TATGGAGGTGACACCCATCTCCTCCATCATGGCCATCCTGAGACCGCTCGCGAAGCCCAAGATCATCAAAAAGAGCACCAAGTTCACTGG
GAACCAGTCAGACTGATATGTCAAAATTAAGGGTAACTGGTGGAAACACAGAGGTATTAACAACAGGGTTCATAGAAGGTTTGAGGGCCA
GATCTATGCCCAACATTGGTTATGGGAGAAACAAAAAGACAAAGCACATACTGCCCAGTGGCTTCTGGAAGTTCCTGGTCCACAACGTTA
AGGAGCTGGAAGTACTGCTGGTGAGCAACAAATCTTACTGTGTTGAGATCACTCATGATGTTTCTTCCAAGAACTGCAAAGCCATCTTGG
AAAGAGCAGCCCAGGTGGTCATCAGAGTCACCAATGCCAATGCCAGCCTGCACAGTGCAGAAAGTGAATAGACAGTGAATGTGTTTGTTT
TATTGGGGTTTAAATAAAACCAATAAAACTGTAAAAACAAAAACAACAAAAACCAGAGACCCTGGCTTTTGGACAGTTGTTCGTGTGGCT
AATGCCCCCAACAGTTTTATTTTTTAAAATTTTATTTATGTATTGTTTTTGAGATGAAGTCTTCCTCTGACTGGAGTGCAGTGATGCAAG
CTCAGCTCTCTGCAACTTCCACCTCCCAGGTTCAAGTGATTCTTCTGTCTCAGCCTCCCAAGTGGCTGGGATTACAGGTGCCCACTACCA
CACCTGACTAATTTTTGTATTTTTGTAGAGATGGGGTTTCATCCTCTTGGCCAGGCTGGTTTCGAACTCCTGACCTCAAGGGATCCTCCC
ACCTCAGCTTCCCAAAGTGCTGGGATCACAGGCTTGTATTAGTTCATTTTCACACTGCTGATAAAGACATACCTAAGACTGGGAAGAAAA
ATAGGTTTATTGGACTCACAGTTCCATGTGGCTGGGGATGCCTCACAATTATGGTAGAAGGCAAAAGGCACTTCTTAGATGGCAGCGGCA
AGCAGGAAATGAGGAAGACATGAAAGCAGATATCCCTTATAAAACCATCAGTTCTCGTGATACTTACTCACTACCATGAGAAAGTATGGG

>88036_88036_4_SULF1-RPS29_SULF1_chr8_70501376_ENST00000402687_RPS29_chr14_50052767_ENST00000396020_length(amino acids)=291AA_BP=245
MKYSCCALVLAVLGTELLGSLCSTVRSPRFRGRIQQERKNIRPNIILVLTDDQDVELGSLQVMNKTRKIMEHGGATFINAFVTTPMCCPS
RSSMLTGKYVHNHNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKE
KHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHIRVCSNRHGLIRKYGLNMCRQCFRQY

--------------------------------------------------------------
>88036_88036_5_SULF1-RPS29_SULF1_chr8_70501376_ENST00000419716_RPS29_chr14_50052767_ENST00000245458_length(transcript)=1499nt_BP=1295nt
GTGATGATAATAATACGCGGGCTTATATAACCGTCTTCATCTTGCGAGCACTTCGCAGACCGTCGCTAATGAATCTTGGGGCCGGTGTCG
GGCCGGGGCGGCTTGATCGGCAACTAGGAAACCCCAGGCGCAGAGGCCAGGAGCGAGGGCAGCGAGGATCAGAGGCCAGGCCTTCCCGGC
TGCCGGCGCTCCTCGGAGGTCAGGGCAGATGAGGAACATGACTCTCCCCCTTCGGAGGAGGAAGGAAGTCCCGCTGCCACCTTATCTCTG
CTCCTCTGCCTCCTCCCTGTTCCCAGAGCTTTTTCTCTAGAGAAGATTTTGAAGGCGGCTTTTGTGCTGACGGCCACCCACCATCATCTA
AAGAAGATAAACTTGGCAAATGACATGCAGGTTCTTCAAGGCAGAATAATTGCAGAAAATCTTCAAAGGACCCTATCTGCAGATGTTCTG
AATACCTCTGAGAATAGAGATTGATTATTCAACCAGGATACCTAATTCAAGAACTCCAGAAATCAGGAGACGGAGACATTTTGTCAGTTT
TGCAACATTGGACCAAATACAATGAAGTATTCTTGCTGTGCTCTGGTTTTGGCTGTCCTGGGCACAGAATTGCTGGGAAGCCTCTGTTCG
ACTGTCAGATCCCCGAGGTTCAGAGGACGGATACAGCAGGAACGAAAAAACATCCGACCCAACATTATTCTTGTGCTTACCGATGATCAA
GATGTGGAGCTGGGGTCCCTGCAAGTCATGAACAAAACGAGAAAGATTATGGAACATGGGGGGGCCACCTTCATCAATGCCTTTGTGACT
ACACCCATGTGCTGCCCGTCACGGTCCTCCATGCTCACCGGGAAGTATGTGCACAATCACAATGTCTACACCAACAACGAGAACTGCTCT
TCCCCCTCGTGGCAGGCCATGCATGAGCCTCGGACTTTTGCTGTATATCTTAACAACACTGGCTACAGAACAGCCTTTTTTGGAAAATAC
CTCAATGAATATAATGGCAGCTACATCCCCCCTGGGTGGCGAGAATGGCTTGGATTAATCAAGAATTCTCGCTTCTATAATTACACTGTT
TGTCGCAATGGCATCAAAGAAAAGCATGGATTTGATTATGCAAAGGACTACTTCACAGACTTAATCACTAACGAGAGCATTAATTACTTC
AAAATGTCTAAGAGAATGTATCCCCATAGGCCCGTTATGATGGTGATCAGCCACGCTGCGCCCCACGGCCCCGAGGACTCAGCCCCACAG
TTTTCTAAACTGTACCCCAATGCTTCCCAACACATTCGTGTCTGTTCAAACCGGCACGGTCTGATCCGGAAATATGGCCTCAATATGTGC
CGCCAGTGTTTCCGTCAGTACGCGAAGGATATCGGTTTCATTAAGTTGGACTAAATGCTCTTCCTTCAGAGGATTATCCGGGGCATCTAC

>88036_88036_5_SULF1-RPS29_SULF1_chr8_70501376_ENST00000419716_RPS29_chr14_50052767_ENST00000245458_length(amino acids)=280AA_BP=245
MKYSCCALVLAVLGTELLGSLCSTVRSPRFRGRIQQERKNIRPNIILVLTDDQDVELGSLQVMNKTRKIMEHGGATFINAFVTTPMCCPS
RSSMLTGKYVHNHNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKE
KHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHIRVCSNRHGLIRKYGLNMCRQCFRQY

--------------------------------------------------------------
>88036_88036_6_SULF1-RPS29_SULF1_chr8_70501376_ENST00000419716_RPS29_chr14_50052767_ENST00000396020_length(transcript)=2577nt_BP=1295nt
GTGATGATAATAATACGCGGGCTTATATAACCGTCTTCATCTTGCGAGCACTTCGCAGACCGTCGCTAATGAATCTTGGGGCCGGTGTCG
GGCCGGGGCGGCTTGATCGGCAACTAGGAAACCCCAGGCGCAGAGGCCAGGAGCGAGGGCAGCGAGGATCAGAGGCCAGGCCTTCCCGGC
TGCCGGCGCTCCTCGGAGGTCAGGGCAGATGAGGAACATGACTCTCCCCCTTCGGAGGAGGAAGGAAGTCCCGCTGCCACCTTATCTCTG
CTCCTCTGCCTCCTCCCTGTTCCCAGAGCTTTTTCTCTAGAGAAGATTTTGAAGGCGGCTTTTGTGCTGACGGCCACCCACCATCATCTA
AAGAAGATAAACTTGGCAAATGACATGCAGGTTCTTCAAGGCAGAATAATTGCAGAAAATCTTCAAAGGACCCTATCTGCAGATGTTCTG
AATACCTCTGAGAATAGAGATTGATTATTCAACCAGGATACCTAATTCAAGAACTCCAGAAATCAGGAGACGGAGACATTTTGTCAGTTT
TGCAACATTGGACCAAATACAATGAAGTATTCTTGCTGTGCTCTGGTTTTGGCTGTCCTGGGCACAGAATTGCTGGGAAGCCTCTGTTCG
ACTGTCAGATCCCCGAGGTTCAGAGGACGGATACAGCAGGAACGAAAAAACATCCGACCCAACATTATTCTTGTGCTTACCGATGATCAA
GATGTGGAGCTGGGGTCCCTGCAAGTCATGAACAAAACGAGAAAGATTATGGAACATGGGGGGGCCACCTTCATCAATGCCTTTGTGACT
ACACCCATGTGCTGCCCGTCACGGTCCTCCATGCTCACCGGGAAGTATGTGCACAATCACAATGTCTACACCAACAACGAGAACTGCTCT
TCCCCCTCGTGGCAGGCCATGCATGAGCCTCGGACTTTTGCTGTATATCTTAACAACACTGGCTACAGAACAGCCTTTTTTGGAAAATAC
CTCAATGAATATAATGGCAGCTACATCCCCCCTGGGTGGCGAGAATGGCTTGGATTAATCAAGAATTCTCGCTTCTATAATTACACTGTT
TGTCGCAATGGCATCAAAGAAAAGCATGGATTTGATTATGCAAAGGACTACTTCACAGACTTAATCACTAACGAGAGCATTAATTACTTC
AAAATGTCTAAGAGAATGTATCCCCATAGGCCCGTTATGATGGTGATCAGCCACGCTGCGCCCCACGGCCCCGAGGACTCAGCCCCACAG
TTTTCTAAACTGTACCCCAATGCTTCCCAACACATTCGTGTCTGTTCAAACCGGCACGGTCTGATCCGGAAATATGGCCTCAATATGTGC
CGCCAGTGTTTCCGTCAGTACGCGAAGGATATCGGTTTCATTAAGAAAGACCTGAGCTGTCTTCCTTGGCACTGCCTATGGAGGTGACAC
CCATCTCCTCCATCATGGCCATCCTGAGACCGCTCGCGAAGCCCAAGATCATCAAAAAGAGCACCAAGTTCACTGGGAACCAGTCAGACT
GATATGTCAAAATTAAGGGTAACTGGTGGAAACACAGAGGTATTAACAACAGGGTTCATAGAAGGTTTGAGGGCCAGATCTATGCCCAAC
ATTGGTTATGGGAGAAACAAAAAGACAAAGCACATACTGCCCAGTGGCTTCTGGAAGTTCCTGGTCCACAACGTTAAGGAGCTGGAAGTA
CTGCTGGTGAGCAACAAATCTTACTGTGTTGAGATCACTCATGATGTTTCTTCCAAGAACTGCAAAGCCATCTTGGAAAGAGCAGCCCAG
GTGGTCATCAGAGTCACCAATGCCAATGCCAGCCTGCACAGTGCAGAAAGTGAATAGACAGTGAATGTGTTTGTTTTATTGGGGTTTAAA
TAAAACCAATAAAACTGTAAAAACAAAAACAACAAAAACCAGAGACCCTGGCTTTTGGACAGTTGTTCGTGTGGCTAATGCCCCCAACAG
TTTTATTTTTTAAAATTTTATTTATGTATTGTTTTTGAGATGAAGTCTTCCTCTGACTGGAGTGCAGTGATGCAAGCTCAGCTCTCTGCA
ACTTCCACCTCCCAGGTTCAAGTGATTCTTCTGTCTCAGCCTCCCAAGTGGCTGGGATTACAGGTGCCCACTACCACACCTGACTAATTT
TTGTATTTTTGTAGAGATGGGGTTTCATCCTCTTGGCCAGGCTGGTTTCGAACTCCTGACCTCAAGGGATCCTCCCACCTCAGCTTCCCA
AAGTGCTGGGATCACAGGCTTGTATTAGTTCATTTTCACACTGCTGATAAAGACATACCTAAGACTGGGAAGAAAAATAGGTTTATTGGA
CTCACAGTTCCATGTGGCTGGGGATGCCTCACAATTATGGTAGAAGGCAAAAGGCACTTCTTAGATGGCAGCGGCAAGCAGGAAATGAGG
AAGACATGAAAGCAGATATCCCTTATAAAACCATCAGTTCTCGTGATACTTACTCACTACCATGAGAAAGTATGGGGGAAACCGCTCCCA

>88036_88036_6_SULF1-RPS29_SULF1_chr8_70501376_ENST00000419716_RPS29_chr14_50052767_ENST00000396020_length(amino acids)=291AA_BP=245
MKYSCCALVLAVLGTELLGSLCSTVRSPRFRGRIQQERKNIRPNIILVLTDDQDVELGSLQVMNKTRKIMEHGGATFINAFVTTPMCCPS
RSSMLTGKYVHNHNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKE
KHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHIRVCSNRHGLIRKYGLNMCRQCFRQY

--------------------------------------------------------------
>88036_88036_7_SULF1-RPS29_SULF1_chr8_70501376_ENST00000458141_RPS29_chr14_50052767_ENST00000245458_length(transcript)=1493nt_BP=1289nt
AGGTTACTTGACTGGGAGTTCTCAGACCTCCAGTTTCAGCCCTGCCCTCAGCCTCCAATCCGTAAGAGACACCCAGCCCCAGCAATTGGA
TTGGGCAGCCCGTCTTGACACACCACTGTGCTGAGTGCTTGAGGACGTGTTTCAACAGATGGTTGGGGTTAGTGTGTGTCATCACATTCG
AGTGGGGATTAAGAGAAGGAAGGCTGCCTTGCTGGAGCTGTGTGGTCTTCTCCAAGTGAGAGTCGCAGGCAATAGAACTACTTTGCTTTT
GGAGGAAAAGGAGGAATTCATTTTCAGCAGACACAAGAAAAGCAGTTTTTTTTTCAGGTGCTGACGGCCACCCACCATCATCTAAAGAAG
ATAAACTTGGCAAATGACATGCAGGTTCTTCAAGGCAGAATAATTGCAGAAAATCTTCAAAGGACCCTATCTGCAGATGTTCTGAATACC
TCTGAGAATAGAGATTGATTATTCAACCAGGATACCTAATTCAAGAACTCCAGAAATCAGGAGACGGAGACATTTTGTCAGTTTTGCAAC
ATTGGACCAAATACAATGAAGTATTCTTGCTGTGCTCTGGTTTTGGCTGTCCTGGGCACAGAATTGCTGGGAAGCCTCTGTTCGACTGTC
AGATCCCCGAGGTTCAGAGGACGGATACAGCAGGAACGAAAAAACATCCGACCCAACATTATTCTTGTGCTTACCGATGATCAAGATGTG
GAGCTGGGGTCCCTGCAAGTCATGAACAAAACGAGAAAGATTATGGAACATGGGGGGGCCACCTTCATCAATGCCTTTGTGACTACACCC
ATGTGCTGCCCGTCACGGTCCTCCATGCTCACCGGGAAGTATGTGCACAATCACAATGTCTACACCAACAACGAGAACTGCTCTTCCCCC
TCGTGGCAGGCCATGCATGAGCCTCGGACTTTTGCTGTATATCTTAACAACACTGGCTACAGAACAGCCTTTTTTGGAAAATACCTCAAT
GAATATAATGGCAGCTACATCCCCCCTGGGTGGCGAGAATGGCTTGGATTAATCAAGAATTCTCGCTTCTATAATTACACTGTTTGTCGC
AATGGCATCAAAGAAAAGCATGGATTTGATTATGCAAAGGACTACTTCACAGACTTAATCACTAACGAGAGCATTAATTACTTCAAAATG
TCTAAGAGAATGTATCCCCATAGGCCCGTTATGATGGTGATCAGCCACGCTGCGCCCCACGGCCCCGAGGACTCAGCCCCACAGTTTTCT
AAACTGTACCCCAATGCTTCCCAACACATTCGTGTCTGTTCAAACCGGCACGGTCTGATCCGGAAATATGGCCTCAATATGTGCCGCCAG
TGTTTCCGTCAGTACGCGAAGGATATCGGTTTCATTAAGTTGGACTAAATGCTCTTCCTTCAGAGGATTATCCGGGGCATCTACTCAATG

>88036_88036_7_SULF1-RPS29_SULF1_chr8_70501376_ENST00000458141_RPS29_chr14_50052767_ENST00000245458_length(amino acids)=280AA_BP=245
MKYSCCALVLAVLGTELLGSLCSTVRSPRFRGRIQQERKNIRPNIILVLTDDQDVELGSLQVMNKTRKIMEHGGATFINAFVTTPMCCPS
RSSMLTGKYVHNHNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKE
KHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHIRVCSNRHGLIRKYGLNMCRQCFRQY

--------------------------------------------------------------
>88036_88036_8_SULF1-RPS29_SULF1_chr8_70501376_ENST00000458141_RPS29_chr14_50052767_ENST00000396020_length(transcript)=2571nt_BP=1289nt
AGGTTACTTGACTGGGAGTTCTCAGACCTCCAGTTTCAGCCCTGCCCTCAGCCTCCAATCCGTAAGAGACACCCAGCCCCAGCAATTGGA
TTGGGCAGCCCGTCTTGACACACCACTGTGCTGAGTGCTTGAGGACGTGTTTCAACAGATGGTTGGGGTTAGTGTGTGTCATCACATTCG
AGTGGGGATTAAGAGAAGGAAGGCTGCCTTGCTGGAGCTGTGTGGTCTTCTCCAAGTGAGAGTCGCAGGCAATAGAACTACTTTGCTTTT
GGAGGAAAAGGAGGAATTCATTTTCAGCAGACACAAGAAAAGCAGTTTTTTTTTCAGGTGCTGACGGCCACCCACCATCATCTAAAGAAG
ATAAACTTGGCAAATGACATGCAGGTTCTTCAAGGCAGAATAATTGCAGAAAATCTTCAAAGGACCCTATCTGCAGATGTTCTGAATACC
TCTGAGAATAGAGATTGATTATTCAACCAGGATACCTAATTCAAGAACTCCAGAAATCAGGAGACGGAGACATTTTGTCAGTTTTGCAAC
ATTGGACCAAATACAATGAAGTATTCTTGCTGTGCTCTGGTTTTGGCTGTCCTGGGCACAGAATTGCTGGGAAGCCTCTGTTCGACTGTC
AGATCCCCGAGGTTCAGAGGACGGATACAGCAGGAACGAAAAAACATCCGACCCAACATTATTCTTGTGCTTACCGATGATCAAGATGTG
GAGCTGGGGTCCCTGCAAGTCATGAACAAAACGAGAAAGATTATGGAACATGGGGGGGCCACCTTCATCAATGCCTTTGTGACTACACCC
ATGTGCTGCCCGTCACGGTCCTCCATGCTCACCGGGAAGTATGTGCACAATCACAATGTCTACACCAACAACGAGAACTGCTCTTCCCCC
TCGTGGCAGGCCATGCATGAGCCTCGGACTTTTGCTGTATATCTTAACAACACTGGCTACAGAACAGCCTTTTTTGGAAAATACCTCAAT
GAATATAATGGCAGCTACATCCCCCCTGGGTGGCGAGAATGGCTTGGATTAATCAAGAATTCTCGCTTCTATAATTACACTGTTTGTCGC
AATGGCATCAAAGAAAAGCATGGATTTGATTATGCAAAGGACTACTTCACAGACTTAATCACTAACGAGAGCATTAATTACTTCAAAATG
TCTAAGAGAATGTATCCCCATAGGCCCGTTATGATGGTGATCAGCCACGCTGCGCCCCACGGCCCCGAGGACTCAGCCCCACAGTTTTCT
AAACTGTACCCCAATGCTTCCCAACACATTCGTGTCTGTTCAAACCGGCACGGTCTGATCCGGAAATATGGCCTCAATATGTGCCGCCAG
TGTTTCCGTCAGTACGCGAAGGATATCGGTTTCATTAAGAAAGACCTGAGCTGTCTTCCTTGGCACTGCCTATGGAGGTGACACCCATCT
CCTCCATCATGGCCATCCTGAGACCGCTCGCGAAGCCCAAGATCATCAAAAAGAGCACCAAGTTCACTGGGAACCAGTCAGACTGATATG
TCAAAATTAAGGGTAACTGGTGGAAACACAGAGGTATTAACAACAGGGTTCATAGAAGGTTTGAGGGCCAGATCTATGCCCAACATTGGT
TATGGGAGAAACAAAAAGACAAAGCACATACTGCCCAGTGGCTTCTGGAAGTTCCTGGTCCACAACGTTAAGGAGCTGGAAGTACTGCTG
GTGAGCAACAAATCTTACTGTGTTGAGATCACTCATGATGTTTCTTCCAAGAACTGCAAAGCCATCTTGGAAAGAGCAGCCCAGGTGGTC
ATCAGAGTCACCAATGCCAATGCCAGCCTGCACAGTGCAGAAAGTGAATAGACAGTGAATGTGTTTGTTTTATTGGGGTTTAAATAAAAC
CAATAAAACTGTAAAAACAAAAACAACAAAAACCAGAGACCCTGGCTTTTGGACAGTTGTTCGTGTGGCTAATGCCCCCAACAGTTTTAT
TTTTTAAAATTTTATTTATGTATTGTTTTTGAGATGAAGTCTTCCTCTGACTGGAGTGCAGTGATGCAAGCTCAGCTCTCTGCAACTTCC
ACCTCCCAGGTTCAAGTGATTCTTCTGTCTCAGCCTCCCAAGTGGCTGGGATTACAGGTGCCCACTACCACACCTGACTAATTTTTGTAT
TTTTGTAGAGATGGGGTTTCATCCTCTTGGCCAGGCTGGTTTCGAACTCCTGACCTCAAGGGATCCTCCCACCTCAGCTTCCCAAAGTGC
TGGGATCACAGGCTTGTATTAGTTCATTTTCACACTGCTGATAAAGACATACCTAAGACTGGGAAGAAAAATAGGTTTATTGGACTCACA
GTTCCATGTGGCTGGGGATGCCTCACAATTATGGTAGAAGGCAAAAGGCACTTCTTAGATGGCAGCGGCAAGCAGGAAATGAGGAAGACA
TGAAAGCAGATATCCCTTATAAAACCATCAGTTCTCGTGATACTTACTCACTACCATGAGAAAGTATGGGGGAAACCGCTCCCATGATTC

>88036_88036_8_SULF1-RPS29_SULF1_chr8_70501376_ENST00000458141_RPS29_chr14_50052767_ENST00000396020_length(amino acids)=291AA_BP=245
MKYSCCALVLAVLGTELLGSLCSTVRSPRFRGRIQQERKNIRPNIILVLTDDQDVELGSLQVMNKTRKIMEHGGATFINAFVTTPMCCPS
RSSMLTGKYVHNHNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKE
KHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHIRVCSNRHGLIRKYGLNMCRQCFRQY

--------------------------------------------------------------
>88036_88036_9_SULF1-RPS29_SULF1_chr8_70512988_ENST00000260128_RPS29_chr14_50044571_ENST00000396020_length(transcript)=2784nt_BP=1602nt
AGGTTACTTGACTGGGAGTTCTCAGACCTCCAGTTTCAGCCCTGCCCTCAGCCTCCAATCCGTAAGAGACACCCAGCCCCAGCAATTGGA
TTGGGCAGCCCGTCTTGACACACCACTGTGCTGAGTGCTTGAGGACGTGTTTCAACAGATGGTTGGGGTTAGTGTGTGTCATCACATTCG
AGTGGGGATTAAGAGAAGGAAGGCTGCCTTGCTGGAGCTGTGTGGTCTTCTCCAAGTGAGAGTCGCAGGCAATAGAACTACTTTGCTTTT
GGAGGAAAAGGAGGAATTCATTTTCAGCAGACACAAGAAAAGCAGTTTTTTTTTCAGGGATTCTTCACTTCTCTTGAACAAGGAACTCAC
TCAGAGACTAACACAAAGGAAGTAATTTCTTACCTGGTCATTATTTAGTCTACAATAAGTTCATCCTTCTTCAGTGTGACCAGTAAATTC
TTCCCATACTCTTGAAGAGAGCATAATTGGAATGGAGAGGTGCTGACGGCCACCCACCATCATCTAAAGAAGATAAACTTGGCAAATGAC
ATGCAGGTTCTTCAAGGCAGAATAATTGCAGAAAATCTTCAAAGGACCCTATCTGCAGATGTTCTGAATACCTCTGAGAATAGAGATTGA
TTATTCAACCAGGATACCTAATTCAAGAACTCCAGAAATCAGGAGACGGAGACATTTTGTCAGTTTTGCAACATTGGACCAAATACAATG
AAGTATTCTTGCTGTGCTCTGGTTTTGGCTGTCCTGGGCACAGAATTGCTGGGAAGCCTCTGTTCGACTGTCAGATCCCCGAGGTTCAGA
GGACGGATACAGCAGGAACGAAAAAACATCCGACCCAACATTATTCTTGTGCTTACCGATGATCAAGATGTGGAGCTGGGGTCCCTGCAA
GTCATGAACAAAACGAGAAAGATTATGGAACATGGGGGGGCCACCTTCATCAATGCCTTTGTGACTACACCCATGTGCTGCCCGTCACGG
TCCTCCATGCTCACCGGGAAGTATGTGCACAATCACAATGTCTACACCAACAACGAGAACTGCTCTTCCCCCTCGTGGCAGGCCATGCAT
GAGCCTCGGACTTTTGCTGTATATCTTAACAACACTGGCTACAGAACAGCCTTTTTTGGAAAATACCTCAATGAATATAATGGCAGCTAC
ATCCCCCCTGGGTGGCGAGAATGGCTTGGATTAATCAAGAATTCTCGCTTCTATAATTACACTGTTTGTCGCAATGGCATCAAAGAAAAG
CATGGATTTGATTATGCAAAGGACTACTTCACAGACTTAATCACTAACGAGAGCATTAATTACTTCAAAATGTCTAAGAGAATGTATCCC
CATAGGCCCGTTATGATGGTGATCAGCCACGCTGCGCCCCACGGCCCCGAGGACTCAGCCCCACAGTTTTCTAAACTGTACCCCAATGCT
TCCCAACACATAACTCCTAGTTATAACTATGCACCAAATATGGATAAACACTGGATTATGCAGTACACAGGACCAATGCTGCCCATCCAC
ATGGAATTTACAAACATTCTACAGCGCAAAAGGCTCCAGACTTTGATGTCAGTGGATGATTCTGTGGAGAGGAAAGACCTGAGCTGTCTT
CCTTGGCACTGCCTATGGAGGTGACACCCATCTCCTCCATCATGGCCATCCTGAGACCGCTCGCGAAGCCCAAGATCATCAAAAAGAGCA
CCAAGTTCACTGGGAACCAGTCAGACTGATATGTCAAAATTAAGGGTAACTGGTGGAAACACAGAGGTATTAACAACAGGGTTCATAGAA
GGTTTGAGGGCCAGATCTATGCCCAACATTGGTTATGGGAGAAACAAAAAGACAAAGCACATACTGCCCAGTGGCTTCTGGAAGTTCCTG
GTCCACAACGTTAAGGAGCTGGAAGTACTGCTGGTGAGCAACAAATCTTACTGTGTTGAGATCACTCATGATGTTTCTTCCAAGAACTGC
AAAGCCATCTTGGAAAGAGCAGCCCAGGTGGTCATCAGAGTCACCAATGCCAATGCCAGCCTGCACAGTGCAGAAAGTGAATAGACAGTG
AATGTGTTTGTTTTATTGGGGTTTAAATAAAACCAATAAAACTGTAAAAACAAAAACAACAAAAACCAGAGACCCTGGCTTTTGGACAGT
TGTTCGTGTGGCTAATGCCCCCAACAGTTTTATTTTTTAAAATTTTATTTATGTATTGTTTTTGAGATGAAGTCTTCCTCTGACTGGAGT
GCAGTGATGCAAGCTCAGCTCTCTGCAACTTCCACCTCCCAGGTTCAAGTGATTCTTCTGTCTCAGCCTCCCAAGTGGCTGGGATTACAG
GTGCCCACTACCACACCTGACTAATTTTTGTATTTTTGTAGAGATGGGGTTTCATCCTCTTGGCCAGGCTGGTTTCGAACTCCTGACCTC
AAGGGATCCTCCCACCTCAGCTTCCCAAAGTGCTGGGATCACAGGCTTGTATTAGTTCATTTTCACACTGCTGATAAAGACATACCTAAG
ACTGGGAAGAAAAATAGGTTTATTGGACTCACAGTTCCATGTGGCTGGGGATGCCTCACAATTATGGTAGAAGGCAAAAGGCACTTCTTA
GATGGCAGCGGCAAGCAGGAAATGAGGAAGACATGAAAGCAGATATCCCTTATAAAACCATCAGTTCTCGTGATACTTACTCACTACCAT

>88036_88036_9_SULF1-RPS29_SULF1_chr8_70512988_ENST00000260128_RPS29_chr14_50044571_ENST00000396020_length(amino acids)=308AA_BP=
MKYSCCALVLAVLGTELLGSLCSTVRSPRFRGRIQQERKNIRPNIILVLTDDQDVELGSLQVMNKTRKIMEHGGATFINAFVTTPMCCPS
RSSMLTGKYVHNHNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKE
KHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPI

--------------------------------------------------------------
>88036_88036_10_SULF1-RPS29_SULF1_chr8_70512988_ENST00000402687_RPS29_chr14_50044571_ENST00000396020_length(transcript)=2822nt_BP=1640nt
TTTGTCTGGCAGCGTTGGTTCTATGGGGGTGTGTGATGATAATAATACGCGGGCTTATATAACCGTCTTCATCTTGCGAGCACTTCGCAG
ACCGTCGCTAATGAATCTTGGGGCCGGTGTCGGGCCGGGGCGGCTTGATCGGCAACTAGGAAACCCCAGGCGCAGAGGCCAGGAGCGAGG
GCAGCGAGGATCAGAGGCCAGGCCTTCCCGGCTGCCGGCGCTCCTCGGAGGTCAGGGCAGATGAGGAACATGACTCTCCCCCTTCGGAGG
AGGAAGGAAGTCCCGCTGCCACCTTATCTCTGCTCCTCTGCCTCCTCCCTGTTCCCAGAGCTTTTTCTCTAGAGAAGATTTTGAAGGCGG
CTTTTGGATTCTTCACTTCTCTTGAACAAGGAACTCACTCAGAGACTAACACAAAGGAAGTAATTTCTTACCTGGTCATTATTTAGTCTA
CAATAAGTTCATCCTTCTTCAGTGTGACCAGTAAATTCTTCCCATACTCTTGAAGAGAGCATAATTGGAATGGAGAGGTGCTGACGGCCA
CCCACCATCATCTAAAGAAGATAAACTTGGCAAATGACATGCAGGTTCTTCAAGGCAGAATAATTGCAGAAAATCTTCAAAGGACCCTAT
CTGCAGATGTTCTGAATACCTCTGAGAATAGAGATTGATTATTCAACCAGGATACCTAATTCAAGAACTCCAGAAATCAGGAGACGGAGA
CATTTTGTCAGTTTTGCAACATTGGACCAAATACAATGAAGTATTCTTGCTGTGCTCTGGTTTTGGCTGTCCTGGGCACAGAATTGCTGG
GAAGCCTCTGTTCGACTGTCAGATCCCCGAGGTTCAGAGGACGGATACAGCAGGAACGAAAAAACATCCGACCCAACATTATTCTTGTGC
TTACCGATGATCAAGATGTGGAGCTGGGGTCCCTGCAAGTCATGAACAAAACGAGAAAGATTATGGAACATGGGGGGGCCACCTTCATCA
ATGCCTTTGTGACTACACCCATGTGCTGCCCGTCACGGTCCTCCATGCTCACCGGGAAGTATGTGCACAATCACAATGTCTACACCAACA
ACGAGAACTGCTCTTCCCCCTCGTGGCAGGCCATGCATGAGCCTCGGACTTTTGCTGTATATCTTAACAACACTGGCTACAGAACAGCCT
TTTTTGGAAAATACCTCAATGAATATAATGGCAGCTACATCCCCCCTGGGTGGCGAGAATGGCTTGGATTAATCAAGAATTCTCGCTTCT
ATAATTACACTGTTTGTCGCAATGGCATCAAAGAAAAGCATGGATTTGATTATGCAAAGGACTACTTCACAGACTTAATCACTAACGAGA
GCATTAATTACTTCAAAATGTCTAAGAGAATGTATCCCCATAGGCCCGTTATGATGGTGATCAGCCACGCTGCGCCCCACGGCCCCGAGG
ACTCAGCCCCACAGTTTTCTAAACTGTACCCCAATGCTTCCCAACACATAACTCCTAGTTATAACTATGCACCAAATATGGATAAACACT
GGATTATGCAGTACACAGGACCAATGCTGCCCATCCACATGGAATTTACAAACATTCTACAGCGCAAAAGGCTCCAGACTTTGATGTCAG
TGGATGATTCTGTGGAGAGGAAAGACCTGAGCTGTCTTCCTTGGCACTGCCTATGGAGGTGACACCCATCTCCTCCATCATGGCCATCCT
GAGACCGCTCGCGAAGCCCAAGATCATCAAAAAGAGCACCAAGTTCACTGGGAACCAGTCAGACTGATATGTCAAAATTAAGGGTAACTG
GTGGAAACACAGAGGTATTAACAACAGGGTTCATAGAAGGTTTGAGGGCCAGATCTATGCCCAACATTGGTTATGGGAGAAACAAAAAGA
CAAAGCACATACTGCCCAGTGGCTTCTGGAAGTTCCTGGTCCACAACGTTAAGGAGCTGGAAGTACTGCTGGTGAGCAACAAATCTTACT
GTGTTGAGATCACTCATGATGTTTCTTCCAAGAACTGCAAAGCCATCTTGGAAAGAGCAGCCCAGGTGGTCATCAGAGTCACCAATGCCA
ATGCCAGCCTGCACAGTGCAGAAAGTGAATAGACAGTGAATGTGTTTGTTTTATTGGGGTTTAAATAAAACCAATAAAACTGTAAAAACA
AAAACAACAAAAACCAGAGACCCTGGCTTTTGGACAGTTGTTCGTGTGGCTAATGCCCCCAACAGTTTTATTTTTTAAAATTTTATTTAT
GTATTGTTTTTGAGATGAAGTCTTCCTCTGACTGGAGTGCAGTGATGCAAGCTCAGCTCTCTGCAACTTCCACCTCCCAGGTTCAAGTGA
TTCTTCTGTCTCAGCCTCCCAAGTGGCTGGGATTACAGGTGCCCACTACCACACCTGACTAATTTTTGTATTTTTGTAGAGATGGGGTTT
CATCCTCTTGGCCAGGCTGGTTTCGAACTCCTGACCTCAAGGGATCCTCCCACCTCAGCTTCCCAAAGTGCTGGGATCACAGGCTTGTAT
TAGTTCATTTTCACACTGCTGATAAAGACATACCTAAGACTGGGAAGAAAAATAGGTTTATTGGACTCACAGTTCCATGTGGCTGGGGAT
GCCTCACAATTATGGTAGAAGGCAAAAGGCACTTCTTAGATGGCAGCGGCAAGCAGGAAATGAGGAAGACATGAAAGCAGATATCCCTTA
TAAAACCATCAGTTCTCGTGATACTTACTCACTACCATGAGAAAGTATGGGGGAAACCGCTCCCATGATTCAGTTATCTCCCACCGGGTC

>88036_88036_10_SULF1-RPS29_SULF1_chr8_70512988_ENST00000402687_RPS29_chr14_50044571_ENST00000396020_length(amino acids)=308AA_BP=
MKYSCCALVLAVLGTELLGSLCSTVRSPRFRGRIQQERKNIRPNIILVLTDDQDVELGSLQVMNKTRKIMEHGGATFINAFVTTPMCCPS
RSSMLTGKYVHNHNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKE
KHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPI

--------------------------------------------------------------
>88036_88036_11_SULF1-RPS29_SULF1_chr8_70512988_ENST00000419716_RPS29_chr14_50044571_ENST00000396020_length(transcript)=2628nt_BP=1446nt
GTGATGATAATAATACGCGGGCTTATATAACCGTCTTCATCTTGCGAGCACTTCGCAGACCGTCGCTAATGAATCTTGGGGCCGGTGTCG
GGCCGGGGCGGCTTGATCGGCAACTAGGAAACCCCAGGCGCAGAGGCCAGGAGCGAGGGCAGCGAGGATCAGAGGCCAGGCCTTCCCGGC
TGCCGGCGCTCCTCGGAGGTCAGGGCAGATGAGGAACATGACTCTCCCCCTTCGGAGGAGGAAGGAAGTCCCGCTGCCACCTTATCTCTG
CTCCTCTGCCTCCTCCCTGTTCCCAGAGCTTTTTCTCTAGAGAAGATTTTGAAGGCGGCTTTTGTGCTGACGGCCACCCACCATCATCTA
AAGAAGATAAACTTGGCAAATGACATGCAGGTTCTTCAAGGCAGAATAATTGCAGAAAATCTTCAAAGGACCCTATCTGCAGATGTTCTG
AATACCTCTGAGAATAGAGATTGATTATTCAACCAGGATACCTAATTCAAGAACTCCAGAAATCAGGAGACGGAGACATTTTGTCAGTTT
TGCAACATTGGACCAAATACAATGAAGTATTCTTGCTGTGCTCTGGTTTTGGCTGTCCTGGGCACAGAATTGCTGGGAAGCCTCTGTTCG
ACTGTCAGATCCCCGAGGTTCAGAGGACGGATACAGCAGGAACGAAAAAACATCCGACCCAACATTATTCTTGTGCTTACCGATGATCAA
GATGTGGAGCTGGGGTCCCTGCAAGTCATGAACAAAACGAGAAAGATTATGGAACATGGGGGGGCCACCTTCATCAATGCCTTTGTGACT
ACACCCATGTGCTGCCCGTCACGGTCCTCCATGCTCACCGGGAAGTATGTGCACAATCACAATGTCTACACCAACAACGAGAACTGCTCT
TCCCCCTCGTGGCAGGCCATGCATGAGCCTCGGACTTTTGCTGTATATCTTAACAACACTGGCTACAGAACAGCCTTTTTTGGAAAATAC
CTCAATGAATATAATGGCAGCTACATCCCCCCTGGGTGGCGAGAATGGCTTGGATTAATCAAGAATTCTCGCTTCTATAATTACACTGTT
TGTCGCAATGGCATCAAAGAAAAGCATGGATTTGATTATGCAAAGGACTACTTCACAGACTTAATCACTAACGAGAGCATTAATTACTTC
AAAATGTCTAAGAGAATGTATCCCCATAGGCCCGTTATGATGGTGATCAGCCACGCTGCGCCCCACGGCCCCGAGGACTCAGCCCCACAG
TTTTCTAAACTGTACCCCAATGCTTCCCAACACATAACTCCTAGTTATAACTATGCACCAAATATGGATAAACACTGGATTATGCAGTAC
ACAGGACCAATGCTGCCCATCCACATGGAATTTACAAACATTCTACAGCGCAAAAGGCTCCAGACTTTGATGTCAGTGGATGATTCTGTG
GAGAGGAAAGACCTGAGCTGTCTTCCTTGGCACTGCCTATGGAGGTGACACCCATCTCCTCCATCATGGCCATCCTGAGACCGCTCGCGA
AGCCCAAGATCATCAAAAAGAGCACCAAGTTCACTGGGAACCAGTCAGACTGATATGTCAAAATTAAGGGTAACTGGTGGAAACACAGAG
GTATTAACAACAGGGTTCATAGAAGGTTTGAGGGCCAGATCTATGCCCAACATTGGTTATGGGAGAAACAAAAAGACAAAGCACATACTG
CCCAGTGGCTTCTGGAAGTTCCTGGTCCACAACGTTAAGGAGCTGGAAGTACTGCTGGTGAGCAACAAATCTTACTGTGTTGAGATCACT
CATGATGTTTCTTCCAAGAACTGCAAAGCCATCTTGGAAAGAGCAGCCCAGGTGGTCATCAGAGTCACCAATGCCAATGCCAGCCTGCAC
AGTGCAGAAAGTGAATAGACAGTGAATGTGTTTGTTTTATTGGGGTTTAAATAAAACCAATAAAACTGTAAAAACAAAAACAACAAAAAC
CAGAGACCCTGGCTTTTGGACAGTTGTTCGTGTGGCTAATGCCCCCAACAGTTTTATTTTTTAAAATTTTATTTATGTATTGTTTTTGAG
ATGAAGTCTTCCTCTGACTGGAGTGCAGTGATGCAAGCTCAGCTCTCTGCAACTTCCACCTCCCAGGTTCAAGTGATTCTTCTGTCTCAG
CCTCCCAAGTGGCTGGGATTACAGGTGCCCACTACCACACCTGACTAATTTTTGTATTTTTGTAGAGATGGGGTTTCATCCTCTTGGCCA
GGCTGGTTTCGAACTCCTGACCTCAAGGGATCCTCCCACCTCAGCTTCCCAAAGTGCTGGGATCACAGGCTTGTATTAGTTCATTTTCAC
ACTGCTGATAAAGACATACCTAAGACTGGGAAGAAAAATAGGTTTATTGGACTCACAGTTCCATGTGGCTGGGGATGCCTCACAATTATG
GTAGAAGGCAAAAGGCACTTCTTAGATGGCAGCGGCAAGCAGGAAATGAGGAAGACATGAAAGCAGATATCCCTTATAAAACCATCAGTT
CTCGTGATACTTACTCACTACCATGAGAAAGTATGGGGGAAACCGCTCCCATGATTCAGTTATCTCCCACCGGGTCCCTCTCACAACATG

>88036_88036_11_SULF1-RPS29_SULF1_chr8_70512988_ENST00000419716_RPS29_chr14_50044571_ENST00000396020_length(amino acids)=308AA_BP=
MKYSCCALVLAVLGTELLGSLCSTVRSPRFRGRIQQERKNIRPNIILVLTDDQDVELGSLQVMNKTRKIMEHGGATFINAFVTTPMCCPS
RSSMLTGKYVHNHNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKE
KHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPI

--------------------------------------------------------------
>88036_88036_12_SULF1-RPS29_SULF1_chr8_70512988_ENST00000458141_RPS29_chr14_50044571_ENST00000396020_length(transcript)=2622nt_BP=1440nt
AGGTTACTTGACTGGGAGTTCTCAGACCTCCAGTTTCAGCCCTGCCCTCAGCCTCCAATCCGTAAGAGACACCCAGCCCCAGCAATTGGA
TTGGGCAGCCCGTCTTGACACACCACTGTGCTGAGTGCTTGAGGACGTGTTTCAACAGATGGTTGGGGTTAGTGTGTGTCATCACATTCG
AGTGGGGATTAAGAGAAGGAAGGCTGCCTTGCTGGAGCTGTGTGGTCTTCTCCAAGTGAGAGTCGCAGGCAATAGAACTACTTTGCTTTT
GGAGGAAAAGGAGGAATTCATTTTCAGCAGACACAAGAAAAGCAGTTTTTTTTTCAGGTGCTGACGGCCACCCACCATCATCTAAAGAAG
ATAAACTTGGCAAATGACATGCAGGTTCTTCAAGGCAGAATAATTGCAGAAAATCTTCAAAGGACCCTATCTGCAGATGTTCTGAATACC
TCTGAGAATAGAGATTGATTATTCAACCAGGATACCTAATTCAAGAACTCCAGAAATCAGGAGACGGAGACATTTTGTCAGTTTTGCAAC
ATTGGACCAAATACAATGAAGTATTCTTGCTGTGCTCTGGTTTTGGCTGTCCTGGGCACAGAATTGCTGGGAAGCCTCTGTTCGACTGTC
AGATCCCCGAGGTTCAGAGGACGGATACAGCAGGAACGAAAAAACATCCGACCCAACATTATTCTTGTGCTTACCGATGATCAAGATGTG
GAGCTGGGGTCCCTGCAAGTCATGAACAAAACGAGAAAGATTATGGAACATGGGGGGGCCACCTTCATCAATGCCTTTGTGACTACACCC
ATGTGCTGCCCGTCACGGTCCTCCATGCTCACCGGGAAGTATGTGCACAATCACAATGTCTACACCAACAACGAGAACTGCTCTTCCCCC
TCGTGGCAGGCCATGCATGAGCCTCGGACTTTTGCTGTATATCTTAACAACACTGGCTACAGAACAGCCTTTTTTGGAAAATACCTCAAT
GAATATAATGGCAGCTACATCCCCCCTGGGTGGCGAGAATGGCTTGGATTAATCAAGAATTCTCGCTTCTATAATTACACTGTTTGTCGC
AATGGCATCAAAGAAAAGCATGGATTTGATTATGCAAAGGACTACTTCACAGACTTAATCACTAACGAGAGCATTAATTACTTCAAAATG
TCTAAGAGAATGTATCCCCATAGGCCCGTTATGATGGTGATCAGCCACGCTGCGCCCCACGGCCCCGAGGACTCAGCCCCACAGTTTTCT
AAACTGTACCCCAATGCTTCCCAACACATAACTCCTAGTTATAACTATGCACCAAATATGGATAAACACTGGATTATGCAGTACACAGGA
CCAATGCTGCCCATCCACATGGAATTTACAAACATTCTACAGCGCAAAAGGCTCCAGACTTTGATGTCAGTGGATGATTCTGTGGAGAGG
AAAGACCTGAGCTGTCTTCCTTGGCACTGCCTATGGAGGTGACACCCATCTCCTCCATCATGGCCATCCTGAGACCGCTCGCGAAGCCCA
AGATCATCAAAAAGAGCACCAAGTTCACTGGGAACCAGTCAGACTGATATGTCAAAATTAAGGGTAACTGGTGGAAACACAGAGGTATTA
ACAACAGGGTTCATAGAAGGTTTGAGGGCCAGATCTATGCCCAACATTGGTTATGGGAGAAACAAAAAGACAAAGCACATACTGCCCAGT
GGCTTCTGGAAGTTCCTGGTCCACAACGTTAAGGAGCTGGAAGTACTGCTGGTGAGCAACAAATCTTACTGTGTTGAGATCACTCATGAT
GTTTCTTCCAAGAACTGCAAAGCCATCTTGGAAAGAGCAGCCCAGGTGGTCATCAGAGTCACCAATGCCAATGCCAGCCTGCACAGTGCA
GAAAGTGAATAGACAGTGAATGTGTTTGTTTTATTGGGGTTTAAATAAAACCAATAAAACTGTAAAAACAAAAACAACAAAAACCAGAGA
CCCTGGCTTTTGGACAGTTGTTCGTGTGGCTAATGCCCCCAACAGTTTTATTTTTTAAAATTTTATTTATGTATTGTTTTTGAGATGAAG
TCTTCCTCTGACTGGAGTGCAGTGATGCAAGCTCAGCTCTCTGCAACTTCCACCTCCCAGGTTCAAGTGATTCTTCTGTCTCAGCCTCCC
AAGTGGCTGGGATTACAGGTGCCCACTACCACACCTGACTAATTTTTGTATTTTTGTAGAGATGGGGTTTCATCCTCTTGGCCAGGCTGG
TTTCGAACTCCTGACCTCAAGGGATCCTCCCACCTCAGCTTCCCAAAGTGCTGGGATCACAGGCTTGTATTAGTTCATTTTCACACTGCT
GATAAAGACATACCTAAGACTGGGAAGAAAAATAGGTTTATTGGACTCACAGTTCCATGTGGCTGGGGATGCCTCACAATTATGGTAGAA
GGCAAAAGGCACTTCTTAGATGGCAGCGGCAAGCAGGAAATGAGGAAGACATGAAAGCAGATATCCCTTATAAAACCATCAGTTCTCGTG
ATACTTACTCACTACCATGAGAAAGTATGGGGGAAACCGCTCCCATGATTCAGTTATCTCCCACCGGGTCCCTCTCACAACATGTGGGGA

>88036_88036_12_SULF1-RPS29_SULF1_chr8_70512988_ENST00000458141_RPS29_chr14_50044571_ENST00000396020_length(amino acids)=308AA_BP=
MKYSCCALVLAVLGTELLGSLCSTVRSPRFRGRIQQERKNIRPNIILVLTDDQDVELGSLQVMNKTRKIMEHGGATFINAFVTTPMCCPS
RSSMLTGKYVHNHNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKYLNEYNGSYIPPGWREWLGLIKNSRFYNYTVCRNGIKE
KHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPI

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for SULF1-RPS29


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for SULF1-RPS29


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for SULF1-RPS29


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource