FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:SSBP2-MSH3 (FusionGDB2 ID:86714)

Fusion Gene Summary for SSBP2-MSH3

check button Fusion gene summary
Fusion gene informationFusion gene name: SSBP2-MSH3
Fusion gene ID: 86714
HgeneTgene
Gene symbol

SSBP2

MSH3

Gene ID

23635

4437

Gene namesingle stranded DNA binding protein 2mutS homolog 3
SynonymsHSPC116|SOSS-B2DUP|FAP4|MRP1
Cytomap

5q14.1

5q14.1

Type of geneprotein-codingprotein-coding
Descriptionsingle-stranded DNA-binding protein 2sequence-specific single-stranded-DNA-binding protein 2DNA mismatch repair protein Msh3divergent upstream proteinepididymis secretory sperm binding proteinhMSH3mismatch repair protein 1
Modification date2020031320200327
UniProtAcc.

P20585

Ensembl transtripts involved in fusion geneENST00000320672, ENST00000505980, 
ENST00000509053, ENST00000514493, 
ENST00000515395, ENST00000510060, 
ENST00000512258, ENST00000265081, 
Fusion gene scores* DoF score12 X 13 X 6=9366 X 7 X 2=84
# samples 147
** MAII scorelog2(14/936*10)=-2.74108170263844
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(7/84*10)=-0.263034405833794
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: SSBP2 [Title/Abstract] AND MSH3 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointSSBP2(80733249)-MSH3(79974746), # samples:1
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
TgeneMSH3

GO:0006281

DNA repair

8942985

TgeneMSH3

GO:0045910

negative regulation of DNA recombination

17715146

TgeneMSH3

GO:0051096

positive regulation of helicase activity

17715146


check buttonFusion gene breakpoints across SSBP2 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across MSH3 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4STADTCGA-CG-5717-01ASSBP2chr5

80733249

-MSH3chr5

79974746

+


Top

Fusion Gene ORF analysis for SSBP2-MSH3

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-intronENST00000320672ENST00000512258SSBP2chr5

80733249

-MSH3chr5

79974746

+
5CDS-intronENST00000505980ENST00000512258SSBP2chr5

80733249

-MSH3chr5

79974746

+
5CDS-intronENST00000509053ENST00000512258SSBP2chr5

80733249

-MSH3chr5

79974746

+
5CDS-intronENST00000514493ENST00000512258SSBP2chr5

80733249

-MSH3chr5

79974746

+
5CDS-intronENST00000515395ENST00000512258SSBP2chr5

80733249

-MSH3chr5

79974746

+
5UTR-3CDSENST00000510060ENST00000265081SSBP2chr5

80733249

-MSH3chr5

79974746

+
5UTR-intronENST00000510060ENST00000512258SSBP2chr5

80733249

-MSH3chr5

79974746

+
In-frameENST00000320672ENST00000265081SSBP2chr5

80733249

-MSH3chr5

79974746

+
In-frameENST00000505980ENST00000265081SSBP2chr5

80733249

-MSH3chr5

79974746

+
In-frameENST00000509053ENST00000265081SSBP2chr5

80733249

-MSH3chr5

79974746

+
In-frameENST00000514493ENST00000265081SSBP2chr5

80733249

-MSH3chr5

79974746

+
In-frameENST00000515395ENST00000265081SSBP2chr5

80733249

-MSH3chr5

79974746

+

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000320672SSBP2chr580733249-ENST00000265081MSH3chr579974746+4007116821134081065
ENST00000509053SSBP2chr580733249-ENST00000265081MSH3chr579974746+3712873631131035
ENST00000514493SSBP2chr580733249-ENST00000265081MSH3chr579974746+3914107520833151035
ENST00000505980SSBP2chr580733249-ENST00000265081MSH3chr579974746+3741902531421045
ENST00000515395SSBP2chr580733249-ENST00000265081MSH3chr579974746+37859465531861043

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000320672ENST00000265081SSBP2chr580733249-MSH3chr579974746+0.0003470860.99965286
ENST00000509053ENST00000265081SSBP2chr580733249-MSH3chr579974746+0.0003175230.99968255
ENST00000514493ENST00000265081SSBP2chr580733249-MSH3chr579974746+0.0003601730.99963987
ENST00000505980ENST00000265081SSBP2chr580733249-MSH3chr579974746+0.0004353050.9995647
ENST00000515395ENST00000265081SSBP2chr580733249-MSH3chr579974746+0.0002533030.99974674

Top

Fusion Genomic Features for SSBP2-MSH3


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)
SSBP2chr580733248-MSH3chr579974745+0.0003558550.99964416
SSBP2chr580733248-MSH3chr579974745+0.0003558550.99964416

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.
genomic feature of top 1%

Top

Fusion Protein Features for SSBP2-MSH3


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr5:80733249/chr5:79974746)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
.MSH3

P20585

FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.FUNCTION: Component of the post-replicative DNA mismatch repair system (MMR). Heterodimerizes with MSH2 to form MutS beta which binds to DNA mismatches thereby initiating DNA repair. When bound, the MutS beta heterodimer bends the DNA helix and shields approximately 20 base pairs. MutS beta recognizes large insertion-deletion loops (IDL) up to 13 nucleotides long. After mismatch binding, forms a ternary complex with the MutL alpha heterodimer, which is thought to be responsible for directing the downstream MMR events, including strand discrimination, excision, and resynthesis.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneSSBP2chr5:80733249chr5:79974746ENST00000320672-1517100_293319362.0Compositional biasNote=Pro-rich
HgeneSSBP2chr5:80733249chr5:79974746ENST00000320672-1517147_312319362.0Compositional biasNote=Gly-rich
HgeneSSBP2chr5:80733249chr5:79974746ENST00000505980-1416100_293299342.0Compositional biasNote=Pro-rich
HgeneSSBP2chr5:80733249chr5:79974746ENST00000515395-1416100_293297340.0Compositional biasNote=Pro-rich
HgeneSSBP2chr5:80733249chr5:79974746ENST00000320672-151718_50319362.0DomainLisH
HgeneSSBP2chr5:80733249chr5:79974746ENST00000505980-141618_50299342.0DomainLisH
HgeneSSBP2chr5:80733249chr5:79974746ENST00000509053-141518_50289299.0DomainLisH
HgeneSSBP2chr5:80733249chr5:79974746ENST00000514493-141618_50289332.0DomainLisH
HgeneSSBP2chr5:80733249chr5:79974746ENST00000515395-141618_50297340.0DomainLisH
TgeneMSH3chr5:80733249chr5:79974746ENST00000265081624896_9033911138.0Nucleotide bindingATP

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneSSBP2chr5:80733249chr5:79974746ENST00000505980-1416147_312299342.0Compositional biasNote=Gly-rich
HgeneSSBP2chr5:80733249chr5:79974746ENST00000509053-1415100_293289299.0Compositional biasNote=Pro-rich
HgeneSSBP2chr5:80733249chr5:79974746ENST00000509053-1415147_312289299.0Compositional biasNote=Gly-rich
HgeneSSBP2chr5:80733249chr5:79974746ENST00000514493-1416100_293289332.0Compositional biasNote=Pro-rich
HgeneSSBP2chr5:80733249chr5:79974746ENST00000514493-1416147_312289332.0Compositional biasNote=Gly-rich
HgeneSSBP2chr5:80733249chr5:79974746ENST00000515395-1416147_312297340.0Compositional biasNote=Gly-rich
TgeneMSH3chr5:80733249chr5:79974746ENST0000026508162451_623911138.0Compositional biasNote=Poly-Ala


Top

Fusion Gene Sequence for SSBP2-MSH3


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>86714_86714_1_SSBP2-MSH3_SSBP2_chr5_80733249_ENST00000320672_MSH3_chr5_79974746_ENST00000265081_length(transcript)=4007nt_BP=1168nt
GGCTTAGCCGCCACCGCGCGCTTCGGAAGGCCAGAGGGAGGGGGAGGCCTGTCAGTCTCGCGCGTTGCCTGGGCGAAGGGGGCGGAGCTT
TGGCGTGGGGCGGCCAATAGTGGGGGTGGCTGCGTGGGTCGCCATGGGGACGGGGCTGTTCCCGGGGAGGCTGTGATGGGTTGACAGGTG
CGTGACAGTGGGAGCTGCTCTCGGCACAAGCATGTACGGCAAAGGCAAGAGTAACAGCAGCGCCGTCCCGTCCGACAGCCAGGCCCGGGA
GAAGTTAGCACTCTACGTATATGAATATCTGCTCCATGTAGGAGCTCAGAAATCAGCTCAAACATTTTTATCAGAGATAAGATGGGAAAA
AAACATCACATTGGGGGAACCACCAGGATTCTTACATTCTTGGTGGTGTGTATTTTGGGATCTCTACTGTGCAGCTCCAGAGAGACGTGA
AACATGTGAACACTCAAGTGAAGCAAAAGCCTTCCATGATTACAGTGCTGCAGCAGCTCCCAGTCCAGTGCTAGGAAACATTCCCCCAGG
AGATGGCATGCCAGTAGGTCCTGTACCACCAGGGTTCTTTCAGCCTTTTATGTCACCTCGGTACCCTGGAGGTCCAAGGCCCCCATTGAG
GATACCTAATCAGGCACTTGGAGGTGTCCCAGGAAGTCAGCCATTACTCCCCAGTGGAATGGATCCAACTCGACAACAAGGACATCCAAA
TATGGGTGGGCCAATGCAGAGAATGACTCCTCCAAGAGGAATGGTGCCCTTAGGACCACAGAACTATGGAGGTGCAATGAGACCCCCACT
GAATGCTTTAGGTGGCCCTGGAATGCCTGGAATGAACATGGGTCCAGGTGGTGGTAGACCTTGGCCAAACCCAACAAATGCCAATTCAAT
ACCATACTCCTCAGCATCTCCTGGGAATTATGTAGGTCCTCCAGGAGGTGGAGGGCCACCAGGAACACCCATCATGCCTAGTCCAGCAGA
TTCAACCAACTCTGGTGATAACATGTATACTTTAATGAATGCAGTACCTCCTGGACCTAACAGACCTAATTTTCCAATGGGTCCTGGGTC
AGATGGTCCCATGGGTGGATTAGGAGGAATGGAGTCACATCACATGAATGGCTCTTTAGGCTCAGGAGATATGGACAGTATTTCCAAGGG
AGTGCAGCCTGCCACAGGCGAGGTTGTGTTTGATAGTTTCCAGGACTCTGCTTCTCGTTCAGAGCTAGAAACCCGGATGTCAAGCCTGCA
GCCAGTAGAGCTGCTGCTTCCTTCGGCCTTGTCCGAGCAAACAGAGGCGCTCATCCACAGAGCCACATCTGTTAGTGTGCAGGATGACAG
AATTCGAGTCGAAAGGATGGATAACATTTATTTTGAATACAGCCATGCTTTCCAGGCAGTTACAGAGTTTTATGCAAAAGATACAGTTGA
CATCAAAGGTTCTCAAATTATTTCTGGCATTGTTAACTTAGAGAAGCCTGTGATTTGCTCTTTGGCTGCCATCATAAAATACCTCAAAGA
ATTCAACTTGGAAAAGATGCTCTCCAAACCTGAGAATTTTAAACAGCTATCAAGTAAAATGGAATTTATGACAATTAATGGAACAACATT
AAGGAATCTGGAAATCCTACAGAATCAGACTGATATGAAAACCAAAGGAAGTTTGCTGTGGGTTTTAGACCACACTAAAACTTCATTTGG
GAGACGGAAGTTAAAGAAGTGGGTGACCCAGCCACTCCTTAAATTAAGGGAAATAAATGCCCGGCTTGATGCTGTATCGGAAGTTCTCCA
TTCAGAATCTAGTGTGTTTGGTCAGATAGAAAATCATCTACGTAAATTGCCCGACATAGAGAGGGGACTCTGTAGCATTTATCACAAAAA
ATGTTCTACCCAAGAGTTCTTCTTGATTGTCAAAACTTTATATCACCTAAAGTCAGAATTTCAAGCAATAATACCTGCTGTTAATTCCCA
CATTCAGTCAGACTTGCTCCGGACCGTTATTTTAGAAATTCCTGAACTCCTCAGTCCAGTGGAGCATTACTTAAAGATACTCAATGAACA
AGCTGCCAAAGTTGGGGATAAAACTGAATTATTTAAAGACCTTTCTGACTTCCCTTTAATAAAAAAGAGGAAGGATGAAATTCAAGGTGT
TATTGACGAGATCCGAATGCATTTGCAAGAAATACGAAAAATACTAAAAAATCCTTCTGCACAATATGTGACAGTATCAGGACAGGAGTT
TATGATAGAAATAAAGAACTCTGCTGTATCTTGTATACCAACTGATTGGGTAAAGGTTGGAAGCACAAAAGCTGTGAGCCGCTTTCACTC
TCCTTTTATTGTAGAAAATTACAGACATCTGAATCAGCTCCGGGAGCAGCTAGTCCTTGACTGCAGTGCTGAATGGCTTGATTTTCTAGA
GAAATTCAGTGAACATTATCACTCCTTGTGTAAAGCAGTGCATCACCTAGCAACTGTTGACTGCATTTTCTCCCTGGCCAAGGTCGCTAA
GCAAGGAGATTACTGCAGACCAACTGTACAAGAAGAAAGAAAAATTGTAATAAAAAATGGAAGGCACCCTGTGATTGATGTGTTGCTGGG
AGAACAGGATCAATATGTCCCAAATAATACAGATTTATCAGAGGACTCAGAGAGAGTAATGATAATTACCGGACCAAACATGGGTGGAAA
GAGCTCCTACATAAAACAAGTTGCATTGATTACCATCATGGCTCAGATTGGCTCCTATGTTCCTGCAGAAGAAGCGACAATTGGGATTGT
GGATGGCATTTTCACAAGGATGGGTGCTGCAGACAATATATATAAAGGACAGAGTACATTTATGGAAGAACTGACTGACACAGCAGAAAT
AATCAGAAAAGCAACATCACAGTCCTTGGTTATCTTGGATGAACTAGGAAGAGGGACGAGCACTCATGATGGAATTGCCATTGCCTATGC
TACACTTGAGTATTTCATCAGAGATGTGAAATCCTTAACCCTGTTTGTCACCCATTATCCGCCAGTTTGTGAACTAGAAAAAAATTACTC
ACACCAGGTGGGGAATTACCACATGGGATTCTTGGTCAGTGAGGATGAAAGCAAACTGGATCCAGGCGCAGCAGAACAAGTCCCTGATTT
TGTCACCTTCCTTTACCAAATAACTAGAGGAATTGCAGCAAGGAGTTATGGATTAAATGTGGCTAAACTAGCAGATGTTCCTGGAGAAAT
TTTGAAGAAAGCAGCTCACAAGTCAAAAGAGCTGGAAGGATTAATAAATACGAAAAGAAAGAGACTCAAGTATTTTGCAAAGTTATGGAC
GATGCATAATGCACAAGACCTGCAGAAGTGGACAGAGGAGTTCAACATGGAAGAAACACAGACTTCTCTTCTTCATTAAAATGAAGACTA
CATTTGTGAACAAAAAATGGAGAATTAAAAATACCAACTGTACAAAATAACTCTCCAGTAACAGCCTATCTTTGTGTGACATGTGAGCAT
AAAATTATGACCATGGTATATTCCTATTGGAAACAGAGAGGTTTTTCTGAAGACAGTCTTTTTCAAGTTTCTGTCTTCCTAACTTTTCTA
CGTATAAACACTCTTGAATAGACTTCCACTTTGTAATTAGAAAATTTTATGGACAGTAAGTCCAGTAAAGCCTTAAGTGGCAGAATATAA
TTCCCAAGCTTTTGGAGGGTGATATAAAAATTTACTTGATATTTTTATTTGTTTCAGTTCAGATAATTGGCAACTGGGTGAATCTGGCAG
GAATCTATCCATTGAACTAAAATAATTTTATTATGCAACCAGTTTATCCACCAAGAACATAAGAATTTTTTATAAGTAGAAAGAATTGGC
CAGGCATGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGTAGGCAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGG

>86714_86714_1_SSBP2-MSH3_SSBP2_chr5_80733249_ENST00000320672_MSH3_chr5_79974746_ENST00000265081_length(amino acids)=1065AA_BP=319
MYGKGKSNSSAVPSDSQAREKLALYVYEYLLHVGAQKSAQTFLSEIRWEKNITLGEPPGFLHSWWCVFWDLYCAAPERRETCEHSSEAKA
FHDYSAAAAPSPVLGNIPPGDGMPVGPVPPGFFQPFMSPRYPGGPRPPLRIPNQALGGVPGSQPLLPSGMDPTRQQGHPNMGGPMQRMTP
PRGMVPLGPQNYGGAMRPPLNALGGPGMPGMNMGPGGGRPWPNPTNANSIPYSSASPGNYVGPPGGGGPPGTPIMPSPADSTNSGDNMYT
LMNAVPPGPNRPNFPMGPGSDGPMGGLGGMESHHMNGSLGSGDMDSISKGVQPATGEVVFDSFQDSASRSELETRMSSLQPVELLLPSAL
SEQTEALIHRATSVSVQDDRIRVERMDNIYFEYSHAFQAVTEFYAKDTVDIKGSQIISGIVNLEKPVICSLAAIIKYLKEFNLEKMLSKP
ENFKQLSSKMEFMTINGTTLRNLEILQNQTDMKTKGSLLWVLDHTKTSFGRRKLKKWVTQPLLKLREINARLDAVSEVLHSESSVFGQIE
NHLRKLPDIERGLCSIYHKKCSTQEFFLIVKTLYHLKSEFQAIIPAVNSHIQSDLLRTVILEIPELLSPVEHYLKILNEQAAKVGDKTEL
FKDLSDFPLIKKRKDEIQGVIDEIRMHLQEIRKILKNPSAQYVTVSGQEFMIEIKNSAVSCIPTDWVKVGSTKAVSRFHSPFIVENYRHL
NQLREQLVLDCSAEWLDFLEKFSEHYHSLCKAVHHLATVDCIFSLAKVAKQGDYCRPTVQEERKIVIKNGRHPVIDVLLGEQDQYVPNNT
DLSEDSERVMIITGPNMGGKSSYIKQVALITIMAQIGSYVPAEEATIGIVDGIFTRMGAADNIYKGQSTFMEELTDTAEIIRKATSQSLV
ILDELGRGTSTHDGIAIAYATLEYFIRDVKSLTLFVTHYPPVCELEKNYSHQVGNYHMGFLVSEDESKLDPGAAEQVPDFVTFLYQITRG

--------------------------------------------------------------
>86714_86714_2_SSBP2-MSH3_SSBP2_chr5_80733249_ENST00000505980_MSH3_chr5_79974746_ENST00000265081_length(transcript)=3741nt_BP=902nt
CAAGCATGTACGGCAAAGGCAAGAGTAACAGCAGCGCCGTCCCGTCCGACAGCCAGGCCCGGGAGAAGTTAGCACTCTACGTATATGAAT
ATCTGCTCCATGTAGGAGCTCAGAAATCAGCTCAAACATTTTTATCAGAGATAAGATGGGAAAAAAACATCACATTGGGGGAACCACCAG
GATTCTTACATTCTTGGTGGTGTGTATTTTGGGATCTCTACTGTGCAGCTCCAGAGAGACGTGAAACATGTGAACACTCAAGTGAAGCAA
AAGCCTTCCATGATTACAGTGCTGCAGCAGCTCCCAGTCCAGTGCTAGGAAACATTCCCCCAGGAGATGGCATGCCAGTAGGTCCTGTAC
CACCAGGGTTCTTTCAGGCACTTGGAGGTGTCCCAGGAAGTCAGCCATTACTCCCCAGTGGAATGGATCCAACTCGACAACAAGGACATC
CAAATATGGGTGGGCCAATGCAGAGAATGACTCCTCCAAGAGGAATGGTGCCCTTAGGACCACAGAACTATGGAGGTGCAATGAGACCCC
CACTGAATGCTTTAGGTGGCCCTGGAATGCCTGGAATGAACATGGGTCCAGGTGGTGGTAGACCTTGGCCAAACCCAACAAATGCCAATT
CAATACCATACTCCTCAGCATCTCCTGGGAATTATGTAGGTCCTCCAGGAGGTGGAGGGCCACCAGGAACACCCATCATGCCTAGTCCAG
CAGATTCAACCAACTCTGGTGATAACATGTATACTTTAATGAATGCAGTACCTCCTGGACCTAACAGACCTAATTTTCCAATGGGTCCTG
GGTCAGATGGTCCCATGGGTGGATTAGGAGGAATGGAGTCACATCACATGAATGGCTCTTTAGGCTCAGGAGATATGGACAGTATTTCCA
AGGGAGTGCAGCCTGCCACAGGCGAGGTTGTGTTTGATAGTTTCCAGGACTCTGCTTCTCGTTCAGAGCTAGAAACCCGGATGTCAAGCC
TGCAGCCAGTAGAGCTGCTGCTTCCTTCGGCCTTGTCCGAGCAAACAGAGGCGCTCATCCACAGAGCCACATCTGTTAGTGTGCAGGATG
ACAGAATTCGAGTCGAAAGGATGGATAACATTTATTTTGAATACAGCCATGCTTTCCAGGCAGTTACAGAGTTTTATGCAAAAGATACAG
TTGACATCAAAGGTTCTCAAATTATTTCTGGCATTGTTAACTTAGAGAAGCCTGTGATTTGCTCTTTGGCTGCCATCATAAAATACCTCA
AAGAATTCAACTTGGAAAAGATGCTCTCCAAACCTGAGAATTTTAAACAGCTATCAAGTAAAATGGAATTTATGACAATTAATGGAACAA
CATTAAGGAATCTGGAAATCCTACAGAATCAGACTGATATGAAAACCAAAGGAAGTTTGCTGTGGGTTTTAGACCACACTAAAACTTCAT
TTGGGAGACGGAAGTTAAAGAAGTGGGTGACCCAGCCACTCCTTAAATTAAGGGAAATAAATGCCCGGCTTGATGCTGTATCGGAAGTTC
TCCATTCAGAATCTAGTGTGTTTGGTCAGATAGAAAATCATCTACGTAAATTGCCCGACATAGAGAGGGGACTCTGTAGCATTTATCACA
AAAAATGTTCTACCCAAGAGTTCTTCTTGATTGTCAAAACTTTATATCACCTAAAGTCAGAATTTCAAGCAATAATACCTGCTGTTAATT
CCCACATTCAGTCAGACTTGCTCCGGACCGTTATTTTAGAAATTCCTGAACTCCTCAGTCCAGTGGAGCATTACTTAAAGATACTCAATG
AACAAGCTGCCAAAGTTGGGGATAAAACTGAATTATTTAAAGACCTTTCTGACTTCCCTTTAATAAAAAAGAGGAAGGATGAAATTCAAG
GTGTTATTGACGAGATCCGAATGCATTTGCAAGAAATACGAAAAATACTAAAAAATCCTTCTGCACAATATGTGACAGTATCAGGACAGG
AGTTTATGATAGAAATAAAGAACTCTGCTGTATCTTGTATACCAACTGATTGGGTAAAGGTTGGAAGCACAAAAGCTGTGAGCCGCTTTC
ACTCTCCTTTTATTGTAGAAAATTACAGACATCTGAATCAGCTCCGGGAGCAGCTAGTCCTTGACTGCAGTGCTGAATGGCTTGATTTTC
TAGAGAAATTCAGTGAACATTATCACTCCTTGTGTAAAGCAGTGCATCACCTAGCAACTGTTGACTGCATTTTCTCCCTGGCCAAGGTCG
CTAAGCAAGGAGATTACTGCAGACCAACTGTACAAGAAGAAAGAAAAATTGTAATAAAAAATGGAAGGCACCCTGTGATTGATGTGTTGC
TGGGAGAACAGGATCAATATGTCCCAAATAATACAGATTTATCAGAGGACTCAGAGAGAGTAATGATAATTACCGGACCAAACATGGGTG
GAAAGAGCTCCTACATAAAACAAGTTGCATTGATTACCATCATGGCTCAGATTGGCTCCTATGTTCCTGCAGAAGAAGCGACAATTGGGA
TTGTGGATGGCATTTTCACAAGGATGGGTGCTGCAGACAATATATATAAAGGACAGAGTACATTTATGGAAGAACTGACTGACACAGCAG
AAATAATCAGAAAAGCAACATCACAGTCCTTGGTTATCTTGGATGAACTAGGAAGAGGGACGAGCACTCATGATGGAATTGCCATTGCCT
ATGCTACACTTGAGTATTTCATCAGAGATGTGAAATCCTTAACCCTGTTTGTCACCCATTATCCGCCAGTTTGTGAACTAGAAAAAAATT
ACTCACACCAGGTGGGGAATTACCACATGGGATTCTTGGTCAGTGAGGATGAAAGCAAACTGGATCCAGGCGCAGCAGAACAAGTCCCTG
ATTTTGTCACCTTCCTTTACCAAATAACTAGAGGAATTGCAGCAAGGAGTTATGGATTAAATGTGGCTAAACTAGCAGATGTTCCTGGAG
AAATTTTGAAGAAAGCAGCTCACAAGTCAAAAGAGCTGGAAGGATTAATAAATACGAAAAGAAAGAGACTCAAGTATTTTGCAAAGTTAT
GGACGATGCATAATGCACAAGACCTGCAGAAGTGGACAGAGGAGTTCAACATGGAAGAAACACAGACTTCTCTTCTTCATTAAAATGAAG
ACTACATTTGTGAACAAAAAATGGAGAATTAAAAATACCAACTGTACAAAATAACTCTCCAGTAACAGCCTATCTTTGTGTGACATGTGA
GCATAAAATTATGACCATGGTATATTCCTATTGGAAACAGAGAGGTTTTTCTGAAGACAGTCTTTTTCAAGTTTCTGTCTTCCTAACTTT
TCTACGTATAAACACTCTTGAATAGACTTCCACTTTGTAATTAGAAAATTTTATGGACAGTAAGTCCAGTAAAGCCTTAAGTGGCAGAAT
ATAATTCCCAAGCTTTTGGAGGGTGATATAAAAATTTACTTGATATTTTTATTTGTTTCAGTTCAGATAATTGGCAACTGGGTGAATCTG
GCAGGAATCTATCCATTGAACTAAAATAATTTTATTATGCAACCAGTTTATCCACCAAGAACATAAGAATTTTTTATAAGTAGAAAGAAT
TGGCCAGGCATGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGTAGGCAGATCACCTGAGGTCAGGAGTTCAAGACCAGC

>86714_86714_2_SSBP2-MSH3_SSBP2_chr5_80733249_ENST00000505980_MSH3_chr5_79974746_ENST00000265081_length(amino acids)=1045AA_BP=299
MYGKGKSNSSAVPSDSQAREKLALYVYEYLLHVGAQKSAQTFLSEIRWEKNITLGEPPGFLHSWWCVFWDLYCAAPERRETCEHSSEAKA
FHDYSAAAAPSPVLGNIPPGDGMPVGPVPPGFFQALGGVPGSQPLLPSGMDPTRQQGHPNMGGPMQRMTPPRGMVPLGPQNYGGAMRPPL
NALGGPGMPGMNMGPGGGRPWPNPTNANSIPYSSASPGNYVGPPGGGGPPGTPIMPSPADSTNSGDNMYTLMNAVPPGPNRPNFPMGPGS
DGPMGGLGGMESHHMNGSLGSGDMDSISKGVQPATGEVVFDSFQDSASRSELETRMSSLQPVELLLPSALSEQTEALIHRATSVSVQDDR
IRVERMDNIYFEYSHAFQAVTEFYAKDTVDIKGSQIISGIVNLEKPVICSLAAIIKYLKEFNLEKMLSKPENFKQLSSKMEFMTINGTTL
RNLEILQNQTDMKTKGSLLWVLDHTKTSFGRRKLKKWVTQPLLKLREINARLDAVSEVLHSESSVFGQIENHLRKLPDIERGLCSIYHKK
CSTQEFFLIVKTLYHLKSEFQAIIPAVNSHIQSDLLRTVILEIPELLSPVEHYLKILNEQAAKVGDKTELFKDLSDFPLIKKRKDEIQGV
IDEIRMHLQEIRKILKNPSAQYVTVSGQEFMIEIKNSAVSCIPTDWVKVGSTKAVSRFHSPFIVENYRHLNQLREQLVLDCSAEWLDFLE
KFSEHYHSLCKAVHHLATVDCIFSLAKVAKQGDYCRPTVQEERKIVIKNGRHPVIDVLLGEQDQYVPNNTDLSEDSERVMIITGPNMGGK
SSYIKQVALITIMAQIGSYVPAEEATIGIVDGIFTRMGAADNIYKGQSTFMEELTDTAEIIRKATSQSLVILDELGRGTSTHDGIAIAYA
TLEYFIRDVKSLTLFVTHYPPVCELEKNYSHQVGNYHMGFLVSEDESKLDPGAAEQVPDFVTFLYQITRGIAARSYGLNVAKLADVPGEI

--------------------------------------------------------------
>86714_86714_3_SSBP2-MSH3_SSBP2_chr5_80733249_ENST00000509053_MSH3_chr5_79974746_ENST00000265081_length(transcript)=3712nt_BP=873nt
ACAAGCATGTACGGCAAAGGCAAGAGTAACAGCAGCGCCGTCCCGTCCGACAGCCAGGCCCGGGAGAAGTTAGCACTCTACGTATATGAA
TATCTGCTCCATGTAGGAGCTCAGAAATCAGCTCAAACATTTTTATCAGAGATAAGATGGGAAAAAAACATCACATTGGGGGAACCACCA
GGATTCTTACATTCTTGGTGGTGTGTATTTTGGGATCTCTACTGTGCAGCTCCAGAGAGACGTGAAACATGTGAACACTCAAGTGAAGCA
AAAGCCTTCCATGATTACCCTTTTATGTCACCTCGGTACCCTGGAGGTCCAAGGCCCCCATTGAGGATACCTAATCAGGCACTTGGAGGT
GTCCCAGGAAGTCAGCCATTACTCCCCAGTGGAATGGATCCAACTCGACAACAAGGACATCCAAATATGGGTGGGCCAATGCAGAGAATG
ACTCCTCCAAGAGGAATGGTGCCCTTAGGACCACAGAACTATGGAGGTGCAATGAGACCCCCACTGAATGCTTTAGGTGGCCCTGGAATG
CCTGGAATGAACATGGGTCCAGGTGGTGGTAGACCTTGGCCAAACCCAACAAATGCCAATTCAATACCATACTCCTCAGCATCTCCTGGG
AATTATGTAGGTCCTCCAGGAGGTGGAGGGCCACCAGGAACACCCATCATGCCTAGTCCAGCAGATTCAACCAACTCTGGTGATAACATG
TATACTTTAATGAATGCAGTACCTCCTGGACCTAACAGACCTAATTTTCCAATGGGTCCTGGGTCAGATGGTCCCATGGGTGGATTAGGA
GGAATGGAGTCACATCACATGAATGGCTCTTTAGGCTCAGGAGATATGGACAGTATTTCCAAGGGAGTGCAGCCTGCCACAGGCGAGGTT
GTGTTTGATAGTTTCCAGGACTCTGCTTCTCGTTCAGAGCTAGAAACCCGGATGTCAAGCCTGCAGCCAGTAGAGCTGCTGCTTCCTTCG
GCCTTGTCCGAGCAAACAGAGGCGCTCATCCACAGAGCCACATCTGTTAGTGTGCAGGATGACAGAATTCGAGTCGAAAGGATGGATAAC
ATTTATTTTGAATACAGCCATGCTTTCCAGGCAGTTACAGAGTTTTATGCAAAAGATACAGTTGACATCAAAGGTTCTCAAATTATTTCT
GGCATTGTTAACTTAGAGAAGCCTGTGATTTGCTCTTTGGCTGCCATCATAAAATACCTCAAAGAATTCAACTTGGAAAAGATGCTCTCC
AAACCTGAGAATTTTAAACAGCTATCAAGTAAAATGGAATTTATGACAATTAATGGAACAACATTAAGGAATCTGGAAATCCTACAGAAT
CAGACTGATATGAAAACCAAAGGAAGTTTGCTGTGGGTTTTAGACCACACTAAAACTTCATTTGGGAGACGGAAGTTAAAGAAGTGGGTG
ACCCAGCCACTCCTTAAATTAAGGGAAATAAATGCCCGGCTTGATGCTGTATCGGAAGTTCTCCATTCAGAATCTAGTGTGTTTGGTCAG
ATAGAAAATCATCTACGTAAATTGCCCGACATAGAGAGGGGACTCTGTAGCATTTATCACAAAAAATGTTCTACCCAAGAGTTCTTCTTG
ATTGTCAAAACTTTATATCACCTAAAGTCAGAATTTCAAGCAATAATACCTGCTGTTAATTCCCACATTCAGTCAGACTTGCTCCGGACC
GTTATTTTAGAAATTCCTGAACTCCTCAGTCCAGTGGAGCATTACTTAAAGATACTCAATGAACAAGCTGCCAAAGTTGGGGATAAAACT
GAATTATTTAAAGACCTTTCTGACTTCCCTTTAATAAAAAAGAGGAAGGATGAAATTCAAGGTGTTATTGACGAGATCCGAATGCATTTG
CAAGAAATACGAAAAATACTAAAAAATCCTTCTGCACAATATGTGACAGTATCAGGACAGGAGTTTATGATAGAAATAAAGAACTCTGCT
GTATCTTGTATACCAACTGATTGGGTAAAGGTTGGAAGCACAAAAGCTGTGAGCCGCTTTCACTCTCCTTTTATTGTAGAAAATTACAGA
CATCTGAATCAGCTCCGGGAGCAGCTAGTCCTTGACTGCAGTGCTGAATGGCTTGATTTTCTAGAGAAATTCAGTGAACATTATCACTCC
TTGTGTAAAGCAGTGCATCACCTAGCAACTGTTGACTGCATTTTCTCCCTGGCCAAGGTCGCTAAGCAAGGAGATTACTGCAGACCAACT
GTACAAGAAGAAAGAAAAATTGTAATAAAAAATGGAAGGCACCCTGTGATTGATGTGTTGCTGGGAGAACAGGATCAATATGTCCCAAAT
AATACAGATTTATCAGAGGACTCAGAGAGAGTAATGATAATTACCGGACCAAACATGGGTGGAAAGAGCTCCTACATAAAACAAGTTGCA
TTGATTACCATCATGGCTCAGATTGGCTCCTATGTTCCTGCAGAAGAAGCGACAATTGGGATTGTGGATGGCATTTTCACAAGGATGGGT
GCTGCAGACAATATATATAAAGGACAGAGTACATTTATGGAAGAACTGACTGACACAGCAGAAATAATCAGAAAAGCAACATCACAGTCC
TTGGTTATCTTGGATGAACTAGGAAGAGGGACGAGCACTCATGATGGAATTGCCATTGCCTATGCTACACTTGAGTATTTCATCAGAGAT
GTGAAATCCTTAACCCTGTTTGTCACCCATTATCCGCCAGTTTGTGAACTAGAAAAAAATTACTCACACCAGGTGGGGAATTACCACATG
GGATTCTTGGTCAGTGAGGATGAAAGCAAACTGGATCCAGGCGCAGCAGAACAAGTCCCTGATTTTGTCACCTTCCTTTACCAAATAACT
AGAGGAATTGCAGCAAGGAGTTATGGATTAAATGTGGCTAAACTAGCAGATGTTCCTGGAGAAATTTTGAAGAAAGCAGCTCACAAGTCA
AAAGAGCTGGAAGGATTAATAAATACGAAAAGAAAGAGACTCAAGTATTTTGCAAAGTTATGGACGATGCATAATGCACAAGACCTGCAG
AAGTGGACAGAGGAGTTCAACATGGAAGAAACACAGACTTCTCTTCTTCATTAAAATGAAGACTACATTTGTGAACAAAAAATGGAGAAT
TAAAAATACCAACTGTACAAAATAACTCTCCAGTAACAGCCTATCTTTGTGTGACATGTGAGCATAAAATTATGACCATGGTATATTCCT
ATTGGAAACAGAGAGGTTTTTCTGAAGACAGTCTTTTTCAAGTTTCTGTCTTCCTAACTTTTCTACGTATAAACACTCTTGAATAGACTT
CCACTTTGTAATTAGAAAATTTTATGGACAGTAAGTCCAGTAAAGCCTTAAGTGGCAGAATATAATTCCCAAGCTTTTGGAGGGTGATAT
AAAAATTTACTTGATATTTTTATTTGTTTCAGTTCAGATAATTGGCAACTGGGTGAATCTGGCAGGAATCTATCCATTGAACTAAAATAA
TTTTATTATGCAACCAGTTTATCCACCAAGAACATAAGAATTTTTTATAAGTAGAAAGAATTGGCCAGGCATGGTGGCTCATGCCTGTAA
TCCCAGCACTTTGGGAGGCCAAGGTAGGCAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGCAAAACCCCATCTTTA

>86714_86714_3_SSBP2-MSH3_SSBP2_chr5_80733249_ENST00000509053_MSH3_chr5_79974746_ENST00000265081_length(amino acids)=1035AA_BP=289
MYGKGKSNSSAVPSDSQAREKLALYVYEYLLHVGAQKSAQTFLSEIRWEKNITLGEPPGFLHSWWCVFWDLYCAAPERRETCEHSSEAKA
FHDYPFMSPRYPGGPRPPLRIPNQALGGVPGSQPLLPSGMDPTRQQGHPNMGGPMQRMTPPRGMVPLGPQNYGGAMRPPLNALGGPGMPG
MNMGPGGGRPWPNPTNANSIPYSSASPGNYVGPPGGGGPPGTPIMPSPADSTNSGDNMYTLMNAVPPGPNRPNFPMGPGSDGPMGGLGGM
ESHHMNGSLGSGDMDSISKGVQPATGEVVFDSFQDSASRSELETRMSSLQPVELLLPSALSEQTEALIHRATSVSVQDDRIRVERMDNIY
FEYSHAFQAVTEFYAKDTVDIKGSQIISGIVNLEKPVICSLAAIIKYLKEFNLEKMLSKPENFKQLSSKMEFMTINGTTLRNLEILQNQT
DMKTKGSLLWVLDHTKTSFGRRKLKKWVTQPLLKLREINARLDAVSEVLHSESSVFGQIENHLRKLPDIERGLCSIYHKKCSTQEFFLIV
KTLYHLKSEFQAIIPAVNSHIQSDLLRTVILEIPELLSPVEHYLKILNEQAAKVGDKTELFKDLSDFPLIKKRKDEIQGVIDEIRMHLQE
IRKILKNPSAQYVTVSGQEFMIEIKNSAVSCIPTDWVKVGSTKAVSRFHSPFIVENYRHLNQLREQLVLDCSAEWLDFLEKFSEHYHSLC
KAVHHLATVDCIFSLAKVAKQGDYCRPTVQEERKIVIKNGRHPVIDVLLGEQDQYVPNNTDLSEDSERVMIITGPNMGGKSSYIKQVALI
TIMAQIGSYVPAEEATIGIVDGIFTRMGAADNIYKGQSTFMEELTDTAEIIRKATSQSLVILDELGRGTSTHDGIAIAYATLEYFIRDVK
SLTLFVTHYPPVCELEKNYSHQVGNYHMGFLVSEDESKLDPGAAEQVPDFVTFLYQITRGIAARSYGLNVAKLADVPGEILKKAAHKSKE

--------------------------------------------------------------
>86714_86714_4_SSBP2-MSH3_SSBP2_chr5_80733249_ENST00000514493_MSH3_chr5_79974746_ENST00000265081_length(transcript)=3914nt_BP=1075nt
TTAGCCGCCACCGCGCGCTTCGGAAGGCCAGAGGGAGGGGGAGGCCTGTCAGTCTCGCGCGTTGCCTGGGCGAAGGGGGCGGAGCTTTGG
CGTGGGGCGGCCAATAGTGGGGGTGGCTGCGTGGGTCGCCATGGGGACGGGGCTGTTCCCGGGGAGGCTGTGATGGGTTGACAGGTGCGT
GACAGTGGGAGCTGCTCTCGGCACAAGCATGTACGGCAAAGGCAAGAGTAACAGCAGCGCCGTCCCGTCCGACAGCCAGGCCCGGGAGAA
GTTAGCACTCTACGTATATGAATATCTGCTCCATGTAGGAGCTCAGAAATCAGCTCAAACATTTTTATCAGAGATAAGATGGGAAAAAAA
CATCACATTGGGGGAACCACCAGGATTCTTACATTCTTGGTGGTGTGTATTTTGGGATCTCTACTGTGCAGCTCCAGAGAGACGTGAAAC
ATGTGAACACTCAAGTGAAGCAAAAGCCTTCCATGATTACCCTTTTATGTCACCTCGGTACCCTGGAGGTCCAAGGCCCCCATTGAGGAT
ACCTAATCAGGCACTTGGAGGTGTCCCAGGAAGTCAGCCATTACTCCCCAGTGGAATGGATCCAACTCGACAACAAGGACATCCAAATAT
GGGTGGGCCAATGCAGAGAATGACTCCTCCAAGAGGAATGGTGCCCTTAGGACCACAGAACTATGGAGGTGCAATGAGACCCCCACTGAA
TGCTTTAGGTGGCCCTGGAATGCCTGGAATGAACATGGGTCCAGGTGGTGGTAGACCTTGGCCAAACCCAACAAATGCCAATTCAATACC
ATACTCCTCAGCATCTCCTGGGAATTATGTAGGTCCTCCAGGAGGTGGAGGGCCACCAGGAACACCCATCATGCCTAGTCCAGCAGATTC
AACCAACTCTGGTGATAACATGTATACTTTAATGAATGCAGTACCTCCTGGACCTAACAGACCTAATTTTCCAATGGGTCCTGGGTCAGA
TGGTCCCATGGGTGGATTAGGAGGAATGGAGTCACATCACATGAATGGCTCTTTAGGCTCAGGAGATATGGACAGTATTTCCAAGGGAGT
GCAGCCTGCCACAGGCGAGGTTGTGTTTGATAGTTTCCAGGACTCTGCTTCTCGTTCAGAGCTAGAAACCCGGATGTCAAGCCTGCAGCC
AGTAGAGCTGCTGCTTCCTTCGGCCTTGTCCGAGCAAACAGAGGCGCTCATCCACAGAGCCACATCTGTTAGTGTGCAGGATGACAGAAT
TCGAGTCGAAAGGATGGATAACATTTATTTTGAATACAGCCATGCTTTCCAGGCAGTTACAGAGTTTTATGCAAAAGATACAGTTGACAT
CAAAGGTTCTCAAATTATTTCTGGCATTGTTAACTTAGAGAAGCCTGTGATTTGCTCTTTGGCTGCCATCATAAAATACCTCAAAGAATT
CAACTTGGAAAAGATGCTCTCCAAACCTGAGAATTTTAAACAGCTATCAAGTAAAATGGAATTTATGACAATTAATGGAACAACATTAAG
GAATCTGGAAATCCTACAGAATCAGACTGATATGAAAACCAAAGGAAGTTTGCTGTGGGTTTTAGACCACACTAAAACTTCATTTGGGAG
ACGGAAGTTAAAGAAGTGGGTGACCCAGCCACTCCTTAAATTAAGGGAAATAAATGCCCGGCTTGATGCTGTATCGGAAGTTCTCCATTC
AGAATCTAGTGTGTTTGGTCAGATAGAAAATCATCTACGTAAATTGCCCGACATAGAGAGGGGACTCTGTAGCATTTATCACAAAAAATG
TTCTACCCAAGAGTTCTTCTTGATTGTCAAAACTTTATATCACCTAAAGTCAGAATTTCAAGCAATAATACCTGCTGTTAATTCCCACAT
TCAGTCAGACTTGCTCCGGACCGTTATTTTAGAAATTCCTGAACTCCTCAGTCCAGTGGAGCATTACTTAAAGATACTCAATGAACAAGC
TGCCAAAGTTGGGGATAAAACTGAATTATTTAAAGACCTTTCTGACTTCCCTTTAATAAAAAAGAGGAAGGATGAAATTCAAGGTGTTAT
TGACGAGATCCGAATGCATTTGCAAGAAATACGAAAAATACTAAAAAATCCTTCTGCACAATATGTGACAGTATCAGGACAGGAGTTTAT
GATAGAAATAAAGAACTCTGCTGTATCTTGTATACCAACTGATTGGGTAAAGGTTGGAAGCACAAAAGCTGTGAGCCGCTTTCACTCTCC
TTTTATTGTAGAAAATTACAGACATCTGAATCAGCTCCGGGAGCAGCTAGTCCTTGACTGCAGTGCTGAATGGCTTGATTTTCTAGAGAA
ATTCAGTGAACATTATCACTCCTTGTGTAAAGCAGTGCATCACCTAGCAACTGTTGACTGCATTTTCTCCCTGGCCAAGGTCGCTAAGCA
AGGAGATTACTGCAGACCAACTGTACAAGAAGAAAGAAAAATTGTAATAAAAAATGGAAGGCACCCTGTGATTGATGTGTTGCTGGGAGA
ACAGGATCAATATGTCCCAAATAATACAGATTTATCAGAGGACTCAGAGAGAGTAATGATAATTACCGGACCAAACATGGGTGGAAAGAG
CTCCTACATAAAACAAGTTGCATTGATTACCATCATGGCTCAGATTGGCTCCTATGTTCCTGCAGAAGAAGCGACAATTGGGATTGTGGA
TGGCATTTTCACAAGGATGGGTGCTGCAGACAATATATATAAAGGACAGAGTACATTTATGGAAGAACTGACTGACACAGCAGAAATAAT
CAGAAAAGCAACATCACAGTCCTTGGTTATCTTGGATGAACTAGGAAGAGGGACGAGCACTCATGATGGAATTGCCATTGCCTATGCTAC
ACTTGAGTATTTCATCAGAGATGTGAAATCCTTAACCCTGTTTGTCACCCATTATCCGCCAGTTTGTGAACTAGAAAAAAATTACTCACA
CCAGGTGGGGAATTACCACATGGGATTCTTGGTCAGTGAGGATGAAAGCAAACTGGATCCAGGCGCAGCAGAACAAGTCCCTGATTTTGT
CACCTTCCTTTACCAAATAACTAGAGGAATTGCAGCAAGGAGTTATGGATTAAATGTGGCTAAACTAGCAGATGTTCCTGGAGAAATTTT
GAAGAAAGCAGCTCACAAGTCAAAAGAGCTGGAAGGATTAATAAATACGAAAAGAAAGAGACTCAAGTATTTTGCAAAGTTATGGACGAT
GCATAATGCACAAGACCTGCAGAAGTGGACAGAGGAGTTCAACATGGAAGAAACACAGACTTCTCTTCTTCATTAAAATGAAGACTACAT
TTGTGAACAAAAAATGGAGAATTAAAAATACCAACTGTACAAAATAACTCTCCAGTAACAGCCTATCTTTGTGTGACATGTGAGCATAAA
ATTATGACCATGGTATATTCCTATTGGAAACAGAGAGGTTTTTCTGAAGACAGTCTTTTTCAAGTTTCTGTCTTCCTAACTTTTCTACGT
ATAAACACTCTTGAATAGACTTCCACTTTGTAATTAGAAAATTTTATGGACAGTAAGTCCAGTAAAGCCTTAAGTGGCAGAATATAATTC
CCAAGCTTTTGGAGGGTGATATAAAAATTTACTTGATATTTTTATTTGTTTCAGTTCAGATAATTGGCAACTGGGTGAATCTGGCAGGAA
TCTATCCATTGAACTAAAATAATTTTATTATGCAACCAGTTTATCCACCAAGAACATAAGAATTTTTTATAAGTAGAAAGAATTGGCCAG
GCATGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGTAGGCAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCA

>86714_86714_4_SSBP2-MSH3_SSBP2_chr5_80733249_ENST00000514493_MSH3_chr5_79974746_ENST00000265081_length(amino acids)=1035AA_BP=289
MYGKGKSNSSAVPSDSQAREKLALYVYEYLLHVGAQKSAQTFLSEIRWEKNITLGEPPGFLHSWWCVFWDLYCAAPERRETCEHSSEAKA
FHDYPFMSPRYPGGPRPPLRIPNQALGGVPGSQPLLPSGMDPTRQQGHPNMGGPMQRMTPPRGMVPLGPQNYGGAMRPPLNALGGPGMPG
MNMGPGGGRPWPNPTNANSIPYSSASPGNYVGPPGGGGPPGTPIMPSPADSTNSGDNMYTLMNAVPPGPNRPNFPMGPGSDGPMGGLGGM
ESHHMNGSLGSGDMDSISKGVQPATGEVVFDSFQDSASRSELETRMSSLQPVELLLPSALSEQTEALIHRATSVSVQDDRIRVERMDNIY
FEYSHAFQAVTEFYAKDTVDIKGSQIISGIVNLEKPVICSLAAIIKYLKEFNLEKMLSKPENFKQLSSKMEFMTINGTTLRNLEILQNQT
DMKTKGSLLWVLDHTKTSFGRRKLKKWVTQPLLKLREINARLDAVSEVLHSESSVFGQIENHLRKLPDIERGLCSIYHKKCSTQEFFLIV
KTLYHLKSEFQAIIPAVNSHIQSDLLRTVILEIPELLSPVEHYLKILNEQAAKVGDKTELFKDLSDFPLIKKRKDEIQGVIDEIRMHLQE
IRKILKNPSAQYVTVSGQEFMIEIKNSAVSCIPTDWVKVGSTKAVSRFHSPFIVENYRHLNQLREQLVLDCSAEWLDFLEKFSEHYHSLC
KAVHHLATVDCIFSLAKVAKQGDYCRPTVQEERKIVIKNGRHPVIDVLLGEQDQYVPNNTDLSEDSERVMIITGPNMGGKSSYIKQVALI
TIMAQIGSYVPAEEATIGIVDGIFTRMGAADNIYKGQSTFMEELTDTAEIIRKATSQSLVILDELGRGTSTHDGIAIAYATLEYFIRDVK
SLTLFVTHYPPVCELEKNYSHQVGNYHMGFLVSEDESKLDPGAAEQVPDFVTFLYQITRGIAARSYGLNVAKLADVPGEILKKAAHKSKE

--------------------------------------------------------------
>86714_86714_5_SSBP2-MSH3_SSBP2_chr5_80733249_ENST00000515395_MSH3_chr5_79974746_ENST00000265081_length(transcript)=3785nt_BP=946nt
GAGGCTGTGATGGGTTGACAGGTGCGTGACAGTGGGAGCTGCTCTCGGCACAAGCATGTACGGCAAAGGCAAGAGTAACAGCAGCGCCGT
CCCGTCCGACAGCCAGGCCCGGGAGAAGTTAGCACTCTACGTATATGAATATCTGCTCCATGTAGGAGCTCAGAAATCAGCTCAAACATT
TTTATCAGAGATAAGATGGGAAAAAAACATCACATTGGGGGAACCACCAGGATTCTTACATTCTTGGTGGTGTGTATTTTGGGATCTCTA
CTGTGCAGCTCCAGAGAGACGTGAAACATGTGAACACTCAAGTGAAGCAAAAGCCTTCCATGATTACCCTTTTATGTCACCTCGGTACCC
TGGAGGTCCAAGGCCCCCATTGAGGATACCTAATCAGGCACTTGGAGGTGTCCCAGGAAGTCAGCCATTACTCCCCAGTGGAATGGATCC
AACTCGACAACAAGGACATCCAAATATGGGTGGGCCAATGCAGAGAATGACTCCTCCAAGAGGAATGGTGCCCTTAGGACCACAGTCTGA
CCCTTGGTTATCATTACAGAACTATGGAGGTGCAATGAGACCCCCACTGAATGCTTTAGGTGGCCCTGGAATGCCTGGAATGAACATGGG
TCCAGGTGGTGGTAGACCTTGGCCAAACCCAACAAATGCCAATTCAATACCATACTCCTCAGCATCTCCTGGGAATTATGTAGGTCCTCC
AGGAGGTGGAGGGCCACCAGGAACACCCATCATGCCTAGTCCAGCAGATTCAACCAACTCTGGTGATAACATGTATACTTTAATGAATGC
AGTACCTCCTGGACCTAACAGACCTAATTTTCCAATGGGTCCTGGGTCAGATGGTCCCATGGGTGGATTAGGAGGAATGGAGTCACATCA
CATGAATGGCTCTTTAGGCTCAGGAGATATGGACAGTATTTCCAAGGGAGTGCAGCCTGCCACAGGCGAGGTTGTGTTTGATAGTTTCCA
GGACTCTGCTTCTCGTTCAGAGCTAGAAACCCGGATGTCAAGCCTGCAGCCAGTAGAGCTGCTGCTTCCTTCGGCCTTGTCCGAGCAAAC
AGAGGCGCTCATCCACAGAGCCACATCTGTTAGTGTGCAGGATGACAGAATTCGAGTCGAAAGGATGGATAACATTTATTTTGAATACAG
CCATGCTTTCCAGGCAGTTACAGAGTTTTATGCAAAAGATACAGTTGACATCAAAGGTTCTCAAATTATTTCTGGCATTGTTAACTTAGA
GAAGCCTGTGATTTGCTCTTTGGCTGCCATCATAAAATACCTCAAAGAATTCAACTTGGAAAAGATGCTCTCCAAACCTGAGAATTTTAA
ACAGCTATCAAGTAAAATGGAATTTATGACAATTAATGGAACAACATTAAGGAATCTGGAAATCCTACAGAATCAGACTGATATGAAAAC
CAAAGGAAGTTTGCTGTGGGTTTTAGACCACACTAAAACTTCATTTGGGAGACGGAAGTTAAAGAAGTGGGTGACCCAGCCACTCCTTAA
ATTAAGGGAAATAAATGCCCGGCTTGATGCTGTATCGGAAGTTCTCCATTCAGAATCTAGTGTGTTTGGTCAGATAGAAAATCATCTACG
TAAATTGCCCGACATAGAGAGGGGACTCTGTAGCATTTATCACAAAAAATGTTCTACCCAAGAGTTCTTCTTGATTGTCAAAACTTTATA
TCACCTAAAGTCAGAATTTCAAGCAATAATACCTGCTGTTAATTCCCACATTCAGTCAGACTTGCTCCGGACCGTTATTTTAGAAATTCC
TGAACTCCTCAGTCCAGTGGAGCATTACTTAAAGATACTCAATGAACAAGCTGCCAAAGTTGGGGATAAAACTGAATTATTTAAAGACCT
TTCTGACTTCCCTTTAATAAAAAAGAGGAAGGATGAAATTCAAGGTGTTATTGACGAGATCCGAATGCATTTGCAAGAAATACGAAAAAT
ACTAAAAAATCCTTCTGCACAATATGTGACAGTATCAGGACAGGAGTTTATGATAGAAATAAAGAACTCTGCTGTATCTTGTATACCAAC
TGATTGGGTAAAGGTTGGAAGCACAAAAGCTGTGAGCCGCTTTCACTCTCCTTTTATTGTAGAAAATTACAGACATCTGAATCAGCTCCG
GGAGCAGCTAGTCCTTGACTGCAGTGCTGAATGGCTTGATTTTCTAGAGAAATTCAGTGAACATTATCACTCCTTGTGTAAAGCAGTGCA
TCACCTAGCAACTGTTGACTGCATTTTCTCCCTGGCCAAGGTCGCTAAGCAAGGAGATTACTGCAGACCAACTGTACAAGAAGAAAGAAA
AATTGTAATAAAAAATGGAAGGCACCCTGTGATTGATGTGTTGCTGGGAGAACAGGATCAATATGTCCCAAATAATACAGATTTATCAGA
GGACTCAGAGAGAGTAATGATAATTACCGGACCAAACATGGGTGGAAAGAGCTCCTACATAAAACAAGTTGCATTGATTACCATCATGGC
TCAGATTGGCTCCTATGTTCCTGCAGAAGAAGCGACAATTGGGATTGTGGATGGCATTTTCACAAGGATGGGTGCTGCAGACAATATATA
TAAAGGACAGAGTACATTTATGGAAGAACTGACTGACACAGCAGAAATAATCAGAAAAGCAACATCACAGTCCTTGGTTATCTTGGATGA
ACTAGGAAGAGGGACGAGCACTCATGATGGAATTGCCATTGCCTATGCTACACTTGAGTATTTCATCAGAGATGTGAAATCCTTAACCCT
GTTTGTCACCCATTATCCGCCAGTTTGTGAACTAGAAAAAAATTACTCACACCAGGTGGGGAATTACCACATGGGATTCTTGGTCAGTGA
GGATGAAAGCAAACTGGATCCAGGCGCAGCAGAACAAGTCCCTGATTTTGTCACCTTCCTTTACCAAATAACTAGAGGAATTGCAGCAAG
GAGTTATGGATTAAATGTGGCTAAACTAGCAGATGTTCCTGGAGAAATTTTGAAGAAAGCAGCTCACAAGTCAAAAGAGCTGGAAGGATT
AATAAATACGAAAAGAAAGAGACTCAAGTATTTTGCAAAGTTATGGACGATGCATAATGCACAAGACCTGCAGAAGTGGACAGAGGAGTT
CAACATGGAAGAAACACAGACTTCTCTTCTTCATTAAAATGAAGACTACATTTGTGAACAAAAAATGGAGAATTAAAAATACCAACTGTA
CAAAATAACTCTCCAGTAACAGCCTATCTTTGTGTGACATGTGAGCATAAAATTATGACCATGGTATATTCCTATTGGAAACAGAGAGGT
TTTTCTGAAGACAGTCTTTTTCAAGTTTCTGTCTTCCTAACTTTTCTACGTATAAACACTCTTGAATAGACTTCCACTTTGTAATTAGAA
AATTTTATGGACAGTAAGTCCAGTAAAGCCTTAAGTGGCAGAATATAATTCCCAAGCTTTTGGAGGGTGATATAAAAATTTACTTGATAT
TTTTATTTGTTTCAGTTCAGATAATTGGCAACTGGGTGAATCTGGCAGGAATCTATCCATTGAACTAAAATAATTTTATTATGCAACCAG
TTTATCCACCAAGAACATAAGAATTTTTTATAAGTAGAAAGAATTGGCCAGGCATGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAG
GCCAAGGTAGGCAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGCAAAACCCCATCTTTACTAAAAATATAAAGTAC

>86714_86714_5_SSBP2-MSH3_SSBP2_chr5_80733249_ENST00000515395_MSH3_chr5_79974746_ENST00000265081_length(amino acids)=1043AA_BP=297
MYGKGKSNSSAVPSDSQAREKLALYVYEYLLHVGAQKSAQTFLSEIRWEKNITLGEPPGFLHSWWCVFWDLYCAAPERRETCEHSSEAKA
FHDYPFMSPRYPGGPRPPLRIPNQALGGVPGSQPLLPSGMDPTRQQGHPNMGGPMQRMTPPRGMVPLGPQSDPWLSLQNYGGAMRPPLNA
LGGPGMPGMNMGPGGGRPWPNPTNANSIPYSSASPGNYVGPPGGGGPPGTPIMPSPADSTNSGDNMYTLMNAVPPGPNRPNFPMGPGSDG
PMGGLGGMESHHMNGSLGSGDMDSISKGVQPATGEVVFDSFQDSASRSELETRMSSLQPVELLLPSALSEQTEALIHRATSVSVQDDRIR
VERMDNIYFEYSHAFQAVTEFYAKDTVDIKGSQIISGIVNLEKPVICSLAAIIKYLKEFNLEKMLSKPENFKQLSSKMEFMTINGTTLRN
LEILQNQTDMKTKGSLLWVLDHTKTSFGRRKLKKWVTQPLLKLREINARLDAVSEVLHSESSVFGQIENHLRKLPDIERGLCSIYHKKCS
TQEFFLIVKTLYHLKSEFQAIIPAVNSHIQSDLLRTVILEIPELLSPVEHYLKILNEQAAKVGDKTELFKDLSDFPLIKKRKDEIQGVID
EIRMHLQEIRKILKNPSAQYVTVSGQEFMIEIKNSAVSCIPTDWVKVGSTKAVSRFHSPFIVENYRHLNQLREQLVLDCSAEWLDFLEKF
SEHYHSLCKAVHHLATVDCIFSLAKVAKQGDYCRPTVQEERKIVIKNGRHPVIDVLLGEQDQYVPNNTDLSEDSERVMIITGPNMGGKSS
YIKQVALITIMAQIGSYVPAEEATIGIVDGIFTRMGAADNIYKGQSTFMEELTDTAEIIRKATSQSLVILDELGRGTSTHDGIAIAYATL
EYFIRDVKSLTLFVTHYPPVCELEKNYSHQVGNYHMGFLVSEDESKLDPGAAEQVPDFVTFLYQITRGIAARSYGLNVAKLADVPGEILK

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for SSBP2-MSH3


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with
TgeneMSH3chr5:80733249chr5:79974746ENST0000026508162475_297391.01138.0EXO1


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for SSBP2-MSH3


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for SSBP2-MSH3


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource