FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:INO80-THSD4 (FusionGDB2 ID:39771)

Fusion Gene Summary for INO80-THSD4

check button Fusion gene summary
Fusion gene informationFusion gene name: INO80-THSD4
Fusion gene ID: 39771
HgeneTgene
Gene symbol

INO80

THSD4

Gene ID

54617

79875

Gene nameINO80 complex ATPase subunitthrombospondin type 1 domain containing 4
SynonymsINO80A|INOC1ADAMTSL-6|ADAMTSL6|FVSY9334|PRO34005
Cytomap

15q15.1

15q23

Type of geneprotein-codingprotein-coding
Descriptionchromatin-remodeling ATPase INO80DNA helicase INO80DNA helicase-related INO80 complex homolog 1DNA helicase-related protein INO80INO80 complex subunit Ahomolog of yeast INO80putative DNA helicase INO80 complex homolog 1thrombospondin type-1 domain-containing protein 4A disintegrin and metalloproteinase with thrombospondin motifs-like protein 6ADAMTS-like protein 6thrombospondin, type I, domain containing 4
Modification date2020031320200313
UniProtAcc

Q9ULG1

.
Ensembl transtripts involved in fusion geneENST00000361937, ENST00000401393, 
ENST00000561244, 
ENST00000567838, 
ENST00000261862, ENST00000355327, 
ENST00000357769, 
Fusion gene scores* DoF score10 X 11 X 6=66030 X 19 X 13=7410
# samples 1131
** MAII scorelog2(11/660*10)=-2.58496250072116
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(31/7410*10)=-4.57913342191896
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: INO80 [Title/Abstract] AND THSD4 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointINO80(41313098)-THSD4(72020888), # samples:3
Anticipated loss of major functional domain due to fusion event.INO80-THSD4 seems lost the major protein functional domain in Hgene partner, which is a epigenetic factor due to the frame-shifted ORF.
INO80-THSD4 seems lost the major protein functional domain in Hgene partner, which is a essential gene due to the frame-shifted ORF.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID

check buttonFusion gene breakpoints across INO80 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across THSD4 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4OVTCGA-29-1702-01AINO80chr15

41313098

-THSD4chr15

72020888

+


Top

Fusion Gene ORF analysis for INO80-THSD4

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-3UTRENST00000361937ENST00000567838INO80chr15

41313098

-THSD4chr15

72020888

+
5CDS-3UTRENST00000401393ENST00000567838INO80chr15

41313098

-THSD4chr15

72020888

+
Frame-shiftENST00000361937ENST00000261862INO80chr15

41313098

-THSD4chr15

72020888

+
Frame-shiftENST00000361937ENST00000355327INO80chr15

41313098

-THSD4chr15

72020888

+
Frame-shiftENST00000361937ENST00000357769INO80chr15

41313098

-THSD4chr15

72020888

+
In-frameENST00000401393ENST00000261862INO80chr15

41313098

-THSD4chr15

72020888

+
In-frameENST00000401393ENST00000355327INO80chr15

41313098

-THSD4chr15

72020888

+
In-frameENST00000401393ENST00000357769INO80chr15

41313098

-THSD4chr15

72020888

+
intron-3CDSENST00000561244ENST00000261862INO80chr15

41313098

-THSD4chr15

72020888

+
intron-3CDSENST00000561244ENST00000355327INO80chr15

41313098

-THSD4chr15

72020888

+
intron-3CDSENST00000561244ENST00000357769INO80chr15

41313098

-THSD4chr15

72020888

+
intron-3UTRENST00000561244ENST00000567838INO80chr15

41313098

-THSD4chr15

72020888

+

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000401393INO80chr1541313098-ENST00000355327THSD4chr1572020888+11207349822451971657
ENST00000401393INO80chr1541313098-ENST00000261862THSD4chr1572020888+11207349822451971657
ENST00000401393INO80chr1541313098-ENST00000357769THSD4chr1572020888+6450349822451971657

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000401393ENST00000355327INO80chr1541313098-THSD4chr1572020888+0.0005605230.9994394
ENST00000401393ENST00000261862INO80chr1541313098-THSD4chr1572020888+0.0005605230.9994394
ENST00000401393ENST00000357769INO80chr1541313098-THSD4chr1572020888+0.0016825270.9983175

Top

Fusion Genomic Features for INO80-THSD4


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)
INO80chr1541313097-THSD4chr1572020887+4.91E-070.9999995
INO80chr1541313097-THSD4chr1572020887+4.91E-070.9999995

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.
genomic feature of top 1%

Top

Fusion Protein Features for INO80-THSD4


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr15:41313098/chr15:72020888)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
INO80

Q9ULG1

.
FUNCTION: ATPase component of the chromatin remodeling INO80 complex which is involved in transcriptional regulation, DNA replication and DNA repair (PubMed:16230350, PubMed:16298340, PubMed:17721549, PubMed:20855601, PubMed:20237820). Binds DNA (PubMed:16298340, PubMed:21303910). As part of the INO80 complex, remodels chromatin by shifting nucleosomes (PubMed:16230350, PubMed:21303910). Regulates transcription upon recruitment by YY1 to YY1-activated genes, where it acts as an essential coactivator (PubMed:17721549). Involved in UV-damage excision DNA repair (PubMed:20855601). The contribution to DNA double-strand break repair appears to be largely indirect through transcriptional regulation (PubMed:20687897). Involved in DNA replication (PubMed:20237820). Required for microtubule assembly during mitosis thereby regulating chromosome segregation cycle (PubMed:20237820). {ECO:0000269|PubMed:16230350, ECO:0000269|PubMed:16298340, ECO:0000269|PubMed:17721549, ECO:0000269|PubMed:20237820, ECO:0000269|PubMed:20687897, ECO:0000269|PubMed:20855601, ECO:0000269|PubMed:21303910}.FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneINO80chr15:41313098chr15:72020888ENST00000361937-2636280_40510911557.0DomainDBINO
HgeneINO80chr15:41313098chr15:72020888ENST00000361937-2636530_70110911557.0DomainHelicase ATP-binding
HgeneINO80chr15:41313098chr15:72020888ENST00000401393-2637280_40510911872.0DomainDBINO
HgeneINO80chr15:41313098chr15:72020888ENST00000401393-2637530_70110911872.0DomainHelicase ATP-binding
HgeneINO80chr15:41313098chr15:72020888ENST00000361937-2636543_55010911557.0Nucleotide bindingATP
HgeneINO80chr15:41313098chr15:72020888ENST00000401393-2637543_55010911872.0Nucleotide bindingATP
HgeneINO80chr15:41313098chr15:72020888ENST00000361937-26361_26610911557.0RegionAssembles INO80 complex module with putative regulatory components INO80E%2C INO80F%2C UCHL5%2C NFRKB%2C MCRS1 and IN80D
HgeneINO80chr15:41313098chr15:72020888ENST00000361937-2636212_52610911557.0RegionAssembles INO80 complex module consisting of conserved components ACTR8%2C ACTL6A and YY1
HgeneINO80chr15:41313098chr15:72020888ENST00000401393-26371_26610911872.0RegionAssembles INO80 complex module with putative regulatory components INO80E%2C INO80F%2C UCHL5%2C NFRKB%2C MCRS1 and IN80D
HgeneINO80chr15:41313098chr15:72020888ENST00000401393-2637212_52610911872.0RegionAssembles INO80 complex module consisting of conserved components ACTR8%2C ACTL6A and YY1
TgeneTHSD4chr15:41313098chr15:72020888ENST00000261862617676_7374521019.0DomainTSP type-1 2
TgeneTHSD4chr15:41313098chr15:72020888ENST00000261862617739_7924521019.0DomainTSP type-1 3
TgeneTHSD4chr15:41313098chr15:72020888ENST00000261862617793_8514521019.0DomainTSP type-1 4
TgeneTHSD4chr15:41313098chr15:72020888ENST00000261862617852_9114521019.0DomainTSP type-1 5
TgeneTHSD4chr15:41313098chr15:72020888ENST00000261862617912_9684521019.0DomainTSP type-1 6
TgeneTHSD4chr15:41313098chr15:72020888ENST00000261862617971_10084521019.0DomainPLAC
TgeneTHSD4chr15:41313098chr15:72020888ENST00000355327718676_7374521019.0DomainTSP type-1 2
TgeneTHSD4chr15:41313098chr15:72020888ENST00000355327718739_7924521019.0DomainTSP type-1 3
TgeneTHSD4chr15:41313098chr15:72020888ENST00000355327718793_8514521019.0DomainTSP type-1 4
TgeneTHSD4chr15:41313098chr15:72020888ENST00000355327718852_9114521019.0DomainTSP type-1 5
TgeneTHSD4chr15:41313098chr15:72020888ENST00000355327718912_9684521019.0DomainTSP type-1 6
TgeneTHSD4chr15:41313098chr15:72020888ENST00000355327718971_10084521019.0DomainPLAC

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneINO80chr15:41313098chr15:72020888ENST00000361937-26361105_126010911557.0DomainHelicase C-terminal
HgeneINO80chr15:41313098chr15:72020888ENST00000401393-26371105_126010911872.0DomainHelicase C-terminal
HgeneINO80chr15:41313098chr15:72020888ENST00000361937-2636521_155610911557.0RegionAssembles INO80 complex module consisting of conserved components INO80B%2C INO80C%2C ACTR5%2C RVBL1%2C RVBL2
HgeneINO80chr15:41313098chr15:72020888ENST00000401393-2637521_155610911872.0RegionAssembles INO80 complex module consisting of conserved components INO80B%2C INO80C%2C ACTR5%2C RVBL1%2C RVBL2
TgeneTHSD4chr15:41313098chr15:72020888ENST0000026186261753_3074521019.0DomainTSP type-1 1
TgeneTHSD4chr15:41313098chr15:72020888ENST0000035532771853_3074521019.0DomainTSP type-1 1


Top

Fusion Gene Sequence for INO80-THSD4


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>39771_39771_1_INO80-THSD4_INO80_chr15_41313098_ENST00000401393_THSD4_chr15_72020888_ENST00000261862_length(transcript)=11207nt_BP=3498nt
GGTCCCAGGAGCCGCGGAGGGAGCGAGCTAGGAGCGTCCACGCCCAACGCAGTCACCGTCCCACGGCCTCAGAGAGCGAACCGCGGCTCC
ACCGTCGGCGGGGCGACCCCCCCCTCCGGACCCCGCCCGCACCCCGCCCCCCCTCCGCCGCCGTCGCGGCGGCGGGGCCAGGCGGCCCGA
GCCGTGCAGTCGGAGGTCCTTGTGCATGAAGACAGATTTGTTCTATGGCCTCGGAGTTGGGTGCCAGGGATGATGGAGGCTGCACTGAGC
TGGCAAAGCCCCTCTATCTTCAGTACTTGGAGAGGGCCCTCCGGTTGGACCATTTTCTGCGACAAACGTCAGCTATCTTCAATAGGAATA
TTTCTAGTGATGACAGTGAAGATGGACTGGATGACAGTAATCCATTATTGCCCCAGTCTGGGGATCCCTTAATACAAGTTAAGGAAGAAC
CTCCAAATTCATTGCTTGGTGAAACTTCTGGAGCAGGCAGTTCTGGAATGTTAAACACATATTCTCTGAATGGAGTTCTACAGTCAGAAT
CAAAATGTGATAAGGGGAATTTATATAATTTCTCTAAGCTGAAGAAAAGCAGAAAGTGGCTAAAGAGCATTCTGCTAAGTGATGAATCCA
GCGAGGCTGATTCTCAGAGTGAAGACGATGATGAAGAAGAACTCAATCTCAGCAGAGAAGAACTTCACAACATGCTTCGACTACACAAAT
ATAAGAAACTTCACCAAAATAAGTATAGTAAAGACAAGGAGTTGCAGCAATATCAGTACTACAGTGCAGGCCTGCTCTCCACATATGACC
CTTTCTATGAGCAACAACGGCACCTACTTGGACCCAAGAAAAAGAAATTTAAGGAGGAAAAGAAACTTAAAGCTAAGTTGAAAAAAGTGA
AGAAAAAAAGACGAAGAGATGAAGAACTTTCCTCTGAAGAATCCCCTCGTCGCCATCACCACCAGACCAAAGTCTTTGCCAAGTTTTCTC
ACGATGCACCTCCCCCTGGCACTAAGAAAAAGCACTTATCCATTGAGCAGCTGAATGCTCGTCGCAGGAAAGTATGGCTCAGCATTGTGA
AAAAGGAACTACCAAAGGCAAATAAGCAGAAAGCTTCAGCTCGTAACCTGTTTCTCACCAATAGCCGAAAGCTTGCTCACCAGTGCATGA
AGGAGGTGCGTCGAGCTGCCTTGCAGGCCCAGAAGAACTGTAAGGAAACCTTGCCTCGTGCCCGCCGCCTCACCAAGGAGATGCTTCTGT
ACTGGAAGAAATATGAGAAAGTAGAGAAGGAGCACCGCAAGAGAGCAGAGAAGGAAGCTTTGGAGCAGCGGAAGTTGGATGAGGAAATGC
GGGAGGCCAAGAGGCAACAGCGAAAACTCAACTTCTTAATTACCCAGACAGAGTTGTATGCCCATTTCATGAGTCGCAAACGAGATATGG
GTCATGATGGTATCCAGGAAGAAATCCTAAGGAAACTGGAAGACAGTTCTACCCAGAGACAAATCGATATAGGTGGAGGAGTGGTAGTTA
ACATCACACAGGAGGATTATGATAGTAACCATTTTAAAGCCCAGGCCCTGAAGAATGCTGAAAATGCTTACCATATTCACCAAGCTCGGA
CAAGGTCATTTGATGAAGATGCAAAAGAAAGTCGAGCAGCTGCCCTACGGGCAGCAAACAAGTCTGGCACTGGGTTTGGGGAGAGTTATA
GCCTGGCTAACCCATCTATCCGGGCTGGTGAGGATATTCCACAGCCCACAATTTTTAATGGCAAATTGAAAGGTTATCAACTGAAAGGCA
TGAATTGGTTGGCCAATCTATATGAACAGGGTATTAATGGCATTCTTGCTGATGAAATGGGCCTTGGTAAAACAGTACAGAGCATTGCCC
TTCTGGCCCATCTGGCTGAGAGAGAGAACATTTGGGGACCTTTCTTAATAATTTCACCTGCGTCTACACTTAACAATTGGCACCAGGAGT
TTACTAGATTTGTTCCTAAATTTAAGGTGCTACCATATTGGGGAAATCCTCATGATAGAAAAGTCATCAGAAGGTTCTGGAGTCAGAAGA
CCCTCTACACTCAGGATGCCCCCTTCCATGTGGTTATTACCAGCTATCAGCTGGTGGTGCAGGATGTAAAGTATTTCCAGCGGGTCAAGT
GGCAATACATGGTACTGGATGAGGCTCAGGCGCTCAAGAGTAGTTCCAGTGTTCGTTGGAAGATCCTCTTACAGTTCCAGTGTCGGAATC
GGCTTTTGCTAACCGGGACCCCAATTCAGAACACCATGGCAGAGCTTTGGGCTCTGCTGCATTTCATTATGCCAACATTATTTGATTCAC
ATGAGGAATTTAATGAATGGTTTTCCAAGGACATTGAGAGCCATGCCGAAAACAAATCTGCTATTGATGAGAATCAACTTTCTCGCTTAC
ACATGATTTTGAAGCCATTTATGCTGAGGAGAATCAAGAAAGATGTGGAAAATGAATTATCTGACAAGATTGAGATTCTAATGTATTGCC
AACTGACCAGCCGACAGAAGCTGCTATATCAGGCACTAAAGAACAAAATTTCCATTGAGGATTTATTGCAGTCTTCTATGGGCTCTACCC
AACAAGCACAGAACACCACCAGCAGCCTCATGAATCTGGTCATGCAGTTTAGGAAGGTGTGTAATCACCCGGAGTTATTTGAACGGCAAG
AAACTTGGTCTCCATTTCATATTTCCCTAAAGCCATACCACATTTCAAAGTTTATCTACCGTCATGGACAGATCAGGGTCTTCAATCATT
CACGAGACAGGTGGTTAAGGGTTCTTTCTCCATTTGCACCAGACTATATCCAACGGTCTCTCTTTCACAGAAAAGGTATTAATGAAGAAA
GCTGTTTCTCTTTCCTTCGCTTTATTGATATATCTCCAGCAGAAATGGCAAACCTTATGCTTCAGGGACTTTTGGCCAGATGGTTAGCTC
TTTTCCTGTCTCTGAAAGCCTCCTACAGGCTCCATCAGCTACGCTCCTGGGGAGCGCCAGAAGGGGAGAGCCACCAGAGATACCTGAGGA
ACAAGGATTTCCTTCTTGGGGTTAATTTTCCACTCTCCTTTCCAAACCTTTGCAGCTGCCCTTTGTTAAAGTCTCTTGTTTTCAGCAGCC
ACTGTAAAGCAGTGAGTGGCTACTCAGACCAGGTTGTCCATCAGCGGAGATCAGCTACCTCCTCGCTGCGTCGCTGCCTGCTCACTGAGC
TGCCATCTTTTTTGTGTGTGGCCAGTCCACGAGTTACCGCAGTGCCATTGGATTCTTACTGCAATGACCGAAGTGCAGAATATGAAAGGC
GAGTTCTGAAGGAAGGAGGGAGTCTGGCAGCCAAGCAGTGTTTGTTGAATGGGGCCCCTGAACTGGCTGCAGACTGGCTAAATAGACGAT
CACAGTTCTTCCCAGAGCCAGCTGGAGGTCTGTGGAGCATCAGACCTCAGAATGGCTGGTCTTTCATCAGGATTCCAGCCCTGAGAAGTC
GTTCTGGACGCTCCATCATCAATGGGAACTGGGCAATTGATCGACCAGGAAAATACGAGGGCGGAGGGACCATGTTCACCTACAAGCGTC
CAAATGAGATTTCGAGCACTGCCGGAGAGTCCTTTTTGGCGGAAGGTCCCACCAACGAGATCTTGGATGTCTACATGATACACCAGCAGC
CAAACCCAGGCGTGCACTACGAGTACGTGATCATGGGGACCAACGCCATCAGCCCCCAGGTGCCACCCCACAGGAGACCAGGGGAACCCT
TCAATGGCCAGATGGTGACAGAAGGCAGGAGCCAGGAGGAGGGAGAACAGAAAGGGAGGAACGAGGAGAAGGAAGACTTGCGTGGGGAGG
CCCCTGAGATGTTCACCTCAGAATCGGCACAGACCTTCCCAGTCAGGCATCCAGACAGATTTTCTCCCCATCGACCGGACAACTTGGTGC
CACCAGCACCGCAGCCCCCACGGCGCAGCCGGGATCACAACTGGAAGCAGCTTGGGACAACAGAATGTTCCACGACCTGTGGGAAAGGAT
CGCAGTACCCTATTTTCCGCTGTGTGCACAGAAGCACTCATGAAGAGGCTCCTGAGAGTTACTGTGACTCCAGCATGAAGCCGACCCCCG
AGGAGGAGCCCTGCAACATCTTCCCTTGCCCAGCCTTCTGGGACATCGGGGAGTGGTCTGAGTGCAGCAAGACCTGTGGCCTGGGCATGC
AGCACCGCCAGGTTCTGTGCCGCCAGGTGTACGCCAACCGCAGCCTGACGGTGCAGCCCTACCGCTGCCAGCACCTGGAGAAACCTGAGA
CCACCAGCACCTGCCAACTCAAGATCTGCAGCGAGTGGCAGATCCGGACCGACTGGACCTCGTGCTCGGTGCCCTGCGGCGTGGGACAGA
GGACCCGTGATGTGAAGTGTGTGAGCAACATTGGGGATGTGGTTGACGATGAGGAATGCAACATGAAGCTCCGGCCGAATGACATTGAGA
ACTGCGACATGGGACCCTGTGCCAAGAGCTGGTTCCTCACCGAGTGGAGCGAAAGGTGCTCAGCGGAGTGTGGGGCCGGAGTGCGGACAC
GCTCGGTGGTGTGCATGACCAACCATGTCAGCAGCCTGCCCCTGGAGGGCTGTGGGAACAACCGGCCGGCAGAGGCCACCCCATGTGACA
ACGGACCCTGCACGGGCAAGGTGGAGTGGTTTGCCGGGAGCTGGAGTCAGTGTTCCATCGAGTGTGGGAGCGGGACGCAACAGAGGGAGG
TGATTTGTGTTAGAAAGAATGCAGACACCTTTGAAGTGTTGGACCCCTCTGAATGTTCTTTCCTGGAGAAACCCCCCAGCCAGCAATCCT
GCCACCTCAAGCCTTGCGGAGCCAAATGGTTTAGCACCGAATGGAGCATGTGTTCCAAGAGCTGCCAGGGTGGCTTTCGGGTCCGGGAAG
TGCGGTGTCTGTCTGATGACATGACTCTAAGTAACCTCTGTGACCCTCAGTTGAAACCAGAAGAGAGAGAATCTTGTAACCCTCAGGACT
GTGTCCCTGAAGTTGATGAAAACTGCAAGGACAAGTACTACAACTGCAACGTGGTGGTCCAGGCAAGACTCTGTGTCTACAACTACTACA
AGACCGCCTGCTGTGCCTCCTGCACCCGTGTGGCCAACAGGCAGACGGGCTTCCTGGGGAGCAGATAACACTCCTGCACCCCCATCAGTA
GGGCAGCATCACTGCCTTCCCGGGGGCTTCAGCAGTGCGCCTGGCTGGCTGCTGCTCCACCACGGGCCCCCTGGCCCAGGCGCTGCCAAC
CAACTTAGTCACCACCCCTGCCTCCGGTGAATGCACCCCGTGGTACCCAGGGGCTTTTTACACAAGATGTTTGAAAGCCACAGTCAGTCC
TTTAAGCATCACCATGTACTGATGATCCCCTCCTTGGACCTGGCATCTGCTAATGGTGCCCTTTGAAAGTCAAGCAGTGGGAAGTACATG
GAGCTCTCAGCCCTGCTCCCATCTGGCACCTTCAAGTCAGCAGATGGGCCACTGACTGAGCACTGCCCCGTCCCTGGTGCTACTGGTCTT
TCTAAACTTAGCACCCTGGAGAGTCCAAGGAGGCAGCGCCCCCAACCCAGCGCCCCACTAAGCCTTGCTGACACGCGTGCATCCCTCTGT
GACCTCAGCCCAGATGTGCCTGTTTTCATTCTCAAAGACATTAGACTGTTTTCCTGCCCTATGACACAGATAGCTCACATGAATATTGTG
CTTTATTTAGCAGGTGTACTCACAGATACTAGCTCCTTAGCAGCTCACAACATCCCAGAATGGGAGGCAGGGGGTGACTCATTATCCCCA
TTTTACTGACAGGGAAACTGAGGCTCAACTTAAGTAATTGACCTGCCAGGTATATTCACCCATCCAGTGGAAGAGCTGAGTCCCCGCCCC
AGTCATCTACCAGTATCCAGCCTGGGGCCTGTACTTAGATGTGAAAGGTGCTGCTTCATTTCTGACCAAGAGACTGAGAAGTTTCCCAGA
ATGCAAACAAAGCCCAGGCCCCTGAAATCTTTCCGGTCAAGCCTTTATCCCAGCACTCAGTTGTTTTGGATGTCTGTTCCTACTTGCCCT
TACCCCCAAAGTTACAGATCCTAGTTACAGGACTCTGCCAGCTTTGTTAAACTGTCCGTGAGACAAGAAAGCCATTGGGGAAACCAGGTG
ATTGCCTGAAATTCTTACTCCGTTCCAAGTGCTGTTCCTCCCAGGAAATCAAAGGCCAGGGTCCTTATGGCCGTGGAGCCTTCCCGACCA
CAGAGCCAACTTGTGAAGCACACAGCTCTGCAGCCTGGGCTCTGCCCTGCCTCAGCCGCCTCCCCCACGCTCTTCACCACGTTCCTGGAG
AGTCCGGCCAACCTGTCCCAGCCGAAACACTGCTGTATTAGAAAAAGTCTCTTTCTGGTCTTTCTGGTTTTGTTTATGAATTTCCCTCTG
TGGCCACAAATTCCTCCCCTCCCCCATGACTCACAGTCCATATGGCCCACCCCCAGACTTGAGCACCAAGCTCTGCATTAATGCAGTTGG
CCTGCGACAAGGAGCTGTGGACCCTTCCCCATCTCTTCCAATTCACTTTCCCCAACTATCCAGTTCCAGAGGCCGCAGGCCTGGAAGGAT
GCAGTGCATATTGAAAGGTGGACCCTCTGAAAACAGTTAAGAGGAATATATGTATGTTTTACCCATTAAGAAAAAATGGCAAGCTAAACA
AATGTTAAACTTACAGAAAATTTGTCTTATGGTCCTGAGCATATTTCCCTTTTAGAGCAAGCCTGGATTCTTAGCAAAGTGTTTCCCCCA
TTTGCTCTTTTAGCTGACAAATCTGCCACTGTGATGATGGTTTGCAGCTTTTGGAAGCAGTATGGCAACCTGGCCTGACATGCTCTTTAG
GCTTCCACTAACCTGGGGCTTTCAGAAATTCTATTTGGCCTTTCTGTGGGTAGCTTTCCAGCTTCTCTTCTAGGGAGCCCCAGGCATCAT
TTCCCAAAAGCATCCCCATCTCCTGATTCTCTTGGAACTCCTACAGATAAGCATCCTGGCAGAGGCCCAGGCTCCCAAACCGACAAAGTG
AAAAGAGACCAGAGAGGCCAAGCATATTGACTGGTGCTGTTCAGGGCCTGCTCTTTTCCACTCACCACTTGTTTTGCTGCTTGTCACGAG
GAGAGTTGTTCCTGTATGTGGCTGCTCTCAGATCTTTCCAAGCAAGCCAGTCATTTGAAGAGGTTTTCTTTTCATGCTGGAGGGCAGGCT
AAGATCAATGAGTGGAAGAGAGAAAGGCTGTTTTAGCTCAAGTTAAAGGAACACCTTCTAGCCATCAAAGCCGCCCAACAGAGGCAAGGG
CCACCACACATGAGAGAGCGCTCTGTCCTTAAAGGGAATTCTCTGTTGAGTGGGAGGTGAACACCCTGGTTCTTCCAACTCAGGAATTCT
CGTGGCTGGGCTGGGTCAGTGATGGCTTTGTCTCTTTATGTCTAAAGTGCCCTATGGCTGCTGAAGGTTACCTAACCATTCTTTAAAAGG
AGAATGACCCTCCATGGGAATGGCCAGCCTGCCAACTGTGCAATTGAAGAAGACCCGATGGATCAACCCCATGTCTCCCTTGGGGAGAAA
GTGCATAAACCAGGGGTCTCTTTTTTTTTTTCAACAAACCATTGAGCTGTTCTTGGAGTTCATCTCTGGAGAGGTTATACATTATTAGAA
GTTTGATTATTATTATAGTTTGATCAATTTATTTGTCTTAGAGATCCAATTTTTACTAATTCCCTAGTTTTTTATTTCAGCATCTGAATG
TCTTTCTCCCTAGCACAGTGCATACAATCAGGGCCTTGGGTATTTCCAGTGATAACTTTCCTTGGAGAGGATCTAAGAAAAGCCCAGATT
TCGGTAGCCATCTCCCTCCAAATATGTCTCTTTCTGCTTTCTTAGTGCCCATTATTTCCCCCTCTCCTTTCTTCTGTCACTGCCATCTCC
TTCTTGGTCTTCCCATTGTTCTTTAACTGGCCGTAATGTGGAATTGATATTTACATTTTGATACGGTTTTTTTCTTGGCCTGTGTACGGG
ATTGCCTCATTTCCTGCTCTGAATTTTAAAATTAGATATTAAAGCTGTCATATGGTTTCCTCACAAAAGTCAACAAAGTCCAAACAAAAA
TAGTTTGCCGTTTTACTTTCATCCATTGAAAAAGGAAATTGTGCCTCTTGCAGCCTAGGCAAAGGACATTTAGTACTATCGATTCTTTCC
ACCCTCACGATGACTTGCGGTTCTCTCTGTAGAAAAGGGATGGCCTAAGAAATACAACTAAAAAAACAAACAAAAACACCAAAAGAAAAA
AAAAAGCCATTTAAAGCCAGCCACTAGAGGGAGTCAGTTCAGTTCCGTAAAGGTATGCTCAGTGCCCGCTGCCTGCAAGCTGTTGGGGAC
CCCAGGGAGGGCAAGGCAGCCTGTCCCCGCCCCCAGGGAACTAGAACATGACAAGAATTCTCCGCACTGTGCCTACCTGTCCCTTTACCT
TACCTCTCTGGCCCAGAGTTCTTGGAGGGTTTTTTCTTTATTTTCTTATGTACTCATCTACTTATTCTCAAAGTATTTAGCATTCAACAC
TCTTTTTGCTTTAAAAAGAATGGCCTTACAAAGGGACAGAAAAGAGAAGACACGAGCTTGGTGTATTTTCATCAAGTTATGTGGCAGAGA
AATCCAGATATTACCAGGACCTGTCTAAACAAATGTTGTGGGTTTTCTTTTCATTCGGATAGCCACTTTATAGTTGGAATATCAATTCTA
ATGAGGAGGAAGACATAAATATAAGTGGTAAAAAGAAACATGACTTCCCTTAAAACAGGCTGGATAATCTATATCAGCCTTGTGGGTGGA
GACTAGTATTTGATCCTTGCCATATAAAACATTTTAATATGGTTTACATGGGAAAATATCGATGGCTTCCTCACAAAATGTATGGGTGAC
GTGAAGTTGAAGAGCCAATGGCTTGGGTGACACGTGCTGGATCCAAAAAGATCAGGGAGACTAGAATAAAACTTGGATGTTAAAAATTCA
CCAGGAATCCACATAAAGTACTATATTTGGGCTAAAATGAAAAACTAAATACAAGGTGGGAGAGAGGCAAGAATTTCAGTTGACTAAGCT
CAGTGTGAGTCAAAGTGGGATGGAACCATGCAAAAACAAAACCCACAGACATGCAGGCTACGTGAGGAGAAAACAGTGGTGAGGATCACA
TCACATTGTGTTTGCATTTGCCGGAACCATACTTTAAGAAGAAAACCGATCATCTATAATAACATCAGTTTATCAATGCCCCGTCCTGAT
GAAGTGTGCAGACTCTCAGAAACAGCAGGAAGGACTTCATGAGAACCCTCAGGCTGGAGAAGGCACTAGGGCACAAGGAGAGCTCTCCTA
GGACCAGGACCAAGAAGCTACAGGCAGGCACAGTTTAGCTCCTGCAGAGACCCAGCTTTTCACAAGTTGGAGCCTTCCAGAGATAGAGGG
ACTGTGGTAGGTGGTGACCCACCCATCACTGGAGGTGGAAGCAGAGGCCGTTTGCCAGGGATGCTGGAGAGGGGATTCAAGCATCTGGCT
GGGCAACGTGATGCTCAGGGCCGTCTCCACTCAGGGCTTAGGGGAGTCTGTGAGTAGAAGAGCTTTAGGTGATTTGTTTGGTGGGGGAAG
GCAAGTACACAGCTATGCACTTTCCGTTTCTGACTTTTGCCACCCTGTCAGCCATGGGGAGCCCACTGTGGGACTGAAACCCTGAGCTGA
ATGCGGCCTCATGTCTCAGAGAAACACTGGCAAGTTGGTCAGAGCCGCCGTCTGCATCGAGGCGTAGCTGAGCGGCAGGATGGGGGGCTG
CCTGCCCAGGGTCTCTCACCGTGGTGTAAGCAGAGCCATGGCTTGCCTAGGACCCTATAGATACCATCACTCTTTCTCAGCTCGACTGGA
GTTTCTGCACCTTTGCAGGGGCAAAGTAACTCCCTGCACCCTGAACCACCCCCCATTCCTGTTCATTTCAGCAGATAATGATGGAGGGGG
GGGGTGTCCATCGTGCTGAGGGTGTGACCGCAAGAGGGTGAAAACTTCCAGCCAACTTTCTCAGTCCTTTCTCTTGCGAGAGGGAAGCCA
CCTGCTATACAAACTAATACCCCCTGCCTTGACCCCTTCCCCACGACTCAGTTGACAGAAGGATATACTTTGTTATAACTTATTATTTTG
TTCTCTGTAAATACAAGATGTTTATAGGAAATATGTATTCTGAACTCTATCTGCAGAATGAGTCACTACACCAAAATAGTTCTATTATTT
AGAATGTGTTAATTTTAAAGGGACCTGATAGGTATTTATTTACATATGCGATCCACATTTGTGTGAAAGCATGTGATCATACTAACCCAG
CCTCCTGGAATGTCGCTGTACGATGATTGATGTCTTTTTCTCAGTCCATAGTTACAATTGTTTAGTATGCTAATCAGTCCAGTTCCCTGA
GGTTTAAGATCAAATATAAATTACTCTGCTTTTCGACTCATTCAGGTAGCATTGTACCTGAACCTGATTGCTACTTTTTCATCTTAAATA
TTATATTTCCTCATCTAATCTGCCTTCCCCTCATCCACAGACATTTGGAGAAGGAAATGGGAGGGTGTCTGTTATCCCTTTCTCTTTGCT
TTGTCCCCGTTGTTAGACTGGCAGCGTCAGTTGCTCGGTGGGCTTGGTTAGAGCCGTGGGTGAGGCAGGTGGCTGGCGGGGACAGGGAGA
GGCTGAGAGGGAAGTGGTGGCATTTACTGCTCTGACACTTCCACTGTCCCTGCTGGGGATGCTGGGGCCAAGGCCTGTGGGGCCTGTGAA
CTGCACAGCCAGGAGCAAGGAACCCACTAAATACTCCGTCACCTCCATGTCCCCTCTACAGTGTTAAATTATTACATAAGCAGGTGAAAG
GTAGAAGGCGAATTATGTGAGTAAATATGGTCTGTTTTCTCTTCAGCAAAAATGACTATTTTTGTGTGTGACTAATTTATTTTTATTATT
GTAAAGATACAATAAACCGGTTGAAATATCTGCTTTGTTGACAAGCGTGTGCTTTCTCTGGCCTTATTCGCGTTCTGTTCTCCTGCAAAT

>39771_39771_1_INO80-THSD4_INO80_chr15_41313098_ENST00000401393_THSD4_chr15_72020888_ENST00000261862_length(amino acids)=1657AA_BP=1091
MASELGARDDGGCTELAKPLYLQYLERALRLDHFLRQTSAIFNRNISSDDSEDGLDDSNPLLPQSGDPLIQVKEEPPNSLLGETSGAGSS
GMLNTYSLNGVLQSESKCDKGNLYNFSKLKKSRKWLKSILLSDESSEADSQSEDDDEEELNLSREELHNMLRLHKYKKLHQNKYSKDKEL
QQYQYYSAGLLSTYDPFYEQQRHLLGPKKKKFKEEKKLKAKLKKVKKKRRRDEELSSEESPRRHHHQTKVFAKFSHDAPPPGTKKKHLSI
EQLNARRRKVWLSIVKKELPKANKQKASARNLFLTNSRKLAHQCMKEVRRAALQAQKNCKETLPRARRLTKEMLLYWKKYEKVEKEHRKR
AEKEALEQRKLDEEMREAKRQQRKLNFLITQTELYAHFMSRKRDMGHDGIQEEILRKLEDSSTQRQIDIGGGVVVNITQEDYDSNHFKAQ
ALKNAENAYHIHQARTRSFDEDAKESRAAALRAANKSGTGFGESYSLANPSIRAGEDIPQPTIFNGKLKGYQLKGMNWLANLYEQGINGI
LADEMGLGKTVQSIALLAHLAERENIWGPFLIISPASTLNNWHQEFTRFVPKFKVLPYWGNPHDRKVIRRFWSQKTLYTQDAPFHVVITS
YQLVVQDVKYFQRVKWQYMVLDEAQALKSSSSVRWKILLQFQCRNRLLLTGTPIQNTMAELWALLHFIMPTLFDSHEEFNEWFSKDIESH
AENKSAIDENQLSRLHMILKPFMLRRIKKDVENELSDKIEILMYCQLTSRQKLLYQALKNKISIEDLLQSSMGSTQQAQNTTSSLMNLVM
QFRKVCNHPELFERQETWSPFHISLKPYHISKFIYRHGQIRVFNHSRDRWLRVLSPFAPDYIQRSLFHRKGINEESCFSFLRFIDISPAE
MANLMLQGLLARWLALFLSLKASYRLHQLRSWGAPEGESHQRYLRNKDFLLGVNFPLSFPNLCSCPLLKSLVFSSHCKAVSGYSDQVVHQ
RRSATSSLRRCLLTELPSFLCVASPRVTAVPLDSYCNDRSAEYERRVLKEGGSLAAKQCLLNGAPELAADWLNRRSQFFPEPAGGLWSIR
PQNGWSFIRIPALRSRSGRSIINGNWAIDRPGKYEGGGTMFTYKRPNEISSTAGESFLAEGPTNEILDVYMIHQQPNPGVHYEYVIMGTN
AISPQVPPHRRPGEPFNGQMVTEGRSQEEGEQKGRNEEKEDLRGEAPEMFTSESAQTFPVRHPDRFSPHRPDNLVPPAPQPPRRSRDHNW
KQLGTTECSTTCGKGSQYPIFRCVHRSTHEEAPESYCDSSMKPTPEEEPCNIFPCPAFWDIGEWSECSKTCGLGMQHRQVLCRQVYANRS
LTVQPYRCQHLEKPETTSTCQLKICSEWQIRTDWTSCSVPCGVGQRTRDVKCVSNIGDVVDDEECNMKLRPNDIENCDMGPCAKSWFLTE
WSERCSAECGAGVRTRSVVCMTNHVSSLPLEGCGNNRPAEATPCDNGPCTGKVEWFAGSWSQCSIECGSGTQQREVICVRKNADTFEVLD
PSECSFLEKPPSQQSCHLKPCGAKWFSTEWSMCSKSCQGGFRVREVRCLSDDMTLSNLCDPQLKPEERESCNPQDCVPEVDENCKDKYYN

--------------------------------------------------------------
>39771_39771_2_INO80-THSD4_INO80_chr15_41313098_ENST00000401393_THSD4_chr15_72020888_ENST00000355327_length(transcript)=11207nt_BP=3498nt
GGTCCCAGGAGCCGCGGAGGGAGCGAGCTAGGAGCGTCCACGCCCAACGCAGTCACCGTCCCACGGCCTCAGAGAGCGAACCGCGGCTCC
ACCGTCGGCGGGGCGACCCCCCCCTCCGGACCCCGCCCGCACCCCGCCCCCCCTCCGCCGCCGTCGCGGCGGCGGGGCCAGGCGGCCCGA
GCCGTGCAGTCGGAGGTCCTTGTGCATGAAGACAGATTTGTTCTATGGCCTCGGAGTTGGGTGCCAGGGATGATGGAGGCTGCACTGAGC
TGGCAAAGCCCCTCTATCTTCAGTACTTGGAGAGGGCCCTCCGGTTGGACCATTTTCTGCGACAAACGTCAGCTATCTTCAATAGGAATA
TTTCTAGTGATGACAGTGAAGATGGACTGGATGACAGTAATCCATTATTGCCCCAGTCTGGGGATCCCTTAATACAAGTTAAGGAAGAAC
CTCCAAATTCATTGCTTGGTGAAACTTCTGGAGCAGGCAGTTCTGGAATGTTAAACACATATTCTCTGAATGGAGTTCTACAGTCAGAAT
CAAAATGTGATAAGGGGAATTTATATAATTTCTCTAAGCTGAAGAAAAGCAGAAAGTGGCTAAAGAGCATTCTGCTAAGTGATGAATCCA
GCGAGGCTGATTCTCAGAGTGAAGACGATGATGAAGAAGAACTCAATCTCAGCAGAGAAGAACTTCACAACATGCTTCGACTACACAAAT
ATAAGAAACTTCACCAAAATAAGTATAGTAAAGACAAGGAGTTGCAGCAATATCAGTACTACAGTGCAGGCCTGCTCTCCACATATGACC
CTTTCTATGAGCAACAACGGCACCTACTTGGACCCAAGAAAAAGAAATTTAAGGAGGAAAAGAAACTTAAAGCTAAGTTGAAAAAAGTGA
AGAAAAAAAGACGAAGAGATGAAGAACTTTCCTCTGAAGAATCCCCTCGTCGCCATCACCACCAGACCAAAGTCTTTGCCAAGTTTTCTC
ACGATGCACCTCCCCCTGGCACTAAGAAAAAGCACTTATCCATTGAGCAGCTGAATGCTCGTCGCAGGAAAGTATGGCTCAGCATTGTGA
AAAAGGAACTACCAAAGGCAAATAAGCAGAAAGCTTCAGCTCGTAACCTGTTTCTCACCAATAGCCGAAAGCTTGCTCACCAGTGCATGA
AGGAGGTGCGTCGAGCTGCCTTGCAGGCCCAGAAGAACTGTAAGGAAACCTTGCCTCGTGCCCGCCGCCTCACCAAGGAGATGCTTCTGT
ACTGGAAGAAATATGAGAAAGTAGAGAAGGAGCACCGCAAGAGAGCAGAGAAGGAAGCTTTGGAGCAGCGGAAGTTGGATGAGGAAATGC
GGGAGGCCAAGAGGCAACAGCGAAAACTCAACTTCTTAATTACCCAGACAGAGTTGTATGCCCATTTCATGAGTCGCAAACGAGATATGG
GTCATGATGGTATCCAGGAAGAAATCCTAAGGAAACTGGAAGACAGTTCTACCCAGAGACAAATCGATATAGGTGGAGGAGTGGTAGTTA
ACATCACACAGGAGGATTATGATAGTAACCATTTTAAAGCCCAGGCCCTGAAGAATGCTGAAAATGCTTACCATATTCACCAAGCTCGGA
CAAGGTCATTTGATGAAGATGCAAAAGAAAGTCGAGCAGCTGCCCTACGGGCAGCAAACAAGTCTGGCACTGGGTTTGGGGAGAGTTATA
GCCTGGCTAACCCATCTATCCGGGCTGGTGAGGATATTCCACAGCCCACAATTTTTAATGGCAAATTGAAAGGTTATCAACTGAAAGGCA
TGAATTGGTTGGCCAATCTATATGAACAGGGTATTAATGGCATTCTTGCTGATGAAATGGGCCTTGGTAAAACAGTACAGAGCATTGCCC
TTCTGGCCCATCTGGCTGAGAGAGAGAACATTTGGGGACCTTTCTTAATAATTTCACCTGCGTCTACACTTAACAATTGGCACCAGGAGT
TTACTAGATTTGTTCCTAAATTTAAGGTGCTACCATATTGGGGAAATCCTCATGATAGAAAAGTCATCAGAAGGTTCTGGAGTCAGAAGA
CCCTCTACACTCAGGATGCCCCCTTCCATGTGGTTATTACCAGCTATCAGCTGGTGGTGCAGGATGTAAAGTATTTCCAGCGGGTCAAGT
GGCAATACATGGTACTGGATGAGGCTCAGGCGCTCAAGAGTAGTTCCAGTGTTCGTTGGAAGATCCTCTTACAGTTCCAGTGTCGGAATC
GGCTTTTGCTAACCGGGACCCCAATTCAGAACACCATGGCAGAGCTTTGGGCTCTGCTGCATTTCATTATGCCAACATTATTTGATTCAC
ATGAGGAATTTAATGAATGGTTTTCCAAGGACATTGAGAGCCATGCCGAAAACAAATCTGCTATTGATGAGAATCAACTTTCTCGCTTAC
ACATGATTTTGAAGCCATTTATGCTGAGGAGAATCAAGAAAGATGTGGAAAATGAATTATCTGACAAGATTGAGATTCTAATGTATTGCC
AACTGACCAGCCGACAGAAGCTGCTATATCAGGCACTAAAGAACAAAATTTCCATTGAGGATTTATTGCAGTCTTCTATGGGCTCTACCC
AACAAGCACAGAACACCACCAGCAGCCTCATGAATCTGGTCATGCAGTTTAGGAAGGTGTGTAATCACCCGGAGTTATTTGAACGGCAAG
AAACTTGGTCTCCATTTCATATTTCCCTAAAGCCATACCACATTTCAAAGTTTATCTACCGTCATGGACAGATCAGGGTCTTCAATCATT
CACGAGACAGGTGGTTAAGGGTTCTTTCTCCATTTGCACCAGACTATATCCAACGGTCTCTCTTTCACAGAAAAGGTATTAATGAAGAAA
GCTGTTTCTCTTTCCTTCGCTTTATTGATATATCTCCAGCAGAAATGGCAAACCTTATGCTTCAGGGACTTTTGGCCAGATGGTTAGCTC
TTTTCCTGTCTCTGAAAGCCTCCTACAGGCTCCATCAGCTACGCTCCTGGGGAGCGCCAGAAGGGGAGAGCCACCAGAGATACCTGAGGA
ACAAGGATTTCCTTCTTGGGGTTAATTTTCCACTCTCCTTTCCAAACCTTTGCAGCTGCCCTTTGTTAAAGTCTCTTGTTTTCAGCAGCC
ACTGTAAAGCAGTGAGTGGCTACTCAGACCAGGTTGTCCATCAGCGGAGATCAGCTACCTCCTCGCTGCGTCGCTGCCTGCTCACTGAGC
TGCCATCTTTTTTGTGTGTGGCCAGTCCACGAGTTACCGCAGTGCCATTGGATTCTTACTGCAATGACCGAAGTGCAGAATATGAAAGGC
GAGTTCTGAAGGAAGGAGGGAGTCTGGCAGCCAAGCAGTGTTTGTTGAATGGGGCCCCTGAACTGGCTGCAGACTGGCTAAATAGACGAT
CACAGTTCTTCCCAGAGCCAGCTGGAGGTCTGTGGAGCATCAGACCTCAGAATGGCTGGTCTTTCATCAGGATTCCAGCCCTGAGAAGTC
GTTCTGGACGCTCCATCATCAATGGGAACTGGGCAATTGATCGACCAGGAAAATACGAGGGCGGAGGGACCATGTTCACCTACAAGCGTC
CAAATGAGATTTCGAGCACTGCCGGAGAGTCCTTTTTGGCGGAAGGTCCCACCAACGAGATCTTGGATGTCTACATGATACACCAGCAGC
CAAACCCAGGCGTGCACTACGAGTACGTGATCATGGGGACCAACGCCATCAGCCCCCAGGTGCCACCCCACAGGAGACCAGGGGAACCCT
TCAATGGCCAGATGGTGACAGAAGGCAGGAGCCAGGAGGAGGGAGAACAGAAAGGGAGGAACGAGGAGAAGGAAGACTTGCGTGGGGAGG
CCCCTGAGATGTTCACCTCAGAATCGGCACAGACCTTCCCAGTCAGGCATCCAGACAGATTTTCTCCCCATCGACCGGACAACTTGGTGC
CACCAGCACCGCAGCCCCCACGGCGCAGCCGGGATCACAACTGGAAGCAGCTTGGGACAACAGAATGTTCCACGACCTGTGGGAAAGGAT
CGCAGTACCCTATTTTCCGCTGTGTGCACAGAAGCACTCATGAAGAGGCTCCTGAGAGTTACTGTGACTCCAGCATGAAGCCGACCCCCG
AGGAGGAGCCCTGCAACATCTTCCCTTGCCCAGCCTTCTGGGACATCGGGGAGTGGTCTGAGTGCAGCAAGACCTGTGGCCTGGGCATGC
AGCACCGCCAGGTTCTGTGCCGCCAGGTGTACGCCAACCGCAGCCTGACGGTGCAGCCCTACCGCTGCCAGCACCTGGAGAAACCTGAGA
CCACCAGCACCTGCCAACTCAAGATCTGCAGCGAGTGGCAGATCCGGACCGACTGGACCTCGTGCTCGGTGCCCTGCGGCGTGGGACAGA
GGACCCGTGATGTGAAGTGTGTGAGCAACATTGGGGATGTGGTTGACGATGAGGAATGCAACATGAAGCTCCGGCCGAATGACATTGAGA
ACTGCGACATGGGACCCTGTGCCAAGAGCTGGTTCCTCACCGAGTGGAGCGAAAGGTGCTCAGCGGAGTGTGGGGCCGGAGTGCGGACAC
GCTCGGTGGTGTGCATGACCAACCATGTCAGCAGCCTGCCCCTGGAGGGCTGTGGGAACAACCGGCCGGCAGAGGCCACCCCATGTGACA
ACGGACCCTGCACGGGCAAGGTGGAGTGGTTTGCCGGGAGCTGGAGTCAGTGTTCCATCGAGTGTGGGAGCGGGACGCAACAGAGGGAGG
TGATTTGTGTTAGAAAGAATGCAGACACCTTTGAAGTGTTGGACCCCTCTGAATGTTCTTTCCTGGAGAAACCCCCCAGCCAGCAATCCT
GCCACCTCAAGCCTTGCGGAGCCAAATGGTTTAGCACCGAATGGAGCATGTGTTCCAAGAGCTGCCAGGGTGGCTTTCGGGTCCGGGAAG
TGCGGTGTCTGTCTGATGACATGACTCTAAGTAACCTCTGTGACCCTCAGTTGAAACCAGAAGAGAGAGAATCTTGTAACCCTCAGGACT
GTGTCCCTGAAGTTGATGAAAACTGCAAGGACAAGTACTACAACTGCAACGTGGTGGTCCAGGCAAGACTCTGTGTCTACAACTACTACA
AGACCGCCTGCTGTGCCTCCTGCACCCGTGTGGCCAACAGGCAGACGGGCTTCCTGGGGAGCAGATAACACTCCTGCACCCCCATCAGTA
GGGCAGCATCACTGCCTTCCCGGGGGCTTCAGCAGTGCGCCTGGCTGGCTGCTGCTCCACCACGGGCCCCCTGGCCCAGGCGCTGCCAAC
CAACTTAGTCACCACCCCTGCCTCCGGTGAATGCACCCCGTGGTACCCAGGGGCTTTTTACACAAGATGTTTGAAAGCCACAGTCAGTCC
TTTAAGCATCACCATGTACTGATGATCCCCTCCTTGGACCTGGCATCTGCTAATGGTGCCCTTTGAAAGTCAAGCAGTGGGAAGTACATG
GAGCTCTCAGCCCTGCTCCCATCTGGCACCTTCAAGTCAGCAGATGGGCCACTGACTGAGCACTGCCCCGTCCCTGGTGCTACTGGTCTT
TCTAAACTTAGCACCCTGGAGAGTCCAAGGAGGCAGCGCCCCCAACCCAGCGCCCCACTAAGCCTTGCTGACACGCGTGCATCCCTCTGT
GACCTCAGCCCAGATGTGCCTGTTTTCATTCTCAAAGACATTAGACTGTTTTCCTGCCCTATGACACAGATAGCTCACATGAATATTGTG
CTTTATTTAGCAGGTGTACTCACAGATACTAGCTCCTTAGCAGCTCACAACATCCCAGAATGGGAGGCAGGGGGTGACTCATTATCCCCA
TTTTACTGACAGGGAAACTGAGGCTCAACTTAAGTAATTGACCTGCCAGGTATATTCACCCATCCAGTGGAAGAGCTGAGTCCCCGCCCC
AGTCATCTACCAGTATCCAGCCTGGGGCCTGTACTTAGATGTGAAAGGTGCTGCTTCATTTCTGACCAAGAGACTGAGAAGTTTCCCAGA
ATGCAAACAAAGCCCAGGCCCCTGAAATCTTTCCGGTCAAGCCTTTATCCCAGCACTCAGTTGTTTTGGATGTCTGTTCCTACTTGCCCT
TACCCCCAAAGTTACAGATCCTAGTTACAGGACTCTGCCAGCTTTGTTAAACTGTCCGTGAGACAAGAAAGCCATTGGGGAAACCAGGTG
ATTGCCTGAAATTCTTACTCCGTTCCAAGTGCTGTTCCTCCCAGGAAATCAAAGGCCAGGGTCCTTATGGCCGTGGAGCCTTCCCGACCA
CAGAGCCAACTTGTGAAGCACACAGCTCTGCAGCCTGGGCTCTGCCCTGCCTCAGCCGCCTCCCCCACGCTCTTCACCACGTTCCTGGAG
AGTCCGGCCAACCTGTCCCAGCCGAAACACTGCTGTATTAGAAAAAGTCTCTTTCTGGTCTTTCTGGTTTTGTTTATGAATTTCCCTCTG
TGGCCACAAATTCCTCCCCTCCCCCATGACTCACAGTCCATATGGCCCACCCCCAGACTTGAGCACCAAGCTCTGCATTAATGCAGTTGG
CCTGCGACAAGGAGCTGTGGACCCTTCCCCATCTCTTCCAATTCACTTTCCCCAACTATCCAGTTCCAGAGGCCGCAGGCCTGGAAGGAT
GCAGTGCATATTGAAAGGTGGACCCTCTGAAAACAGTTAAGAGGAATATATGTATGTTTTACCCATTAAGAAAAAATGGCAAGCTAAACA
AATGTTAAACTTACAGAAAATTTGTCTTATGGTCCTGAGCATATTTCCCTTTTAGAGCAAGCCTGGATTCTTAGCAAAGTGTTTCCCCCA
TTTGCTCTTTTAGCTGACAAATCTGCCACTGTGATGATGGTTTGCAGCTTTTGGAAGCAGTATGGCAACCTGGCCTGACATGCTCTTTAG
GCTTCCACTAACCTGGGGCTTTCAGAAATTCTATTTGGCCTTTCTGTGGGTAGCTTTCCAGCTTCTCTTCTAGGGAGCCCCAGGCATCAT
TTCCCAAAAGCATCCCCATCTCCTGATTCTCTTGGAACTCCTACAGATAAGCATCCTGGCAGAGGCCCAGGCTCCCAAACCGACAAAGTG
AAAAGAGACCAGAGAGGCCAAGCATATTGACTGGTGCTGTTCAGGGCCTGCTCTTTTCCACTCACCACTTGTTTTGCTGCTTGTCACGAG
GAGAGTTGTTCCTGTATGTGGCTGCTCTCAGATCTTTCCAAGCAAGCCAGTCATTTGAAGAGGTTTTCTTTTCATGCTGGAGGGCAGGCT
AAGATCAATGAGTGGAAGAGAGAAAGGCTGTTTTAGCTCAAGTTAAAGGAACACCTTCTAGCCATCAAAGCCGCCCAACAGAGGCAAGGG
CCACCACACATGAGAGAGCGCTCTGTCCTTAAAGGGAATTCTCTGTTGAGTGGGAGGTGAACACCCTGGTTCTTCCAACTCAGGAATTCT
CGTGGCTGGGCTGGGTCAGTGATGGCTTTGTCTCTTTATGTCTAAAGTGCCCTATGGCTGCTGAAGGTTACCTAACCATTCTTTAAAAGG
AGAATGACCCTCCATGGGAATGGCCAGCCTGCCAACTGTGCAATTGAAGAAGACCCGATGGATCAACCCCATGTCTCCCTTGGGGAGAAA
GTGCATAAACCAGGGGTCTCTTTTTTTTTTTCAACAAACCATTGAGCTGTTCTTGGAGTTCATCTCTGGAGAGGTTATACATTATTAGAA
GTTTGATTATTATTATAGTTTGATCAATTTATTTGTCTTAGAGATCCAATTTTTACTAATTCCCTAGTTTTTTATTTCAGCATCTGAATG
TCTTTCTCCCTAGCACAGTGCATACAATCAGGGCCTTGGGTATTTCCAGTGATAACTTTCCTTGGAGAGGATCTAAGAAAAGCCCAGATT
TCGGTAGCCATCTCCCTCCAAATATGTCTCTTTCTGCTTTCTTAGTGCCCATTATTTCCCCCTCTCCTTTCTTCTGTCACTGCCATCTCC
TTCTTGGTCTTCCCATTGTTCTTTAACTGGCCGTAATGTGGAATTGATATTTACATTTTGATACGGTTTTTTTCTTGGCCTGTGTACGGG
ATTGCCTCATTTCCTGCTCTGAATTTTAAAATTAGATATTAAAGCTGTCATATGGTTTCCTCACAAAAGTCAACAAAGTCCAAACAAAAA
TAGTTTGCCGTTTTACTTTCATCCATTGAAAAAGGAAATTGTGCCTCTTGCAGCCTAGGCAAAGGACATTTAGTACTATCGATTCTTTCC
ACCCTCACGATGACTTGCGGTTCTCTCTGTAGAAAAGGGATGGCCTAAGAAATACAACTAAAAAAACAAACAAAAACACCAAAAGAAAAA
AAAAAGCCATTTAAAGCCAGCCACTAGAGGGAGTCAGTTCAGTTCCGTAAAGGTATGCTCAGTGCCCGCTGCCTGCAAGCTGTTGGGGAC
CCCAGGGAGGGCAAGGCAGCCTGTCCCCGCCCCCAGGGAACTAGAACATGACAAGAATTCTCCGCACTGTGCCTACCTGTCCCTTTACCT
TACCTCTCTGGCCCAGAGTTCTTGGAGGGTTTTTTCTTTATTTTCTTATGTACTCATCTACTTATTCTCAAAGTATTTAGCATTCAACAC
TCTTTTTGCTTTAAAAAGAATGGCCTTACAAAGGGACAGAAAAGAGAAGACACGAGCTTGGTGTATTTTCATCAAGTTATGTGGCAGAGA
AATCCAGATATTACCAGGACCTGTCTAAACAAATGTTGTGGGTTTTCTTTTCATTCGGATAGCCACTTTATAGTTGGAATATCAATTCTA
ATGAGGAGGAAGACATAAATATAAGTGGTAAAAAGAAACATGACTTCCCTTAAAACAGGCTGGATAATCTATATCAGCCTTGTGGGTGGA
GACTAGTATTTGATCCTTGCCATATAAAACATTTTAATATGGTTTACATGGGAAAATATCGATGGCTTCCTCACAAAATGTATGGGTGAC
GTGAAGTTGAAGAGCCAATGGCTTGGGTGACACGTGCTGGATCCAAAAAGATCAGGGAGACTAGAATAAAACTTGGATGTTAAAAATTCA
CCAGGAATCCACATAAAGTACTATATTTGGGCTAAAATGAAAAACTAAATACAAGGTGGGAGAGAGGCAAGAATTTCAGTTGACTAAGCT
CAGTGTGAGTCAAAGTGGGATGGAACCATGCAAAAACAAAACCCACAGACATGCAGGCTACGTGAGGAGAAAACAGTGGTGAGGATCACA
TCACATTGTGTTTGCATTTGCCGGAACCATACTTTAAGAAGAAAACCGATCATCTATAATAACATCAGTTTATCAATGCCCCGTCCTGAT
GAAGTGTGCAGACTCTCAGAAACAGCAGGAAGGACTTCATGAGAACCCTCAGGCTGGAGAAGGCACTAGGGCACAAGGAGAGCTCTCCTA
GGACCAGGACCAAGAAGCTACAGGCAGGCACAGTTTAGCTCCTGCAGAGACCCAGCTTTTCACAAGTTGGAGCCTTCCAGAGATAGAGGG
ACTGTGGTAGGTGGTGACCCACCCATCACTGGAGGTGGAAGCAGAGGCCGTTTGCCAGGGATGCTGGAGAGGGGATTCAAGCATCTGGCT
GGGCAACGTGATGCTCAGGGCCGTCTCCACTCAGGGCTTAGGGGAGTCTGTGAGTAGAAGAGCTTTAGGTGATTTGTTTGGTGGGGGAAG
GCAAGTACACAGCTATGCACTTTCCGTTTCTGACTTTTGCCACCCTGTCAGCCATGGGGAGCCCACTGTGGGACTGAAACCCTGAGCTGA
ATGCGGCCTCATGTCTCAGAGAAACACTGGCAAGTTGGTCAGAGCCGCCGTCTGCATCGAGGCGTAGCTGAGCGGCAGGATGGGGGGCTG
CCTGCCCAGGGTCTCTCACCGTGGTGTAAGCAGAGCCATGGCTTGCCTAGGACCCTATAGATACCATCACTCTTTCTCAGCTCGACTGGA
GTTTCTGCACCTTTGCAGGGGCAAAGTAACTCCCTGCACCCTGAACCACCCCCCATTCCTGTTCATTTCAGCAGATAATGATGGAGGGGG
GGGGTGTCCATCGTGCTGAGGGTGTGACCGCAAGAGGGTGAAAACTTCCAGCCAACTTTCTCAGTCCTTTCTCTTGCGAGAGGGAAGCCA
CCTGCTATACAAACTAATACCCCCTGCCTTGACCCCTTCCCCACGACTCAGTTGACAGAAGGATATACTTTGTTATAACTTATTATTTTG
TTCTCTGTAAATACAAGATGTTTATAGGAAATATGTATTCTGAACTCTATCTGCAGAATGAGTCACTACACCAAAATAGTTCTATTATTT
AGAATGTGTTAATTTTAAAGGGACCTGATAGGTATTTATTTACATATGCGATCCACATTTGTGTGAAAGCATGTGATCATACTAACCCAG
CCTCCTGGAATGTCGCTGTACGATGATTGATGTCTTTTTCTCAGTCCATAGTTACAATTGTTTAGTATGCTAATCAGTCCAGTTCCCTGA
GGTTTAAGATCAAATATAAATTACTCTGCTTTTCGACTCATTCAGGTAGCATTGTACCTGAACCTGATTGCTACTTTTTCATCTTAAATA
TTATATTTCCTCATCTAATCTGCCTTCCCCTCATCCACAGACATTTGGAGAAGGAAATGGGAGGGTGTCTGTTATCCCTTTCTCTTTGCT
TTGTCCCCGTTGTTAGACTGGCAGCGTCAGTTGCTCGGTGGGCTTGGTTAGAGCCGTGGGTGAGGCAGGTGGCTGGCGGGGACAGGGAGA
GGCTGAGAGGGAAGTGGTGGCATTTACTGCTCTGACACTTCCACTGTCCCTGCTGGGGATGCTGGGGCCAAGGCCTGTGGGGCCTGTGAA
CTGCACAGCCAGGAGCAAGGAACCCACTAAATACTCCGTCACCTCCATGTCCCCTCTACAGTGTTAAATTATTACATAAGCAGGTGAAAG
GTAGAAGGCGAATTATGTGAGTAAATATGGTCTGTTTTCTCTTCAGCAAAAATGACTATTTTTGTGTGTGACTAATTTATTTTTATTATT
GTAAAGATACAATAAACCGGTTGAAATATCTGCTTTGTTGACAAGCGTGTGCTTTCTCTGGCCTTATTCGCGTTCTGTTCTCCTGCAAAT

>39771_39771_2_INO80-THSD4_INO80_chr15_41313098_ENST00000401393_THSD4_chr15_72020888_ENST00000355327_length(amino acids)=1657AA_BP=1091
MASELGARDDGGCTELAKPLYLQYLERALRLDHFLRQTSAIFNRNISSDDSEDGLDDSNPLLPQSGDPLIQVKEEPPNSLLGETSGAGSS
GMLNTYSLNGVLQSESKCDKGNLYNFSKLKKSRKWLKSILLSDESSEADSQSEDDDEEELNLSREELHNMLRLHKYKKLHQNKYSKDKEL
QQYQYYSAGLLSTYDPFYEQQRHLLGPKKKKFKEEKKLKAKLKKVKKKRRRDEELSSEESPRRHHHQTKVFAKFSHDAPPPGTKKKHLSI
EQLNARRRKVWLSIVKKELPKANKQKASARNLFLTNSRKLAHQCMKEVRRAALQAQKNCKETLPRARRLTKEMLLYWKKYEKVEKEHRKR
AEKEALEQRKLDEEMREAKRQQRKLNFLITQTELYAHFMSRKRDMGHDGIQEEILRKLEDSSTQRQIDIGGGVVVNITQEDYDSNHFKAQ
ALKNAENAYHIHQARTRSFDEDAKESRAAALRAANKSGTGFGESYSLANPSIRAGEDIPQPTIFNGKLKGYQLKGMNWLANLYEQGINGI
LADEMGLGKTVQSIALLAHLAERENIWGPFLIISPASTLNNWHQEFTRFVPKFKVLPYWGNPHDRKVIRRFWSQKTLYTQDAPFHVVITS
YQLVVQDVKYFQRVKWQYMVLDEAQALKSSSSVRWKILLQFQCRNRLLLTGTPIQNTMAELWALLHFIMPTLFDSHEEFNEWFSKDIESH
AENKSAIDENQLSRLHMILKPFMLRRIKKDVENELSDKIEILMYCQLTSRQKLLYQALKNKISIEDLLQSSMGSTQQAQNTTSSLMNLVM
QFRKVCNHPELFERQETWSPFHISLKPYHISKFIYRHGQIRVFNHSRDRWLRVLSPFAPDYIQRSLFHRKGINEESCFSFLRFIDISPAE
MANLMLQGLLARWLALFLSLKASYRLHQLRSWGAPEGESHQRYLRNKDFLLGVNFPLSFPNLCSCPLLKSLVFSSHCKAVSGYSDQVVHQ
RRSATSSLRRCLLTELPSFLCVASPRVTAVPLDSYCNDRSAEYERRVLKEGGSLAAKQCLLNGAPELAADWLNRRSQFFPEPAGGLWSIR
PQNGWSFIRIPALRSRSGRSIINGNWAIDRPGKYEGGGTMFTYKRPNEISSTAGESFLAEGPTNEILDVYMIHQQPNPGVHYEYVIMGTN
AISPQVPPHRRPGEPFNGQMVTEGRSQEEGEQKGRNEEKEDLRGEAPEMFTSESAQTFPVRHPDRFSPHRPDNLVPPAPQPPRRSRDHNW
KQLGTTECSTTCGKGSQYPIFRCVHRSTHEEAPESYCDSSMKPTPEEEPCNIFPCPAFWDIGEWSECSKTCGLGMQHRQVLCRQVYANRS
LTVQPYRCQHLEKPETTSTCQLKICSEWQIRTDWTSCSVPCGVGQRTRDVKCVSNIGDVVDDEECNMKLRPNDIENCDMGPCAKSWFLTE
WSERCSAECGAGVRTRSVVCMTNHVSSLPLEGCGNNRPAEATPCDNGPCTGKVEWFAGSWSQCSIECGSGTQQREVICVRKNADTFEVLD
PSECSFLEKPPSQQSCHLKPCGAKWFSTEWSMCSKSCQGGFRVREVRCLSDDMTLSNLCDPQLKPEERESCNPQDCVPEVDENCKDKYYN

--------------------------------------------------------------
>39771_39771_3_INO80-THSD4_INO80_chr15_41313098_ENST00000401393_THSD4_chr15_72020888_ENST00000357769_length(transcript)=6450nt_BP=3498nt
GGTCCCAGGAGCCGCGGAGGGAGCGAGCTAGGAGCGTCCACGCCCAACGCAGTCACCGTCCCACGGCCTCAGAGAGCGAACCGCGGCTCC
ACCGTCGGCGGGGCGACCCCCCCCTCCGGACCCCGCCCGCACCCCGCCCCCCCTCCGCCGCCGTCGCGGCGGCGGGGCCAGGCGGCCCGA
GCCGTGCAGTCGGAGGTCCTTGTGCATGAAGACAGATTTGTTCTATGGCCTCGGAGTTGGGTGCCAGGGATGATGGAGGCTGCACTGAGC
TGGCAAAGCCCCTCTATCTTCAGTACTTGGAGAGGGCCCTCCGGTTGGACCATTTTCTGCGACAAACGTCAGCTATCTTCAATAGGAATA
TTTCTAGTGATGACAGTGAAGATGGACTGGATGACAGTAATCCATTATTGCCCCAGTCTGGGGATCCCTTAATACAAGTTAAGGAAGAAC
CTCCAAATTCATTGCTTGGTGAAACTTCTGGAGCAGGCAGTTCTGGAATGTTAAACACATATTCTCTGAATGGAGTTCTACAGTCAGAAT
CAAAATGTGATAAGGGGAATTTATATAATTTCTCTAAGCTGAAGAAAAGCAGAAAGTGGCTAAAGAGCATTCTGCTAAGTGATGAATCCA
GCGAGGCTGATTCTCAGAGTGAAGACGATGATGAAGAAGAACTCAATCTCAGCAGAGAAGAACTTCACAACATGCTTCGACTACACAAAT
ATAAGAAACTTCACCAAAATAAGTATAGTAAAGACAAGGAGTTGCAGCAATATCAGTACTACAGTGCAGGCCTGCTCTCCACATATGACC
CTTTCTATGAGCAACAACGGCACCTACTTGGACCCAAGAAAAAGAAATTTAAGGAGGAAAAGAAACTTAAAGCTAAGTTGAAAAAAGTGA
AGAAAAAAAGACGAAGAGATGAAGAACTTTCCTCTGAAGAATCCCCTCGTCGCCATCACCACCAGACCAAAGTCTTTGCCAAGTTTTCTC
ACGATGCACCTCCCCCTGGCACTAAGAAAAAGCACTTATCCATTGAGCAGCTGAATGCTCGTCGCAGGAAAGTATGGCTCAGCATTGTGA
AAAAGGAACTACCAAAGGCAAATAAGCAGAAAGCTTCAGCTCGTAACCTGTTTCTCACCAATAGCCGAAAGCTTGCTCACCAGTGCATGA
AGGAGGTGCGTCGAGCTGCCTTGCAGGCCCAGAAGAACTGTAAGGAAACCTTGCCTCGTGCCCGCCGCCTCACCAAGGAGATGCTTCTGT
ACTGGAAGAAATATGAGAAAGTAGAGAAGGAGCACCGCAAGAGAGCAGAGAAGGAAGCTTTGGAGCAGCGGAAGTTGGATGAGGAAATGC
GGGAGGCCAAGAGGCAACAGCGAAAACTCAACTTCTTAATTACCCAGACAGAGTTGTATGCCCATTTCATGAGTCGCAAACGAGATATGG
GTCATGATGGTATCCAGGAAGAAATCCTAAGGAAACTGGAAGACAGTTCTACCCAGAGACAAATCGATATAGGTGGAGGAGTGGTAGTTA
ACATCACACAGGAGGATTATGATAGTAACCATTTTAAAGCCCAGGCCCTGAAGAATGCTGAAAATGCTTACCATATTCACCAAGCTCGGA
CAAGGTCATTTGATGAAGATGCAAAAGAAAGTCGAGCAGCTGCCCTACGGGCAGCAAACAAGTCTGGCACTGGGTTTGGGGAGAGTTATA
GCCTGGCTAACCCATCTATCCGGGCTGGTGAGGATATTCCACAGCCCACAATTTTTAATGGCAAATTGAAAGGTTATCAACTGAAAGGCA
TGAATTGGTTGGCCAATCTATATGAACAGGGTATTAATGGCATTCTTGCTGATGAAATGGGCCTTGGTAAAACAGTACAGAGCATTGCCC
TTCTGGCCCATCTGGCTGAGAGAGAGAACATTTGGGGACCTTTCTTAATAATTTCACCTGCGTCTACACTTAACAATTGGCACCAGGAGT
TTACTAGATTTGTTCCTAAATTTAAGGTGCTACCATATTGGGGAAATCCTCATGATAGAAAAGTCATCAGAAGGTTCTGGAGTCAGAAGA
CCCTCTACACTCAGGATGCCCCCTTCCATGTGGTTATTACCAGCTATCAGCTGGTGGTGCAGGATGTAAAGTATTTCCAGCGGGTCAAGT
GGCAATACATGGTACTGGATGAGGCTCAGGCGCTCAAGAGTAGTTCCAGTGTTCGTTGGAAGATCCTCTTACAGTTCCAGTGTCGGAATC
GGCTTTTGCTAACCGGGACCCCAATTCAGAACACCATGGCAGAGCTTTGGGCTCTGCTGCATTTCATTATGCCAACATTATTTGATTCAC
ATGAGGAATTTAATGAATGGTTTTCCAAGGACATTGAGAGCCATGCCGAAAACAAATCTGCTATTGATGAGAATCAACTTTCTCGCTTAC
ACATGATTTTGAAGCCATTTATGCTGAGGAGAATCAAGAAAGATGTGGAAAATGAATTATCTGACAAGATTGAGATTCTAATGTATTGCC
AACTGACCAGCCGACAGAAGCTGCTATATCAGGCACTAAAGAACAAAATTTCCATTGAGGATTTATTGCAGTCTTCTATGGGCTCTACCC
AACAAGCACAGAACACCACCAGCAGCCTCATGAATCTGGTCATGCAGTTTAGGAAGGTGTGTAATCACCCGGAGTTATTTGAACGGCAAG
AAACTTGGTCTCCATTTCATATTTCCCTAAAGCCATACCACATTTCAAAGTTTATCTACCGTCATGGACAGATCAGGGTCTTCAATCATT
CACGAGACAGGTGGTTAAGGGTTCTTTCTCCATTTGCACCAGACTATATCCAACGGTCTCTCTTTCACAGAAAAGGTATTAATGAAGAAA
GCTGTTTCTCTTTCCTTCGCTTTATTGATATATCTCCAGCAGAAATGGCAAACCTTATGCTTCAGGGACTTTTGGCCAGATGGTTAGCTC
TTTTCCTGTCTCTGAAAGCCTCCTACAGGCTCCATCAGCTACGCTCCTGGGGAGCGCCAGAAGGGGAGAGCCACCAGAGATACCTGAGGA
ACAAGGATTTCCTTCTTGGGGTTAATTTTCCACTCTCCTTTCCAAACCTTTGCAGCTGCCCTTTGTTAAAGTCTCTTGTTTTCAGCAGCC
ACTGTAAAGCAGTGAGTGGCTACTCAGACCAGGTTGTCCATCAGCGGAGATCAGCTACCTCCTCGCTGCGTCGCTGCCTGCTCACTGAGC
TGCCATCTTTTTTGTGTGTGGCCAGTCCACGAGTTACCGCAGTGCCATTGGATTCTTACTGCAATGACCGAAGTGCAGAATATGAAAGGC
GAGTTCTGAAGGAAGGAGGGAGTCTGGCAGCCAAGCAGTGTTTGTTGAATGGGGCCCCTGAACTGGCTGCAGACTGGCTAAATAGACGAT
CACAGTTCTTCCCAGAGCCAGCTGGAGGTCTGTGGAGCATCAGACCTCAGAATGGCTGGTCTTTCATCAGGATTCCAGCCCTGAGAAGTC
GTTCTGGACGCTCCATCATCAATGGGAACTGGGCAATTGATCGACCAGGAAAATACGAGGGCGGAGGGACCATGTTCACCTACAAGCGTC
CAAATGAGATTTCGAGCACTGCCGGAGAGTCCTTTTTGGCGGAAGGTCCCACCAACGAGATCTTGGATGTCTACATGATACACCAGCAGC
CAAACCCAGGCGTGCACTACGAGTACGTGATCATGGGGACCAACGCCATCAGCCCCCAGGTGCCACCCCACAGGAGACCAGGGGAACCCT
TCAATGGCCAGATGGTGACAGAAGGCAGGAGCCAGGAGGAGGGAGAACAGAAAGGGAGGAACGAGGAGAAGGAAGACTTGCGTGGGGAGG
CCCCTGAGATGTTCACCTCAGAATCGGCACAGACCTTCCCAGTCAGGCATCCAGACAGATTTTCTCCCCATCGACCGGACAACTTGGTGC
CACCAGCACCGCAGCCCCCACGGCGCAGCCGGGATCACAACTGGAAGCAGCTTGGGACAACAGAATGTTCCACGACCTGTGGGAAAGGAT
CGCAGTACCCTATTTTCCGCTGTGTGCACAGAAGCACTCATGAAGAGGCTCCTGAGAGTTACTGTGACTCCAGCATGAAGCCGACCCCCG
AGGAGGAGCCCTGCAACATCTTCCCTTGCCCAGCCTTCTGGGACATCGGGGAGTGGTCTGAGTGCAGCAAGACCTGTGGCCTGGGCATGC
AGCACCGCCAGGTTCTGTGCCGCCAGGTGTACGCCAACCGCAGCCTGACGGTGCAGCCCTACCGCTGCCAGCACCTGGAGAAACCTGAGA
CCACCAGCACCTGCCAACTCAAGATCTGCAGCGAGTGGCAGATCCGGACCGACTGGACCTCGTGCTCGGTGCCCTGCGGCGTGGGACAGA
GGACCCGTGATGTGAAGTGTGTGAGCAACATTGGGGATGTGGTTGACGATGAGGAATGCAACATGAAGCTCCGGCCGAATGACATTGAGA
ACTGCGACATGGGACCCTGTGCCAAGAGCTGGTTCCTCACCGAGTGGAGCGAAAGGTGCTCAGCGGAGTGTGGGGCCGGAGTGCGGACAC
GCTCGGTGGTGTGCATGACCAACCATGTCAGCAGCCTGCCCCTGGAGGGCTGTGGGAACAACCGGCCGGCAGAGGCCACCCCATGTGACA
ACGGACCCTGCACGGGCAAGGTGGAGTGGTTTGCCGGGAGCTGGAGTCAGTGTTCCATCGAGTGTGGGAGCGGGACGCAACAGAGGGAGG
TGATTTGTGTTAGAAAGAATGCAGACACCTTTGAAGTGTTGGACCCCTCTGAATGTTCTTTCCTGGAGAAACCCCCCAGCCAGCAATCCT
GCCACCTCAAGCCTTGCGGAGCCAAATGGTTTAGCACCGAATGGAGCATGTGTTCCAAGAGCTGCCAGGGTGGCTTTCGGGTCCGGGAAG
TGCGGTGTCTGTCTGATGACATGACTCTAAGTAACCTCTGTGACCCTCAGTTGAAACCAGAAGAGAGAGAATCTTGTAACCCTCAGGACT
GTGTCCCTGAAGTTGATGAAAACTGCAAGGACAAGTACTACAACTGCAACGTGGTGGTCCAGGCAAGACTCTGTGTCTACAACTACTACA
AGACCGCCTGCTGTGCCTCCTGCACCCGTGTGGCCAACAGGCAGACGGGCTTCCTGGGGAGCAGATAACACTCCTGCACCCCCATCAGTA
GGGCAGCATCACTGCCTTCCCGGGGGCTTCAGCAGTGCGCCTGGCTGGCTGCTGCTCCACCACGGGCCCCCTGGCCCAGGCGCTGCCAAC
CAACTTAGTCACCACCCCTGCCTCCGGTGAATGCACCCCGTGGTACCCAGGGGCTTTTTACACAAGATGTTTGAAAGCCACAGTCAGTCC
TTTAAGCATCACCATGTACTGATGATCCCCTCCTTGGACCTGGCATCTGCTAATGGTGCCCTTTGAAAGTCAAGCAGTGGGAAGTACATG
GAGCTCTCAGCCCTGCTCCCATCTGGCACCTTCAAGTCAGCAGATGGGCCACTGACTGAGCACTGCCCCGTCCCTGGTGCTACTGGTCTT
TCTAAACTTAGCACCCTGGAGAGTCCAAGGAGGCAGCGCCCCCAACCCAGCGCCCCACTAAGCCTTGCTGACACGCGTGCATCCCTCTGT
GACCTCAGCCCAGATGTGCCTGTTTTCATTCTCAAAGACATTAGACTGTTTTCCTGCCCTATGACACAGATAGCTCACATGAATATTGTG
CTTTATTTAGCAGGTGTACTCACAGATACTAGCTCCTTAGCAGCTCACAACATCCCAGAATGGGAGGCAGGGGGTGACTCATTATCCCCA
TTTTACTGACAGGGAAACTGAGGCTCAACTTAAGTAATTGACCTGCCAGGTATATTCACCCATCCAGTGGAAGAGCTGAGTCCCCGCCCC
AGTCATCTACCAGTATCCAGCCTGGGGCCTGTACTTAGATGTGAAAGGTGCTGCTTCATTTCTGACCAAGAGACTGAGAAGTTTCCCAGA
ATGCAAACAAAGCCCAGGCCCCTGAAATCTTTCCGGTCAAGCCTTTATCCCAGCACTCAGTTGTTTTGGATGTCTGTTCCTACTTGCCCT
TACCCCCAAAGTTACAGATCCTAGTTACAGGACTCTGCCAGCTTTGTTAAACTGTCCGTGAGACAAGAAAGCCATTGGGGAAACCAGGTG
ATTGCCTGAAATTCTTACTCCGTTCCAAGTGCTGTTCCTCCCAGGAAATCAAAGGCCAGGGTCCTTATGGCCGTGGAGCCTTCCCGACCA
CAGAGCCAACTTGTGAAGCACACAGCTCTGCAGCCTGGGCTCTGCCCTGCCTCAGCCGCCTCCCCCACGCTCTTCACCACGTTCCTGGAG

>39771_39771_3_INO80-THSD4_INO80_chr15_41313098_ENST00000401393_THSD4_chr15_72020888_ENST00000357769_length(amino acids)=1657AA_BP=1091
MASELGARDDGGCTELAKPLYLQYLERALRLDHFLRQTSAIFNRNISSDDSEDGLDDSNPLLPQSGDPLIQVKEEPPNSLLGETSGAGSS
GMLNTYSLNGVLQSESKCDKGNLYNFSKLKKSRKWLKSILLSDESSEADSQSEDDDEEELNLSREELHNMLRLHKYKKLHQNKYSKDKEL
QQYQYYSAGLLSTYDPFYEQQRHLLGPKKKKFKEEKKLKAKLKKVKKKRRRDEELSSEESPRRHHHQTKVFAKFSHDAPPPGTKKKHLSI
EQLNARRRKVWLSIVKKELPKANKQKASARNLFLTNSRKLAHQCMKEVRRAALQAQKNCKETLPRARRLTKEMLLYWKKYEKVEKEHRKR
AEKEALEQRKLDEEMREAKRQQRKLNFLITQTELYAHFMSRKRDMGHDGIQEEILRKLEDSSTQRQIDIGGGVVVNITQEDYDSNHFKAQ
ALKNAENAYHIHQARTRSFDEDAKESRAAALRAANKSGTGFGESYSLANPSIRAGEDIPQPTIFNGKLKGYQLKGMNWLANLYEQGINGI
LADEMGLGKTVQSIALLAHLAERENIWGPFLIISPASTLNNWHQEFTRFVPKFKVLPYWGNPHDRKVIRRFWSQKTLYTQDAPFHVVITS
YQLVVQDVKYFQRVKWQYMVLDEAQALKSSSSVRWKILLQFQCRNRLLLTGTPIQNTMAELWALLHFIMPTLFDSHEEFNEWFSKDIESH
AENKSAIDENQLSRLHMILKPFMLRRIKKDVENELSDKIEILMYCQLTSRQKLLYQALKNKISIEDLLQSSMGSTQQAQNTTSSLMNLVM
QFRKVCNHPELFERQETWSPFHISLKPYHISKFIYRHGQIRVFNHSRDRWLRVLSPFAPDYIQRSLFHRKGINEESCFSFLRFIDISPAE
MANLMLQGLLARWLALFLSLKASYRLHQLRSWGAPEGESHQRYLRNKDFLLGVNFPLSFPNLCSCPLLKSLVFSSHCKAVSGYSDQVVHQ
RRSATSSLRRCLLTELPSFLCVASPRVTAVPLDSYCNDRSAEYERRVLKEGGSLAAKQCLLNGAPELAADWLNRRSQFFPEPAGGLWSIR
PQNGWSFIRIPALRSRSGRSIINGNWAIDRPGKYEGGGTMFTYKRPNEISSTAGESFLAEGPTNEILDVYMIHQQPNPGVHYEYVIMGTN
AISPQVPPHRRPGEPFNGQMVTEGRSQEEGEQKGRNEEKEDLRGEAPEMFTSESAQTFPVRHPDRFSPHRPDNLVPPAPQPPRRSRDHNW
KQLGTTECSTTCGKGSQYPIFRCVHRSTHEEAPESYCDSSMKPTPEEEPCNIFPCPAFWDIGEWSECSKTCGLGMQHRQVLCRQVYANRS
LTVQPYRCQHLEKPETTSTCQLKICSEWQIRTDWTSCSVPCGVGQRTRDVKCVSNIGDVVDDEECNMKLRPNDIENCDMGPCAKSWFLTE
WSERCSAECGAGVRTRSVVCMTNHVSSLPLEGCGNNRPAEATPCDNGPCTGKVEWFAGSWSQCSIECGSGTQQREVICVRKNADTFEVLD
PSECSFLEKPPSQQSCHLKPCGAKWFSTEWSMCSKSCQGGFRVREVRCLSDDMTLSNLCDPQLKPEERESCNPQDCVPEVDENCKDKYYN

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for INO80-THSD4


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for INO80-THSD4


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for INO80-THSD4


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource