FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:TGFBR1-COL15A1 (FusionGDB2 ID:90456)

Fusion Gene Summary for TGFBR1-COL15A1

check button Fusion gene summary
Fusion gene informationFusion gene name: TGFBR1-COL15A1
Fusion gene ID: 90456
HgeneTgene
Gene symbol

TGFBR1

COL15A1

Gene ID

7046

1306

Gene nametransforming growth factor beta receptor 1collagen type XV alpha 1 chain
SynonymsAAT5|ACVRLK4|ALK-5|ALK5|ESS1|LDS1|LDS1A|LDS2A|MSSE|SKR4|TBR-i|TBRI|TGFR-1|tbetaR-I-
Cytomap

9q22.33

9q22.33

Type of geneprotein-codingprotein-coding
DescriptionTGF-beta receptor type-1activin A receptor type II-like kinase, 53kDaactivin A receptor type II-like protein kinase of 53kDactivin receptor-like kinase 5mutant transforming growth factor beta receptor Iserine/threonine-protein kinase receptor R4trancollagen alpha-1(XV) chaincollagen XV, alpha-1 polypeptidecollagen type XV proteoglycancollagen, type XV, alpha 1endostatin-XVrestin
Modification date2020032920200313
UniProtAcc.

P39059

Ensembl transtripts involved in fusion geneENST00000374990, ENST00000374994, 
ENST00000550253, ENST00000552516, 
ENST00000467052, ENST00000375001, 
Fusion gene scores* DoF score4 X 4 X 3=486 X 7 X 3=126
# samples 47
** MAII scorelog2(4/48*10)=-0.263034405833794
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(7/126*10)=-0.84799690655495
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: TGFBR1 [Title/Abstract] AND COL15A1 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointTGFBR1(101910066)-COL15A1(101777699), # samples:3
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneTGFBR1

GO:0000186

activation of MAPKK activity

18625725

HgeneTGFBR1

GO:0001837

epithelial to mesenchymal transition

15761148

HgeneTGFBR1

GO:0006355

regulation of transcription, DNA-templated

14517293

HgeneTGFBR1

GO:0006468

protein phosphorylation

12015308|12065756

HgeneTGFBR1

GO:0007165

signal transduction

14633705

HgeneTGFBR1

GO:0007179

transforming growth factor beta receptor signaling pathway

9389648|11157754

HgeneTGFBR1

GO:0010862

positive regulation of pathway-restricted SMAD protein phosphorylation

9311995|9389648

HgeneTGFBR1

GO:0018105

peptidyl-serine phosphorylation

15761148

HgeneTGFBR1

GO:0018107

peptidyl-threonine phosphorylation

19736306

HgeneTGFBR1

GO:0030307

positive regulation of cell growth

18625725

HgeneTGFBR1

GO:0030335

positive regulation of cell migration

19736306

HgeneTGFBR1

GO:0031396

regulation of protein ubiquitination

18758450

HgeneTGFBR1

GO:0045893

positive regulation of transcription, DNA-templated

9311995|9389648

HgeneTGFBR1

GO:0051897

positive regulation of protein kinase B signaling

18625725

HgeneTGFBR1

GO:0060389

pathway-restricted SMAD protein phosphorylation

11157754|12015308|18625725|19736306

HgeneTGFBR1

GO:0060391

positive regulation of SMAD protein signal transduction

9389648

HgeneTGFBR1

GO:0070723

response to cholesterol

17878231

HgeneTGFBR1

GO:0071560

cellular response to transforming growth factor beta stimulus

19494318

HgeneTGFBR1

GO:2001235

positive regulation of apoptotic signaling pathway

18758450


check buttonFusion gene breakpoints across TGFBR1 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across COL15A1 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4LIHCTCGA-RC-A6M3-01ATGFBR1chr9

101910066

-COL15A1chr9

101777699

+
ChimerDB4LIHCTCGA-RC-A6M3-01ATGFBR1chr9

101910066

+COL15A1chr9

101777699

+


Top

Fusion Gene ORF analysis for TGFBR1-COL15A1

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-intronENST00000374990ENST00000467052TGFBR1chr9

101910066

+COL15A1chr9

101777699

+
5CDS-intronENST00000374994ENST00000467052TGFBR1chr9

101910066

+COL15A1chr9

101777699

+
5CDS-intronENST00000550253ENST00000467052TGFBR1chr9

101910066

+COL15A1chr9

101777699

+
5CDS-intronENST00000552516ENST00000467052TGFBR1chr9

101910066

+COL15A1chr9

101777699

+
In-frameENST00000374990ENST00000375001TGFBR1chr9

101910066

+COL15A1chr9

101777699

+
In-frameENST00000374994ENST00000375001TGFBR1chr9

101910066

+COL15A1chr9

101777699

+
In-frameENST00000550253ENST00000375001TGFBR1chr9

101910066

+COL15A1chr9

101777699

+
In-frameENST00000552516ENST00000375001TGFBR1chr9

101910066

+COL15A1chr9

101777699

+

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000374994TGFBR1chr9101910066+ENST00000375001COL15A1chr9101777699+5218150311743161399
ENST00000374990TGFBR1chr9101910066+ENST00000375001COL15A1chr9101777699+496512509540631322
ENST00000552516TGFBR1chr9101910066+ENST00000375001COL15A1chr9101777699+516914545642671403
ENST00000550253TGFBR1chr9101910066+ENST00000375001COL15A1chr9101777699+509113762941891386

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000374994ENST00000375001TGFBR1chr9101910066+COL15A1chr9101777699+0.0009844040.99901557
ENST00000374990ENST00000375001TGFBR1chr9101910066+COL15A1chr9101777699+0.0009885930.9990114
ENST00000552516ENST00000375001TGFBR1chr9101910066+COL15A1chr9101777699+0.0009144120.99908555
ENST00000550253ENST00000375001TGFBR1chr9101910066+COL15A1chr9101777699+0.0009619250.9990381

Top

Fusion Genomic Features for TGFBR1-COL15A1


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)
TGFBR1chr9101910066+COL15A1chr9101777698+0.0028381150.99716187
TGFBR1chr9101910066+COL15A1chr9101777698+0.0028381150.99716187

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.
genomic feature of top 1%

Top

Fusion Protein Features for TGFBR1-COL15A1


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr9:101910066/chr9:101777699)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
.COL15A1

P39059

FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.FUNCTION: Structural protein that stabilizes microvessels and muscle cells, both in heart and in skeletal muscle. {ECO:0000269|PubMed:10049780}.; FUNCTION: Restin potently inhibits angiogenesis. {ECO:0000250}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneTGFBR1chr9:101910066chr9:101777699ENST00000374990+78175_204385427.0DomainGS
HgeneTGFBR1chr9:101910066chr9:101777699ENST00000374994+89175_204462504.0DomainGS
HgeneTGFBR1chr9:101910066chr9:101777699ENST00000552516+89175_204466508.0DomainGS
HgeneTGFBR1chr9:101910066chr9:101777699ENST00000374990+78193_194385427.0MotifNote=FKBP1A-binding
HgeneTGFBR1chr9:101910066chr9:101777699ENST00000374994+89193_194462504.0MotifNote=FKBP1A-binding
HgeneTGFBR1chr9:101910066chr9:101777699ENST00000552516+89193_194466508.0MotifNote=FKBP1A-binding
HgeneTGFBR1chr9:101910066chr9:101777699ENST00000374990+78211_219385427.0Nucleotide bindingATP
HgeneTGFBR1chr9:101910066chr9:101777699ENST00000374994+89211_219462504.0Nucleotide bindingATP
HgeneTGFBR1chr9:101910066chr9:101777699ENST00000552516+89211_219466508.0Nucleotide bindingATP
HgeneTGFBR1chr9:101910066chr9:101777699ENST00000374990+7834_126385427.0Topological domainExtracellular
HgeneTGFBR1chr9:101910066chr9:101777699ENST00000374994+8934_126462504.0Topological domainExtracellular
HgeneTGFBR1chr9:101910066chr9:101777699ENST00000552516+8934_126466508.0Topological domainExtracellular
HgeneTGFBR1chr9:101910066chr9:101777699ENST00000374990+78127_147385427.0TransmembraneHelical
HgeneTGFBR1chr9:101910066chr9:101777699ENST00000374994+89127_147462504.0TransmembraneHelical
HgeneTGFBR1chr9:101910066chr9:101777699ENST00000552516+89127_147466508.0TransmembraneHelical
TgeneCOL15A1chr9:101910066chr9:101777699ENST00000375001842619_6804511389.0DomainNote=Collagen-like 1
TgeneCOL15A1chr9:101910066chr9:101777699ENST00000375001842681_7314511389.0DomainNote=Collagen-like 2
TgeneCOL15A1chr9:101910066chr9:101777699ENST00000375001842823_8654511389.0DomainNote=Collagen-like 3
TgeneCOL15A1chr9:101910066chr9:101777699ENST00000375001842879_9274511389.0DomainNote=Collagen-like 4
TgeneCOL15A1chr9:101910066chr9:101777699ENST000003750018421014_10274511389.0RegionNote=Nonhelical region 7 (NC7)
TgeneCOL15A1chr9:101910066chr9:101777699ENST000003750018421028_10454511389.0RegionNote=Triple-helical region 7 (COL7)
TgeneCOL15A1chr9:101910066chr9:101777699ENST000003750018421046_10524511389.0RegionNote=Nonhelical region 8 (NC8)
TgeneCOL15A1chr9:101910066chr9:101777699ENST000003750018421053_11074511389.0RegionNote=Triple-helical region 8 (COL8)
TgeneCOL15A1chr9:101910066chr9:101777699ENST000003750018421108_11174511389.0RegionNote=Nonhelical region 9 (NC9)
TgeneCOL15A1chr9:101910066chr9:101777699ENST000003750018421118_11324511389.0RegionNote=Triple-helical region 9 (COL9)
TgeneCOL15A1chr9:101910066chr9:101777699ENST000003750018421133_13884511389.0RegionNote=Nonhelical region 10 (NC10)
TgeneCOL15A1chr9:101910066chr9:101777699ENST00000375001842556_5734511389.0RegionNote=Triple-helical region 1 (COL1)
TgeneCOL15A1chr9:101910066chr9:101777699ENST00000375001842574_6184511389.0RegionNote=Nonhelical region 2 (NC2)
TgeneCOL15A1chr9:101910066chr9:101777699ENST00000375001842619_7324511389.0RegionNote=Triple-helical region 2 (COL2)
TgeneCOL15A1chr9:101910066chr9:101777699ENST00000375001842733_7634511389.0RegionNote=Nonhelical region 3 (NC3)
TgeneCOL15A1chr9:101910066chr9:101777699ENST00000375001842764_7984511389.0RegionNote=Triple-helical region 3 (COL3)
TgeneCOL15A1chr9:101910066chr9:101777699ENST00000375001842799_8224511389.0RegionNote=Nonhelical region 4 (NC4)
TgeneCOL15A1chr9:101910066chr9:101777699ENST00000375001842823_8674511389.0RegionNote=Triple-helical region 4 (COL4)
TgeneCOL15A1chr9:101910066chr9:101777699ENST00000375001842868_8784511389.0RegionNote=Nonhelical region 5 (NC5)
TgeneCOL15A1chr9:101910066chr9:101777699ENST00000375001842879_9494511389.0RegionNote=Triple-helical region 5 (COL5)
TgeneCOL15A1chr9:101910066chr9:101777699ENST00000375001842950_9834511389.0RegionNote=Nonhelical region 6 (NC6)
TgeneCOL15A1chr9:101910066chr9:101777699ENST00000375001842984_10134511389.0RegionNote=Triple-helical region 6 (COL6)
TgeneCOL15A1chr9:101910066chr9:101777699ENST00000375001842460_5094511389.0RepeatNote=3
TgeneCOL15A1chr9:101910066chr9:101777699ENST00000375001842510_5554511389.0RepeatNote=4

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneTGFBR1chr9:101910066chr9:101777699ENST00000374990+78205_495385427.0DomainProtein kinase
HgeneTGFBR1chr9:101910066chr9:101777699ENST00000374994+89205_495462504.0DomainProtein kinase
HgeneTGFBR1chr9:101910066chr9:101777699ENST00000552516+89205_495466508.0DomainProtein kinase
HgeneTGFBR1chr9:101910066chr9:101777699ENST00000374990+78148_503385427.0Topological domainCytoplasmic
HgeneTGFBR1chr9:101910066chr9:101777699ENST00000374994+89148_503462504.0Topological domainCytoplasmic
HgeneTGFBR1chr9:101910066chr9:101777699ENST00000552516+89148_503466508.0Topological domainCytoplasmic
TgeneCOL15A1chr9:101910066chr9:101777699ENST0000037500184266_2494511389.0DomainNote=Laminin G-like
TgeneCOL15A1chr9:101910066chr9:101777699ENST00000375001842229_5554511389.0RegionNote=Nonhelical region 1 (NC1)
TgeneCOL15A1chr9:101910066chr9:101777699ENST00000375001842358_5554511389.0RegionNote=4 X tandem repeats
TgeneCOL15A1chr9:101910066chr9:101777699ENST00000375001842358_4084511389.0RepeatNote=1
TgeneCOL15A1chr9:101910066chr9:101777699ENST00000375001842409_4594511389.0RepeatNote=2


Top

Fusion Gene Sequence for TGFBR1-COL15A1


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>90456_90456_1_TGFBR1-COL15A1_TGFBR1_chr9_101910066_ENST00000374990_COL15A1_chr9_101777699_ENST00000375001_length(transcript)=4965nt_BP=1250nt
GCGGCGGCTAGGGAGGTGGGGCGAGGCGAGGTTTGCTGGGGTGAGGCAGCGGCGCGGCCGGGCCGGGCCGGGCCACAGGCGGTGGCGGCG
GGACCATGGAGGCGGCGGTCGCTGCTCCGCGTCCCCGGCTGCTCCTCCTCGTGCTGGCGGCGGCGGCGGCGGCGGCGGCGGCGCTGCTCC
CGGGGGCGACGGCGTTACAGTGTTTCTGCCACCTCTGTACAAAAGACAATTTTACTTGTGTGACAGATGGGCTCTGCTTTGTCTCTGTCA
CAGAGACCACAGACAAAGTTATACACAACAGCATGTGTATAGCTGAAATTGACTTAATTCCTCGAGATAGGCCGTTTGTATGTGCACCCT
CTTCAAAAACTGGGTCTGTGACTACAACATATTGCTGCAATCAGGACCATTGCAATAAAATAGAACTTCCAACTACTGGTTTACCATTGC
TTGTTCAGAGAACAATTGCGAGAACTATTGTGTTACAAGAAAGCATTGGCAAAGGTCGATTTGGAGAAGTTTGGAGAGGAAAGTGGCGGG
GAGAAGAAGTTGCTGTTAAGATATTCTCCTCTAGAGAAGAACGTTCGTGGTTCCGTGAGGCAGAGATTTATCAAACTGTAATGTTACGTC
ATGAAAACATCCTGGGATTTATAGCAGCAGACAATAAAGACAATGGTACTTGGACTCAGCTCTGGTTGGTGTCAGATTATCATGAGCATG
GATCCCTTTTTGATTACTTAAACAGATACACAGTTACTGTGGAAGGAATGATAAAACTTGCTCTGTCCACGGCGAGCGGTCTTGCCCATC
TTCACATGGAGATTGTTGGTACCCAAGGAAAGCCAGCCATTGCTCATAGAGATTTGAAATCAAAGAATATCTTGGTAAAGAAGAATGGAA
CTTGCTGTATTGCAGACTTAGGACTGGCAGTAAGACATGATTCAGCCACAGATACCATTGATATTGCTCCAAACCACAGAGTGGGAACAA
AAAGGTACATGGCCCCTGAAGTTCTCGATGATTCCATAAATATGAAACATTTTGAATCCTTCAAACGTGCTGACATCTATGCAATGGGCT
TAGTATTCTGGGAAATTGCTCGACGATGTTCCATTGGTGGAATTCATGAAGATTACCAACTGCCTTATTATGATCTTGTACCTTCTGACC
CATCAGTTGAAGAAATGAGAAAAGTTGTTTGTGAACAGAAGTTAAGGCCAAATATCCCAAACAGATGGCAGAGCTGTGAAGGTCCAAGCA
GTGAAGACAGTTTAACAACAGCTGCAGCTGCAACCGAAGTGTCCCTCAGTACTTTTGAGGATGAGGAAGCCAGTGGGGTCCCCACAGATG
GCCTGGCTCCCCTCACAGCCACCATGGCCCCTGAGCGGGCAGTCACTTCTGGTCCTGGTGATGAAGAAGACTTGGCAGCAGCCACAACAG
AGGAGCCCCTCATCACAGCTGGGGGTGAAGAGTCCGGCAGCCCTCCCCCTGATGGGCCACCGCTGCCCCTGCCCACAGTGGCTCCTGAAA
GATGGATCACTCCAGCTCAAAGAGAACATGTGGGAATGAAAGGACAGGCTGGGCCCAAAGGAGAAAAGGGTGATGCTGGGGAGGAGCTTC
CTGGCCCTCCTGAACCTTCTGGGCCTGTTGGACCCACGGCAGGAGCAGAAGCAGAGGGCTCTGGCCTAGGCTGGGGCTCGGACGTCGGCT
CTGGCTCTGGTGACCTGGTGGGCAGTGAGCAGCTGCTGAGAGGTCCTCCAGGACCCCCAGGGCCACCTGGCTTACCTGGGATTCCAGGAA
AACCAGGAACTGATGTTTTCATGGGACCCCCTGGATCTCCTGGAGAGGATGGACCTGCTGGTGAACCTGGGCCCCCGGGCCCTGAGGGAC
AGCCTGGAGTTGATGGAGCCACCGGCCTTCCCGGGATGAAAGGGGAGAAGGGAGCAAGAGGGCCTAATGGCTCAGTTGGTGAAAAGGGTG
ACCCTGGCAACAGAGGCTTACCTGGACCCCCGGGGAAAAAGGGACAAGCTGGCCCTCCTGGGGTCATGGGACCCCCAGGGCCTCCTGGAC
CCCCTGGGCCCCCAGGCCCTGGATGCACAATGGGACTTGGATTCGAGGATACCGAAGGCTCTGGAAGCACCCAGCTATTGAATGAACCCA
AACTCTCCAGACCAACGGCTGCAATTGGTCTCAAAGGAGAGAAAGGAGACCGGGGACCCAAGGGAGAAAGGGGGATGGATGGAGCCAGTA
TTGTGGGACCCCCTGGGCCGAGAGGGCCACCTGGGCACATCAAGGTCTTGTCTAATTCCTTGATCAATATCACCCATGGATTCATGAATT
TCTCGGACATTCCTGAGCTGGTGGGGCCTCCGGGGCCGGACGGGTTGCCTGGGCTGCCAGGATTTCCAGGTCCTAGAGGACCAAAAGGTG
ACACTGGTTTACCTGGCTTTCCAGGACTAAAAGGAGAACAGGGCGAGAAGGGAGAGCCGGGTGCCATCCTGACAGAGGACATTCCTCTGG
AAAGGCTGATGGGGAAAAAGGGTGAACCTGGAATGCATGGAGCCCCAGGACCAATGGGGCCCAAAGGACCACCAGGACATAAAGGAGAAT
TTGGCCTTCCCGGGCGACCTGGTCGCCCAGGACTGAATGGCCTCAAGGGTACCAAAGGAGATCCAGGGGTCATTATGCAGGGCCCACCTG
GCTTACCTGGCCCTCCAGGCCCCCCTGGGCCACCTGGAGCTGTGATTAACATCAAAGGAGCCATTTTCCCAATACCCGTCCGACCACACT
GCAAAATGCCAGTTGATACTGCTCATCCTGGGAGTCCAGAGCTCATCACTTTTCACGGTGTTAAAGGAGAGAAAGGATCCTGGGGTCTTC
CTGGCTCAAAGGGAGAAAAAGGCGACCAGGGAGCCCAGGGACCACCAGGTCCTCCACTTGATCTAGCTTACCTGAGACACTTTCTGAACA
ACTTGAAGGGGGAGAATGGAGACAAGGGGTTCAAAGGTGAAAAAGGAGAAAAAGGAGACATTAATGGCAGCTTCCTTATGTCTGGGCCTC
CAGGCCTGCCCGGAAATCCAGGCCCGGCTGGCCAAAAAGGGGAGACAGTCGTTGGGCCCCAAGGACCCCCAGGTGCTCCTGGTCTGCCTG
GGCCACCTGGCTTTGGAAGACCTGGTGATCCTGGGCCACCGGGGCCCCCGGGGCCACCAGGACCTCCAGCTATCCTGGGAGCAGCTGTGG
CCCTTCCAGGTCCCCCTGGCCCTCCAGGACAGCCAGGGCTTCCCGGATCCAGAAACCTGGTCACAGCATTCAGCAACATGGATGACATGC
TGCAGAAAGCGCATTTGGTTATAGAAGGAACATTCATCTACCTGAGGGACAGCACTGAGTTTTTCATTCGTGTTAGAGATGGCTGGAAAA
AATTACAGCTGGGAGAACTGATCCCCATTCCTGCCGACAGCCCTCCACCCCCTGCGCTTTCCAGCAACCCACATCAGCTTCTGCCTCCAC
CAAACCCTATTTCAAGTGCCAATTATGAGAAGCCTGCTCTGCATTTGGCTGCTCTGAACATGCCATTTTCTGGGGACATTCGAGCTGATT
TTCAGTGCTTCAAGCAGGCCAGAGCTGCAGGACTGTTGTCCACCTACCGAGCATTCTTATCTTCCCATTTGCAAGATCTGTCCACCATTG
TGAGGAAAGCAGAGAGATACAGCCTTCCCATAGTGAACCTCAAGGGCCAAGTACTTTTTAATAATTGGGACTCAATTTTTTCTGGCCACG
GAGGTCAGTTCAATATGCATATTCCAATATACTCCTTTGATGGTCGAGACATAATGACAGATCCTTCTTGGCCCCAGAAAGTCATTTGGC
ATGGCTCCAGCCCCCATGGCGTCCGCCTTGTGGATAACTACTGTGAAGCATGGCGAACCGCGGACACAGCGGTCACGGGACTTGCCTCCC
CGCTGAGCACGGGGAAGATTCTGGACCAGAAAGCATACAGCTGTGCTAATCGGCTAATTGTCCTATGTATCGAAAACAGTTTCATGACAG
ACGCTAGGAAGTAATGGCCTTCTGATGATTCTTAAAGAGTTTTCAATTTTTTCTTATGTGAAGAGTTGACACTGAAATCTAAAATGTTTA
ATTGTTGTAAATATTACAGTTTTTTTTTTTTACTACATATTCTTTACAACAGCAACCAAAGAAAACATACCTCAATACACTCAAAACTGA
AGACATAGAGGACTCAGATCAAAGACAAAATCTGATCCATATATTGGTGCTAGATTCTGCAGGAAACCCCAGCAGTGTGAACGCATCCCA
ACATAGGTTAAGAGCAAGTTGAAAACAAAGGCCATGGCATTCTGCCACTGCATCCTTCAGACAGTTATATCCTCCTTTTAAACCATTGTT
GTTGAGTGTAAGATGTCCTTCATGTTTTCTTATAAAGTCAGTGTTTAGAAATGTTACCCTTTCTAAGTTATATACAGATCAAATGCTTTT
TTCTTTCACGTACATCCATCATTTGCAACTGCTGTTCGTACACAGAAACAGGACTGCTCAAATGATCCTATTTGTATTTTCTGATGCTAT
CAGACTCTAATGTTTTTTTCCCTAAAATATTATTGCCATCATGCTTTAGGAATTTTATATTTTTACACAATCATATTTTAGTATGGTGTC
TGTTTATGTAACTCTGACTTGCTGGAAAAGTTGAAACTCCAAATAATCTGAAACTAGAAAAGAAATAGCACATAATTACTACCTTCCCCT
TGGCGGCTCTCCTCCCCAACCCCCACCCCACAATTTTATGACTTCCATTTGGCAATTGTTGAATTATAACTGCGACTGAAACAAACAGGT
TCATAGAGATGAATTTTCTGAGAAACATATATCTACATGTTGTATAATTGGATTTTTTTTCCATGTAAGTGAACATAAAAACATCTTTTC

>90456_90456_1_TGFBR1-COL15A1_TGFBR1_chr9_101910066_ENST00000374990_COL15A1_chr9_101777699_ENST00000375001_length(amino acids)=1322AA_BP=576
MEAAVAAPRPRLLLLVLAAAAAAAAALLPGATALQCFCHLCTKDNFTCVTDGLCFVSVTETTDKVIHNSMCIAEIDLIPRDRPFVCAPSS
KTGSVTTTYCCNQDHCNKIELPTTGLPLLVQRTIARTIVLQESIGKGRFGEVWRGKWRGEEVAVKIFSSREERSWFREAEIYQTVMLRHE
NILGFIAADNKDNGTWTQLWLVSDYHEHGSLFDYLNRYTVTVEGMIKLALSTASGLAHLHMEIVGTQGKPAIAHRDLKSKNILVKKNGTC
CIADLGLAVRHDSATDTIDIAPNHRVGTKRYMAPEVLDDSINMKHFESFKRADIYAMGLVFWEIARRCSIGGIHEDYQLPYYDLVPSDPS
VEEMRKVVCEQKLRPNIPNRWQSCEGPSSEDSLTTAAAATEVSLSTFEDEEASGVPTDGLAPLTATMAPERAVTSGPGDEEDLAAATTEE
PLITAGGEESGSPPPDGPPLPLPTVAPERWITPAQREHVGMKGQAGPKGEKGDAGEELPGPPEPSGPVGPTAGAEAEGSGLGWGSDVGSG
SGDLVGSEQLLRGPPGPPGPPGLPGIPGKPGTDVFMGPPGSPGEDGPAGEPGPPGPEGQPGVDGATGLPGMKGEKGARGPNGSVGEKGDP
GNRGLPGPPGKKGQAGPPGVMGPPGPPGPPGPPGPGCTMGLGFEDTEGSGSTQLLNEPKLSRPTAAIGLKGEKGDRGPKGERGMDGASIV
GPPGPRGPPGHIKVLSNSLINITHGFMNFSDIPELVGPPGPDGLPGLPGFPGPRGPKGDTGLPGFPGLKGEQGEKGEPGAILTEDIPLER
LMGKKGEPGMHGAPGPMGPKGPPGHKGEFGLPGRPGRPGLNGLKGTKGDPGVIMQGPPGLPGPPGPPGPPGAVINIKGAIFPIPVRPHCK
MPVDTAHPGSPELITFHGVKGEKGSWGLPGSKGEKGDQGAQGPPGPPLDLAYLRHFLNNLKGENGDKGFKGEKGEKGDINGSFLMSGPPG
LPGNPGPAGQKGETVVGPQGPPGAPGLPGPPGFGRPGDPGPPGPPGPPGPPAILGAAVALPGPPGPPGQPGLPGSRNLVTAFSNMDDMLQ
KAHLVIEGTFIYLRDSTEFFIRVRDGWKKLQLGELIPIPADSPPPPALSSNPHQLLPPPNPISSANYEKPALHLAALNMPFSGDIRADFQ
CFKQARAAGLLSTYRAFLSSHLQDLSTIVRKAERYSLPIVNLKGQVLFNNWDSIFSGHGGQFNMHIPIYSFDGRDIMTDPSWPQKVIWHG

--------------------------------------------------------------
>90456_90456_2_TGFBR1-COL15A1_TGFBR1_chr9_101910066_ENST00000374994_COL15A1_chr9_101777699_ENST00000375001_length(transcript)=5218nt_BP=1503nt
AAAGGGCCGGAGCGAGGCCGCCGCGGCGGCTAGGGAGGTGGGGCGAGGCGAGGTTTGCTGGGGTGAGGCAGCGGCGCGGCCGGGCCGGGC
CGGGCCACAGGCGGTGGCGGCGGGACCATGGAGGCGGCGGTCGCTGCTCCGCGTCCCCGGCTGCTCCTCCTCGTGCTGGCGGCGGCGGCG
GCGGCGGCGGCGGCGCTGCTCCCGGGGGCGACGGCGTTACAGTGTTTCTGCCACCTCTGTACAAAAGACAATTTTACTTGTGTGACAGAT
GGGCTCTGCTTTGTCTCTGTCACAGAGACCACAGACAAAGTTATACACAACAGCATGTGTATAGCTGAAATTGACTTAATTCCTCGAGAT
AGGCCGTTTGTATGTGCACCCTCTTCAAAAACTGGGTCTGTGACTACAACATATTGCTGCAATCAGGACCATTGCAATAAAATAGAACTT
CCAACTACTGTAAAGTCATCACCTGGCCTTGGTCCTGTGGAACTGGCAGCTGTCATTGCTGGACCAGTGTGCTTCGTCTGCATCTCACTC
ATGTTGATGGTCTATATCTGCCACAACCGCACTGTCATTCACCATCGAGTGCCAAATGAAGAGGACCCTTCATTAGATCGCCCTTTTATT
TCAGAGGGTACTACGTTGAAAGACTTAATTTATGATATGACAACGTCAGGTTCTGGCTCAGGTTTACCATTGCTTGTTCAGAGAACAATT
GCGAGAACTATTGTGTTACAAGAAAGCATTGGCAAAGGTCGATTTGGAGAAGTTTGGAGAGGAAAGTGGCGGGGAGAAGAAGTTGCTGTT
AAGATATTCTCCTCTAGAGAAGAACGTTCGTGGTTCCGTGAGGCAGAGATTTATCAAACTGTAATGTTACGTCATGAAAACATCCTGGGA
TTTATAGCAGCAGACAATAAAGACAATGGTACTTGGACTCAGCTCTGGTTGGTGTCAGATTATCATGAGCATGGATCCCTTTTTGATTAC
TTAAACAGATACACAGTTACTGTGGAAGGAATGATAAAACTTGCTCTGTCCACGGCGAGCGGTCTTGCCCATCTTCACATGGAGATTGTT
GGTACCCAAGGAAAGCCAGCCATTGCTCATAGAGATTTGAAATCAAAGAATATCTTGGTAAAGAAGAATGGAACTTGCTGTATTGCAGAC
TTAGGACTGGCAGTAAGACATGATTCAGCCACAGATACCATTGATATTGCTCCAAACCACAGAGTGGGAACAAAAAGGTACATGGCCCCT
GAAGTTCTCGATGATTCCATAAATATGAAACATTTTGAATCCTTCAAACGTGCTGACATCTATGCAATGGGCTTAGTATTCTGGGAAATT
GCTCGACGATGTTCCATTGGTGGAATTCATGAAGATTACCAACTGCCTTATTATGATCTTGTACCTTCTGACCCATCAGTTGAAGAAATG
AGAAAAGTTGTTTGTGAACAGAAGTTAAGGCCAAATATCCCAAACAGATGGCAGAGCTGTGAAGGTCCAAGCAGTGAAGACAGTTTAACA
ACAGCTGCAGCTGCAACCGAAGTGTCCCTCAGTACTTTTGAGGATGAGGAAGCCAGTGGGGTCCCCACAGATGGCCTGGCTCCCCTCACA
GCCACCATGGCCCCTGAGCGGGCAGTCACTTCTGGTCCTGGTGATGAAGAAGACTTGGCAGCAGCCACAACAGAGGAGCCCCTCATCACA
GCTGGGGGTGAAGAGTCCGGCAGCCCTCCCCCTGATGGGCCACCGCTGCCCCTGCCCACAGTGGCTCCTGAAAGATGGATCACTCCAGCT
CAAAGAGAACATGTGGGAATGAAAGGACAGGCTGGGCCCAAAGGAGAAAAGGGTGATGCTGGGGAGGAGCTTCCTGGCCCTCCTGAACCT
TCTGGGCCTGTTGGACCCACGGCAGGAGCAGAAGCAGAGGGCTCTGGCCTAGGCTGGGGCTCGGACGTCGGCTCTGGCTCTGGTGACCTG
GTGGGCAGTGAGCAGCTGCTGAGAGGTCCTCCAGGACCCCCAGGGCCACCTGGCTTACCTGGGATTCCAGGAAAACCAGGAACTGATGTT
TTCATGGGACCCCCTGGATCTCCTGGAGAGGATGGACCTGCTGGTGAACCTGGGCCCCCGGGCCCTGAGGGACAGCCTGGAGTTGATGGA
GCCACCGGCCTTCCCGGGATGAAAGGGGAGAAGGGAGCAAGAGGGCCTAATGGCTCAGTTGGTGAAAAGGGTGACCCTGGCAACAGAGGC
TTACCTGGACCCCCGGGGAAAAAGGGACAAGCTGGCCCTCCTGGGGTCATGGGACCCCCAGGGCCTCCTGGACCCCCTGGGCCCCCAGGC
CCTGGATGCACAATGGGACTTGGATTCGAGGATACCGAAGGCTCTGGAAGCACCCAGCTATTGAATGAACCCAAACTCTCCAGACCAACG
GCTGCAATTGGTCTCAAAGGAGAGAAAGGAGACCGGGGACCCAAGGGAGAAAGGGGGATGGATGGAGCCAGTATTGTGGGACCCCCTGGG
CCGAGAGGGCCACCTGGGCACATCAAGGTCTTGTCTAATTCCTTGATCAATATCACCCATGGATTCATGAATTTCTCGGACATTCCTGAG
CTGGTGGGGCCTCCGGGGCCGGACGGGTTGCCTGGGCTGCCAGGATTTCCAGGTCCTAGAGGACCAAAAGGTGACACTGGTTTACCTGGC
TTTCCAGGACTAAAAGGAGAACAGGGCGAGAAGGGAGAGCCGGGTGCCATCCTGACAGAGGACATTCCTCTGGAAAGGCTGATGGGGAAA
AAGGGTGAACCTGGAATGCATGGAGCCCCAGGACCAATGGGGCCCAAAGGACCACCAGGACATAAAGGAGAATTTGGCCTTCCCGGGCGA
CCTGGTCGCCCAGGACTGAATGGCCTCAAGGGTACCAAAGGAGATCCAGGGGTCATTATGCAGGGCCCACCTGGCTTACCTGGCCCTCCA
GGCCCCCCTGGGCCACCTGGAGCTGTGATTAACATCAAAGGAGCCATTTTCCCAATACCCGTCCGACCACACTGCAAAATGCCAGTTGAT
ACTGCTCATCCTGGGAGTCCAGAGCTCATCACTTTTCACGGTGTTAAAGGAGAGAAAGGATCCTGGGGTCTTCCTGGCTCAAAGGGAGAA
AAAGGCGACCAGGGAGCCCAGGGACCACCAGGTCCTCCACTTGATCTAGCTTACCTGAGACACTTTCTGAACAACTTGAAGGGGGAGAAT
GGAGACAAGGGGTTCAAAGGTGAAAAAGGAGAAAAAGGAGACATTAATGGCAGCTTCCTTATGTCTGGGCCTCCAGGCCTGCCCGGAAAT
CCAGGCCCGGCTGGCCAAAAAGGGGAGACAGTCGTTGGGCCCCAAGGACCCCCAGGTGCTCCTGGTCTGCCTGGGCCACCTGGCTTTGGA
AGACCTGGTGATCCTGGGCCACCGGGGCCCCCGGGGCCACCAGGACCTCCAGCTATCCTGGGAGCAGCTGTGGCCCTTCCAGGTCCCCCT
GGCCCTCCAGGACAGCCAGGGCTTCCCGGATCCAGAAACCTGGTCACAGCATTCAGCAACATGGATGACATGCTGCAGAAAGCGCATTTG
GTTATAGAAGGAACATTCATCTACCTGAGGGACAGCACTGAGTTTTTCATTCGTGTTAGAGATGGCTGGAAAAAATTACAGCTGGGAGAA
CTGATCCCCATTCCTGCCGACAGCCCTCCACCCCCTGCGCTTTCCAGCAACCCACATCAGCTTCTGCCTCCACCAAACCCTATTTCAAGT
GCCAATTATGAGAAGCCTGCTCTGCATTTGGCTGCTCTGAACATGCCATTTTCTGGGGACATTCGAGCTGATTTTCAGTGCTTCAAGCAG
GCCAGAGCTGCAGGACTGTTGTCCACCTACCGAGCATTCTTATCTTCCCATTTGCAAGATCTGTCCACCATTGTGAGGAAAGCAGAGAGA
TACAGCCTTCCCATAGTGAACCTCAAGGGCCAAGTACTTTTTAATAATTGGGACTCAATTTTTTCTGGCCACGGAGGTCAGTTCAATATG
CATATTCCAATATACTCCTTTGATGGTCGAGACATAATGACAGATCCTTCTTGGCCCCAGAAAGTCATTTGGCATGGCTCCAGCCCCCAT
GGCGTCCGCCTTGTGGATAACTACTGTGAAGCATGGCGAACCGCGGACACAGCGGTCACGGGACTTGCCTCCCCGCTGAGCACGGGGAAG
ATTCTGGACCAGAAAGCATACAGCTGTGCTAATCGGCTAATTGTCCTATGTATCGAAAACAGTTTCATGACAGACGCTAGGAAGTAATGG
CCTTCTGATGATTCTTAAAGAGTTTTCAATTTTTTCTTATGTGAAGAGTTGACACTGAAATCTAAAATGTTTAATTGTTGTAAATATTAC
AGTTTTTTTTTTTTACTACATATTCTTTACAACAGCAACCAAAGAAAACATACCTCAATACACTCAAAACTGAAGACATAGAGGACTCAG
ATCAAAGACAAAATCTGATCCATATATTGGTGCTAGATTCTGCAGGAAACCCCAGCAGTGTGAACGCATCCCAACATAGGTTAAGAGCAA
GTTGAAAACAAAGGCCATGGCATTCTGCCACTGCATCCTTCAGACAGTTATATCCTCCTTTTAAACCATTGTTGTTGAGTGTAAGATGTC
CTTCATGTTTTCTTATAAAGTCAGTGTTTAGAAATGTTACCCTTTCTAAGTTATATACAGATCAAATGCTTTTTTCTTTCACGTACATCC
ATCATTTGCAACTGCTGTTCGTACACAGAAACAGGACTGCTCAAATGATCCTATTTGTATTTTCTGATGCTATCAGACTCTAATGTTTTT
TTCCCTAAAATATTATTGCCATCATGCTTTAGGAATTTTATATTTTTACACAATCATATTTTAGTATGGTGTCTGTTTATGTAACTCTGA
CTTGCTGGAAAAGTTGAAACTCCAAATAATCTGAAACTAGAAAAGAAATAGCACATAATTACTACCTTCCCCTTGGCGGCTCTCCTCCCC
AACCCCCACCCCACAATTTTATGACTTCCATTTGGCAATTGTTGAATTATAACTGCGACTGAAACAAACAGGTTCATAGAGATGAATTTT

>90456_90456_2_TGFBR1-COL15A1_TGFBR1_chr9_101910066_ENST00000374994_COL15A1_chr9_101777699_ENST00000375001_length(amino acids)=1399AA_BP=653
MEAAVAAPRPRLLLLVLAAAAAAAAALLPGATALQCFCHLCTKDNFTCVTDGLCFVSVTETTDKVIHNSMCIAEIDLIPRDRPFVCAPSS
KTGSVTTTYCCNQDHCNKIELPTTVKSSPGLGPVELAAVIAGPVCFVCISLMLMVYICHNRTVIHHRVPNEEDPSLDRPFISEGTTLKDL
IYDMTTSGSGSGLPLLVQRTIARTIVLQESIGKGRFGEVWRGKWRGEEVAVKIFSSREERSWFREAEIYQTVMLRHENILGFIAADNKDN
GTWTQLWLVSDYHEHGSLFDYLNRYTVTVEGMIKLALSTASGLAHLHMEIVGTQGKPAIAHRDLKSKNILVKKNGTCCIADLGLAVRHDS
ATDTIDIAPNHRVGTKRYMAPEVLDDSINMKHFESFKRADIYAMGLVFWEIARRCSIGGIHEDYQLPYYDLVPSDPSVEEMRKVVCEQKL
RPNIPNRWQSCEGPSSEDSLTTAAAATEVSLSTFEDEEASGVPTDGLAPLTATMAPERAVTSGPGDEEDLAAATTEEPLITAGGEESGSP
PPDGPPLPLPTVAPERWITPAQREHVGMKGQAGPKGEKGDAGEELPGPPEPSGPVGPTAGAEAEGSGLGWGSDVGSGSGDLVGSEQLLRG
PPGPPGPPGLPGIPGKPGTDVFMGPPGSPGEDGPAGEPGPPGPEGQPGVDGATGLPGMKGEKGARGPNGSVGEKGDPGNRGLPGPPGKKG
QAGPPGVMGPPGPPGPPGPPGPGCTMGLGFEDTEGSGSTQLLNEPKLSRPTAAIGLKGEKGDRGPKGERGMDGASIVGPPGPRGPPGHIK
VLSNSLINITHGFMNFSDIPELVGPPGPDGLPGLPGFPGPRGPKGDTGLPGFPGLKGEQGEKGEPGAILTEDIPLERLMGKKGEPGMHGA
PGPMGPKGPPGHKGEFGLPGRPGRPGLNGLKGTKGDPGVIMQGPPGLPGPPGPPGPPGAVINIKGAIFPIPVRPHCKMPVDTAHPGSPEL
ITFHGVKGEKGSWGLPGSKGEKGDQGAQGPPGPPLDLAYLRHFLNNLKGENGDKGFKGEKGEKGDINGSFLMSGPPGLPGNPGPAGQKGE
TVVGPQGPPGAPGLPGPPGFGRPGDPGPPGPPGPPGPPAILGAAVALPGPPGPPGQPGLPGSRNLVTAFSNMDDMLQKAHLVIEGTFIYL
RDSTEFFIRVRDGWKKLQLGELIPIPADSPPPPALSSNPHQLLPPPNPISSANYEKPALHLAALNMPFSGDIRADFQCFKQARAAGLLST
YRAFLSSHLQDLSTIVRKAERYSLPIVNLKGQVLFNNWDSIFSGHGGQFNMHIPIYSFDGRDIMTDPSWPQKVIWHGSSPHGVRLVDNYC

--------------------------------------------------------------
>90456_90456_3_TGFBR1-COL15A1_TGFBR1_chr9_101910066_ENST00000550253_COL15A1_chr9_101777699_ENST00000375001_length(transcript)=5091nt_BP=1376nt
AGATTGACAGAAATGACTAATACAGGTATCTGGCCTCAAGGAGTTTACAGTTTCATAGGAGAAAGAAAAACAAAACTAGCAATTCTGCGT
TACAGTGTTTCTGCCACCTCTGTACAAAAGACAATTTTACTTGTGTGACAGATGGGCTCTGCTTTGTCTCTGTCACAGAGACCACAGACA
AAGTTATACACAACAGCATGTGTATAGCTGAAATTGACTTAATTCCTCGAGATAGGCCGTTTGTATGTGCACCCTCTTCAAAAACTGGGT
CTGTGACTACAACATATTGCTGCAATCAGGACCATTGCAATAAAATAGAACTTCCAACTACTGTAAAGTCATCACCTGGCCTTGGTCCTG
TGGAACTGGCAGCTGTCATTGCTGGACCAGTGTGCTTCGTCTGCATCTCACTCATGTTGATGGTCTATATCTGCCACAACCGCACTGTCA
TTCACCATCGAGTGCCAAATGAAGAGGACCCTTCATTAGATCGCCCTTTTATTTCAGAGGGTACTACGTTGAAAGACTTAATTTATGATA
TGACAACGTCAGGTTCTGGCTCAGGTTTACCATTGCTTGTTCAGAGAACAATTGCGAGAACTATTGTGTTACAAGAAAGCATTGGCAAAG
GTCGATTTGGAGAAGTTTGGAGAGGAAAGTGGCGGGGAGAAGAAGTTGCTGTTAAGATATTCTCCTCTAGAGAAGAACGTTCGTGGTTCC
GTGAGGCAGAGATTTATCAAACTGTAATGTTACGTCATGAAAACATCCTGGGATTTATAGCAGCAGACAATAAAGACAATGGTACTTGGA
CTCAGCTCTGGTTGGTGTCAGATTATCATGAGCATGGATCCCTTTTTGATTACTTAAACAGATACACAGTTACTGTGGAAGGAATGATAA
AACTTGCTCTGTCCACGGCGAGCGGTCTTGCCCATCTTCACATGGAGATTGTTGGTACCCAAGGAAAGCCAGCCATTGCTCATAGAGATT
TGAAATCAAAGAATATCTTGGTAAAGAAGAATGGAACTTGCTGTATTGCAGACTTAGGACTGGCAGTAAGACATGATTCAGCCACAGATA
CCATTGATATTGCTCCAAACCACAGAGTGGGAACAAAAAGGTACATGGCCCCTGAAGTTCTCGATGATTCCATAAATATGAAACATTTTG
AATCCTTCAAACGTGCTGACATCTATGCAATGGGCTTAGTATTCTGGGAAATTGCTCGACGATGTTCCATTGGTGGAATTCATGAAGATT
ACCAACTGCCTTATTATGATCTTGTACCTTCTGACCCATCAGTTGAAGAAATGAGAAAAGTTGTTTGTGAACAGAAGTTAAGGCCAAATA
TCCCAAACAGATGGCAGAGCTGTGAAGGTCCAAGCAGTGAAGACAGTTTAACAACAGCTGCAGCTGCAACCGAAGTGTCCCTCAGTACTT
TTGAGGATGAGGAAGCCAGTGGGGTCCCCACAGATGGCCTGGCTCCCCTCACAGCCACCATGGCCCCTGAGCGGGCAGTCACTTCTGGTC
CTGGTGATGAAGAAGACTTGGCAGCAGCCACAACAGAGGAGCCCCTCATCACAGCTGGGGGTGAAGAGTCCGGCAGCCCTCCCCCTGATG
GGCCACCGCTGCCCCTGCCCACAGTGGCTCCTGAAAGATGGATCACTCCAGCTCAAAGAGAACATGTGGGAATGAAAGGACAGGCTGGGC
CCAAAGGAGAAAAGGGTGATGCTGGGGAGGAGCTTCCTGGCCCTCCTGAACCTTCTGGGCCTGTTGGACCCACGGCAGGAGCAGAAGCAG
AGGGCTCTGGCCTAGGCTGGGGCTCGGACGTCGGCTCTGGCTCTGGTGACCTGGTGGGCAGTGAGCAGCTGCTGAGAGGTCCTCCAGGAC
CCCCAGGGCCACCTGGCTTACCTGGGATTCCAGGAAAACCAGGAACTGATGTTTTCATGGGACCCCCTGGATCTCCTGGAGAGGATGGAC
CTGCTGGTGAACCTGGGCCCCCGGGCCCTGAGGGACAGCCTGGAGTTGATGGAGCCACCGGCCTTCCCGGGATGAAAGGGGAGAAGGGAG
CAAGAGGGCCTAATGGCTCAGTTGGTGAAAAGGGTGACCCTGGCAACAGAGGCTTACCTGGACCCCCGGGGAAAAAGGGACAAGCTGGCC
CTCCTGGGGTCATGGGACCCCCAGGGCCTCCTGGACCCCCTGGGCCCCCAGGCCCTGGATGCACAATGGGACTTGGATTCGAGGATACCG
AAGGCTCTGGAAGCACCCAGCTATTGAATGAACCCAAACTCTCCAGACCAACGGCTGCAATTGGTCTCAAAGGAGAGAAAGGAGACCGGG
GACCCAAGGGAGAAAGGGGGATGGATGGAGCCAGTATTGTGGGACCCCCTGGGCCGAGAGGGCCACCTGGGCACATCAAGGTCTTGTCTA
ATTCCTTGATCAATATCACCCATGGATTCATGAATTTCTCGGACATTCCTGAGCTGGTGGGGCCTCCGGGGCCGGACGGGTTGCCTGGGC
TGCCAGGATTTCCAGGTCCTAGAGGACCAAAAGGTGACACTGGTTTACCTGGCTTTCCAGGACTAAAAGGAGAACAGGGCGAGAAGGGAG
AGCCGGGTGCCATCCTGACAGAGGACATTCCTCTGGAAAGGCTGATGGGGAAAAAGGGTGAACCTGGAATGCATGGAGCCCCAGGACCAA
TGGGGCCCAAAGGACCACCAGGACATAAAGGAGAATTTGGCCTTCCCGGGCGACCTGGTCGCCCAGGACTGAATGGCCTCAAGGGTACCA
AAGGAGATCCAGGGGTCATTATGCAGGGCCCACCTGGCTTACCTGGCCCTCCAGGCCCCCCTGGGCCACCTGGAGCTGTGATTAACATCA
AAGGAGCCATTTTCCCAATACCCGTCCGACCACACTGCAAAATGCCAGTTGATACTGCTCATCCTGGGAGTCCAGAGCTCATCACTTTTC
ACGGTGTTAAAGGAGAGAAAGGATCCTGGGGTCTTCCTGGCTCAAAGGGAGAAAAAGGCGACCAGGGAGCCCAGGGACCACCAGGTCCTC
CACTTGATCTAGCTTACCTGAGACACTTTCTGAACAACTTGAAGGGGGAGAATGGAGACAAGGGGTTCAAAGGTGAAAAAGGAGAAAAAG
GAGACATTAATGGCAGCTTCCTTATGTCTGGGCCTCCAGGCCTGCCCGGAAATCCAGGCCCGGCTGGCCAAAAAGGGGAGACAGTCGTTG
GGCCCCAAGGACCCCCAGGTGCTCCTGGTCTGCCTGGGCCACCTGGCTTTGGAAGACCTGGTGATCCTGGGCCACCGGGGCCCCCGGGGC
CACCAGGACCTCCAGCTATCCTGGGAGCAGCTGTGGCCCTTCCAGGTCCCCCTGGCCCTCCAGGACAGCCAGGGCTTCCCGGATCCAGAA
ACCTGGTCACAGCATTCAGCAACATGGATGACATGCTGCAGAAAGCGCATTTGGTTATAGAAGGAACATTCATCTACCTGAGGGACAGCA
CTGAGTTTTTCATTCGTGTTAGAGATGGCTGGAAAAAATTACAGCTGGGAGAACTGATCCCCATTCCTGCCGACAGCCCTCCACCCCCTG
CGCTTTCCAGCAACCCACATCAGCTTCTGCCTCCACCAAACCCTATTTCAAGTGCCAATTATGAGAAGCCTGCTCTGCATTTGGCTGCTC
TGAACATGCCATTTTCTGGGGACATTCGAGCTGATTTTCAGTGCTTCAAGCAGGCCAGAGCTGCAGGACTGTTGTCCACCTACCGAGCAT
TCTTATCTTCCCATTTGCAAGATCTGTCCACCATTGTGAGGAAAGCAGAGAGATACAGCCTTCCCATAGTGAACCTCAAGGGCCAAGTAC
TTTTTAATAATTGGGACTCAATTTTTTCTGGCCACGGAGGTCAGTTCAATATGCATATTCCAATATACTCCTTTGATGGTCGAGACATAA
TGACAGATCCTTCTTGGCCCCAGAAAGTCATTTGGCATGGCTCCAGCCCCCATGGCGTCCGCCTTGTGGATAACTACTGTGAAGCATGGC
GAACCGCGGACACAGCGGTCACGGGACTTGCCTCCCCGCTGAGCACGGGGAAGATTCTGGACCAGAAAGCATACAGCTGTGCTAATCGGC
TAATTGTCCTATGTATCGAAAACAGTTTCATGACAGACGCTAGGAAGTAATGGCCTTCTGATGATTCTTAAAGAGTTTTCAATTTTTTCT
TATGTGAAGAGTTGACACTGAAATCTAAAATGTTTAATTGTTGTAAATATTACAGTTTTTTTTTTTTACTACATATTCTTTACAACAGCA
ACCAAAGAAAACATACCTCAATACACTCAAAACTGAAGACATAGAGGACTCAGATCAAAGACAAAATCTGATCCATATATTGGTGCTAGA
TTCTGCAGGAAACCCCAGCAGTGTGAACGCATCCCAACATAGGTTAAGAGCAAGTTGAAAACAAAGGCCATGGCATTCTGCCACTGCATC
CTTCAGACAGTTATATCCTCCTTTTAAACCATTGTTGTTGAGTGTAAGATGTCCTTCATGTTTTCTTATAAAGTCAGTGTTTAGAAATGT
TACCCTTTCTAAGTTATATACAGATCAAATGCTTTTTTCTTTCACGTACATCCATCATTTGCAACTGCTGTTCGTACACAGAAACAGGAC
TGCTCAAATGATCCTATTTGTATTTTCTGATGCTATCAGACTCTAATGTTTTTTTCCCTAAAATATTATTGCCATCATGCTTTAGGAATT
TTATATTTTTACACAATCATATTTTAGTATGGTGTCTGTTTATGTAACTCTGACTTGCTGGAAAAGTTGAAACTCCAAATAATCTGAAAC
TAGAAAAGAAATAGCACATAATTACTACCTTCCCCTTGGCGGCTCTCCTCCCCAACCCCCACCCCACAATTTTATGACTTCCATTTGGCA
ATTGTTGAATTATAACTGCGACTGAAACAAACAGGTTCATAGAGATGAATTTTCTGAGAAACATATATCTACATGTTGTATAATTGGATT

>90456_90456_3_TGFBR1-COL15A1_TGFBR1_chr9_101910066_ENST00000550253_COL15A1_chr9_101777699_ENST00000375001_length(amino acids)=1386AA_BP=640
MASRSLQFHRRKKNKTSNSALQCFCHLCTKDNFTCVTDGLCFVSVTETTDKVIHNSMCIAEIDLIPRDRPFVCAPSSKTGSVTTTYCCNQ
DHCNKIELPTTVKSSPGLGPVELAAVIAGPVCFVCISLMLMVYICHNRTVIHHRVPNEEDPSLDRPFISEGTTLKDLIYDMTTSGSGSGL
PLLVQRTIARTIVLQESIGKGRFGEVWRGKWRGEEVAVKIFSSREERSWFREAEIYQTVMLRHENILGFIAADNKDNGTWTQLWLVSDYH
EHGSLFDYLNRYTVTVEGMIKLALSTASGLAHLHMEIVGTQGKPAIAHRDLKSKNILVKKNGTCCIADLGLAVRHDSATDTIDIAPNHRV
GTKRYMAPEVLDDSINMKHFESFKRADIYAMGLVFWEIARRCSIGGIHEDYQLPYYDLVPSDPSVEEMRKVVCEQKLRPNIPNRWQSCEG
PSSEDSLTTAAAATEVSLSTFEDEEASGVPTDGLAPLTATMAPERAVTSGPGDEEDLAAATTEEPLITAGGEESGSPPPDGPPLPLPTVA
PERWITPAQREHVGMKGQAGPKGEKGDAGEELPGPPEPSGPVGPTAGAEAEGSGLGWGSDVGSGSGDLVGSEQLLRGPPGPPGPPGLPGI
PGKPGTDVFMGPPGSPGEDGPAGEPGPPGPEGQPGVDGATGLPGMKGEKGARGPNGSVGEKGDPGNRGLPGPPGKKGQAGPPGVMGPPGP
PGPPGPPGPGCTMGLGFEDTEGSGSTQLLNEPKLSRPTAAIGLKGEKGDRGPKGERGMDGASIVGPPGPRGPPGHIKVLSNSLINITHGF
MNFSDIPELVGPPGPDGLPGLPGFPGPRGPKGDTGLPGFPGLKGEQGEKGEPGAILTEDIPLERLMGKKGEPGMHGAPGPMGPKGPPGHK
GEFGLPGRPGRPGLNGLKGTKGDPGVIMQGPPGLPGPPGPPGPPGAVINIKGAIFPIPVRPHCKMPVDTAHPGSPELITFHGVKGEKGSW
GLPGSKGEKGDQGAQGPPGPPLDLAYLRHFLNNLKGENGDKGFKGEKGEKGDINGSFLMSGPPGLPGNPGPAGQKGETVVGPQGPPGAPG
LPGPPGFGRPGDPGPPGPPGPPGPPAILGAAVALPGPPGPPGQPGLPGSRNLVTAFSNMDDMLQKAHLVIEGTFIYLRDSTEFFIRVRDG
WKKLQLGELIPIPADSPPPPALSSNPHQLLPPPNPISSANYEKPALHLAALNMPFSGDIRADFQCFKQARAAGLLSTYRAFLSSHLQDLS
TIVRKAERYSLPIVNLKGQVLFNNWDSIFSGHGGQFNMHIPIYSFDGRDIMTDPSWPQKVIWHGSSPHGVRLVDNYCEAWRTADTAVTGL

--------------------------------------------------------------
>90456_90456_4_TGFBR1-COL15A1_TGFBR1_chr9_101910066_ENST00000552516_COL15A1_chr9_101777699_ENST00000375001_length(transcript)=5169nt_BP=1454nt
GGTGAGGCAGCGGCGCGGCCGGGCCGGGCCGGGCCACAGGCGGTGGCGGCGGGACCATGGAGGCGGCGGTCGCTGCTCCGCGTCCCCGGC
TGCTCCTCCTCGTGCTGGCGGCGGCGGCGGCGGCGGCGGCGGCGCTGCTCCCGGGGGCGACGGCGTTACAGTGTTTCTGCCACCTCTGTA
CAAAAGACAATTTTACTTGTGTGACAGATGGGCTCTGCTTTGTCTCTGTCACAGAGACCACAGACAAAGTTATACACAACAGCATGTGTA
TAGCTGAAATTGACTTAATTCCTCGAGATAGGCCGTTTGTATGTGCACCCTCTTCAAAAACTGGGTCTGTGACTACAACATATTGCTGCA
ATCAGGACCATTGCAATAAAATAGAACTTCCAACTACTGGCCCTTTTTCAGTAAAGTCATCACCTGGCCTTGGTCCTGTGGAACTGGCAG
CTGTCATTGCTGGACCAGTGTGCTTCGTCTGCATCTCACTCATGTTGATGGTCTATATCTGCCACAACCGCACTGTCATTCACCATCGAG
TGCCAAATGAAGAGGACCCTTCATTAGATCGCCCTTTTATTTCAGAGGGTACTACGTTGAAAGACTTAATTTATGATATGACAACGTCAG
GTTCTGGCTCAGGTTTACCATTGCTTGTTCAGAGAACAATTGCGAGAACTATTGTGTTACAAGAAAGCATTGGCAAAGGTCGATTTGGAG
AAGTTTGGAGAGGAAAGTGGCGGGGAGAAGAAGTTGCTGTTAAGATATTCTCCTCTAGAGAAGAACGTTCGTGGTTCCGTGAGGCAGAGA
TTTATCAAACTGTAATGTTACGTCATGAAAACATCCTGGGATTTATAGCAGCAGACAATAAAGACAATGGTACTTGGACTCAGCTCTGGT
TGGTGTCAGATTATCATGAGCATGGATCCCTTTTTGATTACTTAAACAGATACACAGTTACTGTGGAAGGAATGATAAAACTTGCTCTGT
CCACGGCGAGCGGTCTTGCCCATCTTCACATGGAGATTGTTGGTACCCAAGGAAAGCCAGCCATTGCTCATAGAGATTTGAAATCAAAGA
ATATCTTGGTAAAGAAGAATGGAACTTGCTGTATTGCAGACTTAGGACTGGCAGTAAGACATGATTCAGCCACAGATACCATTGATATTG
CTCCAAACCACAGAGTGGGAACAAAAAGGTACATGGCCCCTGAAGTTCTCGATGATTCCATAAATATGAAACATTTTGAATCCTTCAAAC
GTGCTGACATCTATGCAATGGGCTTAGTATTCTGGGAAATTGCTCGACGATGTTCCATTGGTGGAATTCATGAAGATTACCAACTGCCTT
ATTATGATCTTGTACCTTCTGACCCATCAGTTGAAGAAATGAGAAAAGTTGTTTGTGAACAGAAGTTAAGGCCAAATATCCCAAACAGAT
GGCAGAGCTGTGAAGGTCCAAGCAGTGAAGACAGTTTAACAACAGCTGCAGCTGCAACCGAAGTGTCCCTCAGTACTTTTGAGGATGAGG
AAGCCAGTGGGGTCCCCACAGATGGCCTGGCTCCCCTCACAGCCACCATGGCCCCTGAGCGGGCAGTCACTTCTGGTCCTGGTGATGAAG
AAGACTTGGCAGCAGCCACAACAGAGGAGCCCCTCATCACAGCTGGGGGTGAAGAGTCCGGCAGCCCTCCCCCTGATGGGCCACCGCTGC
CCCTGCCCACAGTGGCTCCTGAAAGATGGATCACTCCAGCTCAAAGAGAACATGTGGGAATGAAAGGACAGGCTGGGCCCAAAGGAGAAA
AGGGTGATGCTGGGGAGGAGCTTCCTGGCCCTCCTGAACCTTCTGGGCCTGTTGGACCCACGGCAGGAGCAGAAGCAGAGGGCTCTGGCC
TAGGCTGGGGCTCGGACGTCGGCTCTGGCTCTGGTGACCTGGTGGGCAGTGAGCAGCTGCTGAGAGGTCCTCCAGGACCCCCAGGGCCAC
CTGGCTTACCTGGGATTCCAGGAAAACCAGGAACTGATGTTTTCATGGGACCCCCTGGATCTCCTGGAGAGGATGGACCTGCTGGTGAAC
CTGGGCCCCCGGGCCCTGAGGGACAGCCTGGAGTTGATGGAGCCACCGGCCTTCCCGGGATGAAAGGGGAGAAGGGAGCAAGAGGGCCTA
ATGGCTCAGTTGGTGAAAAGGGTGACCCTGGCAACAGAGGCTTACCTGGACCCCCGGGGAAAAAGGGACAAGCTGGCCCTCCTGGGGTCA
TGGGACCCCCAGGGCCTCCTGGACCCCCTGGGCCCCCAGGCCCTGGATGCACAATGGGACTTGGATTCGAGGATACCGAAGGCTCTGGAA
GCACCCAGCTATTGAATGAACCCAAACTCTCCAGACCAACGGCTGCAATTGGTCTCAAAGGAGAGAAAGGAGACCGGGGACCCAAGGGAG
AAAGGGGGATGGATGGAGCCAGTATTGTGGGACCCCCTGGGCCGAGAGGGCCACCTGGGCACATCAAGGTCTTGTCTAATTCCTTGATCA
ATATCACCCATGGATTCATGAATTTCTCGGACATTCCTGAGCTGGTGGGGCCTCCGGGGCCGGACGGGTTGCCTGGGCTGCCAGGATTTC
CAGGTCCTAGAGGACCAAAAGGTGACACTGGTTTACCTGGCTTTCCAGGACTAAAAGGAGAACAGGGCGAGAAGGGAGAGCCGGGTGCCA
TCCTGACAGAGGACATTCCTCTGGAAAGGCTGATGGGGAAAAAGGGTGAACCTGGAATGCATGGAGCCCCAGGACCAATGGGGCCCAAAG
GACCACCAGGACATAAAGGAGAATTTGGCCTTCCCGGGCGACCTGGTCGCCCAGGACTGAATGGCCTCAAGGGTACCAAAGGAGATCCAG
GGGTCATTATGCAGGGCCCACCTGGCTTACCTGGCCCTCCAGGCCCCCCTGGGCCACCTGGAGCTGTGATTAACATCAAAGGAGCCATTT
TCCCAATACCCGTCCGACCACACTGCAAAATGCCAGTTGATACTGCTCATCCTGGGAGTCCAGAGCTCATCACTTTTCACGGTGTTAAAG
GAGAGAAAGGATCCTGGGGTCTTCCTGGCTCAAAGGGAGAAAAAGGCGACCAGGGAGCCCAGGGACCACCAGGTCCTCCACTTGATCTAG
CTTACCTGAGACACTTTCTGAACAACTTGAAGGGGGAGAATGGAGACAAGGGGTTCAAAGGTGAAAAAGGAGAAAAAGGAGACATTAATG
GCAGCTTCCTTATGTCTGGGCCTCCAGGCCTGCCCGGAAATCCAGGCCCGGCTGGCCAAAAAGGGGAGACAGTCGTTGGGCCCCAAGGAC
CCCCAGGTGCTCCTGGTCTGCCTGGGCCACCTGGCTTTGGAAGACCTGGTGATCCTGGGCCACCGGGGCCCCCGGGGCCACCAGGACCTC
CAGCTATCCTGGGAGCAGCTGTGGCCCTTCCAGGTCCCCCTGGCCCTCCAGGACAGCCAGGGCTTCCCGGATCCAGAAACCTGGTCACAG
CATTCAGCAACATGGATGACATGCTGCAGAAAGCGCATTTGGTTATAGAAGGAACATTCATCTACCTGAGGGACAGCACTGAGTTTTTCA
TTCGTGTTAGAGATGGCTGGAAAAAATTACAGCTGGGAGAACTGATCCCCATTCCTGCCGACAGCCCTCCACCCCCTGCGCTTTCCAGCA
ACCCACATCAGCTTCTGCCTCCACCAAACCCTATTTCAAGTGCCAATTATGAGAAGCCTGCTCTGCATTTGGCTGCTCTGAACATGCCAT
TTTCTGGGGACATTCGAGCTGATTTTCAGTGCTTCAAGCAGGCCAGAGCTGCAGGACTGTTGTCCACCTACCGAGCATTCTTATCTTCCC
ATTTGCAAGATCTGTCCACCATTGTGAGGAAAGCAGAGAGATACAGCCTTCCCATAGTGAACCTCAAGGGCCAAGTACTTTTTAATAATT
GGGACTCAATTTTTTCTGGCCACGGAGGTCAGTTCAATATGCATATTCCAATATACTCCTTTGATGGTCGAGACATAATGACAGATCCTT
CTTGGCCCCAGAAAGTCATTTGGCATGGCTCCAGCCCCCATGGCGTCCGCCTTGTGGATAACTACTGTGAAGCATGGCGAACCGCGGACA
CAGCGGTCACGGGACTTGCCTCCCCGCTGAGCACGGGGAAGATTCTGGACCAGAAAGCATACAGCTGTGCTAATCGGCTAATTGTCCTAT
GTATCGAAAACAGTTTCATGACAGACGCTAGGAAGTAATGGCCTTCTGATGATTCTTAAAGAGTTTTCAATTTTTTCTTATGTGAAGAGT
TGACACTGAAATCTAAAATGTTTAATTGTTGTAAATATTACAGTTTTTTTTTTTTACTACATATTCTTTACAACAGCAACCAAAGAAAAC
ATACCTCAATACACTCAAAACTGAAGACATAGAGGACTCAGATCAAAGACAAAATCTGATCCATATATTGGTGCTAGATTCTGCAGGAAA
CCCCAGCAGTGTGAACGCATCCCAACATAGGTTAAGAGCAAGTTGAAAACAAAGGCCATGGCATTCTGCCACTGCATCCTTCAGACAGTT
ATATCCTCCTTTTAAACCATTGTTGTTGAGTGTAAGATGTCCTTCATGTTTTCTTATAAAGTCAGTGTTTAGAAATGTTACCCTTTCTAA
GTTATATACAGATCAAATGCTTTTTTCTTTCACGTACATCCATCATTTGCAACTGCTGTTCGTACACAGAAACAGGACTGCTCAAATGAT
CCTATTTGTATTTTCTGATGCTATCAGACTCTAATGTTTTTTTCCCTAAAATATTATTGCCATCATGCTTTAGGAATTTTATATTTTTAC
ACAATCATATTTTAGTATGGTGTCTGTTTATGTAACTCTGACTTGCTGGAAAAGTTGAAACTCCAAATAATCTGAAACTAGAAAAGAAAT
AGCACATAATTACTACCTTCCCCTTGGCGGCTCTCCTCCCCAACCCCCACCCCACAATTTTATGACTTCCATTTGGCAATTGTTGAATTA
TAACTGCGACTGAAACAAACAGGTTCATAGAGATGAATTTTCTGAGAAACATATATCTACATGTTGTATAATTGGATTTTTTTTCCATGT

>90456_90456_4_TGFBR1-COL15A1_TGFBR1_chr9_101910066_ENST00000552516_COL15A1_chr9_101777699_ENST00000375001_length(amino acids)=1403AA_BP=657
MEAAVAAPRPRLLLLVLAAAAAAAAALLPGATALQCFCHLCTKDNFTCVTDGLCFVSVTETTDKVIHNSMCIAEIDLIPRDRPFVCAPSS
KTGSVTTTYCCNQDHCNKIELPTTGPFSVKSSPGLGPVELAAVIAGPVCFVCISLMLMVYICHNRTVIHHRVPNEEDPSLDRPFISEGTT
LKDLIYDMTTSGSGSGLPLLVQRTIARTIVLQESIGKGRFGEVWRGKWRGEEVAVKIFSSREERSWFREAEIYQTVMLRHENILGFIAAD
NKDNGTWTQLWLVSDYHEHGSLFDYLNRYTVTVEGMIKLALSTASGLAHLHMEIVGTQGKPAIAHRDLKSKNILVKKNGTCCIADLGLAV
RHDSATDTIDIAPNHRVGTKRYMAPEVLDDSINMKHFESFKRADIYAMGLVFWEIARRCSIGGIHEDYQLPYYDLVPSDPSVEEMRKVVC
EQKLRPNIPNRWQSCEGPSSEDSLTTAAAATEVSLSTFEDEEASGVPTDGLAPLTATMAPERAVTSGPGDEEDLAAATTEEPLITAGGEE
SGSPPPDGPPLPLPTVAPERWITPAQREHVGMKGQAGPKGEKGDAGEELPGPPEPSGPVGPTAGAEAEGSGLGWGSDVGSGSGDLVGSEQ
LLRGPPGPPGPPGLPGIPGKPGTDVFMGPPGSPGEDGPAGEPGPPGPEGQPGVDGATGLPGMKGEKGARGPNGSVGEKGDPGNRGLPGPP
GKKGQAGPPGVMGPPGPPGPPGPPGPGCTMGLGFEDTEGSGSTQLLNEPKLSRPTAAIGLKGEKGDRGPKGERGMDGASIVGPPGPRGPP
GHIKVLSNSLINITHGFMNFSDIPELVGPPGPDGLPGLPGFPGPRGPKGDTGLPGFPGLKGEQGEKGEPGAILTEDIPLERLMGKKGEPG
MHGAPGPMGPKGPPGHKGEFGLPGRPGRPGLNGLKGTKGDPGVIMQGPPGLPGPPGPPGPPGAVINIKGAIFPIPVRPHCKMPVDTAHPG
SPELITFHGVKGEKGSWGLPGSKGEKGDQGAQGPPGPPLDLAYLRHFLNNLKGENGDKGFKGEKGEKGDINGSFLMSGPPGLPGNPGPAG
QKGETVVGPQGPPGAPGLPGPPGFGRPGDPGPPGPPGPPGPPAILGAAVALPGPPGPPGQPGLPGSRNLVTAFSNMDDMLQKAHLVIEGT
FIYLRDSTEFFIRVRDGWKKLQLGELIPIPADSPPPPALSSNPHQLLPPPNPISSANYEKPALHLAALNMPFSGDIRADFQCFKQARAAG
LLSTYRAFLSSHLQDLSTIVRKAERYSLPIVNLKGQVLFNNWDSIFSGHGGQFNMHIPIYSFDGRDIMTDPSWPQKVIWHGSSPHGVRLV

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for TGFBR1-COL15A1


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for TGFBR1-COL15A1


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for TGFBR1-COL15A1


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource