FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:COL6A2-MYH9 (FusionGDB2 ID:18436)

Fusion Gene Summary for COL6A2-MYH9

check button Fusion gene summary
Fusion gene informationFusion gene name: COL6A2-MYH9
Fusion gene ID: 18436
HgeneTgene
Gene symbol

COL6A2

MYH9

Gene ID

1292

4627

Gene namecollagen type VI alpha 2 chainmyosin heavy chain 9
SynonymsBTHLM1|PP3610|UCMD1BDPLT6|DFNA17|EPSTS|FTNS|MATINS|MHA|NMHC-II-A|NMMHC-IIA|NMMHCA
Cytomap

21q22.3

22q12.3

Type of geneprotein-codingprotein-coding
Descriptioncollagen alpha-2(VI) chaincollagen VI, alpha-2 polypeptidecollagen, type VI, alpha 2epididymis secretory sperm binding proteinhuman mRNA for collagen VI alpha-2 C-terminal globular domainmyosin-9cellular myosin heavy chain, type Amyosin, heavy chain 9, non-musclenon-muscle myosin heavy chain 9non-muscle myosin heavy chain Anon-muscle myosin heavy chain IIanon-muscle myosin heavy polypeptide 9nonmuscle myosin heavy chain II-A
Modification date2020032820200315
UniProtAcc.

P35579

Ensembl transtripts involved in fusion geneENST00000300527, ENST00000310645, 
ENST00000357838, ENST00000397763, 
ENST00000409416, ENST00000460886, 
ENST00000401701, ENST00000475726, 
ENST00000216181, 
Fusion gene scores* DoF score13 X 9 X 8=93644 X 46 X 15=30360
# samples 1356
** MAII scorelog2(13/936*10)=-2.84799690655495
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(56/30360*10)=-5.76060115335786
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: COL6A2 [Title/Abstract] AND MYH9 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointCOL6A2(47542072)-MYH9(36695088), # samples:1
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
TgeneMYH9

GO:0001525

angiogenesis

16403913

TgeneMYH9

GO:0001778

plasma membrane repair

27325790

TgeneMYH9

GO:0006509

membrane protein ectodomain proteolysis

16186248

TgeneMYH9

GO:0030048

actin filament-based movement

12237319|15845534

TgeneMYH9

GO:0031032

actomyosin structure organization

24072716


check buttonFusion gene breakpoints across COL6A2 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across MYH9 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4HNSCTCGA-HD-7831-01ACOL6A2chr21

47542072

+MYH9chr22

36695088

-


Top

Fusion Gene ORF analysis for COL6A2-MYH9

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-intronENST00000300527ENST00000401701COL6A2chr21

47542072

+MYH9chr22

36695088

-
5CDS-intronENST00000300527ENST00000475726COL6A2chr21

47542072

+MYH9chr22

36695088

-
5CDS-intronENST00000310645ENST00000401701COL6A2chr21

47542072

+MYH9chr22

36695088

-
5CDS-intronENST00000310645ENST00000475726COL6A2chr21

47542072

+MYH9chr22

36695088

-
5CDS-intronENST00000357838ENST00000401701COL6A2chr21

47542072

+MYH9chr22

36695088

-
5CDS-intronENST00000357838ENST00000475726COL6A2chr21

47542072

+MYH9chr22

36695088

-
5CDS-intronENST00000397763ENST00000401701COL6A2chr21

47542072

+MYH9chr22

36695088

-
5CDS-intronENST00000397763ENST00000475726COL6A2chr21

47542072

+MYH9chr22

36695088

-
5CDS-intronENST00000409416ENST00000401701COL6A2chr21

47542072

+MYH9chr22

36695088

-
5CDS-intronENST00000409416ENST00000475726COL6A2chr21

47542072

+MYH9chr22

36695088

-
In-frameENST00000300527ENST00000216181COL6A2chr21

47542072

+MYH9chr22

36695088

-
In-frameENST00000310645ENST00000216181COL6A2chr21

47542072

+MYH9chr22

36695088

-
In-frameENST00000357838ENST00000216181COL6A2chr21

47542072

+MYH9chr22

36695088

-
In-frameENST00000397763ENST00000216181COL6A2chr21

47542072

+MYH9chr22

36695088

-
In-frameENST00000409416ENST00000216181COL6A2chr21

47542072

+MYH9chr22

36695088

-
intron-3CDSENST00000460886ENST00000216181COL6A2chr21

47542072

+MYH9chr22

36695088

-
intron-intronENST00000460886ENST00000401701COL6A2chr21

47542072

+MYH9chr22

36695088

-
intron-intronENST00000460886ENST00000475726COL6A2chr21

47542072

+MYH9chr22

36695088

-

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000300527COL6A2chr2147542072+ENST00000216181MYH9chr2236695088-597016761745821521
ENST00000310645COL6A2chr2147542072+ENST00000216181MYH9chr2236695088-594816548245601492
ENST00000357838COL6A2chr2147542072+ENST00000216181MYH9chr2236695088-594816548245601492
ENST00000409416COL6A2chr2147542072+ENST00000216181MYH9chr2236695088-59291635045411513
ENST00000397763COL6A2chr2147542072+ENST00000216181MYH9chr2236695088-589315992745051492

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000300527ENST00000216181COL6A2chr2147542072+MYH9chr2236695088-0.017313420.98268664
ENST00000310645ENST00000216181COL6A2chr2147542072+MYH9chr2236695088-0.0171760820.98282397
ENST00000357838ENST00000216181COL6A2chr2147542072+MYH9chr2236695088-0.0171760820.98282397
ENST00000409416ENST00000216181COL6A2chr2147542072+MYH9chr2236695088-0.0170055680.98299444
ENST00000397763ENST00000216181COL6A2chr2147542072+MYH9chr2236695088-0.0168358590.98316413

Top

Fusion Genomic Features for COL6A2-MYH9


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.

Top

Fusion Protein Features for COL6A2-MYH9


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr21:47542072/chr22:36695088)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
.MYH9

P35579

FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.FUNCTION: Cellular myosin that appears to play a role in cytokinesis, cell shape, and specialized functions such as secretion and capping. Promotes also cell motility together with S100A4 (PubMed:16707441). During cell spreading, plays an important role in cytoskeleton reorganization, focal contacts formation (in the margins but not the central part of spreading cells), and lamellipodial retraction; this function is mechanically antagonized by MYH10 (PubMed:20052411). {ECO:0000250|UniProtKB:Q8VDD5, ECO:0000269|PubMed:16707441, ECO:0000269|PubMed:20052411}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000300527+192846_2345241020.0DomainVWFA 1
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000310645+192846_234524829.0DomainVWFA 1
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000357838+192846_234524919.0DomainVWFA 1
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000397763+182746_234524919.0DomainVWFA 1
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000409416+182746_234524829.0DomainVWFA 1
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000300527+1928366_3685241020.0MotifCell attachment site
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000300527+1928426_4285241020.0MotifCell attachment site
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000300527+1928489_4915241020.0MotifCell attachment site
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000300527+1928498_5005241020.0MotifCell attachment site
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000310645+1928366_368524829.0MotifCell attachment site
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000310645+1928426_428524829.0MotifCell attachment site
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000310645+1928489_491524829.0MotifCell attachment site
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000310645+1928498_500524829.0MotifCell attachment site
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000357838+1928366_368524919.0MotifCell attachment site
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000357838+1928426_428524919.0MotifCell attachment site
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000357838+1928489_491524919.0MotifCell attachment site
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000357838+1928498_500524919.0MotifCell attachment site
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000397763+1827366_368524919.0MotifCell attachment site
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000397763+1827426_428524919.0MotifCell attachment site
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000397763+1827489_491524919.0MotifCell attachment site
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000397763+1827498_500524919.0MotifCell attachment site
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000409416+1827366_368524829.0MotifCell attachment site
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000409416+1827426_428524829.0MotifCell attachment site
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000409416+1827489_491524829.0MotifCell attachment site
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000409416+1827498_500524829.0MotifCell attachment site
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000300527+192821_2565241020.0RegionNote=Nonhelical region
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000310645+192821_256524829.0RegionNote=Nonhelical region
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000357838+192821_256524919.0RegionNote=Nonhelical region
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000397763+182721_256524919.0RegionNote=Nonhelical region
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000409416+182721_256524829.0RegionNote=Nonhelical region

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000300527+1928615_8055241020.0DomainVWFA 2
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000300527+1928833_10145241020.0DomainVWFA 3
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000310645+1928615_805524829.0DomainVWFA 2
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000310645+1928833_1014524829.0DomainVWFA 3
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000357838+1928615_805524919.0DomainVWFA 2
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000357838+1928833_1014524919.0DomainVWFA 3
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000397763+1827615_805524919.0DomainVWFA 2
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000397763+1827833_1014524919.0DomainVWFA 3
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000409416+1827615_805524829.0DomainVWFA 2
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000409416+1827833_1014524829.0DomainVWFA 3
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000300527+1928539_5415241020.0MotifCell attachment site
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000310645+1928539_541524829.0MotifCell attachment site
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000357838+1928539_541524919.0MotifCell attachment site
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000397763+1827539_541524919.0MotifCell attachment site
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000409416+1827539_541524829.0MotifCell attachment site
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000300527+1928257_5905241020.0RegionNote=Triple-helical region
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000300527+1928591_10195241020.0RegionNote=Nonhelical region
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000310645+1928257_590524829.0RegionNote=Triple-helical region
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000310645+1928591_1019524829.0RegionNote=Nonhelical region
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000357838+1928257_590524919.0RegionNote=Triple-helical region
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000357838+1928591_1019524919.0RegionNote=Nonhelical region
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000397763+1827257_590524919.0RegionNote=Triple-helical region
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000397763+1827591_1019524919.0RegionNote=Nonhelical region
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000409416+1827257_590524829.0RegionNote=Triple-helical region
HgeneCOL6A2chr21:47542072chr22:36695088ENST00000409416+1827591_1019524829.0RegionNote=Nonhelical region
TgeneMYH9chr21:47542072chr22:36695088ENST000002161812241837_19269921961.0Coiled coilOntology_term=ECO:0000255
TgeneMYH9chr21:47542072chr22:36695088ENST00000216181224127_779921961.0DomainMyosin N-terminal SH3-like
TgeneMYH9chr21:47542072chr22:36695088ENST000002161812241779_8089921961.0DomainIQ
TgeneMYH9chr21:47542072chr22:36695088ENST00000216181224181_7769921961.0DomainMyosin motor
TgeneMYH9chr21:47542072chr22:36695088ENST000002161812241174_1819921961.0Nucleotide bindingATP
TgeneMYH9chr21:47542072chr22:36695088ENST000002161812241654_6769921961.0RegionNote=Actin-binding


Top

Fusion Gene Sequence for COL6A2-MYH9


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>18436_18436_1_COL6A2-MYH9_COL6A2_chr21_47542072_ENST00000300527_MYH9_chr22_36695088_ENST00000216181_length(transcript)=5970nt_BP=1676nt
CGGCCGCGGTTCCCTCCCTGCTGCTTACTCGGCGCCCGCGCCTCGGGCCGTCGGGAGCGGAGCCTCCTCGGGACCAGGACTTCAGGGCCA
CAGGTGCTGCCAAGATGCTCCAGGGCACCTGCTCCGTGCTCCTGCTCTGGGGAATCCTGGGGGCCATCCAGGCCCAGCAGCAGGAGGTCA
TCTCGCCGGACACTACCGAGAGAAACAACAACTGCCCAGAGAAGACCGACTGCCCCATCCACGTGTACTTCGTGCTGGACACCTCGGAGA
GCGTCACCATGCAGTCCCCCACGGACATCCTGCTCTTCCACATGAAGCAGTTCGTGCCGCAGTTCATCAGCCAGCTGCAGAACGAGTTCT
ACCTGGACCAGGTGGCGCTGAGCTGGCGCTACGGCGGCCTGCACTTCTCTGACCAGGTGGAGGTGTTCAGCCCACCGGGCAGCGACCGGG
CCTCCTTCATCAAGAACCTGCAGGGCATCAGCTCCTTCCGCCGCGGCACCTTCACCGACTGCGCGCTGGCCAACATGACGGAGCAGATCC
GGCAGGACCGCAGCAAGGGCACCGTCCACTTCGCCGTGGTCATCACCGACGGCCACGTCACCGGCAGCCCCTGCGGGGGCATCAAGCTGC
AGGCCGAGCGGGCCCGCGAGGAGGGCATCCGGCTCTTCGCCGTGGCCCCCAACCAGAACCTGAAGGAGCAGGGCCTGCGGGACATCGCCA
GCACGCCGCACGAGCTCTACCGCAACGACTACGCCACCATGCTGCCCGACTCCACCGAGATCGACCAGGACACCATCAACCGCATCATCA
AGGTCATGAAACACGAAGCCTACGGAGAGTGCTACAAGGTGAGCTGCCTGGAAATCCCTGGGCCCTCTGGCCCCAAGGGCTACCGTGGAC
AGAAGGGTGCCAAGGGCAACATGGGTGAGCCGGGAGAGCCTGGCCAGAAGGGAAGACAGGGAGACCCGGGCATCGAAGGCCCCATTGGAT
TCCCAGGACCCAAGGGCGTTCCTGGCTTCAAAGGAGAGAAGGGTGAATTTGGAGCCGACGGTCGCAAGGGGGCCCCTGGCCTGGCTGGCA
AGAACGGGACCGATGGACAGAAGGGCAAGCTGGGGCGCATCGGACCTCCTGGCTGCAAGGGAGACCCTGGAAACCGGGGCCCCGACGGTT
ACCCGGGGGAAGCAGGGAGTCCAGGGGAGCGAGGAGACCAAGGCGGCAAGGGGGACCCTGGCCGCCCAGGACGCAGAGGGCCCCCGGGAG
AAATCGGGGCCAAGGGAAGCAAGGGGTATCAAGGCAACAGTGGAGCCCCAGGAAGTCCTGGTGTGAAAGGAGCCAAGGGCGGGCCTGGGC
CCCGCGGACCCAAAGGCGAGCCGGGGCGCAGGGGAGACCCCGGCACCAAGGGCAGCCCAGGCAGCGATGGCCCCAAGGGGGAGAAGGGGG
ACCCTGGCCCTGAGGGGCCCCGCGGCCTGGCTGGAGAGGTTGGCAACAAAGGAGCCAAGGGAGACCGAGGCTTGCCTGGACCCAGAGGCC
CCCAGGGAGCTCTTGGGGAGCCCGGAAAGCAGGGATCTCGGGGAGACCCCGGTGATGCAGGACCCCGTGGAGACTCAGGACAGCCAGGCC
CCAAGGGAGACCCCGGCAGGCCTGGATTCAGCTACCCAGGACCCCGAGGAGCACCCGAAAAGAAACTGCTGGAAGACAGAATAGCTGAGT
TCACCACCAACCTCACAGAAGAGGAGGAGAAATCTAAGAGCCTCGCCAAGCTCAAGAACAAGCATGAGGCAATGATCACTGACTTGGAAG
AGCGCCTCCGCAGGGAGGAGAAGCAGCGACAGGAGCTGGAGAAGACCCGCCGGAAGCTGGAGGGAGACTCCACAGACCTCAGCGACCAGA
TCGCCGAGCTCCAGGCCCAGATCGCGGAGCTCAAGATGCAGCTGGCCAAGAAAGAGGAGGAGCTCCAGGCCGCCCTGGCCAGAGTGGAAG
AGGAAGCTGCCCAGAAGAACATGGCCCTCAAGAAGATCCGGGAGCTGGAATCTCAGATCTCTGAACTCCAGGAAGACCTGGAGTCTGAGC
GTGCTTCCAGGAATAAAGCTGAGAAGCAGAAACGGGACCTTGGGGAAGAGCTAGAGGCTCTGAAAACAGAGTTGGAGGACACGCTGGATT
CCACAGCTGCCCAGCAGGAGCTCAGGTCAAAACGTGAGCAGGAGGTGAACATCCTGAAGAAGACCCTGGAGGAGGAGGCCAAGACCCACG
AGGCCCAGATCCAGGAGATGAGGCAGAAGCACTCACAGGCCGTGGAGGAGCTGGCGGAGCAGCTGGAGCAGACGAAGCGGGTGAAAGCAA
ACCTCGAGAAGGCAAAGCAGACTCTGGAGAACGAGCGGGGGGAGCTGGCCAACGAGGTGAAGGTGCTGCTGCAGGGCAAAGGGGACTCGG
AGCACAAGCGCAAGAAAGTGGAGGCGCAGCTGCAGGAGCTGCAGGTCAAGTTCAACGAGGGAGAGCGCGTGCGCACAGAGCTGGCCGACA
AGGTCACCAAGCTGCAGGTGGAGCTGGACAACGTGACCGGGCTTCTCAGCCAGTCCGACAGCAAGTCCAGCAAGCTCACCAAGGACTTCT
CCGCGCTGGAGTCCCAGCTGCAGGACACTCAGGAGCTGCTGCAGGAGGAGAACCGGCAGAAGCTGAGCCTGAGCACCAAGCTCAAGCAGG
TGGAGGACGAGAAGAATTCCTTCCGGGAGCAGCTGGAGGAGGAGGAGGAGGCCAAGCACAACCTGGAGAAGCAGATCGCCACCCTCCATG
CCCAGGTGGCCGACATGAAAAAGAAGATGGAGGACAGTGTGGGGTGCCTGGAAACTGCTGAGGAGGTGAAGAGGAAGCTCCAGAAGGACC
TGGAGGGCCTGAGCCAGCGGCACGAGGAGAAGGTGGCCGCCTACGACAAGCTGGAGAAGACCAAGACGCGGCTGCAGCAGGAGCTGGACG
ACCTGCTGGTGGACCTGGACCACCAGCGCCAGAGCGCGTGCAACCTGGAGAAGAAGCAGAAGAAGTTTGACCAGCTCCTGGCGGAGGAGA
AGACCATCTCTGCCAAGTATGCAGAGGAGCGCGACCGGGCTGAGGCGGAGGCCCGAGAGAAGGAGACCAAGGCTCTGTCGCTGGCCCGGG
CCCTGGAGGAAGCCATGGAGCAGAAGGCGGAGCTGGAGCGGCTCAACAAGCAGTTCCGCACGGAGATGGAGGACCTTATGAGCTCCAAGG
ATGATGTGGGCAAGAGTGTCCACGAGCTGGAGAAGTCCAAGCGGGCCCTAGAGCAGCAGGTGGAGGAGATGAAGACGCAGCTGGAAGAGC
TGGAGGACGAGCTGCAGGCCACCGAAGATGCCAAGCTGCGGTTGGAGGTCAACCTGCAGGCCATGAAGGCCCAGTTCGAGCGGGACCTGC
AGGGCCGGGACGAGCAGAGCGAGGAGAAGAAGAAGCAGCTGGTCAGACAGGTGCGGGAGATGGAGGCAGAGCTGGAGGACGAGAGGAAGC
AGCGCTCGATGGCAGTGGCCGCCCGGAAGAAGCTGGAGATGGACCTGAAGGACCTGGAGGCGCACATCGACTCGGCCAACAAGAACCGGG
ACGAAGCCATCAAACAGCTGCGGAAGCTGCAGGCCCAGATGAAGGACTGCATGCGCGAGCTGGATGACACCCGCGCCTCTCGTGAGGAGA
TCCTGGCCCAGGCCAAAGAGAACGAGAAGAAGCTGAAGAGCATGGAGGCCGAGATGATCCAGTTGCAGGAGGAACTGGCAGCCGCGGAGC
GTGCCAAGCGCCAGGCCCAGCAGGAGCGGGATGAGCTGGCTGACGAGATCGCCAACAGCAGCGGCAAAGGAGCCCTGGCGTTAGAGGAGA
AGCGGCGTCTGGAGGCCCGCATCGCCCAGCTGGAGGAGGAGCTGGAGGAGGAGCAGGGCAACACGGAGCTGATCAACGACCGGCTGAAGA
AGGCCAACCTGCAGATCGACCAGATCAACACCGACCTGAACCTGGAGCGCAGCCACGCCCAGAAGAACGAGAATGCTCGGCAGCAGCTGG
AACGCCAGAACAAGGAGCTTAAGGTCAAGCTGCAGGAGATGGAGGGCACTGTCAAGTCCAAGTACAAGGCCTCCATCACCGCCCTCGAGG
CCAAGATTGCACAGCTGGAGGAGCAGCTGGACAACGAGACCAAGGAGCGCCAGGCAGCCTGCAAACAGGTGCGTCGGACCGAGAAGAAGC
TGAAGGATGTGCTGCTGCAGGTGGATGACGAGCGGAGGAACGCCGAGCAGTACAAGGACCAGGCCGACAAGGCATCTACCCGCCTGAAGC
AGCTCAAGCGGCAGCTGGAGGAGGCCGAAGAGGAGGCCCAGCGGGCCAACGCCTCCCGCCGGAAACTGCAGCGCGAGCTGGAGGACGCCA
CTGAGACGGCCGATGCCATGAACCGCGAAGTCAGCTCCCTAAAGAACAAGCTCAGGCGCGGGGACCTGCCGTTTGTCGTGCCCCGCCGAA
TGGCCCGGAAAGGCGCCGGGGATGGCTCCGACGAAGAGGTAGATGGCAAAGCGGATGGGGCTGAGGCCAAACCTGCCGAATAAGCCTCTT
CTCCTGCAGCCTGAGATGGATGGACAGACAGACACCACAGCCTCCCCTTCCCAGACCCCGCAGCACGCCTCTCCCCACCTTCTTGGGACT
GCTGTGAACATGCCTCCTCCTGCCCTCCGCCCCGTCCCCCCATCCCGTTTCCCTCCAGGTGTTGTTGAGGGCATTTGGCTTCCTCTGCTG
CATCCCCTTCCAGCTCCCTCCCCTGCTCAGAATCTGATACCAAAGAGACAGGGCCCGGGCCCAGGCAGAGAGCGACCAGCAGGCTCCTCA
GCCCTCTCTTGCCAAAAAGCACAAGATGTTGAGGCGAGCAGGGCAGGCCCCCGGGGAGGGGCCAGAGTTTTCTATGAATCTATTTTTCTT
CAGACTGAGGCCTTTTGGTAGTCGGAGCCCCCGCAGTCGTCAGCCTCCCTGACGTCTGCCACCAGCGCCCCCACTCCTCCTCCTTTCTTT
GCTGTTTGCAATCACACGTGGTGACCTCACACACCTCTGCCCCTTGGGCCTCCCACTCCCATGGCTCTGGGCGGTCCAGAAGGAGCAGGC
CCTGGGCCTCCACCTCTGTGCAGGGCACAGAAGGCTGGGGTGGGGGGAGGAGTGGATTCCTCCCCACCCTGTCCCAGGCAGCGCCACTGT
CCGCTGTCTCCCTCCTGATTCTAAAATGTCTCAAGTGCAATGCCCCCTCCCCTCCTTTACCGAGGACAGCCTGCCTCTGCCACAGCAAGG
CTGTCGGGGTCAAGCTGGAAAGGCCAGCAGCCTTCCAGTGGCTTCTCCCAACACTCTTGGGGACCAAATATATTTAATGGTTAAGGGACT
TGTCCCAAGTCTGACAGCCAGAGCGTTAGAGGGGCCAGCGGCCCTCCCAGGCGATCTTGTGTCTACTCTAGGACTGGGCCCGAGGGTGGT
TTACCTGCACCGTTGACTCAGTATAGTTTAAAAATCTGCCACCTGCACAGGTATTTTTGAAAGCAAAATAAGGTTTTCTTTTTTCCCCTT
TCTTGTAATAAATGATAAAATTCCGAGTCTTTCTCACTGCCTTTGTTTAGAAGAGAGTAGCTCGTCCTCACTGGTCTACACTGGTTGCCG
AATTTACTTGTATTCCTAACTGTTTTGTATATGCTGCATTGAGACTTACGGCAAGAAGGCATTTTTTTTTTTTAAAGGAAACAAACTCTC
AAATCATGAAGTGATATAAAAGCTGCATATGCCTACAAAGCTCTGAATTCAGGTCCCAGTTGCTGTCACAAAGGAGTGAGTGAAACTCCC
ACCCTACCCCCTTTTTTATATAATAAAAGTGCCTTAGCATGTGTTGCAGCTGTCACCACTACAGTAAGCTGGTTTACAGATGTTTTCCAC

>18436_18436_1_COL6A2-MYH9_COL6A2_chr21_47542072_ENST00000300527_MYH9_chr22_36695088_ENST00000216181_length(amino acids)=1521AA_BP=369
MLLTRRPRLGPSGAEPPRDQDFRATGAAKMLQGTCSVLLLWGILGAIQAQQQEVISPDTTERNNNCPEKTDCPIHVYFVLDTSESVTMQS
PTDILLFHMKQFVPQFISQLQNEFYLDQVALSWRYGGLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDRSK
GTVHFAVVITDGHVTGSPCGGIKLQAERAREEGIRLFAVAPNQNLKEQGLRDIASTPHELYRNDYATMLPDSTEIDQDTINRIIKVMKHE
AYGECYKVSCLEIPGPSGPKGYRGQKGAKGNMGEPGEPGQKGRQGDPGIEGPIGFPGPKGVPGFKGEKGEFGADGRKGAPGLAGKNGTDG
QKGKLGRIGPPGCKGDPGNRGPDGYPGEAGSPGERGDQGGKGDPGRPGRRGPPGEIGAKGSKGYQGNSGAPGSPGVKGAKGGPGPRGPKG
EPGRRGDPGTKGSPGSDGPKGEKGDPGPEGPRGLAGEVGNKGAKGDRGLPGPRGPQGALGEPGKQGSRGDPGDAGPRGDSGQPGPKGDPG
RPGFSYPGPRGAPEKKLLEDRIAEFTTNLTEEEEKSKSLAKLKNKHEAMITDLEERLRREEKQRQELEKTRRKLEGDSTDLSDQIAELQA
QIAELKMQLAKKEEELQAALARVEEEAAQKNMALKKIRELESQISELQEDLESERASRNKAEKQKRDLGEELEALKTELEDTLDSTAAQQ
ELRSKREQEVNILKKTLEEEAKTHEAQIQEMRQKHSQAVEELAEQLEQTKRVKANLEKAKQTLENERGELANEVKVLLQGKGDSEHKRKK
VEAQLQELQVKFNEGERVRTELADKVTKLQVELDNVTGLLSQSDSKSSKLTKDFSALESQLQDTQELLQEENRQKLSLSTKLKQVEDEKN
SFREQLEEEEEAKHNLEKQIATLHAQVADMKKKMEDSVGCLETAEEVKRKLQKDLEGLSQRHEEKVAAYDKLEKTKTRLQQELDDLLVDL
DHQRQSACNLEKKQKKFDQLLAEEKTISAKYAEERDRAEAEAREKETKALSLARALEEAMEQKAELERLNKQFRTEMEDLMSSKDDVGKS
VHELEKSKRALEQQVEEMKTQLEELEDELQATEDAKLRLEVNLQAMKAQFERDLQGRDEQSEEKKKQLVRQVREMEAELEDERKQRSMAV
AARKKLEMDLKDLEAHIDSANKNRDEAIKQLRKLQAQMKDCMRELDDTRASREEILAQAKENEKKLKSMEAEMIQLQEELAAAERAKRQA
QQERDELADEIANSSGKGALALEEKRRLEARIAQLEEELEEEQGNTELINDRLKKANLQIDQINTDLNLERSHAQKNENARQQLERQNKE
LKVKLQEMEGTVKSKYKASITALEAKIAQLEEQLDNETKERQAACKQVRRTEKKLKDVLLQVDDERRNAEQYKDQADKASTRLKQLKRQL

--------------------------------------------------------------
>18436_18436_2_COL6A2-MYH9_COL6A2_chr21_47542072_ENST00000310645_MYH9_chr22_36695088_ENST00000216181_length(transcript)=5948nt_BP=1654nt
GCTTACTCGGCGCCCGCGCCTCGGGCCGTCGGGAGCGGAGCCTCCTCGGGACCAGGACTTCAGGGCCACAGGTGCTGCCAAGATGCTCCA
GGGCACCTGCTCCGTGCTCCTGCTCTGGGGAATCCTGGGGGCCATCCAGGCCCAGCAGCAGGAGGTCATCTCGCCGGACACTACCGAGAG
AAACAACAACTGCCCAGAGAAGACCGACTGCCCCATCCACGTGTACTTCGTGCTGGACACCTCGGAGAGCGTCACCATGCAGTCCCCCAC
GGACATCCTGCTCTTCCACATGAAGCAGTTCGTGCCGCAGTTCATCAGCCAGCTGCAGAACGAGTTCTACCTGGACCAGGTGGCGCTGAG
CTGGCGCTACGGCGGCCTGCACTTCTCTGACCAGGTGGAGGTGTTCAGCCCACCGGGCAGCGACCGGGCCTCCTTCATCAAGAACCTGCA
GGGCATCAGCTCCTTCCGCCGCGGCACCTTCACCGACTGCGCGCTGGCCAACATGACGGAGCAGATCCGGCAGGACCGCAGCAAGGGCAC
CGTCCACTTCGCCGTGGTCATCACCGACGGCCACGTCACCGGCAGCCCCTGCGGGGGCATCAAGCTGCAGGCCGAGCGGGCCCGCGAGGA
GGGCATCCGGCTCTTCGCCGTGGCCCCCAACCAGAACCTGAAGGAGCAGGGCCTGCGGGACATCGCCAGCACGCCGCACGAGCTCTACCG
CAACGACTACGCCACCATGCTGCCCGACTCCACCGAGATCGACCAGGACACCATCAACCGCATCATCAAGGTCATGAAACACGAAGCCTA
CGGAGAGTGCTACAAGGTGAGCTGCCTGGAAATCCCTGGGCCCTCTGGCCCCAAGGGCTACCGTGGACAGAAGGGTGCCAAGGGCAACAT
GGGTGAGCCGGGAGAGCCTGGCCAGAAGGGAAGACAGGGAGACCCGGGCATCGAAGGCCCCATTGGATTCCCAGGACCCAAGGGCGTTCC
TGGCTTCAAAGGAGAGAAGGGTGAATTTGGAGCCGACGGTCGCAAGGGGGCCCCTGGCCTGGCTGGCAAGAACGGGACCGATGGACAGAA
GGGCAAGCTGGGGCGCATCGGACCTCCTGGCTGCAAGGGAGACCCTGGAAACCGGGGCCCCGACGGTTACCCGGGGGAAGCAGGGAGTCC
AGGGGAGCGAGGAGACCAAGGCGGCAAGGGGGACCCTGGCCGCCCAGGACGCAGAGGGCCCCCGGGAGAAATCGGGGCCAAGGGAAGCAA
GGGGTATCAAGGCAACAGTGGAGCCCCAGGAAGTCCTGGTGTGAAAGGAGCCAAGGGCGGGCCTGGGCCCCGCGGACCCAAAGGCGAGCC
GGGGCGCAGGGGAGACCCCGGCACCAAGGGCAGCCCAGGCAGCGATGGCCCCAAGGGGGAGAAGGGGGACCCTGGCCCTGAGGGGCCCCG
CGGCCTGGCTGGAGAGGTTGGCAACAAAGGAGCCAAGGGAGACCGAGGCTTGCCTGGACCCAGAGGCCCCCAGGGAGCTCTTGGGGAGCC
CGGAAAGCAGGGATCTCGGGGAGACCCCGGTGATGCAGGACCCCGTGGAGACTCAGGACAGCCAGGCCCCAAGGGAGACCCCGGCAGGCC
TGGATTCAGCTACCCAGGACCCCGAGGAGCACCCGAAAAGAAACTGCTGGAAGACAGAATAGCTGAGTTCACCACCAACCTCACAGAAGA
GGAGGAGAAATCTAAGAGCCTCGCCAAGCTCAAGAACAAGCATGAGGCAATGATCACTGACTTGGAAGAGCGCCTCCGCAGGGAGGAGAA
GCAGCGACAGGAGCTGGAGAAGACCCGCCGGAAGCTGGAGGGAGACTCCACAGACCTCAGCGACCAGATCGCCGAGCTCCAGGCCCAGAT
CGCGGAGCTCAAGATGCAGCTGGCCAAGAAAGAGGAGGAGCTCCAGGCCGCCCTGGCCAGAGTGGAAGAGGAAGCTGCCCAGAAGAACAT
GGCCCTCAAGAAGATCCGGGAGCTGGAATCTCAGATCTCTGAACTCCAGGAAGACCTGGAGTCTGAGCGTGCTTCCAGGAATAAAGCTGA
GAAGCAGAAACGGGACCTTGGGGAAGAGCTAGAGGCTCTGAAAACAGAGTTGGAGGACACGCTGGATTCCACAGCTGCCCAGCAGGAGCT
CAGGTCAAAACGTGAGCAGGAGGTGAACATCCTGAAGAAGACCCTGGAGGAGGAGGCCAAGACCCACGAGGCCCAGATCCAGGAGATGAG
GCAGAAGCACTCACAGGCCGTGGAGGAGCTGGCGGAGCAGCTGGAGCAGACGAAGCGGGTGAAAGCAAACCTCGAGAAGGCAAAGCAGAC
TCTGGAGAACGAGCGGGGGGAGCTGGCCAACGAGGTGAAGGTGCTGCTGCAGGGCAAAGGGGACTCGGAGCACAAGCGCAAGAAAGTGGA
GGCGCAGCTGCAGGAGCTGCAGGTCAAGTTCAACGAGGGAGAGCGCGTGCGCACAGAGCTGGCCGACAAGGTCACCAAGCTGCAGGTGGA
GCTGGACAACGTGACCGGGCTTCTCAGCCAGTCCGACAGCAAGTCCAGCAAGCTCACCAAGGACTTCTCCGCGCTGGAGTCCCAGCTGCA
GGACACTCAGGAGCTGCTGCAGGAGGAGAACCGGCAGAAGCTGAGCCTGAGCACCAAGCTCAAGCAGGTGGAGGACGAGAAGAATTCCTT
CCGGGAGCAGCTGGAGGAGGAGGAGGAGGCCAAGCACAACCTGGAGAAGCAGATCGCCACCCTCCATGCCCAGGTGGCCGACATGAAAAA
GAAGATGGAGGACAGTGTGGGGTGCCTGGAAACTGCTGAGGAGGTGAAGAGGAAGCTCCAGAAGGACCTGGAGGGCCTGAGCCAGCGGCA
CGAGGAGAAGGTGGCCGCCTACGACAAGCTGGAGAAGACCAAGACGCGGCTGCAGCAGGAGCTGGACGACCTGCTGGTGGACCTGGACCA
CCAGCGCCAGAGCGCGTGCAACCTGGAGAAGAAGCAGAAGAAGTTTGACCAGCTCCTGGCGGAGGAGAAGACCATCTCTGCCAAGTATGC
AGAGGAGCGCGACCGGGCTGAGGCGGAGGCCCGAGAGAAGGAGACCAAGGCTCTGTCGCTGGCCCGGGCCCTGGAGGAAGCCATGGAGCA
GAAGGCGGAGCTGGAGCGGCTCAACAAGCAGTTCCGCACGGAGATGGAGGACCTTATGAGCTCCAAGGATGATGTGGGCAAGAGTGTCCA
CGAGCTGGAGAAGTCCAAGCGGGCCCTAGAGCAGCAGGTGGAGGAGATGAAGACGCAGCTGGAAGAGCTGGAGGACGAGCTGCAGGCCAC
CGAAGATGCCAAGCTGCGGTTGGAGGTCAACCTGCAGGCCATGAAGGCCCAGTTCGAGCGGGACCTGCAGGGCCGGGACGAGCAGAGCGA
GGAGAAGAAGAAGCAGCTGGTCAGACAGGTGCGGGAGATGGAGGCAGAGCTGGAGGACGAGAGGAAGCAGCGCTCGATGGCAGTGGCCGC
CCGGAAGAAGCTGGAGATGGACCTGAAGGACCTGGAGGCGCACATCGACTCGGCCAACAAGAACCGGGACGAAGCCATCAAACAGCTGCG
GAAGCTGCAGGCCCAGATGAAGGACTGCATGCGCGAGCTGGATGACACCCGCGCCTCTCGTGAGGAGATCCTGGCCCAGGCCAAAGAGAA
CGAGAAGAAGCTGAAGAGCATGGAGGCCGAGATGATCCAGTTGCAGGAGGAACTGGCAGCCGCGGAGCGTGCCAAGCGCCAGGCCCAGCA
GGAGCGGGATGAGCTGGCTGACGAGATCGCCAACAGCAGCGGCAAAGGAGCCCTGGCGTTAGAGGAGAAGCGGCGTCTGGAGGCCCGCAT
CGCCCAGCTGGAGGAGGAGCTGGAGGAGGAGCAGGGCAACACGGAGCTGATCAACGACCGGCTGAAGAAGGCCAACCTGCAGATCGACCA
GATCAACACCGACCTGAACCTGGAGCGCAGCCACGCCCAGAAGAACGAGAATGCTCGGCAGCAGCTGGAACGCCAGAACAAGGAGCTTAA
GGTCAAGCTGCAGGAGATGGAGGGCACTGTCAAGTCCAAGTACAAGGCCTCCATCACCGCCCTCGAGGCCAAGATTGCACAGCTGGAGGA
GCAGCTGGACAACGAGACCAAGGAGCGCCAGGCAGCCTGCAAACAGGTGCGTCGGACCGAGAAGAAGCTGAAGGATGTGCTGCTGCAGGT
GGATGACGAGCGGAGGAACGCCGAGCAGTACAAGGACCAGGCCGACAAGGCATCTACCCGCCTGAAGCAGCTCAAGCGGCAGCTGGAGGA
GGCCGAAGAGGAGGCCCAGCGGGCCAACGCCTCCCGCCGGAAACTGCAGCGCGAGCTGGAGGACGCCACTGAGACGGCCGATGCCATGAA
CCGCGAAGTCAGCTCCCTAAAGAACAAGCTCAGGCGCGGGGACCTGCCGTTTGTCGTGCCCCGCCGAATGGCCCGGAAAGGCGCCGGGGA
TGGCTCCGACGAAGAGGTAGATGGCAAAGCGGATGGGGCTGAGGCCAAACCTGCCGAATAAGCCTCTTCTCCTGCAGCCTGAGATGGATG
GACAGACAGACACCACAGCCTCCCCTTCCCAGACCCCGCAGCACGCCTCTCCCCACCTTCTTGGGACTGCTGTGAACATGCCTCCTCCTG
CCCTCCGCCCCGTCCCCCCATCCCGTTTCCCTCCAGGTGTTGTTGAGGGCATTTGGCTTCCTCTGCTGCATCCCCTTCCAGCTCCCTCCC
CTGCTCAGAATCTGATACCAAAGAGACAGGGCCCGGGCCCAGGCAGAGAGCGACCAGCAGGCTCCTCAGCCCTCTCTTGCCAAAAAGCAC
AAGATGTTGAGGCGAGCAGGGCAGGCCCCCGGGGAGGGGCCAGAGTTTTCTATGAATCTATTTTTCTTCAGACTGAGGCCTTTTGGTAGT
CGGAGCCCCCGCAGTCGTCAGCCTCCCTGACGTCTGCCACCAGCGCCCCCACTCCTCCTCCTTTCTTTGCTGTTTGCAATCACACGTGGT
GACCTCACACACCTCTGCCCCTTGGGCCTCCCACTCCCATGGCTCTGGGCGGTCCAGAAGGAGCAGGCCCTGGGCCTCCACCTCTGTGCA
GGGCACAGAAGGCTGGGGTGGGGGGAGGAGTGGATTCCTCCCCACCCTGTCCCAGGCAGCGCCACTGTCCGCTGTCTCCCTCCTGATTCT
AAAATGTCTCAAGTGCAATGCCCCCTCCCCTCCTTTACCGAGGACAGCCTGCCTCTGCCACAGCAAGGCTGTCGGGGTCAAGCTGGAAAG
GCCAGCAGCCTTCCAGTGGCTTCTCCCAACACTCTTGGGGACCAAATATATTTAATGGTTAAGGGACTTGTCCCAAGTCTGACAGCCAGA
GCGTTAGAGGGGCCAGCGGCCCTCCCAGGCGATCTTGTGTCTACTCTAGGACTGGGCCCGAGGGTGGTTTACCTGCACCGTTGACTCAGT
ATAGTTTAAAAATCTGCCACCTGCACAGGTATTTTTGAAAGCAAAATAAGGTTTTCTTTTTTCCCCTTTCTTGTAATAAATGATAAAATT
CCGAGTCTTTCTCACTGCCTTTGTTTAGAAGAGAGTAGCTCGTCCTCACTGGTCTACACTGGTTGCCGAATTTACTTGTATTCCTAACTG
TTTTGTATATGCTGCATTGAGACTTACGGCAAGAAGGCATTTTTTTTTTTTAAAGGAAACAAACTCTCAAATCATGAAGTGATATAAAAG
CTGCATATGCCTACAAAGCTCTGAATTCAGGTCCCAGTTGCTGTCACAAAGGAGTGAGTGAAACTCCCACCCTACCCCCTTTTTTATATA
ATAAAAGTGCCTTAGCATGTGTTGCAGCTGTCACCACTACAGTAAGCTGGTTTACAGATGTTTTCCACTGAGCATCACAATAAAGAGAAC

>18436_18436_2_COL6A2-MYH9_COL6A2_chr21_47542072_ENST00000310645_MYH9_chr22_36695088_ENST00000216181_length(amino acids)=1492AA_BP=340
MLQGTCSVLLLWGILGAIQAQQQEVISPDTTERNNNCPEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQV
ALSWRYGGLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDRSKGTVHFAVVITDGHVTGSPCGGIKLQAERA
REEGIRLFAVAPNQNLKEQGLRDIASTPHELYRNDYATMLPDSTEIDQDTINRIIKVMKHEAYGECYKVSCLEIPGPSGPKGYRGQKGAK
GNMGEPGEPGQKGRQGDPGIEGPIGFPGPKGVPGFKGEKGEFGADGRKGAPGLAGKNGTDGQKGKLGRIGPPGCKGDPGNRGPDGYPGEA
GSPGERGDQGGKGDPGRPGRRGPPGEIGAKGSKGYQGNSGAPGSPGVKGAKGGPGPRGPKGEPGRRGDPGTKGSPGSDGPKGEKGDPGPE
GPRGLAGEVGNKGAKGDRGLPGPRGPQGALGEPGKQGSRGDPGDAGPRGDSGQPGPKGDPGRPGFSYPGPRGAPEKKLLEDRIAEFTTNL
TEEEEKSKSLAKLKNKHEAMITDLEERLRREEKQRQELEKTRRKLEGDSTDLSDQIAELQAQIAELKMQLAKKEEELQAALARVEEEAAQ
KNMALKKIRELESQISELQEDLESERASRNKAEKQKRDLGEELEALKTELEDTLDSTAAQQELRSKREQEVNILKKTLEEEAKTHEAQIQ
EMRQKHSQAVEELAEQLEQTKRVKANLEKAKQTLENERGELANEVKVLLQGKGDSEHKRKKVEAQLQELQVKFNEGERVRTELADKVTKL
QVELDNVTGLLSQSDSKSSKLTKDFSALESQLQDTQELLQEENRQKLSLSTKLKQVEDEKNSFREQLEEEEEAKHNLEKQIATLHAQVAD
MKKKMEDSVGCLETAEEVKRKLQKDLEGLSQRHEEKVAAYDKLEKTKTRLQQELDDLLVDLDHQRQSACNLEKKQKKFDQLLAEEKTISA
KYAEERDRAEAEAREKETKALSLARALEEAMEQKAELERLNKQFRTEMEDLMSSKDDVGKSVHELEKSKRALEQQVEEMKTQLEELEDEL
QATEDAKLRLEVNLQAMKAQFERDLQGRDEQSEEKKKQLVRQVREMEAELEDERKQRSMAVAARKKLEMDLKDLEAHIDSANKNRDEAIK
QLRKLQAQMKDCMRELDDTRASREEILAQAKENEKKLKSMEAEMIQLQEELAAAERAKRQAQQERDELADEIANSSGKGALALEEKRRLE
ARIAQLEEELEEEQGNTELINDRLKKANLQIDQINTDLNLERSHAQKNENARQQLERQNKELKVKLQEMEGTVKSKYKASITALEAKIAQ
LEEQLDNETKERQAACKQVRRTEKKLKDVLLQVDDERRNAEQYKDQADKASTRLKQLKRQLEEAEEEAQRANASRRKLQRELEDATETAD

--------------------------------------------------------------
>18436_18436_3_COL6A2-MYH9_COL6A2_chr21_47542072_ENST00000357838_MYH9_chr22_36695088_ENST00000216181_length(transcript)=5948nt_BP=1654nt
GCTTACTCGGCGCCCGCGCCTCGGGCCGTCGGGAGCGGAGCCTCCTCGGGACCAGGACTTCAGGGCCACAGGTGCTGCCAAGATGCTCCA
GGGCACCTGCTCCGTGCTCCTGCTCTGGGGAATCCTGGGGGCCATCCAGGCCCAGCAGCAGGAGGTCATCTCGCCGGACACTACCGAGAG
AAACAACAACTGCCCAGAGAAGACCGACTGCCCCATCCACGTGTACTTCGTGCTGGACACCTCGGAGAGCGTCACCATGCAGTCCCCCAC
GGACATCCTGCTCTTCCACATGAAGCAGTTCGTGCCGCAGTTCATCAGCCAGCTGCAGAACGAGTTCTACCTGGACCAGGTGGCGCTGAG
CTGGCGCTACGGCGGCCTGCACTTCTCTGACCAGGTGGAGGTGTTCAGCCCACCGGGCAGCGACCGGGCCTCCTTCATCAAGAACCTGCA
GGGCATCAGCTCCTTCCGCCGCGGCACCTTCACCGACTGCGCGCTGGCCAACATGACGGAGCAGATCCGGCAGGACCGCAGCAAGGGCAC
CGTCCACTTCGCCGTGGTCATCACCGACGGCCACGTCACCGGCAGCCCCTGCGGGGGCATCAAGCTGCAGGCCGAGCGGGCCCGCGAGGA
GGGCATCCGGCTCTTCGCCGTGGCCCCCAACCAGAACCTGAAGGAGCAGGGCCTGCGGGACATCGCCAGCACGCCGCACGAGCTCTACCG
CAACGACTACGCCACCATGCTGCCCGACTCCACCGAGATCGACCAGGACACCATCAACCGCATCATCAAGGTCATGAAACACGAAGCCTA
CGGAGAGTGCTACAAGGTGAGCTGCCTGGAAATCCCTGGGCCCTCTGGCCCCAAGGGCTACCGTGGACAGAAGGGTGCCAAGGGCAACAT
GGGTGAGCCGGGAGAGCCTGGCCAGAAGGGAAGACAGGGAGACCCGGGCATCGAAGGCCCCATTGGATTCCCAGGACCCAAGGGCGTTCC
TGGCTTCAAAGGAGAGAAGGGTGAATTTGGAGCCGACGGTCGCAAGGGGGCCCCTGGCCTGGCTGGCAAGAACGGGACCGATGGACAGAA
GGGCAAGCTGGGGCGCATCGGACCTCCTGGCTGCAAGGGAGACCCTGGAAACCGGGGCCCCGACGGTTACCCGGGGGAAGCAGGGAGTCC
AGGGGAGCGAGGAGACCAAGGCGGCAAGGGGGACCCTGGCCGCCCAGGACGCAGAGGGCCCCCGGGAGAAATCGGGGCCAAGGGAAGCAA
GGGGTATCAAGGCAACAGTGGAGCCCCAGGAAGTCCTGGTGTGAAAGGAGCCAAGGGCGGGCCTGGGCCCCGCGGACCCAAAGGCGAGCC
GGGGCGCAGGGGAGACCCCGGCACCAAGGGCAGCCCAGGCAGCGATGGCCCCAAGGGGGAGAAGGGGGACCCTGGCCCTGAGGGGCCCCG
CGGCCTGGCTGGAGAGGTTGGCAACAAAGGAGCCAAGGGAGACCGAGGCTTGCCTGGACCCAGAGGCCCCCAGGGAGCTCTTGGGGAGCC
CGGAAAGCAGGGATCTCGGGGAGACCCCGGTGATGCAGGACCCCGTGGAGACTCAGGACAGCCAGGCCCCAAGGGAGACCCCGGCAGGCC
TGGATTCAGCTACCCAGGACCCCGAGGAGCACCCGAAAAGAAACTGCTGGAAGACAGAATAGCTGAGTTCACCACCAACCTCACAGAAGA
GGAGGAGAAATCTAAGAGCCTCGCCAAGCTCAAGAACAAGCATGAGGCAATGATCACTGACTTGGAAGAGCGCCTCCGCAGGGAGGAGAA
GCAGCGACAGGAGCTGGAGAAGACCCGCCGGAAGCTGGAGGGAGACTCCACAGACCTCAGCGACCAGATCGCCGAGCTCCAGGCCCAGAT
CGCGGAGCTCAAGATGCAGCTGGCCAAGAAAGAGGAGGAGCTCCAGGCCGCCCTGGCCAGAGTGGAAGAGGAAGCTGCCCAGAAGAACAT
GGCCCTCAAGAAGATCCGGGAGCTGGAATCTCAGATCTCTGAACTCCAGGAAGACCTGGAGTCTGAGCGTGCTTCCAGGAATAAAGCTGA
GAAGCAGAAACGGGACCTTGGGGAAGAGCTAGAGGCTCTGAAAACAGAGTTGGAGGACACGCTGGATTCCACAGCTGCCCAGCAGGAGCT
CAGGTCAAAACGTGAGCAGGAGGTGAACATCCTGAAGAAGACCCTGGAGGAGGAGGCCAAGACCCACGAGGCCCAGATCCAGGAGATGAG
GCAGAAGCACTCACAGGCCGTGGAGGAGCTGGCGGAGCAGCTGGAGCAGACGAAGCGGGTGAAAGCAAACCTCGAGAAGGCAAAGCAGAC
TCTGGAGAACGAGCGGGGGGAGCTGGCCAACGAGGTGAAGGTGCTGCTGCAGGGCAAAGGGGACTCGGAGCACAAGCGCAAGAAAGTGGA
GGCGCAGCTGCAGGAGCTGCAGGTCAAGTTCAACGAGGGAGAGCGCGTGCGCACAGAGCTGGCCGACAAGGTCACCAAGCTGCAGGTGGA
GCTGGACAACGTGACCGGGCTTCTCAGCCAGTCCGACAGCAAGTCCAGCAAGCTCACCAAGGACTTCTCCGCGCTGGAGTCCCAGCTGCA
GGACACTCAGGAGCTGCTGCAGGAGGAGAACCGGCAGAAGCTGAGCCTGAGCACCAAGCTCAAGCAGGTGGAGGACGAGAAGAATTCCTT
CCGGGAGCAGCTGGAGGAGGAGGAGGAGGCCAAGCACAACCTGGAGAAGCAGATCGCCACCCTCCATGCCCAGGTGGCCGACATGAAAAA
GAAGATGGAGGACAGTGTGGGGTGCCTGGAAACTGCTGAGGAGGTGAAGAGGAAGCTCCAGAAGGACCTGGAGGGCCTGAGCCAGCGGCA
CGAGGAGAAGGTGGCCGCCTACGACAAGCTGGAGAAGACCAAGACGCGGCTGCAGCAGGAGCTGGACGACCTGCTGGTGGACCTGGACCA
CCAGCGCCAGAGCGCGTGCAACCTGGAGAAGAAGCAGAAGAAGTTTGACCAGCTCCTGGCGGAGGAGAAGACCATCTCTGCCAAGTATGC
AGAGGAGCGCGACCGGGCTGAGGCGGAGGCCCGAGAGAAGGAGACCAAGGCTCTGTCGCTGGCCCGGGCCCTGGAGGAAGCCATGGAGCA
GAAGGCGGAGCTGGAGCGGCTCAACAAGCAGTTCCGCACGGAGATGGAGGACCTTATGAGCTCCAAGGATGATGTGGGCAAGAGTGTCCA
CGAGCTGGAGAAGTCCAAGCGGGCCCTAGAGCAGCAGGTGGAGGAGATGAAGACGCAGCTGGAAGAGCTGGAGGACGAGCTGCAGGCCAC
CGAAGATGCCAAGCTGCGGTTGGAGGTCAACCTGCAGGCCATGAAGGCCCAGTTCGAGCGGGACCTGCAGGGCCGGGACGAGCAGAGCGA
GGAGAAGAAGAAGCAGCTGGTCAGACAGGTGCGGGAGATGGAGGCAGAGCTGGAGGACGAGAGGAAGCAGCGCTCGATGGCAGTGGCCGC
CCGGAAGAAGCTGGAGATGGACCTGAAGGACCTGGAGGCGCACATCGACTCGGCCAACAAGAACCGGGACGAAGCCATCAAACAGCTGCG
GAAGCTGCAGGCCCAGATGAAGGACTGCATGCGCGAGCTGGATGACACCCGCGCCTCTCGTGAGGAGATCCTGGCCCAGGCCAAAGAGAA
CGAGAAGAAGCTGAAGAGCATGGAGGCCGAGATGATCCAGTTGCAGGAGGAACTGGCAGCCGCGGAGCGTGCCAAGCGCCAGGCCCAGCA
GGAGCGGGATGAGCTGGCTGACGAGATCGCCAACAGCAGCGGCAAAGGAGCCCTGGCGTTAGAGGAGAAGCGGCGTCTGGAGGCCCGCAT
CGCCCAGCTGGAGGAGGAGCTGGAGGAGGAGCAGGGCAACACGGAGCTGATCAACGACCGGCTGAAGAAGGCCAACCTGCAGATCGACCA
GATCAACACCGACCTGAACCTGGAGCGCAGCCACGCCCAGAAGAACGAGAATGCTCGGCAGCAGCTGGAACGCCAGAACAAGGAGCTTAA
GGTCAAGCTGCAGGAGATGGAGGGCACTGTCAAGTCCAAGTACAAGGCCTCCATCACCGCCCTCGAGGCCAAGATTGCACAGCTGGAGGA
GCAGCTGGACAACGAGACCAAGGAGCGCCAGGCAGCCTGCAAACAGGTGCGTCGGACCGAGAAGAAGCTGAAGGATGTGCTGCTGCAGGT
GGATGACGAGCGGAGGAACGCCGAGCAGTACAAGGACCAGGCCGACAAGGCATCTACCCGCCTGAAGCAGCTCAAGCGGCAGCTGGAGGA
GGCCGAAGAGGAGGCCCAGCGGGCCAACGCCTCCCGCCGGAAACTGCAGCGCGAGCTGGAGGACGCCACTGAGACGGCCGATGCCATGAA
CCGCGAAGTCAGCTCCCTAAAGAACAAGCTCAGGCGCGGGGACCTGCCGTTTGTCGTGCCCCGCCGAATGGCCCGGAAAGGCGCCGGGGA
TGGCTCCGACGAAGAGGTAGATGGCAAAGCGGATGGGGCTGAGGCCAAACCTGCCGAATAAGCCTCTTCTCCTGCAGCCTGAGATGGATG
GACAGACAGACACCACAGCCTCCCCTTCCCAGACCCCGCAGCACGCCTCTCCCCACCTTCTTGGGACTGCTGTGAACATGCCTCCTCCTG
CCCTCCGCCCCGTCCCCCCATCCCGTTTCCCTCCAGGTGTTGTTGAGGGCATTTGGCTTCCTCTGCTGCATCCCCTTCCAGCTCCCTCCC
CTGCTCAGAATCTGATACCAAAGAGACAGGGCCCGGGCCCAGGCAGAGAGCGACCAGCAGGCTCCTCAGCCCTCTCTTGCCAAAAAGCAC
AAGATGTTGAGGCGAGCAGGGCAGGCCCCCGGGGAGGGGCCAGAGTTTTCTATGAATCTATTTTTCTTCAGACTGAGGCCTTTTGGTAGT
CGGAGCCCCCGCAGTCGTCAGCCTCCCTGACGTCTGCCACCAGCGCCCCCACTCCTCCTCCTTTCTTTGCTGTTTGCAATCACACGTGGT
GACCTCACACACCTCTGCCCCTTGGGCCTCCCACTCCCATGGCTCTGGGCGGTCCAGAAGGAGCAGGCCCTGGGCCTCCACCTCTGTGCA
GGGCACAGAAGGCTGGGGTGGGGGGAGGAGTGGATTCCTCCCCACCCTGTCCCAGGCAGCGCCACTGTCCGCTGTCTCCCTCCTGATTCT
AAAATGTCTCAAGTGCAATGCCCCCTCCCCTCCTTTACCGAGGACAGCCTGCCTCTGCCACAGCAAGGCTGTCGGGGTCAAGCTGGAAAG
GCCAGCAGCCTTCCAGTGGCTTCTCCCAACACTCTTGGGGACCAAATATATTTAATGGTTAAGGGACTTGTCCCAAGTCTGACAGCCAGA
GCGTTAGAGGGGCCAGCGGCCCTCCCAGGCGATCTTGTGTCTACTCTAGGACTGGGCCCGAGGGTGGTTTACCTGCACCGTTGACTCAGT
ATAGTTTAAAAATCTGCCACCTGCACAGGTATTTTTGAAAGCAAAATAAGGTTTTCTTTTTTCCCCTTTCTTGTAATAAATGATAAAATT
CCGAGTCTTTCTCACTGCCTTTGTTTAGAAGAGAGTAGCTCGTCCTCACTGGTCTACACTGGTTGCCGAATTTACTTGTATTCCTAACTG
TTTTGTATATGCTGCATTGAGACTTACGGCAAGAAGGCATTTTTTTTTTTTAAAGGAAACAAACTCTCAAATCATGAAGTGATATAAAAG
CTGCATATGCCTACAAAGCTCTGAATTCAGGTCCCAGTTGCTGTCACAAAGGAGTGAGTGAAACTCCCACCCTACCCCCTTTTTTATATA
ATAAAAGTGCCTTAGCATGTGTTGCAGCTGTCACCACTACAGTAAGCTGGTTTACAGATGTTTTCCACTGAGCATCACAATAAAGAGAAC

>18436_18436_3_COL6A2-MYH9_COL6A2_chr21_47542072_ENST00000357838_MYH9_chr22_36695088_ENST00000216181_length(amino acids)=1492AA_BP=340
MLQGTCSVLLLWGILGAIQAQQQEVISPDTTERNNNCPEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQV
ALSWRYGGLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDRSKGTVHFAVVITDGHVTGSPCGGIKLQAERA
REEGIRLFAVAPNQNLKEQGLRDIASTPHELYRNDYATMLPDSTEIDQDTINRIIKVMKHEAYGECYKVSCLEIPGPSGPKGYRGQKGAK
GNMGEPGEPGQKGRQGDPGIEGPIGFPGPKGVPGFKGEKGEFGADGRKGAPGLAGKNGTDGQKGKLGRIGPPGCKGDPGNRGPDGYPGEA
GSPGERGDQGGKGDPGRPGRRGPPGEIGAKGSKGYQGNSGAPGSPGVKGAKGGPGPRGPKGEPGRRGDPGTKGSPGSDGPKGEKGDPGPE
GPRGLAGEVGNKGAKGDRGLPGPRGPQGALGEPGKQGSRGDPGDAGPRGDSGQPGPKGDPGRPGFSYPGPRGAPEKKLLEDRIAEFTTNL
TEEEEKSKSLAKLKNKHEAMITDLEERLRREEKQRQELEKTRRKLEGDSTDLSDQIAELQAQIAELKMQLAKKEEELQAALARVEEEAAQ
KNMALKKIRELESQISELQEDLESERASRNKAEKQKRDLGEELEALKTELEDTLDSTAAQQELRSKREQEVNILKKTLEEEAKTHEAQIQ
EMRQKHSQAVEELAEQLEQTKRVKANLEKAKQTLENERGELANEVKVLLQGKGDSEHKRKKVEAQLQELQVKFNEGERVRTELADKVTKL
QVELDNVTGLLSQSDSKSSKLTKDFSALESQLQDTQELLQEENRQKLSLSTKLKQVEDEKNSFREQLEEEEEAKHNLEKQIATLHAQVAD
MKKKMEDSVGCLETAEEVKRKLQKDLEGLSQRHEEKVAAYDKLEKTKTRLQQELDDLLVDLDHQRQSACNLEKKQKKFDQLLAEEKTISA
KYAEERDRAEAEAREKETKALSLARALEEAMEQKAELERLNKQFRTEMEDLMSSKDDVGKSVHELEKSKRALEQQVEEMKTQLEELEDEL
QATEDAKLRLEVNLQAMKAQFERDLQGRDEQSEEKKKQLVRQVREMEAELEDERKQRSMAVAARKKLEMDLKDLEAHIDSANKNRDEAIK
QLRKLQAQMKDCMRELDDTRASREEILAQAKENEKKLKSMEAEMIQLQEELAAAERAKRQAQQERDELADEIANSSGKGALALEEKRRLE
ARIAQLEEELEEEQGNTELINDRLKKANLQIDQINTDLNLERSHAQKNENARQQLERQNKELKVKLQEMEGTVKSKYKASITALEAKIAQ
LEEQLDNETKERQAACKQVRRTEKKLKDVLLQVDDERRNAEQYKDQADKASTRLKQLKRQLEEAEEEAQRANASRRKLQRELEDATETAD

--------------------------------------------------------------
>18436_18436_4_COL6A2-MYH9_COL6A2_chr21_47542072_ENST00000397763_MYH9_chr22_36695088_ENST00000216181_length(transcript)=5893nt_BP=1599nt
GACTTCAGGGCCACAGGTGCTGCCAAGATGCTCCAGGGCACCTGCTCCGTGCTCCTGCTCTGGGGAATCCTGGGGGCCATCCAGGCCCAG
CAGCAGGAGGTCATCTCGCCGGACACTACCGAGAGAAACAACAACTGCCCAGAGAAGACCGACTGCCCCATCCACGTGTACTTCGTGCTG
GACACCTCGGAGAGCGTCACCATGCAGTCCCCCACGGACATCCTGCTCTTCCACATGAAGCAGTTCGTGCCGCAGTTCATCAGCCAGCTG
CAGAACGAGTTCTACCTGGACCAGGTGGCGCTGAGCTGGCGCTACGGCGGCCTGCACTTCTCTGACCAGGTGGAGGTGTTCAGCCCACCG
GGCAGCGACCGGGCCTCCTTCATCAAGAACCTGCAGGGCATCAGCTCCTTCCGCCGCGGCACCTTCACCGACTGCGCGCTGGCCAACATG
ACGGAGCAGATCCGGCAGGACCGCAGCAAGGGCACCGTCCACTTCGCCGTGGTCATCACCGACGGCCACGTCACCGGCAGCCCCTGCGGG
GGCATCAAGCTGCAGGCCGAGCGGGCCCGCGAGGAGGGCATCCGGCTCTTCGCCGTGGCCCCCAACCAGAACCTGAAGGAGCAGGGCCTG
CGGGACATCGCCAGCACGCCGCACGAGCTCTACCGCAACGACTACGCCACCATGCTGCCCGACTCCACCGAGATCGACCAGGACACCATC
AACCGCATCATCAAGGTCATGAAACACGAAGCCTACGGAGAGTGCTACAAGGTGAGCTGCCTGGAAATCCCTGGGCCCTCTGGCCCCAAG
GGCTACCGTGGACAGAAGGGTGCCAAGGGCAACATGGGTGAGCCGGGAGAGCCTGGCCAGAAGGGAAGACAGGGAGACCCGGGCATCGAA
GGCCCCATTGGATTCCCAGGACCCAAGGGCGTTCCTGGCTTCAAAGGAGAGAAGGGTGAATTTGGAGCCGACGGTCGCAAGGGGGCCCCT
GGCCTGGCTGGCAAGAACGGGACCGATGGACAGAAGGGCAAGCTGGGGCGCATCGGACCTCCTGGCTGCAAGGGAGACCCTGGAAACCGG
GGCCCCGACGGTTACCCGGGGGAAGCAGGGAGTCCAGGGGAGCGAGGAGACCAAGGCGGCAAGGGGGACCCTGGCCGCCCAGGACGCAGA
GGGCCCCCGGGAGAAATCGGGGCCAAGGGAAGCAAGGGGTATCAAGGCAACAGTGGAGCCCCAGGAAGTCCTGGTGTGAAAGGAGCCAAG
GGCGGGCCTGGGCCCCGCGGACCCAAAGGCGAGCCGGGGCGCAGGGGAGACCCCGGCACCAAGGGCAGCCCAGGCAGCGATGGCCCCAAG
GGGGAGAAGGGGGACCCTGGCCCTGAGGGGCCCCGCGGCCTGGCTGGAGAGGTTGGCAACAAAGGAGCCAAGGGAGACCGAGGCTTGCCT
GGACCCAGAGGCCCCCAGGGAGCTCTTGGGGAGCCCGGAAAGCAGGGATCTCGGGGAGACCCCGGTGATGCAGGACCCCGTGGAGACTCA
GGACAGCCAGGCCCCAAGGGAGACCCCGGCAGGCCTGGATTCAGCTACCCAGGACCCCGAGGAGCACCCGAAAAGAAACTGCTGGAAGAC
AGAATAGCTGAGTTCACCACCAACCTCACAGAAGAGGAGGAGAAATCTAAGAGCCTCGCCAAGCTCAAGAACAAGCATGAGGCAATGATC
ACTGACTTGGAAGAGCGCCTCCGCAGGGAGGAGAAGCAGCGACAGGAGCTGGAGAAGACCCGCCGGAAGCTGGAGGGAGACTCCACAGAC
CTCAGCGACCAGATCGCCGAGCTCCAGGCCCAGATCGCGGAGCTCAAGATGCAGCTGGCCAAGAAAGAGGAGGAGCTCCAGGCCGCCCTG
GCCAGAGTGGAAGAGGAAGCTGCCCAGAAGAACATGGCCCTCAAGAAGATCCGGGAGCTGGAATCTCAGATCTCTGAACTCCAGGAAGAC
CTGGAGTCTGAGCGTGCTTCCAGGAATAAAGCTGAGAAGCAGAAACGGGACCTTGGGGAAGAGCTAGAGGCTCTGAAAACAGAGTTGGAG
GACACGCTGGATTCCACAGCTGCCCAGCAGGAGCTCAGGTCAAAACGTGAGCAGGAGGTGAACATCCTGAAGAAGACCCTGGAGGAGGAG
GCCAAGACCCACGAGGCCCAGATCCAGGAGATGAGGCAGAAGCACTCACAGGCCGTGGAGGAGCTGGCGGAGCAGCTGGAGCAGACGAAG
CGGGTGAAAGCAAACCTCGAGAAGGCAAAGCAGACTCTGGAGAACGAGCGGGGGGAGCTGGCCAACGAGGTGAAGGTGCTGCTGCAGGGC
AAAGGGGACTCGGAGCACAAGCGCAAGAAAGTGGAGGCGCAGCTGCAGGAGCTGCAGGTCAAGTTCAACGAGGGAGAGCGCGTGCGCACA
GAGCTGGCCGACAAGGTCACCAAGCTGCAGGTGGAGCTGGACAACGTGACCGGGCTTCTCAGCCAGTCCGACAGCAAGTCCAGCAAGCTC
ACCAAGGACTTCTCCGCGCTGGAGTCCCAGCTGCAGGACACTCAGGAGCTGCTGCAGGAGGAGAACCGGCAGAAGCTGAGCCTGAGCACC
AAGCTCAAGCAGGTGGAGGACGAGAAGAATTCCTTCCGGGAGCAGCTGGAGGAGGAGGAGGAGGCCAAGCACAACCTGGAGAAGCAGATC
GCCACCCTCCATGCCCAGGTGGCCGACATGAAAAAGAAGATGGAGGACAGTGTGGGGTGCCTGGAAACTGCTGAGGAGGTGAAGAGGAAG
CTCCAGAAGGACCTGGAGGGCCTGAGCCAGCGGCACGAGGAGAAGGTGGCCGCCTACGACAAGCTGGAGAAGACCAAGACGCGGCTGCAG
CAGGAGCTGGACGACCTGCTGGTGGACCTGGACCACCAGCGCCAGAGCGCGTGCAACCTGGAGAAGAAGCAGAAGAAGTTTGACCAGCTC
CTGGCGGAGGAGAAGACCATCTCTGCCAAGTATGCAGAGGAGCGCGACCGGGCTGAGGCGGAGGCCCGAGAGAAGGAGACCAAGGCTCTG
TCGCTGGCCCGGGCCCTGGAGGAAGCCATGGAGCAGAAGGCGGAGCTGGAGCGGCTCAACAAGCAGTTCCGCACGGAGATGGAGGACCTT
ATGAGCTCCAAGGATGATGTGGGCAAGAGTGTCCACGAGCTGGAGAAGTCCAAGCGGGCCCTAGAGCAGCAGGTGGAGGAGATGAAGACG
CAGCTGGAAGAGCTGGAGGACGAGCTGCAGGCCACCGAAGATGCCAAGCTGCGGTTGGAGGTCAACCTGCAGGCCATGAAGGCCCAGTTC
GAGCGGGACCTGCAGGGCCGGGACGAGCAGAGCGAGGAGAAGAAGAAGCAGCTGGTCAGACAGGTGCGGGAGATGGAGGCAGAGCTGGAG
GACGAGAGGAAGCAGCGCTCGATGGCAGTGGCCGCCCGGAAGAAGCTGGAGATGGACCTGAAGGACCTGGAGGCGCACATCGACTCGGCC
AACAAGAACCGGGACGAAGCCATCAAACAGCTGCGGAAGCTGCAGGCCCAGATGAAGGACTGCATGCGCGAGCTGGATGACACCCGCGCC
TCTCGTGAGGAGATCCTGGCCCAGGCCAAAGAGAACGAGAAGAAGCTGAAGAGCATGGAGGCCGAGATGATCCAGTTGCAGGAGGAACTG
GCAGCCGCGGAGCGTGCCAAGCGCCAGGCCCAGCAGGAGCGGGATGAGCTGGCTGACGAGATCGCCAACAGCAGCGGCAAAGGAGCCCTG
GCGTTAGAGGAGAAGCGGCGTCTGGAGGCCCGCATCGCCCAGCTGGAGGAGGAGCTGGAGGAGGAGCAGGGCAACACGGAGCTGATCAAC
GACCGGCTGAAGAAGGCCAACCTGCAGATCGACCAGATCAACACCGACCTGAACCTGGAGCGCAGCCACGCCCAGAAGAACGAGAATGCT
CGGCAGCAGCTGGAACGCCAGAACAAGGAGCTTAAGGTCAAGCTGCAGGAGATGGAGGGCACTGTCAAGTCCAAGTACAAGGCCTCCATC
ACCGCCCTCGAGGCCAAGATTGCACAGCTGGAGGAGCAGCTGGACAACGAGACCAAGGAGCGCCAGGCAGCCTGCAAACAGGTGCGTCGG
ACCGAGAAGAAGCTGAAGGATGTGCTGCTGCAGGTGGATGACGAGCGGAGGAACGCCGAGCAGTACAAGGACCAGGCCGACAAGGCATCT
ACCCGCCTGAAGCAGCTCAAGCGGCAGCTGGAGGAGGCCGAAGAGGAGGCCCAGCGGGCCAACGCCTCCCGCCGGAAACTGCAGCGCGAG
CTGGAGGACGCCACTGAGACGGCCGATGCCATGAACCGCGAAGTCAGCTCCCTAAAGAACAAGCTCAGGCGCGGGGACCTGCCGTTTGTC
GTGCCCCGCCGAATGGCCCGGAAAGGCGCCGGGGATGGCTCCGACGAAGAGGTAGATGGCAAAGCGGATGGGGCTGAGGCCAAACCTGCC
GAATAAGCCTCTTCTCCTGCAGCCTGAGATGGATGGACAGACAGACACCACAGCCTCCCCTTCCCAGACCCCGCAGCACGCCTCTCCCCA
CCTTCTTGGGACTGCTGTGAACATGCCTCCTCCTGCCCTCCGCCCCGTCCCCCCATCCCGTTTCCCTCCAGGTGTTGTTGAGGGCATTTG
GCTTCCTCTGCTGCATCCCCTTCCAGCTCCCTCCCCTGCTCAGAATCTGATACCAAAGAGACAGGGCCCGGGCCCAGGCAGAGAGCGACC
AGCAGGCTCCTCAGCCCTCTCTTGCCAAAAAGCACAAGATGTTGAGGCGAGCAGGGCAGGCCCCCGGGGAGGGGCCAGAGTTTTCTATGA
ATCTATTTTTCTTCAGACTGAGGCCTTTTGGTAGTCGGAGCCCCCGCAGTCGTCAGCCTCCCTGACGTCTGCCACCAGCGCCCCCACTCC
TCCTCCTTTCTTTGCTGTTTGCAATCACACGTGGTGACCTCACACACCTCTGCCCCTTGGGCCTCCCACTCCCATGGCTCTGGGCGGTCC
AGAAGGAGCAGGCCCTGGGCCTCCACCTCTGTGCAGGGCACAGAAGGCTGGGGTGGGGGGAGGAGTGGATTCCTCCCCACCCTGTCCCAG
GCAGCGCCACTGTCCGCTGTCTCCCTCCTGATTCTAAAATGTCTCAAGTGCAATGCCCCCTCCCCTCCTTTACCGAGGACAGCCTGCCTC
TGCCACAGCAAGGCTGTCGGGGTCAAGCTGGAAAGGCCAGCAGCCTTCCAGTGGCTTCTCCCAACACTCTTGGGGACCAAATATATTTAA
TGGTTAAGGGACTTGTCCCAAGTCTGACAGCCAGAGCGTTAGAGGGGCCAGCGGCCCTCCCAGGCGATCTTGTGTCTACTCTAGGACTGG
GCCCGAGGGTGGTTTACCTGCACCGTTGACTCAGTATAGTTTAAAAATCTGCCACCTGCACAGGTATTTTTGAAAGCAAAATAAGGTTTT
CTTTTTTCCCCTTTCTTGTAATAAATGATAAAATTCCGAGTCTTTCTCACTGCCTTTGTTTAGAAGAGAGTAGCTCGTCCTCACTGGTCT
ACACTGGTTGCCGAATTTACTTGTATTCCTAACTGTTTTGTATATGCTGCATTGAGACTTACGGCAAGAAGGCATTTTTTTTTTTTAAAG
GAAACAAACTCTCAAATCATGAAGTGATATAAAAGCTGCATATGCCTACAAAGCTCTGAATTCAGGTCCCAGTTGCTGTCACAAAGGAGT
GAGTGAAACTCCCACCCTACCCCCTTTTTTATATAATAAAAGTGCCTTAGCATGTGTTGCAGCTGTCACCACTACAGTAAGCTGGTTTAC

>18436_18436_4_COL6A2-MYH9_COL6A2_chr21_47542072_ENST00000397763_MYH9_chr22_36695088_ENST00000216181_length(amino acids)=1492AA_BP=340
MLQGTCSVLLLWGILGAIQAQQQEVISPDTTERNNNCPEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQV
ALSWRYGGLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDRSKGTVHFAVVITDGHVTGSPCGGIKLQAERA
REEGIRLFAVAPNQNLKEQGLRDIASTPHELYRNDYATMLPDSTEIDQDTINRIIKVMKHEAYGECYKVSCLEIPGPSGPKGYRGQKGAK
GNMGEPGEPGQKGRQGDPGIEGPIGFPGPKGVPGFKGEKGEFGADGRKGAPGLAGKNGTDGQKGKLGRIGPPGCKGDPGNRGPDGYPGEA
GSPGERGDQGGKGDPGRPGRRGPPGEIGAKGSKGYQGNSGAPGSPGVKGAKGGPGPRGPKGEPGRRGDPGTKGSPGSDGPKGEKGDPGPE
GPRGLAGEVGNKGAKGDRGLPGPRGPQGALGEPGKQGSRGDPGDAGPRGDSGQPGPKGDPGRPGFSYPGPRGAPEKKLLEDRIAEFTTNL
TEEEEKSKSLAKLKNKHEAMITDLEERLRREEKQRQELEKTRRKLEGDSTDLSDQIAELQAQIAELKMQLAKKEEELQAALARVEEEAAQ
KNMALKKIRELESQISELQEDLESERASRNKAEKQKRDLGEELEALKTELEDTLDSTAAQQELRSKREQEVNILKKTLEEEAKTHEAQIQ
EMRQKHSQAVEELAEQLEQTKRVKANLEKAKQTLENERGELANEVKVLLQGKGDSEHKRKKVEAQLQELQVKFNEGERVRTELADKVTKL
QVELDNVTGLLSQSDSKSSKLTKDFSALESQLQDTQELLQEENRQKLSLSTKLKQVEDEKNSFREQLEEEEEAKHNLEKQIATLHAQVAD
MKKKMEDSVGCLETAEEVKRKLQKDLEGLSQRHEEKVAAYDKLEKTKTRLQQELDDLLVDLDHQRQSACNLEKKQKKFDQLLAEEKTISA
KYAEERDRAEAEAREKETKALSLARALEEAMEQKAELERLNKQFRTEMEDLMSSKDDVGKSVHELEKSKRALEQQVEEMKTQLEELEDEL
QATEDAKLRLEVNLQAMKAQFERDLQGRDEQSEEKKKQLVRQVREMEAELEDERKQRSMAVAARKKLEMDLKDLEAHIDSANKNRDEAIK
QLRKLQAQMKDCMRELDDTRASREEILAQAKENEKKLKSMEAEMIQLQEELAAAERAKRQAQQERDELADEIANSSGKGALALEEKRRLE
ARIAQLEEELEEEQGNTELINDRLKKANLQIDQINTDLNLERSHAQKNENARQQLERQNKELKVKLQEMEGTVKSKYKASITALEAKIAQ
LEEQLDNETKERQAACKQVRRTEKKLKDVLLQVDDERRNAEQYKDQADKASTRLKQLKRQLEEAEEEAQRANASRRKLQRELEDATETAD

--------------------------------------------------------------
>18436_18436_5_COL6A2-MYH9_COL6A2_chr21_47542072_ENST00000409416_MYH9_chr22_36695088_ENST00000216181_length(transcript)=5929nt_BP=1635nt
CTGGGGGTGTCTGAGCGACCCCCACCCCTGTTGCAGGACTTCAGGGCCACAGGTGCTGCCAAGATGCTCCAGGGCACCTGCTCCGTGCTC
CTGCTCTGGGGAATCCTGGGGGCCATCCAGGCCCAGCAGCAGGAGGTCATCTCGCCGGACACTACCGAGAGAAACAACAACTGCCCAGAG
AAGACCGACTGCCCCATCCACGTGTACTTCGTGCTGGACACCTCGGAGAGCGTCACCATGCAGTCCCCCACGGACATCCTGCTCTTCCAC
ATGAAGCAGTTCGTGCCGCAGTTCATCAGCCAGCTGCAGAACGAGTTCTACCTGGACCAGGTGGCGCTGAGCTGGCGCTACGGCGGCCTG
CACTTCTCTGACCAGGTGGAGGTGTTCAGCCCACCGGGCAGCGACCGGGCCTCCTTCATCAAGAACCTGCAGGGCATCAGCTCCTTCCGC
CGCGGCACCTTCACCGACTGCGCGCTGGCCAACATGACGGAGCAGATCCGGCAGGACCGCAGCAAGGGCACCGTCCACTTCGCCGTGGTC
ATCACCGACGGCCACGTCACCGGCAGCCCCTGCGGGGGCATCAAGCTGCAGGCCGAGCGGGCCCGCGAGGAGGGCATCCGGCTCTTCGCC
GTGGCCCCCAACCAGAACCTGAAGGAGCAGGGCCTGCGGGACATCGCCAGCACGCCGCACGAGCTCTACCGCAACGACTACGCCACCATG
CTGCCCGACTCCACCGAGATCGACCAGGACACCATCAACCGCATCATCAAGGTCATGAAACACGAAGCCTACGGAGAGTGCTACAAGGTG
AGCTGCCTGGAAATCCCTGGGCCCTCTGGCCCCAAGGGCTACCGTGGACAGAAGGGTGCCAAGGGCAACATGGGTGAGCCGGGAGAGCCT
GGCCAGAAGGGAAGACAGGGAGACCCGGGCATCGAAGGCCCCATTGGATTCCCAGGACCCAAGGGCGTTCCTGGCTTCAAAGGAGAGAAG
GGTGAATTTGGAGCCGACGGTCGCAAGGGGGCCCCTGGCCTGGCTGGCAAGAACGGGACCGATGGACAGAAGGGCAAGCTGGGGCGCATC
GGACCTCCTGGCTGCAAGGGAGACCCTGGAAACCGGGGCCCCGACGGTTACCCGGGGGAAGCAGGGAGTCCAGGGGAGCGAGGAGACCAA
GGCGGCAAGGGGGACCCTGGCCGCCCAGGACGCAGAGGGCCCCCGGGAGAAATCGGGGCCAAGGGAAGCAAGGGGTATCAAGGCAACAGT
GGAGCCCCAGGAAGTCCTGGTGTGAAAGGAGCCAAGGGCGGGCCTGGGCCCCGCGGACCCAAAGGCGAGCCGGGGCGCAGGGGAGACCCC
GGCACCAAGGGCAGCCCAGGCAGCGATGGCCCCAAGGGGGAGAAGGGGGACCCTGGCCCTGAGGGGCCCCGCGGCCTGGCTGGAGAGGTT
GGCAACAAAGGAGCCAAGGGAGACCGAGGCTTGCCTGGACCCAGAGGCCCCCAGGGAGCTCTTGGGGAGCCCGGAAAGCAGGGATCTCGG
GGAGACCCCGGTGATGCAGGACCCCGTGGAGACTCAGGACAGCCAGGCCCCAAGGGAGACCCCGGCAGGCCTGGATTCAGCTACCCAGGA
CCCCGAGGAGCACCCGAAAAGAAACTGCTGGAAGACAGAATAGCTGAGTTCACCACCAACCTCACAGAAGAGGAGGAGAAATCTAAGAGC
CTCGCCAAGCTCAAGAACAAGCATGAGGCAATGATCACTGACTTGGAAGAGCGCCTCCGCAGGGAGGAGAAGCAGCGACAGGAGCTGGAG
AAGACCCGCCGGAAGCTGGAGGGAGACTCCACAGACCTCAGCGACCAGATCGCCGAGCTCCAGGCCCAGATCGCGGAGCTCAAGATGCAG
CTGGCCAAGAAAGAGGAGGAGCTCCAGGCCGCCCTGGCCAGAGTGGAAGAGGAAGCTGCCCAGAAGAACATGGCCCTCAAGAAGATCCGG
GAGCTGGAATCTCAGATCTCTGAACTCCAGGAAGACCTGGAGTCTGAGCGTGCTTCCAGGAATAAAGCTGAGAAGCAGAAACGGGACCTT
GGGGAAGAGCTAGAGGCTCTGAAAACAGAGTTGGAGGACACGCTGGATTCCACAGCTGCCCAGCAGGAGCTCAGGTCAAAACGTGAGCAG
GAGGTGAACATCCTGAAGAAGACCCTGGAGGAGGAGGCCAAGACCCACGAGGCCCAGATCCAGGAGATGAGGCAGAAGCACTCACAGGCC
GTGGAGGAGCTGGCGGAGCAGCTGGAGCAGACGAAGCGGGTGAAAGCAAACCTCGAGAAGGCAAAGCAGACTCTGGAGAACGAGCGGGGG
GAGCTGGCCAACGAGGTGAAGGTGCTGCTGCAGGGCAAAGGGGACTCGGAGCACAAGCGCAAGAAAGTGGAGGCGCAGCTGCAGGAGCTG
CAGGTCAAGTTCAACGAGGGAGAGCGCGTGCGCACAGAGCTGGCCGACAAGGTCACCAAGCTGCAGGTGGAGCTGGACAACGTGACCGGG
CTTCTCAGCCAGTCCGACAGCAAGTCCAGCAAGCTCACCAAGGACTTCTCCGCGCTGGAGTCCCAGCTGCAGGACACTCAGGAGCTGCTG
CAGGAGGAGAACCGGCAGAAGCTGAGCCTGAGCACCAAGCTCAAGCAGGTGGAGGACGAGAAGAATTCCTTCCGGGAGCAGCTGGAGGAG
GAGGAGGAGGCCAAGCACAACCTGGAGAAGCAGATCGCCACCCTCCATGCCCAGGTGGCCGACATGAAAAAGAAGATGGAGGACAGTGTG
GGGTGCCTGGAAACTGCTGAGGAGGTGAAGAGGAAGCTCCAGAAGGACCTGGAGGGCCTGAGCCAGCGGCACGAGGAGAAGGTGGCCGCC
TACGACAAGCTGGAGAAGACCAAGACGCGGCTGCAGCAGGAGCTGGACGACCTGCTGGTGGACCTGGACCACCAGCGCCAGAGCGCGTGC
AACCTGGAGAAGAAGCAGAAGAAGTTTGACCAGCTCCTGGCGGAGGAGAAGACCATCTCTGCCAAGTATGCAGAGGAGCGCGACCGGGCT
GAGGCGGAGGCCCGAGAGAAGGAGACCAAGGCTCTGTCGCTGGCCCGGGCCCTGGAGGAAGCCATGGAGCAGAAGGCGGAGCTGGAGCGG
CTCAACAAGCAGTTCCGCACGGAGATGGAGGACCTTATGAGCTCCAAGGATGATGTGGGCAAGAGTGTCCACGAGCTGGAGAAGTCCAAG
CGGGCCCTAGAGCAGCAGGTGGAGGAGATGAAGACGCAGCTGGAAGAGCTGGAGGACGAGCTGCAGGCCACCGAAGATGCCAAGCTGCGG
TTGGAGGTCAACCTGCAGGCCATGAAGGCCCAGTTCGAGCGGGACCTGCAGGGCCGGGACGAGCAGAGCGAGGAGAAGAAGAAGCAGCTG
GTCAGACAGGTGCGGGAGATGGAGGCAGAGCTGGAGGACGAGAGGAAGCAGCGCTCGATGGCAGTGGCCGCCCGGAAGAAGCTGGAGATG
GACCTGAAGGACCTGGAGGCGCACATCGACTCGGCCAACAAGAACCGGGACGAAGCCATCAAACAGCTGCGGAAGCTGCAGGCCCAGATG
AAGGACTGCATGCGCGAGCTGGATGACACCCGCGCCTCTCGTGAGGAGATCCTGGCCCAGGCCAAAGAGAACGAGAAGAAGCTGAAGAGC
ATGGAGGCCGAGATGATCCAGTTGCAGGAGGAACTGGCAGCCGCGGAGCGTGCCAAGCGCCAGGCCCAGCAGGAGCGGGATGAGCTGGCT
GACGAGATCGCCAACAGCAGCGGCAAAGGAGCCCTGGCGTTAGAGGAGAAGCGGCGTCTGGAGGCCCGCATCGCCCAGCTGGAGGAGGAG
CTGGAGGAGGAGCAGGGCAACACGGAGCTGATCAACGACCGGCTGAAGAAGGCCAACCTGCAGATCGACCAGATCAACACCGACCTGAAC
CTGGAGCGCAGCCACGCCCAGAAGAACGAGAATGCTCGGCAGCAGCTGGAACGCCAGAACAAGGAGCTTAAGGTCAAGCTGCAGGAGATG
GAGGGCACTGTCAAGTCCAAGTACAAGGCCTCCATCACCGCCCTCGAGGCCAAGATTGCACAGCTGGAGGAGCAGCTGGACAACGAGACC
AAGGAGCGCCAGGCAGCCTGCAAACAGGTGCGTCGGACCGAGAAGAAGCTGAAGGATGTGCTGCTGCAGGTGGATGACGAGCGGAGGAAC
GCCGAGCAGTACAAGGACCAGGCCGACAAGGCATCTACCCGCCTGAAGCAGCTCAAGCGGCAGCTGGAGGAGGCCGAAGAGGAGGCCCAG
CGGGCCAACGCCTCCCGCCGGAAACTGCAGCGCGAGCTGGAGGACGCCACTGAGACGGCCGATGCCATGAACCGCGAAGTCAGCTCCCTA
AAGAACAAGCTCAGGCGCGGGGACCTGCCGTTTGTCGTGCCCCGCCGAATGGCCCGGAAAGGCGCCGGGGATGGCTCCGACGAAGAGGTA
GATGGCAAAGCGGATGGGGCTGAGGCCAAACCTGCCGAATAAGCCTCTTCTCCTGCAGCCTGAGATGGATGGACAGACAGACACCACAGC
CTCCCCTTCCCAGACCCCGCAGCACGCCTCTCCCCACCTTCTTGGGACTGCTGTGAACATGCCTCCTCCTGCCCTCCGCCCCGTCCCCCC
ATCCCGTTTCCCTCCAGGTGTTGTTGAGGGCATTTGGCTTCCTCTGCTGCATCCCCTTCCAGCTCCCTCCCCTGCTCAGAATCTGATACC
AAAGAGACAGGGCCCGGGCCCAGGCAGAGAGCGACCAGCAGGCTCCTCAGCCCTCTCTTGCCAAAAAGCACAAGATGTTGAGGCGAGCAG
GGCAGGCCCCCGGGGAGGGGCCAGAGTTTTCTATGAATCTATTTTTCTTCAGACTGAGGCCTTTTGGTAGTCGGAGCCCCCGCAGTCGTC
AGCCTCCCTGACGTCTGCCACCAGCGCCCCCACTCCTCCTCCTTTCTTTGCTGTTTGCAATCACACGTGGTGACCTCACACACCTCTGCC
CCTTGGGCCTCCCACTCCCATGGCTCTGGGCGGTCCAGAAGGAGCAGGCCCTGGGCCTCCACCTCTGTGCAGGGCACAGAAGGCTGGGGT
GGGGGGAGGAGTGGATTCCTCCCCACCCTGTCCCAGGCAGCGCCACTGTCCGCTGTCTCCCTCCTGATTCTAAAATGTCTCAAGTGCAAT
GCCCCCTCCCCTCCTTTACCGAGGACAGCCTGCCTCTGCCACAGCAAGGCTGTCGGGGTCAAGCTGGAAAGGCCAGCAGCCTTCCAGTGG
CTTCTCCCAACACTCTTGGGGACCAAATATATTTAATGGTTAAGGGACTTGTCCCAAGTCTGACAGCCAGAGCGTTAGAGGGGCCAGCGG
CCCTCCCAGGCGATCTTGTGTCTACTCTAGGACTGGGCCCGAGGGTGGTTTACCTGCACCGTTGACTCAGTATAGTTTAAAAATCTGCCA
CCTGCACAGGTATTTTTGAAAGCAAAATAAGGTTTTCTTTTTTCCCCTTTCTTGTAATAAATGATAAAATTCCGAGTCTTTCTCACTGCC
TTTGTTTAGAAGAGAGTAGCTCGTCCTCACTGGTCTACACTGGTTGCCGAATTTACTTGTATTCCTAACTGTTTTGTATATGCTGCATTG
AGACTTACGGCAAGAAGGCATTTTTTTTTTTTAAAGGAAACAAACTCTCAAATCATGAAGTGATATAAAAGCTGCATATGCCTACAAAGC
TCTGAATTCAGGTCCCAGTTGCTGTCACAAAGGAGTGAGTGAAACTCCCACCCTACCCCCTTTTTTATATAATAAAAGTGCCTTAGCATG

>18436_18436_5_COL6A2-MYH9_COL6A2_chr21_47542072_ENST00000409416_MYH9_chr22_36695088_ENST00000216181_length(amino acids)=1513AA_BP=361
LGVSERPPPLLQDFRATGAAKMLQGTCSVLLLWGILGAIQAQQQEVISPDTTERNNNCPEKTDCPIHVYFVLDTSESVTMQSPTDILLFH
MKQFVPQFISQLQNEFYLDQVALSWRYGGLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDRSKGTVHFAVV
ITDGHVTGSPCGGIKLQAERAREEGIRLFAVAPNQNLKEQGLRDIASTPHELYRNDYATMLPDSTEIDQDTINRIIKVMKHEAYGECYKV
SCLEIPGPSGPKGYRGQKGAKGNMGEPGEPGQKGRQGDPGIEGPIGFPGPKGVPGFKGEKGEFGADGRKGAPGLAGKNGTDGQKGKLGRI
GPPGCKGDPGNRGPDGYPGEAGSPGERGDQGGKGDPGRPGRRGPPGEIGAKGSKGYQGNSGAPGSPGVKGAKGGPGPRGPKGEPGRRGDP
GTKGSPGSDGPKGEKGDPGPEGPRGLAGEVGNKGAKGDRGLPGPRGPQGALGEPGKQGSRGDPGDAGPRGDSGQPGPKGDPGRPGFSYPG
PRGAPEKKLLEDRIAEFTTNLTEEEEKSKSLAKLKNKHEAMITDLEERLRREEKQRQELEKTRRKLEGDSTDLSDQIAELQAQIAELKMQ
LAKKEEELQAALARVEEEAAQKNMALKKIRELESQISELQEDLESERASRNKAEKQKRDLGEELEALKTELEDTLDSTAAQQELRSKREQ
EVNILKKTLEEEAKTHEAQIQEMRQKHSQAVEELAEQLEQTKRVKANLEKAKQTLENERGELANEVKVLLQGKGDSEHKRKKVEAQLQEL
QVKFNEGERVRTELADKVTKLQVELDNVTGLLSQSDSKSSKLTKDFSALESQLQDTQELLQEENRQKLSLSTKLKQVEDEKNSFREQLEE
EEEAKHNLEKQIATLHAQVADMKKKMEDSVGCLETAEEVKRKLQKDLEGLSQRHEEKVAAYDKLEKTKTRLQQELDDLLVDLDHQRQSAC
NLEKKQKKFDQLLAEEKTISAKYAEERDRAEAEAREKETKALSLARALEEAMEQKAELERLNKQFRTEMEDLMSSKDDVGKSVHELEKSK
RALEQQVEEMKTQLEELEDELQATEDAKLRLEVNLQAMKAQFERDLQGRDEQSEEKKKQLVRQVREMEAELEDERKQRSMAVAARKKLEM
DLKDLEAHIDSANKNRDEAIKQLRKLQAQMKDCMRELDDTRASREEILAQAKENEKKLKSMEAEMIQLQEELAAAERAKRQAQQERDELA
DEIANSSGKGALALEEKRRLEARIAQLEEELEEEQGNTELINDRLKKANLQIDQINTDLNLERSHAQKNENARQQLERQNKELKVKLQEM
EGTVKSKYKASITALEAKIAQLEEQLDNETKERQAACKQVRRTEKKLKDVLLQVDDERRNAEQYKDQADKASTRLKQLKRQLEEAEEEAQ

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for COL6A2-MYH9


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for COL6A2-MYH9


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for COL6A2-MYH9


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource