Fusion Gene Studies
in Kim Lab

FusionBase FusionGDB FusionGDB2 FusionPDB FusionNeoAntigen FusionAI FusionNW FGviewer Publication Contact
FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:COL2A1-ZNF609 (FusionGDB2 ID:HG1280TG23060)

Fusion Gene Summary for COL2A1-ZNF609

check button Fusion gene summary
Fusion gene informationFusion gene name: COL2A1-ZNF609
Fusion gene ID: hg1280tg23060
HgeneTgene
Gene symbol

COL2A1

ZNF609

Gene ID

1280

23060

Gene namecollagen type II alpha 1 chainzinc finger protein 609
SynonymsANFH|AOM|COL11A3|SEDC|STL1-
Cytomap('COL2A1')('ZNF609')

12q13.11

15q22.31

Type of geneprotein-codingprotein-coding
Descriptioncollagen alpha-1(II) chainalpha-1 type II collagenarthroophthalmopathy, progressive (Stickler syndrome)cartilage collagenchondrocalcincollagen II, alpha-1 polypeptidecollagen, type II, alpha 1zinc finger protein 609
Modification date2020032820200313
UniProtAcc

P02458

.
Ensembl transtripts involved in fusion geneENST00000337299, ENST00000380518, 
ENST00000493991, 
Fusion gene scores* DoF score5 X 6 X 2=6012 X 15 X 3=540
# samples 617
** MAII scorelog2(6/60*10)=0log2(17/540*10)=-1.66742466091313
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: COL2A1 [Title/Abstract] AND ZNF609 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointCOL2A1(48398020)-ZNF609(64915026), # samples:2
Anticipated loss of major functional domain due to fusion event.COL2A1-ZNF609 seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
COL2A1-ZNF609 seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
COL2A1-ZNF609 seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
COL2A1-ZNF609 seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
TgeneZNF609

GO:2000291

regulation of myoblast proliferation

28344082


check buttonFusion gene breakpoints across COL2A1 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure
check buttonFusion gene breakpoints across ZNF609 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4PRADTCGA-EJ-5503-01ACOL2A1chr12

48398020

-ZNF609chr15

64915026

+


Top

Fusion Gene ORF analysis for COL2A1-ZNF609

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-intronENST00000337299ENST00000416172COL2A1chr12

48398020

-ZNF609chr15

64915026

+
5CDS-intronENST00000337299ENST00000559364COL2A1chr12

48398020

-ZNF609chr15

64915026

+
5CDS-intronENST00000380518ENST00000416172COL2A1chr12

48398020

-ZNF609chr15

64915026

+
5CDS-intronENST00000380518ENST00000559364COL2A1chr12

48398020

-ZNF609chr15

64915026

+
In-frameENST00000337299ENST00000326648COL2A1chr12

48398020

-ZNF609chr15

64915026

+
In-frameENST00000380518ENST00000326648COL2A1chr12

48398020

-ZNF609chr15

64915026

+
intron-3CDSENST00000493991ENST00000326648COL2A1chr12

48398020

-ZNF609chr15

64915026

+
intron-intronENST00000493991ENST00000416172COL2A1chr12

48398020

-ZNF609chr15

64915026

+
intron-intronENST00000493991ENST00000559364COL2A1chr12

48398020

-ZNF609chr15

64915026

+

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000380518COL2A1chr1248398020-ENST00000326648ZNF609chr1564915026+811825022037381172
ENST00000337299COL2A1chr1248398020-ENST00000326648ZNF609chr1564915026+7953855535731172

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000380518ENST00000326648COL2A1chr1248398020-ZNF609chr1564915026+0.0014089270.998591
ENST00000337299ENST00000326648COL2A1chr1248398020-ZNF609chr1564915026+0.0011646370.9988354

Top

Fusion Genomic Features for COL2A1-ZNF609


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)
COL2A1chr1248398019-ZNF609chr1564915025+0.0023768680.99762315
COL2A1chr1248398019-ZNF609chr1564915025+0.0023768680.99762315

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.
genomic feature of top 1%

Top

Fusion Protein Features for COL2A1-ZNF609


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr12:48398020/chr15:64915026)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
COL2A1

P02458

.
FUNCTION: Type II collagen is specific for cartilaginous tissues. It is essential for the normal embryonic development of the skeleton, for linear growth and for the ability of cartilage to resist compressive forces.FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
TgeneZNF609chr12:48398020chr15:64915026ENST00000326648091333_13362492578.3333333333335Compositional biasNote=Poly-Gly
TgeneZNF609chr12:48398020chr15:64915026ENST00000326648091366_13712492578.3333333333335Compositional biasNote=Poly-His
TgeneZNF609chr12:48398020chr15:64915026ENST0000032664809725_7352492578.3333333333335Compositional biasNote=Poly-Lys
TgeneZNF609chr12:48398020chr15:64915026ENST0000032664809952_9552492578.3333333333335Compositional biasNote=Poly-Ser
TgeneZNF609chr12:48398020chr15:64915026ENST0000032664809495_5202492578.3333333333335Zinc fingerC2H2-type

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneCOL2A1chr12:48398020chr15:64915026ENST00000337299-1531253_1487281419.0DomainFibrillar collagen NC1
HgeneCOL2A1chr12:48398020chr15:64915026ENST00000337299-15332_90281419.0DomainVWFC
HgeneCOL2A1chr12:48398020chr15:64915026ENST00000380518-1541253_1487281488.0DomainFibrillar collagen NC1
HgeneCOL2A1chr12:48398020chr15:64915026ENST00000380518-15432_90281488.0DomainVWFC
HgeneCOL2A1chr12:48398020chr15:64915026ENST00000337299-1531215_1241281419.0RegionNote=Nonhelical region (C-terminal)
HgeneCOL2A1chr12:48398020chr15:64915026ENST00000337299-153201_1214281419.0RegionNote=Triple-helical region
HgeneCOL2A1chr12:48398020chr15:64915026ENST00000380518-1541215_1241281488.0RegionNote=Nonhelical region (C-terminal)
HgeneCOL2A1chr12:48398020chr15:64915026ENST00000380518-154201_1214281488.0RegionNote=Triple-helical region


Top

Fusion Gene Sequence for COL2A1-ZNF609


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>18282_18282_1_COL2A1-ZNF609_COL2A1_chr12_48398020_ENST00000337299_ZNF609_chr15_64915026_ENST00000326648_length(transcript)=7953nt_BP=85nt
ATGATTCGCCTCGGGGCTCCCCAGACGCTGGTGCTGCTGACGCTGCTCGTCGCCGCTGTCCTTCGGTGTCAGGGCCAGGATGTCCATGGA
GTCCCCTGTTTCCACACCAGCAGTGCTGCCAATACACCTTTTGGTGCCAGTGGTCAACAATGACATCTCATCTCCCTGTGAGCAGATCAT
GGTTCGTACCCGATCAGTTGGGGTCAACACATGTGATGTGGCTCTGGCCACAGAGCCTGAGTGCTTGGGCCCCTGTGAACCTGGAACTAG
CGTCAACCTTGAAGGCATCGTGTGGCAGGAAACAGAAGATGGGATGTTGGTGGTAAATGTAACGTGGAGGAACAAGACATATGTAGGTAC
ACTCCTTGACTGCACACGACATGATTGGGCACCCCCAAGGTTCTGTGACTCCCCGACCAGTGACCTGGAAATGCGCAATGGCCGGGGTAG
AGGCAAACGCATGCGTCCCAACAGTAATACACCTGTCAATGAGACAGCCACAGCCTCTGACAGCAAAGGGACCAGTAACAGCAGCAAAAC
CCGGGCAGGAGCCAATAGCAAAGGCCGTCGGGGCAGCCAGAATTCTTCAGAGCACCGCCCACCTGCCAGCAGCACTTCTGAGGATGTCAA
GGCCAGCCCTTCCTCAGCTAATAAGCGGAAAAACAAACCCCTTTCAGACATGGAGCTGAATTCTAGCTCAGAGGACTCCAAAGGGAGCAA
GCGTGTCCGTACTAATTCCATGGGCTCAGCCACTGGCCCCCTTCCTGGGACAAAGGTAGAACCCACTGTTCTGGACAGAAACTGCCCCTC
CCCCGTCCTAATTGACTGTCCCCACCCAAACTGCAACAAGAAGTACAAGCACATCAATGGACTTAAGTACCACCAAGCTCATGCCCATAC
AGATGATGACAGCAAGCCGGAAGCGGATGGGGACAGTGAGTACGGAGAGGAACCTATTCTCCATGCAGATCTTGGGAGCTGCAACGGTGC
ATCTGTCTCACAAAAAGGTTCCTTGTCCCCTGCCCGCTCAGCTACCCCCAAAGTTCGACTTGTAGAGCCCCATAGCCCTTCTCCTTCAAG
CAAATTCAGCACAAAAGGCCTCTGTAAGAAAAAGTTGAGTGGGGAAGGGGACACAGACCTTGGGGCCTTATCCAATGATGGCTCTGATGA
TGGACCCTCAGTGATGGATGAAACAAGCAATGATGCCTTTGATTCTTTAGAAAGGAAGTGTATGGAAAAAGAAAAATGTAAAAAACCCTC
TAGTTTAAAACCTGAAAAGATTCCTTCCAAGAGCCTAAAGTCAGCCCGTCCCATTGCCCCTGCCATCCCCCCACAGCAAATCTACACCTT
CCAGACAGCCACCTTCACAGCAGCGAGCCCAGGCTCTTCCTCAGGCTTGACCGCCACAGTGGCACAAGCCATGCCCAACAGTCCCCAACT
CAAGCCCATTCAGCCCAAGCCCACTGTTATGGGAGAACCTTTCACAGTCAACCCTGCCTTGACTCCAGCCAAGGACAAGAAAAAGAAAGA
CAAAAAAAAGAAGGAATCTTCAAAGGAACTTGAAAGTCCTCTGACCCCTGGGAAGGTGTGTCGAGCAGAGGAAGGCAAAAGCCCATTCAG
GGAATCTTCAGGAGATGGGATGAAAATGGAGGGGCTCCTAAATGGCTCATCAGACCCCCACCAAAGCCGACTGGCTAGCATCAAGGCTGA
AGCCGACAAGATCTACAGTTTCACGGACAATGCCCCCAGCCCTTCCATTGGAGGCAGTAGCCGCCTTGAAAACACTACCCCTACTCAGCC
CCTGACTCCCTTACATGTGGTGACCCAGAATGGAGCTGAAGCCAGCTCAGTCAAAACCAACAGCCCTGCATACTCTGACATCTCTGATGC
TGGGGAGGATGGGGAGGGCAAGGTAGACAGTGTCAAATCAAAGGACGCCGAACAGTTGGTTAAAGAAGGGGCTAAGAAAACTCTTTTTCC
CCCTCAGCCTCAGAGCAAAGACTCACCATATTACCAAGGCTTTGAGAGTTACTATTCTCCAAGTTATGCACAGTCCAGCCCTGGGGCTCT
GAACCCCAGCAGCCAGGCAGGAGTGGAGAGCCAGGCCCTGAAGACAAAAAGGGATGAGGAACCTGAGAGCATAGAAGGGAAAGTGAAGAA
CGATATCTGTGAAGAAAAGAAGCCCGAGCTGAGCAGTTCCAGTCAGCAGCCCTCGGTCATCCAGCAGCGTCCCAATATGTACATGCAGTC
CCTGTACTACAACCAGTATGCCTATGTACCCCCCTATGGCTACAGCGACCAGAGTTACCACACCCACCTTCTGAGCACTAACACGGCTTA
CCGGCAGCAGTACGAAGAACAGCAGAAACGCCAGAGCTTAGAGCAGCAGCAGCGGGGAGTGGACAAGAAGGCAGAGATGGGCCTGAAGGA
GCGGGAGGCAGCACTCAAGGAAGAGTGGAAGCAAAAGCCGTCAATTCCACCAACTCTCACCAAGGCCCCCAGCCTGACAGACCTGGTGAA
ATCAGGACCTGGCAAGGCCAAGGAGCCAGGGGCTGACCCAGCCAAATCAGTCATCATTCCCAAGTTAGATGACTCTTCAAAACTCCCGGG
CCAGGCCCCTGAAGGCCTTAAAGTGAAGCTGAGTGATGCCAGCCACCTAAGCAAGGAGGCCTCTGAGGCCAAGACAGGTGCTGAGTGTGG
TCGACAGGCAGAGATGGATCCAATACTCTGGTACCGACAGGAGGCAGAGCCCCGGATGTGGACATATGTTTATCCTGCCAAGTACTCAGA
CATCAAGTCAGAGGATGAGCGGTGGAAGGAGGAGCGGGACCGCAAATTGAAGGAGGAAAGGAGTCGGAGTAAGGACTCTGTCCCCAAGGA
AGATGGGAAGGAAAGCACAAGTAGTGACTGCAAGCTGCCCACGTCAGAGGAGTCTCGCCTTGGGAGCAAGGAGCCCCGGCCAAGTGTCCA
TGTGCCTGTGTCCTCCCCACTTACCCAGCACCAGTCCTACATCCCCTACATGCACGGCTATTCCTACAGTCAGTCCTACGACCCCAACCA
CCCCAGCTACCGGAGCATGCCTGCTGTGATGATGCAGAACTACCCAGGTTCCTACCTGCCTTCCAGCTACTCTTTTTCCCCATATGGCAG
CAAGGTCTCAGGTGGTGAAGATGCTGACAAGGCACGAGCCAGCCCCAGTGTGACTTGTAAATCCAGCTCAGAGTCCAAAGCCCTGGACAT
CTTGCAGCAGCATGCCAGTCACTACAAGAGCAAGTCTCCCACGATAAGTGATAAAACTTCTCAGGAGAGAGATCGAGGAGGCTGTGGGGT
GGTTGGGGGTGGTGGCAGCTGTAGCAGCGTCGGGGGAGCAAGTGGGGGTGAACGGAGTGTTGACCGGCCCCGCACCTCTCCTTCCCAGCG
CCTGATGTCCACACACCACCACCACCACCACTTGGGGTACTCATTGCTCCCAGCACAGTACAACTTACCCTATGCAGCAGGGCTTTCTTC
TACAGCCATTGTTGCCAGCCAACAAGGCTCAACTCCCTCACTCTACCCACCCCCCAGGAGGTGAGAATGACACCAAGTGCCCGGATAAAG
TCAGCTTCACGGGCCCGGACTGGCTTACCCAAGGAGGTGCTGAAGGTGCCGTTTAGACATCAGTTAAATGGTGTTGATCATCCTGTTTGC
CGTTTCCACCATGACTGAAGGCAGACCCTTGGCTATCTCACCTCCACCAGACCTCCGGACTACCTGACCCTACCTCTTCCTCAGGAGCTG
GAGAGCTGGTACTTAGCAAAAATATTTATTCTCTCAGCCACAGTTATGACTATTGTGGCCTCTGTGGAGATGAAGGCACGGGAAGCAACC
AGGGGAACATGGCCTCAGCCCAGAGAAGCCACTGCTCTGTTCCCCAAGCCCTTGGTCTGCTGCTGGAGCAGTACCAGCCCCCCCGCCCAC
CAGGGAGGGACCCCCACCCCCAAGCACTGGGTAAGGTCTGAAGACAGCACAGCAGCCATACCCCTCACCATCATTACCACCATCACCAGA
TTCTGCATCTCCCTAGTGCTTTGCACCCTGGGAATTGGCAGCATGTGGAGGAACTAGAATCTCAGGAAAGAAATTGGGGGTTGTTTTCTA
CATAATTGTGAAAACAAGGTCTTCAAATGTGGAGACTTCTCCCCATTTACATGAGCACATATAAACGCTCACAACCTAGCCTGGAAAGGA
AGACCAAGGCATCTGCCCCAACATGGCCTTGAGCTGCCTGTGAGGCAGGGGGCAGGGGTTCCAACACCAGCACAGGGCTCCCCAGGGACA
CTGGGAGCAAGCTGGTGCTGGAGCATGAATGACGTCTGTGAAGTAGAACCTGCGTCCCCACTAAGTCCTGCTGCTTCTTATTCCCCAACT
CCTTGCCCTTTTCCCTTCCCTCCTAACCCCTTGGTGCCTTTCCCAGGGGGATCCCCACACTGGTCTTGCCTCTTCTTTTCCACTGCTTGG
CTCTTAAGCCTCAGGCAGATAAACTAGTATTCCCCCCAGCTTGGGGAACCTTGGAGTCTGCCAGGTCACCTTAGGGCAAGGCCCAGAAGG
CAGCCCCTGGGAGCACCCAGCAGTTCTTGGAGATGTCCTGTCATCTAGCCATCTGATATCTTCCTCATTTGAGGCCACAGATATATACAG
CCCAATTCCTCTGTCTACAAGTACATGATTTTATATAGCTCAGTCTATAACCTCCATGTGGGCCAATATAAGCTGTGTTTCTTGGTAACA
CATATTTTGTTTGAGGGGCCACTGGCCATGGGAGGTTATTTGTTCCTTAGACCCTGGAATAACACATCCAAGCCATTACTTATTAGAGTC
TCAGAATGTACTCAGTGGAGCTGTGCTTTGAGGCAGCCAACATTTCTCTGCTCTCCTTAGAAATGCAGTCTCCCAATGGAAGCTTTATAC
TCTTTGTACTGGGAAAGTGAGGATGATTTGGTAGCTTTATTGGGGTCATGTCTTCCCCAAGGTGTGGGGAGCTTAGCTTACTTGGCTTTT
GAGGTATCATCCCTCTGTTCTCCCCTCCTATCTTTCCATGACCCTCTGGATTGAGAGAGAGAGATAAAGACTGACAGACACCAGTGTAGG
CTGGAAAAGGGAGTGTGTGACCAGAGTGCCAAAAGTGACTAGGAGCAGGAACTTGGCTCCGACTCAGTTTGGAAAATGGGAAATAGGGGA
CAGTAAGCACAATGCCCAGTAGTAGTTGATTTCCAAGGACCCTGGAACCCTACACTTGAGAGGCTTAGGGTCACCATCTGCTCAAGAGGA
TCCCCTCTGATCTACAGGCCTTTTCCCTAGGTTTCTGCCTCCTCGTTTTTGTTCAAGTTGGGTTCTGAGTCCTCCCCAAAAACCATTGTT
TTAGACCTCTTGGCAGGGCCCCAAAACAGCCTCCCTCATACCCATCATTCCCTCTGCCTTCTGCTGCCCTCATGGGCAGTGCTCTGAGCA
GTGACCTCCCTTTCCTCCGTGGAAGTAGCTAGTGCAGACACCGTCATCCCACCCCACCTGAGTCACCCCAACCAAGAGGGTGACTGAATT
TCAGCCTGATTATGCCCTCCTGGGGCTCCTGTGAGGTGGAGCCAAGGTTCCCTCTCTGTTCCTGTTTGTTTTTAAATATTGTTGTGTGTT
TTGTATCTGTGGCACTGGCCTGCAGCATACTCTGTATATATTGTAAAGAAACCGTTAGGAGTAATTTTCTTTTGCATTGGGCAGGCATGG
CCCTGCATTCCTGCCCTTTCCACTCATTCTGTAACACAGAGGACGAACTTCTGTATTAGCTGGGCAGCCTTGGGTTCTCCAGAAGAGAAC
AGGTTTTTCTTTTCCTTTTTAATTTTTCTTCTTAAACATTTGGCTCTTTGATCCTCATATCCAAGTCTCCCCTGAAGAGTAGGAGCTGCT
CAGAAGAGCAGGTGAAAGCCACCATGGCAGATCCTGATGCCTGCCGGGCCTAGTCTTCCCTCTGAAATAACATGAAGCAGCAGCTGTGGA
GATTCTTGACAAGTGCTGAGTGAAAGATTTGCTGCCCACCTCTACATGGGGAGGAGAAACACAGGTGGGAGCTACCTGTGGCATCCATGA
CCTAGTCAGAGGGATGAGATGCTCAGCAGGGGTCCCCATCCTATCCCACCCCACAAACAAAGGCTGGAAAAATTTGCTACCAAGGGCCAA
GACCACCAGACCAAGCCTGTTTATGAGCCACCCCTGCCCAGGCCCTCACAGACATTGCTCACGGGGCTTCCCATAGAGGAGAAGCTAAAG
AGGGAGGGGGCCTCATCCCCAGATAGATCAGGCAAGGCTTGGAGAGCTGCTCTTTAGGATCCACATCAACTACTTCCTCATTTTAAGGTA
TGGCAGTTCCCTTCATCCCCTTTTCCTGCCTTGTACATGTACATGTATGAAATTTCCTTCTCTTACCGAACTCTCTCCACACATCACAAG
GTCAAAGAACCACACGCTTAGAAGGGTAAGAGGGCACCCTATGAAATGAAATGGTGATTTCTTGAGTCTCTTTTTTCCACGTTTAAGGGG
CCATGGCAGGACTTAGAGTTGCGAGTTAAGACTGCAGAGGGCTAGAGAATTATTTCATACAGGCTTTGAGGCCACCCATGTCACTTATCC
CGTATACCCTCTCACCATCCCCTTGTCTACTCTGATGCCCCCAAGATGCAACTGGGCAGCTAGTTGGCCCCATAATTCTGGGCCTTTGTT
GTTTGTTTTAATTACTTGGGCATCCCAGGAAGCTTTCCAGTGATCTCCTACCATGGGCCCCCCTCCTGGGATCAAGCCCCTCCCAGGCCC
TGTCCCCAGCCCCTCCTGCCCCAGCCCACCCGCTTGCCTTGGTGCTCAGCCCTCCCATTGGGAGCAGGTTGGGGCGAGCTGGAGGCCCGG
GCTGGAGGGGCAGTGTTGCTGTTCATAGATTTTGTTCCATTGGCGTTGCTCTGTTGAATTTAATTTCAGTCTTCCTGATTCTTCCCTTCT
GTAAAGTGTACATTACCAAGTTCCTTGTTTTTTTATATATATATATAAATATATATATATACAAACTGTACTCTTTTTGCCTTTGTACAT
TCAGGCAAGAAGAGAAAATAAATCTTTTTAAGAGACAATCACAAATCTGTGAGGGCTGCTGGTTATTTCTCCTGGAGTTTGCTGCTGAGC
TGCCTCTTCCTTCCTCCCAATTTTCCTGTTCTCCCTCAGCTCTCCTGATCTTCCTGGCCCTGCTCCATATGCATCCTCAGCTTCACTTTC
CCTGGCTGATGGCAAGCTGTTGAATCCAGTGTCCAGACTACCTGCCTTGTAACCCTTTTCTGCCCAGCATTGTTTTCTGGCTTGGCCACT
GGCTTAGCCCAGGAGCTTTACTCTGTGCCCTGGCCTCCCCTCTCTTCACCTTTAGATTTCCATTCACCGAAGTGGCTTTGGACCCCTGGG
TACTCTGGGACCTGTTTCCTGGAGGCCCTGGCTTGGGACACTCACCTGTGAAACTATGCAGCTGGGAGCTCTCTGCCTAAGAGTTTGCAC
TATTTAAACCTGCCTGGGAGTTAGGACGGATGGTTTTAGGAATGACCGGAAAACTACCCCTAAAACTCCCCCGACATTCCAGCCTCTAGA
ATGCTCTGATCCAGAGCTCAGTGGATGATTCCCAGCTGGTGGACTCCTGTGGCTACCCCATCAGAACAAGGGCTAAGGGTTTATGGGTCA
AGAGTATTTGATCAGAATTTTAAAGGGTGGTATACTCTGAAACACAGCCCAACCAAACCATTGTTTGGCCGCTTTCTCTTTTCCTCTACC
TTCCTCATCCCCACTTTTTTCCCTTTCTCTCTACTTCCTCTTCTTAATTGGCTTTGGAATTGAAATATATTTTTAAATTATTTGTTGTAT

>18282_18282_1_COL2A1-ZNF609_COL2A1_chr12_48398020_ENST00000337299_ZNF609_chr15_64915026_ENST00000326648_length(amino acids)=1172AA_BP=10
MSFGVRARMSMESPVSTPAVLPIHLLVPVVNNDISSPCEQIMVRTRSVGVNTCDVALATEPECLGPCEPGTSVNLEGIVWQETEDGMLVV
NVTWRNKTYVGTLLDCTRHDWAPPRFCDSPTSDLEMRNGRGRGKRMRPNSNTPVNETATASDSKGTSNSSKTRAGANSKGRRGSQNSSEH
RPPASSTSEDVKASPSSANKRKNKPLSDMELNSSSEDSKGSKRVRTNSMGSATGPLPGTKVEPTVLDRNCPSPVLIDCPHPNCNKKYKHI
NGLKYHQAHAHTDDDSKPEADGDSEYGEEPILHADLGSCNGASVSQKGSLSPARSATPKVRLVEPHSPSPSSKFSTKGLCKKKLSGEGDT
DLGALSNDGSDDGPSVMDETSNDAFDSLERKCMEKEKCKKPSSLKPEKIPSKSLKSARPIAPAIPPQQIYTFQTATFTAASPGSSSGLTA
TVAQAMPNSPQLKPIQPKPTVMGEPFTVNPALTPAKDKKKKDKKKKESSKELESPLTPGKVCRAEEGKSPFRESSGDGMKMEGLLNGSSD
PHQSRLASIKAEADKIYSFTDNAPSPSIGGSSRLENTTPTQPLTPLHVVTQNGAEASSVKTNSPAYSDISDAGEDGEGKVDSVKSKDAEQ
LVKEGAKKTLFPPQPQSKDSPYYQGFESYYSPSYAQSSPGALNPSSQAGVESQALKTKRDEEPESIEGKVKNDICEEKKPELSSSSQQPS
VIQQRPNMYMQSLYYNQYAYVPPYGYSDQSYHTHLLSTNTAYRQQYEEQQKRQSLEQQQRGVDKKAEMGLKEREAALKEEWKQKPSIPPT
LTKAPSLTDLVKSGPGKAKEPGADPAKSVIIPKLDDSSKLPGQAPEGLKVKLSDASHLSKEASEAKTGAECGRQAEMDPILWYRQEAEPR
MWTYVYPAKYSDIKSEDERWKEERDRKLKEERSRSKDSVPKEDGKESTSSDCKLPTSEESRLGSKEPRPSVHVPVSSPLTQHQSYIPYMH
GYSYSQSYDPNHPSYRSMPAVMMQNYPGSYLPSSYSFSPYGSKVSGGEDADKARASPSVTCKSSSESKALDILQQHASHYKSKSPTISDK
TSQERDRGGCGVVGGGGSCSSVGGASGGERSVDRPRTSPSQRLMSTHHHHHHLGYSLLPAQYNLPYAAGLSSTAIVASQQGSTPSLYPPP

--------------------------------------------------------------
>18282_18282_2_COL2A1-ZNF609_COL2A1_chr12_48398020_ENST00000380518_ZNF609_chr15_64915026_ENST00000326648_length(transcript)=8118nt_BP=250nt
GGGAGAAGACGCAGAGCGCTGCTGGGCTGCCGGGTCTCCCGCTTCCCCCTCCTGCTCCAAGGGCCTCCTGCATGAGGGCGCGGTAGAGAC
CCGGACCCGCGCCGTGCTCCTGCCGTTTCGCTGCGCTCCGCCCGGGCCCGGCTCAGCCAGGCCCCGCGGTGAGCCATGATTCGCCTCGGG
GCTCCCCAGACGCTGGTGCTGCTGACGCTGCTCGTCGCCGCTGTCCTTCGGTGTCAGGGCCAGGATGTCCATGGAGTCCCCTGTTTCCAC
ACCAGCAGTGCTGCCAATACACCTTTTGGTGCCAGTGGTCAACAATGACATCTCATCTCCCTGTGAGCAGATCATGGTTCGTACCCGATC
AGTTGGGGTCAACACATGTGATGTGGCTCTGGCCACAGAGCCTGAGTGCTTGGGCCCCTGTGAACCTGGAACTAGCGTCAACCTTGAAGG
CATCGTGTGGCAGGAAACAGAAGATGGGATGTTGGTGGTAAATGTAACGTGGAGGAACAAGACATATGTAGGTACACTCCTTGACTGCAC
ACGACATGATTGGGCACCCCCAAGGTTCTGTGACTCCCCGACCAGTGACCTGGAAATGCGCAATGGCCGGGGTAGAGGCAAACGCATGCG
TCCCAACAGTAATACACCTGTCAATGAGACAGCCACAGCCTCTGACAGCAAAGGGACCAGTAACAGCAGCAAAACCCGGGCAGGAGCCAA
TAGCAAAGGCCGTCGGGGCAGCCAGAATTCTTCAGAGCACCGCCCACCTGCCAGCAGCACTTCTGAGGATGTCAAGGCCAGCCCTTCCTC
AGCTAATAAGCGGAAAAACAAACCCCTTTCAGACATGGAGCTGAATTCTAGCTCAGAGGACTCCAAAGGGAGCAAGCGTGTCCGTACTAA
TTCCATGGGCTCAGCCACTGGCCCCCTTCCTGGGACAAAGGTAGAACCCACTGTTCTGGACAGAAACTGCCCCTCCCCCGTCCTAATTGA
CTGTCCCCACCCAAACTGCAACAAGAAGTACAAGCACATCAATGGACTTAAGTACCACCAAGCTCATGCCCATACAGATGATGACAGCAA
GCCGGAAGCGGATGGGGACAGTGAGTACGGAGAGGAACCTATTCTCCATGCAGATCTTGGGAGCTGCAACGGTGCATCTGTCTCACAAAA
AGGTTCCTTGTCCCCTGCCCGCTCAGCTACCCCCAAAGTTCGACTTGTAGAGCCCCATAGCCCTTCTCCTTCAAGCAAATTCAGCACAAA
AGGCCTCTGTAAGAAAAAGTTGAGTGGGGAAGGGGACACAGACCTTGGGGCCTTATCCAATGATGGCTCTGATGATGGACCCTCAGTGAT
GGATGAAACAAGCAATGATGCCTTTGATTCTTTAGAAAGGAAGTGTATGGAAAAAGAAAAATGTAAAAAACCCTCTAGTTTAAAACCTGA
AAAGATTCCTTCCAAGAGCCTAAAGTCAGCCCGTCCCATTGCCCCTGCCATCCCCCCACAGCAAATCTACACCTTCCAGACAGCCACCTT
CACAGCAGCGAGCCCAGGCTCTTCCTCAGGCTTGACCGCCACAGTGGCACAAGCCATGCCCAACAGTCCCCAACTCAAGCCCATTCAGCC
CAAGCCCACTGTTATGGGAGAACCTTTCACAGTCAACCCTGCCTTGACTCCAGCCAAGGACAAGAAAAAGAAAGACAAAAAAAAGAAGGA
ATCTTCAAAGGAACTTGAAAGTCCTCTGACCCCTGGGAAGGTGTGTCGAGCAGAGGAAGGCAAAAGCCCATTCAGGGAATCTTCAGGAGA
TGGGATGAAAATGGAGGGGCTCCTAAATGGCTCATCAGACCCCCACCAAAGCCGACTGGCTAGCATCAAGGCTGAAGCCGACAAGATCTA
CAGTTTCACGGACAATGCCCCCAGCCCTTCCATTGGAGGCAGTAGCCGCCTTGAAAACACTACCCCTACTCAGCCCCTGACTCCCTTACA
TGTGGTGACCCAGAATGGAGCTGAAGCCAGCTCAGTCAAAACCAACAGCCCTGCATACTCTGACATCTCTGATGCTGGGGAGGATGGGGA
GGGCAAGGTAGACAGTGTCAAATCAAAGGACGCCGAACAGTTGGTTAAAGAAGGGGCTAAGAAAACTCTTTTTCCCCCTCAGCCTCAGAG
CAAAGACTCACCATATTACCAAGGCTTTGAGAGTTACTATTCTCCAAGTTATGCACAGTCCAGCCCTGGGGCTCTGAACCCCAGCAGCCA
GGCAGGAGTGGAGAGCCAGGCCCTGAAGACAAAAAGGGATGAGGAACCTGAGAGCATAGAAGGGAAAGTGAAGAACGATATCTGTGAAGA
AAAGAAGCCCGAGCTGAGCAGTTCCAGTCAGCAGCCCTCGGTCATCCAGCAGCGTCCCAATATGTACATGCAGTCCCTGTACTACAACCA
GTATGCCTATGTACCCCCCTATGGCTACAGCGACCAGAGTTACCACACCCACCTTCTGAGCACTAACACGGCTTACCGGCAGCAGTACGA
AGAACAGCAGAAACGCCAGAGCTTAGAGCAGCAGCAGCGGGGAGTGGACAAGAAGGCAGAGATGGGCCTGAAGGAGCGGGAGGCAGCACT
CAAGGAAGAGTGGAAGCAAAAGCCGTCAATTCCACCAACTCTCACCAAGGCCCCCAGCCTGACAGACCTGGTGAAATCAGGACCTGGCAA
GGCCAAGGAGCCAGGGGCTGACCCAGCCAAATCAGTCATCATTCCCAAGTTAGATGACTCTTCAAAACTCCCGGGCCAGGCCCCTGAAGG
CCTTAAAGTGAAGCTGAGTGATGCCAGCCACCTAAGCAAGGAGGCCTCTGAGGCCAAGACAGGTGCTGAGTGTGGTCGACAGGCAGAGAT
GGATCCAATACTCTGGTACCGACAGGAGGCAGAGCCCCGGATGTGGACATATGTTTATCCTGCCAAGTACTCAGACATCAAGTCAGAGGA
TGAGCGGTGGAAGGAGGAGCGGGACCGCAAATTGAAGGAGGAAAGGAGTCGGAGTAAGGACTCTGTCCCCAAGGAAGATGGGAAGGAAAG
CACAAGTAGTGACTGCAAGCTGCCCACGTCAGAGGAGTCTCGCCTTGGGAGCAAGGAGCCCCGGCCAAGTGTCCATGTGCCTGTGTCCTC
CCCACTTACCCAGCACCAGTCCTACATCCCCTACATGCACGGCTATTCCTACAGTCAGTCCTACGACCCCAACCACCCCAGCTACCGGAG
CATGCCTGCTGTGATGATGCAGAACTACCCAGGTTCCTACCTGCCTTCCAGCTACTCTTTTTCCCCATATGGCAGCAAGGTCTCAGGTGG
TGAAGATGCTGACAAGGCACGAGCCAGCCCCAGTGTGACTTGTAAATCCAGCTCAGAGTCCAAAGCCCTGGACATCTTGCAGCAGCATGC
CAGTCACTACAAGAGCAAGTCTCCCACGATAAGTGATAAAACTTCTCAGGAGAGAGATCGAGGAGGCTGTGGGGTGGTTGGGGGTGGTGG
CAGCTGTAGCAGCGTCGGGGGAGCAAGTGGGGGTGAACGGAGTGTTGACCGGCCCCGCACCTCTCCTTCCCAGCGCCTGATGTCCACACA
CCACCACCACCACCACTTGGGGTACTCATTGCTCCCAGCACAGTACAACTTACCCTATGCAGCAGGGCTTTCTTCTACAGCCATTGTTGC
CAGCCAACAAGGCTCAACTCCCTCACTCTACCCACCCCCCAGGAGGTGAGAATGACACCAAGTGCCCGGATAAAGTCAGCTTCACGGGCC
CGGACTGGCTTACCCAAGGAGGTGCTGAAGGTGCCGTTTAGACATCAGTTAAATGGTGTTGATCATCCTGTTTGCCGTTTCCACCATGAC
TGAAGGCAGACCCTTGGCTATCTCACCTCCACCAGACCTCCGGACTACCTGACCCTACCTCTTCCTCAGGAGCTGGAGAGCTGGTACTTA
GCAAAAATATTTATTCTCTCAGCCACAGTTATGACTATTGTGGCCTCTGTGGAGATGAAGGCACGGGAAGCAACCAGGGGAACATGGCCT
CAGCCCAGAGAAGCCACTGCTCTGTTCCCCAAGCCCTTGGTCTGCTGCTGGAGCAGTACCAGCCCCCCCGCCCACCAGGGAGGGACCCCC
ACCCCCAAGCACTGGGTAAGGTCTGAAGACAGCACAGCAGCCATACCCCTCACCATCATTACCACCATCACCAGATTCTGCATCTCCCTA
GTGCTTTGCACCCTGGGAATTGGCAGCATGTGGAGGAACTAGAATCTCAGGAAAGAAATTGGGGGTTGTTTTCTACATAATTGTGAAAAC
AAGGTCTTCAAATGTGGAGACTTCTCCCCATTTACATGAGCACATATAAACGCTCACAACCTAGCCTGGAAAGGAAGACCAAGGCATCTG
CCCCAACATGGCCTTGAGCTGCCTGTGAGGCAGGGGGCAGGGGTTCCAACACCAGCACAGGGCTCCCCAGGGACACTGGGAGCAAGCTGG
TGCTGGAGCATGAATGACGTCTGTGAAGTAGAACCTGCGTCCCCACTAAGTCCTGCTGCTTCTTATTCCCCAACTCCTTGCCCTTTTCCC
TTCCCTCCTAACCCCTTGGTGCCTTTCCCAGGGGGATCCCCACACTGGTCTTGCCTCTTCTTTTCCACTGCTTGGCTCTTAAGCCTCAGG
CAGATAAACTAGTATTCCCCCCAGCTTGGGGAACCTTGGAGTCTGCCAGGTCACCTTAGGGCAAGGCCCAGAAGGCAGCCCCTGGGAGCA
CCCAGCAGTTCTTGGAGATGTCCTGTCATCTAGCCATCTGATATCTTCCTCATTTGAGGCCACAGATATATACAGCCCAATTCCTCTGTC
TACAAGTACATGATTTTATATAGCTCAGTCTATAACCTCCATGTGGGCCAATATAAGCTGTGTTTCTTGGTAACACATATTTTGTTTGAG
GGGCCACTGGCCATGGGAGGTTATTTGTTCCTTAGACCCTGGAATAACACATCCAAGCCATTACTTATTAGAGTCTCAGAATGTACTCAG
TGGAGCTGTGCTTTGAGGCAGCCAACATTTCTCTGCTCTCCTTAGAAATGCAGTCTCCCAATGGAAGCTTTATACTCTTTGTACTGGGAA
AGTGAGGATGATTTGGTAGCTTTATTGGGGTCATGTCTTCCCCAAGGTGTGGGGAGCTTAGCTTACTTGGCTTTTGAGGTATCATCCCTC
TGTTCTCCCCTCCTATCTTTCCATGACCCTCTGGATTGAGAGAGAGAGATAAAGACTGACAGACACCAGTGTAGGCTGGAAAAGGGAGTG
TGTGACCAGAGTGCCAAAAGTGACTAGGAGCAGGAACTTGGCTCCGACTCAGTTTGGAAAATGGGAAATAGGGGACAGTAAGCACAATGC
CCAGTAGTAGTTGATTTCCAAGGACCCTGGAACCCTACACTTGAGAGGCTTAGGGTCACCATCTGCTCAAGAGGATCCCCTCTGATCTAC
AGGCCTTTTCCCTAGGTTTCTGCCTCCTCGTTTTTGTTCAAGTTGGGTTCTGAGTCCTCCCCAAAAACCATTGTTTTAGACCTCTTGGCA
GGGCCCCAAAACAGCCTCCCTCATACCCATCATTCCCTCTGCCTTCTGCTGCCCTCATGGGCAGTGCTCTGAGCAGTGACCTCCCTTTCC
TCCGTGGAAGTAGCTAGTGCAGACACCGTCATCCCACCCCACCTGAGTCACCCCAACCAAGAGGGTGACTGAATTTCAGCCTGATTATGC
CCTCCTGGGGCTCCTGTGAGGTGGAGCCAAGGTTCCCTCTCTGTTCCTGTTTGTTTTTAAATATTGTTGTGTGTTTTGTATCTGTGGCAC
TGGCCTGCAGCATACTCTGTATATATTGTAAAGAAACCGTTAGGAGTAATTTTCTTTTGCATTGGGCAGGCATGGCCCTGCATTCCTGCC
CTTTCCACTCATTCTGTAACACAGAGGACGAACTTCTGTATTAGCTGGGCAGCCTTGGGTTCTCCAGAAGAGAACAGGTTTTTCTTTTCC
TTTTTAATTTTTCTTCTTAAACATTTGGCTCTTTGATCCTCATATCCAAGTCTCCCCTGAAGAGTAGGAGCTGCTCAGAAGAGCAGGTGA
AAGCCACCATGGCAGATCCTGATGCCTGCCGGGCCTAGTCTTCCCTCTGAAATAACATGAAGCAGCAGCTGTGGAGATTCTTGACAAGTG
CTGAGTGAAAGATTTGCTGCCCACCTCTACATGGGGAGGAGAAACACAGGTGGGAGCTACCTGTGGCATCCATGACCTAGTCAGAGGGAT
GAGATGCTCAGCAGGGGTCCCCATCCTATCCCACCCCACAAACAAAGGCTGGAAAAATTTGCTACCAAGGGCCAAGACCACCAGACCAAG
CCTGTTTATGAGCCACCCCTGCCCAGGCCCTCACAGACATTGCTCACGGGGCTTCCCATAGAGGAGAAGCTAAAGAGGGAGGGGGCCTCA
TCCCCAGATAGATCAGGCAAGGCTTGGAGAGCTGCTCTTTAGGATCCACATCAACTACTTCCTCATTTTAAGGTATGGCAGTTCCCTTCA
TCCCCTTTTCCTGCCTTGTACATGTACATGTATGAAATTTCCTTCTCTTACCGAACTCTCTCCACACATCACAAGGTCAAAGAACCACAC
GCTTAGAAGGGTAAGAGGGCACCCTATGAAATGAAATGGTGATTTCTTGAGTCTCTTTTTTCCACGTTTAAGGGGCCATGGCAGGACTTA
GAGTTGCGAGTTAAGACTGCAGAGGGCTAGAGAATTATTTCATACAGGCTTTGAGGCCACCCATGTCACTTATCCCGTATACCCTCTCAC
CATCCCCTTGTCTACTCTGATGCCCCCAAGATGCAACTGGGCAGCTAGTTGGCCCCATAATTCTGGGCCTTTGTTGTTTGTTTTAATTAC
TTGGGCATCCCAGGAAGCTTTCCAGTGATCTCCTACCATGGGCCCCCCTCCTGGGATCAAGCCCCTCCCAGGCCCTGTCCCCAGCCCCTC
CTGCCCCAGCCCACCCGCTTGCCTTGGTGCTCAGCCCTCCCATTGGGAGCAGGTTGGGGCGAGCTGGAGGCCCGGGCTGGAGGGGCAGTG
TTGCTGTTCATAGATTTTGTTCCATTGGCGTTGCTCTGTTGAATTTAATTTCAGTCTTCCTGATTCTTCCCTTCTGTAAAGTGTACATTA
CCAAGTTCCTTGTTTTTTTATATATATATATAAATATATATATATACAAACTGTACTCTTTTTGCCTTTGTACATTCAGGCAAGAAGAGA
AAATAAATCTTTTTAAGAGACAATCACAAATCTGTGAGGGCTGCTGGTTATTTCTCCTGGAGTTTGCTGCTGAGCTGCCTCTTCCTTCCT
CCCAATTTTCCTGTTCTCCCTCAGCTCTCCTGATCTTCCTGGCCCTGCTCCATATGCATCCTCAGCTTCACTTTCCCTGGCTGATGGCAA
GCTGTTGAATCCAGTGTCCAGACTACCTGCCTTGTAACCCTTTTCTGCCCAGCATTGTTTTCTGGCTTGGCCACTGGCTTAGCCCAGGAG
CTTTACTCTGTGCCCTGGCCTCCCCTCTCTTCACCTTTAGATTTCCATTCACCGAAGTGGCTTTGGACCCCTGGGTACTCTGGGACCTGT
TTCCTGGAGGCCCTGGCTTGGGACACTCACCTGTGAAACTATGCAGCTGGGAGCTCTCTGCCTAAGAGTTTGCACTATTTAAACCTGCCT
GGGAGTTAGGACGGATGGTTTTAGGAATGACCGGAAAACTACCCCTAAAACTCCCCCGACATTCCAGCCTCTAGAATGCTCTGATCCAGA
GCTCAGTGGATGATTCCCAGCTGGTGGACTCCTGTGGCTACCCCATCAGAACAAGGGCTAAGGGTTTATGGGTCAAGAGTATTTGATCAG
AATTTTAAAGGGTGGTATACTCTGAAACACAGCCCAACCAAACCATTGTTTGGCCGCTTTCTCTTTTCCTCTACCTTCCTCATCCCCACT
TTTTTCCCTTTCTCTCTACTTCCTCTTCTTAATTGGCTTTGGAATTGAAATATATTTTTAAATTATTTGTTGTATTTATTGAATAAAGTT

>18282_18282_2_COL2A1-ZNF609_COL2A1_chr12_48398020_ENST00000380518_ZNF609_chr15_64915026_ENST00000326648_length(amino acids)=1172AA_BP=10
MSFGVRARMSMESPVSTPAVLPIHLLVPVVNNDISSPCEQIMVRTRSVGVNTCDVALATEPECLGPCEPGTSVNLEGIVWQETEDGMLVV
NVTWRNKTYVGTLLDCTRHDWAPPRFCDSPTSDLEMRNGRGRGKRMRPNSNTPVNETATASDSKGTSNSSKTRAGANSKGRRGSQNSSEH
RPPASSTSEDVKASPSSANKRKNKPLSDMELNSSSEDSKGSKRVRTNSMGSATGPLPGTKVEPTVLDRNCPSPVLIDCPHPNCNKKYKHI
NGLKYHQAHAHTDDDSKPEADGDSEYGEEPILHADLGSCNGASVSQKGSLSPARSATPKVRLVEPHSPSPSSKFSTKGLCKKKLSGEGDT
DLGALSNDGSDDGPSVMDETSNDAFDSLERKCMEKEKCKKPSSLKPEKIPSKSLKSARPIAPAIPPQQIYTFQTATFTAASPGSSSGLTA
TVAQAMPNSPQLKPIQPKPTVMGEPFTVNPALTPAKDKKKKDKKKKESSKELESPLTPGKVCRAEEGKSPFRESSGDGMKMEGLLNGSSD
PHQSRLASIKAEADKIYSFTDNAPSPSIGGSSRLENTTPTQPLTPLHVVTQNGAEASSVKTNSPAYSDISDAGEDGEGKVDSVKSKDAEQ
LVKEGAKKTLFPPQPQSKDSPYYQGFESYYSPSYAQSSPGALNPSSQAGVESQALKTKRDEEPESIEGKVKNDICEEKKPELSSSSQQPS
VIQQRPNMYMQSLYYNQYAYVPPYGYSDQSYHTHLLSTNTAYRQQYEEQQKRQSLEQQQRGVDKKAEMGLKEREAALKEEWKQKPSIPPT
LTKAPSLTDLVKSGPGKAKEPGADPAKSVIIPKLDDSSKLPGQAPEGLKVKLSDASHLSKEASEAKTGAECGRQAEMDPILWYRQEAEPR
MWTYVYPAKYSDIKSEDERWKEERDRKLKEERSRSKDSVPKEDGKESTSSDCKLPTSEESRLGSKEPRPSVHVPVSSPLTQHQSYIPYMH
GYSYSQSYDPNHPSYRSMPAVMMQNYPGSYLPSSYSFSPYGSKVSGGEDADKARASPSVTCKSSSESKALDILQQHASHYKSKSPTISDK
TSQERDRGGCGVVGGGGSCSSVGGASGGERSVDRPRTSPSQRLMSTHHHHHHLGYSLLPAQYNLPYAAGLSSTAIVASQQGSTPSLYPPP

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for COL2A1-ZNF609


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for COL2A1-ZNF609


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status
HgeneCOL2A1P02458DB00048Collagenase clostridium histolyticumBinderBiotechApproved|Investigational

Top

Related Diseases for COL2A1-ZNF609


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource
HgeneCOL2A1C0220685Achondrogenesis type 213CTD_human;GENOMICS_ENGLAND;ORPHANET;UNIPROT
HgeneCOL2A1C2745959Spondyloepiphyseal dysplasia, congenita12CTD_human;GENOMICS_ENGLAND;ORPHANET;UNIPROT
HgeneCOL2A1C0003865Arthritis, Adjuvant-Induced9CTD_human
HgeneCOL2A1C0265279Kniest dysplasia9CTD_human;GENOMICS_ENGLAND;ORPHANET;UNIPROT
HgeneCOL2A1C0971858Arthritis, Collagen-Induced9CTD_human
HgeneCOL2A1C0993582Arthritis, Experimental9CTD_human
HgeneCOL2A1C1836080Stickler Syndrome, Type I, Nonsyndromic Ocular8CTD_human;GENOMICS_ENGLAND;UNIPROT
HgeneCOL2A1C1835437Platyspondylic Lethal Skeletal Dysplasia, Torrance Type6CTD_human;GENOMICS_ENGLAND;ORPHANET;UNIPROT
HgeneCOL2A1C1836683Czech dysplasia, metatarsal type6CTD_human;GENOMICS_ENGLAND;ORPHANET;UNIPROT
HgeneCOL2A1C2020284Stickler syndrome, type 16CTD_human;GENOMICS_ENGLAND;ORPHANET;UNIPROT
HgeneCOL2A1C0432214Namaqualand hip dysplasia5CTD_human;GENOMICS_ENGLAND;ORPHANET;UNIPROT
HgeneCOL2A1C0700635Strudwick syndrome5CTD_human;GENOMICS_ENGLAND;ORPHANET;UNIPROT
HgeneCOL2A1C4225273SPONDYLOEPIPHYSEAL DYSPLASIA, STANESCU TYPE5CLINGEN;GENOMICS_ENGLAND;ORPHANET;UNIPROT
HgeneCOL2A1C0013604Edema4CTD_human
HgeneCOL2A1C0151603Anasarca4CTD_human
HgeneCOL2A1C0796173Spondyloperipheral dysplasia short ulna4CTD_human;GENOMICS_ENGLAND;ORPHANET
HgeneCOL2A1C4551562AVASCULAR NECROSIS OF FEMORAL HEAD, PRIMARY, 14GENOMICS_ENGLAND;ORPHANET;UNIPROT
HgeneCOL2A1C0003864Arthritis3CTD_human
HgeneCOL2A1C0023234Legg-Calve-Perthes Disease3CTD_human;GENOMICS_ENGLAND;ORPHANET;UNIPROT
HgeneCOL2A1C0162323Polyarthritis3CTD_human
HgeneCOL2A1C1851536Epiphyseal Dysplasia, Multiple, with Myopia and Conductive Deafness3CTD_human;GENOMICS_ENGLAND;ORPHANET;UNIPROT
HgeneCOL2A1C0003873Rheumatoid Arthritis2CTD_human
HgeneCOL2A1C0008925Cleft Palate2CTD_human;GENOMICS_ENGLAND
HgeneCOL2A1C0021368Inflammation2CTD_human
HgeneCOL2A1C0029408Degenerative polyarthritis2CTD_human
HgeneCOL2A1C0041834Erythema2CTD_human
HgeneCOL2A1C0086743Osteoarthrosis Deformans2CTD_human
HgeneCOL2A1C0376634Craniofacial Abnormalities2CTD_human
HgeneCOL2A1C1262477Weight decreased2CTD_human
HgeneCOL2A1C1852989Vitreoretinopathy with Phalangeal Epiphyseal Dysplasia2GENOMICS_ENGLAND
HgeneCOL2A1C1861481Stickler syndrome, type 32ORPHANET
HgeneCOL2A1C4302548Dysspondyloenchondromatosis2ORPHANET
HgeneCOL2A1C4520892Otospondylomegaepiphyseal dysplasia2CTD_human;GENOMICS_ENGLAND
HgeneCOL2A1C0005974Bone Resorption1CTD_human
HgeneCOL2A1C0007302Cartilage Diseases1CTD_human
HgeneCOL2A1C0008479Chondrosarcoma1CTD_human
HgeneCOL2A1C0013366Dyschondroplasias1CTD_human
HgeneCOL2A1C0015397Disorder of eye1GENOMICS_ENGLAND
HgeneCOL2A1C0018784Sensorineural Hearing Loss (disorder)1CTD_human
HgeneCOL2A1C0020507Hyperplasia1CTD_human
HgeneCOL2A1C0025202melanoma1CTD_human
HgeneCOL2A1C0025237Melnick-Needles Syndrome1CTD_human
HgeneCOL2A1C0026760Multiple Epiphyseal Dysplasia1CTD_human
HgeneCOL2A1C0027092Myopia1CTD_human
HgeneCOL2A1C0029422Osteochondrodysplasias1CTD_human
HgeneCOL2A1C0035305Retinal Detachment1CTD_human
HgeneCOL2A1C0036391Schwartz-Jampel Syndrome1CTD_human
HgeneCOL2A1C0038015Spondyloepiphyseal Dysplasia1CTD_human
HgeneCOL2A1C0039103Synovitis1CTD_human
HgeneCOL2A1C0085700Chondromalacia1CTD_human
HgeneCOL2A1C0086543Cataract1CTD_human
HgeneCOL2A1C0339546Retinal Pigment Epithelial Detachment1CTD_human
HgeneCOL2A1C0410480Avascular Necrosis of Femur Head1CTD_human;GENOMICS_ENGLAND;ORPHANET
HgeneCOL2A1C0410574Synovial Hypertrophy1CTD_human
HgeneCOL2A1C0432221Spondylometaphyseal dysplasia, 'corner fracture' type1ORPHANET
HgeneCOL2A1C0432272Van Buchem disease1CTD_human
HgeneCOL2A1C0524524Pseudoaphakia1CTD_human
HgeneCOL2A1C0542428Hypochondrogenesis1ORPHANET
HgeneCOL2A1C1510497Lens Opacities1CTD_human
HgeneCOL2A1C1691779Sensory hearing loss1CTD_human
HgeneCOL2A1C1836081RHEGMATOGENOUS RETINAL DETACHMENT, AUTOSOMAL DOMINANT1ORPHANET
HgeneCOL2A1C1837218Cleft palate, isolated1CTD_human
HgeneCOL2A1C1840452Hyaloideoretinal degeneration of Wagner1CTD_human
HgeneCOL2A1C1855310Megaepiphyseal dwarfism1CTD_human
HgeneCOL2A1C1866688Spondylometaphyseal dysplasia, Algerian type1ORPHANET
HgeneCOL2A1C3541456Spondyloepiphyseal Dysplasia Tarda, X-Linked1CTD_human
HgeneCOL2A1C4479260AVASCULAR NECROSIS OF FEMORAL HEAD, PRIMARY, 21ORPHANET
HgeneCOL2A1C4551479Schwartz-Jampel Syndrome, Type 11CTD_human