FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:COL26A1-GTF2IRD1 (FusionGDB2 ID:18258)

Fusion Gene Summary for COL26A1-GTF2IRD1

check button Fusion gene summary
Fusion gene informationFusion gene name: COL26A1-GTF2IRD1
Fusion gene ID: 18258
HgeneTgene
Gene symbol

COL26A1

GTF2IRD1

Gene ID

136227

9569

Gene namecollagen type XXVI alpha 1 chainGTF2I repeat domain containing 1
SynonymsEMI6|EMID2|EMU2|SH2BBEN|CREAM1|GTF3|MUSTRD1|RBAP2|WBS|WBSCR11|WBSCR12|hMusTRD1alpha1
Cytomap

7q22.1

7q11.23

Type of geneprotein-codingprotein-coding
Descriptioncollagen alpha-1(XXVI) chainEMI domain containing 2collagen, type XXVI, alpha 1emilin and multimerin domain-containing protein 2general transcription factor II-I repeat domain-containing protein 1USE B1-binding proteinWilliams-Beuren syndrome chromosome region 11binding factor for early enhancergeneral transcription factor 3general transcription factor IIImuscle TFII-I repea
Modification date2020031320200313
UniProtAcc

Q96A83

Q9UHL9

Ensembl transtripts involved in fusion geneENST00000313669, ENST00000397927, 
ENST00000528707, 
ENST00000489094, 
ENST00000265755, ENST00000424337, 
ENST00000455841, ENST00000476977, 
Fusion gene scores* DoF score5 X 5 X 4=10012 X 10 X 9=1080
# samples 512
** MAII scorelog2(5/100*10)=-1
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(12/1080*10)=-3.16992500144231
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: COL26A1 [Title/Abstract] AND GTF2IRD1 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointCOL26A1(101091068)-GTF2IRD1(73944064), # samples:1
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID

check buttonFusion gene breakpoints across COL26A1 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across GTF2IRD1 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4ESCATCGA-L5-A8NECOL26A1chr7

101091068

+GTF2IRD1chr7

73944064

+


Top

Fusion Gene ORF analysis for COL26A1-GTF2IRD1

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-3UTRENST00000313669ENST00000489094COL26A1chr7

101091068

+GTF2IRD1chr7

73944064

+
5CDS-3UTRENST00000397927ENST00000489094COL26A1chr7

101091068

+GTF2IRD1chr7

73944064

+
5CDS-3UTRENST00000528707ENST00000489094COL26A1chr7

101091068

+GTF2IRD1chr7

73944064

+
In-frameENST00000313669ENST00000265755COL26A1chr7

101091068

+GTF2IRD1chr7

73944064

+
In-frameENST00000313669ENST00000424337COL26A1chr7

101091068

+GTF2IRD1chr7

73944064

+
In-frameENST00000313669ENST00000455841COL26A1chr7

101091068

+GTF2IRD1chr7

73944064

+
In-frameENST00000313669ENST00000476977COL26A1chr7

101091068

+GTF2IRD1chr7

73944064

+
In-frameENST00000397927ENST00000265755COL26A1chr7

101091068

+GTF2IRD1chr7

73944064

+
In-frameENST00000397927ENST00000424337COL26A1chr7

101091068

+GTF2IRD1chr7

73944064

+
In-frameENST00000397927ENST00000455841COL26A1chr7

101091068

+GTF2IRD1chr7

73944064

+
In-frameENST00000397927ENST00000476977COL26A1chr7

101091068

+GTF2IRD1chr7

73944064

+
In-frameENST00000528707ENST00000265755COL26A1chr7

101091068

+GTF2IRD1chr7

73944064

+
In-frameENST00000528707ENST00000424337COL26A1chr7

101091068

+GTF2IRD1chr7

73944064

+
In-frameENST00000528707ENST00000455841COL26A1chr7

101091068

+GTF2IRD1chr7

73944064

+
In-frameENST00000528707ENST00000476977COL26A1chr7

101091068

+GTF2IRD1chr7

73944064

+

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000397927COL26A1chr7101091068+ENST00000265755GTF2IRD1chr773944064+25455981622387741
ENST00000397927COL26A1chr7101091068+ENST00000455841GTF2IRD1chr773944064+25145981622342726
ENST00000397927COL26A1chr7101091068+ENST00000424337GTF2IRD1chr773944064+25115981622342726
ENST00000397927COL26A1chr7101091068+ENST00000476977GTF2IRD1chr773944064+36855981622390742
ENST00000313669COL26A1chr7101091068+ENST00000265755GTF2IRD1chr773944064+25245771412366741
ENST00000313669COL26A1chr7101091068+ENST00000455841GTF2IRD1chr773944064+24935771412321726
ENST00000313669COL26A1chr7101091068+ENST00000424337GTF2IRD1chr773944064+24905771412321726
ENST00000313669COL26A1chr7101091068+ENST00000476977GTF2IRD1chr773944064+36645771412369742
ENST00000528707COL26A1chr7101091068+ENST00000265755GTF2IRD1chr773944064+24825351052324739
ENST00000528707COL26A1chr7101091068+ENST00000455841GTF2IRD1chr773944064+24515351052279724
ENST00000528707COL26A1chr7101091068+ENST00000424337GTF2IRD1chr773944064+24485351052279724
ENST00000528707COL26A1chr7101091068+ENST00000476977GTF2IRD1chr773944064+36225351052327740

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000397927ENST00000265755COL26A1chr7101091068+GTF2IRD1chr773944064+0.0083614220.9916386
ENST00000397927ENST00000455841COL26A1chr7101091068+GTF2IRD1chr773944064+0.008016050.99198395
ENST00000397927ENST00000424337COL26A1chr7101091068+GTF2IRD1chr773944064+0.0082784220.9917216
ENST00000397927ENST00000476977COL26A1chr7101091068+GTF2IRD1chr773944064+0.0077020510.99229795
ENST00000313669ENST00000265755COL26A1chr7101091068+GTF2IRD1chr773944064+0.0084438180.99155617
ENST00000313669ENST00000455841COL26A1chr7101091068+GTF2IRD1chr773944064+0.0080536670.9919463
ENST00000313669ENST00000424337COL26A1chr7101091068+GTF2IRD1chr773944064+0.0083027770.9916972
ENST00000313669ENST00000476977COL26A1chr7101091068+GTF2IRD1chr773944064+0.0077090620.9922909
ENST00000528707ENST00000265755COL26A1chr7101091068+GTF2IRD1chr773944064+0.0092061260.9907939
ENST00000528707ENST00000455841COL26A1chr7101091068+GTF2IRD1chr773944064+0.0081396510.99186033
ENST00000528707ENST00000424337COL26A1chr7101091068+GTF2IRD1chr773944064+0.0083766020.99162334
ENST00000528707ENST00000476977COL26A1chr7101091068+GTF2IRD1chr773944064+0.0074187420.99258125

Top

Fusion Genomic Features for COL26A1-GTF2IRD1


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)
COL26A1chr7101091068+GTF2IRD1chr773944063+2.07E-121
COL26A1chr7101091068+GTF2IRD1chr773944063+2.07E-121

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.
genomic feature of top 1%

Top

Fusion Protein Features for COL26A1-GTF2IRD1


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr7:101091068/chr7:73944064)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
COL26A1

Q96A83

GTF2IRD1

Q9UHL9

FUNCTION: May be a transcription regulator involved in cell-cycle progression and skeletal muscle differentiation. May repress GTF2I transcriptional functions, by preventing its nuclear residency, or by inhibiting its transcriptional activation. May contribute to slow-twitch fiber type specificity during myogenesis and in regenerating muscles. Binds troponin I slow-muscle fiber enhancer (USE B1). Binds specifically and with high affinity to the EFG sequences derived from the early enhancer of HOXC8 (By similarity). {ECO:0000250, ECO:0000269|PubMed:11438732}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneCOL26A1chr7:101091068chr7:73944064ENST00000397927+31352_128128441.6666666666667DomainEMI
TgeneGTF2IRD1chr7:101091068chr7:73944064ENST00000265755727906_930363960.0Compositional biasNote=Ser-rich
TgeneGTF2IRD1chr7:101091068chr7:73944064ENST00000424337727906_930363945.0Compositional biasNote=Ser-rich
TgeneGTF2IRD1chr7:101091068chr7:73944064ENST00000455841727906_930395977.0Compositional biasNote=Ser-rich
TgeneGTF2IRD1chr7:101091068chr7:73944064ENST00000265755727898_905363960.0MotifNuclear localization signal
TgeneGTF2IRD1chr7:101091068chr7:73944064ENST00000424337727898_905363945.0MotifNuclear localization signal
TgeneGTF2IRD1chr7:101091068chr7:73944064ENST00000455841727898_905395977.0MotifNuclear localization signal
TgeneGTF2IRD1chr7:101091068chr7:73944064ENST00000265755727556_650363960.0RepeatNote=GTF2I-like 3
TgeneGTF2IRD1chr7:101091068chr7:73944064ENST00000265755727696_790363960.0RepeatNote=GTF2I-like 4
TgeneGTF2IRD1chr7:101091068chr7:73944064ENST00000265755727793_887363960.0RepeatNote=GTF2I-like 5
TgeneGTF2IRD1chr7:101091068chr7:73944064ENST00000424337727556_650363945.0RepeatNote=GTF2I-like 3
TgeneGTF2IRD1chr7:101091068chr7:73944064ENST00000424337727696_790363945.0RepeatNote=GTF2I-like 4
TgeneGTF2IRD1chr7:101091068chr7:73944064ENST00000424337727793_887363945.0RepeatNote=GTF2I-like 5
TgeneGTF2IRD1chr7:101091068chr7:73944064ENST00000455841727556_650395977.0RepeatNote=GTF2I-like 3
TgeneGTF2IRD1chr7:101091068chr7:73944064ENST00000455841727696_790395977.0RepeatNote=GTF2I-like 4
TgeneGTF2IRD1chr7:101091068chr7:73944064ENST00000455841727793_887395977.0RepeatNote=GTF2I-like 5

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneCOL26A1chr7:101091068chr7:73944064ENST00000397927+313199_267128441.6666666666667DomainNote=Collagen-like 1
HgeneCOL26A1chr7:101091068chr7:73944064ENST00000397927+313302_355128441.6666666666667DomainNote=Collagen-like 2
TgeneGTF2IRD1chr7:101091068chr7:73944064ENST00000265755727119_213363960.0RepeatNote=GTF2I-like 1
TgeneGTF2IRD1chr7:101091068chr7:73944064ENST00000265755727342_436363960.0RepeatNote=GTF2I-like 2
TgeneGTF2IRD1chr7:101091068chr7:73944064ENST00000424337727119_213363945.0RepeatNote=GTF2I-like 1
TgeneGTF2IRD1chr7:101091068chr7:73944064ENST00000424337727342_436363945.0RepeatNote=GTF2I-like 2
TgeneGTF2IRD1chr7:101091068chr7:73944064ENST00000455841727119_213395977.0RepeatNote=GTF2I-like 1
TgeneGTF2IRD1chr7:101091068chr7:73944064ENST00000455841727342_436395977.0RepeatNote=GTF2I-like 2


Top

Fusion Gene Sequence for COL26A1-GTF2IRD1


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>18258_18258_1_COL26A1-GTF2IRD1_COL26A1_chr7_101091068_ENST00000313669_GTF2IRD1_chr7_73944064_ENST00000265755_length(transcript)=2524nt_BP=577nt
GAATTTGAAAAGGCGGCCCCGGAGAGGCGTGGGCGCCCCCCACACATTTCCAGCTCGCACCCGGGCTCCGACCGCTCGCCCCGCTCCTCT
CGCTGTGCTCCCGGCCGGTGCCGCGGGTTCGGTCCGGGCGCCGGTGCGCTCCTGCCGGTCCTCGTGCCCGGGACTCCGGGTCCCCGCGGG
CTGCTGCGCACGATGAAGCTGGCCCTGCTCCTGCCCTGGGCGTGTTGCTGCCTCTGCGGGTCGGCGCTGGCCACCGGCTTCCTCTATCCC
TTCTCGGCCGCAGCTCTGCAGCAGCACGGCTACCCCGAGCCCGGCGCCGGCTCCCCTGGCAGCGGCTACGCGAGCCGCCGGCACTGGTGC
CATCACACAGTGACACGGACGGTGTCCTGCCAGGTGCAGAATGGCTCGGAGACGGTGGTCCAGCGCGTGTACCAGAGCTGCCGGTGGCCG
GGGCCCTGCGCCAACCTCGTAAGTTACAGGACTCTGATCAGACCCACCTACAGAGTGTCCTACCGCACGGTGACGGTGCTGGAGTGGAGA
TGCTGCCCTGGCTTCACCGGGAGCAACTGTGATGAGGCGGAAGCCCTGGGCCTGGACCACATGGTCCCCGTGCCCTACCGGAAGATTGCC
TGTGACCCGGAGGCTGTGGAGATCGTGGGCATCCCGGACAAGATCCCCTTCAAGCGCCCCTGCACTTATGGAGTCCCCAAGCTGAAGCGG
ATCCTGGAGGAGCGCCATAGTATCCACTTCATCATTAAGAGGATGTTTGATGAGCGAATTTTCACAGGGAACAAGTTTACCAAAGACACC
ACGAAGCTGGAGCCAGCCAGCCCGCCAGAGGACACCTCTGCAGAGGTCTCTAGGGCCACCGTCCTTGACCTTGCTGGGAATGCTCGGTCA
GACAAGGGCAGCATGTCTGAAGACTGTGGGCCAGGAACCTCCGGGGAGCTGGGCGGGCTGAGGCCGATCAAAATTGAGCCAGAGGATCTG
GACATCATTCAGGTCACCGTCCCAGACCCCTCGCCAACCTCTGAGGAAATGACAGACTCGATGCCTGGGCACCTGCCATCGGAGGATTCT
GGTTATGGGATGGAGATGCTGACAGACAAAGGTCTGAGTGAGGACGCGCGGCCCGAGGAGAGGCCCGTGGAGGACAGCCACGGTGACGTG
ATCCGGCCCCTGCGGAAGCAGGTGGAGCTGCTCTTCAACACACGATACGCCAAGGCCATTGGCATCTCGGAGCCCGTCAAGGTGCCGTAC
TCCAAGTTTCTGATGCACCCGGAGGAGCTGTTTGTGGTGGGACTGCCTGAAGGCATCTCCCTCCGCAGGCCCAACTGCTTCGGGATCGCC
AAGCTCCGGAAGATTCTGGAGGCCAGCAACAGCATCCAGTTTGTCATCAAGAGGCCCGAGCTGCTCACTGAGGGAGTCAAAGAGCCCATC
ATGGATAGTCAAGGAACTGCCTCCTCACTTGGCTTCTCTCCCCCTGCCCTGCCCCCAGAGAGGGATTCCGGGGACCCTCTGGTGGACGAG
AGCCTGAAGAGACAGGGCTTTCAAGAAAATTATGACGCGAGGCTCTCACGGATCGACATCGCCAACACACTAAGGGAGCAGGTCCAGGAC
CTTTTCAATAAGAAATACGGGGAAGCCTTGGGCATCAAGTACCCGGTCCAGGTCCCCTACAAGCGGATCAAGAGTAACCCCGGCTCCGTG
ATCATCGAGGGGCTGCCCCCAGGAATCCCGTTCCGAAAGCCCTGTACCTTCGGCTCCCAGAACCTGGAGAGGATTCTTGCTGTGGCTGAC
AAGATCAAGTTCACAGTCACCAGGCCTTTCCAAGGACTCATCCCAAAGCCTGATGAAGATGACGCCAACAGACTCGGGGAGAAGGTGATC
CTGCGGGAGCAGGTGAAGGAACTCTTCAACGAGAAATACGGTGAGGCCCTGGGCCTGAACCGGCCGGTGCTGGTCCCTTATAAACTAATC
CGGGACAGCCCAGACGCCGTGGAGGTCACGGGTCTGCCTGATGACATCCCCTTCCGGAACCCCAACACGTACGACATCCACCGGCTGGAG
AAGATCCTGAAGGCCCGAGAGCATGTCCGCATGGTCATCATTAACCAGCTCCAACCCTTTGCAGAAATCTGCAATGATGCCAAGGTGCCA
GCCAAAGACAGCAGCATTCCCAAGCGCAAGAGAAAGCGGGTCTCGGAAGGAAATTCCGTCTCCTCTTCCTCCTCGTCTTCCTCTTCCTCG
TCCTCTAACCCGGATTCAGTGGCATCGGCCAACCAGATCTCACTCGTGCAATGGCCAATGTACATGGTGGACTATGCCGGCCTGAACGTG
CAGCTCCCGGGACCTCTTAATTACTAGACCTCAGTACTGAATCAGGACCTCACTCAGAAAGACTAAAGGAAATGTAATTTATGTACAAAA
TGTATATTCGGATATGTATCGATGCCTTTTAGTTTTTCCAATGATTTTTACACTATATTCCTGCCACCAAGGCCTTTTTAAATAAGTAAA

>18258_18258_1_COL26A1-GTF2IRD1_COL26A1_chr7_101091068_ENST00000313669_GTF2IRD1_chr7_73944064_ENST00000265755_length(amino acids)=741AA_BP=145
MPVLVPGTPGPRGLLRTMKLALLLPWACCCLCGSALATGFLYPFSAAALQQHGYPEPGAGSPGSGYASRRHWCHHTVTRTVSCQVQNGSE
TVVQRVYQSCRWPGPCANLVSYRTLIRPTYRVSYRTVTVLEWRCCPGFTGSNCDEAEALGLDHMVPVPYRKIACDPEAVEIVGIPDKIPF
KRPCTYGVPKLKRILEERHSIHFIIKRMFDERIFTGNKFTKDTTKLEPASPPEDTSAEVSRATVLDLAGNARSDKGSMSEDCGPGTSGEL
GGLRPIKIEPEDLDIIQVTVPDPSPTSEEMTDSMPGHLPSEDSGYGMEMLTDKGLSEDARPEERPVEDSHGDVIRPLRKQVELLFNTRYA
KAIGISEPVKVPYSKFLMHPEELFVVGLPEGISLRRPNCFGIAKLRKILEASNSIQFVIKRPELLTEGVKEPIMDSQGTASSLGFSPPAL
PPERDSGDPLVDESLKRQGFQENYDARLSRIDIANTLREQVQDLFNKKYGEALGIKYPVQVPYKRIKSNPGSVIIEGLPPGIPFRKPCTF
GSQNLERILAVADKIKFTVTRPFQGLIPKPDEDDANRLGEKVILREQVKELFNEKYGEALGLNRPVLVPYKLIRDSPDAVEVTGLPDDIP
FRNPNTYDIHRLEKILKAREHVRMVIINQLQPFAEICNDAKVPAKDSSIPKRKRKRVSEGNSVSSSSSSSSSSSSNPDSVASANQISLVQ

--------------------------------------------------------------
>18258_18258_2_COL26A1-GTF2IRD1_COL26A1_chr7_101091068_ENST00000313669_GTF2IRD1_chr7_73944064_ENST00000424337_length(transcript)=2490nt_BP=577nt
GAATTTGAAAAGGCGGCCCCGGAGAGGCGTGGGCGCCCCCCACACATTTCCAGCTCGCACCCGGGCTCCGACCGCTCGCCCCGCTCCTCT
CGCTGTGCTCCCGGCCGGTGCCGCGGGTTCGGTCCGGGCGCCGGTGCGCTCCTGCCGGTCCTCGTGCCCGGGACTCCGGGTCCCCGCGGG
CTGCTGCGCACGATGAAGCTGGCCCTGCTCCTGCCCTGGGCGTGTTGCTGCCTCTGCGGGTCGGCGCTGGCCACCGGCTTCCTCTATCCC
TTCTCGGCCGCAGCTCTGCAGCAGCACGGCTACCCCGAGCCCGGCGCCGGCTCCCCTGGCAGCGGCTACGCGAGCCGCCGGCACTGGTGC
CATCACACAGTGACACGGACGGTGTCCTGCCAGGTGCAGAATGGCTCGGAGACGGTGGTCCAGCGCGTGTACCAGAGCTGCCGGTGGCCG
GGGCCCTGCGCCAACCTCGTAAGTTACAGGACTCTGATCAGACCCACCTACAGAGTGTCCTACCGCACGGTGACGGTGCTGGAGTGGAGA
TGCTGCCCTGGCTTCACCGGGAGCAACTGTGATGAGGCGGAAGCCCTGGGCCTGGACCACATGGTCCCCGTGCCCTACCGGAAGATTGCC
TGTGACCCGGAGGCTGTGGAGATCGTGGGCATCCCGGACAAGATCCCCTTCAAGCGCCCCTGCACTTATGGAGTCCCCAAGCTGAAGCGG
ATCCTGGAGGAGCGCCATAGTATCCACTTCATCATTAAGAGGATGTTTGATGAGCGAATTTTCACAGGGAACAAGTTTACCAAAGACACC
ACGAAGCTGGAGCCAGCCAGCCCGCCAGAGGACACCTCTGCAGAGGTCTCTAGGGCCACCGTCCTTGACCTTGCTGGGAATGCTCGGTCA
GACAAGGGCAGCATGTCTGAAGACTGTGGGCCAGGAACCTCCGGGGAGCTGGGCGGGCTGAGGCCGATCAAAATTGAGCCAGAGGATCTG
GACATCATTCAGGTCACCGTCCCAGACCCCTCGCCAACCTCTGAGGAAATGACAGACTCGATGCCTGGGCACCTGCCATCGGAGGATTCT
GGTTATGGGATGGAGATGCTGACAGACAAAGGTCTGAGTGAGGACGCGCGGCCCGAGGAGAGGCCCGTGGAGGACAGCCACGGTGACGTG
ATCCGGCCCCTGCGGAAGCAGGTGGAGCTGCTCTTCAACACACGATACGCCAAGGCCATTGGCATCTCGGAGCCCGTCAAGGTGCCGTAC
TCCAAGTTTCTGATGCACCCGGAGGAGCTGTTTGTGGTGGGACTGCCTGAAGGCATCTCCCTCCGCAGGCCCAACTGCTTCGGGATCGCC
AAGCTCCGGAAGATTCTGGAGGCCAGCAACAGCATCCAGTTTGTCATCAAGAGGCCCGAGCTGCTCACTGAGGGAGTCAAAGAGCCCATC
ATGGATAGTCAAGAGAGGGATTCCGGGGACCCTCTGGTGGACGAGAGCCTGAAGAGACAGGGCTTTCAAGAAAATTATGACGCGAGGCTC
TCACGGATCGACATCGCCAACACACTAAGGGAGCAGGTCCAGGACCTTTTCAATAAGAAATACGGGGAAGCCTTGGGCATCAAGTACCCG
GTCCAGGTCCCCTACAAGCGGATCAAGAGTAACCCCGGCTCCGTGATCATCGAGGGGCTGCCCCCAGGAATCCCGTTCCGAAAGCCCTGT
ACCTTCGGCTCCCAGAACCTGGAGAGGATTCTTGCTGTGGCTGACAAGATCAAGTTCACAGTCACCAGGCCTTTCCAAGGACTCATCCCA
AAGCCTGATGAAGATGACGCCAACAGACTCGGGGAGAAGGTGATCCTGCGGGAGCAGGTGAAGGAACTCTTCAACGAGAAATACGGTGAG
GCCCTGGGCCTGAACCGGCCGGTGCTGGTCCCTTATAAACTAATCCGGGACAGCCCAGACGCCGTGGAGGTCACGGGTCTGCCTGATGAC
ATCCCCTTCCGGAACCCCAACACGTACGACATCCACCGGCTGGAGAAGATCCTGAAGGCCCGAGAGCATGTCCGCATGGTCATCATTAAC
CAGCTCCAACCCTTTGCAGAAATCTGCAATGATGCCAAGGTGCCAGCCAAAGACAGCAGCATTCCCAAGCGCAAGAGAAAGCGGGTCTCG
GAAGGAAATTCCGTCTCCTCTTCCTCCTCGTCTTCCTCTTCCTCGTCCTCTAACCCGGATTCAGTGGCATCGGCCAACCAGATCTCACTC
GTGCAATGGCCAATGTACATGGTGGACTATGCCGGCCTGAACGTGCAGCTCCCGGGACCTCTTAATTACTAGACCTCAGTACTGAATCAG
GACCTCACTCAGAAAGACTAAAGGAAATGTAATTTATGTACAAAATGTATATTCGGATATGTATCGATGCCTTTTAGTTTTTCCAATGAT

>18258_18258_2_COL26A1-GTF2IRD1_COL26A1_chr7_101091068_ENST00000313669_GTF2IRD1_chr7_73944064_ENST00000424337_length(amino acids)=726AA_BP=145
MPVLVPGTPGPRGLLRTMKLALLLPWACCCLCGSALATGFLYPFSAAALQQHGYPEPGAGSPGSGYASRRHWCHHTVTRTVSCQVQNGSE
TVVQRVYQSCRWPGPCANLVSYRTLIRPTYRVSYRTVTVLEWRCCPGFTGSNCDEAEALGLDHMVPVPYRKIACDPEAVEIVGIPDKIPF
KRPCTYGVPKLKRILEERHSIHFIIKRMFDERIFTGNKFTKDTTKLEPASPPEDTSAEVSRATVLDLAGNARSDKGSMSEDCGPGTSGEL
GGLRPIKIEPEDLDIIQVTVPDPSPTSEEMTDSMPGHLPSEDSGYGMEMLTDKGLSEDARPEERPVEDSHGDVIRPLRKQVELLFNTRYA
KAIGISEPVKVPYSKFLMHPEELFVVGLPEGISLRRPNCFGIAKLRKILEASNSIQFVIKRPELLTEGVKEPIMDSQERDSGDPLVDESL
KRQGFQENYDARLSRIDIANTLREQVQDLFNKKYGEALGIKYPVQVPYKRIKSNPGSVIIEGLPPGIPFRKPCTFGSQNLERILAVADKI
KFTVTRPFQGLIPKPDEDDANRLGEKVILREQVKELFNEKYGEALGLNRPVLVPYKLIRDSPDAVEVTGLPDDIPFRNPNTYDIHRLEKI
LKAREHVRMVIINQLQPFAEICNDAKVPAKDSSIPKRKRKRVSEGNSVSSSSSSSSSSSSNPDSVASANQISLVQWPMYMVDYAGLNVQL

--------------------------------------------------------------
>18258_18258_3_COL26A1-GTF2IRD1_COL26A1_chr7_101091068_ENST00000313669_GTF2IRD1_chr7_73944064_ENST00000455841_length(transcript)=2493nt_BP=577nt
GAATTTGAAAAGGCGGCCCCGGAGAGGCGTGGGCGCCCCCCACACATTTCCAGCTCGCACCCGGGCTCCGACCGCTCGCCCCGCTCCTCT
CGCTGTGCTCCCGGCCGGTGCCGCGGGTTCGGTCCGGGCGCCGGTGCGCTCCTGCCGGTCCTCGTGCCCGGGACTCCGGGTCCCCGCGGG
CTGCTGCGCACGATGAAGCTGGCCCTGCTCCTGCCCTGGGCGTGTTGCTGCCTCTGCGGGTCGGCGCTGGCCACCGGCTTCCTCTATCCC
TTCTCGGCCGCAGCTCTGCAGCAGCACGGCTACCCCGAGCCCGGCGCCGGCTCCCCTGGCAGCGGCTACGCGAGCCGCCGGCACTGGTGC
CATCACACAGTGACACGGACGGTGTCCTGCCAGGTGCAGAATGGCTCGGAGACGGTGGTCCAGCGCGTGTACCAGAGCTGCCGGTGGCCG
GGGCCCTGCGCCAACCTCGTAAGTTACAGGACTCTGATCAGACCCACCTACAGAGTGTCCTACCGCACGGTGACGGTGCTGGAGTGGAGA
TGCTGCCCTGGCTTCACCGGGAGCAACTGTGATGAGGCGGAAGCCCTGGGCCTGGACCACATGGTCCCCGTGCCCTACCGGAAGATTGCC
TGTGACCCGGAGGCTGTGGAGATCGTGGGCATCCCGGACAAGATCCCCTTCAAGCGCCCCTGCACTTATGGAGTCCCCAAGCTGAAGCGG
ATCCTGGAGGAGCGCCATAGTATCCACTTCATCATTAAGAGGATGTTTGATGAGCGAATTTTCACAGGGAACAAGTTTACCAAAGACACC
ACGAAGCTGGAGCCAGCCAGCCCGCCAGAGGACACCTCTGCAGAGGTCTCTAGGGCCACCGTCCTTGACCTTGCTGGGAATGCTCGGTCA
GACAAGGGCAGCATGTCTGAAGACTGTGGGCCAGGAACCTCCGGGGAGCTGGGCGGGCTGAGGCCGATCAAAATTGAGCCAGAGGATCTG
GACATCATTCAGGTCACCGTCCCAGACCCCTCGCCAACCTCTGAGGAAATGACAGACTCGATGCCTGGGCACCTGCCATCGGAGGATTCT
GGTTATGGGATGGAGATGCTGACAGACAAAGGTCTGAGTGAGGACGCGCGGCCCGAGGAGAGGCCCGTGGAGGACAGCCACGGTGACGTG
ATCCGGCCCCTGCGGAAGCAGGTGGAGCTGCTCTTCAACACACGATACGCCAAGGCCATTGGCATCTCGGAGCCCGTCAAGGTGCCGTAC
TCCAAGTTTCTGATGCACCCGGAGGAGCTGTTTGTGGTGGGACTGCCTGAAGGCATCTCCCTCCGCAGGCCCAACTGCTTCGGGATCGCC
AAGCTCCGGAAGATTCTGGAGGCCAGCAACAGCATCCAGTTTGTCATCAAGAGGCCCGAGCTGCTCACTGAGGGAGTCAAAGAGCCCATC
ATGGATAGTCAAGAGAGGGATTCCGGGGACCCTCTGGTGGACGAGAGCCTGAAGAGACAGGGCTTTCAAGAAAATTATGACGCGAGGCTC
TCACGGATCGACATCGCCAACACACTAAGGGAGCAGGTCCAGGACCTTTTCAATAAGAAATACGGGGAAGCCTTGGGCATCAAGTACCCG
GTCCAGGTCCCCTACAAGCGGATCAAGAGTAACCCCGGCTCCGTGATCATCGAGGGGCTGCCCCCAGGAATCCCGTTCCGAAAGCCCTGT
ACCTTCGGCTCCCAGAACCTGGAGAGGATTCTTGCTGTGGCTGACAAGATCAAGTTCACAGTCACCAGGCCTTTCCAAGGACTCATCCCA
AAGCCTGATGAAGATGACGCCAACAGACTCGGGGAGAAGGTGATCCTGCGGGAGCAGGTGAAGGAACTCTTCAACGAGAAATACGGTGAG
GCCCTGGGCCTGAACCGGCCGGTGCTGGTCCCTTATAAACTAATCCGGGACAGCCCAGACGCCGTGGAGGTCACGGGTCTGCCTGATGAC
ATCCCCTTCCGGAACCCCAACACGTACGACATCCACCGGCTGGAGAAGATCCTGAAGGCCCGAGAGCATGTCCGCATGGTCATCATTAAC
CAGCTCCAACCCTTTGCAGAAATCTGCAATGATGCCAAGGTGCCAGCCAAAGACAGCAGCATTCCCAAGCGCAAGAGAAAGCGGGTCTCG
GAAGGAAATTCCGTCTCCTCTTCCTCCTCGTCTTCCTCTTCCTCGTCCTCTAACCCGGATTCAGTGGCATCGGCCAACCAGATCTCACTC
GTGCAATGGCCAATGTACATGGTGGACTATGCCGGCCTGAACGTGCAGCTCCCGGGACCTCTTAATTACTAGACCTCAGTACTGAATCAG
GACCTCACTCAGAAAGACTAAAGGAAATGTAATTTATGTACAAAATGTATATTCGGATATGTATCGATGCCTTTTAGTTTTTCCAATGAT

>18258_18258_3_COL26A1-GTF2IRD1_COL26A1_chr7_101091068_ENST00000313669_GTF2IRD1_chr7_73944064_ENST00000455841_length(amino acids)=726AA_BP=145
MPVLVPGTPGPRGLLRTMKLALLLPWACCCLCGSALATGFLYPFSAAALQQHGYPEPGAGSPGSGYASRRHWCHHTVTRTVSCQVQNGSE
TVVQRVYQSCRWPGPCANLVSYRTLIRPTYRVSYRTVTVLEWRCCPGFTGSNCDEAEALGLDHMVPVPYRKIACDPEAVEIVGIPDKIPF
KRPCTYGVPKLKRILEERHSIHFIIKRMFDERIFTGNKFTKDTTKLEPASPPEDTSAEVSRATVLDLAGNARSDKGSMSEDCGPGTSGEL
GGLRPIKIEPEDLDIIQVTVPDPSPTSEEMTDSMPGHLPSEDSGYGMEMLTDKGLSEDARPEERPVEDSHGDVIRPLRKQVELLFNTRYA
KAIGISEPVKVPYSKFLMHPEELFVVGLPEGISLRRPNCFGIAKLRKILEASNSIQFVIKRPELLTEGVKEPIMDSQERDSGDPLVDESL
KRQGFQENYDARLSRIDIANTLREQVQDLFNKKYGEALGIKYPVQVPYKRIKSNPGSVIIEGLPPGIPFRKPCTFGSQNLERILAVADKI
KFTVTRPFQGLIPKPDEDDANRLGEKVILREQVKELFNEKYGEALGLNRPVLVPYKLIRDSPDAVEVTGLPDDIPFRNPNTYDIHRLEKI
LKAREHVRMVIINQLQPFAEICNDAKVPAKDSSIPKRKRKRVSEGNSVSSSSSSSSSSSSNPDSVASANQISLVQWPMYMVDYAGLNVQL

--------------------------------------------------------------
>18258_18258_4_COL26A1-GTF2IRD1_COL26A1_chr7_101091068_ENST00000313669_GTF2IRD1_chr7_73944064_ENST00000476977_length(transcript)=3664nt_BP=577nt
GAATTTGAAAAGGCGGCCCCGGAGAGGCGTGGGCGCCCCCCACACATTTCCAGCTCGCACCCGGGCTCCGACCGCTCGCCCCGCTCCTCT
CGCTGTGCTCCCGGCCGGTGCCGCGGGTTCGGTCCGGGCGCCGGTGCGCTCCTGCCGGTCCTCGTGCCCGGGACTCCGGGTCCCCGCGGG
CTGCTGCGCACGATGAAGCTGGCCCTGCTCCTGCCCTGGGCGTGTTGCTGCCTCTGCGGGTCGGCGCTGGCCACCGGCTTCCTCTATCCC
TTCTCGGCCGCAGCTCTGCAGCAGCACGGCTACCCCGAGCCCGGCGCCGGCTCCCCTGGCAGCGGCTACGCGAGCCGCCGGCACTGGTGC
CATCACACAGTGACACGGACGGTGTCCTGCCAGGTGCAGAATGGCTCGGAGACGGTGGTCCAGCGCGTGTACCAGAGCTGCCGGTGGCCG
GGGCCCTGCGCCAACCTCGTAAGTTACAGGACTCTGATCAGACCCACCTACAGAGTGTCCTACCGCACGGTGACGGTGCTGGAGTGGAGA
TGCTGCCCTGGCTTCACCGGGAGCAACTGTGATGAGGCGGAAGCCCTGGGCCTGGACCACATGGTCCCCGTGCCCTACCGGAAGATTGCC
TGTGACCCGGAGGCTGTGGAGATCGTGGGCATCCCGGACAAGATCCCCTTCAAGCGCCCCTGCACTTATGGAGTCCCCAAGCTGAAGCGG
ATCCTGGAGGAGCGCCATAGTATCCACTTCATCATTAAGAGGATGTTTGATGAGCGAATTTTCACAGGGAACAAGTTTACCAAAGACACC
ACGAAGCTGGAGCCAGCCAGCCCGCCAGAGGACACCTCTGCAGAGGTCTCTAGGGCCACCGTCCTTGACCTTGCTGGGAATGCTCGGTCA
GACAAGGGCAGCATGTCTGAAGACTGTGGGCCAGGAACCTCCGGGGAGCTGGGCGGGCTGAGGCCGATCAAAATTGAGCCAGAGGATCTG
GACATCATTCAGGTCACCGTCCCAGACCCCTCGCCAACCTCTGAGGAAATGACAGACTCGATGCCTGGGCACCTGCCATCGGAGGATTCT
GGTTATGGGATGGAGATGCTGACAGACAAAGGTCTGAGTGAGGACGCGCGGCCCGAGGAGAGGCCCGTGGAGGACAGCCACGGTGACGTG
ATCCGGCCCCTGCGGAAGCAGGTGGAGCTGCTCTTCAACACACGATACGCCAAGGCCATTGGCATCTCGGAGCCCGTCAAGGTGCCGTAC
TCCAAGTTTCTGATGCACCCGGAGGAGCTGTTTGTGGTGGGACTGCCTGAAGGCATCTCCCTCCGCAGGCCCAACTGCTTCGGGATCGCC
AAGCTCCGGAAGATTCTGGAGGCCAGCAACAGCATCCAGTTTGTCATCAAGAGGCCCGAGCTGCTCACTGAGGGAGTCAAAGAGCCCATC
ATGGATAGTCAAGAGAGGGATTCCGGGGACCCTCTGGTGGACGAGAGCCTGAAGAGACAGGGCTTTCAAGAAAATTATGACGCGAGGCTC
TCACGGATCGACATCGCCAACACACTAAGGGAGCAGGTCCAGGACCTTTTCAATAAGAAATACGGGGAAGCCTTGGGCATCAAGTACCCG
GTCCAGGTCCCCTACAAGCGGATCAAGAGTAACCCCGGCTCCGTGATCATCGAGGGGCTGCCCCCAGGAATCCCGTTCCGAAAGCCCTGT
ACCTTCGGCTCCCAGAACCTGGAGAGGATTCTTGCTGTGGCTGACAAGATCAAGTTCACAGTCACCAGGCCTTTCCAAGGACTCATCCCA
AAGCCTGATGAAGATGACGCCAACAGACTCGGGGAGAAGGTGATCCTGCGGGAGCAGGTGAAGGAACTCTTCAACGAGAAATACGGTGAG
GCCCTGGGCCTGAACCGGCCGGTGCTGGTCCCTTATAAACTAATCCGGGACAGCCCAGACGCCGTGGAGGTCACGGGTCTGCCTGATGAC
ATCCCCTTCCGGAACCCCAACACGTACGACATCCACCGGCTGGAGAAGATCCTGAAGGCCCGAGAGCATGTCCGCATGGTCATCATTAAC
CAGCTCCAACCCTTTGCAGAAATCTGCAATGATGCCAAGGTGCCAGCCAAAGACAGCAGCATTCCCAAGCGCAAGAGAAAGCGGGTCTCG
GAAGGAAATTCCGTCTCCTCTTCCTCCTCGTCTTCCTCTTCCTCGTCCTCTAACCCGGATTCAGTGGCATCGGCCAACCAGATCTCACTC
GTGGTAAAGTTGCACCGATTTGGACTCCGGCACTCATCTCTGTGGCCCTCACCCCTCTGTCTGGCAGGGCCGTCTACTCTGGGATGTGGG
CCCAGGGGACGGGGAGGCACTGGGCTTTGAGTGGGGACCTTCCGGCCTCGGGGGTTATAGATGCATCCACCTGTCTCACCCAAGAGGTAG
CCCATCCTTCTCGTGGGGTACTCACAGGCACTCAGGCAGGAATTCACATCCTCGCTGGGCAGATGGGCCGGCTGAGGTCCACCTGCCCAC
ACCCTTCAGCCGCACCAGAGCTGGAGACATGAAAAGACATGGCTGGCGGGTGCAGTGGCTCACGCCTGTAATCCCAGCACTTTGGCAGGT
CAAGTCGGGTGGATCACCTGAGGTCAGGAGTTTGAGACCAGGCTGACCAACACGGGGAAACCCCATCTCTACTAAAAATACAAAATTAGC
CGGGCAAAGTGGGGCATAGTGGCTCATGCCTGTAATCCCAGCTACTTGGAAGGCTGAGATAGGAGAATCGCTTGAACCTGGGAGGCAGAG
GTTGCAATGAGCCGAGGTCGCGCCATTGCACTGCAGCCTGGGCAACAAGAGTGAAACACTGTCTCAGAAAAAAAAATTAGCCAGGCATGG
TGGCACGTGCCTGTGGTCGCAGCTACTTGGGAGGCTGGGGCAGGAGGATCATTTGAGCCCAAGGGGATTGAGGCTGCAGTGAGCCAAGAT
CGTCCCATTGCACTCCAGCCTGGGCAAGAGAACGAGACTCCATCTCAAAAATAAATAAATAGGCTGGGTGTGGTGGCTCACGCCTGTAAT
CCTAGCACTTTGGGAGGCCGAGGCAGGCGGATCACTTGAGGCTCAGGAGTTCAAGACCAGCCTGGCCAACATGGCAAAACCCCGTCTCTA
CTAAAAATAGAAAAATTAGCCGGGCATGGTGGCGGGCGCCTATAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGACTCGCTTGAACCCG
CGGGGCCAAGGTTGCAGTGAGCCGAGATTGCATCACTGCACTCCAGCCTGGGCAGAAGAGTGAAACTCCATCTCAAAAAAATAAAAAATA
TAAATAAATAGCCTCTGAGAAAGCTCTTCCAAAAGCAGAACTAAGCATTTTGGGTTTGTTCCGCATCACCTGGAGTCCTAATCCAGTCCC
TTTGTCCCTCTCTCTAGCAATGGCCAATGTACATGGTGGACTATGCCGGCCTGAACGTGCAGCTCCCGGGACCTCTTAATTACTAGACCT
CAGTACTGAATCAGGACCTCACTCAGAAAGACTAAAGGAAATGTAATTTATGTACAAAATGTATATTCGGATATGTATCGATGCCTTTTA

>18258_18258_4_COL26A1-GTF2IRD1_COL26A1_chr7_101091068_ENST00000313669_GTF2IRD1_chr7_73944064_ENST00000476977_length(amino acids)=742AA_BP=145
MPVLVPGTPGPRGLLRTMKLALLLPWACCCLCGSALATGFLYPFSAAALQQHGYPEPGAGSPGSGYASRRHWCHHTVTRTVSCQVQNGSE
TVVQRVYQSCRWPGPCANLVSYRTLIRPTYRVSYRTVTVLEWRCCPGFTGSNCDEAEALGLDHMVPVPYRKIACDPEAVEIVGIPDKIPF
KRPCTYGVPKLKRILEERHSIHFIIKRMFDERIFTGNKFTKDTTKLEPASPPEDTSAEVSRATVLDLAGNARSDKGSMSEDCGPGTSGEL
GGLRPIKIEPEDLDIIQVTVPDPSPTSEEMTDSMPGHLPSEDSGYGMEMLTDKGLSEDARPEERPVEDSHGDVIRPLRKQVELLFNTRYA
KAIGISEPVKVPYSKFLMHPEELFVVGLPEGISLRRPNCFGIAKLRKILEASNSIQFVIKRPELLTEGVKEPIMDSQERDSGDPLVDESL
KRQGFQENYDARLSRIDIANTLREQVQDLFNKKYGEALGIKYPVQVPYKRIKSNPGSVIIEGLPPGIPFRKPCTFGSQNLERILAVADKI
KFTVTRPFQGLIPKPDEDDANRLGEKVILREQVKELFNEKYGEALGLNRPVLVPYKLIRDSPDAVEVTGLPDDIPFRNPNTYDIHRLEKI
LKAREHVRMVIINQLQPFAEICNDAKVPAKDSSIPKRKRKRVSEGNSVSSSSSSSSSSSSNPDSVASANQISLVVKLHRFGLRHSSLWPS

--------------------------------------------------------------
>18258_18258_5_COL26A1-GTF2IRD1_COL26A1_chr7_101091068_ENST00000397927_GTF2IRD1_chr7_73944064_ENST00000265755_length(transcript)=2545nt_BP=598nt
CGGCTTAGAGCCCACCTCGCCGAATTTGAAAAGGCGGCCCCGGAGAGGCGTGGGCGCCCCCCACACATTTCCAGCTCGCACCCGGGCTCC
GACCGCTCGCCCCGCTCCTCTCGCTGTGCTCCCGGCCGGTGCCGCGGGTTCGGTCCGGGCGCCGGTGCGCTCCTGCCGGTCCTCGTGCCC
GGGACTCCGGGTCCCCGCGGGCTGCTGCGCACGATGAAGCTGGCCCTGCTCCTGCCCTGGGCGTGTTGCTGCCTCTGCGGGTCGGCGCTG
GCCACCGGCTTCCTCTATCCCTTCTCGGCCGCAGCTCTGCAGCAGCACGGCTACCCCGAGCCCGGCGCCGGCTCCCCTGGCAGCGGCTAC
GCGAGCCGCCGGCACTGGTGCCATCACACAGTGACACGGACGGTGTCCTGCCAGGTGCAGAATGGCTCGGAGACGGTGGTCCAGCGCGTG
TACCAGAGCTGCCGGTGGCCGGGGCCCTGCGCCAACCTCGTAAGTTACAGGACTCTGATCAGACCCACCTACAGAGTGTCCTACCGCACG
GTGACGGTGCTGGAGTGGAGATGCTGCCCTGGCTTCACCGGGAGCAACTGTGATGAGGCGGAAGCCCTGGGCCTGGACCACATGGTCCCC
GTGCCCTACCGGAAGATTGCCTGTGACCCGGAGGCTGTGGAGATCGTGGGCATCCCGGACAAGATCCCCTTCAAGCGCCCCTGCACTTAT
GGAGTCCCCAAGCTGAAGCGGATCCTGGAGGAGCGCCATAGTATCCACTTCATCATTAAGAGGATGTTTGATGAGCGAATTTTCACAGGG
AACAAGTTTACCAAAGACACCACGAAGCTGGAGCCAGCCAGCCCGCCAGAGGACACCTCTGCAGAGGTCTCTAGGGCCACCGTCCTTGAC
CTTGCTGGGAATGCTCGGTCAGACAAGGGCAGCATGTCTGAAGACTGTGGGCCAGGAACCTCCGGGGAGCTGGGCGGGCTGAGGCCGATC
AAAATTGAGCCAGAGGATCTGGACATCATTCAGGTCACCGTCCCAGACCCCTCGCCAACCTCTGAGGAAATGACAGACTCGATGCCTGGG
CACCTGCCATCGGAGGATTCTGGTTATGGGATGGAGATGCTGACAGACAAAGGTCTGAGTGAGGACGCGCGGCCCGAGGAGAGGCCCGTG
GAGGACAGCCACGGTGACGTGATCCGGCCCCTGCGGAAGCAGGTGGAGCTGCTCTTCAACACACGATACGCCAAGGCCATTGGCATCTCG
GAGCCCGTCAAGGTGCCGTACTCCAAGTTTCTGATGCACCCGGAGGAGCTGTTTGTGGTGGGACTGCCTGAAGGCATCTCCCTCCGCAGG
CCCAACTGCTTCGGGATCGCCAAGCTCCGGAAGATTCTGGAGGCCAGCAACAGCATCCAGTTTGTCATCAAGAGGCCCGAGCTGCTCACT
GAGGGAGTCAAAGAGCCCATCATGGATAGTCAAGGAACTGCCTCCTCACTTGGCTTCTCTCCCCCTGCCCTGCCCCCAGAGAGGGATTCC
GGGGACCCTCTGGTGGACGAGAGCCTGAAGAGACAGGGCTTTCAAGAAAATTATGACGCGAGGCTCTCACGGATCGACATCGCCAACACA
CTAAGGGAGCAGGTCCAGGACCTTTTCAATAAGAAATACGGGGAAGCCTTGGGCATCAAGTACCCGGTCCAGGTCCCCTACAAGCGGATC
AAGAGTAACCCCGGCTCCGTGATCATCGAGGGGCTGCCCCCAGGAATCCCGTTCCGAAAGCCCTGTACCTTCGGCTCCCAGAACCTGGAG
AGGATTCTTGCTGTGGCTGACAAGATCAAGTTCACAGTCACCAGGCCTTTCCAAGGACTCATCCCAAAGCCTGATGAAGATGACGCCAAC
AGACTCGGGGAGAAGGTGATCCTGCGGGAGCAGGTGAAGGAACTCTTCAACGAGAAATACGGTGAGGCCCTGGGCCTGAACCGGCCGGTG
CTGGTCCCTTATAAACTAATCCGGGACAGCCCAGACGCCGTGGAGGTCACGGGTCTGCCTGATGACATCCCCTTCCGGAACCCCAACACG
TACGACATCCACCGGCTGGAGAAGATCCTGAAGGCCCGAGAGCATGTCCGCATGGTCATCATTAACCAGCTCCAACCCTTTGCAGAAATC
TGCAATGATGCCAAGGTGCCAGCCAAAGACAGCAGCATTCCCAAGCGCAAGAGAAAGCGGGTCTCGGAAGGAAATTCCGTCTCCTCTTCC
TCCTCGTCTTCCTCTTCCTCGTCCTCTAACCCGGATTCAGTGGCATCGGCCAACCAGATCTCACTCGTGCAATGGCCAATGTACATGGTG
GACTATGCCGGCCTGAACGTGCAGCTCCCGGGACCTCTTAATTACTAGACCTCAGTACTGAATCAGGACCTCACTCAGAAAGACTAAAGG
AAATGTAATTTATGTACAAAATGTATATTCGGATATGTATCGATGCCTTTTAGTTTTTCCAATGATTTTTACACTATATTCCTGCCACCA

>18258_18258_5_COL26A1-GTF2IRD1_COL26A1_chr7_101091068_ENST00000397927_GTF2IRD1_chr7_73944064_ENST00000265755_length(amino acids)=741AA_BP=145
MPVLVPGTPGPRGLLRTMKLALLLPWACCCLCGSALATGFLYPFSAAALQQHGYPEPGAGSPGSGYASRRHWCHHTVTRTVSCQVQNGSE
TVVQRVYQSCRWPGPCANLVSYRTLIRPTYRVSYRTVTVLEWRCCPGFTGSNCDEAEALGLDHMVPVPYRKIACDPEAVEIVGIPDKIPF
KRPCTYGVPKLKRILEERHSIHFIIKRMFDERIFTGNKFTKDTTKLEPASPPEDTSAEVSRATVLDLAGNARSDKGSMSEDCGPGTSGEL
GGLRPIKIEPEDLDIIQVTVPDPSPTSEEMTDSMPGHLPSEDSGYGMEMLTDKGLSEDARPEERPVEDSHGDVIRPLRKQVELLFNTRYA
KAIGISEPVKVPYSKFLMHPEELFVVGLPEGISLRRPNCFGIAKLRKILEASNSIQFVIKRPELLTEGVKEPIMDSQGTASSLGFSPPAL
PPERDSGDPLVDESLKRQGFQENYDARLSRIDIANTLREQVQDLFNKKYGEALGIKYPVQVPYKRIKSNPGSVIIEGLPPGIPFRKPCTF
GSQNLERILAVADKIKFTVTRPFQGLIPKPDEDDANRLGEKVILREQVKELFNEKYGEALGLNRPVLVPYKLIRDSPDAVEVTGLPDDIP
FRNPNTYDIHRLEKILKAREHVRMVIINQLQPFAEICNDAKVPAKDSSIPKRKRKRVSEGNSVSSSSSSSSSSSSNPDSVASANQISLVQ

--------------------------------------------------------------
>18258_18258_6_COL26A1-GTF2IRD1_COL26A1_chr7_101091068_ENST00000397927_GTF2IRD1_chr7_73944064_ENST00000424337_length(transcript)=2511nt_BP=598nt
CGGCTTAGAGCCCACCTCGCCGAATTTGAAAAGGCGGCCCCGGAGAGGCGTGGGCGCCCCCCACACATTTCCAGCTCGCACCCGGGCTCC
GACCGCTCGCCCCGCTCCTCTCGCTGTGCTCCCGGCCGGTGCCGCGGGTTCGGTCCGGGCGCCGGTGCGCTCCTGCCGGTCCTCGTGCCC
GGGACTCCGGGTCCCCGCGGGCTGCTGCGCACGATGAAGCTGGCCCTGCTCCTGCCCTGGGCGTGTTGCTGCCTCTGCGGGTCGGCGCTG
GCCACCGGCTTCCTCTATCCCTTCTCGGCCGCAGCTCTGCAGCAGCACGGCTACCCCGAGCCCGGCGCCGGCTCCCCTGGCAGCGGCTAC
GCGAGCCGCCGGCACTGGTGCCATCACACAGTGACACGGACGGTGTCCTGCCAGGTGCAGAATGGCTCGGAGACGGTGGTCCAGCGCGTG
TACCAGAGCTGCCGGTGGCCGGGGCCCTGCGCCAACCTCGTAAGTTACAGGACTCTGATCAGACCCACCTACAGAGTGTCCTACCGCACG
GTGACGGTGCTGGAGTGGAGATGCTGCCCTGGCTTCACCGGGAGCAACTGTGATGAGGCGGAAGCCCTGGGCCTGGACCACATGGTCCCC
GTGCCCTACCGGAAGATTGCCTGTGACCCGGAGGCTGTGGAGATCGTGGGCATCCCGGACAAGATCCCCTTCAAGCGCCCCTGCACTTAT
GGAGTCCCCAAGCTGAAGCGGATCCTGGAGGAGCGCCATAGTATCCACTTCATCATTAAGAGGATGTTTGATGAGCGAATTTTCACAGGG
AACAAGTTTACCAAAGACACCACGAAGCTGGAGCCAGCCAGCCCGCCAGAGGACACCTCTGCAGAGGTCTCTAGGGCCACCGTCCTTGAC
CTTGCTGGGAATGCTCGGTCAGACAAGGGCAGCATGTCTGAAGACTGTGGGCCAGGAACCTCCGGGGAGCTGGGCGGGCTGAGGCCGATC
AAAATTGAGCCAGAGGATCTGGACATCATTCAGGTCACCGTCCCAGACCCCTCGCCAACCTCTGAGGAAATGACAGACTCGATGCCTGGG
CACCTGCCATCGGAGGATTCTGGTTATGGGATGGAGATGCTGACAGACAAAGGTCTGAGTGAGGACGCGCGGCCCGAGGAGAGGCCCGTG
GAGGACAGCCACGGTGACGTGATCCGGCCCCTGCGGAAGCAGGTGGAGCTGCTCTTCAACACACGATACGCCAAGGCCATTGGCATCTCG
GAGCCCGTCAAGGTGCCGTACTCCAAGTTTCTGATGCACCCGGAGGAGCTGTTTGTGGTGGGACTGCCTGAAGGCATCTCCCTCCGCAGG
CCCAACTGCTTCGGGATCGCCAAGCTCCGGAAGATTCTGGAGGCCAGCAACAGCATCCAGTTTGTCATCAAGAGGCCCGAGCTGCTCACT
GAGGGAGTCAAAGAGCCCATCATGGATAGTCAAGAGAGGGATTCCGGGGACCCTCTGGTGGACGAGAGCCTGAAGAGACAGGGCTTTCAA
GAAAATTATGACGCGAGGCTCTCACGGATCGACATCGCCAACACACTAAGGGAGCAGGTCCAGGACCTTTTCAATAAGAAATACGGGGAA
GCCTTGGGCATCAAGTACCCGGTCCAGGTCCCCTACAAGCGGATCAAGAGTAACCCCGGCTCCGTGATCATCGAGGGGCTGCCCCCAGGA
ATCCCGTTCCGAAAGCCCTGTACCTTCGGCTCCCAGAACCTGGAGAGGATTCTTGCTGTGGCTGACAAGATCAAGTTCACAGTCACCAGG
CCTTTCCAAGGACTCATCCCAAAGCCTGATGAAGATGACGCCAACAGACTCGGGGAGAAGGTGATCCTGCGGGAGCAGGTGAAGGAACTC
TTCAACGAGAAATACGGTGAGGCCCTGGGCCTGAACCGGCCGGTGCTGGTCCCTTATAAACTAATCCGGGACAGCCCAGACGCCGTGGAG
GTCACGGGTCTGCCTGATGACATCCCCTTCCGGAACCCCAACACGTACGACATCCACCGGCTGGAGAAGATCCTGAAGGCCCGAGAGCAT
GTCCGCATGGTCATCATTAACCAGCTCCAACCCTTTGCAGAAATCTGCAATGATGCCAAGGTGCCAGCCAAAGACAGCAGCATTCCCAAG
CGCAAGAGAAAGCGGGTCTCGGAAGGAAATTCCGTCTCCTCTTCCTCCTCGTCTTCCTCTTCCTCGTCCTCTAACCCGGATTCAGTGGCA
TCGGCCAACCAGATCTCACTCGTGCAATGGCCAATGTACATGGTGGACTATGCCGGCCTGAACGTGCAGCTCCCGGGACCTCTTAATTAC
TAGACCTCAGTACTGAATCAGGACCTCACTCAGAAAGACTAAAGGAAATGTAATTTATGTACAAAATGTATATTCGGATATGTATCGATG

>18258_18258_6_COL26A1-GTF2IRD1_COL26A1_chr7_101091068_ENST00000397927_GTF2IRD1_chr7_73944064_ENST00000424337_length(amino acids)=726AA_BP=145
MPVLVPGTPGPRGLLRTMKLALLLPWACCCLCGSALATGFLYPFSAAALQQHGYPEPGAGSPGSGYASRRHWCHHTVTRTVSCQVQNGSE
TVVQRVYQSCRWPGPCANLVSYRTLIRPTYRVSYRTVTVLEWRCCPGFTGSNCDEAEALGLDHMVPVPYRKIACDPEAVEIVGIPDKIPF
KRPCTYGVPKLKRILEERHSIHFIIKRMFDERIFTGNKFTKDTTKLEPASPPEDTSAEVSRATVLDLAGNARSDKGSMSEDCGPGTSGEL
GGLRPIKIEPEDLDIIQVTVPDPSPTSEEMTDSMPGHLPSEDSGYGMEMLTDKGLSEDARPEERPVEDSHGDVIRPLRKQVELLFNTRYA
KAIGISEPVKVPYSKFLMHPEELFVVGLPEGISLRRPNCFGIAKLRKILEASNSIQFVIKRPELLTEGVKEPIMDSQERDSGDPLVDESL
KRQGFQENYDARLSRIDIANTLREQVQDLFNKKYGEALGIKYPVQVPYKRIKSNPGSVIIEGLPPGIPFRKPCTFGSQNLERILAVADKI
KFTVTRPFQGLIPKPDEDDANRLGEKVILREQVKELFNEKYGEALGLNRPVLVPYKLIRDSPDAVEVTGLPDDIPFRNPNTYDIHRLEKI
LKAREHVRMVIINQLQPFAEICNDAKVPAKDSSIPKRKRKRVSEGNSVSSSSSSSSSSSSNPDSVASANQISLVQWPMYMVDYAGLNVQL

--------------------------------------------------------------
>18258_18258_7_COL26A1-GTF2IRD1_COL26A1_chr7_101091068_ENST00000397927_GTF2IRD1_chr7_73944064_ENST00000455841_length(transcript)=2514nt_BP=598nt
CGGCTTAGAGCCCACCTCGCCGAATTTGAAAAGGCGGCCCCGGAGAGGCGTGGGCGCCCCCCACACATTTCCAGCTCGCACCCGGGCTCC
GACCGCTCGCCCCGCTCCTCTCGCTGTGCTCCCGGCCGGTGCCGCGGGTTCGGTCCGGGCGCCGGTGCGCTCCTGCCGGTCCTCGTGCCC
GGGACTCCGGGTCCCCGCGGGCTGCTGCGCACGATGAAGCTGGCCCTGCTCCTGCCCTGGGCGTGTTGCTGCCTCTGCGGGTCGGCGCTG
GCCACCGGCTTCCTCTATCCCTTCTCGGCCGCAGCTCTGCAGCAGCACGGCTACCCCGAGCCCGGCGCCGGCTCCCCTGGCAGCGGCTAC
GCGAGCCGCCGGCACTGGTGCCATCACACAGTGACACGGACGGTGTCCTGCCAGGTGCAGAATGGCTCGGAGACGGTGGTCCAGCGCGTG
TACCAGAGCTGCCGGTGGCCGGGGCCCTGCGCCAACCTCGTAAGTTACAGGACTCTGATCAGACCCACCTACAGAGTGTCCTACCGCACG
GTGACGGTGCTGGAGTGGAGATGCTGCCCTGGCTTCACCGGGAGCAACTGTGATGAGGCGGAAGCCCTGGGCCTGGACCACATGGTCCCC
GTGCCCTACCGGAAGATTGCCTGTGACCCGGAGGCTGTGGAGATCGTGGGCATCCCGGACAAGATCCCCTTCAAGCGCCCCTGCACTTAT
GGAGTCCCCAAGCTGAAGCGGATCCTGGAGGAGCGCCATAGTATCCACTTCATCATTAAGAGGATGTTTGATGAGCGAATTTTCACAGGG
AACAAGTTTACCAAAGACACCACGAAGCTGGAGCCAGCCAGCCCGCCAGAGGACACCTCTGCAGAGGTCTCTAGGGCCACCGTCCTTGAC
CTTGCTGGGAATGCTCGGTCAGACAAGGGCAGCATGTCTGAAGACTGTGGGCCAGGAACCTCCGGGGAGCTGGGCGGGCTGAGGCCGATC
AAAATTGAGCCAGAGGATCTGGACATCATTCAGGTCACCGTCCCAGACCCCTCGCCAACCTCTGAGGAAATGACAGACTCGATGCCTGGG
CACCTGCCATCGGAGGATTCTGGTTATGGGATGGAGATGCTGACAGACAAAGGTCTGAGTGAGGACGCGCGGCCCGAGGAGAGGCCCGTG
GAGGACAGCCACGGTGACGTGATCCGGCCCCTGCGGAAGCAGGTGGAGCTGCTCTTCAACACACGATACGCCAAGGCCATTGGCATCTCG
GAGCCCGTCAAGGTGCCGTACTCCAAGTTTCTGATGCACCCGGAGGAGCTGTTTGTGGTGGGACTGCCTGAAGGCATCTCCCTCCGCAGG
CCCAACTGCTTCGGGATCGCCAAGCTCCGGAAGATTCTGGAGGCCAGCAACAGCATCCAGTTTGTCATCAAGAGGCCCGAGCTGCTCACT
GAGGGAGTCAAAGAGCCCATCATGGATAGTCAAGAGAGGGATTCCGGGGACCCTCTGGTGGACGAGAGCCTGAAGAGACAGGGCTTTCAA
GAAAATTATGACGCGAGGCTCTCACGGATCGACATCGCCAACACACTAAGGGAGCAGGTCCAGGACCTTTTCAATAAGAAATACGGGGAA
GCCTTGGGCATCAAGTACCCGGTCCAGGTCCCCTACAAGCGGATCAAGAGTAACCCCGGCTCCGTGATCATCGAGGGGCTGCCCCCAGGA
ATCCCGTTCCGAAAGCCCTGTACCTTCGGCTCCCAGAACCTGGAGAGGATTCTTGCTGTGGCTGACAAGATCAAGTTCACAGTCACCAGG
CCTTTCCAAGGACTCATCCCAAAGCCTGATGAAGATGACGCCAACAGACTCGGGGAGAAGGTGATCCTGCGGGAGCAGGTGAAGGAACTC
TTCAACGAGAAATACGGTGAGGCCCTGGGCCTGAACCGGCCGGTGCTGGTCCCTTATAAACTAATCCGGGACAGCCCAGACGCCGTGGAG
GTCACGGGTCTGCCTGATGACATCCCCTTCCGGAACCCCAACACGTACGACATCCACCGGCTGGAGAAGATCCTGAAGGCCCGAGAGCAT
GTCCGCATGGTCATCATTAACCAGCTCCAACCCTTTGCAGAAATCTGCAATGATGCCAAGGTGCCAGCCAAAGACAGCAGCATTCCCAAG
CGCAAGAGAAAGCGGGTCTCGGAAGGAAATTCCGTCTCCTCTTCCTCCTCGTCTTCCTCTTCCTCGTCCTCTAACCCGGATTCAGTGGCA
TCGGCCAACCAGATCTCACTCGTGCAATGGCCAATGTACATGGTGGACTATGCCGGCCTGAACGTGCAGCTCCCGGGACCTCTTAATTAC
TAGACCTCAGTACTGAATCAGGACCTCACTCAGAAAGACTAAAGGAAATGTAATTTATGTACAAAATGTATATTCGGATATGTATCGATG

>18258_18258_7_COL26A1-GTF2IRD1_COL26A1_chr7_101091068_ENST00000397927_GTF2IRD1_chr7_73944064_ENST00000455841_length(amino acids)=726AA_BP=145
MPVLVPGTPGPRGLLRTMKLALLLPWACCCLCGSALATGFLYPFSAAALQQHGYPEPGAGSPGSGYASRRHWCHHTVTRTVSCQVQNGSE
TVVQRVYQSCRWPGPCANLVSYRTLIRPTYRVSYRTVTVLEWRCCPGFTGSNCDEAEALGLDHMVPVPYRKIACDPEAVEIVGIPDKIPF
KRPCTYGVPKLKRILEERHSIHFIIKRMFDERIFTGNKFTKDTTKLEPASPPEDTSAEVSRATVLDLAGNARSDKGSMSEDCGPGTSGEL
GGLRPIKIEPEDLDIIQVTVPDPSPTSEEMTDSMPGHLPSEDSGYGMEMLTDKGLSEDARPEERPVEDSHGDVIRPLRKQVELLFNTRYA
KAIGISEPVKVPYSKFLMHPEELFVVGLPEGISLRRPNCFGIAKLRKILEASNSIQFVIKRPELLTEGVKEPIMDSQERDSGDPLVDESL
KRQGFQENYDARLSRIDIANTLREQVQDLFNKKYGEALGIKYPVQVPYKRIKSNPGSVIIEGLPPGIPFRKPCTFGSQNLERILAVADKI
KFTVTRPFQGLIPKPDEDDANRLGEKVILREQVKELFNEKYGEALGLNRPVLVPYKLIRDSPDAVEVTGLPDDIPFRNPNTYDIHRLEKI
LKAREHVRMVIINQLQPFAEICNDAKVPAKDSSIPKRKRKRVSEGNSVSSSSSSSSSSSSNPDSVASANQISLVQWPMYMVDYAGLNVQL

--------------------------------------------------------------
>18258_18258_8_COL26A1-GTF2IRD1_COL26A1_chr7_101091068_ENST00000397927_GTF2IRD1_chr7_73944064_ENST00000476977_length(transcript)=3685nt_BP=598nt
CGGCTTAGAGCCCACCTCGCCGAATTTGAAAAGGCGGCCCCGGAGAGGCGTGGGCGCCCCCCACACATTTCCAGCTCGCACCCGGGCTCC
GACCGCTCGCCCCGCTCCTCTCGCTGTGCTCCCGGCCGGTGCCGCGGGTTCGGTCCGGGCGCCGGTGCGCTCCTGCCGGTCCTCGTGCCC
GGGACTCCGGGTCCCCGCGGGCTGCTGCGCACGATGAAGCTGGCCCTGCTCCTGCCCTGGGCGTGTTGCTGCCTCTGCGGGTCGGCGCTG
GCCACCGGCTTCCTCTATCCCTTCTCGGCCGCAGCTCTGCAGCAGCACGGCTACCCCGAGCCCGGCGCCGGCTCCCCTGGCAGCGGCTAC
GCGAGCCGCCGGCACTGGTGCCATCACACAGTGACACGGACGGTGTCCTGCCAGGTGCAGAATGGCTCGGAGACGGTGGTCCAGCGCGTG
TACCAGAGCTGCCGGTGGCCGGGGCCCTGCGCCAACCTCGTAAGTTACAGGACTCTGATCAGACCCACCTACAGAGTGTCCTACCGCACG
GTGACGGTGCTGGAGTGGAGATGCTGCCCTGGCTTCACCGGGAGCAACTGTGATGAGGCGGAAGCCCTGGGCCTGGACCACATGGTCCCC
GTGCCCTACCGGAAGATTGCCTGTGACCCGGAGGCTGTGGAGATCGTGGGCATCCCGGACAAGATCCCCTTCAAGCGCCCCTGCACTTAT
GGAGTCCCCAAGCTGAAGCGGATCCTGGAGGAGCGCCATAGTATCCACTTCATCATTAAGAGGATGTTTGATGAGCGAATTTTCACAGGG
AACAAGTTTACCAAAGACACCACGAAGCTGGAGCCAGCCAGCCCGCCAGAGGACACCTCTGCAGAGGTCTCTAGGGCCACCGTCCTTGAC
CTTGCTGGGAATGCTCGGTCAGACAAGGGCAGCATGTCTGAAGACTGTGGGCCAGGAACCTCCGGGGAGCTGGGCGGGCTGAGGCCGATC
AAAATTGAGCCAGAGGATCTGGACATCATTCAGGTCACCGTCCCAGACCCCTCGCCAACCTCTGAGGAAATGACAGACTCGATGCCTGGG
CACCTGCCATCGGAGGATTCTGGTTATGGGATGGAGATGCTGACAGACAAAGGTCTGAGTGAGGACGCGCGGCCCGAGGAGAGGCCCGTG
GAGGACAGCCACGGTGACGTGATCCGGCCCCTGCGGAAGCAGGTGGAGCTGCTCTTCAACACACGATACGCCAAGGCCATTGGCATCTCG
GAGCCCGTCAAGGTGCCGTACTCCAAGTTTCTGATGCACCCGGAGGAGCTGTTTGTGGTGGGACTGCCTGAAGGCATCTCCCTCCGCAGG
CCCAACTGCTTCGGGATCGCCAAGCTCCGGAAGATTCTGGAGGCCAGCAACAGCATCCAGTTTGTCATCAAGAGGCCCGAGCTGCTCACT
GAGGGAGTCAAAGAGCCCATCATGGATAGTCAAGAGAGGGATTCCGGGGACCCTCTGGTGGACGAGAGCCTGAAGAGACAGGGCTTTCAA
GAAAATTATGACGCGAGGCTCTCACGGATCGACATCGCCAACACACTAAGGGAGCAGGTCCAGGACCTTTTCAATAAGAAATACGGGGAA
GCCTTGGGCATCAAGTACCCGGTCCAGGTCCCCTACAAGCGGATCAAGAGTAACCCCGGCTCCGTGATCATCGAGGGGCTGCCCCCAGGA
ATCCCGTTCCGAAAGCCCTGTACCTTCGGCTCCCAGAACCTGGAGAGGATTCTTGCTGTGGCTGACAAGATCAAGTTCACAGTCACCAGG
CCTTTCCAAGGACTCATCCCAAAGCCTGATGAAGATGACGCCAACAGACTCGGGGAGAAGGTGATCCTGCGGGAGCAGGTGAAGGAACTC
TTCAACGAGAAATACGGTGAGGCCCTGGGCCTGAACCGGCCGGTGCTGGTCCCTTATAAACTAATCCGGGACAGCCCAGACGCCGTGGAG
GTCACGGGTCTGCCTGATGACATCCCCTTCCGGAACCCCAACACGTACGACATCCACCGGCTGGAGAAGATCCTGAAGGCCCGAGAGCAT
GTCCGCATGGTCATCATTAACCAGCTCCAACCCTTTGCAGAAATCTGCAATGATGCCAAGGTGCCAGCCAAAGACAGCAGCATTCCCAAG
CGCAAGAGAAAGCGGGTCTCGGAAGGAAATTCCGTCTCCTCTTCCTCCTCGTCTTCCTCTTCCTCGTCCTCTAACCCGGATTCAGTGGCA
TCGGCCAACCAGATCTCACTCGTGGTAAAGTTGCACCGATTTGGACTCCGGCACTCATCTCTGTGGCCCTCACCCCTCTGTCTGGCAGGG
CCGTCTACTCTGGGATGTGGGCCCAGGGGACGGGGAGGCACTGGGCTTTGAGTGGGGACCTTCCGGCCTCGGGGGTTATAGATGCATCCA
CCTGTCTCACCCAAGAGGTAGCCCATCCTTCTCGTGGGGTACTCACAGGCACTCAGGCAGGAATTCACATCCTCGCTGGGCAGATGGGCC
GGCTGAGGTCCACCTGCCCACACCCTTCAGCCGCACCAGAGCTGGAGACATGAAAAGACATGGCTGGCGGGTGCAGTGGCTCACGCCTGT
AATCCCAGCACTTTGGCAGGTCAAGTCGGGTGGATCACCTGAGGTCAGGAGTTTGAGACCAGGCTGACCAACACGGGGAAACCCCATCTC
TACTAAAAATACAAAATTAGCCGGGCAAAGTGGGGCATAGTGGCTCATGCCTGTAATCCCAGCTACTTGGAAGGCTGAGATAGGAGAATC
GCTTGAACCTGGGAGGCAGAGGTTGCAATGAGCCGAGGTCGCGCCATTGCACTGCAGCCTGGGCAACAAGAGTGAAACACTGTCTCAGAA
AAAAAAATTAGCCAGGCATGGTGGCACGTGCCTGTGGTCGCAGCTACTTGGGAGGCTGGGGCAGGAGGATCATTTGAGCCCAAGGGGATT
GAGGCTGCAGTGAGCCAAGATCGTCCCATTGCACTCCAGCCTGGGCAAGAGAACGAGACTCCATCTCAAAAATAAATAAATAGGCTGGGT
GTGGTGGCTCACGCCTGTAATCCTAGCACTTTGGGAGGCCGAGGCAGGCGGATCACTTGAGGCTCAGGAGTTCAAGACCAGCCTGGCCAA
CATGGCAAAACCCCGTCTCTACTAAAAATAGAAAAATTAGCCGGGCATGGTGGCGGGCGCCTATAATCCCAGCTACTCGGGAGGCTGAGG
CAGGAGACTCGCTTGAACCCGCGGGGCCAAGGTTGCAGTGAGCCGAGATTGCATCACTGCACTCCAGCCTGGGCAGAAGAGTGAAACTCC
ATCTCAAAAAAATAAAAAATATAAATAAATAGCCTCTGAGAAAGCTCTTCCAAAAGCAGAACTAAGCATTTTGGGTTTGTTCCGCATCAC
CTGGAGTCCTAATCCAGTCCCTTTGTCCCTCTCTCTAGCAATGGCCAATGTACATGGTGGACTATGCCGGCCTGAACGTGCAGCTCCCGG
GACCTCTTAATTACTAGACCTCAGTACTGAATCAGGACCTCACTCAGAAAGACTAAAGGAAATGTAATTTATGTACAAAATGTATATTCG

>18258_18258_8_COL26A1-GTF2IRD1_COL26A1_chr7_101091068_ENST00000397927_GTF2IRD1_chr7_73944064_ENST00000476977_length(amino acids)=742AA_BP=145
MPVLVPGTPGPRGLLRTMKLALLLPWACCCLCGSALATGFLYPFSAAALQQHGYPEPGAGSPGSGYASRRHWCHHTVTRTVSCQVQNGSE
TVVQRVYQSCRWPGPCANLVSYRTLIRPTYRVSYRTVTVLEWRCCPGFTGSNCDEAEALGLDHMVPVPYRKIACDPEAVEIVGIPDKIPF
KRPCTYGVPKLKRILEERHSIHFIIKRMFDERIFTGNKFTKDTTKLEPASPPEDTSAEVSRATVLDLAGNARSDKGSMSEDCGPGTSGEL
GGLRPIKIEPEDLDIIQVTVPDPSPTSEEMTDSMPGHLPSEDSGYGMEMLTDKGLSEDARPEERPVEDSHGDVIRPLRKQVELLFNTRYA
KAIGISEPVKVPYSKFLMHPEELFVVGLPEGISLRRPNCFGIAKLRKILEASNSIQFVIKRPELLTEGVKEPIMDSQERDSGDPLVDESL
KRQGFQENYDARLSRIDIANTLREQVQDLFNKKYGEALGIKYPVQVPYKRIKSNPGSVIIEGLPPGIPFRKPCTFGSQNLERILAVADKI
KFTVTRPFQGLIPKPDEDDANRLGEKVILREQVKELFNEKYGEALGLNRPVLVPYKLIRDSPDAVEVTGLPDDIPFRNPNTYDIHRLEKI
LKAREHVRMVIINQLQPFAEICNDAKVPAKDSSIPKRKRKRVSEGNSVSSSSSSSSSSSSNPDSVASANQISLVVKLHRFGLRHSSLWPS

--------------------------------------------------------------
>18258_18258_9_COL26A1-GTF2IRD1_COL26A1_chr7_101091068_ENST00000528707_GTF2IRD1_chr7_73944064_ENST00000265755_length(transcript)=2482nt_BP=535nt
CCCCCACACATTTCCAGCTCGCACCCGGGCTCCGACCGCTCGCCCCGCTCCTCTCGCTGTGCTCCCGGCCGGTGCCGCGGGTTCGGTCCG
GGCGCCGGTGCGCTCCTGCCGGTCCTCGTGCCCGGGACTCCGGGTCCCCGCGGGCTGCTGCGCACGATGAAGCTGGCCCTGCTCCTGCCC
TGGGCGTGTTGCTGCCTCTGCGGGTCGGCGCTGGCCACCGGCTTCCTCTATCCCTTCTCGGCCGCAGCTCTGCAGCAGCACGGCTACCCC
GAGCCCGGCGCCGGCTCCCCTGGCAGCGGCTACGCGAGCCGCCGGCACTGGTGCCATCACACAGTGACACGGACGGTGTCCTGCCAGGTG
CAGAATGGCTCGGAGACGGTGGTCCAGCGCGTGTACCAGAGCTGCCGGTGGCCGGGGCCCTGCGCCAACCTCGTAAGGACTCTGATCAGA
CCCACCTACAGAGTGTCCTACCGCACGGTGACGGTGCTGGAGTGGAGATGCTGCCCTGGCTTCACCGGGAGCAACTGTGATGAGGCGGAA
GCCCTGGGCCTGGACCACATGGTCCCCGTGCCCTACCGGAAGATTGCCTGTGACCCGGAGGCTGTGGAGATCGTGGGCATCCCGGACAAG
ATCCCCTTCAAGCGCCCCTGCACTTATGGAGTCCCCAAGCTGAAGCGGATCCTGGAGGAGCGCCATAGTATCCACTTCATCATTAAGAGG
ATGTTTGATGAGCGAATTTTCACAGGGAACAAGTTTACCAAAGACACCACGAAGCTGGAGCCAGCCAGCCCGCCAGAGGACACCTCTGCA
GAGGTCTCTAGGGCCACCGTCCTTGACCTTGCTGGGAATGCTCGGTCAGACAAGGGCAGCATGTCTGAAGACTGTGGGCCAGGAACCTCC
GGGGAGCTGGGCGGGCTGAGGCCGATCAAAATTGAGCCAGAGGATCTGGACATCATTCAGGTCACCGTCCCAGACCCCTCGCCAACCTCT
GAGGAAATGACAGACTCGATGCCTGGGCACCTGCCATCGGAGGATTCTGGTTATGGGATGGAGATGCTGACAGACAAAGGTCTGAGTGAG
GACGCGCGGCCCGAGGAGAGGCCCGTGGAGGACAGCCACGGTGACGTGATCCGGCCCCTGCGGAAGCAGGTGGAGCTGCTCTTCAACACA
CGATACGCCAAGGCCATTGGCATCTCGGAGCCCGTCAAGGTGCCGTACTCCAAGTTTCTGATGCACCCGGAGGAGCTGTTTGTGGTGGGA
CTGCCTGAAGGCATCTCCCTCCGCAGGCCCAACTGCTTCGGGATCGCCAAGCTCCGGAAGATTCTGGAGGCCAGCAACAGCATCCAGTTT
GTCATCAAGAGGCCCGAGCTGCTCACTGAGGGAGTCAAAGAGCCCATCATGGATAGTCAAGGAACTGCCTCCTCACTTGGCTTCTCTCCC
CCTGCCCTGCCCCCAGAGAGGGATTCCGGGGACCCTCTGGTGGACGAGAGCCTGAAGAGACAGGGCTTTCAAGAAAATTATGACGCGAGG
CTCTCACGGATCGACATCGCCAACACACTAAGGGAGCAGGTCCAGGACCTTTTCAATAAGAAATACGGGGAAGCCTTGGGCATCAAGTAC
CCGGTCCAGGTCCCCTACAAGCGGATCAAGAGTAACCCCGGCTCCGTGATCATCGAGGGGCTGCCCCCAGGAATCCCGTTCCGAAAGCCC
TGTACCTTCGGCTCCCAGAACCTGGAGAGGATTCTTGCTGTGGCTGACAAGATCAAGTTCACAGTCACCAGGCCTTTCCAAGGACTCATC
CCAAAGCCTGATGAAGATGACGCCAACAGACTCGGGGAGAAGGTGATCCTGCGGGAGCAGGTGAAGGAACTCTTCAACGAGAAATACGGT
GAGGCCCTGGGCCTGAACCGGCCGGTGCTGGTCCCTTATAAACTAATCCGGGACAGCCCAGACGCCGTGGAGGTCACGGGTCTGCCTGAT
GACATCCCCTTCCGGAACCCCAACACGTACGACATCCACCGGCTGGAGAAGATCCTGAAGGCCCGAGAGCATGTCCGCATGGTCATCATT
AACCAGCTCCAACCCTTTGCAGAAATCTGCAATGATGCCAAGGTGCCAGCCAAAGACAGCAGCATTCCCAAGCGCAAGAGAAAGCGGGTC
TCGGAAGGAAATTCCGTCTCCTCTTCCTCCTCGTCTTCCTCTTCCTCGTCCTCTAACCCGGATTCAGTGGCATCGGCCAACCAGATCTCA
CTCGTGCAATGGCCAATGTACATGGTGGACTATGCCGGCCTGAACGTGCAGCTCCCGGGACCTCTTAATTACTAGACCTCAGTACTGAAT
CAGGACCTCACTCAGAAAGACTAAAGGAAATGTAATTTATGTACAAAATGTATATTCGGATATGTATCGATGCCTTTTAGTTTTTCCAAT

>18258_18258_9_COL26A1-GTF2IRD1_COL26A1_chr7_101091068_ENST00000528707_GTF2IRD1_chr7_73944064_ENST00000265755_length(amino acids)=739AA_BP=143
MPVLVPGTPGPRGLLRTMKLALLLPWACCCLCGSALATGFLYPFSAAALQQHGYPEPGAGSPGSGYASRRHWCHHTVTRTVSCQVQNGSE
TVVQRVYQSCRWPGPCANLVRTLIRPTYRVSYRTVTVLEWRCCPGFTGSNCDEAEALGLDHMVPVPYRKIACDPEAVEIVGIPDKIPFKR
PCTYGVPKLKRILEERHSIHFIIKRMFDERIFTGNKFTKDTTKLEPASPPEDTSAEVSRATVLDLAGNARSDKGSMSEDCGPGTSGELGG
LRPIKIEPEDLDIIQVTVPDPSPTSEEMTDSMPGHLPSEDSGYGMEMLTDKGLSEDARPEERPVEDSHGDVIRPLRKQVELLFNTRYAKA
IGISEPVKVPYSKFLMHPEELFVVGLPEGISLRRPNCFGIAKLRKILEASNSIQFVIKRPELLTEGVKEPIMDSQGTASSLGFSPPALPP
ERDSGDPLVDESLKRQGFQENYDARLSRIDIANTLREQVQDLFNKKYGEALGIKYPVQVPYKRIKSNPGSVIIEGLPPGIPFRKPCTFGS
QNLERILAVADKIKFTVTRPFQGLIPKPDEDDANRLGEKVILREQVKELFNEKYGEALGLNRPVLVPYKLIRDSPDAVEVTGLPDDIPFR
NPNTYDIHRLEKILKAREHVRMVIINQLQPFAEICNDAKVPAKDSSIPKRKRKRVSEGNSVSSSSSSSSSSSSNPDSVASANQISLVQWP

--------------------------------------------------------------
>18258_18258_10_COL26A1-GTF2IRD1_COL26A1_chr7_101091068_ENST00000528707_GTF2IRD1_chr7_73944064_ENST00000424337_length(transcript)=2448nt_BP=535nt
CCCCCACACATTTCCAGCTCGCACCCGGGCTCCGACCGCTCGCCCCGCTCCTCTCGCTGTGCTCCCGGCCGGTGCCGCGGGTTCGGTCCG
GGCGCCGGTGCGCTCCTGCCGGTCCTCGTGCCCGGGACTCCGGGTCCCCGCGGGCTGCTGCGCACGATGAAGCTGGCCCTGCTCCTGCCC
TGGGCGTGTTGCTGCCTCTGCGGGTCGGCGCTGGCCACCGGCTTCCTCTATCCCTTCTCGGCCGCAGCTCTGCAGCAGCACGGCTACCCC
GAGCCCGGCGCCGGCTCCCCTGGCAGCGGCTACGCGAGCCGCCGGCACTGGTGCCATCACACAGTGACACGGACGGTGTCCTGCCAGGTG
CAGAATGGCTCGGAGACGGTGGTCCAGCGCGTGTACCAGAGCTGCCGGTGGCCGGGGCCCTGCGCCAACCTCGTAAGGACTCTGATCAGA
CCCACCTACAGAGTGTCCTACCGCACGGTGACGGTGCTGGAGTGGAGATGCTGCCCTGGCTTCACCGGGAGCAACTGTGATGAGGCGGAA
GCCCTGGGCCTGGACCACATGGTCCCCGTGCCCTACCGGAAGATTGCCTGTGACCCGGAGGCTGTGGAGATCGTGGGCATCCCGGACAAG
ATCCCCTTCAAGCGCCCCTGCACTTATGGAGTCCCCAAGCTGAAGCGGATCCTGGAGGAGCGCCATAGTATCCACTTCATCATTAAGAGG
ATGTTTGATGAGCGAATTTTCACAGGGAACAAGTTTACCAAAGACACCACGAAGCTGGAGCCAGCCAGCCCGCCAGAGGACACCTCTGCA
GAGGTCTCTAGGGCCACCGTCCTTGACCTTGCTGGGAATGCTCGGTCAGACAAGGGCAGCATGTCTGAAGACTGTGGGCCAGGAACCTCC
GGGGAGCTGGGCGGGCTGAGGCCGATCAAAATTGAGCCAGAGGATCTGGACATCATTCAGGTCACCGTCCCAGACCCCTCGCCAACCTCT
GAGGAAATGACAGACTCGATGCCTGGGCACCTGCCATCGGAGGATTCTGGTTATGGGATGGAGATGCTGACAGACAAAGGTCTGAGTGAG
GACGCGCGGCCCGAGGAGAGGCCCGTGGAGGACAGCCACGGTGACGTGATCCGGCCCCTGCGGAAGCAGGTGGAGCTGCTCTTCAACACA
CGATACGCCAAGGCCATTGGCATCTCGGAGCCCGTCAAGGTGCCGTACTCCAAGTTTCTGATGCACCCGGAGGAGCTGTTTGTGGTGGGA
CTGCCTGAAGGCATCTCCCTCCGCAGGCCCAACTGCTTCGGGATCGCCAAGCTCCGGAAGATTCTGGAGGCCAGCAACAGCATCCAGTTT
GTCATCAAGAGGCCCGAGCTGCTCACTGAGGGAGTCAAAGAGCCCATCATGGATAGTCAAGAGAGGGATTCCGGGGACCCTCTGGTGGAC
GAGAGCCTGAAGAGACAGGGCTTTCAAGAAAATTATGACGCGAGGCTCTCACGGATCGACATCGCCAACACACTAAGGGAGCAGGTCCAG
GACCTTTTCAATAAGAAATACGGGGAAGCCTTGGGCATCAAGTACCCGGTCCAGGTCCCCTACAAGCGGATCAAGAGTAACCCCGGCTCC
GTGATCATCGAGGGGCTGCCCCCAGGAATCCCGTTCCGAAAGCCCTGTACCTTCGGCTCCCAGAACCTGGAGAGGATTCTTGCTGTGGCT
GACAAGATCAAGTTCACAGTCACCAGGCCTTTCCAAGGACTCATCCCAAAGCCTGATGAAGATGACGCCAACAGACTCGGGGAGAAGGTG
ATCCTGCGGGAGCAGGTGAAGGAACTCTTCAACGAGAAATACGGTGAGGCCCTGGGCCTGAACCGGCCGGTGCTGGTCCCTTATAAACTA
ATCCGGGACAGCCCAGACGCCGTGGAGGTCACGGGTCTGCCTGATGACATCCCCTTCCGGAACCCCAACACGTACGACATCCACCGGCTG
GAGAAGATCCTGAAGGCCCGAGAGCATGTCCGCATGGTCATCATTAACCAGCTCCAACCCTTTGCAGAAATCTGCAATGATGCCAAGGTG
CCAGCCAAAGACAGCAGCATTCCCAAGCGCAAGAGAAAGCGGGTCTCGGAAGGAAATTCCGTCTCCTCTTCCTCCTCGTCTTCCTCTTCC
TCGTCCTCTAACCCGGATTCAGTGGCATCGGCCAACCAGATCTCACTCGTGCAATGGCCAATGTACATGGTGGACTATGCCGGCCTGAAC
GTGCAGCTCCCGGGACCTCTTAATTACTAGACCTCAGTACTGAATCAGGACCTCACTCAGAAAGACTAAAGGAAATGTAATTTATGTACA
AAATGTATATTCGGATATGTATCGATGCCTTTTAGTTTTTCCAATGATTTTTACACTATATTCCTGCCACCAAGGCCTTTTTAAATAAGT

>18258_18258_10_COL26A1-GTF2IRD1_COL26A1_chr7_101091068_ENST00000528707_GTF2IRD1_chr7_73944064_ENST00000424337_length(amino acids)=724AA_BP=143
MPVLVPGTPGPRGLLRTMKLALLLPWACCCLCGSALATGFLYPFSAAALQQHGYPEPGAGSPGSGYASRRHWCHHTVTRTVSCQVQNGSE
TVVQRVYQSCRWPGPCANLVRTLIRPTYRVSYRTVTVLEWRCCPGFTGSNCDEAEALGLDHMVPVPYRKIACDPEAVEIVGIPDKIPFKR
PCTYGVPKLKRILEERHSIHFIIKRMFDERIFTGNKFTKDTTKLEPASPPEDTSAEVSRATVLDLAGNARSDKGSMSEDCGPGTSGELGG
LRPIKIEPEDLDIIQVTVPDPSPTSEEMTDSMPGHLPSEDSGYGMEMLTDKGLSEDARPEERPVEDSHGDVIRPLRKQVELLFNTRYAKA
IGISEPVKVPYSKFLMHPEELFVVGLPEGISLRRPNCFGIAKLRKILEASNSIQFVIKRPELLTEGVKEPIMDSQERDSGDPLVDESLKR
QGFQENYDARLSRIDIANTLREQVQDLFNKKYGEALGIKYPVQVPYKRIKSNPGSVIIEGLPPGIPFRKPCTFGSQNLERILAVADKIKF
TVTRPFQGLIPKPDEDDANRLGEKVILREQVKELFNEKYGEALGLNRPVLVPYKLIRDSPDAVEVTGLPDDIPFRNPNTYDIHRLEKILK
AREHVRMVIINQLQPFAEICNDAKVPAKDSSIPKRKRKRVSEGNSVSSSSSSSSSSSSNPDSVASANQISLVQWPMYMVDYAGLNVQLPG

--------------------------------------------------------------
>18258_18258_11_COL26A1-GTF2IRD1_COL26A1_chr7_101091068_ENST00000528707_GTF2IRD1_chr7_73944064_ENST00000455841_length(transcript)=2451nt_BP=535nt
CCCCCACACATTTCCAGCTCGCACCCGGGCTCCGACCGCTCGCCCCGCTCCTCTCGCTGTGCTCCCGGCCGGTGCCGCGGGTTCGGTCCG
GGCGCCGGTGCGCTCCTGCCGGTCCTCGTGCCCGGGACTCCGGGTCCCCGCGGGCTGCTGCGCACGATGAAGCTGGCCCTGCTCCTGCCC
TGGGCGTGTTGCTGCCTCTGCGGGTCGGCGCTGGCCACCGGCTTCCTCTATCCCTTCTCGGCCGCAGCTCTGCAGCAGCACGGCTACCCC
GAGCCCGGCGCCGGCTCCCCTGGCAGCGGCTACGCGAGCCGCCGGCACTGGTGCCATCACACAGTGACACGGACGGTGTCCTGCCAGGTG
CAGAATGGCTCGGAGACGGTGGTCCAGCGCGTGTACCAGAGCTGCCGGTGGCCGGGGCCCTGCGCCAACCTCGTAAGGACTCTGATCAGA
CCCACCTACAGAGTGTCCTACCGCACGGTGACGGTGCTGGAGTGGAGATGCTGCCCTGGCTTCACCGGGAGCAACTGTGATGAGGCGGAA
GCCCTGGGCCTGGACCACATGGTCCCCGTGCCCTACCGGAAGATTGCCTGTGACCCGGAGGCTGTGGAGATCGTGGGCATCCCGGACAAG
ATCCCCTTCAAGCGCCCCTGCACTTATGGAGTCCCCAAGCTGAAGCGGATCCTGGAGGAGCGCCATAGTATCCACTTCATCATTAAGAGG
ATGTTTGATGAGCGAATTTTCACAGGGAACAAGTTTACCAAAGACACCACGAAGCTGGAGCCAGCCAGCCCGCCAGAGGACACCTCTGCA
GAGGTCTCTAGGGCCACCGTCCTTGACCTTGCTGGGAATGCTCGGTCAGACAAGGGCAGCATGTCTGAAGACTGTGGGCCAGGAACCTCC
GGGGAGCTGGGCGGGCTGAGGCCGATCAAAATTGAGCCAGAGGATCTGGACATCATTCAGGTCACCGTCCCAGACCCCTCGCCAACCTCT
GAGGAAATGACAGACTCGATGCCTGGGCACCTGCCATCGGAGGATTCTGGTTATGGGATGGAGATGCTGACAGACAAAGGTCTGAGTGAG
GACGCGCGGCCCGAGGAGAGGCCCGTGGAGGACAGCCACGGTGACGTGATCCGGCCCCTGCGGAAGCAGGTGGAGCTGCTCTTCAACACA
CGATACGCCAAGGCCATTGGCATCTCGGAGCCCGTCAAGGTGCCGTACTCCAAGTTTCTGATGCACCCGGAGGAGCTGTTTGTGGTGGGA
CTGCCTGAAGGCATCTCCCTCCGCAGGCCCAACTGCTTCGGGATCGCCAAGCTCCGGAAGATTCTGGAGGCCAGCAACAGCATCCAGTTT
GTCATCAAGAGGCCCGAGCTGCTCACTGAGGGAGTCAAAGAGCCCATCATGGATAGTCAAGAGAGGGATTCCGGGGACCCTCTGGTGGAC
GAGAGCCTGAAGAGACAGGGCTTTCAAGAAAATTATGACGCGAGGCTCTCACGGATCGACATCGCCAACACACTAAGGGAGCAGGTCCAG
GACCTTTTCAATAAGAAATACGGGGAAGCCTTGGGCATCAAGTACCCGGTCCAGGTCCCCTACAAGCGGATCAAGAGTAACCCCGGCTCC
GTGATCATCGAGGGGCTGCCCCCAGGAATCCCGTTCCGAAAGCCCTGTACCTTCGGCTCCCAGAACCTGGAGAGGATTCTTGCTGTGGCT
GACAAGATCAAGTTCACAGTCACCAGGCCTTTCCAAGGACTCATCCCAAAGCCTGATGAAGATGACGCCAACAGACTCGGGGAGAAGGTG
ATCCTGCGGGAGCAGGTGAAGGAACTCTTCAACGAGAAATACGGTGAGGCCCTGGGCCTGAACCGGCCGGTGCTGGTCCCTTATAAACTA
ATCCGGGACAGCCCAGACGCCGTGGAGGTCACGGGTCTGCCTGATGACATCCCCTTCCGGAACCCCAACACGTACGACATCCACCGGCTG
GAGAAGATCCTGAAGGCCCGAGAGCATGTCCGCATGGTCATCATTAACCAGCTCCAACCCTTTGCAGAAATCTGCAATGATGCCAAGGTG
CCAGCCAAAGACAGCAGCATTCCCAAGCGCAAGAGAAAGCGGGTCTCGGAAGGAAATTCCGTCTCCTCTTCCTCCTCGTCTTCCTCTTCC
TCGTCCTCTAACCCGGATTCAGTGGCATCGGCCAACCAGATCTCACTCGTGCAATGGCCAATGTACATGGTGGACTATGCCGGCCTGAAC
GTGCAGCTCCCGGGACCTCTTAATTACTAGACCTCAGTACTGAATCAGGACCTCACTCAGAAAGACTAAAGGAAATGTAATTTATGTACA
AAATGTATATTCGGATATGTATCGATGCCTTTTAGTTTTTCCAATGATTTTTACACTATATTCCTGCCACCAAGGCCTTTTTAAATAAGT

>18258_18258_11_COL26A1-GTF2IRD1_COL26A1_chr7_101091068_ENST00000528707_GTF2IRD1_chr7_73944064_ENST00000455841_length(amino acids)=724AA_BP=143
MPVLVPGTPGPRGLLRTMKLALLLPWACCCLCGSALATGFLYPFSAAALQQHGYPEPGAGSPGSGYASRRHWCHHTVTRTVSCQVQNGSE
TVVQRVYQSCRWPGPCANLVRTLIRPTYRVSYRTVTVLEWRCCPGFTGSNCDEAEALGLDHMVPVPYRKIACDPEAVEIVGIPDKIPFKR
PCTYGVPKLKRILEERHSIHFIIKRMFDERIFTGNKFTKDTTKLEPASPPEDTSAEVSRATVLDLAGNARSDKGSMSEDCGPGTSGELGG
LRPIKIEPEDLDIIQVTVPDPSPTSEEMTDSMPGHLPSEDSGYGMEMLTDKGLSEDARPEERPVEDSHGDVIRPLRKQVELLFNTRYAKA
IGISEPVKVPYSKFLMHPEELFVVGLPEGISLRRPNCFGIAKLRKILEASNSIQFVIKRPELLTEGVKEPIMDSQERDSGDPLVDESLKR
QGFQENYDARLSRIDIANTLREQVQDLFNKKYGEALGIKYPVQVPYKRIKSNPGSVIIEGLPPGIPFRKPCTFGSQNLERILAVADKIKF
TVTRPFQGLIPKPDEDDANRLGEKVILREQVKELFNEKYGEALGLNRPVLVPYKLIRDSPDAVEVTGLPDDIPFRNPNTYDIHRLEKILK
AREHVRMVIINQLQPFAEICNDAKVPAKDSSIPKRKRKRVSEGNSVSSSSSSSSSSSSNPDSVASANQISLVQWPMYMVDYAGLNVQLPG

--------------------------------------------------------------
>18258_18258_12_COL26A1-GTF2IRD1_COL26A1_chr7_101091068_ENST00000528707_GTF2IRD1_chr7_73944064_ENST00000476977_length(transcript)=3622nt_BP=535nt
CCCCCACACATTTCCAGCTCGCACCCGGGCTCCGACCGCTCGCCCCGCTCCTCTCGCTGTGCTCCCGGCCGGTGCCGCGGGTTCGGTCCG
GGCGCCGGTGCGCTCCTGCCGGTCCTCGTGCCCGGGACTCCGGGTCCCCGCGGGCTGCTGCGCACGATGAAGCTGGCCCTGCTCCTGCCC
TGGGCGTGTTGCTGCCTCTGCGGGTCGGCGCTGGCCACCGGCTTCCTCTATCCCTTCTCGGCCGCAGCTCTGCAGCAGCACGGCTACCCC
GAGCCCGGCGCCGGCTCCCCTGGCAGCGGCTACGCGAGCCGCCGGCACTGGTGCCATCACACAGTGACACGGACGGTGTCCTGCCAGGTG
CAGAATGGCTCGGAGACGGTGGTCCAGCGCGTGTACCAGAGCTGCCGGTGGCCGGGGCCCTGCGCCAACCTCGTAAGGACTCTGATCAGA
CCCACCTACAGAGTGTCCTACCGCACGGTGACGGTGCTGGAGTGGAGATGCTGCCCTGGCTTCACCGGGAGCAACTGTGATGAGGCGGAA
GCCCTGGGCCTGGACCACATGGTCCCCGTGCCCTACCGGAAGATTGCCTGTGACCCGGAGGCTGTGGAGATCGTGGGCATCCCGGACAAG
ATCCCCTTCAAGCGCCCCTGCACTTATGGAGTCCCCAAGCTGAAGCGGATCCTGGAGGAGCGCCATAGTATCCACTTCATCATTAAGAGG
ATGTTTGATGAGCGAATTTTCACAGGGAACAAGTTTACCAAAGACACCACGAAGCTGGAGCCAGCCAGCCCGCCAGAGGACACCTCTGCA
GAGGTCTCTAGGGCCACCGTCCTTGACCTTGCTGGGAATGCTCGGTCAGACAAGGGCAGCATGTCTGAAGACTGTGGGCCAGGAACCTCC
GGGGAGCTGGGCGGGCTGAGGCCGATCAAAATTGAGCCAGAGGATCTGGACATCATTCAGGTCACCGTCCCAGACCCCTCGCCAACCTCT
GAGGAAATGACAGACTCGATGCCTGGGCACCTGCCATCGGAGGATTCTGGTTATGGGATGGAGATGCTGACAGACAAAGGTCTGAGTGAG
GACGCGCGGCCCGAGGAGAGGCCCGTGGAGGACAGCCACGGTGACGTGATCCGGCCCCTGCGGAAGCAGGTGGAGCTGCTCTTCAACACA
CGATACGCCAAGGCCATTGGCATCTCGGAGCCCGTCAAGGTGCCGTACTCCAAGTTTCTGATGCACCCGGAGGAGCTGTTTGTGGTGGGA
CTGCCTGAAGGCATCTCCCTCCGCAGGCCCAACTGCTTCGGGATCGCCAAGCTCCGGAAGATTCTGGAGGCCAGCAACAGCATCCAGTTT
GTCATCAAGAGGCCCGAGCTGCTCACTGAGGGAGTCAAAGAGCCCATCATGGATAGTCAAGAGAGGGATTCCGGGGACCCTCTGGTGGAC
GAGAGCCTGAAGAGACAGGGCTTTCAAGAAAATTATGACGCGAGGCTCTCACGGATCGACATCGCCAACACACTAAGGGAGCAGGTCCAG
GACCTTTTCAATAAGAAATACGGGGAAGCCTTGGGCATCAAGTACCCGGTCCAGGTCCCCTACAAGCGGATCAAGAGTAACCCCGGCTCC
GTGATCATCGAGGGGCTGCCCCCAGGAATCCCGTTCCGAAAGCCCTGTACCTTCGGCTCCCAGAACCTGGAGAGGATTCTTGCTGTGGCT
GACAAGATCAAGTTCACAGTCACCAGGCCTTTCCAAGGACTCATCCCAAAGCCTGATGAAGATGACGCCAACAGACTCGGGGAGAAGGTG
ATCCTGCGGGAGCAGGTGAAGGAACTCTTCAACGAGAAATACGGTGAGGCCCTGGGCCTGAACCGGCCGGTGCTGGTCCCTTATAAACTA
ATCCGGGACAGCCCAGACGCCGTGGAGGTCACGGGTCTGCCTGATGACATCCCCTTCCGGAACCCCAACACGTACGACATCCACCGGCTG
GAGAAGATCCTGAAGGCCCGAGAGCATGTCCGCATGGTCATCATTAACCAGCTCCAACCCTTTGCAGAAATCTGCAATGATGCCAAGGTG
CCAGCCAAAGACAGCAGCATTCCCAAGCGCAAGAGAAAGCGGGTCTCGGAAGGAAATTCCGTCTCCTCTTCCTCCTCGTCTTCCTCTTCC
TCGTCCTCTAACCCGGATTCAGTGGCATCGGCCAACCAGATCTCACTCGTGGTAAAGTTGCACCGATTTGGACTCCGGCACTCATCTCTG
TGGCCCTCACCCCTCTGTCTGGCAGGGCCGTCTACTCTGGGATGTGGGCCCAGGGGACGGGGAGGCACTGGGCTTTGAGTGGGGACCTTC
CGGCCTCGGGGGTTATAGATGCATCCACCTGTCTCACCCAAGAGGTAGCCCATCCTTCTCGTGGGGTACTCACAGGCACTCAGGCAGGAA
TTCACATCCTCGCTGGGCAGATGGGCCGGCTGAGGTCCACCTGCCCACACCCTTCAGCCGCACCAGAGCTGGAGACATGAAAAGACATGG
CTGGCGGGTGCAGTGGCTCACGCCTGTAATCCCAGCACTTTGGCAGGTCAAGTCGGGTGGATCACCTGAGGTCAGGAGTTTGAGACCAGG
CTGACCAACACGGGGAAACCCCATCTCTACTAAAAATACAAAATTAGCCGGGCAAAGTGGGGCATAGTGGCTCATGCCTGTAATCCCAGC
TACTTGGAAGGCTGAGATAGGAGAATCGCTTGAACCTGGGAGGCAGAGGTTGCAATGAGCCGAGGTCGCGCCATTGCACTGCAGCCTGGG
CAACAAGAGTGAAACACTGTCTCAGAAAAAAAAATTAGCCAGGCATGGTGGCACGTGCCTGTGGTCGCAGCTACTTGGGAGGCTGGGGCA
GGAGGATCATTTGAGCCCAAGGGGATTGAGGCTGCAGTGAGCCAAGATCGTCCCATTGCACTCCAGCCTGGGCAAGAGAACGAGACTCCA
TCTCAAAAATAAATAAATAGGCTGGGTGTGGTGGCTCACGCCTGTAATCCTAGCACTTTGGGAGGCCGAGGCAGGCGGATCACTTGAGGC
TCAGGAGTTCAAGACCAGCCTGGCCAACATGGCAAAACCCCGTCTCTACTAAAAATAGAAAAATTAGCCGGGCATGGTGGCGGGCGCCTA
TAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGACTCGCTTGAACCCGCGGGGCCAAGGTTGCAGTGAGCCGAGATTGCATCACTGCACT
CCAGCCTGGGCAGAAGAGTGAAACTCCATCTCAAAAAAATAAAAAATATAAATAAATAGCCTCTGAGAAAGCTCTTCCAAAAGCAGAACT
AAGCATTTTGGGTTTGTTCCGCATCACCTGGAGTCCTAATCCAGTCCCTTTGTCCCTCTCTCTAGCAATGGCCAATGTACATGGTGGACT
ATGCCGGCCTGAACGTGCAGCTCCCGGGACCTCTTAATTACTAGACCTCAGTACTGAATCAGGACCTCACTCAGAAAGACTAAAGGAAAT
GTAATTTATGTACAAAATGTATATTCGGATATGTATCGATGCCTTTTAGTTTTTCCAATGATTTTTACACTATATTCCTGCCACCAAGGC

>18258_18258_12_COL26A1-GTF2IRD1_COL26A1_chr7_101091068_ENST00000528707_GTF2IRD1_chr7_73944064_ENST00000476977_length(amino acids)=740AA_BP=143
MPVLVPGTPGPRGLLRTMKLALLLPWACCCLCGSALATGFLYPFSAAALQQHGYPEPGAGSPGSGYASRRHWCHHTVTRTVSCQVQNGSE
TVVQRVYQSCRWPGPCANLVRTLIRPTYRVSYRTVTVLEWRCCPGFTGSNCDEAEALGLDHMVPVPYRKIACDPEAVEIVGIPDKIPFKR
PCTYGVPKLKRILEERHSIHFIIKRMFDERIFTGNKFTKDTTKLEPASPPEDTSAEVSRATVLDLAGNARSDKGSMSEDCGPGTSGELGG
LRPIKIEPEDLDIIQVTVPDPSPTSEEMTDSMPGHLPSEDSGYGMEMLTDKGLSEDARPEERPVEDSHGDVIRPLRKQVELLFNTRYAKA
IGISEPVKVPYSKFLMHPEELFVVGLPEGISLRRPNCFGIAKLRKILEASNSIQFVIKRPELLTEGVKEPIMDSQERDSGDPLVDESLKR
QGFQENYDARLSRIDIANTLREQVQDLFNKKYGEALGIKYPVQVPYKRIKSNPGSVIIEGLPPGIPFRKPCTFGSQNLERILAVADKIKF
TVTRPFQGLIPKPDEDDANRLGEKVILREQVKELFNEKYGEALGLNRPVLVPYKLIRDSPDAVEVTGLPDDIPFRNPNTYDIHRLEKILK
AREHVRMVIINQLQPFAEICNDAKVPAKDSSIPKRKRKRVSEGNSVSSSSSSSSSSSSNPDSVASANQISLVVKLHRFGLRHSSLWPSPL

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for COL26A1-GTF2IRD1


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for COL26A1-GTF2IRD1


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for COL26A1-GTF2IRD1


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource