Fusion Gene Studies
in Kim Lab

FusionBase FusionGDB FusionGDB2 FusionPDB FusionNeoAntigen FusionAI FusionNW FGviewer Publication Contact
FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:GALNT7-GAB1 (FusionGDB2 ID:HG51809TG2549)

Fusion Gene Summary for GALNT7-GAB1

check button Fusion gene summary
Fusion gene informationFusion gene name: GALNT7-GAB1
Fusion gene ID: hg51809tg2549
HgeneTgene
Gene symbol

GALNT7

GAB1

Gene ID

51809

2549

Gene namepolypeptide N-acetylgalactosaminyltransferase 7GRB2 associated binding protein 1
SynonymsGALNAC-T7|GalNAcT7DFNB26
Cytomap('GALNT7')('GAB1')

4q34.1

4q31.21

Type of geneprotein-codingprotein-coding
DescriptionN-acetylgalactosaminyltransferase 7UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase 7UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase 7 (GalNAc-T7)polypeptide GalNAc transferase 7pp-GaNTase 7protein-UDP acetylgGRB2-associated-binding protein 1GRB2-associated binder 1deafness, autosomal recessive 26growth factor receptor bound protein 2-associated protein 1symbol withdrawn, see GAB1
Modification date2020031320200327
UniProtAcc

Q86SF2

Q13480

Ensembl transtripts involved in fusion geneENST00000265000, ENST00000512285, 
ENST00000502407, 
Fusion gene scores* DoF score8 X 6 X 5=2408 X 7 X 5=280
# samples 119
** MAII scorelog2(11/240*10)=-1.12553088208386
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(9/280*10)=-1.63742992061529
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: GALNT7 [Title/Abstract] AND GAB1 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointGALNT7(174090112)-GAB1(144336630), # samples:3
Anticipated loss of major functional domain due to fusion event.GALNT7-GAB1 seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
GALNT7-GAB1 seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
GALNT7-GAB1 seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
GALNT7-GAB1 seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID

check buttonFusion gene breakpoints across GALNT7 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure
check buttonFusion gene breakpoints across GAB1 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4PRADTCGA-VN-A943-01AGALNT7chr4

174090112

-GAB1chr4

144336630

+
ChimerDB4PRADTCGA-VN-A943-01AGALNT7chr4

174090112

+GAB1chr4

144336630

+
ChimerDB4PRADTCGA-VN-A943GALNT7chr4

174090112

+GAB1chr4

144336630

+


Top

Fusion Gene ORF analysis for GALNT7-GAB1

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-5UTRENST00000265000ENST00000505913GALNT7chr4

174090112

+GAB1chr4

144336630

+
5CDS-5UTRENST00000512285ENST00000505913GALNT7chr4

174090112

+GAB1chr4

144336630

+
5CDS-intronENST00000265000ENST00000515388GALNT7chr4

174090112

+GAB1chr4

144336630

+
5CDS-intronENST00000512285ENST00000515388GALNT7chr4

174090112

+GAB1chr4

144336630

+
In-frameENST00000265000ENST00000262994GALNT7chr4

174090112

+GAB1chr4

144336630

+
In-frameENST00000265000ENST00000262995GALNT7chr4

174090112

+GAB1chr4

144336630

+
In-frameENST00000512285ENST00000262994GALNT7chr4

174090112

+GAB1chr4

144336630

+
In-frameENST00000512285ENST00000262995GALNT7chr4

174090112

+GAB1chr4

144336630

+
intron-3CDSENST00000502407ENST00000262994GALNT7chr4

174090112

+GAB1chr4

144336630

+
intron-3CDSENST00000502407ENST00000262995GALNT7chr4

174090112

+GAB1chr4

144336630

+
intron-5UTRENST00000502407ENST00000505913GALNT7chr4

174090112

+GAB1chr4

144336630

+
intron-intronENST00000502407ENST00000515388GALNT7chr4

174090112

+GAB1chr4

144336630

+

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000265000GALNT7chr4174090112+ENST00000262995GAB1chr4144336630+7691209832311742
ENST00000265000GALNT7chr4174090112+ENST00000262994GAB1chr4144336630+2483209832221712
ENST00000512285GALNT7chr4174090112+ENST00000262995GAB1chr4144336630+7633151252253742
ENST00000512285GALNT7chr4174090112+ENST00000262994GAB1chr4144336630+2425151252163712

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000265000ENST00000262995GALNT7chr4174090112+GAB1chr4144336630+8.77E-050.99991226
ENST00000265000ENST00000262994GALNT7chr4174090112+GAB1chr4144336630+0.0008632910.99913675
ENST00000512285ENST00000262995GALNT7chr4174090112+GAB1chr4144336630+8.69E-050.9999131
ENST00000512285ENST00000262994GALNT7chr4174090112+GAB1chr4144336630+0.0008658960.9991341

Top

Fusion Genomic Features for GALNT7-GAB1


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)
GALNT7chr4174090112+GAB1chr4144336629+4.73E-050.9999527
GALNT7chr4174090112+GAB1chr4144336629+4.73E-050.9999527
GALNT7chr4174090112+GAB1chr4144336629+4.73E-050.9999527
GALNT7chr4174090112+GAB1chr4144336629+4.73E-050.9999527

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.
genomic feature of top 1%

Top

Fusion Protein Features for GALNT7-GAB1


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr4:174090112/chr4:144336630)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
GALNT7

Q86SF2

GAB1

Q13480

FUNCTION: Glycopeptide transferase involved in O-linked oligosaccharide biosynthesis, which catalyzes the transfer of an N-acetyl-D-galactosamine residue to an already glycosylated peptide. In contrast to other proteins of the family, it does not act as a peptide transferase that transfers GalNAc onto serine or threonine residue on the protein receptor, but instead requires the prior addition of a GalNAc on a peptide before adding additional GalNAc moieties. Some peptide transferase activity is however not excluded, considering that its appropriate peptide substrate may remain unidentified. {ECO:0000269|PubMed:10544240, ECO:0000269|PubMed:11925450}.FUNCTION: Adapter protein that plays a role in intracellular signaling cascades triggered by activated receptor-type kinases. Plays a role in FGFR1 signaling. Probably involved in signaling by the epidermal growth factor receptor (EGFR) and the insulin receptor (INSR). Involved in the MET/HGF-signaling pathway (PubMed:29408807). {ECO:0000269|PubMed:29408807}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneGALNT7chr4:174090112chr4:144336630ENST00000265000+1121_642658.0Topological domainCytoplasmic
HgeneGALNT7chr4:174090112chr4:144336630ENST00000265000+1127_2942658.0TransmembraneHelical%3B Signal-anchor for type II membrane protein
TgeneGAB1chr4:174090112chr4:144336630ENST00000262994010449_54024695.0Compositional biasNote=Pro-rich
TgeneGAB1chr4:174090112chr4:144336630ENST00000262995011449_54024725.0Compositional biasNote=Pro-rich

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneGALNT7chr4:174090112chr4:144336630ENST00000265000+112532_65242658.0DomainRicin B-type lectin
HgeneGALNT7chr4:174090112chr4:144336630ENST00000265000+112206_31742658.0RegionNote=Catalytic subdomain A
HgeneGALNT7chr4:174090112chr4:144336630ENST00000265000+112381_44342658.0RegionNote=Catalytic subdomain B
HgeneGALNT7chr4:174090112chr4:144336630ENST00000265000+11230_65742658.0Topological domainLumenal
TgeneGAB1chr4:174090112chr4:144336630ENST000002629940105_11624695.0DomainPH
TgeneGAB1chr4:174090112chr4:144336630ENST000002629950115_11624725.0DomainPH


Top

Fusion Gene Sequence for GALNT7-GAB1


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>32304_32304_1_GALNT7-GAB1_GALNT7_chr4_174090112_ENST00000265000_GAB1_chr4_144336630_ENST00000262994_length(transcript)=2483nt_BP=209nt
AGAGCCGGAGGAGGGGGAAGGAGGGAGGGGAGAGCGGTGGCGGCGGCTGCGCCGGGCTGTGAGTCTCTCGCCGCCGGAGGAAGATGAGGC
TGAAGATTGGGTTCATCTTACGCAGTTTGCTGGTGGTGGGAAGCTTCCTGGGGCTAGTGGTCCTCTGGTCTTCCCTGACCCCGCGGCCGG
ACGACCCAAGCCCGCTGAGCAGGATGAGGGCATGGAAGAGGAGATGGTTCGTGTTACGCAGTGGCCGTTTAACTGGAGATCCAGATGTTT
TGGAATATTACAAAAATGATCATGCCAAGAAGCCTATTCGTATTATTGATTTAAATTTATGTCAACAAGTAGATGCTGGATTGACATTTA
ACAAAAAAGAGTTTGAAAACAGCTACATTTTTGATATCAACACTATTGACCGGATTTTCTACTTGGTAGCAGACAGCGAGGAGGAGATGA
ATAAGTGGGTTCGTTGTATTTGTGACATCTGTGGGTTTAATCCAACAGAAGAAGATCCTGTGAAGCCACCTGGCAGCTCTTTACAAGCAC
CAGCTGATTTACCTTTAGCTATAAATACAGCACCACCATCCACCCAGGCAGATTCATCCTCTGCTACTCTACCTCCTCCATATCAGCTAA
TCAATGTTCCACCACACCTGGAAACTCTTGGCATTCAGGAGGATCCTCAAGACTACCTGTTGCTCATCAACTGTCAAAGCAAGAAGCCCG
AACCCACCAGAACGCATGCTGATTCTGCAAAATCCACCTCTTCTGAAACAGACTGCAATGATAACGTCCCTTCTCATAAAAATCCTGCTT
CCTCCCAGAGCAAACATGGAATGAATGGCTTTTTTCAGCAGCAAATGATATACGACTCTCCACCTTCACGTGCCCCATCTGCTTCAGTTG
ACTCCAGCCTTTATAACCTGCCCAGGAGTTATTCCCATGATGTTTTACCAAAGGTGTCTCCATCAAGTACTGAAGCAGATGGAGAACTCT
ATGTTTTTAATACCCCATCTGGGACATCGAGTGTAGAGACTCAAATGAGGCATGTATCTATTAGTTATGACATTCCTCCAACACCTGGTA
ATACTTATCAGATTCCACGAACATTTCCAGAAGGAACCTTGGGACAGACATCAAAGCTAGACACTATTCCAGATATTCCTCCACCTCGGC
CACCGAAACCACATCCAGCTCATGACCGATCTCCTGTGGAAACGTGTAGTATCCCACGCACCGCCTCAGACACTGACAGTAGTTACTGTA
TCCCTACAGCAGGGATGTCGCCTTCACGTAGTAATACCATTTCCACTGTGGATTTAAACAAATTGCGAAAAGATGCTAGTTCTCAAGACT
GCTATGATATTCCACGAGCATTTCCAAGTGATAGATCTAGTTCACTTGAAGGCTTCCATAACCACTTTAAAGTCAAAAATGTGTTGACAG
TGGGAAGTGTTTCAAGTGAAGAACTGGATGAAAATTACGTCCCAATGAATCCCAATTCACCACCACGACAACATTCCAGCAGTTTTACAG
AACCAATTCAGGAAGCAAATTATGTGCCAATGACTCCAGGAACATTTGATTTTTCCTCATTTGGAATGCAAGTTCCTCCTCCTGCTCATA
TGGGCTTCAGGTCCAGCCCAAAAACCCCTCCCAGAAGGCCAGTTCCTGTTGCAGACTGTGAACCACCCCCCGTGGATAGGAACCTCAAGC
CAGACAGAAAAGTCAAGCCAGCGCCTTTAGAAATAAAACCTTTGCCAGAATGGGAAGAATTACAAGCCCCAGTTAGATCTCCCATCACTA
GGAGTTTTGCTCGAGACTCTTCCAGGTTTCCCATGTCCCCCCGACCAGATTCAGTGCATAGCACAACTTCAAGCAGTGACTCACACGACA
GTGAAGAGAATTATGTTCCCATGAACCCAAACCTGTCCAGTGAAGACCCAAATCTCTTTGGCAGTAACAGTCTTGATGGAGGAAGCAGCC
CTATGATCAAGCCCAAAGGAGACAAACAGGTGGAATACTTAGATCTCGACTTAGATTCTGGGAAATCCACACCACCACGTAAGCAAAAGA
GCAGTGGCTCAGGCAGCAGTGTAGCAGATGAGAGAGTGGATTATGTTGTTGTTGACCAACAGAAGACCTTGGCTCTAAAGAGTACCCGGG
AAGCCTGGACAGATGGGAGACAGTCCACAGAATCAGAAACGCCAGCGAAGAGTGTGAAATGAAAATATTGCCTTGCCATTTCTGAACAAA
AGAAAACTGAATTGTAAAGATAAATCCCTTTTGAAGAATGACTTGACACTTCCACTCTAGGTAGATCCTCAAATGAGTAGAGTTGAAGTC
AAAGGACCTTTCTGACATAATCAAGCAATTTAGACTTAAGTGGTGCTTTGTGGTATCTGAACAATTCATAACATGTAAATAATGTGGGAA

>32304_32304_1_GALNT7-GAB1_GALNT7_chr4_174090112_ENST00000265000_GAB1_chr4_144336630_ENST00000262994_length(amino acids)=712AA_BP=20
MRLKIGFILRSLLVVGSFLGLVVLWSSLTPRPDDPSPLSRMRAWKRRWFVLRSGRLTGDPDVLEYYKNDHAKKPIRIIDLNLCQQVDAGL
TFNKKEFENSYIFDINTIDRIFYLVADSEEEMNKWVRCICDICGFNPTEEDPVKPPGSSLQAPADLPLAINTAPPSTQADSSSATLPPPY
QLINVPPHLETLGIQEDPQDYLLLINCQSKKPEPTRTHADSAKSTSSETDCNDNVPSHKNPASSQSKHGMNGFFQQQMIYDSPPSRAPSA
SVDSSLYNLPRSYSHDVLPKVSPSSTEADGELYVFNTPSGTSSVETQMRHVSISYDIPPTPGNTYQIPRTFPEGTLGQTSKLDTIPDIPP
PRPPKPHPAHDRSPVETCSIPRTASDTDSSYCIPTAGMSPSRSNTISTVDLNKLRKDASSQDCYDIPRAFPSDRSSSLEGFHNHFKVKNV
LTVGSVSSEELDENYVPMNPNSPPRQHSSSFTEPIQEANYVPMTPGTFDFSSFGMQVPPPAHMGFRSSPKTPPRRPVPVADCEPPPVDRN
LKPDRKVKPAPLEIKPLPEWEELQAPVRSPITRSFARDSSRFPMSPRPDSVHSTTSSSDSHDSEENYVPMNPNLSSEDPNLFGSNSLDGG

--------------------------------------------------------------
>32304_32304_2_GALNT7-GAB1_GALNT7_chr4_174090112_ENST00000265000_GAB1_chr4_144336630_ENST00000262995_length(transcript)=7691nt_BP=209nt
AGAGCCGGAGGAGGGGGAAGGAGGGAGGGGAGAGCGGTGGCGGCGGCTGCGCCGGGCTGTGAGTCTCTCGCCGCCGGAGGAAGATGAGGC
TGAAGATTGGGTTCATCTTACGCAGTTTGCTGGTGGTGGGAAGCTTCCTGGGGCTAGTGGTCCTCTGGTCTTCCCTGACCCCGCGGCCGG
ACGACCCAAGCCCGCTGAGCAGGATGAGGGCATGGAAGAGGAGATGGTTCGTGTTACGCAGTGGCCGTTTAACTGGAGATCCAGATGTTT
TGGAATATTACAAAAATGATCATGCCAAGAAGCCTATTCGTATTATTGATTTAAATTTATGTCAACAAGTAGATGCTGGATTGACATTTA
ACAAAAAAGAGTTTGAAAACAGCTACATTTTTGATATCAACACTATTGACCGGATTTTCTACTTGGTAGCAGACAGCGAGGAGGAGATGA
ATAAGTGGGTTCGTTGTATTTGTGACATCTGTGGGTTTAATCCAACAGAAGAAGATCCTGTGAAGCCACCTGGCAGCTCTTTACAAGCAC
CAGCTGATTTACCTTTAGCTATAAATACAGCACCACCATCCACCCAGGCAGATTCATCCTCTGCTACTCTACCTCCTCCATATCAGCTAA
TCAATGTTCCACCACACCTGGAAACTCTTGGCATTCAGGAGGATCCTCAAGACTACCTGTTGCTCATCAACTGTCAAAGCAAGAAGCCCG
AACCCACCAGAACGCATGCTGATTCTGCAAAATCCACCTCTTCTGAAACAGACTGCAATGATAACGTCCCTTCTCATAAAAATCCTGCTT
CCTCCCAGAGCAAACATGGAATGAATGGCTTTTTTCAGCAGCAAATGATATACGACTCTCCACCTTCACGTGCCCCATCTGCTTCAGTTG
ACTCCAGCCTTTATAACCTGCCCAGGAGTTATTCCCATGATGTTTTACCAAAGGTGTCTCCATCAAGTACTGAAGCAGATGGAGAACTCT
ATGTTTTTAATACCCCATCTGGGACATCGAGTGTAGAGACTCAAATGAGGCATGTATCTATTAGTTATGACATTCCTCCAACACCTGGTA
ATACTTATCAGATTCCACGAACATTTCCAGAAGGAACCTTGGGACAGACATCAAAGCTAGACACTATTCCAGATATTCCTCCACCTCGGC
CACCGAAACCACATCCAGCTCATGACCGATCTCCTGTGGAAACGTGTAGTATCCCACGCACCGCCTCAGACACTGACAGTAGTTACTGTA
TCCCTACAGCAGGGATGTCGCCTTCACGTAGTAATACCATTTCCACTGTGGATTTAAACAAATTGCGAAAAGATGCTAGTTCTCAAGACT
GCTATGATATTCCACGAGCATTTCCAAGTGATAGATCTAGTTCACTTGAAGGCTTCCATAACCACTTTAAAGTCAAAAATGTGTTGACAG
TGGGAAGTGTTTCAAGTGAAGAACTGGATGAAAATTACGTCCCAATGAATCCCAATTCACCACCACGACAACATTCCAGCAGTTTTACAG
AACCAATTCAGGAAGCAAATTATGTGCCAATGACTCCAGGAACATTTGATTTTTCCTCATTTGGAATGCAAGTTCCTCCTCCTGCTCATA
TGGGCTTCAGGTCCAGCCCAAAAACCCCTCCCAGAAGGCCAGTTCCTGTTGCAGACTGTGAACCACCCCCCGTGGATAGGAACCTCAAGC
CAGACAGAAAAGGTCAAAGTCCTAAAATTTTAAGACTCAAACCCCATGGTTTAGAGCGAACTGATTCACAAACCATAGGTGACTTTGCTA
CAAGAAGAAAGGTCAAGCCAGCGCCTTTAGAAATAAAACCTTTGCCAGAATGGGAAGAATTACAAGCCCCAGTTAGATCTCCCATCACTA
GGAGTTTTGCTCGAGACTCTTCCAGGTTTCCCATGTCCCCCCGACCAGATTCAGTGCATAGCACAACTTCAAGCAGTGACTCACACGACA
GTGAAGAGAATTATGTTCCCATGAACCCAAACCTGTCCAGTGAAGACCCAAATCTCTTTGGCAGTAACAGTCTTGATGGAGGAAGCAGCC
CTATGATCAAGCCCAAAGGAGACAAACAGGTGGAATACTTAGATCTCGACTTAGATTCTGGGAAATCCACACCACCACGTAAGCAAAAGA
GCAGTGGCTCAGGCAGCAGTGTAGCAGATGAGAGAGTGGATTATGTTGTTGTTGACCAACAGAAGACCTTGGCTCTAAAGAGTACCCGGG
AAGCCTGGACAGATGGGAGACAGTCCACAGAATCAGAAACGCCAGCGAAGAGTGTGAAATGAAAATATTGCCTTGCCATTTCTGAACAAA
AGAAAACTGAATTGTAAAGATAAATCCCTTTTGAAGAATGACTTGACACTTCCACTCTAGGTAGATCCTCAAATGAGTAGAGTTGAAGTC
AAAGGACCTTTCTGACATAATCAAGCAATTTAGACTTAAGTGGTGCTTTGTGGTATCTGAACAATTCATAACATGTAAATAATGTGGGAA
AATAGTATTGTTTAGCTCCCAGAGAAACATTTGTTCCACAGTTAACACACTCGTAGTATTACTGTATTTATGCACTTTTTCATCTAAAAC
ATTGTTCTGGGTTTTCCCAATGTACCTTACCATAATTCCTTTGGGAGTTCTTGTTTTTTGTCACACTACTTTATATAACAATACTAAGTC
AACTAAGCTACTTTTAGATTTGGAAATTGCTGTTTACAGTCTAACAACATTAAAATGAGAGGTAGATTCACAAGTTAGCTTTCTACCTGA
AGCTTCAGGTGATAACCATTAGCTTATACTTGGACTCATCATTTGTTGCCTTCCAAAATGCTGAGGATAATGTATGTACTGGTGTCAGGA
CCTAGTTCTCTGGTTAATGTACATTTAGTTTTTAATGGTGGAACTTTGTTATATTTTGTTAATTACAGTGTTTTTGGTTCATTGAGTGAA
GATTCTGCCGGGTGGGATCTTGCACCTTTGAAAGACTGAATAATTACACTACCAAGTAAGCCTGCAAATCATTGATGGCATGCAGTGATG
ATGTGCTCTTACACTTGTTAACATGTATTAAGTGTTATTTGCAAAAGGTAGATTATGTAACCAATCAGGTACGTACCAGGCAGTGATGTG
CTAATACACTGATCAGGTTTAGACAATGAGCTTTGGTTGTGTTCTTGTTAGTCCTAATATTGGTTTTCAGTTTGGAATTAATAAAGCAGT
TGACATTCACTGTTAGTTACAGCAACATACTGTGATTTTTAATTAGATAGTAATTCAGATTTATTACTCTATGAAATTCTGTCTTTTGAC
ACCATAGTGCCCTTTCTATGATTTTTTTTACTTAATATTCTTCTTGGCCTTATATTTAATTCCCTATGCAATTAATATTTTATATCTGCA
TTTTTTTAAAAAAAATAGATGTTATATAAGTGATTCTCGTATGTAGCACCTGTTGCTTTTCCACTGAAAGAATTACGGATTTTGTACTGT
GATTTATATTCACTGCCCCAATTCAAGAAATATTGGAGCCTTGCTACAATGTGAAATGTTATAGTCATGGACTCCTTCCAACCAGATTTC
TGAAAACACCAGAGGGATGGTATAATTCTGTCTCACCTATAACATGGTCCTGTGACATAGATATTAAGACCACAAGTTGTAGTGAGGCTA
CAATTATATTCGTCTGTCTTGGCTTTGCAACATAATTTAGAAAGCACGTATAGTTGTTTTTTAACCAAGTTACATACAATCTCATGTACT
GATTTGAGACTTATAACAATTTTTGGAGGGGGCATAGAGAAAGGAGTGCCCACAGTTGAGGCATGACCCCCTCCATTCAGACCTCTAACT
GTTGCCTGAGTACACAGATGTGCCCTGATTTCTGGCCCATTGGCCATAGTACTGTGCCTAATCAATGTAATAGGTTTATTTTCCCAATCC
TCAAACTAAAAATGTTCATAACAAGATGAATTGTAGACTAGTAACATTTGATGCTTTTAAATATTTGCTTCTTTTTAAACAAAAACTAAA
ACCCAGAAGTGAATTTTTAGGTGGATTTTTAAATAAAAAAGATTGATTGAGTTTGGTGTGCAAGCTGTTTTATAATGAAACAACAAAATG
AAATCTAAAATCCTGAAATGTGCCTAAACTATCAAAACACACGATACAGCTAATGTGTAAAGATGCTAAATTCTGTTACTTGGAGGATGA
ATATATTTAAGATTTAAAACACAATAATAAATACATGATTAATTCAAAAATAAAAATCTTTACAGCTGCCTATCAAGGGTCTAAAGCACT
TAATGAATGTTTTTAGTCTAACTTATCATTAACTTTTTACAAGTCACCATATTTGAAGATCTGTAGCACTCTGATTTTCAGAAAATTTTT
CATTCTGAATAATTTAAAAATGGTGATGTATTAGAAAGGCAGTTTGCTTTAGAAAACTAAATCACATTGAACATTGTATTAGAGAATTAA
ATTAAAAGTTTCTTACAGAGCAGTATTTTCCAAACATTTTTAGCACTAGAATCTTTTTAGATGAAATTTTATGTATAACCCCAATACATA
AAGCCTGAAAACTCAATTTTATCAATATAAATGTATTTTGGGTTCACATTTATGCTTATTCATTTTGGCTCATTACTAAGCATAATAAGA
TTCTGAGTTATTTCTGAATAACACAAATGTGGAGTTATACATAGTTGATGAAACCAGCAGCCAATTTATAGCTATGCCCTGTTTTATTTG
TATACTATCAAGAAAATTTTGATTCACACAAATGTAAGCAAAAATAATAGGTTTTAAACATACATCTCAGGAAATTCTTTAATTAGAGAT
AGCTAAAGTTATTCAAGGTCTATACAAAAATAAGTTATCCTGGTAGTGGAAGTTAATACATAAGCAGTCTCCAGTGTGGTAAAGTAGGGT
ATGTAACACATCAGAATGTGCGTTTTTATTAGGTTTTAAAATATGCACGTATAAAAACTAAATTTGAATCAAACCCTTTTAACTCACCTC
CAAGAAGCTAGACTTTGGCCAGGAATGGGCTAAAAACCACTGGTTAACGATGTGACAGTTATGATCTTGGAGATTGGAAATCTTTCTTCC
ACATTAGAGTTCTTTACCTTAATTCCTTATTCTGAAAAATTGTAAGATTTTATGAAGGTTTGAATACTGAAGCACAGTTCTGCTTTCAAA
AATTAAAATTCAAACTTGAAAAAGCTGTTTAACCCATGGAAGATATCATTTAGTAAGATGTAAAAGATTTTTTAAATCTACACTTCAGTT
TATACATCTTTATCATTATCAATACTATATAAGTTACTGTGAGCATTTTAGAGAATTCCATAAAGGTACTATGAGTGTGTCTGTATGTGT
GTGTATATATAGCATTGTATTTAATCATAGACTAAATTTAATTTGATATAGAAATACTACTTTACTTGTACATTAAGGTCATAATTTCTG
CTGGACTCTTTTATATTTAATTAATGGGGATTATAGTCTTCCTTCATAAATGCATTTAAACCTGAAATTGAACACCAGTGTTTTTCTTTT
TCTACTTATGGGAAGTTGTCTGCTTCCCCCTTTAGAGAAAACAGTATTTTTATATTTTGTTAAAATATTAACTACTTTATGCCTACACAC
TATGCTGTAGATACTGATCATAATTCTTGGGTGTTCACAAACACTCCTAGTGCCTCTTTTTTGGCCCGTTGAAAGTGTTGGTATTACTAC
TTTCACTACAGAGCCTTTGGCCCTCTAATAATGCTGAGGTGGGCTGATCCTTCCCATTTCTGTCTTCGGGTCATTCTGGTAGGTCTTCTC
CTCCACTGTCAAGTAAGCAATCAGGTCCGTGACAGGGATTGGACATATGAACAAATTAAGTGGATACACACAGTGAGAAAGATACATGCA
TTCTATGGTAACAACTACTGTCAATAACATCTGATGTTACATGCACATTTATATATATATAATTTTAAAAACTGAACTATGAGAAGCCAT
GGTATAAATGAATATTGTGGACATCATGGACTTGATATGATAGAAATCAATTGTCAGCTTGAGAAAGTTGTTTTTAATCTGTCTAAATAG
TTCATGCATTACTACAGTTAAAAATAGTTTCATTTGTCTTCTATAGACTTAATTTTATTCCGGTTCAGTATAATCTCTGTTAACAGAGTT
TCAGCAAACTGATTGGTCAAGGTATTAACATAGCTTCTACTTCCTTTACTTAAAAAGATGTGGTTTTATGTAAGTTCTTGATTACTGATG
ATCATCCCAAATTTTGACAACAAAATCATATGTATAAATTTATTTCTCCCCTCTTGTTCATCATCTTTTGTAAAGGTCCCATTGTAGATC
TTTTCTGCTACCAAATAAAACTTTTCAAACAATTTGGTTTCAAGACCTTAAATAGACAAGTTGGATACTAAGATTGTGAACTGATAAGGA
CATATAAATTTATATTTCCAGCCCTTCCTTAGAGTCTTTATCTGCATCAAAAACCCAATTCTGCCATTAACTGTGCTTCCCAGTCCCACC
TCTATATGTCACTCATTTTCTGCAACAAAGATCTCACTAAATCATGTTGAAACACAAGTCATGATCCTCTCTAAGTAAATAGAAAAAGCT
CCCTGGAAAAACTCTGTTGCCACATGCACGTGCCCTGTTACTCCTCCAGCCAGCCAGTGCTGCCAGCATTTTATTGTGTAAAAGTCCAAA
TAAATAAGGGCCTGCATGCAACCTTTATCTTCAGAAACTAGGTTTTATATGTAAAATGTGACTTGGGAAATGATTCTGTTTATTAACTGG
CTGGGATTTTTCATTTCTATGAAAGTTTCAAACATCTCCAGTACTTTATAAAATCCCAACAATTGCTGTAAGTCAGCACTTTGGTCCACT
CAGCCCACCCAGCCCACTTGCAACTCTGACTCTTCACTGAATCATATTTGGGAAGTTTGGGTAGGGTGAGGCTATCTTCTTCAAGATTAT
TTTCTCATATGTCTGTCTGTCACCTTGTAAACCATGAGACTCCTGGGTATTTGCATGTAACTTCTTTGAGGAAGTTACCACCATCTCTGA
TATAGACACACTTTTTGAGTTGCAGTTTCTGTTAGAATTTTTTGGAGACTAACTTGCCAATTCTGTGAATGTTATTGAATATTTAAAAAG
CTGGGTCTGTAATGGGAGGCATTTTATTAGCTGTTGTGATTGGGTAACATGTCCCCTTAGATTTCCTGATTTAAAATTATACAAAATTAC
TATTTTTGATAAAATAAAGGAACACCTACAGAAAATTAAGTTTCTAAGATGTTTCTATACTTCATTAGAAAAGATTTTATTACTATTACT
TATGGTTATTGGTGATTAACACTTAATGCGTCTCCTCTGATTTTGTGTTCCATGAGGTGCTTGGAACATTTGGAGTGCTCTGTGCGAGGG
ACATACAGTGATATAGGAAATTTAAAAATTAAAATAATACCCAAAACCCACTTTATCAGATATGGTATTGTGATGGTTAATATTATGTGT
CAACTTGGTGAGGCTATGGCGCCCATGTGTTTGGTCAAACACTAGCCTAGATGTTGCTGTGAATATATTTTGTAGATGTGATTAACATTT

>32304_32304_2_GALNT7-GAB1_GALNT7_chr4_174090112_ENST00000265000_GAB1_chr4_144336630_ENST00000262995_length(amino acids)=742AA_BP=20
MRLKIGFILRSLLVVGSFLGLVVLWSSLTPRPDDPSPLSRMRAWKRRWFVLRSGRLTGDPDVLEYYKNDHAKKPIRIIDLNLCQQVDAGL
TFNKKEFENSYIFDINTIDRIFYLVADSEEEMNKWVRCICDICGFNPTEEDPVKPPGSSLQAPADLPLAINTAPPSTQADSSSATLPPPY
QLINVPPHLETLGIQEDPQDYLLLINCQSKKPEPTRTHADSAKSTSSETDCNDNVPSHKNPASSQSKHGMNGFFQQQMIYDSPPSRAPSA
SVDSSLYNLPRSYSHDVLPKVSPSSTEADGELYVFNTPSGTSSVETQMRHVSISYDIPPTPGNTYQIPRTFPEGTLGQTSKLDTIPDIPP
PRPPKPHPAHDRSPVETCSIPRTASDTDSSYCIPTAGMSPSRSNTISTVDLNKLRKDASSQDCYDIPRAFPSDRSSSLEGFHNHFKVKNV
LTVGSVSSEELDENYVPMNPNSPPRQHSSSFTEPIQEANYVPMTPGTFDFSSFGMQVPPPAHMGFRSSPKTPPRRPVPVADCEPPPVDRN
LKPDRKGQSPKILRLKPHGLERTDSQTIGDFATRRKVKPAPLEIKPLPEWEELQAPVRSPITRSFARDSSRFPMSPRPDSVHSTTSSSDS
HDSEENYVPMNPNLSSEDPNLFGSNSLDGGSSPMIKPKGDKQVEYLDLDLDSGKSTPPRKQKSSGSGSSVADERVDYVVVDQQKTLALKS

--------------------------------------------------------------
>32304_32304_3_GALNT7-GAB1_GALNT7_chr4_174090112_ENST00000512285_GAB1_chr4_144336630_ENST00000262994_length(transcript)=2425nt_BP=151nt
GTGAGTCTCTCGCCGCCGGAGGAAGATGAGGCTGAAGATTGGGTTCATCTTACGCAGTTTGCTGGTGGTGGGAAGCTTCCTGGGGCTAGT
GGTCCTCTGGTCTTCCCTGACCCCGCGGCCGGACGACCCAAGCCCGCTGAGCAGGATGAGGGCATGGAAGAGGAGATGGTTCGTGTTACG
CAGTGGCCGTTTAACTGGAGATCCAGATGTTTTGGAATATTACAAAAATGATCATGCCAAGAAGCCTATTCGTATTATTGATTTAAATTT
ATGTCAACAAGTAGATGCTGGATTGACATTTAACAAAAAAGAGTTTGAAAACAGCTACATTTTTGATATCAACACTATTGACCGGATTTT
CTACTTGGTAGCAGACAGCGAGGAGGAGATGAATAAGTGGGTTCGTTGTATTTGTGACATCTGTGGGTTTAATCCAACAGAAGAAGATCC
TGTGAAGCCACCTGGCAGCTCTTTACAAGCACCAGCTGATTTACCTTTAGCTATAAATACAGCACCACCATCCACCCAGGCAGATTCATC
CTCTGCTACTCTACCTCCTCCATATCAGCTAATCAATGTTCCACCACACCTGGAAACTCTTGGCATTCAGGAGGATCCTCAAGACTACCT
GTTGCTCATCAACTGTCAAAGCAAGAAGCCCGAACCCACCAGAACGCATGCTGATTCTGCAAAATCCACCTCTTCTGAAACAGACTGCAA
TGATAACGTCCCTTCTCATAAAAATCCTGCTTCCTCCCAGAGCAAACATGGAATGAATGGCTTTTTTCAGCAGCAAATGATATACGACTC
TCCACCTTCACGTGCCCCATCTGCTTCAGTTGACTCCAGCCTTTATAACCTGCCCAGGAGTTATTCCCATGATGTTTTACCAAAGGTGTC
TCCATCAAGTACTGAAGCAGATGGAGAACTCTATGTTTTTAATACCCCATCTGGGACATCGAGTGTAGAGACTCAAATGAGGCATGTATC
TATTAGTTATGACATTCCTCCAACACCTGGTAATACTTATCAGATTCCACGAACATTTCCAGAAGGAACCTTGGGACAGACATCAAAGCT
AGACACTATTCCAGATATTCCTCCACCTCGGCCACCGAAACCACATCCAGCTCATGACCGATCTCCTGTGGAAACGTGTAGTATCCCACG
CACCGCCTCAGACACTGACAGTAGTTACTGTATCCCTACAGCAGGGATGTCGCCTTCACGTAGTAATACCATTTCCACTGTGGATTTAAA
CAAATTGCGAAAAGATGCTAGTTCTCAAGACTGCTATGATATTCCACGAGCATTTCCAAGTGATAGATCTAGTTCACTTGAAGGCTTCCA
TAACCACTTTAAAGTCAAAAATGTGTTGACAGTGGGAAGTGTTTCAAGTGAAGAACTGGATGAAAATTACGTCCCAATGAATCCCAATTC
ACCACCACGACAACATTCCAGCAGTTTTACAGAACCAATTCAGGAAGCAAATTATGTGCCAATGACTCCAGGAACATTTGATTTTTCCTC
ATTTGGAATGCAAGTTCCTCCTCCTGCTCATATGGGCTTCAGGTCCAGCCCAAAAACCCCTCCCAGAAGGCCAGTTCCTGTTGCAGACTG
TGAACCACCCCCCGTGGATAGGAACCTCAAGCCAGACAGAAAAGTCAAGCCAGCGCCTTTAGAAATAAAACCTTTGCCAGAATGGGAAGA
ATTACAAGCCCCAGTTAGATCTCCCATCACTAGGAGTTTTGCTCGAGACTCTTCCAGGTTTCCCATGTCCCCCCGACCAGATTCAGTGCA
TAGCACAACTTCAAGCAGTGACTCACACGACAGTGAAGAGAATTATGTTCCCATGAACCCAAACCTGTCCAGTGAAGACCCAAATCTCTT
TGGCAGTAACAGTCTTGATGGAGGAAGCAGCCCTATGATCAAGCCCAAAGGAGACAAACAGGTGGAATACTTAGATCTCGACTTAGATTC
TGGGAAATCCACACCACCACGTAAGCAAAAGAGCAGTGGCTCAGGCAGCAGTGTAGCAGATGAGAGAGTGGATTATGTTGTTGTTGACCA
ACAGAAGACCTTGGCTCTAAAGAGTACCCGGGAAGCCTGGACAGATGGGAGACAGTCCACAGAATCAGAAACGCCAGCGAAGAGTGTGAA
ATGAAAATATTGCCTTGCCATTTCTGAACAAAAGAAAACTGAATTGTAAAGATAAATCCCTTTTGAAGAATGACTTGACACTTCCACTCT
AGGTAGATCCTCAAATGAGTAGAGTTGAAGTCAAAGGACCTTTCTGACATAATCAAGCAATTTAGACTTAAGTGGTGCTTTGTGGTATCT

>32304_32304_3_GALNT7-GAB1_GALNT7_chr4_174090112_ENST00000512285_GAB1_chr4_144336630_ENST00000262994_length(amino acids)=712AA_BP=20
MRLKIGFILRSLLVVGSFLGLVVLWSSLTPRPDDPSPLSRMRAWKRRWFVLRSGRLTGDPDVLEYYKNDHAKKPIRIIDLNLCQQVDAGL
TFNKKEFENSYIFDINTIDRIFYLVADSEEEMNKWVRCICDICGFNPTEEDPVKPPGSSLQAPADLPLAINTAPPSTQADSSSATLPPPY
QLINVPPHLETLGIQEDPQDYLLLINCQSKKPEPTRTHADSAKSTSSETDCNDNVPSHKNPASSQSKHGMNGFFQQQMIYDSPPSRAPSA
SVDSSLYNLPRSYSHDVLPKVSPSSTEADGELYVFNTPSGTSSVETQMRHVSISYDIPPTPGNTYQIPRTFPEGTLGQTSKLDTIPDIPP
PRPPKPHPAHDRSPVETCSIPRTASDTDSSYCIPTAGMSPSRSNTISTVDLNKLRKDASSQDCYDIPRAFPSDRSSSLEGFHNHFKVKNV
LTVGSVSSEELDENYVPMNPNSPPRQHSSSFTEPIQEANYVPMTPGTFDFSSFGMQVPPPAHMGFRSSPKTPPRRPVPVADCEPPPVDRN
LKPDRKVKPAPLEIKPLPEWEELQAPVRSPITRSFARDSSRFPMSPRPDSVHSTTSSSDSHDSEENYVPMNPNLSSEDPNLFGSNSLDGG

--------------------------------------------------------------
>32304_32304_4_GALNT7-GAB1_GALNT7_chr4_174090112_ENST00000512285_GAB1_chr4_144336630_ENST00000262995_length(transcript)=7633nt_BP=151nt
GTGAGTCTCTCGCCGCCGGAGGAAGATGAGGCTGAAGATTGGGTTCATCTTACGCAGTTTGCTGGTGGTGGGAAGCTTCCTGGGGCTAGT
GGTCCTCTGGTCTTCCCTGACCCCGCGGCCGGACGACCCAAGCCCGCTGAGCAGGATGAGGGCATGGAAGAGGAGATGGTTCGTGTTACG
CAGTGGCCGTTTAACTGGAGATCCAGATGTTTTGGAATATTACAAAAATGATCATGCCAAGAAGCCTATTCGTATTATTGATTTAAATTT
ATGTCAACAAGTAGATGCTGGATTGACATTTAACAAAAAAGAGTTTGAAAACAGCTACATTTTTGATATCAACACTATTGACCGGATTTT
CTACTTGGTAGCAGACAGCGAGGAGGAGATGAATAAGTGGGTTCGTTGTATTTGTGACATCTGTGGGTTTAATCCAACAGAAGAAGATCC
TGTGAAGCCACCTGGCAGCTCTTTACAAGCACCAGCTGATTTACCTTTAGCTATAAATACAGCACCACCATCCACCCAGGCAGATTCATC
CTCTGCTACTCTACCTCCTCCATATCAGCTAATCAATGTTCCACCACACCTGGAAACTCTTGGCATTCAGGAGGATCCTCAAGACTACCT
GTTGCTCATCAACTGTCAAAGCAAGAAGCCCGAACCCACCAGAACGCATGCTGATTCTGCAAAATCCACCTCTTCTGAAACAGACTGCAA
TGATAACGTCCCTTCTCATAAAAATCCTGCTTCCTCCCAGAGCAAACATGGAATGAATGGCTTTTTTCAGCAGCAAATGATATACGACTC
TCCACCTTCACGTGCCCCATCTGCTTCAGTTGACTCCAGCCTTTATAACCTGCCCAGGAGTTATTCCCATGATGTTTTACCAAAGGTGTC
TCCATCAAGTACTGAAGCAGATGGAGAACTCTATGTTTTTAATACCCCATCTGGGACATCGAGTGTAGAGACTCAAATGAGGCATGTATC
TATTAGTTATGACATTCCTCCAACACCTGGTAATACTTATCAGATTCCACGAACATTTCCAGAAGGAACCTTGGGACAGACATCAAAGCT
AGACACTATTCCAGATATTCCTCCACCTCGGCCACCGAAACCACATCCAGCTCATGACCGATCTCCTGTGGAAACGTGTAGTATCCCACG
CACCGCCTCAGACACTGACAGTAGTTACTGTATCCCTACAGCAGGGATGTCGCCTTCACGTAGTAATACCATTTCCACTGTGGATTTAAA
CAAATTGCGAAAAGATGCTAGTTCTCAAGACTGCTATGATATTCCACGAGCATTTCCAAGTGATAGATCTAGTTCACTTGAAGGCTTCCA
TAACCACTTTAAAGTCAAAAATGTGTTGACAGTGGGAAGTGTTTCAAGTGAAGAACTGGATGAAAATTACGTCCCAATGAATCCCAATTC
ACCACCACGACAACATTCCAGCAGTTTTACAGAACCAATTCAGGAAGCAAATTATGTGCCAATGACTCCAGGAACATTTGATTTTTCCTC
ATTTGGAATGCAAGTTCCTCCTCCTGCTCATATGGGCTTCAGGTCCAGCCCAAAAACCCCTCCCAGAAGGCCAGTTCCTGTTGCAGACTG
TGAACCACCCCCCGTGGATAGGAACCTCAAGCCAGACAGAAAAGGTCAAAGTCCTAAAATTTTAAGACTCAAACCCCATGGTTTAGAGCG
AACTGATTCACAAACCATAGGTGACTTTGCTACAAGAAGAAAGGTCAAGCCAGCGCCTTTAGAAATAAAACCTTTGCCAGAATGGGAAGA
ATTACAAGCCCCAGTTAGATCTCCCATCACTAGGAGTTTTGCTCGAGACTCTTCCAGGTTTCCCATGTCCCCCCGACCAGATTCAGTGCA
TAGCACAACTTCAAGCAGTGACTCACACGACAGTGAAGAGAATTATGTTCCCATGAACCCAAACCTGTCCAGTGAAGACCCAAATCTCTT
TGGCAGTAACAGTCTTGATGGAGGAAGCAGCCCTATGATCAAGCCCAAAGGAGACAAACAGGTGGAATACTTAGATCTCGACTTAGATTC
TGGGAAATCCACACCACCACGTAAGCAAAAGAGCAGTGGCTCAGGCAGCAGTGTAGCAGATGAGAGAGTGGATTATGTTGTTGTTGACCA
ACAGAAGACCTTGGCTCTAAAGAGTACCCGGGAAGCCTGGACAGATGGGAGACAGTCCACAGAATCAGAAACGCCAGCGAAGAGTGTGAA
ATGAAAATATTGCCTTGCCATTTCTGAACAAAAGAAAACTGAATTGTAAAGATAAATCCCTTTTGAAGAATGACTTGACACTTCCACTCT
AGGTAGATCCTCAAATGAGTAGAGTTGAAGTCAAAGGACCTTTCTGACATAATCAAGCAATTTAGACTTAAGTGGTGCTTTGTGGTATCT
GAACAATTCATAACATGTAAATAATGTGGGAAAATAGTATTGTTTAGCTCCCAGAGAAACATTTGTTCCACAGTTAACACACTCGTAGTA
TTACTGTATTTATGCACTTTTTCATCTAAAACATTGTTCTGGGTTTTCCCAATGTACCTTACCATAATTCCTTTGGGAGTTCTTGTTTTT
TGTCACACTACTTTATATAACAATACTAAGTCAACTAAGCTACTTTTAGATTTGGAAATTGCTGTTTACAGTCTAACAACATTAAAATGA
GAGGTAGATTCACAAGTTAGCTTTCTACCTGAAGCTTCAGGTGATAACCATTAGCTTATACTTGGACTCATCATTTGTTGCCTTCCAAAA
TGCTGAGGATAATGTATGTACTGGTGTCAGGACCTAGTTCTCTGGTTAATGTACATTTAGTTTTTAATGGTGGAACTTTGTTATATTTTG
TTAATTACAGTGTTTTTGGTTCATTGAGTGAAGATTCTGCCGGGTGGGATCTTGCACCTTTGAAAGACTGAATAATTACACTACCAAGTA
AGCCTGCAAATCATTGATGGCATGCAGTGATGATGTGCTCTTACACTTGTTAACATGTATTAAGTGTTATTTGCAAAAGGTAGATTATGT
AACCAATCAGGTACGTACCAGGCAGTGATGTGCTAATACACTGATCAGGTTTAGACAATGAGCTTTGGTTGTGTTCTTGTTAGTCCTAAT
ATTGGTTTTCAGTTTGGAATTAATAAAGCAGTTGACATTCACTGTTAGTTACAGCAACATACTGTGATTTTTAATTAGATAGTAATTCAG
ATTTATTACTCTATGAAATTCTGTCTTTTGACACCATAGTGCCCTTTCTATGATTTTTTTTACTTAATATTCTTCTTGGCCTTATATTTA
ATTCCCTATGCAATTAATATTTTATATCTGCATTTTTTTAAAAAAAATAGATGTTATATAAGTGATTCTCGTATGTAGCACCTGTTGCTT
TTCCACTGAAAGAATTACGGATTTTGTACTGTGATTTATATTCACTGCCCCAATTCAAGAAATATTGGAGCCTTGCTACAATGTGAAATG
TTATAGTCATGGACTCCTTCCAACCAGATTTCTGAAAACACCAGAGGGATGGTATAATTCTGTCTCACCTATAACATGGTCCTGTGACAT
AGATATTAAGACCACAAGTTGTAGTGAGGCTACAATTATATTCGTCTGTCTTGGCTTTGCAACATAATTTAGAAAGCACGTATAGTTGTT
TTTTAACCAAGTTACATACAATCTCATGTACTGATTTGAGACTTATAACAATTTTTGGAGGGGGCATAGAGAAAGGAGTGCCCACAGTTG
AGGCATGACCCCCTCCATTCAGACCTCTAACTGTTGCCTGAGTACACAGATGTGCCCTGATTTCTGGCCCATTGGCCATAGTACTGTGCC
TAATCAATGTAATAGGTTTATTTTCCCAATCCTCAAACTAAAAATGTTCATAACAAGATGAATTGTAGACTAGTAACATTTGATGCTTTT
AAATATTTGCTTCTTTTTAAACAAAAACTAAAACCCAGAAGTGAATTTTTAGGTGGATTTTTAAATAAAAAAGATTGATTGAGTTTGGTG
TGCAAGCTGTTTTATAATGAAACAACAAAATGAAATCTAAAATCCTGAAATGTGCCTAAACTATCAAAACACACGATACAGCTAATGTGT
AAAGATGCTAAATTCTGTTACTTGGAGGATGAATATATTTAAGATTTAAAACACAATAATAAATACATGATTAATTCAAAAATAAAAATC
TTTACAGCTGCCTATCAAGGGTCTAAAGCACTTAATGAATGTTTTTAGTCTAACTTATCATTAACTTTTTACAAGTCACCATATTTGAAG
ATCTGTAGCACTCTGATTTTCAGAAAATTTTTCATTCTGAATAATTTAAAAATGGTGATGTATTAGAAAGGCAGTTTGCTTTAGAAAACT
AAATCACATTGAACATTGTATTAGAGAATTAAATTAAAAGTTTCTTACAGAGCAGTATTTTCCAAACATTTTTAGCACTAGAATCTTTTT
AGATGAAATTTTATGTATAACCCCAATACATAAAGCCTGAAAACTCAATTTTATCAATATAAATGTATTTTGGGTTCACATTTATGCTTA
TTCATTTTGGCTCATTACTAAGCATAATAAGATTCTGAGTTATTTCTGAATAACACAAATGTGGAGTTATACATAGTTGATGAAACCAGC
AGCCAATTTATAGCTATGCCCTGTTTTATTTGTATACTATCAAGAAAATTTTGATTCACACAAATGTAAGCAAAAATAATAGGTTTTAAA
CATACATCTCAGGAAATTCTTTAATTAGAGATAGCTAAAGTTATTCAAGGTCTATACAAAAATAAGTTATCCTGGTAGTGGAAGTTAATA
CATAAGCAGTCTCCAGTGTGGTAAAGTAGGGTATGTAACACATCAGAATGTGCGTTTTTATTAGGTTTTAAAATATGCACGTATAAAAAC
TAAATTTGAATCAAACCCTTTTAACTCACCTCCAAGAAGCTAGACTTTGGCCAGGAATGGGCTAAAAACCACTGGTTAACGATGTGACAG
TTATGATCTTGGAGATTGGAAATCTTTCTTCCACATTAGAGTTCTTTACCTTAATTCCTTATTCTGAAAAATTGTAAGATTTTATGAAGG
TTTGAATACTGAAGCACAGTTCTGCTTTCAAAAATTAAAATTCAAACTTGAAAAAGCTGTTTAACCCATGGAAGATATCATTTAGTAAGA
TGTAAAAGATTTTTTAAATCTACACTTCAGTTTATACATCTTTATCATTATCAATACTATATAAGTTACTGTGAGCATTTTAGAGAATTC
CATAAAGGTACTATGAGTGTGTCTGTATGTGTGTGTATATATAGCATTGTATTTAATCATAGACTAAATTTAATTTGATATAGAAATACT
ACTTTACTTGTACATTAAGGTCATAATTTCTGCTGGACTCTTTTATATTTAATTAATGGGGATTATAGTCTTCCTTCATAAATGCATTTA
AACCTGAAATTGAACACCAGTGTTTTTCTTTTTCTACTTATGGGAAGTTGTCTGCTTCCCCCTTTAGAGAAAACAGTATTTTTATATTTT
GTTAAAATATTAACTACTTTATGCCTACACACTATGCTGTAGATACTGATCATAATTCTTGGGTGTTCACAAACACTCCTAGTGCCTCTT
TTTTGGCCCGTTGAAAGTGTTGGTATTACTACTTTCACTACAGAGCCTTTGGCCCTCTAATAATGCTGAGGTGGGCTGATCCTTCCCATT
TCTGTCTTCGGGTCATTCTGGTAGGTCTTCTCCTCCACTGTCAAGTAAGCAATCAGGTCCGTGACAGGGATTGGACATATGAACAAATTA
AGTGGATACACACAGTGAGAAAGATACATGCATTCTATGGTAACAACTACTGTCAATAACATCTGATGTTACATGCACATTTATATATAT
ATAATTTTAAAAACTGAACTATGAGAAGCCATGGTATAAATGAATATTGTGGACATCATGGACTTGATATGATAGAAATCAATTGTCAGC
TTGAGAAAGTTGTTTTTAATCTGTCTAAATAGTTCATGCATTACTACAGTTAAAAATAGTTTCATTTGTCTTCTATAGACTTAATTTTAT
TCCGGTTCAGTATAATCTCTGTTAACAGAGTTTCAGCAAACTGATTGGTCAAGGTATTAACATAGCTTCTACTTCCTTTACTTAAAAAGA
TGTGGTTTTATGTAAGTTCTTGATTACTGATGATCATCCCAAATTTTGACAACAAAATCATATGTATAAATTTATTTCTCCCCTCTTGTT
CATCATCTTTTGTAAAGGTCCCATTGTAGATCTTTTCTGCTACCAAATAAAACTTTTCAAACAATTTGGTTTCAAGACCTTAAATAGACA
AGTTGGATACTAAGATTGTGAACTGATAAGGACATATAAATTTATATTTCCAGCCCTTCCTTAGAGTCTTTATCTGCATCAAAAACCCAA
TTCTGCCATTAACTGTGCTTCCCAGTCCCACCTCTATATGTCACTCATTTTCTGCAACAAAGATCTCACTAAATCATGTTGAAACACAAG
TCATGATCCTCTCTAAGTAAATAGAAAAAGCTCCCTGGAAAAACTCTGTTGCCACATGCACGTGCCCTGTTACTCCTCCAGCCAGCCAGT
GCTGCCAGCATTTTATTGTGTAAAAGTCCAAATAAATAAGGGCCTGCATGCAACCTTTATCTTCAGAAACTAGGTTTTATATGTAAAATG
TGACTTGGGAAATGATTCTGTTTATTAACTGGCTGGGATTTTTCATTTCTATGAAAGTTTCAAACATCTCCAGTACTTTATAAAATCCCA
ACAATTGCTGTAAGTCAGCACTTTGGTCCACTCAGCCCACCCAGCCCACTTGCAACTCTGACTCTTCACTGAATCATATTTGGGAAGTTT
GGGTAGGGTGAGGCTATCTTCTTCAAGATTATTTTCTCATATGTCTGTCTGTCACCTTGTAAACCATGAGACTCCTGGGTATTTGCATGT
AACTTCTTTGAGGAAGTTACCACCATCTCTGATATAGACACACTTTTTGAGTTGCAGTTTCTGTTAGAATTTTTTGGAGACTAACTTGCC
AATTCTGTGAATGTTATTGAATATTTAAAAAGCTGGGTCTGTAATGGGAGGCATTTTATTAGCTGTTGTGATTGGGTAACATGTCCCCTT
AGATTTCCTGATTTAAAATTATACAAAATTACTATTTTTGATAAAATAAAGGAACACCTACAGAAAATTAAGTTTCTAAGATGTTTCTAT
ACTTCATTAGAAAAGATTTTATTACTATTACTTATGGTTATTGGTGATTAACACTTAATGCGTCTCCTCTGATTTTGTGTTCCATGAGGT
GCTTGGAACATTTGGAGTGCTCTGTGCGAGGGACATACAGTGATATAGGAAATTTAAAAATTAAAATAATACCCAAAACCCACTTTATCA
GATATGGTATTGTGATGGTTAATATTATGTGTCAACTTGGTGAGGCTATGGCGCCCATGTGTTTGGTCAAACACTAGCCTAGATGTTGCT

>32304_32304_4_GALNT7-GAB1_GALNT7_chr4_174090112_ENST00000512285_GAB1_chr4_144336630_ENST00000262995_length(amino acids)=742AA_BP=20
MRLKIGFILRSLLVVGSFLGLVVLWSSLTPRPDDPSPLSRMRAWKRRWFVLRSGRLTGDPDVLEYYKNDHAKKPIRIIDLNLCQQVDAGL
TFNKKEFENSYIFDINTIDRIFYLVADSEEEMNKWVRCICDICGFNPTEEDPVKPPGSSLQAPADLPLAINTAPPSTQADSSSATLPPPY
QLINVPPHLETLGIQEDPQDYLLLINCQSKKPEPTRTHADSAKSTSSETDCNDNVPSHKNPASSQSKHGMNGFFQQQMIYDSPPSRAPSA
SVDSSLYNLPRSYSHDVLPKVSPSSTEADGELYVFNTPSGTSSVETQMRHVSISYDIPPTPGNTYQIPRTFPEGTLGQTSKLDTIPDIPP
PRPPKPHPAHDRSPVETCSIPRTASDTDSSYCIPTAGMSPSRSNTISTVDLNKLRKDASSQDCYDIPRAFPSDRSSSLEGFHNHFKVKNV
LTVGSVSSEELDENYVPMNPNSPPRQHSSSFTEPIQEANYVPMTPGTFDFSSFGMQVPPPAHMGFRSSPKTPPRRPVPVADCEPPPVDRN
LKPDRKGQSPKILRLKPHGLERTDSQTIGDFATRRKVKPAPLEIKPLPEWEELQAPVRSPITRSFARDSSRFPMSPRPDSVHSTTSSSDS
HDSEENYVPMNPNLSSEDPNLFGSNSLDGGSSPMIKPKGDKQVEYLDLDLDSGKSTPPRKQKSSGSGSSVADERVDYVVVDQQKTLALKS

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for GALNT7-GAB1


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for GALNT7-GAB1


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for GALNT7-GAB1


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource
TgeneC0032460Polycystic Ovary Syndrome1CTD_human
TgeneC1136382Sclerocystic Ovaries1CTD_human
TgeneC1854275DEAFNESS, AUTOSOMAL RECESSIVE 261CTD_human;UNIPROT