Fusion Gene Studies
in Kim Lab

FusionBase FusionGDB FusionGDB2 FusionPDB FusionNeoAntigen FusionAI FusionNW FGviewer Publication Contact
FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:B4GALT5-ERBB4 (FusionGDB2 ID:HG9334TG2066)

Fusion Gene Summary for B4GALT5-ERBB4

check button Fusion gene summary
Fusion gene informationFusion gene name: B4GALT5-ERBB4
Fusion gene ID: hg9334tg2066
HgeneTgene
Gene symbol

B4GALT5

ERBB4

Gene ID

9334

2066

Gene namebeta-1,4-galactosyltransferase 5erb-b2 receptor tyrosine kinase 4
SynonymsB4Gal-T5|BETA4-GALT-IV|beta4Gal-T5|beta4GalT-V|gt-VALS19|HER4|p180erbB4
Cytomap('B4GALT5')('ERBB4')

20q13.13

2q34

Type of geneprotein-codingprotein-coding
Descriptionbeta-1,4-galactosyltransferase 5UDP-Gal:beta-GlcNAc beta-1,4-galactosyltransferase 5UDP-Gal:betaGlcNAc beta 1,4-galactosyltransferase, polypeptide 5UDP-galactose:beta-N-acetylglucosamine beta-1,4-galactosyltransferase 5beta-1,4-GalT IIbeta-1,4-GalT Ireceptor tyrosine-protein kinase erbB-4avian erythroblastic leukemia viral (v-erb-b2) oncogene homolog 4human epidermal growth factor receptor 4proto-oncogene-like protein c-ErbB-4tyrosine kinase-type cell surface receptor HER4v-erb-a erythroblastic
Modification date2020031320200327
UniProtAcc

O43286

Q15303

Ensembl transtripts involved in fusion geneENST00000371711, 
Fusion gene scores* DoF score21 X 14 X 12=352818 X 16 X 8=2304
# samples 2517
** MAII scorelog2(25/3528*10)=-3.81885056089543
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(17/2304*10)=-3.76053406530461
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: B4GALT5 [Title/Abstract] AND ERBB4 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpoint
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
TgeneERBB4

GO:0007165

signal transduction

10572067

TgeneERBB4

GO:0007169

transmembrane receptor protein tyrosine kinase signaling pathway

10353604|18334220

TgeneERBB4

GO:0016477

cell migration

9135143

TgeneERBB4

GO:0018108

peptidyl-tyrosine phosphorylation

18334220

TgeneERBB4

GO:0046777

protein autophosphorylation

18334220



check button Fusion gene information
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerKB3..B4GALT5chr20

48330113

-ERBB4chr2

212812341

-


Top

Fusion Gene ORF analysis for B4GALT5-ERBB4

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-5UTRENST00000371711ENST00000484474B4GALT5chr20

48330113

-ERBB4chr2

212812341

-
In-frameENST00000371711ENST00000342788B4GALT5chr20

48330113

-ERBB4chr2

212812341

-
In-frameENST00000371711ENST00000402597B4GALT5chr20

48330113

-ERBB4chr2

212812341

-
In-frameENST00000371711ENST00000436443B4GALT5chr20

48330113

-ERBB4chr2

212812341

-

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000371711B4GALT5chr2048330113-ENST00000436443ERBB4chr2212812341-1184230335739471196
ENST00000371711B4GALT5chr2048330113-ENST00000342788ERBB4chr2212812341-1189030335739951212
ENST00000371711B4GALT5chr2048330113-ENST00000402597ERBB4chr2212812341-396630335739651202

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score

Top

Fusion Genomic Features for B4GALT5-ERBB4


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)


Top

Fusion Protein Features for B4GALT5-ERBB4


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr20:/chr2:)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
B4GALT5

O43286

ERBB4

Q15303

FUNCTION: Catalyzes the synthesis of lactosylceramide (LacCer) via the transfer of galactose from UDP-galactose to glucosylceramide (GlcCer) (PubMed:24498430). LacCer is the starting point in the biosynthesis of all gangliosides (membrane-bound glycosphingolipids) which play pivotal roles in the CNS including neuronal maturation and axonal and myelin formation (By similarity). Plays a role in the glycosylation of BMPR1A and regulation of its protein stability (By similarity). Essential for extraembryonic development during early embryogenesis (By similarity). {ECO:0000250|UniProtKB:Q9JMK0, ECO:0000269|PubMed:24498430}.FUNCTION: Tyrosine-protein kinase that plays an essential role as cell surface receptor for neuregulins and EGF family members and regulates development of the heart, the central nervous system and the mammary gland, gene transcription, cell proliferation, differentiation, migration and apoptosis. Required for normal cardiac muscle differentiation during embryonic development, and for postnatal cardiomyocyte proliferation. Required for normal development of the embryonic central nervous system, especially for normal neural crest cell migration and normal axon guidance. Required for mammary gland differentiation, induction of milk proteins and lactation. Acts as cell-surface receptor for the neuregulins NRG1, NRG2, NRG3 and NRG4 and the EGF family members BTC, EREG and HBEGF. Ligand binding triggers receptor dimerization and autophosphorylation at specific tyrosine residues that then serve as binding sites for scaffold proteins and effectors. Ligand specificity and signaling is modulated by alternative splicing, proteolytic processing, and by the formation of heterodimers with other ERBB family members, thereby creating multiple combinations of intracellular phosphotyrosines that trigger ligand- and context-specific cellular responses. Mediates phosphorylation of SHC1 and activation of the MAP kinases MAPK1/ERK2 and MAPK3/ERK1. Isoform JM-A CYT-1 and isoform JM-B CYT-1 phosphorylate PIK3R1, leading to the activation of phosphatidylinositol 3-kinase and AKT1 and protect cells against apoptosis. Isoform JM-A CYT-1 and isoform JM-B CYT-1 mediate reorganization of the actin cytoskeleton and promote cell migration in response to NRG1. Isoform JM-A CYT-2 and isoform JM-B CYT-2 lack the phosphotyrosine that mediates interaction with PIK3R1, and hence do not phosphorylate PIK3R1, do not protect cells against apoptosis, and do not promote reorganization of the actin cytoskeleton and cell migration. Proteolytic processing of isoform JM-A CYT-1 and isoform JM-A CYT-2 gives rise to the corresponding soluble intracellular domains (4ICD) that translocate to the nucleus, promote nuclear import of STAT5A, activation of STAT5A, mammary epithelium differentiation, cell proliferation and activation of gene expression. The ERBB4 soluble intracellular domains (4ICD) colocalize with STAT5A at the CSN2 promoter to regulate transcription of milk proteins during lactation. The ERBB4 soluble intracellular domains can also translocate to mitochondria and promote apoptosis. {ECO:0000269|PubMed:10348342, ECO:0000269|PubMed:10353604, ECO:0000269|PubMed:10358079, ECO:0000269|PubMed:10722704, ECO:0000269|PubMed:10867024, ECO:0000269|PubMed:11178955, ECO:0000269|PubMed:11390655, ECO:0000269|PubMed:12807903, ECO:0000269|PubMed:15534001, ECO:0000269|PubMed:15746097, ECO:0000269|PubMed:16251361, ECO:0000269|PubMed:16778220, ECO:0000269|PubMed:16837552, ECO:0000269|PubMed:17486069, ECO:0000269|PubMed:17638867, ECO:0000269|PubMed:19098003, ECO:0000269|PubMed:20858735, ECO:0000269|PubMed:8383326, ECO:0000269|PubMed:8617750, ECO:0000269|PubMed:9135143, ECO:0000269|PubMed:9168115, ECO:0000269|PubMed:9334263}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note


Top

Fusion Gene Sequence for B4GALT5-ERBB4


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>8762_8762_1_B4GALT5-ERBB4_B4GALT5_chr20_48330113_ENST00000371711_ERBB4_chr2_212812341_ENST00000342788_length(transcript)=11890nt_BP=303nt
ATGTGGCCCGGCCCGCGACGGCCGGCGGCTGGGAGCGGCGAGGCGGCGGCGGCGGCGAGTGGCGGCCCGCGAGGCCCGGGAGGCGGTGGC
CGAGGCCCAGGCGGTGGCGGCGGCGGCCCAGGAGGCGGCGGACGGGGAGCTGCGGGAGCAGGCCCGGCCTGGCTCTCTAGCGGCCGCCTG
GCTGCAGCATGCGCGCCCGCCGGGGGCTGCTGCGGCTGCCGCGCCGCTCGCTGCTCGCCGCGCTCTTCTTCTTTTCTCTCTCGTCCTCGC
TGCTGTACTTCGTCTATGTGGCGCCCGGCATAGTCTGTTCGAGAAGTCACAGGCTACGTGTTAGTGGCTCTTAATCAGTTTCGTTACCTG
CCTCTGGAGAATTTACGCATTATTCGTGGGACAAAACTTTATGAGGATCGATATGCCTTGGCAATATTTTTAAACTACAGAAAAGATGGA
AACTTTGGACTTCAAGAACTTGGATTAAAGAACTTGACAGAAATCCTAAATGGTGGAGTCTATGTAGACCAGAACAAATTCCTTTGTTAT
GCAGACACCATTCATTGGCAAGATATTGTTCGGAACCCATGGCCTTCCAACTTGACTCTTGTGTCAACAAATGGTAGTTCAGGATGTGGA
CGTTGCCATAAGTCCTGTACTGGCCGTTGCTGGGGACCCACAGAAAATCATTGCCAGACTTTGACAAGGACGGTGTGTGCAGAACAATGT
GACGGCAGATGCTACGGACCTTACGTCAGTGACTGCTGCCATCGAGAATGTGCTGGAGGCTGCTCAGGACCTAAGGACACAGACTGCTTT
GCCTGCATGAATTTCAATGACAGTGGAGCATGTGTTACTCAGTGTCCCCAAACCTTTGTCTACAATCCAACCACCTTTCAACTGGAGCAC
AATTTCAATGCAAAGTACACATATGGAGCATTCTGTGTCAAGAAATGTCCACATAACTTTGTGGTAGATTCCAGTTCTTGTGTGCGTGCC
TGCCCTAGTTCCAAGATGGAAGTAGAAGAAAATGGGATTAAAATGTGTAAACCTTGCACTGACATTTGCCCAAAAGCTTGTGATGGCATT
GGCACAGGATCATTGATGTCAGCTCAGACTGTGGATTCCAGTAACATTGACAAATTCATAAACTGTACCAAGATCAATGGGAATTTGATC
TTTCTAGTCACTGGTATTCATGGGGACCCTTACAATGCAATTGAAGCCATAGACCCAGAGAAACTGAACGTCTTTCGGACAGTCAGAGAG
ATAACAGGTTTCCTGAACATACAGTCATGGCCACCAAACATGACTGACTTCAGTGTTTTTTCTAACCTGGTGACCATTGGTGGAAGAGTA
CTCTATAGTGGCCTGTCCTTGCTTATCCTCAAGCAACAGGGCATCACCTCTCTACAGTTCCAGTCCCTGAAGGAAATCAGCGCAGGAAAC
ATCTATATTACTGACAACAGCAACCTGTGTTATTATCATACCATTAACTGGACAACACTCTTCAGCACAATCAACCAGAGAATAGTAATC
CGGGACAACAGAAAAGCTGAAAATTGTACTGCTGAAGGAATGGTGTGCAACCATCTGTGTTCCAGTGATGGCTGTTGGGGACCTGGGCCA
GACCAATGTCTGTCGTGTCGCCGCTTCAGTAGAGGAAGGATCTGCATAGAGTCTTGTAACCTCTATGATGGTGAATTTCGGGAGTTTGAG
AATGGCTCCATCTGTGTGGAGTGTGACCCCCAGTGTGAGAAGATGGAAGATGGCCTCCTCACATGCCATGGACCGGGTCCTGACAACTGT
ACAAAGTGCTCTCATTTTAAAGATGGCCCAAACTGTGTGGAAAAATGTCCAGATGGCTTACAGGGGGCAAACAGTTTCATTTTCAAGTAT
GCTGATCCAGATCGGGAGTGCCACCCATGCCATCCAAACTGCACCCAAGGGTGTAACGGTCCCACTAGTCATGACTGCATTTACTACCCA
TGGACGGGCCATTCCACTTTACCACAACATGCTAGAACTCCCCTGATTGCAGCTGGAGTAATTGGTGGGCTCTTCATTCTGGTCATTGTG
GGTCTGACATTTGCTGTTTATGTTAGAAGGAAGAGCATCAAAAAGAAAAGAGCCTTGAGAAGATTCTTGGAAACAGAGTTGGTGGAACCA
TTAACTCCCAGTGGCACAGCACCCAATCAAGCTCAACTTCGTATTTTGAAAGAAACTGAGCTGAAGAGGGTAAAAGTCCTTGGCTCAGGT
GCTTTTGGAACGGTTTATAAAGGTATTTGGGTACCTGAAGGAGAAACTGTGAAGATTCCTGTGGCTATTAAGATTCTTAATGAGACAACT
GGTCCCAAGGCAAATGTGGAGTTCATGGATGAAGCTCTGATCATGGCAAGTATGGATCATCCACACCTAGTCCGGTTGCTGGGTGTGTGT
CTGAGCCCAACCATCCAGCTGGTTACTCAACTTATGCCCCATGGCTGCCTGTTGGAGTATGTCCACGAGCACAAGGATAACATTGGATCA
CAACTGCTGCTTAACTGGTGTGTCCAGATAGCTAAGGGAATGATGTACCTGGAAGAAAGACGACTCGTTCATCGGGATTTGGCAGCCCGT
AATGTCTTAGTGAAATCTCCAAACCATGTGAAAATCACAGATTTTGGGCTAGCCAGACTCTTGGAAGGAGATGAAAAAGAGTACAATGCT
GATGGAGGAAAGATGCCAATTAAATGGATGGCTCTGGAGTGTATACATTACAGGAAATTCACCCATCAGAGTGACGTTTGGAGCTATGGA
GTTACTATATGGGAACTGATGACCTTTGGAGGAAAACCCTATGATGGAATTCCAACGCGAGAAATCCCTGATTTATTAGAGAAAGGAGAA
CGTTTGCCTCAGCCTCCCATCTGCACTATTGACGTTTACATGGTCATGGTCAAATGTTGGATGATTGATGCTGACAGTAGACCTAAATTT
AAGGAACTGGCTGCTGAGTTTTCAAGGATGGCTCGAGACCCTCAAAGATACCTAGTTATTCAGGGTGATGATCGTATGAAGCTTCCCAGT
CCAAATGACAGCAAGTTCTTTCAGAATCTCTTGGATGAAGAGGATTTGGAAGATATGATGGATGCTGAGGAGTACTTGGTCCCTCAGGCT
TTCAACATCCCACCTCCCATCTATACTTCCAGAGCAAGAATTGACTCGAATAGGAGTGAAATTGGACACAGCCCTCCTCCTGCCTACACC
CCCATGTCAGGAAACCAGTTTGTATACCGAGATGGAGGTTTTGCTGCTGAACAAGGAGTGTCTGTGCCCTACAGAGCCCCAACTAGCACA
ATTCCAGAAGCTCCTGTGGCACAGGGTGCTACTGCTGAGATTTTTGATGACTCCTGCTGTAATGGCACCCTACGCAAGCCAGTGGCACCC
CATGTCCAAGAGGACAGTAGCACCCAGAGGTACAGTGCTGACCCCACCGTGTTTGCCCCAGAACGGAGCCCACGAGGAGAGCTGGATGAG
GAAGGTTACATGACTCCTATGCGAGACAAACCCAAACAAGAATACCTGAATCCAGTGGAGGAGAACCCTTTTGTTTCTCGGAGAAAAAAT
GGAGACCTTCAAGCATTGGATAATCCCGAATATCACAATGCATCCAATGGTCCACCCAAGGCCGAGGATGAGTATGTGAATGAGCCACTG
TACCTCAACACCTTTGCCAACACCTTGGGAAAAGCTGAGTACCTGAAGAACAACATACTGTCAATGCCAGAGAAGGCCAAGAAAGCGTTT
GACAACCCTGACTACTGGAACCACAGCCTGCCACCTCGGAGCACCCTTCAGCACCCAGACTACCTGCAGGAGTACAGCACAAAATATTTT
TATAAACAGAATGGGCGGATCCGGCCTATTGTGGCAGAGAATCCTGAATACCTCTCTGAGTTCTCCCTGAAGCCAGGCACTGTGCTGCCG
CCTCCACCTTACAGACACCGGAATACTGTGGTGTAAGCTCAGTTGTGGTTTTTTAGGTGGAGAGACACACCTGCTCCAATTTCCCCACCC
CCCTCTCTTTCTCTGGTGGTCTTCCTTCTACCCCAAGGCCAGTAGTTTTGACACTTCCCAGTGGAAGATACAGAGATGCAATGATAGTTA
TGTGCTTACCTAACTTGAACATTAGAGGGAAAGACTGAAAGAGAAAGATAGGAGGAACCACAATGTTTCTTCATTTCTCTGCATGGGTTG
GTCAGGAGAATGAAACAGCTAGAGAAGGACCAGAAAATGTAAGGCAATGCTGCCTACTATCAAACTAGCTGTCACTTTTTTTCTTTTTCT
TTTTCTTTCTTTGTTTCTTTCTTCCTCTTCTTTTTTTTTTTTTTTTTTAAAGCAGATGGTTGAAACACCCATGCTATCTGTTCCTATCTG
CAGGAACTGATGTGTGCATATTTAGCATCCCTGGAAATCATAATAAAGTTTCCATTAGAACAAAAGAATAACATTTTCTATAACATATGA
TGGTGTCTGAAATTGAGAATCCAGTTTCTTTCCCCAGCAGTTTCTGTCCTAGCAAGTAAGAATGGCCAACTCAACTTTCATAATTTAAAA
ATCTCCATTAAAGTTATAACTAGTAATTATGTTTTCAACACTTTTTGGTTTTTTTCATTTTGTTTTGCTCTGACCGATTCCTTTATATTT
GCTCCCCTATTTTTGGCTTTAATTTCTAATTGCAAAGATGTTTACATCAAAGCTTCTTCACAGAATTTAAGCAAGAAATATTTTAATATA
GTGAAATGGCCACTACTTTAAGTATACAATCTTTAAAATAAGAAAGGGAGGCTAATATTTTTCATGCTATCAAATTATCTTCACCCTCAT
CCTTTACATTTTTCAACATTTTTTTTTCTCCATAAATGACACTACTTGATAGGCCGTTGGTTGTCTGAAGAGTAGAAGGGAAACTAAGAG
ACAGTTCTCTGTGGTTCAGGAAAACTACTGATACTTTCAGGGGTGGCCCAATGAGGGAATCCATTGAACTGGAAGAAACACACTGGATTG
GGTATGTCTACCTGGCAGATACTCAGAAATGTAGTTTGCACTTAAGCTGTAATTTTATTTGTTCTTTTTCTGAACTCCATTTTGGATTTT
GAATCAAGCAATATGGAAGCAACCAGCAAATTAACTAATTTAAGTACATTTTTAAAAAAAGAGCTAAGATAAAGACTGTGGAAATGCCAA
ACCAAGCAAATTAGGAACCTTGCAACGGTATCCAGGGACTATGATGAGAGGCCAGCACATTATCTTCATATGTCACCTTTGCTACGCAAG
GAAATTTGTTCAGTTCGTATACTTCGTAAGAAGGAATGCGAGTAAGGATTGGCTTGAATTCCATGGAATTTCTAGTATGAGACTATTTAT
ATGAAGTAGAAGGTAACTCTTTGCACATAAATTGGTATAATAAAAAGAAAAACACAAACATTCAAAGCTTAGGGATAGGTCCTTGGGTCA
AAAGTTGTAAATAAATGTGAAACATCTTCTCATGCAATTATTTTATTATCCAACACACTAATCTTTTGATACTTTATATAATTCCCTTTC
TTCATATACTGCATCCAGTACTAGAACCATCATTATTATGTATCATTTTGAAAGAATACCTGATGAGATGAAGGATGAGAACAAATGACA
GAGATGAGTCTCCAAGTAAAGGGGGCCTCACATCAATAATTAGGAAACTTAGATATAAGTCGCCCTTTTCTGAAAATTCTACCCCAAGTC
ATTTAGATTTTTAAAAAATATTTCTAATGTTAAAATATTGGGACCAAATTAGAATCAATAGTATAAGATTAATTAATTAGAGTAAAAATA
TCTATTAAGGCAGAGAAAGTTTAGAGAAAAAAATCCAAAGAAATTTGTGTTTCTTCCTATTCTGAACAAGTAAATCCATCCATCCATCCA
TCCAAACCTCCTTTATCTAACTGTGTCTACTAAAAGCACCATGTTTTGTGGGGAACACTCAGATAAATGGAATATCATCCTCAACTTCAA
AATTCTATGATCTAGGAGATTTAATTAAAATGACATTTTAATTTTTCTATGCGTTCCAACAATCAGATTGCATAGTCTCTTTTGTGAATA
GCTGTCATATAATCAGTTGTACTGTAAGATATCTCCTTTAAACTCATTTGGGATATAAGTTAAACATCCTTCAAATTGTTGATGTTGACA
AACAGGATAATTTCAATAATATTATTCAAACATAAACTGGTCTAGGAGAATATTGCATCACTGACTAATTAGCCTATCTAGAGTCTAACT
TCACCATTAAACCAAAAGCAGATGGTGGTCCTTGGCCAAGAATATTGGAGACATTGGAGTTGGTTTTTTTCTAAGCTATAAGAAGTGAGG
CGAGCTGAAAAAGTATGGTAGAGCAGGAGAAGGGTTTGTGAGATTCCTTCTAGTGAAGTTCACCCTCAAACTTTTCAGGGGTAAAGACAC
AGAGTGATTCAGGGGCCACAATCTAATAGCTCAGGGCTCTCCTATCCATTCAGAGAAGTCTCTAGGAAAAGGGATCTCATATCAGTACTT
ATGAAAAATTGAATATAAGCCTCCCTTTCTAAATAAATCTGCATCGAGTCATCACAGCCCTCTTTTTGGATACTATACCTTGATTTTTTT
TTTCTGATTTACAATATGCATATGGTTTCTACTGGGCTATAGAAAGCAGAATCACTCATTTTGGAGAAGGAAAAAATGAATAGTTAAAAC
AAACTTTTAACTGTTAAGGTAACAGAAATGTATTTAGTGAATGTCTCTTTCCTCCTAAGAACACAAGACTTCTACATGTTGGGTAATACC
TAGAGATGCATGTAGGAATAATCCAAAATGACCCAAATGCTTTATAATAGCACCACTTTATAATTCTTTTGAATGATTTCTGTAGTATAT
AATTGACTTCAGTTGTTTGAGTGTTTTTTGTTTTATTTTTGTCCCCCCTGGGAAAACATATTTCAGCATGTATAAGAGGGAGAAAAAAAG
TTTCATTCCTTCCAGAGAATAACTTATTTAGTCCAGTAGGGTAGAATTTTAAAATGTCAGTTAAAGTCTTCAAAGTGCTTGGGGGGATAT
CAGATTCCAGAGGCCAATTGTAGCAATTGAAATTTGCAGAATCAATTATGTAAATCTGAGACAAATTAGTATTAAAATTACACGGAGTAT
ATTTTTTAAATCACCCAACTTTGTAGATTATACCTATTTTGGGCAGGTATGGAAAAATTTTGCAGTTAAATGATTGCCTAAAGAAAGTGG
TAAACAGGTGAGGAAAGATGGCCTCTGATCTAGGATAGATCCAGAACCACAAAGCATCTGCACCACAAAAGGTGTTAGACTACCAAGCAG
CTCCTGGTTTTCTGCATAGTATTAGTAGCACAGCTTAGGATGAGAATCCTTTCTCCAGTAACATTCTTAAAATAGCATGAAAAACAACGC
AAAACTCAAATTTCTATTAAAACACACAAACTAAAATCAAGTGATTCTTTTTTGTAGATTAGGGAGAAGGACTGAATATCTAATTTAAGA
GAAGGAATAGTGTTTAAGTGTTATAGTGTGTGAGCTAATACCTTCTAAAGGAAAGACATGGCATGAAGATTGTGCATACTTACAATGCTA
AGGAAAAATCAAGAAAAGGACTGTGTGAGGCTCTGCTACTAGATGAAGTTGGAAGGACTATTAATGTGCTTCTTGAAGTATCAAAAATGA
AAAGAAAATTAAAATTGTTTAAGCCTGACAGGGAAGGATGTAAATACAAGTTTTTCTAGAGCTCTCTAACCTTTATTTCAAAACTGGAAT
TATTCATCCATCTGTAATTGTTGATAATTTAACTAGTATATGTAGTTCATAAGGTAATAGAAAAGGTGATCATGAAAGCATGTATATAAC
TGGACAGAACCACGATAATGCTATAAGATGTAGATTTAGTTAGGTTATCAGATGTTAAATGATTTTAATATTATTAAATAAATCAAACTA
GAAAACTAACCACAAGTATAATGTAACAAAGTTAAATGCAGGATATAAAAATGTAGGATGGATTTTGCATAGTAAAAAGATAAGTTTGCC
ATTTAAAATTGTTGTTTGTTGGGTTTAGCTGAAAGTAGGCATATATGGTTCCACTTGGGAAAACTTGCTTTAAAGCATTACAATGAACAA
TTTTTTCTCATTCTCTTATTCCTTTATCACTTTTTAAATGTAAAGAAAATTGTATTTATTTATTTTTTTAAATAAACACCACCTTGCAGA
ATTTAATAGGCAAACATGTTACATATGACTAAGTAAGGGTCTTCAAGATGAAGTAAAGAAAATGTAAATGTTCTATTACCTTATGCAGAG
ACAAAAAAAAAAAGGAGTGGTGTCATTTAGCTAGCAAACAAACAAAATACAGTTAATTGGTGATATGTCCTTTCTTTTCTCACTATGCCC
TCTTGCCTCCAAAAATGACAACAAAGAATCACAATTTTTCTGATAAATAAATGCTAAACCAAGCGTTTCAAACTATTGCATTGCCATTCT
TTTGGACTTTAGTTATTAGAATGATGATTGTTATAGGGCAAATGAGAAATCCATGTGCATCAGCTTCTAGTTGTTAAAAAAACCAGATAA
ATTAACTTCTACTGTATACTGTGGGCAGAGGATCCTAGAGCTGATCCTACAACATCAGCTTCTAGTTGTTAAAAAAAAAAAAAGAAACAG
ATAAATTAACTTCTACTGTATATACTGTGGGCAGAGGATCTTACTGTGCCTCTGTTTGTGTACATGGACTTCGGTGTGTATCAGTTTGAA
GGACAGCCTTGCCCCATGTAAACATATAAATGCAGATTGGTATCGCCTGGTTGCTATTTGCTTAAGAACAAATATTATACAGATGAGATC
AGGCATAATTTTAAAAGATCATTATCAGTGGAGACCTCATTATTACTGATATTACAATGGGGCCAGTTTTTATACTTCTGGGTAGAATTA
ATAAAATTTTTCTGATCCCAGAGATCTGAGTTCTCTCTGCAGTTGGAAACAAGAAGCTGTTGTGGGCATTGTGTCGGGCCAGGGGCCCTT
GTGTTTGTGTGGGCAAATATCTTTTAGCAGTGTGAGCTGCTTTTTTCTTTTCATTAAAAGTCTCTCTAAAATAATAGAAATTTCAGATAC
TCGGTTCAAGTCTCACTGATTTTGTAGAGGTCCAAAAATGTAGGATCTGTCACTTTTGCAGGCCCCTGCCTCACCTAATTCCTGGCCAGG
TGACATTTTGGGCAGAAGTAAATGCTTCTATAGTCACAAGCTAAAATGACTCTAAGCCCCAATTTCACGGGGGGTATTCACATGCTTCCT
CTGGAAAATACTCTTTGACAGTCAGCTTTGCAAGTAAGTGATTACCTTGTTAGGAATCAAAGAAAAATGTATTTCTCTCTGACCTTTAGA
GGAAAATAGAATCCTTCCCTTTTTTGCCCATTGACACAACTGGCACTGCTCTCTTCCCTTTCTACCACCCTGGTTCAAAGTAGTCCCCCG
ATGCTGTCCTGTTCCTTTCTTAAGCCATAGTGGATCTCTGAGATCCTACACCCCACTTTGTGAAACACTGACTTCATCTTTGCCCTCGAA
TGCCTGATTTTTTCATAAGAGATTCTAGCAATTTGGACACTGTTTAAGTGAACTATCAAACTACCGCATAGAGAATATTTAAGCTATTAA
AATTATGGTTTCCCATGAAGATCAATTCTCTGTGTCCTTCCCTATAGGAATTTGAGACGAGTTAGCCCTGTGATGAATCTTGAAACTCAC
ATATGTCCACATACACTTGGTAGAACTTCGATTTAATCTTTACATAAAAGCTGTACATATAACCAAGAAGTTATTTTTGCCAGTAAATTA
ACTTATTTGCTTTATTCATCTTATTTGGTTCCTAATCGTAAATATTTTGTAGCTGCTGTAAATTTTTTTCTCCCAAATGAGGAGTCTTAT
TATCATAAAGGTAAAGGCTATTCAGCTTTGATAACCACCTGCAATTCTTTTTTGGATCATTCATCCATCTAACAAATACATAATGAGGAC
AGTTCATGTTAATGAAAATCCATGTTGTTTAATAGAATGCCATCCTTTACCTACTTTTGCTCTTTATGGACGTTTTTCTTTTCATGCTCT
AGTGAGCTTTCCCTATATCATGAGAAGTGGTTATATTTGTGCAAATATACAAATATAGGAAAACAAAGATTCATACCTGTAGGCAATAGT
CTAACTTGTCCAAACCACTTTGCCTTTACTGCTATTTTTATCCCCAATGCGTAGATATTTCCCCCAGGCCTATAGCCTTTGTGAAGGAAA
GCAAATCATACCTCCTGTATATTGACACGAATCTGGTTTTCAAATGTCATTTCCAGATTTTTTAGTTAATTGGGGGTTGTCCTTTTCCCT
TAATGTGAGAGTCATTTTCCTGTATATTTCTGGATCTCTCAGGGGCTGGGAGGGGGGAGTGAGGGGACTACAACCATAGCACTCCAAGAA
CCCTTTTGGGATTACTCCAGTAATCAACTACGAAAGTTATTTTCTAAATGTAGATATGTAAGGTGTTCTTTTAAAGTAAGGTACTTTGAA
ATATGTAGCATAAACTGGTACTGCTGTTAAATGGGTCGATTATTAAACGGAGCAGCTGTGTGAGGGCAGCTAACTTTGAATGCCTGTCTC
CCTGGCTGGTGTGTCTCCTTCTCATGTTGAGAGCACCAGGGATTGCGTGGCTGCATGCTGAAACCGCATTTTCCCATGGTGTATGACTAG
TTCATCTCTTTCTTGAGCACCATTACAAGAAGATCAAATGAAAATGAGATCAATGTGGAAGACAATTCATAGCACAAAAAAAGTCATCTT
AAATCTACTCTCAAACATTCATCTTATACATGCATCAAAGTAATTTACTGACATCAGTTTGGGTGAGAGAGGGAGTCACTTTACTGAAAA
GGCAGAGGCTTAAGGTGTATACATTTGTACTCACTTCCTTATTTTCTTAACTTGTAAGCAGAAAACAAGCCCTCTCTCTTGTGAAGTATC
TTCAAAGGATTGGGGTGCAAAAATACCTTGCTGGTAAGCCATCAATGTTTTATTTAAATCCCTGCATTCAAAGTTAGCTGCCTTTTTGAA
ATAAACAAACAAAAAATACTACTGTATGTTTGAAAATGTGAATAGTATTTTTATAGCTTGTTAAAGACATGGCTAGTTGCATTTGTAAAT
AAGTATAATGTTGCTTTGATTTTCTTTTGTGGACATCTTTATTTGGAACATAATTGTCTTTAGGGTTGATTTGTATATAAGTAATTGGCC
TGTGATTGTTTCTTTTTTGGTTGGAAGTTATCATTTTGACATTACTTGTGATTCTGTGTTCAGCACTATTGTGATGTGTTCAACCTCTGC
ACTCGCTTACACAATAGGATATGCCAATTGTGTGTGGTGTAATGTTATTTTGATTTTTTTCCATGTTATTGATGAAGGATCATGCACCTA
ACACATACTAACTTTTTTAATGTTAGGCATATTTTTAGTATACTTTCTCTTATTCTTTCTTCTCCTCCAACCTTTTACCCATCCTCCTTC
CTTTCCCTCATTCCTGTTGTTATTTGAGAATGAGGGAGAAACAGTATTTTACATTTATGTAATTAGGCTTTTCCGTTAGTTCTCAAGGAT
CCTCTTTTGGCTCTTGGGAAAGAATTGTACCTGTACAAGGCAATTATAGAATGCGAACTGCTTTGCCTCATTCCATACTGATCATCCCAG
CTGAACAATTTGAAAACTGTTCTGCCTTTTTGTTACATGAATCTGTCAGAAATATATTTTTAATTTAATATAAATGAAATTCAATAAAAT

>8762_8762_1_B4GALT5-ERBB4_B4GALT5_chr20_48330113_ENST00000371711_ERBB4_chr2_212812341_ENST00000342788_length(amino acids)=1212AA_BP=
MPLENLRIIRGTKLYEDRYALAIFLNYRKDGNFGLQELGLKNLTEILNGGVYVDQNKFLCYADTIHWQDIVRNPWPSNLTLVSTNGSSGC
GRCHKSCTGRCWGPTENHCQTLTRTVCAEQCDGRCYGPYVSDCCHRECAGGCSGPKDTDCFACMNFNDSGACVTQCPQTFVYNPTTFQLE
HNFNAKYTYGAFCVKKCPHNFVVDSSSCVRACPSSKMEVEENGIKMCKPCTDICPKACDGIGTGSLMSAQTVDSSNIDKFINCTKINGNL
IFLVTGIHGDPYNAIEAIDPEKLNVFRTVREITGFLNIQSWPPNMTDFSVFSNLVTIGGRVLYSGLSLLILKQQGITSLQFQSLKEISAG
NIYITDNSNLCYYHTINWTTLFSTINQRIVIRDNRKAENCTAEGMVCNHLCSSDGCWGPGPDQCLSCRRFSRGRICIESCNLYDGEFREF
ENGSICVECDPQCEKMEDGLLTCHGPGPDNCTKCSHFKDGPNCVEKCPDGLQGANSFIFKYADPDRECHPCHPNCTQGCNGPTSHDCIYY
PWTGHSTLPQHARTPLIAAGVIGGLFILVIVGLTFAVYVRRKSIKKKRALRRFLETELVEPLTPSGTAPNQAQLRILKETELKRVKVLGS
GAFGTVYKGIWVPEGETVKIPVAIKILNETTGPKANVEFMDEALIMASMDHPHLVRLLGVCLSPTIQLVTQLMPHGCLLEYVHEHKDNIG
SQLLLNWCVQIAKGMMYLEERRLVHRDLAARNVLVKSPNHVKITDFGLARLLEGDEKEYNADGGKMPIKWMALECIHYRKFTHQSDVWSY
GVTIWELMTFGGKPYDGIPTREIPDLLEKGERLPQPPICTIDVYMVMVKCWMIDADSRPKFKELAAEFSRMARDPQRYLVIQGDDRMKLP
SPNDSKFFQNLLDEEDLEDMMDAEEYLVPQAFNIPPPIYTSRARIDSNRSEIGHSPPPAYTPMSGNQFVYRDGGFAAEQGVSVPYRAPTS
TIPEAPVAQGATAEIFDDSCCNGTLRKPVAPHVQEDSSTQRYSADPTVFAPERSPRGELDEEGYMTPMRDKPKQEYLNPVEENPFVSRRK
NGDLQALDNPEYHNASNGPPKAEDEYVNEPLYLNTFANTLGKAEYLKNNILSMPEKAKKAFDNPDYWNHSLPPRSTLQHPDYLQEYSTKY

--------------------------------------------------------------
>8762_8762_2_B4GALT5-ERBB4_B4GALT5_chr20_48330113_ENST00000371711_ERBB4_chr2_212812341_ENST00000402597_length(transcript)=3966nt_BP=303nt
ATGTGGCCCGGCCCGCGACGGCCGGCGGCTGGGAGCGGCGAGGCGGCGGCGGCGGCGAGTGGCGGCCCGCGAGGCCCGGGAGGCGGTGGC
CGAGGCCCAGGCGGTGGCGGCGGCGGCCCAGGAGGCGGCGGACGGGGAGCTGCGGGAGCAGGCCCGGCCTGGCTCTCTAGCGGCCGCCTG
GCTGCAGCATGCGCGCCCGCCGGGGGCTGCTGCGGCTGCCGCGCCGCTCGCTGCTCGCCGCGCTCTTCTTCTTTTCTCTCTCGTCCTCGC
TGCTGTACTTCGTCTATGTGGCGCCCGGCATAGTCTGTTCGAGAAGTCACAGGCTACGTGTTAGTGGCTCTTAATCAGTTTCGTTACCTG
CCTCTGGAGAATTTACGCATTATTCGTGGGACAAAACTTTATGAGGATCGATATGCCTTGGCAATATTTTTAAACTACAGAAAAGATGGA
AACTTTGGACTTCAAGAACTTGGATTAAAGAACTTGACAGAAATCCTAAATGGTGGAGTCTATGTAGACCAGAACAAATTCCTTTGTTAT
GCAGACACCATTCATTGGCAAGATATTGTTCGGAACCCATGGCCTTCCAACTTGACTCTTGTGTCAACAAATGGTAGTTCAGGATGTGGA
CGTTGCCATAAGTCCTGTACTGGCCGTTGCTGGGGACCCACAGAAAATCATTGCCAGACTTTGACAAGGACGGTGTGTGCAGAACAATGT
GACGGCAGATGCTACGGACCTTACGTCAGTGACTGCTGCCATCGAGAATGTGCTGGAGGCTGCTCAGGACCTAAGGACACAGACTGCTTT
GCCTGCATGAATTTCAATGACAGTGGAGCATGTGTTACTCAGTGTCCCCAAACCTTTGTCTACAATCCAACCACCTTTCAACTGGAGCAC
AATTTCAATGCAAAGTACACATATGGAGCATTCTGTGTCAAGAAATGTCCACATAACTTTGTGGTAGATTCCAGTTCTTGTGTGCGTGCC
TGCCCTAGTTCCAAGATGGAAGTAGAAGAAAATGGGATTAAAATGTGTAAACCTTGCACTGACATTTGCCCAAAAGCTTGTGATGGCATT
GGCACAGGATCATTGATGTCAGCTCAGACTGTGGATTCCAGTAACATTGACAAATTCATAAACTGTACCAAGATCAATGGGAATTTGATC
TTTCTAGTCACTGGTATTCATGGGGACCCTTACAATGCAATTGAAGCCATAGACCCAGAGAAACTGAACGTCTTTCGGACAGTCAGAGAG
ATAACAGGTTTCCTGAACATACAGTCATGGCCACCAAACATGACTGACTTCAGTGTTTTTTCTAACCTGGTGACCATTGGTGGAAGAGTA
CTCTATAGTGGCCTGTCCTTGCTTATCCTCAAGCAACAGGGCATCACCTCTCTACAGTTCCAGTCCCTGAAGGAAATCAGCGCAGGAAAC
ATCTATATTACTGACAACAGCAACCTGTGTTATTATCATACCATTAACTGGACAACACTCTTCAGCACAATCAACCAGAGAATAGTAATC
CGGGACAACAGAAAAGCTGAAAATTGTACTGCTGAAGGAATGGTGTGCAACCATCTGTGTTCCAGTGATGGCTGTTGGGGACCTGGGCCA
GACCAATGTCTGTCGTGTCGCCGCTTCAGTAGAGGAAGGATCTGCATAGAGTCTTGTAACCTCTATGATGGTGAATTTCGGGAGTTTGAG
AATGGCTCCATCTGTGTGGAGTGTGACCCCCAGTGTGAGAAGATGGAAGATGGCCTCCTCACATGCCATGGACCGGGTCCTGACAACTGT
ACAAAGTGCTCTCATTTTAAAGATGGCCCAAACTGTGTGGAAAAATGTCCAGATGGCTTACAGGGGGCAAACAGTTTCATTTTCAAGTAT
GCTGATCCAGATCGGGAGTGCCACCCATGCCATCCAAACTGCACCCAAGGGTGCATAGGCTCAAGTATTGAAGACTGCATCGGCCTGATG
GATAGAACTCCCCTGATTGCAGCTGGAGTAATTGGTGGGCTCTTCATTCTGGTCATTGTGGGTCTGACATTTGCTGTTTATGTTAGAAGG
AAGAGCATCAAAAAGAAAAGAGCCTTGAGAAGATTCTTGGAAACAGAGTTGGTGGAACCATTAACTCCCAGTGGCACAGCACCCAATCAA
GCTCAACTTCGTATTTTGAAAGAAACTGAGCTGAAGAGGGTAAAAGTCCTTGGCTCAGGTGCTTTTGGAACGGTTTATAAAGGTATTTGG
GTACCTGAAGGAGAAACTGTGAAGATTCCTGTGGCTATTAAGATTCTTAATGAGACAACTGGTCCCAAGGCAAATGTGGAGTTCATGGAT
GAAGCTCTGATCATGGCAAGTATGGATCATCCACACCTAGTCCGGTTGCTGGGTGTGTGTCTGAGCCCAACCATCCAGCTGGTTACTCAA
CTTATGCCCCATGGCTGCCTGTTGGAGTATGTCCACGAGCACAAGGATAACATTGGATCACAACTGCTGCTTAACTGGTGTGTCCAGATA
GCTAAGGGAATGATGTACCTGGAAGAAAGACGACTCGTTCATCGGGATTTGGCAGCCCGTAATGTCTTAGTGAAATCTCCAAACCATGTG
AAAATCACAGATTTTGGGCTAGCCAGACTCTTGGAAGGAGATGAAAAAGAGTACAATGCTGATGGAGGAAAGATGCCAATTAAATGGATG
GCTCTGGAGTGTATACATTACAGGAAATTCACCCATCAGAGTGACGTTTGGAGCTATGGAGTTACTATATGGGAACTGATGACCTTTGGA
GGAAAACCCTATGATGGAATTCCAACGCGAGAAATCCCTGATTTATTAGAGAAAGGAGAACGTTTGCCTCAGCCTCCCATCTGCACTATT
GACGTTTACATGGTCATGGTCAAATGTTGGATGATTGATGCTGACAGTAGACCTAAATTTAAGGAACTGGCTGCTGAGTTTTCAAGGATG
GCTCGAGACCCTCAAAGATACCTAGTTATTCAGGGTGATGATCGTATGAAGCTTCCCAGTCCAAATGACAGCAAGTTCTTTCAGAATCTC
TTGGATGAAGAGGATTTGGAAGATATGATGGATGCTGAGGAGTACTTGGTCCCTCAGGCTTTCAACATCCCACCTCCCATCTATACTTCC
AGAGCAAGAATTGACTCGAATAGGAGTGAAATTGGACACAGCCCTCCTCCTGCCTACACCCCCATGTCAGGAAACCAGTTTGTATACCGA
GATGGAGGTTTTGCTGCTGAACAAGGAGTGTCTGTGCCCTACAGAGCCCCAACTAGCACAATTCCAGAAGCTCCTGTGGCACAGGGTGCT
ACTGCTGAGATTTTTGATGACTCCTGCTGTAATGGCACCCTACGCAAGCCAGTGGCACCCCATGTCCAAGAGGACAGTAGCACCCAGAGG
TACAGTGCTGACCCCACCGTGTTTGCCCCAGAACGGAGCCCACGAGGAGAGCTGGATGAGGAAGGTTACATGACTCCTATGCGAGACAAA
CCCAAACAAGAATACCTGAATCCAGTGGAGGAGAACCCTTTTGTTTCTCGGAGAAAAAATGGAGACCTTCAAGCATTGGATAATCCCGAA
TATCACAATGCATCCAATGGTCCACCCAAGGCCGAGGATGAGTATGTGAATGAGCCACTGTACCTCAACACCTTTGCCAACACCTTGGGA
AAAGCTGAGTACCTGAAGAACAACATACTGTCAATGCCAGAGAAGGCCAAGAAAGCGTTTGACAACCCTGACTACTGGAACCACAGCCTG
CCACCTCGGAGCACCCTTCAGCACCCAGACTACCTGCAGGAGTACAGCACAAAATATTTTTATAAACAGAATGGGCGGATCCGGCCTATT
GTGGCAGAGAATCCTGAATACCTCTCTGAGTTCTCCCTGAAGCCAGGCACTGTGCTGCCGCCTCCACCTTACAGACACCGGAATACTGTG

>8762_8762_2_B4GALT5-ERBB4_B4GALT5_chr20_48330113_ENST00000371711_ERBB4_chr2_212812341_ENST00000402597_length(amino acids)=1202AA_BP=
MPLENLRIIRGTKLYEDRYALAIFLNYRKDGNFGLQELGLKNLTEILNGGVYVDQNKFLCYADTIHWQDIVRNPWPSNLTLVSTNGSSGC
GRCHKSCTGRCWGPTENHCQTLTRTVCAEQCDGRCYGPYVSDCCHRECAGGCSGPKDTDCFACMNFNDSGACVTQCPQTFVYNPTTFQLE
HNFNAKYTYGAFCVKKCPHNFVVDSSSCVRACPSSKMEVEENGIKMCKPCTDICPKACDGIGTGSLMSAQTVDSSNIDKFINCTKINGNL
IFLVTGIHGDPYNAIEAIDPEKLNVFRTVREITGFLNIQSWPPNMTDFSVFSNLVTIGGRVLYSGLSLLILKQQGITSLQFQSLKEISAG
NIYITDNSNLCYYHTINWTTLFSTINQRIVIRDNRKAENCTAEGMVCNHLCSSDGCWGPGPDQCLSCRRFSRGRICIESCNLYDGEFREF
ENGSICVECDPQCEKMEDGLLTCHGPGPDNCTKCSHFKDGPNCVEKCPDGLQGANSFIFKYADPDRECHPCHPNCTQGCIGSSIEDCIGL
MDRTPLIAAGVIGGLFILVIVGLTFAVYVRRKSIKKKRALRRFLETELVEPLTPSGTAPNQAQLRILKETELKRVKVLGSGAFGTVYKGI
WVPEGETVKIPVAIKILNETTGPKANVEFMDEALIMASMDHPHLVRLLGVCLSPTIQLVTQLMPHGCLLEYVHEHKDNIGSQLLLNWCVQ
IAKGMMYLEERRLVHRDLAARNVLVKSPNHVKITDFGLARLLEGDEKEYNADGGKMPIKWMALECIHYRKFTHQSDVWSYGVTIWELMTF
GGKPYDGIPTREIPDLLEKGERLPQPPICTIDVYMVMVKCWMIDADSRPKFKELAAEFSRMARDPQRYLVIQGDDRMKLPSPNDSKFFQN
LLDEEDLEDMMDAEEYLVPQAFNIPPPIYTSRARIDSNRSEIGHSPPPAYTPMSGNQFVYRDGGFAAEQGVSVPYRAPTSTIPEAPVAQG
ATAEIFDDSCCNGTLRKPVAPHVQEDSSTQRYSADPTVFAPERSPRGELDEEGYMTPMRDKPKQEYLNPVEENPFVSRRKNGDLQALDNP
EYHNASNGPPKAEDEYVNEPLYLNTFANTLGKAEYLKNNILSMPEKAKKAFDNPDYWNHSLPPRSTLQHPDYLQEYSTKYFYKQNGRIRP

--------------------------------------------------------------
>8762_8762_3_B4GALT5-ERBB4_B4GALT5_chr20_48330113_ENST00000371711_ERBB4_chr2_212812341_ENST00000436443_length(transcript)=11842nt_BP=303nt
ATGTGGCCCGGCCCGCGACGGCCGGCGGCTGGGAGCGGCGAGGCGGCGGCGGCGGCGAGTGGCGGCCCGCGAGGCCCGGGAGGCGGTGGC
CGAGGCCCAGGCGGTGGCGGCGGCGGCCCAGGAGGCGGCGGACGGGGAGCTGCGGGAGCAGGCCCGGCCTGGCTCTCTAGCGGCCGCCTG
GCTGCAGCATGCGCGCCCGCCGGGGGCTGCTGCGGCTGCCGCGCCGCTCGCTGCTCGCCGCGCTCTTCTTCTTTTCTCTCTCGTCCTCGC
TGCTGTACTTCGTCTATGTGGCGCCCGGCATAGTCTGTTCGAGAAGTCACAGGCTACGTGTTAGTGGCTCTTAATCAGTTTCGTTACCTG
CCTCTGGAGAATTTACGCATTATTCGTGGGACAAAACTTTATGAGGATCGATATGCCTTGGCAATATTTTTAAACTACAGAAAAGATGGA
AACTTTGGACTTCAAGAACTTGGATTAAAGAACTTGACAGAAATCCTAAATGGTGGAGTCTATGTAGACCAGAACAAATTCCTTTGTTAT
GCAGACACCATTCATTGGCAAGATATTGTTCGGAACCCATGGCCTTCCAACTTGACTCTTGTGTCAACAAATGGTAGTTCAGGATGTGGA
CGTTGCCATAAGTCCTGTACTGGCCGTTGCTGGGGACCCACAGAAAATCATTGCCAGACTTTGACAAGGACGGTGTGTGCAGAACAATGT
GACGGCAGATGCTACGGACCTTACGTCAGTGACTGCTGCCATCGAGAATGTGCTGGAGGCTGCTCAGGACCTAAGGACACAGACTGCTTT
GCCTGCATGAATTTCAATGACAGTGGAGCATGTGTTACTCAGTGTCCCCAAACCTTTGTCTACAATCCAACCACCTTTCAACTGGAGCAC
AATTTCAATGCAAAGTACACATATGGAGCATTCTGTGTCAAGAAATGTCCACATAACTTTGTGGTAGATTCCAGTTCTTGTGTGCGTGCC
TGCCCTAGTTCCAAGATGGAAGTAGAAGAAAATGGGATTAAAATGTGTAAACCTTGCACTGACATTTGCCCAAAAGCTTGTGATGGCATT
GGCACAGGATCATTGATGTCAGCTCAGACTGTGGATTCCAGTAACATTGACAAATTCATAAACTGTACCAAGATCAATGGGAATTTGATC
TTTCTAGTCACTGGTATTCATGGGGACCCTTACAATGCAATTGAAGCCATAGACCCAGAGAAACTGAACGTCTTTCGGACAGTCAGAGAG
ATAACAGGTTTCCTGAACATACAGTCATGGCCACCAAACATGACTGACTTCAGTGTTTTTTCTAACCTGGTGACCATTGGTGGAAGAGTA
CTCTATAGTGGCCTGTCCTTGCTTATCCTCAAGCAACAGGGCATCACCTCTCTACAGTTCCAGTCCCTGAAGGAAATCAGCGCAGGAAAC
ATCTATATTACTGACAACAGCAACCTGTGTTATTATCATACCATTAACTGGACAACACTCTTCAGCACAATCAACCAGAGAATAGTAATC
CGGGACAACAGAAAAGCTGAAAATTGTACTGCTGAAGGAATGGTGTGCAACCATCTGTGTTCCAGTGATGGCTGTTGGGGACCTGGGCCA
GACCAATGTCTGTCGTGTCGCCGCTTCAGTAGAGGAAGGATCTGCATAGAGTCTTGTAACCTCTATGATGGTGAATTTCGGGAGTTTGAG
AATGGCTCCATCTGTGTGGAGTGTGACCCCCAGTGTGAGAAGATGGAAGATGGCCTCCTCACATGCCATGGACCGGGTCCTGACAACTGT
ACAAAGTGCTCTCATTTTAAAGATGGCCCAAACTGTGTGGAAAAATGTCCAGATGGCTTACAGGGGGCAAACAGTTTCATTTTCAAGTAT
GCTGATCCAGATCGGGAGTGCCACCCATGCCATCCAAACTGCACCCAAGGGTGTAACGGTCCCACTAGTCATGACTGCATTTACTACCCA
TGGACGGGCCATTCCACTTTACCACAACATGCTAGAACTCCCCTGATTGCAGCTGGAGTAATTGGTGGGCTCTTCATTCTGGTCATTGTG
GGTCTGACATTTGCTGTTTATGTTAGAAGGAAGAGCATCAAAAAGAAAAGAGCCTTGAGAAGATTCTTGGAAACAGAGTTGGTGGAACCA
TTAACTCCCAGTGGCACAGCACCCAATCAAGCTCAACTTCGTATTTTGAAAGAAACTGAGCTGAAGAGGGTAAAAGTCCTTGGCTCAGGT
GCTTTTGGAACGGTTTATAAAGGTATTTGGGTACCTGAAGGAGAAACTGTGAAGATTCCTGTGGCTATTAAGATTCTTAATGAGACAACT
GGTCCCAAGGCAAATGTGGAGTTCATGGATGAAGCTCTGATCATGGCAAGTATGGATCATCCACACCTAGTCCGGTTGCTGGGTGTGTGT
CTGAGCCCAACCATCCAGCTGGTTACTCAACTTATGCCCCATGGCTGCCTGTTGGAGTATGTCCACGAGCACAAGGATAACATTGGATCA
CAACTGCTGCTTAACTGGTGTGTCCAGATAGCTAAGGGAATGATGTACCTGGAAGAAAGACGACTCGTTCATCGGGATTTGGCAGCCCGT
AATGTCTTAGTGAAATCTCCAAACCATGTGAAAATCACAGATTTTGGGCTAGCCAGACTCTTGGAAGGAGATGAAAAAGAGTACAATGCT
GATGGAGGAAAGATGCCAATTAAATGGATGGCTCTGGAGTGTATACATTACAGGAAATTCACCCATCAGAGTGACGTTTGGAGCTATGGA
GTTACTATATGGGAACTGATGACCTTTGGAGGAAAACCCTATGATGGAATTCCAACGCGAGAAATCCCTGATTTATTAGAGAAAGGAGAA
CGTTTGCCTCAGCCTCCCATCTGCACTATTGACGTTTACATGGTCATGGTCAAATGTTGGATGATTGATGCTGACAGTAGACCTAAATTT
AAGGAACTGGCTGCTGAGTTTTCAAGGATGGCTCGAGACCCTCAAAGATACCTAGTTATTCAGGGTGATGATCGTATGAAGCTTCCCAGT
CCAAATGACAGCAAGTTCTTTCAGAATCTCTTGGATGAAGAGGATTTGGAAGATATGATGGATGCTGAGGAGTACTTGGTCCCTCAGGCT
TTCAACATCCCACCTCCCATCTATACTTCCAGAGCAAGAATTGACTCGAATAGGAACCAGTTTGTATACCGAGATGGAGGTTTTGCTGCT
GAACAAGGAGTGTCTGTGCCCTACAGAGCCCCAACTAGCACAATTCCAGAAGCTCCTGTGGCACAGGGTGCTACTGCTGAGATTTTTGAT
GACTCCTGCTGTAATGGCACCCTACGCAAGCCAGTGGCACCCCATGTCCAAGAGGACAGTAGCACCCAGAGGTACAGTGCTGACCCCACC
GTGTTTGCCCCAGAACGGAGCCCACGAGGAGAGCTGGATGAGGAAGGTTACATGACTCCTATGCGAGACAAACCCAAACAAGAATACCTG
AATCCAGTGGAGGAGAACCCTTTTGTTTCTCGGAGAAAAAATGGAGACCTTCAAGCATTGGATAATCCCGAATATCACAATGCATCCAAT
GGTCCACCCAAGGCCGAGGATGAGTATGTGAATGAGCCACTGTACCTCAACACCTTTGCCAACACCTTGGGAAAAGCTGAGTACCTGAAG
AACAACATACTGTCAATGCCAGAGAAGGCCAAGAAAGCGTTTGACAACCCTGACTACTGGAACCACAGCCTGCCACCTCGGAGCACCCTT
CAGCACCCAGACTACCTGCAGGAGTACAGCACAAAATATTTTTATAAACAGAATGGGCGGATCCGGCCTATTGTGGCAGAGAATCCTGAA
TACCTCTCTGAGTTCTCCCTGAAGCCAGGCACTGTGCTGCCGCCTCCACCTTACAGACACCGGAATACTGTGGTGTAAGCTCAGTTGTGG
TTTTTTAGGTGGAGAGACACACCTGCTCCAATTTCCCCACCCCCCTCTCTTTCTCTGGTGGTCTTCCTTCTACCCCAAGGCCAGTAGTTT
TGACACTTCCCAGTGGAAGATACAGAGATGCAATGATAGTTATGTGCTTACCTAACTTGAACATTAGAGGGAAAGACTGAAAGAGAAAGA
TAGGAGGAACCACAATGTTTCTTCATTTCTCTGCATGGGTTGGTCAGGAGAATGAAACAGCTAGAGAAGGACCAGAAAATGTAAGGCAAT
GCTGCCTACTATCAAACTAGCTGTCACTTTTTTTCTTTTTCTTTTTCTTTCTTTGTTTCTTTCTTCCTCTTCTTTTTTTTTTTTTTTTTT
AAAGCAGATGGTTGAAACACCCATGCTATCTGTTCCTATCTGCAGGAACTGATGTGTGCATATTTAGCATCCCTGGAAATCATAATAAAG
TTTCCATTAGAACAAAAGAATAACATTTTCTATAACATATGATGGTGTCTGAAATTGAGAATCCAGTTTCTTTCCCCAGCAGTTTCTGTC
CTAGCAAGTAAGAATGGCCAACTCAACTTTCATAATTTAAAAATCTCCATTAAAGTTATAACTAGTAATTATGTTTTCAACACTTTTTGG
TTTTTTTCATTTTGTTTTGCTCTGACCGATTCCTTTATATTTGCTCCCCTATTTTTGGCTTTAATTTCTAATTGCAAAGATGTTTACATC
AAAGCTTCTTCACAGAATTTAAGCAAGAAATATTTTAATATAGTGAAATGGCCACTACTTTAAGTATACAATCTTTAAAATAAGAAAGGG
AGGCTAATATTTTTCATGCTATCAAATTATCTTCACCCTCATCCTTTACATTTTTCAACATTTTTTTTTCTCCATAAATGACACTACTTG
ATAGGCCGTTGGTTGTCTGAAGAGTAGAAGGGAAACTAAGAGACAGTTCTCTGTGGTTCAGGAAAACTACTGATACTTTCAGGGGTGGCC
CAATGAGGGAATCCATTGAACTGGAAGAAACACACTGGATTGGGTATGTCTACCTGGCAGATACTCAGAAATGTAGTTTGCACTTAAGCT
GTAATTTTATTTGTTCTTTTTCTGAACTCCATTTTGGATTTTGAATCAAGCAATATGGAAGCAACCAGCAAATTAACTAATTTAAGTACA
TTTTTAAAAAAAGAGCTAAGATAAAGACTGTGGAAATGCCAAACCAAGCAAATTAGGAACCTTGCAACGGTATCCAGGGACTATGATGAG
AGGCCAGCACATTATCTTCATATGTCACCTTTGCTACGCAAGGAAATTTGTTCAGTTCGTATACTTCGTAAGAAGGAATGCGAGTAAGGA
TTGGCTTGAATTCCATGGAATTTCTAGTATGAGACTATTTATATGAAGTAGAAGGTAACTCTTTGCACATAAATTGGTATAATAAAAAGA
AAAACACAAACATTCAAAGCTTAGGGATAGGTCCTTGGGTCAAAAGTTGTAAATAAATGTGAAACATCTTCTCATGCAATTATTTTATTA
TCCAACACACTAATCTTTTGATACTTTATATAATTCCCTTTCTTCATATACTGCATCCAGTACTAGAACCATCATTATTATGTATCATTT
TGAAAGAATACCTGATGAGATGAAGGATGAGAACAAATGACAGAGATGAGTCTCCAAGTAAAGGGGGCCTCACATCAATAATTAGGAAAC
TTAGATATAAGTCGCCCTTTTCTGAAAATTCTACCCCAAGTCATTTAGATTTTTAAAAAATATTTCTAATGTTAAAATATTGGGACCAAA
TTAGAATCAATAGTATAAGATTAATTAATTAGAGTAAAAATATCTATTAAGGCAGAGAAAGTTTAGAGAAAAAAATCCAAAGAAATTTGT
GTTTCTTCCTATTCTGAACAAGTAAATCCATCCATCCATCCATCCAAACCTCCTTTATCTAACTGTGTCTACTAAAAGCACCATGTTTTG
TGGGGAACACTCAGATAAATGGAATATCATCCTCAACTTCAAAATTCTATGATCTAGGAGATTTAATTAAAATGACATTTTAATTTTTCT
ATGCGTTCCAACAATCAGATTGCATAGTCTCTTTTGTGAATAGCTGTCATATAATCAGTTGTACTGTAAGATATCTCCTTTAAACTCATT
TGGGATATAAGTTAAACATCCTTCAAATTGTTGATGTTGACAAACAGGATAATTTCAATAATATTATTCAAACATAAACTGGTCTAGGAG
AATATTGCATCACTGACTAATTAGCCTATCTAGAGTCTAACTTCACCATTAAACCAAAAGCAGATGGTGGTCCTTGGCCAAGAATATTGG
AGACATTGGAGTTGGTTTTTTTCTAAGCTATAAGAAGTGAGGCGAGCTGAAAAAGTATGGTAGAGCAGGAGAAGGGTTTGTGAGATTCCT
TCTAGTGAAGTTCACCCTCAAACTTTTCAGGGGTAAAGACACAGAGTGATTCAGGGGCCACAATCTAATAGCTCAGGGCTCTCCTATCCA
TTCAGAGAAGTCTCTAGGAAAAGGGATCTCATATCAGTACTTATGAAAAATTGAATATAAGCCTCCCTTTCTAAATAAATCTGCATCGAG
TCATCACAGCCCTCTTTTTGGATACTATACCTTGATTTTTTTTTTCTGATTTACAATATGCATATGGTTTCTACTGGGCTATAGAAAGCA
GAATCACTCATTTTGGAGAAGGAAAAAATGAATAGTTAAAACAAACTTTTAACTGTTAAGGTAACAGAAATGTATTTAGTGAATGTCTCT
TTCCTCCTAAGAACACAAGACTTCTACATGTTGGGTAATACCTAGAGATGCATGTAGGAATAATCCAAAATGACCCAAATGCTTTATAAT
AGCACCACTTTATAATTCTTTTGAATGATTTCTGTAGTATATAATTGACTTCAGTTGTTTGAGTGTTTTTTGTTTTATTTTTGTCCCCCC
TGGGAAAACATATTTCAGCATGTATAAGAGGGAGAAAAAAAGTTTCATTCCTTCCAGAGAATAACTTATTTAGTCCAGTAGGGTAGAATT
TTAAAATGTCAGTTAAAGTCTTCAAAGTGCTTGGGGGGATATCAGATTCCAGAGGCCAATTGTAGCAATTGAAATTTGCAGAATCAATTA
TGTAAATCTGAGACAAATTAGTATTAAAATTACACGGAGTATATTTTTTAAATCACCCAACTTTGTAGATTATACCTATTTTGGGCAGGT
ATGGAAAAATTTTGCAGTTAAATGATTGCCTAAAGAAAGTGGTAAACAGGTGAGGAAAGATGGCCTCTGATCTAGGATAGATCCAGAACC
ACAAAGCATCTGCACCACAAAAGGTGTTAGACTACCAAGCAGCTCCTGGTTTTCTGCATAGTATTAGTAGCACAGCTTAGGATGAGAATC
CTTTCTCCAGTAACATTCTTAAAATAGCATGAAAAACAACGCAAAACTCAAATTTCTATTAAAACACACAAACTAAAATCAAGTGATTCT
TTTTTGTAGATTAGGGAGAAGGACTGAATATCTAATTTAAGAGAAGGAATAGTGTTTAAGTGTTATAGTGTGTGAGCTAATACCTTCTAA
AGGAAAGACATGGCATGAAGATTGTGCATACTTACAATGCTAAGGAAAAATCAAGAAAAGGACTGTGTGAGGCTCTGCTACTAGATGAAG
TTGGAAGGACTATTAATGTGCTTCTTGAAGTATCAAAAATGAAAAGAAAATTAAAATTGTTTAAGCCTGACAGGGAAGGATGTAAATACA
AGTTTTTCTAGAGCTCTCTAACCTTTATTTCAAAACTGGAATTATTCATCCATCTGTAATTGTTGATAATTTAACTAGTATATGTAGTTC
ATAAGGTAATAGAAAAGGTGATCATGAAAGCATGTATATAACTGGACAGAACCACGATAATGCTATAAGATGTAGATTTAGTTAGGTTAT
CAGATGTTAAATGATTTTAATATTATTAAATAAATCAAACTAGAAAACTAACCACAAGTATAATGTAACAAAGTTAAATGCAGGATATAA
AAATGTAGGATGGATTTTGCATAGTAAAAAGATAAGTTTGCCATTTAAAATTGTTGTTTGTTGGGTTTAGCTGAAAGTAGGCATATATGG
TTCCACTTGGGAAAACTTGCTTTAAAGCATTACAATGAACAATTTTTTCTCATTCTCTTATTCCTTTATCACTTTTTAAATGTAAAGAAA
ATTGTATTTATTTATTTTTTTAAATAAACACCACCTTGCAGAATTTAATAGGCAAACATGTTACATATGACTAAGTAAGGGTCTTCAAGA
TGAAGTAAAGAAAATGTAAATGTTCTATTACCTTATGCAGAGACAAAAAAAAAAAGGAGTGGTGTCATTTAGCTAGCAAACAAACAAAAT
ACAGTTAATTGGTGATATGTCCTTTCTTTTCTCACTATGCCCTCTTGCCTCCAAAAATGACAACAAAGAATCACAATTTTTCTGATAAAT
AAATGCTAAACCAAGCGTTTCAAACTATTGCATTGCCATTCTTTTGGACTTTAGTTATTAGAATGATGATTGTTATAGGGCAAATGAGAA
ATCCATGTGCATCAGCTTCTAGTTGTTAAAAAAACCAGATAAATTAACTTCTACTGTATACTGTGGGCAGAGGATCCTAGAGCTGATCCT
ACAACATCAGCTTCTAGTTGTTAAAAAAAAAAAAAGAAACAGATAAATTAACTTCTACTGTATATACTGTGGGCAGAGGATCTTACTGTG
CCTCTGTTTGTGTACATGGACTTCGGTGTGTATCAGTTTGAAGGACAGCCTTGCCCCATGTAAACATATAAATGCAGATTGGTATCGCCT
GGTTGCTATTTGCTTAAGAACAAATATTATACAGATGAGATCAGGCATAATTTTAAAAGATCATTATCAGTGGAGACCTCATTATTACTG
ATATTACAATGGGGCCAGTTTTTATACTTCTGGGTAGAATTAATAAAATTTTTCTGATCCCAGAGATCTGAGTTCTCTCTGCAGTTGGAA
ACAAGAAGCTGTTGTGGGCATTGTGTCGGGCCAGGGGCCCTTGTGTTTGTGTGGGCAAATATCTTTTAGCAGTGTGAGCTGCTTTTTTCT
TTTCATTAAAAGTCTCTCTAAAATAATAGAAATTTCAGATACTCGGTTCAAGTCTCACTGATTTTGTAGAGGTCCAAAAATGTAGGATCT
GTCACTTTTGCAGGCCCCTGCCTCACCTAATTCCTGGCCAGGTGACATTTTGGGCAGAAGTAAATGCTTCTATAGTCACAAGCTAAAATG
ACTCTAAGCCCCAATTTCACGGGGGGTATTCACATGCTTCCTCTGGAAAATACTCTTTGACAGTCAGCTTTGCAAGTAAGTGATTACCTT
GTTAGGAATCAAAGAAAAATGTATTTCTCTCTGACCTTTAGAGGAAAATAGAATCCTTCCCTTTTTTGCCCATTGACACAACTGGCACTG
CTCTCTTCCCTTTCTACCACCCTGGTTCAAAGTAGTCCCCCGATGCTGTCCTGTTCCTTTCTTAAGCCATAGTGGATCTCTGAGATCCTA
CACCCCACTTTGTGAAACACTGACTTCATCTTTGCCCTCGAATGCCTGATTTTTTCATAAGAGATTCTAGCAATTTGGACACTGTTTAAG
TGAACTATCAAACTACCGCATAGAGAATATTTAAGCTATTAAAATTATGGTTTCCCATGAAGATCAATTCTCTGTGTCCTTCCCTATAGG
AATTTGAGACGAGTTAGCCCTGTGATGAATCTTGAAACTCACATATGTCCACATACACTTGGTAGAACTTCGATTTAATCTTTACATAAA
AGCTGTACATATAACCAAGAAGTTATTTTTGCCAGTAAATTAACTTATTTGCTTTATTCATCTTATTTGGTTCCTAATCGTAAATATTTT
GTAGCTGCTGTAAATTTTTTTCTCCCAAATGAGGAGTCTTATTATCATAAAGGTAAAGGCTATTCAGCTTTGATAACCACCTGCAATTCT
TTTTTGGATCATTCATCCATCTAACAAATACATAATGAGGACAGTTCATGTTAATGAAAATCCATGTTGTTTAATAGAATGCCATCCTTT
ACCTACTTTTGCTCTTTATGGACGTTTTTCTTTTCATGCTCTAGTGAGCTTTCCCTATATCATGAGAAGTGGTTATATTTGTGCAAATAT
ACAAATATAGGAAAACAAAGATTCATACCTGTAGGCAATAGTCTAACTTGTCCAAACCACTTTGCCTTTACTGCTATTTTTATCCCCAAT
GCGTAGATATTTCCCCCAGGCCTATAGCCTTTGTGAAGGAAAGCAAATCATACCTCCTGTATATTGACACGAATCTGGTTTTCAAATGTC
ATTTCCAGATTTTTTAGTTAATTGGGGGTTGTCCTTTTCCCTTAATGTGAGAGTCATTTTCCTGTATATTTCTGGATCTCTCAGGGGCTG
GGAGGGGGGAGTGAGGGGACTACAACCATAGCACTCCAAGAACCCTTTTGGGATTACTCCAGTAATCAACTACGAAAGTTATTTTCTAAA
TGTAGATATGTAAGGTGTTCTTTTAAAGTAAGGTACTTTGAAATATGTAGCATAAACTGGTACTGCTGTTAAATGGGTCGATTATTAAAC
GGAGCAGCTGTGTGAGGGCAGCTAACTTTGAATGCCTGTCTCCCTGGCTGGTGTGTCTCCTTCTCATGTTGAGAGCACCAGGGATTGCGT
GGCTGCATGCTGAAACCGCATTTTCCCATGGTGTATGACTAGTTCATCTCTTTCTTGAGCACCATTACAAGAAGATCAAATGAAAATGAG
ATCAATGTGGAAGACAATTCATAGCACAAAAAAAGTCATCTTAAATCTACTCTCAAACATTCATCTTATACATGCATCAAAGTAATTTAC
TGACATCAGTTTGGGTGAGAGAGGGAGTCACTTTACTGAAAAGGCAGAGGCTTAAGGTGTATACATTTGTACTCACTTCCTTATTTTCTT
AACTTGTAAGCAGAAAACAAGCCCTCTCTCTTGTGAAGTATCTTCAAAGGATTGGGGTGCAAAAATACCTTGCTGGTAAGCCATCAATGT
TTTATTTAAATCCCTGCATTCAAAGTTAGCTGCCTTTTTGAAATAAACAAACAAAAAATACTACTGTATGTTTGAAAATGTGAATAGTAT
TTTTATAGCTTGTTAAAGACATGGCTAGTTGCATTTGTAAATAAGTATAATGTTGCTTTGATTTTCTTTTGTGGACATCTTTATTTGGAA
CATAATTGTCTTTAGGGTTGATTTGTATATAAGTAATTGGCCTGTGATTGTTTCTTTTTTGGTTGGAAGTTATCATTTTGACATTACTTG
TGATTCTGTGTTCAGCACTATTGTGATGTGTTCAACCTCTGCACTCGCTTACACAATAGGATATGCCAATTGTGTGTGGTGTAATGTTAT
TTTGATTTTTTTCCATGTTATTGATGAAGGATCATGCACCTAACACATACTAACTTTTTTAATGTTAGGCATATTTTTAGTATACTTTCT
CTTATTCTTTCTTCTCCTCCAACCTTTTACCCATCCTCCTTCCTTTCCCTCATTCCTGTTGTTATTTGAGAATGAGGGAGAAACAGTATT
TTACATTTATGTAATTAGGCTTTTCCGTTAGTTCTCAAGGATCCTCTTTTGGCTCTTGGGAAAGAATTGTACCTGTACAAGGCAATTATA
GAATGCGAACTGCTTTGCCTCATTCCATACTGATCATCCCAGCTGAACAATTTGAAAACTGTTCTGCCTTTTTGTTACATGAATCTGTCA

>8762_8762_3_B4GALT5-ERBB4_B4GALT5_chr20_48330113_ENST00000371711_ERBB4_chr2_212812341_ENST00000436443_length(amino acids)=1196AA_BP=
MPLENLRIIRGTKLYEDRYALAIFLNYRKDGNFGLQELGLKNLTEILNGGVYVDQNKFLCYADTIHWQDIVRNPWPSNLTLVSTNGSSGC
GRCHKSCTGRCWGPTENHCQTLTRTVCAEQCDGRCYGPYVSDCCHRECAGGCSGPKDTDCFACMNFNDSGACVTQCPQTFVYNPTTFQLE
HNFNAKYTYGAFCVKKCPHNFVVDSSSCVRACPSSKMEVEENGIKMCKPCTDICPKACDGIGTGSLMSAQTVDSSNIDKFINCTKINGNL
IFLVTGIHGDPYNAIEAIDPEKLNVFRTVREITGFLNIQSWPPNMTDFSVFSNLVTIGGRVLYSGLSLLILKQQGITSLQFQSLKEISAG
NIYITDNSNLCYYHTINWTTLFSTINQRIVIRDNRKAENCTAEGMVCNHLCSSDGCWGPGPDQCLSCRRFSRGRICIESCNLYDGEFREF
ENGSICVECDPQCEKMEDGLLTCHGPGPDNCTKCSHFKDGPNCVEKCPDGLQGANSFIFKYADPDRECHPCHPNCTQGCNGPTSHDCIYY
PWTGHSTLPQHARTPLIAAGVIGGLFILVIVGLTFAVYVRRKSIKKKRALRRFLETELVEPLTPSGTAPNQAQLRILKETELKRVKVLGS
GAFGTVYKGIWVPEGETVKIPVAIKILNETTGPKANVEFMDEALIMASMDHPHLVRLLGVCLSPTIQLVTQLMPHGCLLEYVHEHKDNIG
SQLLLNWCVQIAKGMMYLEERRLVHRDLAARNVLVKSPNHVKITDFGLARLLEGDEKEYNADGGKMPIKWMALECIHYRKFTHQSDVWSY
GVTIWELMTFGGKPYDGIPTREIPDLLEKGERLPQPPICTIDVYMVMVKCWMIDADSRPKFKELAAEFSRMARDPQRYLVIQGDDRMKLP
SPNDSKFFQNLLDEEDLEDMMDAEEYLVPQAFNIPPPIYTSRARIDSNRNQFVYRDGGFAAEQGVSVPYRAPTSTIPEAPVAQGATAEIF
DDSCCNGTLRKPVAPHVQEDSSTQRYSADPTVFAPERSPRGELDEEGYMTPMRDKPKQEYLNPVEENPFVSRRKNGDLQALDNPEYHNAS
NGPPKAEDEYVNEPLYLNTFANTLGKAEYLKNNILSMPEKAKKAFDNPDYWNHSLPPRSTLQHPDYLQEYSTKYFYKQNGRIRPIVAENP

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for B4GALT5-ERBB4


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for B4GALT5-ERBB4


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status
TgeneERBB4Q15303DB08916AfatinibInhibitorSmall moleculeApproved
TgeneERBB4Q15303DB08916AfatinibInhibitorSmall moleculeApproved
TgeneERBB4Q15303DB12010FostamatinibInhibitorSmall moleculeApproved|Investigational
TgeneERBB4Q15303DB12010FostamatinibInhibitorSmall moleculeApproved|Investigational
TgeneERBB4Q15303DB12267BrigatinibInhibitorSmall moleculeApproved|Investigational
TgeneERBB4Q15303DB12267BrigatinibInhibitorSmall moleculeApproved|Investigational
TgeneERBB4Q15303DB15035ZanubrutinibInhibitorSmall moleculeApproved|Investigational
TgeneERBB4Q15303DB15035ZanubrutinibInhibitorSmall moleculeApproved|Investigational

Top

Related Diseases for B4GALT5-ERBB4


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource
HgeneB4GALT5C0023893Liver Cirrhosis, Experimental1CTD_human
HgeneB4GALT5C0087031Juvenile-Onset Still Disease1CTD_human
HgeneB4GALT5C3495559Juvenile arthritis1CTD_human
HgeneB4GALT5C3714758Juvenile psoriatic arthritis1CTD_human
HgeneB4GALT5C4552091Polyarthritis, Juvenile, Rheumatoid Factor Negative1CTD_human
HgeneB4GALT5C4704862Polyarthritis, Juvenile, Rheumatoid Factor Positive1CTD_human
TgeneC0005586Bipolar Disorder5PSYGENET
TgeneC0036341Schizophrenia4PSYGENET
TgeneC0004238Atrial Fibrillation2CTD_human
TgeneC0235480Paroxysmal atrial fibrillation2CTD_human
TgeneC2585653Persistent atrial fibrillation2CTD_human
TgeneC3468561familial atrial fibrillation2CTD_human
TgeneC0002736Amyotrophic Lateral Sclerosis1ORPHANET
TgeneC0007114Malignant neoplasm of skin1CTD_human
TgeneC0016978gallbladder neoplasm1CTD_human
TgeneC0025202melanoma1CGI;CTD_human
TgeneC0037286Skin Neoplasms1CTD_human
TgeneC0153452Malignant neoplasm of gallbladder1CTD_human
TgeneC3715155AMYOTROPHIC LATERAL SCLEROSIS 191CTD_human;GENOMICS_ENGLAND;UNIPROT