FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:BAI1-EXT1 (FusionGDB2 ID:8898)

Fusion Gene Summary for BAI1-EXT1

check button Fusion gene summary
Fusion gene informationFusion gene name: BAI1-EXT1
Fusion gene ID: 8898
HgeneTgene
Gene symbol

BAI1

EXT1

Gene ID

575

2131

Gene nameadhesion G protein-coupled receptor B1exostosin glycosyltransferase 1
SynonymsBAI1|GDAIFEXT|LGCR|LGS|TRPS2|TTV
Cytomap

8q24.3

8q24.11

Type of geneprotein-codingprotein-coding
Descriptionadhesion G protein-coupled receptor B1brain-specific angiogenesis inhibitor 1exostosin-1Glucuronosyl-N-acetylglucosaminyl-proteoglycan 4-alpha-N- acetylglucosaminyltransferaseLanger-Giedion syndrome chromosome regionN-acetylglucosaminyl-proteoglycan 4-beta-glucuronosyltransferaseexostoses (multiple) 1glucuronosyl-N-acetylgluc
Modification date2020031320200313
UniProtAcc.

Q16394

Ensembl transtripts involved in fusion geneENST00000323289, ENST00000517894, 
ENST00000378204, 
Fusion gene scores* DoF score1 X 1 X 1=115 X 10 X 7=1050
# samples 117
** MAII scorelog2(1/1*10)=3.32192809488736log2(17/1050*10)=-2.62678267641578
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: BAI1 [Title/Abstract] AND EXT1 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointBAI1(143592434)-EXT1(118834836), # samples:3
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneBAI1

GO:0010596

negative regulation of endothelial cell migration

15782143|19176395

HgeneBAI1

GO:0016525

negative regulation of angiogenesis

15782143|19176395|22330140

TgeneEXT1

GO:0006024

glycosaminoglycan biosynthetic process

12907669

TgeneEXT1

GO:0015012

heparan sulfate proteoglycan biosynthetic process

9620772|10639137

TgeneEXT1

GO:0033692

cellular polysaccharide biosynthetic process

12907669


check buttonFusion gene breakpoints across BAI1 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across EXT1 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4LGGTCGA-HW-A5KK-01ABAI1chr8

143592434

-EXT1chr8

118834836

-
ChimerDB4LGGTCGA-HW-A5KKBAI1chr8

143592434

+EXT1chr8

118834836

-


Top

Fusion Gene ORF analysis for BAI1-EXT1

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
In-frameENST00000323289ENST00000378204BAI1chr8

143592434

+EXT1chr8

118834836

-
In-frameENST00000517894ENST00000378204BAI1chr8

143592434

+EXT1chr8

118834836

-

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000517894BAI1chr8143592434+ENST00000378204EXT1chr8118834836-9890371172646671313
ENST00000323289BAI1chr8143592434+ENST00000378204EXT1chr8118834836-917930001539561313

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000517894ENST00000378204BAI1chr8143592434+EXT1chr8118834836-0.0018365680.9981634
ENST00000323289ENST00000378204BAI1chr8143592434+EXT1chr8118834836-0.0006443010.99935573

Top

Fusion Genomic Features for BAI1-EXT1


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.

Top

Fusion Protein Features for BAI1-EXT1


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr8:143592434/chr8:118834836)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
.EXT1

Q16394

FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.FUNCTION: Glycosyltransferase required for the biosynthesis of heparan-sulfate. The EXT1/EXT2 complex possesses substantially higher glycosyltransferase activity than EXT1 or EXT2 alone. Appears to be a tumor suppressor. Required for the exosomal release of SDCBP, CD63 and syndecan (PubMed:22660413). {ECO:0000269|PubMed:11518722, ECO:0000269|PubMed:22660413}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneBAI1chr8:143592434chr8:118834836ENST00000323289+1730261_3159391585.0DomainTSP type-1 1
HgeneBAI1chr8:143592434chr8:118834836ENST00000323289+1730354_4079391585.0DomainTSP type-1 2
HgeneBAI1chr8:143592434chr8:118834836ENST00000323289+1730409_4629391585.0DomainTSP type-1 3
HgeneBAI1chr8:143592434chr8:118834836ENST00000323289+1730467_5209391585.0DomainTSP type-1 4
HgeneBAI1chr8:143592434chr8:118834836ENST00000323289+1730522_5759391585.0DomainTSP type-1 5
HgeneBAI1chr8:143592434chr8:118834836ENST00000323289+1730881_9389391585.0DomainGPS
HgeneBAI1chr8:143592434chr8:118834836ENST00000517894+1831261_3159391585.0DomainTSP type-1 1
HgeneBAI1chr8:143592434chr8:118834836ENST00000517894+1831354_4079391585.0DomainTSP type-1 2
HgeneBAI1chr8:143592434chr8:118834836ENST00000517894+1831409_4629391585.0DomainTSP type-1 3
HgeneBAI1chr8:143592434chr8:118834836ENST00000517894+1831467_5209391585.0DomainTSP type-1 4
HgeneBAI1chr8:143592434chr8:118834836ENST00000517894+1831522_5759391585.0DomainTSP type-1 5
HgeneBAI1chr8:143592434chr8:118834836ENST00000517894+1831881_9389391585.0DomainGPS
TgeneEXT1chr8:143592434chr8:118834836ENST00000378204311544_549428747.0RegionSubstrate binding
TgeneEXT1chr8:143592434chr8:118834836ENST00000378204311565_567428747.0RegionSubstrate binding
TgeneEXT1chr8:143592434chr8:118834836ENST00000378204311650_654428747.0RegionSubstrate binding
TgeneEXT1chr8:143592434chr8:118834836ENST00000378204311688_701428747.0RegionSubstrate binding

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneBAI1chr8:143592434chr8:118834836ENST00000323289+17301411_14229391585.0Compositional biasNote=Poly-Pro
HgeneBAI1chr8:143592434chr8:118834836ENST00000323289+17301425_14309391585.0Compositional biasNote=Poly-Pro
HgeneBAI1chr8:143592434chr8:118834836ENST00000517894+18311411_14229391585.0Compositional biasNote=Poly-Pro
HgeneBAI1chr8:143592434chr8:118834836ENST00000517894+18311425_14309391585.0Compositional biasNote=Poly-Pro
HgeneBAI1chr8:143592434chr8:118834836ENST00000323289+1730927_9439391585.0RegionN-terminal stalk following vasculostatin-120 cleavage which is not required for signaling activity
HgeneBAI1chr8:143592434chr8:118834836ENST00000517894+1831927_9439391585.0RegionN-terminal stalk following vasculostatin-120 cleavage which is not required for signaling activity
HgeneBAI1chr8:143592434chr8:118834836ENST00000323289+17301002_10089391585.0Topological domainExtracellular
HgeneBAI1chr8:143592434chr8:118834836ENST00000323289+17301030_10529391585.0Topological domainCytoplasmic
HgeneBAI1chr8:143592434chr8:118834836ENST00000323289+17301074_10939391585.0Topological domainExtracellular
HgeneBAI1chr8:143592434chr8:118834836ENST00000323289+17301115_11369391585.0Topological domainCytoplasmic
HgeneBAI1chr8:143592434chr8:118834836ENST00000323289+17301158_11669391585.0Topological domainExtracellular
HgeneBAI1chr8:143592434chr8:118834836ENST00000323289+17301188_15849391585.0Topological domainCytoplasmic
HgeneBAI1chr8:143592434chr8:118834836ENST00000323289+173031_9489391585.0Topological domainExtracellular
HgeneBAI1chr8:143592434chr8:118834836ENST00000323289+1730970_9809391585.0Topological domainCytoplasmic
HgeneBAI1chr8:143592434chr8:118834836ENST00000517894+18311002_10089391585.0Topological domainExtracellular
HgeneBAI1chr8:143592434chr8:118834836ENST00000517894+18311030_10529391585.0Topological domainCytoplasmic
HgeneBAI1chr8:143592434chr8:118834836ENST00000517894+18311074_10939391585.0Topological domainExtracellular
HgeneBAI1chr8:143592434chr8:118834836ENST00000517894+18311115_11369391585.0Topological domainCytoplasmic
HgeneBAI1chr8:143592434chr8:118834836ENST00000517894+18311158_11669391585.0Topological domainExtracellular
HgeneBAI1chr8:143592434chr8:118834836ENST00000517894+18311188_15849391585.0Topological domainCytoplasmic
HgeneBAI1chr8:143592434chr8:118834836ENST00000517894+183131_9489391585.0Topological domainExtracellular
HgeneBAI1chr8:143592434chr8:118834836ENST00000517894+1831970_9809391585.0Topological domainCytoplasmic
HgeneBAI1chr8:143592434chr8:118834836ENST00000323289+17301009_10299391585.0TransmembraneHelical%3B Name%3D3
HgeneBAI1chr8:143592434chr8:118834836ENST00000323289+17301053_10739391585.0TransmembraneHelical%3B Name%3D4
HgeneBAI1chr8:143592434chr8:118834836ENST00000323289+17301094_11149391585.0TransmembraneHelical%3B Name%3D5
HgeneBAI1chr8:143592434chr8:118834836ENST00000323289+17301137_11579391585.0TransmembraneHelical%3B Name%3D6
HgeneBAI1chr8:143592434chr8:118834836ENST00000323289+17301167_11879391585.0TransmembraneHelical%3B Name%3D7
HgeneBAI1chr8:143592434chr8:118834836ENST00000323289+1730949_9699391585.0TransmembraneHelical%3B Name%3D1
HgeneBAI1chr8:143592434chr8:118834836ENST00000323289+1730981_10019391585.0TransmembraneHelical%3B Name%3D2
HgeneBAI1chr8:143592434chr8:118834836ENST00000517894+18311009_10299391585.0TransmembraneHelical%3B Name%3D3
HgeneBAI1chr8:143592434chr8:118834836ENST00000517894+18311053_10739391585.0TransmembraneHelical%3B Name%3D4
HgeneBAI1chr8:143592434chr8:118834836ENST00000517894+18311094_11149391585.0TransmembraneHelical%3B Name%3D5
HgeneBAI1chr8:143592434chr8:118834836ENST00000517894+18311137_11579391585.0TransmembraneHelical%3B Name%3D6
HgeneBAI1chr8:143592434chr8:118834836ENST00000517894+18311167_11879391585.0TransmembraneHelical%3B Name%3D7
HgeneBAI1chr8:143592434chr8:118834836ENST00000517894+1831949_9699391585.0TransmembraneHelical%3B Name%3D1
HgeneBAI1chr8:143592434chr8:118834836ENST00000517894+1831981_10019391585.0TransmembraneHelical%3B Name%3D2
TgeneEXT1chr8:143592434chr8:118834836ENST000003782043111_7428747.0Topological domainCytoplasmic
TgeneEXT1chr8:143592434chr8:118834836ENST0000037820431129_746428747.0Topological domainLumenal
TgeneEXT1chr8:143592434chr8:118834836ENST000003782043118_28428747.0TransmembraneHelical%3B Signal-anchor for type II membrane protein


Top

Fusion Gene Sequence for BAI1-EXT1


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>8898_8898_1_BAI1-EXT1_BAI1_chr8_143592434_ENST00000323289_EXT1_chr8_118834836_ENST00000378204_length(transcript)=9179nt_BP=3000nt
GGACTTTAGAAGCCGTTGCTGCCCTCTCTGTCACCTGAAGCGGGGCCCTCTCCCATCCCACCCTTGCCCCGCCTCCCTGCCCCCACCGGG
CCGGCCCTGCCCGCCGCCGGACCCTGGCATGTCAAGACCTGGTCCGCGCCTGCCTGCCCAGCCCGCGGAACCCCGGCGGCCCCGCGAGCT
AGGATGAGGGGCCAGGCCGCCGCCCCGGGCCCCGTCTGGATCCTCGCCCCGCTGCTACTGCTGCTGCTGCTGCTGGGACGCCGCGCGCGG
GCGGCCGCCGGAGCAGACGCGGGGCCCGGGCCCGAGCCGTGCGCCACGCTGGTGCAGGGAAAGTTCTTCGGCTACTTCTCCGCGGCCGCC
GTGTTCCCGGCCAACGCCTCGCGCTGCTCCTGGACGCTACGCAACCCGGACCCGCGGCGCTACACTCTCTACATGAAGGTGGCCAAGGCG
CCCGTGCCCTGCAGCGGCCCCGGCCGCGTGCGCACCTACCAGTTCGACTCCTTCCTCGAGTCCACGCGCACCTACCTGGGCGTGGAGAGC
TTCGACGAGGTGCTGCGGCTCTGCGACCCCTCCGCACCCCTGGCCTTCCTGCAGGCCAGCAAGCAGTTCCTGCAGATGCGGCGCCAGCAG
CCGCCCCAGCACGACGGGCTCCGGCCCCGGGCCGGGCCGCCGGGCCCCACCGACGACTTCTCCGTGGAGTACCTGGTGGTGGGGAACCGC
AACCCCAGCCGTGCCGCCTGCCAGATGCTGTGCCGCTGGCTGGACGCGTGTCTGGCCGGTAGTCGCAGCTCGCACCCCTGCGGGATCATG
CAGACCCCCTGCGCCTGCCTGGGCGGCGAGGCGGGCGGCCCTGCCGCGGGACCCCTGGCCCCCCGCGGGGATGTCTGCTTGAGAGATGCG
GTGGCTGGTGGCCCTGAAAACTGCCTCACCAGCCTGACCCAGGACCGGGGCGGGCACGGCGCCACAGGCGGCTGGAAGCTGTGGTCCCTG
TGGGGCGAATGCACGCGGGACTGCGGGGGAGGCCTCCAGACGCGGACGCGCACCTGCCTGCCCGCGCCGGGCGTGGAGGGCGGCGGCTGC
GAGGGGGTGCTGGAGGAGGGTCGCCAGTGCAACCGCGAGGCCTGCGGCCCCGCTGGGCGCACCAGCTCCCGGAGCCAGTCCCTGCGGTCC
ACAGATGCCCGGCGGCGCGAGGAGCTGGGGGACGAGCTGCAGCAGTTTGGGTTCCCAGCCCCCCAGACCGGTGACCCAGCAGCCGAGGAG
TGGTCCCCGTGGAGCGTGTGCTCCAGCACCTGCGGCGAGGGCTGGCAGACCCGCACGCGCTTCTGCGTGTCCTCCTCCTACAGCACGCAG
TGCAGCGGACCCCTGCGCGAGCAGCGGCTGTGCAACAACTCTGCCGTGTGCCCAGTGCATGGTGCCTGGGATGAGTGGTCGCCCTGGAGC
CTCTGCTCCAGCACCTGTGGCCGTGGCTTTCGGGATCGCACGCGCACCTGCAGGCCCCCCCAGTTTGGGGGCAACCCCTGTGAGGGCCCT
GAGAAGCAAACCAAGTTCTGCAACATTGCCCTGTGCCCTGGCCGGGCAGTGGATGGAAACTGGAATGAGTGGTCGAGCTGGAGCGCCTGC
TCCGCCAGCTGCTCCCAGGGCCGACAGCAGCGCACGCGTGAATGCAACGGGCCTTCCTACGGGGGTGCGGAGTGCCAGGGCCACTGGGTG
GAGACCCGAGACTGCTTCCTGCAGCAGTGCCCAGTGGATGGCAAGTGGCAGGCCTGGGCGTCATGGGGCAGTTGCAGCGTCACGTGTGGG
GCTGGCAGCCAGCGACGGGAGCGTGTCTGCTCTGGGCCCTTCTTCGGGGGAGCAGCCTGCCAGGGCCCCCAGGATGAGTACCGGCAGTGC
GGCACCCAGCGGTGTCCCGAGCCCCATGAGATCTGTGATGAGGACAACTTTGGTGCTGTGATCTGGAAGGAGACCCCAGCGGGAGAGGTG
GCTGCTGTCCGGTGTCCCCGCAACGCCACAGGACTCATCCTGCGACGGTGTGAGCTGGACGAGGAAGGCATCGCCTACTGGGAGCCCCCC
ACCTACATCCGCTGTGTTTCCATTGACTACAGAAACATCCAGATGATGACCCGGGAGCACCTGGCCAAGGCTCAGCGAGGGCTGCCTGGG
GAGGGGGTCTCGGAGGTCATCCAGACACTGGTGGAGATCTCTCAGGACGGGACCAGCTACAGTGGGGACCTGCTGTCCACCATCGATGTC
CTGAGGAACATGACAGAGATTTTCCGGAGAGCGTACTACAGCCCCACCCCTGGGGACGTACAGAACTTTGTCCAGATCCTTAGCAACCTG
TTGGCAGAGGAGAATCGGGACAAGTGGGAGGAGGCCCAGCTGGCGGGCCCCAACGCCAAGGAGCTGTTCCGGCTGGTGGAGGACTTTGTG
GACGTCATCGGCTTCCGCATGAAGGACCTGAGGGATGCATACCAGGTGACAGACAACCTGGTTCTCAGCATCCATAAGCTCCCAGCCAGC
GGAGCCACTGACATCAGCTTCCCCATGAAGGGCTGGCGGGCCACGGGTGACTGGGCCAAGGTGCCAGAGGACAGGGTCACTGTGTCCAAG
AGTGTCTTCTCCACGGGGCTGACAGAGGCCGATGAAGCATCCGTGTTTGTGGTGGGCACCGTGCTCTACAGGAACCTGGGCAGCTTCCTG
GCCCTGCAGAGGAACACGACCGTCCTGAATTCTAAGGTGATCTCCGTGACTGTGAAACCCCCGCCTCGCTCCCTGCGCACACCCTTGGAG
ATCGAGTTTGCCCACATGTATAATGGCACCACCAACCAGACCTGTATCCTGTGGGATGAGACGGATGTACCCTCCTCCTCCGCCCCCCCG
CAGCTCGGGCCCTGGTCGTGGCGCGGCTGCCGCACGGTGCCCCTCGACGCCCTCCGGACGCGCTGCCTCTGTGACCGGCTCTCCACCTTC
GCCATCTTAGCCCAGCTCAGCGCCGACGCGATTATTCAGGACAGAATATTCAAGCACATATCACGTAACAGTTTAATATGGAACAAACAT
CCTGGAGGATTGTTCGTACTACCACAGTATTCATCTTATCTGGGAGATTTTCCTTACTACTATGCTAATTTAGGTTTAAAGCCCCCCTCC
AAATTCACTGCAGTCATCCATGCGGTGACCCCCCTGGTCTCTCAGTCCCAGCCAGTGTTGAAGCTTCTCGTGGCTGCAGCCAAGTCCCAG
TACTGTGCCCAGATCATAGTTCTATGGAATTGTGACAAGCCCCTACCAGCCAAACACCGCTGGCCTGCCACTGCTGTGCCTGTCGTCGTC
ATTGAAGGAGAGAGCAAGGTTATGAGCAGCCGTTTTCTGCCCTACGACAACATCATCACAGACGCCGTGCTCAGCCTTGACGAGGACACG
GTGCTTTCAACAACAGAGGTGGATTTCGCCTTCACAGTGTGGCAGAGCTTCCCTGAGAGGATTGTGGGGTACCCCGCGCGCAGCCACTTC
TGGGATAACTCTAAGGAGCGGTGGGGATACACATCAAAGTGGACGAACGACTACTCCATGGTGTTGACAGGAGCTGCTATTTACCACAAA
TATTATCACTACCTATACTCCCATTACCTGCCAGCCAGCCTGAAGAACATGGTGGACCAATTGGCCAATTGTGAGGACATTCTCATGAAC
TTCCTGGTGTCTGCTGTGACAAAATTGCCTCCAATCAAAGTGACCCAGAAGAAGCAGTATAAGGAGACAATGATGGGACAGACTTCTCGG
GCTTCCCGTTGGGCTGACCCTGACCACTTTGCCCAGCGACAGAGCTGCATGAATACGTTTGCCAGCTGGTTTGGCTACATGCCGCTGATC
CACTCTCAGATGAGGCTCGACCCCGTCCTCTTTAAAGACCAGGTCTCTATTTTGAGGAAGAAATACCGAGACATTGAGCGACTTTGAGGA
ATCCGGCTGAGTGGGGGAGGGGAAGCAAGAAGGGATGGGGGTCAAGCTGCTCTCTCTTCCCAGTGCAGATCCACTCATCAGCAGAGCCAG
ATTGTGCCAACTATCCAAAAACTTAGATGAGCAGAATGACAAAAAAAAAAAGGCCAATGAGAACTCAACTCCTGGCTCCTGGGACTGCAC
CAGACTGCTCCAAACTCACCTCACTGGCTTCTGTGTCCCAAGACTAGGTTGTGTACAGTTTAATTATGGAACATTAAATAATTATTTTTG
AAATGATTGCTATGCAGGTTTAAACTTTTTTAATGATCAAAACTATTAAAAACCAGAGTTCTTTGTTTAATCAAAATTGTGTTGGTTGTG
AATATTTCAAAGCTGCTATTCCTTTTCCCACAGACATCATTGTCATGGCCATGTAGGGTGCCCTGCAGTTTCAAAAGCTCAAACTTCGTG
GAAAACACAATAAGTCACTCTACCCATTATCAAGAAATAACTGAGCATAAGTTGTAACTTCATTATTCAACTTTGCCAGTGCAAATTGTT
TTCCACTTCGAATCTTCAAATCCACTTGAACTTTTATCTCTAAAATGTCGCTGCATGAAAGAAAGTATTACGACTTCCAGGTAGGCAGTT
CTAACTGAAATCTCTATGTTTGAGATAGATATATATGATAATCGTTTTTCATTGGGGGGGTGGGGGGAATTAGTACCAAGAAAACACTAG
TATAATTAAGAAATGTTCAGTTTGCACAAAGAACTATCCAGATAACCCACCAGCATGTTAGTGAGATGGAAATACAGACCCACAACAGTA
ACCCAATACTTGCAGGGGTTGGGGGCACGGTTATAGATTCAACCATTGACCTAAGTCTGCGTAGCACTGGGAAGAGGCTTTGGTTTAGAA
GCCAAGGAATAATGAGTATATTGGGGAGAACAGATTATTTACAAGATGAACTCTTTAATGTTTGTGAGAATCTCAAGTTTCAGAGTTTCT
CTTTTGAGAAAGAAAAAGGGGTAATAAGGTAGAAATTCACACCAATGAACAAGAGGATTGCTGCAAAGTAACTGAGGAGATGTCTCGCCA
TTGGGACCCTAATGCCATTTTTGGTCAAACATTGTTTTGAGCAAGAATCTGGCAAACAAAATAATCAACAACAAATGTGAATATAGTTTC
ATTTACTTTTAATTTTTAAATCTGTGGAAAAGTTTAGTTGTGCTTCTTGTTAAAAAGAACATTTCTATCCCTGAAAATGCTATCTTGGGC
TTATGATTATTGTTAAACTCCAAGTATAAACTGAAAAAAAAAACATATCCCTAACTCTGTTATGAAAAATGGAGACTTCTGATATTAAAT
GCTTTCTTCTACTTGGAAGAGGCCAGAGAAAACAGGGAAGAGAAAGACATTATTGAGTTTGACCATGTATTATGCTGAATAAACAATAAG
CACTTTAGAGTCCCTCCCTCAAACTCTCATACATTTCATATTTCTTTTCCATTTATTTTCAGTTTTGTTTTAGAAGAAAAGTTCATCAGA
GAATTTTGTTCTGATAATTTCAAGTGGCCATCTTAGGTCAGTGGAAACCGTAAGTCCATTGGACTTTACCTCATCCTTTCTTTGTAGACA
TCTGGGAGAGGAAGAGGAGCTTGTAATGATAGCACGGGGATGGTGCTGTAAACAGGAGTGAAAGTGTTTGTGGAAGTCCAGAGAAGTGAC
TCAACAGGCTACCTAGCTGCAGAACAAGGAGTAGAGCCCACATATCCAGATGGTGTTTTGAGAGGTGCGTAGACAGTGAAATTCAATTAA
AGAGAGGATTTTCTCCTCAGCCTCATGACTCAGAACAGCTCCCTAAATACCTCTCTCATCTAATTGGACCCATCCATAGTCCATTCTCAA
ACATGTGACATTTTCCCCTAAGTAGATGTGATTATCTCTTAGGCATTTGTTGAAAAATAATTCTTAGGTCCATGGTGCTTGCACTTGTGT
CTTTTCTATAAAATGTTTGGTCTTACAGGTTTCATGTCATTTAAGCTGCACTTCTCAGCAAGTTAAAAGATTAGCAGTTAATCTTTATTC
ATTTGGCCTTTGTGAATTTGTATTTAAATTATTTTTTTCAGGATTGGCAAATTATTCTTGTTTCTCCTTTACCAAAAAATAAATGCGATA
TTTTGTTCAATGACCCCAAAACCAACTTGAAACTTAGGTGGTCATATTGGCTTGCAAAGCAATGTCCCTAATTGTACCAGTCAGTCACAC
AAGTGGATTCAAGGACCTGCTTTGCCAGATTGACCTGTCACCAAGCTCACAACACATATCCTCCACAAACAACATGTGTTATGTGAAGAA
AAATGGTAATTATAAATAAGAACAGGATATAACACACCTTTCATCCACTTTAAATCTCCACAGTTTCATTTTATGTCATTCTCTGAGCAA
ATCTCTTTGGGATGTGAGCTAGCGTGTTCTTCTCCCATTTGGAATATAAGGCTGGGAATAGAACAATGCTTAACAAATCAGTGAAGCTCG
ACAGTAATATGTAATTTTAATTCAGTTAGGAAAGAGTTGTATTGCATTGCAACAAGTTGACAAATCATAGCCATCTCTGTAGGGTGTCAG
GAATTATCTCCTGGTCAACTTTAATGATAACTAGGGGTCCCTAAGTGGGCTAACATGTGCTGCATTGGAGAGAAGCCAAGGGCTGAGAGT
ACAGTGCCAACCACGTAATGAATTGCTTGCAGAAATTCCAAAGAGGACTCAGCCACTTATGGATTTCCACACAGCATCTTTTCCCAGCTC
CACATTAAGACACAGGATCTTAAAAGTAATTTTTTAAAAGCTGGCTGTGTATTTATTTATATGAAAGTGTTATAAATATCAAAGCTTACA
AATACATTAATACATATCACCTTTGTTGAAGCTAGCAAAATGGCTCAAAATTGAGACTGTAGAGAAAAATCCATGAAATAATCACGATAG
TCATACCACAAAAGATAGCATAGCTAGGTGGGCCTACCTGGTCGTATCAACACACTTTCAGATAGATGCTTCAAAAAAAAAGGGAAAGCT
TGTTACTGGAATCTTATGTGCATTACAGTTTAATTTCTTCTTGTGACAAATGGTGCTCAAAGAAGGATCAATCCGGTTGCATCCTTCAAT
TTCCCTTTTAGAATGACTTTTGGTTTTCATTACAGTAGGCTATGTTAGCCTTTATTTTGGTGGTTCTCAAATACCTGGTGACTTGAAAGA
GTATTATGCTGGTGGCCTTTCATAAAGGTTACTGAACTTTAATCAGGGGTGAGATGAATGAATGTGAATAGGCCTATATACTATTTTTTT
TAATTAAAAAAAAAAAAAACAGGGCACTTGATTTAAGAACAAACCTGTTTAAAGGGCAGGATCAGGGGGAAGACAGCCTTACTAGGTTGG
AATCTGAATGAGATTCTTGCTGGACAAGGCTGAATTATGGATGCAAAAGCCAAGCAAACTTCCCAGACCTGGAAGTGTCTTGTGTTCCCT
TGGGTCTCTTGAGATCTCCAGTATTTAAAAGTAGCAGTTATTGCCATGAACTCTCAAATAAAAATGCTCCTGCCTCTTACTTGTGAATAT
AATCAACATGCAGCCAGTACATCAGGACCGCAGTTGCTGAATTCATTTACCTAGTTCCCCTTTACAGCGACTATGGACAAGAACCTTTCA
GTTGGTTTTGTCATTCTTGGCCAGATTTAACCAAGAAATTTCTTTCACTATCAGCCTCATTTTCTCAGGCATGCTTACAGGGACAAAGTT
GTTTTGAGTTGAGTTCTCAGCAAATAAGCATTAATATCAAACTGTGCTAAACTTGTCTCACTTGGTATAAGAACTTCAGCATAAAGAGAT
TAGTTTATCCTTTGACACCAAGTCTTTTTTTTTTCTTTGAGACAGAGTCTTGCCCTGTCACCCAGGCTGGAATGCATTGGTGTGATCTCA
GCTCAGTCACTGCAACCTCCGCCTCCCCGGTTCAAGCAATTCTCCTGCCTCACCCTGCTTAGTAGTTGGGATTACAGGCATGTGCTACCA
CACTGGGCTAATTTTTGTATATTTATTAGAGACAGGGTTTTGCTATGTTGGCCAGGCTGGTCTCAAACTCCTGACCTCAGGTGATCCACC
TGCCTCTGCCTCCCAAAGTGCTGAGATTACAGGTGTAAGCCACTGCGCCTGGCCCCCCTTTTTTTTTTCTTTAAAAAAAAAAAAAAGCTT
GGGGGATTAAGTACCCTAAGATAGTGGTCTCTCCCATCAGTTCAATAATGCTGTACAGAACCTCTGGATAAAAAAGCTGTTATTTACACC
ATAATTGTGATGATGGATTTGACTCCTCAAAACCATCTCTCCCCTTTTTGCCCTTTCCAAAAATCAAAGGCTTTTGTCTCCATGAGTTTT
ACCTAGTAGGGATTGTAGTTGTCACTGGTCTAGAGTTCCAATACATTTTATATCAGGAACCTCAGTGAGCTACCCAGAATTGGAACTTAA
CATGGCCCAGCAACTAGGAGAAGAAGGCAAACCTTGAGAAAGCTGAGCCCTCAGGATAGCACAGTCTCACCCTTGTTTAAAATCAAGTCA
GGCTAGCCTTTCAGTCACTCTGGTTTTGTAATGATTTGCCTTTTAGGCCTCAGGCCATTTGATTTCTGCAATAGTTTTCTGTTCTTCTCC
TACAGGAATACATCCCTGTCCTTTCGGTTGTAACTATTACTAAAACTAGGGCTATGCAAATGACCTGGTTACCATGTAATGAACCTTGTG
TACTTATTTTGAGAGAACAATATGTATAGGATATGTTGAGGGGCAGAAAGAAAGACATCAAAAACTGGAACTATTTTAGGTGGCAAATTG
TAACGCAAAAAACAAAAGTATACCTTATTTTGTATACGATGTACACTTGGGACAGAGTTTTCTAATATGTTGCCAATGTTTTTGTAGTGT
CACCACAGGTCTTTTCTGAAGTGTTTTTCCCATTTGTTATAGAGTATTAATGACTTAGCGTAATTAAGCCCTCAAGTATGTGTGAGAGAG
CGCGTGTGAGAAAATACAAAGCCATAGATATTGATTTACTTCACTGACCTGTGTAACTTTATGTCTGGGTTTCGCCATCCAAGATAGATT

>8898_8898_1_BAI1-EXT1_BAI1_chr8_143592434_ENST00000323289_EXT1_chr8_118834836_ENST00000378204_length(amino acids)=1313AA_BP=995
MLPSLSPEAGPSPIPPLPRLPAPTGPALPAAGPWHVKTWSAPACPARGTPAAPRARMRGQAAAPGPVWILAPLLLLLLLLGRRARAAAGA
DAGPGPEPCATLVQGKFFGYFSAAAVFPANASRCSWTLRNPDPRRYTLYMKVAKAPVPCSGPGRVRTYQFDSFLESTRTYLGVESFDEVL
RLCDPSAPLAFLQASKQFLQMRRQQPPQHDGLRPRAGPPGPTDDFSVEYLVVGNRNPSRAACQMLCRWLDACLAGSRSSHPCGIMQTPCA
CLGGEAGGPAAGPLAPRGDVCLRDAVAGGPENCLTSLTQDRGGHGATGGWKLWSLWGECTRDCGGGLQTRTRTCLPAPGVEGGGCEGVLE
EGRQCNREACGPAGRTSSRSQSLRSTDARRREELGDELQQFGFPAPQTGDPAAEEWSPWSVCSSTCGEGWQTRTRFCVSSSYSTQCSGPL
REQRLCNNSAVCPVHGAWDEWSPWSLCSSTCGRGFRDRTRTCRPPQFGGNPCEGPEKQTKFCNIALCPGRAVDGNWNEWSSWSACSASCS
QGRQQRTRECNGPSYGGAECQGHWVETRDCFLQQCPVDGKWQAWASWGSCSVTCGAGSQRRERVCSGPFFGGAACQGPQDEYRQCGTQRC
PEPHEICDEDNFGAVIWKETPAGEVAAVRCPRNATGLILRRCELDEEGIAYWEPPTYIRCVSIDYRNIQMMTREHLAKAQRGLPGEGVSE
VIQTLVEISQDGTSYSGDLLSTIDVLRNMTEIFRRAYYSPTPGDVQNFVQILSNLLAEENRDKWEEAQLAGPNAKELFRLVEDFVDVIGF
RMKDLRDAYQVTDNLVLSIHKLPASGATDISFPMKGWRATGDWAKVPEDRVTVSKSVFSTGLTEADEASVFVVGTVLYRNLGSFLALQRN
TTVLNSKVISVTVKPPPRSLRTPLEIEFAHMYNGTTNQTCILWDETDVPSSSAPPQLGPWSWRGCRTVPLDALRTRCLCDRLSTFAILAQ
LSADAIIQDRIFKHISRNSLIWNKHPGGLFVLPQYSSYLGDFPYYYANLGLKPPSKFTAVIHAVTPLVSQSQPVLKLLVAAAKSQYCAQI
IVLWNCDKPLPAKHRWPATAVPVVVIEGESKVMSSRFLPYDNIITDAVLSLDEDTVLSTTEVDFAFTVWQSFPERIVGYPARSHFWDNSK
ERWGYTSKWTNDYSMVLTGAAIYHKYYHYLYSHYLPASLKNMVDQLANCEDILMNFLVSAVTKLPPIKVTQKKQYKETMMGQTSRASRWA

--------------------------------------------------------------
>8898_8898_2_BAI1-EXT1_BAI1_chr8_143592434_ENST00000517894_EXT1_chr8_118834836_ENST00000378204_length(transcript)=9890nt_BP=3711nt
GATGTAATCCGTAATGCAGTCTCTGGCTCCCGATTCGGGATCCAGTTTCAGAGAGCGAGAGATTGGGGAGCGCCGGCAGCCGGGTGGGGG
GAAGCAGCTCTCGCCTCTCTCCTCCTCCTTCTCCTCGTCCTCGGCCTCCTCCTCCTCCCCGCGCCGCCCCGCCCCCGGCTCGGCTTGGCT
CGCCCCCCCCCCCCCCTCGCCAGGAAGGGGAAAAAAGGCGAGAAGAGCCGGGCAGGCGAGAGGAGCGGAGCGGCGGCGGCGGCCGGAGAG
GGAGCGGCGGGCGCAGGCGGCGGCGGCGGGCGCGGCGTTGGCGGCGGCCCCGGCGGAGCGAGCGCGGAGCCGGAGAGCCGGGAGCACAGG
CGGCCGCGCCGCGTCCTGGCCCGGCCCGGGCCCGCGCGCCAGCATCGTCCGCAGCGCGGGCATCCGGACCCTCCGGGCGCCCGGGGGGCT
CCAGCAGGCGCGGGGGAACGGGAGGGGGCCTGCGTGCGCCGGCGGGTGCTCTCCAGGGGGCGTCCCGGCGAGGCCCAGAGCGGGCCGGGG
GCGGCGGCGGCTGGAGGAGCCCCCCCCACCTCCGGTCGGGCGCCCGGCTCAGCCGCCGGCGACGCGAGGCGCTCGCGGGGATTTGCAACT
CGCCGGATCGAGTCCTCGCCGGCGGGGCCGCTGCTGCTGGGGAAGCTGCTGCTGGTGGCCACAGGCTGGCACCAGGGCCCTGGACTTTAG
AAGCCGTTGCTGCCCTCTCTGTCACCTGAAGCGGGGCCCTCTCCCATCCCACCCTTGCCCCGCCTCCCTGCCCCCACCGGGCCGGCCCTG
CCCGCCGCCGGACCCTGGCATGTCAAGACCTGGTCCGCGCCTGCCTGCCCAGCCCGCGGAACCCCGGCGGCCCCGCGAGCTAGGATGAGG
GGCCAGGCCGCCGCCCCGGGCCCCGTCTGGATCCTCGCCCCGCTGCTACTGCTGCTGCTGCTGCTGGGACGCCGCGCGCGGGCGGCCGCC
GGAGCAGACGCGGGGCCCGGGCCCGAGCCGTGCGCCACGCTGGTGCAGGGAAAGTTCTTCGGCTACTTCTCCGCGGCCGCCGTGTTCCCG
GCCAACGCCTCGCGCTGCTCCTGGACGCTACGCAACCCGGACCCGCGGCGCTACACTCTCTACATGAAGGTGGCCAAGGCGCCCGTGCCC
TGCAGCGGCCCCGGCCGCGTGCGCACCTACCAGTTCGACTCCTTCCTCGAGTCCACGCGCACCTACCTGGGCGTGGAGAGCTTCGACGAG
GTGCTGCGGCTCTGCGACCCCTCCGCACCCCTGGCCTTCCTGCAGGCCAGCAAGCAGTTCCTGCAGATGCGGCGCCAGCAGCCGCCCCAG
CACGACGGGCTCCGGCCCCGGGCCGGGCCGCCGGGCCCCACCGACGACTTCTCCGTGGAGTACCTGGTGGTGGGGAACCGCAACCCCAGC
CGTGCCGCCTGCCAGATGCTGTGCCGCTGGCTGGACGCGTGTCTGGCCGGTAGTCGCAGCTCGCACCCCTGCGGGATCATGCAGACCCCC
TGCGCCTGCCTGGGCGGCGAGGCGGGCGGCCCTGCCGCGGGACCCCTGGCCCCCCGCGGGGATGTCTGCTTGAGAGATGCGGTGGCTGGT
GGCCCTGAAAACTGCCTCACCAGCCTGACCCAGGACCGGGGCGGGCACGGCGCCACAGGCGGCTGGAAGCTGTGGTCCCTGTGGGGCGAA
TGCACGCGGGACTGCGGGGGAGGCCTCCAGACGCGGACGCGCACCTGCCTGCCCGCGCCGGGCGTGGAGGGCGGCGGCTGCGAGGGGGTG
CTGGAGGAGGGTCGCCAGTGCAACCGCGAGGCCTGCGGCCCCGCTGGGCGCACCAGCTCCCGGAGCCAGTCCCTGCGGTCCACAGATGCC
CGGCGGCGCGAGGAGCTGGGGGACGAGCTGCAGCAGTTTGGGTTCCCAGCCCCCCAGACCGGTGACCCAGCAGCCGAGGAGTGGTCCCCG
TGGAGCGTGTGCTCCAGCACCTGCGGCGAGGGCTGGCAGACCCGCACGCGCTTCTGCGTGTCCTCCTCCTACAGCACGCAGTGCAGCGGA
CCCCTGCGCGAGCAGCGGCTGTGCAACAACTCTGCCGTGTGCCCAGTGCATGGTGCCTGGGATGAGTGGTCGCCCTGGAGCCTCTGCTCC
AGCACCTGTGGCCGTGGCTTTCGGGATCGCACGCGCACCTGCAGGCCCCCCCAGTTTGGGGGCAACCCCTGTGAGGGCCCTGAGAAGCAA
ACCAAGTTCTGCAACATTGCCCTGTGCCCTGGCCGGGCAGTGGATGGAAACTGGAATGAGTGGTCGAGCTGGAGCGCCTGCTCCGCCAGC
TGCTCCCAGGGCCGACAGCAGCGCACGCGTGAATGCAACGGGCCTTCCTACGGGGGTGCGGAGTGCCAGGGCCACTGGGTGGAGACCCGA
GACTGCTTCCTGCAGCAGTGCCCAGTGGATGGCAAGTGGCAGGCCTGGGCGTCATGGGGCAGTTGCAGCGTCACGTGTGGGGCTGGCAGC
CAGCGACGGGAGCGTGTCTGCTCTGGGCCCTTCTTCGGGGGAGCAGCCTGCCAGGGCCCCCAGGATGAGTACCGGCAGTGCGGCACCCAG
CGGTGTCCCGAGCCCCATGAGATCTGTGATGAGGACAACTTTGGTGCTGTGATCTGGAAGGAGACCCCAGCGGGAGAGGTGGCTGCTGTC
CGGTGTCCCCGCAACGCCACAGGACTCATCCTGCGACGGTGTGAGCTGGACGAGGAAGGCATCGCCTACTGGGAGCCCCCCACCTACATC
CGCTGTGTTTCCATTGACTACAGAAACATCCAGATGATGACCCGGGAGCACCTGGCCAAGGCTCAGCGAGGGCTGCCTGGGGAGGGGGTC
TCGGAGGTCATCCAGACACTGGTGGAGATCTCTCAGGACGGGACCAGCTACAGTGGGGACCTGCTGTCCACCATCGATGTCCTGAGGAAC
ATGACAGAGATTTTCCGGAGAGCGTACTACAGCCCCACCCCTGGGGACGTACAGAACTTTGTCCAGATCCTTAGCAACCTGTTGGCAGAG
GAGAATCGGGACAAGTGGGAGGAGGCCCAGCTGGCGGGCCCCAACGCCAAGGAGCTGTTCCGGCTGGTGGAGGACTTTGTGGACGTCATC
GGCTTCCGCATGAAGGACCTGAGGGATGCATACCAGGTGACAGACAACCTGGTTCTCAGCATCCATAAGCTCCCAGCCAGCGGAGCCACT
GACATCAGCTTCCCCATGAAGGGCTGGCGGGCCACGGGTGACTGGGCCAAGGTGCCAGAGGACAGGGTCACTGTGTCCAAGAGTGTCTTC
TCCACGGGGCTGACAGAGGCCGATGAAGCATCCGTGTTTGTGGTGGGCACCGTGCTCTACAGGAACCTGGGCAGCTTCCTGGCCCTGCAG
AGGAACACGACCGTCCTGAATTCTAAGGTGATCTCCGTGACTGTGAAACCCCCGCCTCGCTCCCTGCGCACACCCTTGGAGATCGAGTTT
GCCCACATGTATAATGGCACCACCAACCAGACCTGTATCCTGTGGGATGAGACGGATGTACCCTCCTCCTCCGCCCCCCCGCAGCTCGGG
CCCTGGTCGTGGCGCGGCTGCCGCACGGTGCCCCTCGACGCCCTCCGGACGCGCTGCCTCTGTGACCGGCTCTCCACCTTCGCCATCTTA
GCCCAGCTCAGCGCCGACGCGATTATTCAGGACAGAATATTCAAGCACATATCACGTAACAGTTTAATATGGAACAAACATCCTGGAGGA
TTGTTCGTACTACCACAGTATTCATCTTATCTGGGAGATTTTCCTTACTACTATGCTAATTTAGGTTTAAAGCCCCCCTCCAAATTCACT
GCAGTCATCCATGCGGTGACCCCCCTGGTCTCTCAGTCCCAGCCAGTGTTGAAGCTTCTCGTGGCTGCAGCCAAGTCCCAGTACTGTGCC
CAGATCATAGTTCTATGGAATTGTGACAAGCCCCTACCAGCCAAACACCGCTGGCCTGCCACTGCTGTGCCTGTCGTCGTCATTGAAGGA
GAGAGCAAGGTTATGAGCAGCCGTTTTCTGCCCTACGACAACATCATCACAGACGCCGTGCTCAGCCTTGACGAGGACACGGTGCTTTCA
ACAACAGAGGTGGATTTCGCCTTCACAGTGTGGCAGAGCTTCCCTGAGAGGATTGTGGGGTACCCCGCGCGCAGCCACTTCTGGGATAAC
TCTAAGGAGCGGTGGGGATACACATCAAAGTGGACGAACGACTACTCCATGGTGTTGACAGGAGCTGCTATTTACCACAAATATTATCAC
TACCTATACTCCCATTACCTGCCAGCCAGCCTGAAGAACATGGTGGACCAATTGGCCAATTGTGAGGACATTCTCATGAACTTCCTGGTG
TCTGCTGTGACAAAATTGCCTCCAATCAAAGTGACCCAGAAGAAGCAGTATAAGGAGACAATGATGGGACAGACTTCTCGGGCTTCCCGT
TGGGCTGACCCTGACCACTTTGCCCAGCGACAGAGCTGCATGAATACGTTTGCCAGCTGGTTTGGCTACATGCCGCTGATCCACTCTCAG
ATGAGGCTCGACCCCGTCCTCTTTAAAGACCAGGTCTCTATTTTGAGGAAGAAATACCGAGACATTGAGCGACTTTGAGGAATCCGGCTG
AGTGGGGGAGGGGAAGCAAGAAGGGATGGGGGTCAAGCTGCTCTCTCTTCCCAGTGCAGATCCACTCATCAGCAGAGCCAGATTGTGCCA
ACTATCCAAAAACTTAGATGAGCAGAATGACAAAAAAAAAAAGGCCAATGAGAACTCAACTCCTGGCTCCTGGGACTGCACCAGACTGCT
CCAAACTCACCTCACTGGCTTCTGTGTCCCAAGACTAGGTTGTGTACAGTTTAATTATGGAACATTAAATAATTATTTTTGAAATGATTG
CTATGCAGGTTTAAACTTTTTTAATGATCAAAACTATTAAAAACCAGAGTTCTTTGTTTAATCAAAATTGTGTTGGTTGTGAATATTTCA
AAGCTGCTATTCCTTTTCCCACAGACATCATTGTCATGGCCATGTAGGGTGCCCTGCAGTTTCAAAAGCTCAAACTTCGTGGAAAACACA
ATAAGTCACTCTACCCATTATCAAGAAATAACTGAGCATAAGTTGTAACTTCATTATTCAACTTTGCCAGTGCAAATTGTTTTCCACTTC
GAATCTTCAAATCCACTTGAACTTTTATCTCTAAAATGTCGCTGCATGAAAGAAAGTATTACGACTTCCAGGTAGGCAGTTCTAACTGAA
ATCTCTATGTTTGAGATAGATATATATGATAATCGTTTTTCATTGGGGGGGTGGGGGGAATTAGTACCAAGAAAACACTAGTATAATTAA
GAAATGTTCAGTTTGCACAAAGAACTATCCAGATAACCCACCAGCATGTTAGTGAGATGGAAATACAGACCCACAACAGTAACCCAATAC
TTGCAGGGGTTGGGGGCACGGTTATAGATTCAACCATTGACCTAAGTCTGCGTAGCACTGGGAAGAGGCTTTGGTTTAGAAGCCAAGGAA
TAATGAGTATATTGGGGAGAACAGATTATTTACAAGATGAACTCTTTAATGTTTGTGAGAATCTCAAGTTTCAGAGTTTCTCTTTTGAGA
AAGAAAAAGGGGTAATAAGGTAGAAATTCACACCAATGAACAAGAGGATTGCTGCAAAGTAACTGAGGAGATGTCTCGCCATTGGGACCC
TAATGCCATTTTTGGTCAAACATTGTTTTGAGCAAGAATCTGGCAAACAAAATAATCAACAACAAATGTGAATATAGTTTCATTTACTTT
TAATTTTTAAATCTGTGGAAAAGTTTAGTTGTGCTTCTTGTTAAAAAGAACATTTCTATCCCTGAAAATGCTATCTTGGGCTTATGATTA
TTGTTAAACTCCAAGTATAAACTGAAAAAAAAAACATATCCCTAACTCTGTTATGAAAAATGGAGACTTCTGATATTAAATGCTTTCTTC
TACTTGGAAGAGGCCAGAGAAAACAGGGAAGAGAAAGACATTATTGAGTTTGACCATGTATTATGCTGAATAAACAATAAGCACTTTAGA
GTCCCTCCCTCAAACTCTCATACATTTCATATTTCTTTTCCATTTATTTTCAGTTTTGTTTTAGAAGAAAAGTTCATCAGAGAATTTTGT
TCTGATAATTTCAAGTGGCCATCTTAGGTCAGTGGAAACCGTAAGTCCATTGGACTTTACCTCATCCTTTCTTTGTAGACATCTGGGAGA
GGAAGAGGAGCTTGTAATGATAGCACGGGGATGGTGCTGTAAACAGGAGTGAAAGTGTTTGTGGAAGTCCAGAGAAGTGACTCAACAGGC
TACCTAGCTGCAGAACAAGGAGTAGAGCCCACATATCCAGATGGTGTTTTGAGAGGTGCGTAGACAGTGAAATTCAATTAAAGAGAGGAT
TTTCTCCTCAGCCTCATGACTCAGAACAGCTCCCTAAATACCTCTCTCATCTAATTGGACCCATCCATAGTCCATTCTCAAACATGTGAC
ATTTTCCCCTAAGTAGATGTGATTATCTCTTAGGCATTTGTTGAAAAATAATTCTTAGGTCCATGGTGCTTGCACTTGTGTCTTTTCTAT
AAAATGTTTGGTCTTACAGGTTTCATGTCATTTAAGCTGCACTTCTCAGCAAGTTAAAAGATTAGCAGTTAATCTTTATTCATTTGGCCT
TTGTGAATTTGTATTTAAATTATTTTTTTCAGGATTGGCAAATTATTCTTGTTTCTCCTTTACCAAAAAATAAATGCGATATTTTGTTCA
ATGACCCCAAAACCAACTTGAAACTTAGGTGGTCATATTGGCTTGCAAAGCAATGTCCCTAATTGTACCAGTCAGTCACACAAGTGGATT
CAAGGACCTGCTTTGCCAGATTGACCTGTCACCAAGCTCACAACACATATCCTCCACAAACAACATGTGTTATGTGAAGAAAAATGGTAA
TTATAAATAAGAACAGGATATAACACACCTTTCATCCACTTTAAATCTCCACAGTTTCATTTTATGTCATTCTCTGAGCAAATCTCTTTG
GGATGTGAGCTAGCGTGTTCTTCTCCCATTTGGAATATAAGGCTGGGAATAGAACAATGCTTAACAAATCAGTGAAGCTCGACAGTAATA
TGTAATTTTAATTCAGTTAGGAAAGAGTTGTATTGCATTGCAACAAGTTGACAAATCATAGCCATCTCTGTAGGGTGTCAGGAATTATCT
CCTGGTCAACTTTAATGATAACTAGGGGTCCCTAAGTGGGCTAACATGTGCTGCATTGGAGAGAAGCCAAGGGCTGAGAGTACAGTGCCA
ACCACGTAATGAATTGCTTGCAGAAATTCCAAAGAGGACTCAGCCACTTATGGATTTCCACACAGCATCTTTTCCCAGCTCCACATTAAG
ACACAGGATCTTAAAAGTAATTTTTTAAAAGCTGGCTGTGTATTTATTTATATGAAAGTGTTATAAATATCAAAGCTTACAAATACATTA
ATACATATCACCTTTGTTGAAGCTAGCAAAATGGCTCAAAATTGAGACTGTAGAGAAAAATCCATGAAATAATCACGATAGTCATACCAC
AAAAGATAGCATAGCTAGGTGGGCCTACCTGGTCGTATCAACACACTTTCAGATAGATGCTTCAAAAAAAAAGGGAAAGCTTGTTACTGG
AATCTTATGTGCATTACAGTTTAATTTCTTCTTGTGACAAATGGTGCTCAAAGAAGGATCAATCCGGTTGCATCCTTCAATTTCCCTTTT
AGAATGACTTTTGGTTTTCATTACAGTAGGCTATGTTAGCCTTTATTTTGGTGGTTCTCAAATACCTGGTGACTTGAAAGAGTATTATGC
TGGTGGCCTTTCATAAAGGTTACTGAACTTTAATCAGGGGTGAGATGAATGAATGTGAATAGGCCTATATACTATTTTTTTTAATTAAAA
AAAAAAAAAACAGGGCACTTGATTTAAGAACAAACCTGTTTAAAGGGCAGGATCAGGGGGAAGACAGCCTTACTAGGTTGGAATCTGAAT
GAGATTCTTGCTGGACAAGGCTGAATTATGGATGCAAAAGCCAAGCAAACTTCCCAGACCTGGAAGTGTCTTGTGTTCCCTTGGGTCTCT
TGAGATCTCCAGTATTTAAAAGTAGCAGTTATTGCCATGAACTCTCAAATAAAAATGCTCCTGCCTCTTACTTGTGAATATAATCAACAT
GCAGCCAGTACATCAGGACCGCAGTTGCTGAATTCATTTACCTAGTTCCCCTTTACAGCGACTATGGACAAGAACCTTTCAGTTGGTTTT
GTCATTCTTGGCCAGATTTAACCAAGAAATTTCTTTCACTATCAGCCTCATTTTCTCAGGCATGCTTACAGGGACAAAGTTGTTTTGAGT
TGAGTTCTCAGCAAATAAGCATTAATATCAAACTGTGCTAAACTTGTCTCACTTGGTATAAGAACTTCAGCATAAAGAGATTAGTTTATC
CTTTGACACCAAGTCTTTTTTTTTTCTTTGAGACAGAGTCTTGCCCTGTCACCCAGGCTGGAATGCATTGGTGTGATCTCAGCTCAGTCA
CTGCAACCTCCGCCTCCCCGGTTCAAGCAATTCTCCTGCCTCACCCTGCTTAGTAGTTGGGATTACAGGCATGTGCTACCACACTGGGCT
AATTTTTGTATATTTATTAGAGACAGGGTTTTGCTATGTTGGCCAGGCTGGTCTCAAACTCCTGACCTCAGGTGATCCACCTGCCTCTGC
CTCCCAAAGTGCTGAGATTACAGGTGTAAGCCACTGCGCCTGGCCCCCCTTTTTTTTTTCTTTAAAAAAAAAAAAAAGCTTGGGGGATTA
AGTACCCTAAGATAGTGGTCTCTCCCATCAGTTCAATAATGCTGTACAGAACCTCTGGATAAAAAAGCTGTTATTTACACCATAATTGTG
ATGATGGATTTGACTCCTCAAAACCATCTCTCCCCTTTTTGCCCTTTCCAAAAATCAAAGGCTTTTGTCTCCATGAGTTTTACCTAGTAG
GGATTGTAGTTGTCACTGGTCTAGAGTTCCAATACATTTTATATCAGGAACCTCAGTGAGCTACCCAGAATTGGAACTTAACATGGCCCA
GCAACTAGGAGAAGAAGGCAAACCTTGAGAAAGCTGAGCCCTCAGGATAGCACAGTCTCACCCTTGTTTAAAATCAAGTCAGGCTAGCCT
TTCAGTCACTCTGGTTTTGTAATGATTTGCCTTTTAGGCCTCAGGCCATTTGATTTCTGCAATAGTTTTCTGTTCTTCTCCTACAGGAAT
ACATCCCTGTCCTTTCGGTTGTAACTATTACTAAAACTAGGGCTATGCAAATGACCTGGTTACCATGTAATGAACCTTGTGTACTTATTT
TGAGAGAACAATATGTATAGGATATGTTGAGGGGCAGAAAGAAAGACATCAAAAACTGGAACTATTTTAGGTGGCAAATTGTAACGCAAA
AAACAAAAGTATACCTTATTTTGTATACGATGTACACTTGGGACAGAGTTTTCTAATATGTTGCCAATGTTTTTGTAGTGTCACCACAGG
TCTTTTCTGAAGTGTTTTTCCCATTTGTTATAGAGTATTAATGACTTAGCGTAATTAAGCCCTCAAGTATGTGTGAGAGAGCGCGTGTGA
GAAAATACAAAGCCATAGATATTGATTTACTTCACTGACCTGTGTAACTTTATGTCTGGGTTTCGCCATCCAAGATAGATTGTTTTATAG

>8898_8898_2_BAI1-EXT1_BAI1_chr8_143592434_ENST00000517894_EXT1_chr8_118834836_ENST00000378204_length(amino acids)=1313AA_BP=995
MLPSLSPEAGPSPIPPLPRLPAPTGPALPAAGPWHVKTWSAPACPARGTPAAPRARMRGQAAAPGPVWILAPLLLLLLLLGRRARAAAGA
DAGPGPEPCATLVQGKFFGYFSAAAVFPANASRCSWTLRNPDPRRYTLYMKVAKAPVPCSGPGRVRTYQFDSFLESTRTYLGVESFDEVL
RLCDPSAPLAFLQASKQFLQMRRQQPPQHDGLRPRAGPPGPTDDFSVEYLVVGNRNPSRAACQMLCRWLDACLAGSRSSHPCGIMQTPCA
CLGGEAGGPAAGPLAPRGDVCLRDAVAGGPENCLTSLTQDRGGHGATGGWKLWSLWGECTRDCGGGLQTRTRTCLPAPGVEGGGCEGVLE
EGRQCNREACGPAGRTSSRSQSLRSTDARRREELGDELQQFGFPAPQTGDPAAEEWSPWSVCSSTCGEGWQTRTRFCVSSSYSTQCSGPL
REQRLCNNSAVCPVHGAWDEWSPWSLCSSTCGRGFRDRTRTCRPPQFGGNPCEGPEKQTKFCNIALCPGRAVDGNWNEWSSWSACSASCS
QGRQQRTRECNGPSYGGAECQGHWVETRDCFLQQCPVDGKWQAWASWGSCSVTCGAGSQRRERVCSGPFFGGAACQGPQDEYRQCGTQRC
PEPHEICDEDNFGAVIWKETPAGEVAAVRCPRNATGLILRRCELDEEGIAYWEPPTYIRCVSIDYRNIQMMTREHLAKAQRGLPGEGVSE
VIQTLVEISQDGTSYSGDLLSTIDVLRNMTEIFRRAYYSPTPGDVQNFVQILSNLLAEENRDKWEEAQLAGPNAKELFRLVEDFVDVIGF
RMKDLRDAYQVTDNLVLSIHKLPASGATDISFPMKGWRATGDWAKVPEDRVTVSKSVFSTGLTEADEASVFVVGTVLYRNLGSFLALQRN
TTVLNSKVISVTVKPPPRSLRTPLEIEFAHMYNGTTNQTCILWDETDVPSSSAPPQLGPWSWRGCRTVPLDALRTRCLCDRLSTFAILAQ
LSADAIIQDRIFKHISRNSLIWNKHPGGLFVLPQYSSYLGDFPYYYANLGLKPPSKFTAVIHAVTPLVSQSQPVLKLLVAAAKSQYCAQI
IVLWNCDKPLPAKHRWPATAVPVVVIEGESKVMSSRFLPYDNIITDAVLSLDEDTVLSTTEVDFAFTVWQSFPERIVGYPARSHFWDNSK
ERWGYTSKWTNDYSMVLTGAAIYHKYYHYLYSHYLPASLKNMVDQLANCEDILMNFLVSAVTKLPPIKVTQKKQYKETMMGQTSRASRWA

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for BAI1-EXT1


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for BAI1-EXT1


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for BAI1-EXT1


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource