FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:ASH1L-FAM78B (FusionGDB2 ID:7178)

Fusion Gene Summary for ASH1L-FAM78B

check button Fusion gene summary
Fusion gene informationFusion gene name: ASH1L-FAM78B
Fusion gene ID: 7178
HgeneTgene
Gene symbol

ASH1L

FAM78B

Gene ID

55870

149297

Gene nameASH1 like histone lysine methyltransferasefamily with sequence similarity 78 member B
SynonymsASH1|ASH1L1|KMT2H|MRD52-
Cytomap

1q22

1q24.1

Type of geneprotein-codingprotein-coding
Descriptionhistone-lysine N-methyltransferase ASH1LASH1-like proteinabsent small and homeotic disks protein 1 homologash1 (absent, small, or homeotic)-likelysine N-methyltransferase 2Hprobable histone-lysine N-methyltransferase ASH1Lprotein FAM78B
Modification date2020031320200313
UniProtAcc

Q9NR48

Q5VT40

Ensembl transtripts involved in fusion geneENST00000368346, ENST00000392403, 
ENST00000548830, 
ENST00000354422, 
ENST00000338353, 
Fusion gene scores* DoF score36 X 20 X 14=100804 X 2 X 4=32
# samples 444
** MAII scorelog2(44/10080*10)=-4.51784830486262
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(4/32*10)=0.321928094887362
effective Gene in Pan-Cancer Fusion Genes (eGinPCFGs).
DoF>8 and MAII>0
Context

PubMed: ASH1L [Title/Abstract] AND FAM78B [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointASH1L(155365250)-FAM78B(166040000), # samples:3
Anticipated loss of major functional domain due to fusion event.ASH1L-FAM78B seems lost the major protein functional domain in Hgene partner, which is a transcription factor due to the frame-shifted ORF.
ASH1L-FAM78B seems lost the major protein functional domain in Hgene partner, which is a essential gene due to the frame-shifted ORF.
ASH1L-FAM78B seems lost the major protein functional domain in Hgene partner, which is a IUPHAR drug target due to the frame-shifted ORF.
ASH1L-FAM78B seems lost the major protein functional domain in Hgene partner, which is a epigenetic factor due to the frame-shifted ORF.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneASH1L

GO:0097676

histone H3-K36 dimethylation

26002201


check buttonFusion gene breakpoints across ASH1L (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.

check buttonFusion gene breakpoints across FAM78B (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4BRCATCGA-AR-A0TT-01AASH1Lchr1

155365250

-FAM78Bchr1

166040000

-
ChimerDB4BRCATCGA-AR-A0TT-01AASH1Lchr1

155365250

-FAM78Bchr1

166040000

-
ChimerDB4BRCATCGA-AR-A0TT-01AASH1Lchr1

155365250

-FAM78Bchr1

166040000

-


Top

Fusion Gene ORF analysis for ASH1L-FAM78B

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
Frame-shiftENST00000368346ENST00000354422ASH1Lchr1

155365250

-FAM78Bchr1

166040000

-
In-frameENST00000368346ENST00000338353ASH1Lchr1

155365250

-FAM78Bchr1

166040000

-
Frame-shiftENST00000392403ENST00000354422ASH1Lchr1

155365250

-FAM78Bchr1

166040000

-
In-frameENST00000392403ENST00000338353ASH1Lchr1

155365250

-FAM78Bchr1

166040000

-
intron-3CDSENST00000548830ENST00000354422ASH1Lchr1

155365250

-FAM78Bchr1

166040000

-
intron-3CDSENST00000548830ENST00000338353ASH1Lchr1

155365250

-FAM78Bchr1

166040000

-

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000368346ASH1Lchr1155365250-ENST00000338353FAM78Bchr1166040000-7487674364067682042
ENST00000392403ASH1Lchr1155365250-ENST00000338353FAM78Bchr1166040000-7326658247966072042

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000368346ENST00000338353ASH1Lchr1155365250-FAM78Bchr1166040000-0.0014374350.99856263
ENST00000392403ENST00000338353ASH1Lchr1155365250-FAM78Bchr1166040000-0.0012223820.99877757

Top

Fusion Genomic Features for ASH1L-FAM78B


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.

Top

Fusion Protein Features for ASH1L-FAM78B


check button Go to

FGviewer for the breakpoints of chr1:155365250-chr1:166040000

.
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
ASH1L

Q9NR48

FAM78B

Q5VT40

FUNCTION: Histone methyltransferase specifically trimethylating 'Lys-36' of histone H3 forming H3K36me3 (PubMed:21239497). Also monomethylates 'Lys-9' of histone H3 (H3K9me1) in vitro (By similarity). The physiological significance of the H3K9me1 activity is unclear (By similarity). {ECO:0000250|UniProtKB:Q99MY8, ECO:0000269|PubMed:21239497}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneASH1Lchr1:155365250chr1:166040000ENST00000368346-7281380_14242034.33333333333332970.0Compositional biasNote=Pro-rich
HgeneASH1Lchr1:155365250chr1:166040000ENST00000368346-7281580_17912034.33333333333332970.0Compositional biasNote=Ser-rich
HgeneASH1Lchr1:155365250chr1:166040000ENST00000392403-7281380_14242034.33333333333332965.0Compositional biasNote=Pro-rich
HgeneASH1Lchr1:155365250chr1:166040000ENST00000392403-7281580_17912034.33333333333332965.0Compositional biasNote=Ser-rich
HgeneASH1Lchr1:155365250chr1:166040000ENST00000368346-7281347_13592034.33333333333332970.0DNA bindingNote=A.T hook 2
HgeneASH1Lchr1:155365250chr1:166040000ENST00000368346-7281847_18592034.33333333333332970.0DNA bindingNote=A.T hook 3
HgeneASH1Lchr1:155365250chr1:166040000ENST00000368346-728887_8992034.33333333333332970.0DNA bindingNote=A.T hook 1
HgeneASH1Lchr1:155365250chr1:166040000ENST00000392403-7281347_13592034.33333333333332965.0DNA bindingNote=A.T hook 2
HgeneASH1Lchr1:155365250chr1:166040000ENST00000392403-7281847_18592034.33333333333332965.0DNA bindingNote=A.T hook 3
HgeneASH1Lchr1:155365250chr1:166040000ENST00000392403-728887_8992034.33333333333332965.0DNA bindingNote=A.T hook 1

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneASH1Lchr1:155365250chr1:166040000ENST00000368346-7282091_21422034.33333333333332970.0DomainAWS
HgeneASH1Lchr1:155365250chr1:166040000ENST00000368346-7282145_22612034.33333333333332970.0DomainSET
HgeneASH1Lchr1:155365250chr1:166040000ENST00000368346-7282269_22852034.33333333333332970.0DomainPost-SET
HgeneASH1Lchr1:155365250chr1:166040000ENST00000368346-7282463_25332034.33333333333332970.0DomainBromo
HgeneASH1Lchr1:155365250chr1:166040000ENST00000368346-7282661_27982034.33333333333332970.0DomainBAH
HgeneASH1Lchr1:155365250chr1:166040000ENST00000392403-7282091_21422034.33333333333332965.0DomainAWS
HgeneASH1Lchr1:155365250chr1:166040000ENST00000392403-7282145_22612034.33333333333332965.0DomainSET
HgeneASH1Lchr1:155365250chr1:166040000ENST00000392403-7282269_22852034.33333333333332965.0DomainPost-SET
HgeneASH1Lchr1:155365250chr1:166040000ENST00000392403-7282463_25332034.33333333333332965.0DomainBromo
HgeneASH1Lchr1:155365250chr1:166040000ENST00000392403-7282661_27982034.33333333333332965.0DomainBAH
HgeneASH1Lchr1:155365250chr1:166040000ENST00000368346-7282069_22882034.33333333333332970.0RegionNote=Catalytic domain
HgeneASH1Lchr1:155365250chr1:166040000ENST00000392403-7282069_22882034.33333333333332965.0RegionNote=Catalytic domain
HgeneASH1Lchr1:155365250chr1:166040000ENST00000368346-7282585_26312034.33333333333332970.0Zinc fingerNote=PHD-type
HgeneASH1Lchr1:155365250chr1:166040000ENST00000392403-7282585_26312034.33333333333332965.0Zinc fingerNote=PHD-type


Top

Fusion Gene Sequence for ASH1L-FAM78B


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>In-frame_ENST00000368346_ENST00000338353_TCGA-AR-A0TT-01A_ASH1L_chr1_155365250_-_FAM78B_chr1_166040000_length(transcript)=7487nt_BP=6743nt
CGCGTACGGGCTGGCTGGCGCGCGGGGCGGCCGAAGGTGGTGGTTGGTGGGAGCAGCCAGCGACGAGCCCGTAGACACTCGTACGCGTGC
GGGCGGGCGTGCGCGCTACGCGGACGGAGTCGGGCGGAGCGGGGGACGGTGGGAAAGAACGCAAGGAGGAAGGAGTGGAAGGTTGAGGGG
GGCGCTAGGCGCCCTTCGCTCCCTCCCTCTGGAGGAGCTGCCGCCGCCACCGCCGCCACTCTGCTGCTGCCGCCGCCGCCGCCGCCGCTC
CCGCCGCCATTTTGGGTTCGCTTTGCGGAGGGGAGACGATCCCAGTCTCGGTTGCGGGACCCGCCTCCCCTCAGTTTGCCCCCTTTAGCC
TTCCACCTTTCCCTTCTCCTCTCTCGCATTTCCGCCAGTCAGCTTACCCGCTGGCCGCCTCCTGACAAGCGGGAGGGATCCGCCGTGGAC
CCAGGGAAGCGGAGGAGCCTGGCGGCCACCCCCTCTTCCCCACTTCCCTGCACTCTCATCGCTCTCGGCCTCGGCCTCGGCCTCCGACAC
GAGAAAGATGCTGGTTTCGAGTTTTGGAGATCCTTGTTTTTTATGGAACACAGTTCTGTAAAATTTTCATAAGATTCCTTGGCAATAACA
TACGCTTGTGATGGACCCTAGAAATACTGCTATGTTAGGATTGGGTTCTGATTCCGAAGGTTTTTCAAGAAAGAGTCCTTCTGCCATCAG
TACTGGCACATTGGTCAGTAAGAGAGAAGTAGAGCTAGAAAAAAACACAAAGGAGGAAGAGGACCTTCGCAAACGGAATCGAGAAAGAAA
CATCGAAGCTGGGAAAGATGATGGTTTGACTGATGCACAGCAACAGTTTTCAGTGAAAGAAACAAACTTTTCAGAGGGAAATTTAAAATT
GAAAATTGGCCTCCAGGCTAAGAGAACTAAAAAACCTCCAAAGAACTTGGAGAACTATGTATGTCGACCTGCCATAAAAACAACTATTAA
GCACCCAAGGAAAGCACTTAAAAGTGGAAAGATGACGGATGAAAAGAATGAACACTGTCCTTCAAAACGAGACCCTTCAAAGTTGTACAA
GAAAGCAGATGATGTTGCAGCCATTGAATGCCAGTCTGAAGAAGTCATCCGTCTTCATTCACAGGGAGAAAACAATCCTTTGTCTAAGAA
GCTGTCTCCAGTACACTCAGAAATGGCAGATTATATTAATGCAACGCCATCTACTCTTCTTGGTAGCCGGGATCCTGATTTAAAGGACAG
AGCATTACTTAATGGAGGAACTAGTGTAACAGAAAAGTTGGCACAGCTGATTGCTACCTGTCCTCCTTCCAAGTCTTCCAAGACAAAACC
GAAGAAGTTAGGAACTGGCACTACAGCAGGATTGGTTAGCAAGGATTTGATCAGGAAAGCAGGTGTTGGCTCTGTAGCTGGAATAATACA
TAAGGACTTAATAAAAAAGCCAACCATCAGCACAGCAGTTGGATTGGTAACTAAAGATCCTGGGAAAAAGCCAGTGTTTAATGCAGCAGT
AGGATTGGTCAATAAGGACTCTGTGAAAAAACTGGGAACTGGCACTACAGCGGTATTCATTAATAAAAACTTAGGCAAAAAGCCAGGAAC
TATCACTACAGTAGGACTGCTAAGCAAAGATTCAGGAAAGAAGCTAGGAATTGGTATTGTTCCAGGTTTAGTGCATAAAGAGTCTGGCAA
GAAGTTAGGACTTGGCACTGTGGTTGGACTGGTTAATAAAGATTTGGGAAAGAAATTGGGTTCTACTGTTGGCCTAGTGGCCAAGGACTG
TGCAAAGAAGATTGTAGCAAGTTCAGCAATGGGATTGGTTAATAAGGACATTGGAAAGAAACTAATGAGTTGTCCTTTGGCAGGTCTGAT
CAGTAAAGATGCCATAAACCTTAAAGCCGAAGCACTGCTCCCCACTCAGGAACCGCTTAAGGCTTCTTGTAGTACAAACATCAATAATCA
GGAAAGTCAGGAACTTTCTGAATCCCTGAAAGATAGTGCCACCAGCAAAACTTTTGAAAAGAATGTTGTACGGCAGAATAAAGAAAGCAT
ATTGGAAAAGTTCTCAGTACGAAAAGAAATCATTAATTTGGAGAAAGAAATGTTTAATGAAGGAACATGCATTCAGCAAGACAGTTTCTC
ATCCAGTGAAAAGGGATCTTATGAAACCTCAAAGCATGAAAAGCAGCCTCCTGTATATTGCACTTCTCCGGACTTTAAAATGGGAGGTGC
TTCTGATGTATCTACCGCTAAATCCCCATTCAGTGCAGTAGGAGAAAGCAATCTCCCTTCCCCATCACCTACTGTATCTGTTAATCCTTT
AACCAGAAGTCCCCCTGAAACTTCTTCACAGTTGGCTCCTAATCCATTACTTTTAAGTTCTACTACAGAACTAATCGAAGAAATTTCTGA
ATCTGTTGGAAAGAACCAGTTTACTTCTGAAAGTACCCACTTGAACGTTGGTCATAGGTCAGTTGGTCATAGTATAAGTATTGAATGTAA
AGGGATTGATAAAGAGGTAAATGATTCAAAAACTACCCATATAGATATTCCAAGAATAAGCTCTTCCCTTGGAAAAAAGCCAAGTTTGAC
TTCTGAATCCAGCATTCATACTATTACTCCTTCAGTTGTTAACTTCACTAGTTTATTTAGTAATAAGCCTTTTTTAAAACTGGGTGCAGT
ATCTGCATCAGACAAACACTGCCAAGTTGCTGAAAGCCTAAGTACTAGTTTGCAGTCCAAACCATTAAAAAAAAGAAAAGGAAGAAAACC
TCGGTGGACTAAAGTGGTGGCAAGAAGCACATGCCGGTCTCCAAAAGGGCTAGAATTAGAAAGATCAGAGCTTTTTAAAAACGTTTCATG
TAGCTCACTATCAAATAGTAATTCTGAGCCAGCCAAGTTTATGAAAAACATTGGACCCCCTTCATTTGTAGATCATGACTTCCTTAAACG
CCGATTGCCAAAGTTGAGCAAATCCACAGCTCCATCTCTTGCTCTCTTAGCTGATAGTGAAAAACCATCTCATAAGTCTTTTGCTACTCA
CAAACTATCCTCCAGTATGTGTGTCTCTAGTGACCTTTTGTCTGATATTTATAAGCCCAAAAGAGGAAGGCCTAAATCTAAGGAGATGCC
TCAACTGGAAGGGCCACCTAAAAGGACTTTAAAAATCCCTGCTTCTAAAGTGTTTTCTTTACAGTCTAAGGAAGAACAAGAACCCCCAAT
TTTACAGCCAGAAATTGAAATCCCTTCCTTCAAACAAGGTCTGTCTGTGTCTCCTTTTCCAAAAAAGAGAGGCAGGCCTAAGAGGCAAAT
GAGGTCACCAGTCAAGATGAAGCCACCTGTACTGTCAGTGGCTCCATTTGTTGCCACTGAAAGTCCAAGCAAGCTAGAATCTGAAAGTGA
CAACCATAGAAGTAGCAGTGATTTCTTTGAGAGCGAGGATCAACTTCAGGATCCAGATGACCTAGATGACAGTCATAGGCCAAGTGTCTG
TAGTATGAGTGACCTTGAGATGGAACCAGATAAAAAAATTACCAAGAGAAACAATGGACAATTAATGAAAACAATTATCCGCAAAATAAA
TAAAATGAAGACTTTAAAGAGAAAGAAACTGTTGAATCAGATTCTTTCAAGTTCTGTAGAATCAAGTAATAAAGGGAAAGTGCAATCCAA
ACTCCATAATACGGTATCAAGTCTTGCTGCCACATTTGGCTCTAAATTGGGCCAACAGATAAATGTCAGCAAGAAAGGAACCATTTATAT
AGGAAAGAGAAGAGGTCGCAAACCAAAAACTGTCTTAAATGGTATTCTTTCTGGTAGTCCTACTAGCCTTGCTGTTCTTGAGCAAACAGC
TCAACAGGCAGCTGGGTCAGCATTAGGACAGATTCTTCCCCCATTACTGCCTTCATCTGCTAGTAGTTCTGAGATTCTTCCATCACCTAT
TTGCTCTCAGTCTTCTGGGACTAGTGGAGGTCAGAGCCCTGTAAGTAGTGATGCAGGTTTTGTTGAACCCAGTTCAGTGCCATATTTGCA
TTTACACTCCAGACAGGGCAGTATGATTCAGACTCTTGCAATGAAGAAGGCCTCAAAGGGGAGGAGGCGGTTATCTCCTCCTACTTTGTT
GCCAAATTCTCCTTCGCACTTGAGTGAACTCACATCTCTAAAAGAAGCTACTCCTTCCCCAATCAGTGAGTCTCATAGTGATGAGACCAT
TCCCAGTGATAGTGGAATTGGAACAGATAATAACAGCACATCAGACAGGGCAGAGAAATTTTGTGGGCAAAAAAAGAGGAGGCATTCTTT
TGAGCATGTTTCTCTGATTCCCCCTGAAACCTCTACAGTGCTAAGCAGTCTTAAAGAAAAACATAAACACAAATGTAAGCGCAGGAATCA
TGATTACCTCAGCTATGACAAGATGAAAAGGCAGAAACGAAAACGGAAAAAGAAATATCCCCAGCTTCGAAATAGACAGGATCCAGACTT
TATTGCAGAGCTGGAGGAACTAATAAGTCGCCTAAGTGAAATTCGGATCACTCATCGAAGTCATCATTTTATCCCCCGAGATCTTCTGCC
AACTATCTTTCGAATCAACTTTAATAGTTTCTATACACATCCTTCTTTCCCCTTAGACCCTTTGCACTACATTCGAAAACCTGACTTAAA
AAAGAAAAGAGGGAGACCCCCTAAGATGAGGGAGGCAATGGCTGAAATGCCTTTTATGCACAGCCTTAGTTTTCCTCTTTCTAGTACTGG
ATTCTATCCATCTTATGGTATGCCTTACTCTCCTTCACCCCTTACAGCTGCTCCCATAGGATTAGGTTACTATGGAAGGTATCCTCCCAC
TCTTTATCCACCTCCTCCATCTCCTTCTTTCACCACGCCACTTCCACCTCCTTCCTATATGCATGCTGGTCATTTACTTCTCAATCCTGC
CAAATACCATAAGAAAAAGCATAAGCTACTTCGACAGGAGGCCTTTCTTACAACCAGCAGGACTCCCCTCCTTTCCATGAGTACCTACCC
CAGTGTTCCTCCTGAGATGGCCTATGGTTGGATGGTTGAGCACAAACACAGGCACCGTCACAAACACAGAGAACACCGTTCTTCTGAACA
ACCCCAGGTTTCTATGGACACTGGCTCTTCCCGATCTGTCCTGGAATCTTTGAAGCGCTATAGATTTGGAAAGGATGCTGTTGGAGAGCG
ATATAAGCATAAGGAAAAGCACCGTTGTCACATGTCCTGCCCTCATCTCTCTCCTTCAAAAAGCTTAATAAACAGAGAGGAACAGTGGGT
CCACCGAGAGCCTTCAGAATCTAGTCCATTGGCCTTGGGATTGCAGACACCTTTACAGATTGACTGTTCAGAAAGTTCTCCAAGCTTATC
CCTTGGAGGATTCACTCCCAACTCTGAGCCAGCCAGCAGTGATGAACATACAAACCTTTTCACAAGTGCAATAGGCAGCTGCAGAGTTTC
AAACCCTAACTCCAGTGGCCGGAAGAAATTAACTGACAGCCCTGGACTCTTTTCTGCACAGGACACTTCACTAAATCGGCTTCACAGAAA
GGAGTCACTGCCTTCTAACGAAAGGGCAGTACAGACTTTGGCAGGCTCCCAGCCAACCTCTGATAAACCCTCCCAGCGGCCATCAGAGAG
CACAAATTGTAGCCCTACCCGGAAAAGGTCTTCATCTGAGAGTACTTCTTCAACAGTAAACGGAGTTCCCTCTCGAAGTCCAAGATTAGT
TGCTTCTGGGGATGACTCTGTGGATAGTCTGCTGCAGCGGATGGTACAAAATGAGGACCAAGAGCCCATGGAGAAAAGTATTGATGCTGT
GATTGCAACTGCCTCTGCACCACCTTCTTCCAGTCCAGGCCGTAGCCACAGCAAGGACCGAACCCTGGGAAAACCAGACAGCCTTTTAGT
GCCTGCAGTCACAAGTGACTCTTGCAATAATAGCATCTCACTCCTATCTGAAAAGTTGACAAGCAGCTGTTCCCCCCATCATATCAAGAG
AAGTGTAGTGGAAGCTATGCAACGCCAAGCTCGGAAAATGTGCAATTACGACAAAATCTTGGCCACAAAGAAAAACCTAGACCATGTCAA
TAAAATCTTAAAAGCCAAAAAACTTCAAAGGCAGGCCAGGACAGGGAATAACTTTGTGAAACGTAGGCCAGGTCGACCTCGGAAATGTCC
CCTTCAGGCTGTCGTATCAATGCAAGCATTCCAGGCTGCTCAGTTTGTCAACCCAGAATTGAACAGAGACGAGGAAGGAGCAGCACTGCA
CCTCAGTCCTGACACAGTTACAGATGTAATTGAGGCTGTTGTTCAGAGTGTAAATCTGAACCCAGAACATAAAAAGGGGTTGAAGAGAAA
AGGTTGGCTATTGGAAGAACAGACCAGAAAAAAGCAGAAGCCATTACCAGAGGAAGAAGAGCAAGAGAATAATAAAAGCTTTAATGAAGC
ACCAGTTGAGATTCCCAGTCCTTCTGAAACCCCAGCTAAACCTTCTGAACCTGAAAGTACCTTGCAGCCTGTGCTTTCTCTCATCCCAAG
GGAAAAGAAGCCCCCACGTCCCCCAAAGAAGAAGTATCAGAAAGCAGGGCTGTATTCTGACGTTTACAAAACTACAGACCCAAAGAGTCG
ATTGATCCAATTAAAGAAAGAGAAGCTGGAGTATACTCCAGGAGAGCATGAATATGGATTATTTCCAGCGCCCATTCATGTTGGTCAAGC
TGGGAACTGCCTGACTTGAGGGAAGGGAGAGTAAAAGCCATCAGTGACTCAGATGGGGTGAGCTACCCTTGGTACGGGAACACCACAGAA
ACTGTGACCCTGGTTGGCCCCACCAACAAGATCTCCAGGTTCTCCGTCAGCATGAATGACAACTTCTACCCCAGTGTGACATGGGCAGTG
CCTGTGAGTGACAGCAATGTGCCACTGCTCACAAGAATCAAGAGAGACCAAAGTTTCACGACCTGGCTGGTGGCCATGAACACCACCACA
AAGGAGAAGATCATTCTGCAGACCATCAAGTGGAGGATGAGGGTGGACATTGAAGTGGACCCTCTTCAGCTCTTGGGGCAGCGGGCCCGG
CTGGTGGGCAGGACTCAGCAGGAGCAGCCCCGGATCCTGAGCCGGATGGAACCCATCCCCCCTAATGCACTAGTGAAACCCAATGCCAAT
GATGCCCAGGTCCTCATGTGGAGGCCCAAGCGGGGGCCACCTCTGGTTGTGATCCCTCCTAAGTAGAAGCAGACTGGCCTGACTGTGTGT
GGATCACACGCCTCTGAGACATGCAGTGAGGGTGCCAGGGGTGGCAGGAGCCAAACAGAGTTTCTGAGCCAAAGCAGACCTCTCGGTTTG
CCAGCCTTTGCAGCCACTTTTGAAGAGTAGGGCTGCTCCTTGGGTGGTAGAACCATAATCCTTAGGAAAAATCCCTTCCTCTTAGGAATA

>In-frame_ENST00000368346_ENST00000338353_TCGA-AR-A0TT-01A_ASH1L_chr1_155365250_-_FAM78B_chr1_166040000_length(amino acids)=2042AA_start in transcript=640_stop in transcript=6768
MDPRNTAMLGLGSDSEGFSRKSPSAISTGTLVSKREVELEKNTKEEEDLRKRNRERNIEAGKDDGLTDAQQQFSVKETNFSEGNLKLKIG
LQAKRTKKPPKNLENYVCRPAIKTTIKHPRKALKSGKMTDEKNEHCPSKRDPSKLYKKADDVAAIECQSEEVIRLHSQGENNPLSKKLSP
VHSEMADYINATPSTLLGSRDPDLKDRALLNGGTSVTEKLAQLIATCPPSKSSKTKPKKLGTGTTAGLVSKDLIRKAGVGSVAGIIHKDL
IKKPTISTAVGLVTKDPGKKPVFNAAVGLVNKDSVKKLGTGTTAVFINKNLGKKPGTITTVGLLSKDSGKKLGIGIVPGLVHKESGKKLG
LGTVVGLVNKDLGKKLGSTVGLVAKDCAKKIVASSAMGLVNKDIGKKLMSCPLAGLISKDAINLKAEALLPTQEPLKASCSTNINNQESQ
ELSESLKDSATSKTFEKNVVRQNKESILEKFSVRKEIINLEKEMFNEGTCIQQDSFSSSEKGSYETSKHEKQPPVYCTSPDFKMGGASDV
STAKSPFSAVGESNLPSPSPTVSVNPLTRSPPETSSQLAPNPLLLSSTTELIEEISESVGKNQFTSESTHLNVGHRSVGHSISIECKGID
KEVNDSKTTHIDIPRISSSLGKKPSLTSESSIHTITPSVVNFTSLFSNKPFLKLGAVSASDKHCQVAESLSTSLQSKPLKKRKGRKPRWT
KVVARSTCRSPKGLELERSELFKNVSCSSLSNSNSEPAKFMKNIGPPSFVDHDFLKRRLPKLSKSTAPSLALLADSEKPSHKSFATHKLS
SSMCVSSDLLSDIYKPKRGRPKSKEMPQLEGPPKRTLKIPASKVFSLQSKEEQEPPILQPEIEIPSFKQGLSVSPFPKKRGRPKRQMRSP
VKMKPPVLSVAPFVATESPSKLESESDNHRSSSDFFESEDQLQDPDDLDDSHRPSVCSMSDLEMEPDKKITKRNNGQLMKTIIRKINKMK
TLKRKKLLNQILSSSVESSNKGKVQSKLHNTVSSLAATFGSKLGQQINVSKKGTIYIGKRRGRKPKTVLNGILSGSPTSLAVLEQTAQQA
AGSALGQILPPLLPSSASSSEILPSPICSQSSGTSGGQSPVSSDAGFVEPSSVPYLHLHSRQGSMIQTLAMKKASKGRRRLSPPTLLPNS
PSHLSELTSLKEATPSPISESHSDETIPSDSGIGTDNNSTSDRAEKFCGQKKRRHSFEHVSLIPPETSTVLSSLKEKHKHKCKRRNHDYL
SYDKMKRQKRKRKKKYPQLRNRQDPDFIAELEELISRLSEIRITHRSHHFIPRDLLPTIFRINFNSFYTHPSFPLDPLHYIRKPDLKKKR
GRPPKMREAMAEMPFMHSLSFPLSSTGFYPSYGMPYSPSPLTAAPIGLGYYGRYPPTLYPPPPSPSFTTPLPPPSYMHAGHLLLNPAKYH
KKKHKLLRQEAFLTTSRTPLLSMSTYPSVPPEMAYGWMVEHKHRHRHKHREHRSSEQPQVSMDTGSSRSVLESLKRYRFGKDAVGERYKH
KEKHRCHMSCPHLSPSKSLINREEQWVHREPSESSPLALGLQTPLQIDCSESSPSLSLGGFTPNSEPASSDEHTNLFTSAIGSCRVSNPN
SSGRKKLTDSPGLFSAQDTSLNRLHRKESLPSNERAVQTLAGSQPTSDKPSQRPSESTNCSPTRKRSSSESTSSTVNGVPSRSPRLVASG
DDSVDSLLQRMVQNEDQEPMEKSIDAVIATASAPPSSSPGRSHSKDRTLGKPDSLLVPAVTSDSCNNSISLLSEKLTSSCSPHHIKRSVV
EAMQRQARKMCNYDKILATKKNLDHVNKILKAKKLQRQARTGNNFVKRRPGRPRKCPLQAVVSMQAFQAAQFVNPELNRDEEGAALHLSP
DTVTDVIEAVVQSVNLNPEHKKGLKRKGWLLEEQTRKKQKPLPEEEEQENNKSFNEAPVEIPSPSETPAKPSEPESTLQPVLSLIPREKK

--------------------------------------------------------------
>In-frame_ENST00000392403_ENST00000338353_TCGA-AR-A0TT-01A_ASH1L_chr1_155365250_-_FAM78B_chr1_166040000_length(transcript)=7326nt_BP=6582nt
GGAGTGGAAGGTTGAGGGGGGCGCTAGGCGCCCTTCGCTCCCTCCCTCTGGAGGAGCTGCCGCCGCCACCGCCGCCACTCTGCTGCTGCC
GCCGCCGCCGCCGCCGCTCCCGCCGCCATTTTGGGTTCGCTTTGCGGAGGGGAGACGATCCCAGTCTCGGTTGCGGGACCCGCCTCCCCT
CAGTTTGCCCCCTTTAGCCTTCCACCTTTCCCTTCTCCTCTCTCGCATTTCCGCCAGTCAGCTTACCCGCTGGCCGCCTCCTGACAAGCG
GGAGGGATCCGCCGTGGACCCAGGGAAGCGGAGGAGCCTGGCGGCCACCCCCTCTTCCCCACTTCCCTGCACTCTCATCGCTCTCGGCCT
CGGCCTCGGCCTCCGACACGAGAAAGATGCTGGTTTCGAGTTTTGGAGATCCTTGTTTTTTATGGAACACAGTTCTGTAAAATTTTCATA
AGATTCCTTGGCAATAACATACGCTTGTGATGGACCCTAGAAATACTGCTATGTTAGGATTGGGTTCTGATTCCGAAGGTTTTTCAAGAA
AGAGTCCTTCTGCCATCAGTACTGGCACATTGGTCAGTAAGAGAGAAGTAGAGCTAGAAAAAAACACAAAGGAGGAAGAGGACCTTCGCA
AACGGAATCGAGAAAGAAACATCGAAGCTGGGAAAGATGATGGTTTGACTGATGCACAGCAACAGTTTTCAGTGAAAGAAACAAACTTTT
CAGAGGGAAATTTAAAATTGAAAATTGGCCTCCAGGCTAAGAGAACTAAAAAACCTCCAAAGAACTTGGAGAACTATGTATGTCGACCTG
CCATAAAAACAACTATTAAGCACCCAAGGAAAGCACTTAAAAGTGGAAAGATGACGGATGAAAAGAATGAACACTGTCCTTCAAAACGAG
ACCCTTCAAAGTTGTACAAGAAAGCAGATGATGTTGCAGCCATTGAATGCCAGTCTGAAGAAGTCATCCGTCTTCATTCACAGGGAGAAA
ACAATCCTTTGTCTAAGAAGCTGTCTCCAGTACACTCAGAAATGGCAGATTATATTAATGCAACGCCATCTACTCTTCTTGGTAGCCGGG
ATCCTGATTTAAAGGACAGAGCATTACTTAATGGAGGAACTAGTGTAACAGAAAAGTTGGCACAGCTGATTGCTACCTGTCCTCCTTCCA
AGTCTTCCAAGACAAAACCGAAGAAGTTAGGAACTGGCACTACAGCAGGATTGGTTAGCAAGGATTTGATCAGGAAAGCAGGTGTTGGCT
CTGTAGCTGGAATAATACATAAGGACTTAATAAAAAAGCCAACCATCAGCACAGCAGTTGGATTGGTAACTAAAGATCCTGGGAAAAAGC
CAGTGTTTAATGCAGCAGTAGGATTGGTCAATAAGGACTCTGTGAAAAAACTGGGAACTGGCACTACAGCGGTATTCATTAATAAAAACT
TAGGCAAAAAGCCAGGAACTATCACTACAGTAGGACTGCTAAGCAAAGATTCAGGAAAGAAGCTAGGAATTGGTATTGTTCCAGGTTTAG
TGCATAAAGAGTCTGGCAAGAAGTTAGGACTTGGCACTGTGGTTGGACTGGTTAATAAAGATTTGGGAAAGAAATTGGGTTCTACTGTTG
GCCTAGTGGCCAAGGACTGTGCAAAGAAGATTGTAGCAAGTTCAGCAATGGGATTGGTTAATAAGGACATTGGAAAGAAACTAATGAGTT
GTCCTTTGGCAGGTCTGATCAGTAAAGATGCCATAAACCTTAAAGCCGAAGCACTGCTCCCCACTCAGGAACCGCTTAAGGCTTCTTGTA
GTACAAACATCAATAATCAGGAAAGTCAGGAACTTTCTGAATCCCTGAAAGATAGTGCCACCAGCAAAACTTTTGAAAAGAATGTTGTAC
GGCAGAATAAAGAAAGCATATTGGAAAAGTTCTCAGTACGAAAAGAAATCATTAATTTGGAGAAAGAAATGTTTAATGAAGGAACATGCA
TTCAGCAAGACAGTTTCTCATCCAGTGAAAAGGGATCTTATGAAACCTCAAAGCATGAAAAGCAGCCTCCTGTATATTGCACTTCTCCGG
ACTTTAAAATGGGAGGTGCTTCTGATGTATCTACCGCTAAATCCCCATTCAGTGCAGTAGGAGAAAGCAATCTCCCTTCCCCATCACCTA
CTGTATCTGTTAATCCTTTAACCAGAAGTCCCCCTGAAACTTCTTCACAGTTGGCTCCTAATCCATTACTTTTAAGTTCTACTACAGAAC
TAATCGAAGAAATTTCTGAATCTGTTGGAAAGAACCAGTTTACTTCTGAAAGTACCCACTTGAACGTTGGTCATAGGTCAGTTGGTCATA
GTATAAGTATTGAATGTAAAGGGATTGATAAAGAGGTAAATGATTCAAAAACTACCCATATAGATATTCCAAGAATAAGCTCTTCCCTTG
GAAAAAAGCCAAGTTTGACTTCTGAATCCAGCATTCATACTATTACTCCTTCAGTTGTTAACTTCACTAGTTTATTTAGTAATAAGCCTT
TTTTAAAACTGGGTGCAGTATCTGCATCAGACAAACACTGCCAAGTTGCTGAAAGCCTAAGTACTAGTTTGCAGTCCAAACCATTAAAAA
AAAGAAAAGGAAGAAAACCTCGGTGGACTAAAGTGGTGGCAAGAAGCACATGCCGGTCTCCAAAAGGGCTAGAATTAGAAAGATCAGAGC
TTTTTAAAAACGTTTCATGTAGCTCACTATCAAATAGTAATTCTGAGCCAGCCAAGTTTATGAAAAACATTGGACCCCCTTCATTTGTAG
ATCATGACTTCCTTAAACGCCGATTGCCAAAGTTGAGCAAATCCACAGCTCCATCTCTTGCTCTCTTAGCTGATAGTGAAAAACCATCTC
ATAAGTCTTTTGCTACTCACAAACTATCCTCCAGTATGTGTGTCTCTAGTGACCTTTTGTCTGATATTTATAAGCCCAAAAGAGGAAGGC
CTAAATCTAAGGAGATGCCTCAACTGGAAGGGCCACCTAAAAGGACTTTAAAAATCCCTGCTTCTAAAGTGTTTTCTTTACAGTCTAAGG
AAGAACAAGAACCCCCAATTTTACAGCCAGAAATTGAAATCCCTTCCTTCAAACAAGGTCTGTCTGTGTCTCCTTTTCCAAAAAAGAGAG
GCAGGCCTAAGAGGCAAATGAGGTCACCAGTCAAGATGAAGCCACCTGTACTGTCAGTGGCTCCATTTGTTGCCACTGAAAGTCCAAGCA
AGCTAGAATCTGAAAGTGACAACCATAGAAGTAGCAGTGATTTCTTTGAGAGCGAGGATCAACTTCAGGATCCAGATGACCTAGATGACA
GTCATAGGCCAAGTGTCTGTAGTATGAGTGACCTTGAGATGGAACCAGATAAAAAAATTACCAAGAGAAACAATGGACAATTAATGAAAA
CAATTATCCGCAAAATAAATAAAATGAAGACTTTAAAGAGAAAGAAACTGTTGAATCAGATTCTTTCAAGTTCTGTAGAATCAAGTAATA
AAGGGAAAGTGCAATCCAAACTCCATAATACGGTATCAAGTCTTGCTGCCACATTTGGCTCTAAATTGGGCCAACAGATAAATGTCAGCA
AGAAAGGAACCATTTATATAGGAAAGAGAAGAGGTCGCAAACCAAAAACTGTCTTAAATGGTATTCTTTCTGGTAGTCCTACTAGCCTTG
CTGTTCTTGAGCAAACAGCTCAACAGGCAGCTGGGTCAGCATTAGGACAGATTCTTCCCCCATTACTGCCTTCATCTGCTAGTAGTTCTG
AGATTCTTCCATCACCTATTTGCTCTCAGTCTTCTGGGACTAGTGGAGGTCAGAGCCCTGTAAGTAGTGATGCAGGTTTTGTTGAACCCA
GTTCAGTGCCATATTTGCATTTACACTCCAGACAGGGCAGTATGATTCAGACTCTTGCAATGAAGAAGGCCTCAAAGGGGAGGAGGCGGT
TATCTCCTCCTACTTTGTTGCCAAATTCTCCTTCGCACTTGAGTGAACTCACATCTCTAAAAGAAGCTACTCCTTCCCCAATCAGTGAGT
CTCATAGTGATGAGACCATTCCCAGTGATAGTGGAATTGGAACAGATAATAACAGCACATCAGACAGGGCAGAGAAATTTTGTGGGCAAA
AAAAGAGGAGGCATTCTTTTGAGCATGTTTCTCTGATTCCCCCTGAAACCTCTACAGTGCTAAGCAGTCTTAAAGAAAAACATAAACACA
AATGTAAGCGCAGGAATCATGATTACCTCAGCTATGACAAGATGAAAAGGCAGAAACGAAAACGGAAAAAGAAATATCCCCAGCTTCGAA
ATAGACAGGATCCAGACTTTATTGCAGAGCTGGAGGAACTAATAAGTCGCCTAAGTGAAATTCGGATCACTCATCGAAGTCATCATTTTA
TCCCCCGAGATCTTCTGCCAACTATCTTTCGAATCAACTTTAATAGTTTCTATACACATCCTTCTTTCCCCTTAGACCCTTTGCACTACA
TTCGAAAACCTGACTTAAAAAAGAAAAGAGGGAGACCCCCTAAGATGAGGGAGGCAATGGCTGAAATGCCTTTTATGCACAGCCTTAGTT
TTCCTCTTTCTAGTACTGGATTCTATCCATCTTATGGTATGCCTTACTCTCCTTCACCCCTTACAGCTGCTCCCATAGGATTAGGTTACT
ATGGAAGGTATCCTCCCACTCTTTATCCACCTCCTCCATCTCCTTCTTTCACCACGCCACTTCCACCTCCTTCCTATATGCATGCTGGTC
ATTTACTTCTCAATCCTGCCAAATACCATAAGAAAAAGCATAAGCTACTTCGACAGGAGGCCTTTCTTACAACCAGCAGGACTCCCCTCC
TTTCCATGAGTACCTACCCCAGTGTTCCTCCTGAGATGGCCTATGGTTGGATGGTTGAGCACAAACACAGGCACCGTCACAAACACAGAG
AACACCGTTCTTCTGAACAACCCCAGGTTTCTATGGACACTGGCTCTTCCCGATCTGTCCTGGAATCTTTGAAGCGCTATAGATTTGGAA
AGGATGCTGTTGGAGAGCGATATAAGCATAAGGAAAAGCACCGTTGTCACATGTCCTGCCCTCATCTCTCTCCTTCAAAAAGCTTAATAA
ACAGAGAGGAACAGTGGGTCCACCGAGAGCCTTCAGAATCTAGTCCATTGGCCTTGGGATTGCAGACACCTTTACAGATTGACTGTTCAG
AAAGTTCTCCAAGCTTATCCCTTGGAGGATTCACTCCCAACTCTGAGCCAGCCAGCAGTGATGAACATACAAACCTTTTCACAAGTGCAA
TAGGCAGCTGCAGAGTTTCAAACCCTAACTCCAGTGGCCGGAAGAAATTAACTGACAGCCCTGGACTCTTTTCTGCACAGGACACTTCAC
TAAATCGGCTTCACAGAAAGGAGTCACTGCCTTCTAACGAAAGGGCAGTACAGACTTTGGCAGGCTCCCAGCCAACCTCTGATAAACCCT
CCCAGCGGCCATCAGAGAGCACAAATTGTAGCCCTACCCGGAAAAGGTCTTCATCTGAGAGTACTTCTTCAACAGTAAACGGAGTTCCCT
CTCGAAGTCCAAGATTAGTTGCTTCTGGGGATGACTCTGTGGATAGTCTGCTGCAGCGGATGGTACAAAATGAGGACCAAGAGCCCATGG
AGAAAAGTATTGATGCTGTGATTGCAACTGCCTCTGCACCACCTTCTTCCAGTCCAGGCCGTAGCCACAGCAAGGACCGAACCCTGGGAA
AACCAGACAGCCTTTTAGTGCCTGCAGTCACAAGTGACTCTTGCAATAATAGCATCTCACTCCTATCTGAAAAGTTGACAAGCAGCTGTT
CCCCCCATCATATCAAGAGAAGTGTAGTGGAAGCTATGCAACGCCAAGCTCGGAAAATGTGCAATTACGACAAAATCTTGGCCACAAAGA
AAAACCTAGACCATGTCAATAAAATCTTAAAAGCCAAAAAACTTCAAAGGCAGGCCAGGACAGGGAATAACTTTGTGAAACGTAGGCCAG
GTCGACCTCGGAAATGTCCCCTTCAGGCTGTCGTATCAATGCAAGCATTCCAGGCTGCTCAGTTTGTCAACCCAGAATTGAACAGAGACG
AGGAAGGAGCAGCACTGCACCTCAGTCCTGACACAGTTACAGATGTAATTGAGGCTGTTGTTCAGAGTGTAAATCTGAACCCAGAACATA
AAAAGGGGTTGAAGAGAAAAGGTTGGCTATTGGAAGAACAGACCAGAAAAAAGCAGAAGCCATTACCAGAGGAAGAAGAGCAAGAGAATA
ATAAAAGCTTTAATGAAGCACCAGTTGAGATTCCCAGTCCTTCTGAAACCCCAGCTAAACCTTCTGAACCTGAAAGTACCTTGCAGCCTG
TGCTTTCTCTCATCCCAAGGGAAAAGAAGCCCCCACGTCCCCCAAAGAAGAAGTATCAGAAAGCAGGGCTGTATTCTGACGTTTACAAAA
CTACAGACCCAAAGAGTCGATTGATCCAATTAAAGAAAGAGAAGCTGGAGTATACTCCAGGAGAGCATGAATATGGATTATTTCCAGCGC
CCATTCATGTTGGTCAAGCTGGGAACTGCCTGACTTGAGGGAAGGGAGAGTAAAAGCCATCAGTGACTCAGATGGGGTGAGCTACCCTTG
GTACGGGAACACCACAGAAACTGTGACCCTGGTTGGCCCCACCAACAAGATCTCCAGGTTCTCCGTCAGCATGAATGACAACTTCTACCC
CAGTGTGACATGGGCAGTGCCTGTGAGTGACAGCAATGTGCCACTGCTCACAAGAATCAAGAGAGACCAAAGTTTCACGACCTGGCTGGT
GGCCATGAACACCACCACAAAGGAGAAGATCATTCTGCAGACCATCAAGTGGAGGATGAGGGTGGACATTGAAGTGGACCCTCTTCAGCT
CTTGGGGCAGCGGGCCCGGCTGGTGGGCAGGACTCAGCAGGAGCAGCCCCGGATCCTGAGCCGGATGGAACCCATCCCCCCTAATGCACT
AGTGAAACCCAATGCCAATGATGCCCAGGTCCTCATGTGGAGGCCCAAGCGGGGGCCACCTCTGGTTGTGATCCCTCCTAAGTAGAAGCA
GACTGGCCTGACTGTGTGTGGATCACACGCCTCTGAGACATGCAGTGAGGGTGCCAGGGGTGGCAGGAGCCAAACAGAGTTTCTGAGCCA
AAGCAGACCTCTCGGTTTGCCAGCCTTTGCAGCCACTTTTGAAGAGTAGGGCTGCTCCTTGGGTGGTAGAACCATAATCCTTAGGAAAAA

>In-frame_ENST00000392403_ENST00000338353_TCGA-AR-A0TT-01A_ASH1L_chr1_155365250_-_FAM78B_chr1_166040000_length(amino acids)=2042AA_start in transcript=479_stop in transcript=6607
MDPRNTAMLGLGSDSEGFSRKSPSAISTGTLVSKREVELEKNTKEEEDLRKRNRERNIEAGKDDGLTDAQQQFSVKETNFSEGNLKLKIG
LQAKRTKKPPKNLENYVCRPAIKTTIKHPRKALKSGKMTDEKNEHCPSKRDPSKLYKKADDVAAIECQSEEVIRLHSQGENNPLSKKLSP
VHSEMADYINATPSTLLGSRDPDLKDRALLNGGTSVTEKLAQLIATCPPSKSSKTKPKKLGTGTTAGLVSKDLIRKAGVGSVAGIIHKDL
IKKPTISTAVGLVTKDPGKKPVFNAAVGLVNKDSVKKLGTGTTAVFINKNLGKKPGTITTVGLLSKDSGKKLGIGIVPGLVHKESGKKLG
LGTVVGLVNKDLGKKLGSTVGLVAKDCAKKIVASSAMGLVNKDIGKKLMSCPLAGLISKDAINLKAEALLPTQEPLKASCSTNINNQESQ
ELSESLKDSATSKTFEKNVVRQNKESILEKFSVRKEIINLEKEMFNEGTCIQQDSFSSSEKGSYETSKHEKQPPVYCTSPDFKMGGASDV
STAKSPFSAVGESNLPSPSPTVSVNPLTRSPPETSSQLAPNPLLLSSTTELIEEISESVGKNQFTSESTHLNVGHRSVGHSISIECKGID
KEVNDSKTTHIDIPRISSSLGKKPSLTSESSIHTITPSVVNFTSLFSNKPFLKLGAVSASDKHCQVAESLSTSLQSKPLKKRKGRKPRWT
KVVARSTCRSPKGLELERSELFKNVSCSSLSNSNSEPAKFMKNIGPPSFVDHDFLKRRLPKLSKSTAPSLALLADSEKPSHKSFATHKLS
SSMCVSSDLLSDIYKPKRGRPKSKEMPQLEGPPKRTLKIPASKVFSLQSKEEQEPPILQPEIEIPSFKQGLSVSPFPKKRGRPKRQMRSP
VKMKPPVLSVAPFVATESPSKLESESDNHRSSSDFFESEDQLQDPDDLDDSHRPSVCSMSDLEMEPDKKITKRNNGQLMKTIIRKINKMK
TLKRKKLLNQILSSSVESSNKGKVQSKLHNTVSSLAATFGSKLGQQINVSKKGTIYIGKRRGRKPKTVLNGILSGSPTSLAVLEQTAQQA
AGSALGQILPPLLPSSASSSEILPSPICSQSSGTSGGQSPVSSDAGFVEPSSVPYLHLHSRQGSMIQTLAMKKASKGRRRLSPPTLLPNS
PSHLSELTSLKEATPSPISESHSDETIPSDSGIGTDNNSTSDRAEKFCGQKKRRHSFEHVSLIPPETSTVLSSLKEKHKHKCKRRNHDYL
SYDKMKRQKRKRKKKYPQLRNRQDPDFIAELEELISRLSEIRITHRSHHFIPRDLLPTIFRINFNSFYTHPSFPLDPLHYIRKPDLKKKR
GRPPKMREAMAEMPFMHSLSFPLSSTGFYPSYGMPYSPSPLTAAPIGLGYYGRYPPTLYPPPPSPSFTTPLPPPSYMHAGHLLLNPAKYH
KKKHKLLRQEAFLTTSRTPLLSMSTYPSVPPEMAYGWMVEHKHRHRHKHREHRSSEQPQVSMDTGSSRSVLESLKRYRFGKDAVGERYKH
KEKHRCHMSCPHLSPSKSLINREEQWVHREPSESSPLALGLQTPLQIDCSESSPSLSLGGFTPNSEPASSDEHTNLFTSAIGSCRVSNPN
SSGRKKLTDSPGLFSAQDTSLNRLHRKESLPSNERAVQTLAGSQPTSDKPSQRPSESTNCSPTRKRSSSESTSSTVNGVPSRSPRLVASG
DDSVDSLLQRMVQNEDQEPMEKSIDAVIATASAPPSSSPGRSHSKDRTLGKPDSLLVPAVTSDSCNNSISLLSEKLTSSCSPHHIKRSVV
EAMQRQARKMCNYDKILATKKNLDHVNKILKAKKLQRQARTGNNFVKRRPGRPRKCPLQAVVSMQAFQAAQFVNPELNRDEEGAALHLSP
DTVTDVIEAVVQSVNLNPEHKKGLKRKGWLLEEQTRKKQKPLPEEEEQENNKSFNEAPVEIPSPSETPAKPSEPESTLQPVLSLIPREKK

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for ASH1L-FAM78B


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for ASH1L-FAM78B


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for ASH1L-FAM78B


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource
HgeneASH1LC4540478MENTAL RETARDATION, AUTOSOMAL DOMINANT 524GENOMICS_ENGLAND;UNIPROT
HgeneASH1LC3714756Intellectual Disability2GENOMICS_ENGLAND
HgeneASH1LC0023903Liver neoplasms1CTD_human
HgeneASH1LC0033578Prostatic Neoplasms1CTD_human
HgeneASH1LC0345904Malignant neoplasm of liver1CTD_human
HgeneASH1LC0376358Malignant neoplasm of prostate1CTD_human
HgeneASH1LC1535926Neurodevelopmental Disorders1CTD_human