FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:CHD8-IMPG1 (FusionGDB2 ID:16420)

Fusion Gene Summary for CHD8-IMPG1

check button Fusion gene summary
Fusion gene informationFusion gene name: CHD8-IMPG1
Fusion gene ID: 16420
HgeneTgene
Gene symbol

CHD8

IMPG1

Gene ID

57680

3617

Gene namechromodomain helicase DNA binding protein 8interphotoreceptor matrix proteoglycan 1
SynonymsAUTS18|HELSNF1GP147|IPM150|SPACR|VMD4
Cytomap

14q11.2

6q14.1

Type of geneprotein-codingprotein-coding
Descriptionchromodomain-helicase-DNA-binding protein 8ATP-dependent helicase CHD8axis duplication inhibitorduplinhelicase with SNF2 domain 1interphotoreceptor matrix proteoglycan 1interphotoreceptor matrix proteoglycan of 150 kDasialoprotein associated with cones and rods
Modification date2020032920200313
UniProtAcc

Q9HCK8

Q17R60

Ensembl transtripts involved in fusion geneENST00000430710, ENST00000399982, 
ENST00000557364, ENST00000555962, 
ENST00000369963, ENST00000369950, 
Fusion gene scores* DoF score12 X 12 X 10=14403 X 4 X 3=36
# samples 183
** MAII scorelog2(18/1440*10)=-3
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(3/36*10)=-0.263034405833794
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: CHD8 [Title/Abstract] AND IMPG1 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointCHD8(21894287)-IMPG1(76744504), # samples:3
Anticipated loss of major functional domain due to fusion event.CHD8-IMPG1 seems lost the major protein functional domain in Hgene partner, which is a essential gene due to the frame-shifted ORF.
CHD8-IMPG1 seems lost the major protein functional domain in Hgene partner, which is a epigenetic factor due to the frame-shifted ORF.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneCHD8

GO:0045893

positive regulation of transcription, DNA-templated

17938208

HgeneCHD8

GO:0090090

negative regulation of canonical Wnt signaling pathway

18378692|22083958


check buttonFusion gene breakpoints across CHD8 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.

check buttonFusion gene breakpoints across IMPG1 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4BRCATCGA-PE-A5DD-01ACHD8chr14

21894287

-IMPG1chr6

76744504

-
ChimerDB4BRCATCGA-PE-A5DD-01ACHD8chr14

21894287

-IMPG1chr6

76744504

-
ChimerDB4BRCATCGA-PE-A5DD-01ACHD8chr14

21894287

-IMPG1chr6

76744504

-


Top

Fusion Gene ORF analysis for CHD8-IMPG1

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
Frame-shiftENST00000430710ENST00000369963CHD8chr14

21894287

-IMPG1chr6

76744504

-
Frame-shiftENST00000430710ENST00000369950CHD8chr14

21894287

-IMPG1chr6

76744504

-
In-frameENST00000399982ENST00000369963CHD8chr14

21894287

-IMPG1chr6

76744504

-
Frame-shiftENST00000399982ENST00000369950CHD8chr14

21894287

-IMPG1chr6

76744504

-
Frame-shiftENST00000557364ENST00000369963CHD8chr14

21894287

-IMPG1chr6

76744504

-
In-frameENST00000557364ENST00000369950CHD8chr14

21894287

-IMPG1chr6

76744504

-
intron-3CDSENST00000555962ENST00000369963CHD8chr14

21894287

-IMPG1chr6

76744504

-
intron-3CDSENST00000555962ENST00000369950CHD8chr14

21894287

-IMPG1chr6

76744504

-

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000399982CHD8chr1421894287-ENST00000369963IMPG1chr676744504-7319178143926344650
ENST00000557364CHD8chr1421894287-ENST00000369950IMPG1chr676744504-5047198020184072684

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000399982ENST00000369963CHD8chr1421894287-IMPG1chr676744504-0.0009277950.9990722
ENST00000557364ENST00000369950CHD8chr1421894287-IMPG1chr676744504-0.0024378370.99756217

Top

Fusion Genomic Features for CHD8-IMPG1


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.

Top

Fusion Protein Features for CHD8-IMPG1


check button Go to

FGviewer for the breakpoints of chr14:21894287-chr6:76744504

.
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
CHD8

Q9HCK8

IMPG1

Q17R60

FUNCTION: DNA helicase that acts as a chromatin remodeling factor and regulates transcription. Acts as a transcription repressor by remodeling chromatin structure and recruiting histone H1 to target genes. Suppresses p53/TP53-mediated apoptosis by recruiting histone H1 and preventing p53/TP53 transactivation activity. Acts as a negative regulator of Wnt signaling pathway by regulating beta-catenin (CTNNB1) activity. Negatively regulates CTNNB1-targeted gene expression by being recruited specifically to the promoter regions of several CTNNB1 responsive genes. Involved in both enhancer blocking and epigenetic remodeling at chromatin boundary via its interaction with CTCF. Acts as a suppressor of STAT3 activity by suppressing the LIF-induced STAT3 transcriptional activity. Also acts as a transcription activator via its interaction with ZNF143 by participating in efficient U6 RNA polymerase III transcription. {ECO:0000255|HAMAP-Rule:MF_03071, ECO:0000269|PubMed:17938208, ECO:0000269|PubMed:18378692}.FUNCTION: Chondroitin sulfate-, heparin- and hyaluronan-binding protein (By similarity). May serve to form a basic macromolecular scaffold comprising the insoluble interphotoreceptor matrix (PubMed:9813076). {ECO:0000250|UniProtKB:Q8JIR8, ECO:0000269|PubMed:9813076}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneCHD8chr14:21894287chr6:76744504ENST00000399982-437292_410572.02582.0Compositional biasGln-rich
HgeneCHD8chr14:21894287chr6:76744504ENST00000557364-538292_410572.02582.0Compositional biasGln-rich
TgeneIMPG1chr14:21894287chr6:76744504ENST00000369950117232_354100.33333333333333798.0DomainSEA 1
TgeneIMPG1chr14:21894287chr6:76744504ENST00000369950117571_684100.33333333333333798.0DomainSEA 2
TgeneIMPG1chr14:21894287chr6:76744504ENST00000369950117621_629100.33333333333333798.0MotifHeparin- and hyaluronan-binding

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneCHD8chr14:21894287chr6:76744504ENST00000399982-4372069_2098572.02582.0Compositional biasSer-rich
HgeneCHD8chr14:21894287chr6:76744504ENST00000399982-4372493_2508572.02582.0Compositional biasHis-rich
HgeneCHD8chr14:21894287chr6:76744504ENST00000399982-4372539_2581572.02582.0Compositional biasAsp-rich
HgeneCHD8chr14:21894287chr6:76744504ENST00000430710-5382069_2098293.02303.0Compositional biasSer-rich
HgeneCHD8chr14:21894287chr6:76744504ENST00000430710-5382493_2508293.02303.0Compositional biasHis-rich
HgeneCHD8chr14:21894287chr6:76744504ENST00000430710-5382539_2581293.02303.0Compositional biasAsp-rich
HgeneCHD8chr14:21894287chr6:76744504ENST00000430710-538292_410293.02303.0Compositional biasGln-rich
HgeneCHD8chr14:21894287chr6:76744504ENST00000557364-5382069_2098572.02582.0Compositional biasSer-rich
HgeneCHD8chr14:21894287chr6:76744504ENST00000557364-5382493_2508572.02582.0Compositional biasHis-rich
HgeneCHD8chr14:21894287chr6:76744504ENST00000557364-5382539_2581572.02582.0Compositional biasAsp-rich
HgeneCHD8chr14:21894287chr6:76744504ENST00000399982-4371137_1288572.02582.0DomainHelicase C-terminal
HgeneCHD8chr14:21894287chr6:76744504ENST00000399982-437642_709572.02582.0DomainChromo 1
HgeneCHD8chr14:21894287chr6:76744504ENST00000399982-437724_790572.02582.0DomainChromo 2
HgeneCHD8chr14:21894287chr6:76744504ENST00000399982-437823_997572.02582.0DomainHelicase ATP-binding
HgeneCHD8chr14:21894287chr6:76744504ENST00000430710-5381137_1288293.02303.0DomainHelicase C-terminal
HgeneCHD8chr14:21894287chr6:76744504ENST00000430710-538642_709293.02303.0DomainChromo 1
HgeneCHD8chr14:21894287chr6:76744504ENST00000430710-538724_790293.02303.0DomainChromo 2
HgeneCHD8chr14:21894287chr6:76744504ENST00000430710-538823_997293.02303.0DomainHelicase ATP-binding
HgeneCHD8chr14:21894287chr6:76744504ENST00000557364-5381137_1288572.02582.0DomainHelicase C-terminal
HgeneCHD8chr14:21894287chr6:76744504ENST00000557364-538642_709572.02582.0DomainChromo 1
HgeneCHD8chr14:21894287chr6:76744504ENST00000557364-538724_790572.02582.0DomainChromo 2
HgeneCHD8chr14:21894287chr6:76744504ENST00000557364-538823_997572.02582.0DomainHelicase ATP-binding
HgeneCHD8chr14:21894287chr6:76744504ENST00000399982-437948_951572.02582.0MotifDEAH box
HgeneCHD8chr14:21894287chr6:76744504ENST00000430710-538948_951293.02303.0MotifDEAH box
HgeneCHD8chr14:21894287chr6:76744504ENST00000557364-538948_951572.02582.0MotifDEAH box
HgeneCHD8chr14:21894287chr6:76744504ENST00000399982-437836_843572.02582.0Nucleotide bindingATP
HgeneCHD8chr14:21894287chr6:76744504ENST00000430710-538836_843293.02303.0Nucleotide bindingATP
HgeneCHD8chr14:21894287chr6:76744504ENST00000557364-538836_843572.02582.0Nucleotide bindingATP


Top

Fusion Gene Sequence for CHD8-IMPG1


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>In-frame_ENST00000399982_ENST00000369963_TCGA-PE-A5DD-01A_CHD8_chr14_21894287_-_IMPG1_chr6_76744504_length(transcript)=7319nt_BP=1781nt
GCCAGTACCCCCCCTCCCCTTCCCTCCCTAGACCTTGGAGAAGTACCCTCCATTACTTTCCCAAGATGGCAGACCCCATCATGGATCTGT
TCGATGACCCAAATTTATTTGGCCTGGACTCTCTGACTGATGACAGCTTTAACCAGGTCACACAAGACCCCATTGAGGAAGCCCTTGGAC
TGCCAAGCTCTCTGGACTCCTTGGATCAGATGAACCAGGATGGTGGAGGTGGTGATGTGGGGAATTCATCAGCAAGTGAACTGGTCCCTC
CACCAGAGGAAACAGCTCCCACAGAACTTTCCAAAGAATCCACAGCTCCAGCTCCAGAATCCATAACCTTGCATGATTATACCACTCAGC
CTGCCAGCCAGGAGCAGCCAGCCCAACCTGTCTTACAGACATCGACGCCAACATCAGGACTTTTGCAAGTCTCCAAGAGCCAGGAGATCC
TGAGCCAAGGGAATCCTTTCATGGGTGTCTCTGCCACAGCTGTCTCCTCCAGTAGTGCTGGAGGGCAGCCACCTCAGTCAGCCCCTAAGA
TTGTTATCCTTAAGGCCCCACCAAGCTCCTCAGTCACTGGTGCCCATGTGGCACAAATTCAGGCCCAAGGTATCACCAGCACAGCTCAGC
CCCTGGTGGCAGGCACAGCCAATGGTGGAAAAGTCACTTTTACCAAAGTGCTAACCGGCACACCCCTTCGACCAGGTGTTTCCATTGTCT
CTGGTAATACAGTGTTGGCCGCCAAGGTCCCTGGGAACCAGGCTGCTGTTCAGCGCATTGTCCAGCCCAGCCGACCAGTAAAGCAGCTGG
TCCTCCAGCCAGTTAAGGGTTCAGCTCCTGCTGGAAACCCTGGGGCCACAGGGCCCCCACTGAAGCCTGCAGTTACACTGACCTCTACAC
CTACCCAGGGTGAATCGAAACGCATCACCCTGGTCCTCCAGCAGCCACAGTCTGGAGGTCCCCAAGGACATCGGCATGTTGTGCTAGGGA
GTCTACCAGGCAAGATAGTGTTACAGGGCAACCAGCTGGCAGCCCTGACTCAAGCCAAGAATGCCCAAGGGCAGCCTGCCAAGGTAGTAA
CTATCCAGCTGCAGGTGCAGCAGCCACAGCAAAAAATCCAGATTGTACCACAACCACCATCATCGCAGCCACAGCCCCAGCAGCCACCCT
CCACCCAGCCAGTGACTCTGTCCTCTGTACAGCAGGCTCAGATAATGGGACCAGGACAAAGCCCAGGACAAAGACTTTCAGTACCAGTCA
AGGTGGTACTGCAGCCACAGGCTGGCTCTTCCCAAGGGGCCTCTTCTGGGCTCTCTGTAGTTAAAGTTCTGAGTGCCAGTGAAGTGGCAG
CTTTGTCATCACCAGCAAGCTCTGCTCCTCATTCGGGGGGAAAGACAGGAATGGAGGAAAACCGCAGATTGGAACACCAGAAGAAGCAAG
AGAAAGCAAATCGGATTGTAGCAGAGGCCATTGCGAGAGCCCGTGCCCGCGGTGAGCAGAACATACCTCGAGTCTTAAATGAGGACGAGT
TGCCCAGCGTTCGGCCAGAGGAGGAAGGCGAGAAGAAACGCAGGAAGAAGAGTGCTGGGGAGAGGCTGAAAGAGGAGAAGCCAAAGAAGA
GTAAAACATCTGGTGCCTCCAAAACAAAGGGCAAGAGCAAGCTCAACACCATCACTCCTGTAGTGGGTAAGAAGAGAAAACGTAATACCT
CATCTGATAATTCAGATGTGGAAGTCATGCCTGCACAGTCACCTCGAGAAGATGAAGAAAGCAGCATTCAGTGTGTCAGGAAGCAGTATG
GGAAGCATATCGGATCTTTCTGGATCGCATCCCTGACACAGGGGAATATCAGGACTGGGTCAGCATCTGCCAGCAGGAGACCTTCTGCCT
CTTTGACATTGGAAAAAACTTCAGCAATTCCCAGGAGCACCTGGATCTTCTCCAGCAGAGAATAAAACAGAGAAGTTTCCCTGACAGAAA
AGATGAAATATCTGCAGAGAAGACATTGGGAGAGCCTGGTGAAACCATTGTCATTTCAACAGAAAAGAATAAAGGTAAAACCAAACCATT
CAATATCTTACAGTTTGGAAACAACCATCATGAACATTTACTGCCTATTTTCTGCCTTCTTTCTTCTATTATATACACATATTACTGAGA
CCATATTTACTTAGAGTTTGTATCATTTTTTCCACTTAACCTGATGTAAAGGATATTTTATAGTTCACTAAAATTTCTTCAAAGTTATCA
TTTTAATGACTTGCAACATTCTGTTGCATCCATATGGATGTCATCAACTTAACCATTGCATCATTGTATCTGACATTACAAATAGTGTTT
TAATTAAGATATTTAAACATAAATCTTTGTCAAATCACTAATTTCTCAAAATAGATTACTAAGTCAAACACTATGGATTTTAAGGCTTTT
AATAAATTTTGCCAAATTTTTTTTCCAGAAATTTTGTTACAATTTAAATATCCCACTGGCAGAAAAGACAAGAACCTGTCTCATTGCAGC
ACTTATCAATATTGAATATTACTATTTATTCTAGAGTTTGCCAAAAGGCAGAGGAAAATCATTTCTCCTGTGTTTATTAGATGTTTGAAA
TTTCTCTTTCATGTATATTTCTATTGCAGTCTTCCCATTTTCACATAAAAGAATACTTTTAAATATACACAATAAAACTATTCTCTGGGC
AATGTGTTCCCCAGTTCATTTGATTTTTGGTAATGTGAATAATGTTTTAAGAAATTAATTTTTCATGACTTAAAATGTAGAAATTGAGAA
AATAGATATTTGTAAATAAATGATATTACTTTCCCATTTCTTCCTTTACTGACATTCAGAGAAAGCCGGCTGTGCCTATATTTCAATTTT
CTCCTAAGTTCAGCTAATTTATCATTTCTGGACTGACTTGTAGCTAATAATAGGACAAGCCTAGTTGAATTTTAATTATAAAATGACAGT
AAAATGCATTTTTAACAATAAAAAGAAGAGCTACATAGTTAATTCCCCCATATAAACAGCTCAAATTTGACCTGTGACACCTAATATCAA
GTCTTATTCAAAAAGGTATTATTTAGAGGAGGAGCATATAAACATGTAAAAAGACGTTTTGTACTGACTTCATTTTAATATGGGAAAACA
AAATTAATAAGAGAATTGAAATTTGTATTTCTACTTTAAACTTGAAGGTAATGTACTTTAGCTGATAGAAACTTTTGTAATAACAAGATT
AGATTTAGTAAGTTAGAGTAAGTTAATATGACCCCTTTCTTATAAATTATTTTTCATTTTTTTAAATAAAAACTTTGAGGGATCAGTGGA
CAATTCATATATTATAATATAATATGATATATCATCACCATATATTTAAGTTCTAGCAACAAATGTTTTAGAATTATTTCCCCAGTACTT
CTGGACACCTCCAGTTGAAGTATACACACAGAACAGTTAAGATTAGATTTTGATTGGGAAATGTTACTAGGTGCCTAGGTGTGAAGGAAG
AGTGAGAAATCTCCAAAAGAGGCTGAATGTGCTCGAAGACAAACCCAGGCTACCTAATAATTGTTTTTAGGTGTAAGTATGTGAATAATT
TATCTTTTCAAGGCAATAAAATGGAAGGATAGCCAAATAAAAATTTGTTCTTCTAGGGTACCAGCTAAGTACTTTTTTATTGTAACAAAA
ATTGTAGCATAAACCGATTATTCATAGAATTACAAAGTTTGAAGGTGCCTTACATTGGTAATAGTCTGATGAAGTCCAGAGCACAGAACA
CCTCCATCCCAATTGCTCACAGCAGGATTCTTCAATGAGGGCAGAAGGGGAAGATGCCAGGGACCACATCCTTTGAGGTTCTGATATGCC
ACAGGGTATGTACTTTATTAGCCAAAAAACTGCCTTGCCTAGATGATTCTGATACCCACTCTCACGCCCCCACTAGGAAGCACAATTCCA
CACCCAGGGCAATTACTTCACAGAGCCTCACAGTGACAAGTAACCACTTGTGACCGCAGATTCCCAGGTACCACATACTCTTCAGGGAGT
AAGGAAAGATTGAATCATTTTTCAGTATATCTTGTTTTCTAAAACACAAGAATAAACTAAGGCATTCTATGTTTTTGCCTAATTAAGGTT
GAAATGGAAAGTTGTTTCTGGCTAAATATAAACCAATTAACTATAACACTATTTTCTCTGAGCATTTTTTTAAATAATAATTTATTACAA
AATAGAAATGCATTTGTATTATAATAACGCTTTTTTCTTTGCTTGAATAGAACCCAGACTTGCACTCTATGAAGATCTTTAAAAATATCA
AAACATTTGCCACATATTTGAAATGGTTTATGGTGATCTGTTCTAAAAGAAGTGAAATTCATAGCATCTTAAATGTTCTTTCATCTCTTG
GATTTTCTGATTAAAATCTTATTCCTCAATGATTGCTTAAGCAATCTACATTTTAAAGACTTGGGCAGTATTCTAAGAAAACCCTCAGAA
GAGCAAATTCAAGATGTTGCCAACGTCTCACTTGGGCCTTTCCCTCTCACTCCTGATGACACCCTCCTCAATGAAATTCTCGATAATACA
CTCAACGACACCAAGATGCCTACAACAGAAAGAGAAACAGAATTCGCTGTGTTGGAGGAGCAGAGGGTGGAGCTCAGCGTCTCTCTGGTA
AACCAGAAGTTCAAGGCAGAGCTCGCTGACTCCCAGTCCCCATATTACCAGGAGCTAGCAGGAAAGTCCCAACTTCAGATGCAAAAGATA
TTTAAGAAACTTCCAGGATTCAAAAAAATCCATGTGTTAGGATTTAGACCAAAGAAAGAAAAAGATGGCTCAAGCTCCACAGAGATGCAA
CTTACGGCCATCTTTAAGAGACACAGTGCAGAAGCAAAAAGCCCTGCAAGTGACCTCCTGTCTTTTGATTCCAACAAAATTGAAAGTGAG
GAAGTCTATCATGGAACCATGGAGGAGGACAAGCAACCAGAAATCTATCTCACAGCTACAGACCTCAAAAGGCTGATCAGCAAAGCACTA
GAGGAAGAACAATCTTTGGATGTGGGGACAATTCAGTTCACTGATGAAATTGCTGGATCACTGCCAGCCTTTGGTCCTGACACCCAATCA
GAGCTGCCCACATCTTTTGCTGTTATAACAGAGGATGCTACTTTGAGTCCAGAACTTCCTCCTGTTGAACCCCAGCTTGAGACAGTGGAC
GGAGCAGAGCATGGTCTACCTGACACTTCTTGGTCTCCACCTGCTATGGCCTCTACCTCCCTGTCAGAAGCTCCACCTTTCTTTATGGCA
TCAAGCATCTTCTCTCTGACTGATCAAGGCACCACAGATACAATGGCCACTGACCAGACAATGCTAGTACCAGGGCTCACCATCCCCACC
AGTGATTATTCTGCAATCAGCCAACTGGCTCTGGGAATTTCACATCCACCTGCATCTTCAGATGACAGCCGATCAAGTGCAGGTGGCGAA
GATATGGTCAGACACCTAGATGAAATGGATCTGTCTGACACTCCTGCCCCATCTGAGGTACCAGAGCTCAGCGAATATGTTTCTGTCCCA
GATCATTTCTTGGAGGATACCACTCCTGTCTCAGCTTTACAGTATATCACCACTAGTTCTATGACCATTGCCCCCAAGGGCCGAGAGCTG
GTAGTGTTCTTCAGTCTGCGTGTTGCTAACATGGCCTTCTCCAACGACCTGTTCAACAAGAGCTCTCTGGAGTACCGAGCTCTGGAGCAA
CAATTCACACAGCTGCTGGTTCCATATCTACGATCCAATCTTACAGGATTTAAGCAACTTGAAATACTTAACTTCAGAAACGGGAGTGTG
ATTGTGAATAGCAAAATGAAGTTTGCTAAGTCAGTGCCGTATAACCTCACCAAGGCTGTGCACGGGGTCTTGGAGGATTTTCGTTCTGCT
GCAGCCCAACAACTCCATCTGGAAATAGACAGCTACTCTCTCAACATTGAACCAGCTGATCAAGCAGATCCCTGCAAGTTCCTGGCCTGC
GGCGAATTTGCCCAATGTGTAAAGAACGAACGGACTGAGGAAGCGGAGTGTCGCTGCAAACCAGGATATGACAGCCAGGGGAGCCTGGAC
GGTCTGGAACCAGGCCTCTGTGGCCCTGGCACAAAGGAATGCGAGGTCCTCCAGGGAAAGGGAGCTCCATGCAGGTTGCCAGATCACTCT
GAAAATCAAGCATACAAAACTAGTGTTAAAAAGTTCCAAAATCAACAAAATAACAAGGTAATCAGTAAAAGAAATTCTGAATTACTGACC
GTAGAATATGAAGAATTTAACCATCAAGATTGGGAAGGAAATTAAAAACTGAAAATGTACAATTATCATTTAGGCTATCTCAAGAGAGAT
GATTTGCCTTCTCAAGGAAAATGGAGACAGGCATATTCATGGGTCATCAAAATCCAGACATACAGTCAACACTGAGAATCAGCACACACC
ATATTTCAAATATAGAAGAGTCATGTACTTGGCAACCAGTAAATTCTGAAGAAAAAGACACTTACTTATTATTAAAACCCCAAATGCAAT
CAGCGAAACATATTTTTACTATTCTTGGATGATAGTCAAAATGATCATAAGCCAGGTTTGCTTCCACCTTCCCTGAAAATTTTACTCACA
GATCATTTGCAACAAGCATAGCTTACTTATTGTTTAGGGACTGAACAATTTATTGGGAAGCAAACTCTTTATATGCTAGAAAGTACATTT
AAAAGATGACTACTTACGCAGGGAGATGCAGGTCTCTCTAAACGCATGAATGTATGTAGTGTGTAGGCACTGTAGTGAGTGTATATATGC
TCCACACTACGTCTGATAAACACAAACCTCAGTATTCAGTTATTAGGCACACTAGTTTTATACGCAACTACTGCTTACATAGTAGACTGT
TTTGTTGCCAATAATCTTTGAATTGTTCTTTAAAAGAAACTGAGGTTCAGATACACATACCATGGAAAAATCTTACTTTTCTTGTTACTA
CACAAAGCTATTTTAAAGAAGATGCTATGTTGGGAGAAGGGCGAAGTTGTACTATATGACATAATCAATTCCTGTCTTCCACCACAGATG
AACAATGTCTTCCTATAATAACTTCAGAATATTTCCTCACACCAACTTTAGTGTGTGTATACCTAGACTGGCATCAATGTAATCCAATTC
AGTCCATTTTTTATGTGCTGCTTAATGAAAATGAATCTGTCTCTATCTCTTTCAAATGTTCTAATATCAGATATATTTGCTACAAACCTT

>In-frame_ENST00000399982_ENST00000369963_TCGA-PE-A5DD-01A_CHD8_chr14_21894287_-_IMPG1_chr6_76744504_length(amino acids)=650AA_start in transcript=4392_stop in transcript=6344
MFFHLLDFLIKILFLNDCLSNLHFKDLGSILRKPSEEQIQDVANVSLGPFPLTPDDTLLNEILDNTLNDTKMPTTERETEFAVLEEQRVE
LSVSLVNQKFKAELADSQSPYYQELAGKSQLQMQKIFKKLPGFKKIHVLGFRPKKEKDGSSSTEMQLTAIFKRHSAEAKSPASDLLSFDS
NKIESEEVYHGTMEEDKQPEIYLTATDLKRLISKALEEEQSLDVGTIQFTDEIAGSLPAFGPDTQSELPTSFAVITEDATLSPELPPVEP
QLETVDGAEHGLPDTSWSPPAMASTSLSEAPPFFMASSIFSLTDQGTTDTMATDQTMLVPGLTIPTSDYSAISQLALGISHPPASSDDSR
SSAGGEDMVRHLDEMDLSDTPAPSEVPELSEYVSVPDHFLEDTTPVSALQYITTSSMTIAPKGRELVVFFSLRVANMAFSNDLFNKSSLE
YRALEQQFTQLLVPYLRSNLTGFKQLEILNFRNGSVIVNSKMKFAKSVPYNLTKAVHGVLEDFRSAAAQQLHLEIDSYSLNIEPADQADP
CKFLACGEFAQCVKNERTEEAECRCKPGYDSQGSLDGLEPGLCGPGTKECEVLQGKGAPCRLPDHSENQAYKTSVKKFQNQQNNKVISKR

--------------------------------------------------------------
>In-frame_ENST00000557364_ENST00000369950_TCGA-PE-A5DD-01A_CHD8_chr14_21894287_-_IMPG1_chr6_76744504_length(transcript)=5047nt_BP=1980nt
AGGGTGAGTTGAAACGCTGCCTGGAAAGGAAGTACCAGGACTTGCACAGGAGTTCCACCATCTTCCTCTGAAGACGTAGCCATCTTGCTC
CATGAAGGTCAGGACAATCTGACATGCACTTGTTTCTTGCCTCTATACTTGAAGAGTAGTCCTCTTACATTGTGTACATTTTTTTCTTAA
TAGGGGAGGGGAGGGGAGAGCCAGTACCCCCCCTCCCCTTCCCTCCCTAGACCTTGGAGAAGTACCCTCCATTACTTTCCCAAGATGGCA
GACCCCATCATGGATCTGTTCGATGACCCAAATTTATTTGGCCTGGACTCTCTGACTGATGACAGCTTTAACCAGGTCACACAAGACCCC
ATTGAGGAAGCCCTTGGACTGCCAAGCTCTCTGGACTCCTTGGATCAGATGAACCAGGATGGTGGAGGTGGTGATGTGGGGAATTCATCA
GCAAGTGAACTGGTCCCTCCACCAGAGGAAACAGCTCCCACAGAACTTTCCAAAGAATCCACAGCTCCAGCTCCAGAATCCATAACCTTG
CATGATTATACCACTCAGCCTGCCAGCCAGGAGCAGCCAGCCCAACCTGTCTTACAGACATCGACGCCAACATCAGGACTTTTGCAAGTC
TCCAAGAGCCAGGAGATCCTGAGCCAAGGGAATCCTTTCATGGGTGTCTCTGCCACAGCTGTCTCCTCCAGTAGTGCTGGAGGGCAGCCA
CCTCAGTCAGCCCCTAAGATTGTTATCCTTAAGGCCCCACCAAGCTCCTCAGTCACTGGTGCCCATGTGGCACAAATTCAGGCCCAAGGT
ATCACCAGCACAGCTCAGCCCCTGGTGGCAGGCACAGCCAATGGTGGAAAAGTCACTTTTACCAAAGTGCTAACCGGCACACCCCTTCGA
CCAGGTGTTTCCATTGTCTCTGGTAATACAGTGTTGGCCGCCAAGGTCCCTGGGAACCAGGCTGCTGTTCAGCGCATTGTCCAGCCCAGC
CGACCAGTAAAGCAGCTGGTCCTCCAGCCAGTTAAGGGTTCAGCTCCTGCTGGAAACCCTGGGGCCACAGGGCCCCCACTGAAGCCTGCA
GTTACACTGACCTCTACACCTACCCAGGGTGAATCGAAACGCATCACCCTGGTCCTCCAGCAGCCACAGTCTGGAGGTCCCCAAGGACAT
CGGCATGTTGTGCTAGGGAGTCTACCAGGCAAGATAGTGTTACAGGGCAACCAGCTGGCAGCCCTGACTCAAGCCAAGAATGCCCAAGGG
CAGCCTGCCAAGGTAGTAACTATCCAGCTGCAGGTGCAGCAGCCACAGCAAAAAATCCAGATTGTACCACAACCACCATCATCGCAGCCA
CAGCCCCAGCAGCCACCCTCCACCCAGCCAGTGACTCTGTCCTCTGTACAGCAGGCTCAGATAATGGGACCAGGACAAAGCCCAGGACAA
AGACTTTCAGTACCAGTCAAGGTGGTACTGCAGCCACAGGCTGGCTCTTCCCAAGGGGCCTCTTCTGGGCTCTCTGTAGTTAAAGTTCTG
AGTGCCAGTGAAGTGGCAGCTTTGTCATCACCAGCAAGCTCTGCTCCTCATTCGGGGGGAAAGACAGGAATGGAGGAAAACCGCAGATTG
GAACACCAGAAGAAGCAAGAGAAAGCAAATCGGATTGTAGCAGAGGCCATTGCGAGAGCCCGTGCCCGCGGTGAGCAGAACATACCTCGA
GTCTTAAATGAGGACGAGTTGCCCAGCGTTCGGCCAGAGGAGGAAGGCGAGAAGAAACGCAGGAAGAAGAGTGCTGGGGAGAGGCTGAAA
GAGGAGAAGCCAAAGAAGAGTAAAACATCTGGTGCCTCCAAAACAAAGGGCAAGAGCAAGCTCAACACCATCACTCCTGTAGTGGGTAAG
AAGAGAAAACGTAATACCTCATCTGATAATTCAGATGTGGAAGTCATGCCTGCACAGTCACCTCGAGAAGATGAAGAAAGCAGCATTCAG
TGTGTCAGGAAGCAGTATGGGAAGCATATCGGATCTTTCTGGATCGCATCCCTGACACAGGGGAATATCAGGACTGGGTCAGCATCTGCC
AGCAGGAGACCTTCTGCCTCTTTGACATTGGAAAAAACTTCAGCAATTCCCAGGAGCACCTGGATCTTCTCCAGCAGAGAATAAAACAGA
GAAGTTTCCCTGACAGAAAAGATGAAATATCTGCAGAGAAGACATTGGGAGAGCCTGGTGAAACCATTGTCATTTCAACAGATGTTGCCA
ACGTCTCACTTGGGCCTTTCCCTCTCACTCCTGATGACACCCTCCTCAATGAAATTCTCGATAATACACTCAACGACACCAAGATGCCTA
CAACAGAAAGAGAAACAGAATTCGCTGTGTTGGAGGAGCAGAGGGTGGAGCTCAGCGTCTCTCTGGTAAACCAGAAGTTCAAGGCAGAGC
TCGCTGACTCCCAGTCCCCATATTACCAGGAGCTAGCAGGAAAGTCCCAACTTCAGATGCAAAAGATATTTAAGAAACTTCCAGGATTCA
AAAAAATCCATGTGTTAGGATTTAGACCAAAGAAAGAAAAAGATGGCTCAAGCTCCACAGAGATGCAACTTACGGCCATCTTTAAGAGAC
ACAGTGCAGAAGCAAAAAGCCCTGCAAGTGACCTCCTGTCTTTTGATTCCAACAAAATTGAAAGTGAGGAAGTCTATCATGGAACCATGG
AGGAGGACAAGCAACCAGAAATCTATCTCACAGCTACAGACCTCAAAAGGCTGATCAGCAAAGCACTAGAGGAAGAACAATCTTTGGATG
TGGGGACAATTCAGTTCACTGATGAAATTGCTGGATCACTGCCAGCCTTTGGTCCTGACACCCAATCAGAGCTGCCCACATCTTTTGCTG
TTATAACAGAGGATGCTACTTTGAGTCCAGAACTTCCTCCTGTTGAACCCCAGCTTGAGACAGTGGACGGAGCAGAGCATGGTCTACCTG
ACACTTCTTGGTCTCCACCTGCTATGGCCTCTACCTCCCTGTCAGAAGCTCCACCTTTCTTTATGGCATCAAGCATCTTCTCTCTGACTG
ATCAAGGCACCACAGATACAATGGCCACTGACCAGACAATGCTAGTACCAGGGCTCACCATCCCCACCAGTGATTATTCTGCAATCAGCC
AACTGGCTCTGGGAATTTCACATCCACCTGCATCTTCAGATGACAGCCGATCAAGTGCAGGTGGCGAAGATATGGTCAGACACCTAGATG
AAATGGATCTGTCTGACACTCCTGCCCCATCTGAGGTACCAGAGCTCAGCGAATATGTTTCTGTCCCAGATCATTTCTTGGAGGATACCA
CTCCTGTCTCAGCTTTACAGTATATCACCACTAGTTCTATGACCATTGCCCCCAAGGGCCGAGAGCTGGTAGTGTTCTTCAGTCTGCGTG
TTGCTAACATGGCCTTCTCCAACGACCTGTTCAACAAGAGCTCTCTGGAGTACCGAGCTCTGGAGCAACAATTCACACAGCTGCTGGTTC
CATATCTACGATCCAATCTTACAGGATTTAAGCAACTTGAAATACTTAACTTCAGAAACGGGAGTGTGATTGTGAATAGCAAAATGAAGT
TTGCTAAGTCAGTGCCGTATAACCTCACCAAGGCTGTGCACGGGGTCTTGGAGGATTTTCGTTCTGCTGCAGCCCAACAACTCCATCTGG
AAATAGACAGCTACTCTCTCAACATTGAACCAGCTGATCAAGCAGATCCCTGCAAGTTCCTGGCCTGCGGCGAATTTGCCCAATGTGTAA
AGAACGAACGGACTGAGGAAGCGGAGTGTCGCTGCAAACCAGGATATGACAGCCAGGGGAGCCTGGACGGTCTGGAACCAGGCCTCTGTG
GCCCTGGCACAAAGGAATGCGAGGTCCTCCAGGGAAAGGGAGCTCCATGCAGGTTGCCAGATCACTCTGAAAATCAAGCATACAAAACTA
GTGTTAAAAAGTTCCAAAATCAACAAAATAACAAGGTAATCAGTAAAAGAAATTCTGAATTACTGACCGTAGAATATGAAGAATTTAACC
ATCAAGATTGGGAAGGAAATTAAAAACTGAAAATGTACAATTATCATTTAGGCTATCTCAAGAGAGATGATTTGCCTTCTCAAGGAAAAT
GGAGACAGGCATATTCATGGGTCATCAAAATCCAGACATACAGTCAACACTGAGAATCAGCACACACCATATTTCAAATATAGAAGAGTC
ATGTACTTGGCAACCAGTAAATTCTGAAGAAAAAGACACTTACTTATTATTAAAACCCCAAATGCAATCAGCGAAACATATTTTTACTAT
TCTTGGATGATAGTCAAAATGATCATAAGCCAGGTTTGCTTCCACCTTCCCTGAAAATTTTACTCACAGATCATTTGCAACAAGCATAGC
TTACTTATTGTTTAGGGACTGAACAATTTATTGGGAAGCAAACTCTTTATATGCTAGAAAGTACATTTAAAAGATGACTACTTACGCAGG
GAGATGCAGGTCTCTCTAAACGCATGAATGTATGTAGTGTGTAGGCACTGTAGTGAGTGTATATATGCTCCACACTACGTCTGATAAACA
CAAACCTCAGTATTCAGTTATTAGGCACACTAGTTTTATACGCAACTACTGCTTACATAGTAGACTGTTTTGTTGCCAATAATCTTTGAA
TTGTTCTTTAAAAGAAACTGAGGTTCAGATACACATACCATGGAAAAATCTTACTTTTCTTGTTACTACACAAAGCTATTTTAAAGAAGA
TGCTATGTTGGGAGAAGGGCGAAGTTGTACTATATGACATAATCAATTCCTGTCTTCCACCACAGATGAACAATGTCTTCCTATAATAAC
TTCAGAATATTTCCTCACACCAACTTTAGTGTGTGTATACCTAGACTGGCATCAATGTAATCCAATTCAGTCCATTTTTTATGTGCTGCT
TAATGAAAATGAATCTGTCTCTATCTCTTTCAAATGTTCTAATATCAGATATATTTGCTACAAACCTTAATTTCAATAAAGTAAAAGTGA

>In-frame_ENST00000557364_ENST00000369950_TCGA-PE-A5DD-01A_CHD8_chr14_21894287_-_IMPG1_chr6_76744504_length(amino acids)=684AA_start in transcript=2018_stop in transcript=4072
MDRIPDTGEYQDWVSICQQETFCLFDIGKNFSNSQEHLDLLQQRIKQRSFPDRKDEISAEKTLGEPGETIVISTDVANVSLGPFPLTPDD
TLLNEILDNTLNDTKMPTTERETEFAVLEEQRVELSVSLVNQKFKAELADSQSPYYQELAGKSQLQMQKIFKKLPGFKKIHVLGFRPKKE
KDGSSSTEMQLTAIFKRHSAEAKSPASDLLSFDSNKIESEEVYHGTMEEDKQPEIYLTATDLKRLISKALEEEQSLDVGTIQFTDEIAGS
LPAFGPDTQSELPTSFAVITEDATLSPELPPVEPQLETVDGAEHGLPDTSWSPPAMASTSLSEAPPFFMASSIFSLTDQGTTDTMATDQT
MLVPGLTIPTSDYSAISQLALGISHPPASSDDSRSSAGGEDMVRHLDEMDLSDTPAPSEVPELSEYVSVPDHFLEDTTPVSALQYITTSS
MTIAPKGRELVVFFSLRVANMAFSNDLFNKSSLEYRALEQQFTQLLVPYLRSNLTGFKQLEILNFRNGSVIVNSKMKFAKSVPYNLTKAV
HGVLEDFRSAAAQQLHLEIDSYSLNIEPADQADPCKFLACGEFAQCVKNERTEEAECRCKPGYDSQGSLDGLEPGLCGPGTKECEVLQGK

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for CHD8-IMPG1


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with
HgeneCHD8chr14:21894287chr6:76744504ENST00000399982-4371789_2302572.02582.0FAM124B
HgeneCHD8chr14:21894287chr6:76744504ENST00000430710-5381789_2302293.02303.0FAM124B
HgeneCHD8chr14:21894287chr6:76744504ENST00000557364-5381789_2302572.02582.0FAM124B


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for CHD8-IMPG1


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for CHD8-IMPG1


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource
HgeneCHD8C1510586Autism Spectrum Disorders8CLINGEN;CTD_human
HgeneCHD8C3554373AUTISM, SUSCEPTIBILITY TO, 183GENOMICS_ENGLAND;UNIPROT
HgeneCHD8C0004352Autistic Disorder2CTD_human;GENOMICS_ENGLAND
HgeneCHD8C0221355Macrocephaly2CTD_human
HgeneCHD8C0017178Gastrointestinal Diseases1CTD_human
HgeneCHD8C0020796Profound Mental Retardation1CTD_human
HgeneCHD8C0025363Mental Retardation, Psychosocial1CTD_human
HgeneCHD8C0282631Facies1CTD_human
HgeneCHD8C0559031Functional Gastrointestinal Disorders1CTD_human
HgeneCHD8C0917816Mental deficiency1CTD_human
HgeneCHD8C1565321Cholera Infantum1CTD_human
HgeneCHD8C3714756Intellectual Disability1CTD_human
TgeneIMPG1C1842914Adult-Onset Vitelliform Macular Dystrophy1CTD_human;ORPHANET
TgeneIMPG1C4015342MACULAR DYSTROPHY, VITELLIFORM, 41GENOMICS_ENGLAND;UNIPROT