|
Fusion Gene Summary | |
Fusion Gene ORF analysis | |
Fusion Genomic Features | |
Fusion Protein Features | |
Fusion Gene Sequence | |
Fusion Gene PPI analysis | |
Related Drugs | |
Related Diseases |
Fusion gene:HID1-DNAH17 (FusionGDB2 ID:36191) |
Fusion Gene Summary for HID1-DNAH17 |
Fusion gene summary |
Fusion gene information | Fusion gene name: HID1-DNAH17 | Fusion gene ID: 36191 | Hgene | Tgene | Gene symbol | HID1 | DNAH17 | Gene ID | 283987 | 8632 |
Gene name | HID1 domain containing | dynein axonemal heavy chain 17 | |
Synonyms | 17orf28|C17orf28|DMC1|HID-1 | DNAHL1|DNEL2|SPGF39 | |
Cytomap | 17q25.1 | 17q25.3 | |
Type of gene | protein-coding | protein-coding | |
Description | protein HID1HID1 domain-containing proteinUPF0663 transmembrane protein C17orf28down-regulated in multiple cancers 1downregulated in multiple cancer 1protein hid-1 homolog | dynein heavy chain 17, axonemalaxonemal beta dynein heavy chain 17axonemal dynein heavy chain-like protein 1ciliary dynein heavy chain 17ciliary dynein heavy chain-like protein 1dynein light chain 2, axonemaldynein, axonemal, heavy polypeptide 17 | |
Modification date | 20200313 | 20200320 | |
UniProtAcc | Q8IV36 | Q9UFH2 | |
Ensembl transtripts involved in fusion gene | ENST00000425042, ENST00000532900, | ENST00000389840, ENST00000585328, ENST00000586052, | |
Fusion gene scores | * DoF score | 5 X 3 X 5=75 | 15 X 16 X 11=2640 |
# samples | 5 | 19 | |
** MAII score | log2(5/75*10)=-0.584962500721156 possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs). DoF>8 and MAII<0 | log2(19/2640*10)=-3.79646660591487 possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs). DoF>8 and MAII<0 | |
Context | PubMed: HID1 [Title/Abstract] AND DNAH17 [Title/Abstract] AND fusion [Title/Abstract] | ||
Most frequent breakpoint | HID1(72968686)-DNAH17(76506588), # samples:2 | ||
Anticipated loss of major functional domain due to fusion event. |
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types ** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10) |
Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez |
Partner | Gene | GO ID | GO term | PubMed ID |
Fusion gene breakpoints across HID1 (5'-gene) * Click on the image to open the UCSC genome browser with custom track showing this image in a new window. |
Fusion gene breakpoints across DNAH17 (3'-gene) * Click on the image to open the UCSC genome browser with custom track showing this image in a new window. |
Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0) * All genome coordinats were lifted-over on hg19. * Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser. |
Source | Disease | Sample | Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand |
ChimerDB4 | LIHC | TCGA-ED-A97K-01A | HID1 | chr17 | 72968686 | - | DNAH17 | chr17 | 76506588 | - |
ChimerDB4 | LIHC | TCGA-ED-A97K-01A | HID1 | chr17 | 72968686 | - | DNAH17 | chr17 | 76506588 | - |
Top |
Fusion Gene ORF analysis for HID1-DNAH17 |
Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure. * Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser. |
ORF | Henst | Tenst | Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand |
In-frame | ENST00000425042 | ENST00000389840 | HID1 | chr17 | 72968686 | - | DNAH17 | chr17 | 76506588 | - |
In-frame | ENST00000425042 | ENST00000585328 | HID1 | chr17 | 72968686 | - | DNAH17 | chr17 | 76506588 | - |
5CDS-intron | ENST00000425042 | ENST00000586052 | HID1 | chr17 | 72968686 | - | DNAH17 | chr17 | 76506588 | - |
intron-3CDS | ENST00000532900 | ENST00000389840 | HID1 | chr17 | 72968686 | - | DNAH17 | chr17 | 76506588 | - |
intron-3CDS | ENST00000532900 | ENST00000585328 | HID1 | chr17 | 72968686 | - | DNAH17 | chr17 | 76506588 | - |
intron-intron | ENST00000532900 | ENST00000586052 | HID1 | chr17 | 72968686 | - | DNAH17 | chr17 | 76506588 | - |
ORFfinder result based on the fusion transcript sequence of in-frame fusion genes. |
Henst | Tenst | Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand | Seq length (transcript) | BP loci (transcript) | Predicted start (transcript) | Predicted stop (transcript) | Seq length (amino acids) |
ENST00000425042 | HID1 | chr17 | 72968686 | - | ENST00000389840 | DNAH17 | chr17 | 76506588 | - | 9710 | 144 | 6 | 9500 | 3164 |
ENST00000425042 | HID1 | chr17 | 72968686 | - | ENST00000585328 | DNAH17 | chr17 | 76506588 | - | 9623 | 144 | 6 | 9413 | 3135 |
DeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated. |
Henst | Tenst | Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand | No-coding score | Coding score |
ENST00000425042 | ENST00000389840 | HID1 | chr17 | 72968686 | - | DNAH17 | chr17 | 76506588 | - | 0.002845535 | 0.9971545 |
ENST00000425042 | ENST00000585328 | HID1 | chr17 | 72968686 | - | DNAH17 | chr17 | 76506588 | - | 0.003116398 | 0.9968836 |
Top |
Fusion Genomic Features for HID1-DNAH17 |
FusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints. |
Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand | 1-p | p (fusion gene breakpoint) |
Distribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page. |
Distribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page. |
Top |
Fusion Protein Features for HID1-DNAH17 |
Go to FGviewer for the breakpoints of chr17:72968686-chr17:76506588 - FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels. |
Main function of each fusion partner protein. (from UniProt) |
Hgene | Tgene |
HID1 | DNAH17 |
FUNCTION: May play an important role in the development of cancers in a broad range of tissues. {ECO:0000269|PubMed:11281419}. | FUNCTION: Force generating protein component of the outer dynein arms (ODAs) in the sperm flagellum. Produces force towards the minus ends of microtubules. Dynein has ATPase activity; the force-producing power stroke is thought to occur on release of ADP (Probable). Plays a major role in sperm motility, implicated in sperm flagellar assembly and beating (PubMed:31178125). {ECO:0000269|PubMed:31178125, ECO:0000305|PubMed:31178125}. |
Retention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at * Minus value of BPloci means that the break pointn is located before the CDS. |
- In-frame and retained protein feature among the 13 regional features. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Protein feature | Protein feature note |
Tgene | DNAH17 | chr17:72968686 | chr17:76506588 | ENST00000389840 | 25 | 81 | 3027_3086 | 1367.0 | 4486.0 | Coiled coil | Ontology_term=ECO:0000255 | |
Tgene | DNAH17 | chr17:72968686 | chr17:76506588 | ENST00000389840 | 25 | 81 | 3257_3309 | 1367.0 | 4486.0 | Coiled coil | Ontology_term=ECO:0000255 | |
Tgene | DNAH17 | chr17:72968686 | chr17:76506588 | ENST00000389840 | 25 | 81 | 1847_1854 | 1367.0 | 4486.0 | Nucleotide binding | ATP | |
Tgene | DNAH17 | chr17:72968686 | chr17:76506588 | ENST00000389840 | 25 | 81 | 2128_2135 | 1367.0 | 4486.0 | Nucleotide binding | ATP | |
Tgene | DNAH17 | chr17:72968686 | chr17:76506588 | ENST00000389840 | 25 | 81 | 2455_2462 | 1367.0 | 4486.0 | Nucleotide binding | ATP | |
Tgene | DNAH17 | chr17:72968686 | chr17:76506588 | ENST00000389840 | 25 | 81 | 2801_2808 | 1367.0 | 4486.0 | Nucleotide binding | ATP | |
Tgene | DNAH17 | chr17:72968686 | chr17:76506588 | ENST00000389840 | 25 | 81 | 1809_2030 | 1367.0 | 4486.0 | Region | AAA 1 | |
Tgene | DNAH17 | chr17:72968686 | chr17:76506588 | ENST00000389840 | 25 | 81 | 2090_2311 | 1367.0 | 4486.0 | Region | AAA 2 | |
Tgene | DNAH17 | chr17:72968686 | chr17:76506588 | ENST00000389840 | 25 | 81 | 2417_2665 | 1367.0 | 4486.0 | Region | AAA 3 | |
Tgene | DNAH17 | chr17:72968686 | chr17:76506588 | ENST00000389840 | 25 | 81 | 2763_3012 | 1367.0 | 4486.0 | Region | AAA 4 | |
Tgene | DNAH17 | chr17:72968686 | chr17:76506588 | ENST00000389840 | 25 | 81 | 3027_3313 | 1367.0 | 4486.0 | Region | Stalk | |
Tgene | DNAH17 | chr17:72968686 | chr17:76506588 | ENST00000389840 | 25 | 81 | 3405_3632 | 1367.0 | 4486.0 | Region | AAA 5 | |
Tgene | DNAH17 | chr17:72968686 | chr17:76506588 | ENST00000389840 | 25 | 81 | 3842_4068 | 1367.0 | 4486.0 | Region | AAA 6 | |
Tgene | DNAH17 | chr17:72968686 | chr17:76506588 | ENST00000389840 | 25 | 81 | 1702_1736 | 1367.0 | 4486.0 | Repeat | Note=TPR 2 | |
Tgene | DNAH17 | chr17:72968686 | chr17:76506588 | ENST00000389840 | 25 | 81 | 4147_4182 | 1367.0 | 4486.0 | Repeat | Note=TPR 3 |
- In-frame and not-retained protein feature among the 13 regional features. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Protein feature | Protein feature note |
Tgene | DNAH17 | chr17:72968686 | chr17:76506588 | ENST00000389840 | 25 | 81 | 1_1808 | 1367.0 | 4486.0 | Region | Stem | |
Tgene | DNAH17 | chr17:72968686 | chr17:76506588 | ENST00000389840 | 25 | 81 | 1019_1052 | 1367.0 | 4486.0 | Repeat | Note=TPR 1 |
Top |
Fusion Gene Sequence for HID1-DNAH17 |
For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones. |
>In-frame_ENST00000425042_ENST00000389840_TCGA-ED-A97K-01A_HID1_chr17_72968686_-_DNAH17_chr17_76506588_length(transcript)=9710nt_BP=144nt GCGGAGCTGGAGCCGGAGCTGAAGCCGGAGCCGGGTTGGAGTCTGGGCGGGGGCCGGGCCGGAGCGGGCTCCAGAGACATGGGGTCGACC GACTCCAAGCTGAACTTCCGGAAGGCGGTGATCCAGCTCACCACCAAGACGCAGGTGAAATTTAAAATGTCAGAAGAGACGACCCTGGCA GATTTACTGCAGCTGAACCTCCACAGTTACGAGGATGAGGTCCGCAACATCGTGGACAAGGCCGTGAAGGAGTCGGGCATGGAAAAGGTG CTGAAAGCCCTGGACAGTACCTGGAGCATGATGGAATTCCAGCACGAGCCGCACCCGCGGACAGGCACCATGATGCTCAAGTCCAGCGAG GTGCTGGTGGAGACGCTGGAGGACAACCAGGTGCAGCTGCAGAACCTGATGATGTCCAAGTACCTGGCCCACTTCCTGAAGGAGGTGACA AGCTGGCAGCAGAAGCTGTCCACGGCGGACTCCGTCATCTCCATCTGGTTTGAGGTCCAGCGAACCTGGAGCCACCTGGAGAGCATCTTC ATCGGCTCCGAAGACATCCGCACCCAGCTCCCGGGGGACTCCCAGCGCTTTGACGACATCAACCAGGAATTCAAGGCCTTGATGGAAGAT GCAGTGAAAACACCCAACGTGGTGGAAGCCACCAGCAAACCCGGCCTCTACAATAAACTGGAGGCCCTGAAGAAGAGCTTGGCCATCTGT GAAAAGGCTTTGGCAGAGTATTTAGAGACGAAAAGACTGGCTTTCCCCCGGTTCTATTTTGTCTCCTCGGCTGACCTCCTGGACATTCTC TCCAATGGCAATGACCCCGTGGAGGTGAGCCGCCACCTGTCCAAACTCTTCGATAGCCTGTGTAAACTGAAGTTCCGGCTCGATGCCAGT GACAAACCTCTCAAGGTGGGCCTGGGAATGTACAGCAAGGAGGACGAGTACATGGTTTTTGATCAGGAATGCGACCTCTCGGGGCAGGTG GAAGTGTGGCTGAATCGAGTGCTGGACCGAATGTGCTCTACCCTCCGGCACGAAATCCCAGAGGCCGTGGTGACCTACGAAGAGAAGCCG AGGGAGCAGTGGATCCTGGACTACCCAGCCCAGATCTGGTGGACGACCGAGGTGGGCCTGGCATTTGCCAGGCTGGAGGAAGGCTATGAA AACGCTATCAGAGATTATAACAAAAAGCAGATTAGCCAGCTGAACGTACTCATCACGCTGCTCATGGGGAACCTCAACGCTGGCGACAGG ATGAAGATCATGACCATCTGCACCATCGATGTGCACGCACGGGACGTGGTGGCCAAAATGATCGTGGAGAGTTCTCAGGCCTTCACCTGG CAGGCCCAGCTCCGGCATCGCTGGGACGAAGAGAAGCGACACTGCTTTGCCAACATCTGCGATGCCCAAATCCAGTATTCCTATGAGTAT CTGGGCAACACGCCGCGGCTGGTCATCACCCCACTCACTGACAGGTGCTATATCACCCTGACCCAGTCCCTCCATCTCATCATGGGTGGA GCCCCTGCCGGCCCCGCTGGGACCGGCAAGACTGAGACGACCAAGGACCTGGGCAGAGCCCTGGGCACCATGGTCTACGTCTTCAACTGC TCCGAGCAGATGGACTACAAGTCCTGTGGAAATATCTACAAGGGCCTGGCCCAGACGGGAGCCTGGGGCTGCTTTGACGAGTTTAATCGC ATCTCAGTGGAAGTCTTGTCTGTGATTGCCGTGCAGGTAAAATGTGTCCAGGATGCAATTCGGGCCAAGAAAAAAGCATTCAATTTCCTG GGAGAGATCATAGGCCTCATTCCCACCGTCGGTATCTTCATCACCATGAACCCTGGGTACGCCGGACGCGCGGAGCTGCCTGAGAACCTA AAAGCCTTATTCAGGCCCTGTGCCATGGTCGTCCCCGACTTCGAACTGATATGTGAGATCATGCTCATGGCCGAGGGCTTTCTGGAAGCC CGCCTTCTGGCCAGGAAGTTCATCACCCTGTACACCTTGTGCAAGGAGCTGCTCTCGAAGCAGGATCATTACGACTGGGGCCTGAGAGCC ATCAAGTCTGTGCTGGTGGTGGCCGGCTCCCTGAAGAGGGGCGACCCCAGCCGGGCAGAGGACCAGGTGCTCATGCGGGCGCTGAGAGAC TTCAACATCCCCAAGATTGTGACAGACGACCTGCCCGTATTCATGGGACTGATCGGGGACCTCTTCCCGGCTCTGGACGTGCCTCGGAAA CGGGACCTGAATTTTGAAAAGATCATCAAGCAGAGCATCGTGGAGCTCAAGCTGCAGGCGGAGGACAGCTTCGTGCTGAAGGTGGTGCAG CTGGAGGAGCTGCTGCAGGTCCGCCACTCCGTGTTCATCGTCGGGAATGCGGGCAGCGGCAAATCTCAGGTCCTCAAATCCCTCAACAAG ACCTATCAGAACCTGAAGAGGAAGCCGGTCGCCGTGGACCTGGACCCCAAGGCCGTCACCTGCGACGAGCTCTTTGGCATCATCAACCCA GTGACCAGGGAATGGAAAGATGGCCTGTTCTCCACCATCATGCGAGACCTGGCCAACATCACCCATGACGGCCCCAAGTGGATCATCCTT GACGGAGACATAGACCCCATGTGGATCGAGTCTCTCAACACAGTCATGGATGACAACAAGGTCCTCACCCTGGCCAGCAACGAGCGGATC CCCCTGAACCGCACCATGAGGCTGGTGTTCGAAATCAGCCACCTGAGGACGGCCACCCCAGCCACCGTTTCCAGAGCCGGCATCCTCTAC ATCAACCCAGCCGACCTGGGATGGAACCCGGTGGTGAGCAGCTGGATCGAGAGGCGCAAGGTGCAGTCGGAGAAGGCCAACCTGATGATC CTCTTTGACAAGTACCTGCCCACGTGCCTGGACAAGTTGCGCTTTGGGTTCAAGAAGATCACGCCAGTGCCGGAGATCACGGTGATCCAA ACGATTCTGTACCTGCTGGAGTGCCTGCTCACGGAGAAGACCGTGCCCCCCGACTCCCCCAGGGAGCTGTACGAGCTGTACTTCGTGTTC ACCTGCTTCTGGGCCTTCGGTGGCGCCATGTTCCAGGACCAGCTTGTGGATTATCGAGTGGAGTTCAGTAAATGGTGGATCAACGAATTC AAGACTATCAAGTTCCCCTCGCAGGGAACGATTTTTGACTACTACATTGATCCTGACACAAAAAAGTTCCTGCCCTGGACAGATAAAGTG CCCTCCTTTGAGCTGGATCCCGATGTCCCACTGCAGGCCTCTTTGGTCCACACCACGGAAACCATCCGCATCCGCTACTTCATGGACCTG CTCATGGAGAAGTCCTGGCCGGTGATGCTGGTGGGGAACGCGGGGACGGGCAAGTCGGTGCTGATGGGGGACAAGCTGGAAAGCCTGAAC ACGGACAACTACCTGGTGCAGGCTGTGCCCTTCAACTTCTACACGACCTCAGCCATGCTGCAGGGGGTGCTGGAGAAGCCGCTGGAGAAG AAATCGGGGAGGAACTACGGGCCGCCAGGCACTAAGAAGCTCGTCTACTTCATCGACGACATGAACATGCCCGAGGTGGACAAGTATGGG ACGGTGGCCCCGCACACCCTCATCCGGCAGCACATGGACCACCGGCACTGGTATGACAGACATAAGCTGACGTTAAAAGATATCCATAAT TGTCAGTACGTGGCCTGCATGAACCCCACTTCCGGATCCTTCACCATCGACTCCAGGCTTCAGCGCCATTTCTGCGTGTTTGCTGTGAGC TTCCCCGGCCAGGAGGCCCTCACCACCATCTACAACACAATCCTGACGCAGCACCTGGCCTTCCGCTCGGTCTCCATGGCTATCCAGAGG ATAAGCAGCCAGCTGGTGGCCGCGGCCCTGGCTTTGCATCAGAAAATCACGGCAACATTTCTTCCCACGGCCATTAAGTTTCATTATGTC TTCAACCTCAGGGACCTCTCCAATATTTTCCAGGGACTCTTATTTTCCACAGCAGAAGTTCTGAAAACCCCACTGGACCTCGTCCGCCTT TGGCTACATGAGACTGAACGAGTGTATGGTGACAAAATGGTTGACGAAAAAGACCAGGAAACATTGCATAGAGTCACCATGGCCTCCACC AAGAAGTTCTTTGATGATCTTGGTGATGAACTCTTATTTGCCAAGCCAAATATCTTCTGCCACTTTGCTCAAGGGATTGGCGATCCCAAA TATGTTCCTGTAACCGACATGGCTCCTCTGAACAAGCTCCTCGTGGACGTCCTGGACAGCTACAATGAAGTTAATGCAGTCATGAATTTG GTGCTGTTTGAGGACGCCGTGGCTCACATCTGCAGGATTAATCGCATCCTGGAGTCTCCCCGGGGGAATGCCCTGCTGGTGGGGGTGGGC GGCAGTGGCAAACAGAGCCTCTCCCGCCTGGCAGCGTACATCAGCGGGCTTGACGTGTTTCAGATCACCCTCAAGAAGGGCTACGGGATC CCCGACCTCAAGATTGACCTCGCTGCTCAGTACATAAAGGCTGCCGTGAAGAACGTTCCCTCGGTGTTCCTGATGACAGACTCCCAGGTG GCCGAGGAGCAGTTTCTGGTGCTGATCAATGACCTGCTGGCCTCAGGAGAGATCCCTGGGCTGTTTATGGAGGACGAGGTGGAGAACATC ATCTCCTCCATGCGACCCCAAGTCAAGTCCCTTGGCATGAATGACACTCGGGAAACATGTTGGAAGTTCTTCATCGAAAAAGTGCGCAGA CAGCTCAAGGTGATCCTGTGTTTCTCCCCTGTGGGCTCCGTGCTGCGGGTACGAGCCAGAAAGTTCCCAGCTGTGGTCAACTGCACGGCC ATCGACTGGTTCCACGAGTGGCCGGAAGATGCGCTGGTGTCCGTCAGCGCCCGCTTCCTGGAGGAGACTGAGGGGATTCCGTGGGAAGTC AAGGCCTCCATCAGCTTCTTCATGTCCTACGTGCACACCACCGTCAACGAGATGTCCAGGGTATACCTGGCTACTGAGAGGCGCTACAAC TACACCACACCCAAAACCTTTCTGGAGCAGATCAAACTGTACCAGAACCTGCTGGCCAAGAAGAGAACGGAACTTGTTGCCAAAATCGAG AGGCTGGAGAACGGCCTGATGAAGCTGCAGAGCACGGCTTCCCAGGTGGATGATTTGAAAGCCAAGTTGGCGATTCAGGAGGCTGAGCTC AAGCAGAAGAATGAGAGCGCAGACCAACTGATCCAGGTGGTCGGCATCGAGGCCGAGAAGGTCAGCAAAGAGAAGGCCATTGCTGACCAG GAAGAAGTCAAGGTCGAGGTCATCAATAAGAACGTCACTGAGAAGCAAAAGGCCTGTGAAACAGACCTGGCCAAAGCAGAACCGGCCCTG CTGGCAGCCCAGGAGGCTCTGGACACTCTGAATAAGAACAACCTGACAGAGCTGAAGTCCTTTGGGTCCCCGCCGGATGCTGTGGTCAAC GTCACCGCCGCCGTCATGATTCTGACCGCACCTGGGGGCAAGATCCCCAAGGACAAGAGCTGGAAGGCGGCCAAGATCATGATGGGCAAG GTGGACACCTTCCTAGACTCCCTGAAGAAGTTCGACAAGGAGCACATCCCTGAGGCCTGCCTGAAGGCCTTCAAGCCCTACCAAGGCAAC CCGACGTTCGACCCCGAGTTCATCCGCTCCAAGTCCACGGCCGCCGCCGGCCTGTGCTCCTGGTGCATCAACATCGTCCGCTTCTACGAG GTCTACTGCGACGTGGCGCCCAAGAGGCAGGCACTGGAGGAGGCTAATGCAGAGCTGGCAGAGGCACAAGAGAAGCTGTCCCGGATCAAA AACAAGATTGCCGAACTTAACGCCAACCTGAGCAACCTAACCTCAGCGTTTGAAAAAGCAACAGCTGAGAAAATCAAGTGTCAGCAAGAG GCCGATGCCACGAACAGGGTGATCTTACTGGCGAACAGGCTGGTCGGGGGATTAGCATCGGAAAACATCCGCTGGGCTGAGTCTGTGGAG AACTTCAGGAGCCAGGGGGTCACGCTGTGTGGGGACGTCCTGCTCATCTCTGCCTTCGTGTCCTACGTGGGCTACTTCACCAAGAAATAC CGGAATGAGCTGATGGAGAAATTCTGGATCCCTTACATACATAACTTAAAGGTCCCCATCCCGATCACGAATGGCCTGGATCCCTTGAGC CTGCTGACAGATGACGCGGACGTGGCCACCTGGAACAACCAGGGCCTCCCCAGCGACCGCATGTCCACCGAGAATGCCACCATCCTGGGC AACACCGAGCGGTGGCCGCTGATCGTGGACGCCCAGCTCCAAGGAATCAAGTGGATCAAAAACAAATACAGGAGTGAACTGAAAGCCATC CGCCTGGGACAGAAGAGCTACCTGGATGTCATCGAGCAGGCCATCTCGGAAGGGGACACCTTGCTCATTGAGAACATCGGCGAAACCGTG GACCCCGTGCTGGACCCTCTACTGGGCAGGAACACGATTAAAAAGGGAAAGTACATTAAGATCGGTGACAAGGAGGTGGAGTACCACCCC AAGTTCCGCCTGATCCTACACACCAAGTACTTCAACCCACACTACAAGCCAGAGATGCAGGCTCAGTGCACCCTCATCAACTTCCTGGTC ACCAGGGATGGACTCGAGGACCAACTCTTGGCCGCTGTGGTGGCCAAAGAGCGCCCAGATCTGGAACAGCTGAAGGCAAACCTCACCAAG TCTCAAAACGAATTTAAGATTGTTCTGAAAGAGCTGGAAGATTCGCTCCTGGCCCGTCTGTCGGCTGCGTCGGGGAACTTTCTGGGAGAC ACGGCCTTGGTGGAGAATCTGGAGACCACCAAGCACACAGCCAGCGAGATCGAGGAGAAGGTGGTGGAGGCAAAAATCACAGAAGTTAAA ATCAACGAAGCGAGAGAGAACTACCGCCCGGCTGCGGAGAGGGCATCTCTGCTCTACTTCATACTGAACGATCTCAACAAAATCAACCCC GTCTACCAGTTCTCCCTCAAGGCCTTCAACGTGGTGTTTGAGAAAGCCATCCAGAGGACCACCCCTGCCAACGAGGTGAAGCAGCGGGTG ATCAACCTGACGGACGAGATCACCTACTCCGTCTACATGTACACGGCCCGGGGACTCTTCGAGAGGGACAAACTCATTTTCCTGGCACAA GTTACGTTTCAGGTCCTGTCCATGAAGAAGGAGCTGAACCCAGTGGAGCTGGATTTCCTCCTGCGGTTCCCTTTTAAGGCCGGAGTGGTC TCACCAGTGGACTTCCTCCAGCATCAAGGCTGGGGCGGGATCAAGGCCCTCTCGGAGATGGATGAGTTCAAAAATCTGGACAGTGACATC GAAGGATCTGCCAAGCGCTGGAAAAAGCTGGTGGAGTCGGAAGCCCCCGAGAAGGAGATCTTCCCCAAGGAGTGGAAGAACAAGACGGCC CTGCAGAAGCTGTGCATGGTGCGCTGCCTGCGGCCAGATCGCATGACCTACGCTATCAAGAACTTCGTGGAGGAAAAGATGGGCAGCAAG TTCGTGGAAGGCCGGAGTGTTGAGTTTTCTAAGTCCTACGAGGAGAGCAGCCCCTCCACGTCAATCTTCTTCATCCTCTCCCCGGGGGTT GACCCCTTGAAAGACGTGGAAGCCCTGGGAAAAAAACTAGGGTTTACCATAGACAATGGAAAACTCCATAATGTGTCCCTGGGGCAGGGA CAAGAGGTGGTGGCTGAGAACGCCCTGGACGTGGCTGCAGAGAAAGGACACTGGGTCATTCTGCAGGTACGAGGGGGCCAGCACTGCAGG AATATCCACCTGGTGGCCCGGTGGCTGGGAACACTGGACAAGAAGCTGGAGCACTACAGCACGGGCAGCCATGAGGACTACCGGGTGTTC ATCAGCGCGGAGCCTGCCCCCAGCCCCGAGACCCACATCATCCCCCAGGGCATTCTGGAGAACGCCATCAAGATCACCAACGAGCCCCCC ACGGGCATGCACGCCAACTTGCACAAGGCCCTGGACCTGTTCACCCAGGACACCCTGGAGATGTGCACCAAGGAGATGGAGTTCAAGTGC ATGCTCTTCGCCCTGTGCTACTTCCACGCTGTGGTGGCAGAGAGGCGCAAGTTCGGCGCCCAGGGCTGGAACCGGTCGTACCCCTTCAAC AACGGGGACCTCACCATCTCCATCAACGTGCTCTACAACTACCTGGAGGCCAACCCCAAGGTGCCCTGGGACGATCTCCGCTACCTTTTT GGTGAAATCATGTATGGCGGCCACATCACAGATGACTGGGACCGTCGGCTGTGCAGGACCTACCTGGCTGAATACATCCGGACGGAGATG CTGGAGGGAGACGTCCTGCTGGCCCCCGGCTTTCAGATCCCCCCCAACCTGGACTACAAGGGTTACCACGAATACATCGATGAGAACCTG CCCCCTGAGAGTCCCTATCTGTATGGCCTGCACCCCAACGCAGAGATTGGCTTTCTGACGGTCACCTCAGAGAAGCTGTTCCGCACTGTC CTGGAAATGCAGCCAAAAGAGACGGACTCGGGGGCAGGCACGGGAGTGTCCCGCGAGGAGAAGGCAGGATCTTTGAAACTGCTCCCAAGC GAGAGGAAGGGGGAGGATCTAGAACTGAGGAGGGGGGGCTGTCCGGGGACTGGCTTCCAGGTGAAGGCCGTGCTGGACGACATCCTGGAG AAGATTCCGGAGACTTTCAACATGGCTGAGATCATGGCAAAGGCAGCGGAAAAGACCCCCTACGTGGTAGTCGCCTTTCAAGAATGTGAA AGAATGAACATCCTGACCAACGAAATGCGCCGTTCGCTCAAGGAGCTGAACCTGGGGCTGAAGGGAGAACTGACCATCACGACCGACGTG GAAGATCTGTCCACGGCTCTCTTCTATGACACCGTGCCTGATACGTGGGTGGCCCGGGCCTACCCCTCCATGATGGGCCTGGCGGCCTGG TACGCAGACCTGCTGCTCCGCATCAGGGAACTCGAGGCCTGGACGACAGACTTTGCCCTGCCCACCACCGTGTGGCTGGCCGGCTTCTTC AACCCCCAGTCGTTCCTCACGGCCATCATGCAGTCCATGGCCAGGAAGAACGAGTGGCCCCTGGACAAGATGTGTCTGTCTGTCGAGGTG ACCAAGAAAAACCGAGAGGACATGACCGCTCCTCCGCGAGAGGGCTCCTACGTGTACGGACTCTTCATGGAAGGGGCTCGCTGGGACACC CAGACTGGAGTCATCGCTGAAGCGCGGCTGAAAGAGCTGACCCCGGCCATGCCTGTCATCTTCATCAAGGCCATTCCTGTGGACCGCATG GAGACCAAGAACATCTATGAGTGTCCCGTGTACAAAACACGCATCCGCGGCCCCACCTATGTCTGGACCTTTAACTTGAAGACCAAAGAG AAGGCAGCGAAGTGGATCCTGGCAGCCGTGGCGCTGCTCCTACAGGTTTAGCTCGCTCCTGCCTCACAGCCCACACTCCCTGGGGCTGGA CCACAACTCAGCCCTTCACCTGTGCACCTGTGACTTATTCTTTACAGGAACTGGTGGTGGTTTTTCGTTCTCTTAAATAATCAGGTGCTT >In-frame_ENST00000425042_ENST00000389840_TCGA-ED-A97K-01A_HID1_chr17_72968686_-_DNAH17_chr17_76506588_length(amino acids)=3164AA_start in transcript=6_stop in transcript=9500 MEPELKPEPGWSLGGGRAGAGSRDMGSTDSKLNFRKAVIQLTTKTQVKFKMSEETTLADLLQLNLHSYEDEVRNIVDKAVKESGMEKVLK ALDSTWSMMEFQHEPHPRTGTMMLKSSEVLVETLEDNQVQLQNLMMSKYLAHFLKEVTSWQQKLSTADSVISIWFEVQRTWSHLESIFIG SEDIRTQLPGDSQRFDDINQEFKALMEDAVKTPNVVEATSKPGLYNKLEALKKSLAICEKALAEYLETKRLAFPRFYFVSSADLLDILSN GNDPVEVSRHLSKLFDSLCKLKFRLDASDKPLKVGLGMYSKEDEYMVFDQECDLSGQVEVWLNRVLDRMCSTLRHEIPEAVVTYEEKPRE QWILDYPAQIWWTTEVGLAFARLEEGYENAIRDYNKKQISQLNVLITLLMGNLNAGDRMKIMTICTIDVHARDVVAKMIVESSQAFTWQA QLRHRWDEEKRHCFANICDAQIQYSYEYLGNTPRLVITPLTDRCYITLTQSLHLIMGGAPAGPAGTGKTETTKDLGRALGTMVYVFNCSE QMDYKSCGNIYKGLAQTGAWGCFDEFNRISVEVLSVIAVQVKCVQDAIRAKKKAFNFLGEIIGLIPTVGIFITMNPGYAGRAELPENLKA LFRPCAMVVPDFELICEIMLMAEGFLEARLLARKFITLYTLCKELLSKQDHYDWGLRAIKSVLVVAGSLKRGDPSRAEDQVLMRALRDFN IPKIVTDDLPVFMGLIGDLFPALDVPRKRDLNFEKIIKQSIVELKLQAEDSFVLKVVQLEELLQVRHSVFIVGNAGSGKSQVLKSLNKTY QNLKRKPVAVDLDPKAVTCDELFGIINPVTREWKDGLFSTIMRDLANITHDGPKWIILDGDIDPMWIESLNTVMDDNKVLTLASNERIPL NRTMRLVFEISHLRTATPATVSRAGILYINPADLGWNPVVSSWIERRKVQSEKANLMILFDKYLPTCLDKLRFGFKKITPVPEITVIQTI LYLLECLLTEKTVPPDSPRELYELYFVFTCFWAFGGAMFQDQLVDYRVEFSKWWINEFKTIKFPSQGTIFDYYIDPDTKKFLPWTDKVPS FELDPDVPLQASLVHTTETIRIRYFMDLLMEKSWPVMLVGNAGTGKSVLMGDKLESLNTDNYLVQAVPFNFYTTSAMLQGVLEKPLEKKS GRNYGPPGTKKLVYFIDDMNMPEVDKYGTVAPHTLIRQHMDHRHWYDRHKLTLKDIHNCQYVACMNPTSGSFTIDSRLQRHFCVFAVSFP GQEALTTIYNTILTQHLAFRSVSMAIQRISSQLVAAALALHQKITATFLPTAIKFHYVFNLRDLSNIFQGLLFSTAEVLKTPLDLVRLWL HETERVYGDKMVDEKDQETLHRVTMASTKKFFDDLGDELLFAKPNIFCHFAQGIGDPKYVPVTDMAPLNKLLVDVLDSYNEVNAVMNLVL FEDAVAHICRINRILESPRGNALLVGVGGSGKQSLSRLAAYISGLDVFQITLKKGYGIPDLKIDLAAQYIKAAVKNVPSVFLMTDSQVAE EQFLVLINDLLASGEIPGLFMEDEVENIISSMRPQVKSLGMNDTRETCWKFFIEKVRRQLKVILCFSPVGSVLRVRARKFPAVVNCTAID WFHEWPEDALVSVSARFLEETEGIPWEVKASISFFMSYVHTTVNEMSRVYLATERRYNYTTPKTFLEQIKLYQNLLAKKRTELVAKIERL ENGLMKLQSTASQVDDLKAKLAIQEAELKQKNESADQLIQVVGIEAEKVSKEKAIADQEEVKVEVINKNVTEKQKACETDLAKAEPALLA AQEALDTLNKNNLTELKSFGSPPDAVVNVTAAVMILTAPGGKIPKDKSWKAAKIMMGKVDTFLDSLKKFDKEHIPEACLKAFKPYQGNPT FDPEFIRSKSTAAAGLCSWCINIVRFYEVYCDVAPKRQALEEANAELAEAQEKLSRIKNKIAELNANLSNLTSAFEKATAEKIKCQQEAD ATNRVILLANRLVGGLASENIRWAESVENFRSQGVTLCGDVLLISAFVSYVGYFTKKYRNELMEKFWIPYIHNLKVPIPITNGLDPLSLL TDDADVATWNNQGLPSDRMSTENATILGNTERWPLIVDAQLQGIKWIKNKYRSELKAIRLGQKSYLDVIEQAISEGDTLLIENIGETVDP VLDPLLGRNTIKKGKYIKIGDKEVEYHPKFRLILHTKYFNPHYKPEMQAQCTLINFLVTRDGLEDQLLAAVVAKERPDLEQLKANLTKSQ NEFKIVLKELEDSLLARLSAASGNFLGDTALVENLETTKHTASEIEEKVVEAKITEVKINEARENYRPAAERASLLYFILNDLNKINPVY QFSLKAFNVVFEKAIQRTTPANEVKQRVINLTDEITYSVYMYTARGLFERDKLIFLAQVTFQVLSMKKELNPVELDFLLRFPFKAGVVSP VDFLQHQGWGGIKALSEMDEFKNLDSDIEGSAKRWKKLVESEAPEKEIFPKEWKNKTALQKLCMVRCLRPDRMTYAIKNFVEEKMGSKFV EGRSVEFSKSYEESSPSTSIFFILSPGVDPLKDVEALGKKLGFTIDNGKLHNVSLGQGQEVVAENALDVAAEKGHWVILQVRGGQHCRNI HLVARWLGTLDKKLEHYSTGSHEDYRVFISAEPAPSPETHIIPQGILENAIKITNEPPTGMHANLHKALDLFTQDTLEMCTKEMEFKCML FALCYFHAVVAERRKFGAQGWNRSYPFNNGDLTISINVLYNYLEANPKVPWDDLRYLFGEIMYGGHITDDWDRRLCRTYLAEYIRTEMLE GDVLLAPGFQIPPNLDYKGYHEYIDENLPPESPYLYGLHPNAEIGFLTVTSEKLFRTVLEMQPKETDSGAGTGVSREEKAGSLKLLPSER KGEDLELRRGGCPGTGFQVKAVLDDILEKIPETFNMAEIMAKAAEKTPYVVVAFQECERMNILTNEMRRSLKELNLGLKGELTITTDVED LSTALFYDTVPDTWVARAYPSMMGLAAWYADLLLRIRELEAWTTDFALPTTVWLAGFFNPQSFLTAIMQSMARKNEWPLDKMCLSVEVTK KNREDMTAPPREGSYVYGLFMEGARWDTQTGVIAEARLKELTPAMPVIFIKAIPVDRMETKNIYECPVYKTRIRGPTYVWTFNLKTKEKA -------------------------------------------------------------- >In-frame_ENST00000425042_ENST00000585328_TCGA-ED-A97K-01A_HID1_chr17_72968686_-_DNAH17_chr17_76506588_length(transcript)=9623nt_BP=144nt GCGGAGCTGGAGCCGGAGCTGAAGCCGGAGCCGGGTTGGAGTCTGGGCGGGGGCCGGGCCGGAGCGGGCTCCAGAGACATGGGGTCGACC GACTCCAAGCTGAACTTCCGGAAGGCGGTGATCCAGCTCACCACCAAGACGCAGGTGAAATTTAAAATGTCAGAAGAGACGACCCTGGCA GATTTACTGCAGCTGAACCTCCACAGTTACGAGGATGAGGTCCGCAACATCGTGGACAAGGCCGTGAAGGAGTCGGGCATGGAAAAGGTG CTGAAAGCCCTGGACAGTACCTGGAGCATGATGGAATTCCAGCACGAGCCGCACCCGCGGACAGGCACCATGATGCTCAAGTCCAGCGAG GTGCTGGTGGAGACGCTGGAGGACAACCAGGTGCAGCTGCAGAACCTGATGATGTCCAAGTACCTGGCCCACTTCCTGAAGGAGGTGACA AGCTGGCAGCAGAAGCTGTCCACGGCGGACTCCGTCATCTCCATCTGGTTTGAGGTCCAGCGAACCTGGAGCCACCTGGAGAGCATCTTC ATCGGCTCCGAAGACATCCGCACCCAGCTCCCGGGGGACTCCCAGCGCTTTGACGACATCAACCAGGAATTCAAGGCCTTGATGGAAGAT GCAGTGAAAACACCCAACGTGGTGGAAGCCACCAGCAAACCCGGCCTCTACAATAAACTGGAGGCCCTGAAGAAGAGCTTGGCCATCTGT GAAAAGGCTTTGGCAGAGTATTTAGAGACGAAAAGACTGGCTTTCCCCCGGTTCTATTTTGTCTCCTCGGCTGACCTCCTGGACATTCTC TCCAATGGCAATGACCCCGTGGAGGTGAGCCGCCACCTGTCCAAACTCTTCGATAGCCTGTGTAAACTGAAGTTCCGGCTCGATGCCAGT GACAAACCTCTCAAGGTGGGCCTGGGAATGTACAGCAAGGAGGACGAGTACATGGTTTTTGATCAGGAATGCGACCTCTCGGGGCAGGTG GAAGTGTGGCTGAATCGAGTGCTGGACCGAATGTGCTCTACCCTCCGGCACGAAATCCCAGAGGCCGTGGTGACCTACGAAGAGAAGCCG AGGGAGCAGTGGATCCTGGACTACCCAGCCCAGGTGGCCCTGACTTGCACCCAGATCTGGTGGACGACCGAGGTGGGCCTGGCATTTGCC AGGCTGGAGGAAGGCTATGAAAACGCTATCAGAGATTATAACAAAAAGCAGATTAGCCAGCTGAACGTACTCATCACGCTGCTCATGGGG AACCTCAACGCTGGCGACAGGATGAAGATCATGACCATCTGCACCATCGATGTGCACGCACGGGACGTGGTGGCCAAAATGATCGTGGTG GAGAGTTCTCAGGCCTTCACCTGGCAGGCCCAGCTCCGGCATCGCTGGGACGAAGAGAAGCGACACTGCTTTGCCAACATCTGCGATGCC CAAATCCAGTATTCCTATGAGTATCTGGGCAACACGCCGCGGCTGGTCATCACCCCACTCACTGACAGGTGCTATATCACCCTGACCCAG TCCCTCCATCTCATCATGGGTGGAGCCCCTGCCGGCCCCGCTGGGACCGGCAAGACTGAGACGACCAAGGACCTGGGCAGAGCCCTGGGC ACCATGGTCTACGTCTTCAACTGCTCCGAGCAGATGGACTACAAGTCCTGTGGAAATATCTACAAGGGCCTGGCCCAGACGGGAGCCTGG GGCTGCTTTGACGAGTTTAATCGCATCTCAGTGGAAGTCTTGTCTGTGATTGCCGTGCAGGTAAAATGTGTCCAGGATGCAATTCGGGCC AAGAAAAAAGCATTCAATTTCCTGGGAGAGATCATAGGCCTCATTCCCACCGTCGGTATCTTCATCACCATGAACCCTGGGTACGCCGGA CGCGCGGAGCTGCCTGAGAACCTAAAAGCCTTATTCAGGCCCTGTGCCATGGTCGTCCCCGACTTCGAACTGATATGTGAGATCATGCTC ATGGCCGAGGGCTTTCTGGAAGCCCGCCTTCTGGCCAGGAAGTTCATCACCCTGTACACCTTGTGCAAGGAGCTGCTCTCGAAGCAGGAT CATTACGACTGGGGCCTGAGAGCCATCAAGTCTGTGCTGGTGGTGGCCGGCTCCCTGAAGAGGGGCGACCCCAGCCGGGCAGAGGACCAG GTGCTCATGCGGGCGCTGAGAGACTTCAACATCCCCAAGATTGTGACAGACGACCTGCCCGTATTCATGGGACTGATCGGGGACCTCTTC CCGGCTCTGGACGTGCCTCGGAAACGGGACCTGAATTTTGAAAAGATCATCAAGCAGAGCATCGTGGAGCTCAAGCTGCAGGCGGAGGAC AGCTTCGTGCTGAAGGTGGTGCAGCTGGAGGAGCTGCTGCAGGTCCGCCACTCCGTGTTCATCGTCGGGAATGCGGGCAGCGGCAAATCT CAGGTCCTCAAATCCCTCAACAAGACCTATCAGAACCTGAAGAGGAAGCCGGTCGCCGTGGACCTGGACCCCAAGGCCGTCACCTGCGAC GAGCTCTTTGGCATCATCAACCCAGTGACCAGGGAATGGAAAGATGGCCTGTTCTCCACCATCATGCGAGACCTGGCCAACATCACCCAT GACGGCCCCAAGTGGATCATCCTTGACGGAGACATAGACCCCATGTGGATCGAGTCTCTCAACACAGTCATGGATGACAACAAGGTCCTC ACCCTGGCCAGCAACGAGCGGATCCCCCTGAACCGCACCATGAGGCTGGTGTTCGAAATCAGCCACCTGAGGACGGCCACCCCAGCCACC GTTTCCAGAGCCGGCATCCTCTACATCAACCCAGCCGACCTGGGATGGAACCCGGTGGTGAGCAGCTGGATCGAGAGGCGCAAGGTGCAG TCGGAGAAGGCCAACCTGATGATCCTCTTTGACAAGTACCTGCCCACGTGCCTGGACAAGTTGCGCTTTGGGTTCAAGAAGATCACGCCA GTGCCGGAGATCACGGTGATCCAAACGATTCTGTACCTGCTGGAGTGCCTGCTCACGGAGAAGACCGTGCCCCCCGACTCCCCCAGGGAG CTGTACGAGCTGTACTTCGTGTTCACCTGCTTCTGGGCCTTCGGTGGCGCCATGTTCCAGGACCAGCTTGTGGATTATCGAGTGGAGTTC AGTAAATGGTGGATCAACGAATTCAAGACTATCAAGTTCCCCTCGCAGGGAACGATTTTTGACTACTACATTGATCCTGACACAAAAAAG TTCCTGCCCTGGACAGATAAAGTGCCCTCCTTTGAGCTGGATCCCGATGTCCCACTGCAGGCCTCTTTGGTCCACACCACGGAAACCATC CGCATCCGCTACTTCATGGACCTGCTCATGGAGAAGTCCTGGCCGGTGATGCTGGTGGGGAACGCGGGGACGGGCAAGTCGGTGCTGATG GGGGACAAGCTGGAAAGCCTGAACACGGACAACTACCTGGTGCAGGCTGTGCCCTTCAACTTCTACACGACCTCAGCCATGCTGCAGGGG GTGCTGGAGAAGCCGCTGGAGAAGAAATCGGGGAGGAACTACGGGCCGCCAGGCACTAAGAAGCTCGTCTACTTCATCGACGACATGAAC ATGCCCGAGGTGGACAAGTATGGGACGGTGGCCCCGCACACCCTCATCCGGCAGCACATGGACCACCGGCACTGGTATGACAGACATAAG CTGACGTTAAAAGATATCCATAATTGTCAGTACGTGGCCTGCATGAACCCCACTTCCGGATCCTTCACCATCGACTCCAGGCTTCAGCGC CATTTCTGCGTGTTTGCTGTGAGCTTCCCCGGCCAGGAGGCCCTCACCACCATCTACAACACAATCCTGACGCAGCACCTGGCCTTCCGC TCGGTCTCCATGGCTATCCAGAGGATAAGCAGCCAGCTGGTGGCCGCGGCCCTGGCTTTGCATCAGAAAATCACGGCAACATTTCTTCCC ACGGCCATTAAGTTTCATTATGTCTTCAACCTCAGGGACCTCTCCAATATTTTCCAGGGACTCTTATTTTCCACAGCAGAAGTTCTGAAA ACCCCACTGGACCTCGTCCGCCTTTGGCTACATGAGACTGAACGAGTGTATGGTGACAAAATGGTTGACGAAAAAGACCAGGAAACATTG CATAGAGTCACCATGGCCTCCACCAAGAAGTTCTTTGATGATCTTGGTGATGAACTCTTATTTGCCAAGCCAAATATCTTCTGCCACTTT GCTCAAGGGATTGGCGATCCCAAATATGTTCCTGTAACCGACATGGCTCCTCTGAACAAGCTCCTCGTGGACGTCCTGGACAGCTACAAT GAAGTTAATGCAGTCATGAATTTGGTGCTGTTTGAGGACGCCGTGGCTCACATCTGCAGGATTAATCGCATCCTGGAGTCTCCCCGGGGG AATGCCCTGCTGGTGGGGGTGGGCGGCAGTGGCAAACAGAGCCTCTCCCGCCTGGCAGCGTACATCAGCGGGCTTGACGTGTTTCAGATC ACCCTCAAGAAGGGCTACGGGATCCCCGACCTCAAGATTGACCTCGCTGCTCAGTACATAAAGGCTGCCGTGAAGAACGTTCCCTCGGTG TTCCTGATGACAGACTCCCAGGTGGCCGAGGAGCAGTTTCTGGTGCTGATCAATGACCTGCTGGCCTCAGGAGAGATCCCTGGGCTGTTT ATGGAGGACGAGGTGGAGAACATCATCTCCTCCATGCGACCCCAAGTCAAGTCCCTTGGCATGAATGACACTCGGGAAACATGTTGGAAG TTCTTCATCGAAAAAGTGCGCAGACAGCTCAAGGTGATCCTGTGTTTCTCCCCTGTGGGCTCCGTGCTGCGGGTACGAGCCAGAAAGTTC CCAGCTGTGGTCAACTGCACGGCCATCGACTGGTTCCACGAGTGGCCGGAAGATGCGCTGGTGTCCGTCAGCGCCCGCTTCCTGGAGGAG ACTGAGGGGATTCCGTGGGAAGTCAAGGCCTCCATCAGCTTCTTCATGTCCTACGTGCACACCACCGTCAACGAGATGTCCAGGGTATAC CTGGCTACTGAGAGGCGCTACAACTACACCACACCCAAAACCTTTCTGGAGCAGATCAAACTGTACCAGAACCTGCTGGCCAAGAAGAGA ACGGAACTTGTTGCCAAAATCGAGAGGCTGGAGAACGGCCTGATGAAGCTGCAGAGCACGGCTTCCCAGGTGGATGATTTGAAAGCCAAG TTGGCGATTCAGGAGGCTGAGCTCAAGCAGAAGAATGAGAGCGCAGACCAACTGATCCAGGTGGTCGGCATCGAGGCCGAGAAGGTCAGC AAAGAGAAGGCCATTGCTGACCAGGAAGAAGTCAAGGTCGAGGTCATCAATAAGAACGTCACTGAGAAGCAAAAGGCCTGTGAAACAGAC CTGGCCAAAGCAGAACCGGCCCTGCTGGCAGCCCAGGAGGCTCTGGACACTCTGAATAAGAACAACCTGACAGAGCTGAAGTCCTTTGGG TCCCCGCCGGATGCTGTGGTCAACGTCACCGCCGCCGTCATGATTCTGACCGCACCTGGGGGCAAGATCCCCAAGGACAAGAGCTGGAAG GCGGCCAAGATCATGATGGGCAAGGTGGACACCTTCCTAGACTCCCTGAAGAAGTTCGACAAGGAGCACATCCCTGAGGCCTGCCTGAAG GCCTTCAAGCCCTACCAAGGCAACCCGACGTTCGACCCCGAGTTCATCCGCTCCAAGTCCACGGCCGCCGCCGGCCTGTGCTCCTGGTGC ATCAACATCGTCCGCTTCTACGAGGTCTACTGCGACGTGGCGCCCAAGAGGCAGGCACTGGAGGAGGCTAATGCAGAGCTGGCAGAGGCA CAAGAGAAGCTGTCCCGGATCAAAAACAAGATTGCCGAACTTAACGCCAACCTGAGCAACCTAACCTCAGCGTTTGAAAAAGCAACAGCT GAGAAAATCAAGTGTCAGCAAGAGGCCGATGCCACGAACAGGGTGATCTTACTGGCGAACAGGCTGGTCGGGGGATTAGCATCGGAAAAC ATCCGCTGGGCTGAGTCTGTGGAGAACTTCAGGAGCCAGGGGGTCACGCTGTGTGGGGACGTCCTGCTCATCTCTGCCTTCGTGTCCTAC GTGGGCTACTTCACCAAGAAATACCGGAATGAGCTGATGGAGAAATTCTGGATCCCTTACATACATAACTTAAAGGTCCCCATCCCGATC ACGAATGGCCTGGATCCCTTGAGCCTGCTGACAGATGACGCGGACGTGGCCACCTGGAACAACCAGGGCCTCCCCAGCGACCGCATGTCC ACCGAGAATGCCACCATCCTGGGCAACACCGAGCGGTGGCCGCTGATCGTGGACGCCCAGCTCCAAGGAATCAAGTGGATCAAAAACAAA TACAGGAGTGAACTGAAAGCCATCCGCCTGGGACAGAAGAGCTACCTGGATGTCATCGAGCAGGCCATCTCGGAAGGGGACACCTTGCTC ATTGAGAACATCGGCGAAACCGTGGACCCCGTGCTGGACCCTCTACTGGGCAGGAACACGATTAAAAAGGGAAAGTACATTAAGATCGGT GACAAGGAGGTGGAGTACCACCCCAAGTTCCGCCTGATCCTACACACCAAGTACTTCAACCCACACTACAAGCCAGAGATGCAGGCTCAG TGCACCCTCATCAACTTCCTGGTCACCAGGGATGGACTCGAGGACCAACTCTTGGCCGCTGTGGTGGCCAAAGAGCGCCCAGATCTGGAA CAGCTGAAGGCAAACCTCACCAAGTCTCAAAACGAATTTAAGATTGTTCTGAAAGAGCTGGAAGATTCGCTCCTGGCCCGTCTGTCGGCT GCGTCGGGGAACTTTCTGGGAGACACGGCCTTGGTGGAGAATCTGGAGACCACCAAGCACACAGCCAGCGAGATCGAGGAGAAGGTGGTG GAGGCAAAAATCACAGAAGTTAAAATCAACGAAGCGAGAGAGAACTACCGCCCGGCTGCGGAGAGGGCATCTCTGCTCTACTTCATACTG AACGATCTCAACAAAATCAACCCCGTCTACCAGTTCTCCCTCAAGGCCTTCAACGTGGTGTTTGAGAAAGCCATCCAGAGGACCACCCCT GCCAACGAGGTGAAGCAGCGGGTGATCAACCTGACGGACGAGATCACCTACTCCGTCTACATGTACACGGCCCGGGGACTCTTCGAGAGG GACAAACTCATTTTCCTGGCACAAGTTACGTTTCAGGTCCTGTCCATGAAGAAGGAGCTGAACCCAGTGGAGCTGGATTTCCTCCTGCGG TTCCCTTTTAAGGCCGGAGTGGTCTCACCAGTGGACTTCCTCCAGCATCAAGGCTGGGGCGGGATCAAGGCCCTCTCGGAGATGGATGAG TTCAAAAATCTGGACAGTGACATCGAAGGATCTGCCAAGCGCTGGAAAAAGCTGGTGGAGTCGGAAGCCCCCGAGAAGGAGATCTTCCCC AAGGAGTGGAAGAACAAGACGGCCCTGCAGAAGCTGTGCATGGTGCGCTGCCTGCGGCCAGATCGCATGACCTACGCTATCAAGAACTTC GTGGAGGAAAAGATGGGCAGCAAGTTCGTGGAAGGCCGGAGTGTTGAGTTTTCTAAGTCCTACGAGGAGAGCAGCCCCTCCACGTCAATC TTCTTCATCCTCTCCCCGGGGGTTGACCCCTTGAAAGACGTGGAAGCCCTGGGAAAAAAACTAGGGTTTACCATAGACAATGGAAAACTC CATAATGTGTCCCTGGGGCAGGGACAAGAGGTGGTGGCTGAGAACGCCCTGGACGTGGCTGCAGAGAAAGGACACTGGGTCATTCTGCAG AATATCCACCTGGTGGCCCGGTGGCTGGGAACACTGGACAAGAAGCTGGAGCACTACAGCACGGGCAGCCATGAGGACTACCGGGTGTTC ATCAGCGCGGAGCCTGCCCCCAGCCCCGAGACCCACATCATCCCCCAGGGCATTCTGGAGAACGCCATCAAGATCACCAACGAGCCCCCC ACGGGCATGCACGCCAACTTGCACAAGGCCCTGGACCTGTTCACCCAGGACACCCTGGAGATGTGCACCAAGGAGATGGAGTTCAAGTGC ATGCTCTTCGCCCTGTGCTACTTCCACGCTGTGGTGGCAGAGAGGCGCAAGTTCGGCGCCCAGGGCTGGAACCGGTCGTACCCCTTCAAC AACGGGGACCTCACCATCTCCATCAACGTGCTCTACAACTACCTGGAGGCCAACCCCAAGGTGCCCTGGGACGATCTCCGCTACCTTTTT GGTGAAATCATGTATGGCGGCCACATCACAGATGACTGGGACCGTCGGCTGTGCAGGACCTACCTGGCTGAATACATCCGGACGGAGATG CTGGAGGGAGACGTCCTGCTGGCCCCCGGCTTTCAGATCCCCCCCAACCTGGACTACAAGGGTTACCACGAATACATCGATGAGAACCTG CCCCCTGAGAGTCCCTATCTGTATGGCCTGCACCCCAACGCAGAGATTGGCTTTCTGACGGTCACCTCAGAGAAGCTGTTCCGCACTGTC CTGGAAATGCAGCCAAAAGAGACGGACTCGGGGGCAGGCACGGGAGTGTCCCGCGAGGAGAAGGTGAAGGCCGTGCTGGACGACATCCTG GAGAAGATTCCGGAGACTTTCAACATGGCTGAGATCATGGCAAAGGCAGCGGAAAAGACCCCCTACGTGGTAGTCGCCTTTCAAGAATGT GAAAGAATGAACATCCTGACCAACGAAATGCGCCGTTCGCTCAAGGAGCTGAACCTGGGGCTGAAGGGAGAACTGACCATCACGACCGAC GTGGAAGATCTGTCCACGGCTCTCTTCTATGACACCGTGCCTGATACGTGGGTGGCCCGGGCCTACCCCTCCATGATGGGCCTGGCGGCC TGGTACGCAGACCTGCTGCTCCGCATCAGGGAACTCGAGGCCTGGACGACAGACTTTGCCCTGCCCACCACCGTGTGGCTGGCCGGCTTC TTCAACCCCCAGTCGTTCCTCACGGCCATCATGCAGTCCATGGCCAGGAAGAACGAGTGGCCCCTGGACAAGATGTGTCTGTCTGTCGAG GTGACCAAGAAAAACCGAGAGGACATGACCGCTCCTCCGCGAGAGGGCTCCTACGTGTACGGACTCTTCATGGAAGGGGCTCGCTGGGAC ACCCAGACTGGAGTCATCGCTGAAGCGCGGCTGAAAGAGCTGACCCCGGCCATGCCTGTCATCTTCATCAAGGCCATTCCTGTGGACCGC ATGGAGACCAAGAACATCTATGAGTGTCCCGTGTACAAAACACGCATCCGCGGCCCCACCTATGTCTGGACCTTTAACTTGAAGACCAAA GAGAAGGCAGCGAAGTGGATCCTGGCAGCCGTGGCGCTGCTCCTACAGGTTTAGCTCGCTCCTGCCTCACAGCCCACACTCCCTGGGGCT GGACCACAACTCAGCCCTTCACCTGTGCACCTGTGACTTATTCTTTACAGGAACTGGTGGTGGTTTTTCGTTCTCTTAAATAATCAGGTG >In-frame_ENST00000425042_ENST00000585328_TCGA-ED-A97K-01A_HID1_chr17_72968686_-_DNAH17_chr17_76506588_length(amino acids)=3135AA_start in transcript=6_stop in transcript=9413 MEPELKPEPGWSLGGGRAGAGSRDMGSTDSKLNFRKAVIQLTTKTQVKFKMSEETTLADLLQLNLHSYEDEVRNIVDKAVKESGMEKVLK ALDSTWSMMEFQHEPHPRTGTMMLKSSEVLVETLEDNQVQLQNLMMSKYLAHFLKEVTSWQQKLSTADSVISIWFEVQRTWSHLESIFIG SEDIRTQLPGDSQRFDDINQEFKALMEDAVKTPNVVEATSKPGLYNKLEALKKSLAICEKALAEYLETKRLAFPRFYFVSSADLLDILSN GNDPVEVSRHLSKLFDSLCKLKFRLDASDKPLKVGLGMYSKEDEYMVFDQECDLSGQVEVWLNRVLDRMCSTLRHEIPEAVVTYEEKPRE QWILDYPAQVALTCTQIWWTTEVGLAFARLEEGYENAIRDYNKKQISQLNVLITLLMGNLNAGDRMKIMTICTIDVHARDVVAKMIVVES SQAFTWQAQLRHRWDEEKRHCFANICDAQIQYSYEYLGNTPRLVITPLTDRCYITLTQSLHLIMGGAPAGPAGTGKTETTKDLGRALGTM VYVFNCSEQMDYKSCGNIYKGLAQTGAWGCFDEFNRISVEVLSVIAVQVKCVQDAIRAKKKAFNFLGEIIGLIPTVGIFITMNPGYAGRA ELPENLKALFRPCAMVVPDFELICEIMLMAEGFLEARLLARKFITLYTLCKELLSKQDHYDWGLRAIKSVLVVAGSLKRGDPSRAEDQVL MRALRDFNIPKIVTDDLPVFMGLIGDLFPALDVPRKRDLNFEKIIKQSIVELKLQAEDSFVLKVVQLEELLQVRHSVFIVGNAGSGKSQV LKSLNKTYQNLKRKPVAVDLDPKAVTCDELFGIINPVTREWKDGLFSTIMRDLANITHDGPKWIILDGDIDPMWIESLNTVMDDNKVLTL ASNERIPLNRTMRLVFEISHLRTATPATVSRAGILYINPADLGWNPVVSSWIERRKVQSEKANLMILFDKYLPTCLDKLRFGFKKITPVP EITVIQTILYLLECLLTEKTVPPDSPRELYELYFVFTCFWAFGGAMFQDQLVDYRVEFSKWWINEFKTIKFPSQGTIFDYYIDPDTKKFL PWTDKVPSFELDPDVPLQASLVHTTETIRIRYFMDLLMEKSWPVMLVGNAGTGKSVLMGDKLESLNTDNYLVQAVPFNFYTTSAMLQGVL EKPLEKKSGRNYGPPGTKKLVYFIDDMNMPEVDKYGTVAPHTLIRQHMDHRHWYDRHKLTLKDIHNCQYVACMNPTSGSFTIDSRLQRHF CVFAVSFPGQEALTTIYNTILTQHLAFRSVSMAIQRISSQLVAAALALHQKITATFLPTAIKFHYVFNLRDLSNIFQGLLFSTAEVLKTP LDLVRLWLHETERVYGDKMVDEKDQETLHRVTMASTKKFFDDLGDELLFAKPNIFCHFAQGIGDPKYVPVTDMAPLNKLLVDVLDSYNEV NAVMNLVLFEDAVAHICRINRILESPRGNALLVGVGGSGKQSLSRLAAYISGLDVFQITLKKGYGIPDLKIDLAAQYIKAAVKNVPSVFL MTDSQVAEEQFLVLINDLLASGEIPGLFMEDEVENIISSMRPQVKSLGMNDTRETCWKFFIEKVRRQLKVILCFSPVGSVLRVRARKFPA VVNCTAIDWFHEWPEDALVSVSARFLEETEGIPWEVKASISFFMSYVHTTVNEMSRVYLATERRYNYTTPKTFLEQIKLYQNLLAKKRTE LVAKIERLENGLMKLQSTASQVDDLKAKLAIQEAELKQKNESADQLIQVVGIEAEKVSKEKAIADQEEVKVEVINKNVTEKQKACETDLA KAEPALLAAQEALDTLNKNNLTELKSFGSPPDAVVNVTAAVMILTAPGGKIPKDKSWKAAKIMMGKVDTFLDSLKKFDKEHIPEACLKAF KPYQGNPTFDPEFIRSKSTAAAGLCSWCINIVRFYEVYCDVAPKRQALEEANAELAEAQEKLSRIKNKIAELNANLSNLTSAFEKATAEK IKCQQEADATNRVILLANRLVGGLASENIRWAESVENFRSQGVTLCGDVLLISAFVSYVGYFTKKYRNELMEKFWIPYIHNLKVPIPITN GLDPLSLLTDDADVATWNNQGLPSDRMSTENATILGNTERWPLIVDAQLQGIKWIKNKYRSELKAIRLGQKSYLDVIEQAISEGDTLLIE NIGETVDPVLDPLLGRNTIKKGKYIKIGDKEVEYHPKFRLILHTKYFNPHYKPEMQAQCTLINFLVTRDGLEDQLLAAVVAKERPDLEQL KANLTKSQNEFKIVLKELEDSLLARLSAASGNFLGDTALVENLETTKHTASEIEEKVVEAKITEVKINEARENYRPAAERASLLYFILND LNKINPVYQFSLKAFNVVFEKAIQRTTPANEVKQRVINLTDEITYSVYMYTARGLFERDKLIFLAQVTFQVLSMKKELNPVELDFLLRFP FKAGVVSPVDFLQHQGWGGIKALSEMDEFKNLDSDIEGSAKRWKKLVESEAPEKEIFPKEWKNKTALQKLCMVRCLRPDRMTYAIKNFVE EKMGSKFVEGRSVEFSKSYEESSPSTSIFFILSPGVDPLKDVEALGKKLGFTIDNGKLHNVSLGQGQEVVAENALDVAAEKGHWVILQNI HLVARWLGTLDKKLEHYSTGSHEDYRVFISAEPAPSPETHIIPQGILENAIKITNEPPTGMHANLHKALDLFTQDTLEMCTKEMEFKCML FALCYFHAVVAERRKFGAQGWNRSYPFNNGDLTISINVLYNYLEANPKVPWDDLRYLFGEIMYGGHITDDWDRRLCRTYLAEYIRTEMLE GDVLLAPGFQIPPNLDYKGYHEYIDENLPPESPYLYGLHPNAEIGFLTVTSEKLFRTVLEMQPKETDSGAGTGVSREEKVKAVLDDILEK IPETFNMAEIMAKAAEKTPYVVVAFQECERMNILTNEMRRSLKELNLGLKGELTITTDVEDLSTALFYDTVPDTWVARAYPSMMGLAAWY ADLLLRIRELEAWTTDFALPTTVWLAGFFNPQSFLTAIMQSMARKNEWPLDKMCLSVEVTKKNREDMTAPPREGSYVYGLFMEGARWDTQ -------------------------------------------------------------- |
Top |
Fusion Gene PPI Analysis for HID1-DNAH17 |
Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in |
Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160) |
Hgene | Hgene's interactors | Tgene | Tgene's interactors |
- Retained PPIs in in-frame fusion. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Still interaction with |
- Lost PPIs in in-frame fusion. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Interaction lost with |
- Retained PPIs, but lost function due to frame-shift fusion. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Interaction lost with |
Top |
Related Drugs for HID1-DNAH17 |
Drugs targeting genes involved in this fusion gene. (DrugBank Version 5.1.8 2021-05-08) |
Partner | Gene | UniProtAcc | DrugBank ID | Drug name | Drug activity | Drug type | Drug status |
Top |
Related Diseases for HID1-DNAH17 |
Diseases associated with fusion partners. (DisGeNet 4.0) |
Partner | Gene | Disease ID | Disease name | # pubmeds | Source |