FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:HID1-DNAH17 (FusionGDB2 ID:36191)

Fusion Gene Summary for HID1-DNAH17

check button Fusion gene summary
Fusion gene informationFusion gene name: HID1-DNAH17
Fusion gene ID: 36191
HgeneTgene
Gene symbol

HID1

DNAH17

Gene ID

283987

8632

Gene nameHID1 domain containingdynein axonemal heavy chain 17
Synonyms17orf28|C17orf28|DMC1|HID-1DNAHL1|DNEL2|SPGF39
Cytomap

17q25.1

17q25.3

Type of geneprotein-codingprotein-coding
Descriptionprotein HID1HID1 domain-containing proteinUPF0663 transmembrane protein C17orf28down-regulated in multiple cancers 1downregulated in multiple cancer 1protein hid-1 homologdynein heavy chain 17, axonemalaxonemal beta dynein heavy chain 17axonemal dynein heavy chain-like protein 1ciliary dynein heavy chain 17ciliary dynein heavy chain-like protein 1dynein light chain 2, axonemaldynein, axonemal, heavy polypeptide 17
Modification date2020031320200320
UniProtAcc

Q8IV36

Q9UFH2

Ensembl transtripts involved in fusion geneENST00000425042, ENST00000532900, 
ENST00000389840, ENST00000585328, 
ENST00000586052, 
Fusion gene scores* DoF score5 X 3 X 5=7515 X 16 X 11=2640
# samples 519
** MAII scorelog2(5/75*10)=-0.584962500721156
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(19/2640*10)=-3.79646660591487
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: HID1 [Title/Abstract] AND DNAH17 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointHID1(72968686)-DNAH17(76506588), # samples:2
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID

check buttonFusion gene breakpoints across HID1 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.

check buttonFusion gene breakpoints across DNAH17 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4LIHCTCGA-ED-A97K-01AHID1chr17

72968686

-DNAH17chr17

76506588

-
ChimerDB4LIHCTCGA-ED-A97K-01AHID1chr17

72968686

-DNAH17chr17

76506588

-


Top

Fusion Gene ORF analysis for HID1-DNAH17

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
In-frameENST00000425042ENST00000389840HID1chr17

72968686

-DNAH17chr17

76506588

-
In-frameENST00000425042ENST00000585328HID1chr17

72968686

-DNAH17chr17

76506588

-
5CDS-intronENST00000425042ENST00000586052HID1chr17

72968686

-DNAH17chr17

76506588

-
intron-3CDSENST00000532900ENST00000389840HID1chr17

72968686

-DNAH17chr17

76506588

-
intron-3CDSENST00000532900ENST00000585328HID1chr17

72968686

-DNAH17chr17

76506588

-
intron-intronENST00000532900ENST00000586052HID1chr17

72968686

-DNAH17chr17

76506588

-

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000425042HID1chr1772968686-ENST00000389840DNAH17chr1776506588-9710144695003164
ENST00000425042HID1chr1772968686-ENST00000585328DNAH17chr1776506588-9623144694133135

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000425042ENST00000389840HID1chr1772968686-DNAH17chr1776506588-0.0028455350.9971545
ENST00000425042ENST00000585328HID1chr1772968686-DNAH17chr1776506588-0.0031163980.9968836

Top

Fusion Genomic Features for HID1-DNAH17


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.

Top

Fusion Protein Features for HID1-DNAH17


check button Go to

FGviewer for the breakpoints of chr17:72968686-chr17:76506588

.
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
HID1

Q8IV36

DNAH17

Q9UFH2

FUNCTION: May play an important role in the development of cancers in a broad range of tissues. {ECO:0000269|PubMed:11281419}.FUNCTION: Force generating protein component of the outer dynein arms (ODAs) in the sperm flagellum. Produces force towards the minus ends of microtubules. Dynein has ATPase activity; the force-producing power stroke is thought to occur on release of ADP (Probable). Plays a major role in sperm motility, implicated in sperm flagellar assembly and beating (PubMed:31178125). {ECO:0000269|PubMed:31178125, ECO:0000305|PubMed:31178125}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
TgeneDNAH17chr17:72968686chr17:76506588ENST0000038984025813027_30861367.04486.0Coiled coilOntology_term=ECO:0000255
TgeneDNAH17chr17:72968686chr17:76506588ENST0000038984025813257_33091367.04486.0Coiled coilOntology_term=ECO:0000255
TgeneDNAH17chr17:72968686chr17:76506588ENST0000038984025811847_18541367.04486.0Nucleotide bindingATP
TgeneDNAH17chr17:72968686chr17:76506588ENST0000038984025812128_21351367.04486.0Nucleotide bindingATP
TgeneDNAH17chr17:72968686chr17:76506588ENST0000038984025812455_24621367.04486.0Nucleotide bindingATP
TgeneDNAH17chr17:72968686chr17:76506588ENST0000038984025812801_28081367.04486.0Nucleotide bindingATP
TgeneDNAH17chr17:72968686chr17:76506588ENST0000038984025811809_20301367.04486.0RegionAAA 1
TgeneDNAH17chr17:72968686chr17:76506588ENST0000038984025812090_23111367.04486.0RegionAAA 2
TgeneDNAH17chr17:72968686chr17:76506588ENST0000038984025812417_26651367.04486.0RegionAAA 3
TgeneDNAH17chr17:72968686chr17:76506588ENST0000038984025812763_30121367.04486.0RegionAAA 4
TgeneDNAH17chr17:72968686chr17:76506588ENST0000038984025813027_33131367.04486.0RegionStalk
TgeneDNAH17chr17:72968686chr17:76506588ENST0000038984025813405_36321367.04486.0RegionAAA 5
TgeneDNAH17chr17:72968686chr17:76506588ENST0000038984025813842_40681367.04486.0RegionAAA 6
TgeneDNAH17chr17:72968686chr17:76506588ENST0000038984025811702_17361367.04486.0RepeatNote=TPR 2
TgeneDNAH17chr17:72968686chr17:76506588ENST0000038984025814147_41821367.04486.0RepeatNote=TPR 3

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
TgeneDNAH17chr17:72968686chr17:76506588ENST0000038984025811_18081367.04486.0RegionStem
TgeneDNAH17chr17:72968686chr17:76506588ENST0000038984025811019_10521367.04486.0RepeatNote=TPR 1


Top

Fusion Gene Sequence for HID1-DNAH17


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>In-frame_ENST00000425042_ENST00000389840_TCGA-ED-A97K-01A_HID1_chr17_72968686_-_DNAH17_chr17_76506588_length(transcript)=9710nt_BP=144nt
GCGGAGCTGGAGCCGGAGCTGAAGCCGGAGCCGGGTTGGAGTCTGGGCGGGGGCCGGGCCGGAGCGGGCTCCAGAGACATGGGGTCGACC
GACTCCAAGCTGAACTTCCGGAAGGCGGTGATCCAGCTCACCACCAAGACGCAGGTGAAATTTAAAATGTCAGAAGAGACGACCCTGGCA
GATTTACTGCAGCTGAACCTCCACAGTTACGAGGATGAGGTCCGCAACATCGTGGACAAGGCCGTGAAGGAGTCGGGCATGGAAAAGGTG
CTGAAAGCCCTGGACAGTACCTGGAGCATGATGGAATTCCAGCACGAGCCGCACCCGCGGACAGGCACCATGATGCTCAAGTCCAGCGAG
GTGCTGGTGGAGACGCTGGAGGACAACCAGGTGCAGCTGCAGAACCTGATGATGTCCAAGTACCTGGCCCACTTCCTGAAGGAGGTGACA
AGCTGGCAGCAGAAGCTGTCCACGGCGGACTCCGTCATCTCCATCTGGTTTGAGGTCCAGCGAACCTGGAGCCACCTGGAGAGCATCTTC
ATCGGCTCCGAAGACATCCGCACCCAGCTCCCGGGGGACTCCCAGCGCTTTGACGACATCAACCAGGAATTCAAGGCCTTGATGGAAGAT
GCAGTGAAAACACCCAACGTGGTGGAAGCCACCAGCAAACCCGGCCTCTACAATAAACTGGAGGCCCTGAAGAAGAGCTTGGCCATCTGT
GAAAAGGCTTTGGCAGAGTATTTAGAGACGAAAAGACTGGCTTTCCCCCGGTTCTATTTTGTCTCCTCGGCTGACCTCCTGGACATTCTC
TCCAATGGCAATGACCCCGTGGAGGTGAGCCGCCACCTGTCCAAACTCTTCGATAGCCTGTGTAAACTGAAGTTCCGGCTCGATGCCAGT
GACAAACCTCTCAAGGTGGGCCTGGGAATGTACAGCAAGGAGGACGAGTACATGGTTTTTGATCAGGAATGCGACCTCTCGGGGCAGGTG
GAAGTGTGGCTGAATCGAGTGCTGGACCGAATGTGCTCTACCCTCCGGCACGAAATCCCAGAGGCCGTGGTGACCTACGAAGAGAAGCCG
AGGGAGCAGTGGATCCTGGACTACCCAGCCCAGATCTGGTGGACGACCGAGGTGGGCCTGGCATTTGCCAGGCTGGAGGAAGGCTATGAA
AACGCTATCAGAGATTATAACAAAAAGCAGATTAGCCAGCTGAACGTACTCATCACGCTGCTCATGGGGAACCTCAACGCTGGCGACAGG
ATGAAGATCATGACCATCTGCACCATCGATGTGCACGCACGGGACGTGGTGGCCAAAATGATCGTGGAGAGTTCTCAGGCCTTCACCTGG
CAGGCCCAGCTCCGGCATCGCTGGGACGAAGAGAAGCGACACTGCTTTGCCAACATCTGCGATGCCCAAATCCAGTATTCCTATGAGTAT
CTGGGCAACACGCCGCGGCTGGTCATCACCCCACTCACTGACAGGTGCTATATCACCCTGACCCAGTCCCTCCATCTCATCATGGGTGGA
GCCCCTGCCGGCCCCGCTGGGACCGGCAAGACTGAGACGACCAAGGACCTGGGCAGAGCCCTGGGCACCATGGTCTACGTCTTCAACTGC
TCCGAGCAGATGGACTACAAGTCCTGTGGAAATATCTACAAGGGCCTGGCCCAGACGGGAGCCTGGGGCTGCTTTGACGAGTTTAATCGC
ATCTCAGTGGAAGTCTTGTCTGTGATTGCCGTGCAGGTAAAATGTGTCCAGGATGCAATTCGGGCCAAGAAAAAAGCATTCAATTTCCTG
GGAGAGATCATAGGCCTCATTCCCACCGTCGGTATCTTCATCACCATGAACCCTGGGTACGCCGGACGCGCGGAGCTGCCTGAGAACCTA
AAAGCCTTATTCAGGCCCTGTGCCATGGTCGTCCCCGACTTCGAACTGATATGTGAGATCATGCTCATGGCCGAGGGCTTTCTGGAAGCC
CGCCTTCTGGCCAGGAAGTTCATCACCCTGTACACCTTGTGCAAGGAGCTGCTCTCGAAGCAGGATCATTACGACTGGGGCCTGAGAGCC
ATCAAGTCTGTGCTGGTGGTGGCCGGCTCCCTGAAGAGGGGCGACCCCAGCCGGGCAGAGGACCAGGTGCTCATGCGGGCGCTGAGAGAC
TTCAACATCCCCAAGATTGTGACAGACGACCTGCCCGTATTCATGGGACTGATCGGGGACCTCTTCCCGGCTCTGGACGTGCCTCGGAAA
CGGGACCTGAATTTTGAAAAGATCATCAAGCAGAGCATCGTGGAGCTCAAGCTGCAGGCGGAGGACAGCTTCGTGCTGAAGGTGGTGCAG
CTGGAGGAGCTGCTGCAGGTCCGCCACTCCGTGTTCATCGTCGGGAATGCGGGCAGCGGCAAATCTCAGGTCCTCAAATCCCTCAACAAG
ACCTATCAGAACCTGAAGAGGAAGCCGGTCGCCGTGGACCTGGACCCCAAGGCCGTCACCTGCGACGAGCTCTTTGGCATCATCAACCCA
GTGACCAGGGAATGGAAAGATGGCCTGTTCTCCACCATCATGCGAGACCTGGCCAACATCACCCATGACGGCCCCAAGTGGATCATCCTT
GACGGAGACATAGACCCCATGTGGATCGAGTCTCTCAACACAGTCATGGATGACAACAAGGTCCTCACCCTGGCCAGCAACGAGCGGATC
CCCCTGAACCGCACCATGAGGCTGGTGTTCGAAATCAGCCACCTGAGGACGGCCACCCCAGCCACCGTTTCCAGAGCCGGCATCCTCTAC
ATCAACCCAGCCGACCTGGGATGGAACCCGGTGGTGAGCAGCTGGATCGAGAGGCGCAAGGTGCAGTCGGAGAAGGCCAACCTGATGATC
CTCTTTGACAAGTACCTGCCCACGTGCCTGGACAAGTTGCGCTTTGGGTTCAAGAAGATCACGCCAGTGCCGGAGATCACGGTGATCCAA
ACGATTCTGTACCTGCTGGAGTGCCTGCTCACGGAGAAGACCGTGCCCCCCGACTCCCCCAGGGAGCTGTACGAGCTGTACTTCGTGTTC
ACCTGCTTCTGGGCCTTCGGTGGCGCCATGTTCCAGGACCAGCTTGTGGATTATCGAGTGGAGTTCAGTAAATGGTGGATCAACGAATTC
AAGACTATCAAGTTCCCCTCGCAGGGAACGATTTTTGACTACTACATTGATCCTGACACAAAAAAGTTCCTGCCCTGGACAGATAAAGTG
CCCTCCTTTGAGCTGGATCCCGATGTCCCACTGCAGGCCTCTTTGGTCCACACCACGGAAACCATCCGCATCCGCTACTTCATGGACCTG
CTCATGGAGAAGTCCTGGCCGGTGATGCTGGTGGGGAACGCGGGGACGGGCAAGTCGGTGCTGATGGGGGACAAGCTGGAAAGCCTGAAC
ACGGACAACTACCTGGTGCAGGCTGTGCCCTTCAACTTCTACACGACCTCAGCCATGCTGCAGGGGGTGCTGGAGAAGCCGCTGGAGAAG
AAATCGGGGAGGAACTACGGGCCGCCAGGCACTAAGAAGCTCGTCTACTTCATCGACGACATGAACATGCCCGAGGTGGACAAGTATGGG
ACGGTGGCCCCGCACACCCTCATCCGGCAGCACATGGACCACCGGCACTGGTATGACAGACATAAGCTGACGTTAAAAGATATCCATAAT
TGTCAGTACGTGGCCTGCATGAACCCCACTTCCGGATCCTTCACCATCGACTCCAGGCTTCAGCGCCATTTCTGCGTGTTTGCTGTGAGC
TTCCCCGGCCAGGAGGCCCTCACCACCATCTACAACACAATCCTGACGCAGCACCTGGCCTTCCGCTCGGTCTCCATGGCTATCCAGAGG
ATAAGCAGCCAGCTGGTGGCCGCGGCCCTGGCTTTGCATCAGAAAATCACGGCAACATTTCTTCCCACGGCCATTAAGTTTCATTATGTC
TTCAACCTCAGGGACCTCTCCAATATTTTCCAGGGACTCTTATTTTCCACAGCAGAAGTTCTGAAAACCCCACTGGACCTCGTCCGCCTT
TGGCTACATGAGACTGAACGAGTGTATGGTGACAAAATGGTTGACGAAAAAGACCAGGAAACATTGCATAGAGTCACCATGGCCTCCACC
AAGAAGTTCTTTGATGATCTTGGTGATGAACTCTTATTTGCCAAGCCAAATATCTTCTGCCACTTTGCTCAAGGGATTGGCGATCCCAAA
TATGTTCCTGTAACCGACATGGCTCCTCTGAACAAGCTCCTCGTGGACGTCCTGGACAGCTACAATGAAGTTAATGCAGTCATGAATTTG
GTGCTGTTTGAGGACGCCGTGGCTCACATCTGCAGGATTAATCGCATCCTGGAGTCTCCCCGGGGGAATGCCCTGCTGGTGGGGGTGGGC
GGCAGTGGCAAACAGAGCCTCTCCCGCCTGGCAGCGTACATCAGCGGGCTTGACGTGTTTCAGATCACCCTCAAGAAGGGCTACGGGATC
CCCGACCTCAAGATTGACCTCGCTGCTCAGTACATAAAGGCTGCCGTGAAGAACGTTCCCTCGGTGTTCCTGATGACAGACTCCCAGGTG
GCCGAGGAGCAGTTTCTGGTGCTGATCAATGACCTGCTGGCCTCAGGAGAGATCCCTGGGCTGTTTATGGAGGACGAGGTGGAGAACATC
ATCTCCTCCATGCGACCCCAAGTCAAGTCCCTTGGCATGAATGACACTCGGGAAACATGTTGGAAGTTCTTCATCGAAAAAGTGCGCAGA
CAGCTCAAGGTGATCCTGTGTTTCTCCCCTGTGGGCTCCGTGCTGCGGGTACGAGCCAGAAAGTTCCCAGCTGTGGTCAACTGCACGGCC
ATCGACTGGTTCCACGAGTGGCCGGAAGATGCGCTGGTGTCCGTCAGCGCCCGCTTCCTGGAGGAGACTGAGGGGATTCCGTGGGAAGTC
AAGGCCTCCATCAGCTTCTTCATGTCCTACGTGCACACCACCGTCAACGAGATGTCCAGGGTATACCTGGCTACTGAGAGGCGCTACAAC
TACACCACACCCAAAACCTTTCTGGAGCAGATCAAACTGTACCAGAACCTGCTGGCCAAGAAGAGAACGGAACTTGTTGCCAAAATCGAG
AGGCTGGAGAACGGCCTGATGAAGCTGCAGAGCACGGCTTCCCAGGTGGATGATTTGAAAGCCAAGTTGGCGATTCAGGAGGCTGAGCTC
AAGCAGAAGAATGAGAGCGCAGACCAACTGATCCAGGTGGTCGGCATCGAGGCCGAGAAGGTCAGCAAAGAGAAGGCCATTGCTGACCAG
GAAGAAGTCAAGGTCGAGGTCATCAATAAGAACGTCACTGAGAAGCAAAAGGCCTGTGAAACAGACCTGGCCAAAGCAGAACCGGCCCTG
CTGGCAGCCCAGGAGGCTCTGGACACTCTGAATAAGAACAACCTGACAGAGCTGAAGTCCTTTGGGTCCCCGCCGGATGCTGTGGTCAAC
GTCACCGCCGCCGTCATGATTCTGACCGCACCTGGGGGCAAGATCCCCAAGGACAAGAGCTGGAAGGCGGCCAAGATCATGATGGGCAAG
GTGGACACCTTCCTAGACTCCCTGAAGAAGTTCGACAAGGAGCACATCCCTGAGGCCTGCCTGAAGGCCTTCAAGCCCTACCAAGGCAAC
CCGACGTTCGACCCCGAGTTCATCCGCTCCAAGTCCACGGCCGCCGCCGGCCTGTGCTCCTGGTGCATCAACATCGTCCGCTTCTACGAG
GTCTACTGCGACGTGGCGCCCAAGAGGCAGGCACTGGAGGAGGCTAATGCAGAGCTGGCAGAGGCACAAGAGAAGCTGTCCCGGATCAAA
AACAAGATTGCCGAACTTAACGCCAACCTGAGCAACCTAACCTCAGCGTTTGAAAAAGCAACAGCTGAGAAAATCAAGTGTCAGCAAGAG
GCCGATGCCACGAACAGGGTGATCTTACTGGCGAACAGGCTGGTCGGGGGATTAGCATCGGAAAACATCCGCTGGGCTGAGTCTGTGGAG
AACTTCAGGAGCCAGGGGGTCACGCTGTGTGGGGACGTCCTGCTCATCTCTGCCTTCGTGTCCTACGTGGGCTACTTCACCAAGAAATAC
CGGAATGAGCTGATGGAGAAATTCTGGATCCCTTACATACATAACTTAAAGGTCCCCATCCCGATCACGAATGGCCTGGATCCCTTGAGC
CTGCTGACAGATGACGCGGACGTGGCCACCTGGAACAACCAGGGCCTCCCCAGCGACCGCATGTCCACCGAGAATGCCACCATCCTGGGC
AACACCGAGCGGTGGCCGCTGATCGTGGACGCCCAGCTCCAAGGAATCAAGTGGATCAAAAACAAATACAGGAGTGAACTGAAAGCCATC
CGCCTGGGACAGAAGAGCTACCTGGATGTCATCGAGCAGGCCATCTCGGAAGGGGACACCTTGCTCATTGAGAACATCGGCGAAACCGTG
GACCCCGTGCTGGACCCTCTACTGGGCAGGAACACGATTAAAAAGGGAAAGTACATTAAGATCGGTGACAAGGAGGTGGAGTACCACCCC
AAGTTCCGCCTGATCCTACACACCAAGTACTTCAACCCACACTACAAGCCAGAGATGCAGGCTCAGTGCACCCTCATCAACTTCCTGGTC
ACCAGGGATGGACTCGAGGACCAACTCTTGGCCGCTGTGGTGGCCAAAGAGCGCCCAGATCTGGAACAGCTGAAGGCAAACCTCACCAAG
TCTCAAAACGAATTTAAGATTGTTCTGAAAGAGCTGGAAGATTCGCTCCTGGCCCGTCTGTCGGCTGCGTCGGGGAACTTTCTGGGAGAC
ACGGCCTTGGTGGAGAATCTGGAGACCACCAAGCACACAGCCAGCGAGATCGAGGAGAAGGTGGTGGAGGCAAAAATCACAGAAGTTAAA
ATCAACGAAGCGAGAGAGAACTACCGCCCGGCTGCGGAGAGGGCATCTCTGCTCTACTTCATACTGAACGATCTCAACAAAATCAACCCC
GTCTACCAGTTCTCCCTCAAGGCCTTCAACGTGGTGTTTGAGAAAGCCATCCAGAGGACCACCCCTGCCAACGAGGTGAAGCAGCGGGTG
ATCAACCTGACGGACGAGATCACCTACTCCGTCTACATGTACACGGCCCGGGGACTCTTCGAGAGGGACAAACTCATTTTCCTGGCACAA
GTTACGTTTCAGGTCCTGTCCATGAAGAAGGAGCTGAACCCAGTGGAGCTGGATTTCCTCCTGCGGTTCCCTTTTAAGGCCGGAGTGGTC
TCACCAGTGGACTTCCTCCAGCATCAAGGCTGGGGCGGGATCAAGGCCCTCTCGGAGATGGATGAGTTCAAAAATCTGGACAGTGACATC
GAAGGATCTGCCAAGCGCTGGAAAAAGCTGGTGGAGTCGGAAGCCCCCGAGAAGGAGATCTTCCCCAAGGAGTGGAAGAACAAGACGGCC
CTGCAGAAGCTGTGCATGGTGCGCTGCCTGCGGCCAGATCGCATGACCTACGCTATCAAGAACTTCGTGGAGGAAAAGATGGGCAGCAAG
TTCGTGGAAGGCCGGAGTGTTGAGTTTTCTAAGTCCTACGAGGAGAGCAGCCCCTCCACGTCAATCTTCTTCATCCTCTCCCCGGGGGTT
GACCCCTTGAAAGACGTGGAAGCCCTGGGAAAAAAACTAGGGTTTACCATAGACAATGGAAAACTCCATAATGTGTCCCTGGGGCAGGGA
CAAGAGGTGGTGGCTGAGAACGCCCTGGACGTGGCTGCAGAGAAAGGACACTGGGTCATTCTGCAGGTACGAGGGGGCCAGCACTGCAGG
AATATCCACCTGGTGGCCCGGTGGCTGGGAACACTGGACAAGAAGCTGGAGCACTACAGCACGGGCAGCCATGAGGACTACCGGGTGTTC
ATCAGCGCGGAGCCTGCCCCCAGCCCCGAGACCCACATCATCCCCCAGGGCATTCTGGAGAACGCCATCAAGATCACCAACGAGCCCCCC
ACGGGCATGCACGCCAACTTGCACAAGGCCCTGGACCTGTTCACCCAGGACACCCTGGAGATGTGCACCAAGGAGATGGAGTTCAAGTGC
ATGCTCTTCGCCCTGTGCTACTTCCACGCTGTGGTGGCAGAGAGGCGCAAGTTCGGCGCCCAGGGCTGGAACCGGTCGTACCCCTTCAAC
AACGGGGACCTCACCATCTCCATCAACGTGCTCTACAACTACCTGGAGGCCAACCCCAAGGTGCCCTGGGACGATCTCCGCTACCTTTTT
GGTGAAATCATGTATGGCGGCCACATCACAGATGACTGGGACCGTCGGCTGTGCAGGACCTACCTGGCTGAATACATCCGGACGGAGATG
CTGGAGGGAGACGTCCTGCTGGCCCCCGGCTTTCAGATCCCCCCCAACCTGGACTACAAGGGTTACCACGAATACATCGATGAGAACCTG
CCCCCTGAGAGTCCCTATCTGTATGGCCTGCACCCCAACGCAGAGATTGGCTTTCTGACGGTCACCTCAGAGAAGCTGTTCCGCACTGTC
CTGGAAATGCAGCCAAAAGAGACGGACTCGGGGGCAGGCACGGGAGTGTCCCGCGAGGAGAAGGCAGGATCTTTGAAACTGCTCCCAAGC
GAGAGGAAGGGGGAGGATCTAGAACTGAGGAGGGGGGGCTGTCCGGGGACTGGCTTCCAGGTGAAGGCCGTGCTGGACGACATCCTGGAG
AAGATTCCGGAGACTTTCAACATGGCTGAGATCATGGCAAAGGCAGCGGAAAAGACCCCCTACGTGGTAGTCGCCTTTCAAGAATGTGAA
AGAATGAACATCCTGACCAACGAAATGCGCCGTTCGCTCAAGGAGCTGAACCTGGGGCTGAAGGGAGAACTGACCATCACGACCGACGTG
GAAGATCTGTCCACGGCTCTCTTCTATGACACCGTGCCTGATACGTGGGTGGCCCGGGCCTACCCCTCCATGATGGGCCTGGCGGCCTGG
TACGCAGACCTGCTGCTCCGCATCAGGGAACTCGAGGCCTGGACGACAGACTTTGCCCTGCCCACCACCGTGTGGCTGGCCGGCTTCTTC
AACCCCCAGTCGTTCCTCACGGCCATCATGCAGTCCATGGCCAGGAAGAACGAGTGGCCCCTGGACAAGATGTGTCTGTCTGTCGAGGTG
ACCAAGAAAAACCGAGAGGACATGACCGCTCCTCCGCGAGAGGGCTCCTACGTGTACGGACTCTTCATGGAAGGGGCTCGCTGGGACACC
CAGACTGGAGTCATCGCTGAAGCGCGGCTGAAAGAGCTGACCCCGGCCATGCCTGTCATCTTCATCAAGGCCATTCCTGTGGACCGCATG
GAGACCAAGAACATCTATGAGTGTCCCGTGTACAAAACACGCATCCGCGGCCCCACCTATGTCTGGACCTTTAACTTGAAGACCAAAGAG
AAGGCAGCGAAGTGGATCCTGGCAGCCGTGGCGCTGCTCCTACAGGTTTAGCTCGCTCCTGCCTCACAGCCCACACTCCCTGGGGCTGGA
CCACAACTCAGCCCTTCACCTGTGCACCTGTGACTTATTCTTTACAGGAACTGGTGGTGGTTTTTCGTTCTCTTAAATAATCAGGTGCTT

>In-frame_ENST00000425042_ENST00000389840_TCGA-ED-A97K-01A_HID1_chr17_72968686_-_DNAH17_chr17_76506588_length(amino acids)=3164AA_start in transcript=6_stop in transcript=9500
MEPELKPEPGWSLGGGRAGAGSRDMGSTDSKLNFRKAVIQLTTKTQVKFKMSEETTLADLLQLNLHSYEDEVRNIVDKAVKESGMEKVLK
ALDSTWSMMEFQHEPHPRTGTMMLKSSEVLVETLEDNQVQLQNLMMSKYLAHFLKEVTSWQQKLSTADSVISIWFEVQRTWSHLESIFIG
SEDIRTQLPGDSQRFDDINQEFKALMEDAVKTPNVVEATSKPGLYNKLEALKKSLAICEKALAEYLETKRLAFPRFYFVSSADLLDILSN
GNDPVEVSRHLSKLFDSLCKLKFRLDASDKPLKVGLGMYSKEDEYMVFDQECDLSGQVEVWLNRVLDRMCSTLRHEIPEAVVTYEEKPRE
QWILDYPAQIWWTTEVGLAFARLEEGYENAIRDYNKKQISQLNVLITLLMGNLNAGDRMKIMTICTIDVHARDVVAKMIVESSQAFTWQA
QLRHRWDEEKRHCFANICDAQIQYSYEYLGNTPRLVITPLTDRCYITLTQSLHLIMGGAPAGPAGTGKTETTKDLGRALGTMVYVFNCSE
QMDYKSCGNIYKGLAQTGAWGCFDEFNRISVEVLSVIAVQVKCVQDAIRAKKKAFNFLGEIIGLIPTVGIFITMNPGYAGRAELPENLKA
LFRPCAMVVPDFELICEIMLMAEGFLEARLLARKFITLYTLCKELLSKQDHYDWGLRAIKSVLVVAGSLKRGDPSRAEDQVLMRALRDFN
IPKIVTDDLPVFMGLIGDLFPALDVPRKRDLNFEKIIKQSIVELKLQAEDSFVLKVVQLEELLQVRHSVFIVGNAGSGKSQVLKSLNKTY
QNLKRKPVAVDLDPKAVTCDELFGIINPVTREWKDGLFSTIMRDLANITHDGPKWIILDGDIDPMWIESLNTVMDDNKVLTLASNERIPL
NRTMRLVFEISHLRTATPATVSRAGILYINPADLGWNPVVSSWIERRKVQSEKANLMILFDKYLPTCLDKLRFGFKKITPVPEITVIQTI
LYLLECLLTEKTVPPDSPRELYELYFVFTCFWAFGGAMFQDQLVDYRVEFSKWWINEFKTIKFPSQGTIFDYYIDPDTKKFLPWTDKVPS
FELDPDVPLQASLVHTTETIRIRYFMDLLMEKSWPVMLVGNAGTGKSVLMGDKLESLNTDNYLVQAVPFNFYTTSAMLQGVLEKPLEKKS
GRNYGPPGTKKLVYFIDDMNMPEVDKYGTVAPHTLIRQHMDHRHWYDRHKLTLKDIHNCQYVACMNPTSGSFTIDSRLQRHFCVFAVSFP
GQEALTTIYNTILTQHLAFRSVSMAIQRISSQLVAAALALHQKITATFLPTAIKFHYVFNLRDLSNIFQGLLFSTAEVLKTPLDLVRLWL
HETERVYGDKMVDEKDQETLHRVTMASTKKFFDDLGDELLFAKPNIFCHFAQGIGDPKYVPVTDMAPLNKLLVDVLDSYNEVNAVMNLVL
FEDAVAHICRINRILESPRGNALLVGVGGSGKQSLSRLAAYISGLDVFQITLKKGYGIPDLKIDLAAQYIKAAVKNVPSVFLMTDSQVAE
EQFLVLINDLLASGEIPGLFMEDEVENIISSMRPQVKSLGMNDTRETCWKFFIEKVRRQLKVILCFSPVGSVLRVRARKFPAVVNCTAID
WFHEWPEDALVSVSARFLEETEGIPWEVKASISFFMSYVHTTVNEMSRVYLATERRYNYTTPKTFLEQIKLYQNLLAKKRTELVAKIERL
ENGLMKLQSTASQVDDLKAKLAIQEAELKQKNESADQLIQVVGIEAEKVSKEKAIADQEEVKVEVINKNVTEKQKACETDLAKAEPALLA
AQEALDTLNKNNLTELKSFGSPPDAVVNVTAAVMILTAPGGKIPKDKSWKAAKIMMGKVDTFLDSLKKFDKEHIPEACLKAFKPYQGNPT
FDPEFIRSKSTAAAGLCSWCINIVRFYEVYCDVAPKRQALEEANAELAEAQEKLSRIKNKIAELNANLSNLTSAFEKATAEKIKCQQEAD
ATNRVILLANRLVGGLASENIRWAESVENFRSQGVTLCGDVLLISAFVSYVGYFTKKYRNELMEKFWIPYIHNLKVPIPITNGLDPLSLL
TDDADVATWNNQGLPSDRMSTENATILGNTERWPLIVDAQLQGIKWIKNKYRSELKAIRLGQKSYLDVIEQAISEGDTLLIENIGETVDP
VLDPLLGRNTIKKGKYIKIGDKEVEYHPKFRLILHTKYFNPHYKPEMQAQCTLINFLVTRDGLEDQLLAAVVAKERPDLEQLKANLTKSQ
NEFKIVLKELEDSLLARLSAASGNFLGDTALVENLETTKHTASEIEEKVVEAKITEVKINEARENYRPAAERASLLYFILNDLNKINPVY
QFSLKAFNVVFEKAIQRTTPANEVKQRVINLTDEITYSVYMYTARGLFERDKLIFLAQVTFQVLSMKKELNPVELDFLLRFPFKAGVVSP
VDFLQHQGWGGIKALSEMDEFKNLDSDIEGSAKRWKKLVESEAPEKEIFPKEWKNKTALQKLCMVRCLRPDRMTYAIKNFVEEKMGSKFV
EGRSVEFSKSYEESSPSTSIFFILSPGVDPLKDVEALGKKLGFTIDNGKLHNVSLGQGQEVVAENALDVAAEKGHWVILQVRGGQHCRNI
HLVARWLGTLDKKLEHYSTGSHEDYRVFISAEPAPSPETHIIPQGILENAIKITNEPPTGMHANLHKALDLFTQDTLEMCTKEMEFKCML
FALCYFHAVVAERRKFGAQGWNRSYPFNNGDLTISINVLYNYLEANPKVPWDDLRYLFGEIMYGGHITDDWDRRLCRTYLAEYIRTEMLE
GDVLLAPGFQIPPNLDYKGYHEYIDENLPPESPYLYGLHPNAEIGFLTVTSEKLFRTVLEMQPKETDSGAGTGVSREEKAGSLKLLPSER
KGEDLELRRGGCPGTGFQVKAVLDDILEKIPETFNMAEIMAKAAEKTPYVVVAFQECERMNILTNEMRRSLKELNLGLKGELTITTDVED
LSTALFYDTVPDTWVARAYPSMMGLAAWYADLLLRIRELEAWTTDFALPTTVWLAGFFNPQSFLTAIMQSMARKNEWPLDKMCLSVEVTK
KNREDMTAPPREGSYVYGLFMEGARWDTQTGVIAEARLKELTPAMPVIFIKAIPVDRMETKNIYECPVYKTRIRGPTYVWTFNLKTKEKA

--------------------------------------------------------------
>In-frame_ENST00000425042_ENST00000585328_TCGA-ED-A97K-01A_HID1_chr17_72968686_-_DNAH17_chr17_76506588_length(transcript)=9623nt_BP=144nt
GCGGAGCTGGAGCCGGAGCTGAAGCCGGAGCCGGGTTGGAGTCTGGGCGGGGGCCGGGCCGGAGCGGGCTCCAGAGACATGGGGTCGACC
GACTCCAAGCTGAACTTCCGGAAGGCGGTGATCCAGCTCACCACCAAGACGCAGGTGAAATTTAAAATGTCAGAAGAGACGACCCTGGCA
GATTTACTGCAGCTGAACCTCCACAGTTACGAGGATGAGGTCCGCAACATCGTGGACAAGGCCGTGAAGGAGTCGGGCATGGAAAAGGTG
CTGAAAGCCCTGGACAGTACCTGGAGCATGATGGAATTCCAGCACGAGCCGCACCCGCGGACAGGCACCATGATGCTCAAGTCCAGCGAG
GTGCTGGTGGAGACGCTGGAGGACAACCAGGTGCAGCTGCAGAACCTGATGATGTCCAAGTACCTGGCCCACTTCCTGAAGGAGGTGACA
AGCTGGCAGCAGAAGCTGTCCACGGCGGACTCCGTCATCTCCATCTGGTTTGAGGTCCAGCGAACCTGGAGCCACCTGGAGAGCATCTTC
ATCGGCTCCGAAGACATCCGCACCCAGCTCCCGGGGGACTCCCAGCGCTTTGACGACATCAACCAGGAATTCAAGGCCTTGATGGAAGAT
GCAGTGAAAACACCCAACGTGGTGGAAGCCACCAGCAAACCCGGCCTCTACAATAAACTGGAGGCCCTGAAGAAGAGCTTGGCCATCTGT
GAAAAGGCTTTGGCAGAGTATTTAGAGACGAAAAGACTGGCTTTCCCCCGGTTCTATTTTGTCTCCTCGGCTGACCTCCTGGACATTCTC
TCCAATGGCAATGACCCCGTGGAGGTGAGCCGCCACCTGTCCAAACTCTTCGATAGCCTGTGTAAACTGAAGTTCCGGCTCGATGCCAGT
GACAAACCTCTCAAGGTGGGCCTGGGAATGTACAGCAAGGAGGACGAGTACATGGTTTTTGATCAGGAATGCGACCTCTCGGGGCAGGTG
GAAGTGTGGCTGAATCGAGTGCTGGACCGAATGTGCTCTACCCTCCGGCACGAAATCCCAGAGGCCGTGGTGACCTACGAAGAGAAGCCG
AGGGAGCAGTGGATCCTGGACTACCCAGCCCAGGTGGCCCTGACTTGCACCCAGATCTGGTGGACGACCGAGGTGGGCCTGGCATTTGCC
AGGCTGGAGGAAGGCTATGAAAACGCTATCAGAGATTATAACAAAAAGCAGATTAGCCAGCTGAACGTACTCATCACGCTGCTCATGGGG
AACCTCAACGCTGGCGACAGGATGAAGATCATGACCATCTGCACCATCGATGTGCACGCACGGGACGTGGTGGCCAAAATGATCGTGGTG
GAGAGTTCTCAGGCCTTCACCTGGCAGGCCCAGCTCCGGCATCGCTGGGACGAAGAGAAGCGACACTGCTTTGCCAACATCTGCGATGCC
CAAATCCAGTATTCCTATGAGTATCTGGGCAACACGCCGCGGCTGGTCATCACCCCACTCACTGACAGGTGCTATATCACCCTGACCCAG
TCCCTCCATCTCATCATGGGTGGAGCCCCTGCCGGCCCCGCTGGGACCGGCAAGACTGAGACGACCAAGGACCTGGGCAGAGCCCTGGGC
ACCATGGTCTACGTCTTCAACTGCTCCGAGCAGATGGACTACAAGTCCTGTGGAAATATCTACAAGGGCCTGGCCCAGACGGGAGCCTGG
GGCTGCTTTGACGAGTTTAATCGCATCTCAGTGGAAGTCTTGTCTGTGATTGCCGTGCAGGTAAAATGTGTCCAGGATGCAATTCGGGCC
AAGAAAAAAGCATTCAATTTCCTGGGAGAGATCATAGGCCTCATTCCCACCGTCGGTATCTTCATCACCATGAACCCTGGGTACGCCGGA
CGCGCGGAGCTGCCTGAGAACCTAAAAGCCTTATTCAGGCCCTGTGCCATGGTCGTCCCCGACTTCGAACTGATATGTGAGATCATGCTC
ATGGCCGAGGGCTTTCTGGAAGCCCGCCTTCTGGCCAGGAAGTTCATCACCCTGTACACCTTGTGCAAGGAGCTGCTCTCGAAGCAGGAT
CATTACGACTGGGGCCTGAGAGCCATCAAGTCTGTGCTGGTGGTGGCCGGCTCCCTGAAGAGGGGCGACCCCAGCCGGGCAGAGGACCAG
GTGCTCATGCGGGCGCTGAGAGACTTCAACATCCCCAAGATTGTGACAGACGACCTGCCCGTATTCATGGGACTGATCGGGGACCTCTTC
CCGGCTCTGGACGTGCCTCGGAAACGGGACCTGAATTTTGAAAAGATCATCAAGCAGAGCATCGTGGAGCTCAAGCTGCAGGCGGAGGAC
AGCTTCGTGCTGAAGGTGGTGCAGCTGGAGGAGCTGCTGCAGGTCCGCCACTCCGTGTTCATCGTCGGGAATGCGGGCAGCGGCAAATCT
CAGGTCCTCAAATCCCTCAACAAGACCTATCAGAACCTGAAGAGGAAGCCGGTCGCCGTGGACCTGGACCCCAAGGCCGTCACCTGCGAC
GAGCTCTTTGGCATCATCAACCCAGTGACCAGGGAATGGAAAGATGGCCTGTTCTCCACCATCATGCGAGACCTGGCCAACATCACCCAT
GACGGCCCCAAGTGGATCATCCTTGACGGAGACATAGACCCCATGTGGATCGAGTCTCTCAACACAGTCATGGATGACAACAAGGTCCTC
ACCCTGGCCAGCAACGAGCGGATCCCCCTGAACCGCACCATGAGGCTGGTGTTCGAAATCAGCCACCTGAGGACGGCCACCCCAGCCACC
GTTTCCAGAGCCGGCATCCTCTACATCAACCCAGCCGACCTGGGATGGAACCCGGTGGTGAGCAGCTGGATCGAGAGGCGCAAGGTGCAG
TCGGAGAAGGCCAACCTGATGATCCTCTTTGACAAGTACCTGCCCACGTGCCTGGACAAGTTGCGCTTTGGGTTCAAGAAGATCACGCCA
GTGCCGGAGATCACGGTGATCCAAACGATTCTGTACCTGCTGGAGTGCCTGCTCACGGAGAAGACCGTGCCCCCCGACTCCCCCAGGGAG
CTGTACGAGCTGTACTTCGTGTTCACCTGCTTCTGGGCCTTCGGTGGCGCCATGTTCCAGGACCAGCTTGTGGATTATCGAGTGGAGTTC
AGTAAATGGTGGATCAACGAATTCAAGACTATCAAGTTCCCCTCGCAGGGAACGATTTTTGACTACTACATTGATCCTGACACAAAAAAG
TTCCTGCCCTGGACAGATAAAGTGCCCTCCTTTGAGCTGGATCCCGATGTCCCACTGCAGGCCTCTTTGGTCCACACCACGGAAACCATC
CGCATCCGCTACTTCATGGACCTGCTCATGGAGAAGTCCTGGCCGGTGATGCTGGTGGGGAACGCGGGGACGGGCAAGTCGGTGCTGATG
GGGGACAAGCTGGAAAGCCTGAACACGGACAACTACCTGGTGCAGGCTGTGCCCTTCAACTTCTACACGACCTCAGCCATGCTGCAGGGG
GTGCTGGAGAAGCCGCTGGAGAAGAAATCGGGGAGGAACTACGGGCCGCCAGGCACTAAGAAGCTCGTCTACTTCATCGACGACATGAAC
ATGCCCGAGGTGGACAAGTATGGGACGGTGGCCCCGCACACCCTCATCCGGCAGCACATGGACCACCGGCACTGGTATGACAGACATAAG
CTGACGTTAAAAGATATCCATAATTGTCAGTACGTGGCCTGCATGAACCCCACTTCCGGATCCTTCACCATCGACTCCAGGCTTCAGCGC
CATTTCTGCGTGTTTGCTGTGAGCTTCCCCGGCCAGGAGGCCCTCACCACCATCTACAACACAATCCTGACGCAGCACCTGGCCTTCCGC
TCGGTCTCCATGGCTATCCAGAGGATAAGCAGCCAGCTGGTGGCCGCGGCCCTGGCTTTGCATCAGAAAATCACGGCAACATTTCTTCCC
ACGGCCATTAAGTTTCATTATGTCTTCAACCTCAGGGACCTCTCCAATATTTTCCAGGGACTCTTATTTTCCACAGCAGAAGTTCTGAAA
ACCCCACTGGACCTCGTCCGCCTTTGGCTACATGAGACTGAACGAGTGTATGGTGACAAAATGGTTGACGAAAAAGACCAGGAAACATTG
CATAGAGTCACCATGGCCTCCACCAAGAAGTTCTTTGATGATCTTGGTGATGAACTCTTATTTGCCAAGCCAAATATCTTCTGCCACTTT
GCTCAAGGGATTGGCGATCCCAAATATGTTCCTGTAACCGACATGGCTCCTCTGAACAAGCTCCTCGTGGACGTCCTGGACAGCTACAAT
GAAGTTAATGCAGTCATGAATTTGGTGCTGTTTGAGGACGCCGTGGCTCACATCTGCAGGATTAATCGCATCCTGGAGTCTCCCCGGGGG
AATGCCCTGCTGGTGGGGGTGGGCGGCAGTGGCAAACAGAGCCTCTCCCGCCTGGCAGCGTACATCAGCGGGCTTGACGTGTTTCAGATC
ACCCTCAAGAAGGGCTACGGGATCCCCGACCTCAAGATTGACCTCGCTGCTCAGTACATAAAGGCTGCCGTGAAGAACGTTCCCTCGGTG
TTCCTGATGACAGACTCCCAGGTGGCCGAGGAGCAGTTTCTGGTGCTGATCAATGACCTGCTGGCCTCAGGAGAGATCCCTGGGCTGTTT
ATGGAGGACGAGGTGGAGAACATCATCTCCTCCATGCGACCCCAAGTCAAGTCCCTTGGCATGAATGACACTCGGGAAACATGTTGGAAG
TTCTTCATCGAAAAAGTGCGCAGACAGCTCAAGGTGATCCTGTGTTTCTCCCCTGTGGGCTCCGTGCTGCGGGTACGAGCCAGAAAGTTC
CCAGCTGTGGTCAACTGCACGGCCATCGACTGGTTCCACGAGTGGCCGGAAGATGCGCTGGTGTCCGTCAGCGCCCGCTTCCTGGAGGAG
ACTGAGGGGATTCCGTGGGAAGTCAAGGCCTCCATCAGCTTCTTCATGTCCTACGTGCACACCACCGTCAACGAGATGTCCAGGGTATAC
CTGGCTACTGAGAGGCGCTACAACTACACCACACCCAAAACCTTTCTGGAGCAGATCAAACTGTACCAGAACCTGCTGGCCAAGAAGAGA
ACGGAACTTGTTGCCAAAATCGAGAGGCTGGAGAACGGCCTGATGAAGCTGCAGAGCACGGCTTCCCAGGTGGATGATTTGAAAGCCAAG
TTGGCGATTCAGGAGGCTGAGCTCAAGCAGAAGAATGAGAGCGCAGACCAACTGATCCAGGTGGTCGGCATCGAGGCCGAGAAGGTCAGC
AAAGAGAAGGCCATTGCTGACCAGGAAGAAGTCAAGGTCGAGGTCATCAATAAGAACGTCACTGAGAAGCAAAAGGCCTGTGAAACAGAC
CTGGCCAAAGCAGAACCGGCCCTGCTGGCAGCCCAGGAGGCTCTGGACACTCTGAATAAGAACAACCTGACAGAGCTGAAGTCCTTTGGG
TCCCCGCCGGATGCTGTGGTCAACGTCACCGCCGCCGTCATGATTCTGACCGCACCTGGGGGCAAGATCCCCAAGGACAAGAGCTGGAAG
GCGGCCAAGATCATGATGGGCAAGGTGGACACCTTCCTAGACTCCCTGAAGAAGTTCGACAAGGAGCACATCCCTGAGGCCTGCCTGAAG
GCCTTCAAGCCCTACCAAGGCAACCCGACGTTCGACCCCGAGTTCATCCGCTCCAAGTCCACGGCCGCCGCCGGCCTGTGCTCCTGGTGC
ATCAACATCGTCCGCTTCTACGAGGTCTACTGCGACGTGGCGCCCAAGAGGCAGGCACTGGAGGAGGCTAATGCAGAGCTGGCAGAGGCA
CAAGAGAAGCTGTCCCGGATCAAAAACAAGATTGCCGAACTTAACGCCAACCTGAGCAACCTAACCTCAGCGTTTGAAAAAGCAACAGCT
GAGAAAATCAAGTGTCAGCAAGAGGCCGATGCCACGAACAGGGTGATCTTACTGGCGAACAGGCTGGTCGGGGGATTAGCATCGGAAAAC
ATCCGCTGGGCTGAGTCTGTGGAGAACTTCAGGAGCCAGGGGGTCACGCTGTGTGGGGACGTCCTGCTCATCTCTGCCTTCGTGTCCTAC
GTGGGCTACTTCACCAAGAAATACCGGAATGAGCTGATGGAGAAATTCTGGATCCCTTACATACATAACTTAAAGGTCCCCATCCCGATC
ACGAATGGCCTGGATCCCTTGAGCCTGCTGACAGATGACGCGGACGTGGCCACCTGGAACAACCAGGGCCTCCCCAGCGACCGCATGTCC
ACCGAGAATGCCACCATCCTGGGCAACACCGAGCGGTGGCCGCTGATCGTGGACGCCCAGCTCCAAGGAATCAAGTGGATCAAAAACAAA
TACAGGAGTGAACTGAAAGCCATCCGCCTGGGACAGAAGAGCTACCTGGATGTCATCGAGCAGGCCATCTCGGAAGGGGACACCTTGCTC
ATTGAGAACATCGGCGAAACCGTGGACCCCGTGCTGGACCCTCTACTGGGCAGGAACACGATTAAAAAGGGAAAGTACATTAAGATCGGT
GACAAGGAGGTGGAGTACCACCCCAAGTTCCGCCTGATCCTACACACCAAGTACTTCAACCCACACTACAAGCCAGAGATGCAGGCTCAG
TGCACCCTCATCAACTTCCTGGTCACCAGGGATGGACTCGAGGACCAACTCTTGGCCGCTGTGGTGGCCAAAGAGCGCCCAGATCTGGAA
CAGCTGAAGGCAAACCTCACCAAGTCTCAAAACGAATTTAAGATTGTTCTGAAAGAGCTGGAAGATTCGCTCCTGGCCCGTCTGTCGGCT
GCGTCGGGGAACTTTCTGGGAGACACGGCCTTGGTGGAGAATCTGGAGACCACCAAGCACACAGCCAGCGAGATCGAGGAGAAGGTGGTG
GAGGCAAAAATCACAGAAGTTAAAATCAACGAAGCGAGAGAGAACTACCGCCCGGCTGCGGAGAGGGCATCTCTGCTCTACTTCATACTG
AACGATCTCAACAAAATCAACCCCGTCTACCAGTTCTCCCTCAAGGCCTTCAACGTGGTGTTTGAGAAAGCCATCCAGAGGACCACCCCT
GCCAACGAGGTGAAGCAGCGGGTGATCAACCTGACGGACGAGATCACCTACTCCGTCTACATGTACACGGCCCGGGGACTCTTCGAGAGG
GACAAACTCATTTTCCTGGCACAAGTTACGTTTCAGGTCCTGTCCATGAAGAAGGAGCTGAACCCAGTGGAGCTGGATTTCCTCCTGCGG
TTCCCTTTTAAGGCCGGAGTGGTCTCACCAGTGGACTTCCTCCAGCATCAAGGCTGGGGCGGGATCAAGGCCCTCTCGGAGATGGATGAG
TTCAAAAATCTGGACAGTGACATCGAAGGATCTGCCAAGCGCTGGAAAAAGCTGGTGGAGTCGGAAGCCCCCGAGAAGGAGATCTTCCCC
AAGGAGTGGAAGAACAAGACGGCCCTGCAGAAGCTGTGCATGGTGCGCTGCCTGCGGCCAGATCGCATGACCTACGCTATCAAGAACTTC
GTGGAGGAAAAGATGGGCAGCAAGTTCGTGGAAGGCCGGAGTGTTGAGTTTTCTAAGTCCTACGAGGAGAGCAGCCCCTCCACGTCAATC
TTCTTCATCCTCTCCCCGGGGGTTGACCCCTTGAAAGACGTGGAAGCCCTGGGAAAAAAACTAGGGTTTACCATAGACAATGGAAAACTC
CATAATGTGTCCCTGGGGCAGGGACAAGAGGTGGTGGCTGAGAACGCCCTGGACGTGGCTGCAGAGAAAGGACACTGGGTCATTCTGCAG
AATATCCACCTGGTGGCCCGGTGGCTGGGAACACTGGACAAGAAGCTGGAGCACTACAGCACGGGCAGCCATGAGGACTACCGGGTGTTC
ATCAGCGCGGAGCCTGCCCCCAGCCCCGAGACCCACATCATCCCCCAGGGCATTCTGGAGAACGCCATCAAGATCACCAACGAGCCCCCC
ACGGGCATGCACGCCAACTTGCACAAGGCCCTGGACCTGTTCACCCAGGACACCCTGGAGATGTGCACCAAGGAGATGGAGTTCAAGTGC
ATGCTCTTCGCCCTGTGCTACTTCCACGCTGTGGTGGCAGAGAGGCGCAAGTTCGGCGCCCAGGGCTGGAACCGGTCGTACCCCTTCAAC
AACGGGGACCTCACCATCTCCATCAACGTGCTCTACAACTACCTGGAGGCCAACCCCAAGGTGCCCTGGGACGATCTCCGCTACCTTTTT
GGTGAAATCATGTATGGCGGCCACATCACAGATGACTGGGACCGTCGGCTGTGCAGGACCTACCTGGCTGAATACATCCGGACGGAGATG
CTGGAGGGAGACGTCCTGCTGGCCCCCGGCTTTCAGATCCCCCCCAACCTGGACTACAAGGGTTACCACGAATACATCGATGAGAACCTG
CCCCCTGAGAGTCCCTATCTGTATGGCCTGCACCCCAACGCAGAGATTGGCTTTCTGACGGTCACCTCAGAGAAGCTGTTCCGCACTGTC
CTGGAAATGCAGCCAAAAGAGACGGACTCGGGGGCAGGCACGGGAGTGTCCCGCGAGGAGAAGGTGAAGGCCGTGCTGGACGACATCCTG
GAGAAGATTCCGGAGACTTTCAACATGGCTGAGATCATGGCAAAGGCAGCGGAAAAGACCCCCTACGTGGTAGTCGCCTTTCAAGAATGT
GAAAGAATGAACATCCTGACCAACGAAATGCGCCGTTCGCTCAAGGAGCTGAACCTGGGGCTGAAGGGAGAACTGACCATCACGACCGAC
GTGGAAGATCTGTCCACGGCTCTCTTCTATGACACCGTGCCTGATACGTGGGTGGCCCGGGCCTACCCCTCCATGATGGGCCTGGCGGCC
TGGTACGCAGACCTGCTGCTCCGCATCAGGGAACTCGAGGCCTGGACGACAGACTTTGCCCTGCCCACCACCGTGTGGCTGGCCGGCTTC
TTCAACCCCCAGTCGTTCCTCACGGCCATCATGCAGTCCATGGCCAGGAAGAACGAGTGGCCCCTGGACAAGATGTGTCTGTCTGTCGAG
GTGACCAAGAAAAACCGAGAGGACATGACCGCTCCTCCGCGAGAGGGCTCCTACGTGTACGGACTCTTCATGGAAGGGGCTCGCTGGGAC
ACCCAGACTGGAGTCATCGCTGAAGCGCGGCTGAAAGAGCTGACCCCGGCCATGCCTGTCATCTTCATCAAGGCCATTCCTGTGGACCGC
ATGGAGACCAAGAACATCTATGAGTGTCCCGTGTACAAAACACGCATCCGCGGCCCCACCTATGTCTGGACCTTTAACTTGAAGACCAAA
GAGAAGGCAGCGAAGTGGATCCTGGCAGCCGTGGCGCTGCTCCTACAGGTTTAGCTCGCTCCTGCCTCACAGCCCACACTCCCTGGGGCT
GGACCACAACTCAGCCCTTCACCTGTGCACCTGTGACTTATTCTTTACAGGAACTGGTGGTGGTTTTTCGTTCTCTTAAATAATCAGGTG

>In-frame_ENST00000425042_ENST00000585328_TCGA-ED-A97K-01A_HID1_chr17_72968686_-_DNAH17_chr17_76506588_length(amino acids)=3135AA_start in transcript=6_stop in transcript=9413
MEPELKPEPGWSLGGGRAGAGSRDMGSTDSKLNFRKAVIQLTTKTQVKFKMSEETTLADLLQLNLHSYEDEVRNIVDKAVKESGMEKVLK
ALDSTWSMMEFQHEPHPRTGTMMLKSSEVLVETLEDNQVQLQNLMMSKYLAHFLKEVTSWQQKLSTADSVISIWFEVQRTWSHLESIFIG
SEDIRTQLPGDSQRFDDINQEFKALMEDAVKTPNVVEATSKPGLYNKLEALKKSLAICEKALAEYLETKRLAFPRFYFVSSADLLDILSN
GNDPVEVSRHLSKLFDSLCKLKFRLDASDKPLKVGLGMYSKEDEYMVFDQECDLSGQVEVWLNRVLDRMCSTLRHEIPEAVVTYEEKPRE
QWILDYPAQVALTCTQIWWTTEVGLAFARLEEGYENAIRDYNKKQISQLNVLITLLMGNLNAGDRMKIMTICTIDVHARDVVAKMIVVES
SQAFTWQAQLRHRWDEEKRHCFANICDAQIQYSYEYLGNTPRLVITPLTDRCYITLTQSLHLIMGGAPAGPAGTGKTETTKDLGRALGTM
VYVFNCSEQMDYKSCGNIYKGLAQTGAWGCFDEFNRISVEVLSVIAVQVKCVQDAIRAKKKAFNFLGEIIGLIPTVGIFITMNPGYAGRA
ELPENLKALFRPCAMVVPDFELICEIMLMAEGFLEARLLARKFITLYTLCKELLSKQDHYDWGLRAIKSVLVVAGSLKRGDPSRAEDQVL
MRALRDFNIPKIVTDDLPVFMGLIGDLFPALDVPRKRDLNFEKIIKQSIVELKLQAEDSFVLKVVQLEELLQVRHSVFIVGNAGSGKSQV
LKSLNKTYQNLKRKPVAVDLDPKAVTCDELFGIINPVTREWKDGLFSTIMRDLANITHDGPKWIILDGDIDPMWIESLNTVMDDNKVLTL
ASNERIPLNRTMRLVFEISHLRTATPATVSRAGILYINPADLGWNPVVSSWIERRKVQSEKANLMILFDKYLPTCLDKLRFGFKKITPVP
EITVIQTILYLLECLLTEKTVPPDSPRELYELYFVFTCFWAFGGAMFQDQLVDYRVEFSKWWINEFKTIKFPSQGTIFDYYIDPDTKKFL
PWTDKVPSFELDPDVPLQASLVHTTETIRIRYFMDLLMEKSWPVMLVGNAGTGKSVLMGDKLESLNTDNYLVQAVPFNFYTTSAMLQGVL
EKPLEKKSGRNYGPPGTKKLVYFIDDMNMPEVDKYGTVAPHTLIRQHMDHRHWYDRHKLTLKDIHNCQYVACMNPTSGSFTIDSRLQRHF
CVFAVSFPGQEALTTIYNTILTQHLAFRSVSMAIQRISSQLVAAALALHQKITATFLPTAIKFHYVFNLRDLSNIFQGLLFSTAEVLKTP
LDLVRLWLHETERVYGDKMVDEKDQETLHRVTMASTKKFFDDLGDELLFAKPNIFCHFAQGIGDPKYVPVTDMAPLNKLLVDVLDSYNEV
NAVMNLVLFEDAVAHICRINRILESPRGNALLVGVGGSGKQSLSRLAAYISGLDVFQITLKKGYGIPDLKIDLAAQYIKAAVKNVPSVFL
MTDSQVAEEQFLVLINDLLASGEIPGLFMEDEVENIISSMRPQVKSLGMNDTRETCWKFFIEKVRRQLKVILCFSPVGSVLRVRARKFPA
VVNCTAIDWFHEWPEDALVSVSARFLEETEGIPWEVKASISFFMSYVHTTVNEMSRVYLATERRYNYTTPKTFLEQIKLYQNLLAKKRTE
LVAKIERLENGLMKLQSTASQVDDLKAKLAIQEAELKQKNESADQLIQVVGIEAEKVSKEKAIADQEEVKVEVINKNVTEKQKACETDLA
KAEPALLAAQEALDTLNKNNLTELKSFGSPPDAVVNVTAAVMILTAPGGKIPKDKSWKAAKIMMGKVDTFLDSLKKFDKEHIPEACLKAF
KPYQGNPTFDPEFIRSKSTAAAGLCSWCINIVRFYEVYCDVAPKRQALEEANAELAEAQEKLSRIKNKIAELNANLSNLTSAFEKATAEK
IKCQQEADATNRVILLANRLVGGLASENIRWAESVENFRSQGVTLCGDVLLISAFVSYVGYFTKKYRNELMEKFWIPYIHNLKVPIPITN
GLDPLSLLTDDADVATWNNQGLPSDRMSTENATILGNTERWPLIVDAQLQGIKWIKNKYRSELKAIRLGQKSYLDVIEQAISEGDTLLIE
NIGETVDPVLDPLLGRNTIKKGKYIKIGDKEVEYHPKFRLILHTKYFNPHYKPEMQAQCTLINFLVTRDGLEDQLLAAVVAKERPDLEQL
KANLTKSQNEFKIVLKELEDSLLARLSAASGNFLGDTALVENLETTKHTASEIEEKVVEAKITEVKINEARENYRPAAERASLLYFILND
LNKINPVYQFSLKAFNVVFEKAIQRTTPANEVKQRVINLTDEITYSVYMYTARGLFERDKLIFLAQVTFQVLSMKKELNPVELDFLLRFP
FKAGVVSPVDFLQHQGWGGIKALSEMDEFKNLDSDIEGSAKRWKKLVESEAPEKEIFPKEWKNKTALQKLCMVRCLRPDRMTYAIKNFVE
EKMGSKFVEGRSVEFSKSYEESSPSTSIFFILSPGVDPLKDVEALGKKLGFTIDNGKLHNVSLGQGQEVVAENALDVAAEKGHWVILQNI
HLVARWLGTLDKKLEHYSTGSHEDYRVFISAEPAPSPETHIIPQGILENAIKITNEPPTGMHANLHKALDLFTQDTLEMCTKEMEFKCML
FALCYFHAVVAERRKFGAQGWNRSYPFNNGDLTISINVLYNYLEANPKVPWDDLRYLFGEIMYGGHITDDWDRRLCRTYLAEYIRTEMLE
GDVLLAPGFQIPPNLDYKGYHEYIDENLPPESPYLYGLHPNAEIGFLTVTSEKLFRTVLEMQPKETDSGAGTGVSREEKVKAVLDDILEK
IPETFNMAEIMAKAAEKTPYVVVAFQECERMNILTNEMRRSLKELNLGLKGELTITTDVEDLSTALFYDTVPDTWVARAYPSMMGLAAWY
ADLLLRIRELEAWTTDFALPTTVWLAGFFNPQSFLTAIMQSMARKNEWPLDKMCLSVEVTKKNREDMTAPPREGSYVYGLFMEGARWDTQ

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for HID1-DNAH17


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for HID1-DNAH17


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for HID1-DNAH17


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource