FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:ARHGAP44-DNAH2 (FusionGDB2 ID:6122)

Fusion Gene Summary for ARHGAP44-DNAH2

check button Fusion gene summary
Fusion gene informationFusion gene name: ARHGAP44-DNAH2
Fusion gene ID: 6122
HgeneTgene
Gene symbol

ARHGAP44

DNAH2

Gene ID

9912

146754

Gene nameRho GTPase activating protein 44dynein axonemal heavy chain 2
SynonymsNPC-A-10|RICH2DNAHC2|DNHD3
Cytomap

17p12

17p13.1

Type of geneprotein-codingprotein-coding
Descriptionrho GTPase-activating protein 44Rho-type GTPase-activating protein RICH2RhoGAP interacting with CIP4 homologs protein 2rho GTPase-activating protein RICH2dynein heavy chain 2, axonemalaxonemal beta dynein heavy chain 2ciliary dynein heavy chain 2dynein heavy chain domain-containing protein 3dynein, axonemal, heavy polypeptide 2
Modification date2020031320200327
UniProtAcc

Q17R89

Q9P225

Ensembl transtripts involved in fusion geneENST00000262444, ENST00000340825, 
ENST00000379672, ENST00000578087, 
ENST00000082259, ENST00000570791, 
ENST00000389173, ENST00000572933, 
Fusion gene scores* DoF score9 X 8 X 6=4327 X 9 X 5=315
# samples 99
** MAII scorelog2(9/432*10)=-2.26303440583379
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(9/315*10)=-1.8073549220576
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: ARHGAP44 [Title/Abstract] AND DNAH2 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointARHGAP44(12693208)-DNAH2(7704896), # samples:3
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID

check buttonFusion gene breakpoints across ARHGAP44 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across DNAH2 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4BRCATCGA-A2-A0CT-01AARHGAP44chr17

12693208

-DNAH2chr17

7704896

+
ChimerDB4BRCATCGA-A2-A0CT-01AARHGAP44chr17

12693208

+DNAH2chr17

7704896

+


Top

Fusion Gene ORF analysis for ARHGAP44-DNAH2

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-intronENST00000262444ENST00000082259ARHGAP44chr17

12693208

+DNAH2chr17

7704896

+
5CDS-intronENST00000262444ENST00000570791ARHGAP44chr17

12693208

+DNAH2chr17

7704896

+
5CDS-intronENST00000340825ENST00000082259ARHGAP44chr17

12693208

+DNAH2chr17

7704896

+
5CDS-intronENST00000340825ENST00000570791ARHGAP44chr17

12693208

+DNAH2chr17

7704896

+
5CDS-intronENST00000379672ENST00000082259ARHGAP44chr17

12693208

+DNAH2chr17

7704896

+
5CDS-intronENST00000379672ENST00000570791ARHGAP44chr17

12693208

+DNAH2chr17

7704896

+
In-frameENST00000262444ENST00000389173ARHGAP44chr17

12693208

+DNAH2chr17

7704896

+
In-frameENST00000262444ENST00000572933ARHGAP44chr17

12693208

+DNAH2chr17

7704896

+
In-frameENST00000340825ENST00000389173ARHGAP44chr17

12693208

+DNAH2chr17

7704896

+
In-frameENST00000340825ENST00000572933ARHGAP44chr17

12693208

+DNAH2chr17

7704896

+
In-frameENST00000379672ENST00000389173ARHGAP44chr17

12693208

+DNAH2chr17

7704896

+
In-frameENST00000379672ENST00000572933ARHGAP44chr17

12693208

+DNAH2chr17

7704896

+
intron-3CDSENST00000578087ENST00000389173ARHGAP44chr17

12693208

+DNAH2chr17

7704896

+
intron-3CDSENST00000578087ENST00000572933ARHGAP44chr17

12693208

+DNAH2chr17

7704896

+
intron-intronENST00000578087ENST00000082259ARHGAP44chr17

12693208

+DNAH2chr17

7704896

+
intron-intronENST00000578087ENST00000570791ARHGAP44chr17

12693208

+DNAH2chr17

7704896

+

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000379672ARHGAP44chr1712693208+ENST00000572933DNAH2chr177704896+514935330049371545
ENST00000379672ARHGAP44chr1712693208+ENST00000389173DNAH2chr177704896+514535330049371545
ENST00000340825ARHGAP44chr1712693208+ENST00000572933DNAH2chr177704896+514935330049371545
ENST00000340825ARHGAP44chr1712693208+ENST00000389173DNAH2chr177704896+514535330049371545
ENST00000262444ARHGAP44chr1712693208+ENST00000572933DNAH2chr177704896+49281327947161545
ENST00000262444ARHGAP44chr1712693208+ENST00000389173DNAH2chr177704896+49241327947161545

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000379672ENST00000572933ARHGAP44chr1712693208+DNAH2chr177704896+0.0030095290.99699044
ENST00000379672ENST00000389173ARHGAP44chr1712693208+DNAH2chr177704896+0.0029681950.9970318
ENST00000340825ENST00000572933ARHGAP44chr1712693208+DNAH2chr177704896+0.0030095290.99699044
ENST00000340825ENST00000389173ARHGAP44chr1712693208+DNAH2chr177704896+0.0029681950.9970318
ENST00000262444ENST00000572933ARHGAP44chr1712693208+DNAH2chr177704896+0.0021159560.997884
ENST00000262444ENST00000389173ARHGAP44chr1712693208+DNAH2chr177704896+0.0020828340.9979171

Top

Fusion Genomic Features for ARHGAP44-DNAH2


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)
ARHGAP44chr1712693208+DNAH2chr177704895+4.96E-081
ARHGAP44chr1712693208+DNAH2chr177704895+4.96E-081

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.
genomic feature of top 1%

Top

Fusion Protein Features for ARHGAP44-DNAH2


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr17:12693208/chr17:7704896)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
ARHGAP44

Q17R89

DNAH2

Q9P225

FUNCTION: GTPase-activating protein (GAP) that stimulates the GTPase activity of Rho-type GTPases. Thereby, controls Rho-type GTPases cycling between their active GTP-bound and inactive GDP-bound states. Acts as a GAP at least for CDC42 and RAC1 (PubMed:11431473). In neurons, is involved in dendritic spine formation and synaptic plasticity in a specific RAC1-GAP activity (By similarity). Limits the initiation of exploratory dendritic filopodia. Recruited to actin-patches that seed filopodia, binds specifically to plasma membrane sections that are deformed inward by acto-myosin mediated contractile forces. Acts through GAP activity on RAC1 to reduce actin polymerization necessary for filopodia formation (By similarity). In association with SHANK3, promotes GRIA1 exocytosis from recycling endosomes and spine morphological changes associated to long-term potentiation (By similarity). {ECO:0000250|UniProtKB:F1LQX4, ECO:0000250|UniProtKB:Q5SSM3, ECO:0000269|PubMed:11431473}.FUNCTION: Force generating protein of respiratory cilia. Produces force towards the minus ends of microtubules. Dynein has ATPase activity; the force-producing power stroke is thought to occur on release of ADP. Involved in sperm motility; implicated in sperm flagellar assembly (By similarity). {ECO:0000250}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
TgeneDNAH2chr17:12693208chr17:7704896ENST000000822590133012_30490873.0Coiled coilOntology_term=ECO:0000255
TgeneDNAH2chr17:12693208chr17:7704896ENST000000822590133216_33040873.0Coiled coilOntology_term=ECO:0000255
TgeneDNAH2chr17:12693208chr17:7704896ENST000000822590133523_35670873.0Coiled coilOntology_term=ECO:0000255
TgeneDNAH2chr17:12693208chr17:7704896ENST0000038917354853012_304928994428.0Coiled coilOntology_term=ECO:0000255
TgeneDNAH2chr17:12693208chr17:7704896ENST0000038917354853216_330428994428.0Coiled coilOntology_term=ECO:0000255
TgeneDNAH2chr17:12693208chr17:7704896ENST0000038917354853523_356728994428.0Coiled coilOntology_term=ECO:0000255
TgeneDNAH2chr17:12693208chr17:7704896ENST000005707910143012_30490873.0Coiled coilOntology_term=ECO:0000255
TgeneDNAH2chr17:12693208chr17:7704896ENST000005707910143216_33040873.0Coiled coilOntology_term=ECO:0000255
TgeneDNAH2chr17:12693208chr17:7704896ENST000005707910143523_35670873.0Coiled coilOntology_term=ECO:0000255
TgeneDNAH2chr17:12693208chr17:7704896ENST0000057293355863012_304928994428.0Coiled coilOntology_term=ECO:0000255
TgeneDNAH2chr17:12693208chr17:7704896ENST0000057293355863216_330428994428.0Coiled coilOntology_term=ECO:0000255
TgeneDNAH2chr17:12693208chr17:7704896ENST0000057293355863523_356728994428.0Coiled coilOntology_term=ECO:0000255
TgeneDNAH2chr17:12693208chr17:7704896ENST000000822590131803_18100873.0Nucleotide bindingATP
TgeneDNAH2chr17:12693208chr17:7704896ENST000000822590132084_20910873.0Nucleotide bindingATP
TgeneDNAH2chr17:12693208chr17:7704896ENST000000822590132416_24230873.0Nucleotide bindingATP
TgeneDNAH2chr17:12693208chr17:7704896ENST000000822590132762_27690873.0Nucleotide bindingATP
TgeneDNAH2chr17:12693208chr17:7704896ENST000005707910141803_18100873.0Nucleotide bindingATP
TgeneDNAH2chr17:12693208chr17:7704896ENST000005707910142084_20910873.0Nucleotide bindingATP
TgeneDNAH2chr17:12693208chr17:7704896ENST000005707910142416_24230873.0Nucleotide bindingATP
TgeneDNAH2chr17:12693208chr17:7704896ENST000005707910142762_27690873.0Nucleotide bindingATP
TgeneDNAH2chr17:12693208chr17:7704896ENST000000822590131765_19860873.0RegionAAA 1
TgeneDNAH2chr17:12693208chr17:7704896ENST000000822590131_17640873.0RegionStem
TgeneDNAH2chr17:12693208chr17:7704896ENST000000822590132046_22730873.0RegionAAA 2
TgeneDNAH2chr17:12693208chr17:7704896ENST000000822590132378_26250873.0RegionAAA 3
TgeneDNAH2chr17:12693208chr17:7704896ENST000000822590132722_29740873.0RegionAAA 4
TgeneDNAH2chr17:12693208chr17:7704896ENST000000822590132989_32720873.0RegionStalk
TgeneDNAH2chr17:12693208chr17:7704896ENST000000822590133358_35880873.0RegionAAA 5
TgeneDNAH2chr17:12693208chr17:7704896ENST000000822590133804_40230873.0RegionAAA 6
TgeneDNAH2chr17:12693208chr17:7704896ENST0000038917354852989_327228994428.0RegionStalk
TgeneDNAH2chr17:12693208chr17:7704896ENST0000038917354853358_358828994428.0RegionAAA 5
TgeneDNAH2chr17:12693208chr17:7704896ENST0000038917354853804_402328994428.0RegionAAA 6
TgeneDNAH2chr17:12693208chr17:7704896ENST000005707910141765_19860873.0RegionAAA 1
TgeneDNAH2chr17:12693208chr17:7704896ENST000005707910141_17640873.0RegionStem
TgeneDNAH2chr17:12693208chr17:7704896ENST000005707910142046_22730873.0RegionAAA 2
TgeneDNAH2chr17:12693208chr17:7704896ENST000005707910142378_26250873.0RegionAAA 3
TgeneDNAH2chr17:12693208chr17:7704896ENST000005707910142722_29740873.0RegionAAA 4
TgeneDNAH2chr17:12693208chr17:7704896ENST000005707910142989_32720873.0RegionStalk
TgeneDNAH2chr17:12693208chr17:7704896ENST000005707910143358_35880873.0RegionAAA 5
TgeneDNAH2chr17:12693208chr17:7704896ENST000005707910143804_40230873.0RegionAAA 6
TgeneDNAH2chr17:12693208chr17:7704896ENST0000057293355862989_327228994428.0RegionStalk
TgeneDNAH2chr17:12693208chr17:7704896ENST0000057293355863358_358828994428.0RegionAAA 5
TgeneDNAH2chr17:12693208chr17:7704896ENST0000057293355863804_402328994428.0RegionAAA 6
TgeneDNAH2chr17:12693208chr17:7704896ENST000000822590131404_14390873.0RepeatNote=TPR 1
TgeneDNAH2chr17:12693208chr17:7704896ENST000000822590132721_27540873.0RepeatNote=TPR 2
TgeneDNAH2chr17:12693208chr17:7704896ENST000000822590133072_31050873.0RepeatNote=TPR 3
TgeneDNAH2chr17:12693208chr17:7704896ENST000000822590134072_41040873.0RepeatNote=TPR 4
TgeneDNAH2chr17:12693208chr17:7704896ENST000000822590134105_41400873.0RepeatNote=TPR 5
TgeneDNAH2chr17:12693208chr17:7704896ENST0000038917354853072_310528994428.0RepeatNote=TPR 3
TgeneDNAH2chr17:12693208chr17:7704896ENST0000038917354854072_410428994428.0RepeatNote=TPR 4
TgeneDNAH2chr17:12693208chr17:7704896ENST0000038917354854105_414028994428.0RepeatNote=TPR 5
TgeneDNAH2chr17:12693208chr17:7704896ENST000005707910141404_14390873.0RepeatNote=TPR 1
TgeneDNAH2chr17:12693208chr17:7704896ENST000005707910142721_27540873.0RepeatNote=TPR 2
TgeneDNAH2chr17:12693208chr17:7704896ENST000005707910143072_31050873.0RepeatNote=TPR 3
TgeneDNAH2chr17:12693208chr17:7704896ENST000005707910144072_41040873.0RepeatNote=TPR 4
TgeneDNAH2chr17:12693208chr17:7704896ENST000005707910144105_41400873.0RepeatNote=TPR 5
TgeneDNAH2chr17:12693208chr17:7704896ENST0000057293355863072_310528994428.0RepeatNote=TPR 3
TgeneDNAH2chr17:12693208chr17:7704896ENST0000057293355864072_410428994428.0RepeatNote=TPR 4
TgeneDNAH2chr17:12693208chr17:7704896ENST0000057293355864105_414028994428.0RepeatNote=TPR 5

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneARHGAP44chr17:12693208chr17:7704896ENST00000379672+12114_24917819.0DomainBAR
HgeneARHGAP44chr17:12693208chr17:7704896ENST00000379672+121255_44517819.0DomainRho-GAP
HgeneARHGAP44chr17:12693208chr17:7704896ENST00000379672+121815_81817819.0MotifPDZ-binding
TgeneDNAH2chr17:12693208chr17:7704896ENST0000038917354851803_181028994428.0Nucleotide bindingATP
TgeneDNAH2chr17:12693208chr17:7704896ENST0000038917354852084_209128994428.0Nucleotide bindingATP
TgeneDNAH2chr17:12693208chr17:7704896ENST0000038917354852416_242328994428.0Nucleotide bindingATP
TgeneDNAH2chr17:12693208chr17:7704896ENST0000038917354852762_276928994428.0Nucleotide bindingATP
TgeneDNAH2chr17:12693208chr17:7704896ENST0000057293355861803_181028994428.0Nucleotide bindingATP
TgeneDNAH2chr17:12693208chr17:7704896ENST0000057293355862084_209128994428.0Nucleotide bindingATP
TgeneDNAH2chr17:12693208chr17:7704896ENST0000057293355862416_242328994428.0Nucleotide bindingATP
TgeneDNAH2chr17:12693208chr17:7704896ENST0000057293355862762_276928994428.0Nucleotide bindingATP
TgeneDNAH2chr17:12693208chr17:7704896ENST0000038917354851765_198628994428.0RegionAAA 1
TgeneDNAH2chr17:12693208chr17:7704896ENST0000038917354851_176428994428.0RegionStem
TgeneDNAH2chr17:12693208chr17:7704896ENST0000038917354852046_227328994428.0RegionAAA 2
TgeneDNAH2chr17:12693208chr17:7704896ENST0000038917354852378_262528994428.0RegionAAA 3
TgeneDNAH2chr17:12693208chr17:7704896ENST0000038917354852722_297428994428.0RegionAAA 4
TgeneDNAH2chr17:12693208chr17:7704896ENST0000057293355861765_198628994428.0RegionAAA 1
TgeneDNAH2chr17:12693208chr17:7704896ENST0000057293355861_176428994428.0RegionStem
TgeneDNAH2chr17:12693208chr17:7704896ENST0000057293355862046_227328994428.0RegionAAA 2
TgeneDNAH2chr17:12693208chr17:7704896ENST0000057293355862378_262528994428.0RegionAAA 3
TgeneDNAH2chr17:12693208chr17:7704896ENST0000057293355862722_297428994428.0RegionAAA 4
TgeneDNAH2chr17:12693208chr17:7704896ENST0000038917354851404_143928994428.0RepeatNote=TPR 1
TgeneDNAH2chr17:12693208chr17:7704896ENST0000038917354852721_275428994428.0RepeatNote=TPR 2
TgeneDNAH2chr17:12693208chr17:7704896ENST0000057293355861404_143928994428.0RepeatNote=TPR 1
TgeneDNAH2chr17:12693208chr17:7704896ENST0000057293355862721_275428994428.0RepeatNote=TPR 2


Top

Fusion Gene Sequence for ARHGAP44-DNAH2


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>6122_6122_1_ARHGAP44-DNAH2_ARHGAP44_chr17_12693208_ENST00000262444_DNAH2_chr17_7704896_ENST00000389173_length(transcript)=4924nt_BP=132nt
GAGCCATGTAACCCTGCGGCGGGCTCCGGGCTGCTCCGTCCTTCCCCAGCTCCCGGGCTAGCGCGGCAGCGGGGCCACGATGAAGAAGCA
GTTCAATCGCATGCGCCAGCTGGCCAACCAGACGGTGGGCAGGAACTGGATCCGCCAGTACCCAGCCTTGGTGAACTGCACAACCATCAA
CTGGTTCTCAGAGTGGCCCCAAGAGGCCCTGCTCGAGGTGGCTGAGAAGTGCCTCATAGGAGTAGACCTGGGAACTCAGGAGAATATCCA
CAGGAAGGTGGCCCAGATCTTTGTCACTATGCACTGGTCAGTAGCTCAGTATTCCCAGAAGATGCTGTTGGAACTGCGGAGACACAACTA
TGTCACACCCACCAAATACCTGGAACTCCTGTCTGGATATAAGAAGTTGCTGGGAGAAAAACGGCAGGAGCTGCTGGCCCAAGCCAATAA
ACTGCGGACAGGCTTGTTCAAGATCGACGAAACTAGGGAAAAGGTGCAAGTGATGTCGTTGGAGCTGGAGGATGCCAAGAAGAAGGTGGC
TGAGTTCCAGAAGCAGTGTGAGGAGTACCTGGTCATCATTGTGCAGCAGAAGCGGGAGGCAGATGAGCAGCAGAAGGCCGTAACAGCCAA
CAGTGAAAAGATTGCAGTTGAGGAAATCAAGTGTCAGGCACTGGCTGACAATGCCCAGAAAGATCTAGAAGAGGCACTGCCCGCCCTGGA
AGAGGCCATGCGGGCCCTGGAGTCTCTGAACAAGAAGGATATAGGAGAGATCAAGTCTTATGGACGGCCCCCAGCCCAAGTGGAGATAGT
GATGCAGGCAGTTATGATTCTTCGAGGCAACGAGCCCACATGGGCAGAGGCCAAGAGGCAGCTAGGGGAACAGAACTTCATCAAGTCACT
GATCAACTTTGATAAAGACAATATCTCAGATAAGGTTCTGAAGAAGATTGGGGCCTACTGCGCCCAGCCTGACTTCCAGCCTGATATCAT
CGGCCGCGTCTCCCTGGCTGCCAAGTCCCTCTGCATGTGGGTGCGGGCCATGGAGCTGTATGGGCGGCTATATCGGGTGGTGGAGCCCAA
GCGAATCCGAATGAACGCTGCCTTGGCTCAGCTTCGGGAGAAGCAAGCCGCGCTCGCTGAGGCCCAGGAGAAGCTGCGGGAGGTAGCTGA
GAAACTGGAGATGCTAAAGAAACAGTATGATGAGAAGCTGGCACAGAAGGAGGAGCTTCGCAAGAAGTCTGAAGAGATGGAGCTGAAGCT
GGAGCGAGCTGGGATGCTCGTGTCGGGGTTGGCTGGCGAGAAGGCCAGATGGGAGGAGACAGTCCAGGGCCTGGAGGAGGACCTGGGCTA
CCTGGTGGGGGACTGTCTCCTGGCAGCTGCCTTCCTGTCCTACATGGGACCCTTCCTGACCAACTACCGGGATGAGATTGTCAACCAAAT
CTGGATCGGGAAGATCTGGGAGCTTCAGGTTCCTTGCTCCCCTTCTTTCGCCATCGATAACTTCCTGTGCAATCCTACCAAAGTCCGGGA
CTGGAACATCCAAGGGTTGCCCTCAGACGCCTTCTCCACTGAGAATGGCATCATCGTCACCCGAGGCAACAGGTGGGCACTGATGATCGA
CCCTCAGGCCCAGGCCCTGAAATGGATTAAGAACATGGAAGGAGGCCAGGGCCTGAAGATCATCGACCTGCAGATGAGCGATTACCTGCG
AATCCTAGAACACGCCATTCACTTTGGATACCCGGTGCTACTTCAGAACGTGCAGGAATATCTGGACCCCACACTGAACCCCATGCTCAA
CAAATCTGTAGCCCGAATCGGTGGTCGGCTGTTGATGCGCATTGGCGATAAGGAGGTGGAATATAATACCAATTTCCGTTTCTACATCAC
CACCAAGCTCTCCAACCCCCACTACAGCCCAGAGACCTCAGCCAAGACCACCATCGTCAACTTTGCTGTTAAAGAACAGGGCCTGGAGGC
CCAGCTGCTGGGCATTGTGGTGCGGAAGGAGCGGCCTGAGCTGGAGGAGCAGAAGGACTCACTGGTCATCAACATCGCGGCTGGTAAAAG
GAAGCTCAAGGAGCTGGAGGATGAGATCCTGCGGCTGCTGAATGAGGCCACCGGCTCCCTGCTGGATGATGTGCAGCTGGTGAACACGCT
GCATACCTCCAAGATCACAGCCACAGAGGTGACTGAGCAGCTGGAGACCAGTGAGACCACAGAGATCAACACTGACTTGGCGCGGGAGGC
TTACCGCCCATGCGCCCAGCGGGCATCAATCCTGTTCTTCGTGCTCAATGATATGGGCTGCATCGACCCCATGTACCAGTTCTCACTGGA
TGCCTACATCAGCCTCTTTATTCTCAGCATTGACAAAAGCCACCGCAGCAATAAGCTGGAGGACCGCATTGACTACCTGAATGACTACCA
CACCTACGCTGTCTACAGGTACACCTGCCGTACCCTTTTCGAACGCCACAAACTACTATTCAGTTTTCATATGTGTGCCAAAATCTTGGA
GACTTCTGGCAAGCTCAACATGGATGAATACAACTTCTTTCTACGTGGGGGTGTGGTCTTGGATCGGGAGGGCCAAATGGACAATCCATG
TAGTAGCTGGCTTGCAGATGCCTACTGGGATAACATCACAGAGCTAGACAAACTGACCAACTTCCACGGACTCATGAACTCCTTTGAGCA
GTACCCTCGTGACTGGCACCTGTGGTATACCAATGCTGCCCCGGAGAAGGCGATGCTGCCAGGTGAGTGGGAAAATGCCTGCAATGAAAT
GCAACGGATGCTGATCGTTCGCTCCCTGCGCCAGGACCGCGTGGCCTTCTGCGTGACCTCCTTCATCATCACCAACCTTGGCTCCCGCTT
CATCGAGCCGCCTGTGCTGAATATGAAGTCGGTGCTGGAGGATTCAACCCCACGATCCCCACTCGTGTTCATCCTGTCCCCTGGTGTGGA
CCCCACCAGTGCCCTGCTGCAGCTGGCAGAGCACATGGGCATGGCCCAGCGCTTCCACGCCCTGTCCCTGGGCCAGGGCCAGGCCCCCAT
CGCTGCTCGGCTCCTCCGAGAGGGTGTGACTCAGGGACACTGGGTGTTCCTGGCAAACTGCCACCTGTCACTGTCTTGGATGCCTAATCT
GGACAAGCTGGTGGAGCAGCTGCAGGTGGAGGATCCTCATCCATCCTTCCGCCTCTGGCTCAGCTCCATCCCCCACCCAGACTTCCCTAT
CTCAATCTTGCAGGTCAGCATCAAGATGACCACAGAGCCACCAAAGGGCCTAAAGGCCAACATGACACGTCTTTACCAACTGATGTCAGA
ACCACAGTTTTCCCGCTGCTCCAAACCTGCCAAATATAAGAAGCTGCTGTTTTCACTCTGTTTCTTCCACTCTGTGTTACTTGAACGCAA
AAAGTTCCTGCAGCTTGGCTGGAACATCATCTATGGCTTCAATGACTCCGACTTTGAGGTGTCAGAAAACTTGCTGAGCCTCTATCTCGA
TGAGTACGAGGAGACACCTTGGGACGCACTTAAGTACCTCATTGCCGGCATCAACTATGGTGGACATGTCACAGATGACTGGGACCGGCG
CCTGCTGACCACCTACATCAATGATTATTTCTGTGACCAGTCTCTATCAACTCCCTTCCACCGGTTGTCAGCACTGGAGACTTATTTCAT
CCCCAAGGATGGCAGCCTCGCTTCTTACAAGGAATACATCAGCTTATTGCCTGGCATGGACCCCCCTGAGGCCTTTGGCCAGCACCCCAA
TGCTGATGTGGCCTCTCAGATCACTGAGGCACAAACCCTCTTTGATACTTTGCTTTCCTTGCAACCTCAGATTACACCCACCAGGGCTGG
AGGCCAGACCCGGGAAGAGAAGGTCCTTGAGTTGGCCGCTGATGTGAAGCAGAAGATCCCTGAAATGATCGACTATGAGGGGACTCAAAA
ACTGCTAGCTCTCGACCCCTCCCCCCTCAATGTGGTCCTTCTGCAGGAGATCCAGAGATACAACACACTGATGCAGACCATCCTGTTCTC
ACTGACAGACCTAGAGAAAGGCATCCAGGGTCTCATCGTCATGTCTACAAGCCTGGAAGAGATTTTCAATTGCATCTTTGATGCCCATGT
TCCTCCGCTCTGGGGAAAGGCATACCCCTCACAAAAGCCATTGGCTGCCTGGACCCGGGACTTGGCCATGCGTGTGGAGCAGTTTGAGCT
GTGGGCCAGCCGGGCCCGGCCTCCTGTGATCTTCTGGTTGTCTGGTTTCACCTTTCCCACTGGCTTCCTCACTGCTGTGCTGCAGTCTTC
AGCTCGCCAAAACAACGTTTCAGTGGACAGCCTCTCCTGGGAGTTTATCGTTTCCACTGTGGATGACAGCAACCTAGTGTATCCCCCCAA
GGATGGTGTCTGGGTCCGGGGCCTGTACCTGGAAGGTGCTGGCTGGGACCGGAAGAACTCCTGCTTGGTGGAGGCAGAGCCCATGCAGCT
TGTCTGCCTCATGCCCACGATCCACTTCCGGCCTGCAGAGAGCCGCAAGAAGAGCGCCAAGGGCATGTACTCCTGCCCCTGCTATTACTA
TCCCAACCGGGCAGGCAGCTCAGACCGAGCCTCCTTTGTCATCGGCATTGACCTGCGGTCTGGGGCCATGACACCTGATCATTGGATCAA
GAGGGGCACTGCTCTACTCATGAGCCTGGACAGCTGAGACCTCCTCCTCTTCTCCGCTTGAGAGAGAGGGTCAGGGACTCCAGGAGCTAA
GACAGATGTTGCACCTAGGACTGAGGCCGGACCTCACTCAGACTTTGACCTTGGCCGAATTTGTGTGATGTGGCCCTGGAGATACCTAGT

>6122_6122_1_ARHGAP44-DNAH2_ARHGAP44_chr17_12693208_ENST00000262444_DNAH2_chr17_7704896_ENST00000389173_length(amino acids)=1545AA_BP=17
MKKQFNRMRQLANQTVGRNWIRQYPALVNCTTINWFSEWPQEALLEVAEKCLIGVDLGTQENIHRKVAQIFVTMHWSVAQYSQKMLLELR
RHNYVTPTKYLELLSGYKKLLGEKRQELLAQANKLRTGLFKIDETREKVQVMSLELEDAKKKVAEFQKQCEEYLVIIVQQKREADEQQKA
VTANSEKIAVEEIKCQALADNAQKDLEEALPALEEAMRALESLNKKDIGEIKSYGRPPAQVEIVMQAVMILRGNEPTWAEAKRQLGEQNF
IKSLINFDKDNISDKVLKKIGAYCAQPDFQPDIIGRVSLAAKSLCMWVRAMELYGRLYRVVEPKRIRMNAALAQLREKQAALAEAQEKLR
EVAEKLEMLKKQYDEKLAQKEELRKKSEEMELKLERAGMLVSGLAGEKARWEETVQGLEEDLGYLVGDCLLAAAFLSYMGPFLTNYRDEI
VNQIWIGKIWELQVPCSPSFAIDNFLCNPTKVRDWNIQGLPSDAFSTENGIIVTRGNRWALMIDPQAQALKWIKNMEGGQGLKIIDLQMS
DYLRILEHAIHFGYPVLLQNVQEYLDPTLNPMLNKSVARIGGRLLMRIGDKEVEYNTNFRFYITTKLSNPHYSPETSAKTTIVNFAVKEQ
GLEAQLLGIVVRKERPELEEQKDSLVINIAAGKRKLKELEDEILRLLNEATGSLLDDVQLVNTLHTSKITATEVTEQLETSETTEINTDL
AREAYRPCAQRASILFFVLNDMGCIDPMYQFSLDAYISLFILSIDKSHRSNKLEDRIDYLNDYHTYAVYRYTCRTLFERHKLLFSFHMCA
KILETSGKLNMDEYNFFLRGGVVLDREGQMDNPCSSWLADAYWDNITELDKLTNFHGLMNSFEQYPRDWHLWYTNAAPEKAMLPGEWENA
CNEMQRMLIVRSLRQDRVAFCVTSFIITNLGSRFIEPPVLNMKSVLEDSTPRSPLVFILSPGVDPTSALLQLAEHMGMAQRFHALSLGQG
QAPIAARLLREGVTQGHWVFLANCHLSLSWMPNLDKLVEQLQVEDPHPSFRLWLSSIPHPDFPISILQVSIKMTTEPPKGLKANMTRLYQ
LMSEPQFSRCSKPAKYKKLLFSLCFFHSVLLERKKFLQLGWNIIYGFNDSDFEVSENLLSLYLDEYEETPWDALKYLIAGINYGGHVTDD
WDRRLLTTYINDYFCDQSLSTPFHRLSALETYFIPKDGSLASYKEYISLLPGMDPPEAFGQHPNADVASQITEAQTLFDTLLSLQPQITP
TRAGGQTREEKVLELAADVKQKIPEMIDYEGTQKLLALDPSPLNVVLLQEIQRYNTLMQTILFSLTDLEKGIQGLIVMSTSLEEIFNCIF
DAHVPPLWGKAYPSQKPLAAWTRDLAMRVEQFELWASRARPPVIFWLSGFTFPTGFLTAVLQSSARQNNVSVDSLSWEFIVSTVDDSNLV
YPPKDGVWVRGLYLEGAGWDRKNSCLVEAEPMQLVCLMPTIHFRPAESRKKSAKGMYSCPCYYYPNRAGSSDRASFVIGIDLRSGAMTPD

--------------------------------------------------------------
>6122_6122_2_ARHGAP44-DNAH2_ARHGAP44_chr17_12693208_ENST00000262444_DNAH2_chr17_7704896_ENST00000572933_length(transcript)=4928nt_BP=132nt
GAGCCATGTAACCCTGCGGCGGGCTCCGGGCTGCTCCGTCCTTCCCCAGCTCCCGGGCTAGCGCGGCAGCGGGGCCACGATGAAGAAGCA
GTTCAATCGCATGCGCCAGCTGGCCAACCAGACGGTGGGCAGGAACTGGATCCGCCAGTACCCAGCCTTGGTGAACTGCACAACCATCAA
CTGGTTCTCAGAGTGGCCCCAAGAGGCCCTGCTCGAGGTGGCTGAGAAGTGCCTCATAGGAGTAGACCTGGGAACTCAGGAGAATATCCA
CAGGAAGGTGGCCCAGATCTTTGTCACTATGCACTGGTCAGTAGCTCAGTATTCCCAGAAGATGCTGTTGGAACTGCGGAGACACAACTA
TGTCACACCCACCAAATACCTGGAACTCCTGTCTGGATATAAGAAGTTGCTGGGAGAAAAACGGCAGGAGCTGCTGGCCCAAGCCAATAA
ACTGCGGACAGGCTTGTTCAAGATCGACGAAACTAGGGAAAAGGTGCAAGTGATGTCGTTGGAGCTGGAGGATGCCAAGAAGAAGGTGGC
TGAGTTCCAGAAGCAGTGTGAGGAGTACCTGGTCATCATTGTGCAGCAGAAGCGGGAGGCAGATGAGCAGCAGAAGGCCGTAACAGCCAA
CAGTGAAAAGATTGCAGTTGAGGAAATCAAGTGTCAGGCACTGGCTGACAATGCCCAGAAAGATCTAGAAGAGGCACTGCCCGCCCTGGA
AGAGGCCATGCGGGCCCTGGAGTCTCTGAACAAGAAGGATATAGGAGAGATCAAGTCTTATGGACGGCCCCCAGCCCAAGTGGAGATAGT
GATGCAGGCAGTTATGATTCTTCGAGGCAACGAGCCCACATGGGCAGAGGCCAAGAGGCAGCTAGGGGAACAGAACTTCATCAAGTCACT
GATCAACTTTGATAAAGACAATATCTCAGATAAGGTTCTGAAGAAGATTGGGGCCTACTGCGCCCAGCCTGACTTCCAGCCTGATATCAT
CGGCCGCGTCTCCCTGGCTGCCAAGTCCCTCTGCATGTGGGTGCGGGCCATGGAGCTGTATGGGCGGCTATATCGGGTGGTGGAGCCCAA
GCGAATCCGAATGAACGCTGCCTTGGCTCAGCTTCGGGAGAAGCAAGCCGCGCTCGCTGAGGCCCAGGAGAAGCTGCGGGAGGTAGCTGA
GAAACTGGAGATGCTAAAGAAACAGTATGATGAGAAGCTGGCACAGAAGGAGGAGCTTCGCAAGAAGTCTGAAGAGATGGAGCTGAAGCT
GGAGCGAGCTGGGATGCTCGTGTCGGGGTTGGCTGGCGAGAAGGCCAGATGGGAGGAGACAGTCCAGGGCCTGGAGGAGGACCTGGGCTA
CCTGGTGGGGGACTGTCTCCTGGCAGCTGCCTTCCTGTCCTACATGGGACCCTTCCTGACCAACTACCGGGATGAGATTGTCAACCAAAT
CTGGATCGGGAAGATCTGGGAGCTTCAGGTTCCTTGCTCCCCTTCTTTCGCCATCGATAACTTCCTGTGCAATCCTACCAAAGTCCGGGA
CTGGAACATCCAAGGGTTGCCCTCAGACGCCTTCTCCACTGAGAATGGCATCATCGTCACCCGAGGCAACAGGTGGGCACTGATGATCGA
CCCTCAGGCCCAGGCCCTGAAATGGATTAAGAACATGGAAGGAGGCCAGGGCCTGAAGATCATCGACCTGCAGATGAGCGATTACCTGCG
AATCCTAGAACACGCCATTCACTTTGGATACCCGGTGCTACTTCAGAACGTGCAGGAATATCTGGACCCCACACTGAACCCCATGCTCAA
CAAATCTGTAGCCCGAATCGGTGGTCGGCTGTTGATGCGCATTGGCGATAAGGAGGTGGAATATAATACCAATTTCCGTTTCTACATCAC
CACCAAGCTCTCCAACCCCCACTACAGCCCAGAGACCTCAGCCAAGACCACCATCGTCAACTTTGCTGTTAAAGAACAGGGCCTGGAGGC
CCAGCTGCTGGGCATTGTGGTGCGGAAGGAGCGGCCTGAGCTGGAGGAGCAGAAGGACTCACTGGTCATCAACATCGCGGCTGGTAAAAG
GAAGCTCAAGGAGCTGGAGGATGAGATCCTGCGGCTGCTGAATGAGGCCACCGGCTCCCTGCTGGATGATGTGCAGCTGGTGAACACGCT
GCATACCTCCAAGATCACAGCCACAGAGGTGACTGAGCAGCTGGAGACCAGTGAGACCACAGAGATCAACACTGACTTGGCGCGGGAGGC
TTACCGCCCATGCGCCCAGCGGGCATCAATCCTGTTCTTCGTGCTCAATGATATGGGCTGCATCGACCCCATGTACCAGTTCTCACTGGA
TGCCTACATCAGCCTCTTTATTCTCAGCATTGACAAAAGCCACCGCAGCAATAAGCTGGAGGACCGCATTGACTACCTGAATGACTACCA
CACCTACGCTGTCTACAGGTACACCTGCCGTACCCTTTTCGAACGCCACAAACTACTATTCAGTTTTCATATGTGTGCCAAAATCTTGGA
GACTTCTGGCAAGCTCAACATGGATGAATACAACTTCTTTCTACGTGGGGGTGTGGTCTTGGATCGGGAGGGCCAAATGGACAATCCATG
TAGTAGCTGGCTTGCAGATGCCTACTGGGATAACATCACAGAGCTAGACAAACTGACCAACTTCCACGGACTCATGAACTCCTTTGAGCA
GTACCCTCGTGACTGGCACCTGTGGTATACCAATGCTGCCCCGGAGAAGGCGATGCTGCCAGGTGAGTGGGAAAATGCCTGCAATGAAAT
GCAACGGATGCTGATCGTTCGCTCCCTGCGCCAGGACCGCGTGGCCTTCTGCGTGACCTCCTTCATCATCACCAACCTTGGCTCCCGCTT
CATCGAGCCGCCTGTGCTGAATATGAAGTCGGTGCTGGAGGATTCAACCCCACGATCCCCACTCGTGTTCATCCTGTCCCCTGGTGTGGA
CCCCACCAGTGCCCTGCTGCAGCTGGCAGAGCACATGGGCATGGCCCAGCGCTTCCACGCCCTGTCCCTGGGCCAGGGCCAGGCCCCCAT
CGCTGCTCGGCTCCTCCGAGAGGGTGTGACTCAGGGACACTGGGTGTTCCTGGCAAACTGCCACCTGTCACTGTCTTGGATGCCTAATCT
GGACAAGCTGGTGGAGCAGCTGCAGGTGGAGGATCCTCATCCATCCTTCCGCCTCTGGCTCAGCTCCATCCCCCACCCAGACTTCCCTAT
CTCAATCTTGCAGGTCAGCATCAAGATGACCACAGAGCCACCAAAGGGCCTAAAGGCCAACATGACACGTCTTTACCAACTGATGTCAGA
ACCACAGTTTTCCCGCTGCTCCAAACCTGCCAAATATAAGAAGCTGCTGTTTTCACTCTGTTTCTTCCACTCTGTGTTACTTGAACGCAA
AAAGTTCCTGCAGCTTGGCTGGAACATCATCTATGGCTTCAATGACTCCGACTTTGAGGTGTCAGAAAACTTGCTGAGCCTCTATCTCGA
TGAGTACGAGGAGACACCTTGGGACGCACTTAAGTACCTCATTGCCGGCATCAACTATGGTGGACATGTCACAGATGACTGGGACCGGCG
CCTGCTGACCACCTACATCAATGATTATTTCTGTGACCAGTCTCTATCAACTCCCTTCCACCGGTTGTCAGCACTGGAGACTTATTTCAT
CCCCAAGGATGGCAGCCTCGCTTCTTACAAGGAATACATCAGCTTATTGCCTGGCATGGACCCCCCTGAGGCCTTTGGCCAGCACCCCAA
TGCTGATGTGGCCTCTCAGATCACTGAGGCACAAACCCTCTTTGATACTTTGCTTTCCTTGCAACCTCAGATTACACCCACCAGGGCTGG
AGGCCAGACCCGGGAAGAGAAGGTCCTTGAGTTGGCCGCTGATGTGAAGCAGAAGATCCCTGAAATGATCGACTATGAGGGGACTCAAAA
ACTGCTAGCTCTCGACCCCTCCCCCCTCAATGTGGTCCTTCTGCAGGAGATCCAGAGATACAACACACTGATGCAGACCATCCTGTTCTC
ACTGACAGACCTAGAGAAAGGCATCCAGGGTCTCATCGTCATGTCTACAAGCCTGGAAGAGATTTTCAATTGCATCTTTGATGCCCATGT
TCCTCCGCTCTGGGGAAAGGCATACCCCTCACAAAAGCCATTGGCTGCCTGGACCCGGGACTTGGCCATGCGTGTGGAGCAGTTTGAGCT
GTGGGCCAGCCGGGCCCGGCCTCCTGTGATCTTCTGGTTGTCTGGTTTCACCTTTCCCACTGGCTTCCTCACTGCTGTGCTGCAGTCTTC
AGCTCGCCAAAACAACGTTTCAGTGGACAGCCTCTCCTGGGAGTTTATCGTTTCCACTGTGGATGACAGCAACCTAGTGTATCCCCCCAA
GGATGGTGTCTGGGTCCGGGGCCTGTACCTGGAAGGTGCTGGCTGGGACCGGAAGAACTCCTGCTTGGTGGAGGCAGAGCCCATGCAGCT
TGTCTGCCTCATGCCCACGATCCACTTCCGGCCTGCAGAGAGCCGCAAGAAGAGCGCCAAGGGCATGTACTCCTGCCCCTGCTATTACTA
TCCCAACCGGGCAGGCAGCTCAGACCGAGCCTCCTTTGTCATCGGCATTGACCTGCGGTCTGGGGCCATGACACCTGATCATTGGATCAA
GAGGGGCACTGCTCTACTCATGAGCCTGGACAGCTGAGACCTCCTCCTCTTCTCCGCTTGAGAGAGAGGGTCAGGGACTCCAGGAGCTAA
GACAGATGTTGCACCTAGGACTGAGGCCGGACCTCACTCAGACTTTGACCTTGGCCGAATTTGTGTGATGTGGCCCTGGAGATACCTAGT

>6122_6122_2_ARHGAP44-DNAH2_ARHGAP44_chr17_12693208_ENST00000262444_DNAH2_chr17_7704896_ENST00000572933_length(amino acids)=1545AA_BP=17
MKKQFNRMRQLANQTVGRNWIRQYPALVNCTTINWFSEWPQEALLEVAEKCLIGVDLGTQENIHRKVAQIFVTMHWSVAQYSQKMLLELR
RHNYVTPTKYLELLSGYKKLLGEKRQELLAQANKLRTGLFKIDETREKVQVMSLELEDAKKKVAEFQKQCEEYLVIIVQQKREADEQQKA
VTANSEKIAVEEIKCQALADNAQKDLEEALPALEEAMRALESLNKKDIGEIKSYGRPPAQVEIVMQAVMILRGNEPTWAEAKRQLGEQNF
IKSLINFDKDNISDKVLKKIGAYCAQPDFQPDIIGRVSLAAKSLCMWVRAMELYGRLYRVVEPKRIRMNAALAQLREKQAALAEAQEKLR
EVAEKLEMLKKQYDEKLAQKEELRKKSEEMELKLERAGMLVSGLAGEKARWEETVQGLEEDLGYLVGDCLLAAAFLSYMGPFLTNYRDEI
VNQIWIGKIWELQVPCSPSFAIDNFLCNPTKVRDWNIQGLPSDAFSTENGIIVTRGNRWALMIDPQAQALKWIKNMEGGQGLKIIDLQMS
DYLRILEHAIHFGYPVLLQNVQEYLDPTLNPMLNKSVARIGGRLLMRIGDKEVEYNTNFRFYITTKLSNPHYSPETSAKTTIVNFAVKEQ
GLEAQLLGIVVRKERPELEEQKDSLVINIAAGKRKLKELEDEILRLLNEATGSLLDDVQLVNTLHTSKITATEVTEQLETSETTEINTDL
AREAYRPCAQRASILFFVLNDMGCIDPMYQFSLDAYISLFILSIDKSHRSNKLEDRIDYLNDYHTYAVYRYTCRTLFERHKLLFSFHMCA
KILETSGKLNMDEYNFFLRGGVVLDREGQMDNPCSSWLADAYWDNITELDKLTNFHGLMNSFEQYPRDWHLWYTNAAPEKAMLPGEWENA
CNEMQRMLIVRSLRQDRVAFCVTSFIITNLGSRFIEPPVLNMKSVLEDSTPRSPLVFILSPGVDPTSALLQLAEHMGMAQRFHALSLGQG
QAPIAARLLREGVTQGHWVFLANCHLSLSWMPNLDKLVEQLQVEDPHPSFRLWLSSIPHPDFPISILQVSIKMTTEPPKGLKANMTRLYQ
LMSEPQFSRCSKPAKYKKLLFSLCFFHSVLLERKKFLQLGWNIIYGFNDSDFEVSENLLSLYLDEYEETPWDALKYLIAGINYGGHVTDD
WDRRLLTTYINDYFCDQSLSTPFHRLSALETYFIPKDGSLASYKEYISLLPGMDPPEAFGQHPNADVASQITEAQTLFDTLLSLQPQITP
TRAGGQTREEKVLELAADVKQKIPEMIDYEGTQKLLALDPSPLNVVLLQEIQRYNTLMQTILFSLTDLEKGIQGLIVMSTSLEEIFNCIF
DAHVPPLWGKAYPSQKPLAAWTRDLAMRVEQFELWASRARPPVIFWLSGFTFPTGFLTAVLQSSARQNNVSVDSLSWEFIVSTVDDSNLV
YPPKDGVWVRGLYLEGAGWDRKNSCLVEAEPMQLVCLMPTIHFRPAESRKKSAKGMYSCPCYYYPNRAGSSDRASFVIGIDLRSGAMTPD

--------------------------------------------------------------
>6122_6122_3_ARHGAP44-DNAH2_ARHGAP44_chr17_12693208_ENST00000340825_DNAH2_chr17_7704896_ENST00000389173_length(transcript)=5145nt_BP=353nt
GACTGGGAGCAGGCAGCCCGGGCGGAGCGGGCCGGTGCCGAGGACGGCCCCAGGCATTGCTCTGCCCCGGGCATTGCGCGGCGCGCGTGA
GGGGGATGCGGCAGGAGGCGGCGCGGCGGGAGGAGTAGGCGGCGGCGCCCTCGGGAGGGAGCTGCGCGCGGGCCAGACGGCGCCCGGAGG
CTCCGCAGTGCCGCCGCCGTCGCCCGGGAGGCTCCGCGCGGGAGCCATGTAACCCTGCGGCGGGCTCCGGGCTGCTCCGTCCTTCCCCAG
CTCCCGGGCTAGCGCGGCAGCGGGGCCACGATGAAGAAGCAGTTCAATCGCATGCGCCAGCTGGCCAACCAGACGGTGGGCAGGAACTGG
ATCCGCCAGTACCCAGCCTTGGTGAACTGCACAACCATCAACTGGTTCTCAGAGTGGCCCCAAGAGGCCCTGCTCGAGGTGGCTGAGAAG
TGCCTCATAGGAGTAGACCTGGGAACTCAGGAGAATATCCACAGGAAGGTGGCCCAGATCTTTGTCACTATGCACTGGTCAGTAGCTCAG
TATTCCCAGAAGATGCTGTTGGAACTGCGGAGACACAACTATGTCACACCCACCAAATACCTGGAACTCCTGTCTGGATATAAGAAGTTG
CTGGGAGAAAAACGGCAGGAGCTGCTGGCCCAAGCCAATAAACTGCGGACAGGCTTGTTCAAGATCGACGAAACTAGGGAAAAGGTGCAA
GTGATGTCGTTGGAGCTGGAGGATGCCAAGAAGAAGGTGGCTGAGTTCCAGAAGCAGTGTGAGGAGTACCTGGTCATCATTGTGCAGCAG
AAGCGGGAGGCAGATGAGCAGCAGAAGGCCGTAACAGCCAACAGTGAAAAGATTGCAGTTGAGGAAATCAAGTGTCAGGCACTGGCTGAC
AATGCCCAGAAAGATCTAGAAGAGGCACTGCCCGCCCTGGAAGAGGCCATGCGGGCCCTGGAGTCTCTGAACAAGAAGGATATAGGAGAG
ATCAAGTCTTATGGACGGCCCCCAGCCCAAGTGGAGATAGTGATGCAGGCAGTTATGATTCTTCGAGGCAACGAGCCCACATGGGCAGAG
GCCAAGAGGCAGCTAGGGGAACAGAACTTCATCAAGTCACTGATCAACTTTGATAAAGACAATATCTCAGATAAGGTTCTGAAGAAGATT
GGGGCCTACTGCGCCCAGCCTGACTTCCAGCCTGATATCATCGGCCGCGTCTCCCTGGCTGCCAAGTCCCTCTGCATGTGGGTGCGGGCC
ATGGAGCTGTATGGGCGGCTATATCGGGTGGTGGAGCCCAAGCGAATCCGAATGAACGCTGCCTTGGCTCAGCTTCGGGAGAAGCAAGCC
GCGCTCGCTGAGGCCCAGGAGAAGCTGCGGGAGGTAGCTGAGAAACTGGAGATGCTAAAGAAACAGTATGATGAGAAGCTGGCACAGAAG
GAGGAGCTTCGCAAGAAGTCTGAAGAGATGGAGCTGAAGCTGGAGCGAGCTGGGATGCTCGTGTCGGGGTTGGCTGGCGAGAAGGCCAGA
TGGGAGGAGACAGTCCAGGGCCTGGAGGAGGACCTGGGCTACCTGGTGGGGGACTGTCTCCTGGCAGCTGCCTTCCTGTCCTACATGGGA
CCCTTCCTGACCAACTACCGGGATGAGATTGTCAACCAAATCTGGATCGGGAAGATCTGGGAGCTTCAGGTTCCTTGCTCCCCTTCTTTC
GCCATCGATAACTTCCTGTGCAATCCTACCAAAGTCCGGGACTGGAACATCCAAGGGTTGCCCTCAGACGCCTTCTCCACTGAGAATGGC
ATCATCGTCACCCGAGGCAACAGGTGGGCACTGATGATCGACCCTCAGGCCCAGGCCCTGAAATGGATTAAGAACATGGAAGGAGGCCAG
GGCCTGAAGATCATCGACCTGCAGATGAGCGATTACCTGCGAATCCTAGAACACGCCATTCACTTTGGATACCCGGTGCTACTTCAGAAC
GTGCAGGAATATCTGGACCCCACACTGAACCCCATGCTCAACAAATCTGTAGCCCGAATCGGTGGTCGGCTGTTGATGCGCATTGGCGAT
AAGGAGGTGGAATATAATACCAATTTCCGTTTCTACATCACCACCAAGCTCTCCAACCCCCACTACAGCCCAGAGACCTCAGCCAAGACC
ACCATCGTCAACTTTGCTGTTAAAGAACAGGGCCTGGAGGCCCAGCTGCTGGGCATTGTGGTGCGGAAGGAGCGGCCTGAGCTGGAGGAG
CAGAAGGACTCACTGGTCATCAACATCGCGGCTGGTAAAAGGAAGCTCAAGGAGCTGGAGGATGAGATCCTGCGGCTGCTGAATGAGGCC
ACCGGCTCCCTGCTGGATGATGTGCAGCTGGTGAACACGCTGCATACCTCCAAGATCACAGCCACAGAGGTGACTGAGCAGCTGGAGACC
AGTGAGACCACAGAGATCAACACTGACTTGGCGCGGGAGGCTTACCGCCCATGCGCCCAGCGGGCATCAATCCTGTTCTTCGTGCTCAAT
GATATGGGCTGCATCGACCCCATGTACCAGTTCTCACTGGATGCCTACATCAGCCTCTTTATTCTCAGCATTGACAAAAGCCACCGCAGC
AATAAGCTGGAGGACCGCATTGACTACCTGAATGACTACCACACCTACGCTGTCTACAGGTACACCTGCCGTACCCTTTTCGAACGCCAC
AAACTACTATTCAGTTTTCATATGTGTGCCAAAATCTTGGAGACTTCTGGCAAGCTCAACATGGATGAATACAACTTCTTTCTACGTGGG
GGTGTGGTCTTGGATCGGGAGGGCCAAATGGACAATCCATGTAGTAGCTGGCTTGCAGATGCCTACTGGGATAACATCACAGAGCTAGAC
AAACTGACCAACTTCCACGGACTCATGAACTCCTTTGAGCAGTACCCTCGTGACTGGCACCTGTGGTATACCAATGCTGCCCCGGAGAAG
GCGATGCTGCCAGGTGAGTGGGAAAATGCCTGCAATGAAATGCAACGGATGCTGATCGTTCGCTCCCTGCGCCAGGACCGCGTGGCCTTC
TGCGTGACCTCCTTCATCATCACCAACCTTGGCTCCCGCTTCATCGAGCCGCCTGTGCTGAATATGAAGTCGGTGCTGGAGGATTCAACC
CCACGATCCCCACTCGTGTTCATCCTGTCCCCTGGTGTGGACCCCACCAGTGCCCTGCTGCAGCTGGCAGAGCACATGGGCATGGCCCAG
CGCTTCCACGCCCTGTCCCTGGGCCAGGGCCAGGCCCCCATCGCTGCTCGGCTCCTCCGAGAGGGTGTGACTCAGGGACACTGGGTGTTC
CTGGCAAACTGCCACCTGTCACTGTCTTGGATGCCTAATCTGGACAAGCTGGTGGAGCAGCTGCAGGTGGAGGATCCTCATCCATCCTTC
CGCCTCTGGCTCAGCTCCATCCCCCACCCAGACTTCCCTATCTCAATCTTGCAGGTCAGCATCAAGATGACCACAGAGCCACCAAAGGGC
CTAAAGGCCAACATGACACGTCTTTACCAACTGATGTCAGAACCACAGTTTTCCCGCTGCTCCAAACCTGCCAAATATAAGAAGCTGCTG
TTTTCACTCTGTTTCTTCCACTCTGTGTTACTTGAACGCAAAAAGTTCCTGCAGCTTGGCTGGAACATCATCTATGGCTTCAATGACTCC
GACTTTGAGGTGTCAGAAAACTTGCTGAGCCTCTATCTCGATGAGTACGAGGAGACACCTTGGGACGCACTTAAGTACCTCATTGCCGGC
ATCAACTATGGTGGACATGTCACAGATGACTGGGACCGGCGCCTGCTGACCACCTACATCAATGATTATTTCTGTGACCAGTCTCTATCA
ACTCCCTTCCACCGGTTGTCAGCACTGGAGACTTATTTCATCCCCAAGGATGGCAGCCTCGCTTCTTACAAGGAATACATCAGCTTATTG
CCTGGCATGGACCCCCCTGAGGCCTTTGGCCAGCACCCCAATGCTGATGTGGCCTCTCAGATCACTGAGGCACAAACCCTCTTTGATACT
TTGCTTTCCTTGCAACCTCAGATTACACCCACCAGGGCTGGAGGCCAGACCCGGGAAGAGAAGGTCCTTGAGTTGGCCGCTGATGTGAAG
CAGAAGATCCCTGAAATGATCGACTATGAGGGGACTCAAAAACTGCTAGCTCTCGACCCCTCCCCCCTCAATGTGGTCCTTCTGCAGGAG
ATCCAGAGATACAACACACTGATGCAGACCATCCTGTTCTCACTGACAGACCTAGAGAAAGGCATCCAGGGTCTCATCGTCATGTCTACA
AGCCTGGAAGAGATTTTCAATTGCATCTTTGATGCCCATGTTCCTCCGCTCTGGGGAAAGGCATACCCCTCACAAAAGCCATTGGCTGCC
TGGACCCGGGACTTGGCCATGCGTGTGGAGCAGTTTGAGCTGTGGGCCAGCCGGGCCCGGCCTCCTGTGATCTTCTGGTTGTCTGGTTTC
ACCTTTCCCACTGGCTTCCTCACTGCTGTGCTGCAGTCTTCAGCTCGCCAAAACAACGTTTCAGTGGACAGCCTCTCCTGGGAGTTTATC
GTTTCCACTGTGGATGACAGCAACCTAGTGTATCCCCCCAAGGATGGTGTCTGGGTCCGGGGCCTGTACCTGGAAGGTGCTGGCTGGGAC
CGGAAGAACTCCTGCTTGGTGGAGGCAGAGCCCATGCAGCTTGTCTGCCTCATGCCCACGATCCACTTCCGGCCTGCAGAGAGCCGCAAG
AAGAGCGCCAAGGGCATGTACTCCTGCCCCTGCTATTACTATCCCAACCGGGCAGGCAGCTCAGACCGAGCCTCCTTTGTCATCGGCATT
GACCTGCGGTCTGGGGCCATGACACCTGATCATTGGATCAAGAGGGGCACTGCTCTACTCATGAGCCTGGACAGCTGAGACCTCCTCCTC
TTCTCCGCTTGAGAGAGAGGGTCAGGGACTCCAGGAGCTAAGACAGATGTTGCACCTAGGACTGAGGCCGGACCTCACTCAGACTTTGAC
CTTGGCCGAATTTGTGTGATGTGGCCCTGGAGATACCTAGTTGTGTTAGCCATAAAAGTGAAAGAGTTGTATTGGAGCTCAGTGCTGTAA

>6122_6122_3_ARHGAP44-DNAH2_ARHGAP44_chr17_12693208_ENST00000340825_DNAH2_chr17_7704896_ENST00000389173_length(amino acids)=1545AA_BP=17
MKKQFNRMRQLANQTVGRNWIRQYPALVNCTTINWFSEWPQEALLEVAEKCLIGVDLGTQENIHRKVAQIFVTMHWSVAQYSQKMLLELR
RHNYVTPTKYLELLSGYKKLLGEKRQELLAQANKLRTGLFKIDETREKVQVMSLELEDAKKKVAEFQKQCEEYLVIIVQQKREADEQQKA
VTANSEKIAVEEIKCQALADNAQKDLEEALPALEEAMRALESLNKKDIGEIKSYGRPPAQVEIVMQAVMILRGNEPTWAEAKRQLGEQNF
IKSLINFDKDNISDKVLKKIGAYCAQPDFQPDIIGRVSLAAKSLCMWVRAMELYGRLYRVVEPKRIRMNAALAQLREKQAALAEAQEKLR
EVAEKLEMLKKQYDEKLAQKEELRKKSEEMELKLERAGMLVSGLAGEKARWEETVQGLEEDLGYLVGDCLLAAAFLSYMGPFLTNYRDEI
VNQIWIGKIWELQVPCSPSFAIDNFLCNPTKVRDWNIQGLPSDAFSTENGIIVTRGNRWALMIDPQAQALKWIKNMEGGQGLKIIDLQMS
DYLRILEHAIHFGYPVLLQNVQEYLDPTLNPMLNKSVARIGGRLLMRIGDKEVEYNTNFRFYITTKLSNPHYSPETSAKTTIVNFAVKEQ
GLEAQLLGIVVRKERPELEEQKDSLVINIAAGKRKLKELEDEILRLLNEATGSLLDDVQLVNTLHTSKITATEVTEQLETSETTEINTDL
AREAYRPCAQRASILFFVLNDMGCIDPMYQFSLDAYISLFILSIDKSHRSNKLEDRIDYLNDYHTYAVYRYTCRTLFERHKLLFSFHMCA
KILETSGKLNMDEYNFFLRGGVVLDREGQMDNPCSSWLADAYWDNITELDKLTNFHGLMNSFEQYPRDWHLWYTNAAPEKAMLPGEWENA
CNEMQRMLIVRSLRQDRVAFCVTSFIITNLGSRFIEPPVLNMKSVLEDSTPRSPLVFILSPGVDPTSALLQLAEHMGMAQRFHALSLGQG
QAPIAARLLREGVTQGHWVFLANCHLSLSWMPNLDKLVEQLQVEDPHPSFRLWLSSIPHPDFPISILQVSIKMTTEPPKGLKANMTRLYQ
LMSEPQFSRCSKPAKYKKLLFSLCFFHSVLLERKKFLQLGWNIIYGFNDSDFEVSENLLSLYLDEYEETPWDALKYLIAGINYGGHVTDD
WDRRLLTTYINDYFCDQSLSTPFHRLSALETYFIPKDGSLASYKEYISLLPGMDPPEAFGQHPNADVASQITEAQTLFDTLLSLQPQITP
TRAGGQTREEKVLELAADVKQKIPEMIDYEGTQKLLALDPSPLNVVLLQEIQRYNTLMQTILFSLTDLEKGIQGLIVMSTSLEEIFNCIF
DAHVPPLWGKAYPSQKPLAAWTRDLAMRVEQFELWASRARPPVIFWLSGFTFPTGFLTAVLQSSARQNNVSVDSLSWEFIVSTVDDSNLV
YPPKDGVWVRGLYLEGAGWDRKNSCLVEAEPMQLVCLMPTIHFRPAESRKKSAKGMYSCPCYYYPNRAGSSDRASFVIGIDLRSGAMTPD

--------------------------------------------------------------
>6122_6122_4_ARHGAP44-DNAH2_ARHGAP44_chr17_12693208_ENST00000340825_DNAH2_chr17_7704896_ENST00000572933_length(transcript)=5149nt_BP=353nt
GACTGGGAGCAGGCAGCCCGGGCGGAGCGGGCCGGTGCCGAGGACGGCCCCAGGCATTGCTCTGCCCCGGGCATTGCGCGGCGCGCGTGA
GGGGGATGCGGCAGGAGGCGGCGCGGCGGGAGGAGTAGGCGGCGGCGCCCTCGGGAGGGAGCTGCGCGCGGGCCAGACGGCGCCCGGAGG
CTCCGCAGTGCCGCCGCCGTCGCCCGGGAGGCTCCGCGCGGGAGCCATGTAACCCTGCGGCGGGCTCCGGGCTGCTCCGTCCTTCCCCAG
CTCCCGGGCTAGCGCGGCAGCGGGGCCACGATGAAGAAGCAGTTCAATCGCATGCGCCAGCTGGCCAACCAGACGGTGGGCAGGAACTGG
ATCCGCCAGTACCCAGCCTTGGTGAACTGCACAACCATCAACTGGTTCTCAGAGTGGCCCCAAGAGGCCCTGCTCGAGGTGGCTGAGAAG
TGCCTCATAGGAGTAGACCTGGGAACTCAGGAGAATATCCACAGGAAGGTGGCCCAGATCTTTGTCACTATGCACTGGTCAGTAGCTCAG
TATTCCCAGAAGATGCTGTTGGAACTGCGGAGACACAACTATGTCACACCCACCAAATACCTGGAACTCCTGTCTGGATATAAGAAGTTG
CTGGGAGAAAAACGGCAGGAGCTGCTGGCCCAAGCCAATAAACTGCGGACAGGCTTGTTCAAGATCGACGAAACTAGGGAAAAGGTGCAA
GTGATGTCGTTGGAGCTGGAGGATGCCAAGAAGAAGGTGGCTGAGTTCCAGAAGCAGTGTGAGGAGTACCTGGTCATCATTGTGCAGCAG
AAGCGGGAGGCAGATGAGCAGCAGAAGGCCGTAACAGCCAACAGTGAAAAGATTGCAGTTGAGGAAATCAAGTGTCAGGCACTGGCTGAC
AATGCCCAGAAAGATCTAGAAGAGGCACTGCCCGCCCTGGAAGAGGCCATGCGGGCCCTGGAGTCTCTGAACAAGAAGGATATAGGAGAG
ATCAAGTCTTATGGACGGCCCCCAGCCCAAGTGGAGATAGTGATGCAGGCAGTTATGATTCTTCGAGGCAACGAGCCCACATGGGCAGAG
GCCAAGAGGCAGCTAGGGGAACAGAACTTCATCAAGTCACTGATCAACTTTGATAAAGACAATATCTCAGATAAGGTTCTGAAGAAGATT
GGGGCCTACTGCGCCCAGCCTGACTTCCAGCCTGATATCATCGGCCGCGTCTCCCTGGCTGCCAAGTCCCTCTGCATGTGGGTGCGGGCC
ATGGAGCTGTATGGGCGGCTATATCGGGTGGTGGAGCCCAAGCGAATCCGAATGAACGCTGCCTTGGCTCAGCTTCGGGAGAAGCAAGCC
GCGCTCGCTGAGGCCCAGGAGAAGCTGCGGGAGGTAGCTGAGAAACTGGAGATGCTAAAGAAACAGTATGATGAGAAGCTGGCACAGAAG
GAGGAGCTTCGCAAGAAGTCTGAAGAGATGGAGCTGAAGCTGGAGCGAGCTGGGATGCTCGTGTCGGGGTTGGCTGGCGAGAAGGCCAGA
TGGGAGGAGACAGTCCAGGGCCTGGAGGAGGACCTGGGCTACCTGGTGGGGGACTGTCTCCTGGCAGCTGCCTTCCTGTCCTACATGGGA
CCCTTCCTGACCAACTACCGGGATGAGATTGTCAACCAAATCTGGATCGGGAAGATCTGGGAGCTTCAGGTTCCTTGCTCCCCTTCTTTC
GCCATCGATAACTTCCTGTGCAATCCTACCAAAGTCCGGGACTGGAACATCCAAGGGTTGCCCTCAGACGCCTTCTCCACTGAGAATGGC
ATCATCGTCACCCGAGGCAACAGGTGGGCACTGATGATCGACCCTCAGGCCCAGGCCCTGAAATGGATTAAGAACATGGAAGGAGGCCAG
GGCCTGAAGATCATCGACCTGCAGATGAGCGATTACCTGCGAATCCTAGAACACGCCATTCACTTTGGATACCCGGTGCTACTTCAGAAC
GTGCAGGAATATCTGGACCCCACACTGAACCCCATGCTCAACAAATCTGTAGCCCGAATCGGTGGTCGGCTGTTGATGCGCATTGGCGAT
AAGGAGGTGGAATATAATACCAATTTCCGTTTCTACATCACCACCAAGCTCTCCAACCCCCACTACAGCCCAGAGACCTCAGCCAAGACC
ACCATCGTCAACTTTGCTGTTAAAGAACAGGGCCTGGAGGCCCAGCTGCTGGGCATTGTGGTGCGGAAGGAGCGGCCTGAGCTGGAGGAG
CAGAAGGACTCACTGGTCATCAACATCGCGGCTGGTAAAAGGAAGCTCAAGGAGCTGGAGGATGAGATCCTGCGGCTGCTGAATGAGGCC
ACCGGCTCCCTGCTGGATGATGTGCAGCTGGTGAACACGCTGCATACCTCCAAGATCACAGCCACAGAGGTGACTGAGCAGCTGGAGACC
AGTGAGACCACAGAGATCAACACTGACTTGGCGCGGGAGGCTTACCGCCCATGCGCCCAGCGGGCATCAATCCTGTTCTTCGTGCTCAAT
GATATGGGCTGCATCGACCCCATGTACCAGTTCTCACTGGATGCCTACATCAGCCTCTTTATTCTCAGCATTGACAAAAGCCACCGCAGC
AATAAGCTGGAGGACCGCATTGACTACCTGAATGACTACCACACCTACGCTGTCTACAGGTACACCTGCCGTACCCTTTTCGAACGCCAC
AAACTACTATTCAGTTTTCATATGTGTGCCAAAATCTTGGAGACTTCTGGCAAGCTCAACATGGATGAATACAACTTCTTTCTACGTGGG
GGTGTGGTCTTGGATCGGGAGGGCCAAATGGACAATCCATGTAGTAGCTGGCTTGCAGATGCCTACTGGGATAACATCACAGAGCTAGAC
AAACTGACCAACTTCCACGGACTCATGAACTCCTTTGAGCAGTACCCTCGTGACTGGCACCTGTGGTATACCAATGCTGCCCCGGAGAAG
GCGATGCTGCCAGGTGAGTGGGAAAATGCCTGCAATGAAATGCAACGGATGCTGATCGTTCGCTCCCTGCGCCAGGACCGCGTGGCCTTC
TGCGTGACCTCCTTCATCATCACCAACCTTGGCTCCCGCTTCATCGAGCCGCCTGTGCTGAATATGAAGTCGGTGCTGGAGGATTCAACC
CCACGATCCCCACTCGTGTTCATCCTGTCCCCTGGTGTGGACCCCACCAGTGCCCTGCTGCAGCTGGCAGAGCACATGGGCATGGCCCAG
CGCTTCCACGCCCTGTCCCTGGGCCAGGGCCAGGCCCCCATCGCTGCTCGGCTCCTCCGAGAGGGTGTGACTCAGGGACACTGGGTGTTC
CTGGCAAACTGCCACCTGTCACTGTCTTGGATGCCTAATCTGGACAAGCTGGTGGAGCAGCTGCAGGTGGAGGATCCTCATCCATCCTTC
CGCCTCTGGCTCAGCTCCATCCCCCACCCAGACTTCCCTATCTCAATCTTGCAGGTCAGCATCAAGATGACCACAGAGCCACCAAAGGGC
CTAAAGGCCAACATGACACGTCTTTACCAACTGATGTCAGAACCACAGTTTTCCCGCTGCTCCAAACCTGCCAAATATAAGAAGCTGCTG
TTTTCACTCTGTTTCTTCCACTCTGTGTTACTTGAACGCAAAAAGTTCCTGCAGCTTGGCTGGAACATCATCTATGGCTTCAATGACTCC
GACTTTGAGGTGTCAGAAAACTTGCTGAGCCTCTATCTCGATGAGTACGAGGAGACACCTTGGGACGCACTTAAGTACCTCATTGCCGGC
ATCAACTATGGTGGACATGTCACAGATGACTGGGACCGGCGCCTGCTGACCACCTACATCAATGATTATTTCTGTGACCAGTCTCTATCA
ACTCCCTTCCACCGGTTGTCAGCACTGGAGACTTATTTCATCCCCAAGGATGGCAGCCTCGCTTCTTACAAGGAATACATCAGCTTATTG
CCTGGCATGGACCCCCCTGAGGCCTTTGGCCAGCACCCCAATGCTGATGTGGCCTCTCAGATCACTGAGGCACAAACCCTCTTTGATACT
TTGCTTTCCTTGCAACCTCAGATTACACCCACCAGGGCTGGAGGCCAGACCCGGGAAGAGAAGGTCCTTGAGTTGGCCGCTGATGTGAAG
CAGAAGATCCCTGAAATGATCGACTATGAGGGGACTCAAAAACTGCTAGCTCTCGACCCCTCCCCCCTCAATGTGGTCCTTCTGCAGGAG
ATCCAGAGATACAACACACTGATGCAGACCATCCTGTTCTCACTGACAGACCTAGAGAAAGGCATCCAGGGTCTCATCGTCATGTCTACA
AGCCTGGAAGAGATTTTCAATTGCATCTTTGATGCCCATGTTCCTCCGCTCTGGGGAAAGGCATACCCCTCACAAAAGCCATTGGCTGCC
TGGACCCGGGACTTGGCCATGCGTGTGGAGCAGTTTGAGCTGTGGGCCAGCCGGGCCCGGCCTCCTGTGATCTTCTGGTTGTCTGGTTTC
ACCTTTCCCACTGGCTTCCTCACTGCTGTGCTGCAGTCTTCAGCTCGCCAAAACAACGTTTCAGTGGACAGCCTCTCCTGGGAGTTTATC
GTTTCCACTGTGGATGACAGCAACCTAGTGTATCCCCCCAAGGATGGTGTCTGGGTCCGGGGCCTGTACCTGGAAGGTGCTGGCTGGGAC
CGGAAGAACTCCTGCTTGGTGGAGGCAGAGCCCATGCAGCTTGTCTGCCTCATGCCCACGATCCACTTCCGGCCTGCAGAGAGCCGCAAG
AAGAGCGCCAAGGGCATGTACTCCTGCCCCTGCTATTACTATCCCAACCGGGCAGGCAGCTCAGACCGAGCCTCCTTTGTCATCGGCATT
GACCTGCGGTCTGGGGCCATGACACCTGATCATTGGATCAAGAGGGGCACTGCTCTACTCATGAGCCTGGACAGCTGAGACCTCCTCCTC
TTCTCCGCTTGAGAGAGAGGGTCAGGGACTCCAGGAGCTAAGACAGATGTTGCACCTAGGACTGAGGCCGGACCTCACTCAGACTTTGAC
CTTGGCCGAATTTGTGTGATGTGGCCCTGGAGATACCTAGTTGTGTTAGCCATAAAAGTGAAAGAGTTGTATTGGAGCTCAGTGCTGTAA

>6122_6122_4_ARHGAP44-DNAH2_ARHGAP44_chr17_12693208_ENST00000340825_DNAH2_chr17_7704896_ENST00000572933_length(amino acids)=1545AA_BP=17
MKKQFNRMRQLANQTVGRNWIRQYPALVNCTTINWFSEWPQEALLEVAEKCLIGVDLGTQENIHRKVAQIFVTMHWSVAQYSQKMLLELR
RHNYVTPTKYLELLSGYKKLLGEKRQELLAQANKLRTGLFKIDETREKVQVMSLELEDAKKKVAEFQKQCEEYLVIIVQQKREADEQQKA
VTANSEKIAVEEIKCQALADNAQKDLEEALPALEEAMRALESLNKKDIGEIKSYGRPPAQVEIVMQAVMILRGNEPTWAEAKRQLGEQNF
IKSLINFDKDNISDKVLKKIGAYCAQPDFQPDIIGRVSLAAKSLCMWVRAMELYGRLYRVVEPKRIRMNAALAQLREKQAALAEAQEKLR
EVAEKLEMLKKQYDEKLAQKEELRKKSEEMELKLERAGMLVSGLAGEKARWEETVQGLEEDLGYLVGDCLLAAAFLSYMGPFLTNYRDEI
VNQIWIGKIWELQVPCSPSFAIDNFLCNPTKVRDWNIQGLPSDAFSTENGIIVTRGNRWALMIDPQAQALKWIKNMEGGQGLKIIDLQMS
DYLRILEHAIHFGYPVLLQNVQEYLDPTLNPMLNKSVARIGGRLLMRIGDKEVEYNTNFRFYITTKLSNPHYSPETSAKTTIVNFAVKEQ
GLEAQLLGIVVRKERPELEEQKDSLVINIAAGKRKLKELEDEILRLLNEATGSLLDDVQLVNTLHTSKITATEVTEQLETSETTEINTDL
AREAYRPCAQRASILFFVLNDMGCIDPMYQFSLDAYISLFILSIDKSHRSNKLEDRIDYLNDYHTYAVYRYTCRTLFERHKLLFSFHMCA
KILETSGKLNMDEYNFFLRGGVVLDREGQMDNPCSSWLADAYWDNITELDKLTNFHGLMNSFEQYPRDWHLWYTNAAPEKAMLPGEWENA
CNEMQRMLIVRSLRQDRVAFCVTSFIITNLGSRFIEPPVLNMKSVLEDSTPRSPLVFILSPGVDPTSALLQLAEHMGMAQRFHALSLGQG
QAPIAARLLREGVTQGHWVFLANCHLSLSWMPNLDKLVEQLQVEDPHPSFRLWLSSIPHPDFPISILQVSIKMTTEPPKGLKANMTRLYQ
LMSEPQFSRCSKPAKYKKLLFSLCFFHSVLLERKKFLQLGWNIIYGFNDSDFEVSENLLSLYLDEYEETPWDALKYLIAGINYGGHVTDD
WDRRLLTTYINDYFCDQSLSTPFHRLSALETYFIPKDGSLASYKEYISLLPGMDPPEAFGQHPNADVASQITEAQTLFDTLLSLQPQITP
TRAGGQTREEKVLELAADVKQKIPEMIDYEGTQKLLALDPSPLNVVLLQEIQRYNTLMQTILFSLTDLEKGIQGLIVMSTSLEEIFNCIF
DAHVPPLWGKAYPSQKPLAAWTRDLAMRVEQFELWASRARPPVIFWLSGFTFPTGFLTAVLQSSARQNNVSVDSLSWEFIVSTVDDSNLV
YPPKDGVWVRGLYLEGAGWDRKNSCLVEAEPMQLVCLMPTIHFRPAESRKKSAKGMYSCPCYYYPNRAGSSDRASFVIGIDLRSGAMTPD

--------------------------------------------------------------
>6122_6122_5_ARHGAP44-DNAH2_ARHGAP44_chr17_12693208_ENST00000379672_DNAH2_chr17_7704896_ENST00000389173_length(transcript)=5145nt_BP=353nt
GACTGGGAGCAGGCAGCCCGGGCGGAGCGGGCCGGTGCCGAGGACGGCCCCAGGCATTGCTCTGCCCCGGGCATTGCGCGGCGCGCGTGA
GGGGGATGCGGCAGGAGGCGGCGCGGCGGGAGGAGTAGGCGGCGGCGCCCTCGGGAGGGAGCTGCGCGCGGGCCAGACGGCGCCCGGAGG
CTCCGCAGTGCCGCCGCCGTCGCCCGGGAGGCTCCGCGCGGGAGCCATGTAACCCTGCGGCGGGCTCCGGGCTGCTCCGTCCTTCCCCAG
CTCCCGGGCTAGCGCGGCAGCGGGGCCACGATGAAGAAGCAGTTCAATCGCATGCGCCAGCTGGCCAACCAGACGGTGGGCAGGAACTGG
ATCCGCCAGTACCCAGCCTTGGTGAACTGCACAACCATCAACTGGTTCTCAGAGTGGCCCCAAGAGGCCCTGCTCGAGGTGGCTGAGAAG
TGCCTCATAGGAGTAGACCTGGGAACTCAGGAGAATATCCACAGGAAGGTGGCCCAGATCTTTGTCACTATGCACTGGTCAGTAGCTCAG
TATTCCCAGAAGATGCTGTTGGAACTGCGGAGACACAACTATGTCACACCCACCAAATACCTGGAACTCCTGTCTGGATATAAGAAGTTG
CTGGGAGAAAAACGGCAGGAGCTGCTGGCCCAAGCCAATAAACTGCGGACAGGCTTGTTCAAGATCGACGAAACTAGGGAAAAGGTGCAA
GTGATGTCGTTGGAGCTGGAGGATGCCAAGAAGAAGGTGGCTGAGTTCCAGAAGCAGTGTGAGGAGTACCTGGTCATCATTGTGCAGCAG
AAGCGGGAGGCAGATGAGCAGCAGAAGGCCGTAACAGCCAACAGTGAAAAGATTGCAGTTGAGGAAATCAAGTGTCAGGCACTGGCTGAC
AATGCCCAGAAAGATCTAGAAGAGGCACTGCCCGCCCTGGAAGAGGCCATGCGGGCCCTGGAGTCTCTGAACAAGAAGGATATAGGAGAG
ATCAAGTCTTATGGACGGCCCCCAGCCCAAGTGGAGATAGTGATGCAGGCAGTTATGATTCTTCGAGGCAACGAGCCCACATGGGCAGAG
GCCAAGAGGCAGCTAGGGGAACAGAACTTCATCAAGTCACTGATCAACTTTGATAAAGACAATATCTCAGATAAGGTTCTGAAGAAGATT
GGGGCCTACTGCGCCCAGCCTGACTTCCAGCCTGATATCATCGGCCGCGTCTCCCTGGCTGCCAAGTCCCTCTGCATGTGGGTGCGGGCC
ATGGAGCTGTATGGGCGGCTATATCGGGTGGTGGAGCCCAAGCGAATCCGAATGAACGCTGCCTTGGCTCAGCTTCGGGAGAAGCAAGCC
GCGCTCGCTGAGGCCCAGGAGAAGCTGCGGGAGGTAGCTGAGAAACTGGAGATGCTAAAGAAACAGTATGATGAGAAGCTGGCACAGAAG
GAGGAGCTTCGCAAGAAGTCTGAAGAGATGGAGCTGAAGCTGGAGCGAGCTGGGATGCTCGTGTCGGGGTTGGCTGGCGAGAAGGCCAGA
TGGGAGGAGACAGTCCAGGGCCTGGAGGAGGACCTGGGCTACCTGGTGGGGGACTGTCTCCTGGCAGCTGCCTTCCTGTCCTACATGGGA
CCCTTCCTGACCAACTACCGGGATGAGATTGTCAACCAAATCTGGATCGGGAAGATCTGGGAGCTTCAGGTTCCTTGCTCCCCTTCTTTC
GCCATCGATAACTTCCTGTGCAATCCTACCAAAGTCCGGGACTGGAACATCCAAGGGTTGCCCTCAGACGCCTTCTCCACTGAGAATGGC
ATCATCGTCACCCGAGGCAACAGGTGGGCACTGATGATCGACCCTCAGGCCCAGGCCCTGAAATGGATTAAGAACATGGAAGGAGGCCAG
GGCCTGAAGATCATCGACCTGCAGATGAGCGATTACCTGCGAATCCTAGAACACGCCATTCACTTTGGATACCCGGTGCTACTTCAGAAC
GTGCAGGAATATCTGGACCCCACACTGAACCCCATGCTCAACAAATCTGTAGCCCGAATCGGTGGTCGGCTGTTGATGCGCATTGGCGAT
AAGGAGGTGGAATATAATACCAATTTCCGTTTCTACATCACCACCAAGCTCTCCAACCCCCACTACAGCCCAGAGACCTCAGCCAAGACC
ACCATCGTCAACTTTGCTGTTAAAGAACAGGGCCTGGAGGCCCAGCTGCTGGGCATTGTGGTGCGGAAGGAGCGGCCTGAGCTGGAGGAG
CAGAAGGACTCACTGGTCATCAACATCGCGGCTGGTAAAAGGAAGCTCAAGGAGCTGGAGGATGAGATCCTGCGGCTGCTGAATGAGGCC
ACCGGCTCCCTGCTGGATGATGTGCAGCTGGTGAACACGCTGCATACCTCCAAGATCACAGCCACAGAGGTGACTGAGCAGCTGGAGACC
AGTGAGACCACAGAGATCAACACTGACTTGGCGCGGGAGGCTTACCGCCCATGCGCCCAGCGGGCATCAATCCTGTTCTTCGTGCTCAAT
GATATGGGCTGCATCGACCCCATGTACCAGTTCTCACTGGATGCCTACATCAGCCTCTTTATTCTCAGCATTGACAAAAGCCACCGCAGC
AATAAGCTGGAGGACCGCATTGACTACCTGAATGACTACCACACCTACGCTGTCTACAGGTACACCTGCCGTACCCTTTTCGAACGCCAC
AAACTACTATTCAGTTTTCATATGTGTGCCAAAATCTTGGAGACTTCTGGCAAGCTCAACATGGATGAATACAACTTCTTTCTACGTGGG
GGTGTGGTCTTGGATCGGGAGGGCCAAATGGACAATCCATGTAGTAGCTGGCTTGCAGATGCCTACTGGGATAACATCACAGAGCTAGAC
AAACTGACCAACTTCCACGGACTCATGAACTCCTTTGAGCAGTACCCTCGTGACTGGCACCTGTGGTATACCAATGCTGCCCCGGAGAAG
GCGATGCTGCCAGGTGAGTGGGAAAATGCCTGCAATGAAATGCAACGGATGCTGATCGTTCGCTCCCTGCGCCAGGACCGCGTGGCCTTC
TGCGTGACCTCCTTCATCATCACCAACCTTGGCTCCCGCTTCATCGAGCCGCCTGTGCTGAATATGAAGTCGGTGCTGGAGGATTCAACC
CCACGATCCCCACTCGTGTTCATCCTGTCCCCTGGTGTGGACCCCACCAGTGCCCTGCTGCAGCTGGCAGAGCACATGGGCATGGCCCAG
CGCTTCCACGCCCTGTCCCTGGGCCAGGGCCAGGCCCCCATCGCTGCTCGGCTCCTCCGAGAGGGTGTGACTCAGGGACACTGGGTGTTC
CTGGCAAACTGCCACCTGTCACTGTCTTGGATGCCTAATCTGGACAAGCTGGTGGAGCAGCTGCAGGTGGAGGATCCTCATCCATCCTTC
CGCCTCTGGCTCAGCTCCATCCCCCACCCAGACTTCCCTATCTCAATCTTGCAGGTCAGCATCAAGATGACCACAGAGCCACCAAAGGGC
CTAAAGGCCAACATGACACGTCTTTACCAACTGATGTCAGAACCACAGTTTTCCCGCTGCTCCAAACCTGCCAAATATAAGAAGCTGCTG
TTTTCACTCTGTTTCTTCCACTCTGTGTTACTTGAACGCAAAAAGTTCCTGCAGCTTGGCTGGAACATCATCTATGGCTTCAATGACTCC
GACTTTGAGGTGTCAGAAAACTTGCTGAGCCTCTATCTCGATGAGTACGAGGAGACACCTTGGGACGCACTTAAGTACCTCATTGCCGGC
ATCAACTATGGTGGACATGTCACAGATGACTGGGACCGGCGCCTGCTGACCACCTACATCAATGATTATTTCTGTGACCAGTCTCTATCA
ACTCCCTTCCACCGGTTGTCAGCACTGGAGACTTATTTCATCCCCAAGGATGGCAGCCTCGCTTCTTACAAGGAATACATCAGCTTATTG
CCTGGCATGGACCCCCCTGAGGCCTTTGGCCAGCACCCCAATGCTGATGTGGCCTCTCAGATCACTGAGGCACAAACCCTCTTTGATACT
TTGCTTTCCTTGCAACCTCAGATTACACCCACCAGGGCTGGAGGCCAGACCCGGGAAGAGAAGGTCCTTGAGTTGGCCGCTGATGTGAAG
CAGAAGATCCCTGAAATGATCGACTATGAGGGGACTCAAAAACTGCTAGCTCTCGACCCCTCCCCCCTCAATGTGGTCCTTCTGCAGGAG
ATCCAGAGATACAACACACTGATGCAGACCATCCTGTTCTCACTGACAGACCTAGAGAAAGGCATCCAGGGTCTCATCGTCATGTCTACA
AGCCTGGAAGAGATTTTCAATTGCATCTTTGATGCCCATGTTCCTCCGCTCTGGGGAAAGGCATACCCCTCACAAAAGCCATTGGCTGCC
TGGACCCGGGACTTGGCCATGCGTGTGGAGCAGTTTGAGCTGTGGGCCAGCCGGGCCCGGCCTCCTGTGATCTTCTGGTTGTCTGGTTTC
ACCTTTCCCACTGGCTTCCTCACTGCTGTGCTGCAGTCTTCAGCTCGCCAAAACAACGTTTCAGTGGACAGCCTCTCCTGGGAGTTTATC
GTTTCCACTGTGGATGACAGCAACCTAGTGTATCCCCCCAAGGATGGTGTCTGGGTCCGGGGCCTGTACCTGGAAGGTGCTGGCTGGGAC
CGGAAGAACTCCTGCTTGGTGGAGGCAGAGCCCATGCAGCTTGTCTGCCTCATGCCCACGATCCACTTCCGGCCTGCAGAGAGCCGCAAG
AAGAGCGCCAAGGGCATGTACTCCTGCCCCTGCTATTACTATCCCAACCGGGCAGGCAGCTCAGACCGAGCCTCCTTTGTCATCGGCATT
GACCTGCGGTCTGGGGCCATGACACCTGATCATTGGATCAAGAGGGGCACTGCTCTACTCATGAGCCTGGACAGCTGAGACCTCCTCCTC
TTCTCCGCTTGAGAGAGAGGGTCAGGGACTCCAGGAGCTAAGACAGATGTTGCACCTAGGACTGAGGCCGGACCTCACTCAGACTTTGAC
CTTGGCCGAATTTGTGTGATGTGGCCCTGGAGATACCTAGTTGTGTTAGCCATAAAAGTGAAAGAGTTGTATTGGAGCTCAGTGCTGTAA

>6122_6122_5_ARHGAP44-DNAH2_ARHGAP44_chr17_12693208_ENST00000379672_DNAH2_chr17_7704896_ENST00000389173_length(amino acids)=1545AA_BP=17
MKKQFNRMRQLANQTVGRNWIRQYPALVNCTTINWFSEWPQEALLEVAEKCLIGVDLGTQENIHRKVAQIFVTMHWSVAQYSQKMLLELR
RHNYVTPTKYLELLSGYKKLLGEKRQELLAQANKLRTGLFKIDETREKVQVMSLELEDAKKKVAEFQKQCEEYLVIIVQQKREADEQQKA
VTANSEKIAVEEIKCQALADNAQKDLEEALPALEEAMRALESLNKKDIGEIKSYGRPPAQVEIVMQAVMILRGNEPTWAEAKRQLGEQNF
IKSLINFDKDNISDKVLKKIGAYCAQPDFQPDIIGRVSLAAKSLCMWVRAMELYGRLYRVVEPKRIRMNAALAQLREKQAALAEAQEKLR
EVAEKLEMLKKQYDEKLAQKEELRKKSEEMELKLERAGMLVSGLAGEKARWEETVQGLEEDLGYLVGDCLLAAAFLSYMGPFLTNYRDEI
VNQIWIGKIWELQVPCSPSFAIDNFLCNPTKVRDWNIQGLPSDAFSTENGIIVTRGNRWALMIDPQAQALKWIKNMEGGQGLKIIDLQMS
DYLRILEHAIHFGYPVLLQNVQEYLDPTLNPMLNKSVARIGGRLLMRIGDKEVEYNTNFRFYITTKLSNPHYSPETSAKTTIVNFAVKEQ
GLEAQLLGIVVRKERPELEEQKDSLVINIAAGKRKLKELEDEILRLLNEATGSLLDDVQLVNTLHTSKITATEVTEQLETSETTEINTDL
AREAYRPCAQRASILFFVLNDMGCIDPMYQFSLDAYISLFILSIDKSHRSNKLEDRIDYLNDYHTYAVYRYTCRTLFERHKLLFSFHMCA
KILETSGKLNMDEYNFFLRGGVVLDREGQMDNPCSSWLADAYWDNITELDKLTNFHGLMNSFEQYPRDWHLWYTNAAPEKAMLPGEWENA
CNEMQRMLIVRSLRQDRVAFCVTSFIITNLGSRFIEPPVLNMKSVLEDSTPRSPLVFILSPGVDPTSALLQLAEHMGMAQRFHALSLGQG
QAPIAARLLREGVTQGHWVFLANCHLSLSWMPNLDKLVEQLQVEDPHPSFRLWLSSIPHPDFPISILQVSIKMTTEPPKGLKANMTRLYQ
LMSEPQFSRCSKPAKYKKLLFSLCFFHSVLLERKKFLQLGWNIIYGFNDSDFEVSENLLSLYLDEYEETPWDALKYLIAGINYGGHVTDD
WDRRLLTTYINDYFCDQSLSTPFHRLSALETYFIPKDGSLASYKEYISLLPGMDPPEAFGQHPNADVASQITEAQTLFDTLLSLQPQITP
TRAGGQTREEKVLELAADVKQKIPEMIDYEGTQKLLALDPSPLNVVLLQEIQRYNTLMQTILFSLTDLEKGIQGLIVMSTSLEEIFNCIF
DAHVPPLWGKAYPSQKPLAAWTRDLAMRVEQFELWASRARPPVIFWLSGFTFPTGFLTAVLQSSARQNNVSVDSLSWEFIVSTVDDSNLV
YPPKDGVWVRGLYLEGAGWDRKNSCLVEAEPMQLVCLMPTIHFRPAESRKKSAKGMYSCPCYYYPNRAGSSDRASFVIGIDLRSGAMTPD

--------------------------------------------------------------
>6122_6122_6_ARHGAP44-DNAH2_ARHGAP44_chr17_12693208_ENST00000379672_DNAH2_chr17_7704896_ENST00000572933_length(transcript)=5149nt_BP=353nt
GACTGGGAGCAGGCAGCCCGGGCGGAGCGGGCCGGTGCCGAGGACGGCCCCAGGCATTGCTCTGCCCCGGGCATTGCGCGGCGCGCGTGA
GGGGGATGCGGCAGGAGGCGGCGCGGCGGGAGGAGTAGGCGGCGGCGCCCTCGGGAGGGAGCTGCGCGCGGGCCAGACGGCGCCCGGAGG
CTCCGCAGTGCCGCCGCCGTCGCCCGGGAGGCTCCGCGCGGGAGCCATGTAACCCTGCGGCGGGCTCCGGGCTGCTCCGTCCTTCCCCAG
CTCCCGGGCTAGCGCGGCAGCGGGGCCACGATGAAGAAGCAGTTCAATCGCATGCGCCAGCTGGCCAACCAGACGGTGGGCAGGAACTGG
ATCCGCCAGTACCCAGCCTTGGTGAACTGCACAACCATCAACTGGTTCTCAGAGTGGCCCCAAGAGGCCCTGCTCGAGGTGGCTGAGAAG
TGCCTCATAGGAGTAGACCTGGGAACTCAGGAGAATATCCACAGGAAGGTGGCCCAGATCTTTGTCACTATGCACTGGTCAGTAGCTCAG
TATTCCCAGAAGATGCTGTTGGAACTGCGGAGACACAACTATGTCACACCCACCAAATACCTGGAACTCCTGTCTGGATATAAGAAGTTG
CTGGGAGAAAAACGGCAGGAGCTGCTGGCCCAAGCCAATAAACTGCGGACAGGCTTGTTCAAGATCGACGAAACTAGGGAAAAGGTGCAA
GTGATGTCGTTGGAGCTGGAGGATGCCAAGAAGAAGGTGGCTGAGTTCCAGAAGCAGTGTGAGGAGTACCTGGTCATCATTGTGCAGCAG
AAGCGGGAGGCAGATGAGCAGCAGAAGGCCGTAACAGCCAACAGTGAAAAGATTGCAGTTGAGGAAATCAAGTGTCAGGCACTGGCTGAC
AATGCCCAGAAAGATCTAGAAGAGGCACTGCCCGCCCTGGAAGAGGCCATGCGGGCCCTGGAGTCTCTGAACAAGAAGGATATAGGAGAG
ATCAAGTCTTATGGACGGCCCCCAGCCCAAGTGGAGATAGTGATGCAGGCAGTTATGATTCTTCGAGGCAACGAGCCCACATGGGCAGAG
GCCAAGAGGCAGCTAGGGGAACAGAACTTCATCAAGTCACTGATCAACTTTGATAAAGACAATATCTCAGATAAGGTTCTGAAGAAGATT
GGGGCCTACTGCGCCCAGCCTGACTTCCAGCCTGATATCATCGGCCGCGTCTCCCTGGCTGCCAAGTCCCTCTGCATGTGGGTGCGGGCC
ATGGAGCTGTATGGGCGGCTATATCGGGTGGTGGAGCCCAAGCGAATCCGAATGAACGCTGCCTTGGCTCAGCTTCGGGAGAAGCAAGCC
GCGCTCGCTGAGGCCCAGGAGAAGCTGCGGGAGGTAGCTGAGAAACTGGAGATGCTAAAGAAACAGTATGATGAGAAGCTGGCACAGAAG
GAGGAGCTTCGCAAGAAGTCTGAAGAGATGGAGCTGAAGCTGGAGCGAGCTGGGATGCTCGTGTCGGGGTTGGCTGGCGAGAAGGCCAGA
TGGGAGGAGACAGTCCAGGGCCTGGAGGAGGACCTGGGCTACCTGGTGGGGGACTGTCTCCTGGCAGCTGCCTTCCTGTCCTACATGGGA
CCCTTCCTGACCAACTACCGGGATGAGATTGTCAACCAAATCTGGATCGGGAAGATCTGGGAGCTTCAGGTTCCTTGCTCCCCTTCTTTC
GCCATCGATAACTTCCTGTGCAATCCTACCAAAGTCCGGGACTGGAACATCCAAGGGTTGCCCTCAGACGCCTTCTCCACTGAGAATGGC
ATCATCGTCACCCGAGGCAACAGGTGGGCACTGATGATCGACCCTCAGGCCCAGGCCCTGAAATGGATTAAGAACATGGAAGGAGGCCAG
GGCCTGAAGATCATCGACCTGCAGATGAGCGATTACCTGCGAATCCTAGAACACGCCATTCACTTTGGATACCCGGTGCTACTTCAGAAC
GTGCAGGAATATCTGGACCCCACACTGAACCCCATGCTCAACAAATCTGTAGCCCGAATCGGTGGTCGGCTGTTGATGCGCATTGGCGAT
AAGGAGGTGGAATATAATACCAATTTCCGTTTCTACATCACCACCAAGCTCTCCAACCCCCACTACAGCCCAGAGACCTCAGCCAAGACC
ACCATCGTCAACTTTGCTGTTAAAGAACAGGGCCTGGAGGCCCAGCTGCTGGGCATTGTGGTGCGGAAGGAGCGGCCTGAGCTGGAGGAG
CAGAAGGACTCACTGGTCATCAACATCGCGGCTGGTAAAAGGAAGCTCAAGGAGCTGGAGGATGAGATCCTGCGGCTGCTGAATGAGGCC
ACCGGCTCCCTGCTGGATGATGTGCAGCTGGTGAACACGCTGCATACCTCCAAGATCACAGCCACAGAGGTGACTGAGCAGCTGGAGACC
AGTGAGACCACAGAGATCAACACTGACTTGGCGCGGGAGGCTTACCGCCCATGCGCCCAGCGGGCATCAATCCTGTTCTTCGTGCTCAAT
GATATGGGCTGCATCGACCCCATGTACCAGTTCTCACTGGATGCCTACATCAGCCTCTTTATTCTCAGCATTGACAAAAGCCACCGCAGC
AATAAGCTGGAGGACCGCATTGACTACCTGAATGACTACCACACCTACGCTGTCTACAGGTACACCTGCCGTACCCTTTTCGAACGCCAC
AAACTACTATTCAGTTTTCATATGTGTGCCAAAATCTTGGAGACTTCTGGCAAGCTCAACATGGATGAATACAACTTCTTTCTACGTGGG
GGTGTGGTCTTGGATCGGGAGGGCCAAATGGACAATCCATGTAGTAGCTGGCTTGCAGATGCCTACTGGGATAACATCACAGAGCTAGAC
AAACTGACCAACTTCCACGGACTCATGAACTCCTTTGAGCAGTACCCTCGTGACTGGCACCTGTGGTATACCAATGCTGCCCCGGAGAAG
GCGATGCTGCCAGGTGAGTGGGAAAATGCCTGCAATGAAATGCAACGGATGCTGATCGTTCGCTCCCTGCGCCAGGACCGCGTGGCCTTC
TGCGTGACCTCCTTCATCATCACCAACCTTGGCTCCCGCTTCATCGAGCCGCCTGTGCTGAATATGAAGTCGGTGCTGGAGGATTCAACC
CCACGATCCCCACTCGTGTTCATCCTGTCCCCTGGTGTGGACCCCACCAGTGCCCTGCTGCAGCTGGCAGAGCACATGGGCATGGCCCAG
CGCTTCCACGCCCTGTCCCTGGGCCAGGGCCAGGCCCCCATCGCTGCTCGGCTCCTCCGAGAGGGTGTGACTCAGGGACACTGGGTGTTC
CTGGCAAACTGCCACCTGTCACTGTCTTGGATGCCTAATCTGGACAAGCTGGTGGAGCAGCTGCAGGTGGAGGATCCTCATCCATCCTTC
CGCCTCTGGCTCAGCTCCATCCCCCACCCAGACTTCCCTATCTCAATCTTGCAGGTCAGCATCAAGATGACCACAGAGCCACCAAAGGGC
CTAAAGGCCAACATGACACGTCTTTACCAACTGATGTCAGAACCACAGTTTTCCCGCTGCTCCAAACCTGCCAAATATAAGAAGCTGCTG
TTTTCACTCTGTTTCTTCCACTCTGTGTTACTTGAACGCAAAAAGTTCCTGCAGCTTGGCTGGAACATCATCTATGGCTTCAATGACTCC
GACTTTGAGGTGTCAGAAAACTTGCTGAGCCTCTATCTCGATGAGTACGAGGAGACACCTTGGGACGCACTTAAGTACCTCATTGCCGGC
ATCAACTATGGTGGACATGTCACAGATGACTGGGACCGGCGCCTGCTGACCACCTACATCAATGATTATTTCTGTGACCAGTCTCTATCA
ACTCCCTTCCACCGGTTGTCAGCACTGGAGACTTATTTCATCCCCAAGGATGGCAGCCTCGCTTCTTACAAGGAATACATCAGCTTATTG
CCTGGCATGGACCCCCCTGAGGCCTTTGGCCAGCACCCCAATGCTGATGTGGCCTCTCAGATCACTGAGGCACAAACCCTCTTTGATACT
TTGCTTTCCTTGCAACCTCAGATTACACCCACCAGGGCTGGAGGCCAGACCCGGGAAGAGAAGGTCCTTGAGTTGGCCGCTGATGTGAAG
CAGAAGATCCCTGAAATGATCGACTATGAGGGGACTCAAAAACTGCTAGCTCTCGACCCCTCCCCCCTCAATGTGGTCCTTCTGCAGGAG
ATCCAGAGATACAACACACTGATGCAGACCATCCTGTTCTCACTGACAGACCTAGAGAAAGGCATCCAGGGTCTCATCGTCATGTCTACA
AGCCTGGAAGAGATTTTCAATTGCATCTTTGATGCCCATGTTCCTCCGCTCTGGGGAAAGGCATACCCCTCACAAAAGCCATTGGCTGCC
TGGACCCGGGACTTGGCCATGCGTGTGGAGCAGTTTGAGCTGTGGGCCAGCCGGGCCCGGCCTCCTGTGATCTTCTGGTTGTCTGGTTTC
ACCTTTCCCACTGGCTTCCTCACTGCTGTGCTGCAGTCTTCAGCTCGCCAAAACAACGTTTCAGTGGACAGCCTCTCCTGGGAGTTTATC
GTTTCCACTGTGGATGACAGCAACCTAGTGTATCCCCCCAAGGATGGTGTCTGGGTCCGGGGCCTGTACCTGGAAGGTGCTGGCTGGGAC
CGGAAGAACTCCTGCTTGGTGGAGGCAGAGCCCATGCAGCTTGTCTGCCTCATGCCCACGATCCACTTCCGGCCTGCAGAGAGCCGCAAG
AAGAGCGCCAAGGGCATGTACTCCTGCCCCTGCTATTACTATCCCAACCGGGCAGGCAGCTCAGACCGAGCCTCCTTTGTCATCGGCATT
GACCTGCGGTCTGGGGCCATGACACCTGATCATTGGATCAAGAGGGGCACTGCTCTACTCATGAGCCTGGACAGCTGAGACCTCCTCCTC
TTCTCCGCTTGAGAGAGAGGGTCAGGGACTCCAGGAGCTAAGACAGATGTTGCACCTAGGACTGAGGCCGGACCTCACTCAGACTTTGAC
CTTGGCCGAATTTGTGTGATGTGGCCCTGGAGATACCTAGTTGTGTTAGCCATAAAAGTGAAAGAGTTGTATTGGAGCTCAGTGCTGTAA

>6122_6122_6_ARHGAP44-DNAH2_ARHGAP44_chr17_12693208_ENST00000379672_DNAH2_chr17_7704896_ENST00000572933_length(amino acids)=1545AA_BP=17
MKKQFNRMRQLANQTVGRNWIRQYPALVNCTTINWFSEWPQEALLEVAEKCLIGVDLGTQENIHRKVAQIFVTMHWSVAQYSQKMLLELR
RHNYVTPTKYLELLSGYKKLLGEKRQELLAQANKLRTGLFKIDETREKVQVMSLELEDAKKKVAEFQKQCEEYLVIIVQQKREADEQQKA
VTANSEKIAVEEIKCQALADNAQKDLEEALPALEEAMRALESLNKKDIGEIKSYGRPPAQVEIVMQAVMILRGNEPTWAEAKRQLGEQNF
IKSLINFDKDNISDKVLKKIGAYCAQPDFQPDIIGRVSLAAKSLCMWVRAMELYGRLYRVVEPKRIRMNAALAQLREKQAALAEAQEKLR
EVAEKLEMLKKQYDEKLAQKEELRKKSEEMELKLERAGMLVSGLAGEKARWEETVQGLEEDLGYLVGDCLLAAAFLSYMGPFLTNYRDEI
VNQIWIGKIWELQVPCSPSFAIDNFLCNPTKVRDWNIQGLPSDAFSTENGIIVTRGNRWALMIDPQAQALKWIKNMEGGQGLKIIDLQMS
DYLRILEHAIHFGYPVLLQNVQEYLDPTLNPMLNKSVARIGGRLLMRIGDKEVEYNTNFRFYITTKLSNPHYSPETSAKTTIVNFAVKEQ
GLEAQLLGIVVRKERPELEEQKDSLVINIAAGKRKLKELEDEILRLLNEATGSLLDDVQLVNTLHTSKITATEVTEQLETSETTEINTDL
AREAYRPCAQRASILFFVLNDMGCIDPMYQFSLDAYISLFILSIDKSHRSNKLEDRIDYLNDYHTYAVYRYTCRTLFERHKLLFSFHMCA
KILETSGKLNMDEYNFFLRGGVVLDREGQMDNPCSSWLADAYWDNITELDKLTNFHGLMNSFEQYPRDWHLWYTNAAPEKAMLPGEWENA
CNEMQRMLIVRSLRQDRVAFCVTSFIITNLGSRFIEPPVLNMKSVLEDSTPRSPLVFILSPGVDPTSALLQLAEHMGMAQRFHALSLGQG
QAPIAARLLREGVTQGHWVFLANCHLSLSWMPNLDKLVEQLQVEDPHPSFRLWLSSIPHPDFPISILQVSIKMTTEPPKGLKANMTRLYQ
LMSEPQFSRCSKPAKYKKLLFSLCFFHSVLLERKKFLQLGWNIIYGFNDSDFEVSENLLSLYLDEYEETPWDALKYLIAGINYGGHVTDD
WDRRLLTTYINDYFCDQSLSTPFHRLSALETYFIPKDGSLASYKEYISLLPGMDPPEAFGQHPNADVASQITEAQTLFDTLLSLQPQITP
TRAGGQTREEKVLELAADVKQKIPEMIDYEGTQKLLALDPSPLNVVLLQEIQRYNTLMQTILFSLTDLEKGIQGLIVMSTSLEEIFNCIF
DAHVPPLWGKAYPSQKPLAAWTRDLAMRVEQFELWASRARPPVIFWLSGFTFPTGFLTAVLQSSARQNNVSVDSLSWEFIVSTVDDSNLV
YPPKDGVWVRGLYLEGAGWDRKNSCLVEAEPMQLVCLMPTIHFRPAESRKKSAKGMYSCPCYYYPNRAGSSDRASFVIGIDLRSGAMTPD

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for ARHGAP44-DNAH2


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with
HgeneARHGAP44chr17:12693208chr17:7704896ENST00000379672+121731_81817.666666666666668819.0BST2


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for ARHGAP44-DNAH2


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for ARHGAP44-DNAH2


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource