![]() |
||||||
|
![]() | Fusion Gene Summary |
![]() | Fusion Gene ORF analysis |
![]() | Fusion Genomic Features |
![]() | Fusion Protein Features |
![]() | Fusion Gene Sequence |
![]() | Fusion Gene PPI analysis |
![]() | Related Drugs |
![]() | Related Diseases |
Fusion gene:ARSG-CDK12 (FusionGDB2 ID:HG22901TG51755) |
Fusion Gene Summary for ARSG-CDK12 |
![]() |
Fusion gene information | Fusion gene name: ARSG-CDK12 | Fusion gene ID: hg22901tg51755 | Hgene | Tgene | Gene symbol | ARSG | CDK12 | Gene ID | 22901 | 51755 |
Gene name | arylsulfatase G | cyclin dependent kinase 12 | |
Synonyms | USH4 | CRK7|CRKR|CRKRS | |
Cytomap | ('ARSG')('CDK12') 17q24.2 | 17q12 | |
Type of gene | protein-coding | protein-coding | |
Description | arylsulfatase GASG | cyclin-dependent kinase 12CDC2-related protein kinase 7Cdc2-related kinase, arginine/serine-richcell division cycle 2-related protein kinase 7cell division protein kinase 12 | |
Modification date | 20200313 | 20200313 | |
UniProtAcc | . | . | |
Ensembl transtripts involved in fusion gene | ENST00000582154, ENST00000448504, ENST00000452479, | ||
Fusion gene scores | * DoF score | 18 X 13 X 7=1638 | 36 X 30 X 14=15120 |
# samples | 18 | 55 | |
** MAII score | log2(18/1638*10)=-3.18586654531133 possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs). DoF>8 and MAII<0 | log2(55/15120*10)=-4.78088271069641 possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs). DoF>8 and MAII<0 | |
Context | PubMed: ARSG [Title/Abstract] AND CDK12 [Title/Abstract] AND fusion [Title/Abstract] | ||
Most frequent breakpoint | ARSG(66352945)-CDK12(37667782), # samples:3 | ||
Anticipated loss of major functional domain due to fusion event. | ARSG-CDK12 seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF. ARSG-CDK12 seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF. ARSG-CDK12 seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF. ARSG-CDK12 seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF. |
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types ** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10) |
![]() |
Partner | Gene | GO ID | GO term | PubMed ID |
Hgene | ARSG | GO:0006790 | sulfur compound metabolic process | 18283100 |
Tgene | CDK12 | GO:0046777 | protein autophosphorylation | 11683387 |
![]() * Click on the image to open the UCSC genome browser with custom track showing this image in a new window. |
![]() |
![]() * Click on the image to open the UCSC genome browser with custom track showing this image in a new window. |
![]() |
![]() * All genome coordinats were lifted-over on hg19. * Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser. |
Source | Disease | Sample | Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand |
ChimerDB4 | BRCA | TCGA-3C-AALI-01A | ARSG | chr17 | 66352945 | - | CDK12 | chr17 | 37667782 | + |
ChimerDB4 | BRCA | TCGA-3C-AALI-01A | ARSG | chr17 | 66352945 | + | CDK12 | chr17 | 37667782 | + |
Top |
Fusion Gene ORF analysis for ARSG-CDK12 |
![]() * Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser. |
ORF | Henst | Tenst | Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand |
3UTR-3CDS | ENST00000582154 | ENST00000430627 | ARSG | chr17 | 66352945 | + | CDK12 | chr17 | 37667782 | + |
3UTR-3CDS | ENST00000582154 | ENST00000447079 | ARSG | chr17 | 66352945 | + | CDK12 | chr17 | 37667782 | + |
3UTR-intron | ENST00000582154 | ENST00000559545 | ARSG | chr17 | 66352945 | + | CDK12 | chr17 | 37667782 | + |
5CDS-intron | ENST00000448504 | ENST00000559545 | ARSG | chr17 | 66352945 | + | CDK12 | chr17 | 37667782 | + |
5CDS-intron | ENST00000452479 | ENST00000559545 | ARSG | chr17 | 66352945 | + | CDK12 | chr17 | 37667782 | + |
In-frame | ENST00000448504 | ENST00000430627 | ARSG | chr17 | 66352945 | + | CDK12 | chr17 | 37667782 | + |
In-frame | ENST00000448504 | ENST00000447079 | ARSG | chr17 | 66352945 | + | CDK12 | chr17 | 37667782 | + |
In-frame | ENST00000452479 | ENST00000430627 | ARSG | chr17 | 66352945 | + | CDK12 | chr17 | 37667782 | + |
In-frame | ENST00000452479 | ENST00000447079 | ARSG | chr17 | 66352945 | + | CDK12 | chr17 | 37667782 | + |
![]() |
Henst | Tenst | Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand | Seq length (transcript) | BP loci (transcript) | Predicted start (transcript) | Predicted stop (transcript) | Seq length (amino acids) |
ENST00000448504 | ARSG | chr17 | 66352945 | + | ENST00000430627 | CDK12 | chr17 | 37667782 | + | 4441 | 1500 | 796 | 3279 | 827 |
ENST00000448504 | ARSG | chr17 | 66352945 | + | ENST00000447079 | CDK12 | chr17 | 37667782 | + | 7137 | 1500 | 796 | 3306 | 836 |
ENST00000452479 | ARSG | chr17 | 66352945 | + | ENST00000430627 | CDK12 | chr17 | 37667782 | + | 3478 | 537 | 103 | 2316 | 737 |
ENST00000452479 | ARSG | chr17 | 66352945 | + | ENST00000447079 | CDK12 | chr17 | 37667782 | + | 6174 | 537 | 103 | 2343 | 746 |
![]() |
Henst | Tenst | Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand | No-coding score | Coding score |
ENST00000448504 | ENST00000430627 | ARSG | chr17 | 66352945 | + | CDK12 | chr17 | 37667782 | + | 0.00445838 | 0.99554163 |
ENST00000448504 | ENST00000447079 | ARSG | chr17 | 66352945 | + | CDK12 | chr17 | 37667782 | + | 0.000573609 | 0.99942636 |
ENST00000452479 | ENST00000430627 | ARSG | chr17 | 66352945 | + | CDK12 | chr17 | 37667782 | + | 0.011608442 | 0.9883916 |
ENST00000452479 | ENST00000447079 | ARSG | chr17 | 66352945 | + | CDK12 | chr17 | 37667782 | + | 0.002388437 | 0.9976115 |
Top |
Fusion Genomic Features for ARSG-CDK12 |
![]() |
Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand | 1-p | p (fusion gene breakpoint) |
ARSG | chr17 | 66352945 | + | CDK12 | chr17 | 37667781 | + | 2.18E-08 | 1 |
ARSG | chr17 | 66352945 | + | CDK12 | chr17 | 37667781 | + | 2.18E-08 | 1 |
![]() |
![]() |
![]() |
![]() |
Top |
Fusion Protein Features for ARSG-CDK12 |
![]() Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr17:66352945/chr17:37667782) - FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels. - How to search 1. Put your fusion gene symbol. 2. Press the tab key until there will be shown the breakpoint information filled. 4. Go down and press 'Search' tab twice. 4. Go down to have the hyperlink of the search result. 5. Click the hyperlink. 6. See the FGviewer result for your fusion gene. |
![]() |
![]() |
Hgene | Tgene |
. | . |
FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}. | FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}. |
![]() * Minus value of BPloci means that the break pointn is located before the CDS. |
- In-frame and retained protein feature among the 13 regional features. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Protein feature | Protein feature note |
Tgene | CDK12 | chr17:66352945 | chr17:37667782 | ENST00000430627 | 6 | 14 | 1266_1280 | 888 | 1482.0 | Compositional bias | Note=Poly-Pro | |
Tgene | CDK12 | chr17:66352945 | chr17:37667782 | ENST00000447079 | 6 | 14 | 1266_1280 | 888 | 1491.0 | Compositional bias | Note=Poly-Pro |
- In-frame and not-retained protein feature among the 13 regional features. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Protein feature | Protein feature note |
Tgene | CDK12 | chr17:66352945 | chr17:37667782 | ENST00000430627 | 6 | 14 | 407_413 | 888 | 1482.0 | Compositional bias | Note=Poly-Ala | |
Tgene | CDK12 | chr17:66352945 | chr17:37667782 | ENST00000430627 | 6 | 14 | 535_540 | 888 | 1482.0 | Compositional bias | Note=Poly-Pro | |
Tgene | CDK12 | chr17:66352945 | chr17:37667782 | ENST00000447079 | 6 | 14 | 407_413 | 888 | 1491.0 | Compositional bias | Note=Poly-Ala | |
Tgene | CDK12 | chr17:66352945 | chr17:37667782 | ENST00000447079 | 6 | 14 | 535_540 | 888 | 1491.0 | Compositional bias | Note=Poly-Pro | |
Tgene | CDK12 | chr17:66352945 | chr17:37667782 | ENST00000430627 | 6 | 14 | 727_1020 | 888 | 1482.0 | Domain | Protein kinase | |
Tgene | CDK12 | chr17:66352945 | chr17:37667782 | ENST00000447079 | 6 | 14 | 727_1020 | 888 | 1491.0 | Domain | Protein kinase | |
Tgene | CDK12 | chr17:66352945 | chr17:37667782 | ENST00000430627 | 6 | 14 | 733_741 | 888 | 1482.0 | Nucleotide binding | Note=ATP | |
Tgene | CDK12 | chr17:66352945 | chr17:37667782 | ENST00000430627 | 6 | 14 | 814_819 | 888 | 1482.0 | Nucleotide binding | Note=ATP | |
Tgene | CDK12 | chr17:66352945 | chr17:37667782 | ENST00000447079 | 6 | 14 | 733_741 | 888 | 1491.0 | Nucleotide binding | Note=ATP | |
Tgene | CDK12 | chr17:66352945 | chr17:37667782 | ENST00000447079 | 6 | 14 | 814_819 | 888 | 1491.0 | Nucleotide binding | Note=ATP |
Top |
Fusion Gene Sequence for ARSG-CDK12 |
![]() |
>6894_6894_1_ARSG-CDK12_ARSG_chr17_66352945_ENST00000448504_CDK12_chr17_37667782_ENST00000430627_length(transcript)=4441nt_BP=1500nt GGCTGCGCCCAGGCCGGCGGGCCCAGCAGCTGCGAACCGCCGGCGCACCACCTGTTTCCGCGCCCGGGGACTTCCCCGGCGGGGCTCAGA AGTGTGGGATCGGTCGCTTGGCTTCCCCTGGCGTCAGCGACCCAGGGTAACCTCCTCCACTGCTGCGTGCCGTGCAGGCCTGCCTGTGTG AGAGCCACGTGTGCCGCGCTCTGGGCACAGCCTTGGAAAGTCAGGACCGCGACGGCAGCAGAGCAGAAACCTTACAGAAACATGAAGCCC TCAACCATCTGCTACTCAGTTATTCGGGGCTGACGGCGGCTTCTAGAACATCCAGGTGTTCTGCAGATGCGAGAACTCATCCTGTAGTCA CCAGATGGAGTCCCAAACAGCCAAGCAGATGTAAGGCCTGTGCTGTGGCTCTGAGGCCCTGAATACAGAAGGGTCACTTTCTTAGTGGCC AAAGAGCAGTTGTTGACATTGATGTCTAATTATTGAACACGACCAGTCATTTTACTGAGCTGCGGTGAGGAAACACTGACCATAGAAGAT CAAGCCAAATGAGGGATTGCAAATTTCCTGATTCTTTTGAATTAGGATTCCAGATGGGGGCCTCATTTCTACAGCCCCCAACATTCCTAT AGCCGTTATCACTGCCATCACCACTGCCACCAGCATCTTCTTGCAGATTCCACCCCTGCTCCCCAGAGACTTCCTGCTTTGAAAGTGAGC AGAAAGGAAGCTCTCAGAAAAATCTCTAGTGGTGGCTGCCGTCGCTCCAGACAATCGGAATCCTGCCTTCACCACCATGGGCTGGCTTTT TCTAAAGGTTTTGTTGGCGGGAGTGAGTTTCTCAGGATTTCTTTATCCTCTTGTGGATTTTTGCATCAGTGGGAAAACAAGAGGACAGAA GCCAAACTTTGTGATTATTTTGGCCGATGACATGGGGTGGGGTGACCTGGGAGCAAACTGGGCAGAAACAAAGGACACTGCCAACCTTGA TAAGATGGCTTCGGAGGGAATGAGGTTTGTGGATTTCCATGCAGCTGCCTCCACCTGCTCACCCTCCCGGGCTTCCTTGCTCACCGGCCG GCTTGGCCTTCGCAATGGAGTCACACGCAACTTTGCAGTCACTTCTGTGGGAGGCCTTCCGCTCAACGAGACCACCTTGGCAGAGGTGCT GCAGCAGGCGGGTTACGTCACTGGGATAATAGGCAAATGGCATCTTGGACACCACGGCTCTTATCACCCCAACTTCCGTGGTTTTGATTA CTACTTTGGAATCCCATATAGCCATGATATGGGCTGTACTGATACTCCAGGCTACAACCACCCTCCTTGTCCAGCGTGTCCACAGGGTGA TGGACCATCAAGGAACCTTCAAAGAGACTGTTACACTGACGTGGCCCTCCCTCTTTATGAAAACCTCAACATTGTGGAGCAGCCGGTGAA CTTGAGCAGCCTTGCCCAGAAGTATGCTGAGAAAGCAACCCAGTTCATCCAGCGTGCAAGTCGCCCTTACACAAACAAAGTCATTACTTT GTGGTACCGACCTCCAGAACTACTGCTAGGAGAGGAACGTTACACACCAGCCATAGATGTTTGGAGCTGTGGATGTATTCTTGGGGAACT ATTCACAAAGAAGCCTATTTTTCAAGCCAATCTGGAACTGGCTCAGCTAGAACTGATCAGCCGACTTTGTGGTAGCCCTTGTCCAGCTGT GTGGCCTGATGTTATCAAACTGCCCTACTTCAACACCATGAAACCGAAGAAGCAATATCGAAGGCGTCTACGAGAAGAATTCTCTTTCAT TCCTTCTGCAGCACTTGATTTATTGGACCACATGCTGACACTAGATCCTAGTAAGCGGTGCACAGCTGAACAGACCCTACAGAGCGACTT CCTTAAAGATGTCGAACTCAGCAAAATGGCTCCTCCAGACCTCCCCCACTGGCAGGATTGCCATGAGTTGTGGAGTAAGAAACGGCGACG TCAGCGACAAAGTGGTGTTGTAGTCGAAGAGCCACCTCCATCCAAAACTTCTCGAAAAGAAACTACCTCAGGGACAAGTACTGAGCCTGT GAAGAACAGCAGCCCAGCACCACCTCAGCCTGCTCCTGGCAAGGTGGAGTCTGGGGCTGGGGATGCAATAGGCCTTGCTGACATCACACA ACAGCTGAATCAAAGTGAATTGGCAGTGTTATTAAACCTGCTGCAGAGCCAAACCGACCTGAGCATCCCTCAAATGGCACAGCTGCTTAA CATCCACTCCAACCCAGAGATGCAGCAGCAGCTGGAAGCCCTGAACCAATCCATCAGTGCCCTGACGGAAGCTACTTCCCAGCAGCAGGA CTCAGAGACCATGGCCCCAGAGGAGTCTTTGAAGGAAGCACCCTCTGCCCCAGTGATCCTGCCTTCAGCAGAACAGACGACCCTTGAAGC TTCAAGCACACCAGCTGACATGCAGAATATATTGGCAGTTCTCTTGAGTCAGCTGATGAAAACCCAAGAGCCAGCAGGCAGTCTGGAGGA AAACAACAGTGACAAGAACAGTGGGCCACAGGGGCCCCGAAGAACTCCCACAATGCCACAGGAGGAGGCAGCAGAGAAGAGGCCCCCTGA GCCCCCCGGACCTCCACCGCCGCCACCTCCACCCCCTCTGGTTGAAGGCGATCTTTCCAGCGCCCCCCAGGAGTTGAACCCAGCCGTGAC AGCCGCCTTGCTGCAACTTTTATCCCAGCCTGAAGCAGAGCCTCCTGGCCACCTGCCACATGAGCACCAGGCCTTGAGACCAATGGAGTA CTCCACCCGACCCCGTCCAAACAGGACTTATGGAAACACTGATGGGCCTGAAACAGGGTTCAGTGCCATTGACACTGATGAACGAAACTC TGGTCCAGCCTTGACAGAATCCTTGGTCCAGACCCTGGTGAAGAACAGGACCTTCTCAGGCTCTCTGAGCCACCTTGGGGAGTCCAGCAG TTACCAGGGCACAGGGTCAGTGCAGTTTCCAGGGGACCAGGACCTCCGTTTTGCCAGGGTCCCCTTAGCGTTACACCCGGTGGTCGGGCA ACCATTCCTGAAGGCTGAGGGAAGCAGCAATTCTGTGGTACATGCAGAGACCAAATTGCAAAACTATGGGGAGCTGGGGCCAGGAACCAC TGGGGCCAGCAGCTCAGGAGCAGGCCTTCACTGGGGGGGCCCAACTCAGTCTTCTGCTTATGGAAAACTCTATCGGGGGCCTACAAGAGT CCCACCAAGAGGGGGAAGAGGGAGAGGAGTTCCTTACTAACCCAGAGACTTCAGTGTCCTGAAAGATTCCTTTCCTATCCATCCTTCCAT CCAGTTCTCTGAATCTTTAATGAAATCATTTGCCAGAGCGAGGTAATCATCTGCATTTGGCTACTGCAAAGCTGTCCGTTGTATTCCTTG CTCACTTGCTACTAGCAGGCGACTTACGAAATAATGATGTTGGCACCAGTTCCCCCTGGATGGGCTATAGCCAGAACATTTACTTCAACT CTACCTTAGTAGATACAAGTAGAGAATATGGAGAGGATCATTACATTGAAAAGTAAATGTTTTATTAGTTCATTGCCTGCACTTACTGAT CGGAAGAGAGAAAGAACAGTTTCAGTATTGAGATGGCTCAGGAGAGGCTCTTTGATTTTTAAAGTTTTGGGGTGGGGGATTGTGTGTGGT TTCTTTCTTTTGAATTTTAATTTAGGTGTTTTGGGTTTTTTTCCTTTAAAGAGAATAGTGTTCACAAAATTTGAGCTGCTCTTTGGCTTT TGCTATAAGGGAAACAGAGTGGCCTGGCTGATTTGAATAAATGTTTCTTTCCTCTCCACCATCTCACATTTTGCTTTTAAGTGAACACTT TTTCCCCATTGAGCATCTTGAACATACTTTTTTTCCAAATAAATTACTCATCCTTAAAGTTTACTCCACTTTGACAAAAGATACGCCCTT CTCCCTGCACATAAAGCAGGTTGTAGAACGTGGCATTCTTGGGCAAGTAGGTAGACTTTACCCAGTCTCTTTCCTTTTTTGCTGATGTGT GCTCTCTCTCTCTCTTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTGTCTCGCTTGCTCGCTCTCGCTGTTTCTCTCTC TTTGAGGCATTTGTTTGGAAAAAATCGTTGAGATGCCCAAGAACCTGGGATAATTCTTTACTTTTTTTGAAATAAAGGAAAGGAAATTCA GACTCTTACATTGTTCTCTGTAACTCTTCAATTCTAAAATGTTTTGTTTTTTAAACCATGTTCTGATGGGGAAGTTGATTTGTAAGTGTG GACAGCTTGGACATTGCTGCTGAGCTGTGGTTAGAGATGATGCCTCCATTCCTAGAGGGCTAATAACAGCATTTAGCATATTGTTTACAC ATATATTTTTATGTCAAAAAAAAAACAAAAA >6894_6894_1_ARSG-CDK12_ARSG_chr17_66352945_ENST00000448504_CDK12_chr17_37667782_ENST00000430627_length(amino acids)=827AA_BP=234 MGWLFLKVLLAGVSFSGFLYPLVDFCISGKTRGQKPNFVIILADDMGWGDLGANWAETKDTANLDKMASEGMRFVDFHAAASTCSPSRAS LLTGRLGLRNGVTRNFAVTSVGGLPLNETTLAEVLQQAGYVTGIIGKWHLGHHGSYHPNFRGFDYYFGIPYSHDMGCTDTPGYNHPPCPA CPQGDGPSRNLQRDCYTDVALPLYENLNIVEQPVNLSSLAQKYAEKATQFIQRASRPYTNKVITLWYRPPELLLGEERYTPAIDVWSCGC ILGELFTKKPIFQANLELAQLELISRLCGSPCPAVWPDVIKLPYFNTMKPKKQYRRRLREEFSFIPSAALDLLDHMLTLDPSKRCTAEQT LQSDFLKDVELSKMAPPDLPHWQDCHELWSKKRRRQRQSGVVVEEPPPSKTSRKETTSGTSTEPVKNSSPAPPQPAPGKVESGAGDAIGL ADITQQLNQSELAVLLNLLQSQTDLSIPQMAQLLNIHSNPEMQQQLEALNQSISALTEATSQQQDSETMAPEESLKEAPSAPVILPSAEQ TTLEASSTPADMQNILAVLLSQLMKTQEPAGSLEENNSDKNSGPQGPRRTPTMPQEEAAEKRPPEPPGPPPPPPPPPLVEGDLSSAPQEL NPAVTAALLQLLSQPEAEPPGHLPHEHQALRPMEYSTRPRPNRTYGNTDGPETGFSAIDTDERNSGPALTESLVQTLVKNRTFSGSLSHL GESSSYQGTGSVQFPGDQDLRFARVPLALHPVVGQPFLKAEGSSNSVVHAETKLQNYGELGPGTTGASSSGAGLHWGGPTQSSAYGKLYR GPTRVPPRGGRGRGVPY -------------------------------------------------------------- >6894_6894_2_ARSG-CDK12_ARSG_chr17_66352945_ENST00000448504_CDK12_chr17_37667782_ENST00000447079_length(transcript)=7137nt_BP=1500nt GGCTGCGCCCAGGCCGGCGGGCCCAGCAGCTGCGAACCGCCGGCGCACCACCTGTTTCCGCGCCCGGGGACTTCCCCGGCGGGGCTCAGA AGTGTGGGATCGGTCGCTTGGCTTCCCCTGGCGTCAGCGACCCAGGGTAACCTCCTCCACTGCTGCGTGCCGTGCAGGCCTGCCTGTGTG AGAGCCACGTGTGCCGCGCTCTGGGCACAGCCTTGGAAAGTCAGGACCGCGACGGCAGCAGAGCAGAAACCTTACAGAAACATGAAGCCC TCAACCATCTGCTACTCAGTTATTCGGGGCTGACGGCGGCTTCTAGAACATCCAGGTGTTCTGCAGATGCGAGAACTCATCCTGTAGTCA CCAGATGGAGTCCCAAACAGCCAAGCAGATGTAAGGCCTGTGCTGTGGCTCTGAGGCCCTGAATACAGAAGGGTCACTTTCTTAGTGGCC AAAGAGCAGTTGTTGACATTGATGTCTAATTATTGAACACGACCAGTCATTTTACTGAGCTGCGGTGAGGAAACACTGACCATAGAAGAT CAAGCCAAATGAGGGATTGCAAATTTCCTGATTCTTTTGAATTAGGATTCCAGATGGGGGCCTCATTTCTACAGCCCCCAACATTCCTAT AGCCGTTATCACTGCCATCACCACTGCCACCAGCATCTTCTTGCAGATTCCACCCCTGCTCCCCAGAGACTTCCTGCTTTGAAAGTGAGC AGAAAGGAAGCTCTCAGAAAAATCTCTAGTGGTGGCTGCCGTCGCTCCAGACAATCGGAATCCTGCCTTCACCACCATGGGCTGGCTTTT TCTAAAGGTTTTGTTGGCGGGAGTGAGTTTCTCAGGATTTCTTTATCCTCTTGTGGATTTTTGCATCAGTGGGAAAACAAGAGGACAGAA GCCAAACTTTGTGATTATTTTGGCCGATGACATGGGGTGGGGTGACCTGGGAGCAAACTGGGCAGAAACAAAGGACACTGCCAACCTTGA TAAGATGGCTTCGGAGGGAATGAGGTTTGTGGATTTCCATGCAGCTGCCTCCACCTGCTCACCCTCCCGGGCTTCCTTGCTCACCGGCCG GCTTGGCCTTCGCAATGGAGTCACACGCAACTTTGCAGTCACTTCTGTGGGAGGCCTTCCGCTCAACGAGACCACCTTGGCAGAGGTGCT GCAGCAGGCGGGTTACGTCACTGGGATAATAGGCAAATGGCATCTTGGACACCACGGCTCTTATCACCCCAACTTCCGTGGTTTTGATTA CTACTTTGGAATCCCATATAGCCATGATATGGGCTGTACTGATACTCCAGGCTACAACCACCCTCCTTGTCCAGCGTGTCCACAGGGTGA TGGACCATCAAGGAACCTTCAAAGAGACTGTTACACTGACGTGGCCCTCCCTCTTTATGAAAACCTCAACATTGTGGAGCAGCCGGTGAA CTTGAGCAGCCTTGCCCAGAAGTATGCTGAGAAAGCAACCCAGTTCATCCAGCGTGCAAGTCGCCCTTACACAAACAAAGTCATTACTTT GTGGTACCGACCTCCAGAACTACTGCTAGGAGAGGAACGTTACACACCAGCCATAGATGTTTGGAGCTGTGGATGTATTCTTGGGGAACT ATTCACAAAGAAGCCTATTTTTCAAGCCAATCTGGAACTGGCTCAGCTAGAACTGATCAGCCGACTTTGTGGTAGCCCTTGTCCAGCTGT GTGGCCTGATGTTATCAAACTGCCCTACTTCAACACCATGAAACCGAAGAAGCAATATCGAAGGCGTCTACGAGAAGAATTCTCTTTCAT TCCTTCTGCAGCACTTGATTTATTGGACCACATGCTGACACTAGATCCTAGTAAGCGGTGCACAGCTGAACAGACCCTACAGAGCGACTT CCTTAAAGATGTCGAACTCAGCAAAATGGCTCCTCCAGACCTCCCCCACTGGCAGGATTGCCATGAGTTGTGGAGTAAGAAACGGCGACG TCAGCGACAAAGTGGTGTTGTAGTCGAAGAGCCACCTCCATCCAAAACTTCTCGAAAAGAAACTACCTCAGGGACAAGTACTGAGCCTGT GAAGAACAGCAGCCCAGCACCACCTCAGCCTGCTCCTGGCAAGGTGGAGTCTGGGGCTGGGGATGCAATAGGCCTTGCTGACATCACACA ACAGCTGAATCAAAGTGAATTGGCAGTGTTATTAAACCTGCTGCAGAGCCAAACCGACCTGAGCATCCCTCAAATGGCACAGCTGCTTAA CATCCACTCCAACCCAGAGATGCAGCAGCAGCTGGAAGCCCTGAACCAATCCATCAGTGCCCTGACGGAAGCTACTTCCCAGCAGCAGGA CTCAGAGACCATGGCCCCAGAGGAGTCTTTGAAGGAAGCACCCTCTGCCCCAGTGATCCTGCCTTCAGCAGAACAGACGACCCTTGAAGC TTCAAGCACACCAGCTGACATGCAGAATATATTGGCAGTTCTCTTGAGTCAGCTGATGAAAACCCAAGAGCCAGCAGGCAGTCTGGAGGA AAACAACAGTGACAAGAACAGTGGGCCACAGGGGCCCCGAAGAACTCCCACAATGCCACAGGAGGAGGCAGCAGCATGTCCTCCTCACAT TCTTCCACCAGAGAAGAGGCCCCCTGAGCCCCCCGGACCTCCACCGCCGCCACCTCCACCCCCTCTGGTTGAAGGCGATCTTTCCAGCGC CCCCCAGGAGTTGAACCCAGCCGTGACAGCCGCCTTGCTGCAACTTTTATCCCAGCCTGAAGCAGAGCCTCCTGGCCACCTGCCACATGA GCACCAGGCCTTGAGACCAATGGAGTACTCCACCCGACCCCGTCCAAACAGGACTTATGGAAACACTGATGGGCCTGAAACAGGGTTCAG TGCCATTGACACTGATGAACGAAACTCTGGTCCAGCCTTGACAGAATCCTTGGTCCAGACCCTGGTGAAGAACAGGACCTTCTCAGGCTC TCTGAGCCACCTTGGGGAGTCCAGCAGTTACCAGGGCACAGGGTCAGTGCAGTTTCCAGGGGACCAGGACCTCCGTTTTGCCAGGGTCCC CTTAGCGTTACACCCGGTGGTCGGGCAACCATTCCTGAAGGCTGAGGGAAGCAGCAATTCTGTGGTACATGCAGAGACCAAATTGCAAAA CTATGGGGAGCTGGGGCCAGGAACCACTGGGGCCAGCAGCTCAGGAGCAGGCCTTCACTGGGGGGGCCCAACTCAGTCTTCTGCTTATGG AAAACTCTATCGGGGGCCTACAAGAGTCCCACCAAGAGGGGGAAGAGGGAGAGGAGTTCCTTACTAACCCAGAGACTTCAGTGTCCTGAA AGATTCCTTTCCTATCCATCCTTCCATCCAGTTCTCTGAATCTTTAATGAAATCATTTGCCAGAGCGAGGTAATCATCTGCATTTGGCTA CTGCAAAGCTGTCCGTTGTATTCCTTGCTCACTTGCTACTAGCAGGCGACTTACGAAATAATGATGTTGGCACCAGTTCCCCCTGGATGG GCTATAGCCAGAACATTTACTTCAACTCTACCTTAGTAGATACAAGTAGAGAATATGGAGAGGATCATTACATTGAAAAGTAAATGTTTT ATTAGTTCATTGCCTGCACTTACTGATCGGAAGAGAGAAAGAACAGTTTCAGTATTGAGATGGCTCAGGAGAGGCTCTTTGATTTTTAAA GTTTTGGGGTGGGGGATTGTGTGTGGTTTCTTTCTTTTGAATTTTAATTTAGGTGTTTTGGGTTTTTTTCCTTTAAAGAGAATAGTGTTC ACAAAATTTGAGCTGCTCTTTGGCTTTTGCTATAAGGGAAACAGAGTGGCCTGGCTGATTTGAATAAATGTTTCTTTCCTCTCCACCATC TCACATTTTGCTTTTAAGTGAACACTTTTTCCCCATTGAGCATCTTGAACATACTTTTTTTCCAAATAAATTACTCATCCTTAAAGTTTA CTCCACTTTGACAAAAGATACGCCCTTCTCCCTGCACATAAAGCAGGTTGTAGAACGTGGCATTCTTGGGCAAGTAGGTAGACTTTACCC AGTCTCTTTCCTTTTTTGCTGATGTGTGCTCTCTCTCTCTCTTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTGTCTCG CTTGCTCGCTCTCGCTGTTTCTCTCTCTTTGAGGCATTTGTTTGGAAAAAATCGTTGAGATGCCCAAGAACCTGGGATAATTCTTTACTT TTTTTGAAATAAAGGAAAGGAAATTCAGACTCTTACATTGTTCTCTGTAACTCTTCAATTCTAAAATGTTTTGTTTTTTAAACCATGTTC TGATGGGGAAGTTGATTTGTAAGTGTGGACAGCTTGGACATTGCTGCTGAGCTGTGGTTAGAGATGATGCCTCCATTCCTAGAGGGCTAA TAACAGCATTTAGCATATTGTTTACACATATATTTTTATGTCAAAAAAAAAACAAAAACCTTTCAAACAGAGCATTGTGATATTGTCAAA GAGAAAAACAAATCCTGAAGATACATGGAAATGTAACCTAGTTTAGGGTGGGTATTTTTCTGAAGATACATCAATACCTGACCTTTTTTA AAAAAATAATTTTAAAACAGCATACTGTGAGGAAGAACAGTATTGACATACCCACATCCCAGCATGTGTACCCTGCCAGTTCTTTTAGGG ATTTTTCCTCCAAAGAGATTTGGATTTGGTTTTGGTAAAAGGGGTTAAATTGTGCTTCCAGGCAAGAACTTTGCCTTATCATAAACAGGA AATGAAAAAGGGAAGGGCTGTCAGGATGGGATAATTTGGGAGGCTTCTCATTCTGGCTTCTATTTCTATGTGAGTACCAGCATATAGAGT GTTTTAAAAACAGATACATGTCATATAATTTATCTGCACAGACTTAGACCTTCAGGAAACATAGGTTAAGCCCCCTTTTACAAAGAAAAA GTAAACATACTTCAGCATCTTGGAGGGTAGTTTTCAAAACTCAAGTTTCATGTTTCAATGCCAAGTTCTTATTTTAAAAAATAAAATCTA CTTATAAGAGAAAGGTGCATTACTTAAAAAAAAAAAACTTTAAAGAAATGAAAGAAGAACCCTCTTCAGATACTTACTTGAAGACTGTTT TCCCCTGTTAATGAGATATAGCTAGATATCGGTGTGTGTATTTCTTTATTATTCTCTGGTTTTTGATCTGGCCTTGCCTCCAGGGCCAAA CACTGATTTAGAAAGAGAGCCTTCTAGCTATTTTGGCATTGATGGCTTTTTATACCAGTGTGTCCAGTTAGATTTACTAGGCTTACTGAC ATGCTATTGGTAAATCGCATTAAAGTTCATCTGAACCTTCTGTCTGTTGACTTCTTAGTCCTCAGACATGGGCCTTTGTGTTTTAGAATA TTTGAATTTGAGTTATTGGGCCCCACTCCCTGTTTTTTATTAAAGAACGTGAGCCTGGGATACTTTCAGAAGTATCTGTTCAATGAAAAA AAGTTGGTTTCCCATCAAATATGAATAAAATTCTCTATATATTTCATTGTATTTTGGTTATCAGCAGTCATCAATAATGTTTTTCCCTCC CCTCTCCCACCTCTTATTTTTAATTATGCCAAATATCCTAAATAATATACTTAAGCCTCCATTCCCTCATCCCTACTAGGGAAGGGGGTG AGTGTATGTGTGAGTGTATGTGTATGTATGATCCCATCTCACCCCCACCCCCATTTTGGGAGTCTTTTAAAATGAAAACAAAGTTTGGTA GTTTTGACTATTTCTAAAAGCAGAGGAGAAAAAAAAACTTATTTAAATATCCTGGAATCTGTATGGAGGAAGAAAAGGTATTTGTTAATT TTTCAGTTACGTTATCTATAAACATGATGGAAGTAAAGGTTTGGCAGAATTTCACCTTGACTATTTGAAAATTACAGACCCAATTAATTC CATTCAAAAGTGGTTTTCGTTTTGTTTTAATTATTGTACAATGAGAGATATTGTCTATTAAATACATTATTTTGAACAGATGAGAAATCT GATTCTGTTCATGAGTGGGAGGCAAAACTGGTTTGACCGTGATCATTTTTGTGGTTTTGAAAACAAATATACTTGACCCAGTTTCCTTAG TTTTTTCTTCAACTGTCCATAGGAACGATAAGTATTTGAAAGCAACATCAAATCTATACGTTTAAAGCAGGGCAGTTAGCACAAATTTGC AAGTAGAACTTCTATTAGCTTATGCCATAGACATCACCCAACCACTTGTATGTGTGTGTGTATATATAATATGCATATATAGTTACCGTG CTAAAATGGTTACCAGCAGGTTTTGAGAGAGAATGCTGCATCAGAAAAGTGTCAGTTGCCACCTCATTCTCCCTGATTTAGGTTCCTGAC ACTGATTCCTTTCTCTCTCGTTTTTGACCCCCATTGGGTGTATCTTGTCTATGTACAGATATTTTGTAATATATTAAATTTTTTTCTTTC AGTTTATAAAAATGGAAAGTGGAGATTGGAAAATTAAATATTTCCTGTTACTATACCACTTTTGCTCCATTGCATTTACTTCTTAATCTG TACCCCCTGAGCATATCTAATCATGTATAAAGGACGTTTTTCCTCCACTTTATCTTAGGGGTTCTCTGTCTCAGAATCATTATAGACTCA TTAACTCCCCCTCCCAGCAAAAGGTTATCAGGATTTGAAGAGGTGCTTGAAAACGCTAGACTAGGAACTAGAGAATAAATGAGTTGGGAA AAACCATGAAATGTGATTTTTTTAAAGTAGAAAAGTTATACAAATAATGGTACCAAACCATCAAAAGAGTTGAGCTTCATGTACCCTGAC TCCTCCTGACAGGAGAGGTAAGTGGGTTTGAGCTCAACTGTCATCAAGGGAAGTTGGTAAGAGGCTGTTTAGACCCAAAGGATAGTCTTA AACCAGACTTCACCACCCACCCTACCTCAGTTCCCATGTTATTACATGCAGAGTCAGCATGGGGATTAGTGTACCTACCTTTGCTGAGAT TTCCCGATGCGTTGCCAATCCAGAAAGTGAATCAAAAAGTTGTTTAAAAGTTAAAATCTCTATTGTTTCCAAAATCTTTCCCATCTCCAC CTGAAGACAGAATTGCTTCCCCTTCTC >6894_6894_2_ARSG-CDK12_ARSG_chr17_66352945_ENST00000448504_CDK12_chr17_37667782_ENST00000447079_length(amino acids)=836AA_BP=234 MGWLFLKVLLAGVSFSGFLYPLVDFCISGKTRGQKPNFVIILADDMGWGDLGANWAETKDTANLDKMASEGMRFVDFHAAASTCSPSRAS LLTGRLGLRNGVTRNFAVTSVGGLPLNETTLAEVLQQAGYVTGIIGKWHLGHHGSYHPNFRGFDYYFGIPYSHDMGCTDTPGYNHPPCPA CPQGDGPSRNLQRDCYTDVALPLYENLNIVEQPVNLSSLAQKYAEKATQFIQRASRPYTNKVITLWYRPPELLLGEERYTPAIDVWSCGC ILGELFTKKPIFQANLELAQLELISRLCGSPCPAVWPDVIKLPYFNTMKPKKQYRRRLREEFSFIPSAALDLLDHMLTLDPSKRCTAEQT LQSDFLKDVELSKMAPPDLPHWQDCHELWSKKRRRQRQSGVVVEEPPPSKTSRKETTSGTSTEPVKNSSPAPPQPAPGKVESGAGDAIGL ADITQQLNQSELAVLLNLLQSQTDLSIPQMAQLLNIHSNPEMQQQLEALNQSISALTEATSQQQDSETMAPEESLKEAPSAPVILPSAEQ TTLEASSTPADMQNILAVLLSQLMKTQEPAGSLEENNSDKNSGPQGPRRTPTMPQEEAAACPPHILPPEKRPPEPPGPPPPPPPPPLVEG DLSSAPQELNPAVTAALLQLLSQPEAEPPGHLPHEHQALRPMEYSTRPRPNRTYGNTDGPETGFSAIDTDERNSGPALTESLVQTLVKNR TFSGSLSHLGESSSYQGTGSVQFPGDQDLRFARVPLALHPVVGQPFLKAEGSSNSVVHAETKLQNYGELGPGTTGASSSGAGLHWGGPTQ SSAYGKLYRGPTRVPPRGGRGRGVPY -------------------------------------------------------------- >6894_6894_3_ARSG-CDK12_ARSG_chr17_66352945_ENST00000452479_CDK12_chr17_37667782_ENST00000430627_length(transcript)=3478nt_BP=537nt GCGCCGGTCGCGCGCCCGCCAGCCTGCCGCCTGGGCTGGGGGTCACGAAAGGTTTGTGGATTTCCATGCAGCTGCCTCCACCTGCTCACC CTCCCGGGCTTCCTTGCTCACCGGCCGGCTTGGCCTTCGCAATGGAGTCACACGCAACTTTGCAGTCACTTCTGTGGGAGGCCTTCCGCT CAACGAGACCACCTTGGCAGAGGTGCTGCAGCAGGCGGGTTACGTCACTGGGATAATAGGCAAATGGCATCTTGGACACCACGGCTCTTA TCACCCCAACTTCCGTGGTTTTGATTACTACTTTGGAATCCCATATAGCCATGATATGGGCTGTACTGATACTCCAGGCTACAACCACCC TCCTTGTCCAGCGTGTCCACAGGGTGATGGACCATCAAGGAACCTTCAAAGAGACTGTTACACTGACGTGGCCCTCCCTCTTTATGAAAA CCTCAACATTGTGGAGCAGCCGGTGAACTTGAGCAGCCTTGCCCAGAAGTATGCTGAGAAAGCAACCCAGTTCATCCAGCGTGCAAGTCG CCCTTACACAAACAAAGTCATTACTTTGTGGTACCGACCTCCAGAACTACTGCTAGGAGAGGAACGTTACACACCAGCCATAGATGTTTG GAGCTGTGGATGTATTCTTGGGGAACTATTCACAAAGAAGCCTATTTTTCAAGCCAATCTGGAACTGGCTCAGCTAGAACTGATCAGCCG ACTTTGTGGTAGCCCTTGTCCAGCTGTGTGGCCTGATGTTATCAAACTGCCCTACTTCAACACCATGAAACCGAAGAAGCAATATCGAAG GCGTCTACGAGAAGAATTCTCTTTCATTCCTTCTGCAGCACTTGATTTATTGGACCACATGCTGACACTAGATCCTAGTAAGCGGTGCAC AGCTGAACAGACCCTACAGAGCGACTTCCTTAAAGATGTCGAACTCAGCAAAATGGCTCCTCCAGACCTCCCCCACTGGCAGGATTGCCA TGAGTTGTGGAGTAAGAAACGGCGACGTCAGCGACAAAGTGGTGTTGTAGTCGAAGAGCCACCTCCATCCAAAACTTCTCGAAAAGAAAC TACCTCAGGGACAAGTACTGAGCCTGTGAAGAACAGCAGCCCAGCACCACCTCAGCCTGCTCCTGGCAAGGTGGAGTCTGGGGCTGGGGA TGCAATAGGCCTTGCTGACATCACACAACAGCTGAATCAAAGTGAATTGGCAGTGTTATTAAACCTGCTGCAGAGCCAAACCGACCTGAG CATCCCTCAAATGGCACAGCTGCTTAACATCCACTCCAACCCAGAGATGCAGCAGCAGCTGGAAGCCCTGAACCAATCCATCAGTGCCCT GACGGAAGCTACTTCCCAGCAGCAGGACTCAGAGACCATGGCCCCAGAGGAGTCTTTGAAGGAAGCACCCTCTGCCCCAGTGATCCTGCC TTCAGCAGAACAGACGACCCTTGAAGCTTCAAGCACACCAGCTGACATGCAGAATATATTGGCAGTTCTCTTGAGTCAGCTGATGAAAAC CCAAGAGCCAGCAGGCAGTCTGGAGGAAAACAACAGTGACAAGAACAGTGGGCCACAGGGGCCCCGAAGAACTCCCACAATGCCACAGGA GGAGGCAGCAGAGAAGAGGCCCCCTGAGCCCCCCGGACCTCCACCGCCGCCACCTCCACCCCCTCTGGTTGAAGGCGATCTTTCCAGCGC CCCCCAGGAGTTGAACCCAGCCGTGACAGCCGCCTTGCTGCAACTTTTATCCCAGCCTGAAGCAGAGCCTCCTGGCCACCTGCCACATGA GCACCAGGCCTTGAGACCAATGGAGTACTCCACCCGACCCCGTCCAAACAGGACTTATGGAAACACTGATGGGCCTGAAACAGGGTTCAG TGCCATTGACACTGATGAACGAAACTCTGGTCCAGCCTTGACAGAATCCTTGGTCCAGACCCTGGTGAAGAACAGGACCTTCTCAGGCTC TCTGAGCCACCTTGGGGAGTCCAGCAGTTACCAGGGCACAGGGTCAGTGCAGTTTCCAGGGGACCAGGACCTCCGTTTTGCCAGGGTCCC CTTAGCGTTACACCCGGTGGTCGGGCAACCATTCCTGAAGGCTGAGGGAAGCAGCAATTCTGTGGTACATGCAGAGACCAAATTGCAAAA CTATGGGGAGCTGGGGCCAGGAACCACTGGGGCCAGCAGCTCAGGAGCAGGCCTTCACTGGGGGGGCCCAACTCAGTCTTCTGCTTATGG AAAACTCTATCGGGGGCCTACAAGAGTCCCACCAAGAGGGGGAAGAGGGAGAGGAGTTCCTTACTAACCCAGAGACTTCAGTGTCCTGAA AGATTCCTTTCCTATCCATCCTTCCATCCAGTTCTCTGAATCTTTAATGAAATCATTTGCCAGAGCGAGGTAATCATCTGCATTTGGCTA CTGCAAAGCTGTCCGTTGTATTCCTTGCTCACTTGCTACTAGCAGGCGACTTACGAAATAATGATGTTGGCACCAGTTCCCCCTGGATGG GCTATAGCCAGAACATTTACTTCAACTCTACCTTAGTAGATACAAGTAGAGAATATGGAGAGGATCATTACATTGAAAAGTAAATGTTTT ATTAGTTCATTGCCTGCACTTACTGATCGGAAGAGAGAAAGAACAGTTTCAGTATTGAGATGGCTCAGGAGAGGCTCTTTGATTTTTAAA GTTTTGGGGTGGGGGATTGTGTGTGGTTTCTTTCTTTTGAATTTTAATTTAGGTGTTTTGGGTTTTTTTCCTTTAAAGAGAATAGTGTTC ACAAAATTTGAGCTGCTCTTTGGCTTTTGCTATAAGGGAAACAGAGTGGCCTGGCTGATTTGAATAAATGTTTCTTTCCTCTCCACCATC TCACATTTTGCTTTTAAGTGAACACTTTTTCCCCATTGAGCATCTTGAACATACTTTTTTTCCAAATAAATTACTCATCCTTAAAGTTTA CTCCACTTTGACAAAAGATACGCCCTTCTCCCTGCACATAAAGCAGGTTGTAGAACGTGGCATTCTTGGGCAAGTAGGTAGACTTTACCC AGTCTCTTTCCTTTTTTGCTGATGTGTGCTCTCTCTCTCTCTTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTGTCTCG CTTGCTCGCTCTCGCTGTTTCTCTCTCTTTGAGGCATTTGTTTGGAAAAAATCGTTGAGATGCCCAAGAACCTGGGATAATTCTTTACTT TTTTTGAAATAAAGGAAAGGAAATTCAGACTCTTACATTGTTCTCTGTAACTCTTCAATTCTAAAATGTTTTGTTTTTTAAACCATGTTC TGATGGGGAAGTTGATTTGTAAGTGTGGACAGCTTGGACATTGCTGCTGAGCTGTGGTTAGAGATGATGCCTCCATTCCTAGAGGGCTAA TAACAGCATTTAGCATATTGTTTACACATATATTTTTATGTCAAAAAAAAAACAAAAA >6894_6894_3_ARSG-CDK12_ARSG_chr17_66352945_ENST00000452479_CDK12_chr17_37667782_ENST00000430627_length(amino acids)=737AA_BP=144 MLTGRLGLRNGVTRNFAVTSVGGLPLNETTLAEVLQQAGYVTGIIGKWHLGHHGSYHPNFRGFDYYFGIPYSHDMGCTDTPGYNHPPCPA CPQGDGPSRNLQRDCYTDVALPLYENLNIVEQPVNLSSLAQKYAEKATQFIQRASRPYTNKVITLWYRPPELLLGEERYTPAIDVWSCGC ILGELFTKKPIFQANLELAQLELISRLCGSPCPAVWPDVIKLPYFNTMKPKKQYRRRLREEFSFIPSAALDLLDHMLTLDPSKRCTAEQT LQSDFLKDVELSKMAPPDLPHWQDCHELWSKKRRRQRQSGVVVEEPPPSKTSRKETTSGTSTEPVKNSSPAPPQPAPGKVESGAGDAIGL ADITQQLNQSELAVLLNLLQSQTDLSIPQMAQLLNIHSNPEMQQQLEALNQSISALTEATSQQQDSETMAPEESLKEAPSAPVILPSAEQ TTLEASSTPADMQNILAVLLSQLMKTQEPAGSLEENNSDKNSGPQGPRRTPTMPQEEAAEKRPPEPPGPPPPPPPPPLVEGDLSSAPQEL NPAVTAALLQLLSQPEAEPPGHLPHEHQALRPMEYSTRPRPNRTYGNTDGPETGFSAIDTDERNSGPALTESLVQTLVKNRTFSGSLSHL GESSSYQGTGSVQFPGDQDLRFARVPLALHPVVGQPFLKAEGSSNSVVHAETKLQNYGELGPGTTGASSSGAGLHWGGPTQSSAYGKLYR GPTRVPPRGGRGRGVPY -------------------------------------------------------------- >6894_6894_4_ARSG-CDK12_ARSG_chr17_66352945_ENST00000452479_CDK12_chr17_37667782_ENST00000447079_length(transcript)=6174nt_BP=537nt GCGCCGGTCGCGCGCCCGCCAGCCTGCCGCCTGGGCTGGGGGTCACGAAAGGTTTGTGGATTTCCATGCAGCTGCCTCCACCTGCTCACC CTCCCGGGCTTCCTTGCTCACCGGCCGGCTTGGCCTTCGCAATGGAGTCACACGCAACTTTGCAGTCACTTCTGTGGGAGGCCTTCCGCT CAACGAGACCACCTTGGCAGAGGTGCTGCAGCAGGCGGGTTACGTCACTGGGATAATAGGCAAATGGCATCTTGGACACCACGGCTCTTA TCACCCCAACTTCCGTGGTTTTGATTACTACTTTGGAATCCCATATAGCCATGATATGGGCTGTACTGATACTCCAGGCTACAACCACCC TCCTTGTCCAGCGTGTCCACAGGGTGATGGACCATCAAGGAACCTTCAAAGAGACTGTTACACTGACGTGGCCCTCCCTCTTTATGAAAA CCTCAACATTGTGGAGCAGCCGGTGAACTTGAGCAGCCTTGCCCAGAAGTATGCTGAGAAAGCAACCCAGTTCATCCAGCGTGCAAGTCG CCCTTACACAAACAAAGTCATTACTTTGTGGTACCGACCTCCAGAACTACTGCTAGGAGAGGAACGTTACACACCAGCCATAGATGTTTG GAGCTGTGGATGTATTCTTGGGGAACTATTCACAAAGAAGCCTATTTTTCAAGCCAATCTGGAACTGGCTCAGCTAGAACTGATCAGCCG ACTTTGTGGTAGCCCTTGTCCAGCTGTGTGGCCTGATGTTATCAAACTGCCCTACTTCAACACCATGAAACCGAAGAAGCAATATCGAAG GCGTCTACGAGAAGAATTCTCTTTCATTCCTTCTGCAGCACTTGATTTATTGGACCACATGCTGACACTAGATCCTAGTAAGCGGTGCAC AGCTGAACAGACCCTACAGAGCGACTTCCTTAAAGATGTCGAACTCAGCAAAATGGCTCCTCCAGACCTCCCCCACTGGCAGGATTGCCA TGAGTTGTGGAGTAAGAAACGGCGACGTCAGCGACAAAGTGGTGTTGTAGTCGAAGAGCCACCTCCATCCAAAACTTCTCGAAAAGAAAC TACCTCAGGGACAAGTACTGAGCCTGTGAAGAACAGCAGCCCAGCACCACCTCAGCCTGCTCCTGGCAAGGTGGAGTCTGGGGCTGGGGA TGCAATAGGCCTTGCTGACATCACACAACAGCTGAATCAAAGTGAATTGGCAGTGTTATTAAACCTGCTGCAGAGCCAAACCGACCTGAG CATCCCTCAAATGGCACAGCTGCTTAACATCCACTCCAACCCAGAGATGCAGCAGCAGCTGGAAGCCCTGAACCAATCCATCAGTGCCCT GACGGAAGCTACTTCCCAGCAGCAGGACTCAGAGACCATGGCCCCAGAGGAGTCTTTGAAGGAAGCACCCTCTGCCCCAGTGATCCTGCC TTCAGCAGAACAGACGACCCTTGAAGCTTCAAGCACACCAGCTGACATGCAGAATATATTGGCAGTTCTCTTGAGTCAGCTGATGAAAAC CCAAGAGCCAGCAGGCAGTCTGGAGGAAAACAACAGTGACAAGAACAGTGGGCCACAGGGGCCCCGAAGAACTCCCACAATGCCACAGGA GGAGGCAGCAGCATGTCCTCCTCACATTCTTCCACCAGAGAAGAGGCCCCCTGAGCCCCCCGGACCTCCACCGCCGCCACCTCCACCCCC TCTGGTTGAAGGCGATCTTTCCAGCGCCCCCCAGGAGTTGAACCCAGCCGTGACAGCCGCCTTGCTGCAACTTTTATCCCAGCCTGAAGC AGAGCCTCCTGGCCACCTGCCACATGAGCACCAGGCCTTGAGACCAATGGAGTACTCCACCCGACCCCGTCCAAACAGGACTTATGGAAA CACTGATGGGCCTGAAACAGGGTTCAGTGCCATTGACACTGATGAACGAAACTCTGGTCCAGCCTTGACAGAATCCTTGGTCCAGACCCT GGTGAAGAACAGGACCTTCTCAGGCTCTCTGAGCCACCTTGGGGAGTCCAGCAGTTACCAGGGCACAGGGTCAGTGCAGTTTCCAGGGGA CCAGGACCTCCGTTTTGCCAGGGTCCCCTTAGCGTTACACCCGGTGGTCGGGCAACCATTCCTGAAGGCTGAGGGAAGCAGCAATTCTGT GGTACATGCAGAGACCAAATTGCAAAACTATGGGGAGCTGGGGCCAGGAACCACTGGGGCCAGCAGCTCAGGAGCAGGCCTTCACTGGGG GGGCCCAACTCAGTCTTCTGCTTATGGAAAACTCTATCGGGGGCCTACAAGAGTCCCACCAAGAGGGGGAAGAGGGAGAGGAGTTCCTTA CTAACCCAGAGACTTCAGTGTCCTGAAAGATTCCTTTCCTATCCATCCTTCCATCCAGTTCTCTGAATCTTTAATGAAATCATTTGCCAG AGCGAGGTAATCATCTGCATTTGGCTACTGCAAAGCTGTCCGTTGTATTCCTTGCTCACTTGCTACTAGCAGGCGACTTACGAAATAATG ATGTTGGCACCAGTTCCCCCTGGATGGGCTATAGCCAGAACATTTACTTCAACTCTACCTTAGTAGATACAAGTAGAGAATATGGAGAGG ATCATTACATTGAAAAGTAAATGTTTTATTAGTTCATTGCCTGCACTTACTGATCGGAAGAGAGAAAGAACAGTTTCAGTATTGAGATGG CTCAGGAGAGGCTCTTTGATTTTTAAAGTTTTGGGGTGGGGGATTGTGTGTGGTTTCTTTCTTTTGAATTTTAATTTAGGTGTTTTGGGT TTTTTTCCTTTAAAGAGAATAGTGTTCACAAAATTTGAGCTGCTCTTTGGCTTTTGCTATAAGGGAAACAGAGTGGCCTGGCTGATTTGA ATAAATGTTTCTTTCCTCTCCACCATCTCACATTTTGCTTTTAAGTGAACACTTTTTCCCCATTGAGCATCTTGAACATACTTTTTTTCC AAATAAATTACTCATCCTTAAAGTTTACTCCACTTTGACAAAAGATACGCCCTTCTCCCTGCACATAAAGCAGGTTGTAGAACGTGGCAT TCTTGGGCAAGTAGGTAGACTTTACCCAGTCTCTTTCCTTTTTTGCTGATGTGTGCTCTCTCTCTCTCTTTCTCTCTCTCTCTCTCTCTC TCTCTCTCTCTCTCTCTCTCTGTCTCGCTTGCTCGCTCTCGCTGTTTCTCTCTCTTTGAGGCATTTGTTTGGAAAAAATCGTTGAGATGC CCAAGAACCTGGGATAATTCTTTACTTTTTTTGAAATAAAGGAAAGGAAATTCAGACTCTTACATTGTTCTCTGTAACTCTTCAATTCTA AAATGTTTTGTTTTTTAAACCATGTTCTGATGGGGAAGTTGATTTGTAAGTGTGGACAGCTTGGACATTGCTGCTGAGCTGTGGTTAGAG ATGATGCCTCCATTCCTAGAGGGCTAATAACAGCATTTAGCATATTGTTTACACATATATTTTTATGTCAAAAAAAAAACAAAAACCTTT CAAACAGAGCATTGTGATATTGTCAAAGAGAAAAACAAATCCTGAAGATACATGGAAATGTAACCTAGTTTAGGGTGGGTATTTTTCTGA AGATACATCAATACCTGACCTTTTTTAAAAAAATAATTTTAAAACAGCATACTGTGAGGAAGAACAGTATTGACATACCCACATCCCAGC ATGTGTACCCTGCCAGTTCTTTTAGGGATTTTTCCTCCAAAGAGATTTGGATTTGGTTTTGGTAAAAGGGGTTAAATTGTGCTTCCAGGC AAGAACTTTGCCTTATCATAAACAGGAAATGAAAAAGGGAAGGGCTGTCAGGATGGGATAATTTGGGAGGCTTCTCATTCTGGCTTCTAT TTCTATGTGAGTACCAGCATATAGAGTGTTTTAAAAACAGATACATGTCATATAATTTATCTGCACAGACTTAGACCTTCAGGAAACATA GGTTAAGCCCCCTTTTACAAAGAAAAAGTAAACATACTTCAGCATCTTGGAGGGTAGTTTTCAAAACTCAAGTTTCATGTTTCAATGCCA AGTTCTTATTTTAAAAAATAAAATCTACTTATAAGAGAAAGGTGCATTACTTAAAAAAAAAAAACTTTAAAGAAATGAAAGAAGAACCCT CTTCAGATACTTACTTGAAGACTGTTTTCCCCTGTTAATGAGATATAGCTAGATATCGGTGTGTGTATTTCTTTATTATTCTCTGGTTTT TGATCTGGCCTTGCCTCCAGGGCCAAACACTGATTTAGAAAGAGAGCCTTCTAGCTATTTTGGCATTGATGGCTTTTTATACCAGTGTGT CCAGTTAGATTTACTAGGCTTACTGACATGCTATTGGTAAATCGCATTAAAGTTCATCTGAACCTTCTGTCTGTTGACTTCTTAGTCCTC AGACATGGGCCTTTGTGTTTTAGAATATTTGAATTTGAGTTATTGGGCCCCACTCCCTGTTTTTTATTAAAGAACGTGAGCCTGGGATAC TTTCAGAAGTATCTGTTCAATGAAAAAAAGTTGGTTTCCCATCAAATATGAATAAAATTCTCTATATATTTCATTGTATTTTGGTTATCA GCAGTCATCAATAATGTTTTTCCCTCCCCTCTCCCACCTCTTATTTTTAATTATGCCAAATATCCTAAATAATATACTTAAGCCTCCATT CCCTCATCCCTACTAGGGAAGGGGGTGAGTGTATGTGTGAGTGTATGTGTATGTATGATCCCATCTCACCCCCACCCCCATTTTGGGAGT CTTTTAAAATGAAAACAAAGTTTGGTAGTTTTGACTATTTCTAAAAGCAGAGGAGAAAAAAAAACTTATTTAAATATCCTGGAATCTGTA TGGAGGAAGAAAAGGTATTTGTTAATTTTTCAGTTACGTTATCTATAAACATGATGGAAGTAAAGGTTTGGCAGAATTTCACCTTGACTA TTTGAAAATTACAGACCCAATTAATTCCATTCAAAAGTGGTTTTCGTTTTGTTTTAATTATTGTACAATGAGAGATATTGTCTATTAAAT ACATTATTTTGAACAGATGAGAAATCTGATTCTGTTCATGAGTGGGAGGCAAAACTGGTTTGACCGTGATCATTTTTGTGGTTTTGAAAA CAAATATACTTGACCCAGTTTCCTTAGTTTTTTCTTCAACTGTCCATAGGAACGATAAGTATTTGAAAGCAACATCAAATCTATACGTTT AAAGCAGGGCAGTTAGCACAAATTTGCAAGTAGAACTTCTATTAGCTTATGCCATAGACATCACCCAACCACTTGTATGTGTGTGTGTAT ATATAATATGCATATATAGTTACCGTGCTAAAATGGTTACCAGCAGGTTTTGAGAGAGAATGCTGCATCAGAAAAGTGTCAGTTGCCACC TCATTCTCCCTGATTTAGGTTCCTGACACTGATTCCTTTCTCTCTCGTTTTTGACCCCCATTGGGTGTATCTTGTCTATGTACAGATATT TTGTAATATATTAAATTTTTTTCTTTCAGTTTATAAAAATGGAAAGTGGAGATTGGAAAATTAAATATTTCCTGTTACTATACCACTTTT GCTCCATTGCATTTACTTCTTAATCTGTACCCCCTGAGCATATCTAATCATGTATAAAGGACGTTTTTCCTCCACTTTATCTTAGGGGTT CTCTGTCTCAGAATCATTATAGACTCATTAACTCCCCCTCCCAGCAAAAGGTTATCAGGATTTGAAGAGGTGCTTGAAAACGCTAGACTA GGAACTAGAGAATAAATGAGTTGGGAAAAACCATGAAATGTGATTTTTTTAAAGTAGAAAAGTTATACAAATAATGGTACCAAACCATCA AAAGAGTTGAGCTTCATGTACCCTGACTCCTCCTGACAGGAGAGGTAAGTGGGTTTGAGCTCAACTGTCATCAAGGGAAGTTGGTAAGAG GCTGTTTAGACCCAAAGGATAGTCTTAAACCAGACTTCACCACCCACCCTACCTCAGTTCCCATGTTATTACATGCAGAGTCAGCATGGG GATTAGTGTACCTACCTTTGCTGAGATTTCCCGATGCGTTGCCAATCCAGAAAGTGAATCAAAAAGTTGTTTAAAAGTTAAAATCTCTAT TGTTTCCAAAATCTTTCCCATCTCCACCTGAAGACAGAATTGCTTCCCCTTCTC >6894_6894_4_ARSG-CDK12_ARSG_chr17_66352945_ENST00000452479_CDK12_chr17_37667782_ENST00000447079_length(amino acids)=746AA_BP=144 MLTGRLGLRNGVTRNFAVTSVGGLPLNETTLAEVLQQAGYVTGIIGKWHLGHHGSYHPNFRGFDYYFGIPYSHDMGCTDTPGYNHPPCPA CPQGDGPSRNLQRDCYTDVALPLYENLNIVEQPVNLSSLAQKYAEKATQFIQRASRPYTNKVITLWYRPPELLLGEERYTPAIDVWSCGC ILGELFTKKPIFQANLELAQLELISRLCGSPCPAVWPDVIKLPYFNTMKPKKQYRRRLREEFSFIPSAALDLLDHMLTLDPSKRCTAEQT LQSDFLKDVELSKMAPPDLPHWQDCHELWSKKRRRQRQSGVVVEEPPPSKTSRKETTSGTSTEPVKNSSPAPPQPAPGKVESGAGDAIGL ADITQQLNQSELAVLLNLLQSQTDLSIPQMAQLLNIHSNPEMQQQLEALNQSISALTEATSQQQDSETMAPEESLKEAPSAPVILPSAEQ TTLEASSTPADMQNILAVLLSQLMKTQEPAGSLEENNSDKNSGPQGPRRTPTMPQEEAAACPPHILPPEKRPPEPPGPPPPPPPPPLVEG DLSSAPQELNPAVTAALLQLLSQPEAEPPGHLPHEHQALRPMEYSTRPRPNRTYGNTDGPETGFSAIDTDERNSGPALTESLVQTLVKNR TFSGSLSHLGESSSYQGTGSVQFPGDQDLRFARVPLALHPVVGQPFLKAEGSSNSVVHAETKLQNYGELGPGTTGASSSGAGLHWGGPTQ SSAYGKLYRGPTRVPPRGGRGRGVPY -------------------------------------------------------------- |
Top |
Fusion Gene PPI Analysis for ARSG-CDK12 |
![]() |
![]() |
Hgene | Hgene's interactors | Tgene | Tgene's interactors |
![]() |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Still interaction with |
![]() |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Interaction lost with |
![]() |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Interaction lost with |
Top |
Related Drugs for ARSG-CDK12 |
![]() (DrugBank Version 5.1.8 2021-05-08) |
Partner | Gene | UniProtAcc | DrugBank ID | Drug name | Drug activity | Drug type | Drug status |
Top |
Related Diseases for ARSG-CDK12 |
![]() (DisGeNet 4.0) |
Partner | Gene | Disease ID | Disease name | # pubmeds | Source |
Hgene | ARSG | C4748364 | USHER SYNDROME, TYPE IV | 2 | GENOMICS_ENGLAND;UNIPROT |
Hgene | ARSG | C0027877 | Neuronal Ceroid-Lipofuscinoses | 1 | GENOMICS_ENGLAND |
Hgene | ARSG | C1568248 | Usher Syndrome, Type III | 1 | ORPHANET |
Tgene | C0033578 | Prostatic Neoplasms | 1 | CTD_human | |
Tgene | C0376358 | Malignant neoplasm of prostate | 1 | CTD_human |