FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:NSRP1-CDK12 (FusionGDB2 ID:60179)

Fusion Gene Summary for NSRP1-CDK12

check button Fusion gene summary
Fusion gene informationFusion gene name: NSRP1-CDK12
Fusion gene ID: 60179
HgeneTgene
Gene symbol

NSRP1

CDK12

Gene ID

84081

51755

Gene namenuclear speckle splicing regulatory protein 1cyclin dependent kinase 12
SynonymsCCDC55|HSPC095|NSrp70CRK7|CRKR|CRKRS
Cytomap

17q11.2

17q12

Type of geneprotein-codingprotein-coding
Descriptionnuclear speckle splicing regulatory protein 1coiled-coil domain containing 55coiled-coil domain-containing protein 55nuclear speckle-related protein 70cyclin-dependent kinase 12CDC2-related protein kinase 7Cdc2-related kinase, arginine/serine-richcell division cycle 2-related protein kinase 7cell division protein kinase 12
Modification date2020031320200313
UniProtAcc.

Q9NYV4

Ensembl transtripts involved in fusion geneENST00000540900, ENST00000584423, 
ENST00000247026, ENST00000479218, 
ENST00000430627, ENST00000447079, 
ENST00000559545, 
Fusion gene scores* DoF score15 X 10 X 8=120036 X 30 X 14=15120
# samples 1755
** MAII scorelog2(17/1200*10)=-2.81942775435818
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(55/15120*10)=-4.78088271069641
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: NSRP1 [Title/Abstract] AND CDK12 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointNSRP1(28443881)-CDK12(37646810), # samples:2
NSRP1(28445191)-CDK12(37646810), # samples:2
Anticipated loss of major functional domain due to fusion event.NSRP1-CDK12 seems lost the major protein functional domain in Hgene partner, which is a essential gene due to the frame-shifted ORF.
NSRP1-CDK12 seems lost the major protein functional domain in Tgene partner, which is a kinase due to the frame-shifted ORF.
NSRP1-CDK12 seems lost the major protein functional domain in Tgene partner, which is a CGC due to the frame-shifted ORF.
NSRP1-CDK12 seems lost the major protein functional domain in Tgene partner, which is a essential gene due to the frame-shifted ORF.
NSRP1-CDK12 seems lost the major protein functional domain in Tgene partner, which is a IUPHAR drug target due to the frame-shifted ORF.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneNSRP1

GO:0000381

regulation of alternative mRNA splicing, via spliceosome

21296756

TgeneCDK12

GO:0046777

protein autophosphorylation

11683387


check buttonFusion gene breakpoints across NSRP1 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.

check buttonFusion gene breakpoints across CDK12 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4PRADTCGA-YJ-A8SW-01ANSRP1chr17

28443881

+CDK12chr17

37646810

+
ChimerDB4PRADTCGA-YJ-A8SWNSRP1chr17

28445191

+CDK12chr17

37646810

+
ChimerDB4PRADTCGA-YJ-A8SWNSRP1chr17

28443881

+CDK12chr17

37646810

+
ChimerDB4PRADTCGA-YJ-A8SW-01ANSRP1chr17

28445191

-CDK12chr17

37646810

+


Top

Fusion Gene ORF analysis for NSRP1-CDK12

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
3UTR-3CDSENST00000540900ENST00000430627NSRP1chr17

28443881

+CDK12chr17

37646810

+
3UTR-3CDSENST00000540900ENST00000447079NSRP1chr17

28443881

+CDK12chr17

37646810

+
3UTR-intronENST00000540900ENST00000559545NSRP1chr17

28443881

+CDK12chr17

37646810

+
In-frameENST00000584423ENST00000430627NSRP1chr17

28443881

+CDK12chr17

37646810

+
In-frameENST00000584423ENST00000447079NSRP1chr17

28443881

+CDK12chr17

37646810

+
5CDS-intronENST00000584423ENST00000559545NSRP1chr17

28443881

+CDK12chr17

37646810

+
In-frameENST00000247026ENST00000430627NSRP1chr17

28443881

+CDK12chr17

37646810

+
In-frameENST00000247026ENST00000447079NSRP1chr17

28443881

+CDK12chr17

37646810

+
5CDS-intronENST00000247026ENST00000559545NSRP1chr17

28443881

+CDK12chr17

37646810

+
In-frameENST00000479218ENST00000430627NSRP1chr17

28443881

+CDK12chr17

37646810

+
In-frameENST00000479218ENST00000447079NSRP1chr17

28443881

+CDK12chr17

37646810

+
5CDS-intronENST00000479218ENST00000559545NSRP1chr17

28443881

+CDK12chr17

37646810

+
intron-3CDSENST00000540900ENST00000430627NSRP1chr17

28445191

+CDK12chr17

37646810

+
intron-3CDSENST00000540900ENST00000447079NSRP1chr17

28445191

+CDK12chr17

37646810

+
intron-intronENST00000540900ENST00000559545NSRP1chr17

28445191

+CDK12chr17

37646810

+
Frame-shiftENST00000584423ENST00000430627NSRP1chr17

28445191

+CDK12chr17

37646810

+
Frame-shiftENST00000584423ENST00000447079NSRP1chr17

28445191

+CDK12chr17

37646810

+
5CDS-intronENST00000584423ENST00000559545NSRP1chr17

28445191

+CDK12chr17

37646810

+
Frame-shiftENST00000247026ENST00000430627NSRP1chr17

28445191

+CDK12chr17

37646810

+
Frame-shiftENST00000247026ENST00000447079NSRP1chr17

28445191

+CDK12chr17

37646810

+
5CDS-intronENST00000247026ENST00000559545NSRP1chr17

28445191

+CDK12chr17

37646810

+
3UTR-3CDSENST00000479218ENST00000430627NSRP1chr17

28445191

+CDK12chr17

37646810

+
3UTR-3CDSENST00000479218ENST00000447079NSRP1chr17

28445191

+CDK12chr17

37646810

+
3UTR-intronENST00000479218ENST00000559545NSRP1chr17

28445191

+CDK12chr17

37646810

+

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000584423NSRP1chr1728443881+ENST00000430627CDK12chr1737646810+375983632597844
ENST00000584423NSRP1chr1728443881+ENST00000447079CDK12chr1737646810+645583632624853
ENST00000247026NSRP1chr1728443881+ENST00000430627CDK12chr1737646810+375983632597844
ENST00000247026NSRP1chr1728443881+ENST00000447079CDK12chr1737646810+645583632624853
ENST00000479218NSRP1chr1728443881+ENST00000430627CDK12chr1737646810+371337172551844
ENST00000479218NSRP1chr1728443881+ENST00000447079CDK12chr1737646810+640937172578853

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000584423ENST00000430627NSRP1chr1728443881+CDK12chr1737646810+0.0015149440.998485
ENST00000584423ENST00000447079NSRP1chr1728443881+CDK12chr1737646810+0.0001882810.9998117
ENST00000247026ENST00000430627NSRP1chr1728443881+CDK12chr1737646810+0.0015149440.998485
ENST00000247026ENST00000447079NSRP1chr1728443881+CDK12chr1737646810+0.0001882810.9998117
ENST00000479218ENST00000430627NSRP1chr1728443881+CDK12chr1737646810+0.0014809190.9985191
ENST00000479218ENST00000447079NSRP1chr1728443881+CDK12chr1737646810+0.0001828360.9998172

Top

Fusion Genomic Features for NSRP1-CDK12


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)
NSRP1chr1728445191+CDK12chr1737646809+3.73E-060.9999963
NSRP1chr1728443881+CDK12chr1737646809+3.82E-101
NSRP1chr1728443881+CDK12chr1737646809+3.82E-101
NSRP1chr1728445191+CDK12chr1737646809+3.73E-060.9999963
NSRP1chr1728443881+CDK12chr1737646809+3.82E-101
NSRP1chr1728443881+CDK12chr1737646809+3.82E-101

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.

Top

Fusion Protein Features for NSRP1-CDK12


check button Go to

FGviewer for the breakpoints of chr17:28443881-chr17:37646810

.
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
.CDK12

Q9NYV4

FUNCTION: Might normally function as a transcriptional repressor. EWS-fusion-proteins (EFPS) may play a role in the tumorigenic process. They may disturb gene expression by mimicking, or interfering with the normal function of CTD-POLII within the transcription initiation complex. They may also contribute to an aberrant activation of the fusion protein target genes.FUNCTION: Cyclin-dependent kinase that phosphorylates the C-terminal domain (CTD) of the large subunit of RNA polymerase II (POLR2A), thereby acting as a key regulator of transcription elongation. Regulates the expression of genes involved in DNA repair and is required for the maintenance of genomic stability. Preferentially phosphorylates 'Ser-5' in CTD repeats that are already phosphorylated at 'Ser-7', but can also phosphorylate 'Ser-2'. Required for RNA splicing, possibly by phosphorylating SRSF1/SF2. Involved in regulation of MAP kinase activity, possibly leading to affect the response to estrogen inhibitors. {ECO:0000269|PubMed:11683387, ECO:0000269|PubMed:19651820, ECO:0000269|PubMed:20952539, ECO:0000269|PubMed:22012619, ECO:0000269|PubMed:24662513}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
TgeneCDK12chr17:28443881chr17:37646810ENST000004306271141266_1280643.66666666666661482.0Compositional biasNote=Poly-Pro
TgeneCDK12chr17:28443881chr17:37646810ENST000004470791141266_1280643.66666666666661491.0Compositional biasNote=Poly-Pro
TgeneCDK12chr17:28443881chr17:37646810ENST00000430627114727_1020643.66666666666661482.0DomainProtein kinase
TgeneCDK12chr17:28443881chr17:37646810ENST00000447079114727_1020643.66666666666661491.0DomainProtein kinase
TgeneCDK12chr17:28443881chr17:37646810ENST00000430627114733_741643.66666666666661482.0Nucleotide bindingNote=ATP
TgeneCDK12chr17:28443881chr17:37646810ENST00000430627114814_819643.66666666666661482.0Nucleotide bindingNote=ATP
TgeneCDK12chr17:28443881chr17:37646810ENST00000447079114733_741643.66666666666661491.0Nucleotide bindingNote=ATP
TgeneCDK12chr17:28443881chr17:37646810ENST00000447079114814_819643.66666666666661491.0Nucleotide bindingNote=ATP

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneNSRP1chr17:28443881chr17:37646810ENST00000247026+17104_1706.666666666666667559.0Coiled coilOntology_term=ECO:0000255
HgeneNSRP1chr17:28443881chr17:37646810ENST00000247026+17379_4276.666666666666667559.0Coiled coilOntology_term=ECO:0000255
HgeneNSRP1chr17:28443881chr17:37646810ENST00000247026+17282_3596.666666666666667559.0Compositional biasNote=His-rich
HgeneNSRP1chr17:28443881chr17:37646810ENST00000247026+1732_376.666666666666667559.0Compositional biasNote=Poly-Asp
HgeneNSRP1chr17:28443881chr17:37646810ENST00000247026+17106_1706.666666666666667559.0RegionNote=Necessary for alternative splicing activity
TgeneCDK12chr17:28443881chr17:37646810ENST00000430627114407_413643.66666666666661482.0Compositional biasNote=Poly-Ala
TgeneCDK12chr17:28443881chr17:37646810ENST00000430627114535_540643.66666666666661482.0Compositional biasNote=Poly-Pro
TgeneCDK12chr17:28443881chr17:37646810ENST00000447079114407_413643.66666666666661491.0Compositional biasNote=Poly-Ala
TgeneCDK12chr17:28443881chr17:37646810ENST00000447079114535_540643.66666666666661491.0Compositional biasNote=Poly-Pro


Top

Fusion Gene Sequence for NSRP1-CDK12


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>In-frame_ENST00000584423_ENST00000430627_TCGA-YJ-A8SW-01A_NSRP1_chr17_28443881_+_CDK12_chr17_37646810_length(transcript)=3759nt_BP=83nt
CGTGACGACATCGGCGTCAGCGTCACGGAGGCGTCGGCCACGTTCAGCGGACACGGGAGCAAGATGGCGATTCCGGGCAGGCATCCAAAA
GAAACTCTTCCTTCAAAACCTGTGAAGAAAGAGAAGGAACAGAGGACACGTCACTTACTCACAGACCTTCCTCTCCCTCCAGAGCTCCCT
GGTGGAGATCTGTCTCCCCCAGACTCTCCAGAACCAAAGGCAATCACACCACCTCAGCAACCATATAAAAAGAGACCAAAAATTTGTTGT
CCTCGTTATGGAGAAAGAAGACAAACAGAAAGCGACTGGGGGAAACGCTGTGTGGACAAGTTTGACATTATTGGGATTATTGGAGAAGGA
ACCTATGGCCAAGTATATAAAGCCAAGGACAAAGACACAGGAGAACTAGTGGCTCTGAAGAAGGTGAGACTAGACAATGAGAAAGAGGGC
TTCCCAATCACAGCCATTCGTGAAATCAAAATCCTTCGTCAGTTAATCCACCGAAGTGTTGTTAACATGAAGGAAATTGTCACAGATAAA
CAAGATGCACTGGATTTCAAGAAGGACAAAGGTGCCTTTTACCTTGTATTTGAGTATATGGACCATGACTTAATGGGACTGCTAGAATCT
GGTTTGGTGCACTTTTCTGAGGACCATATCAAGTCGTTCATGAAACAGCTAATGGAAGGATTGGAATACTGTCACAAAAAGAATTTCCTG
CATCGGGATATTAAGTGTTCTAACATTTTGCTGAATAACAGTGGGCAAATCAAACTAGCAGATTTTGGACTTGCTCGGCTCTATAACTCT
GAAGAGAGTCGCCCTTACACAAACAAAGTCATTACTTTGTGGTACCGACCTCCAGAACTACTGCTAGGAGAGGAACGTTACACACCAGCC
ATAGATGTTTGGAGCTGTGGATGTATTCTTGGGGAACTATTCACAAAGAAGCCTATTTTTCAAGCCAATCTGGAACTGGCTCAGCTAGAA
CTGATCAGCCGACTTTGTGGTAGCCCTTGTCCAGCTGTGTGGCCTGATGTTATCAAACTGCCCTACTTCAACACCATGAAACCGAAGAAG
CAATATCGAAGGCGTCTACGAGAAGAATTCTCTTTCATTCCTTCTGCAGCACTTGATTTATTGGACCACATGCTGACACTAGATCCTAGT
AAGCGGTGCACAGCTGAACAGACCCTACAGAGCGACTTCCTTAAAGATGTCGAACTCAGCAAAATGGCTCCTCCAGACCTCCCCCACTGG
CAGGATTGCCATGAGTTGTGGAGTAAGAAACGGCGACGTCAGCGACAAAGTGGTGTTGTAGTCGAAGAGCCACCTCCATCCAAAACTTCT
CGAAAAGAAACTACCTCAGGGACAAGTACTGAGCCTGTGAAGAACAGCAGCCCAGCACCACCTCAGCCTGCTCCTGGCAAGGTGGAGTCT
GGGGCTGGGGATGCAATAGGCCTTGCTGACATCACACAACAGCTGAATCAAAGTGAATTGGCAGTGTTATTAAACCTGCTGCAGAGCCAA
ACCGACCTGAGCATCCCTCAAATGGCACAGCTGCTTAACATCCACTCCAACCCAGAGATGCAGCAGCAGCTGGAAGCCCTGAACCAATCC
ATCAGTGCCCTGACGGAAGCTACTTCCCAGCAGCAGGACTCAGAGACCATGGCCCCAGAGGAGTCTTTGAAGGAAGCACCCTCTGCCCCA
GTGATCCTGCCTTCAGCAGAACAGACGACCCTTGAAGCTTCAAGCACACCAGCTGACATGCAGAATATATTGGCAGTTCTCTTGAGTCAG
CTGATGAAAACCCAAGAGCCAGCAGGCAGTCTGGAGGAAAACAACAGTGACAAGAACAGTGGGCCACAGGGGCCCCGAAGAACTCCCACA
ATGCCACAGGAGGAGGCAGCAGAGAAGAGGCCCCCTGAGCCCCCCGGACCTCCACCGCCGCCACCTCCACCCCCTCTGGTTGAAGGCGAT
CTTTCCAGCGCCCCCCAGGAGTTGAACCCAGCCGTGACAGCCGCCTTGCTGCAACTTTTATCCCAGCCTGAAGCAGAGCCTCCTGGCCAC
CTGCCACATGAGCACCAGGCCTTGAGACCAATGGAGTACTCCACCCGACCCCGTCCAAACAGGACTTATGGAAACACTGATGGGCCTGAA
ACAGGGTTCAGTGCCATTGACACTGATGAACGAAACTCTGGTCCAGCCTTGACAGAATCCTTGGTCCAGACCCTGGTGAAGAACAGGACC
TTCTCAGGCTCTCTGAGCCACCTTGGGGAGTCCAGCAGTTACCAGGGCACAGGGTCAGTGCAGTTTCCAGGGGACCAGGACCTCCGTTTT
GCCAGGGTCCCCTTAGCGTTACACCCGGTGGTCGGGCAACCATTCCTGAAGGCTGAGGGAAGCAGCAATTCTGTGGTACATGCAGAGACC
AAATTGCAAAACTATGGGGAGCTGGGGCCAGGAACCACTGGGGCCAGCAGCTCAGGAGCAGGCCTTCACTGGGGGGGCCCAACTCAGTCT
TCTGCTTATGGAAAACTCTATCGGGGGCCTACAAGAGTCCCACCAAGAGGGGGAAGAGGGAGAGGAGTTCCTTACTAACCCAGAGACTTC
AGTGTCCTGAAAGATTCCTTTCCTATCCATCCTTCCATCCAGTTCTCTGAATCTTTAATGAAATCATTTGCCAGAGCGAGGTAATCATCT
GCATTTGGCTACTGCAAAGCTGTCCGTTGTATTCCTTGCTCACTTGCTACTAGCAGGCGACTTACGAAATAATGATGTTGGCACCAGTTC
CCCCTGGATGGGCTATAGCCAGAACATTTACTTCAACTCTACCTTAGTAGATACAAGTAGAGAATATGGAGAGGATCATTACATTGAAAA
GTAAATGTTTTATTAGTTCATTGCCTGCACTTACTGATCGGAAGAGAGAAAGAACAGTTTCAGTATTGAGATGGCTCAGGAGAGGCTCTT
TGATTTTTAAAGTTTTGGGGTGGGGGATTGTGTGTGGTTTCTTTCTTTTGAATTTTAATTTAGGTGTTTTGGGTTTTTTTCCTTTAAAGA
GAATAGTGTTCACAAAATTTGAGCTGCTCTTTGGCTTTTGCTATAAGGGAAACAGAGTGGCCTGGCTGATTTGAATAAATGTTTCTTTCC
TCTCCACCATCTCACATTTTGCTTTTAAGTGAACACTTTTTCCCCATTGAGCATCTTGAACATACTTTTTTTCCAAATAAATTACTCATC
CTTAAAGTTTACTCCACTTTGACAAAAGATACGCCCTTCTCCCTGCACATAAAGCAGGTTGTAGAACGTGGCATTCTTGGGCAAGTAGGT
AGACTTTACCCAGTCTCTTTCCTTTTTTGCTGATGTGTGCTCTCTCTCTCTCTTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTC
TCTCTGTCTCGCTTGCTCGCTCTCGCTGTTTCTCTCTCTTTGAGGCATTTGTTTGGAAAAAATCGTTGAGATGCCCAAGAACCTGGGATA
ATTCTTTACTTTTTTTGAAATAAAGGAAAGGAAATTCAGACTCTTACATTGTTCTCTGTAACTCTTCAATTCTAAAATGTTTTGTTTTTT
AAACCATGTTCTGATGGGGAAGTTGATTTGTAAGTGTGGACAGCTTGGACATTGCTGCTGAGCTGTGGTTAGAGATGATGCCTCCATTCC

>In-frame_ENST00000584423_ENST00000430627_TCGA-YJ-A8SW-01A_NSRP1_chr17_28443881_+_CDK12_chr17_37646810_length(amino acids)=844AA_start in transcript=63_stop in transcript=2597
MAIPGRHPKETLPSKPVKKEKEQRTRHLLTDLPLPPELPGGDLSPPDSPEPKAITPPQQPYKKRPKICCPRYGERRQTESDWGKRCVDKF
DIIGIIGEGTYGQVYKAKDKDTGELVALKKVRLDNEKEGFPITAIREIKILRQLIHRSVVNMKEIVTDKQDALDFKKDKGAFYLVFEYMD
HDLMGLLESGLVHFSEDHIKSFMKQLMEGLEYCHKKNFLHRDIKCSNILLNNSGQIKLADFGLARLYNSEESRPYTNKVITLWYRPPELL
LGEERYTPAIDVWSCGCILGELFTKKPIFQANLELAQLELISRLCGSPCPAVWPDVIKLPYFNTMKPKKQYRRRLREEFSFIPSAALDLL
DHMLTLDPSKRCTAEQTLQSDFLKDVELSKMAPPDLPHWQDCHELWSKKRRRQRQSGVVVEEPPPSKTSRKETTSGTSTEPVKNSSPAPP
QPAPGKVESGAGDAIGLADITQQLNQSELAVLLNLLQSQTDLSIPQMAQLLNIHSNPEMQQQLEALNQSISALTEATSQQQDSETMAPEE
SLKEAPSAPVILPSAEQTTLEASSTPADMQNILAVLLSQLMKTQEPAGSLEENNSDKNSGPQGPRRTPTMPQEEAAEKRPPEPPGPPPPP
PPPPLVEGDLSSAPQELNPAVTAALLQLLSQPEAEPPGHLPHEHQALRPMEYSTRPRPNRTYGNTDGPETGFSAIDTDERNSGPALTESL
VQTLVKNRTFSGSLSHLGESSSYQGTGSVQFPGDQDLRFARVPLALHPVVGQPFLKAEGSSNSVVHAETKLQNYGELGPGTTGASSSGAG

--------------------------------------------------------------
>In-frame_ENST00000584423_ENST00000447079_TCGA-YJ-A8SW-01A_NSRP1_chr17_28443881_+_CDK12_chr17_37646810_length(transcript)=6455nt_BP=83nt
CGTGACGACATCGGCGTCAGCGTCACGGAGGCGTCGGCCACGTTCAGCGGACACGGGAGCAAGATGGCGATTCCGGGCAGGCATCCAAAA
GAAACTCTTCCTTCAAAACCTGTGAAGAAAGAGAAGGAACAGAGGACACGTCACTTACTCACAGACCTTCCTCTCCCTCCAGAGCTCCCT
GGTGGAGATCTGTCTCCCCCAGACTCTCCAGAACCAAAGGCAATCACACCACCTCAGCAACCATATAAAAAGAGACCAAAAATTTGTTGT
CCTCGTTATGGAGAAAGAAGACAAACAGAAAGCGACTGGGGGAAACGCTGTGTGGACAAGTTTGACATTATTGGGATTATTGGAGAAGGA
ACCTATGGCCAAGTATATAAAGCCAAGGACAAAGACACAGGAGAACTAGTGGCTCTGAAGAAGGTGAGACTAGACAATGAGAAAGAGGGC
TTCCCAATCACAGCCATTCGTGAAATCAAAATCCTTCGTCAGTTAATCCACCGAAGTGTTGTTAACATGAAGGAAATTGTCACAGATAAA
CAAGATGCACTGGATTTCAAGAAGGACAAAGGTGCCTTTTACCTTGTATTTGAGTATATGGACCATGACTTAATGGGACTGCTAGAATCT
GGTTTGGTGCACTTTTCTGAGGACCATATCAAGTCGTTCATGAAACAGCTAATGGAAGGATTGGAATACTGTCACAAAAAGAATTTCCTG
CATCGGGATATTAAGTGTTCTAACATTTTGCTGAATAACAGTGGGCAAATCAAACTAGCAGATTTTGGACTTGCTCGGCTCTATAACTCT
GAAGAGAGTCGCCCTTACACAAACAAAGTCATTACTTTGTGGTACCGACCTCCAGAACTACTGCTAGGAGAGGAACGTTACACACCAGCC
ATAGATGTTTGGAGCTGTGGATGTATTCTTGGGGAACTATTCACAAAGAAGCCTATTTTTCAAGCCAATCTGGAACTGGCTCAGCTAGAA
CTGATCAGCCGACTTTGTGGTAGCCCTTGTCCAGCTGTGTGGCCTGATGTTATCAAACTGCCCTACTTCAACACCATGAAACCGAAGAAG
CAATATCGAAGGCGTCTACGAGAAGAATTCTCTTTCATTCCTTCTGCAGCACTTGATTTATTGGACCACATGCTGACACTAGATCCTAGT
AAGCGGTGCACAGCTGAACAGACCCTACAGAGCGACTTCCTTAAAGATGTCGAACTCAGCAAAATGGCTCCTCCAGACCTCCCCCACTGG
CAGGATTGCCATGAGTTGTGGAGTAAGAAACGGCGACGTCAGCGACAAAGTGGTGTTGTAGTCGAAGAGCCACCTCCATCCAAAACTTCT
CGAAAAGAAACTACCTCAGGGACAAGTACTGAGCCTGTGAAGAACAGCAGCCCAGCACCACCTCAGCCTGCTCCTGGCAAGGTGGAGTCT
GGGGCTGGGGATGCAATAGGCCTTGCTGACATCACACAACAGCTGAATCAAAGTGAATTGGCAGTGTTATTAAACCTGCTGCAGAGCCAA
ACCGACCTGAGCATCCCTCAAATGGCACAGCTGCTTAACATCCACTCCAACCCAGAGATGCAGCAGCAGCTGGAAGCCCTGAACCAATCC
ATCAGTGCCCTGACGGAAGCTACTTCCCAGCAGCAGGACTCAGAGACCATGGCCCCAGAGGAGTCTTTGAAGGAAGCACCCTCTGCCCCA
GTGATCCTGCCTTCAGCAGAACAGACGACCCTTGAAGCTTCAAGCACACCAGCTGACATGCAGAATATATTGGCAGTTCTCTTGAGTCAG
CTGATGAAAACCCAAGAGCCAGCAGGCAGTCTGGAGGAAAACAACAGTGACAAGAACAGTGGGCCACAGGGGCCCCGAAGAACTCCCACA
ATGCCACAGGAGGAGGCAGCAGCATGTCCTCCTCACATTCTTCCACCAGAGAAGAGGCCCCCTGAGCCCCCCGGACCTCCACCGCCGCCA
CCTCCACCCCCTCTGGTTGAAGGCGATCTTTCCAGCGCCCCCCAGGAGTTGAACCCAGCCGTGACAGCCGCCTTGCTGCAACTTTTATCC
CAGCCTGAAGCAGAGCCTCCTGGCCACCTGCCACATGAGCACCAGGCCTTGAGACCAATGGAGTACTCCACCCGACCCCGTCCAAACAGG
ACTTATGGAAACACTGATGGGCCTGAAACAGGGTTCAGTGCCATTGACACTGATGAACGAAACTCTGGTCCAGCCTTGACAGAATCCTTG
GTCCAGACCCTGGTGAAGAACAGGACCTTCTCAGGCTCTCTGAGCCACCTTGGGGAGTCCAGCAGTTACCAGGGCACAGGGTCAGTGCAG
TTTCCAGGGGACCAGGACCTCCGTTTTGCCAGGGTCCCCTTAGCGTTACACCCGGTGGTCGGGCAACCATTCCTGAAGGCTGAGGGAAGC
AGCAATTCTGTGGTACATGCAGAGACCAAATTGCAAAACTATGGGGAGCTGGGGCCAGGAACCACTGGGGCCAGCAGCTCAGGAGCAGGC
CTTCACTGGGGGGGCCCAACTCAGTCTTCTGCTTATGGAAAACTCTATCGGGGGCCTACAAGAGTCCCACCAAGAGGGGGAAGAGGGAGA
GGAGTTCCTTACTAACCCAGAGACTTCAGTGTCCTGAAAGATTCCTTTCCTATCCATCCTTCCATCCAGTTCTCTGAATCTTTAATGAAA
TCATTTGCCAGAGCGAGGTAATCATCTGCATTTGGCTACTGCAAAGCTGTCCGTTGTATTCCTTGCTCACTTGCTACTAGCAGGCGACTT
ACGAAATAATGATGTTGGCACCAGTTCCCCCTGGATGGGCTATAGCCAGAACATTTACTTCAACTCTACCTTAGTAGATACAAGTAGAGA
ATATGGAGAGGATCATTACATTGAAAAGTAAATGTTTTATTAGTTCATTGCCTGCACTTACTGATCGGAAGAGAGAAAGAACAGTTTCAG
TATTGAGATGGCTCAGGAGAGGCTCTTTGATTTTTAAAGTTTTGGGGTGGGGGATTGTGTGTGGTTTCTTTCTTTTGAATTTTAATTTAG
GTGTTTTGGGTTTTTTTCCTTTAAAGAGAATAGTGTTCACAAAATTTGAGCTGCTCTTTGGCTTTTGCTATAAGGGAAACAGAGTGGCCT
GGCTGATTTGAATAAATGTTTCTTTCCTCTCCACCATCTCACATTTTGCTTTTAAGTGAACACTTTTTCCCCATTGAGCATCTTGAACAT
ACTTTTTTTCCAAATAAATTACTCATCCTTAAAGTTTACTCCACTTTGACAAAAGATACGCCCTTCTCCCTGCACATAAAGCAGGTTGTA
GAACGTGGCATTCTTGGGCAAGTAGGTAGACTTTACCCAGTCTCTTTCCTTTTTTGCTGATGTGTGCTCTCTCTCTCTCTTTCTCTCTCT
CTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTGTCTCGCTTGCTCGCTCTCGCTGTTTCTCTCTCTTTGAGGCATTTGTTTGGAAAAAAT
CGTTGAGATGCCCAAGAACCTGGGATAATTCTTTACTTTTTTTGAAATAAAGGAAAGGAAATTCAGACTCTTACATTGTTCTCTGTAACT
CTTCAATTCTAAAATGTTTTGTTTTTTAAACCATGTTCTGATGGGGAAGTTGATTTGTAAGTGTGGACAGCTTGGACATTGCTGCTGAGC
TGTGGTTAGAGATGATGCCTCCATTCCTAGAGGGCTAATAACAGCATTTAGCATATTGTTTACACATATATTTTTATGTCAAAAAAAAAA
CAAAAACCTTTCAAACAGAGCATTGTGATATTGTCAAAGAGAAAAACAAATCCTGAAGATACATGGAAATGTAACCTAGTTTAGGGTGGG
TATTTTTCTGAAGATACATCAATACCTGACCTTTTTTAAAAAAATAATTTTAAAACAGCATACTGTGAGGAAGAACAGTATTGACATACC
CACATCCCAGCATGTGTACCCTGCCAGTTCTTTTAGGGATTTTTCCTCCAAAGAGATTTGGATTTGGTTTTGGTAAAAGGGGTTAAATTG
TGCTTCCAGGCAAGAACTTTGCCTTATCATAAACAGGAAATGAAAAAGGGAAGGGCTGTCAGGATGGGATAATTTGGGAGGCTTCTCATT
CTGGCTTCTATTTCTATGTGAGTACCAGCATATAGAGTGTTTTAAAAACAGATACATGTCATATAATTTATCTGCACAGACTTAGACCTT
CAGGAAACATAGGTTAAGCCCCCTTTTACAAAGAAAAAGTAAACATACTTCAGCATCTTGGAGGGTAGTTTTCAAAACTCAAGTTTCATG
TTTCAATGCCAAGTTCTTATTTTAAAAAATAAAATCTACTTATAAGAGAAAGGTGCATTACTTAAAAAAAAAAAACTTTAAAGAAATGAA
AGAAGAACCCTCTTCAGATACTTACTTGAAGACTGTTTTCCCCTGTTAATGAGATATAGCTAGATATCGGTGTGTGTATTTCTTTATTAT
TCTCTGGTTTTTGATCTGGCCTTGCCTCCAGGGCCAAACACTGATTTAGAAAGAGAGCCTTCTAGCTATTTTGGCATTGATGGCTTTTTA
TACCAGTGTGTCCAGTTAGATTTACTAGGCTTACTGACATGCTATTGGTAAATCGCATTAAAGTTCATCTGAACCTTCTGTCTGTTGACT
TCTTAGTCCTCAGACATGGGCCTTTGTGTTTTAGAATATTTGAATTTGAGTTATTGGGCCCCACTCCCTGTTTTTTATTAAAGAACGTGA
GCCTGGGATACTTTCAGAAGTATCTGTTCAATGAAAAAAAGTTGGTTTCCCATCAAATATGAATAAAATTCTCTATATATTTCATTGTAT
TTTGGTTATCAGCAGTCATCAATAATGTTTTTCCCTCCCCTCTCCCACCTCTTATTTTTAATTATGCCAAATATCCTAAATAATATACTT
AAGCCTCCATTCCCTCATCCCTACTAGGGAAGGGGGTGAGTGTATGTGTGAGTGTATGTGTATGTATGATCCCATCTCACCCCCACCCCC
ATTTTGGGAGTCTTTTAAAATGAAAACAAAGTTTGGTAGTTTTGACTATTTCTAAAAGCAGAGGAGAAAAAAAAACTTATTTAAATATCC
TGGAATCTGTATGGAGGAAGAAAAGGTATTTGTTAATTTTTCAGTTACGTTATCTATAAACATGATGGAAGTAAAGGTTTGGCAGAATTT
CACCTTGACTATTTGAAAATTACAGACCCAATTAATTCCATTCAAAAGTGGTTTTCGTTTTGTTTTAATTATTGTACAATGAGAGATATT
GTCTATTAAATACATTATTTTGAACAGATGAGAAATCTGATTCTGTTCATGAGTGGGAGGCAAAACTGGTTTGACCGTGATCATTTTTGT
GGTTTTGAAAACAAATATACTTGACCCAGTTTCCTTAGTTTTTTCTTCAACTGTCCATAGGAACGATAAGTATTTGAAAGCAACATCAAA
TCTATACGTTTAAAGCAGGGCAGTTAGCACAAATTTGCAAGTAGAACTTCTATTAGCTTATGCCATAGACATCACCCAACCACTTGTATG
TGTGTGTGTATATATAATATGCATATATAGTTACCGTGCTAAAATGGTTACCAGCAGGTTTTGAGAGAGAATGCTGCATCAGAAAAGTGT
CAGTTGCCACCTCATTCTCCCTGATTTAGGTTCCTGACACTGATTCCTTTCTCTCTCGTTTTTGACCCCCATTGGGTGTATCTTGTCTAT
GTACAGATATTTTGTAATATATTAAATTTTTTTCTTTCAGTTTATAAAAATGGAAAGTGGAGATTGGAAAATTAAATATTTCCTGTTACT
ATACCACTTTTGCTCCATTGCATTTACTTCTTAATCTGTACCCCCTGAGCATATCTAATCATGTATAAAGGACGTTTTTCCTCCACTTTA
TCTTAGGGGTTCTCTGTCTCAGAATCATTATAGACTCATTAACTCCCCCTCCCAGCAAAAGGTTATCAGGATTTGAAGAGGTGCTTGAAA
ACGCTAGACTAGGAACTAGAGAATAAATGAGTTGGGAAAAACCATGAAATGTGATTTTTTTAAAGTAGAAAAGTTATACAAATAATGGTA
CCAAACCATCAAAAGAGTTGAGCTTCATGTACCCTGACTCCTCCTGACAGGAGAGGTAAGTGGGTTTGAGCTCAACTGTCATCAAGGGAA
GTTGGTAAGAGGCTGTTTAGACCCAAAGGATAGTCTTAAACCAGACTTCACCACCCACCCTACCTCAGTTCCCATGTTATTACATGCAGA
GTCAGCATGGGGATTAGTGTACCTACCTTTGCTGAGATTTCCCGATGCGTTGCCAATCCAGAAAGTGAATCAAAAAGTTGTTTAAAAGTT

>In-frame_ENST00000584423_ENST00000447079_TCGA-YJ-A8SW-01A_NSRP1_chr17_28443881_+_CDK12_chr17_37646810_length(amino acids)=853AA_start in transcript=63_stop in transcript=2624
MAIPGRHPKETLPSKPVKKEKEQRTRHLLTDLPLPPELPGGDLSPPDSPEPKAITPPQQPYKKRPKICCPRYGERRQTESDWGKRCVDKF
DIIGIIGEGTYGQVYKAKDKDTGELVALKKVRLDNEKEGFPITAIREIKILRQLIHRSVVNMKEIVTDKQDALDFKKDKGAFYLVFEYMD
HDLMGLLESGLVHFSEDHIKSFMKQLMEGLEYCHKKNFLHRDIKCSNILLNNSGQIKLADFGLARLYNSEESRPYTNKVITLWYRPPELL
LGEERYTPAIDVWSCGCILGELFTKKPIFQANLELAQLELISRLCGSPCPAVWPDVIKLPYFNTMKPKKQYRRRLREEFSFIPSAALDLL
DHMLTLDPSKRCTAEQTLQSDFLKDVELSKMAPPDLPHWQDCHELWSKKRRRQRQSGVVVEEPPPSKTSRKETTSGTSTEPVKNSSPAPP
QPAPGKVESGAGDAIGLADITQQLNQSELAVLLNLLQSQTDLSIPQMAQLLNIHSNPEMQQQLEALNQSISALTEATSQQQDSETMAPEE
SLKEAPSAPVILPSAEQTTLEASSTPADMQNILAVLLSQLMKTQEPAGSLEENNSDKNSGPQGPRRTPTMPQEEAAACPPHILPPEKRPP
EPPGPPPPPPPPPLVEGDLSSAPQELNPAVTAALLQLLSQPEAEPPGHLPHEHQALRPMEYSTRPRPNRTYGNTDGPETGFSAIDTDERN
SGPALTESLVQTLVKNRTFSGSLSHLGESSSYQGTGSVQFPGDQDLRFARVPLALHPVVGQPFLKAEGSSNSVVHAETKLQNYGELGPGT

--------------------------------------------------------------
>In-frame_ENST00000247026_ENST00000430627_TCGA-YJ-A8SW-01A_NSRP1_chr17_28443881_+_CDK12_chr17_37646810_length(transcript)=3759nt_BP=83nt
CGTGACGACATCGGCGTCAGCGTCACGGAGGCGTCGGCCACGTTCAGCGGACACGGGAGCAAGATGGCGATTCCGGGCAGGCATCCAAAA
GAAACTCTTCCTTCAAAACCTGTGAAGAAAGAGAAGGAACAGAGGACACGTCACTTACTCACAGACCTTCCTCTCCCTCCAGAGCTCCCT
GGTGGAGATCTGTCTCCCCCAGACTCTCCAGAACCAAAGGCAATCACACCACCTCAGCAACCATATAAAAAGAGACCAAAAATTTGTTGT
CCTCGTTATGGAGAAAGAAGACAAACAGAAAGCGACTGGGGGAAACGCTGTGTGGACAAGTTTGACATTATTGGGATTATTGGAGAAGGA
ACCTATGGCCAAGTATATAAAGCCAAGGACAAAGACACAGGAGAACTAGTGGCTCTGAAGAAGGTGAGACTAGACAATGAGAAAGAGGGC
TTCCCAATCACAGCCATTCGTGAAATCAAAATCCTTCGTCAGTTAATCCACCGAAGTGTTGTTAACATGAAGGAAATTGTCACAGATAAA
CAAGATGCACTGGATTTCAAGAAGGACAAAGGTGCCTTTTACCTTGTATTTGAGTATATGGACCATGACTTAATGGGACTGCTAGAATCT
GGTTTGGTGCACTTTTCTGAGGACCATATCAAGTCGTTCATGAAACAGCTAATGGAAGGATTGGAATACTGTCACAAAAAGAATTTCCTG
CATCGGGATATTAAGTGTTCTAACATTTTGCTGAATAACAGTGGGCAAATCAAACTAGCAGATTTTGGACTTGCTCGGCTCTATAACTCT
GAAGAGAGTCGCCCTTACACAAACAAAGTCATTACTTTGTGGTACCGACCTCCAGAACTACTGCTAGGAGAGGAACGTTACACACCAGCC
ATAGATGTTTGGAGCTGTGGATGTATTCTTGGGGAACTATTCACAAAGAAGCCTATTTTTCAAGCCAATCTGGAACTGGCTCAGCTAGAA
CTGATCAGCCGACTTTGTGGTAGCCCTTGTCCAGCTGTGTGGCCTGATGTTATCAAACTGCCCTACTTCAACACCATGAAACCGAAGAAG
CAATATCGAAGGCGTCTACGAGAAGAATTCTCTTTCATTCCTTCTGCAGCACTTGATTTATTGGACCACATGCTGACACTAGATCCTAGT
AAGCGGTGCACAGCTGAACAGACCCTACAGAGCGACTTCCTTAAAGATGTCGAACTCAGCAAAATGGCTCCTCCAGACCTCCCCCACTGG
CAGGATTGCCATGAGTTGTGGAGTAAGAAACGGCGACGTCAGCGACAAAGTGGTGTTGTAGTCGAAGAGCCACCTCCATCCAAAACTTCT
CGAAAAGAAACTACCTCAGGGACAAGTACTGAGCCTGTGAAGAACAGCAGCCCAGCACCACCTCAGCCTGCTCCTGGCAAGGTGGAGTCT
GGGGCTGGGGATGCAATAGGCCTTGCTGACATCACACAACAGCTGAATCAAAGTGAATTGGCAGTGTTATTAAACCTGCTGCAGAGCCAA
ACCGACCTGAGCATCCCTCAAATGGCACAGCTGCTTAACATCCACTCCAACCCAGAGATGCAGCAGCAGCTGGAAGCCCTGAACCAATCC
ATCAGTGCCCTGACGGAAGCTACTTCCCAGCAGCAGGACTCAGAGACCATGGCCCCAGAGGAGTCTTTGAAGGAAGCACCCTCTGCCCCA
GTGATCCTGCCTTCAGCAGAACAGACGACCCTTGAAGCTTCAAGCACACCAGCTGACATGCAGAATATATTGGCAGTTCTCTTGAGTCAG
CTGATGAAAACCCAAGAGCCAGCAGGCAGTCTGGAGGAAAACAACAGTGACAAGAACAGTGGGCCACAGGGGCCCCGAAGAACTCCCACA
ATGCCACAGGAGGAGGCAGCAGAGAAGAGGCCCCCTGAGCCCCCCGGACCTCCACCGCCGCCACCTCCACCCCCTCTGGTTGAAGGCGAT
CTTTCCAGCGCCCCCCAGGAGTTGAACCCAGCCGTGACAGCCGCCTTGCTGCAACTTTTATCCCAGCCTGAAGCAGAGCCTCCTGGCCAC
CTGCCACATGAGCACCAGGCCTTGAGACCAATGGAGTACTCCACCCGACCCCGTCCAAACAGGACTTATGGAAACACTGATGGGCCTGAA
ACAGGGTTCAGTGCCATTGACACTGATGAACGAAACTCTGGTCCAGCCTTGACAGAATCCTTGGTCCAGACCCTGGTGAAGAACAGGACC
TTCTCAGGCTCTCTGAGCCACCTTGGGGAGTCCAGCAGTTACCAGGGCACAGGGTCAGTGCAGTTTCCAGGGGACCAGGACCTCCGTTTT
GCCAGGGTCCCCTTAGCGTTACACCCGGTGGTCGGGCAACCATTCCTGAAGGCTGAGGGAAGCAGCAATTCTGTGGTACATGCAGAGACC
AAATTGCAAAACTATGGGGAGCTGGGGCCAGGAACCACTGGGGCCAGCAGCTCAGGAGCAGGCCTTCACTGGGGGGGCCCAACTCAGTCT
TCTGCTTATGGAAAACTCTATCGGGGGCCTACAAGAGTCCCACCAAGAGGGGGAAGAGGGAGAGGAGTTCCTTACTAACCCAGAGACTTC
AGTGTCCTGAAAGATTCCTTTCCTATCCATCCTTCCATCCAGTTCTCTGAATCTTTAATGAAATCATTTGCCAGAGCGAGGTAATCATCT
GCATTTGGCTACTGCAAAGCTGTCCGTTGTATTCCTTGCTCACTTGCTACTAGCAGGCGACTTACGAAATAATGATGTTGGCACCAGTTC
CCCCTGGATGGGCTATAGCCAGAACATTTACTTCAACTCTACCTTAGTAGATACAAGTAGAGAATATGGAGAGGATCATTACATTGAAAA
GTAAATGTTTTATTAGTTCATTGCCTGCACTTACTGATCGGAAGAGAGAAAGAACAGTTTCAGTATTGAGATGGCTCAGGAGAGGCTCTT
TGATTTTTAAAGTTTTGGGGTGGGGGATTGTGTGTGGTTTCTTTCTTTTGAATTTTAATTTAGGTGTTTTGGGTTTTTTTCCTTTAAAGA
GAATAGTGTTCACAAAATTTGAGCTGCTCTTTGGCTTTTGCTATAAGGGAAACAGAGTGGCCTGGCTGATTTGAATAAATGTTTCTTTCC
TCTCCACCATCTCACATTTTGCTTTTAAGTGAACACTTTTTCCCCATTGAGCATCTTGAACATACTTTTTTTCCAAATAAATTACTCATC
CTTAAAGTTTACTCCACTTTGACAAAAGATACGCCCTTCTCCCTGCACATAAAGCAGGTTGTAGAACGTGGCATTCTTGGGCAAGTAGGT
AGACTTTACCCAGTCTCTTTCCTTTTTTGCTGATGTGTGCTCTCTCTCTCTCTTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTC
TCTCTGTCTCGCTTGCTCGCTCTCGCTGTTTCTCTCTCTTTGAGGCATTTGTTTGGAAAAAATCGTTGAGATGCCCAAGAACCTGGGATA
ATTCTTTACTTTTTTTGAAATAAAGGAAAGGAAATTCAGACTCTTACATTGTTCTCTGTAACTCTTCAATTCTAAAATGTTTTGTTTTTT
AAACCATGTTCTGATGGGGAAGTTGATTTGTAAGTGTGGACAGCTTGGACATTGCTGCTGAGCTGTGGTTAGAGATGATGCCTCCATTCC

>In-frame_ENST00000247026_ENST00000430627_TCGA-YJ-A8SW-01A_NSRP1_chr17_28443881_+_CDK12_chr17_37646810_length(amino acids)=844AA_start in transcript=63_stop in transcript=2597
MAIPGRHPKETLPSKPVKKEKEQRTRHLLTDLPLPPELPGGDLSPPDSPEPKAITPPQQPYKKRPKICCPRYGERRQTESDWGKRCVDKF
DIIGIIGEGTYGQVYKAKDKDTGELVALKKVRLDNEKEGFPITAIREIKILRQLIHRSVVNMKEIVTDKQDALDFKKDKGAFYLVFEYMD
HDLMGLLESGLVHFSEDHIKSFMKQLMEGLEYCHKKNFLHRDIKCSNILLNNSGQIKLADFGLARLYNSEESRPYTNKVITLWYRPPELL
LGEERYTPAIDVWSCGCILGELFTKKPIFQANLELAQLELISRLCGSPCPAVWPDVIKLPYFNTMKPKKQYRRRLREEFSFIPSAALDLL
DHMLTLDPSKRCTAEQTLQSDFLKDVELSKMAPPDLPHWQDCHELWSKKRRRQRQSGVVVEEPPPSKTSRKETTSGTSTEPVKNSSPAPP
QPAPGKVESGAGDAIGLADITQQLNQSELAVLLNLLQSQTDLSIPQMAQLLNIHSNPEMQQQLEALNQSISALTEATSQQQDSETMAPEE
SLKEAPSAPVILPSAEQTTLEASSTPADMQNILAVLLSQLMKTQEPAGSLEENNSDKNSGPQGPRRTPTMPQEEAAEKRPPEPPGPPPPP
PPPPLVEGDLSSAPQELNPAVTAALLQLLSQPEAEPPGHLPHEHQALRPMEYSTRPRPNRTYGNTDGPETGFSAIDTDERNSGPALTESL
VQTLVKNRTFSGSLSHLGESSSYQGTGSVQFPGDQDLRFARVPLALHPVVGQPFLKAEGSSNSVVHAETKLQNYGELGPGTTGASSSGAG

--------------------------------------------------------------
>In-frame_ENST00000247026_ENST00000447079_TCGA-YJ-A8SW-01A_NSRP1_chr17_28443881_+_CDK12_chr17_37646810_length(transcript)=6455nt_BP=83nt
CGTGACGACATCGGCGTCAGCGTCACGGAGGCGTCGGCCACGTTCAGCGGACACGGGAGCAAGATGGCGATTCCGGGCAGGCATCCAAAA
GAAACTCTTCCTTCAAAACCTGTGAAGAAAGAGAAGGAACAGAGGACACGTCACTTACTCACAGACCTTCCTCTCCCTCCAGAGCTCCCT
GGTGGAGATCTGTCTCCCCCAGACTCTCCAGAACCAAAGGCAATCACACCACCTCAGCAACCATATAAAAAGAGACCAAAAATTTGTTGT
CCTCGTTATGGAGAAAGAAGACAAACAGAAAGCGACTGGGGGAAACGCTGTGTGGACAAGTTTGACATTATTGGGATTATTGGAGAAGGA
ACCTATGGCCAAGTATATAAAGCCAAGGACAAAGACACAGGAGAACTAGTGGCTCTGAAGAAGGTGAGACTAGACAATGAGAAAGAGGGC
TTCCCAATCACAGCCATTCGTGAAATCAAAATCCTTCGTCAGTTAATCCACCGAAGTGTTGTTAACATGAAGGAAATTGTCACAGATAAA
CAAGATGCACTGGATTTCAAGAAGGACAAAGGTGCCTTTTACCTTGTATTTGAGTATATGGACCATGACTTAATGGGACTGCTAGAATCT
GGTTTGGTGCACTTTTCTGAGGACCATATCAAGTCGTTCATGAAACAGCTAATGGAAGGATTGGAATACTGTCACAAAAAGAATTTCCTG
CATCGGGATATTAAGTGTTCTAACATTTTGCTGAATAACAGTGGGCAAATCAAACTAGCAGATTTTGGACTTGCTCGGCTCTATAACTCT
GAAGAGAGTCGCCCTTACACAAACAAAGTCATTACTTTGTGGTACCGACCTCCAGAACTACTGCTAGGAGAGGAACGTTACACACCAGCC
ATAGATGTTTGGAGCTGTGGATGTATTCTTGGGGAACTATTCACAAAGAAGCCTATTTTTCAAGCCAATCTGGAACTGGCTCAGCTAGAA
CTGATCAGCCGACTTTGTGGTAGCCCTTGTCCAGCTGTGTGGCCTGATGTTATCAAACTGCCCTACTTCAACACCATGAAACCGAAGAAG
CAATATCGAAGGCGTCTACGAGAAGAATTCTCTTTCATTCCTTCTGCAGCACTTGATTTATTGGACCACATGCTGACACTAGATCCTAGT
AAGCGGTGCACAGCTGAACAGACCCTACAGAGCGACTTCCTTAAAGATGTCGAACTCAGCAAAATGGCTCCTCCAGACCTCCCCCACTGG
CAGGATTGCCATGAGTTGTGGAGTAAGAAACGGCGACGTCAGCGACAAAGTGGTGTTGTAGTCGAAGAGCCACCTCCATCCAAAACTTCT
CGAAAAGAAACTACCTCAGGGACAAGTACTGAGCCTGTGAAGAACAGCAGCCCAGCACCACCTCAGCCTGCTCCTGGCAAGGTGGAGTCT
GGGGCTGGGGATGCAATAGGCCTTGCTGACATCACACAACAGCTGAATCAAAGTGAATTGGCAGTGTTATTAAACCTGCTGCAGAGCCAA
ACCGACCTGAGCATCCCTCAAATGGCACAGCTGCTTAACATCCACTCCAACCCAGAGATGCAGCAGCAGCTGGAAGCCCTGAACCAATCC
ATCAGTGCCCTGACGGAAGCTACTTCCCAGCAGCAGGACTCAGAGACCATGGCCCCAGAGGAGTCTTTGAAGGAAGCACCCTCTGCCCCA
GTGATCCTGCCTTCAGCAGAACAGACGACCCTTGAAGCTTCAAGCACACCAGCTGACATGCAGAATATATTGGCAGTTCTCTTGAGTCAG
CTGATGAAAACCCAAGAGCCAGCAGGCAGTCTGGAGGAAAACAACAGTGACAAGAACAGTGGGCCACAGGGGCCCCGAAGAACTCCCACA
ATGCCACAGGAGGAGGCAGCAGCATGTCCTCCTCACATTCTTCCACCAGAGAAGAGGCCCCCTGAGCCCCCCGGACCTCCACCGCCGCCA
CCTCCACCCCCTCTGGTTGAAGGCGATCTTTCCAGCGCCCCCCAGGAGTTGAACCCAGCCGTGACAGCCGCCTTGCTGCAACTTTTATCC
CAGCCTGAAGCAGAGCCTCCTGGCCACCTGCCACATGAGCACCAGGCCTTGAGACCAATGGAGTACTCCACCCGACCCCGTCCAAACAGG
ACTTATGGAAACACTGATGGGCCTGAAACAGGGTTCAGTGCCATTGACACTGATGAACGAAACTCTGGTCCAGCCTTGACAGAATCCTTG
GTCCAGACCCTGGTGAAGAACAGGACCTTCTCAGGCTCTCTGAGCCACCTTGGGGAGTCCAGCAGTTACCAGGGCACAGGGTCAGTGCAG
TTTCCAGGGGACCAGGACCTCCGTTTTGCCAGGGTCCCCTTAGCGTTACACCCGGTGGTCGGGCAACCATTCCTGAAGGCTGAGGGAAGC
AGCAATTCTGTGGTACATGCAGAGACCAAATTGCAAAACTATGGGGAGCTGGGGCCAGGAACCACTGGGGCCAGCAGCTCAGGAGCAGGC
CTTCACTGGGGGGGCCCAACTCAGTCTTCTGCTTATGGAAAACTCTATCGGGGGCCTACAAGAGTCCCACCAAGAGGGGGAAGAGGGAGA
GGAGTTCCTTACTAACCCAGAGACTTCAGTGTCCTGAAAGATTCCTTTCCTATCCATCCTTCCATCCAGTTCTCTGAATCTTTAATGAAA
TCATTTGCCAGAGCGAGGTAATCATCTGCATTTGGCTACTGCAAAGCTGTCCGTTGTATTCCTTGCTCACTTGCTACTAGCAGGCGACTT
ACGAAATAATGATGTTGGCACCAGTTCCCCCTGGATGGGCTATAGCCAGAACATTTACTTCAACTCTACCTTAGTAGATACAAGTAGAGA
ATATGGAGAGGATCATTACATTGAAAAGTAAATGTTTTATTAGTTCATTGCCTGCACTTACTGATCGGAAGAGAGAAAGAACAGTTTCAG
TATTGAGATGGCTCAGGAGAGGCTCTTTGATTTTTAAAGTTTTGGGGTGGGGGATTGTGTGTGGTTTCTTTCTTTTGAATTTTAATTTAG
GTGTTTTGGGTTTTTTTCCTTTAAAGAGAATAGTGTTCACAAAATTTGAGCTGCTCTTTGGCTTTTGCTATAAGGGAAACAGAGTGGCCT
GGCTGATTTGAATAAATGTTTCTTTCCTCTCCACCATCTCACATTTTGCTTTTAAGTGAACACTTTTTCCCCATTGAGCATCTTGAACAT
ACTTTTTTTCCAAATAAATTACTCATCCTTAAAGTTTACTCCACTTTGACAAAAGATACGCCCTTCTCCCTGCACATAAAGCAGGTTGTA
GAACGTGGCATTCTTGGGCAAGTAGGTAGACTTTACCCAGTCTCTTTCCTTTTTTGCTGATGTGTGCTCTCTCTCTCTCTTTCTCTCTCT
CTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTGTCTCGCTTGCTCGCTCTCGCTGTTTCTCTCTCTTTGAGGCATTTGTTTGGAAAAAAT
CGTTGAGATGCCCAAGAACCTGGGATAATTCTTTACTTTTTTTGAAATAAAGGAAAGGAAATTCAGACTCTTACATTGTTCTCTGTAACT
CTTCAATTCTAAAATGTTTTGTTTTTTAAACCATGTTCTGATGGGGAAGTTGATTTGTAAGTGTGGACAGCTTGGACATTGCTGCTGAGC
TGTGGTTAGAGATGATGCCTCCATTCCTAGAGGGCTAATAACAGCATTTAGCATATTGTTTACACATATATTTTTATGTCAAAAAAAAAA
CAAAAACCTTTCAAACAGAGCATTGTGATATTGTCAAAGAGAAAAACAAATCCTGAAGATACATGGAAATGTAACCTAGTTTAGGGTGGG
TATTTTTCTGAAGATACATCAATACCTGACCTTTTTTAAAAAAATAATTTTAAAACAGCATACTGTGAGGAAGAACAGTATTGACATACC
CACATCCCAGCATGTGTACCCTGCCAGTTCTTTTAGGGATTTTTCCTCCAAAGAGATTTGGATTTGGTTTTGGTAAAAGGGGTTAAATTG
TGCTTCCAGGCAAGAACTTTGCCTTATCATAAACAGGAAATGAAAAAGGGAAGGGCTGTCAGGATGGGATAATTTGGGAGGCTTCTCATT
CTGGCTTCTATTTCTATGTGAGTACCAGCATATAGAGTGTTTTAAAAACAGATACATGTCATATAATTTATCTGCACAGACTTAGACCTT
CAGGAAACATAGGTTAAGCCCCCTTTTACAAAGAAAAAGTAAACATACTTCAGCATCTTGGAGGGTAGTTTTCAAAACTCAAGTTTCATG
TTTCAATGCCAAGTTCTTATTTTAAAAAATAAAATCTACTTATAAGAGAAAGGTGCATTACTTAAAAAAAAAAAACTTTAAAGAAATGAA
AGAAGAACCCTCTTCAGATACTTACTTGAAGACTGTTTTCCCCTGTTAATGAGATATAGCTAGATATCGGTGTGTGTATTTCTTTATTAT
TCTCTGGTTTTTGATCTGGCCTTGCCTCCAGGGCCAAACACTGATTTAGAAAGAGAGCCTTCTAGCTATTTTGGCATTGATGGCTTTTTA
TACCAGTGTGTCCAGTTAGATTTACTAGGCTTACTGACATGCTATTGGTAAATCGCATTAAAGTTCATCTGAACCTTCTGTCTGTTGACT
TCTTAGTCCTCAGACATGGGCCTTTGTGTTTTAGAATATTTGAATTTGAGTTATTGGGCCCCACTCCCTGTTTTTTATTAAAGAACGTGA
GCCTGGGATACTTTCAGAAGTATCTGTTCAATGAAAAAAAGTTGGTTTCCCATCAAATATGAATAAAATTCTCTATATATTTCATTGTAT
TTTGGTTATCAGCAGTCATCAATAATGTTTTTCCCTCCCCTCTCCCACCTCTTATTTTTAATTATGCCAAATATCCTAAATAATATACTT
AAGCCTCCATTCCCTCATCCCTACTAGGGAAGGGGGTGAGTGTATGTGTGAGTGTATGTGTATGTATGATCCCATCTCACCCCCACCCCC
ATTTTGGGAGTCTTTTAAAATGAAAACAAAGTTTGGTAGTTTTGACTATTTCTAAAAGCAGAGGAGAAAAAAAAACTTATTTAAATATCC
TGGAATCTGTATGGAGGAAGAAAAGGTATTTGTTAATTTTTCAGTTACGTTATCTATAAACATGATGGAAGTAAAGGTTTGGCAGAATTT
CACCTTGACTATTTGAAAATTACAGACCCAATTAATTCCATTCAAAAGTGGTTTTCGTTTTGTTTTAATTATTGTACAATGAGAGATATT
GTCTATTAAATACATTATTTTGAACAGATGAGAAATCTGATTCTGTTCATGAGTGGGAGGCAAAACTGGTTTGACCGTGATCATTTTTGT
GGTTTTGAAAACAAATATACTTGACCCAGTTTCCTTAGTTTTTTCTTCAACTGTCCATAGGAACGATAAGTATTTGAAAGCAACATCAAA
TCTATACGTTTAAAGCAGGGCAGTTAGCACAAATTTGCAAGTAGAACTTCTATTAGCTTATGCCATAGACATCACCCAACCACTTGTATG
TGTGTGTGTATATATAATATGCATATATAGTTACCGTGCTAAAATGGTTACCAGCAGGTTTTGAGAGAGAATGCTGCATCAGAAAAGTGT
CAGTTGCCACCTCATTCTCCCTGATTTAGGTTCCTGACACTGATTCCTTTCTCTCTCGTTTTTGACCCCCATTGGGTGTATCTTGTCTAT
GTACAGATATTTTGTAATATATTAAATTTTTTTCTTTCAGTTTATAAAAATGGAAAGTGGAGATTGGAAAATTAAATATTTCCTGTTACT
ATACCACTTTTGCTCCATTGCATTTACTTCTTAATCTGTACCCCCTGAGCATATCTAATCATGTATAAAGGACGTTTTTCCTCCACTTTA
TCTTAGGGGTTCTCTGTCTCAGAATCATTATAGACTCATTAACTCCCCCTCCCAGCAAAAGGTTATCAGGATTTGAAGAGGTGCTTGAAA
ACGCTAGACTAGGAACTAGAGAATAAATGAGTTGGGAAAAACCATGAAATGTGATTTTTTTAAAGTAGAAAAGTTATACAAATAATGGTA
CCAAACCATCAAAAGAGTTGAGCTTCATGTACCCTGACTCCTCCTGACAGGAGAGGTAAGTGGGTTTGAGCTCAACTGTCATCAAGGGAA
GTTGGTAAGAGGCTGTTTAGACCCAAAGGATAGTCTTAAACCAGACTTCACCACCCACCCTACCTCAGTTCCCATGTTATTACATGCAGA
GTCAGCATGGGGATTAGTGTACCTACCTTTGCTGAGATTTCCCGATGCGTTGCCAATCCAGAAAGTGAATCAAAAAGTTGTTTAAAAGTT

>In-frame_ENST00000247026_ENST00000447079_TCGA-YJ-A8SW-01A_NSRP1_chr17_28443881_+_CDK12_chr17_37646810_length(amino acids)=853AA_start in transcript=63_stop in transcript=2624
MAIPGRHPKETLPSKPVKKEKEQRTRHLLTDLPLPPELPGGDLSPPDSPEPKAITPPQQPYKKRPKICCPRYGERRQTESDWGKRCVDKF
DIIGIIGEGTYGQVYKAKDKDTGELVALKKVRLDNEKEGFPITAIREIKILRQLIHRSVVNMKEIVTDKQDALDFKKDKGAFYLVFEYMD
HDLMGLLESGLVHFSEDHIKSFMKQLMEGLEYCHKKNFLHRDIKCSNILLNNSGQIKLADFGLARLYNSEESRPYTNKVITLWYRPPELL
LGEERYTPAIDVWSCGCILGELFTKKPIFQANLELAQLELISRLCGSPCPAVWPDVIKLPYFNTMKPKKQYRRRLREEFSFIPSAALDLL
DHMLTLDPSKRCTAEQTLQSDFLKDVELSKMAPPDLPHWQDCHELWSKKRRRQRQSGVVVEEPPPSKTSRKETTSGTSTEPVKNSSPAPP
QPAPGKVESGAGDAIGLADITQQLNQSELAVLLNLLQSQTDLSIPQMAQLLNIHSNPEMQQQLEALNQSISALTEATSQQQDSETMAPEE
SLKEAPSAPVILPSAEQTTLEASSTPADMQNILAVLLSQLMKTQEPAGSLEENNSDKNSGPQGPRRTPTMPQEEAAACPPHILPPEKRPP
EPPGPPPPPPPPPLVEGDLSSAPQELNPAVTAALLQLLSQPEAEPPGHLPHEHQALRPMEYSTRPRPNRTYGNTDGPETGFSAIDTDERN
SGPALTESLVQTLVKNRTFSGSLSHLGESSSYQGTGSVQFPGDQDLRFARVPLALHPVVGQPFLKAEGSSNSVVHAETKLQNYGELGPGT

--------------------------------------------------------------
>In-frame_ENST00000479218_ENST00000430627_TCGA-YJ-A8SW-01A_NSRP1_chr17_28443881_+_CDK12_chr17_37646810_length(transcript)=3713nt_BP=37nt
GCGGACACGGGAGCAAGATGGCGATTCCGGGCAGGCATCCAAAAGAAACTCTTCCTTCAAAACCTGTGAAGAAAGAGAAGGAACAGAGGA
CACGTCACTTACTCACAGACCTTCCTCTCCCTCCAGAGCTCCCTGGTGGAGATCTGTCTCCCCCAGACTCTCCAGAACCAAAGGCAATCA
CACCACCTCAGCAACCATATAAAAAGAGACCAAAAATTTGTTGTCCTCGTTATGGAGAAAGAAGACAAACAGAAAGCGACTGGGGGAAAC
GCTGTGTGGACAAGTTTGACATTATTGGGATTATTGGAGAAGGAACCTATGGCCAAGTATATAAAGCCAAGGACAAAGACACAGGAGAAC
TAGTGGCTCTGAAGAAGGTGAGACTAGACAATGAGAAAGAGGGCTTCCCAATCACAGCCATTCGTGAAATCAAAATCCTTCGTCAGTTAA
TCCACCGAAGTGTTGTTAACATGAAGGAAATTGTCACAGATAAACAAGATGCACTGGATTTCAAGAAGGACAAAGGTGCCTTTTACCTTG
TATTTGAGTATATGGACCATGACTTAATGGGACTGCTAGAATCTGGTTTGGTGCACTTTTCTGAGGACCATATCAAGTCGTTCATGAAAC
AGCTAATGGAAGGATTGGAATACTGTCACAAAAAGAATTTCCTGCATCGGGATATTAAGTGTTCTAACATTTTGCTGAATAACAGTGGGC
AAATCAAACTAGCAGATTTTGGACTTGCTCGGCTCTATAACTCTGAAGAGAGTCGCCCTTACACAAACAAAGTCATTACTTTGTGGTACC
GACCTCCAGAACTACTGCTAGGAGAGGAACGTTACACACCAGCCATAGATGTTTGGAGCTGTGGATGTATTCTTGGGGAACTATTCACAA
AGAAGCCTATTTTTCAAGCCAATCTGGAACTGGCTCAGCTAGAACTGATCAGCCGACTTTGTGGTAGCCCTTGTCCAGCTGTGTGGCCTG
ATGTTATCAAACTGCCCTACTTCAACACCATGAAACCGAAGAAGCAATATCGAAGGCGTCTACGAGAAGAATTCTCTTTCATTCCTTCTG
CAGCACTTGATTTATTGGACCACATGCTGACACTAGATCCTAGTAAGCGGTGCACAGCTGAACAGACCCTACAGAGCGACTTCCTTAAAG
ATGTCGAACTCAGCAAAATGGCTCCTCCAGACCTCCCCCACTGGCAGGATTGCCATGAGTTGTGGAGTAAGAAACGGCGACGTCAGCGAC
AAAGTGGTGTTGTAGTCGAAGAGCCACCTCCATCCAAAACTTCTCGAAAAGAAACTACCTCAGGGACAAGTACTGAGCCTGTGAAGAACA
GCAGCCCAGCACCACCTCAGCCTGCTCCTGGCAAGGTGGAGTCTGGGGCTGGGGATGCAATAGGCCTTGCTGACATCACACAACAGCTGA
ATCAAAGTGAATTGGCAGTGTTATTAAACCTGCTGCAGAGCCAAACCGACCTGAGCATCCCTCAAATGGCACAGCTGCTTAACATCCACT
CCAACCCAGAGATGCAGCAGCAGCTGGAAGCCCTGAACCAATCCATCAGTGCCCTGACGGAAGCTACTTCCCAGCAGCAGGACTCAGAGA
CCATGGCCCCAGAGGAGTCTTTGAAGGAAGCACCCTCTGCCCCAGTGATCCTGCCTTCAGCAGAACAGACGACCCTTGAAGCTTCAAGCA
CACCAGCTGACATGCAGAATATATTGGCAGTTCTCTTGAGTCAGCTGATGAAAACCCAAGAGCCAGCAGGCAGTCTGGAGGAAAACAACA
GTGACAAGAACAGTGGGCCACAGGGGCCCCGAAGAACTCCCACAATGCCACAGGAGGAGGCAGCAGAGAAGAGGCCCCCTGAGCCCCCCG
GACCTCCACCGCCGCCACCTCCACCCCCTCTGGTTGAAGGCGATCTTTCCAGCGCCCCCCAGGAGTTGAACCCAGCCGTGACAGCCGCCT
TGCTGCAACTTTTATCCCAGCCTGAAGCAGAGCCTCCTGGCCACCTGCCACATGAGCACCAGGCCTTGAGACCAATGGAGTACTCCACCC
GACCCCGTCCAAACAGGACTTATGGAAACACTGATGGGCCTGAAACAGGGTTCAGTGCCATTGACACTGATGAACGAAACTCTGGTCCAG
CCTTGACAGAATCCTTGGTCCAGACCCTGGTGAAGAACAGGACCTTCTCAGGCTCTCTGAGCCACCTTGGGGAGTCCAGCAGTTACCAGG
GCACAGGGTCAGTGCAGTTTCCAGGGGACCAGGACCTCCGTTTTGCCAGGGTCCCCTTAGCGTTACACCCGGTGGTCGGGCAACCATTCC
TGAAGGCTGAGGGAAGCAGCAATTCTGTGGTACATGCAGAGACCAAATTGCAAAACTATGGGGAGCTGGGGCCAGGAACCACTGGGGCCA
GCAGCTCAGGAGCAGGCCTTCACTGGGGGGGCCCAACTCAGTCTTCTGCTTATGGAAAACTCTATCGGGGGCCTACAAGAGTCCCACCAA
GAGGGGGAAGAGGGAGAGGAGTTCCTTACTAACCCAGAGACTTCAGTGTCCTGAAAGATTCCTTTCCTATCCATCCTTCCATCCAGTTCT
CTGAATCTTTAATGAAATCATTTGCCAGAGCGAGGTAATCATCTGCATTTGGCTACTGCAAAGCTGTCCGTTGTATTCCTTGCTCACTTG
CTACTAGCAGGCGACTTACGAAATAATGATGTTGGCACCAGTTCCCCCTGGATGGGCTATAGCCAGAACATTTACTTCAACTCTACCTTA
GTAGATACAAGTAGAGAATATGGAGAGGATCATTACATTGAAAAGTAAATGTTTTATTAGTTCATTGCCTGCACTTACTGATCGGAAGAG
AGAAAGAACAGTTTCAGTATTGAGATGGCTCAGGAGAGGCTCTTTGATTTTTAAAGTTTTGGGGTGGGGGATTGTGTGTGGTTTCTTTCT
TTTGAATTTTAATTTAGGTGTTTTGGGTTTTTTTCCTTTAAAGAGAATAGTGTTCACAAAATTTGAGCTGCTCTTTGGCTTTTGCTATAA
GGGAAACAGAGTGGCCTGGCTGATTTGAATAAATGTTTCTTTCCTCTCCACCATCTCACATTTTGCTTTTAAGTGAACACTTTTTCCCCA
TTGAGCATCTTGAACATACTTTTTTTCCAAATAAATTACTCATCCTTAAAGTTTACTCCACTTTGACAAAAGATACGCCCTTCTCCCTGC
ACATAAAGCAGGTTGTAGAACGTGGCATTCTTGGGCAAGTAGGTAGACTTTACCCAGTCTCTTTCCTTTTTTGCTGATGTGTGCTCTCTC
TCTCTCTTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTGTCTCGCTTGCTCGCTCTCGCTGTTTCTCTCTCTTTGAGGC
ATTTGTTTGGAAAAAATCGTTGAGATGCCCAAGAACCTGGGATAATTCTTTACTTTTTTTGAAATAAAGGAAAGGAAATTCAGACTCTTA
CATTGTTCTCTGTAACTCTTCAATTCTAAAATGTTTTGTTTTTTAAACCATGTTCTGATGGGGAAGTTGATTTGTAAGTGTGGACAGCTT
GGACATTGCTGCTGAGCTGTGGTTAGAGATGATGCCTCCATTCCTAGAGGGCTAATAACAGCATTTAGCATATTGTTTACACATATATTT

>In-frame_ENST00000479218_ENST00000430627_TCGA-YJ-A8SW-01A_NSRP1_chr17_28443881_+_CDK12_chr17_37646810_length(amino acids)=844AA_start in transcript=17_stop in transcript=2551
MAIPGRHPKETLPSKPVKKEKEQRTRHLLTDLPLPPELPGGDLSPPDSPEPKAITPPQQPYKKRPKICCPRYGERRQTESDWGKRCVDKF
DIIGIIGEGTYGQVYKAKDKDTGELVALKKVRLDNEKEGFPITAIREIKILRQLIHRSVVNMKEIVTDKQDALDFKKDKGAFYLVFEYMD
HDLMGLLESGLVHFSEDHIKSFMKQLMEGLEYCHKKNFLHRDIKCSNILLNNSGQIKLADFGLARLYNSEESRPYTNKVITLWYRPPELL
LGEERYTPAIDVWSCGCILGELFTKKPIFQANLELAQLELISRLCGSPCPAVWPDVIKLPYFNTMKPKKQYRRRLREEFSFIPSAALDLL
DHMLTLDPSKRCTAEQTLQSDFLKDVELSKMAPPDLPHWQDCHELWSKKRRRQRQSGVVVEEPPPSKTSRKETTSGTSTEPVKNSSPAPP
QPAPGKVESGAGDAIGLADITQQLNQSELAVLLNLLQSQTDLSIPQMAQLLNIHSNPEMQQQLEALNQSISALTEATSQQQDSETMAPEE
SLKEAPSAPVILPSAEQTTLEASSTPADMQNILAVLLSQLMKTQEPAGSLEENNSDKNSGPQGPRRTPTMPQEEAAEKRPPEPPGPPPPP
PPPPLVEGDLSSAPQELNPAVTAALLQLLSQPEAEPPGHLPHEHQALRPMEYSTRPRPNRTYGNTDGPETGFSAIDTDERNSGPALTESL
VQTLVKNRTFSGSLSHLGESSSYQGTGSVQFPGDQDLRFARVPLALHPVVGQPFLKAEGSSNSVVHAETKLQNYGELGPGTTGASSSGAG

--------------------------------------------------------------
>In-frame_ENST00000479218_ENST00000447079_TCGA-YJ-A8SW-01A_NSRP1_chr17_28443881_+_CDK12_chr17_37646810_length(transcript)=6409nt_BP=37nt
GCGGACACGGGAGCAAGATGGCGATTCCGGGCAGGCATCCAAAAGAAACTCTTCCTTCAAAACCTGTGAAGAAAGAGAAGGAACAGAGGA
CACGTCACTTACTCACAGACCTTCCTCTCCCTCCAGAGCTCCCTGGTGGAGATCTGTCTCCCCCAGACTCTCCAGAACCAAAGGCAATCA
CACCACCTCAGCAACCATATAAAAAGAGACCAAAAATTTGTTGTCCTCGTTATGGAGAAAGAAGACAAACAGAAAGCGACTGGGGGAAAC
GCTGTGTGGACAAGTTTGACATTATTGGGATTATTGGAGAAGGAACCTATGGCCAAGTATATAAAGCCAAGGACAAAGACACAGGAGAAC
TAGTGGCTCTGAAGAAGGTGAGACTAGACAATGAGAAAGAGGGCTTCCCAATCACAGCCATTCGTGAAATCAAAATCCTTCGTCAGTTAA
TCCACCGAAGTGTTGTTAACATGAAGGAAATTGTCACAGATAAACAAGATGCACTGGATTTCAAGAAGGACAAAGGTGCCTTTTACCTTG
TATTTGAGTATATGGACCATGACTTAATGGGACTGCTAGAATCTGGTTTGGTGCACTTTTCTGAGGACCATATCAAGTCGTTCATGAAAC
AGCTAATGGAAGGATTGGAATACTGTCACAAAAAGAATTTCCTGCATCGGGATATTAAGTGTTCTAACATTTTGCTGAATAACAGTGGGC
AAATCAAACTAGCAGATTTTGGACTTGCTCGGCTCTATAACTCTGAAGAGAGTCGCCCTTACACAAACAAAGTCATTACTTTGTGGTACC
GACCTCCAGAACTACTGCTAGGAGAGGAACGTTACACACCAGCCATAGATGTTTGGAGCTGTGGATGTATTCTTGGGGAACTATTCACAA
AGAAGCCTATTTTTCAAGCCAATCTGGAACTGGCTCAGCTAGAACTGATCAGCCGACTTTGTGGTAGCCCTTGTCCAGCTGTGTGGCCTG
ATGTTATCAAACTGCCCTACTTCAACACCATGAAACCGAAGAAGCAATATCGAAGGCGTCTACGAGAAGAATTCTCTTTCATTCCTTCTG
CAGCACTTGATTTATTGGACCACATGCTGACACTAGATCCTAGTAAGCGGTGCACAGCTGAACAGACCCTACAGAGCGACTTCCTTAAAG
ATGTCGAACTCAGCAAAATGGCTCCTCCAGACCTCCCCCACTGGCAGGATTGCCATGAGTTGTGGAGTAAGAAACGGCGACGTCAGCGAC
AAAGTGGTGTTGTAGTCGAAGAGCCACCTCCATCCAAAACTTCTCGAAAAGAAACTACCTCAGGGACAAGTACTGAGCCTGTGAAGAACA
GCAGCCCAGCACCACCTCAGCCTGCTCCTGGCAAGGTGGAGTCTGGGGCTGGGGATGCAATAGGCCTTGCTGACATCACACAACAGCTGA
ATCAAAGTGAATTGGCAGTGTTATTAAACCTGCTGCAGAGCCAAACCGACCTGAGCATCCCTCAAATGGCACAGCTGCTTAACATCCACT
CCAACCCAGAGATGCAGCAGCAGCTGGAAGCCCTGAACCAATCCATCAGTGCCCTGACGGAAGCTACTTCCCAGCAGCAGGACTCAGAGA
CCATGGCCCCAGAGGAGTCTTTGAAGGAAGCACCCTCTGCCCCAGTGATCCTGCCTTCAGCAGAACAGACGACCCTTGAAGCTTCAAGCA
CACCAGCTGACATGCAGAATATATTGGCAGTTCTCTTGAGTCAGCTGATGAAAACCCAAGAGCCAGCAGGCAGTCTGGAGGAAAACAACA
GTGACAAGAACAGTGGGCCACAGGGGCCCCGAAGAACTCCCACAATGCCACAGGAGGAGGCAGCAGCATGTCCTCCTCACATTCTTCCAC
CAGAGAAGAGGCCCCCTGAGCCCCCCGGACCTCCACCGCCGCCACCTCCACCCCCTCTGGTTGAAGGCGATCTTTCCAGCGCCCCCCAGG
AGTTGAACCCAGCCGTGACAGCCGCCTTGCTGCAACTTTTATCCCAGCCTGAAGCAGAGCCTCCTGGCCACCTGCCACATGAGCACCAGG
CCTTGAGACCAATGGAGTACTCCACCCGACCCCGTCCAAACAGGACTTATGGAAACACTGATGGGCCTGAAACAGGGTTCAGTGCCATTG
ACACTGATGAACGAAACTCTGGTCCAGCCTTGACAGAATCCTTGGTCCAGACCCTGGTGAAGAACAGGACCTTCTCAGGCTCTCTGAGCC
ACCTTGGGGAGTCCAGCAGTTACCAGGGCACAGGGTCAGTGCAGTTTCCAGGGGACCAGGACCTCCGTTTTGCCAGGGTCCCCTTAGCGT
TACACCCGGTGGTCGGGCAACCATTCCTGAAGGCTGAGGGAAGCAGCAATTCTGTGGTACATGCAGAGACCAAATTGCAAAACTATGGGG
AGCTGGGGCCAGGAACCACTGGGGCCAGCAGCTCAGGAGCAGGCCTTCACTGGGGGGGCCCAACTCAGTCTTCTGCTTATGGAAAACTCT
ATCGGGGGCCTACAAGAGTCCCACCAAGAGGGGGAAGAGGGAGAGGAGTTCCTTACTAACCCAGAGACTTCAGTGTCCTGAAAGATTCCT
TTCCTATCCATCCTTCCATCCAGTTCTCTGAATCTTTAATGAAATCATTTGCCAGAGCGAGGTAATCATCTGCATTTGGCTACTGCAAAG
CTGTCCGTTGTATTCCTTGCTCACTTGCTACTAGCAGGCGACTTACGAAATAATGATGTTGGCACCAGTTCCCCCTGGATGGGCTATAGC
CAGAACATTTACTTCAACTCTACCTTAGTAGATACAAGTAGAGAATATGGAGAGGATCATTACATTGAAAAGTAAATGTTTTATTAGTTC
ATTGCCTGCACTTACTGATCGGAAGAGAGAAAGAACAGTTTCAGTATTGAGATGGCTCAGGAGAGGCTCTTTGATTTTTAAAGTTTTGGG
GTGGGGGATTGTGTGTGGTTTCTTTCTTTTGAATTTTAATTTAGGTGTTTTGGGTTTTTTTCCTTTAAAGAGAATAGTGTTCACAAAATT
TGAGCTGCTCTTTGGCTTTTGCTATAAGGGAAACAGAGTGGCCTGGCTGATTTGAATAAATGTTTCTTTCCTCTCCACCATCTCACATTT
TGCTTTTAAGTGAACACTTTTTCCCCATTGAGCATCTTGAACATACTTTTTTTCCAAATAAATTACTCATCCTTAAAGTTTACTCCACTT
TGACAAAAGATACGCCCTTCTCCCTGCACATAAAGCAGGTTGTAGAACGTGGCATTCTTGGGCAAGTAGGTAGACTTTACCCAGTCTCTT
TCCTTTTTTGCTGATGTGTGCTCTCTCTCTCTCTTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTGTCTCGCTTGCTCG
CTCTCGCTGTTTCTCTCTCTTTGAGGCATTTGTTTGGAAAAAATCGTTGAGATGCCCAAGAACCTGGGATAATTCTTTACTTTTTTTGAA
ATAAAGGAAAGGAAATTCAGACTCTTACATTGTTCTCTGTAACTCTTCAATTCTAAAATGTTTTGTTTTTTAAACCATGTTCTGATGGGG
AAGTTGATTTGTAAGTGTGGACAGCTTGGACATTGCTGCTGAGCTGTGGTTAGAGATGATGCCTCCATTCCTAGAGGGCTAATAACAGCA
TTTAGCATATTGTTTACACATATATTTTTATGTCAAAAAAAAAACAAAAACCTTTCAAACAGAGCATTGTGATATTGTCAAAGAGAAAAA
CAAATCCTGAAGATACATGGAAATGTAACCTAGTTTAGGGTGGGTATTTTTCTGAAGATACATCAATACCTGACCTTTTTTAAAAAAATA
ATTTTAAAACAGCATACTGTGAGGAAGAACAGTATTGACATACCCACATCCCAGCATGTGTACCCTGCCAGTTCTTTTAGGGATTTTTCC
TCCAAAGAGATTTGGATTTGGTTTTGGTAAAAGGGGTTAAATTGTGCTTCCAGGCAAGAACTTTGCCTTATCATAAACAGGAAATGAAAA
AGGGAAGGGCTGTCAGGATGGGATAATTTGGGAGGCTTCTCATTCTGGCTTCTATTTCTATGTGAGTACCAGCATATAGAGTGTTTTAAA
AACAGATACATGTCATATAATTTATCTGCACAGACTTAGACCTTCAGGAAACATAGGTTAAGCCCCCTTTTACAAAGAAAAAGTAAACAT
ACTTCAGCATCTTGGAGGGTAGTTTTCAAAACTCAAGTTTCATGTTTCAATGCCAAGTTCTTATTTTAAAAAATAAAATCTACTTATAAG
AGAAAGGTGCATTACTTAAAAAAAAAAAACTTTAAAGAAATGAAAGAAGAACCCTCTTCAGATACTTACTTGAAGACTGTTTTCCCCTGT
TAATGAGATATAGCTAGATATCGGTGTGTGTATTTCTTTATTATTCTCTGGTTTTTGATCTGGCCTTGCCTCCAGGGCCAAACACTGATT
TAGAAAGAGAGCCTTCTAGCTATTTTGGCATTGATGGCTTTTTATACCAGTGTGTCCAGTTAGATTTACTAGGCTTACTGACATGCTATT
GGTAAATCGCATTAAAGTTCATCTGAACCTTCTGTCTGTTGACTTCTTAGTCCTCAGACATGGGCCTTTGTGTTTTAGAATATTTGAATT
TGAGTTATTGGGCCCCACTCCCTGTTTTTTATTAAAGAACGTGAGCCTGGGATACTTTCAGAAGTATCTGTTCAATGAAAAAAAGTTGGT
TTCCCATCAAATATGAATAAAATTCTCTATATATTTCATTGTATTTTGGTTATCAGCAGTCATCAATAATGTTTTTCCCTCCCCTCTCCC
ACCTCTTATTTTTAATTATGCCAAATATCCTAAATAATATACTTAAGCCTCCATTCCCTCATCCCTACTAGGGAAGGGGGTGAGTGTATG
TGTGAGTGTATGTGTATGTATGATCCCATCTCACCCCCACCCCCATTTTGGGAGTCTTTTAAAATGAAAACAAAGTTTGGTAGTTTTGAC
TATTTCTAAAAGCAGAGGAGAAAAAAAAACTTATTTAAATATCCTGGAATCTGTATGGAGGAAGAAAAGGTATTTGTTAATTTTTCAGTT
ACGTTATCTATAAACATGATGGAAGTAAAGGTTTGGCAGAATTTCACCTTGACTATTTGAAAATTACAGACCCAATTAATTCCATTCAAA
AGTGGTTTTCGTTTTGTTTTAATTATTGTACAATGAGAGATATTGTCTATTAAATACATTATTTTGAACAGATGAGAAATCTGATTCTGT
TCATGAGTGGGAGGCAAAACTGGTTTGACCGTGATCATTTTTGTGGTTTTGAAAACAAATATACTTGACCCAGTTTCCTTAGTTTTTTCT
TCAACTGTCCATAGGAACGATAAGTATTTGAAAGCAACATCAAATCTATACGTTTAAAGCAGGGCAGTTAGCACAAATTTGCAAGTAGAA
CTTCTATTAGCTTATGCCATAGACATCACCCAACCACTTGTATGTGTGTGTGTATATATAATATGCATATATAGTTACCGTGCTAAAATG
GTTACCAGCAGGTTTTGAGAGAGAATGCTGCATCAGAAAAGTGTCAGTTGCCACCTCATTCTCCCTGATTTAGGTTCCTGACACTGATTC
CTTTCTCTCTCGTTTTTGACCCCCATTGGGTGTATCTTGTCTATGTACAGATATTTTGTAATATATTAAATTTTTTTCTTTCAGTTTATA
AAAATGGAAAGTGGAGATTGGAAAATTAAATATTTCCTGTTACTATACCACTTTTGCTCCATTGCATTTACTTCTTAATCTGTACCCCCT
GAGCATATCTAATCATGTATAAAGGACGTTTTTCCTCCACTTTATCTTAGGGGTTCTCTGTCTCAGAATCATTATAGACTCATTAACTCC
CCCTCCCAGCAAAAGGTTATCAGGATTTGAAGAGGTGCTTGAAAACGCTAGACTAGGAACTAGAGAATAAATGAGTTGGGAAAAACCATG
AAATGTGATTTTTTTAAAGTAGAAAAGTTATACAAATAATGGTACCAAACCATCAAAAGAGTTGAGCTTCATGTACCCTGACTCCTCCTG
ACAGGAGAGGTAAGTGGGTTTGAGCTCAACTGTCATCAAGGGAAGTTGGTAAGAGGCTGTTTAGACCCAAAGGATAGTCTTAAACCAGAC
TTCACCACCCACCCTACCTCAGTTCCCATGTTATTACATGCAGAGTCAGCATGGGGATTAGTGTACCTACCTTTGCTGAGATTTCCCGAT
GCGTTGCCAATCCAGAAAGTGAATCAAAAAGTTGTTTAAAAGTTAAAATCTCTATTGTTTCCAAAATCTTTCCCATCTCCACCTGAAGAC

>In-frame_ENST00000479218_ENST00000447079_TCGA-YJ-A8SW-01A_NSRP1_chr17_28443881_+_CDK12_chr17_37646810_length(amino acids)=853AA_start in transcript=17_stop in transcript=2578
MAIPGRHPKETLPSKPVKKEKEQRTRHLLTDLPLPPELPGGDLSPPDSPEPKAITPPQQPYKKRPKICCPRYGERRQTESDWGKRCVDKF
DIIGIIGEGTYGQVYKAKDKDTGELVALKKVRLDNEKEGFPITAIREIKILRQLIHRSVVNMKEIVTDKQDALDFKKDKGAFYLVFEYMD
HDLMGLLESGLVHFSEDHIKSFMKQLMEGLEYCHKKNFLHRDIKCSNILLNNSGQIKLADFGLARLYNSEESRPYTNKVITLWYRPPELL
LGEERYTPAIDVWSCGCILGELFTKKPIFQANLELAQLELISRLCGSPCPAVWPDVIKLPYFNTMKPKKQYRRRLREEFSFIPSAALDLL
DHMLTLDPSKRCTAEQTLQSDFLKDVELSKMAPPDLPHWQDCHELWSKKRRRQRQSGVVVEEPPPSKTSRKETTSGTSTEPVKNSSPAPP
QPAPGKVESGAGDAIGLADITQQLNQSELAVLLNLLQSQTDLSIPQMAQLLNIHSNPEMQQQLEALNQSISALTEATSQQQDSETMAPEE
SLKEAPSAPVILPSAEQTTLEASSTPADMQNILAVLLSQLMKTQEPAGSLEENNSDKNSGPQGPRRTPTMPQEEAAACPPHILPPEKRPP
EPPGPPPPPPPPPLVEGDLSSAPQELNPAVTAALLQLLSQPEAEPPGHLPHEHQALRPMEYSTRPRPNRTYGNTDGPETGFSAIDTDERN
SGPALTESLVQTLVKNRTFSGSLSHLGESSSYQGTGSVQFPGDQDLRFARVPLALHPVVGQPFLKAEGSSNSVVHAETKLQNYGELGPGT

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for NSRP1-CDK12


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for NSRP1-CDK12


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for NSRP1-CDK12


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource
TgeneCDK12C0033578Prostatic Neoplasms1CTD_human
TgeneCDK12C0376358Malignant neoplasm of prostate1CTD_human