FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:NUP210-XPC (FusionGDB2 ID:61034)

Fusion Gene Summary for NUP210-XPC

check button Fusion gene summary
Fusion gene informationFusion gene name: NUP210-XPC
Fusion gene ID: 61034
HgeneTgene
Gene symbol

NUP210

XPC

Gene ID

23225

7508

Gene namenucleoporin 210XPC complex subunit, DNA damage recognition and repair factor
SynonymsGP210|POM210RAD4|XP3|XPCC|p125
Cytomap

3p25.1

3p25.1

Type of geneprotein-codingprotein-coding
Descriptionnuclear pore membrane glycoprotein 210nuclear envelope pore membrane protein POM 210nuclear pore protein gp210nucleoporin 210kDanucleoporin Nup210pore membrane protein of 210 kDaDNA repair protein complementing XP-C cellsmutant xeroderma pigmentosum group Cxeroderma pigmentosum, complementation group C
Modification date2020032020200313
UniProtAcc..
Ensembl transtripts involved in fusion geneENST00000254508, ENST00000485755, 
ENST00000285021, ENST00000449060, 
Fusion gene scores* DoF score9 X 7 X 5=3155 X 4 X 4=80
# samples 115
** MAII scorelog2(11/315*10)=-1.51784830486262
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(5/80*10)=-0.678071905112638
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: NUP210 [Title/Abstract] AND XPC [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointNUP210(13364809)-XPC(14214562), # samples:1
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
TgeneXPC

GO:0000715

nucleotide-excision repair, DNA damage recognition

10873465|19941824

TgeneXPC

GO:0006289

nucleotide-excision repair

8168482|9734359|11259578

TgeneXPC

GO:0045893

positive regulation of transcription, DNA-templated

29973595|31527837

TgeneXPC

GO:0070914

UV-damage excision repair

8077226


check buttonFusion gene breakpoints across NUP210 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across XPC (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4UCSTCGA-N6-A4VGNUP210chr3

13364809

-XPCchr3

14214562

-


Top

Fusion Gene ORF analysis for NUP210-XPC

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
In-frameENST00000254508ENST00000285021NUP210chr3

13364809

-XPCchr3

14214562

-
In-frameENST00000254508ENST00000449060NUP210chr3

13364809

-XPCchr3

14214562

-
intron-3CDSENST00000485755ENST00000285021NUP210chr3

13364809

-XPCchr3

14214562

-
intron-3CDSENST00000485755ENST00000449060NUP210chr3

13364809

-XPCchr3

14214562

-

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000254508NUP210chr313364809-ENST00000285021XPCchr314214562-836548518375702495
ENST00000254508NUP210chr313364809-ENST00000449060XPCchr314214562-746048518374592459

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000254508ENST00000285021NUP210chr313364809-XPCchr314214562-0.0020334620.9979665
ENST00000254508ENST00000449060NUP210chr313364809-XPCchr314214562-0.0024889350.997511

Top

Fusion Genomic Features for NUP210-XPC


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.

Top

Fusion Protein Features for NUP210-XPC


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr3:13364809/chr3:14214562)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
..
FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneNUP210chr3:13364809chr3:14214562ENST00000254508-3440486_49115891888.0Compositional biasNote=Poly-Ser
HgeneNUP210chr3:13364809chr3:14214562ENST00000254508-34401078_115115891888.0DomainBIG2
TgeneXPCchr3:13364809chr3:14214562ENST00000285021016124_13034941.0Compositional biasNote=Poly-Glu
TgeneXPCchr3:13364809chr3:14214562ENST00000285021016359_39534941.0Compositional biasNote=Lys-rich (basic)
TgeneXPCchr3:13364809chr3:14214562ENST00000285021016408_43134941.0Compositional biasNote=Arg/Lys-rich (basic)
TgeneXPCchr3:13364809chr3:14214562ENST00000285021016432_46134941.0Compositional biasNote=Asp/Glu-rich (acidic)
TgeneXPCchr3:13364809chr3:14214562ENST00000285021016466_49334941.0Compositional biasNote=Arg/Lys-rich (basic)
TgeneXPCchr3:13364809chr3:14214562ENST00000285021016501_50734941.0Compositional biasNote=Poly-Ser
TgeneXPCchr3:13364809chr3:14214562ENST00000449060016124_13034904.0Compositional biasNote=Poly-Glu
TgeneXPCchr3:13364809chr3:14214562ENST00000449060016359_39534904.0Compositional biasNote=Lys-rich (basic)
TgeneXPCchr3:13364809chr3:14214562ENST00000449060016408_43134904.0Compositional biasNote=Arg/Lys-rich (basic)
TgeneXPCchr3:13364809chr3:14214562ENST00000449060016432_46134904.0Compositional biasNote=Asp/Glu-rich (acidic)
TgeneXPCchr3:13364809chr3:14214562ENST00000449060016466_49334904.0Compositional biasNote=Arg/Lys-rich (basic)
TgeneXPCchr3:13364809chr3:14214562ENST00000449060016501_50734904.0Compositional biasNote=Poly-Ser
TgeneXPCchr3:13364809chr3:14214562ENST00000285021016390_39534941.0MotifNuclear localization signal
TgeneXPCchr3:13364809chr3:14214562ENST00000449060016390_39534904.0MotifNuclear localization signal
TgeneXPCchr3:13364809chr3:14214562ENST00000285021016607_74134941.0RegionNote=DNA-binding%3B preference for heteroduplex DNA
TgeneXPCchr3:13364809chr3:14214562ENST00000285021016607_76634941.0RegionNote=Minimal sensor domain involved in damage recognition
TgeneXPCchr3:13364809chr3:14214562ENST00000285021016767_83134941.0RegionNote=DNA-binding%3B preference for single stranded DNA%3B required for formation of stable nucleoprotein complex
TgeneXPCchr3:13364809chr3:14214562ENST00000449060016607_74134904.0RegionNote=DNA-binding%3B preference for heteroduplex DNA
TgeneXPCchr3:13364809chr3:14214562ENST00000449060016607_76634904.0RegionNote=Minimal sensor domain involved in damage recognition
TgeneXPCchr3:13364809chr3:14214562ENST00000449060016767_83134904.0RegionNote=DNA-binding%3B preference for single stranded DNA%3B required for formation of stable nucleoprotein complex

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneNUP210chr3:13364809chr3:14214562ENST00000254508-34401830_188715891888.0Topological domainCytoplasmic
HgeneNUP210chr3:13364809chr3:14214562ENST00000254508-344027_180815891888.0Topological domainPerinuclear space
HgeneNUP210chr3:13364809chr3:14214562ENST00000254508-34401809_182915891888.0TransmembraneHelical
TgeneXPCchr3:13364809chr3:14214562ENST0000028502101630_17734941.0Compositional biasNote=Glu-rich (acidic)
TgeneXPCchr3:13364809chr3:14214562ENST0000028502101630_3434941.0Compositional biasNote=Poly-Glu
TgeneXPCchr3:13364809chr3:14214562ENST0000044906001630_17734904.0Compositional biasNote=Glu-rich (acidic)
TgeneXPCchr3:13364809chr3:14214562ENST0000044906001630_3434904.0Compositional biasNote=Poly-Glu


Top

Fusion Gene Sequence for NUP210-XPC


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>61034_61034_1_NUP210-XPC_NUP210_chr3_13364809_ENST00000254508_XPC_chr3_14214562_ENST00000285021_length(transcript)=8365nt_BP=4851nt
GCGCGCGCGCGGGCGGGAGCGGAGGGCAACGGGGCGGCGCGGGCGGCCGGGCGCAGGGTCGCGGGAGGTGACGCGCGGCGAGGATGGCGG
CGCGGGGCCGGGGGCTGCTGCTGCTGACGCTGTCGGTGCTGTTGGCGGCGGGCCCCTCCGCCGCTGCGGCCAAGCTCAACATCCCCAAAG
TGCTGCTGCCCTTCACGCGGGCCACGCGCGTTAACTTCACGCTGGAGGCCTCGGAGGGCTGCTACCGCTGGTTGTCCACCCGGCCGGAGG
TGGCCAGCATCGAGCCGCTGGGCCTGGACGAGCAGCAGTGCTCCCAGAAGGCAGTGGTGCAGGCCCGCCTGACCCAGCCTGCCCGCCTCA
CCAGCATCATCTTCGCAGAGGACATCACCACAGGCCAGGTCCTGCGCTGTGATGCCATTGTGGACCTCATCCATGACATCCAGATCGTCT
CCACCACCCGCGAGCTCTACCTGGAGGACTCCCCCCTGGAGCTGAAGATCCAGGCCCTGGACTCCGAAGGGAACACCTTCAGCACTCTGG
CTGGACTGGTCTTCGAGTGGACGATTGTGAAGGACTCCGAGGCGGACAGGTTCTCAGACTCCCACAATGCGCTGCGAATCCTCACTTTCT
TGGAGTCTACGTACATCCCTCCTTCTTACATCTCAGAGATGGAGAAGGCTGCCAAGCAAGGGGACACCATCCTGGTGTCTGGGATGAAGA
CCGGGAGCTCCAAGCTCAAGGCTCGCATCCAGGAGGCTGTCTACAAGAATGTACGCCCTGCAGAAGTCAGGCTGCTGATTTTGGAAAACA
TCCTTCTGAACCCGGCCTATGACGTCTACCTGATGGTGGGAACCTCCATTCACTACAAGGTGCAGAAGATCAGGCAAGGGAAAATTACAG
AACTCTCCATGCCTTCCGATCAGTACGAGTTGCAGCTTCAGAACAGCATCCCGGGCCCCGAAGGAGACCCAGCCCGGCCGGTGGCTGTCT
TGGCCCAGGACACGTCGATGGTCACTGCACTGCAGCTGGGACAGAGCAGCCTCGTCCTTGGCCACAGGAGTATTCGCATGCAAGGTGCTT
CTAGGTTACCCAACAGCACTATCTACGTGGTCGAACCTGGATACCTAGGGTTCACTGTTCACCCTGGTGACAGGTGGGTGCTGGAGACCG
GCCGCCTGTATGAAATCACCATCGAAGTTTTTGACAAGTTCAGCAACAAGGTCTATGTATCTGACAACATCCGAATTGAAACTGTGCTTC
CTGCTGAGTTCTTCGAGGTGCTCTCGTCCTCCCAGAATGGGTCATACCATCGCATCAGGGCACTAAAGAGGGGACAGACGGCCATTGACG
CGGCCCTCACCTCTGTGGTGGACCAGGATGGAGGGGTCCACATACTACAGGTGCCTGTGTGGAACCAGCAGGAGGTGGAAATTCACATCC
CGATCACCCTGTATCCCAGCATCTTGACATTTCCGTGGCAACCAAAGACGGGCGCCTATCAGTACACAATAAGGGCCCACGGTGGCAGTG
GGAACTTCAGCTGGTCTTCGTCAAGCCACCTGGTTGCCACAGTTACTGTCAAGGGCGTGATGACCACAGGCAGTGACATCGGGTTCAGTG
TGATCCAGGCACATGATGTGCAGAACCCACTCCATTTCGGTGAGATGAAGGTGTATGTGATCGAGCCCCACAGCATGGAGTTTGCCCCGT
GCCAGGTGGAGGCACGTGTGGGCCAGGCCCTGGAGCTGCCCCTGAGGATCAGTGGCCTCATGCCCGGCGGGGCCAGTGAGGTGGTCACCT
TGAGCGACTGCTCCCACTTTGACTTGGCTGTCGAGGTGGAGAACCAGGGTGTGTTCCAGCCACTCCCAGGGAGGCTGCCGCCAGGCTCTG
AGCACTGCAGCGGCATCCGGGTAAAGGCCGAGGCCCAGGGCTCTACCACGCTTCTTGTGAGCTACAGACACGGCCACGTCCACCTGAGTG
CCAAGATCACCATTGCTGCCTACCTGCCCCTCAAGGCTGTGGATCCCTCCTCTGTTGCCTTGGTAACCCTGGGCTCCTCAAAGGAGATGC
TGTTTGAAGGAGGTCCCAGACCTTGGATCCTCGAGCCGTCCAAATTCTTCCAGAACGTCACCGCTGAGGACACTGACAGCATCGGCCTGG
CTCTCTTTGCCCCCCATTCCTCCCGGAATTATCAGCAACACTGGATCCTTGTGACCTGTCAGGCCTTGGGTGAGCAGGTCATCGCCCTGT
CGGTGGGGAACAAGCCCAGCCTCACCAACCCCTTTCCTGCGGTGGAGCCTGCCGTGGTGAAGTTCGTCTGCGCCCCACCGTCCAGGCTCA
CCCTCGCGCCTGTCTACACCAGCCCCCAGCTGGACATGTCCTGTCCGCTGCTGCAGCAGAACAAGCAGGTGGTCCCAGTGTCCAGCCACC
GCAACCCCCGGCTGGACCTGGCTGCTTACGACCAGGAGGGCCGCCGGTTCGACAACTTCAGCTCTCTGAGCATCCAGTGGGAGTCCACCA
GGCCAGTGTTGGCCAGCATCGAGCCTGAGCTGCCCATGCAGCTGGTGTCCCAGGACGATGAGAGTGGCCAAAAGAAGCTGCACGGTTTGC
AGGCCATTTTGGTTCACGAGGCATCAGGAACCACAGCCATCACTGCCACTGCCACTGGCTACCAGGAGTCCCACCTCAGCTCTGCCAGAA
CAAAGCAGCCGCATGACCCTCTGGTGCCTCTGTCGGCCTCCATAGAGCTCATCCTGGTGGAGGACGTGAGGGTGAGCCCAGAAGAGGTGA
CCATCTACAACCACCCTGGCATCCAGGCAGAGCTCCGCATCAGGGAAGGCTCAGGTTACTTCTTCCTCAACACCAGCACCGCAGATGTTG
TCAAGGTGGCCTACCAGGAGGCCAGGGGTGTCGCCATGGTGCACCCTTTGCTCCCGGGCTCATCCACCATCATGATCCATGACTTGTGCC
TCGTCTTCCCGGCCCCAGCCAAGGCTGTCGTTTACGTGTCGGACATTCAGGAGCTGTACATCCGTGTGGTTGACAAGGTGGAGATTGGGA
AGACAGTGAAGGCATACGTCCGCGTGCTGGACTTGCACAAGAAGCCCTTCCTTGCCAAATACTTCCCCTTTATGGACCTGAAGCTCCGAG
CAGCCTCCCCGATCATTACATTGGTGGCCCTTGATGAAGCCCTTGACAACTACACCATCACATTCCTCATCCGCGGTGTGGCCATCGGCC
AGACCAGTCTAACTGCAAGTGTGACCAATAAAGCTGGACAGAGAATCAACTCAGCCCCACAACAGATTGAAGTCTTTCCCCCGTTCAGGC
TGATGCCCAGGAAGGTGACACTGCTTATCGGGGCCACGATGCAGGTCACCTCCGAGGGCGGCCCCCAGCCTCAGTCCAACATCCTTTTCT
CCATCAGCAATGAGAGCGTTGCGCTGGTGAGCGCTGCTGGGCTGGTACAGGGCCTCGCCATCGGGAACGGCACTGTGTCTGGGCTCGTGC
AGGCAGTGGATGCAGAGACCGGCAAGGTGGTCATCATCTCTCAGGACCTCGTGCAGGTGGAGGTGCTGCTGCTAAGGGCCGTGAGGATCC
GCGCCCCCATCATGCGGATGAGGACGGGCACCCAGATGCCCATCTATGTCACCGGCATCACCAACCACCAGAACCCTTTCTCCTTTGGCA
ATGCCGTGCCAGGCCTGACCTTCCACTGGTCTGTCACCAAGCGGGACGTCCTGGACCTCCGAGGGCGGCACCACGAGGCGTCGATCCGAC
TCCCGTCACAGTACAACTTTGCCATGAACGTGCTCGGCCGGGTAAAAGGCCGGACCGGGCTGAGGGTGGTGGTCAAGGCTGTGGACCCCA
CATCGGGGCAGCTGTATGGCCTGGCCAGAGAACTCTCGGATGAGATCCAAGTCCAGGTGTTTGAGAAGCTGCAGCTGCTCAACCCTGAAA
TAGAAGCAGAACAAATATTAATGTCGCCCAACTCATATATAAAGCTGCAGACAAACAGGGATGGTGCAGCCTCTCTGAGCTACCGCGTCC
TGGATGGACCCGAAAAGGTTCCAGTTGTGCATGTTGATGAGAAAGGCTTTCTAGCATCAGGGTCTATGATCGGGACATCCACCATCGAAG
TGATTGCACAAGAGCCCTTTGGGGCCAACCAAACCATCATTGTTGCTGTAAAGGTATCCCCTGTTTCCTACCTGAGGGTTTCCATGAGCC
CTGTCCTGCACACCCAGAACAAGGAGGCCCTGGTGGCCGTGCCTTTGGGAATGACCGTGACCTTCACTGTCCACTTCCACGACAACTCTG
GAGATGTCTTCCATGCTCACAGTTCGGTCCTCAACTTTGCCACTAACAGAGACGACTTTGTGCAGATCGGGAAGGGCCCCACCAACAACA
CCTGCGTTGTCCGCACAGTCAGCGTGGGCCTGACACTGCTCCGTGTGTGGGACGCAGAGCACCCGGGCCTCTCGGACTTCATGCCCCTGC
CTGTCCTACAGGCCATCTCCCCAGAGCTGTCTGGGGCCATGGTGGTGGGGGACGTGCTCTGTCTGGCCACTGTTCTGACCAGCCTGGAAG
GCCTCTCAGGAACCTGGAGCTCCTCGGCCAACAGCATCCTCCACATCGACCCCAAGACGGGTGTGGCTGTGGCCCGGGCCGTGGGATCCG
TGACGGTTTACTATGAGGTCGCTGGGCACCTGAGGACCTACAAGGAGGTGGTGGTCAGCGTCCCTCAGAGGATCATGGCCCGTCACCTCC
ACCCCATCCAGACCAGCTTCCAGGAGGCTACAGCCTCCAAAGTGATTGTTGCCGTGGGAGACAGAAGCTCTAACCTGAGAGATGCCTTTG
AAGATGAGAAACCCCCAAAGAAGAGCCTTCTCTCCAAAGTTTCACAAGGAAAGAGGAAAAGAGGCTGCAGTCATCCTGGGGGTTCAGCAG
ATGGTCCAGCAAAAAAGAAAGTGGCCAAGGTGACTGTTAAATCTGAAAACCTCAAGGTTATAAAGGATGAAGCCCTCAGCGATGGGGATG
ACCTCAGGGACTTTCCAAGTGACCTCAAGAAGGCACACCATCTGAAGAGAGGGGCTACCATGAATGAAGACAGCAATGAAGAAGAGGAAG
AAAGTGAAAATGATTGGGAAGAGGTTGAAGAACTTAGTGAGCCTGTGCTGGGTGACGTGAGAGAAAGTACAGCCTTCTCTCGATCTCTTC
TGCCTGTGAAGCCAGTGGAGATAGAGATTGAAACGCCAGAGCAGGCGAAGACAAGAGAAAGAAGTGAAAAGATAAAACTGGAGTTTGAGA
CATATCTTCGGAGGGCGATGAAACGTTTCAATAAAGGGGTCCATGAGGACACACACAAGGTTCACCTTCTCTGCCTGCTAGCAAATGGCT
TCTATCGAAATAACATCTGCAGCCAGCCAGATCTGCATGCTATTGGCCTGTCCATCATCCCAGCCCGCTTTACCAGAGTGCTGCCTCGAG
ATGTGGACACCTACTACCTCTCAAACCTGGTGAAGTGGTTCATTGGAACATTTACAGTTAATGCAGAACTTTCAGCCAGTGAACAAGATA
ACCTGCAGACTACATTGGAAAGGAGATTTGCTATTTACTCTGCTCGAGATGATGAGGAATTGGTCCATATATTCTTACTGATTCTCCGGG
CTCTGCAGCTCTTGACCCGGCTGGTATTGTCTCTACAGCCAATTCCTCTGAAGTCAGCAACAGCAAAGGGAAAGAAACCTTCCAAGGAAA
GATTGACTGCGGATCCAGGAGGCTCCTCAGAAACTTCCAGCCAAGTTCTAGAAAACCACACCAAACCAAAGACCAGCAAAGGAACCAAAC
AAGAGGAAACCTTTGCTAAGGGCACCTGCAGGCCAAGTGCCAAAGGGAAGAGGAACAAGGGAGGCAGAAAGAAACGGAGCAAGCCCTCCT
CCAGCGAGGAAGATGAGGGCCCAGGAGACAAGCAGGAGAAGGCAACCCAGCGACGTCCGCATGGCCGGGAGCGGCGGGTGGCCTCCAGGG
TGTCTTATAAAGAGGAGAGTGGGAGTGATGAGGCTGGCAGCGGCTCTGATTTTGAGCTCTCCAGTGGAGAAGCCTCTGATCCCTCTGATG
AGGATTCCGAACCTGGCCCTCCAAAGCAGAGGAAAGCCCCCGCTCCTCAGAGGACAAAGGCTGGGTCCAAGAGTGCCTCCAGGACCCATC
GTGGGAGCCATCGTAAGGACCCAAGCTTGCCAGCGGCATCCTCAAGCTCTTCAAGCAGTAAAAGAGGCAAGAAAATGTGCAGCGATGGTG
AGAAGGCAGAAAAAAGAAGCATAGCTGGTATAGACCAGTGGCTAGAGGTGTTCTGTGAGCAGGAGGAAAAGTGGGTATGTGTAGACTGTG
TGCACGGTGTGGTGGGCCAGCCTCTGACCTGTTACAAGTACGCCACCAAGCCCATGACCTATGTGGTGGGCATTGACAGTGACGGCTGGG
TCCGAGATGTCACACAGAGGTACGACCCAGTCTGGATGACAGTGACCCGCAAGTGCCGGGTTGATGCTGAGTGGTGGGCCGAGACCTTGA
GACCATACCAGAGCCCATTTATGGACAGGGAGAAGAAAGAAGACTTGGAGTTTCAGGCTAAACACATGGACCAGCCTTTGCCCACTGCCA
TTGGCTTATATAAGAACCACCCTCTGTATGCCCTGAAGCGGCATCTCCTGAAATATGAGGCCATCTATCCCGAGACAGCTGCCATCCTTG
GGTATTGTCGTGGAGAAGCGGTCTACTCCAGGGATTGTGTGCACACTCTGCATTCCAGGGACACGTGGCTGAAGAAAGCAAGAGTGGTGA
GGCTTGGAGAAGTACCCTACAAGATGGTGAAAGGCTTTTCTAACCGTGCTCGGAAAGCCCGACTTGCTGAGCCCCAGCTGCGGGAAGAAA
ATGACCTGGGCCTGTTTGGCTACTGGCAGACAGAGGAGTATCAGCCCCCAGTGGCCGTGGACGGGAAGGTGCCCCGGAACGAGTTTGGGA
ATGTGTACCTCTTCCTGCCCAGCATGATGCCTATTGGCTGTGTCCAGCTGAACCTGCCCAATCTACACCGCGTGGCCCGCAAGCTGGACA
TCGACTGTGTCCAGGCCATCACTGGCTTTGATTTCCATGGCGGCTACTCCCATCCCGTGACTGATGGATACATCGTCTGCGAGGAATTCA
AAGACGTGCTCCTGACTGCCTGGGAAAATGAGCAGGCAGTCATTGAAAGGAAGGAGAAGGAGAAAAAGGAGAAGCGGGCTCTAGGGAACT
GGAAGTTGCTGGCCAAAGGTCTGCTCATCAGGGAGAGGCTGAAGCGTCGCTACGGGCCCAAGAGTGAGGCAGCAGCTCCCCACACAGATG
CAGGAGGTGGACTCTCTTCTGATGAAGAGGAGGGGACCAGCTCTCAAGCAGAAGCGGCCAGGATACTGGCTGCCTCCTGGCCTCAAAACC
GAGAAGATGAAGAAAAGCAGAAGCTGAAGGGTGGGCCCAAGAAGACCAAAAGGGAAAAGAAAGCAGCAGCTTCCCACCTGTTCCCATTTG
AGCAGCTGTGAGCTGAGCGCCCACTAGAGGGGCACCCACCAGTTGCTGCTGCCCCACTACAGGCCCCACACCTGCCCTGGGCATGCCCAG
CCCCTGGTGGTGGGGGCTTCTCTGCTGAGAAGGCAAACTGAGGCAGCATGCACGGAGGCGGGGTCAGGGGAGACGAGGCCAAGCTGAGGA
GGTGCTGCAGGTCCCGTCTGGCTCCAGCCCTTGTCAGATTCACCCAGGGTGAAGCCTTCAAAGCTTTTTGCTACCAAAGCCCACTCACCC
TTTGAGCTACAGAACACTTTGCTAGGAGATACTCTTCTGCCTCCTAGACCTGTTCTTTCCATCTTTAGAAACATCAGTTTTTGTATGGAA
GCCACCGGGAGATTTCTGGATGGTGGTGCATCCGTGAATGCGCTGATCGTTTCTTCCAGTTAGAGTCTTCATCTGTCCGACAAGTTCACT
CGCCTCGGTTGCGGACCTAGGACCATTTCTCTGCAGGCCACTTACCTTCCCCTGAGTCAGGCTTACTAATGCTGCCCTCACTGCCTCTTT
GCAGTAGGGGAGAGAGCAGAGAAGTACAGGTCATCTGCTGGGATCTAGTTTTCCAAGTAACATTTTGTGGTGACAGAAGCCTAAAAAAAG
CTAAAATCAGGAAAGAAAAGGAAAAATACGAATTGAAAATTAAGGAAATGTTAGTAAAATAGATGAGTGTTAAACTAGATTGTATTCATT

>61034_61034_1_NUP210-XPC_NUP210_chr3_13364809_ENST00000254508_XPC_chr3_14214562_ENST00000285021_length(amino acids)=2495AA_BP=1589
MAARGRGLLLLTLSVLLAAGPSAAAAKLNIPKVLLPFTRATRVNFTLEASEGCYRWLSTRPEVASIEPLGLDEQQCSQKAVVQARLTQPA
RLTSIIFAEDITTGQVLRCDAIVDLIHDIQIVSTTRELYLEDSPLELKIQALDSEGNTFSTLAGLVFEWTIVKDSEADRFSDSHNALRIL
TFLESTYIPPSYISEMEKAAKQGDTILVSGMKTGSSKLKARIQEAVYKNVRPAEVRLLILENILLNPAYDVYLMVGTSIHYKVQKIRQGK
ITELSMPSDQYELQLQNSIPGPEGDPARPVAVLAQDTSMVTALQLGQSSLVLGHRSIRMQGASRLPNSTIYVVEPGYLGFTVHPGDRWVL
ETGRLYEITIEVFDKFSNKVYVSDNIRIETVLPAEFFEVLSSSQNGSYHRIRALKRGQTAIDAALTSVVDQDGGVHILQVPVWNQQEVEI
HIPITLYPSILTFPWQPKTGAYQYTIRAHGGSGNFSWSSSSHLVATVTVKGVMTTGSDIGFSVIQAHDVQNPLHFGEMKVYVIEPHSMEF
APCQVEARVGQALELPLRISGLMPGGASEVVTLSDCSHFDLAVEVENQGVFQPLPGRLPPGSEHCSGIRVKAEAQGSTTLLVSYRHGHVH
LSAKITIAAYLPLKAVDPSSVALVTLGSSKEMLFEGGPRPWILEPSKFFQNVTAEDTDSIGLALFAPHSSRNYQQHWILVTCQALGEQVI
ALSVGNKPSLTNPFPAVEPAVVKFVCAPPSRLTLAPVYTSPQLDMSCPLLQQNKQVVPVSSHRNPRLDLAAYDQEGRRFDNFSSLSIQWE
STRPVLASIEPELPMQLVSQDDESGQKKLHGLQAILVHEASGTTAITATATGYQESHLSSARTKQPHDPLVPLSASIELILVEDVRVSPE
EVTIYNHPGIQAELRIREGSGYFFLNTSTADVVKVAYQEARGVAMVHPLLPGSSTIMIHDLCLVFPAPAKAVVYVSDIQELYIRVVDKVE
IGKTVKAYVRVLDLHKKPFLAKYFPFMDLKLRAASPIITLVALDEALDNYTITFLIRGVAIGQTSLTASVTNKAGQRINSAPQQIEVFPP
FRLMPRKVTLLIGATMQVTSEGGPQPQSNILFSISNESVALVSAAGLVQGLAIGNGTVSGLVQAVDAETGKVVIISQDLVQVEVLLLRAV
RIRAPIMRMRTGTQMPIYVTGITNHQNPFSFGNAVPGLTFHWSVTKRDVLDLRGRHHEASIRLPSQYNFAMNVLGRVKGRTGLRVVVKAV
DPTSGQLYGLARELSDEIQVQVFEKLQLLNPEIEAEQILMSPNSYIKLQTNRDGAASLSYRVLDGPEKVPVVHVDEKGFLASGSMIGTST
IEVIAQEPFGANQTIIVAVKVSPVSYLRVSMSPVLHTQNKEALVAVPLGMTVTFTVHFHDNSGDVFHAHSSVLNFATNRDDFVQIGKGPT
NNTCVVRTVSVGLTLLRVWDAEHPGLSDFMPLPVLQAISPELSGAMVVGDVLCLATVLTSLEGLSGTWSSSANSILHIDPKTGVAVARAV
GSVTVYYEVAGHLRTYKEVVVSVPQRIMARHLHPIQTSFQEATASKVIVAVGDRSSNLRDAFEDEKPPKKSLLSKVSQGKRKRGCSHPGG
SADGPAKKKVAKVTVKSENLKVIKDEALSDGDDLRDFPSDLKKAHHLKRGATMNEDSNEEEEESENDWEEVEELSEPVLGDVRESTAFSR
SLLPVKPVEIEIETPEQAKTRERSEKIKLEFETYLRRAMKRFNKGVHEDTHKVHLLCLLANGFYRNNICSQPDLHAIGLSIIPARFTRVL
PRDVDTYYLSNLVKWFIGTFTVNAELSASEQDNLQTTLERRFAIYSARDDEELVHIFLLILRALQLLTRLVLSLQPIPLKSATAKGKKPS
KERLTADPGGSSETSSQVLENHTKPKTSKGTKQEETFAKGTCRPSAKGKRNKGGRKKRSKPSSSEEDEGPGDKQEKATQRRPHGRERRVA
SRVSYKEESGSDEAGSGSDFELSSGEASDPSDEDSEPGPPKQRKAPAPQRTKAGSKSASRTHRGSHRKDPSLPAASSSSSSSKRGKKMCS
DGEKAEKRSIAGIDQWLEVFCEQEEKWVCVDCVHGVVGQPLTCYKYATKPMTYVVGIDSDGWVRDVTQRYDPVWMTVTRKCRVDAEWWAE
TLRPYQSPFMDREKKEDLEFQAKHMDQPLPTAIGLYKNHPLYALKRHLLKYEAIYPETAAILGYCRGEAVYSRDCVHTLHSRDTWLKKAR
VVRLGEVPYKMVKGFSNRARKARLAEPQLREENDLGLFGYWQTEEYQPPVAVDGKVPRNEFGNVYLFLPSMMPIGCVQLNLPNLHRVARK
LDIDCVQAITGFDFHGGYSHPVTDGYIVCEEFKDVLLTAWENEQAVIERKEKEKKEKRALGNWKLLAKGLLIRERLKRRYGPKSEAAAPH

--------------------------------------------------------------
>61034_61034_2_NUP210-XPC_NUP210_chr3_13364809_ENST00000254508_XPC_chr3_14214562_ENST00000449060_length(transcript)=7460nt_BP=4851nt
GCGCGCGCGCGGGCGGGAGCGGAGGGCAACGGGGCGGCGCGGGCGGCCGGGCGCAGGGTCGCGGGAGGTGACGCGCGGCGAGGATGGCGG
CGCGGGGCCGGGGGCTGCTGCTGCTGACGCTGTCGGTGCTGTTGGCGGCGGGCCCCTCCGCCGCTGCGGCCAAGCTCAACATCCCCAAAG
TGCTGCTGCCCTTCACGCGGGCCACGCGCGTTAACTTCACGCTGGAGGCCTCGGAGGGCTGCTACCGCTGGTTGTCCACCCGGCCGGAGG
TGGCCAGCATCGAGCCGCTGGGCCTGGACGAGCAGCAGTGCTCCCAGAAGGCAGTGGTGCAGGCCCGCCTGACCCAGCCTGCCCGCCTCA
CCAGCATCATCTTCGCAGAGGACATCACCACAGGCCAGGTCCTGCGCTGTGATGCCATTGTGGACCTCATCCATGACATCCAGATCGTCT
CCACCACCCGCGAGCTCTACCTGGAGGACTCCCCCCTGGAGCTGAAGATCCAGGCCCTGGACTCCGAAGGGAACACCTTCAGCACTCTGG
CTGGACTGGTCTTCGAGTGGACGATTGTGAAGGACTCCGAGGCGGACAGGTTCTCAGACTCCCACAATGCGCTGCGAATCCTCACTTTCT
TGGAGTCTACGTACATCCCTCCTTCTTACATCTCAGAGATGGAGAAGGCTGCCAAGCAAGGGGACACCATCCTGGTGTCTGGGATGAAGA
CCGGGAGCTCCAAGCTCAAGGCTCGCATCCAGGAGGCTGTCTACAAGAATGTACGCCCTGCAGAAGTCAGGCTGCTGATTTTGGAAAACA
TCCTTCTGAACCCGGCCTATGACGTCTACCTGATGGTGGGAACCTCCATTCACTACAAGGTGCAGAAGATCAGGCAAGGGAAAATTACAG
AACTCTCCATGCCTTCCGATCAGTACGAGTTGCAGCTTCAGAACAGCATCCCGGGCCCCGAAGGAGACCCAGCCCGGCCGGTGGCTGTCT
TGGCCCAGGACACGTCGATGGTCACTGCACTGCAGCTGGGACAGAGCAGCCTCGTCCTTGGCCACAGGAGTATTCGCATGCAAGGTGCTT
CTAGGTTACCCAACAGCACTATCTACGTGGTCGAACCTGGATACCTAGGGTTCACTGTTCACCCTGGTGACAGGTGGGTGCTGGAGACCG
GCCGCCTGTATGAAATCACCATCGAAGTTTTTGACAAGTTCAGCAACAAGGTCTATGTATCTGACAACATCCGAATTGAAACTGTGCTTC
CTGCTGAGTTCTTCGAGGTGCTCTCGTCCTCCCAGAATGGGTCATACCATCGCATCAGGGCACTAAAGAGGGGACAGACGGCCATTGACG
CGGCCCTCACCTCTGTGGTGGACCAGGATGGAGGGGTCCACATACTACAGGTGCCTGTGTGGAACCAGCAGGAGGTGGAAATTCACATCC
CGATCACCCTGTATCCCAGCATCTTGACATTTCCGTGGCAACCAAAGACGGGCGCCTATCAGTACACAATAAGGGCCCACGGTGGCAGTG
GGAACTTCAGCTGGTCTTCGTCAAGCCACCTGGTTGCCACAGTTACTGTCAAGGGCGTGATGACCACAGGCAGTGACATCGGGTTCAGTG
TGATCCAGGCACATGATGTGCAGAACCCACTCCATTTCGGTGAGATGAAGGTGTATGTGATCGAGCCCCACAGCATGGAGTTTGCCCCGT
GCCAGGTGGAGGCACGTGTGGGCCAGGCCCTGGAGCTGCCCCTGAGGATCAGTGGCCTCATGCCCGGCGGGGCCAGTGAGGTGGTCACCT
TGAGCGACTGCTCCCACTTTGACTTGGCTGTCGAGGTGGAGAACCAGGGTGTGTTCCAGCCACTCCCAGGGAGGCTGCCGCCAGGCTCTG
AGCACTGCAGCGGCATCCGGGTAAAGGCCGAGGCCCAGGGCTCTACCACGCTTCTTGTGAGCTACAGACACGGCCACGTCCACCTGAGTG
CCAAGATCACCATTGCTGCCTACCTGCCCCTCAAGGCTGTGGATCCCTCCTCTGTTGCCTTGGTAACCCTGGGCTCCTCAAAGGAGATGC
TGTTTGAAGGAGGTCCCAGACCTTGGATCCTCGAGCCGTCCAAATTCTTCCAGAACGTCACCGCTGAGGACACTGACAGCATCGGCCTGG
CTCTCTTTGCCCCCCATTCCTCCCGGAATTATCAGCAACACTGGATCCTTGTGACCTGTCAGGCCTTGGGTGAGCAGGTCATCGCCCTGT
CGGTGGGGAACAAGCCCAGCCTCACCAACCCCTTTCCTGCGGTGGAGCCTGCCGTGGTGAAGTTCGTCTGCGCCCCACCGTCCAGGCTCA
CCCTCGCGCCTGTCTACACCAGCCCCCAGCTGGACATGTCCTGTCCGCTGCTGCAGCAGAACAAGCAGGTGGTCCCAGTGTCCAGCCACC
GCAACCCCCGGCTGGACCTGGCTGCTTACGACCAGGAGGGCCGCCGGTTCGACAACTTCAGCTCTCTGAGCATCCAGTGGGAGTCCACCA
GGCCAGTGTTGGCCAGCATCGAGCCTGAGCTGCCCATGCAGCTGGTGTCCCAGGACGATGAGAGTGGCCAAAAGAAGCTGCACGGTTTGC
AGGCCATTTTGGTTCACGAGGCATCAGGAACCACAGCCATCACTGCCACTGCCACTGGCTACCAGGAGTCCCACCTCAGCTCTGCCAGAA
CAAAGCAGCCGCATGACCCTCTGGTGCCTCTGTCGGCCTCCATAGAGCTCATCCTGGTGGAGGACGTGAGGGTGAGCCCAGAAGAGGTGA
CCATCTACAACCACCCTGGCATCCAGGCAGAGCTCCGCATCAGGGAAGGCTCAGGTTACTTCTTCCTCAACACCAGCACCGCAGATGTTG
TCAAGGTGGCCTACCAGGAGGCCAGGGGTGTCGCCATGGTGCACCCTTTGCTCCCGGGCTCATCCACCATCATGATCCATGACTTGTGCC
TCGTCTTCCCGGCCCCAGCCAAGGCTGTCGTTTACGTGTCGGACATTCAGGAGCTGTACATCCGTGTGGTTGACAAGGTGGAGATTGGGA
AGACAGTGAAGGCATACGTCCGCGTGCTGGACTTGCACAAGAAGCCCTTCCTTGCCAAATACTTCCCCTTTATGGACCTGAAGCTCCGAG
CAGCCTCCCCGATCATTACATTGGTGGCCCTTGATGAAGCCCTTGACAACTACACCATCACATTCCTCATCCGCGGTGTGGCCATCGGCC
AGACCAGTCTAACTGCAAGTGTGACCAATAAAGCTGGACAGAGAATCAACTCAGCCCCACAACAGATTGAAGTCTTTCCCCCGTTCAGGC
TGATGCCCAGGAAGGTGACACTGCTTATCGGGGCCACGATGCAGGTCACCTCCGAGGGCGGCCCCCAGCCTCAGTCCAACATCCTTTTCT
CCATCAGCAATGAGAGCGTTGCGCTGGTGAGCGCTGCTGGGCTGGTACAGGGCCTCGCCATCGGGAACGGCACTGTGTCTGGGCTCGTGC
AGGCAGTGGATGCAGAGACCGGCAAGGTGGTCATCATCTCTCAGGACCTCGTGCAGGTGGAGGTGCTGCTGCTAAGGGCCGTGAGGATCC
GCGCCCCCATCATGCGGATGAGGACGGGCACCCAGATGCCCATCTATGTCACCGGCATCACCAACCACCAGAACCCTTTCTCCTTTGGCA
ATGCCGTGCCAGGCCTGACCTTCCACTGGTCTGTCACCAAGCGGGACGTCCTGGACCTCCGAGGGCGGCACCACGAGGCGTCGATCCGAC
TCCCGTCACAGTACAACTTTGCCATGAACGTGCTCGGCCGGGTAAAAGGCCGGACCGGGCTGAGGGTGGTGGTCAAGGCTGTGGACCCCA
CATCGGGGCAGCTGTATGGCCTGGCCAGAGAACTCTCGGATGAGATCCAAGTCCAGGTGTTTGAGAAGCTGCAGCTGCTCAACCCTGAAA
TAGAAGCAGAACAAATATTAATGTCGCCCAACTCATATATAAAGCTGCAGACAAACAGGGATGGTGCAGCCTCTCTGAGCTACCGCGTCC
TGGATGGACCCGAAAAGGTTCCAGTTGTGCATGTTGATGAGAAAGGCTTTCTAGCATCAGGGTCTATGATCGGGACATCCACCATCGAAG
TGATTGCACAAGAGCCCTTTGGGGCCAACCAAACCATCATTGTTGCTGTAAAGGTATCCCCTGTTTCCTACCTGAGGGTTTCCATGAGCC
CTGTCCTGCACACCCAGAACAAGGAGGCCCTGGTGGCCGTGCCTTTGGGAATGACCGTGACCTTCACTGTCCACTTCCACGACAACTCTG
GAGATGTCTTCCATGCTCACAGTTCGGTCCTCAACTTTGCCACTAACAGAGACGACTTTGTGCAGATCGGGAAGGGCCCCACCAACAACA
CCTGCGTTGTCCGCACAGTCAGCGTGGGCCTGACACTGCTCCGTGTGTGGGACGCAGAGCACCCGGGCCTCTCGGACTTCATGCCCCTGC
CTGTCCTACAGGCCATCTCCCCAGAGCTGTCTGGGGCCATGGTGGTGGGGGACGTGCTCTGTCTGGCCACTGTTCTGACCAGCCTGGAAG
GCCTCTCAGGAACCTGGAGCTCCTCGGCCAACAGCATCCTCCACATCGACCCCAAGACGGGTGTGGCTGTGGCCCGGGCCGTGGGATCCG
TGACGGTTTACTATGAGGTCGCTGGGCACCTGAGGACCTACAAGGAGGTGGTGGTCAGCGTCCCTCAGAGGATCATGGCCCGTCACCTCC
ACCCCATCCAGACCAGCTTCCAGGAGGCTACAGCCTCCAAAGTGATTGTTGCCGTGGGAGACAGAAGCTCTAACCTGAGAGATGCCTTTG
AAGATGAGAAACCCCCAAAGAAGAGCCTTCTCTCCAAAGTTTCACAAGGAAAGAGGAAAAGAGGCTGCAGTCATCCTGGGGGTTCAGCAG
ATGGTCCAGCAAAAAAGAAAGTGGCCAAGGTGACTGTTAAATCTGAAAACCTCAAGGTTATAAAGGATGAAGCCCTCAGCGATGGGGATG
ACCTCAGGGACTTTCCAAGTGACCTCAAGAAGGCACACCATCTGAAGAGAGGGGCTACCATGAATGAAGACAGCAATGAAGAAGAGGAAG
AAAGTGAAAATGATTGGGAAGAGGCGAAGACAAGAGAAAGAAGTGAAAAGATAAAACTGGAGTTTGAGACATATCTTCGGAGGGCGATGA
AACGTTTCAATAAAGGGGTCCATGAGGACACACACAAGGTTCACCTTCTCTGCCTGCTAGCAAATGGCTTCTATCGAAATAACATCTGCA
GCCAGCCAGATCTGCATGCTATTGGCCTGTCCATCATCCCAGCCCGCTTTACCAGAGTGCTGCCTCGAGATGTGGACACCTACTACCTCT
CAAACCTGGTGAAGTGGTTCATTGGAACATTTACAGTTAATGCAGAACTTTCAGCCAGTGAACAAGATAACCTGCAGACTACATTGGAAA
GGAGATTTGCTATTTACTCTGCTCGAGATGATGAGGAATTGGTCCATATATTCTTACTGATTCTCCGGGCTCTGCAGCTCTTGACCCGGC
TGGTATTGTCTCTACAGCCAATTCCTCTGAAGTCAGCAACAGCAAAGGGAAAGAAACCTTCCAAGGAAAGATTGACTGCGGATCCAGGAG
GCTCCTCAGAAACTTCCAGCCAAGTTCTAGAAAACCACACCAAACCAAAGACCAGCAAAGGAACCAAACAAGAGGAAACCTTTGCTAAGG
GCACCTGCAGGCCAAGTGCCAAAGGGAAGAGGAACAAGGGAGGCAGAAAGAAACGGAGCAAGCCCTCCTCCAGCGAGGAAGATGAGGGCC
CAGGAGACAAGCAGGAGAAGGCAACCCAGCGACGTCCGCATGGCCGGGAGCGGCGGGTGGCCTCCAGGGTGTCTTATAAAGAGGAGAGTG
GGAGTGATGAGGCTGGCAGCGGCTCTGATTTTGAGCTCTCCAGTGGAGAAGCCTCTGATCCCTCTGATGAGGATTCCGAACCTGGCCCTC
CAAAGCAGAGGAAAGCCCCCGCTCCTCAGAGGACAAAGGCTGGGTCCAAGAGTGCCTCCAGGACCCATCGTGGGAGCCATCGTAAGGACC
CAAGCTTGCCAGCGGCATCCTCAAGCTCTTCAAGCAGTAAAAGAGGCAAGAAAATGTGCAGCGATGGTGAGAAGGCAGAAAAAAGAAGCA
TAGCTGGTATAGACCAGTGGCTAGAGGTGTTCTGTGAGCAGGAGGAAAAGTGGGTATGTGTAGACTGTGTGCACGGTGTGGTGGGCCAGC
CTCTGACCTGTTACAAGTACGCCACCAAGCCCATGACCTATGTGGTGGGCATTGACAGTGACGGCTGGGTCCGAGATGTCACACAGAGGT
ACGACCCAGTCTGGATGACAGTGACCCGCAAGTGCCGGGTTGATGCTGAGTGGTGGGCCGAGACCTTGAGACCATACCAGAGCCCATTTA
TGGACAGGGAGAAGAAAGAAGACTTGGAGTTTCAGGCTAAACACATGGACCAGCCTTTGCCCACTGCCATTGGCTTATATAAGAACCACC
CTCTGTATGCCCTGAAGCGGCATCTCCTGAAATATGAGGCCATCTATCCCGAGACAGCTGCCATCCTTGGGTATTGTCGTGGAGAAGCGG
TCTACTCCAGGGATTGTGTGCACACTCTGCATTCCAGGGACACGTGGCTGAAGAAAGCAAGAGTGGTGAGGCTTGGAGAAGTACCCTACA
AGATGGTGAAAGGCTTTTCTAACCGTGCTCGGAAAGCCCGACTTGCTGAGCCCCAGCTGCGGGAAGAAAATGACCTGGGCCTGTTTGGCT
ACTGGCAGACAGAGGAGTATCAGCCCCCAGTGGCCGTGGACGGGAAGGTGCCCCGGAACGAGTTTGGGAATGTGTACCTCTTCCTGCCCA
GCATGATGCCTATTGGCTGTGTCCAGCTGAACCTGCCCAATCTACACCGCGTGGCCCGCAAGCTGGACATCGACTGTGTCCAGGCCATCA
CTGGCTTTGATTTCCATGGCGGCTACTCCCATCCCGTGACTGATGGATACATCGTCTGCGAGGAATTCAAAGACGTGCTCCTGACTGCCT
GGGAAAATGAGCAGGCAGTCATTGAAAGGAAGGAGAAGGAGAAAAAGGAGAAGCGGGCTCTAGGGAACTGGAAGTTGCTGGCCAAAGGTC
TGCTCATCAGGGAGAGGCTGAAGCGTCGCTACGGGCCCAAGAGTGAGGCAGCAGCTCCCCACACAGATGCAGGAGGTGGACTCTCTTCTG
ATGAAGAGGAGGGGACCAGCTCTCAAGCAGAAGCGGCCAGGATACTGGCTGCCTCCTGGCCTCAAAACCGAGAAGATGAAGAAAAGCAGA

>61034_61034_2_NUP210-XPC_NUP210_chr3_13364809_ENST00000254508_XPC_chr3_14214562_ENST00000449060_length(amino acids)=2459AA_BP=1589
MAARGRGLLLLTLSVLLAAGPSAAAAKLNIPKVLLPFTRATRVNFTLEASEGCYRWLSTRPEVASIEPLGLDEQQCSQKAVVQARLTQPA
RLTSIIFAEDITTGQVLRCDAIVDLIHDIQIVSTTRELYLEDSPLELKIQALDSEGNTFSTLAGLVFEWTIVKDSEADRFSDSHNALRIL
TFLESTYIPPSYISEMEKAAKQGDTILVSGMKTGSSKLKARIQEAVYKNVRPAEVRLLILENILLNPAYDVYLMVGTSIHYKVQKIRQGK
ITELSMPSDQYELQLQNSIPGPEGDPARPVAVLAQDTSMVTALQLGQSSLVLGHRSIRMQGASRLPNSTIYVVEPGYLGFTVHPGDRWVL
ETGRLYEITIEVFDKFSNKVYVSDNIRIETVLPAEFFEVLSSSQNGSYHRIRALKRGQTAIDAALTSVVDQDGGVHILQVPVWNQQEVEI
HIPITLYPSILTFPWQPKTGAYQYTIRAHGGSGNFSWSSSSHLVATVTVKGVMTTGSDIGFSVIQAHDVQNPLHFGEMKVYVIEPHSMEF
APCQVEARVGQALELPLRISGLMPGGASEVVTLSDCSHFDLAVEVENQGVFQPLPGRLPPGSEHCSGIRVKAEAQGSTTLLVSYRHGHVH
LSAKITIAAYLPLKAVDPSSVALVTLGSSKEMLFEGGPRPWILEPSKFFQNVTAEDTDSIGLALFAPHSSRNYQQHWILVTCQALGEQVI
ALSVGNKPSLTNPFPAVEPAVVKFVCAPPSRLTLAPVYTSPQLDMSCPLLQQNKQVVPVSSHRNPRLDLAAYDQEGRRFDNFSSLSIQWE
STRPVLASIEPELPMQLVSQDDESGQKKLHGLQAILVHEASGTTAITATATGYQESHLSSARTKQPHDPLVPLSASIELILVEDVRVSPE
EVTIYNHPGIQAELRIREGSGYFFLNTSTADVVKVAYQEARGVAMVHPLLPGSSTIMIHDLCLVFPAPAKAVVYVSDIQELYIRVVDKVE
IGKTVKAYVRVLDLHKKPFLAKYFPFMDLKLRAASPIITLVALDEALDNYTITFLIRGVAIGQTSLTASVTNKAGQRINSAPQQIEVFPP
FRLMPRKVTLLIGATMQVTSEGGPQPQSNILFSISNESVALVSAAGLVQGLAIGNGTVSGLVQAVDAETGKVVIISQDLVQVEVLLLRAV
RIRAPIMRMRTGTQMPIYVTGITNHQNPFSFGNAVPGLTFHWSVTKRDVLDLRGRHHEASIRLPSQYNFAMNVLGRVKGRTGLRVVVKAV
DPTSGQLYGLARELSDEIQVQVFEKLQLLNPEIEAEQILMSPNSYIKLQTNRDGAASLSYRVLDGPEKVPVVHVDEKGFLASGSMIGTST
IEVIAQEPFGANQTIIVAVKVSPVSYLRVSMSPVLHTQNKEALVAVPLGMTVTFTVHFHDNSGDVFHAHSSVLNFATNRDDFVQIGKGPT
NNTCVVRTVSVGLTLLRVWDAEHPGLSDFMPLPVLQAISPELSGAMVVGDVLCLATVLTSLEGLSGTWSSSANSILHIDPKTGVAVARAV
GSVTVYYEVAGHLRTYKEVVVSVPQRIMARHLHPIQTSFQEATASKVIVAVGDRSSNLRDAFEDEKPPKKSLLSKVSQGKRKRGCSHPGG
SADGPAKKKVAKVTVKSENLKVIKDEALSDGDDLRDFPSDLKKAHHLKRGATMNEDSNEEEEESENDWEEAKTRERSEKIKLEFETYLRR
AMKRFNKGVHEDTHKVHLLCLLANGFYRNNICSQPDLHAIGLSIIPARFTRVLPRDVDTYYLSNLVKWFIGTFTVNAELSASEQDNLQTT
LERRFAIYSARDDEELVHIFLLILRALQLLTRLVLSLQPIPLKSATAKGKKPSKERLTADPGGSSETSSQVLENHTKPKTSKGTKQEETF
AKGTCRPSAKGKRNKGGRKKRSKPSSSEEDEGPGDKQEKATQRRPHGRERRVASRVSYKEESGSDEAGSGSDFELSSGEASDPSDEDSEP
GPPKQRKAPAPQRTKAGSKSASRTHRGSHRKDPSLPAASSSSSSSKRGKKMCSDGEKAEKRSIAGIDQWLEVFCEQEEKWVCVDCVHGVV
GQPLTCYKYATKPMTYVVGIDSDGWVRDVTQRYDPVWMTVTRKCRVDAEWWAETLRPYQSPFMDREKKEDLEFQAKHMDQPLPTAIGLYK
NHPLYALKRHLLKYEAIYPETAAILGYCRGEAVYSRDCVHTLHSRDTWLKKARVVRLGEVPYKMVKGFSNRARKARLAEPQLREENDLGL
FGYWQTEEYQPPVAVDGKVPRNEFGNVYLFLPSMMPIGCVQLNLPNLHRVARKLDIDCVQAITGFDFHGGYSHPVTDGYIVCEEFKDVLL
TAWENEQAVIERKEKEKKEKRALGNWKLLAKGLLIRERLKRRYGPKSEAAAPHTDAGGGLSSDEEEGTSSQAEAARILAASWPQNREDEE

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for NUP210-XPC


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with
TgeneXPCchr3:13364809chr3:14214562ENST00000285021016847_86634.333333333333336941.0CETN2
TgeneXPCchr3:13364809chr3:14214562ENST00000449060016847_86634.333333333333336904.0CETN2
TgeneXPCchr3:13364809chr3:14214562ENST00000285021016816_94034.333333333333336941.0ERCC2 and GTF2H1
TgeneXPCchr3:13364809chr3:14214562ENST00000449060016816_94034.333333333333336904.0ERCC2 and GTF2H1
TgeneXPCchr3:13364809chr3:14214562ENST00000285021016496_73434.333333333333336941.0RAD23B
TgeneXPCchr3:13364809chr3:14214562ENST00000449060016496_73434.333333333333336904.0RAD23B


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for NUP210-XPC


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for NUP210-XPC


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource