FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:UBP1-TFCP2L1 (FusionGDB2 ID:96393)

Fusion Gene Summary for UBP1-TFCP2L1

check button Fusion gene summary
Fusion gene informationFusion gene name: UBP1-TFCP2L1
Fusion gene ID: 96393
HgeneTgene
Gene symbol

UBP1

TFCP2L1

Gene ID

7342

29842

Gene nameupstream binding protein 1transcription factor CP2 like 1
SynonymsLBP-1B|LBP-1a|LBP1A|LBP1BCRTR1|LBP-9|LBP9
Cytomap

3p22.3

2q14.2

Type of geneprotein-codingprotein-coding
Descriptionupstream-binding protein 1transcription factor LBP-1transcription factor CP2-like protein 1CP2-related transcriptional repressor 1CRTR-1transcription factor LBP-9
Modification date2020031320200313
UniProtAcc..
Ensembl transtripts involved in fusion geneENST00000283628, ENST00000283629, 
ENST00000447368, ENST00000486388, 
ENST00000263707, 
Fusion gene scores* DoF score8 X 8 X 6=3844 X 4 X 3=48
# samples 94
** MAII scorelog2(9/384*10)=-2.09310940439148
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(4/48*10)=-0.263034405833794
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: UBP1 [Title/Abstract] AND TFCP2L1 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointUBP1(33453072)-TFCP2L1(122004546), # samples:17
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID

check buttonFusion gene breakpoints across UBP1 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across TFCP2L1 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4ESCATCGA-IG-A3I8UBP1chr3

33453072

-TFCP2L1chr2

122004546

-
ChimerDB4ESCATCGA-L5-A4OEUBP1chr3

33453072

-TFCP2L1chr2

122004546

-
ChimerDB4ESCATCGA-L5-A4OMUBP1chr3

33453072

-TFCP2L1chr2

122004546

-
ChimerDB4ESCATCGA-L5-A4OSUBP1chr3

33453072

-TFCP2L1chr2

122004546

-
ChimerDB4ESCATCGA-L5-A4OXUBP1chr3

33453072

-TFCP2L1chr2

122004546

-
ChimerDB4ESCATCGA-L5-A88TUBP1chr3

33453072

-TFCP2L1chr2

122004546

-
ChimerDB4ESCATCGA-L5-A893UBP1chr3

33453072

-TFCP2L1chr2

122004546

-
ChimerDB4ESCATCGA-L7-A6VZUBP1chr3

33453072

-TFCP2L1chr2

122004546

-
ChimerDB4ESCATCGA-LN-A4A4UBP1chr3

33453072

-TFCP2L1chr2

122004546

-
ChimerDB4ESCATCGA-LN-A9FQUBP1chr3

33453072

-TFCP2L1chr2

122004546

-
ChimerDB4ESCATCGA-R6-A6DNUBP1chr3

33453072

-TFCP2L1chr2

122004546

-
ChimerDB4ESCATCGA-R6-A6DQUBP1chr3

33453072

-TFCP2L1chr2

122004546

-
ChimerDB4ESCATCGA-R6-A6Y0UBP1chr3

33453072

-TFCP2L1chr2

122004546

-
ChimerDB4ESCATCGA-R6-A8WCUBP1chr3

33453072

-TFCP2L1chr2

122004546

-
ChimerDB4ESCATCGA-VR-A8ERUBP1chr3

33453072

-TFCP2L1chr2

122004546

-
ChimerDB4ESCATCGA-Z6-A8JDUBP1chr3

33453072

-TFCP2L1chr2

122004546

-
ChimerDB4OVTCGA-3P-A9WAUBP1chr3

33453072

-TFCP2L1chr2

122004546

-


Top

Fusion Gene ORF analysis for UBP1-TFCP2L1

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
In-frameENST00000283628ENST00000263707UBP1chr3

33453072

-TFCP2L1chr2

122004546

-
In-frameENST00000283629ENST00000263707UBP1chr3

33453072

-TFCP2L1chr2

122004546

-
In-frameENST00000447368ENST00000263707UBP1chr3

33453072

-TFCP2L1chr2

122004546

-
intron-3CDSENST00000486388ENST00000263707UBP1chr3

33453072

-TFCP2L1chr2

122004546

-

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000283629UBP1chr333453072-ENST00000263707TFCP2L1chr2122004546-977510853472020557
ENST00000447368UBP1chr333453072-ENST00000263707TFCP2L1chr2122004546-94077171621652496
ENST00000283628UBP1chr333453072-ENST00000263707TFCP2L1chr2122004546-94998092541744496

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000283629ENST00000263707UBP1chr333453072-TFCP2L1chr2122004546-0.0004523020.99954766
ENST00000447368ENST00000263707UBP1chr333453072-TFCP2L1chr2122004546-0.0004202210.9995797
ENST00000283628ENST00000263707UBP1chr333453072-TFCP2L1chr2122004546-0.0004109810.999589

Top

Fusion Genomic Features for UBP1-TFCP2L1


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.

Top

Fusion Protein Features for UBP1-TFCP2L1


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr3:33453072/chr2:122004546)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
..
FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
TgeneTFCP2L1chr3:33453072chr2:122004546ENST00000263707415261_365168480.0RegionSAM2-like domain

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
TgeneTFCP2L1chr3:33453072chr2:122004546ENST0000026370741521_260168480.0RegionCP2-like domain


Top

Fusion Gene Sequence for UBP1-TFCP2L1


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>96393_96393_1_UBP1-TFCP2L1_UBP1_chr3_33453072_ENST00000283628_TFCP2L1_chr2_122004546_ENST00000263707_length(transcript)=9499nt_BP=809nt
CTGGGCCGCGAGGCGCGGGGAGTGGCCCTAAGCGCGGGGCTGTACGGGGAGGGCCGAGGCGTGGGGCGAACGCCGGGGCGGGGTCGGGGC
GGGGCCGGGGGAGGCGGAAGCGGCGCGTGAACCTAAGTCGGAGTCGCGTTGAAGCCAGCGTCCGGCCGGCAGAGCGGCGTCGGCGGGGAC
GCGCCAGCGGAGCTCCGGCTCCTTCGCCCTGGACGCGGAGGCCGCGGTGTGCGGGGCGACGGCGAGGCCGGAAGATGGCCTGGGTGCTCA
AGATGGACGAGGTGATCGAGTCCGGGCTGGTGCACGACTTCGACGCCAGCCTCTCGGGCATCGGGCAGGAACTGGGCGCCGGCGCTTACA
GCATGAGTGATGTCTTGGCATTGCCCATTTTCAAGCAGGAAGATTCCAGCCTTCCATTGGATGGTGAAACAGAGCACCCACCCTTTCAGT
ATGTGATGTGTGCTGCAACGTCACCAGCAGTAAAACTGCATGATGAAACGCTTACTTATTTGAACCAAGGTCAGTCATATGAAATTCGGA
TGCTGGATAATCGGAAAATGGGTGATATGCCTGAGATCAATGGAAAATTAGTAAAGAGCATCATAAGGGTTGTATTCCATGACAGACGGC
TACAATACACAGAGCATCAGCAACTTGAAGGATGGAAGTGGAATCGCCCAGGAGACAGACTTCTTGATTTAGATATTCCAATGTCTGTGG
GAATAATTGACACAAGGACGAATCCAAGCCAGTTAAATGCGGTTGAATTTCTGTGGGACCCAGCAAAACGCACCTCTGCTTTCATTCAGG
TACACTGCATCAGCACAGAATTCACCCCCAGGAAGCACGGGGGCGAGAAGGGAGTGCCCTTTCGAGTCCAGATTGACACGTTTAAGCAGA
ACGAGAATGGGGAGTACACGGAGCACCTGCACTCAGCCAGCTGCCAGATCAAGGTGTTCAAGCCGAAGGGAGCCGATCGGAAACAGAAGA
CTGACCGGGAGAAGATGGAGAAAAGAACTGCCCAAGAGAAGGAGAAATACCAGCCGTCCTATGAAACCACCATCCTCACAGAGTGCTCTC
CATGGCCCGACGTGGCCTACCAGGTGAACAGCGCCCCGTCCCCAAGCTACAATGGTTCTCCAAACAGCTTTGGCCTCGGCGAAGGCAACG
CCTCTCCGACCCACCCGGTGGAGGCCCTGCCCGTGGGCAGTGACCACCTGCTCCCATCAGCTTCGATCCAGGATGCCCAGCAGTGGCTTC
ACCGCAACAGGTTCTCGCAGTTCTGCCGGCTCTTTGCCAGCTTCTCAGGTGCTGACTTGCTGAAGATGTCCCGAGATGATTTGGTCCAGA
TCTGTGGTCCCGCAGATGGGATCCGGCTCTTCAACGCCATCAAAGGCCGGAATGTGAGGCCAAAGATGACCATTTATGTCTGTCAGGAGC
TGGAGCAGAATCGAGTGCCCCTGCAGCAGAAGCGGGACGGCAGTGGAGACAGCAACCTGTCTGTGTACCACGCCATCTTCCTGGAAGAGC
TGACCACCTTGGAGCTGATTGAGAAGATCGCCAACCTGTACAGCATCTCCCCCCAGCACATCCACCGAGTCTACCGGCAGGGCCCCACGG
GCATCCATGTGGTGGTGAGCAACGAGATGGTGCAGAACTTCCAAGATGAATCCTGTTTTGTCCTCAGCACAATTAAAGCTGAGAGCAATG
ATGGCTACCACATCATCCTGAAATGTGGACTCTGAGCAGCAGTGGACCTCATACCTGTCTCCAGCTCCCAGCCCTGTGGATCCCCGTGGA
TGTAGACATTGCCCCACTGTAAGCTGTGGCCTCACCAGGCAAGCTGAGGCCAGGAGGGACCCTGCCCAGTCTGTGAAAGCTACAGAGCAC
CAACCAGCAGAAGCCTGTGGACACCAAGTACGGTGTACAGAAAGCCAGTGGCTCCTTTCTCCCTTCCTCTTGGCCTCCAGATTTTGAATG
GTTCCTTGTTCTTTTCTATTGGTCCAACCCTGACGTTCTAAAAGGGCAAACAGTGGAGACGTCTGCTCTGAAATCCCTCATCCCTTAGTT
GGAAGCTGATTGGGTATCTTGGTGCTGCCTGTATTGGTCCCTTCTGACCACTCTCCTGCCTCCAGAGAAAGCTCTGCTTCACCCTGGAAG
CTGGTACCTTTACCTCCTCCTCTGGGAGTTGGCTGCATGGCCAGCACTGCCGACTTGATGGGAGCAGTTTGCCCTCATTCTCCTGTTTCA
GGTTTGCTTCCCTTCTCAGTGACCCTGGTGAGCATCCGCCTTTCCTGTTCTTGGATGAATTGATGGGAGTGGGGCTATTCTGTGCCTTCT
ACCTCTTTCTTCTCTACGTTGTTTCTAAGGATCTGCTGCTGCGGAACCCAAAGATGTGCTCCTGTCTCTGCACTGGCGCATTGGCATGGT
AGATGCCACAATGTATGTGCACGGCCTTTCTCAGAGACATTAGTTCTGAGGCCCTTTGTGGGGAGGTTAGGGGGATGGTAATAGAAAAAG
ACTATTTTATTTCCTGGCAATCACGGGTAAGGAGGATTAGGAATGAGTATTCCATTCCTAGGTGTCATCAGATGACCTTGACCACCACAA
TACCAGGCCCTCTTGGATGGACTTATAGAAAGTTAGAGAAGACCTTGTTGAACCGCTGCTAAACTTGCCACAGGAGCGATGTGTTTTCTC
TGAGTGCCCCTCACTTACATGTTTATCTTTGTTTGTAGAGGCTATGTTTAGGATATTTTGCCTGCATCAGAATGGGTGCATCATCTTTCT
TAATGGCCTAACTATCGGGAAATTTGAGTGTCAGTAACTGTGGTAGACTCAGAAATTCGTCTTTGTCTTGCCTCTGGTTCCTGGGATCCA
GTGATCTCTACTGGCCCAGGGCTTCAGCTCTTGGTTAATTTAGGTTCATGGGGAACCCTCTGACCACCTGAATGGGATGTCATAGCTTCT
AAATGGAGCTTCTGTGGAATGAAGTGCTAGACTGAAGGACTACCAGAATAAAACAGGGTCTACAATGGGGAGAACTTGTTTTATAGATGA
GGAAACCAAGGCTCAGAGGGGCAAAGTCACCTGCATGGTAGCACATAGTGATAGGGTAGCGATATAAATTTATCATATAAACCAGGACAT
CTCGGAATAAAAGGGGCTCTGTTAGTCATTATGTTGGGTAATAGCCGTGGCATTCCTACAGAACAGAGTGAGGACAGGCTCCTGATTCCT
CTTCCTTCTTTAGAGGAGAAGCGGGGAGTGGGTTAACTAACAGCTTTATTGAGATGTCATTCACATGCCATTCAGTTTACCCATTGCTAG
TGTCCAATTGTATTCACAGAACCACCATCAATTCACAGAATTACAGTCAACGTTGGTACATTTTCATCACCCCCAGTAAAACCCCGTACC
CTTGGTCTGTCACTCCTGCTTTCCTAACTCCTGCAGTCCAAGGCAGCCATGAATCTACTTTCTATGTAAGATTAACCTACTCTGGACATT
TCATATATCTGGAATCATGTGATATCTCTTTTGTGACTGGCTTCTTCCACTGAATGTTTTCTAGGGCCGTCCAAGTTGAGGATGTATCAG
TACTTCATTCTTTTGTATTGCTGAATAATACTTCATTGTATAGATAGACCACATTTGTTTATTGATTCATCAGTTGATGGACATTTGTGT
GTTTTTACTTTTTGGCTACTCTGAATGATGCTGCTATGAACATATTTCTACAAGATTTTGTGTGGACATATGTTTTCATTTCTTTTAGCA
ATATACATAGGAGTGGAATTGCTAGGTCTTACAGTAACTCCGTGTTTTAACTTTTTGAGAAACTGCCAGACTGTTTTCTATAGCAGCTGT
ACCATTTTACATTCCCACCAGCAATGTATCCAGGTTTCAATTTGTCTACATCCTCATCAACACTTGCTATTATCTGTCTTTTTGCTTTTA
GCATCCTAATGAGTATGAAATGCTATCTTGTGGTTTTGATTTGCATTCCCCTGATGGCAACTGATGCTGAGTGTCTTTTCCTGTGCTTAC
GGGCCATGCGTATTTCTTTGGAGAAAGGTCTATCCAGGTCCTTTGCCTATTTTTAATTGAGTTGTCTTTTTTTTTTTAAGTTTTCTGTTT
TCCTAACCACTAGACTACCAGGGATGAGCCTTCTTTTTATTATTGAGTTGGGTGAGCTATTTGTATATTCTAGACGCCAGTCTTTTATCA
GGTATATGACTGGTAAAAATGTTCTCCCCTTCTGTGGATTGTTTTCAGTTTCTTGTTGGTGTCCTTTGAGACACAAAACTTTTTAACTTT
GATGATTTCCAAGATACGTATTTTTTTTCTATTGTCACTTGTGCTTTTGGTGCCATATCTAGAAAACCATTGCCTAATCCAAGGTCAAGA
AGATTAATGCCTGTGTTTTCTTCTAAGAACTATACTTTTAGTTCTCACAATGGTCTTTGATCCATTTCGAGTATATTTTTATATATGATG
TGATGTAGGGGTCCAGCTTCATTCTTTTGCTTGTGGATCTCCACTTGTCCCACTGCTGATTATTGAGAAAAATATCCTTTCTCCACGGAA
TTGTCTTGGCATCCTTGCTAAAGGCCTCTGCTTCTTACTGGATCTTCTTTCCTGGGACATGGTGTCGTTGGGAAGCTTACCTTTTTTTTT
TTTTTACTTAGTCTGTGTTTGGTTCCACCAGTTTTATGCTGCCTTTCTACTCTGTTCTTGCTGTCTCCCTCTTTACCTGAGTCAACGGTA
CTGAGTCCTATCTCTCTCTGATGTTCCCCAGTCTTCCTTGGTGCATGTTCTAGCTCCACACACTAGTCCTTGGAGGAAGGTTGAGACCAA
TGATTTCCTGTTATGAGTCATGAGGAAACTGAATCACCTAGAAGTGGAATAATGTGCTCAGGGTCACCATAGCCCATTAGTGGAAGGACC
AGGACTAGACCTTTAGTCTTCTGAGGTCCAGCCCCTTAGGCTGTCTGTCATCACTGTACCCAAGTGATGTCACTACCAAGGCCAAATGAT
GGTGGGCTAAATTTTAATTCTCAAAAGTGTAGGAGGCTAATATTGTCTTCTAAGTTCCAAAAGAAGATGTAATAAAAGTCTGTTACCTTA
AGTGTGCTATTAGTAGAGTCTTCCATTTTTCTGGCATGCCCCTGGCATCTGCTCTTCTTACCTTCTCGTGGTTGTAGTTAAAGCTTATAG
CTTATGAAAGAATAGAAAATAATAAATACCAAAAAAAAGTACACATGGTAATTTGGTACCAAAATATCTCAGCTGCCTAATTTAGCAGCT
CATCCCTTCCACAGGGGTCAGATGAGCTAAAGCTCCAGGTTTTATTTTTCATTTGATTGACATACAGAAAAGCCATAGCCCTTCCCACAG
CTGTCCAGGGTCTTTCCTGTGAGTCCGGAGGTGCTGGCCTATTGAGCAGGACAGCTCTTCCCAGGGCATTCCCACCAACCTGTGGCTTCT
GAACTGTAGCTTCTTTTTACAGTGAACCCCAGAGGGAAATAAGACAGACACATGTGCTCAGGCCACCATCTTGAACTGGAAGCCCAAAGC
TGAGTTCCTTACTCTTAGGTCGTCACGGTTTTTGCGGGGTATCTGCAAGGTTGAGATAAACCCTTTCCTGTTTACCAGGTTGTCCTTTCT
GGATGAAGGGACAGAGGCTGTTGAATGGAGGAATAATAGGTTTGCTGGAGGAGGGGCATGGTATGCCTGTGGAAAGGACAGGATGGGGTG
GGGAGGTCGAGGCTTTGACTTGGGGTCCTAAACAAAGGTCAGGTGTTGCCCTAGTGACCTCTTGCCCAGACAGCCCAGAGCCCCTTACAC
AGAGCTATTAACCTAGGGAAGGCTTTACCAGCAGTGGACTGGAGCCAGCCAGGGTCACAAGTTTCCAAGTCCAGCATTGCTTCAGGGGCT
GGCCTGAGTAACTGAAGATCTGAAAATCATTAACAAGTCGATGAAATAAACGGAAAAGCCTCTTAGGCTGTTGTCAGTGGAGCAGAGGGA
GAAAGTCCCTAGGCGCTCAGAGGGGGTGAGAAAGCAGTGGATGATTGGGCGGGGGTGGGGGATTAGATGTTGACACTGCCTGGGGTGTAG
GAAGAGGAACAGAGAACCCAGAGTCAGGGTCCTAGATCCCAGACCCTCGCTCAGTATGAGTCTCTTTGCCTCTCTGGGTCTCTATCTCCT
CCTCTTACAAATACAGGCTTGGTGATCTCTGAAGATGGCACCAACCTGCCATGAAATGAATCTGAGGGGTTTTCCCATTTTTCCCTCCAT
CAAAATCGTACAAAAAGCTGGACGTGGTGGCCCATGCCTCTAATCCTAGCATTTTGGGAGGCCGAGGTGGGAGAATCACTTGACGCCAAG
AGTTCGAGACCAGCCTGGGCATCGTAGTGAGACTCCATCTCTGTCTTTTTGAAAATAAAAAATCTTTGAAAATTGCACAACAGGCAGGAG
ACCTTTACGTGTGCCCATCCTGGTTGTACACAGTGCCACCAGTGCTCCTGCAGTGCAAGGCGGCATGCTTCTTGACATGGGTCAGATTGT
GTCCATCGTGTCTTTGGGAATCAGCCCTAGCTCCTAACTGGGCTGACTACTTCCTCCGCAAACTTATGGGGGCTCCCAGATATTCCTTGC
CAGCCAGGGGCCAGACACAGTGCAGGCACAGTCTGTGTCATTGGTGCACATGTGCGTGTTTACATGTGTACCTGGGTTCCTTCCCTTGCC
CATGAATTTGCCATGAGCACAGCCAGAAGCAGCCTCAGCTTGGCAAGGTGTGGAGATGACTGCTGTTCCCTTCGCATTTGGGGAAAACAG
GCTCCCTCGGTAGCTCGATGATCCTCTTTTGATCTTGTGTGACCTCCTGGAGAGTGGATGAAGCTGGTGGCCTTAGCTTTTCTAGACAGT
GTAAGTGGCACTGGGCAAGGCCCCCAGAGCAGGGCAAGGTCTCTAGAGCGGGTCTCCCACATGACTGGCTTCACACAGGCACTTCCGCTC
GGGTTGCATGCTCTGTGTCATCTTACCGGTCCAGGGTTGCAGGTAGGAAATGTTTGTACCCTCTTCTGATTGCCACCTCCTTCCCATCGC
CCCTTAGGGACAGGGCTTGAGGGCCAGTGAGGCGCTGGTCAGGCACCCCAGGCCTCCTTGGGACCTGCCCAGGGGCACCCTGAGAGCTCC
TGAAACCCCCACTTAGCTTCCAGACCTTTCTGCAAAAGCTCCTCCTGGCTTTCCTCCCTCCCCCAATCTATGGGTCACAGCTAACAGATC
TGAGGGCAACTGCTGTGCTAGTGGCCAGGGCTGCACCTGCCATCCCCGGCTCTGCCACTTTAGGGCCTTCTAGAGGCAGTGTCCTTAGGA
AGTAGCTCTGAGGCATGGGTTTTCTGCTCCTGTGCAGGGCAGCTGATGGGATAAGGTGGGGAAGGACGGTCAGTGCTTGGGCCCCAGCTG
GCCAGCCTGGCGATGGGGAAACCAAACCATGTCCCCCAGCGAAGGGCCAGAGTGGGAACCTGTCCTCATGCCCTTCGTCCTGAGGAGCCC
TGAGGTGGGCAGCAGGGGCCAGGGGAAGTTTTCAGGCCTTCATCAAAGAGAACAACATCCTCAGCTCCGCACCCCTCATCCTGTATCAGC
ACTTACCGGTGTGTGACTGCCCTTGTCAGCTAGCATACGGTGGGCCCACCTGGCCCACTGGCTGTTTATGCCACTGATTTATGATAGGGA
ATATTATCTTTGAACCCAATGAAGTGTTTTCTCCCCCATCACAAAAAAAAAAATTCTTATTTTTAGTAGACATGTATTTACCAAAAATAT
GTACTCAATTATTGTATTTTGGATTTTATCAATTTAAAAATTGTGGAAATTTGTTTGCTCTTACGCCAACATAATATTGATTTTGCCTCT
TGGCTCTGAAAGCCCAAAATATTTACCGTCTAGCCCGTTACAGAAAAAGTCTGCTGACTACTGAGCCAGACCTCCATTACCTCCATCCCT
GTTGGATTATTTAAAGAAAGCCTCAGACAGTAAGGGCTTTTTTAAAAGAATAAAATGACTTGGTTTGCGCTTGGAAGCAGGGGAAGCATT
CAGATGAGCGGTTTCTGCATTAACCCTGCCTATCACGCATCTCGTGTCCTGTGTGGCTGGCGAGCCCCCCTTGGAAGGTTCTGGTGCTTC
AGCTGGCTCCTGCAGAGTCCACCCCGCCTCGTGGTGGGAATGCAGAGCCCTTTGCTTTCCTTCTTGCCGCCTGCTTCCTGTTCCTGGGGA
CCCGCTGGGCCTTTGGTCTGCATCCCCTGGCCAGGTCCCTCAGGGTTGATGCGTGGAGAAGGACTTTGAGCAGTGGTGGGCAGCAGTGGC
CTCCTGGCCAGCTCACACTCTTGTCCTGGGAGGGGCAGCCTGATCTCACCTCCACCTAGTACCTTGGGGACTGAGGACCTTTTGGCTTCT
CTGGAGCCTGCAAGCCTCTTCCCATGTGTCCAGCTGCTCTTCCTGCTACAAAGGGGACTGCTCACAGTGGCCTCAGCTTGGTGGTTTTGA
GGGGCCGCCCCCCGGCCCTCCATAAGGGTATCCTGGGCCTGAGAATTCTGCATCTGCCATTGGAGGATGGACAGCCTCAAATGGAAGGAG
TCCCACGGGAGATGGGTCCGAGGTCCGGCTGTGGCCATCCAGCCCCCTGTGGCTTGTCCAGCCTCTGTGCACCCCTGGTGTCTTCACTCC
AGGGGCAGACAGCAGCCACTGCAGTTCCTTTCTTCGTGAGTAACAGTAGTGATAGCAGCTGGGGCTAACAGGCTAGGCTTTGTGTTCTGC
GCATTTGGTCAGCTTCTCACTCGATCCTCCCTAAAGCAATGGGGAGGCCCCCACTAGCCCAGTTTTCAGGAAGTCAACTGGGAGGTTAGA
TGGGGGCCAGGGTCCCACAGCTACTGATGGCCCGAGCCAGGTTGAGCTTCCTGGTGTCCAGTCCGGATCCCACTTGCAGATCTCATGCTC
TCAGATAGGTGGGACAAGTTCTTTTGTCACAGTGCTGGCTCTGTCCTGAGGCCTCATTGCTGGCTGGGTGTGCTCTGCTGGGAAAAGCTT
TGCGGGGCTTGCTTGGTTAACCACAGAAGAGAAGGGGACTGTTTGGGGTGCCTCTCTGCAGCCTCCCCGTGCTGGGTGGAAGCACGGTTA
CTGTGTTCTCTAATGTTCATGTATTTAAAATGATTTCTTTCTAAAGATGTAACCTCCACACCTTTCTCCAGATTGGGTGACTCTTTTCTA
AAGGTGGTGGGAGTATCTGTCGGGGTGGTGTGGCCCTTGGATGGGTCAGGTGGGTGTGAGAGGTCCTGGGGAGGTGGGCGTTGAGCTCAA
AGTTGTCCTACTGCCATGTTTTTGTACCTGAAATAAAGCATATTTTGCACTTGTTACTGTACCATAGTGCGGACGAGAAGTCTGTATGTG

>96393_96393_1_UBP1-TFCP2L1_UBP1_chr3_33453072_ENST00000283628_TFCP2L1_chr2_122004546_ENST00000263707_length(amino acids)=496AA_BP=37
MAWVLKMDEVIESGLVHDFDASLSGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDGETEHPPFQYVMCAATSPAVKLHDETLTYLNQGQ
SYEIRMLDNRKMGDMPEINGKLVKSIIRVVFHDRRLQYTEHQQLEGWKWNRPGDRLLDLDIPMSVGIIDTRTNPSQLNAVEFLWDPAKRT
SAFIQVHCISTEFTPRKHGGEKGVPFRVQIDTFKQNENGEYTEHLHSASCQIKVFKPKGADRKQKTDREKMEKRTAQEKEKYQPSYETTI
LTECSPWPDVAYQVNSAPSPSYNGSPNSFGLGEGNASPTHPVEALPVGSDHLLPSASIQDAQQWLHRNRFSQFCRLFASFSGADLLKMSR
DDLVQICGPADGIRLFNAIKGRNVRPKMTIYVCQELEQNRVPLQQKRDGSGDSNLSVYHAIFLEELTTLELIEKIANLYSISPQHIHRVY

--------------------------------------------------------------
>96393_96393_2_UBP1-TFCP2L1_UBP1_chr3_33453072_ENST00000283629_TFCP2L1_chr2_122004546_ENST00000263707_length(transcript)=9775nt_BP=1085nt
GTCGCGTTGAAGCCAGCGTCCGGCCGGCAGAGCGGCGTCGGCGGGGTTGGCCGGGGCGGTGTGGCGCTAGAGCCTGGGCGGGCCGGGGGC
AAGGAGTCAAGGCTTGGGGGCGGGTGACGGGCGGGCGGAGGGCTTGCCCCCGTGTGGTCCTCGGAGCGGCCCGAGGTCCAGCCTCCCAGC
GGTCCCGGAGGAGGATGCGACCGGGGGGCCCCGGGCGGGGCGGTCGGCGGCGGCTGCAGCGCTAGCGGCGCGGGGGCTGGGCGGCGGCGC
CGGCATGAGGCGCAGGGCCGGTGTCCGCGGGAGGGTGGCGCTGCCGGCGGCCCGGCCGCTCCCGCTGCAATTGCTCGCTGGGCTCGTCGT
GCCCTGCCAGTGGGGCCCCCGCAGCTCTCGTCCCGGCCGCCGCTGGTGACCACTCGCCGCCCCTCCGGAGGCTTCACCCGCGCCCTCCCC
CAGGACGCGCCAGCGGAGCTCCGGCTCCTTCGCCCTGGACGCGGAGGCCGCGGTGTGCGGGGCGACGGCGAGGCCGGAAGATGGCCTGGG
TGCTCAAGATGGACGAGGTGATCGAGTCCGGGCTGGTGCACGACTTCGACGCCAGCCTCTCGGGCATCGGGCAGGAACTGGGCGCCGGCG
CTTACAGCATGAGTGATGTCTTGGCATTGCCCATTTTCAAGCAGGAAGATTCCAGCCTTCCATTGGATGGTGAAACAGAGCACCCACCCT
TTCAGTATGTGATGTGTGCTGCAACGTCACCAGCAGTAAAACTGCATGATGAAACGCTTACTTATTTGAACCAAGGTCAGTCATATGAAA
TTCGGATGCTGGATAATCGGAAAATGGGTGATATGCCTGAGATCAATGGAAAATTAGTAAAGAGCATCATAAGGGTTGTATTCCATGACA
GACGGCTACAATACACAGAGCATCAGCAACTTGAAGGATGGAAGTGGAATCGCCCAGGAGACAGACTTCTTGATTTAGATATTCCAATGT
CTGTGGGAATAATTGACACAAGGACGAATCCAAGCCAGTTAAATGCGGTTGAATTTCTGTGGGACCCAGCAAAACGCACCTCTGCTTTCA
TTCAGGTACACTGCATCAGCACAGAATTCACCCCCAGGAAGCACGGGGGCGAGAAGGGAGTGCCCTTTCGAGTCCAGATTGACACGTTTA
AGCAGAACGAGAATGGGGAGTACACGGAGCACCTGCACTCAGCCAGCTGCCAGATCAAGGTGTTCAAGCCGAAGGGAGCCGATCGGAAAC
AGAAGACTGACCGGGAGAAGATGGAGAAAAGAACTGCCCAAGAGAAGGAGAAATACCAGCCGTCCTATGAAACCACCATCCTCACAGAGT
GCTCTCCATGGCCCGACGTGGCCTACCAGGTGAACAGCGCCCCGTCCCCAAGCTACAATGGTTCTCCAAACAGCTTTGGCCTCGGCGAAG
GCAACGCCTCTCCGACCCACCCGGTGGAGGCCCTGCCCGTGGGCAGTGACCACCTGCTCCCATCAGCTTCGATCCAGGATGCCCAGCAGT
GGCTTCACCGCAACAGGTTCTCGCAGTTCTGCCGGCTCTTTGCCAGCTTCTCAGGTGCTGACTTGCTGAAGATGTCCCGAGATGATTTGG
TCCAGATCTGTGGTCCCGCAGATGGGATCCGGCTCTTCAACGCCATCAAAGGCCGGAATGTGAGGCCAAAGATGACCATTTATGTCTGTC
AGGAGCTGGAGCAGAATCGAGTGCCCCTGCAGCAGAAGCGGGACGGCAGTGGAGACAGCAACCTGTCTGTGTACCACGCCATCTTCCTGG
AAGAGCTGACCACCTTGGAGCTGATTGAGAAGATCGCCAACCTGTACAGCATCTCCCCCCAGCACATCCACCGAGTCTACCGGCAGGGCC
CCACGGGCATCCATGTGGTGGTGAGCAACGAGATGGTGCAGAACTTCCAAGATGAATCCTGTTTTGTCCTCAGCACAATTAAAGCTGAGA
GCAATGATGGCTACCACATCATCCTGAAATGTGGACTCTGAGCAGCAGTGGACCTCATACCTGTCTCCAGCTCCCAGCCCTGTGGATCCC
CGTGGATGTAGACATTGCCCCACTGTAAGCTGTGGCCTCACCAGGCAAGCTGAGGCCAGGAGGGACCCTGCCCAGTCTGTGAAAGCTACA
GAGCACCAACCAGCAGAAGCCTGTGGACACCAAGTACGGTGTACAGAAAGCCAGTGGCTCCTTTCTCCCTTCCTCTTGGCCTCCAGATTT
TGAATGGTTCCTTGTTCTTTTCTATTGGTCCAACCCTGACGTTCTAAAAGGGCAAACAGTGGAGACGTCTGCTCTGAAATCCCTCATCCC
TTAGTTGGAAGCTGATTGGGTATCTTGGTGCTGCCTGTATTGGTCCCTTCTGACCACTCTCCTGCCTCCAGAGAAAGCTCTGCTTCACCC
TGGAAGCTGGTACCTTTACCTCCTCCTCTGGGAGTTGGCTGCATGGCCAGCACTGCCGACTTGATGGGAGCAGTTTGCCCTCATTCTCCT
GTTTCAGGTTTGCTTCCCTTCTCAGTGACCCTGGTGAGCATCCGCCTTTCCTGTTCTTGGATGAATTGATGGGAGTGGGGCTATTCTGTG
CCTTCTACCTCTTTCTTCTCTACGTTGTTTCTAAGGATCTGCTGCTGCGGAACCCAAAGATGTGCTCCTGTCTCTGCACTGGCGCATTGG
CATGGTAGATGCCACAATGTATGTGCACGGCCTTTCTCAGAGACATTAGTTCTGAGGCCCTTTGTGGGGAGGTTAGGGGGATGGTAATAG
AAAAAGACTATTTTATTTCCTGGCAATCACGGGTAAGGAGGATTAGGAATGAGTATTCCATTCCTAGGTGTCATCAGATGACCTTGACCA
CCACAATACCAGGCCCTCTTGGATGGACTTATAGAAAGTTAGAGAAGACCTTGTTGAACCGCTGCTAAACTTGCCACAGGAGCGATGTGT
TTTCTCTGAGTGCCCCTCACTTACATGTTTATCTTTGTTTGTAGAGGCTATGTTTAGGATATTTTGCCTGCATCAGAATGGGTGCATCAT
CTTTCTTAATGGCCTAACTATCGGGAAATTTGAGTGTCAGTAACTGTGGTAGACTCAGAAATTCGTCTTTGTCTTGCCTCTGGTTCCTGG
GATCCAGTGATCTCTACTGGCCCAGGGCTTCAGCTCTTGGTTAATTTAGGTTCATGGGGAACCCTCTGACCACCTGAATGGGATGTCATA
GCTTCTAAATGGAGCTTCTGTGGAATGAAGTGCTAGACTGAAGGACTACCAGAATAAAACAGGGTCTACAATGGGGAGAACTTGTTTTAT
AGATGAGGAAACCAAGGCTCAGAGGGGCAAAGTCACCTGCATGGTAGCACATAGTGATAGGGTAGCGATATAAATTTATCATATAAACCA
GGACATCTCGGAATAAAAGGGGCTCTGTTAGTCATTATGTTGGGTAATAGCCGTGGCATTCCTACAGAACAGAGTGAGGACAGGCTCCTG
ATTCCTCTTCCTTCTTTAGAGGAGAAGCGGGGAGTGGGTTAACTAACAGCTTTATTGAGATGTCATTCACATGCCATTCAGTTTACCCAT
TGCTAGTGTCCAATTGTATTCACAGAACCACCATCAATTCACAGAATTACAGTCAACGTTGGTACATTTTCATCACCCCCAGTAAAACCC
CGTACCCTTGGTCTGTCACTCCTGCTTTCCTAACTCCTGCAGTCCAAGGCAGCCATGAATCTACTTTCTATGTAAGATTAACCTACTCTG
GACATTTCATATATCTGGAATCATGTGATATCTCTTTTGTGACTGGCTTCTTCCACTGAATGTTTTCTAGGGCCGTCCAAGTTGAGGATG
TATCAGTACTTCATTCTTTTGTATTGCTGAATAATACTTCATTGTATAGATAGACCACATTTGTTTATTGATTCATCAGTTGATGGACAT
TTGTGTGTTTTTACTTTTTGGCTACTCTGAATGATGCTGCTATGAACATATTTCTACAAGATTTTGTGTGGACATATGTTTTCATTTCTT
TTAGCAATATACATAGGAGTGGAATTGCTAGGTCTTACAGTAACTCCGTGTTTTAACTTTTTGAGAAACTGCCAGACTGTTTTCTATAGC
AGCTGTACCATTTTACATTCCCACCAGCAATGTATCCAGGTTTCAATTTGTCTACATCCTCATCAACACTTGCTATTATCTGTCTTTTTG
CTTTTAGCATCCTAATGAGTATGAAATGCTATCTTGTGGTTTTGATTTGCATTCCCCTGATGGCAACTGATGCTGAGTGTCTTTTCCTGT
GCTTACGGGCCATGCGTATTTCTTTGGAGAAAGGTCTATCCAGGTCCTTTGCCTATTTTTAATTGAGTTGTCTTTTTTTTTTTAAGTTTT
CTGTTTTCCTAACCACTAGACTACCAGGGATGAGCCTTCTTTTTATTATTGAGTTGGGTGAGCTATTTGTATATTCTAGACGCCAGTCTT
TTATCAGGTATATGACTGGTAAAAATGTTCTCCCCTTCTGTGGATTGTTTTCAGTTTCTTGTTGGTGTCCTTTGAGACACAAAACTTTTT
AACTTTGATGATTTCCAAGATACGTATTTTTTTTCTATTGTCACTTGTGCTTTTGGTGCCATATCTAGAAAACCATTGCCTAATCCAAGG
TCAAGAAGATTAATGCCTGTGTTTTCTTCTAAGAACTATACTTTTAGTTCTCACAATGGTCTTTGATCCATTTCGAGTATATTTTTATAT
ATGATGTGATGTAGGGGTCCAGCTTCATTCTTTTGCTTGTGGATCTCCACTTGTCCCACTGCTGATTATTGAGAAAAATATCCTTTCTCC
ACGGAATTGTCTTGGCATCCTTGCTAAAGGCCTCTGCTTCTTACTGGATCTTCTTTCCTGGGACATGGTGTCGTTGGGAAGCTTACCTTT
TTTTTTTTTTTACTTAGTCTGTGTTTGGTTCCACCAGTTTTATGCTGCCTTTCTACTCTGTTCTTGCTGTCTCCCTCTTTACCTGAGTCA
ACGGTACTGAGTCCTATCTCTCTCTGATGTTCCCCAGTCTTCCTTGGTGCATGTTCTAGCTCCACACACTAGTCCTTGGAGGAAGGTTGA
GACCAATGATTTCCTGTTATGAGTCATGAGGAAACTGAATCACCTAGAAGTGGAATAATGTGCTCAGGGTCACCATAGCCCATTAGTGGA
AGGACCAGGACTAGACCTTTAGTCTTCTGAGGTCCAGCCCCTTAGGCTGTCTGTCATCACTGTACCCAAGTGATGTCACTACCAAGGCCA
AATGATGGTGGGCTAAATTTTAATTCTCAAAAGTGTAGGAGGCTAATATTGTCTTCTAAGTTCCAAAAGAAGATGTAATAAAAGTCTGTT
ACCTTAAGTGTGCTATTAGTAGAGTCTTCCATTTTTCTGGCATGCCCCTGGCATCTGCTCTTCTTACCTTCTCGTGGTTGTAGTTAAAGC
TTATAGCTTATGAAAGAATAGAAAATAATAAATACCAAAAAAAAGTACACATGGTAATTTGGTACCAAAATATCTCAGCTGCCTAATTTA
GCAGCTCATCCCTTCCACAGGGGTCAGATGAGCTAAAGCTCCAGGTTTTATTTTTCATTTGATTGACATACAGAAAAGCCATAGCCCTTC
CCACAGCTGTCCAGGGTCTTTCCTGTGAGTCCGGAGGTGCTGGCCTATTGAGCAGGACAGCTCTTCCCAGGGCATTCCCACCAACCTGTG
GCTTCTGAACTGTAGCTTCTTTTTACAGTGAACCCCAGAGGGAAATAAGACAGACACATGTGCTCAGGCCACCATCTTGAACTGGAAGCC
CAAAGCTGAGTTCCTTACTCTTAGGTCGTCACGGTTTTTGCGGGGTATCTGCAAGGTTGAGATAAACCCTTTCCTGTTTACCAGGTTGTC
CTTTCTGGATGAAGGGACAGAGGCTGTTGAATGGAGGAATAATAGGTTTGCTGGAGGAGGGGCATGGTATGCCTGTGGAAAGGACAGGAT
GGGGTGGGGAGGTCGAGGCTTTGACTTGGGGTCCTAAACAAAGGTCAGGTGTTGCCCTAGTGACCTCTTGCCCAGACAGCCCAGAGCCCC
TTACACAGAGCTATTAACCTAGGGAAGGCTTTACCAGCAGTGGACTGGAGCCAGCCAGGGTCACAAGTTTCCAAGTCCAGCATTGCTTCA
GGGGCTGGCCTGAGTAACTGAAGATCTGAAAATCATTAACAAGTCGATGAAATAAACGGAAAAGCCTCTTAGGCTGTTGTCAGTGGAGCA
GAGGGAGAAAGTCCCTAGGCGCTCAGAGGGGGTGAGAAAGCAGTGGATGATTGGGCGGGGGTGGGGGATTAGATGTTGACACTGCCTGGG
GTGTAGGAAGAGGAACAGAGAACCCAGAGTCAGGGTCCTAGATCCCAGACCCTCGCTCAGTATGAGTCTCTTTGCCTCTCTGGGTCTCTA
TCTCCTCCTCTTACAAATACAGGCTTGGTGATCTCTGAAGATGGCACCAACCTGCCATGAAATGAATCTGAGGGGTTTTCCCATTTTTCC
CTCCATCAAAATCGTACAAAAAGCTGGACGTGGTGGCCCATGCCTCTAATCCTAGCATTTTGGGAGGCCGAGGTGGGAGAATCACTTGAC
GCCAAGAGTTCGAGACCAGCCTGGGCATCGTAGTGAGACTCCATCTCTGTCTTTTTGAAAATAAAAAATCTTTGAAAATTGCACAACAGG
CAGGAGACCTTTACGTGTGCCCATCCTGGTTGTACACAGTGCCACCAGTGCTCCTGCAGTGCAAGGCGGCATGCTTCTTGACATGGGTCA
GATTGTGTCCATCGTGTCTTTGGGAATCAGCCCTAGCTCCTAACTGGGCTGACTACTTCCTCCGCAAACTTATGGGGGCTCCCAGATATT
CCTTGCCAGCCAGGGGCCAGACACAGTGCAGGCACAGTCTGTGTCATTGGTGCACATGTGCGTGTTTACATGTGTACCTGGGTTCCTTCC
CTTGCCCATGAATTTGCCATGAGCACAGCCAGAAGCAGCCTCAGCTTGGCAAGGTGTGGAGATGACTGCTGTTCCCTTCGCATTTGGGGA
AAACAGGCTCCCTCGGTAGCTCGATGATCCTCTTTTGATCTTGTGTGACCTCCTGGAGAGTGGATGAAGCTGGTGGCCTTAGCTTTTCTA
GACAGTGTAAGTGGCACTGGGCAAGGCCCCCAGAGCAGGGCAAGGTCTCTAGAGCGGGTCTCCCACATGACTGGCTTCACACAGGCACTT
CCGCTCGGGTTGCATGCTCTGTGTCATCTTACCGGTCCAGGGTTGCAGGTAGGAAATGTTTGTACCCTCTTCTGATTGCCACCTCCTTCC
CATCGCCCCTTAGGGACAGGGCTTGAGGGCCAGTGAGGCGCTGGTCAGGCACCCCAGGCCTCCTTGGGACCTGCCCAGGGGCACCCTGAG
AGCTCCTGAAACCCCCACTTAGCTTCCAGACCTTTCTGCAAAAGCTCCTCCTGGCTTTCCTCCCTCCCCCAATCTATGGGTCACAGCTAA
CAGATCTGAGGGCAACTGCTGTGCTAGTGGCCAGGGCTGCACCTGCCATCCCCGGCTCTGCCACTTTAGGGCCTTCTAGAGGCAGTGTCC
TTAGGAAGTAGCTCTGAGGCATGGGTTTTCTGCTCCTGTGCAGGGCAGCTGATGGGATAAGGTGGGGAAGGACGGTCAGTGCTTGGGCCC
CAGCTGGCCAGCCTGGCGATGGGGAAACCAAACCATGTCCCCCAGCGAAGGGCCAGAGTGGGAACCTGTCCTCATGCCCTTCGTCCTGAG
GAGCCCTGAGGTGGGCAGCAGGGGCCAGGGGAAGTTTTCAGGCCTTCATCAAAGAGAACAACATCCTCAGCTCCGCACCCCTCATCCTGT
ATCAGCACTTACCGGTGTGTGACTGCCCTTGTCAGCTAGCATACGGTGGGCCCACCTGGCCCACTGGCTGTTTATGCCACTGATTTATGA
TAGGGAATATTATCTTTGAACCCAATGAAGTGTTTTCTCCCCCATCACAAAAAAAAAAATTCTTATTTTTAGTAGACATGTATTTACCAA
AAATATGTACTCAATTATTGTATTTTGGATTTTATCAATTTAAAAATTGTGGAAATTTGTTTGCTCTTACGCCAACATAATATTGATTTT
GCCTCTTGGCTCTGAAAGCCCAAAATATTTACCGTCTAGCCCGTTACAGAAAAAGTCTGCTGACTACTGAGCCAGACCTCCATTACCTCC
ATCCCTGTTGGATTATTTAAAGAAAGCCTCAGACAGTAAGGGCTTTTTTAAAAGAATAAAATGACTTGGTTTGCGCTTGGAAGCAGGGGA
AGCATTCAGATGAGCGGTTTCTGCATTAACCCTGCCTATCACGCATCTCGTGTCCTGTGTGGCTGGCGAGCCCCCCTTGGAAGGTTCTGG
TGCTTCAGCTGGCTCCTGCAGAGTCCACCCCGCCTCGTGGTGGGAATGCAGAGCCCTTTGCTTTCCTTCTTGCCGCCTGCTTCCTGTTCC
TGGGGACCCGCTGGGCCTTTGGTCTGCATCCCCTGGCCAGGTCCCTCAGGGTTGATGCGTGGAGAAGGACTTTGAGCAGTGGTGGGCAGC
AGTGGCCTCCTGGCCAGCTCACACTCTTGTCCTGGGAGGGGCAGCCTGATCTCACCTCCACCTAGTACCTTGGGGACTGAGGACCTTTTG
GCTTCTCTGGAGCCTGCAAGCCTCTTCCCATGTGTCCAGCTGCTCTTCCTGCTACAAAGGGGACTGCTCACAGTGGCCTCAGCTTGGTGG
TTTTGAGGGGCCGCCCCCCGGCCCTCCATAAGGGTATCCTGGGCCTGAGAATTCTGCATCTGCCATTGGAGGATGGACAGCCTCAAATGG
AAGGAGTCCCACGGGAGATGGGTCCGAGGTCCGGCTGTGGCCATCCAGCCCCCTGTGGCTTGTCCAGCCTCTGTGCACCCCTGGTGTCTT
CACTCCAGGGGCAGACAGCAGCCACTGCAGTTCCTTTCTTCGTGAGTAACAGTAGTGATAGCAGCTGGGGCTAACAGGCTAGGCTTTGTG
TTCTGCGCATTTGGTCAGCTTCTCACTCGATCCTCCCTAAAGCAATGGGGAGGCCCCCACTAGCCCAGTTTTCAGGAAGTCAACTGGGAG
GTTAGATGGGGGCCAGGGTCCCACAGCTACTGATGGCCCGAGCCAGGTTGAGCTTCCTGGTGTCCAGTCCGGATCCCACTTGCAGATCTC
ATGCTCTCAGATAGGTGGGACAAGTTCTTTTGTCACAGTGCTGGCTCTGTCCTGAGGCCTCATTGCTGGCTGGGTGTGCTCTGCTGGGAA
AAGCTTTGCGGGGCTTGCTTGGTTAACCACAGAAGAGAAGGGGACTGTTTGGGGTGCCTCTCTGCAGCCTCCCCGTGCTGGGTGGAAGCA
CGGTTACTGTGTTCTCTAATGTTCATGTATTTAAAATGATTTCTTTCTAAAGATGTAACCTCCACACCTTTCTCCAGATTGGGTGACTCT
TTTCTAAAGGTGGTGGGAGTATCTGTCGGGGTGGTGTGGCCCTTGGATGGGTCAGGTGGGTGTGAGAGGTCCTGGGGAGGTGGGCGTTGA
GCTCAAAGTTGTCCTACTGCCATGTTTTTGTACCTGAAATAAAGCATATTTTGCACTTGTTACTGTACCATAGTGCGGACGAGAAGTCTG

>96393_96393_2_UBP1-TFCP2L1_UBP1_chr3_33453072_ENST00000283629_TFCP2L1_chr2_122004546_ENST00000263707_length(amino acids)=557AA_BP=98
MGSSCPASGAPAALVPAAAGDHSPPLRRLHPRPPPGRASGAPAPSPWTRRPRCAGRRRGRKMAWVLKMDEVIESGLVHDFDASLSGIGQE
LGAGAYSMSDVLALPIFKQEDSSLPLDGETEHPPFQYVMCAATSPAVKLHDETLTYLNQGQSYEIRMLDNRKMGDMPEINGKLVKSIIRV
VFHDRRLQYTEHQQLEGWKWNRPGDRLLDLDIPMSVGIIDTRTNPSQLNAVEFLWDPAKRTSAFIQVHCISTEFTPRKHGGEKGVPFRVQ
IDTFKQNENGEYTEHLHSASCQIKVFKPKGADRKQKTDREKMEKRTAQEKEKYQPSYETTILTECSPWPDVAYQVNSAPSPSYNGSPNSF
GLGEGNASPTHPVEALPVGSDHLLPSASIQDAQQWLHRNRFSQFCRLFASFSGADLLKMSRDDLVQICGPADGIRLFNAIKGRNVRPKMT
IYVCQELEQNRVPLQQKRDGSGDSNLSVYHAIFLEELTTLELIEKIANLYSISPQHIHRVYRQGPTGIHVVVSNEMVQNFQDESCFVLST

--------------------------------------------------------------
>96393_96393_3_UBP1-TFCP2L1_UBP1_chr3_33453072_ENST00000447368_TFCP2L1_chr2_122004546_ENST00000263707_length(transcript)=9407nt_BP=717nt
AGTGGGGCCCCCGCAGCTCTCGTCCCGGCCGCCGCTGGTGACCACTCGCCGCCCCTCCGGAGGCTTCACCCGCGCCCTCCCCCAGGACGC
GCCAGCGGAGCTCCGGCTCCTTCGCCCTGGACGCGGAGGCCGCGGTGTGCGGGGCGACGGCGAGGCCGGAAGATGGCCTGGGTGCTCAAG
ATGGACGAGGTGATCGAGTCCGGGCTGGTGCACGACTTCGACGCCAGCCTCTCGGGCATCGGGCAGGAACTGGGCGCCGGCGCTTACAGC
ATGAGTGATGTCTTGGCATTGCCCATTTTCAAGCAGGAAGATTCCAGCCTTCCATTGGATGGTGAAACAGAGCACCCACCCTTTCAGTAT
GTGATGTGTGCTGCAACGTCACCAGCAGTAAAACTGCATGATGAAACGCTTACTTATTTGAACCAAGGTCAGTCATATGAAATTCGGATG
CTGGATAATCGGAAAATGGGTGATATGCCTGAGATCAATGGAAAATTAGTAAAGAGCATCATAAGGGTTGTATTCCATGACAGACGGCTA
CAATACACAGAGCATCAGCAACTTGAAGGATGGAAGTGGAATCGCCCAGGAGACAGACTTCTTGATTTAGATATTCCAATGTCTGTGGGA
ATAATTGACACAAGGACGAATCCAAGCCAGTTAAATGCGGTTGAATTTCTGTGGGACCCAGCAAAACGCACCTCTGCTTTCATTCAGGTA
CACTGCATCAGCACAGAATTCACCCCCAGGAAGCACGGGGGCGAGAAGGGAGTGCCCTTTCGAGTCCAGATTGACACGTTTAAGCAGAAC
GAGAATGGGGAGTACACGGAGCACCTGCACTCAGCCAGCTGCCAGATCAAGGTGTTCAAGCCGAAGGGAGCCGATCGGAAACAGAAGACT
GACCGGGAGAAGATGGAGAAAAGAACTGCCCAAGAGAAGGAGAAATACCAGCCGTCCTATGAAACCACCATCCTCACAGAGTGCTCTCCA
TGGCCCGACGTGGCCTACCAGGTGAACAGCGCCCCGTCCCCAAGCTACAATGGTTCTCCAAACAGCTTTGGCCTCGGCGAAGGCAACGCC
TCTCCGACCCACCCGGTGGAGGCCCTGCCCGTGGGCAGTGACCACCTGCTCCCATCAGCTTCGATCCAGGATGCCCAGCAGTGGCTTCAC
CGCAACAGGTTCTCGCAGTTCTGCCGGCTCTTTGCCAGCTTCTCAGGTGCTGACTTGCTGAAGATGTCCCGAGATGATTTGGTCCAGATC
TGTGGTCCCGCAGATGGGATCCGGCTCTTCAACGCCATCAAAGGCCGGAATGTGAGGCCAAAGATGACCATTTATGTCTGTCAGGAGCTG
GAGCAGAATCGAGTGCCCCTGCAGCAGAAGCGGGACGGCAGTGGAGACAGCAACCTGTCTGTGTACCACGCCATCTTCCTGGAAGAGCTG
ACCACCTTGGAGCTGATTGAGAAGATCGCCAACCTGTACAGCATCTCCCCCCAGCACATCCACCGAGTCTACCGGCAGGGCCCCACGGGC
ATCCATGTGGTGGTGAGCAACGAGATGGTGCAGAACTTCCAAGATGAATCCTGTTTTGTCCTCAGCACAATTAAAGCTGAGAGCAATGAT
GGCTACCACATCATCCTGAAATGTGGACTCTGAGCAGCAGTGGACCTCATACCTGTCTCCAGCTCCCAGCCCTGTGGATCCCCGTGGATG
TAGACATTGCCCCACTGTAAGCTGTGGCCTCACCAGGCAAGCTGAGGCCAGGAGGGACCCTGCCCAGTCTGTGAAAGCTACAGAGCACCA
ACCAGCAGAAGCCTGTGGACACCAAGTACGGTGTACAGAAAGCCAGTGGCTCCTTTCTCCCTTCCTCTTGGCCTCCAGATTTTGAATGGT
TCCTTGTTCTTTTCTATTGGTCCAACCCTGACGTTCTAAAAGGGCAAACAGTGGAGACGTCTGCTCTGAAATCCCTCATCCCTTAGTTGG
AAGCTGATTGGGTATCTTGGTGCTGCCTGTATTGGTCCCTTCTGACCACTCTCCTGCCTCCAGAGAAAGCTCTGCTTCACCCTGGAAGCT
GGTACCTTTACCTCCTCCTCTGGGAGTTGGCTGCATGGCCAGCACTGCCGACTTGATGGGAGCAGTTTGCCCTCATTCTCCTGTTTCAGG
TTTGCTTCCCTTCTCAGTGACCCTGGTGAGCATCCGCCTTTCCTGTTCTTGGATGAATTGATGGGAGTGGGGCTATTCTGTGCCTTCTAC
CTCTTTCTTCTCTACGTTGTTTCTAAGGATCTGCTGCTGCGGAACCCAAAGATGTGCTCCTGTCTCTGCACTGGCGCATTGGCATGGTAG
ATGCCACAATGTATGTGCACGGCCTTTCTCAGAGACATTAGTTCTGAGGCCCTTTGTGGGGAGGTTAGGGGGATGGTAATAGAAAAAGAC
TATTTTATTTCCTGGCAATCACGGGTAAGGAGGATTAGGAATGAGTATTCCATTCCTAGGTGTCATCAGATGACCTTGACCACCACAATA
CCAGGCCCTCTTGGATGGACTTATAGAAAGTTAGAGAAGACCTTGTTGAACCGCTGCTAAACTTGCCACAGGAGCGATGTGTTTTCTCTG
AGTGCCCCTCACTTACATGTTTATCTTTGTTTGTAGAGGCTATGTTTAGGATATTTTGCCTGCATCAGAATGGGTGCATCATCTTTCTTA
ATGGCCTAACTATCGGGAAATTTGAGTGTCAGTAACTGTGGTAGACTCAGAAATTCGTCTTTGTCTTGCCTCTGGTTCCTGGGATCCAGT
GATCTCTACTGGCCCAGGGCTTCAGCTCTTGGTTAATTTAGGTTCATGGGGAACCCTCTGACCACCTGAATGGGATGTCATAGCTTCTAA
ATGGAGCTTCTGTGGAATGAAGTGCTAGACTGAAGGACTACCAGAATAAAACAGGGTCTACAATGGGGAGAACTTGTTTTATAGATGAGG
AAACCAAGGCTCAGAGGGGCAAAGTCACCTGCATGGTAGCACATAGTGATAGGGTAGCGATATAAATTTATCATATAAACCAGGACATCT
CGGAATAAAAGGGGCTCTGTTAGTCATTATGTTGGGTAATAGCCGTGGCATTCCTACAGAACAGAGTGAGGACAGGCTCCTGATTCCTCT
TCCTTCTTTAGAGGAGAAGCGGGGAGTGGGTTAACTAACAGCTTTATTGAGATGTCATTCACATGCCATTCAGTTTACCCATTGCTAGTG
TCCAATTGTATTCACAGAACCACCATCAATTCACAGAATTACAGTCAACGTTGGTACATTTTCATCACCCCCAGTAAAACCCCGTACCCT
TGGTCTGTCACTCCTGCTTTCCTAACTCCTGCAGTCCAAGGCAGCCATGAATCTACTTTCTATGTAAGATTAACCTACTCTGGACATTTC
ATATATCTGGAATCATGTGATATCTCTTTTGTGACTGGCTTCTTCCACTGAATGTTTTCTAGGGCCGTCCAAGTTGAGGATGTATCAGTA
CTTCATTCTTTTGTATTGCTGAATAATACTTCATTGTATAGATAGACCACATTTGTTTATTGATTCATCAGTTGATGGACATTTGTGTGT
TTTTACTTTTTGGCTACTCTGAATGATGCTGCTATGAACATATTTCTACAAGATTTTGTGTGGACATATGTTTTCATTTCTTTTAGCAAT
ATACATAGGAGTGGAATTGCTAGGTCTTACAGTAACTCCGTGTTTTAACTTTTTGAGAAACTGCCAGACTGTTTTCTATAGCAGCTGTAC
CATTTTACATTCCCACCAGCAATGTATCCAGGTTTCAATTTGTCTACATCCTCATCAACACTTGCTATTATCTGTCTTTTTGCTTTTAGC
ATCCTAATGAGTATGAAATGCTATCTTGTGGTTTTGATTTGCATTCCCCTGATGGCAACTGATGCTGAGTGTCTTTTCCTGTGCTTACGG
GCCATGCGTATTTCTTTGGAGAAAGGTCTATCCAGGTCCTTTGCCTATTTTTAATTGAGTTGTCTTTTTTTTTTTAAGTTTTCTGTTTTC
CTAACCACTAGACTACCAGGGATGAGCCTTCTTTTTATTATTGAGTTGGGTGAGCTATTTGTATATTCTAGACGCCAGTCTTTTATCAGG
TATATGACTGGTAAAAATGTTCTCCCCTTCTGTGGATTGTTTTCAGTTTCTTGTTGGTGTCCTTTGAGACACAAAACTTTTTAACTTTGA
TGATTTCCAAGATACGTATTTTTTTTCTATTGTCACTTGTGCTTTTGGTGCCATATCTAGAAAACCATTGCCTAATCCAAGGTCAAGAAG
ATTAATGCCTGTGTTTTCTTCTAAGAACTATACTTTTAGTTCTCACAATGGTCTTTGATCCATTTCGAGTATATTTTTATATATGATGTG
ATGTAGGGGTCCAGCTTCATTCTTTTGCTTGTGGATCTCCACTTGTCCCACTGCTGATTATTGAGAAAAATATCCTTTCTCCACGGAATT
GTCTTGGCATCCTTGCTAAAGGCCTCTGCTTCTTACTGGATCTTCTTTCCTGGGACATGGTGTCGTTGGGAAGCTTACCTTTTTTTTTTT
TTTACTTAGTCTGTGTTTGGTTCCACCAGTTTTATGCTGCCTTTCTACTCTGTTCTTGCTGTCTCCCTCTTTACCTGAGTCAACGGTACT
GAGTCCTATCTCTCTCTGATGTTCCCCAGTCTTCCTTGGTGCATGTTCTAGCTCCACACACTAGTCCTTGGAGGAAGGTTGAGACCAATG
ATTTCCTGTTATGAGTCATGAGGAAACTGAATCACCTAGAAGTGGAATAATGTGCTCAGGGTCACCATAGCCCATTAGTGGAAGGACCAG
GACTAGACCTTTAGTCTTCTGAGGTCCAGCCCCTTAGGCTGTCTGTCATCACTGTACCCAAGTGATGTCACTACCAAGGCCAAATGATGG
TGGGCTAAATTTTAATTCTCAAAAGTGTAGGAGGCTAATATTGTCTTCTAAGTTCCAAAAGAAGATGTAATAAAAGTCTGTTACCTTAAG
TGTGCTATTAGTAGAGTCTTCCATTTTTCTGGCATGCCCCTGGCATCTGCTCTTCTTACCTTCTCGTGGTTGTAGTTAAAGCTTATAGCT
TATGAAAGAATAGAAAATAATAAATACCAAAAAAAAGTACACATGGTAATTTGGTACCAAAATATCTCAGCTGCCTAATTTAGCAGCTCA
TCCCTTCCACAGGGGTCAGATGAGCTAAAGCTCCAGGTTTTATTTTTCATTTGATTGACATACAGAAAAGCCATAGCCCTTCCCACAGCT
GTCCAGGGTCTTTCCTGTGAGTCCGGAGGTGCTGGCCTATTGAGCAGGACAGCTCTTCCCAGGGCATTCCCACCAACCTGTGGCTTCTGA
ACTGTAGCTTCTTTTTACAGTGAACCCCAGAGGGAAATAAGACAGACACATGTGCTCAGGCCACCATCTTGAACTGGAAGCCCAAAGCTG
AGTTCCTTACTCTTAGGTCGTCACGGTTTTTGCGGGGTATCTGCAAGGTTGAGATAAACCCTTTCCTGTTTACCAGGTTGTCCTTTCTGG
ATGAAGGGACAGAGGCTGTTGAATGGAGGAATAATAGGTTTGCTGGAGGAGGGGCATGGTATGCCTGTGGAAAGGACAGGATGGGGTGGG
GAGGTCGAGGCTTTGACTTGGGGTCCTAAACAAAGGTCAGGTGTTGCCCTAGTGACCTCTTGCCCAGACAGCCCAGAGCCCCTTACACAG
AGCTATTAACCTAGGGAAGGCTTTACCAGCAGTGGACTGGAGCCAGCCAGGGTCACAAGTTTCCAAGTCCAGCATTGCTTCAGGGGCTGG
CCTGAGTAACTGAAGATCTGAAAATCATTAACAAGTCGATGAAATAAACGGAAAAGCCTCTTAGGCTGTTGTCAGTGGAGCAGAGGGAGA
AAGTCCCTAGGCGCTCAGAGGGGGTGAGAAAGCAGTGGATGATTGGGCGGGGGTGGGGGATTAGATGTTGACACTGCCTGGGGTGTAGGA
AGAGGAACAGAGAACCCAGAGTCAGGGTCCTAGATCCCAGACCCTCGCTCAGTATGAGTCTCTTTGCCTCTCTGGGTCTCTATCTCCTCC
TCTTACAAATACAGGCTTGGTGATCTCTGAAGATGGCACCAACCTGCCATGAAATGAATCTGAGGGGTTTTCCCATTTTTCCCTCCATCA
AAATCGTACAAAAAGCTGGACGTGGTGGCCCATGCCTCTAATCCTAGCATTTTGGGAGGCCGAGGTGGGAGAATCACTTGACGCCAAGAG
TTCGAGACCAGCCTGGGCATCGTAGTGAGACTCCATCTCTGTCTTTTTGAAAATAAAAAATCTTTGAAAATTGCACAACAGGCAGGAGAC
CTTTACGTGTGCCCATCCTGGTTGTACACAGTGCCACCAGTGCTCCTGCAGTGCAAGGCGGCATGCTTCTTGACATGGGTCAGATTGTGT
CCATCGTGTCTTTGGGAATCAGCCCTAGCTCCTAACTGGGCTGACTACTTCCTCCGCAAACTTATGGGGGCTCCCAGATATTCCTTGCCA
GCCAGGGGCCAGACACAGTGCAGGCACAGTCTGTGTCATTGGTGCACATGTGCGTGTTTACATGTGTACCTGGGTTCCTTCCCTTGCCCA
TGAATTTGCCATGAGCACAGCCAGAAGCAGCCTCAGCTTGGCAAGGTGTGGAGATGACTGCTGTTCCCTTCGCATTTGGGGAAAACAGGC
TCCCTCGGTAGCTCGATGATCCTCTTTTGATCTTGTGTGACCTCCTGGAGAGTGGATGAAGCTGGTGGCCTTAGCTTTTCTAGACAGTGT
AAGTGGCACTGGGCAAGGCCCCCAGAGCAGGGCAAGGTCTCTAGAGCGGGTCTCCCACATGACTGGCTTCACACAGGCACTTCCGCTCGG
GTTGCATGCTCTGTGTCATCTTACCGGTCCAGGGTTGCAGGTAGGAAATGTTTGTACCCTCTTCTGATTGCCACCTCCTTCCCATCGCCC
CTTAGGGACAGGGCTTGAGGGCCAGTGAGGCGCTGGTCAGGCACCCCAGGCCTCCTTGGGACCTGCCCAGGGGCACCCTGAGAGCTCCTG
AAACCCCCACTTAGCTTCCAGACCTTTCTGCAAAAGCTCCTCCTGGCTTTCCTCCCTCCCCCAATCTATGGGTCACAGCTAACAGATCTG
AGGGCAACTGCTGTGCTAGTGGCCAGGGCTGCACCTGCCATCCCCGGCTCTGCCACTTTAGGGCCTTCTAGAGGCAGTGTCCTTAGGAAG
TAGCTCTGAGGCATGGGTTTTCTGCTCCTGTGCAGGGCAGCTGATGGGATAAGGTGGGGAAGGACGGTCAGTGCTTGGGCCCCAGCTGGC
CAGCCTGGCGATGGGGAAACCAAACCATGTCCCCCAGCGAAGGGCCAGAGTGGGAACCTGTCCTCATGCCCTTCGTCCTGAGGAGCCCTG
AGGTGGGCAGCAGGGGCCAGGGGAAGTTTTCAGGCCTTCATCAAAGAGAACAACATCCTCAGCTCCGCACCCCTCATCCTGTATCAGCAC
TTACCGGTGTGTGACTGCCCTTGTCAGCTAGCATACGGTGGGCCCACCTGGCCCACTGGCTGTTTATGCCACTGATTTATGATAGGGAAT
ATTATCTTTGAACCCAATGAAGTGTTTTCTCCCCCATCACAAAAAAAAAAATTCTTATTTTTAGTAGACATGTATTTACCAAAAATATGT
ACTCAATTATTGTATTTTGGATTTTATCAATTTAAAAATTGTGGAAATTTGTTTGCTCTTACGCCAACATAATATTGATTTTGCCTCTTG
GCTCTGAAAGCCCAAAATATTTACCGTCTAGCCCGTTACAGAAAAAGTCTGCTGACTACTGAGCCAGACCTCCATTACCTCCATCCCTGT
TGGATTATTTAAAGAAAGCCTCAGACAGTAAGGGCTTTTTTAAAAGAATAAAATGACTTGGTTTGCGCTTGGAAGCAGGGGAAGCATTCA
GATGAGCGGTTTCTGCATTAACCCTGCCTATCACGCATCTCGTGTCCTGTGTGGCTGGCGAGCCCCCCTTGGAAGGTTCTGGTGCTTCAG
CTGGCTCCTGCAGAGTCCACCCCGCCTCGTGGTGGGAATGCAGAGCCCTTTGCTTTCCTTCTTGCCGCCTGCTTCCTGTTCCTGGGGACC
CGCTGGGCCTTTGGTCTGCATCCCCTGGCCAGGTCCCTCAGGGTTGATGCGTGGAGAAGGACTTTGAGCAGTGGTGGGCAGCAGTGGCCT
CCTGGCCAGCTCACACTCTTGTCCTGGGAGGGGCAGCCTGATCTCACCTCCACCTAGTACCTTGGGGACTGAGGACCTTTTGGCTTCTCT
GGAGCCTGCAAGCCTCTTCCCATGTGTCCAGCTGCTCTTCCTGCTACAAAGGGGACTGCTCACAGTGGCCTCAGCTTGGTGGTTTTGAGG
GGCCGCCCCCCGGCCCTCCATAAGGGTATCCTGGGCCTGAGAATTCTGCATCTGCCATTGGAGGATGGACAGCCTCAAATGGAAGGAGTC
CCACGGGAGATGGGTCCGAGGTCCGGCTGTGGCCATCCAGCCCCCTGTGGCTTGTCCAGCCTCTGTGCACCCCTGGTGTCTTCACTCCAG
GGGCAGACAGCAGCCACTGCAGTTCCTTTCTTCGTGAGTAACAGTAGTGATAGCAGCTGGGGCTAACAGGCTAGGCTTTGTGTTCTGCGC
ATTTGGTCAGCTTCTCACTCGATCCTCCCTAAAGCAATGGGGAGGCCCCCACTAGCCCAGTTTTCAGGAAGTCAACTGGGAGGTTAGATG
GGGGCCAGGGTCCCACAGCTACTGATGGCCCGAGCCAGGTTGAGCTTCCTGGTGTCCAGTCCGGATCCCACTTGCAGATCTCATGCTCTC
AGATAGGTGGGACAAGTTCTTTTGTCACAGTGCTGGCTCTGTCCTGAGGCCTCATTGCTGGCTGGGTGTGCTCTGCTGGGAAAAGCTTTG
CGGGGCTTGCTTGGTTAACCACAGAAGAGAAGGGGACTGTTTGGGGTGCCTCTCTGCAGCCTCCCCGTGCTGGGTGGAAGCACGGTTACT
GTGTTCTCTAATGTTCATGTATTTAAAATGATTTCTTTCTAAAGATGTAACCTCCACACCTTTCTCCAGATTGGGTGACTCTTTTCTAAA
GGTGGTGGGAGTATCTGTCGGGGTGGTGTGGCCCTTGGATGGGTCAGGTGGGTGTGAGAGGTCCTGGGGAGGTGGGCGTTGAGCTCAAAG
TTGTCCTACTGCCATGTTTTTGTACCTGAAATAAAGCATATTTTGCACTTGTTACTGTACCATAGTGCGGACGAGAAGTCTGTATGTGGG

>96393_96393_3_UBP1-TFCP2L1_UBP1_chr3_33453072_ENST00000447368_TFCP2L1_chr2_122004546_ENST00000263707_length(amino acids)=496AA_BP=37
MAWVLKMDEVIESGLVHDFDASLSGIGQELGAGAYSMSDVLALPIFKQEDSSLPLDGETEHPPFQYVMCAATSPAVKLHDETLTYLNQGQ
SYEIRMLDNRKMGDMPEINGKLVKSIIRVVFHDRRLQYTEHQQLEGWKWNRPGDRLLDLDIPMSVGIIDTRTNPSQLNAVEFLWDPAKRT
SAFIQVHCISTEFTPRKHGGEKGVPFRVQIDTFKQNENGEYTEHLHSASCQIKVFKPKGADRKQKTDREKMEKRTAQEKEKYQPSYETTI
LTECSPWPDVAYQVNSAPSPSYNGSPNSFGLGEGNASPTHPVEALPVGSDHLLPSASIQDAQQWLHRNRFSQFCRLFASFSGADLLKMSR
DDLVQICGPADGIRLFNAIKGRNVRPKMTIYVCQELEQNRVPLQQKRDGSGDSNLSVYHAIFLEELTTLELIEKIANLYSISPQHIHRVY

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for UBP1-TFCP2L1


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for UBP1-TFCP2L1


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for UBP1-TFCP2L1


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource