FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:HPS4-UBE2L3 (FusionGDB2 ID:37540)

Fusion Gene Summary for HPS4-UBE2L3

check button Fusion gene summary
Fusion gene informationFusion gene name: HPS4-UBE2L3
Fusion gene ID: 37540
HgeneTgene
Gene symbol

HPS4

UBE2L3

Gene ID

89781

7332

Gene nameHPS4 biogenesis of lysosomal organelles complex 3 subunit 2ubiquitin conjugating enzyme E2 L3
SynonymsBLOC3S2|LEE2-F1|L-UBC|UBCH7|UbcM4
Cytomap

22q12.1

22q11.21

Type of geneprotein-codingprotein-coding
DescriptionHermansky-Pudlak syndrome 4 proteinlight-ear protein homologubiquitin-conjugating enzyme E2 L3E2 ubiquitin-conjugating enzyme L3ubiquitin carrier protein L3ubiquitin conjugating enzyme E2L 3ubiquitin-conjugating enzyme E2-F1ubiquitin-conjugating enzyme UBCH7ubiquitin-protein ligase L3
Modification date2020032020200313
UniProtAcc

Q9NQG7

.
Ensembl transtripts involved in fusion geneENST00000336873, ENST00000398141, 
ENST00000398145, ENST00000402105, 
ENST00000493455, 
ENST00000545681, 
ENST00000342192, ENST00000458578, 
Fusion gene scores* DoF score3 X 3 X 3=2720 X 14 X 7=1960
# samples 322
** MAII scorelog2(3/27*10)=0.15200309344505
effective Gene in Pan-Cancer Fusion Genes (eGinPCFGs).
DoF>8 and MAII>0
log2(22/1960*10)=-3.15527822547791
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: HPS4 [Title/Abstract] AND UBE2L3 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointHPS4(26859883)-UBE2L3(21947150), # samples:3
Anticipated loss of major functional domain due to fusion event.HPS4-UBE2L3 seems lost the major protein functional domain in Tgene partner, which is a essential gene due to the frame-shifted ORF.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneHPS4

GO:0006605

protein targeting

12663659

HgeneHPS4

GO:0007040

lysosome organization

12663659

HgeneHPS4

GO:1903232

melanosome assembly

23084991

TgeneUBE2L3

GO:0000209

protein polyubiquitination

10888878|14765125

TgeneUBE2L3

GO:0006355

regulation of transcription, DNA-templated

17003263

TgeneUBE2L3

GO:0016567

protein ubiquitination

9990509|21532592

TgeneUBE2L3

GO:0070979

protein K11-linked ubiquitination

20061386

TgeneUBE2L3

GO:0071385

cellular response to glucocorticoid stimulus

17003263


check buttonFusion gene breakpoints across HPS4 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across UBE2L3 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4SKCMTCGA-XV-A9W2-01AHPS4chr22

26859883

-UBE2L3chr22

21947150

+


Top

Fusion Gene ORF analysis for HPS4-UBE2L3

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-intronENST00000336873ENST00000545681HPS4chr22

26859883

-UBE2L3chr22

21947150

+
5CDS-intronENST00000398141ENST00000545681HPS4chr22

26859883

-UBE2L3chr22

21947150

+
5CDS-intronENST00000398145ENST00000545681HPS4chr22

26859883

-UBE2L3chr22

21947150

+
5CDS-intronENST00000402105ENST00000545681HPS4chr22

26859883

-UBE2L3chr22

21947150

+
5UTR-3CDSENST00000493455ENST00000342192HPS4chr22

26859883

-UBE2L3chr22

21947150

+
5UTR-3CDSENST00000493455ENST00000458578HPS4chr22

26859883

-UBE2L3chr22

21947150

+
5UTR-intronENST00000493455ENST00000545681HPS4chr22

26859883

-UBE2L3chr22

21947150

+
Frame-shiftENST00000398145ENST00000342192HPS4chr22

26859883

-UBE2L3chr22

21947150

+
Frame-shiftENST00000398145ENST00000458578HPS4chr22

26859883

-UBE2L3chr22

21947150

+
In-frameENST00000336873ENST00000342192HPS4chr22

26859883

-UBE2L3chr22

21947150

+
In-frameENST00000336873ENST00000458578HPS4chr22

26859883

-UBE2L3chr22

21947150

+
In-frameENST00000398141ENST00000342192HPS4chr22

26859883

-UBE2L3chr22

21947150

+
In-frameENST00000398141ENST00000458578HPS4chr22

26859883

-UBE2L3chr22

21947150

+
In-frameENST00000402105ENST00000342192HPS4chr22

26859883

-UBE2L3chr22

21947150

+
In-frameENST00000402105ENST00000458578HPS4chr22

26859883

-UBE2L3chr22

21947150

+

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000398141HPS4chr2226859883-ENST00000458578UBE2L3chr2221947150+2773175202189729
ENST00000398141HPS4chr2226859883-ENST00000342192UBE2L3chr2221947150+4555175202189729
ENST00000402105HPS4chr2226859883-ENST00000458578UBE2L3chr2221947150+301919982462435729
ENST00000402105HPS4chr2226859883-ENST00000342192UBE2L3chr2221947150+480119982462435729
ENST00000336873HPS4chr2226859883-ENST00000458578UBE2L3chr2221947150+324322225092659716
ENST00000336873HPS4chr2226859883-ENST00000342192UBE2L3chr2221947150+502522225092659716

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000398141ENST00000458578HPS4chr2226859883-UBE2L3chr2221947150+0.0029150760.9970849
ENST00000398141ENST00000342192HPS4chr2226859883-UBE2L3chr2221947150+0.0023648640.9976351
ENST00000402105ENST00000458578HPS4chr2226859883-UBE2L3chr2221947150+0.002446120.9975539
ENST00000402105ENST00000342192HPS4chr2226859883-UBE2L3chr2221947150+0.0022277660.99777216
ENST00000336873ENST00000458578HPS4chr2226859883-UBE2L3chr2221947150+0.0029789640.997021
ENST00000336873ENST00000342192HPS4chr2226859883-UBE2L3chr2221947150+0.0025638180.9974362

Top

Fusion Genomic Features for HPS4-UBE2L3


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)
HPS4chr2226859882-UBE2L3chr2221947149+4.09E-060.99999595
HPS4chr2226859882-UBE2L3chr2221947149+4.09E-060.99999595

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.
genomic feature of top 1%

Top

Fusion Protein Features for HPS4-UBE2L3


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr22:26859883/chr22:21947150)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
HPS4

Q9NQG7

.
FUNCTION: Component of the BLOC-3 complex, a complex that acts as a guanine exchange factor (GEF) for RAB32 and RAB38, promotes the exchange of GDP to GTP, converting them from an inactive GDP-bound form into an active GTP-bound form. The BLOC-3 complex plays an important role in the control of melanin production and melanosome biogenesis and promotes the membrane localization of RAB32 and RAB38 (PubMed:23084991). {ECO:0000269|PubMed:23084991}.FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
TgeneUBE2L3chr22:26859883chr22:21947150ENST00000545681032_1490123.0DomainUBC core

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
TgeneUBE2L3chr22:26859883chr22:21947150ENST00000342192042_1499155.0DomainUBC core
TgeneUBE2L3chr22:26859883chr22:21947150ENST00000458578042_14967213.0DomainUBC core


Top

Fusion Gene Sequence for HPS4-UBE2L3


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>37540_37540_1_HPS4-UBE2L3_HPS4_chr22_26859883_ENST00000336873_UBE2L3_chr22_21947150_ENST00000342192_length(transcript)=5025nt_BP=2222nt
GTGACCTAAGGCCTCTCTGCCGCGCGCGCAGAGCCAAGGCACTGATGTTTGAACTGGAAACTTCAAAACGTTTAATAAGAGTCTTCAGGA
TGGGTTTGAACTAGACAAGCTAGAAATTTCTTTAGAACACCAGCTCTAGCATGCATCTCCCACTTTTGGCTTTCCTGGAGAGGAGCTTGA
AGAGGTGGTTCTGCAGACAGCCACAGTGATACTTAGGAAACCAGAGGAATGGATTTGACTTTTCTGCTAGGATTCTCTGTTATAGTTTCT
CCCTGAGTTGTAAGAGGCATGGAAATATACATGAAACTGAAGAACCTGCAAGGAAGGGAAGTGGAACTTTCCATGCTGAGTGAAAACTAA
CCAAGTGGCAGTTGTGACTGAAAACACTGAAACCTACCACGTCCAGATTCACTGGATTGGGGGATAGAGGAACGGTCACAGCTAGGGAGA
AAGAAGTGATACCGGAAAAGAAAACCTAAATGAAGAGAATGAGGATGACTGCACAGTAGATGGCCACCTCTACCTCCACAGAGGCAAAGT
CAGCCTCGTGGTGGAATTATTTTTTTCTTTATGATGGTTCCAAGGTAAAGGAAGAAGGCGATCCAACAAGAGCTGGCATTTGTTACTTTT
ATCCTTCCCAGACCCTGCTAGACCAACAGGAGTTGCTTTGTGGACAGATTGCTGGAGTTGTCCGCTGTGTTTCTGACATTTCTGACTCTC
CTCCTACTCTTGTTCGTCTGAGAAAACTGAAGTTTGCCATAAAAGTTGATGGAGATTACCTTTGGGTGCTGGGCTGTGCTGTGGAGCTCC
CTGATGTCAGCTGCAAGCGGTTTCTGGATCAGCTAGTTGGATTCTTTAATTTTTACAATGGACCTGTTTCCCTAGCTTATGAGAACTGTT
CTCAGGAAGAACTGAGCACGGAGTGGGACACCTTCATCGAGCAAATTCTGAAAAACACCAGTGATCTGCATAAGATTTTCAATTCCCTCT
GGAACTTGGACCAAACTAAAGTGGAGCCCCTGTTGTTGCTGAAGGCAGCCCGCATTCTGCAGACCTGCCAGCGCTCGCCTCACATTCTCG
CTGGCTGCATCCTCTATAAAGGACTGATTGTCAGCACCCAACTCCCGCCCTCCCTCACCGCCAAGGTCCTGCTTCACCGAACAGCACCTC
AGGAGCAGAGACTCCCTACGGGAGAGGATGCCCCGCAGGAACATGGAGCGGCATTGCCCCCGAATGTCCAGATTATCCCTGTTTTTGTGA
CCAAAGAGGAAGCCATTAGTCTCCACGAGTTCCCGGTGGAACAGATGACAAGGTCTCTAGCATCTCCAGCAGGACTCCAGGATGGTTCAG
CCCAGCACCATCCAAAGGGTGGGAGCACATCTGCCCTGAAAGAAAACGCCACTGGCCATGTGGAATCCATGGCCTGGACCACCCCAGATC
CCACATCCCCTGACGAAGCTTGTCCAGATGGCAGGAAGGAGAACGGATGCTTGTCTGGCCATGATCTGGAGAGCATCAGGCCCGCAGGAC
TGCACAACTCTGCCAGGGGTGAGGTTCTTGGCCTCAGCTCCTCCCTGGGGAAGGAACTAGTCTTTCTCCAAGAAGAACTCGACTTGTCTG
AAATCCACATTCCAGAGGCTCAGGAAGTGGAAATGGCCTCAGGTCATTTTGCCTTCCTACATGTGCCTGTTCCAGATGGCAGGGCTCCTT
ACTGCAAGGCATCTCTCAGCGCCTCCAGCAGCCTGGAACCCACGCCTCCTGAGGACACAGCCATCAGCAGCTTGCGCCCTCCCTCTGCTC
CTGAGATGCTGACCCAGCATGGAGCCCAAGAGCAGCTCGAAGACCATCCTGGCCATAGCAGCCAAGCCCCCATTCCCAGAGCAGACCCTC
TCCCCAGAAGGACCCGCAGGCCCTTGTTATTGCCTCGCTTAGATCCAGGACAGAGAGGAAACAAGCTTCCCACGGGGGAACAAGGCCTGG
ATGAGGATGTTGATGGGGTCTGTGAAAGCCACGCAGCCCCTGGTCTGGAATGCAGTTCAGGCTCAGCAAACTGTCAGGGTGCTGGCCCCT
CTGCAGATGGAATCAGCTCCAGGCTGACACCAGCAGAGTCCTGCATGGGGCTCGTGAGGATGAATCTCTACACTCACTGCGTCAAAGGGC
TGGTGCTGTCCCTGCTGGCTGAGGAGCCGCTGCTGGGAGACAGCGCAGCCATAGAGGAAGTGGAGCTTGAAGAAATCCGCAAATGTGGGA
TGAAAAACTTCCGTAACATCCAGGTTGATGAAGCTAATTTATTGACTTGGCAAGGGCTTATTGTTCCTGACAACCCTCCATATGATAAGG
GAGCCTTCAGAATCGAAATCAACTTTCCAGCAGAGTACCCATTCAAACCACCGAAGATCACATTTAAAACAAAGATCTATCACCCAAACA
TCGACGAAAAGGGGCAGGTCTGTCTGCCAGTAATTAGTGCCGAAAACTGGAAGCCAGCAACCAAAACCGACCAAGTAATCCAGTCCCTCA
TAGCACTGGTGAATGACCCCCAGCCTGAGCACCCGCTTCGGGCTGACCTAGCTGAAGAATACTCTAAGGACCGTAAAAAATTCTGTAAGA
ATGCTGAAGAGTTTACAAAGAAATATGGGGAAAAGCGACCTGTGGACTAAAATCTGCCACGATTGGTTCCAGCAAGTGTGAGCAGAGACC
CCGTGCAGTGCATTCAGACACCCCGCAAAGCAGGACTCTGTGGAAATTGACACGTGCCACCGCCTGGCGTTCGCTTGTGGCAGTTACTAA
CTTTCTACAGTTTTCTTAATCAAAAGTGGTCTAGGTAACCTGTAAAGAAAGGATTAAAAATTTAAGATGTTCTAGTTCTGCTCTCTTTGT
TTTAAAAATCACTGCTTCAATCTACTTCAAAAGAATGGTGTTTCTTTTCTTGTCCAATTTTATCCAAAATCTTCAAGTTACATTTAACCC
ATAAGGTTTAAAAAAAAGGAAAAAAAACGGTTGTGGTTCCCTTTCTTCCCTACCCTTGCCACTCCCACTTTCTGGCACCGAGTTTATTTT
TCACTTACTTACTTCCCCAGACCCCGGGCTCGCCTCCACAAAGGAGAAAAGACTGCCCTGGCGGTCCTGGTGGCTTTTCTTAGCATGTGT
GGCACTGTTGCCCAGTGTGGGAGTTGGTTTAAATTCTCCTGACTCCAGTTTATAACATCCTTTTAAAAAATTTAAAAACAAACAGCCACA
CCCCTCCTCCAGTCCTTCTCCTCAGTTCTTGTGTGAAACTCCAGCTGATGTTACCACAGTAACATCAGTTAATTGGGCAAGCCCTGATGT
CAGTGTGTGTAACTGACCTCTGGCCTGGCCTGCACAGAGAAGCCCTATAATCACAGGTCTGTGGTGGCCCCGAAATGGGGGGCCTGCTAG
TCAGGAGGATGCTGTGCACACTGTGTGTGATGAATCTCGCCAGAAAGGCTCCTGAGGTCCCAGGTTGGCACTTCTCCCTGCAGCCATTGT
AGAAGATCTGCTGGTCCTTGCAGGCAAAGCTACAGCCAGAATGTCCGTTTGAAACTCCTAGCTCATCTGTCACCGAGCTTCATCCGAATG
TGCCACGGAGCTTGCTCTCCACTTCCTCCGTGCAGTGGCCCTGCCACAGCCCTCCCTCGGCACACTTTGACCCTTTGTAGGATTGGAATT
AGCAGGACTCGGCTATTTAAAGCACCAGTCTGGGGTCGCCTGGGCCCCTGCTGACCCCCTCCTCCAGAGCAGCCAGCCCAGCCCGGGAAC
AAGACGGACTTCCTCTCCCTTCGGACTCACAGCCTTTGCAGAGTCAAGCTCCACTTGAAGCTCACTCAGTAATATCCTTTCAATGTGTTT
TATATTGTTTTGACTGCCTTTTTTTGTAGAAATAAAAATTGACCTTAGAATTTATCGTCAGATAAACTTGTAAAGATTTGAATATTAATG
TCTTTTCAAGGCAAATGGGATTGTCCCCGCACTAGTAGAGAATCCATGTCGCTCTGACACCCCAAGGAAGCCGACGATCCAAATGCCGTG
TGTCACCAACCCCGCTTCTGCCACTGGCGGCTTCCCTTCTTGGCTCTTGGGGGGGACTAGATCCTGTGGAGAAGATGACTTAAACTTTGC
TTTTTGTTTTAATTTTAATTCTATAACTTGAGATCTTTCCGGGGCCTACAGGCGTGTAAGACAGCTTGGTCTGGTCTGTGCAGAAGTGGG
GAGTGATGGGCAGGTTCGGCAGCCTAACATTGTTCAGGCGCATGGCCCCTGCGGTGTGTACACGAACTCGGCTTCTTTTGTCCTAGGTAC
GCCAAGGGCAGGTTTCTGGAGACTCCCTTGTGCCCGGGATGGCAAGGGCACCGGGCTGGCGTTTCCACATCTGTCTTCATTAGCAGAAAA
GTGATGATGGATTTTATTTCACTCACACTCCAGTTTGTAATAAAATGCCAAATTCTGTCAGCTATCCAAACAAGCCACCATTTGTTCTTG
TTGCTTCTCTGGATCCAGAAATGTTGCCATTCTTGGAAACTGTCCCATTGCTTCGTATTTCTGCCAACGTAGCTCTGCCTGCCTGTCAAC
CCCTCACTGCACTCTGCTCATCACGGGAGGATACCTGTGTGCCGGCAGCCCCTCAGGGACTCTCAGCCCTGGCACTGGCACCCCAGGGTT
GGCCCCGTCAGCAGAGGCTTGGCTTTCGAGCCAGTGGGTGTCTCTCCTTTGGGCCTGGGCGGCTTGCTCCTGCCAGCCATGCCTTCAGGG
TAGGCTCTGAGCAAGCTGGCGAACAGCCCTGGCTGCTCCAAAACCAAAAAGCTGGGTCCTCTGGAGGAGGGGCGAGCTGTGGAGCAGCCA
CCCACTGCTGCCCCAAGCTCACTCAGGAATTCACACCCGCCTGGTTTCTTGAAGTGTGCTGGGTCCTTCCCTCTGCTCCCTACTCCCCAC

>37540_37540_1_HPS4-UBE2L3_HPS4_chr22_26859883_ENST00000336873_UBE2L3_chr22_21947150_ENST00000342192_length(amino acids)=716AA_BP=571
MATSTSTEAKSASWWNYFFLYDGSKVKEEGDPTRAGICYFYPSQTLLDQQELLCGQIAGVVRCVSDISDSPPTLVRLRKLKFAIKVDGDY
LWVLGCAVELPDVSCKRFLDQLVGFFNFYNGPVSLAYENCSQEELSTEWDTFIEQILKNTSDLHKIFNSLWNLDQTKVEPLLLLKAARIL
QTCQRSPHILAGCILYKGLIVSTQLPPSLTAKVLLHRTAPQEQRLPTGEDAPQEHGAALPPNVQIIPVFVTKEEAISLHEFPVEQMTRSL
ASPAGLQDGSAQHHPKGGSTSALKENATGHVESMAWTTPDPTSPDEACPDGRKENGCLSGHDLESIRPAGLHNSARGEVLGLSSSLGKEL
VFLQEELDLSEIHIPEAQEVEMASGHFAFLHVPVPDGRAPYCKASLSASSSLEPTPPEDTAISSLRPPSAPEMLTQHGAQEQLEDHPGHS
SQAPIPRADPLPRRTRRPLLLPRLDPGQRGNKLPTGEQGLDEDVDGVCESHAAPGLECSSGSANCQGAGPSADGISSRLTPAESCMGLVR
MNLYTHCVKGLVLSLLAEEPLLGDSAAIEEVELEEIRKCGMKNFRNIQVDEANLLTWQGLIVPDNPPYDKGAFRIEINFPAEYPFKPPKI

--------------------------------------------------------------
>37540_37540_2_HPS4-UBE2L3_HPS4_chr22_26859883_ENST00000336873_UBE2L3_chr22_21947150_ENST00000458578_length(transcript)=3243nt_BP=2222nt
GTGACCTAAGGCCTCTCTGCCGCGCGCGCAGAGCCAAGGCACTGATGTTTGAACTGGAAACTTCAAAACGTTTAATAAGAGTCTTCAGGA
TGGGTTTGAACTAGACAAGCTAGAAATTTCTTTAGAACACCAGCTCTAGCATGCATCTCCCACTTTTGGCTTTCCTGGAGAGGAGCTTGA
AGAGGTGGTTCTGCAGACAGCCACAGTGATACTTAGGAAACCAGAGGAATGGATTTGACTTTTCTGCTAGGATTCTCTGTTATAGTTTCT
CCCTGAGTTGTAAGAGGCATGGAAATATACATGAAACTGAAGAACCTGCAAGGAAGGGAAGTGGAACTTTCCATGCTGAGTGAAAACTAA
CCAAGTGGCAGTTGTGACTGAAAACACTGAAACCTACCACGTCCAGATTCACTGGATTGGGGGATAGAGGAACGGTCACAGCTAGGGAGA
AAGAAGTGATACCGGAAAAGAAAACCTAAATGAAGAGAATGAGGATGACTGCACAGTAGATGGCCACCTCTACCTCCACAGAGGCAAAGT
CAGCCTCGTGGTGGAATTATTTTTTTCTTTATGATGGTTCCAAGGTAAAGGAAGAAGGCGATCCAACAAGAGCTGGCATTTGTTACTTTT
ATCCTTCCCAGACCCTGCTAGACCAACAGGAGTTGCTTTGTGGACAGATTGCTGGAGTTGTCCGCTGTGTTTCTGACATTTCTGACTCTC
CTCCTACTCTTGTTCGTCTGAGAAAACTGAAGTTTGCCATAAAAGTTGATGGAGATTACCTTTGGGTGCTGGGCTGTGCTGTGGAGCTCC
CTGATGTCAGCTGCAAGCGGTTTCTGGATCAGCTAGTTGGATTCTTTAATTTTTACAATGGACCTGTTTCCCTAGCTTATGAGAACTGTT
CTCAGGAAGAACTGAGCACGGAGTGGGACACCTTCATCGAGCAAATTCTGAAAAACACCAGTGATCTGCATAAGATTTTCAATTCCCTCT
GGAACTTGGACCAAACTAAAGTGGAGCCCCTGTTGTTGCTGAAGGCAGCCCGCATTCTGCAGACCTGCCAGCGCTCGCCTCACATTCTCG
CTGGCTGCATCCTCTATAAAGGACTGATTGTCAGCACCCAACTCCCGCCCTCCCTCACCGCCAAGGTCCTGCTTCACCGAACAGCACCTC
AGGAGCAGAGACTCCCTACGGGAGAGGATGCCCCGCAGGAACATGGAGCGGCATTGCCCCCGAATGTCCAGATTATCCCTGTTTTTGTGA
CCAAAGAGGAAGCCATTAGTCTCCACGAGTTCCCGGTGGAACAGATGACAAGGTCTCTAGCATCTCCAGCAGGACTCCAGGATGGTTCAG
CCCAGCACCATCCAAAGGGTGGGAGCACATCTGCCCTGAAAGAAAACGCCACTGGCCATGTGGAATCCATGGCCTGGACCACCCCAGATC
CCACATCCCCTGACGAAGCTTGTCCAGATGGCAGGAAGGAGAACGGATGCTTGTCTGGCCATGATCTGGAGAGCATCAGGCCCGCAGGAC
TGCACAACTCTGCCAGGGGTGAGGTTCTTGGCCTCAGCTCCTCCCTGGGGAAGGAACTAGTCTTTCTCCAAGAAGAACTCGACTTGTCTG
AAATCCACATTCCAGAGGCTCAGGAAGTGGAAATGGCCTCAGGTCATTTTGCCTTCCTACATGTGCCTGTTCCAGATGGCAGGGCTCCTT
ACTGCAAGGCATCTCTCAGCGCCTCCAGCAGCCTGGAACCCACGCCTCCTGAGGACACAGCCATCAGCAGCTTGCGCCCTCCCTCTGCTC
CTGAGATGCTGACCCAGCATGGAGCCCAAGAGCAGCTCGAAGACCATCCTGGCCATAGCAGCCAAGCCCCCATTCCCAGAGCAGACCCTC
TCCCCAGAAGGACCCGCAGGCCCTTGTTATTGCCTCGCTTAGATCCAGGACAGAGAGGAAACAAGCTTCCCACGGGGGAACAAGGCCTGG
ATGAGGATGTTGATGGGGTCTGTGAAAGCCACGCAGCCCCTGGTCTGGAATGCAGTTCAGGCTCAGCAAACTGTCAGGGTGCTGGCCCCT
CTGCAGATGGAATCAGCTCCAGGCTGACACCAGCAGAGTCCTGCATGGGGCTCGTGAGGATGAATCTCTACACTCACTGCGTCAAAGGGC
TGGTGCTGTCCCTGCTGGCTGAGGAGCCGCTGCTGGGAGACAGCGCAGCCATAGAGGAAGTGGAGCTTGAAGAAATCCGCAAATGTGGGA
TGAAAAACTTCCGTAACATCCAGGTTGATGAAGCTAATTTATTGACTTGGCAAGGGCTTATTGTTCCTGACAACCCTCCATATGATAAGG
GAGCCTTCAGAATCGAAATCAACTTTCCAGCAGAGTACCCATTCAAACCACCGAAGATCACATTTAAAACAAAGATCTATCACCCAAACA
TCGACGAAAAGGGGCAGGTCTGTCTGCCAGTAATTAGTGCCGAAAACTGGAAGCCAGCAACCAAAACCGACCAAGTAATCCAGTCCCTCA
TAGCACTGGTGAATGACCCCCAGCCTGAGCACCCGCTTCGGGCTGACCTAGCTGAAGAATACTCTAAGGACCGTAAAAAATTCTGTAAGA
ATGCTGAAGAGTTTACAAAGAAATATGGGGAAAAGCGACCTGTGGACTAAAATCTGCCACGATTGGTTCCAGCAAGTGTGAGCAGAGACC
CCGTGCAGTGCATTCAGACACCCCGCAAAGCAGGACTCTGTGGAAATTGACACGTGCCACCGCCTGGCGTTCGCTTGTGGCAGTTACTAA
CTTTCTACAGTTTTCTTAATCAAAAGTGGTCTAGGTAACCTGTAAAGAAAGGATTAAAAATTTAAGATGTTCTAGTTCTGCTCTCTTTGT
TTTAAAAATCACTGCTTCAATCTACTTCAAAAGAATGGTGTTTCTTTTCTTGTCCAATTTTATCCAAAATCTTCAAGTTACATTTAACCC
ATAAGGTTTAAAAAAAAGGAAAAAAAACGGTTGTGGTTCCCTTTCTTCCCTACCCTTGCCACTCCCACTTTCTGGCACCGAGTTTATTTT
TCACTTACTTACTTCCCCAGACCCCGGGCTCGCCTCCACAAAGGAGAAAAGACTGCCCTGGCGGTCCTGGTGGCTTTTCTTAGCATGTGT
GGCACTGTTGCCCAGTGTGGGAGTTGGTTTAAATTCTCCTGACTCCAGTTTATAACATCCTTTTAAAAAATTTAAAAACAAACAGCCACA

>37540_37540_2_HPS4-UBE2L3_HPS4_chr22_26859883_ENST00000336873_UBE2L3_chr22_21947150_ENST00000458578_length(amino acids)=716AA_BP=571
MATSTSTEAKSASWWNYFFLYDGSKVKEEGDPTRAGICYFYPSQTLLDQQELLCGQIAGVVRCVSDISDSPPTLVRLRKLKFAIKVDGDY
LWVLGCAVELPDVSCKRFLDQLVGFFNFYNGPVSLAYENCSQEELSTEWDTFIEQILKNTSDLHKIFNSLWNLDQTKVEPLLLLKAARIL
QTCQRSPHILAGCILYKGLIVSTQLPPSLTAKVLLHRTAPQEQRLPTGEDAPQEHGAALPPNVQIIPVFVTKEEAISLHEFPVEQMTRSL
ASPAGLQDGSAQHHPKGGSTSALKENATGHVESMAWTTPDPTSPDEACPDGRKENGCLSGHDLESIRPAGLHNSARGEVLGLSSSLGKEL
VFLQEELDLSEIHIPEAQEVEMASGHFAFLHVPVPDGRAPYCKASLSASSSLEPTPPEDTAISSLRPPSAPEMLTQHGAQEQLEDHPGHS
SQAPIPRADPLPRRTRRPLLLPRLDPGQRGNKLPTGEQGLDEDVDGVCESHAAPGLECSSGSANCQGAGPSADGISSRLTPAESCMGLVR
MNLYTHCVKGLVLSLLAEEPLLGDSAAIEEVELEEIRKCGMKNFRNIQVDEANLLTWQGLIVPDNPPYDKGAFRIEINFPAEYPFKPPKI

--------------------------------------------------------------
>37540_37540_3_HPS4-UBE2L3_HPS4_chr22_26859883_ENST00000398141_UBE2L3_chr22_21947150_ENST00000342192_length(transcript)=4555nt_BP=1752nt
ATGGCTCCACTCTGTTCTCTTGCCAGGTGGAATTATTTTTTTCTTTATGATGGTTCCAAGGTAAAGGAAGAAGGCGATCCAACAAGAGCT
GGCATTTGTTACTTTTATCCTTCCCAGACCCTGCTAGACCAACAGGAGTTGCTTTGTGGACAGATTGCTGGAGTTGTCCGCTGTGTTTCT
GACATTTCTGACTCTCCTCCTACTCTTGTTCGTCTGAGAAAACTGAAGTTTGCCATAAAAGTTGATGGAGATTACCTTTGGGTGCTGGGC
TGTGCTGTGGAGCTCCCTGATGTCAGCTGCAAGCGGTTTCTGGATCAGCTAGTTGGATTCTTTAATTTTTACAATGGACCTGTTTCCCTA
GCTTATGAGAACTGTTCTCAGGAAGAACTGAGCACGGAGTGGGACACCTTCATCGAGCAAATTCTGAAAAACACCAGTGATCTGCATAAG
ATTTTCAATTCCCTCTGGAACTTGGACCAAACTAAAGTGGAGCCCCTGTTGTTGCTGAAGGCAGCCCGCATTCTGCAGACCTGCCAGCGC
TCGCCTCACATTCTCGCTGGCTGCATCCTCTATAAAGGACTGATTGTCAGCACCCAACTCCCGCCCTCCCTCACCGCCAAGGTCCTGCTT
CACCGAACAGCACCTCAGGAGCAGAGACTCCCTACGGGAGAGGATGCCCCGCAGGAACATGGAAAATGGATGTTATGGAGCTTCAAGAAT
CGAGTTACCCACCAGAACCCTAATGGAGCGGCATTGCCCCCGAATGTCCAGATTATCCCTGTTTTTGTGACCAAAGAGGAAGCCATTAGT
CTCCACGAGTTCCCGGTGGAACAGATGACAAGGTCTCTAGCATCTCCAGCAGGACTCCAGGATGGTTCAGCCCAGCACCATCCAAAGGGT
GGGAGCACATCTGCCCTGAAAGAAAACGCCACTGGCCATGTGGAATCCATGGCCTGGACCACCCCAGATCCCACATCCCCTGACGAAGCT
TGTCCAGATGGCAGGAAGGAGAACGGATGCTTGTCTGGCCATGATCTGGAGAGCATCAGGCCCGCAGGACTGCACAACTCTGCCAGGGGT
GAGGTTCTTGGCCTCAGCTCCTCCCTGGGGAAGGAACTAGTCTTTCTCCAAGAAGAACTCGACTTGTCTGAAATCCACATTCCAGAGGCT
CAGGAAGTGGAAATGGCCTCAGGTCATTTTGCCTTCCTACATGTGCCTGTTCCAGATGGCAGGGCTCCTTACTGCAAGGCATCTCTCAGC
GCCTCCAGCAGCCTGGAACCCACGCCTCCTGAGGACACAGCCATCAGCAGCTTGCGCCCTCCCTCTGCTCCTGAGATGCTGACCCAGCAT
GGAGCCCAAGAGCAGCTCGAAGACCATCCTGGCCATAGCAGCCAAGCCCCCATTCCCAGAGCAGACCCTCTCCCCAGAAGGACCCGCAGG
CCCTTGTTATTGCCTCGCTTAGATCCAGGACAGAGAGGAAACAAGCTTCCCACGGGGGAACAAGGCCTGGATGAGGATGTTGATGGGGTC
TGTGAAAGCCACGCAGCCCCTGGTCTGGAATGCAGTTCAGGCTCAGCAAACTGTCAGGGTGCTGGCCCCTCTGCAGATGGAATCAGCTCC
AGGCTGACACCAGCAGAGTCCTGCATGGGGCTCGTGAGGATGAATCTCTACACTCACTGCGTCAAAGGGCTGGTGCTGTCCCTGCTGGCT
GAGGAGCCGCTGCTGGGAGACAGCGCAGCCATAGAGGAAGTGGAGCTTGAAGAAATCCGCAAATGTGGGATGAAAAACTTCCGTAACATC
CAGGTTGATGAAGCTAATTTATTGACTTGGCAAGGGCTTATTGTTCCTGACAACCCTCCATATGATAAGGGAGCCTTCAGAATCGAAATC
AACTTTCCAGCAGAGTACCCATTCAAACCACCGAAGATCACATTTAAAACAAAGATCTATCACCCAAACATCGACGAAAAGGGGCAGGTC
TGTCTGCCAGTAATTAGTGCCGAAAACTGGAAGCCAGCAACCAAAACCGACCAAGTAATCCAGTCCCTCATAGCACTGGTGAATGACCCC
CAGCCTGAGCACCCGCTTCGGGCTGACCTAGCTGAAGAATACTCTAAGGACCGTAAAAAATTCTGTAAGAATGCTGAAGAGTTTACAAAG
AAATATGGGGAAAAGCGACCTGTGGACTAAAATCTGCCACGATTGGTTCCAGCAAGTGTGAGCAGAGACCCCGTGCAGTGCATTCAGACA
CCCCGCAAAGCAGGACTCTGTGGAAATTGACACGTGCCACCGCCTGGCGTTCGCTTGTGGCAGTTACTAACTTTCTACAGTTTTCTTAAT
CAAAAGTGGTCTAGGTAACCTGTAAAGAAAGGATTAAAAATTTAAGATGTTCTAGTTCTGCTCTCTTTGTTTTAAAAATCACTGCTTCAA
TCTACTTCAAAAGAATGGTGTTTCTTTTCTTGTCCAATTTTATCCAAAATCTTCAAGTTACATTTAACCCATAAGGTTTAAAAAAAAGGA
AAAAAAACGGTTGTGGTTCCCTTTCTTCCCTACCCTTGCCACTCCCACTTTCTGGCACCGAGTTTATTTTTCACTTACTTACTTCCCCAG
ACCCCGGGCTCGCCTCCACAAAGGAGAAAAGACTGCCCTGGCGGTCCTGGTGGCTTTTCTTAGCATGTGTGGCACTGTTGCCCAGTGTGG
GAGTTGGTTTAAATTCTCCTGACTCCAGTTTATAACATCCTTTTAAAAAATTTAAAAACAAACAGCCACACCCCTCCTCCAGTCCTTCTC
CTCAGTTCTTGTGTGAAACTCCAGCTGATGTTACCACAGTAACATCAGTTAATTGGGCAAGCCCTGATGTCAGTGTGTGTAACTGACCTC
TGGCCTGGCCTGCACAGAGAAGCCCTATAATCACAGGTCTGTGGTGGCCCCGAAATGGGGGGCCTGCTAGTCAGGAGGATGCTGTGCACA
CTGTGTGTGATGAATCTCGCCAGAAAGGCTCCTGAGGTCCCAGGTTGGCACTTCTCCCTGCAGCCATTGTAGAAGATCTGCTGGTCCTTG
CAGGCAAAGCTACAGCCAGAATGTCCGTTTGAAACTCCTAGCTCATCTGTCACCGAGCTTCATCCGAATGTGCCACGGAGCTTGCTCTCC
ACTTCCTCCGTGCAGTGGCCCTGCCACAGCCCTCCCTCGGCACACTTTGACCCTTTGTAGGATTGGAATTAGCAGGACTCGGCTATTTAA
AGCACCAGTCTGGGGTCGCCTGGGCCCCTGCTGACCCCCTCCTCCAGAGCAGCCAGCCCAGCCCGGGAACAAGACGGACTTCCTCTCCCT
TCGGACTCACAGCCTTTGCAGAGTCAAGCTCCACTTGAAGCTCACTCAGTAATATCCTTTCAATGTGTTTTATATTGTTTTGACTGCCTT
TTTTTGTAGAAATAAAAATTGACCTTAGAATTTATCGTCAGATAAACTTGTAAAGATTTGAATATTAATGTCTTTTCAAGGCAAATGGGA
TTGTCCCCGCACTAGTAGAGAATCCATGTCGCTCTGACACCCCAAGGAAGCCGACGATCCAAATGCCGTGTGTCACCAACCCCGCTTCTG
CCACTGGCGGCTTCCCTTCTTGGCTCTTGGGGGGGACTAGATCCTGTGGAGAAGATGACTTAAACTTTGCTTTTTGTTTTAATTTTAATT
CTATAACTTGAGATCTTTCCGGGGCCTACAGGCGTGTAAGACAGCTTGGTCTGGTCTGTGCAGAAGTGGGGAGTGATGGGCAGGTTCGGC
AGCCTAACATTGTTCAGGCGCATGGCCCCTGCGGTGTGTACACGAACTCGGCTTCTTTTGTCCTAGGTACGCCAAGGGCAGGTTTCTGGA
GACTCCCTTGTGCCCGGGATGGCAAGGGCACCGGGCTGGCGTTTCCACATCTGTCTTCATTAGCAGAAAAGTGATGATGGATTTTATTTC
ACTCACACTCCAGTTTGTAATAAAATGCCAAATTCTGTCAGCTATCCAAACAAGCCACCATTTGTTCTTGTTGCTTCTCTGGATCCAGAA
ATGTTGCCATTCTTGGAAACTGTCCCATTGCTTCGTATTTCTGCCAACGTAGCTCTGCCTGCCTGTCAACCCCTCACTGCACTCTGCTCA
TCACGGGAGGATACCTGTGTGCCGGCAGCCCCTCAGGGACTCTCAGCCCTGGCACTGGCACCCCAGGGTTGGCCCCGTCAGCAGAGGCTT
GGCTTTCGAGCCAGTGGGTGTCTCTCCTTTGGGCCTGGGCGGCTTGCTCCTGCCAGCCATGCCTTCAGGGTAGGCTCTGAGCAAGCTGGC
GAACAGCCCTGGCTGCTCCAAAACCAAAAAGCTGGGTCCTCTGGAGGAGGGGCGAGCTGTGGAGCAGCCACCCACTGCTGCCCCAAGCTC
ACTCAGGAATTCACACCCGCCTGGTTTCTTGAAGTGTGCTGGGTCCTTCCCTCTGCTCCCTACTCCCCACCACGGCAGAGAATAGGCTTT

>37540_37540_3_HPS4-UBE2L3_HPS4_chr22_26859883_ENST00000398141_UBE2L3_chr22_21947150_ENST00000342192_length(amino acids)=729AA_BP=584
MAPLCSLARWNYFFLYDGSKVKEEGDPTRAGICYFYPSQTLLDQQELLCGQIAGVVRCVSDISDSPPTLVRLRKLKFAIKVDGDYLWVLG
CAVELPDVSCKRFLDQLVGFFNFYNGPVSLAYENCSQEELSTEWDTFIEQILKNTSDLHKIFNSLWNLDQTKVEPLLLLKAARILQTCQR
SPHILAGCILYKGLIVSTQLPPSLTAKVLLHRTAPQEQRLPTGEDAPQEHGKWMLWSFKNRVTHQNPNGAALPPNVQIIPVFVTKEEAIS
LHEFPVEQMTRSLASPAGLQDGSAQHHPKGGSTSALKENATGHVESMAWTTPDPTSPDEACPDGRKENGCLSGHDLESIRPAGLHNSARG
EVLGLSSSLGKELVFLQEELDLSEIHIPEAQEVEMASGHFAFLHVPVPDGRAPYCKASLSASSSLEPTPPEDTAISSLRPPSAPEMLTQH
GAQEQLEDHPGHSSQAPIPRADPLPRRTRRPLLLPRLDPGQRGNKLPTGEQGLDEDVDGVCESHAAPGLECSSGSANCQGAGPSADGISS
RLTPAESCMGLVRMNLYTHCVKGLVLSLLAEEPLLGDSAAIEEVELEEIRKCGMKNFRNIQVDEANLLTWQGLIVPDNPPYDKGAFRIEI
NFPAEYPFKPPKITFKTKIYHPNIDEKGQVCLPVISAENWKPATKTDQVIQSLIALVNDPQPEHPLRADLAEEYSKDRKKFCKNAEEFTK

--------------------------------------------------------------
>37540_37540_4_HPS4-UBE2L3_HPS4_chr22_26859883_ENST00000398141_UBE2L3_chr22_21947150_ENST00000458578_length(transcript)=2773nt_BP=1752nt
ATGGCTCCACTCTGTTCTCTTGCCAGGTGGAATTATTTTTTTCTTTATGATGGTTCCAAGGTAAAGGAAGAAGGCGATCCAACAAGAGCT
GGCATTTGTTACTTTTATCCTTCCCAGACCCTGCTAGACCAACAGGAGTTGCTTTGTGGACAGATTGCTGGAGTTGTCCGCTGTGTTTCT
GACATTTCTGACTCTCCTCCTACTCTTGTTCGTCTGAGAAAACTGAAGTTTGCCATAAAAGTTGATGGAGATTACCTTTGGGTGCTGGGC
TGTGCTGTGGAGCTCCCTGATGTCAGCTGCAAGCGGTTTCTGGATCAGCTAGTTGGATTCTTTAATTTTTACAATGGACCTGTTTCCCTA
GCTTATGAGAACTGTTCTCAGGAAGAACTGAGCACGGAGTGGGACACCTTCATCGAGCAAATTCTGAAAAACACCAGTGATCTGCATAAG
ATTTTCAATTCCCTCTGGAACTTGGACCAAACTAAAGTGGAGCCCCTGTTGTTGCTGAAGGCAGCCCGCATTCTGCAGACCTGCCAGCGC
TCGCCTCACATTCTCGCTGGCTGCATCCTCTATAAAGGACTGATTGTCAGCACCCAACTCCCGCCCTCCCTCACCGCCAAGGTCCTGCTT
CACCGAACAGCACCTCAGGAGCAGAGACTCCCTACGGGAGAGGATGCCCCGCAGGAACATGGAAAATGGATGTTATGGAGCTTCAAGAAT
CGAGTTACCCACCAGAACCCTAATGGAGCGGCATTGCCCCCGAATGTCCAGATTATCCCTGTTTTTGTGACCAAAGAGGAAGCCATTAGT
CTCCACGAGTTCCCGGTGGAACAGATGACAAGGTCTCTAGCATCTCCAGCAGGACTCCAGGATGGTTCAGCCCAGCACCATCCAAAGGGT
GGGAGCACATCTGCCCTGAAAGAAAACGCCACTGGCCATGTGGAATCCATGGCCTGGACCACCCCAGATCCCACATCCCCTGACGAAGCT
TGTCCAGATGGCAGGAAGGAGAACGGATGCTTGTCTGGCCATGATCTGGAGAGCATCAGGCCCGCAGGACTGCACAACTCTGCCAGGGGT
GAGGTTCTTGGCCTCAGCTCCTCCCTGGGGAAGGAACTAGTCTTTCTCCAAGAAGAACTCGACTTGTCTGAAATCCACATTCCAGAGGCT
CAGGAAGTGGAAATGGCCTCAGGTCATTTTGCCTTCCTACATGTGCCTGTTCCAGATGGCAGGGCTCCTTACTGCAAGGCATCTCTCAGC
GCCTCCAGCAGCCTGGAACCCACGCCTCCTGAGGACACAGCCATCAGCAGCTTGCGCCCTCCCTCTGCTCCTGAGATGCTGACCCAGCAT
GGAGCCCAAGAGCAGCTCGAAGACCATCCTGGCCATAGCAGCCAAGCCCCCATTCCCAGAGCAGACCCTCTCCCCAGAAGGACCCGCAGG
CCCTTGTTATTGCCTCGCTTAGATCCAGGACAGAGAGGAAACAAGCTTCCCACGGGGGAACAAGGCCTGGATGAGGATGTTGATGGGGTC
TGTGAAAGCCACGCAGCCCCTGGTCTGGAATGCAGTTCAGGCTCAGCAAACTGTCAGGGTGCTGGCCCCTCTGCAGATGGAATCAGCTCC
AGGCTGACACCAGCAGAGTCCTGCATGGGGCTCGTGAGGATGAATCTCTACACTCACTGCGTCAAAGGGCTGGTGCTGTCCCTGCTGGCT
GAGGAGCCGCTGCTGGGAGACAGCGCAGCCATAGAGGAAGTGGAGCTTGAAGAAATCCGCAAATGTGGGATGAAAAACTTCCGTAACATC
CAGGTTGATGAAGCTAATTTATTGACTTGGCAAGGGCTTATTGTTCCTGACAACCCTCCATATGATAAGGGAGCCTTCAGAATCGAAATC
AACTTTCCAGCAGAGTACCCATTCAAACCACCGAAGATCACATTTAAAACAAAGATCTATCACCCAAACATCGACGAAAAGGGGCAGGTC
TGTCTGCCAGTAATTAGTGCCGAAAACTGGAAGCCAGCAACCAAAACCGACCAAGTAATCCAGTCCCTCATAGCACTGGTGAATGACCCC
CAGCCTGAGCACCCGCTTCGGGCTGACCTAGCTGAAGAATACTCTAAGGACCGTAAAAAATTCTGTAAGAATGCTGAAGAGTTTACAAAG
AAATATGGGGAAAAGCGACCTGTGGACTAAAATCTGCCACGATTGGTTCCAGCAAGTGTGAGCAGAGACCCCGTGCAGTGCATTCAGACA
CCCCGCAAAGCAGGACTCTGTGGAAATTGACACGTGCCACCGCCTGGCGTTCGCTTGTGGCAGTTACTAACTTTCTACAGTTTTCTTAAT
CAAAAGTGGTCTAGGTAACCTGTAAAGAAAGGATTAAAAATTTAAGATGTTCTAGTTCTGCTCTCTTTGTTTTAAAAATCACTGCTTCAA
TCTACTTCAAAAGAATGGTGTTTCTTTTCTTGTCCAATTTTATCCAAAATCTTCAAGTTACATTTAACCCATAAGGTTTAAAAAAAAGGA
AAAAAAACGGTTGTGGTTCCCTTTCTTCCCTACCCTTGCCACTCCCACTTTCTGGCACCGAGTTTATTTTTCACTTACTTACTTCCCCAG
ACCCCGGGCTCGCCTCCACAAAGGAGAAAAGACTGCCCTGGCGGTCCTGGTGGCTTTTCTTAGCATGTGTGGCACTGTTGCCCAGTGTGG

>37540_37540_4_HPS4-UBE2L3_HPS4_chr22_26859883_ENST00000398141_UBE2L3_chr22_21947150_ENST00000458578_length(amino acids)=729AA_BP=584
MAPLCSLARWNYFFLYDGSKVKEEGDPTRAGICYFYPSQTLLDQQELLCGQIAGVVRCVSDISDSPPTLVRLRKLKFAIKVDGDYLWVLG
CAVELPDVSCKRFLDQLVGFFNFYNGPVSLAYENCSQEELSTEWDTFIEQILKNTSDLHKIFNSLWNLDQTKVEPLLLLKAARILQTCQR
SPHILAGCILYKGLIVSTQLPPSLTAKVLLHRTAPQEQRLPTGEDAPQEHGKWMLWSFKNRVTHQNPNGAALPPNVQIIPVFVTKEEAIS
LHEFPVEQMTRSLASPAGLQDGSAQHHPKGGSTSALKENATGHVESMAWTTPDPTSPDEACPDGRKENGCLSGHDLESIRPAGLHNSARG
EVLGLSSSLGKELVFLQEELDLSEIHIPEAQEVEMASGHFAFLHVPVPDGRAPYCKASLSASSSLEPTPPEDTAISSLRPPSAPEMLTQH
GAQEQLEDHPGHSSQAPIPRADPLPRRTRRPLLLPRLDPGQRGNKLPTGEQGLDEDVDGVCESHAAPGLECSSGSANCQGAGPSADGISS
RLTPAESCMGLVRMNLYTHCVKGLVLSLLAEEPLLGDSAAIEEVELEEIRKCGMKNFRNIQVDEANLLTWQGLIVPDNPPYDKGAFRIEI
NFPAEYPFKPPKITFKTKIYHPNIDEKGQVCLPVISAENWKPATKTDQVIQSLIALVNDPQPEHPLRADLAEEYSKDRKKFCKNAEEFTK

--------------------------------------------------------------
>37540_37540_5_HPS4-UBE2L3_HPS4_chr22_26859883_ENST00000402105_UBE2L3_chr22_21947150_ENST00000342192_length(transcript)=4801nt_BP=1998nt
AGACCATGTGGTTGTATAGTTAAGGTTCAGTGTTTCAAATAAGAGCAGTCATTGTGGGCGCCTCGTTTTTTTTTCTAGCTACTCGTGTCT
TCTCCGGCAGATGGTCTAACGGCTGGATAGAGGCATATTTTTAGTTTGAATTTCCTGCGCTTACAGAATGGTTTTATCTAATTTGTAGCT
ACTGGGGTTCTTTATTCTCTGGGGCCACCTGAAGGCTTTCAAACACAGTAATAAGCCCTCCTCGGGCTGGGAAACAGGAAGCTGTGTTTT
AAATGCCGTGATCACTGGATTAAAGGAAACATGGCTCCACTCTGTTCTCTTGCCAGGTGGAATTATTTTTTTCTTTATGATGGTTCCAAG
GTAAAGGAAGAAGGCGATCCAACAAGAGCTGGCATTTGTTACTTTTATCCTTCCCAGACCCTGCTAGACCAACAGGAGTTGCTTTGTGGA
CAGATTGCTGGAGTTGTCCGCTGTGTTTCTGACATTTCTGACTCTCCTCCTACTCTTGTTCGTCTGAGAAAACTGAAGTTTGCCATAAAA
GTTGATGGAGATTACCTTTGGGTGCTGGGCTGTGCTGTGGAGCTCCCTGATGTCAGCTGCAAGCGGTTTCTGGATCAGCTAGTTGGATTC
TTTAATTTTTACAATGGACCTGTTTCCCTAGCTTATGAGAACTGTTCTCAGGAAGAACTGAGCACGGAGTGGGACACCTTCATCGAGCAA
ATTCTGAAAAACACCAGTGATCTGCATAAGATTTTCAATTCCCTCTGGAACTTGGACCAAACTAAAGTGGAGCCCCTGTTGTTGCTGAAG
GCAGCCCGCATTCTGCAGACCTGCCAGCGCTCGCCTCACATTCTCGCTGGCTGCATCCTCTATAAAGGACTGATTGTCAGCACCCAACTC
CCGCCCTCCCTCACCGCCAAGGTCCTGCTTCACCGAACAGCACCTCAGGAGCAGAGACTCCCTACGGGAGAGGATGCCCCGCAGGAACAT
GGAGCGGCATTGCCCCCGAATGTCCAGATTATCCCTGTTTTTGTGACCAAAGAGGAAGCCATTAGTCTCCACGAGTTCCCGGTGGAACAG
ATGACAAGGTCTCTAGCATCTCCAGCAGGACTCCAGGATGGTTCAGCCCAGCACCATCCAAAGGGTGGGAGCACATCTGCCCTGAAAGAA
AACGCCACTGGCCATGTGGAATCCATGGCCTGGACCACCCCAGATCCCACATCCCCTGACGAAGCTTGTCCAGATGGCAGGAAGGAGAAC
GGATGCTTGTCTGGCCATGATCTGGAGAGCATCAGGCCCGCAGGACTGCACAACTCTGCCAGGGGTGAGGTTCTTGGCCTCAGCTCCTCC
CTGGGGAAGGAACTAGTCTTTCTCCAAGAAGAACTCGACTTGTCTGAAATCCACATTCCAGAGGCTCAGGAAGTGGAAATGGCCTCAGGT
CATTTTGCCTTCCTACATGTGCCTGTTCCAGATGGCAGGGCTCCTTACTGCAAGGCATCTCTCAGCGCCTCCAGCAGCCTGGAACCCACG
CCTCCTGAGGACACAGCCATCAGCAGCTTGCGCCCTCCCTCTGCTCCTGAGATGCTGACCCAGCATGGAGCCCAAGAGCAGCTCGAAGAC
CATCCTGGCCATAGCAGCCAAGCCCCCATTCCCAGAGCAGACCCTCTCCCCAGAAGGACCCGCAGGCCCTTGTTATTGCCTCGCTTAGAT
CCAGGACAGAGAGGAAACAAGCTTCCCACGGGGGAACAAGGCCTGGATGAGGATGTTGATGGGGTCTGTGAAAGCCACGCAGCCCCTGGT
CTGGAATGCAGTTCAGGCTCAGCAAACTGTCAGGGTGCTGGCCCCTCTGCAGATGGAATCAGCTCCAGGCTGACACCAGCAGAGTCCTGC
ATGGGGCTCGTGAGGATGAATCTCTACACTCACTGCGTCAAAGGGCTGGTGCTGTCCCTGCTGGCTGAGGAGCCGCTGCTGGGAGACAGC
GCAGCCATAGAGGAAGTGGAGCTTGAAGAAATCCGCAAATGTGGGATGAAAAACTTCCGTAACATCCAGGTTGATGAAGCTAATTTATTG
ACTTGGCAAGGGCTTATTGTTCCTGACAACCCTCCATATGATAAGGGAGCCTTCAGAATCGAAATCAACTTTCCAGCAGAGTACCCATTC
AAACCACCGAAGATCACATTTAAAACAAAGATCTATCACCCAAACATCGACGAAAAGGGGCAGGTCTGTCTGCCAGTAATTAGTGCCGAA
AACTGGAAGCCAGCAACCAAAACCGACCAAGTAATCCAGTCCCTCATAGCACTGGTGAATGACCCCCAGCCTGAGCACCCGCTTCGGGCT
GACCTAGCTGAAGAATACTCTAAGGACCGTAAAAAATTCTGTAAGAATGCTGAAGAGTTTACAAAGAAATATGGGGAAAAGCGACCTGTG
GACTAAAATCTGCCACGATTGGTTCCAGCAAGTGTGAGCAGAGACCCCGTGCAGTGCATTCAGACACCCCGCAAAGCAGGACTCTGTGGA
AATTGACACGTGCCACCGCCTGGCGTTCGCTTGTGGCAGTTACTAACTTTCTACAGTTTTCTTAATCAAAAGTGGTCTAGGTAACCTGTA
AAGAAAGGATTAAAAATTTAAGATGTTCTAGTTCTGCTCTCTTTGTTTTAAAAATCACTGCTTCAATCTACTTCAAAAGAATGGTGTTTC
TTTTCTTGTCCAATTTTATCCAAAATCTTCAAGTTACATTTAACCCATAAGGTTTAAAAAAAAGGAAAAAAAACGGTTGTGGTTCCCTTT
CTTCCCTACCCTTGCCACTCCCACTTTCTGGCACCGAGTTTATTTTTCACTTACTTACTTCCCCAGACCCCGGGCTCGCCTCCACAAAGG
AGAAAAGACTGCCCTGGCGGTCCTGGTGGCTTTTCTTAGCATGTGTGGCACTGTTGCCCAGTGTGGGAGTTGGTTTAAATTCTCCTGACT
CCAGTTTATAACATCCTTTTAAAAAATTTAAAAACAAACAGCCACACCCCTCCTCCAGTCCTTCTCCTCAGTTCTTGTGTGAAACTCCAG
CTGATGTTACCACAGTAACATCAGTTAATTGGGCAAGCCCTGATGTCAGTGTGTGTAACTGACCTCTGGCCTGGCCTGCACAGAGAAGCC
CTATAATCACAGGTCTGTGGTGGCCCCGAAATGGGGGGCCTGCTAGTCAGGAGGATGCTGTGCACACTGTGTGTGATGAATCTCGCCAGA
AAGGCTCCTGAGGTCCCAGGTTGGCACTTCTCCCTGCAGCCATTGTAGAAGATCTGCTGGTCCTTGCAGGCAAAGCTACAGCCAGAATGT
CCGTTTGAAACTCCTAGCTCATCTGTCACCGAGCTTCATCCGAATGTGCCACGGAGCTTGCTCTCCACTTCCTCCGTGCAGTGGCCCTGC
CACAGCCCTCCCTCGGCACACTTTGACCCTTTGTAGGATTGGAATTAGCAGGACTCGGCTATTTAAAGCACCAGTCTGGGGTCGCCTGGG
CCCCTGCTGACCCCCTCCTCCAGAGCAGCCAGCCCAGCCCGGGAACAAGACGGACTTCCTCTCCCTTCGGACTCACAGCCTTTGCAGAGT
CAAGCTCCACTTGAAGCTCACTCAGTAATATCCTTTCAATGTGTTTTATATTGTTTTGACTGCCTTTTTTTGTAGAAATAAAAATTGACC
TTAGAATTTATCGTCAGATAAACTTGTAAAGATTTGAATATTAATGTCTTTTCAAGGCAAATGGGATTGTCCCCGCACTAGTAGAGAATC
CATGTCGCTCTGACACCCCAAGGAAGCCGACGATCCAAATGCCGTGTGTCACCAACCCCGCTTCTGCCACTGGCGGCTTCCCTTCTTGGC
TCTTGGGGGGGACTAGATCCTGTGGAGAAGATGACTTAAACTTTGCTTTTTGTTTTAATTTTAATTCTATAACTTGAGATCTTTCCGGGG
CCTACAGGCGTGTAAGACAGCTTGGTCTGGTCTGTGCAGAAGTGGGGAGTGATGGGCAGGTTCGGCAGCCTAACATTGTTCAGGCGCATG
GCCCCTGCGGTGTGTACACGAACTCGGCTTCTTTTGTCCTAGGTACGCCAAGGGCAGGTTTCTGGAGACTCCCTTGTGCCCGGGATGGCA
AGGGCACCGGGCTGGCGTTTCCACATCTGTCTTCATTAGCAGAAAAGTGATGATGGATTTTATTTCACTCACACTCCAGTTTGTAATAAA
ATGCCAAATTCTGTCAGCTATCCAAACAAGCCACCATTTGTTCTTGTTGCTTCTCTGGATCCAGAAATGTTGCCATTCTTGGAAACTGTC
CCATTGCTTCGTATTTCTGCCAACGTAGCTCTGCCTGCCTGTCAACCCCTCACTGCACTCTGCTCATCACGGGAGGATACCTGTGTGCCG
GCAGCCCCTCAGGGACTCTCAGCCCTGGCACTGGCACCCCAGGGTTGGCCCCGTCAGCAGAGGCTTGGCTTTCGAGCCAGTGGGTGTCTC
TCCTTTGGGCCTGGGCGGCTTGCTCCTGCCAGCCATGCCTTCAGGGTAGGCTCTGAGCAAGCTGGCGAACAGCCCTGGCTGCTCCAAAAC
CAAAAAGCTGGGTCCTCTGGAGGAGGGGCGAGCTGTGGAGCAGCCACCCACTGCTGCCCCAAGCTCACTCAGGAATTCACACCCGCCTGG
TTTCTTGAAGTGTGCTGGGTCCTTCCCTCTGCTCCCTACTCCCCACCACGGCAGAGAATAGGCTTTCTAAGATGCTGCGATCCCGTTCTG

>37540_37540_5_HPS4-UBE2L3_HPS4_chr22_26859883_ENST00000402105_UBE2L3_chr22_21947150_ENST00000342192_length(amino acids)=729AA_BP=584
MGNRKLCFKCRDHWIKGNMAPLCSLARWNYFFLYDGSKVKEEGDPTRAGICYFYPSQTLLDQQELLCGQIAGVVRCVSDISDSPPTLVRL
RKLKFAIKVDGDYLWVLGCAVELPDVSCKRFLDQLVGFFNFYNGPVSLAYENCSQEELSTEWDTFIEQILKNTSDLHKIFNSLWNLDQTK
VEPLLLLKAARILQTCQRSPHILAGCILYKGLIVSTQLPPSLTAKVLLHRTAPQEQRLPTGEDAPQEHGAALPPNVQIIPVFVTKEEAIS
LHEFPVEQMTRSLASPAGLQDGSAQHHPKGGSTSALKENATGHVESMAWTTPDPTSPDEACPDGRKENGCLSGHDLESIRPAGLHNSARG
EVLGLSSSLGKELVFLQEELDLSEIHIPEAQEVEMASGHFAFLHVPVPDGRAPYCKASLSASSSLEPTPPEDTAISSLRPPSAPEMLTQH
GAQEQLEDHPGHSSQAPIPRADPLPRRTRRPLLLPRLDPGQRGNKLPTGEQGLDEDVDGVCESHAAPGLECSSGSANCQGAGPSADGISS
RLTPAESCMGLVRMNLYTHCVKGLVLSLLAEEPLLGDSAAIEEVELEEIRKCGMKNFRNIQVDEANLLTWQGLIVPDNPPYDKGAFRIEI
NFPAEYPFKPPKITFKTKIYHPNIDEKGQVCLPVISAENWKPATKTDQVIQSLIALVNDPQPEHPLRADLAEEYSKDRKKFCKNAEEFTK

--------------------------------------------------------------
>37540_37540_6_HPS4-UBE2L3_HPS4_chr22_26859883_ENST00000402105_UBE2L3_chr22_21947150_ENST00000458578_length(transcript)=3019nt_BP=1998nt
AGACCATGTGGTTGTATAGTTAAGGTTCAGTGTTTCAAATAAGAGCAGTCATTGTGGGCGCCTCGTTTTTTTTTCTAGCTACTCGTGTCT
TCTCCGGCAGATGGTCTAACGGCTGGATAGAGGCATATTTTTAGTTTGAATTTCCTGCGCTTACAGAATGGTTTTATCTAATTTGTAGCT
ACTGGGGTTCTTTATTCTCTGGGGCCACCTGAAGGCTTTCAAACACAGTAATAAGCCCTCCTCGGGCTGGGAAACAGGAAGCTGTGTTTT
AAATGCCGTGATCACTGGATTAAAGGAAACATGGCTCCACTCTGTTCTCTTGCCAGGTGGAATTATTTTTTTCTTTATGATGGTTCCAAG
GTAAAGGAAGAAGGCGATCCAACAAGAGCTGGCATTTGTTACTTTTATCCTTCCCAGACCCTGCTAGACCAACAGGAGTTGCTTTGTGGA
CAGATTGCTGGAGTTGTCCGCTGTGTTTCTGACATTTCTGACTCTCCTCCTACTCTTGTTCGTCTGAGAAAACTGAAGTTTGCCATAAAA
GTTGATGGAGATTACCTTTGGGTGCTGGGCTGTGCTGTGGAGCTCCCTGATGTCAGCTGCAAGCGGTTTCTGGATCAGCTAGTTGGATTC
TTTAATTTTTACAATGGACCTGTTTCCCTAGCTTATGAGAACTGTTCTCAGGAAGAACTGAGCACGGAGTGGGACACCTTCATCGAGCAA
ATTCTGAAAAACACCAGTGATCTGCATAAGATTTTCAATTCCCTCTGGAACTTGGACCAAACTAAAGTGGAGCCCCTGTTGTTGCTGAAG
GCAGCCCGCATTCTGCAGACCTGCCAGCGCTCGCCTCACATTCTCGCTGGCTGCATCCTCTATAAAGGACTGATTGTCAGCACCCAACTC
CCGCCCTCCCTCACCGCCAAGGTCCTGCTTCACCGAACAGCACCTCAGGAGCAGAGACTCCCTACGGGAGAGGATGCCCCGCAGGAACAT
GGAGCGGCATTGCCCCCGAATGTCCAGATTATCCCTGTTTTTGTGACCAAAGAGGAAGCCATTAGTCTCCACGAGTTCCCGGTGGAACAG
ATGACAAGGTCTCTAGCATCTCCAGCAGGACTCCAGGATGGTTCAGCCCAGCACCATCCAAAGGGTGGGAGCACATCTGCCCTGAAAGAA
AACGCCACTGGCCATGTGGAATCCATGGCCTGGACCACCCCAGATCCCACATCCCCTGACGAAGCTTGTCCAGATGGCAGGAAGGAGAAC
GGATGCTTGTCTGGCCATGATCTGGAGAGCATCAGGCCCGCAGGACTGCACAACTCTGCCAGGGGTGAGGTTCTTGGCCTCAGCTCCTCC
CTGGGGAAGGAACTAGTCTTTCTCCAAGAAGAACTCGACTTGTCTGAAATCCACATTCCAGAGGCTCAGGAAGTGGAAATGGCCTCAGGT
CATTTTGCCTTCCTACATGTGCCTGTTCCAGATGGCAGGGCTCCTTACTGCAAGGCATCTCTCAGCGCCTCCAGCAGCCTGGAACCCACG
CCTCCTGAGGACACAGCCATCAGCAGCTTGCGCCCTCCCTCTGCTCCTGAGATGCTGACCCAGCATGGAGCCCAAGAGCAGCTCGAAGAC
CATCCTGGCCATAGCAGCCAAGCCCCCATTCCCAGAGCAGACCCTCTCCCCAGAAGGACCCGCAGGCCCTTGTTATTGCCTCGCTTAGAT
CCAGGACAGAGAGGAAACAAGCTTCCCACGGGGGAACAAGGCCTGGATGAGGATGTTGATGGGGTCTGTGAAAGCCACGCAGCCCCTGGT
CTGGAATGCAGTTCAGGCTCAGCAAACTGTCAGGGTGCTGGCCCCTCTGCAGATGGAATCAGCTCCAGGCTGACACCAGCAGAGTCCTGC
ATGGGGCTCGTGAGGATGAATCTCTACACTCACTGCGTCAAAGGGCTGGTGCTGTCCCTGCTGGCTGAGGAGCCGCTGCTGGGAGACAGC
GCAGCCATAGAGGAAGTGGAGCTTGAAGAAATCCGCAAATGTGGGATGAAAAACTTCCGTAACATCCAGGTTGATGAAGCTAATTTATTG
ACTTGGCAAGGGCTTATTGTTCCTGACAACCCTCCATATGATAAGGGAGCCTTCAGAATCGAAATCAACTTTCCAGCAGAGTACCCATTC
AAACCACCGAAGATCACATTTAAAACAAAGATCTATCACCCAAACATCGACGAAAAGGGGCAGGTCTGTCTGCCAGTAATTAGTGCCGAA
AACTGGAAGCCAGCAACCAAAACCGACCAAGTAATCCAGTCCCTCATAGCACTGGTGAATGACCCCCAGCCTGAGCACCCGCTTCGGGCT
GACCTAGCTGAAGAATACTCTAAGGACCGTAAAAAATTCTGTAAGAATGCTGAAGAGTTTACAAAGAAATATGGGGAAAAGCGACCTGTG
GACTAAAATCTGCCACGATTGGTTCCAGCAAGTGTGAGCAGAGACCCCGTGCAGTGCATTCAGACACCCCGCAAAGCAGGACTCTGTGGA
AATTGACACGTGCCACCGCCTGGCGTTCGCTTGTGGCAGTTACTAACTTTCTACAGTTTTCTTAATCAAAAGTGGTCTAGGTAACCTGTA
AAGAAAGGATTAAAAATTTAAGATGTTCTAGTTCTGCTCTCTTTGTTTTAAAAATCACTGCTTCAATCTACTTCAAAAGAATGGTGTTTC
TTTTCTTGTCCAATTTTATCCAAAATCTTCAAGTTACATTTAACCCATAAGGTTTAAAAAAAAGGAAAAAAAACGGTTGTGGTTCCCTTT
CTTCCCTACCCTTGCCACTCCCACTTTCTGGCACCGAGTTTATTTTTCACTTACTTACTTCCCCAGACCCCGGGCTCGCCTCCACAAAGG
AGAAAAGACTGCCCTGGCGGTCCTGGTGGCTTTTCTTAGCATGTGTGGCACTGTTGCCCAGTGTGGGAGTTGGTTTAAATTCTCCTGACT

>37540_37540_6_HPS4-UBE2L3_HPS4_chr22_26859883_ENST00000402105_UBE2L3_chr22_21947150_ENST00000458578_length(amino acids)=729AA_BP=584
MGNRKLCFKCRDHWIKGNMAPLCSLARWNYFFLYDGSKVKEEGDPTRAGICYFYPSQTLLDQQELLCGQIAGVVRCVSDISDSPPTLVRL
RKLKFAIKVDGDYLWVLGCAVELPDVSCKRFLDQLVGFFNFYNGPVSLAYENCSQEELSTEWDTFIEQILKNTSDLHKIFNSLWNLDQTK
VEPLLLLKAARILQTCQRSPHILAGCILYKGLIVSTQLPPSLTAKVLLHRTAPQEQRLPTGEDAPQEHGAALPPNVQIIPVFVTKEEAIS
LHEFPVEQMTRSLASPAGLQDGSAQHHPKGGSTSALKENATGHVESMAWTTPDPTSPDEACPDGRKENGCLSGHDLESIRPAGLHNSARG
EVLGLSSSLGKELVFLQEELDLSEIHIPEAQEVEMASGHFAFLHVPVPDGRAPYCKASLSASSSLEPTPPEDTAISSLRPPSAPEMLTQH
GAQEQLEDHPGHSSQAPIPRADPLPRRTRRPLLLPRLDPGQRGNKLPTGEQGLDEDVDGVCESHAAPGLECSSGSANCQGAGPSADGISS
RLTPAESCMGLVRMNLYTHCVKGLVLSLLAEEPLLGDSAAIEEVELEEIRKCGMKNFRNIQVDEANLLTWQGLIVPDNPPYDKGAFRIEI
NFPAEYPFKPPKITFKTKIYHPNIDEKGQVCLPVISAENWKPATKTDQVIQSLIALVNDPQPEHPLRADLAEEYSKDRKKFCKNAEEFTK

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for HPS4-UBE2L3


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for HPS4-UBE2L3


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for HPS4-UBE2L3


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource