Fusion Gene Studies
in Kim Lab

FusionBase FusionGDB FusionGDB2 FusionPDB FusionNeoAntigen FusionAI FusionNW FGviewer Publication Contact
FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:ARHGEF1-UCKL1 (FusionGDB2 ID:HG9138TG54963)

Fusion Gene Summary for ARHGEF1-UCKL1

check button Fusion gene summary
Fusion gene informationFusion gene name: ARHGEF1-UCKL1
Fusion gene ID: hg9138tg54963
HgeneTgene
Gene symbol

ARHGEF1

UCKL1

Gene ID

9138

54963

Gene nameRho guanine nucleotide exchange factor 1uridine-cytidine kinase 1 like 1
SynonymsGEF1|IMD62|LBCL2|LSC|P115-RHOGEF|SUB1.5UCK1L|URKL1
Cytomap('ARHGEF1')('UCKL1')

19q13.2

20q13.33

Type of geneprotein-codingprotein-coding
Descriptionrho guanine nucleotide exchange factor 1115 kDa guanine nucleotide exchange factor115-kD proteinLsc homologRho guanine nucleotide exchange factor (GEF) 1p115RhoGEFuridine-cytidine kinase-like 1UCK1-LIKE
Modification date2020032020200313
UniProtAcc

Q92888

.
Ensembl transtripts involved in fusion geneENST00000337665, ENST00000347545, 
ENST00000354532, ENST00000378152, 
ENST00000599846, ENST00000596957, 
Fusion gene scores* DoF score11 X 9 X 6=5946 X 3 X 3=54
# samples 126
** MAII scorelog2(12/594*10)=-2.30742852519225
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(6/54*10)=0.15200309344505
effective Gene in Pan-Cancer Fusion Genes (eGinPCFGs).
DoF>8 and MAII>0
Context

PubMed: ARHGEF1 [Title/Abstract] AND UCKL1 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointARHGEF1(42400555)-UCKL1(62572561), # samples:4
Anticipated loss of major functional domain due to fusion event.ARHGEF1-UCKL1 seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
ARHGEF1-UCKL1 seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
ARHGEF1-UCKL1 seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
ARHGEF1-UCKL1 seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
ARHGEF1-UCKL1 seems lost the major protein functional domain in Hgene partner, which is a essential gene due to the frame-shifted ORF.
ARHGEF1-UCKL1 seems lost the major protein functional domain in Tgene partner, which is a cell metabolism gene due to the frame-shifted ORF.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID

check buttonFusion gene breakpoints across ARHGEF1 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure
check buttonFusion gene breakpoints across UCKL1 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4COADTCGA-D5-6529-01AARHGEF1chr19

42400555

-UCKL1chr20

62572561

-
ChimerDB4READTCGA-G5-6572-01AARHGEF1chr19

42400555

-UCKL1chr20

62572561

-
ChimerDB4READTCGA-G5-6572ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-


Top

Fusion Gene ORF analysis for ARHGEF1-UCKL1

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-intronENST00000337665ENST00000492660ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
5CDS-intronENST00000347545ENST00000492660ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
5CDS-intronENST00000354532ENST00000492660ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
5CDS-intronENST00000378152ENST00000492660ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
5CDS-intronENST00000599846ENST00000492660ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
Frame-shiftENST00000337665ENST00000358711ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
Frame-shiftENST00000347545ENST00000358711ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
Frame-shiftENST00000354532ENST00000358711ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
Frame-shiftENST00000378152ENST00000358711ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
Frame-shiftENST00000599846ENST00000358711ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
In-frameENST00000337665ENST00000354216ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
In-frameENST00000337665ENST00000369892ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
In-frameENST00000337665ENST00000369908ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
In-frameENST00000347545ENST00000354216ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
In-frameENST00000347545ENST00000369892ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
In-frameENST00000347545ENST00000369908ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
In-frameENST00000354532ENST00000354216ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
In-frameENST00000354532ENST00000369892ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
In-frameENST00000354532ENST00000369908ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
In-frameENST00000378152ENST00000354216ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
In-frameENST00000378152ENST00000369892ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
In-frameENST00000378152ENST00000369908ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
In-frameENST00000599846ENST00000354216ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
In-frameENST00000599846ENST00000369892ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
In-frameENST00000599846ENST00000369908ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
intron-3CDSENST00000596957ENST00000354216ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
intron-3CDSENST00000596957ENST00000358711ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
intron-3CDSENST00000596957ENST00000369892ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
intron-3CDSENST00000596957ENST00000369908ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-
intron-intronENST00000596957ENST00000492660ARHGEF1chr19

42400555

+UCKL1chr20

62572561

-

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000354532ARHGEF1chr1942400555+ENST00000354216UCKL1chr2062572561-213612691481992614
ENST00000354532ARHGEF1chr1942400555+ENST00000369892UCKL1chr2062572561-212912691481320390
ENST00000354532ARHGEF1chr1942400555+ENST00000369908UCKL1chr2062572561-204512691481992614
ENST00000599846ARHGEF1chr1942400555+ENST00000354216UCKL1chr2062572561-211312461251969614
ENST00000599846ARHGEF1chr1942400555+ENST00000369892UCKL1chr2062572561-210612461251297390
ENST00000599846ARHGEF1chr1942400555+ENST00000369908UCKL1chr2062572561-202212461251969614
ENST00000347545ARHGEF1chr1942400555+ENST00000354216UCKL1chr2062572561-19731106841829581
ENST00000347545ARHGEF1chr1942400555+ENST00000369892UCKL1chr2062572561-19661106841157357
ENST00000347545ARHGEF1chr1942400555+ENST00000369908UCKL1chr2062572561-18821106841829581
ENST00000378152ARHGEF1chr1942400555+ENST00000354216UCKL1chr2062572561-20321165981888596
ENST00000378152ARHGEF1chr1942400555+ENST00000369892UCKL1chr2062572561-20251165981216372
ENST00000378152ARHGEF1chr1942400555+ENST00000369908UCKL1chr2062572561-19411165981888596
ENST00000337665ARHGEF1chr1942400555+ENST00000354216UCKL1chr2062572561-20531186201909629
ENST00000337665ARHGEF1chr1942400555+ENST00000369892UCKL1chr2062572561-20461186201237405
ENST00000337665ARHGEF1chr1942400555+ENST00000369908UCKL1chr2062572561-19621186201909629

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000354532ENST00000354216ARHGEF1chr1942400555+UCKL1chr2062572561-0.0180507470.98194927
ENST00000354532ENST00000369892ARHGEF1chr1942400555+UCKL1chr2062572561-0.0272594190.97274053
ENST00000354532ENST00000369908ARHGEF1chr1942400555+UCKL1chr2062572561-0.022763950.97723603
ENST00000599846ENST00000354216ARHGEF1chr1942400555+UCKL1chr2062572561-0.0187088470.9812912
ENST00000599846ENST00000369892ARHGEF1chr1942400555+UCKL1chr2062572561-0.0274565390.9725434
ENST00000599846ENST00000369908ARHGEF1chr1942400555+UCKL1chr2062572561-0.0235197340.97648025
ENST00000347545ENST00000354216ARHGEF1chr1942400555+UCKL1chr2062572561-0.0169077870.98309225
ENST00000347545ENST00000369892ARHGEF1chr1942400555+UCKL1chr2062572561-0.0129364780.9870635
ENST00000347545ENST00000369908ARHGEF1chr1942400555+UCKL1chr2062572561-0.0220091290.97799087
ENST00000378152ENST00000354216ARHGEF1chr1942400555+UCKL1chr2062572561-0.0138118020.98618823
ENST00000378152ENST00000369892ARHGEF1chr1942400555+UCKL1chr2062572561-0.0177137180.9822863
ENST00000378152ENST00000369908ARHGEF1chr1942400555+UCKL1chr2062572561-0.0188111230.98118895
ENST00000337665ENST00000354216ARHGEF1chr1942400555+UCKL1chr2062572561-0.0170548690.98294514
ENST00000337665ENST00000369892ARHGEF1chr1942400555+UCKL1chr2062572561-0.028105940.971894
ENST00000337665ENST00000369908ARHGEF1chr1942400555+UCKL1chr2062572561-0.0232108460.9767892

Top

Fusion Genomic Features for ARHGEF1-UCKL1


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

Top

Fusion Protein Features for ARHGEF1-UCKL1


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr19:42400555/chr20:62572561)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
ARHGEF1

Q92888

.
FUNCTION: Seems to play a role in the regulation of RhoA GTPase by guanine nucleotide-binding alpha-12 (GNA12) and alpha-13 (GNA13) subunits (PubMed:9641915, PubMed:9641916). Acts as GTPase-activating protein (GAP) for GNA12 and GNA13, and as guanine nucleotide exchange factor (GEF) for RhoA GTPase (PubMed:9641915, PubMed:9641916, PubMed:8810315, PubMed:30521495). Activated G alpha 13/GNA13 stimulates the RhoGEF activity through interaction with the RGS-like domain (PubMed:9641916). This GEF activity is inhibited by binding to activated GNA12 (PubMed:9641916). Mediates angiotensin-2-induced RhoA activation (PubMed:20098430). {ECO:0000269|PubMed:20098430, ECO:0000269|PubMed:30521495, ECO:0000269|PubMed:8810315, ECO:0000269|PubMed:9641915, ECO:0000269|PubMed:9641916}.FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneARHGEF1chr19:42400555chr20:62572561ENST00000337665+132941_2323881029.3333333333333DomainNote=RGSL
HgeneARHGEF1chr19:42400555chr20:62572561ENST00000347545+122841_232340975.0DomainNote=RGSL
HgeneARHGEF1chr19:42400555chr20:62572561ENST00000354532+132941_232373983.0DomainNote=RGSL

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneARHGEF1chr19:42400555chr20:62572561ENST00000337665+1329865_8963881029.3333333333333Coiled coilOntology_term=ECO:0000255
HgeneARHGEF1chr19:42400555chr20:62572561ENST00000347545+1228865_896340975.0Coiled coilOntology_term=ECO:0000255
HgeneARHGEF1chr19:42400555chr20:62572561ENST00000354532+1329865_896373983.0Coiled coilOntology_term=ECO:0000255
HgeneARHGEF1chr19:42400555chr20:62572561ENST00000337665+1329416_6053881029.3333333333333DomainDH
HgeneARHGEF1chr19:42400555chr20:62572561ENST00000337665+1329647_7603881029.3333333333333DomainPH
HgeneARHGEF1chr19:42400555chr20:62572561ENST00000347545+1228416_605340975.0DomainDH
HgeneARHGEF1chr19:42400555chr20:62572561ENST00000347545+1228647_760340975.0DomainPH
HgeneARHGEF1chr19:42400555chr20:62572561ENST00000354532+1329416_605373983.0DomainDH
HgeneARHGEF1chr19:42400555chr20:62572561ENST00000354532+1329647_760373983.0DomainPH
TgeneUCKL1chr19:42400555chr20:62572561ENST00000354216715105_112307549.0Nucleotide bindingATP
TgeneUCKL1chr19:42400555chr20:62572561ENST00000358711613105_112302412.6666666666667Nucleotide bindingATP
TgeneUCKL1chr19:42400555chr20:62572561ENST00000369908715105_112292534.0Nucleotide bindingATP


Top

Fusion Gene Sequence for ARHGEF1-UCKL1


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>6261_6261_1_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000337665_UCKL1_chr20_62572561_ENST00000354216_length(transcript)=2053nt_BP=1186nt
CTTCGGTTCCGGTGGCGGCGATGGCTTCTCTTTCCACCTGGAGCAGCCCTGCAGAGCCCAGGGAGATGGAAGACTTCGCCCGAGGGGCGG
CCTCCCCAGGCCCCTCCCGGCCTGGCCTGGTTCCCGTCAGCATCATCGGGGCTGAGGATGAGGATTTTGAGAACGAGCTGGAGACAAACT
CAGAAGAGCAAAACAGCCAGTTCCAGAGCCTGGAGCAGGTGAAGCGGCGCCCAGCCCACCTCATGGCCCTCCTGCAGCACGTGGCCCTGC
AGTTTGAGCCAGGACCCCTGCTTTGCTGTCTGCATGCCGACATGCTGGGCTCACTGGGCCCCAAGGAGGCCAAGAAGGCCTTCCTGGACT
TCTACCACAGCTTCCTGGAGAAGACAGCGGTTCTCCGGGTGCCGGTCCCTCCCAACGTCGCCTTTGAACTTGACCGCACTAGGGCTGACC
TCATCTCCGAGGATGTCCAGCGGCGGTTCGTGCAGGAGGTGGTGCAAAGCCAGCAGGTAGCCGTGGGCCGGCAGCTGGAGGACTTCCGTT
CCAAGCGGCTCATGGGCATGACGCCCTGGGAGCAGGAGCTGGCCCAGCTGGAGGCTTGGGTTGGGCGGGACCGAGCCAGCTACGAGGCCC
GGGAGCGGCACGTGGCGGAGCGGCTGCTCATGCACCTGGAGGAGATGCAACATACCATCTCTACCGACGAAGAAAAGAGTGCTGCCGTGG
TCAACGCCATTGGCCTGTACATGCGCCACCTTGGGGTGCGGACCAAGAGTGGAGACAAGAAGTCGGGGAGGAACTTCTTCCGGAAAAAGG
TGATGGGGAACCGGCGGTCGGACGAGCCTGCCAAGACCAAGAAGGGGCTGAGCAGCATCCTGGATGCCGCCCGCTGGAACCGGGGAGAGC
CCCAGGTTCCAGATTTTCGACACCTCAAAGCAGAGGTTGATGCCGAGAAGCCAGGTGCTACAGACCGGAAGGGAGGCGTGGGGATGCCCT
CTCGGGACCGGAATATCGGGGCTCCTGGGCAGGACACCCCTGGAGTCTCTCTGCACCCTCTGTCCCTGGACAGCCCAGACCGGGAACCAG
GTGCTGACGCCCCCCTGGAGCTGGGGGACTCATCCCCGCAGGGCCCAATGAGCCTGGAGTCCTTGGCGCCCCCAGAGAGTACCGACGAGG
GGGCCGAAACCGAGAGGGCTGCGCTGGCCTCGGCACACCAGTGCCACCCGCTGCCCCGGACGCTGAGCGTCCTGAAGAGCACGCCGCAGG
TACGGGGCATGCACACCATCATCAGGGACAAGGAGACCAGTCGCGACGAGTTCATCTTCTACTCCAAGAGACTGATGCGGCTGCTCATCG
AGCACGCGCTCTCCTTCCTGCCCTTTCAGGACTGCGTCGTACAGACCCCGCAGGGGCAGGACTATGCGGGCAAGTGCTATGCGGGGAAGC
AGATCACCGGTGTGTCCATTCTGCGCGCCGGTGAAACCATGGAGCCCGCGCTGCGCGCTGTGTGCAAAGACGTGCGCATCGGCACCATCC
TCATCCAGACCAACCAGCTTACCGGGGAGCCCGAGCTCCACTACCTGAGGCTGCCCAAGGACATCAGCGATGACCACGTGATCCTCATGG
ACTGCACCGTGTCCACGGGCGCGGCGGCCATGATGGCAGTGCGCGTGCTCCTGGACCACGACGTGCCTGAGGACAAGATCTTTTTGCTGT
CGCTGCTCATGGCAGAGATGGGCGTGCACTCAGTGGCCTATGCATTTCCGCGAGTGAGAATCATCACCACGGCGGTGGACAAGCGGGTCA
ATGACCTTTTCCGCATCATCCCAGGCATTGGGAACTTTGGCGACCGCTACTTTGGGACAGACGCGGTCCCCGATGGCAGTGACGAGGAGG
AAGTGGCCTACACGGGTTAGCTGCCCAGTGAGCCATCCCGTCCCCACCACCCTCCTCCTGCCTCCTGACCCAGGACTGCTGAATACAAAG

>6261_6261_1_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000337665_UCKL1_chr20_62572561_ENST00000354216_length(amino acids)=629AA_BP=388
MASLSTWSSPAEPREMEDFARGAASPGPSRPGLVPVSIIGAEDEDFENELETNSEEQNSQFQSLEQVKRRPAHLMALLQHVALQFEPGPL
LCCLHADMLGSLGPKEAKKAFLDFYHSFLEKTAVLRVPVPPNVAFELDRTRADLISEDVQRRFVQEVVQSQQVAVGRQLEDFRSKRLMGM
TPWEQELAQLEAWVGRDRASYEARERHVAERLLMHLEEMQHTISTDEEKSAAVVNAIGLYMRHLGVRTKSGDKKSGRNFFRKKVMGNRRS
DEPAKTKKGLSSILDAARWNRGEPQVPDFRHLKAEVDAEKPGATDRKGGVGMPSRDRNIGAPGQDTPGVSLHPLSLDSPDREPGADAPLE
LGDSSPQGPMSLESLAPPESTDEGAETERAALASAHQCHPLPRTLSVLKSTPQVRGMHTIIRDKETSRDEFIFYSKRLMRLLIEHALSFL
PFQDCVVQTPQGQDYAGKCYAGKQITGVSILRAGETMEPALRAVCKDVRIGTILIQTNQLTGEPELHYLRLPKDISDDHVILMDCTVSTG

--------------------------------------------------------------
>6261_6261_2_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000337665_UCKL1_chr20_62572561_ENST00000369892_length(transcript)=2046nt_BP=1186nt
CTTCGGTTCCGGTGGCGGCGATGGCTTCTCTTTCCACCTGGAGCAGCCCTGCAGAGCCCAGGGAGATGGAAGACTTCGCCCGAGGGGCGG
CCTCCCCAGGCCCCTCCCGGCCTGGCCTGGTTCCCGTCAGCATCATCGGGGCTGAGGATGAGGATTTTGAGAACGAGCTGGAGACAAACT
CAGAAGAGCAAAACAGCCAGTTCCAGAGCCTGGAGCAGGTGAAGCGGCGCCCAGCCCACCTCATGGCCCTCCTGCAGCACGTGGCCCTGC
AGTTTGAGCCAGGACCCCTGCTTTGCTGTCTGCATGCCGACATGCTGGGCTCACTGGGCCCCAAGGAGGCCAAGAAGGCCTTCCTGGACT
TCTACCACAGCTTCCTGGAGAAGACAGCGGTTCTCCGGGTGCCGGTCCCTCCCAACGTCGCCTTTGAACTTGACCGCACTAGGGCTGACC
TCATCTCCGAGGATGTCCAGCGGCGGTTCGTGCAGGAGGTGGTGCAAAGCCAGCAGGTAGCCGTGGGCCGGCAGCTGGAGGACTTCCGTT
CCAAGCGGCTCATGGGCATGACGCCCTGGGAGCAGGAGCTGGCCCAGCTGGAGGCTTGGGTTGGGCGGGACCGAGCCAGCTACGAGGCCC
GGGAGCGGCACGTGGCGGAGCGGCTGCTCATGCACCTGGAGGAGATGCAACATACCATCTCTACCGACGAAGAAAAGAGTGCTGCCGTGG
TCAACGCCATTGGCCTGTACATGCGCCACCTTGGGGTGCGGACCAAGAGTGGAGACAAGAAGTCGGGGAGGAACTTCTTCCGGAAAAAGG
TGATGGGGAACCGGCGGTCGGACGAGCCTGCCAAGACCAAGAAGGGGCTGAGCAGCATCCTGGATGCCGCCCGCTGGAACCGGGGAGAGC
CCCAGGTTCCAGATTTTCGACACCTCAAAGCAGAGGTTGATGCCGAGAAGCCAGGTGCTACAGACCGGAAGGGAGGCGTGGGGATGCCCT
CTCGGGACCGGAATATCGGGGCTCCTGGGCAGGACACCCCTGGAGTCTCTCTGCACCCTCTGTCCCTGGACAGCCCAGACCGGGAACCAG
GTGCTGACGCCCCCCTGGAGCTGGGGGACTCATCCCCGCAGGGCCCAATGAGCCTGGAGTCCTTGGCGCCCCCAGAGAGTACCGACGAGG
GGGCCGAAACCGAGAGGGAACTTTGGCGACCGCTACTTTGGGACAGACGCGGTCCCCGATGGCAGTGACGAGGAGGAAGTGGCCTACACG
GGTTAGCTGCCCAGTGAGCCATCCCGTCCCCACCACCCTCCTCCTGCCTCCTGACCCAGGACTGCTGAATACAAAGATGTTAATTTTTAA
AATGTTACTAGTATAATTTATTCTATGCATTTTATAAAATAAATAAAGCTTTGACCACGACGTGCCTGAGGACAAGATCTTTTTGCTGTC
GCTGCTCATGGCAGAGATGGGCGTGCACTCAGTGGCCTATGCATTTCCGCGAGTGAGAATCATCACCACGGCGGTGGACAAGCGGGTCAA
TGACCTTTTCCGCATCATCCCAGGCATTGCTCCACTACCTGAGGCTGCCCAAGGACATCAGCGATGACCACGTGATCCTCATGGACTGCA
CCGTGTCCACGGGCGCGGCGGCCATGATGGCAGTGCGCGTGCTCCTGATCACCGGTGTGTCCATTCTGCGCGCCGGTGAAACCATGGAGC
CCGCGCTGCGCGCTGTGTGCAAAGACGTGCGCATCGGCACCATCCTCATCCAGACCAACCAGCTTACCGGGGAGCCCGAGGACTGCGTCG
TACAGACCCCGCAGGGGCAGGACTATGCGGGCAAGTGCTATGCGGGGAAGCAGGGACAAGGAGACCAGTCGCGACGAGTTCATCTTCTAC
TCCAAGAGACTGATGCGGCTGCTCATCGAGCACGCGCTCTCCTTCCTGCCCTTTCAGGGCTGCGCTGGCCTCGGCACACCAGTGCCACCC

>6261_6261_2_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000337665_UCKL1_chr20_62572561_ENST00000369892_length(amino acids)=405AA_BP=
MASLSTWSSPAEPREMEDFARGAASPGPSRPGLVPVSIIGAEDEDFENELETNSEEQNSQFQSLEQVKRRPAHLMALLQHVALQFEPGPL
LCCLHADMLGSLGPKEAKKAFLDFYHSFLEKTAVLRVPVPPNVAFELDRTRADLISEDVQRRFVQEVVQSQQVAVGRQLEDFRSKRLMGM
TPWEQELAQLEAWVGRDRASYEARERHVAERLLMHLEEMQHTISTDEEKSAAVVNAIGLYMRHLGVRTKSGDKKSGRNFFRKKVMGNRRS
DEPAKTKKGLSSILDAARWNRGEPQVPDFRHLKAEVDAEKPGATDRKGGVGMPSRDRNIGAPGQDTPGVSLHPLSLDSPDREPGADAPLE

--------------------------------------------------------------
>6261_6261_3_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000337665_UCKL1_chr20_62572561_ENST00000369908_length(transcript)=1962nt_BP=1186nt
CTTCGGTTCCGGTGGCGGCGATGGCTTCTCTTTCCACCTGGAGCAGCCCTGCAGAGCCCAGGGAGATGGAAGACTTCGCCCGAGGGGCGG
CCTCCCCAGGCCCCTCCCGGCCTGGCCTGGTTCCCGTCAGCATCATCGGGGCTGAGGATGAGGATTTTGAGAACGAGCTGGAGACAAACT
CAGAAGAGCAAAACAGCCAGTTCCAGAGCCTGGAGCAGGTGAAGCGGCGCCCAGCCCACCTCATGGCCCTCCTGCAGCACGTGGCCCTGC
AGTTTGAGCCAGGACCCCTGCTTTGCTGTCTGCATGCCGACATGCTGGGCTCACTGGGCCCCAAGGAGGCCAAGAAGGCCTTCCTGGACT
TCTACCACAGCTTCCTGGAGAAGACAGCGGTTCTCCGGGTGCCGGTCCCTCCCAACGTCGCCTTTGAACTTGACCGCACTAGGGCTGACC
TCATCTCCGAGGATGTCCAGCGGCGGTTCGTGCAGGAGGTGGTGCAAAGCCAGCAGGTAGCCGTGGGCCGGCAGCTGGAGGACTTCCGTT
CCAAGCGGCTCATGGGCATGACGCCCTGGGAGCAGGAGCTGGCCCAGCTGGAGGCTTGGGTTGGGCGGGACCGAGCCAGCTACGAGGCCC
GGGAGCGGCACGTGGCGGAGCGGCTGCTCATGCACCTGGAGGAGATGCAACATACCATCTCTACCGACGAAGAAAAGAGTGCTGCCGTGG
TCAACGCCATTGGCCTGTACATGCGCCACCTTGGGGTGCGGACCAAGAGTGGAGACAAGAAGTCGGGGAGGAACTTCTTCCGGAAAAAGG
TGATGGGGAACCGGCGGTCGGACGAGCCTGCCAAGACCAAGAAGGGGCTGAGCAGCATCCTGGATGCCGCCCGCTGGAACCGGGGAGAGC
CCCAGGTTCCAGATTTTCGACACCTCAAAGCAGAGGTTGATGCCGAGAAGCCAGGTGCTACAGACCGGAAGGGAGGCGTGGGGATGCCCT
CTCGGGACCGGAATATCGGGGCTCCTGGGCAGGACACCCCTGGAGTCTCTCTGCACCCTCTGTCCCTGGACAGCCCAGACCGGGAACCAG
GTGCTGACGCCCCCCTGGAGCTGGGGGACTCATCCCCGCAGGGCCCAATGAGCCTGGAGTCCTTGGCGCCCCCAGAGAGTACCGACGAGG
GGGCCGAAACCGAGAGGGCTGCGCTGGCCTCGGCACACCAGTGCCACCCGCTGCCCCGGACGCTGAGCGTCCTGAAGAGCACGCCGCAGG
TACGGGGCATGCACACCATCATCAGGGACAAGGAGACCAGTCGCGACGAGTTCATCTTCTACTCCAAGAGACTGATGCGGCTGCTCATCG
AGCACGCGCTCTCCTTCCTGCCCTTTCAGGACTGCGTCGTACAGACCCCGCAGGGGCAGGACTATGCGGGCAAGTGCTATGCGGGGAAGC
AGATCACCGGTGTGTCCATTCTGCGCGCCGGTGAAACCATGGAGCCCGCGCTGCGCGCTGTGTGCAAAGACGTGCGCATCGGCACCATCC
TCATCCAGACCAACCAGCTTACCGGGGAGCCCGAGCTCCACTACCTGAGGCTGCCCAAGGACATCAGCGATGACCACGTGATCCTCATGG
ACTGCACCGTGTCCACGGGCGCGGCGGCCATGATGGCAGTGCGCGTGCTCCTGGACCACGACGTGCCTGAGGACAAGATCTTTTTGCTGT
CGCTGCTCATGGCAGAGATGGGCGTGCACTCAGTGGCCTATGCATTTCCGCGAGTGAGAATCATCACCACGGCGGTGGACAAGCGGGTCA
ATGACCTTTTCCGCATCATCCCAGGCATTGGGAACTTTGGCGACCGCTACTTTGGGACAGACGCGGTCCCCGATGGCAGTGACGAGGAGG

>6261_6261_3_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000337665_UCKL1_chr20_62572561_ENST00000369908_length(amino acids)=629AA_BP=388
MASLSTWSSPAEPREMEDFARGAASPGPSRPGLVPVSIIGAEDEDFENELETNSEEQNSQFQSLEQVKRRPAHLMALLQHVALQFEPGPL
LCCLHADMLGSLGPKEAKKAFLDFYHSFLEKTAVLRVPVPPNVAFELDRTRADLISEDVQRRFVQEVVQSQQVAVGRQLEDFRSKRLMGM
TPWEQELAQLEAWVGRDRASYEARERHVAERLLMHLEEMQHTISTDEEKSAAVVNAIGLYMRHLGVRTKSGDKKSGRNFFRKKVMGNRRS
DEPAKTKKGLSSILDAARWNRGEPQVPDFRHLKAEVDAEKPGATDRKGGVGMPSRDRNIGAPGQDTPGVSLHPLSLDSPDREPGADAPLE
LGDSSPQGPMSLESLAPPESTDEGAETERAALASAHQCHPLPRTLSVLKSTPQVRGMHTIIRDKETSRDEFIFYSKRLMRLLIEHALSFL
PFQDCVVQTPQGQDYAGKCYAGKQITGVSILRAGETMEPALRAVCKDVRIGTILIQTNQLTGEPELHYLRLPKDISDDHVILMDCTVSTG

--------------------------------------------------------------
>6261_6261_4_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000347545_UCKL1_chr20_62572561_ENST00000354216_length(transcript)=1973nt_BP=1106nt
CCAGGGCCCGGGATCGCCGAGCCCGACCTCGGGCGCCCCGCCGGTCACCTCCGCGCGGACACCAGCCCTGCAGAGCCCAGGGAGATGGAA
GACTTCGCCCGAGGGGCGGCCTCCCCAGGCCCCTCCCGGCCTGGCCTGGTTCCCGTCAGCATCATCGGGGCTGAGGATGAGGATTTTGAG
AACGAGCTGGAGACAAACTCAGAAGAGCAAAACAGCCAGTTCCAGAGCCTGGAGCAGGTGAAGCGGCGCCCAGCCCACCTCATGGCCCTC
CTGCAGCACGTGGCCCTGCAGTTTGAGCCAGGACCCCTGGTTCTCCGGGTGCCGGTCCCTCCCAACGTCGCCTTTGAACTTGACCGCACT
AGGGCTGACCTCATCTCCGAGGATGTCCAGCGGCGGTTCGTGCAGGAGGTGGTGCAAAGCCAGCAGGTAGCCGTGGGCCGGCAGCTGGAG
GACTTCCGTTCCAAGCGGCTCATGGGCATGACGCCCTGGGAGCAGGAGCTGGCCCAGCTGGAGGCTTGGGTTGGGCGGGACCGAGCCAGC
TACGAGGCCCGGGAGCGGCACGTGGCGGAGCGGCTGCTCATGCACCTGGAGGAGATGCAACATACCATCTCTACCGACGAAGAAAAGAGT
GCTGCCGTGGTCAACGCCATTGGCCTGTACATGCGCCACCTTGGGGTGCGGACCAAGAGTGGAGACAAGAAGTCGGGGAGGAACTTCTTC
CGGAAAAAGGTGATGGGGAACCGGCGGTCGGACGAGCCTGCCAAGACCAAGAAGGGGCTGAGCAGCATCCTGGATGCCGCCCGCTGGAAC
CGGGGAGAGCCCCAGGTTCCAGATTTTCGACACCTCAAAGCAGAGGTTGATGCCGAGAAGCCAGGTGCTACAGACCGGAAGGGAGGCGTG
GGGATGCCCTCTCGGGACCGGAATATCGGGGCTCCTGGGCAGGACACCCCTGGAGTCTCTCTGCACCCTCTGTCCCTGGACAGCCCAGAC
CGGGAACCAGGTGCTGACGCCCCCCTGGAGCTGGGGGACTCATCCCCGCAGGGCCCAATGAGCCTGGAGTCCTTGGCGCCCCCAGAGAGT
ACCGACGAGGGGGCCGAAACCGAGAGGGCTGCGCTGGCCTCGGCACACCAGTGCCACCCGCTGCCCCGGACGCTGAGCGTCCTGAAGAGC
ACGCCGCAGGTACGGGGCATGCACACCATCATCAGGGACAAGGAGACCAGTCGCGACGAGTTCATCTTCTACTCCAAGAGACTGATGCGG
CTGCTCATCGAGCACGCGCTCTCCTTCCTGCCCTTTCAGGACTGCGTCGTACAGACCCCGCAGGGGCAGGACTATGCGGGCAAGTGCTAT
GCGGGGAAGCAGATCACCGGTGTGTCCATTCTGCGCGCCGGTGAAACCATGGAGCCCGCGCTGCGCGCTGTGTGCAAAGACGTGCGCATC
GGCACCATCCTCATCCAGACCAACCAGCTTACCGGGGAGCCCGAGCTCCACTACCTGAGGCTGCCCAAGGACATCAGCGATGACCACGTG
ATCCTCATGGACTGCACCGTGTCCACGGGCGCGGCGGCCATGATGGCAGTGCGCGTGCTCCTGGACCACGACGTGCCTGAGGACAAGATC
TTTTTGCTGTCGCTGCTCATGGCAGAGATGGGCGTGCACTCAGTGGCCTATGCATTTCCGCGAGTGAGAATCATCACCACGGCGGTGGAC
AAGCGGGTCAATGACCTTTTCCGCATCATCCCAGGCATTGGGAACTTTGGCGACCGCTACTTTGGGACAGACGCGGTCCCCGATGGCAGT
GACGAGGAGGAAGTGGCCTACACGGGTTAGCTGCCCAGTGAGCCATCCCGTCCCCACCACCCTCCTCCTGCCTCCTGACCCAGGACTGCT

>6261_6261_4_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000347545_UCKL1_chr20_62572561_ENST00000354216_length(amino acids)=581AA_BP=340
MEDFARGAASPGPSRPGLVPVSIIGAEDEDFENELETNSEEQNSQFQSLEQVKRRPAHLMALLQHVALQFEPGPLVLRVPVPPNVAFELD
RTRADLISEDVQRRFVQEVVQSQQVAVGRQLEDFRSKRLMGMTPWEQELAQLEAWVGRDRASYEARERHVAERLLMHLEEMQHTISTDEE
KSAAVVNAIGLYMRHLGVRTKSGDKKSGRNFFRKKVMGNRRSDEPAKTKKGLSSILDAARWNRGEPQVPDFRHLKAEVDAEKPGATDRKG
GVGMPSRDRNIGAPGQDTPGVSLHPLSLDSPDREPGADAPLELGDSSPQGPMSLESLAPPESTDEGAETERAALASAHQCHPLPRTLSVL
KSTPQVRGMHTIIRDKETSRDEFIFYSKRLMRLLIEHALSFLPFQDCVVQTPQGQDYAGKCYAGKQITGVSILRAGETMEPALRAVCKDV
RIGTILIQTNQLTGEPELHYLRLPKDISDDHVILMDCTVSTGAAAMMAVRVLLDHDVPEDKIFLLSLLMAEMGVHSVAYAFPRVRIITTA

--------------------------------------------------------------
>6261_6261_5_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000347545_UCKL1_chr20_62572561_ENST00000369892_length(transcript)=1966nt_BP=1106nt
CCAGGGCCCGGGATCGCCGAGCCCGACCTCGGGCGCCCCGCCGGTCACCTCCGCGCGGACACCAGCCCTGCAGAGCCCAGGGAGATGGAA
GACTTCGCCCGAGGGGCGGCCTCCCCAGGCCCCTCCCGGCCTGGCCTGGTTCCCGTCAGCATCATCGGGGCTGAGGATGAGGATTTTGAG
AACGAGCTGGAGACAAACTCAGAAGAGCAAAACAGCCAGTTCCAGAGCCTGGAGCAGGTGAAGCGGCGCCCAGCCCACCTCATGGCCCTC
CTGCAGCACGTGGCCCTGCAGTTTGAGCCAGGACCCCTGGTTCTCCGGGTGCCGGTCCCTCCCAACGTCGCCTTTGAACTTGACCGCACT
AGGGCTGACCTCATCTCCGAGGATGTCCAGCGGCGGTTCGTGCAGGAGGTGGTGCAAAGCCAGCAGGTAGCCGTGGGCCGGCAGCTGGAG
GACTTCCGTTCCAAGCGGCTCATGGGCATGACGCCCTGGGAGCAGGAGCTGGCCCAGCTGGAGGCTTGGGTTGGGCGGGACCGAGCCAGC
TACGAGGCCCGGGAGCGGCACGTGGCGGAGCGGCTGCTCATGCACCTGGAGGAGATGCAACATACCATCTCTACCGACGAAGAAAAGAGT
GCTGCCGTGGTCAACGCCATTGGCCTGTACATGCGCCACCTTGGGGTGCGGACCAAGAGTGGAGACAAGAAGTCGGGGAGGAACTTCTTC
CGGAAAAAGGTGATGGGGAACCGGCGGTCGGACGAGCCTGCCAAGACCAAGAAGGGGCTGAGCAGCATCCTGGATGCCGCCCGCTGGAAC
CGGGGAGAGCCCCAGGTTCCAGATTTTCGACACCTCAAAGCAGAGGTTGATGCCGAGAAGCCAGGTGCTACAGACCGGAAGGGAGGCGTG
GGGATGCCCTCTCGGGACCGGAATATCGGGGCTCCTGGGCAGGACACCCCTGGAGTCTCTCTGCACCCTCTGTCCCTGGACAGCCCAGAC
CGGGAACCAGGTGCTGACGCCCCCCTGGAGCTGGGGGACTCATCCCCGCAGGGCCCAATGAGCCTGGAGTCCTTGGCGCCCCCAGAGAGT
ACCGACGAGGGGGCCGAAACCGAGAGGGAACTTTGGCGACCGCTACTTTGGGACAGACGCGGTCCCCGATGGCAGTGACGAGGAGGAAGT
GGCCTACACGGGTTAGCTGCCCAGTGAGCCATCCCGTCCCCACCACCCTCCTCCTGCCTCCTGACCCAGGACTGCTGAATACAAAGATGT
TAATTTTTAAAATGTTACTAGTATAATTTATTCTATGCATTTTATAAAATAAATAAAGCTTTGACCACGACGTGCCTGAGGACAAGATCT
TTTTGCTGTCGCTGCTCATGGCAGAGATGGGCGTGCACTCAGTGGCCTATGCATTTCCGCGAGTGAGAATCATCACCACGGCGGTGGACA
AGCGGGTCAATGACCTTTTCCGCATCATCCCAGGCATTGCTCCACTACCTGAGGCTGCCCAAGGACATCAGCGATGACCACGTGATCCTC
ATGGACTGCACCGTGTCCACGGGCGCGGCGGCCATGATGGCAGTGCGCGTGCTCCTGATCACCGGTGTGTCCATTCTGCGCGCCGGTGAA
ACCATGGAGCCCGCGCTGCGCGCTGTGTGCAAAGACGTGCGCATCGGCACCATCCTCATCCAGACCAACCAGCTTACCGGGGAGCCCGAG
GACTGCGTCGTACAGACCCCGCAGGGGCAGGACTATGCGGGCAAGTGCTATGCGGGGAAGCAGGGACAAGGAGACCAGTCGCGACGAGTT
CATCTTCTACTCCAAGAGACTGATGCGGCTGCTCATCGAGCACGCGCTCTCCTTCCTGCCCTTTCAGGGCTGCGCTGGCCTCGGCACACC

>6261_6261_5_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000347545_UCKL1_chr20_62572561_ENST00000369892_length(amino acids)=357AA_BP=
MEDFARGAASPGPSRPGLVPVSIIGAEDEDFENELETNSEEQNSQFQSLEQVKRRPAHLMALLQHVALQFEPGPLVLRVPVPPNVAFELD
RTRADLISEDVQRRFVQEVVQSQQVAVGRQLEDFRSKRLMGMTPWEQELAQLEAWVGRDRASYEARERHVAERLLMHLEEMQHTISTDEE
KSAAVVNAIGLYMRHLGVRTKSGDKKSGRNFFRKKVMGNRRSDEPAKTKKGLSSILDAARWNRGEPQVPDFRHLKAEVDAEKPGATDRKG

--------------------------------------------------------------
>6261_6261_6_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000347545_UCKL1_chr20_62572561_ENST00000369908_length(transcript)=1882nt_BP=1106nt
CCAGGGCCCGGGATCGCCGAGCCCGACCTCGGGCGCCCCGCCGGTCACCTCCGCGCGGACACCAGCCCTGCAGAGCCCAGGGAGATGGAA
GACTTCGCCCGAGGGGCGGCCTCCCCAGGCCCCTCCCGGCCTGGCCTGGTTCCCGTCAGCATCATCGGGGCTGAGGATGAGGATTTTGAG
AACGAGCTGGAGACAAACTCAGAAGAGCAAAACAGCCAGTTCCAGAGCCTGGAGCAGGTGAAGCGGCGCCCAGCCCACCTCATGGCCCTC
CTGCAGCACGTGGCCCTGCAGTTTGAGCCAGGACCCCTGGTTCTCCGGGTGCCGGTCCCTCCCAACGTCGCCTTTGAACTTGACCGCACT
AGGGCTGACCTCATCTCCGAGGATGTCCAGCGGCGGTTCGTGCAGGAGGTGGTGCAAAGCCAGCAGGTAGCCGTGGGCCGGCAGCTGGAG
GACTTCCGTTCCAAGCGGCTCATGGGCATGACGCCCTGGGAGCAGGAGCTGGCCCAGCTGGAGGCTTGGGTTGGGCGGGACCGAGCCAGC
TACGAGGCCCGGGAGCGGCACGTGGCGGAGCGGCTGCTCATGCACCTGGAGGAGATGCAACATACCATCTCTACCGACGAAGAAAAGAGT
GCTGCCGTGGTCAACGCCATTGGCCTGTACATGCGCCACCTTGGGGTGCGGACCAAGAGTGGAGACAAGAAGTCGGGGAGGAACTTCTTC
CGGAAAAAGGTGATGGGGAACCGGCGGTCGGACGAGCCTGCCAAGACCAAGAAGGGGCTGAGCAGCATCCTGGATGCCGCCCGCTGGAAC
CGGGGAGAGCCCCAGGTTCCAGATTTTCGACACCTCAAAGCAGAGGTTGATGCCGAGAAGCCAGGTGCTACAGACCGGAAGGGAGGCGTG
GGGATGCCCTCTCGGGACCGGAATATCGGGGCTCCTGGGCAGGACACCCCTGGAGTCTCTCTGCACCCTCTGTCCCTGGACAGCCCAGAC
CGGGAACCAGGTGCTGACGCCCCCCTGGAGCTGGGGGACTCATCCCCGCAGGGCCCAATGAGCCTGGAGTCCTTGGCGCCCCCAGAGAGT
ACCGACGAGGGGGCCGAAACCGAGAGGGCTGCGCTGGCCTCGGCACACCAGTGCCACCCGCTGCCCCGGACGCTGAGCGTCCTGAAGAGC
ACGCCGCAGGTACGGGGCATGCACACCATCATCAGGGACAAGGAGACCAGTCGCGACGAGTTCATCTTCTACTCCAAGAGACTGATGCGG
CTGCTCATCGAGCACGCGCTCTCCTTCCTGCCCTTTCAGGACTGCGTCGTACAGACCCCGCAGGGGCAGGACTATGCGGGCAAGTGCTAT
GCGGGGAAGCAGATCACCGGTGTGTCCATTCTGCGCGCCGGTGAAACCATGGAGCCCGCGCTGCGCGCTGTGTGCAAAGACGTGCGCATC
GGCACCATCCTCATCCAGACCAACCAGCTTACCGGGGAGCCCGAGCTCCACTACCTGAGGCTGCCCAAGGACATCAGCGATGACCACGTG
ATCCTCATGGACTGCACCGTGTCCACGGGCGCGGCGGCCATGATGGCAGTGCGCGTGCTCCTGGACCACGACGTGCCTGAGGACAAGATC
TTTTTGCTGTCGCTGCTCATGGCAGAGATGGGCGTGCACTCAGTGGCCTATGCATTTCCGCGAGTGAGAATCATCACCACGGCGGTGGAC
AAGCGGGTCAATGACCTTTTCCGCATCATCCCAGGCATTGGGAACTTTGGCGACCGCTACTTTGGGACAGACGCGGTCCCCGATGGCAGT

>6261_6261_6_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000347545_UCKL1_chr20_62572561_ENST00000369908_length(amino acids)=581AA_BP=340
MEDFARGAASPGPSRPGLVPVSIIGAEDEDFENELETNSEEQNSQFQSLEQVKRRPAHLMALLQHVALQFEPGPLVLRVPVPPNVAFELD
RTRADLISEDVQRRFVQEVVQSQQVAVGRQLEDFRSKRLMGMTPWEQELAQLEAWVGRDRASYEARERHVAERLLMHLEEMQHTISTDEE
KSAAVVNAIGLYMRHLGVRTKSGDKKSGRNFFRKKVMGNRRSDEPAKTKKGLSSILDAARWNRGEPQVPDFRHLKAEVDAEKPGATDRKG
GVGMPSRDRNIGAPGQDTPGVSLHPLSLDSPDREPGADAPLELGDSSPQGPMSLESLAPPESTDEGAETERAALASAHQCHPLPRTLSVL
KSTPQVRGMHTIIRDKETSRDEFIFYSKRLMRLLIEHALSFLPFQDCVVQTPQGQDYAGKCYAGKQITGVSILRAGETMEPALRAVCKDV
RIGTILIQTNQLTGEPELHYLRLPKDISDDHVILMDCTVSTGAAAMMAVRVLLDHDVPEDKIFLLSLLMAEMGVHSVAYAFPRVRIITTA

--------------------------------------------------------------
>6261_6261_7_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000354532_UCKL1_chr20_62572561_ENST00000354216_length(transcript)=2136nt_BP=1269nt
GGGGGCCTCCGGAAAAACGCCCCGACTTCCTGCCCCGCCAGAGCCAGGAAGCGGGAGCCGGGACCCAGGGCCCGGGATCGCCGAGCCCGA
CCTCGGGCGCCCCGCCGGTCACCTCCGCGCGGACACCAGCCCTGCAGAGCCCAGGGAGATGGAAGACTTCGCCCGAGGGGCGGCCTCCCC
AGGCCCCTCCCGGCCTGGCCTGGTTCCCGTCAGCATCATCGGGGCTGAGGATGAGGATTTTGAGAACGAGCTGGAGACAAACTCAGAAGA
GCAAAACAGCCAGTTCCAGAGCCTGGAGCAGGTGAAGCGGCGCCCAGCCCACCTCATGGCCCTCCTGCAGCACGTGGCCCTGCAGTTTGA
GCCAGGACCCCTGCTTTGCTGTCTGCATGCCGACATGCTGGGCTCACTGGGCCCCAAGGAGGCCAAGAAGGCCTTCCTGGACTTCTACCA
CAGCTTCCTGGAGAAGACAGCGGTTCTCCGGGTGCCGGTCCCTCCCAACGTCGCCTTTGAACTTGACCGCACTAGGGCTGACCTCATCTC
CGAGGATGTCCAGCGGCGGTTCGTGCAGGAGGTGGTGCAAAGCCAGCAGGTAGCCGTGGGCCGGCAGCTGGAGGACTTCCGTTCCAAGCG
GCTCATGGGCATGACGCCCTGGGAGCAGGAGCTGGCCCAGCTGGAGGCTTGGGTTGGGCGGGACCGAGCCAGCTACGAGGCCCGGGAGCG
GCACGTGGCGGAGCGGCTGCTCATGCACCTGGAGGAGATGCAACATACCATCTCTACCGACGAAGAAAAGAGTGCTGCCGTGGTCAACGC
CATTGGCCTGTACATGCGCCACCTTGGGGTGCGGACCAAGAGTGGAGACAAGAAGTCGGGGAGGAACTTCTTCCGGAAAAAGGTGATGGG
GAACCGGCGGTCGGACGAGCCTGCCAAGACCAAGAAGGGGCTGAGCAGCATCCTGGATGCCGCCCGCTGGAACCGGGGAGAGCCCCAGGT
TCCAGATTTTCGACACCTCAAAGCAGAGGTTGATGCCGAGAAGCCAGGTGCTACAGACCGGAAGGGAGGCGTGGGGATGCCCTCTCGGGA
CCGGAATATCGGGGCTCCTGGGCAGGACACCCCTGGAGTCTCTCTGCACCCTCTGTCCCTGGACAGCCCAGACCGGGAACCAGGTGCTGA
CGCCCCCCTGGAGCTGGGGGACTCATCCCCGCAGGGCCCAATGAGCCTGGAGTCCTTGGCGCCCCCAGAGAGTACCGACGAGGGGGCCGA
AACCGAGAGGGCTGCGCTGGCCTCGGCACACCAGTGCCACCCGCTGCCCCGGACGCTGAGCGTCCTGAAGAGCACGCCGCAGGTACGGGG
CATGCACACCATCATCAGGGACAAGGAGACCAGTCGCGACGAGTTCATCTTCTACTCCAAGAGACTGATGCGGCTGCTCATCGAGCACGC
GCTCTCCTTCCTGCCCTTTCAGGACTGCGTCGTACAGACCCCGCAGGGGCAGGACTATGCGGGCAAGTGCTATGCGGGGAAGCAGATCAC
CGGTGTGTCCATTCTGCGCGCCGGTGAAACCATGGAGCCCGCGCTGCGCGCTGTGTGCAAAGACGTGCGCATCGGCACCATCCTCATCCA
GACCAACCAGCTTACCGGGGAGCCCGAGCTCCACTACCTGAGGCTGCCCAAGGACATCAGCGATGACCACGTGATCCTCATGGACTGCAC
CGTGTCCACGGGCGCGGCGGCCATGATGGCAGTGCGCGTGCTCCTGGACCACGACGTGCCTGAGGACAAGATCTTTTTGCTGTCGCTGCT
CATGGCAGAGATGGGCGTGCACTCAGTGGCCTATGCATTTCCGCGAGTGAGAATCATCACCACGGCGGTGGACAAGCGGGTCAATGACCT
TTTCCGCATCATCCCAGGCATTGGGAACTTTGGCGACCGCTACTTTGGGACAGACGCGGTCCCCGATGGCAGTGACGAGGAGGAAGTGGC
CTACACGGGTTAGCTGCCCAGTGAGCCATCCCGTCCCCACCACCCTCCTCCTGCCTCCTGACCCAGGACTGCTGAATACAAAGATGTTAA

>6261_6261_7_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000354532_UCKL1_chr20_62572561_ENST00000354216_length(amino acids)=614AA_BP=373
MEDFARGAASPGPSRPGLVPVSIIGAEDEDFENELETNSEEQNSQFQSLEQVKRRPAHLMALLQHVALQFEPGPLLCCLHADMLGSLGPK
EAKKAFLDFYHSFLEKTAVLRVPVPPNVAFELDRTRADLISEDVQRRFVQEVVQSQQVAVGRQLEDFRSKRLMGMTPWEQELAQLEAWVG
RDRASYEARERHVAERLLMHLEEMQHTISTDEEKSAAVVNAIGLYMRHLGVRTKSGDKKSGRNFFRKKVMGNRRSDEPAKTKKGLSSILD
AARWNRGEPQVPDFRHLKAEVDAEKPGATDRKGGVGMPSRDRNIGAPGQDTPGVSLHPLSLDSPDREPGADAPLELGDSSPQGPMSLESL
APPESTDEGAETERAALASAHQCHPLPRTLSVLKSTPQVRGMHTIIRDKETSRDEFIFYSKRLMRLLIEHALSFLPFQDCVVQTPQGQDY
AGKCYAGKQITGVSILRAGETMEPALRAVCKDVRIGTILIQTNQLTGEPELHYLRLPKDISDDHVILMDCTVSTGAAAMMAVRVLLDHDV

--------------------------------------------------------------
>6261_6261_8_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000354532_UCKL1_chr20_62572561_ENST00000369892_length(transcript)=2129nt_BP=1269nt
GGGGGCCTCCGGAAAAACGCCCCGACTTCCTGCCCCGCCAGAGCCAGGAAGCGGGAGCCGGGACCCAGGGCCCGGGATCGCCGAGCCCGA
CCTCGGGCGCCCCGCCGGTCACCTCCGCGCGGACACCAGCCCTGCAGAGCCCAGGGAGATGGAAGACTTCGCCCGAGGGGCGGCCTCCCC
AGGCCCCTCCCGGCCTGGCCTGGTTCCCGTCAGCATCATCGGGGCTGAGGATGAGGATTTTGAGAACGAGCTGGAGACAAACTCAGAAGA
GCAAAACAGCCAGTTCCAGAGCCTGGAGCAGGTGAAGCGGCGCCCAGCCCACCTCATGGCCCTCCTGCAGCACGTGGCCCTGCAGTTTGA
GCCAGGACCCCTGCTTTGCTGTCTGCATGCCGACATGCTGGGCTCACTGGGCCCCAAGGAGGCCAAGAAGGCCTTCCTGGACTTCTACCA
CAGCTTCCTGGAGAAGACAGCGGTTCTCCGGGTGCCGGTCCCTCCCAACGTCGCCTTTGAACTTGACCGCACTAGGGCTGACCTCATCTC
CGAGGATGTCCAGCGGCGGTTCGTGCAGGAGGTGGTGCAAAGCCAGCAGGTAGCCGTGGGCCGGCAGCTGGAGGACTTCCGTTCCAAGCG
GCTCATGGGCATGACGCCCTGGGAGCAGGAGCTGGCCCAGCTGGAGGCTTGGGTTGGGCGGGACCGAGCCAGCTACGAGGCCCGGGAGCG
GCACGTGGCGGAGCGGCTGCTCATGCACCTGGAGGAGATGCAACATACCATCTCTACCGACGAAGAAAAGAGTGCTGCCGTGGTCAACGC
CATTGGCCTGTACATGCGCCACCTTGGGGTGCGGACCAAGAGTGGAGACAAGAAGTCGGGGAGGAACTTCTTCCGGAAAAAGGTGATGGG
GAACCGGCGGTCGGACGAGCCTGCCAAGACCAAGAAGGGGCTGAGCAGCATCCTGGATGCCGCCCGCTGGAACCGGGGAGAGCCCCAGGT
TCCAGATTTTCGACACCTCAAAGCAGAGGTTGATGCCGAGAAGCCAGGTGCTACAGACCGGAAGGGAGGCGTGGGGATGCCCTCTCGGGA
CCGGAATATCGGGGCTCCTGGGCAGGACACCCCTGGAGTCTCTCTGCACCCTCTGTCCCTGGACAGCCCAGACCGGGAACCAGGTGCTGA
CGCCCCCCTGGAGCTGGGGGACTCATCCCCGCAGGGCCCAATGAGCCTGGAGTCCTTGGCGCCCCCAGAGAGTACCGACGAGGGGGCCGA
AACCGAGAGGGAACTTTGGCGACCGCTACTTTGGGACAGACGCGGTCCCCGATGGCAGTGACGAGGAGGAAGTGGCCTACACGGGTTAGC
TGCCCAGTGAGCCATCCCGTCCCCACCACCCTCCTCCTGCCTCCTGACCCAGGACTGCTGAATACAAAGATGTTAATTTTTAAAATGTTA
CTAGTATAATTTATTCTATGCATTTTATAAAATAAATAAAGCTTTGACCACGACGTGCCTGAGGACAAGATCTTTTTGCTGTCGCTGCTC
ATGGCAGAGATGGGCGTGCACTCAGTGGCCTATGCATTTCCGCGAGTGAGAATCATCACCACGGCGGTGGACAAGCGGGTCAATGACCTT
TTCCGCATCATCCCAGGCATTGCTCCACTACCTGAGGCTGCCCAAGGACATCAGCGATGACCACGTGATCCTCATGGACTGCACCGTGTC
CACGGGCGCGGCGGCCATGATGGCAGTGCGCGTGCTCCTGATCACCGGTGTGTCCATTCTGCGCGCCGGTGAAACCATGGAGCCCGCGCT
GCGCGCTGTGTGCAAAGACGTGCGCATCGGCACCATCCTCATCCAGACCAACCAGCTTACCGGGGAGCCCGAGGACTGCGTCGTACAGAC
CCCGCAGGGGCAGGACTATGCGGGCAAGTGCTATGCGGGGAAGCAGGGACAAGGAGACCAGTCGCGACGAGTTCATCTTCTACTCCAAGA
GACTGATGCGGCTGCTCATCGAGCACGCGCTCTCCTTCCTGCCCTTTCAGGGCTGCGCTGGCCTCGGCACACCAGTGCCACCCGCTGCCC

>6261_6261_8_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000354532_UCKL1_chr20_62572561_ENST00000369892_length(amino acids)=390AA_BP=
MEDFARGAASPGPSRPGLVPVSIIGAEDEDFENELETNSEEQNSQFQSLEQVKRRPAHLMALLQHVALQFEPGPLLCCLHADMLGSLGPK
EAKKAFLDFYHSFLEKTAVLRVPVPPNVAFELDRTRADLISEDVQRRFVQEVVQSQQVAVGRQLEDFRSKRLMGMTPWEQELAQLEAWVG
RDRASYEARERHVAERLLMHLEEMQHTISTDEEKSAAVVNAIGLYMRHLGVRTKSGDKKSGRNFFRKKVMGNRRSDEPAKTKKGLSSILD
AARWNRGEPQVPDFRHLKAEVDAEKPGATDRKGGVGMPSRDRNIGAPGQDTPGVSLHPLSLDSPDREPGADAPLELGDSSPQGPMSLESL

--------------------------------------------------------------
>6261_6261_9_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000354532_UCKL1_chr20_62572561_ENST00000369908_length(transcript)=2045nt_BP=1269nt
GGGGGCCTCCGGAAAAACGCCCCGACTTCCTGCCCCGCCAGAGCCAGGAAGCGGGAGCCGGGACCCAGGGCCCGGGATCGCCGAGCCCGA
CCTCGGGCGCCCCGCCGGTCACCTCCGCGCGGACACCAGCCCTGCAGAGCCCAGGGAGATGGAAGACTTCGCCCGAGGGGCGGCCTCCCC
AGGCCCCTCCCGGCCTGGCCTGGTTCCCGTCAGCATCATCGGGGCTGAGGATGAGGATTTTGAGAACGAGCTGGAGACAAACTCAGAAGA
GCAAAACAGCCAGTTCCAGAGCCTGGAGCAGGTGAAGCGGCGCCCAGCCCACCTCATGGCCCTCCTGCAGCACGTGGCCCTGCAGTTTGA
GCCAGGACCCCTGCTTTGCTGTCTGCATGCCGACATGCTGGGCTCACTGGGCCCCAAGGAGGCCAAGAAGGCCTTCCTGGACTTCTACCA
CAGCTTCCTGGAGAAGACAGCGGTTCTCCGGGTGCCGGTCCCTCCCAACGTCGCCTTTGAACTTGACCGCACTAGGGCTGACCTCATCTC
CGAGGATGTCCAGCGGCGGTTCGTGCAGGAGGTGGTGCAAAGCCAGCAGGTAGCCGTGGGCCGGCAGCTGGAGGACTTCCGTTCCAAGCG
GCTCATGGGCATGACGCCCTGGGAGCAGGAGCTGGCCCAGCTGGAGGCTTGGGTTGGGCGGGACCGAGCCAGCTACGAGGCCCGGGAGCG
GCACGTGGCGGAGCGGCTGCTCATGCACCTGGAGGAGATGCAACATACCATCTCTACCGACGAAGAAAAGAGTGCTGCCGTGGTCAACGC
CATTGGCCTGTACATGCGCCACCTTGGGGTGCGGACCAAGAGTGGAGACAAGAAGTCGGGGAGGAACTTCTTCCGGAAAAAGGTGATGGG
GAACCGGCGGTCGGACGAGCCTGCCAAGACCAAGAAGGGGCTGAGCAGCATCCTGGATGCCGCCCGCTGGAACCGGGGAGAGCCCCAGGT
TCCAGATTTTCGACACCTCAAAGCAGAGGTTGATGCCGAGAAGCCAGGTGCTACAGACCGGAAGGGAGGCGTGGGGATGCCCTCTCGGGA
CCGGAATATCGGGGCTCCTGGGCAGGACACCCCTGGAGTCTCTCTGCACCCTCTGTCCCTGGACAGCCCAGACCGGGAACCAGGTGCTGA
CGCCCCCCTGGAGCTGGGGGACTCATCCCCGCAGGGCCCAATGAGCCTGGAGTCCTTGGCGCCCCCAGAGAGTACCGACGAGGGGGCCGA
AACCGAGAGGGCTGCGCTGGCCTCGGCACACCAGTGCCACCCGCTGCCCCGGACGCTGAGCGTCCTGAAGAGCACGCCGCAGGTACGGGG
CATGCACACCATCATCAGGGACAAGGAGACCAGTCGCGACGAGTTCATCTTCTACTCCAAGAGACTGATGCGGCTGCTCATCGAGCACGC
GCTCTCCTTCCTGCCCTTTCAGGACTGCGTCGTACAGACCCCGCAGGGGCAGGACTATGCGGGCAAGTGCTATGCGGGGAAGCAGATCAC
CGGTGTGTCCATTCTGCGCGCCGGTGAAACCATGGAGCCCGCGCTGCGCGCTGTGTGCAAAGACGTGCGCATCGGCACCATCCTCATCCA
GACCAACCAGCTTACCGGGGAGCCCGAGCTCCACTACCTGAGGCTGCCCAAGGACATCAGCGATGACCACGTGATCCTCATGGACTGCAC
CGTGTCCACGGGCGCGGCGGCCATGATGGCAGTGCGCGTGCTCCTGGACCACGACGTGCCTGAGGACAAGATCTTTTTGCTGTCGCTGCT
CATGGCAGAGATGGGCGTGCACTCAGTGGCCTATGCATTTCCGCGAGTGAGAATCATCACCACGGCGGTGGACAAGCGGGTCAATGACCT
TTTCCGCATCATCCCAGGCATTGGGAACTTTGGCGACCGCTACTTTGGGACAGACGCGGTCCCCGATGGCAGTGACGAGGAGGAAGTGGC

>6261_6261_9_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000354532_UCKL1_chr20_62572561_ENST00000369908_length(amino acids)=614AA_BP=373
MEDFARGAASPGPSRPGLVPVSIIGAEDEDFENELETNSEEQNSQFQSLEQVKRRPAHLMALLQHVALQFEPGPLLCCLHADMLGSLGPK
EAKKAFLDFYHSFLEKTAVLRVPVPPNVAFELDRTRADLISEDVQRRFVQEVVQSQQVAVGRQLEDFRSKRLMGMTPWEQELAQLEAWVG
RDRASYEARERHVAERLLMHLEEMQHTISTDEEKSAAVVNAIGLYMRHLGVRTKSGDKKSGRNFFRKKVMGNRRSDEPAKTKKGLSSILD
AARWNRGEPQVPDFRHLKAEVDAEKPGATDRKGGVGMPSRDRNIGAPGQDTPGVSLHPLSLDSPDREPGADAPLELGDSSPQGPMSLESL
APPESTDEGAETERAALASAHQCHPLPRTLSVLKSTPQVRGMHTIIRDKETSRDEFIFYSKRLMRLLIEHALSFLPFQDCVVQTPQGQDY
AGKCYAGKQITGVSILRAGETMEPALRAVCKDVRIGTILIQTNQLTGEPELHYLRLPKDISDDHVILMDCTVSTGAAAMMAVRVLLDHDV

--------------------------------------------------------------
>6261_6261_10_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000378152_UCKL1_chr20_62572561_ENST00000354216_length(transcript)=2032nt_BP=1165nt
GGGCGCAAAGGTGGACTCAGGGCGGCTAGAGCGACGCGGCGGCAGGGGTGGGGAGAGTGCGGAGCCCGAGCGCGGAGGCTTCGGTTCCGG
TGGCGGCGATGGCTTCTCTTTCCACCTGGAGCAGCCCTGCAGAGCCCAGGGAGATGGAAGACTTCGCCCGAGGGGCGGCCTCCCCAGGCC
CCTCCCGGCCTGGCCTGGTTCCCGTCAGCATCATCGGGGCTGAGGATGAGGATTTTGAGAACGAGCTGGAGACAAACTCAGAAGAGCAAA
ACAGCCAGTTCCAGAGCCTGGAGCAGGTGAAGCGGCGCCCAGCCCACCTCATGGCCCTCCTGCAGCACGTGGCCCTGCAGTTTGAGCCAG
GACCCCTGGTTCTCCGGGTGCCGGTCCCTCCCAACGTCGCCTTTGAACTTGACCGCACTAGGGCTGACCTCATCTCCGAGGATGTCCAGC
GGCGGTTCGTGCAGGAGGTGGTGCAAAGCCAGCAGGTAGCCGTGGGCCGGCAGCTGGAGGACTTCCGTTCCAAGCGGCTCATGGGCATGA
CGCCCTGGGAGCAGGAGCTGGCCCAGCTGGAGGCTTGGGTTGGGCGGGACCGAGCCAGCTACGAGGCCCGGGAGCGGCACGTGGCGGAGC
GGCTGCTCATGCACCTGGAGGAGATGCAACATACCATCTCTACCGACGAAGAAAAGAGTGCTGCCGTGGTCAACGCCATTGGCCTGTACA
TGCGCCACCTTGGGGTGCGGACCAAGAGTGGAGACAAGAAGTCGGGGAGGAACTTCTTCCGGAAAAAGGTGATGGGGAACCGGCGGTCGG
ACGAGCCTGCCAAGACCAAGAAGGGGCTGAGCAGCATCCTGGATGCCGCCCGCTGGAACCGGGGAGAGCCCCAGGTTCCAGATTTTCGAC
ACCTCAAAGCAGAGGTTGATGCCGAGAAGCCAGGTGCTACAGACCGGAAGGGAGGCGTGGGGATGCCCTCTCGGGACCGGAATATCGGGG
CTCCTGGGCAGGACACCCCTGGAGTCTCTCTGCACCCTCTGTCCCTGGACAGCCCAGACCGGGAACCAGGTGCTGACGCCCCCCTGGAGC
TGGGGGACTCATCCCCGCAGGGCCCAATGAGCCTGGAGTCCTTGGCGCCCCCAGAGAGTACCGACGAGGGGGCCGAAACCGAGAGGGCTG
CGCTGGCCTCGGCACACCAGTGCCACCCGCTGCCCCGGACGCTGAGCGTCCTGAAGAGCACGCCGCAGGTACGGGGCATGCACACCATCA
TCAGGGACAAGGAGACCAGTCGCGACGAGTTCATCTTCTACTCCAAGAGACTGATGCGGCTGCTCATCGAGCACGCGCTCTCCTTCCTGC
CCTTTCAGGACTGCGTCGTACAGACCCCGCAGGGGCAGGACTATGCGGGCAAGTGCTATGCGGGGAAGCAGATCACCGGTGTGTCCATTC
TGCGCGCCGGTGAAACCATGGAGCCCGCGCTGCGCGCTGTGTGCAAAGACGTGCGCATCGGCACCATCCTCATCCAGACCAACCAGCTTA
CCGGGGAGCCCGAGCTCCACTACCTGAGGCTGCCCAAGGACATCAGCGATGACCACGTGATCCTCATGGACTGCACCGTGTCCACGGGCG
CGGCGGCCATGATGGCAGTGCGCGTGCTCCTGGACCACGACGTGCCTGAGGACAAGATCTTTTTGCTGTCGCTGCTCATGGCAGAGATGG
GCGTGCACTCAGTGGCCTATGCATTTCCGCGAGTGAGAATCATCACCACGGCGGTGGACAAGCGGGTCAATGACCTTTTCCGCATCATCC
CAGGCATTGGGAACTTTGGCGACCGCTACTTTGGGACAGACGCGGTCCCCGATGGCAGTGACGAGGAGGAAGTGGCCTACACGGGTTAGC
TGCCCAGTGAGCCATCCCGTCCCCACCACCCTCCTCCTGCCTCCTGACCCAGGACTGCTGAATACAAAGATGTTAATTTTTAAAATGTTA

>6261_6261_10_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000378152_UCKL1_chr20_62572561_ENST00000354216_length(amino acids)=596AA_BP=355
MASLSTWSSPAEPREMEDFARGAASPGPSRPGLVPVSIIGAEDEDFENELETNSEEQNSQFQSLEQVKRRPAHLMALLQHVALQFEPGPL
VLRVPVPPNVAFELDRTRADLISEDVQRRFVQEVVQSQQVAVGRQLEDFRSKRLMGMTPWEQELAQLEAWVGRDRASYEARERHVAERLL
MHLEEMQHTISTDEEKSAAVVNAIGLYMRHLGVRTKSGDKKSGRNFFRKKVMGNRRSDEPAKTKKGLSSILDAARWNRGEPQVPDFRHLK
AEVDAEKPGATDRKGGVGMPSRDRNIGAPGQDTPGVSLHPLSLDSPDREPGADAPLELGDSSPQGPMSLESLAPPESTDEGAETERAALA
SAHQCHPLPRTLSVLKSTPQVRGMHTIIRDKETSRDEFIFYSKRLMRLLIEHALSFLPFQDCVVQTPQGQDYAGKCYAGKQITGVSILRA
GETMEPALRAVCKDVRIGTILIQTNQLTGEPELHYLRLPKDISDDHVILMDCTVSTGAAAMMAVRVLLDHDVPEDKIFLLSLLMAEMGVH

--------------------------------------------------------------
>6261_6261_11_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000378152_UCKL1_chr20_62572561_ENST00000369892_length(transcript)=2025nt_BP=1165nt
GGGCGCAAAGGTGGACTCAGGGCGGCTAGAGCGACGCGGCGGCAGGGGTGGGGAGAGTGCGGAGCCCGAGCGCGGAGGCTTCGGTTCCGG
TGGCGGCGATGGCTTCTCTTTCCACCTGGAGCAGCCCTGCAGAGCCCAGGGAGATGGAAGACTTCGCCCGAGGGGCGGCCTCCCCAGGCC
CCTCCCGGCCTGGCCTGGTTCCCGTCAGCATCATCGGGGCTGAGGATGAGGATTTTGAGAACGAGCTGGAGACAAACTCAGAAGAGCAAA
ACAGCCAGTTCCAGAGCCTGGAGCAGGTGAAGCGGCGCCCAGCCCACCTCATGGCCCTCCTGCAGCACGTGGCCCTGCAGTTTGAGCCAG
GACCCCTGGTTCTCCGGGTGCCGGTCCCTCCCAACGTCGCCTTTGAACTTGACCGCACTAGGGCTGACCTCATCTCCGAGGATGTCCAGC
GGCGGTTCGTGCAGGAGGTGGTGCAAAGCCAGCAGGTAGCCGTGGGCCGGCAGCTGGAGGACTTCCGTTCCAAGCGGCTCATGGGCATGA
CGCCCTGGGAGCAGGAGCTGGCCCAGCTGGAGGCTTGGGTTGGGCGGGACCGAGCCAGCTACGAGGCCCGGGAGCGGCACGTGGCGGAGC
GGCTGCTCATGCACCTGGAGGAGATGCAACATACCATCTCTACCGACGAAGAAAAGAGTGCTGCCGTGGTCAACGCCATTGGCCTGTACA
TGCGCCACCTTGGGGTGCGGACCAAGAGTGGAGACAAGAAGTCGGGGAGGAACTTCTTCCGGAAAAAGGTGATGGGGAACCGGCGGTCGG
ACGAGCCTGCCAAGACCAAGAAGGGGCTGAGCAGCATCCTGGATGCCGCCCGCTGGAACCGGGGAGAGCCCCAGGTTCCAGATTTTCGAC
ACCTCAAAGCAGAGGTTGATGCCGAGAAGCCAGGTGCTACAGACCGGAAGGGAGGCGTGGGGATGCCCTCTCGGGACCGGAATATCGGGG
CTCCTGGGCAGGACACCCCTGGAGTCTCTCTGCACCCTCTGTCCCTGGACAGCCCAGACCGGGAACCAGGTGCTGACGCCCCCCTGGAGC
TGGGGGACTCATCCCCGCAGGGCCCAATGAGCCTGGAGTCCTTGGCGCCCCCAGAGAGTACCGACGAGGGGGCCGAAACCGAGAGGGAAC
TTTGGCGACCGCTACTTTGGGACAGACGCGGTCCCCGATGGCAGTGACGAGGAGGAAGTGGCCTACACGGGTTAGCTGCCCAGTGAGCCA
TCCCGTCCCCACCACCCTCCTCCTGCCTCCTGACCCAGGACTGCTGAATACAAAGATGTTAATTTTTAAAATGTTACTAGTATAATTTAT
TCTATGCATTTTATAAAATAAATAAAGCTTTGACCACGACGTGCCTGAGGACAAGATCTTTTTGCTGTCGCTGCTCATGGCAGAGATGGG
CGTGCACTCAGTGGCCTATGCATTTCCGCGAGTGAGAATCATCACCACGGCGGTGGACAAGCGGGTCAATGACCTTTTCCGCATCATCCC
AGGCATTGCTCCACTACCTGAGGCTGCCCAAGGACATCAGCGATGACCACGTGATCCTCATGGACTGCACCGTGTCCACGGGCGCGGCGG
CCATGATGGCAGTGCGCGTGCTCCTGATCACCGGTGTGTCCATTCTGCGCGCCGGTGAAACCATGGAGCCCGCGCTGCGCGCTGTGTGCA
AAGACGTGCGCATCGGCACCATCCTCATCCAGACCAACCAGCTTACCGGGGAGCCCGAGGACTGCGTCGTACAGACCCCGCAGGGGCAGG
ACTATGCGGGCAAGTGCTATGCGGGGAAGCAGGGACAAGGAGACCAGTCGCGACGAGTTCATCTTCTACTCCAAGAGACTGATGCGGCTG
CTCATCGAGCACGCGCTCTCCTTCCTGCCCTTTCAGGGCTGCGCTGGCCTCGGCACACCAGTGCCACCCGCTGCCCCGGACGCTGAGCGT

>6261_6261_11_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000378152_UCKL1_chr20_62572561_ENST00000369892_length(amino acids)=372AA_BP=
MASLSTWSSPAEPREMEDFARGAASPGPSRPGLVPVSIIGAEDEDFENELETNSEEQNSQFQSLEQVKRRPAHLMALLQHVALQFEPGPL
VLRVPVPPNVAFELDRTRADLISEDVQRRFVQEVVQSQQVAVGRQLEDFRSKRLMGMTPWEQELAQLEAWVGRDRASYEARERHVAERLL
MHLEEMQHTISTDEEKSAAVVNAIGLYMRHLGVRTKSGDKKSGRNFFRKKVMGNRRSDEPAKTKKGLSSILDAARWNRGEPQVPDFRHLK
AEVDAEKPGATDRKGGVGMPSRDRNIGAPGQDTPGVSLHPLSLDSPDREPGADAPLELGDSSPQGPMSLESLAPPESTDEGAETERELWR

--------------------------------------------------------------
>6261_6261_12_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000378152_UCKL1_chr20_62572561_ENST00000369908_length(transcript)=1941nt_BP=1165nt
GGGCGCAAAGGTGGACTCAGGGCGGCTAGAGCGACGCGGCGGCAGGGGTGGGGAGAGTGCGGAGCCCGAGCGCGGAGGCTTCGGTTCCGG
TGGCGGCGATGGCTTCTCTTTCCACCTGGAGCAGCCCTGCAGAGCCCAGGGAGATGGAAGACTTCGCCCGAGGGGCGGCCTCCCCAGGCC
CCTCCCGGCCTGGCCTGGTTCCCGTCAGCATCATCGGGGCTGAGGATGAGGATTTTGAGAACGAGCTGGAGACAAACTCAGAAGAGCAAA
ACAGCCAGTTCCAGAGCCTGGAGCAGGTGAAGCGGCGCCCAGCCCACCTCATGGCCCTCCTGCAGCACGTGGCCCTGCAGTTTGAGCCAG
GACCCCTGGTTCTCCGGGTGCCGGTCCCTCCCAACGTCGCCTTTGAACTTGACCGCACTAGGGCTGACCTCATCTCCGAGGATGTCCAGC
GGCGGTTCGTGCAGGAGGTGGTGCAAAGCCAGCAGGTAGCCGTGGGCCGGCAGCTGGAGGACTTCCGTTCCAAGCGGCTCATGGGCATGA
CGCCCTGGGAGCAGGAGCTGGCCCAGCTGGAGGCTTGGGTTGGGCGGGACCGAGCCAGCTACGAGGCCCGGGAGCGGCACGTGGCGGAGC
GGCTGCTCATGCACCTGGAGGAGATGCAACATACCATCTCTACCGACGAAGAAAAGAGTGCTGCCGTGGTCAACGCCATTGGCCTGTACA
TGCGCCACCTTGGGGTGCGGACCAAGAGTGGAGACAAGAAGTCGGGGAGGAACTTCTTCCGGAAAAAGGTGATGGGGAACCGGCGGTCGG
ACGAGCCTGCCAAGACCAAGAAGGGGCTGAGCAGCATCCTGGATGCCGCCCGCTGGAACCGGGGAGAGCCCCAGGTTCCAGATTTTCGAC
ACCTCAAAGCAGAGGTTGATGCCGAGAAGCCAGGTGCTACAGACCGGAAGGGAGGCGTGGGGATGCCCTCTCGGGACCGGAATATCGGGG
CTCCTGGGCAGGACACCCCTGGAGTCTCTCTGCACCCTCTGTCCCTGGACAGCCCAGACCGGGAACCAGGTGCTGACGCCCCCCTGGAGC
TGGGGGACTCATCCCCGCAGGGCCCAATGAGCCTGGAGTCCTTGGCGCCCCCAGAGAGTACCGACGAGGGGGCCGAAACCGAGAGGGCTG
CGCTGGCCTCGGCACACCAGTGCCACCCGCTGCCCCGGACGCTGAGCGTCCTGAAGAGCACGCCGCAGGTACGGGGCATGCACACCATCA
TCAGGGACAAGGAGACCAGTCGCGACGAGTTCATCTTCTACTCCAAGAGACTGATGCGGCTGCTCATCGAGCACGCGCTCTCCTTCCTGC
CCTTTCAGGACTGCGTCGTACAGACCCCGCAGGGGCAGGACTATGCGGGCAAGTGCTATGCGGGGAAGCAGATCACCGGTGTGTCCATTC
TGCGCGCCGGTGAAACCATGGAGCCCGCGCTGCGCGCTGTGTGCAAAGACGTGCGCATCGGCACCATCCTCATCCAGACCAACCAGCTTA
CCGGGGAGCCCGAGCTCCACTACCTGAGGCTGCCCAAGGACATCAGCGATGACCACGTGATCCTCATGGACTGCACCGTGTCCACGGGCG
CGGCGGCCATGATGGCAGTGCGCGTGCTCCTGGACCACGACGTGCCTGAGGACAAGATCTTTTTGCTGTCGCTGCTCATGGCAGAGATGG
GCGTGCACTCAGTGGCCTATGCATTTCCGCGAGTGAGAATCATCACCACGGCGGTGGACAAGCGGGTCAATGACCTTTTCCGCATCATCC
CAGGCATTGGGAACTTTGGCGACCGCTACTTTGGGACAGACGCGGTCCCCGATGGCAGTGACGAGGAGGAAGTGGCCTACACGGGTTAGC

>6261_6261_12_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000378152_UCKL1_chr20_62572561_ENST00000369908_length(amino acids)=596AA_BP=355
MASLSTWSSPAEPREMEDFARGAASPGPSRPGLVPVSIIGAEDEDFENELETNSEEQNSQFQSLEQVKRRPAHLMALLQHVALQFEPGPL
VLRVPVPPNVAFELDRTRADLISEDVQRRFVQEVVQSQQVAVGRQLEDFRSKRLMGMTPWEQELAQLEAWVGRDRASYEARERHVAERLL
MHLEEMQHTISTDEEKSAAVVNAIGLYMRHLGVRTKSGDKKSGRNFFRKKVMGNRRSDEPAKTKKGLSSILDAARWNRGEPQVPDFRHLK
AEVDAEKPGATDRKGGVGMPSRDRNIGAPGQDTPGVSLHPLSLDSPDREPGADAPLELGDSSPQGPMSLESLAPPESTDEGAETERAALA
SAHQCHPLPRTLSVLKSTPQVRGMHTIIRDKETSRDEFIFYSKRLMRLLIEHALSFLPFQDCVVQTPQGQDYAGKCYAGKQITGVSILRA
GETMEPALRAVCKDVRIGTILIQTNQLTGEPELHYLRLPKDISDDHVILMDCTVSTGAAAMMAVRVLLDHDVPEDKIFLLSLLMAEMGVH

--------------------------------------------------------------
>6261_6261_13_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000599846_UCKL1_chr20_62572561_ENST00000354216_length(transcript)=2113nt_BP=1246nt
GACTTCCTGCCCCGCCAGAGCCAGGAAGCGGGAGCCGGGACCCAGGGCCCGGGATCGCCGAGCCCGACCTCGGGCGCCCCGCCGGTCACC
TCCGCGCGGACACCAGCCCTGCAGAGCCCAGGGAGATGGAAGACTTCGCCCGAGGGGCGGCCTCCCCAGGCCCCTCCCGGCCTGGCCTGG
TTCCCGTCAGCATCATCGGGGCTGAGGATGAGGATTTTGAGAACGAGCTGGAGACAAACTCAGAAGAGCAAAACAGCCAGTTCCAGAGCC
TGGAGCAGGTGAAGCGGCGCCCAGCCCACCTCATGGCCCTCCTGCAGCACGTGGCCCTGCAGTTTGAGCCAGGACCCCTGCTTTGCTGTC
TGCATGCCGACATGCTGGGCTCACTGGGCCCCAAGGAGGCCAAGAAGGCCTTCCTGGACTTCTACCACAGCTTCCTGGAGAAGACAGCGG
TTCTCCGGGTGCCGGTCCCTCCCAACGTCGCCTTTGAACTTGACCGCACTAGGGCTGACCTCATCTCCGAGGATGTCCAGCGGCGGTTCG
TGCAGGAGGTGGTGCAAAGCCAGCAGGTAGCCGTGGGCCGGCAGCTGGAGGACTTCCGTTCCAAGCGGCTCATGGGCATGACGCCCTGGG
AGCAGGAGCTGGCCCAGCTGGAGGCTTGGGTTGGGCGGGACCGAGCCAGCTACGAGGCCCGGGAGCGGCACGTGGCGGAGCGGCTGCTCA
TGCACCTGGAGGAGATGCAACATACCATCTCTACCGACGAAGAAAAGAGTGCTGCCGTGGTCAACGCCATTGGCCTGTACATGCGCCACC
TTGGGGTGCGGACCAAGAGTGGAGACAAGAAGTCGGGGAGGAACTTCTTCCGGAAAAAGGTGATGGGGAACCGGCGGTCGGACGAGCCTG
CCAAGACCAAGAAGGGGCTGAGCAGCATCCTGGATGCCGCCCGCTGGAACCGGGGAGAGCCCCAGGTTCCAGATTTTCGACACCTCAAAG
CAGAGGTTGATGCCGAGAAGCCAGGTGCTACAGACCGGAAGGGAGGCGTGGGGATGCCCTCTCGGGACCGGAATATCGGGGCTCCTGGGC
AGGACACCCCTGGAGTCTCTCTGCACCCTCTGTCCCTGGACAGCCCAGACCGGGAACCAGGTGCTGACGCCCCCCTGGAGCTGGGGGACT
CATCCCCGCAGGGCCCAATGAGCCTGGAGTCCTTGGCGCCCCCAGAGAGTACCGACGAGGGGGCCGAAACCGAGAGGGCTGCGCTGGCCT
CGGCACACCAGTGCCACCCGCTGCCCCGGACGCTGAGCGTCCTGAAGAGCACGCCGCAGGTACGGGGCATGCACACCATCATCAGGGACA
AGGAGACCAGTCGCGACGAGTTCATCTTCTACTCCAAGAGACTGATGCGGCTGCTCATCGAGCACGCGCTCTCCTTCCTGCCCTTTCAGG
ACTGCGTCGTACAGACCCCGCAGGGGCAGGACTATGCGGGCAAGTGCTATGCGGGGAAGCAGATCACCGGTGTGTCCATTCTGCGCGCCG
GTGAAACCATGGAGCCCGCGCTGCGCGCTGTGTGCAAAGACGTGCGCATCGGCACCATCCTCATCCAGACCAACCAGCTTACCGGGGAGC
CCGAGCTCCACTACCTGAGGCTGCCCAAGGACATCAGCGATGACCACGTGATCCTCATGGACTGCACCGTGTCCACGGGCGCGGCGGCCA
TGATGGCAGTGCGCGTGCTCCTGGACCACGACGTGCCTGAGGACAAGATCTTTTTGCTGTCGCTGCTCATGGCAGAGATGGGCGTGCACT
CAGTGGCCTATGCATTTCCGCGAGTGAGAATCATCACCACGGCGGTGGACAAGCGGGTCAATGACCTTTTCCGCATCATCCCAGGCATTG
GGAACTTTGGCGACCGCTACTTTGGGACAGACGCGGTCCCCGATGGCAGTGACGAGGAGGAAGTGGCCTACACGGGTTAGCTGCCCAGTG
AGCCATCCCGTCCCCACCACCCTCCTCCTGCCTCCTGACCCAGGACTGCTGAATACAAAGATGTTAATTTTTAAAATGTTACTAGTATAA

>6261_6261_13_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000599846_UCKL1_chr20_62572561_ENST00000354216_length(amino acids)=614AA_BP=373
MEDFARGAASPGPSRPGLVPVSIIGAEDEDFENELETNSEEQNSQFQSLEQVKRRPAHLMALLQHVALQFEPGPLLCCLHADMLGSLGPK
EAKKAFLDFYHSFLEKTAVLRVPVPPNVAFELDRTRADLISEDVQRRFVQEVVQSQQVAVGRQLEDFRSKRLMGMTPWEQELAQLEAWVG
RDRASYEARERHVAERLLMHLEEMQHTISTDEEKSAAVVNAIGLYMRHLGVRTKSGDKKSGRNFFRKKVMGNRRSDEPAKTKKGLSSILD
AARWNRGEPQVPDFRHLKAEVDAEKPGATDRKGGVGMPSRDRNIGAPGQDTPGVSLHPLSLDSPDREPGADAPLELGDSSPQGPMSLESL
APPESTDEGAETERAALASAHQCHPLPRTLSVLKSTPQVRGMHTIIRDKETSRDEFIFYSKRLMRLLIEHALSFLPFQDCVVQTPQGQDY
AGKCYAGKQITGVSILRAGETMEPALRAVCKDVRIGTILIQTNQLTGEPELHYLRLPKDISDDHVILMDCTVSTGAAAMMAVRVLLDHDV

--------------------------------------------------------------
>6261_6261_14_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000599846_UCKL1_chr20_62572561_ENST00000369892_length(transcript)=2106nt_BP=1246nt
GACTTCCTGCCCCGCCAGAGCCAGGAAGCGGGAGCCGGGACCCAGGGCCCGGGATCGCCGAGCCCGACCTCGGGCGCCCCGCCGGTCACC
TCCGCGCGGACACCAGCCCTGCAGAGCCCAGGGAGATGGAAGACTTCGCCCGAGGGGCGGCCTCCCCAGGCCCCTCCCGGCCTGGCCTGG
TTCCCGTCAGCATCATCGGGGCTGAGGATGAGGATTTTGAGAACGAGCTGGAGACAAACTCAGAAGAGCAAAACAGCCAGTTCCAGAGCC
TGGAGCAGGTGAAGCGGCGCCCAGCCCACCTCATGGCCCTCCTGCAGCACGTGGCCCTGCAGTTTGAGCCAGGACCCCTGCTTTGCTGTC
TGCATGCCGACATGCTGGGCTCACTGGGCCCCAAGGAGGCCAAGAAGGCCTTCCTGGACTTCTACCACAGCTTCCTGGAGAAGACAGCGG
TTCTCCGGGTGCCGGTCCCTCCCAACGTCGCCTTTGAACTTGACCGCACTAGGGCTGACCTCATCTCCGAGGATGTCCAGCGGCGGTTCG
TGCAGGAGGTGGTGCAAAGCCAGCAGGTAGCCGTGGGCCGGCAGCTGGAGGACTTCCGTTCCAAGCGGCTCATGGGCATGACGCCCTGGG
AGCAGGAGCTGGCCCAGCTGGAGGCTTGGGTTGGGCGGGACCGAGCCAGCTACGAGGCCCGGGAGCGGCACGTGGCGGAGCGGCTGCTCA
TGCACCTGGAGGAGATGCAACATACCATCTCTACCGACGAAGAAAAGAGTGCTGCCGTGGTCAACGCCATTGGCCTGTACATGCGCCACC
TTGGGGTGCGGACCAAGAGTGGAGACAAGAAGTCGGGGAGGAACTTCTTCCGGAAAAAGGTGATGGGGAACCGGCGGTCGGACGAGCCTG
CCAAGACCAAGAAGGGGCTGAGCAGCATCCTGGATGCCGCCCGCTGGAACCGGGGAGAGCCCCAGGTTCCAGATTTTCGACACCTCAAAG
CAGAGGTTGATGCCGAGAAGCCAGGTGCTACAGACCGGAAGGGAGGCGTGGGGATGCCCTCTCGGGACCGGAATATCGGGGCTCCTGGGC
AGGACACCCCTGGAGTCTCTCTGCACCCTCTGTCCCTGGACAGCCCAGACCGGGAACCAGGTGCTGACGCCCCCCTGGAGCTGGGGGACT
CATCCCCGCAGGGCCCAATGAGCCTGGAGTCCTTGGCGCCCCCAGAGAGTACCGACGAGGGGGCCGAAACCGAGAGGGAACTTTGGCGAC
CGCTACTTTGGGACAGACGCGGTCCCCGATGGCAGTGACGAGGAGGAAGTGGCCTACACGGGTTAGCTGCCCAGTGAGCCATCCCGTCCC
CACCACCCTCCTCCTGCCTCCTGACCCAGGACTGCTGAATACAAAGATGTTAATTTTTAAAATGTTACTAGTATAATTTATTCTATGCAT
TTTATAAAATAAATAAAGCTTTGACCACGACGTGCCTGAGGACAAGATCTTTTTGCTGTCGCTGCTCATGGCAGAGATGGGCGTGCACTC
AGTGGCCTATGCATTTCCGCGAGTGAGAATCATCACCACGGCGGTGGACAAGCGGGTCAATGACCTTTTCCGCATCATCCCAGGCATTGC
TCCACTACCTGAGGCTGCCCAAGGACATCAGCGATGACCACGTGATCCTCATGGACTGCACCGTGTCCACGGGCGCGGCGGCCATGATGG
CAGTGCGCGTGCTCCTGATCACCGGTGTGTCCATTCTGCGCGCCGGTGAAACCATGGAGCCCGCGCTGCGCGCTGTGTGCAAAGACGTGC
GCATCGGCACCATCCTCATCCAGACCAACCAGCTTACCGGGGAGCCCGAGGACTGCGTCGTACAGACCCCGCAGGGGCAGGACTATGCGG
GCAAGTGCTATGCGGGGAAGCAGGGACAAGGAGACCAGTCGCGACGAGTTCATCTTCTACTCCAAGAGACTGATGCGGCTGCTCATCGAG
CACGCGCTCTCCTTCCTGCCCTTTCAGGGCTGCGCTGGCCTCGGCACACCAGTGCCACCCGCTGCCCCGGACGCTGAGCGTCCTGAAGAG

>6261_6261_14_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000599846_UCKL1_chr20_62572561_ENST00000369892_length(amino acids)=390AA_BP=
MEDFARGAASPGPSRPGLVPVSIIGAEDEDFENELETNSEEQNSQFQSLEQVKRRPAHLMALLQHVALQFEPGPLLCCLHADMLGSLGPK
EAKKAFLDFYHSFLEKTAVLRVPVPPNVAFELDRTRADLISEDVQRRFVQEVVQSQQVAVGRQLEDFRSKRLMGMTPWEQELAQLEAWVG
RDRASYEARERHVAERLLMHLEEMQHTISTDEEKSAAVVNAIGLYMRHLGVRTKSGDKKSGRNFFRKKVMGNRRSDEPAKTKKGLSSILD
AARWNRGEPQVPDFRHLKAEVDAEKPGATDRKGGVGMPSRDRNIGAPGQDTPGVSLHPLSLDSPDREPGADAPLELGDSSPQGPMSLESL

--------------------------------------------------------------
>6261_6261_15_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000599846_UCKL1_chr20_62572561_ENST00000369908_length(transcript)=2022nt_BP=1246nt
GACTTCCTGCCCCGCCAGAGCCAGGAAGCGGGAGCCGGGACCCAGGGCCCGGGATCGCCGAGCCCGACCTCGGGCGCCCCGCCGGTCACC
TCCGCGCGGACACCAGCCCTGCAGAGCCCAGGGAGATGGAAGACTTCGCCCGAGGGGCGGCCTCCCCAGGCCCCTCCCGGCCTGGCCTGG
TTCCCGTCAGCATCATCGGGGCTGAGGATGAGGATTTTGAGAACGAGCTGGAGACAAACTCAGAAGAGCAAAACAGCCAGTTCCAGAGCC
TGGAGCAGGTGAAGCGGCGCCCAGCCCACCTCATGGCCCTCCTGCAGCACGTGGCCCTGCAGTTTGAGCCAGGACCCCTGCTTTGCTGTC
TGCATGCCGACATGCTGGGCTCACTGGGCCCCAAGGAGGCCAAGAAGGCCTTCCTGGACTTCTACCACAGCTTCCTGGAGAAGACAGCGG
TTCTCCGGGTGCCGGTCCCTCCCAACGTCGCCTTTGAACTTGACCGCACTAGGGCTGACCTCATCTCCGAGGATGTCCAGCGGCGGTTCG
TGCAGGAGGTGGTGCAAAGCCAGCAGGTAGCCGTGGGCCGGCAGCTGGAGGACTTCCGTTCCAAGCGGCTCATGGGCATGACGCCCTGGG
AGCAGGAGCTGGCCCAGCTGGAGGCTTGGGTTGGGCGGGACCGAGCCAGCTACGAGGCCCGGGAGCGGCACGTGGCGGAGCGGCTGCTCA
TGCACCTGGAGGAGATGCAACATACCATCTCTACCGACGAAGAAAAGAGTGCTGCCGTGGTCAACGCCATTGGCCTGTACATGCGCCACC
TTGGGGTGCGGACCAAGAGTGGAGACAAGAAGTCGGGGAGGAACTTCTTCCGGAAAAAGGTGATGGGGAACCGGCGGTCGGACGAGCCTG
CCAAGACCAAGAAGGGGCTGAGCAGCATCCTGGATGCCGCCCGCTGGAACCGGGGAGAGCCCCAGGTTCCAGATTTTCGACACCTCAAAG
CAGAGGTTGATGCCGAGAAGCCAGGTGCTACAGACCGGAAGGGAGGCGTGGGGATGCCCTCTCGGGACCGGAATATCGGGGCTCCTGGGC
AGGACACCCCTGGAGTCTCTCTGCACCCTCTGTCCCTGGACAGCCCAGACCGGGAACCAGGTGCTGACGCCCCCCTGGAGCTGGGGGACT
CATCCCCGCAGGGCCCAATGAGCCTGGAGTCCTTGGCGCCCCCAGAGAGTACCGACGAGGGGGCCGAAACCGAGAGGGCTGCGCTGGCCT
CGGCACACCAGTGCCACCCGCTGCCCCGGACGCTGAGCGTCCTGAAGAGCACGCCGCAGGTACGGGGCATGCACACCATCATCAGGGACA
AGGAGACCAGTCGCGACGAGTTCATCTTCTACTCCAAGAGACTGATGCGGCTGCTCATCGAGCACGCGCTCTCCTTCCTGCCCTTTCAGG
ACTGCGTCGTACAGACCCCGCAGGGGCAGGACTATGCGGGCAAGTGCTATGCGGGGAAGCAGATCACCGGTGTGTCCATTCTGCGCGCCG
GTGAAACCATGGAGCCCGCGCTGCGCGCTGTGTGCAAAGACGTGCGCATCGGCACCATCCTCATCCAGACCAACCAGCTTACCGGGGAGC
CCGAGCTCCACTACCTGAGGCTGCCCAAGGACATCAGCGATGACCACGTGATCCTCATGGACTGCACCGTGTCCACGGGCGCGGCGGCCA
TGATGGCAGTGCGCGTGCTCCTGGACCACGACGTGCCTGAGGACAAGATCTTTTTGCTGTCGCTGCTCATGGCAGAGATGGGCGTGCACT
CAGTGGCCTATGCATTTCCGCGAGTGAGAATCATCACCACGGCGGTGGACAAGCGGGTCAATGACCTTTTCCGCATCATCCCAGGCATTG
GGAACTTTGGCGACCGCTACTTTGGGACAGACGCGGTCCCCGATGGCAGTGACGAGGAGGAAGTGGCCTACACGGGTTAGCTGCCCAGTG

>6261_6261_15_ARHGEF1-UCKL1_ARHGEF1_chr19_42400555_ENST00000599846_UCKL1_chr20_62572561_ENST00000369908_length(amino acids)=614AA_BP=373
MEDFARGAASPGPSRPGLVPVSIIGAEDEDFENELETNSEEQNSQFQSLEQVKRRPAHLMALLQHVALQFEPGPLLCCLHADMLGSLGPK
EAKKAFLDFYHSFLEKTAVLRVPVPPNVAFELDRTRADLISEDVQRRFVQEVVQSQQVAVGRQLEDFRSKRLMGMTPWEQELAQLEAWVG
RDRASYEARERHVAERLLMHLEEMQHTISTDEEKSAAVVNAIGLYMRHLGVRTKSGDKKSGRNFFRKKVMGNRRSDEPAKTKKGLSSILD
AARWNRGEPQVPDFRHLKAEVDAEKPGATDRKGGVGMPSRDRNIGAPGQDTPGVSLHPLSLDSPDREPGADAPLELGDSSPQGPMSLESL
APPESTDEGAETERAALASAHQCHPLPRTLSVLKSTPQVRGMHTIIRDKETSRDEFIFYSKRLMRLLIEHALSFLPFQDCVVQTPQGQDY
AGKCYAGKQITGVSILRAGETMEPALRAVCKDVRIGTILIQTNQLTGEPELHYLRLPKDISDDHVILMDCTVSTGAAAMMAVRVLLDHDV

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for ARHGEF1-UCKL1


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for ARHGEF1-UCKL1


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for ARHGEF1-UCKL1


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource