FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:ARHGAP35-ESR1 (FusionGDB2 ID:6088)

Fusion Gene Summary for ARHGAP35-ESR1

check button Fusion gene summary
Fusion gene informationFusion gene name: ARHGAP35-ESR1
Fusion gene ID: 6088
HgeneTgene
Gene symbol

ARHGAP35

ESR1

Gene ID

2909

2099

Gene nameRho GTPase activating protein 35estrogen receptor 1
SynonymsGRF-1|GRLF1|P190-A|P190A|p190ARhoGAP|p190RhoGAPER|ESR|ESRA|ESTRR|Era|NR3A1
Cytomap

19q13.32

6q25.1-q25.2

Type of geneprotein-codingprotein-coding
Descriptionrho GTPase-activating protein 35glucocorticoid receptor DNA-binding factor 1glucocorticoid receptor repression factor 1rho GAP p190Aestrogen receptorE2 receptor alphaER-alphaestradiol receptorestrogen nuclear receptor alphaestrogen receptor alpha E1-E2-1-2estrogen receptor alpha E1-N2-E2-1-2nuclear receptor subfamily 3 group A member 1oestrogen receptor alpha
Modification date2020031320200329
UniProtAcc

Q9NRY4

P03372

Ensembl transtripts involved in fusion geneENST00000404338, ENST00000598548, 
ENST00000406599, ENST00000427531, 
ENST00000482101, ENST00000206249, 
ENST00000338799, ENST00000440973, 
ENST00000443427, ENST00000456483, 
Fusion gene scores* DoF score19 X 11 X 9=188110 X 10 X 6=600
# samples 1912
** MAII scorelog2(19/1881*10)=-3.30742852519225
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(12/600*10)=-2.32192809488736
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: ARHGAP35 [Title/Abstract] AND ESR1 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointARHGAP35(47440665)-ESR1(152201789), # samples:1
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
TgeneESR1

GO:0006366

transcription by RNA polymerase II

15831516

TgeneESR1

GO:0010629

negative regulation of gene expression

21695196

TgeneESR1

GO:0030520

intracellular estrogen receptor signaling pathway

9841876

TgeneESR1

GO:0032355

response to estradiol

15304487

TgeneESR1

GO:0043124

negative regulation of I-kappaB kinase/NF-kappaB signaling

7651415|16043358

TgeneESR1

GO:0043433

negative regulation of DNA-binding transcription factor activity

10816575

TgeneESR1

GO:0043627

response to estrogen

11581164

TgeneESR1

GO:0045893

positive regulation of transcription, DNA-templated

9841876|20074560

TgeneESR1

GO:0045899

positive regulation of RNA polymerase II transcriptional preinitiation complex assembly

9841876

TgeneESR1

GO:0045944

positive regulation of transcription by RNA polymerase II

11544182|12047722|15345745|15831516|18563714

TgeneESR1

GO:0051091

positive regulation of DNA-binding transcription factor activity

9328340|10681512

TgeneESR1

GO:0071392

cellular response to estradiol stimulus

15831516


check buttonFusion gene breakpoints across ARHGAP35 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across ESR1 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4BRCATCGA-A8-A06OARHGAP35chr19

47440665

+ESR1chr6

152201789

+


Top

Fusion Gene ORF analysis for ARHGAP35-ESR1

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-intronENST00000404338ENST00000406599ARHGAP35chr19

47440665

+ESR1chr6

152201789

+
5CDS-intronENST00000404338ENST00000427531ARHGAP35chr19

47440665

+ESR1chr6

152201789

+
5CDS-intronENST00000404338ENST00000482101ARHGAP35chr19

47440665

+ESR1chr6

152201789

+
In-frameENST00000404338ENST00000206249ARHGAP35chr19

47440665

+ESR1chr6

152201789

+
In-frameENST00000404338ENST00000338799ARHGAP35chr19

47440665

+ESR1chr6

152201789

+
In-frameENST00000404338ENST00000440973ARHGAP35chr19

47440665

+ESR1chr6

152201789

+
In-frameENST00000404338ENST00000443427ARHGAP35chr19

47440665

+ESR1chr6

152201789

+
In-frameENST00000404338ENST00000456483ARHGAP35chr19

47440665

+ESR1chr6

152201789

+
intron-3CDSENST00000598548ENST00000206249ARHGAP35chr19

47440665

+ESR1chr6

152201789

+
intron-3CDSENST00000598548ENST00000338799ARHGAP35chr19

47440665

+ESR1chr6

152201789

+
intron-3CDSENST00000598548ENST00000440973ARHGAP35chr19

47440665

+ESR1chr6

152201789

+
intron-3CDSENST00000598548ENST00000443427ARHGAP35chr19

47440665

+ESR1chr6

152201789

+
intron-3CDSENST00000598548ENST00000456483ARHGAP35chr19

47440665

+ESR1chr6

152201789

+
intron-intronENST00000598548ENST00000406599ARHGAP35chr19

47440665

+ESR1chr6

152201789

+
intron-intronENST00000598548ENST00000427531ARHGAP35chr19

47440665

+ESR1chr6

152201789

+
intron-intronENST00000598548ENST00000482101ARHGAP35chr19

47440665

+ESR1chr6

152201789

+

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000404338ARHGAP35chr1947440665+ENST00000440973ESR1chr6152201789+92793826049701656
ENST00000404338ARHGAP35chr1947440665+ENST00000338799ESR1chr6152201789+50583826049701656
ENST00000404338ARHGAP35chr1947440665+ENST00000456483ESR1chr6152201789+89433826046341544
ENST00000404338ARHGAP35chr1947440665+ENST00000443427ESR1chr6152201789+92793826049701656
ENST00000404338ARHGAP35chr1947440665+ENST00000206249ESR1chr6152201789+92763826049701656

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000404338ENST00000440973ARHGAP35chr1947440665+ESR1chr6152201789+0.0002419560.99975806
ENST00000404338ENST00000338799ARHGAP35chr1947440665+ESR1chr6152201789+0.0008355080.99916446
ENST00000404338ENST00000456483ARHGAP35chr1947440665+ESR1chr6152201789+0.0001387920.99986124
ENST00000404338ENST00000443427ARHGAP35chr1947440665+ESR1chr6152201789+0.0002419560.99975806
ENST00000404338ENST00000206249ARHGAP35chr1947440665+ESR1chr6152201789+0.0002429330.9997571

Top

Fusion Genomic Features for ARHGAP35-ESR1


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)
ARHGAP35chr1947440665+ESR1chr6152201789+9.20E-080.9999999
ARHGAP35chr1947440665+ESR1chr6152201789+9.20E-080.9999999

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.
genomic feature of top 1%

Top

Fusion Protein Features for ARHGAP35-ESR1


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr19:47440665/chr6:152201789)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
ARHGAP35

Q9NRY4

ESR1

P03372

FUNCTION: Rho GTPase-activating protein (GAP) (PubMed:19673492, PubMed:28894085). Binds several acidic phospholipids which inhibits the Rho GAP activity to promote the Rac GAP activity (PubMed:19673492). This binding is inhibited by phosphorylation by PRKCA (PubMed:19673492). Involved in cell differentiation as well as cell adhesion and migration, plays an important role in retinal tissue morphogenesis, neural tube fusion, midline fusion of the cerebral hemispheres and mammary gland branching morphogenesis (By similarity). Transduces signals from p21-ras to the nucleus, acting via the ras GTPase-activating protein (GAP) (By similarity). Transduces SRC-dependent signals from cell-surface adhesion molecules, such as laminin, to promote neurite outgrowth. Regulates axon outgrowth, guidance and fasciculation (By similarity). Modulates Rho GTPase-dependent F-actin polymerization, organization and assembly, is involved in polarized cell migration and in the positive regulation of ciliogenesis and cilia elongation (By similarity). During mammary gland development, is required in both the epithelial and stromal compartments for ductal outgrowth (By similarity). Represses transcription of the glucocorticoid receptor by binding to the cis-acting regulatory sequence 5'-GAGAAAAGAAACTGGAGAAACTC-3'; this function is however unclear and would need additional experimental evidences (PubMed:1894621). {ECO:0000250|UniProtKB:P81128, ECO:0000250|UniProtKB:Q91YM2, ECO:0000269|PubMed:1894621, ECO:0000269|PubMed:19673492, ECO:0000269|PubMed:28894085}.FUNCTION: Nuclear hormone receptor. The steroid hormones and their receptors are involved in the regulation of eukaryotic gene expression and affect cellular proliferation and differentiation in target tissues. Ligand-dependent nuclear transactivation involves either direct homodimer binding to a palindromic estrogen response element (ERE) sequence or association with other DNA-binding transcription factors, such as AP-1/c-Jun, c-Fos, ATF-2, Sp1 and Sp3, to mediate ERE-independent signaling. Ligand binding induces a conformational change allowing subsequent or combinatorial association with multiprotein coactivator complexes through LXXLL motifs of their respective components. Mutual transrepression occurs between the estrogen receptor (ER) and NF-kappa-B in a cell-type specific manner. Decreases NF-kappa-B DNA-binding activity and inhibits NF-kappa-B-mediated transcription from the IL6 promoter and displace RELA/p65 and associated coregulators from the promoter. Recruited to the NF-kappa-B response element of the CCL2 and IL8 promoters and can displace CREBBP. Present with NF-kappa-B components RELA/p65 and NFKB1/p50 on ERE sequences. Can also act synergistically with NF-kappa-B to activate transcription involving respective recruitment adjacent response elements; the function involves CREBBP. Can activate the transcriptional activity of TFF1. Also mediates membrane-initiated estrogen signaling involving various kinase cascades. Isoform 3 is involved in activation of NOS3 and endothelial nitric oxide production. Isoforms lacking one or several functional domains are thought to modulate transcriptional activity by competitive ligand or DNA binding and/or heterodimerization with the full-length receptor. Essential for MTA1-mediated transcriptional regulation of BRCA1 and BCAS3. Isoform 3 can bind to ERE and inhibit isoform 1. {ECO:0000269|PubMed:10681512, ECO:0000269|PubMed:10816575, ECO:0000269|PubMed:11477071, ECO:0000269|PubMed:11682626, ECO:0000269|PubMed:14764652, ECO:0000269|PubMed:15078875, ECO:0000269|PubMed:15891768, ECO:0000269|PubMed:16043358, ECO:0000269|PubMed:16617102, ECO:0000269|PubMed:16684779, ECO:0000269|PubMed:17922032, ECO:0000269|PubMed:17932106, ECO:0000269|PubMed:18247370, ECO:0000269|PubMed:19350539, ECO:0000269|PubMed:20074560, ECO:0000269|PubMed:20705611, ECO:0000269|PubMed:21330404, ECO:0000269|PubMed:22083956, ECO:0000269|PubMed:7651415, ECO:0000269|PubMed:9328340}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneARHGAP35chr19:47440665chr6:152201789ENST00000404338+26270_32712751500.0DomainNote=FF 1
HgeneARHGAP35chr19:47440665chr6:152201789ENST00000404338+26368_42212751500.0DomainNote=FF 2
HgeneARHGAP35chr19:47440665chr6:152201789ENST00000404338+26429_48312751500.0DomainNote=FF 3
HgeneARHGAP35chr19:47440665chr6:152201789ENST00000404338+26485_55012751500.0DomainNote=FF 4
HgeneARHGAP35chr19:47440665chr6:152201789ENST00000404338+26592_76712751500.0DomainpG1 pseudoGTPase
HgeneARHGAP35chr19:47440665chr6:152201789ENST00000404338+26783_94712751500.0DomainpG2 pseudoGTPase
HgeneARHGAP35chr19:47440665chr6:152201789ENST00000404338+26201_20312751500.0Nucleotide bindingGTP
HgeneARHGAP35chr19:47440665chr6:152201789ENST00000404338+26229_23112751500.0Nucleotide bindingGTP
HgeneARHGAP35chr19:47440665chr6:152201789ENST00000404338+2633_3712751500.0Nucleotide bindingGTP
HgeneARHGAP35chr19:47440665chr6:152201789ENST00000404338+2695_9712751500.0Nucleotide bindingGTP
HgeneARHGAP35chr19:47440665chr6:152201789ENST00000404338+261213_123612751500.0RegionRequired for phospholipid binding and regulation of the substrate preference
HgeneARHGAP35chr19:47440665chr6:152201789ENST00000404338+261_26612751500.0RegionHas GTPase activity%2C required for proper localization
TgeneESR1chr19:47440665chr6:152201789ENST0000020624918311_547214596.0DomainNR LBD
TgeneESR1chr19:47440665chr6:152201789ENST0000033879929311_547214596.0DomainNR LBD
TgeneESR1chr19:47440665chr6:152201789ENST00000440973310311_547214596.0DomainNR LBD
TgeneESR1chr19:47440665chr6:152201789ENST0000044342729311_547214596.0DomainNR LBD
TgeneESR1chr19:47440665chr6:152201789ENST0000045648328311_547214484.0DomainNR LBD
TgeneESR1chr19:47440665chr6:152201789ENST0000020624918251_310214596.0RegionNote=Hinge
TgeneESR1chr19:47440665chr6:152201789ENST0000020624918264_595214596.0RegionNote=Self-association
TgeneESR1chr19:47440665chr6:152201789ENST0000020624918311_595214596.0RegionNote=Transactivation AF-2
TgeneESR1chr19:47440665chr6:152201789ENST0000033879929251_310214596.0RegionNote=Hinge
TgeneESR1chr19:47440665chr6:152201789ENST0000033879929264_595214596.0RegionNote=Self-association
TgeneESR1chr19:47440665chr6:152201789ENST0000033879929311_595214596.0RegionNote=Transactivation AF-2
TgeneESR1chr19:47440665chr6:152201789ENST00000440973310251_310214596.0RegionNote=Hinge
TgeneESR1chr19:47440665chr6:152201789ENST00000440973310264_595214596.0RegionNote=Self-association
TgeneESR1chr19:47440665chr6:152201789ENST00000440973310311_595214596.0RegionNote=Transactivation AF-2
TgeneESR1chr19:47440665chr6:152201789ENST0000044342729251_310214596.0RegionNote=Hinge
TgeneESR1chr19:47440665chr6:152201789ENST0000044342729264_595214596.0RegionNote=Self-association
TgeneESR1chr19:47440665chr6:152201789ENST0000044342729311_595214596.0RegionNote=Transactivation AF-2
TgeneESR1chr19:47440665chr6:152201789ENST0000045648328251_310214484.0RegionNote=Hinge
TgeneESR1chr19:47440665chr6:152201789ENST0000045648328264_595214484.0RegionNote=Self-association
TgeneESR1chr19:47440665chr6:152201789ENST0000045648328311_595214484.0RegionNote=Transactivation AF-2
TgeneESR1chr19:47440665chr6:152201789ENST0000020624918221_245214596.0Zinc fingerNR C4-type
TgeneESR1chr19:47440665chr6:152201789ENST0000033879929221_245214596.0Zinc fingerNR C4-type
TgeneESR1chr19:47440665chr6:152201789ENST00000440973310221_245214596.0Zinc fingerNR C4-type
TgeneESR1chr19:47440665chr6:152201789ENST0000044342729221_245214596.0Zinc fingerNR C4-type
TgeneESR1chr19:47440665chr6:152201789ENST0000045648328221_245214484.0Zinc fingerNR C4-type

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneARHGAP35chr19:47440665chr6:152201789ENST00000404338+261440_149012751500.0Compositional biasNote=Pro-rich
HgeneARHGAP35chr19:47440665chr6:152201789ENST00000404338+261249_143612751500.0DomainRho-GAP
TgeneESR1chr19:47440665chr6:152201789ENST0000020624918185_250214596.0DNA bindingNuclear receptor
TgeneESR1chr19:47440665chr6:152201789ENST0000033879929185_250214596.0DNA bindingNuclear receptor
TgeneESR1chr19:47440665chr6:152201789ENST00000440973310185_250214596.0DNA bindingNuclear receptor
TgeneESR1chr19:47440665chr6:152201789ENST0000044342729185_250214596.0DNA bindingNuclear receptor
TgeneESR1chr19:47440665chr6:152201789ENST0000045648328185_250214484.0DNA bindingNuclear receptor
TgeneESR1chr19:47440665chr6:152201789ENST0000020624918185_205214596.0Zinc fingerNR C4-type
TgeneESR1chr19:47440665chr6:152201789ENST0000033879929185_205214596.0Zinc fingerNR C4-type
TgeneESR1chr19:47440665chr6:152201789ENST00000440973310185_205214596.0Zinc fingerNR C4-type
TgeneESR1chr19:47440665chr6:152201789ENST0000044342729185_205214596.0Zinc fingerNR C4-type
TgeneESR1chr19:47440665chr6:152201789ENST0000045648328185_205214484.0Zinc fingerNR C4-type


Top

Fusion Gene Sequence for ARHGAP35-ESR1


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>6088_6088_1_ARHGAP35-ESR1_ARHGAP35_chr19_47440665_ENST00000404338_ESR1_chr6_152201789_ENST00000206249_length(transcript)=9276nt_BP=3826nt
ATGATGATGGCAAGAAAGCAAGATGTCCGAATTCCCACCTACAACATCAGTGTGGTGGGATTATCTGGGACCGAGAAGGAAAAGGGCCAG
TGTGGGATTGGAAAGTCTTGTTTGTGCAACCGCTTCGTGCGCCCGAGTGCTGACGAGTTTCACTTGGACCATACCTCCGTCCTCAGCACC
AGTGACTTTGGAGGGCGAGTGGTCAATAATGACCACTTTCTCTACTGGGGAGAAGTTAGCCGCTCCCTGGAGGATTGTGTGGAATGTAAG
ATGCACATTGTGGAGCAGACTGAATTTATTGATGATCAGACTTTTCAACCTCATCGAAGCACGGCCCTGCAGCCCTATATCAAGAGAGCT
GCTGCGACCAAGCTTGCATCAGCTGAAAAACTCATGTACTTTTGCACTGACCAGCTGGGGCTGGAGCAGGACTTTGAGCAGAAACAAATG
CCAGACGGAAAGCTGCTGGTTGATGGTTTTCTTCTTGGTATTGATGTTAGCAGGGGCATGAATAGGAACTTTGATGACCAGCTCAAGTTT
GTCTCCAATCTCTACAATCAGCTTGCAAAAACAAAAAAGCCCATAGTGGTGGTCCTGACTAAGTGTGACGAAGGTGTTGAGCGGTACATT
AGAGATGCACATACTTTTGCCTTAAGCAAAAAGAACCTCCAGGTTGTGGAGACCTCAGCGAGATCCAATGTAAACGTGGACTTGGCTTTC
AGCACCTTAGTGCAACTCATTGATAAAAGTCGGGGAAAGACAAAAATCATTCCTTATTTTGAAGCTCTCAAGCAGCAGAGTCAGCAGATA
GCTACAGCAAAAGACAAGTATGAGTGGCTGGTGAGTCGCATTGTGAAAAACCACAATGAGAACTGGCTGAGTGTCAGCCGAAAGATGCAG
GCCTCTCCAGAATACCAGGACTATGTCTACCTGGAAGGGACTCAGAAAGCCAAGAAGCTGTTTCTACAGCACATCCACCGCCTCAAGCAT
GAGCATATCGAGCGTAGGAGAAAGCTGTACCTGGCAGCCCTGCCATTAGCTTTTGAAGCTCTTATACCTAATCTAGATGAAATAGACCAC
CTAAGCTGCATAAAAGCCAAAAAGCTCTTAGAAACCAAGCCAGAATTCTTGAAGTGGTTTGTTGTGCTTGAAGAGACCCCATGGGATGCC
ACCAGTCACATTGACAACATGGAAAACGAACGGATTCCCTTTGATTTAATGGATACCGTCCCTGCAGAGCAGCTATACGAGGCCCACTTA
GAGAAGCTGAGGAACGAAAGGAAAAGAGTTGAGATGCGAAGGGCGTTTAAAGAAAACCTGGAGACTTCTCCTTTCATAACTCCCGGAAAG
CCTTGGGAAGAGGCCCGTAGTTTTATTATGAATGAGGATTTCTACCAGTGGCTGGAGGAATCTGTATACATGGATATTTATGGCAAACAC
CAAAAGCAAATTATAGATAAAGCAAAGGAAGAATTTCAGGAGTTGCTTTTGGAATATTCAGAATTGTTTTATGAACTGGAGCTGGATGCT
AAGCCCAGCAAGGAGAAGATGGGTGTTATTCAGGATGTTCTGGGAGAGGAACAGCGATTTAAAGCATTACAAAAGCTCCAAGCAGAGCGT
GATGCCCTTATTCTGAAACACATTCATTTTGTGTACCACCCAACAAAGGAGACATGCCCCAGCTGCCCAGCTTGTGTGGACGCTAAGATT
GAGCACTTGATTAGTTCTCGGTTTATCCGGCCGTCTGACCGGAATCAGAAAAATTCACTCTCTGACCCTAACATTGATAGAATCAACTTG
GTTATATTGGGCAAAGACGGCCTTGCCCGAGAGTTGGCCAATGAGATTCGAGCTCTTTGTACAAATGATGACAAGTATGTGATAGATGGT
AAAATGTATGAGCTTTCCCTGAGGCCAATAGAGGGGAATGTCAGGCTTCCTGTGAACTCTTTCCAGACGCCAACATTTCAGCCCCACGGC
TGTCTCTGCCTTTACAATTCAAAGGAATCGCTATCCTATGTAGTGGAAAGTATAGAGAAGAGTAGAGAGTCCACGCTGGGCCGGCGGGAT
AATCATTTAGTCCATCTCCCCCTTACATTAATTTTGGTTAACAAGAGAGGAGACACCAGTGGAGAGACTCTGCATAGCTTAATACAGCAA
GGTCAACAAATTGCTAGCAAACTTCAGTGTGTCTTTCTCGACCCTGCTTCTGCTGGCATTGGTTACGGACGCAACATTAATGAAAAGCAA
ATCAGTCAAGTTTTGAAGGGACTCCTGGACTCTAAGCGTAACTTAAACCTGGTCAGTTCTACTGCTAGCATCAAAGATTTGGCTGATGTT
GATCTGCGAATTGTTATGTGTCTGATGTGTGGAGATCCTTTTAGTGCAGATGACATACTTTTTCCTGTCCTTCAGTCCCAAACCTGTAAA
TCTTCCCATTGTGGAAGCAACAACTCTGTTTTACTTGAACTACCAATCGGACTGCACAAGAAGCGGATTGAACTGTCTGTTCTTTCATAC
CATTCCTCCTTTAGCATCAGAAAGAGCCGGTTGGTTCATGGGTACATTGTTTTTTATTCAGCCAAACGTAAGGCCTCTTTGGCTATGTTA
CGTGCCTTTCTTTGTGAAGTGCAGGATATTATCCCTATTCAGCTTGTAGCACTCACTGATGGCGCTGTAGATGTCCTGGACAATGACTTA
AGTAGGGAACAGCTAACTGAGGGGGAGGAGATTGCTCAAGAAATTGACGGAAGGTTCACAAGCATCCCCTGTAGCCAACCCCAGCATAAA
CTTGAGATCTTTCACCCATTTTTTAAAGATGTGGTGGAAAAAAAGAACATAATCGAGGCTACTCATATGTACGATAATGCTGCCGAGGCC
TGTAGCACCACCGAAGAGGTGTTTAACTCCCCCCGGGCAGGATCACCGCTCTGCAACTCAAACCTGCAGGATTCAGAAGAAGATATCGAG
CCATCTTACAGCCTGTTTCGAGAAGACACATCACTGCCTTCTCTGTCCAAAGACCATTCTAAGCTCTCTATGGAACTGGAGGGAAATGAT
GGGCTGTCTTTCATTATGAGCAATTTTGAGAGTAAACTGAACAACAAAGTACCTCCGCCAGTCAAACCAAAGCCTCCTGTCCATTTTGAA
ATTACAAAGGGGGATCTATCTTATTTAGACCAAGGCCATAGGGATGGACAGAGGAAGTCTGTGTCTTCTAGCCCCTGGCTGCCTCAGGAT
GGGTTTGATCCTTCTGACTATGCTGAACCCATGGATGCTGTGGTGAAGCCAAGGAATGAAGAAGAAAACATATACTCCGTGCCCCATGAC
AGCACCCAAGGCAAAATCATCACCATTCGGAATATCAACAAAGCCCAGTCCAACGGCAGCGGGAATGGTTCTGACAGTGAAATGGACACC
AGCTCTCTAGAGCGAGGGCGCAAGGTTTCCATCGTGAGCAAGCCAGTGCTGTACAGGACGAGATGCACCCGGCTGGGGCGGTTTGCTAGT
TACCGGACCAGCTTCAGCGTGGGGAGTGATGATGAGCTGGGGCCCATCCGGAAGAAAGAGGAGGATCAGGCATCCCAGGGTTATAAAGGG
GACAATGCTGTCATTCCATACGAAACAGACGAAGACCCGCGGAGGAGGAATATTCTTCGCAGCCTAAGGAGGAACACTAAGAAACCAAAG
CCCAAACCCCGGCCATCCATCACAAAGGCAACCTGGGAGAGTAACTATTTTGGGGTGCCCTTAACAACTGTCGTGACTCCAGAGAAGCCG
ATCCCCATTTTTATTGAAAGATGTATTGAGTACATTGAAGCCACAGGACATAACGACTATATGTGTCCAGCCACCAACCAGTGCACCATT
GATAAAAACAGGAGGAAGAGCTGCCAGGCCTGCCGGCTCCGTAAATGCTACGAAGTGGGAATGATGAAAGGTGGGATACGAAAAGACCGA
AGAGGAGGGAGAATGTTGAAACACAAGCGCCAGAGAGATGATGGGGAGGGCAGGGGTGAAGTGGGGTCTGCTGGAGACATGAGAGCTGCC
AACCTTTGGCCAAGCCCGCTCATGATCAAACGCTCTAAGAAGAACAGCCTGGCCTTGTCCCTGACGGCCGACCAGATGGTCAGTGCCTTG
TTGGATGCTGAGCCCCCGATACTCTATTCCGAGTATGATCCTACCAGACCCTTCAGTGAAGCTTCGATGATGGGCTTACTGACCAACCTG
GCAGACAGGGAGCTGGTTCACATGATCAACTGGGCGAAGAGGGTGCCAGGCTTTGTGGATTTGACCCTCCATGATCAGGTCCACCTTCTA
GAATGTGCCTGGCTAGAGATCCTGATGATTGGTCTCGTCTGGCGCTCCATGGAGCACCCAGGGAAGCTACTGTTTGCTCCTAACTTGCTC
TTGGACAGGAACCAGGGAAAATGTGTAGAGGGCATGGTGGAGATCTTCGACATGCTGCTGGCTACATCATCTCGGTTCCGCATGATGAAT
CTGCAGGGAGAGGAGTTTGTGTGCCTCAAATCTATTATTTTGCTTAATTCTGGAGTGTACACATTTCTGTCCAGCACCCTGAAGTCTCTG
GAAGAGAAGGACCATATCCACCGAGTCCTGGACAAGATCACAGACACTTTGATCCACCTGATGGCCAAGGCAGGCCTGACCCTGCAGCAG
CAGCACCAGCGGCTGGCCCAGCTCCTCCTCATCCTCTCCCACATCAGGCACATGAGTAACAAAGGCATGGAGCATCTGTACAGCATGAAG
TGCAAGAACGTGGTGCCCCTCTATGACCTGCTGCTGGAGATGCTGGACGCCCACCGCCTACATGCGCCCACTAGCCGTGGAGGGGCATCC
GTGGAGGAGACGGACCAAAGCCACTTGGCCACTGCGGGCTCTACTTCATCGCATTCCTTGCAAAAGTATTACATCACGGGGGAGGCAGAG
GGTTTCCCTGCCACGGTCTGAGAGCTCCCTGGCTCCCACACGGTTCAGATAATCCCTGCTGCATTTTACCCTCATCATGCACCACTTTAG
CCAAATTCTGTCTCCTGCATACACTCCGGCATGCATCCAACACCAATGGCTTTCTAGATGAGTGGCCATTCATTTGCTTGCTCAGTTCTT
AGTGGCACATCTTCTGTCTTCTGTTGGGAACAGCCAAAGGGATTCCAAGGCTAAATCTTTGTAACAGCTCTCTTTCCCCCTTGCTATGTT
ACTAAGCGTGAGGATTCCCGTAGCTCTTCACAGCTGAACTCAGTCTATGGGTTGGGGCTCAGATAACTCTGTGCATTTAAGCTACTTGTA
GAGACCCAGGCCTGGAGAGTAGACATTTTGCCTCTGATAAGCACTTTTTAAATGGCTCTAAGAATAAGCCACAGCAAAGAATTTAAAGTG
GCTCCTTTAATTGGTGACTTGGAGAAAGCTAGGTCAAGGGTTTATTATAGCACCCTCTTGTATTCCTATGGCAATGCATCCTTTTATGAA
AGTGGTACACCTTAAAGCTTTTATATGACTGTAGCAGAGTATCTGGTGATTGTCAATTCATTCCCCCTATAGGAATACAAGGGGCACACA
GGGAAGGCAGATCCCCTAGTTGGCAAGACTATTTTAACTTGATACACTGCAGATTCAGATGTGCTGAAAGCTCTGCCTCTGGCTTTCCGG
TCATGGGTTCCAGTTAATTCATGCCTCCCATGGACCTATGGAGAGCAGCAAGTTGATCTTAGTTAAGTCTCCCTATATGAGGGATAAGTT
CCTGATTTTTGTTTTTATTTTTGTGTTACAAAAGAAAGCCCTCCCTCCCTGAACTTGCAGTAAGGTCAGCTTCAGGACCTGTTCCAGTGG
GCACTGTACTTGGATCTTCCCGGCGTGTGTGTGCCTTACACAGGGGTGAACTGTTCACTGTGGTGATGCATGATGAGGGTAAATGGTAGT
TGAAAGGAGCAGGGGCCCTGGTGTTGCATTTAGCCCTGGGGCATGGAGCTGAACAGTACTTGTGCAGGATTGTTGTGGCTACTAGAGAAC
AAGAGGGAAAGTAGGGCAGAAACTGGATACAGTTCTGAGGCACAGCCAGACTTGCTCAGGGTGGCCCTGCCACAGGCTGCAGCTACCTAG
GAACATTCCTTGCAGACCCCGCATTGCCCTTTGGGGGTGCCCTGGGATCCCTGGGGTAGTCCAGCTCTTCTTCATTTCCCAGCGTGGCCC
TGGTTGGAAGAAGCAGCTGTCACAGCTGCTGTAGACAGCTGTGTTCCTACAATTGGCCCAGCACCCTGGGGCACGGGAGAAGGGTGGGGA
CCGTTGCTGTCACTACTCAGGCTGACTGGGGCCTGGTCAGATTACGTATGCCCTTGGTGGTTTAGAGATAATCCAAAATCAGGGTTTGGT
TTGGGGAAGAAAATCCTCCCCCTTCCTCCCCCGCCCCGTTCCCTACCGCCTCCACTCCTGCCAGCTCATTTCCTTCAATTTCCTTTGACC
TATAGGCTAAAAAAGAAAGGCTCATTCCAGCCACAGGGCAGCCTTCCCTGGGCCTTTGCTTCTCTAGCACAATTATGGGTTACTTCCTTT
TTCTTAACAAAAAAGAATGTTTGATTTCCTCTGGGTGACCTTATTGTCTGTAATTGAAACCCTATTGAGAGGTGATGTCTGTGTTAGCCA
ATGACCCAGGTGAGCTGCTCGGGCTTCTCTTGGTATGTCTTGTTTGGAAAAGTGGATTTCATTCATTTCTGATTGTCCAGTTAAGTGATC
ACCAAAGGACTGAGAATCTGGGAGGGCAAAAAAAAAAAAAAAGTTTTTATGTGCACTTAAATTTGGGGACAATTTTATGTATCTGTGTTA
AGGATATGTTTAAGAACATAATTCTTTTGTTGCTGTTTGTTTAAGAAGCACCTTAGTTTGTTTAAGAAGCACCTTATATAGTATAATATA
TATTTTTTTGAAATTACATTGCTTGTTTATCAGACAATTGAATGTAGTAATTCTGTTCTGGATTTAATTTGACTGGGTTAACATGCAAAA
ACCAAGGAAAAATATTTAGTTTTTTTTTTTTTTTTTGTATACTTTTCAAGCTACCTTGTCATGTATACAGTCATTTATGCCTAAAGCCTG
GTGATTATTCATTTAAATGAAGATCACATTTCATATCAACTTTTGTATCCACAGTAGACAAAATAGCACTAATCCAGATGCCTATTGTTG
GATACTGAATGACAGACAATCTTATGTAGCAAAGATTATGCCTGAAAAGGAAAATTATTCAGGGCAGCTAATTTTGCTTTTACCAAAATA
TCAGTAGTAATATTTTTGGACAGTAGCTAATGGGTCAGTGGGTTCTTTTTAATGTTTATACTTAGATTTTCTTTTAAAAAAATTAAAATA
AAACAAAAAAAAATTTCTAGGACTAGACGATGTAATACCAGCTAAAGCCAAACAATTATACAGTGGAAGGTTTTACATTATTCATCCAAT
GTGTTTCTATTCATGTTAAGATACTACTACATTTGAAGTGGGCAGAGAACATCAGATGATTGAAATGTTCGCCCAGGGGTCTCCAGCAAC
TTTGGAAATCTCTTTGTATTTTTACTTGAAGTGCCACTAATGGACAGCAGATATTTTCTGGCTGATGTTGGTATTGGGTGTAGGAACATG
ATTTAAAAAAAAACTCTTGCCTCTGCTTTCCCCCACTCTGAGGCAAGTTAAAATGTAAAAGATGTGATTTATCTGGGGGGCTCAGGTATG
GTGGGGAAGTGGATTCAGGAATCTGGGGAATGGCAAATATATTAAGAAGAGTATTGAAAGTATTTGGAGGAAAATGGTTAATTCTGGGTG
TGCACCAGGGTTCAGTAGAGTCCACTTCTGCCCTGGAGACCACAAATCAACTAGCTCCATTTACAGCCATTTCTAAAATGGCAGCTTCAG
TTCTAGAGAAGAAAGAACAACATCAGCAGTAAAGTCCATGGAATAGCTAGTGGTCTGTGTTTCTTTTCGCCATTGCCTAGCTTGCCGTAA
TGATTCTATAATGCCATCATGCAGCAATTATGAGAGGCTAGGTCATCCAAAGAGAAGACCCTATCAATGTAGGTTGCAAAATCTAACCCC
TAAGGAAGTGCAGTCTTTGATTTGATTTCCCTAGTAACCTTGCAGATATGTTTAACCAAGCCATAGCCCATGCCTTTTGAGGGCTGAACA
AATAAGGGACTTACTGATAATTTACTTTTGATCACATTAAGGTGTTCTCACCTTGAAATCTTATACACTGAAATGGCCATTGATTTAGGC
CACTGGCTTAGAGTACTCCTTCCCCTGCATGACACTGATTACAAATACTTTCCTATTCATACTTTCCAATTATGAGATGGACTGTGGGTA
CTGGGAGTGATCACTAACACCATAGTAATGTCTAATATTCACAGGCAGATCTGCTTGGGGAAGCTAGTTATGTGAAAGGCAAATAGAGTC
ATACAGTAGCTCAAAAGGCAACCATAATTCTCTTTGGTGCAGGTCTTGGGAGCGTGATCTAGATTACACTGCACCATTCCCAAGTTAATC
CCCTGAAAACTTACTCTCAACTGGAGCAAATGAACTTTGGTCCCAAATATCCATCTTTTCAGTAGCGTTAATTATGCTCTGTTTCCAACT
GCATTTCCTTTCCAATTGAATTAAAGTGTGGCCTCGTTTTTAGTCATTTAAAATTGTTTTCTAAGTAATTGCTGCCTCTATTATGGCACT
TCAATTTTGCACTGTCTTTTGAGATTCAAGAAAAATTTCTATTCTTTTTTTTGCATCCAATTGTGCCTGAACTTTTAAAATATGTAAATG
CTGCCATGTTCCAAACCCATCGTCAGTGTGTGTGTTTAGAGCTGTGCACCCTAGAAACAACATATTGTCCCATGAGCAGGTGCCTGAGAC
ACAGACCCCTTTGCATTCACAGAGAGGTCATTGGTTATAGAGACTTGAATTAATAAGTGACATTATGCCAGTTTCTGTTCTCTCACAGGT
GATAAACAATGCTTTTTGTGCACTACATACTCTTCAGTGTAGAGCTCTTGTTTTATGGGAAAAGGCTCAAATGCCAAATTGTGTTTGATG
GATTAATATGCCCTTTTGCCGATGCATACTATTACTGATGTGACTCGGTTTTGTCGCAGCTTTGCTTTGTTTAATGAAACACACTTGTAA
ACCTCTTTTGCACTTTGAAAAAGAATCCAGCGGGATGCTCGAGCACCTGTAAACAATTTTCTCAACCTATTTGATGTTCAAATAAAGAAT

>6088_6088_1_ARHGAP35-ESR1_ARHGAP35_chr19_47440665_ENST00000404338_ESR1_chr6_152201789_ENST00000206249_length(amino acids)=1656AA_BP=1275
MMMARKQDVRIPTYNISVVGLSGTEKEKGQCGIGKSCLCNRFVRPSADEFHLDHTSVLSTSDFGGRVVNNDHFLYWGEVSRSLEDCVECK
MHIVEQTEFIDDQTFQPHRSTALQPYIKRAAATKLASAEKLMYFCTDQLGLEQDFEQKQMPDGKLLVDGFLLGIDVSRGMNRNFDDQLKF
VSNLYNQLAKTKKPIVVVLTKCDEGVERYIRDAHTFALSKKNLQVVETSARSNVNVDLAFSTLVQLIDKSRGKTKIIPYFEALKQQSQQI
ATAKDKYEWLVSRIVKNHNENWLSVSRKMQASPEYQDYVYLEGTQKAKKLFLQHIHRLKHEHIERRRKLYLAALPLAFEALIPNLDEIDH
LSCIKAKKLLETKPEFLKWFVVLEETPWDATSHIDNMENERIPFDLMDTVPAEQLYEAHLEKLRNERKRVEMRRAFKENLETSPFITPGK
PWEEARSFIMNEDFYQWLEESVYMDIYGKHQKQIIDKAKEEFQELLLEYSELFYELELDAKPSKEKMGVIQDVLGEEQRFKALQKLQAER
DALILKHIHFVYHPTKETCPSCPACVDAKIEHLISSRFIRPSDRNQKNSLSDPNIDRINLVILGKDGLARELANEIRALCTNDDKYVIDG
KMYELSLRPIEGNVRLPVNSFQTPTFQPHGCLCLYNSKESLSYVVESIEKSRESTLGRRDNHLVHLPLTLILVNKRGDTSGETLHSLIQQ
GQQIASKLQCVFLDPASAGIGYGRNINEKQISQVLKGLLDSKRNLNLVSSTASIKDLADVDLRIVMCLMCGDPFSADDILFPVLQSQTCK
SSHCGSNNSVLLELPIGLHKKRIELSVLSYHSSFSIRKSRLVHGYIVFYSAKRKASLAMLRAFLCEVQDIIPIQLVALTDGAVDVLDNDL
SREQLTEGEEIAQEIDGRFTSIPCSQPQHKLEIFHPFFKDVVEKKNIIEATHMYDNAAEACSTTEEVFNSPRAGSPLCNSNLQDSEEDIE
PSYSLFREDTSLPSLSKDHSKLSMELEGNDGLSFIMSNFESKLNNKVPPPVKPKPPVHFEITKGDLSYLDQGHRDGQRKSVSSSPWLPQD
GFDPSDYAEPMDAVVKPRNEEENIYSVPHDSTQGKIITIRNINKAQSNGSGNGSDSEMDTSSLERGRKVSIVSKPVLYRTRCTRLGRFAS
YRTSFSVGSDDELGPIRKKEEDQASQGYKGDNAVIPYETDEDPRRRNILRSLRRNTKKPKPKPRPSITKATWESNYFGVPLTTVVTPEKP
IPIFIERCIEYIEATGHNDYMCPATNQCTIDKNRRKSCQACRLRKCYEVGMMKGGIRKDRRGGRMLKHKRQRDDGEGRGEVGSAGDMRAA
NLWPSPLMIKRSKKNSLALSLTADQMVSALLDAEPPILYSEYDPTRPFSEASMMGLLTNLADRELVHMINWAKRVPGFVDLTLHDQVHLL
ECAWLEILMIGLVWRSMEHPGKLLFAPNLLLDRNQGKCVEGMVEIFDMLLATSSRFRMMNLQGEEFVCLKSIILLNSGVYTFLSSTLKSL
EEKDHIHRVLDKITDTLIHLMAKAGLTLQQQHQRLAQLLLILSHIRHMSNKGMEHLYSMKCKNVVPLYDLLLEMLDAHRLHAPTSRGGAS

--------------------------------------------------------------
>6088_6088_2_ARHGAP35-ESR1_ARHGAP35_chr19_47440665_ENST00000404338_ESR1_chr6_152201789_ENST00000338799_length(transcript)=5058nt_BP=3826nt
ATGATGATGGCAAGAAAGCAAGATGTCCGAATTCCCACCTACAACATCAGTGTGGTGGGATTATCTGGGACCGAGAAGGAAAAGGGCCAG
TGTGGGATTGGAAAGTCTTGTTTGTGCAACCGCTTCGTGCGCCCGAGTGCTGACGAGTTTCACTTGGACCATACCTCCGTCCTCAGCACC
AGTGACTTTGGAGGGCGAGTGGTCAATAATGACCACTTTCTCTACTGGGGAGAAGTTAGCCGCTCCCTGGAGGATTGTGTGGAATGTAAG
ATGCACATTGTGGAGCAGACTGAATTTATTGATGATCAGACTTTTCAACCTCATCGAAGCACGGCCCTGCAGCCCTATATCAAGAGAGCT
GCTGCGACCAAGCTTGCATCAGCTGAAAAACTCATGTACTTTTGCACTGACCAGCTGGGGCTGGAGCAGGACTTTGAGCAGAAACAAATG
CCAGACGGAAAGCTGCTGGTTGATGGTTTTCTTCTTGGTATTGATGTTAGCAGGGGCATGAATAGGAACTTTGATGACCAGCTCAAGTTT
GTCTCCAATCTCTACAATCAGCTTGCAAAAACAAAAAAGCCCATAGTGGTGGTCCTGACTAAGTGTGACGAAGGTGTTGAGCGGTACATT
AGAGATGCACATACTTTTGCCTTAAGCAAAAAGAACCTCCAGGTTGTGGAGACCTCAGCGAGATCCAATGTAAACGTGGACTTGGCTTTC
AGCACCTTAGTGCAACTCATTGATAAAAGTCGGGGAAAGACAAAAATCATTCCTTATTTTGAAGCTCTCAAGCAGCAGAGTCAGCAGATA
GCTACAGCAAAAGACAAGTATGAGTGGCTGGTGAGTCGCATTGTGAAAAACCACAATGAGAACTGGCTGAGTGTCAGCCGAAAGATGCAG
GCCTCTCCAGAATACCAGGACTATGTCTACCTGGAAGGGACTCAGAAAGCCAAGAAGCTGTTTCTACAGCACATCCACCGCCTCAAGCAT
GAGCATATCGAGCGTAGGAGAAAGCTGTACCTGGCAGCCCTGCCATTAGCTTTTGAAGCTCTTATACCTAATCTAGATGAAATAGACCAC
CTAAGCTGCATAAAAGCCAAAAAGCTCTTAGAAACCAAGCCAGAATTCTTGAAGTGGTTTGTTGTGCTTGAAGAGACCCCATGGGATGCC
ACCAGTCACATTGACAACATGGAAAACGAACGGATTCCCTTTGATTTAATGGATACCGTCCCTGCAGAGCAGCTATACGAGGCCCACTTA
GAGAAGCTGAGGAACGAAAGGAAAAGAGTTGAGATGCGAAGGGCGTTTAAAGAAAACCTGGAGACTTCTCCTTTCATAACTCCCGGAAAG
CCTTGGGAAGAGGCCCGTAGTTTTATTATGAATGAGGATTTCTACCAGTGGCTGGAGGAATCTGTATACATGGATATTTATGGCAAACAC
CAAAAGCAAATTATAGATAAAGCAAAGGAAGAATTTCAGGAGTTGCTTTTGGAATATTCAGAATTGTTTTATGAACTGGAGCTGGATGCT
AAGCCCAGCAAGGAGAAGATGGGTGTTATTCAGGATGTTCTGGGAGAGGAACAGCGATTTAAAGCATTACAAAAGCTCCAAGCAGAGCGT
GATGCCCTTATTCTGAAACACATTCATTTTGTGTACCACCCAACAAAGGAGACATGCCCCAGCTGCCCAGCTTGTGTGGACGCTAAGATT
GAGCACTTGATTAGTTCTCGGTTTATCCGGCCGTCTGACCGGAATCAGAAAAATTCACTCTCTGACCCTAACATTGATAGAATCAACTTG
GTTATATTGGGCAAAGACGGCCTTGCCCGAGAGTTGGCCAATGAGATTCGAGCTCTTTGTACAAATGATGACAAGTATGTGATAGATGGT
AAAATGTATGAGCTTTCCCTGAGGCCAATAGAGGGGAATGTCAGGCTTCCTGTGAACTCTTTCCAGACGCCAACATTTCAGCCCCACGGC
TGTCTCTGCCTTTACAATTCAAAGGAATCGCTATCCTATGTAGTGGAAAGTATAGAGAAGAGTAGAGAGTCCACGCTGGGCCGGCGGGAT
AATCATTTAGTCCATCTCCCCCTTACATTAATTTTGGTTAACAAGAGAGGAGACACCAGTGGAGAGACTCTGCATAGCTTAATACAGCAA
GGTCAACAAATTGCTAGCAAACTTCAGTGTGTCTTTCTCGACCCTGCTTCTGCTGGCATTGGTTACGGACGCAACATTAATGAAAAGCAA
ATCAGTCAAGTTTTGAAGGGACTCCTGGACTCTAAGCGTAACTTAAACCTGGTCAGTTCTACTGCTAGCATCAAAGATTTGGCTGATGTT
GATCTGCGAATTGTTATGTGTCTGATGTGTGGAGATCCTTTTAGTGCAGATGACATACTTTTTCCTGTCCTTCAGTCCCAAACCTGTAAA
TCTTCCCATTGTGGAAGCAACAACTCTGTTTTACTTGAACTACCAATCGGACTGCACAAGAAGCGGATTGAACTGTCTGTTCTTTCATAC
CATTCCTCCTTTAGCATCAGAAAGAGCCGGTTGGTTCATGGGTACATTGTTTTTTATTCAGCCAAACGTAAGGCCTCTTTGGCTATGTTA
CGTGCCTTTCTTTGTGAAGTGCAGGATATTATCCCTATTCAGCTTGTAGCACTCACTGATGGCGCTGTAGATGTCCTGGACAATGACTTA
AGTAGGGAACAGCTAACTGAGGGGGAGGAGATTGCTCAAGAAATTGACGGAAGGTTCACAAGCATCCCCTGTAGCCAACCCCAGCATAAA
CTTGAGATCTTTCACCCATTTTTTAAAGATGTGGTGGAAAAAAAGAACATAATCGAGGCTACTCATATGTACGATAATGCTGCCGAGGCC
TGTAGCACCACCGAAGAGGTGTTTAACTCCCCCCGGGCAGGATCACCGCTCTGCAACTCAAACCTGCAGGATTCAGAAGAAGATATCGAG
CCATCTTACAGCCTGTTTCGAGAAGACACATCACTGCCTTCTCTGTCCAAAGACCATTCTAAGCTCTCTATGGAACTGGAGGGAAATGAT
GGGCTGTCTTTCATTATGAGCAATTTTGAGAGTAAACTGAACAACAAAGTACCTCCGCCAGTCAAACCAAAGCCTCCTGTCCATTTTGAA
ATTACAAAGGGGGATCTATCTTATTTAGACCAAGGCCATAGGGATGGACAGAGGAAGTCTGTGTCTTCTAGCCCCTGGCTGCCTCAGGAT
GGGTTTGATCCTTCTGACTATGCTGAACCCATGGATGCTGTGGTGAAGCCAAGGAATGAAGAAGAAAACATATACTCCGTGCCCCATGAC
AGCACCCAAGGCAAAATCATCACCATTCGGAATATCAACAAAGCCCAGTCCAACGGCAGCGGGAATGGTTCTGACAGTGAAATGGACACC
AGCTCTCTAGAGCGAGGGCGCAAGGTTTCCATCGTGAGCAAGCCAGTGCTGTACAGGACGAGATGCACCCGGCTGGGGCGGTTTGCTAGT
TACCGGACCAGCTTCAGCGTGGGGAGTGATGATGAGCTGGGGCCCATCCGGAAGAAAGAGGAGGATCAGGCATCCCAGGGTTATAAAGGG
GACAATGCTGTCATTCCATACGAAACAGACGAAGACCCGCGGAGGAGGAATATTCTTCGCAGCCTAAGGAGGAACACTAAGAAACCAAAG
CCCAAACCCCGGCCATCCATCACAAAGGCAACCTGGGAGAGTAACTATTTTGGGGTGCCCTTAACAACTGTCGTGACTCCAGAGAAGCCG
ATCCCCATTTTTATTGAAAGATGTATTGAGTACATTGAAGCCACAGGACATAACGACTATATGTGTCCAGCCACCAACCAGTGCACCATT
GATAAAAACAGGAGGAAGAGCTGCCAGGCCTGCCGGCTCCGTAAATGCTACGAAGTGGGAATGATGAAAGGTGGGATACGAAAAGACCGA
AGAGGAGGGAGAATGTTGAAACACAAGCGCCAGAGAGATGATGGGGAGGGCAGGGGTGAAGTGGGGTCTGCTGGAGACATGAGAGCTGCC
AACCTTTGGCCAAGCCCGCTCATGATCAAACGCTCTAAGAAGAACAGCCTGGCCTTGTCCCTGACGGCCGACCAGATGGTCAGTGCCTTG
TTGGATGCTGAGCCCCCGATACTCTATTCCGAGTATGATCCTACCAGACCCTTCAGTGAAGCTTCGATGATGGGCTTACTGACCAACCTG
GCAGACAGGGAGCTGGTTCACATGATCAACTGGGCGAAGAGGGTGCCAGGCTTTGTGGATTTGACCCTCCATGATCAGGTCCACCTTCTA
GAATGTGCCTGGCTAGAGATCCTGATGATTGGTCTCGTCTGGCGCTCCATGGAGCACCCAGGGAAGCTACTGTTTGCTCCTAACTTGCTC
TTGGACAGGAACCAGGGAAAATGTGTAGAGGGCATGGTGGAGATCTTCGACATGCTGCTGGCTACATCATCTCGGTTCCGCATGATGAAT
CTGCAGGGAGAGGAGTTTGTGTGCCTCAAATCTATTATTTTGCTTAATTCTGGAGTGTACACATTTCTGTCCAGCACCCTGAAGTCTCTG
GAAGAGAAGGACCATATCCACCGAGTCCTGGACAAGATCACAGACACTTTGATCCACCTGATGGCCAAGGCAGGCCTGACCCTGCAGCAG
CAGCACCAGCGGCTGGCCCAGCTCCTCCTCATCCTCTCCCACATCAGGCACATGAGTAACAAAGGCATGGAGCATCTGTACAGCATGAAG
TGCAAGAACGTGGTGCCCCTCTATGACCTGCTGCTGGAGATGCTGGACGCCCACCGCCTACATGCGCCCACTAGCCGTGGAGGGGCATCC
GTGGAGGAGACGGACCAAAGCCACTTGGCCACTGCGGGCTCTACTTCATCGCATTCCTTGCAAAAGTATTACATCACGGGGGAGGCAGAG
GGTTTCCCTGCCACGGTCTGAGAGCTCCCTGGCTCCCACACGGTTCAGATAATCCCTGCTGCATTTTACCCTCATCATGCACCACTTTAG

>6088_6088_2_ARHGAP35-ESR1_ARHGAP35_chr19_47440665_ENST00000404338_ESR1_chr6_152201789_ENST00000338799_length(amino acids)=1656AA_BP=1275
MMMARKQDVRIPTYNISVVGLSGTEKEKGQCGIGKSCLCNRFVRPSADEFHLDHTSVLSTSDFGGRVVNNDHFLYWGEVSRSLEDCVECK
MHIVEQTEFIDDQTFQPHRSTALQPYIKRAAATKLASAEKLMYFCTDQLGLEQDFEQKQMPDGKLLVDGFLLGIDVSRGMNRNFDDQLKF
VSNLYNQLAKTKKPIVVVLTKCDEGVERYIRDAHTFALSKKNLQVVETSARSNVNVDLAFSTLVQLIDKSRGKTKIIPYFEALKQQSQQI
ATAKDKYEWLVSRIVKNHNENWLSVSRKMQASPEYQDYVYLEGTQKAKKLFLQHIHRLKHEHIERRRKLYLAALPLAFEALIPNLDEIDH
LSCIKAKKLLETKPEFLKWFVVLEETPWDATSHIDNMENERIPFDLMDTVPAEQLYEAHLEKLRNERKRVEMRRAFKENLETSPFITPGK
PWEEARSFIMNEDFYQWLEESVYMDIYGKHQKQIIDKAKEEFQELLLEYSELFYELELDAKPSKEKMGVIQDVLGEEQRFKALQKLQAER
DALILKHIHFVYHPTKETCPSCPACVDAKIEHLISSRFIRPSDRNQKNSLSDPNIDRINLVILGKDGLARELANEIRALCTNDDKYVIDG
KMYELSLRPIEGNVRLPVNSFQTPTFQPHGCLCLYNSKESLSYVVESIEKSRESTLGRRDNHLVHLPLTLILVNKRGDTSGETLHSLIQQ
GQQIASKLQCVFLDPASAGIGYGRNINEKQISQVLKGLLDSKRNLNLVSSTASIKDLADVDLRIVMCLMCGDPFSADDILFPVLQSQTCK
SSHCGSNNSVLLELPIGLHKKRIELSVLSYHSSFSIRKSRLVHGYIVFYSAKRKASLAMLRAFLCEVQDIIPIQLVALTDGAVDVLDNDL
SREQLTEGEEIAQEIDGRFTSIPCSQPQHKLEIFHPFFKDVVEKKNIIEATHMYDNAAEACSTTEEVFNSPRAGSPLCNSNLQDSEEDIE
PSYSLFREDTSLPSLSKDHSKLSMELEGNDGLSFIMSNFESKLNNKVPPPVKPKPPVHFEITKGDLSYLDQGHRDGQRKSVSSSPWLPQD
GFDPSDYAEPMDAVVKPRNEEENIYSVPHDSTQGKIITIRNINKAQSNGSGNGSDSEMDTSSLERGRKVSIVSKPVLYRTRCTRLGRFAS
YRTSFSVGSDDELGPIRKKEEDQASQGYKGDNAVIPYETDEDPRRRNILRSLRRNTKKPKPKPRPSITKATWESNYFGVPLTTVVTPEKP
IPIFIERCIEYIEATGHNDYMCPATNQCTIDKNRRKSCQACRLRKCYEVGMMKGGIRKDRRGGRMLKHKRQRDDGEGRGEVGSAGDMRAA
NLWPSPLMIKRSKKNSLALSLTADQMVSALLDAEPPILYSEYDPTRPFSEASMMGLLTNLADRELVHMINWAKRVPGFVDLTLHDQVHLL
ECAWLEILMIGLVWRSMEHPGKLLFAPNLLLDRNQGKCVEGMVEIFDMLLATSSRFRMMNLQGEEFVCLKSIILLNSGVYTFLSSTLKSL
EEKDHIHRVLDKITDTLIHLMAKAGLTLQQQHQRLAQLLLILSHIRHMSNKGMEHLYSMKCKNVVPLYDLLLEMLDAHRLHAPTSRGGAS

--------------------------------------------------------------
>6088_6088_3_ARHGAP35-ESR1_ARHGAP35_chr19_47440665_ENST00000404338_ESR1_chr6_152201789_ENST00000440973_length(transcript)=9279nt_BP=3826nt
ATGATGATGGCAAGAAAGCAAGATGTCCGAATTCCCACCTACAACATCAGTGTGGTGGGATTATCTGGGACCGAGAAGGAAAAGGGCCAG
TGTGGGATTGGAAAGTCTTGTTTGTGCAACCGCTTCGTGCGCCCGAGTGCTGACGAGTTTCACTTGGACCATACCTCCGTCCTCAGCACC
AGTGACTTTGGAGGGCGAGTGGTCAATAATGACCACTTTCTCTACTGGGGAGAAGTTAGCCGCTCCCTGGAGGATTGTGTGGAATGTAAG
ATGCACATTGTGGAGCAGACTGAATTTATTGATGATCAGACTTTTCAACCTCATCGAAGCACGGCCCTGCAGCCCTATATCAAGAGAGCT
GCTGCGACCAAGCTTGCATCAGCTGAAAAACTCATGTACTTTTGCACTGACCAGCTGGGGCTGGAGCAGGACTTTGAGCAGAAACAAATG
CCAGACGGAAAGCTGCTGGTTGATGGTTTTCTTCTTGGTATTGATGTTAGCAGGGGCATGAATAGGAACTTTGATGACCAGCTCAAGTTT
GTCTCCAATCTCTACAATCAGCTTGCAAAAACAAAAAAGCCCATAGTGGTGGTCCTGACTAAGTGTGACGAAGGTGTTGAGCGGTACATT
AGAGATGCACATACTTTTGCCTTAAGCAAAAAGAACCTCCAGGTTGTGGAGACCTCAGCGAGATCCAATGTAAACGTGGACTTGGCTTTC
AGCACCTTAGTGCAACTCATTGATAAAAGTCGGGGAAAGACAAAAATCATTCCTTATTTTGAAGCTCTCAAGCAGCAGAGTCAGCAGATA
GCTACAGCAAAAGACAAGTATGAGTGGCTGGTGAGTCGCATTGTGAAAAACCACAATGAGAACTGGCTGAGTGTCAGCCGAAAGATGCAG
GCCTCTCCAGAATACCAGGACTATGTCTACCTGGAAGGGACTCAGAAAGCCAAGAAGCTGTTTCTACAGCACATCCACCGCCTCAAGCAT
GAGCATATCGAGCGTAGGAGAAAGCTGTACCTGGCAGCCCTGCCATTAGCTTTTGAAGCTCTTATACCTAATCTAGATGAAATAGACCAC
CTAAGCTGCATAAAAGCCAAAAAGCTCTTAGAAACCAAGCCAGAATTCTTGAAGTGGTTTGTTGTGCTTGAAGAGACCCCATGGGATGCC
ACCAGTCACATTGACAACATGGAAAACGAACGGATTCCCTTTGATTTAATGGATACCGTCCCTGCAGAGCAGCTATACGAGGCCCACTTA
GAGAAGCTGAGGAACGAAAGGAAAAGAGTTGAGATGCGAAGGGCGTTTAAAGAAAACCTGGAGACTTCTCCTTTCATAACTCCCGGAAAG
CCTTGGGAAGAGGCCCGTAGTTTTATTATGAATGAGGATTTCTACCAGTGGCTGGAGGAATCTGTATACATGGATATTTATGGCAAACAC
CAAAAGCAAATTATAGATAAAGCAAAGGAAGAATTTCAGGAGTTGCTTTTGGAATATTCAGAATTGTTTTATGAACTGGAGCTGGATGCT
AAGCCCAGCAAGGAGAAGATGGGTGTTATTCAGGATGTTCTGGGAGAGGAACAGCGATTTAAAGCATTACAAAAGCTCCAAGCAGAGCGT
GATGCCCTTATTCTGAAACACATTCATTTTGTGTACCACCCAACAAAGGAGACATGCCCCAGCTGCCCAGCTTGTGTGGACGCTAAGATT
GAGCACTTGATTAGTTCTCGGTTTATCCGGCCGTCTGACCGGAATCAGAAAAATTCACTCTCTGACCCTAACATTGATAGAATCAACTTG
GTTATATTGGGCAAAGACGGCCTTGCCCGAGAGTTGGCCAATGAGATTCGAGCTCTTTGTACAAATGATGACAAGTATGTGATAGATGGT
AAAATGTATGAGCTTTCCCTGAGGCCAATAGAGGGGAATGTCAGGCTTCCTGTGAACTCTTTCCAGACGCCAACATTTCAGCCCCACGGC
TGTCTCTGCCTTTACAATTCAAAGGAATCGCTATCCTATGTAGTGGAAAGTATAGAGAAGAGTAGAGAGTCCACGCTGGGCCGGCGGGAT
AATCATTTAGTCCATCTCCCCCTTACATTAATTTTGGTTAACAAGAGAGGAGACACCAGTGGAGAGACTCTGCATAGCTTAATACAGCAA
GGTCAACAAATTGCTAGCAAACTTCAGTGTGTCTTTCTCGACCCTGCTTCTGCTGGCATTGGTTACGGACGCAACATTAATGAAAAGCAA
ATCAGTCAAGTTTTGAAGGGACTCCTGGACTCTAAGCGTAACTTAAACCTGGTCAGTTCTACTGCTAGCATCAAAGATTTGGCTGATGTT
GATCTGCGAATTGTTATGTGTCTGATGTGTGGAGATCCTTTTAGTGCAGATGACATACTTTTTCCTGTCCTTCAGTCCCAAACCTGTAAA
TCTTCCCATTGTGGAAGCAACAACTCTGTTTTACTTGAACTACCAATCGGACTGCACAAGAAGCGGATTGAACTGTCTGTTCTTTCATAC
CATTCCTCCTTTAGCATCAGAAAGAGCCGGTTGGTTCATGGGTACATTGTTTTTTATTCAGCCAAACGTAAGGCCTCTTTGGCTATGTTA
CGTGCCTTTCTTTGTGAAGTGCAGGATATTATCCCTATTCAGCTTGTAGCACTCACTGATGGCGCTGTAGATGTCCTGGACAATGACTTA
AGTAGGGAACAGCTAACTGAGGGGGAGGAGATTGCTCAAGAAATTGACGGAAGGTTCACAAGCATCCCCTGTAGCCAACCCCAGCATAAA
CTTGAGATCTTTCACCCATTTTTTAAAGATGTGGTGGAAAAAAAGAACATAATCGAGGCTACTCATATGTACGATAATGCTGCCGAGGCC
TGTAGCACCACCGAAGAGGTGTTTAACTCCCCCCGGGCAGGATCACCGCTCTGCAACTCAAACCTGCAGGATTCAGAAGAAGATATCGAG
CCATCTTACAGCCTGTTTCGAGAAGACACATCACTGCCTTCTCTGTCCAAAGACCATTCTAAGCTCTCTATGGAACTGGAGGGAAATGAT
GGGCTGTCTTTCATTATGAGCAATTTTGAGAGTAAACTGAACAACAAAGTACCTCCGCCAGTCAAACCAAAGCCTCCTGTCCATTTTGAA
ATTACAAAGGGGGATCTATCTTATTTAGACCAAGGCCATAGGGATGGACAGAGGAAGTCTGTGTCTTCTAGCCCCTGGCTGCCTCAGGAT
GGGTTTGATCCTTCTGACTATGCTGAACCCATGGATGCTGTGGTGAAGCCAAGGAATGAAGAAGAAAACATATACTCCGTGCCCCATGAC
AGCACCCAAGGCAAAATCATCACCATTCGGAATATCAACAAAGCCCAGTCCAACGGCAGCGGGAATGGTTCTGACAGTGAAATGGACACC
AGCTCTCTAGAGCGAGGGCGCAAGGTTTCCATCGTGAGCAAGCCAGTGCTGTACAGGACGAGATGCACCCGGCTGGGGCGGTTTGCTAGT
TACCGGACCAGCTTCAGCGTGGGGAGTGATGATGAGCTGGGGCCCATCCGGAAGAAAGAGGAGGATCAGGCATCCCAGGGTTATAAAGGG
GACAATGCTGTCATTCCATACGAAACAGACGAAGACCCGCGGAGGAGGAATATTCTTCGCAGCCTAAGGAGGAACACTAAGAAACCAAAG
CCCAAACCCCGGCCATCCATCACAAAGGCAACCTGGGAGAGTAACTATTTTGGGGTGCCCTTAACAACTGTCGTGACTCCAGAGAAGCCG
ATCCCCATTTTTATTGAAAGATGTATTGAGTACATTGAAGCCACAGGACATAACGACTATATGTGTCCAGCCACCAACCAGTGCACCATT
GATAAAAACAGGAGGAAGAGCTGCCAGGCCTGCCGGCTCCGTAAATGCTACGAAGTGGGAATGATGAAAGGTGGGATACGAAAAGACCGA
AGAGGAGGGAGAATGTTGAAACACAAGCGCCAGAGAGATGATGGGGAGGGCAGGGGTGAAGTGGGGTCTGCTGGAGACATGAGAGCTGCC
AACCTTTGGCCAAGCCCGCTCATGATCAAACGCTCTAAGAAGAACAGCCTGGCCTTGTCCCTGACGGCCGACCAGATGGTCAGTGCCTTG
TTGGATGCTGAGCCCCCGATACTCTATTCCGAGTATGATCCTACCAGACCCTTCAGTGAAGCTTCGATGATGGGCTTACTGACCAACCTG
GCAGACAGGGAGCTGGTTCACATGATCAACTGGGCGAAGAGGGTGCCAGGCTTTGTGGATTTGACCCTCCATGATCAGGTCCACCTTCTA
GAATGTGCCTGGCTAGAGATCCTGATGATTGGTCTCGTCTGGCGCTCCATGGAGCACCCAGGGAAGCTACTGTTTGCTCCTAACTTGCTC
TTGGACAGGAACCAGGGAAAATGTGTAGAGGGCATGGTGGAGATCTTCGACATGCTGCTGGCTACATCATCTCGGTTCCGCATGATGAAT
CTGCAGGGAGAGGAGTTTGTGTGCCTCAAATCTATTATTTTGCTTAATTCTGGAGTGTACACATTTCTGTCCAGCACCCTGAAGTCTCTG
GAAGAGAAGGACCATATCCACCGAGTCCTGGACAAGATCACAGACACTTTGATCCACCTGATGGCCAAGGCAGGCCTGACCCTGCAGCAG
CAGCACCAGCGGCTGGCCCAGCTCCTCCTCATCCTCTCCCACATCAGGCACATGAGTAACAAAGGCATGGAGCATCTGTACAGCATGAAG
TGCAAGAACGTGGTGCCCCTCTATGACCTGCTGCTGGAGATGCTGGACGCCCACCGCCTACATGCGCCCACTAGCCGTGGAGGGGCATCC
GTGGAGGAGACGGACCAAAGCCACTTGGCCACTGCGGGCTCTACTTCATCGCATTCCTTGCAAAAGTATTACATCACGGGGGAGGCAGAG
GGTTTCCCTGCCACGGTCTGAGAGCTCCCTGGCTCCCACACGGTTCAGATAATCCCTGCTGCATTTTACCCTCATCATGCACCACTTTAG
CCAAATTCTGTCTCCTGCATACACTCCGGCATGCATCCAACACCAATGGCTTTCTAGATGAGTGGCCATTCATTTGCTTGCTCAGTTCTT
AGTGGCACATCTTCTGTCTTCTGTTGGGAACAGCCAAAGGGATTCCAAGGCTAAATCTTTGTAACAGCTCTCTTTCCCCCTTGCTATGTT
ACTAAGCGTGAGGATTCCCGTAGCTCTTCACAGCTGAACTCAGTCTATGGGTTGGGGCTCAGATAACTCTGTGCATTTAAGCTACTTGTA
GAGACCCAGGCCTGGAGAGTAGACATTTTGCCTCTGATAAGCACTTTTTAAATGGCTCTAAGAATAAGCCACAGCAAAGAATTTAAAGTG
GCTCCTTTAATTGGTGACTTGGAGAAAGCTAGGTCAAGGGTTTATTATAGCACCCTCTTGTATTCCTATGGCAATGCATCCTTTTATGAA
AGTGGTACACCTTAAAGCTTTTATATGACTGTAGCAGAGTATCTGGTGATTGTCAATTCATTCCCCCTATAGGAATACAAGGGGCACACA
GGGAAGGCAGATCCCCTAGTTGGCAAGACTATTTTAACTTGATACACTGCAGATTCAGATGTGCTGAAAGCTCTGCCTCTGGCTTTCCGG
TCATGGGTTCCAGTTAATTCATGCCTCCCATGGACCTATGGAGAGCAGCAAGTTGATCTTAGTTAAGTCTCCCTATATGAGGGATAAGTT
CCTGATTTTTGTTTTTATTTTTGTGTTACAAAAGAAAGCCCTCCCTCCCTGAACTTGCAGTAAGGTCAGCTTCAGGACCTGTTCCAGTGG
GCACTGTACTTGGATCTTCCCGGCGTGTGTGTGCCTTACACAGGGGTGAACTGTTCACTGTGGTGATGCATGATGAGGGTAAATGGTAGT
TGAAAGGAGCAGGGGCCCTGGTGTTGCATTTAGCCCTGGGGCATGGAGCTGAACAGTACTTGTGCAGGATTGTTGTGGCTACTAGAGAAC
AAGAGGGAAAGTAGGGCAGAAACTGGATACAGTTCTGAGGCACAGCCAGACTTGCTCAGGGTGGCCCTGCCACAGGCTGCAGCTACCTAG
GAACATTCCTTGCAGACCCCGCATTGCCCTTTGGGGGTGCCCTGGGATCCCTGGGGTAGTCCAGCTCTTCTTCATTTCCCAGCGTGGCCC
TGGTTGGAAGAAGCAGCTGTCACAGCTGCTGTAGACAGCTGTGTTCCTACAATTGGCCCAGCACCCTGGGGCACGGGAGAAGGGTGGGGA
CCGTTGCTGTCACTACTCAGGCTGACTGGGGCCTGGTCAGATTACGTATGCCCTTGGTGGTTTAGAGATAATCCAAAATCAGGGTTTGGT
TTGGGGAAGAAAATCCTCCCCCTTCCTCCCCCGCCCCGTTCCCTACCGCCTCCACTCCTGCCAGCTCATTTCCTTCAATTTCCTTTGACC
TATAGGCTAAAAAAGAAAGGCTCATTCCAGCCACAGGGCAGCCTTCCCTGGGCCTTTGCTTCTCTAGCACAATTATGGGTTACTTCCTTT
TTCTTAACAAAAAAGAATGTTTGATTTCCTCTGGGTGACCTTATTGTCTGTAATTGAAACCCTATTGAGAGGTGATGTCTGTGTTAGCCA
ATGACCCAGGTGAGCTGCTCGGGCTTCTCTTGGTATGTCTTGTTTGGAAAAGTGGATTTCATTCATTTCTGATTGTCCAGTTAAGTGATC
ACCAAAGGACTGAGAATCTGGGAGGGCAAAAAAAAAAAAAAAGTTTTTATGTGCACTTAAATTTGGGGACAATTTTATGTATCTGTGTTA
AGGATATGTTTAAGAACATAATTCTTTTGTTGCTGTTTGTTTAAGAAGCACCTTAGTTTGTTTAAGAAGCACCTTATATAGTATAATATA
TATTTTTTTGAAATTACATTGCTTGTTTATCAGACAATTGAATGTAGTAATTCTGTTCTGGATTTAATTTGACTGGGTTAACATGCAAAA
ACCAAGGAAAAATATTTAGTTTTTTTTTTTTTTTTTGTATACTTTTCAAGCTACCTTGTCATGTATACAGTCATTTATGCCTAAAGCCTG
GTGATTATTCATTTAAATGAAGATCACATTTCATATCAACTTTTGTATCCACAGTAGACAAAATAGCACTAATCCAGATGCCTATTGTTG
GATACTGAATGACAGACAATCTTATGTAGCAAAGATTATGCCTGAAAAGGAAAATTATTCAGGGCAGCTAATTTTGCTTTTACCAAAATA
TCAGTAGTAATATTTTTGGACAGTAGCTAATGGGTCAGTGGGTTCTTTTTAATGTTTATACTTAGATTTTCTTTTAAAAAAATTAAAATA
AAACAAAAAAAAATTTCTAGGACTAGACGATGTAATACCAGCTAAAGCCAAACAATTATACAGTGGAAGGTTTTACATTATTCATCCAAT
GTGTTTCTATTCATGTTAAGATACTACTACATTTGAAGTGGGCAGAGAACATCAGATGATTGAAATGTTCGCCCAGGGGTCTCCAGCAAC
TTTGGAAATCTCTTTGTATTTTTACTTGAAGTGCCACTAATGGACAGCAGATATTTTCTGGCTGATGTTGGTATTGGGTGTAGGAACATG
ATTTAAAAAAAAACTCTTGCCTCTGCTTTCCCCCACTCTGAGGCAAGTTAAAATGTAAAAGATGTGATTTATCTGGGGGGCTCAGGTATG
GTGGGGAAGTGGATTCAGGAATCTGGGGAATGGCAAATATATTAAGAAGAGTATTGAAAGTATTTGGAGGAAAATGGTTAATTCTGGGTG
TGCACCAGGGTTCAGTAGAGTCCACTTCTGCCCTGGAGACCACAAATCAACTAGCTCCATTTACAGCCATTTCTAAAATGGCAGCTTCAG
TTCTAGAGAAGAAAGAACAACATCAGCAGTAAAGTCCATGGAATAGCTAGTGGTCTGTGTTTCTTTTCGCCATTGCCTAGCTTGCCGTAA
TGATTCTATAATGCCATCATGCAGCAATTATGAGAGGCTAGGTCATCCAAAGAGAAGACCCTATCAATGTAGGTTGCAAAATCTAACCCC
TAAGGAAGTGCAGTCTTTGATTTGATTTCCCTAGTAACCTTGCAGATATGTTTAACCAAGCCATAGCCCATGCCTTTTGAGGGCTGAACA
AATAAGGGACTTACTGATAATTTACTTTTGATCACATTAAGGTGTTCTCACCTTGAAATCTTATACACTGAAATGGCCATTGATTTAGGC
CACTGGCTTAGAGTACTCCTTCCCCTGCATGACACTGATTACAAATACTTTCCTATTCATACTTTCCAATTATGAGATGGACTGTGGGTA
CTGGGAGTGATCACTAACACCATAGTAATGTCTAATATTCACAGGCAGATCTGCTTGGGGAAGCTAGTTATGTGAAAGGCAAATAGAGTC
ATACAGTAGCTCAAAAGGCAACCATAATTCTCTTTGGTGCAGGTCTTGGGAGCGTGATCTAGATTACACTGCACCATTCCCAAGTTAATC
CCCTGAAAACTTACTCTCAACTGGAGCAAATGAACTTTGGTCCCAAATATCCATCTTTTCAGTAGCGTTAATTATGCTCTGTTTCCAACT
GCATTTCCTTTCCAATTGAATTAAAGTGTGGCCTCGTTTTTAGTCATTTAAAATTGTTTTCTAAGTAATTGCTGCCTCTATTATGGCACT
TCAATTTTGCACTGTCTTTTGAGATTCAAGAAAAATTTCTATTCTTTTTTTTGCATCCAATTGTGCCTGAACTTTTAAAATATGTAAATG
CTGCCATGTTCCAAACCCATCGTCAGTGTGTGTGTTTAGAGCTGTGCACCCTAGAAACAACATATTGTCCCATGAGCAGGTGCCTGAGAC
ACAGACCCCTTTGCATTCACAGAGAGGTCATTGGTTATAGAGACTTGAATTAATAAGTGACATTATGCCAGTTTCTGTTCTCTCACAGGT
GATAAACAATGCTTTTTGTGCACTACATACTCTTCAGTGTAGAGCTCTTGTTTTATGGGAAAAGGCTCAAATGCCAAATTGTGTTTGATG
GATTAATATGCCCTTTTGCCGATGCATACTATTACTGATGTGACTCGGTTTTGTCGCAGCTTTGCTTTGTTTAATGAAACACACTTGTAA
ACCTCTTTTGCACTTTGAAAAAGAATCCAGCGGGATGCTCGAGCACCTGTAAACAATTTTCTCAACCTATTTGATGTTCAAATAAAGAAT

>6088_6088_3_ARHGAP35-ESR1_ARHGAP35_chr19_47440665_ENST00000404338_ESR1_chr6_152201789_ENST00000440973_length(amino acids)=1656AA_BP=1275
MMMARKQDVRIPTYNISVVGLSGTEKEKGQCGIGKSCLCNRFVRPSADEFHLDHTSVLSTSDFGGRVVNNDHFLYWGEVSRSLEDCVECK
MHIVEQTEFIDDQTFQPHRSTALQPYIKRAAATKLASAEKLMYFCTDQLGLEQDFEQKQMPDGKLLVDGFLLGIDVSRGMNRNFDDQLKF
VSNLYNQLAKTKKPIVVVLTKCDEGVERYIRDAHTFALSKKNLQVVETSARSNVNVDLAFSTLVQLIDKSRGKTKIIPYFEALKQQSQQI
ATAKDKYEWLVSRIVKNHNENWLSVSRKMQASPEYQDYVYLEGTQKAKKLFLQHIHRLKHEHIERRRKLYLAALPLAFEALIPNLDEIDH
LSCIKAKKLLETKPEFLKWFVVLEETPWDATSHIDNMENERIPFDLMDTVPAEQLYEAHLEKLRNERKRVEMRRAFKENLETSPFITPGK
PWEEARSFIMNEDFYQWLEESVYMDIYGKHQKQIIDKAKEEFQELLLEYSELFYELELDAKPSKEKMGVIQDVLGEEQRFKALQKLQAER
DALILKHIHFVYHPTKETCPSCPACVDAKIEHLISSRFIRPSDRNQKNSLSDPNIDRINLVILGKDGLARELANEIRALCTNDDKYVIDG
KMYELSLRPIEGNVRLPVNSFQTPTFQPHGCLCLYNSKESLSYVVESIEKSRESTLGRRDNHLVHLPLTLILVNKRGDTSGETLHSLIQQ
GQQIASKLQCVFLDPASAGIGYGRNINEKQISQVLKGLLDSKRNLNLVSSTASIKDLADVDLRIVMCLMCGDPFSADDILFPVLQSQTCK
SSHCGSNNSVLLELPIGLHKKRIELSVLSYHSSFSIRKSRLVHGYIVFYSAKRKASLAMLRAFLCEVQDIIPIQLVALTDGAVDVLDNDL
SREQLTEGEEIAQEIDGRFTSIPCSQPQHKLEIFHPFFKDVVEKKNIIEATHMYDNAAEACSTTEEVFNSPRAGSPLCNSNLQDSEEDIE
PSYSLFREDTSLPSLSKDHSKLSMELEGNDGLSFIMSNFESKLNNKVPPPVKPKPPVHFEITKGDLSYLDQGHRDGQRKSVSSSPWLPQD
GFDPSDYAEPMDAVVKPRNEEENIYSVPHDSTQGKIITIRNINKAQSNGSGNGSDSEMDTSSLERGRKVSIVSKPVLYRTRCTRLGRFAS
YRTSFSVGSDDELGPIRKKEEDQASQGYKGDNAVIPYETDEDPRRRNILRSLRRNTKKPKPKPRPSITKATWESNYFGVPLTTVVTPEKP
IPIFIERCIEYIEATGHNDYMCPATNQCTIDKNRRKSCQACRLRKCYEVGMMKGGIRKDRRGGRMLKHKRQRDDGEGRGEVGSAGDMRAA
NLWPSPLMIKRSKKNSLALSLTADQMVSALLDAEPPILYSEYDPTRPFSEASMMGLLTNLADRELVHMINWAKRVPGFVDLTLHDQVHLL
ECAWLEILMIGLVWRSMEHPGKLLFAPNLLLDRNQGKCVEGMVEIFDMLLATSSRFRMMNLQGEEFVCLKSIILLNSGVYTFLSSTLKSL
EEKDHIHRVLDKITDTLIHLMAKAGLTLQQQHQRLAQLLLILSHIRHMSNKGMEHLYSMKCKNVVPLYDLLLEMLDAHRLHAPTSRGGAS

--------------------------------------------------------------
>6088_6088_4_ARHGAP35-ESR1_ARHGAP35_chr19_47440665_ENST00000404338_ESR1_chr6_152201789_ENST00000443427_length(transcript)=9279nt_BP=3826nt
ATGATGATGGCAAGAAAGCAAGATGTCCGAATTCCCACCTACAACATCAGTGTGGTGGGATTATCTGGGACCGAGAAGGAAAAGGGCCAG
TGTGGGATTGGAAAGTCTTGTTTGTGCAACCGCTTCGTGCGCCCGAGTGCTGACGAGTTTCACTTGGACCATACCTCCGTCCTCAGCACC
AGTGACTTTGGAGGGCGAGTGGTCAATAATGACCACTTTCTCTACTGGGGAGAAGTTAGCCGCTCCCTGGAGGATTGTGTGGAATGTAAG
ATGCACATTGTGGAGCAGACTGAATTTATTGATGATCAGACTTTTCAACCTCATCGAAGCACGGCCCTGCAGCCCTATATCAAGAGAGCT
GCTGCGACCAAGCTTGCATCAGCTGAAAAACTCATGTACTTTTGCACTGACCAGCTGGGGCTGGAGCAGGACTTTGAGCAGAAACAAATG
CCAGACGGAAAGCTGCTGGTTGATGGTTTTCTTCTTGGTATTGATGTTAGCAGGGGCATGAATAGGAACTTTGATGACCAGCTCAAGTTT
GTCTCCAATCTCTACAATCAGCTTGCAAAAACAAAAAAGCCCATAGTGGTGGTCCTGACTAAGTGTGACGAAGGTGTTGAGCGGTACATT
AGAGATGCACATACTTTTGCCTTAAGCAAAAAGAACCTCCAGGTTGTGGAGACCTCAGCGAGATCCAATGTAAACGTGGACTTGGCTTTC
AGCACCTTAGTGCAACTCATTGATAAAAGTCGGGGAAAGACAAAAATCATTCCTTATTTTGAAGCTCTCAAGCAGCAGAGTCAGCAGATA
GCTACAGCAAAAGACAAGTATGAGTGGCTGGTGAGTCGCATTGTGAAAAACCACAATGAGAACTGGCTGAGTGTCAGCCGAAAGATGCAG
GCCTCTCCAGAATACCAGGACTATGTCTACCTGGAAGGGACTCAGAAAGCCAAGAAGCTGTTTCTACAGCACATCCACCGCCTCAAGCAT
GAGCATATCGAGCGTAGGAGAAAGCTGTACCTGGCAGCCCTGCCATTAGCTTTTGAAGCTCTTATACCTAATCTAGATGAAATAGACCAC
CTAAGCTGCATAAAAGCCAAAAAGCTCTTAGAAACCAAGCCAGAATTCTTGAAGTGGTTTGTTGTGCTTGAAGAGACCCCATGGGATGCC
ACCAGTCACATTGACAACATGGAAAACGAACGGATTCCCTTTGATTTAATGGATACCGTCCCTGCAGAGCAGCTATACGAGGCCCACTTA
GAGAAGCTGAGGAACGAAAGGAAAAGAGTTGAGATGCGAAGGGCGTTTAAAGAAAACCTGGAGACTTCTCCTTTCATAACTCCCGGAAAG
CCTTGGGAAGAGGCCCGTAGTTTTATTATGAATGAGGATTTCTACCAGTGGCTGGAGGAATCTGTATACATGGATATTTATGGCAAACAC
CAAAAGCAAATTATAGATAAAGCAAAGGAAGAATTTCAGGAGTTGCTTTTGGAATATTCAGAATTGTTTTATGAACTGGAGCTGGATGCT
AAGCCCAGCAAGGAGAAGATGGGTGTTATTCAGGATGTTCTGGGAGAGGAACAGCGATTTAAAGCATTACAAAAGCTCCAAGCAGAGCGT
GATGCCCTTATTCTGAAACACATTCATTTTGTGTACCACCCAACAAAGGAGACATGCCCCAGCTGCCCAGCTTGTGTGGACGCTAAGATT
GAGCACTTGATTAGTTCTCGGTTTATCCGGCCGTCTGACCGGAATCAGAAAAATTCACTCTCTGACCCTAACATTGATAGAATCAACTTG
GTTATATTGGGCAAAGACGGCCTTGCCCGAGAGTTGGCCAATGAGATTCGAGCTCTTTGTACAAATGATGACAAGTATGTGATAGATGGT
AAAATGTATGAGCTTTCCCTGAGGCCAATAGAGGGGAATGTCAGGCTTCCTGTGAACTCTTTCCAGACGCCAACATTTCAGCCCCACGGC
TGTCTCTGCCTTTACAATTCAAAGGAATCGCTATCCTATGTAGTGGAAAGTATAGAGAAGAGTAGAGAGTCCACGCTGGGCCGGCGGGAT
AATCATTTAGTCCATCTCCCCCTTACATTAATTTTGGTTAACAAGAGAGGAGACACCAGTGGAGAGACTCTGCATAGCTTAATACAGCAA
GGTCAACAAATTGCTAGCAAACTTCAGTGTGTCTTTCTCGACCCTGCTTCTGCTGGCATTGGTTACGGACGCAACATTAATGAAAAGCAA
ATCAGTCAAGTTTTGAAGGGACTCCTGGACTCTAAGCGTAACTTAAACCTGGTCAGTTCTACTGCTAGCATCAAAGATTTGGCTGATGTT
GATCTGCGAATTGTTATGTGTCTGATGTGTGGAGATCCTTTTAGTGCAGATGACATACTTTTTCCTGTCCTTCAGTCCCAAACCTGTAAA
TCTTCCCATTGTGGAAGCAACAACTCTGTTTTACTTGAACTACCAATCGGACTGCACAAGAAGCGGATTGAACTGTCTGTTCTTTCATAC
CATTCCTCCTTTAGCATCAGAAAGAGCCGGTTGGTTCATGGGTACATTGTTTTTTATTCAGCCAAACGTAAGGCCTCTTTGGCTATGTTA
CGTGCCTTTCTTTGTGAAGTGCAGGATATTATCCCTATTCAGCTTGTAGCACTCACTGATGGCGCTGTAGATGTCCTGGACAATGACTTA
AGTAGGGAACAGCTAACTGAGGGGGAGGAGATTGCTCAAGAAATTGACGGAAGGTTCACAAGCATCCCCTGTAGCCAACCCCAGCATAAA
CTTGAGATCTTTCACCCATTTTTTAAAGATGTGGTGGAAAAAAAGAACATAATCGAGGCTACTCATATGTACGATAATGCTGCCGAGGCC
TGTAGCACCACCGAAGAGGTGTTTAACTCCCCCCGGGCAGGATCACCGCTCTGCAACTCAAACCTGCAGGATTCAGAAGAAGATATCGAG
CCATCTTACAGCCTGTTTCGAGAAGACACATCACTGCCTTCTCTGTCCAAAGACCATTCTAAGCTCTCTATGGAACTGGAGGGAAATGAT
GGGCTGTCTTTCATTATGAGCAATTTTGAGAGTAAACTGAACAACAAAGTACCTCCGCCAGTCAAACCAAAGCCTCCTGTCCATTTTGAA
ATTACAAAGGGGGATCTATCTTATTTAGACCAAGGCCATAGGGATGGACAGAGGAAGTCTGTGTCTTCTAGCCCCTGGCTGCCTCAGGAT
GGGTTTGATCCTTCTGACTATGCTGAACCCATGGATGCTGTGGTGAAGCCAAGGAATGAAGAAGAAAACATATACTCCGTGCCCCATGAC
AGCACCCAAGGCAAAATCATCACCATTCGGAATATCAACAAAGCCCAGTCCAACGGCAGCGGGAATGGTTCTGACAGTGAAATGGACACC
AGCTCTCTAGAGCGAGGGCGCAAGGTTTCCATCGTGAGCAAGCCAGTGCTGTACAGGACGAGATGCACCCGGCTGGGGCGGTTTGCTAGT
TACCGGACCAGCTTCAGCGTGGGGAGTGATGATGAGCTGGGGCCCATCCGGAAGAAAGAGGAGGATCAGGCATCCCAGGGTTATAAAGGG
GACAATGCTGTCATTCCATACGAAACAGACGAAGACCCGCGGAGGAGGAATATTCTTCGCAGCCTAAGGAGGAACACTAAGAAACCAAAG
CCCAAACCCCGGCCATCCATCACAAAGGCAACCTGGGAGAGTAACTATTTTGGGGTGCCCTTAACAACTGTCGTGACTCCAGAGAAGCCG
ATCCCCATTTTTATTGAAAGATGTATTGAGTACATTGAAGCCACAGGACATAACGACTATATGTGTCCAGCCACCAACCAGTGCACCATT
GATAAAAACAGGAGGAAGAGCTGCCAGGCCTGCCGGCTCCGTAAATGCTACGAAGTGGGAATGATGAAAGGTGGGATACGAAAAGACCGA
AGAGGAGGGAGAATGTTGAAACACAAGCGCCAGAGAGATGATGGGGAGGGCAGGGGTGAAGTGGGGTCTGCTGGAGACATGAGAGCTGCC
AACCTTTGGCCAAGCCCGCTCATGATCAAACGCTCTAAGAAGAACAGCCTGGCCTTGTCCCTGACGGCCGACCAGATGGTCAGTGCCTTG
TTGGATGCTGAGCCCCCGATACTCTATTCCGAGTATGATCCTACCAGACCCTTCAGTGAAGCTTCGATGATGGGCTTACTGACCAACCTG
GCAGACAGGGAGCTGGTTCACATGATCAACTGGGCGAAGAGGGTGCCAGGCTTTGTGGATTTGACCCTCCATGATCAGGTCCACCTTCTA
GAATGTGCCTGGCTAGAGATCCTGATGATTGGTCTCGTCTGGCGCTCCATGGAGCACCCAGGGAAGCTACTGTTTGCTCCTAACTTGCTC
TTGGACAGGAACCAGGGAAAATGTGTAGAGGGCATGGTGGAGATCTTCGACATGCTGCTGGCTACATCATCTCGGTTCCGCATGATGAAT
CTGCAGGGAGAGGAGTTTGTGTGCCTCAAATCTATTATTTTGCTTAATTCTGGAGTGTACACATTTCTGTCCAGCACCCTGAAGTCTCTG
GAAGAGAAGGACCATATCCACCGAGTCCTGGACAAGATCACAGACACTTTGATCCACCTGATGGCCAAGGCAGGCCTGACCCTGCAGCAG
CAGCACCAGCGGCTGGCCCAGCTCCTCCTCATCCTCTCCCACATCAGGCACATGAGTAACAAAGGCATGGAGCATCTGTACAGCATGAAG
TGCAAGAACGTGGTGCCCCTCTATGACCTGCTGCTGGAGATGCTGGACGCCCACCGCCTACATGCGCCCACTAGCCGTGGAGGGGCATCC
GTGGAGGAGACGGACCAAAGCCACTTGGCCACTGCGGGCTCTACTTCATCGCATTCCTTGCAAAAGTATTACATCACGGGGGAGGCAGAG
GGTTTCCCTGCCACGGTCTGAGAGCTCCCTGGCTCCCACACGGTTCAGATAATCCCTGCTGCATTTTACCCTCATCATGCACCACTTTAG
CCAAATTCTGTCTCCTGCATACACTCCGGCATGCATCCAACACCAATGGCTTTCTAGATGAGTGGCCATTCATTTGCTTGCTCAGTTCTT
AGTGGCACATCTTCTGTCTTCTGTTGGGAACAGCCAAAGGGATTCCAAGGCTAAATCTTTGTAACAGCTCTCTTTCCCCCTTGCTATGTT
ACTAAGCGTGAGGATTCCCGTAGCTCTTCACAGCTGAACTCAGTCTATGGGTTGGGGCTCAGATAACTCTGTGCATTTAAGCTACTTGTA
GAGACCCAGGCCTGGAGAGTAGACATTTTGCCTCTGATAAGCACTTTTTAAATGGCTCTAAGAATAAGCCACAGCAAAGAATTTAAAGTG
GCTCCTTTAATTGGTGACTTGGAGAAAGCTAGGTCAAGGGTTTATTATAGCACCCTCTTGTATTCCTATGGCAATGCATCCTTTTATGAA
AGTGGTACACCTTAAAGCTTTTATATGACTGTAGCAGAGTATCTGGTGATTGTCAATTCATTCCCCCTATAGGAATACAAGGGGCACACA
GGGAAGGCAGATCCCCTAGTTGGCAAGACTATTTTAACTTGATACACTGCAGATTCAGATGTGCTGAAAGCTCTGCCTCTGGCTTTCCGG
TCATGGGTTCCAGTTAATTCATGCCTCCCATGGACCTATGGAGAGCAGCAAGTTGATCTTAGTTAAGTCTCCCTATATGAGGGATAAGTT
CCTGATTTTTGTTTTTATTTTTGTGTTACAAAAGAAAGCCCTCCCTCCCTGAACTTGCAGTAAGGTCAGCTTCAGGACCTGTTCCAGTGG
GCACTGTACTTGGATCTTCCCGGCGTGTGTGTGCCTTACACAGGGGTGAACTGTTCACTGTGGTGATGCATGATGAGGGTAAATGGTAGT
TGAAAGGAGCAGGGGCCCTGGTGTTGCATTTAGCCCTGGGGCATGGAGCTGAACAGTACTTGTGCAGGATTGTTGTGGCTACTAGAGAAC
AAGAGGGAAAGTAGGGCAGAAACTGGATACAGTTCTGAGGCACAGCCAGACTTGCTCAGGGTGGCCCTGCCACAGGCTGCAGCTACCTAG
GAACATTCCTTGCAGACCCCGCATTGCCCTTTGGGGGTGCCCTGGGATCCCTGGGGTAGTCCAGCTCTTCTTCATTTCCCAGCGTGGCCC
TGGTTGGAAGAAGCAGCTGTCACAGCTGCTGTAGACAGCTGTGTTCCTACAATTGGCCCAGCACCCTGGGGCACGGGAGAAGGGTGGGGA
CCGTTGCTGTCACTACTCAGGCTGACTGGGGCCTGGTCAGATTACGTATGCCCTTGGTGGTTTAGAGATAATCCAAAATCAGGGTTTGGT
TTGGGGAAGAAAATCCTCCCCCTTCCTCCCCCGCCCCGTTCCCTACCGCCTCCACTCCTGCCAGCTCATTTCCTTCAATTTCCTTTGACC
TATAGGCTAAAAAAGAAAGGCTCATTCCAGCCACAGGGCAGCCTTCCCTGGGCCTTTGCTTCTCTAGCACAATTATGGGTTACTTCCTTT
TTCTTAACAAAAAAGAATGTTTGATTTCCTCTGGGTGACCTTATTGTCTGTAATTGAAACCCTATTGAGAGGTGATGTCTGTGTTAGCCA
ATGACCCAGGTGAGCTGCTCGGGCTTCTCTTGGTATGTCTTGTTTGGAAAAGTGGATTTCATTCATTTCTGATTGTCCAGTTAAGTGATC
ACCAAAGGACTGAGAATCTGGGAGGGCAAAAAAAAAAAAAAAGTTTTTATGTGCACTTAAATTTGGGGACAATTTTATGTATCTGTGTTA
AGGATATGTTTAAGAACATAATTCTTTTGTTGCTGTTTGTTTAAGAAGCACCTTAGTTTGTTTAAGAAGCACCTTATATAGTATAATATA
TATTTTTTTGAAATTACATTGCTTGTTTATCAGACAATTGAATGTAGTAATTCTGTTCTGGATTTAATTTGACTGGGTTAACATGCAAAA
ACCAAGGAAAAATATTTAGTTTTTTTTTTTTTTTTTGTATACTTTTCAAGCTACCTTGTCATGTATACAGTCATTTATGCCTAAAGCCTG
GTGATTATTCATTTAAATGAAGATCACATTTCATATCAACTTTTGTATCCACAGTAGACAAAATAGCACTAATCCAGATGCCTATTGTTG
GATACTGAATGACAGACAATCTTATGTAGCAAAGATTATGCCTGAAAAGGAAAATTATTCAGGGCAGCTAATTTTGCTTTTACCAAAATA
TCAGTAGTAATATTTTTGGACAGTAGCTAATGGGTCAGTGGGTTCTTTTTAATGTTTATACTTAGATTTTCTTTTAAAAAAATTAAAATA
AAACAAAAAAAAATTTCTAGGACTAGACGATGTAATACCAGCTAAAGCCAAACAATTATACAGTGGAAGGTTTTACATTATTCATCCAAT
GTGTTTCTATTCATGTTAAGATACTACTACATTTGAAGTGGGCAGAGAACATCAGATGATTGAAATGTTCGCCCAGGGGTCTCCAGCAAC
TTTGGAAATCTCTTTGTATTTTTACTTGAAGTGCCACTAATGGACAGCAGATATTTTCTGGCTGATGTTGGTATTGGGTGTAGGAACATG
ATTTAAAAAAAAACTCTTGCCTCTGCTTTCCCCCACTCTGAGGCAAGTTAAAATGTAAAAGATGTGATTTATCTGGGGGGCTCAGGTATG
GTGGGGAAGTGGATTCAGGAATCTGGGGAATGGCAAATATATTAAGAAGAGTATTGAAAGTATTTGGAGGAAAATGGTTAATTCTGGGTG
TGCACCAGGGTTCAGTAGAGTCCACTTCTGCCCTGGAGACCACAAATCAACTAGCTCCATTTACAGCCATTTCTAAAATGGCAGCTTCAG
TTCTAGAGAAGAAAGAACAACATCAGCAGTAAAGTCCATGGAATAGCTAGTGGTCTGTGTTTCTTTTCGCCATTGCCTAGCTTGCCGTAA
TGATTCTATAATGCCATCATGCAGCAATTATGAGAGGCTAGGTCATCCAAAGAGAAGACCCTATCAATGTAGGTTGCAAAATCTAACCCC
TAAGGAAGTGCAGTCTTTGATTTGATTTCCCTAGTAACCTTGCAGATATGTTTAACCAAGCCATAGCCCATGCCTTTTGAGGGCTGAACA
AATAAGGGACTTACTGATAATTTACTTTTGATCACATTAAGGTGTTCTCACCTTGAAATCTTATACACTGAAATGGCCATTGATTTAGGC
CACTGGCTTAGAGTACTCCTTCCCCTGCATGACACTGATTACAAATACTTTCCTATTCATACTTTCCAATTATGAGATGGACTGTGGGTA
CTGGGAGTGATCACTAACACCATAGTAATGTCTAATATTCACAGGCAGATCTGCTTGGGGAAGCTAGTTATGTGAAAGGCAAATAGAGTC
ATACAGTAGCTCAAAAGGCAACCATAATTCTCTTTGGTGCAGGTCTTGGGAGCGTGATCTAGATTACACTGCACCATTCCCAAGTTAATC
CCCTGAAAACTTACTCTCAACTGGAGCAAATGAACTTTGGTCCCAAATATCCATCTTTTCAGTAGCGTTAATTATGCTCTGTTTCCAACT
GCATTTCCTTTCCAATTGAATTAAAGTGTGGCCTCGTTTTTAGTCATTTAAAATTGTTTTCTAAGTAATTGCTGCCTCTATTATGGCACT
TCAATTTTGCACTGTCTTTTGAGATTCAAGAAAAATTTCTATTCTTTTTTTTGCATCCAATTGTGCCTGAACTTTTAAAATATGTAAATG
CTGCCATGTTCCAAACCCATCGTCAGTGTGTGTGTTTAGAGCTGTGCACCCTAGAAACAACATATTGTCCCATGAGCAGGTGCCTGAGAC
ACAGACCCCTTTGCATTCACAGAGAGGTCATTGGTTATAGAGACTTGAATTAATAAGTGACATTATGCCAGTTTCTGTTCTCTCACAGGT
GATAAACAATGCTTTTTGTGCACTACATACTCTTCAGTGTAGAGCTCTTGTTTTATGGGAAAAGGCTCAAATGCCAAATTGTGTTTGATG
GATTAATATGCCCTTTTGCCGATGCATACTATTACTGATGTGACTCGGTTTTGTCGCAGCTTTGCTTTGTTTAATGAAACACACTTGTAA
ACCTCTTTTGCACTTTGAAAAAGAATCCAGCGGGATGCTCGAGCACCTGTAAACAATTTTCTCAACCTATTTGATGTTCAAATAAAGAAT

>6088_6088_4_ARHGAP35-ESR1_ARHGAP35_chr19_47440665_ENST00000404338_ESR1_chr6_152201789_ENST00000443427_length(amino acids)=1656AA_BP=1275
MMMARKQDVRIPTYNISVVGLSGTEKEKGQCGIGKSCLCNRFVRPSADEFHLDHTSVLSTSDFGGRVVNNDHFLYWGEVSRSLEDCVECK
MHIVEQTEFIDDQTFQPHRSTALQPYIKRAAATKLASAEKLMYFCTDQLGLEQDFEQKQMPDGKLLVDGFLLGIDVSRGMNRNFDDQLKF
VSNLYNQLAKTKKPIVVVLTKCDEGVERYIRDAHTFALSKKNLQVVETSARSNVNVDLAFSTLVQLIDKSRGKTKIIPYFEALKQQSQQI
ATAKDKYEWLVSRIVKNHNENWLSVSRKMQASPEYQDYVYLEGTQKAKKLFLQHIHRLKHEHIERRRKLYLAALPLAFEALIPNLDEIDH
LSCIKAKKLLETKPEFLKWFVVLEETPWDATSHIDNMENERIPFDLMDTVPAEQLYEAHLEKLRNERKRVEMRRAFKENLETSPFITPGK
PWEEARSFIMNEDFYQWLEESVYMDIYGKHQKQIIDKAKEEFQELLLEYSELFYELELDAKPSKEKMGVIQDVLGEEQRFKALQKLQAER
DALILKHIHFVYHPTKETCPSCPACVDAKIEHLISSRFIRPSDRNQKNSLSDPNIDRINLVILGKDGLARELANEIRALCTNDDKYVIDG
KMYELSLRPIEGNVRLPVNSFQTPTFQPHGCLCLYNSKESLSYVVESIEKSRESTLGRRDNHLVHLPLTLILVNKRGDTSGETLHSLIQQ
GQQIASKLQCVFLDPASAGIGYGRNINEKQISQVLKGLLDSKRNLNLVSSTASIKDLADVDLRIVMCLMCGDPFSADDILFPVLQSQTCK
SSHCGSNNSVLLELPIGLHKKRIELSVLSYHSSFSIRKSRLVHGYIVFYSAKRKASLAMLRAFLCEVQDIIPIQLVALTDGAVDVLDNDL
SREQLTEGEEIAQEIDGRFTSIPCSQPQHKLEIFHPFFKDVVEKKNIIEATHMYDNAAEACSTTEEVFNSPRAGSPLCNSNLQDSEEDIE
PSYSLFREDTSLPSLSKDHSKLSMELEGNDGLSFIMSNFESKLNNKVPPPVKPKPPVHFEITKGDLSYLDQGHRDGQRKSVSSSPWLPQD
GFDPSDYAEPMDAVVKPRNEEENIYSVPHDSTQGKIITIRNINKAQSNGSGNGSDSEMDTSSLERGRKVSIVSKPVLYRTRCTRLGRFAS
YRTSFSVGSDDELGPIRKKEEDQASQGYKGDNAVIPYETDEDPRRRNILRSLRRNTKKPKPKPRPSITKATWESNYFGVPLTTVVTPEKP
IPIFIERCIEYIEATGHNDYMCPATNQCTIDKNRRKSCQACRLRKCYEVGMMKGGIRKDRRGGRMLKHKRQRDDGEGRGEVGSAGDMRAA
NLWPSPLMIKRSKKNSLALSLTADQMVSALLDAEPPILYSEYDPTRPFSEASMMGLLTNLADRELVHMINWAKRVPGFVDLTLHDQVHLL
ECAWLEILMIGLVWRSMEHPGKLLFAPNLLLDRNQGKCVEGMVEIFDMLLATSSRFRMMNLQGEEFVCLKSIILLNSGVYTFLSSTLKSL
EEKDHIHRVLDKITDTLIHLMAKAGLTLQQQHQRLAQLLLILSHIRHMSNKGMEHLYSMKCKNVVPLYDLLLEMLDAHRLHAPTSRGGAS

--------------------------------------------------------------
>6088_6088_5_ARHGAP35-ESR1_ARHGAP35_chr19_47440665_ENST00000404338_ESR1_chr6_152201789_ENST00000456483_length(transcript)=8943nt_BP=3826nt
ATGATGATGGCAAGAAAGCAAGATGTCCGAATTCCCACCTACAACATCAGTGTGGTGGGATTATCTGGGACCGAGAAGGAAAAGGGCCAG
TGTGGGATTGGAAAGTCTTGTTTGTGCAACCGCTTCGTGCGCCCGAGTGCTGACGAGTTTCACTTGGACCATACCTCCGTCCTCAGCACC
AGTGACTTTGGAGGGCGAGTGGTCAATAATGACCACTTTCTCTACTGGGGAGAAGTTAGCCGCTCCCTGGAGGATTGTGTGGAATGTAAG
ATGCACATTGTGGAGCAGACTGAATTTATTGATGATCAGACTTTTCAACCTCATCGAAGCACGGCCCTGCAGCCCTATATCAAGAGAGCT
GCTGCGACCAAGCTTGCATCAGCTGAAAAACTCATGTACTTTTGCACTGACCAGCTGGGGCTGGAGCAGGACTTTGAGCAGAAACAAATG
CCAGACGGAAAGCTGCTGGTTGATGGTTTTCTTCTTGGTATTGATGTTAGCAGGGGCATGAATAGGAACTTTGATGACCAGCTCAAGTTT
GTCTCCAATCTCTACAATCAGCTTGCAAAAACAAAAAAGCCCATAGTGGTGGTCCTGACTAAGTGTGACGAAGGTGTTGAGCGGTACATT
AGAGATGCACATACTTTTGCCTTAAGCAAAAAGAACCTCCAGGTTGTGGAGACCTCAGCGAGATCCAATGTAAACGTGGACTTGGCTTTC
AGCACCTTAGTGCAACTCATTGATAAAAGTCGGGGAAAGACAAAAATCATTCCTTATTTTGAAGCTCTCAAGCAGCAGAGTCAGCAGATA
GCTACAGCAAAAGACAAGTATGAGTGGCTGGTGAGTCGCATTGTGAAAAACCACAATGAGAACTGGCTGAGTGTCAGCCGAAAGATGCAG
GCCTCTCCAGAATACCAGGACTATGTCTACCTGGAAGGGACTCAGAAAGCCAAGAAGCTGTTTCTACAGCACATCCACCGCCTCAAGCAT
GAGCATATCGAGCGTAGGAGAAAGCTGTACCTGGCAGCCCTGCCATTAGCTTTTGAAGCTCTTATACCTAATCTAGATGAAATAGACCAC
CTAAGCTGCATAAAAGCCAAAAAGCTCTTAGAAACCAAGCCAGAATTCTTGAAGTGGTTTGTTGTGCTTGAAGAGACCCCATGGGATGCC
ACCAGTCACATTGACAACATGGAAAACGAACGGATTCCCTTTGATTTAATGGATACCGTCCCTGCAGAGCAGCTATACGAGGCCCACTTA
GAGAAGCTGAGGAACGAAAGGAAAAGAGTTGAGATGCGAAGGGCGTTTAAAGAAAACCTGGAGACTTCTCCTTTCATAACTCCCGGAAAG
CCTTGGGAAGAGGCCCGTAGTTTTATTATGAATGAGGATTTCTACCAGTGGCTGGAGGAATCTGTATACATGGATATTTATGGCAAACAC
CAAAAGCAAATTATAGATAAAGCAAAGGAAGAATTTCAGGAGTTGCTTTTGGAATATTCAGAATTGTTTTATGAACTGGAGCTGGATGCT
AAGCCCAGCAAGGAGAAGATGGGTGTTATTCAGGATGTTCTGGGAGAGGAACAGCGATTTAAAGCATTACAAAAGCTCCAAGCAGAGCGT
GATGCCCTTATTCTGAAACACATTCATTTTGTGTACCACCCAACAAAGGAGACATGCCCCAGCTGCCCAGCTTGTGTGGACGCTAAGATT
GAGCACTTGATTAGTTCTCGGTTTATCCGGCCGTCTGACCGGAATCAGAAAAATTCACTCTCTGACCCTAACATTGATAGAATCAACTTG
GTTATATTGGGCAAAGACGGCCTTGCCCGAGAGTTGGCCAATGAGATTCGAGCTCTTTGTACAAATGATGACAAGTATGTGATAGATGGT
AAAATGTATGAGCTTTCCCTGAGGCCAATAGAGGGGAATGTCAGGCTTCCTGTGAACTCTTTCCAGACGCCAACATTTCAGCCCCACGGC
TGTCTCTGCCTTTACAATTCAAAGGAATCGCTATCCTATGTAGTGGAAAGTATAGAGAAGAGTAGAGAGTCCACGCTGGGCCGGCGGGAT
AATCATTTAGTCCATCTCCCCCTTACATTAATTTTGGTTAACAAGAGAGGAGACACCAGTGGAGAGACTCTGCATAGCTTAATACAGCAA
GGTCAACAAATTGCTAGCAAACTTCAGTGTGTCTTTCTCGACCCTGCTTCTGCTGGCATTGGTTACGGACGCAACATTAATGAAAAGCAA
ATCAGTCAAGTTTTGAAGGGACTCCTGGACTCTAAGCGTAACTTAAACCTGGTCAGTTCTACTGCTAGCATCAAAGATTTGGCTGATGTT
GATCTGCGAATTGTTATGTGTCTGATGTGTGGAGATCCTTTTAGTGCAGATGACATACTTTTTCCTGTCCTTCAGTCCCAAACCTGTAAA
TCTTCCCATTGTGGAAGCAACAACTCTGTTTTACTTGAACTACCAATCGGACTGCACAAGAAGCGGATTGAACTGTCTGTTCTTTCATAC
CATTCCTCCTTTAGCATCAGAAAGAGCCGGTTGGTTCATGGGTACATTGTTTTTTATTCAGCCAAACGTAAGGCCTCTTTGGCTATGTTA
CGTGCCTTTCTTTGTGAAGTGCAGGATATTATCCCTATTCAGCTTGTAGCACTCACTGATGGCGCTGTAGATGTCCTGGACAATGACTTA
AGTAGGGAACAGCTAACTGAGGGGGAGGAGATTGCTCAAGAAATTGACGGAAGGTTCACAAGCATCCCCTGTAGCCAACCCCAGCATAAA
CTTGAGATCTTTCACCCATTTTTTAAAGATGTGGTGGAAAAAAAGAACATAATCGAGGCTACTCATATGTACGATAATGCTGCCGAGGCC
TGTAGCACCACCGAAGAGGTGTTTAACTCCCCCCGGGCAGGATCACCGCTCTGCAACTCAAACCTGCAGGATTCAGAAGAAGATATCGAG
CCATCTTACAGCCTGTTTCGAGAAGACACATCACTGCCTTCTCTGTCCAAAGACCATTCTAAGCTCTCTATGGAACTGGAGGGAAATGAT
GGGCTGTCTTTCATTATGAGCAATTTTGAGAGTAAACTGAACAACAAAGTACCTCCGCCAGTCAAACCAAAGCCTCCTGTCCATTTTGAA
ATTACAAAGGGGGATCTATCTTATTTAGACCAAGGCCATAGGGATGGACAGAGGAAGTCTGTGTCTTCTAGCCCCTGGCTGCCTCAGGAT
GGGTTTGATCCTTCTGACTATGCTGAACCCATGGATGCTGTGGTGAAGCCAAGGAATGAAGAAGAAAACATATACTCCGTGCCCCATGAC
AGCACCCAAGGCAAAATCATCACCATTCGGAATATCAACAAAGCCCAGTCCAACGGCAGCGGGAATGGTTCTGACAGTGAAATGGACACC
AGCTCTCTAGAGCGAGGGCGCAAGGTTTCCATCGTGAGCAAGCCAGTGCTGTACAGGACGAGATGCACCCGGCTGGGGCGGTTTGCTAGT
TACCGGACCAGCTTCAGCGTGGGGAGTGATGATGAGCTGGGGCCCATCCGGAAGAAAGAGGAGGATCAGGCATCCCAGGGTTATAAAGGG
GACAATGCTGTCATTCCATACGAAACAGACGAAGACCCGCGGAGGAGGAATATTCTTCGCAGCCTAAGGAGGAACACTAAGAAACCAAAG
CCCAAACCCCGGCCATCCATCACAAAGGCAACCTGGGAGAGTAACTATTTTGGGGTGCCCTTAACAACTGTCGTGACTCCAGAGAAGCCG
ATCCCCATTTTTATTGAAAGATGTATTGAGTACATTGAAGCCACAGGACATAACGACTATATGTGTCCAGCCACCAACCAGTGCACCATT
GATAAAAACAGGAGGAAGAGCTGCCAGGCCTGCCGGCTCCGTAAATGCTACGAAGTGGGAATGATGAAAGGTGGCTTTGTGGATTTGACC
CTCCATGATCAGGTCCACCTTCTAGAATGTGCCTGGCTAGAGATCCTGATGATTGGTCTCGTCTGGCGCTCCATGGAGCACCCAGGGAAG
CTACTGTTTGCTCCTAACTTGCTCTTGGACAGGAACCAGGGAAAATGTGTAGAGGGCATGGTGGAGATCTTCGACATGCTGCTGGCTACA
TCATCTCGGTTCCGCATGATGAATCTGCAGGGAGAGGAGTTTGTGTGCCTCAAATCTATTATTTTGCTTAATTCTGGAGTGTACACATTT
CTGTCCAGCACCCTGAAGTCTCTGGAAGAGAAGGACCATATCCACCGAGTCCTGGACAAGATCACAGACACTTTGATCCACCTGATGGCC
AAGGCAGGCCTGACCCTGCAGCAGCAGCACCAGCGGCTGGCCCAGCTCCTCCTCATCCTCTCCCACATCAGGCACATGAGTAACAAAGGC
ATGGAGCATCTGTACAGCATGAAGTGCAAGAACGTGGTGCCCCTCTATGACCTGCTGCTGGAGATGCTGGACGCCCACCGCCTACATGCG
CCCACTAGCCGTGGAGGGGCATCCGTGGAGGAGACGGACCAAAGCCACTTGGCCACTGCGGGCTCTACTTCATCGCATTCCTTGCAAAAG
TATTACATCACGGGGGAGGCAGAGGGTTTCCCTGCCACGGTCTGAGAGCTCCCTGGCTCCCACACGGTTCAGATAATCCCTGCTGCATTT
TACCCTCATCATGCACCACTTTAGCCAAATTCTGTCTCCTGCATACACTCCGGCATGCATCCAACACCAATGGCTTTCTAGATGAGTGGC
CATTCATTTGCTTGCTCAGTTCTTAGTGGCACATCTTCTGTCTTCTGTTGGGAACAGCCAAAGGGATTCCAAGGCTAAATCTTTGTAACA
GCTCTCTTTCCCCCTTGCTATGTTACTAAGCGTGAGGATTCCCGTAGCTCTTCACAGCTGAACTCAGTCTATGGGTTGGGGCTCAGATAA
CTCTGTGCATTTAAGCTACTTGTAGAGACCCAGGCCTGGAGAGTAGACATTTTGCCTCTGATAAGCACTTTTTAAATGGCTCTAAGAATA
AGCCACAGCAAAGAATTTAAAGTGGCTCCTTTAATTGGTGACTTGGAGAAAGCTAGGTCAAGGGTTTATTATAGCACCCTCTTGTATTCC
TATGGCAATGCATCCTTTTATGAAAGTGGTACACCTTAAAGCTTTTATATGACTGTAGCAGAGTATCTGGTGATTGTCAATTCATTCCCC
CTATAGGAATACAAGGGGCACACAGGGAAGGCAGATCCCCTAGTTGGCAAGACTATTTTAACTTGATACACTGCAGATTCAGATGTGCTG
AAAGCTCTGCCTCTGGCTTTCCGGTCATGGGTTCCAGTTAATTCATGCCTCCCATGGACCTATGGAGAGCAGCAAGTTGATCTTAGTTAA
GTCTCCCTATATGAGGGATAAGTTCCTGATTTTTGTTTTTATTTTTGTGTTACAAAAGAAAGCCCTCCCTCCCTGAACTTGCAGTAAGGT
CAGCTTCAGGACCTGTTCCAGTGGGCACTGTACTTGGATCTTCCCGGCGTGTGTGTGCCTTACACAGGGGTGAACTGTTCACTGTGGTGA
TGCATGATGAGGGTAAATGGTAGTTGAAAGGAGCAGGGGCCCTGGTGTTGCATTTAGCCCTGGGGCATGGAGCTGAACAGTACTTGTGCA
GGATTGTTGTGGCTACTAGAGAACAAGAGGGAAAGTAGGGCAGAAACTGGATACAGTTCTGAGGCACAGCCAGACTTGCTCAGGGTGGCC
CTGCCACAGGCTGCAGCTACCTAGGAACATTCCTTGCAGACCCCGCATTGCCCTTTGGGGGTGCCCTGGGATCCCTGGGGTAGTCCAGCT
CTTCTTCATTTCCCAGCGTGGCCCTGGTTGGAAGAAGCAGCTGTCACAGCTGCTGTAGACAGCTGTGTTCCTACAATTGGCCCAGCACCC
TGGGGCACGGGAGAAGGGTGGGGACCGTTGCTGTCACTACTCAGGCTGACTGGGGCCTGGTCAGATTACGTATGCCCTTGGTGGTTTAGA
GATAATCCAAAATCAGGGTTTGGTTTGGGGAAGAAAATCCTCCCCCTTCCTCCCCCGCCCCGTTCCCTACCGCCTCCACTCCTGCCAGCT
CATTTCCTTCAATTTCCTTTGACCTATAGGCTAAAAAAGAAAGGCTCATTCCAGCCACAGGGCAGCCTTCCCTGGGCCTTTGCTTCTCTA
GCACAATTATGGGTTACTTCCTTTTTCTTAACAAAAAAGAATGTTTGATTTCCTCTGGGTGACCTTATTGTCTGTAATTGAAACCCTATT
GAGAGGTGATGTCTGTGTTAGCCAATGACCCAGGTGAGCTGCTCGGGCTTCTCTTGGTATGTCTTGTTTGGAAAAGTGGATTTCATTCAT
TTCTGATTGTCCAGTTAAGTGATCACCAAAGGACTGAGAATCTGGGAGGGCAAAAAAAAAAAAAAAGTTTTTATGTGCACTTAAATTTGG
GGACAATTTTATGTATCTGTGTTAAGGATATGTTTAAGAACATAATTCTTTTGTTGCTGTTTGTTTAAGAAGCACCTTAGTTTGTTTAAG
AAGCACCTTATATAGTATAATATATATTTTTTTGAAATTACATTGCTTGTTTATCAGACAATTGAATGTAGTAATTCTGTTCTGGATTTA
ATTTGACTGGGTTAACATGCAAAAACCAAGGAAAAATATTTAGTTTTTTTTTTTTTTTTTGTATACTTTTCAAGCTACCTTGTCATGTAT
ACAGTCATTTATGCCTAAAGCCTGGTGATTATTCATTTAAATGAAGATCACATTTCATATCAACTTTTGTATCCACAGTAGACAAAATAG
CACTAATCCAGATGCCTATTGTTGGATACTGAATGACAGACAATCTTATGTAGCAAAGATTATGCCTGAAAAGGAAAATTATTCAGGGCA
GCTAATTTTGCTTTTACCAAAATATCAGTAGTAATATTTTTGGACAGTAGCTAATGGGTCAGTGGGTTCTTTTTAATGTTTATACTTAGA
TTTTCTTTTAAAAAAATTAAAATAAAACAAAAAAAAATTTCTAGGACTAGACGATGTAATACCAGCTAAAGCCAAACAATTATACAGTGG
AAGGTTTTACATTATTCATCCAATGTGTTTCTATTCATGTTAAGATACTACTACATTTGAAGTGGGCAGAGAACATCAGATGATTGAAAT
GTTCGCCCAGGGGTCTCCAGCAACTTTGGAAATCTCTTTGTATTTTTACTTGAAGTGCCACTAATGGACAGCAGATATTTTCTGGCTGAT
GTTGGTATTGGGTGTAGGAACATGATTTAAAAAAAAACTCTTGCCTCTGCTTTCCCCCACTCTGAGGCAAGTTAAAATGTAAAAGATGTG
ATTTATCTGGGGGGCTCAGGTATGGTGGGGAAGTGGATTCAGGAATCTGGGGAATGGCAAATATATTAAGAAGAGTATTGAAAGTATTTG
GAGGAAAATGGTTAATTCTGGGTGTGCACCAGGGTTCAGTAGAGTCCACTTCTGCCCTGGAGACCACAAATCAACTAGCTCCATTTACAG
CCATTTCTAAAATGGCAGCTTCAGTTCTAGAGAAGAAAGAACAACATCAGCAGTAAAGTCCATGGAATAGCTAGTGGTCTGTGTTTCTTT
TCGCCATTGCCTAGCTTGCCGTAATGATTCTATAATGCCATCATGCAGCAATTATGAGAGGCTAGGTCATCCAAAGAGAAGACCCTATCA
ATGTAGGTTGCAAAATCTAACCCCTAAGGAAGTGCAGTCTTTGATTTGATTTCCCTAGTAACCTTGCAGATATGTTTAACCAAGCCATAG
CCCATGCCTTTTGAGGGCTGAACAAATAAGGGACTTACTGATAATTTACTTTTGATCACATTAAGGTGTTCTCACCTTGAAATCTTATAC
ACTGAAATGGCCATTGATTTAGGCCACTGGCTTAGAGTACTCCTTCCCCTGCATGACACTGATTACAAATACTTTCCTATTCATACTTTC
CAATTATGAGATGGACTGTGGGTACTGGGAGTGATCACTAACACCATAGTAATGTCTAATATTCACAGGCAGATCTGCTTGGGGAAGCTA
GTTATGTGAAAGGCAAATAGAGTCATACAGTAGCTCAAAAGGCAACCATAATTCTCTTTGGTGCAGGTCTTGGGAGCGTGATCTAGATTA
CACTGCACCATTCCCAAGTTAATCCCCTGAAAACTTACTCTCAACTGGAGCAAATGAACTTTGGTCCCAAATATCCATCTTTTCAGTAGC
GTTAATTATGCTCTGTTTCCAACTGCATTTCCTTTCCAATTGAATTAAAGTGTGGCCTCGTTTTTAGTCATTTAAAATTGTTTTCTAAGT
AATTGCTGCCTCTATTATGGCACTTCAATTTTGCACTGTCTTTTGAGATTCAAGAAAAATTTCTATTCTTTTTTTTGCATCCAATTGTGC
CTGAACTTTTAAAATATGTAAATGCTGCCATGTTCCAAACCCATCGTCAGTGTGTGTGTTTAGAGCTGTGCACCCTAGAAACAACATATT
GTCCCATGAGCAGGTGCCTGAGACACAGACCCCTTTGCATTCACAGAGAGGTCATTGGTTATAGAGACTTGAATTAATAAGTGACATTAT
GCCAGTTTCTGTTCTCTCACAGGTGATAAACAATGCTTTTTGTGCACTACATACTCTTCAGTGTAGAGCTCTTGTTTTATGGGAAAAGGC
TCAAATGCCAAATTGTGTTTGATGGATTAATATGCCCTTTTGCCGATGCATACTATTACTGATGTGACTCGGTTTTGTCGCAGCTTTGCT
TTGTTTAATGAAACACACTTGTAAACCTCTTTTGCACTTTGAAAAAGAATCCAGCGGGATGCTCGAGCACCTGTAAACAATTTTCTCAAC

>6088_6088_5_ARHGAP35-ESR1_ARHGAP35_chr19_47440665_ENST00000404338_ESR1_chr6_152201789_ENST00000456483_length(amino acids)=1544AA_BP=1275
MMMARKQDVRIPTYNISVVGLSGTEKEKGQCGIGKSCLCNRFVRPSADEFHLDHTSVLSTSDFGGRVVNNDHFLYWGEVSRSLEDCVECK
MHIVEQTEFIDDQTFQPHRSTALQPYIKRAAATKLASAEKLMYFCTDQLGLEQDFEQKQMPDGKLLVDGFLLGIDVSRGMNRNFDDQLKF
VSNLYNQLAKTKKPIVVVLTKCDEGVERYIRDAHTFALSKKNLQVVETSARSNVNVDLAFSTLVQLIDKSRGKTKIIPYFEALKQQSQQI
ATAKDKYEWLVSRIVKNHNENWLSVSRKMQASPEYQDYVYLEGTQKAKKLFLQHIHRLKHEHIERRRKLYLAALPLAFEALIPNLDEIDH
LSCIKAKKLLETKPEFLKWFVVLEETPWDATSHIDNMENERIPFDLMDTVPAEQLYEAHLEKLRNERKRVEMRRAFKENLETSPFITPGK
PWEEARSFIMNEDFYQWLEESVYMDIYGKHQKQIIDKAKEEFQELLLEYSELFYELELDAKPSKEKMGVIQDVLGEEQRFKALQKLQAER
DALILKHIHFVYHPTKETCPSCPACVDAKIEHLISSRFIRPSDRNQKNSLSDPNIDRINLVILGKDGLARELANEIRALCTNDDKYVIDG
KMYELSLRPIEGNVRLPVNSFQTPTFQPHGCLCLYNSKESLSYVVESIEKSRESTLGRRDNHLVHLPLTLILVNKRGDTSGETLHSLIQQ
GQQIASKLQCVFLDPASAGIGYGRNINEKQISQVLKGLLDSKRNLNLVSSTASIKDLADVDLRIVMCLMCGDPFSADDILFPVLQSQTCK
SSHCGSNNSVLLELPIGLHKKRIELSVLSYHSSFSIRKSRLVHGYIVFYSAKRKASLAMLRAFLCEVQDIIPIQLVALTDGAVDVLDNDL
SREQLTEGEEIAQEIDGRFTSIPCSQPQHKLEIFHPFFKDVVEKKNIIEATHMYDNAAEACSTTEEVFNSPRAGSPLCNSNLQDSEEDIE
PSYSLFREDTSLPSLSKDHSKLSMELEGNDGLSFIMSNFESKLNNKVPPPVKPKPPVHFEITKGDLSYLDQGHRDGQRKSVSSSPWLPQD
GFDPSDYAEPMDAVVKPRNEEENIYSVPHDSTQGKIITIRNINKAQSNGSGNGSDSEMDTSSLERGRKVSIVSKPVLYRTRCTRLGRFAS
YRTSFSVGSDDELGPIRKKEEDQASQGYKGDNAVIPYETDEDPRRRNILRSLRRNTKKPKPKPRPSITKATWESNYFGVPLTTVVTPEKP
IPIFIERCIEYIEATGHNDYMCPATNQCTIDKNRRKSCQACRLRKCYEVGMMKGGFVDLTLHDQVHLLECAWLEILMIGLVWRSMEHPGK
LLFAPNLLLDRNQGKCVEGMVEIFDMLLATSSRFRMMNLQGEEFVCLKSIILLNSGVYTFLSSTLKSLEEKDHIHRVLDKITDTLIHLMA
KAGLTLQQQHQRLAQLLLILSHIRHMSNKGMEHLYSMKCKNVVPLYDLLLEMLDAHRLHAPTSRGGASVEETDQSHLATAGSTSSHSLQK

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for ARHGAP35-ESR1


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with
TgeneESR1chr19:47440665chr6:152201789ENST0000020624918262_595214.33333333333334596.0AKAP13
TgeneESR1chr19:47440665chr6:152201789ENST0000033879929262_595214.33333333333334596.0AKAP13
TgeneESR1chr19:47440665chr6:152201789ENST00000440973310262_595214.33333333333334596.0AKAP13
TgeneESR1chr19:47440665chr6:152201789ENST0000044342729262_595214.33333333333334596.0AKAP13
TgeneESR1chr19:47440665chr6:152201789ENST0000045648328262_595214.33333333333334484.0AKAP13


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with
TgeneESR1chr19:47440665chr6:152201789ENST000002062491835_174214.33333333333334596.0DDX5%3B self-association
TgeneESR1chr19:47440665chr6:152201789ENST000003387992935_174214.33333333333334596.0DDX5%3B self-association
TgeneESR1chr19:47440665chr6:152201789ENST0000044097331035_174214.33333333333334596.0DDX5%3B self-association
TgeneESR1chr19:47440665chr6:152201789ENST000004434272935_174214.33333333333334596.0DDX5%3B self-association
TgeneESR1chr19:47440665chr6:152201789ENST000004564832835_174214.33333333333334484.0DDX5%3B self-association


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for ARHGAP35-ESR1


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for ARHGAP35-ESR1


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource