FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:EPHA4-COL4A3 (FusionGDB2 ID:26960)

Fusion Gene Summary for EPHA4-COL4A3

check button Fusion gene summary
Fusion gene informationFusion gene name: EPHA4-COL4A3
Fusion gene ID: 26960
HgeneTgene
Gene symbol

EPHA4

COL4A3

Gene ID

2043

1285

Gene nameEPH receptor A4collagen type IV alpha 3 chain
SynonymsEK8|HEK8|SEK|TYRO1ATS2|ATS3
Cytomap

2q36.1

2q36.3

Type of geneprotein-codingprotein-coding
Descriptionephrin type-A receptor 4EPH-like kinase 8TYRO1 protein tyrosine kinasereceptor protein-tyrosine kinase HEK8tyrosine-protein kinase TYRO1tyrosine-protein kinase receptor SEKcollagen alpha-3(IV) chaincollagen IV, alpha-3 polypeptidecollagen, type IV, alpha 3 (Goodpasture antigen)tumstatin
Modification date2020031320200313
UniProtAcc

P54764

.
Ensembl transtripts involved in fusion geneENST00000281821, ENST00000392071, 
ENST00000409854, ENST00000409938, 
ENST00000469354, 
ENST00000468753, 
ENST00000396578, 
Fusion gene scores* DoF score4 X 3 X 3=363 X 4 X 2=24
# samples 44
** MAII scorelog2(4/36*10)=0.15200309344505
effective Gene in Pan-Cancer Fusion Genes (eGinPCFGs).
DoF>8 and MAII>0
log2(4/24*10)=0.736965594166206
effective Gene in Pan-Cancer Fusion Genes (eGinPCFGs).
DoF>8 and MAII>0
Context

PubMed: EPHA4 [Title/Abstract] AND COL4A3 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointEPHA4(222428451)-COL4A3(228102684), # samples:2
Anticipated loss of major functional domain due to fusion event.EPHA4-COL4A3 seems lost the major protein functional domain in Hgene partner, which is a essential gene due to the frame-shifted ORF.
EPHA4-COL4A3 seems lost the major protein functional domain in Hgene partner, which is a IUPHAR drug target due to the frame-shifted ORF.
EPHA4-COL4A3 seems lost the major protein functional domain in Hgene partner, which is a kinase due to the frame-shifted ORF.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneEPHA4

GO:0018108

peptidyl-tyrosine phosphorylation

12775584

HgeneEPHA4

GO:0046777

protein autophosphorylation

12775584

HgeneEPHA4

GO:2001108

positive regulation of Rho guanyl-nucleotide exchange factor activity

12775584

TgeneCOL4A3

GO:0006919

activation of cysteine-type endopeptidase activity involved in apoptotic process

10766752

TgeneCOL4A3

GO:0016525

negative regulation of angiogenesis

10766752|12682293

TgeneCOL4A3

GO:0072577

endothelial cell apoptotic process

10766752

TgeneCOL4A3

GO:1905563

negative regulation of vascular endothelial cell proliferation

10766752


check buttonFusion gene breakpoints across EPHA4 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across COL4A3 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4LUSCTCGA-34-5236-01AEPHA4chr2

222428451

-COL4A3chr2

228102684

+


Top

Fusion Gene ORF analysis for EPHA4-COL4A3

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-intronENST00000281821ENST00000468753EPHA4chr2

222428451

-COL4A3chr2

228102684

+
5CDS-intronENST00000392071ENST00000468753EPHA4chr2

222428451

-COL4A3chr2

228102684

+
5CDS-intronENST00000409854ENST00000468753EPHA4chr2

222428451

-COL4A3chr2

228102684

+
5CDS-intronENST00000409938ENST00000468753EPHA4chr2

222428451

-COL4A3chr2

228102684

+
Frame-shiftENST00000392071ENST00000396578EPHA4chr2

222428451

-COL4A3chr2

228102684

+
Frame-shiftENST00000409938ENST00000396578EPHA4chr2

222428451

-COL4A3chr2

228102684

+
In-frameENST00000281821ENST00000396578EPHA4chr2

222428451

-COL4A3chr2

228102684

+
In-frameENST00000409854ENST00000396578EPHA4chr2

222428451

-COL4A3chr2

228102684

+
intron-3CDSENST00000469354ENST00000396578EPHA4chr2

222428451

-COL4A3chr2

228102684

+
intron-intronENST00000469354ENST00000468753EPHA4chr2

222428451

-COL4A3chr2

228102684

+

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000281821EPHA4chr2222428451-ENST00000396578COL4A3chr2228102684+871386560157901729
ENST00000409854EPHA4chr2222428451-ENST00000396578COL4A3chr2228102684+870685859457831729

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000281821ENST00000396578EPHA4chr2222428451-COL4A3chr2228102684+0.0012883690.9987116
ENST00000409854ENST00000396578EPHA4chr2222428451-COL4A3chr2228102684+0.0012756970.9987243

Top

Fusion Genomic Features for EPHA4-COL4A3


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)
EPHA4chr2222428450-COL4A3chr2228102683+7.43E-050.9999256
EPHA4chr2222428450-COL4A3chr2228102683+7.43E-050.9999256

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.
genomic feature of top 1%

Top

Fusion Protein Features for EPHA4-COL4A3


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr2:222428451/chr2:228102684)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
EPHA4

P54764

.
FUNCTION: Receptor tyrosine kinase which binds membrane-bound ephrin family ligands residing on adjacent cells, leading to contact-dependent bidirectional signaling into neighboring cells. The signaling pathway downstream of the receptor is referred to as forward signaling while the signaling pathway downstream of the ephrin ligand is referred to as reverse signaling. Highly promiscuous, it has the unique property among Eph receptors to bind and to be physiologically activated by both GPI-anchored ephrin-A and transmembrane ephrin-B ligands including EFNA1 and EFNB3. Upon activation by ephrin ligands, modulates cell morphology and integrin-dependent cell adhesion through regulation of the Rac, Rap and Rho GTPases activity. Plays an important role in the development of the nervous system controlling different steps of axonal guidance including the establishment of the corticospinal projections. May also control the segregation of motor and sensory axons during neuromuscular circuit development. In addition to its role in axonal guidance plays a role in synaptic plasticity. Activated by EFNA1 phosphorylates CDK5 at 'Tyr-15' which in turn phosphorylates NGEF regulating RHOA and dendritic spine morphogenesis. In the nervous system, plays also a role in repair after injury preventing axonal regeneration and in angiogenesis playing a role in central nervous system vascular formation. Additionally, its promiscuity makes it available to participate in a variety of cell-cell signaling regulating for instance the development of the thymic epithelium. During development of the cochlear organ of Corti, regulates pillar cell separation by forming a ternary complex with ADAM10 and CADH1 which facilitates the cleavage of CADH1 by ADAM10 and disruption of adherens junctions (By similarity). {ECO:0000250|UniProtKB:Q03137, ECO:0000269|PubMed:17143272}.FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneEPHA4chr2:222428451chr2:228102684ENST00000281821-31830_2092742072.6666666666665DomainEph LBD
HgeneEPHA4chr2:222428451chr2:228102684ENST00000409938-41830_209274987.0DomainEph LBD
TgeneCOL4A3chr2:222428451chr2:228102684ENST000003965780521445_1669291671.0DomainCollagen IV NC1
TgeneCOL4A3chr2:222428451chr2:228102684ENST000003965780521154_1156291671.0MotifCell attachment site
TgeneCOL4A3chr2:222428451chr2:228102684ENST000003965780521306_1308291671.0MotifCell attachment site
TgeneCOL4A3chr2:222428451chr2:228102684ENST000003965780521345_1347291671.0MotifCell attachment site
TgeneCOL4A3chr2:222428451chr2:228102684ENST000003965780521432_1434291671.0MotifCell attachment site
TgeneCOL4A3chr2:222428451chr2:228102684ENST00000396578052791_793291671.0MotifCell attachment site
TgeneCOL4A3chr2:222428451chr2:228102684ENST00000396578052996_998291671.0MotifCell attachment site
TgeneCOL4A3chr2:222428451chr2:228102684ENST000003965780521427_1444291671.0RegionNote=Epitope recognized by Goodpasture antibodies
TgeneCOL4A3chr2:222428451chr2:228102684ENST000003965780521479_1557291671.0RegionNote=Required for the anti-angiogenic activity of tumstatin
TgeneCOL4A3chr2:222428451chr2:228102684ENST000003965780521610_1628291671.0RegionNote=Required for the anti-tumor cell activity of tumstatin
TgeneCOL4A3chr2:222428451chr2:228102684ENST0000039657805229_42291671.0RegionNote=7S domain
TgeneCOL4A3chr2:222428451chr2:228102684ENST0000039657805243_1438291671.0RegionNote=Triple-helical region

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneEPHA4chr2:222428451chr2:228102684ENST00000281821-318191_3252742072.6666666666665Compositional biasNote=Cys-rich
HgeneEPHA4chr2:222428451chr2:228102684ENST00000409938-418191_325274987.0Compositional biasNote=Cys-rich
HgeneEPHA4chr2:222428451chr2:228102684ENST00000281821-318328_4392742072.6666666666665DomainFibronectin type-III 1
HgeneEPHA4chr2:222428451chr2:228102684ENST00000281821-318440_5372742072.6666666666665DomainFibronectin type-III 2
HgeneEPHA4chr2:222428451chr2:228102684ENST00000281821-318621_8822742072.6666666666665DomainProtein kinase
HgeneEPHA4chr2:222428451chr2:228102684ENST00000281821-318911_9752742072.6666666666665DomainSAM
HgeneEPHA4chr2:222428451chr2:228102684ENST00000409938-418328_439274987.0DomainFibronectin type-III 1
HgeneEPHA4chr2:222428451chr2:228102684ENST00000409938-418440_537274987.0DomainFibronectin type-III 2
HgeneEPHA4chr2:222428451chr2:228102684ENST00000409938-418621_882274987.0DomainProtein kinase
HgeneEPHA4chr2:222428451chr2:228102684ENST00000409938-418911_975274987.0DomainSAM
HgeneEPHA4chr2:222428451chr2:228102684ENST00000281821-318984_9862742072.6666666666665MotifPDZ-binding
HgeneEPHA4chr2:222428451chr2:228102684ENST00000409938-418984_986274987.0MotifPDZ-binding
HgeneEPHA4chr2:222428451chr2:228102684ENST00000281821-318627_6352742072.6666666666665Nucleotide bindingATP
HgeneEPHA4chr2:222428451chr2:228102684ENST00000409938-418627_635274987.0Nucleotide bindingATP
HgeneEPHA4chr2:222428451chr2:228102684ENST00000281821-31820_5472742072.6666666666665Topological domainExtracellular
HgeneEPHA4chr2:222428451chr2:228102684ENST00000281821-318570_9862742072.6666666666665Topological domainCytoplasmic
HgeneEPHA4chr2:222428451chr2:228102684ENST00000409938-41820_547274987.0Topological domainExtracellular
HgeneEPHA4chr2:222428451chr2:228102684ENST00000409938-418570_986274987.0Topological domainCytoplasmic
HgeneEPHA4chr2:222428451chr2:228102684ENST00000281821-318548_5692742072.6666666666665TransmembraneHelical
HgeneEPHA4chr2:222428451chr2:228102684ENST00000409938-418548_569274987.0TransmembraneHelical


Top

Fusion Gene Sequence for EPHA4-COL4A3


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>26960_26960_1_EPHA4-COL4A3_EPHA4_chr2_222428451_ENST00000281821_COL4A3_chr2_228102684_ENST00000396578_length(transcript)=8713nt_BP=865nt
CTGGGATAGAAGCGGCAGGAGCAGCGTTGGCACCGGCGAACCATGGCTGGGATTTTCTATTTCGCCCTATTTTCGTGTCTCTTCGGGATT
TGCGACGCTGTCACAGGTTCCAGGGTATACCCCGCGAATGAAGTTACCTTATTGGATTCCAGATCTGTTCAGGGAGAACTTGGGTGGATA
GCAAGCCCTCTGGAAGGAGGGTGGGAGGAAGTGAGTATCATGGATGAAAAAAATACACCAATCCGAACCTACCAAGTGTGCAATGTGATG
GAACCCAGCCAGAATAACTGGCTACGAACTGATTGGATCACCCGAGAAGGGGCTCAGAGGGTGTATATTGAGATTAAATTCACCTTGAGG
GACTGCAATAGTCTTCCGGGCGTCATGGGGACTTGCAAGGAGACGTTTAACCTGTACTACTATGAATCAGACAACGACAAAGAGCGTTTC
ATCAGAGAGAACCAGTTTGTCAAAATTGACACCATTGCTGCTGATGAGAGCTTCACCCAAGTGGACATTGGTGACAGAATCATGAAGCTG
AACACCGAGATCCGGGATGTAGGGCCATTAAGCAAAAAGGGGTTTTACCTGGCTTTTCAGGATGTGGGGGCCTGCATCGCCCTGGTATCA
GTCCGTGTGTTCTATAAAAAGTGTCCACTCACAGTCCGCAATCTGGCCCAGTTTCCTGACACCATCACAGGGGCTGATACGTCTTCCCTG
GTGGAAGTTCGAGGCTCCTGTGTCAACAACTCAGAAGAGAAAGATGTGCCAAAAATGTACTGTGGGGCAGATGGTGAATGGCTGGTACCC
ATTGGCAACTGCCTATGCAACGCTGGGCATGAGGAGCGGAGCGGAGAATGCCAAGGGTTGTGTCTGTAAAGACAAAGGCCAGTGCTTCTG
TGACGGGGCCAAAGGGGAGAAGGGGGAGAAGGGCTTTCCTGGACCCCCCGGTTCTCCTGGCCAGAAAGGATTCACAGGTCCTGAAGGCTT
GCCTGGACCGCAGGGACCCAAGGGCTTTCCAGGACTTCCAGGACTCACGGGTTCCAAAGGTGTAAGGGGAATAAGTGGATTGCCAGGATT
TTCTGGTTCTCCTGGACTTCCAGGCACCCCAGGCAATACCGGGCCTTACGGACTTGTCGGTGTACCAGGATGCAGTGGTTCTAAGGGTGA
GCAGGGGTTTCCAGGACTCCCAGGGACACTGGGCTACCCAGGGATCCCGGGTGCTGCTGGTTTGAAAGGACAAAAGGGTGCTCCTGCTAA
AGAAGAAGATATAGAACTTGATGCAAAAGGCGACCCCGGGTTGCCAGGGGCTCCAGGACCCCAGGGTTTGCCAGGCCCTCCAGGTTTTCC
TGGGCCTGTTGGCCCACCTGGTCCTCCGGGATTCTTTGGCTTTCCAGGAGCCATGGGACCTAGAGGACCTAAGGGTCACATGGGTGAAAG
AGTGATAGGACATAAAGGAGAGCGGGGTGTGAAAGGGTTAACAGGACCCCCGGGACCACCAGGAACAGTTATTGTGACCCTAACTGGCCC
AGATAACAGAACGGACCTCAAGGGGGAAAAGGGAGACAAGGGAGCAATGGGCGAGCCTGGACCTCCTGGACCCTCAGGACTGCCTGGAGA
ATCATATGGATCTGAAAAGGGTGCTCCTGGAGACCCTGGCCTGCAGGGAAAACCCGGAAAAGATGGTGTTCCTGGCTTCCCTGGAAGTGA
GGGAGTCAAGGGCAACAGGGGTTTCCCTGGGTTAATGGGTGAAGATGGCATTAAGGGACAGAAAGGGGACATTGGCCCTCCAGGATTTCG
TGGTCCAACAGAATATTATGACACATACCAGGAAAAGGGAGATGAAGGCACTCCAGGCCCACCAGGGCCCAGAGGAGCTCGTGGCCCACA
AGGTCCCAGTGGTCCCCCCGGAGTTCCTGGAAGTCCTGGATCATCAAGGCCTGGCCTCAGAGGAGCCCCTGGATGGCCAGGCCTGAAAGG
AAGTAAAGGGGAACGAGGCCGCCCAGGAAAGGATGCCATGGGGACTCCTGGGTCCCCAGGTTGTGCTGGTTCACCAGGTCTTCCAGGATC
ACCGGGACCTCCAGGACCGCCAGGTGACATCGTTTTTCGCAAGGGTCCACCTGGAGATCACGGACTGCCAGGCTATCTAGGGTCTCCAGG
AATCCCAGGAGTTGATGGGCCCAAAGGAGAACCAGGCCTCCTGTGTACACAGTGCCCTTATATCCCAGGGCCTCCCGGTCTCCCAGGATT
GCCAGGGTTACATGGTGTAAAAGGAATCCCAGGAAGACAAGGCGCAGCTGGCTTGAAAGGAAGCCCAGGGTCCCCAGGAAATACAGGTCT
TCCAGGATTTCCAGGTTTCCCAGGTGCCCAGGGTGACCCAGGACTTAAAGGAGAAAAAGGTGAAACACTTCAGCCTGAGGGGCAAGTGGG
TGTCCCAGGTGACCCGGGGCTCAGAGGCCAACCTGGGAGAAAGGGCTTGGATGGAATTCCTGGAACTCCGGGAGTGAAAGGATTACCAGG
ACCTAAAGGCGAACTGGCTCTGAGTGGTGAGAAAGGGGACCAAGGTCCTCCAGGGGATCCTGGCTCCCCTGGGTCCCCAGGACCTGCAGG
ACCAGCTGGACCACCTGGCTACGGACCCCAAGGAGAACCTGGTCTCCAGGGCACGCAAGGAGTTCCTGGAGCCCCCGGACCACCCGGAGA
AGCCGGCCCTAGGGGAGAGCTCAGTGTTTCAACACCAGTTCCAGGCCCACCAGGACCTCCAGGGCCCCCTGGCCATCCTGGCCCCCAAGG
TCCACCTGGTATCCCTGGATCCCTGGGGAAATGTGGAGATCCTGGTCTTCCAGGGCCTGATGGTGAACCAGGAATTCCAGGAATTGGATT
TCCTGGGCCTCCTGGACCTAAGGGAGACCAAGGTTTTCCAGGTACAAAAGGATCACTGGGTTGTCCTGGAAAAATGGGAGAGCCTGGGTT
ACCTGGAAAGCCAGGCCTCCCAGGAGCCAAGGGAGAACCAGCAGTAGCCATGCCTGGAGGACCAGGAACACCAGGTTTTCCAGGAGAAAG
AGGCAATTCTGGGGAACATGGAGAAATTGGACTCCCTGGACTTCCAGGTCTCCCTGGAACTCCAGGAAATGAAGGGCTTGATGGACCACG
AGGAGATCCAGGGCAGCCTGGACCACCTGGAGAACAAGGACCCCCAGGAAGGTGCATAGAGGGTCCCAGGGGAGCCCAAGGACTTCCAGG
CTTAAATGGATTGAAAGGGCAACAAGGCAGAAGAGGTAAAACGGGGCCAAAGGGAGACCCAGGAATTCCAGGCTTGGATAGATCAGGATT
TCCTGGAGAAACTGGATCACCAGGAATTCCAGGTCATCAAGGTGAAATGGGACCACTGGGTCAAAGAGGATATCCAGGAAATCCGGGAAT
TTTAGGGCCACCAGGTGAAGATGGAGTGATTGGGATGATGGGCTTTCCTGGAGCCATTGGCCCTCCAGGGCCCCCTGGGAACCCAGGCAC
ACCAGGGCAGAGGGGGAGCCCTGGAATTCCAGGAGTAAAGGGCCAGAGAGGAACCCCAGGAGCCAAGGGGGAACAAGGAGATAAAGGAAA
TCCCGGGCCTTCAGAGATATCCCACGTAATAGGGGACAAAGGAGAACCAGGTCTCAAAGGATTCGCAGGAAATCCAGGTGAGAAAGGAAA
CAGAGGCGTTCCAGGGATGCCAGGTTTAAAGGGCCTCAAAGGACTACCCGGACCAGCAGGACCACCAGGCCCCAGAGGAGATTTGGGCAG
CACTGGGAATCCTGGAGAACCAGGACTGCGTGGTATACCAGGAAGCATGGGGAACATGGGCATGCCAGGTTCTAAAGGAAAAAGGGGAAC
TTTGGGATTCCCAGGTCGAGCAGGAAGACCAGGCCTCCCAGGTATTCATGGTCTCCAGGGAGATAAGGGAGAGCCAGGTTATTCAGAAGG
TACAAGGCCAGGACCACCGGGACCAACGGGGGATCCAGGACTGCCGGGTGATATGGGAAAGAAAGGAGAAATGGGGCAACCTGGCCCACC
TGGACATTTGGGGCCTGCTGGACCTGAGGGAGCCCCTGGAAGTCCTGGAAGTCCTGGCCTCCCAGGAAAGCCAGGTCCTCATGGTGATTT
GGGTTTTAAAGGAATCAAAGGCCTCCTGGGCCCTCCAGGAATCAGAGGCCCTCCAGGTCTTCCAGGATTTCCAGGATCTCCTGGACCAAT
GGGTATAAGAGGTGACCAAGGACGTGATGGAATTCCTGGTCCAGCCGGAGAAAAGGGAGAAACGGGTTTATTGAGGGCCCCTCCAGGCCC
AAGAGGGAACCCTGGTGCTCAAGGAGCCAAAGGAGACAGGGGAGCCCCAGGTTTTCCTGGCCTCCCGGGCAGAAAAGGGGCCATGGGAGA
TGCTGGACCTCGAGGACCCACAGGCATAGAAGGATTCCCAGGGCCACCAGGTCTGCCCGGTGCAATTATCCCTGGCCAGACAGGAAATCG
TGGTCCACCAGGCTCAAGAGGAAGCCCAGGTGCGCCTGGTCCCCCTGGACCTCCAGGGAGTCATGTAATAGGCATAAAAGGAGACAAAGG
GTCTATGGGCCACCCTGGCCCAAAAGGTCCACCTGGAACTGCAGGAGACATGGGACCACCAGGTCGTCTGGGAGCACCAGGTACTCCAGG
TCTTCCAGGACCCAGAGGTGATCCTGGATTCCAGGGGTTTCCAGGCGTGAAAGGAGAAAAGGGTAATCCTGGATTTCTAGGATCCATTGG
ACCTCCAGGACCAATTGGGCCAAAAGGACCACCTGGTGTACGTGGAGACCCTGGCACACTTAAGATTATCTCCCTTCCAGGAAGCCCAGG
GCCACCTGGCACACCTGGAGAACCAGGGATGCAGGGAGAACCTGGGCCACCAGGGCCACCTGGAAACCTAGGACCCTGTGGGCCAAGAGG
TAAGCCAGGCAAGGATGGAAAACCAGGAACTCCTGGACCAGCTGGAGAAAAAGGCAACAAAGGTTCTAAAGGAGAGCCAGGACCAGCTGG
ATCAGATGGATTGCCAGGTTTGAAAGGAAAACGTGGAGACAGTGGATCACCTGCAACCTGGACAACGAGAGGCTTTGTCTTCACCCGACA
CAGTCAAACCACAGCAATTCCTTCATGTCCAGAGGGGACAGTGCCACTCTACAGTGGGTTTTCTTTTCTTTTTGTACAAGGAAATCAACG
AGCCCACGGACAAGACCTTGGAACTCTTGGCAGCTGCCTGCAGCGATTTACCACAATGCCATTCTTATTCTGCAATGTCAATGATGTATG
TAATTTTGCATCTCGAAATGATTATTCATACTGGCTGTCAACACCAGCTCTGATGCCAATGAACATGGCTCCCATTACTGGCAGAGCCCT
TGAGCCTTATATAAGCAGATGCACTGTTTGTGAAGGTCCTGCGATCGCCATAGCCGTTCACAGCCAAACCACTGACATTCCTCCATGTCC
TCACGGCTGGATTTCTCTCTGGAAAGGATTTTCATTCATCATGTTCACAAGTGCAGGTTCTGAGGGCACCGGGCAAGCACTGGCCTCCCC
TGGCTCCTGCCTGGAAGAATTCCGAGCCAGCCCATTTCTAGAATGTCATGGAAGAGGAACGTGCAACTACTATTCAAATTCCTACAGTTT
CTGGCTGGCTTCATTAAACCCAGAAAGAATGTTCAGAAAGCCTATTCCATCAACTGTGAAAGCTGGGGAATTAGAAAAAATAATAAGTCG
CTGTCAGGTGTGCATGAAGAAAAGACACTGAAGCTAAAAAAGACAGCAGAACTGCTATTTTTCATCCTAAAGAACAAAGTAATGACAGAA
CATGCTGTTATTTAGGTATTTTTCTTTAACCAAACAATATTGCTCCATGATGACTTAGTACAAAGTTTCAATTTGTTTCCCCACAAAACA
AAGCAATTCTTTCAAGTCAGTTCTGTGATCTGGGTCTCTAATCTGTGCTGTTTCAAAGTTCTCTGTGGCAAAGCAGCAACTATTCACAAA
ATATCACCAAAAACCTATTCCACTTACATCCAAGGCACTGTCACTACGGTGATTGTATGAAGTTTGAATGCTGCAAGTTATGAAATATTT
GGCCCGCTGGATTCCCACATTTGTCTTCTTTCTGTCTTTAAGACTCAGGGAGGCTAAATCAGTGTTTGATTGCCCCGCCAACCCTTCCTG
AAACTTCAGACCCTGGGTAGGGGAAGAGAAGGGGGCATGTGGTATCCTGGAGCATTGTGTATAGAACTGGATTTTCAGACCTGCTGAGGA
CCGTAAGGCCTGATGGAACACAGAACTGAACTGAGGTTCATGGATTTTCCAGGACTGTTTCAAACATGCCCATTACTAACGGCAAAAGGG
GGATTCCCTGATGGAACCATAATACCCTTGGAAATACTGTATGGTTTTGTTTTGTTTTGTTGGTTTTTAAAGATTTTTGTTTGTTTATTG
AATTCATTTCACTGTAGCTCTAAAATCTGCTTGTATTCCAAGCATATAAAATTTTCCCCCTTAGTGAATTAGTTTTAAAATGATATTGTT
ATATACATACTATGAAATATGTATAACTTTAACTTCTGTTTTACCAGCATACCCACACAAATAACAAGAATACTACTTATGAAATGTGCA
CTTTATCCTCATTCCATAAATGTCGGTGCATACCTTATGTAAGGGAGCAGTTCAATAATCCATGAAAGAACTTAAGGCATTTGTTGGTTT
ATCAGACTCGGAATCTATTTTCTCATTGCTCTGAATATGTCATCACTCTAGGTTTTACAGATTTATTCCTTTGTTACTTCTCTAATTCTT
CCTTTGTAAAAAAAAAAAAAAGCAACACTTTTTATGTTATATGTTGTTCTTACAAACCATACTGAAAGAGTCCATTGTTTAAAAATCTTA
ATGTATCAAACTGTATAACTTGGCCGCTGTATGTCTTAAAACCTGCTTTTCAATGTGTTGATACATTCCCAAGGTTACTTAATTCAACTT
AACTATCATCTTATTCAGCACCAAGCATGTCCCAGGCACTGTACTAACCTACAGAGATGCTAAGAGAAAAAAAAGACTTGTTTCTGATCT
AATATCCCAGAAAAAGTAACTCATTGCTCTGTTAATAATCTCACATATACAAGTAGCTTCCCTCCCCTCTAGTTTTTTCTTCCTTTTCAC
TGCTGTTATATTTCATCATGATAATTCAGCAGGCCCAAGTAAAGGTTAAAAATAAGGTCTATGCCTAGGGAAACTCAGGGCTTCTAGTTT
CTCTTAGAAAAGCTAAGAGAAGATAAGGTCTGAATAATAGCAGAAAAACCAACATCTACAAAACATTAAACTAGTGTTATACTTGATGAT
AACACTATTTGATGAGTCTTAGAGTCCAGACACAAAGAGACAAAGCTTTGAAGATGCTTTTTGATCTACCTAGGTGGAGTTGGTGGTGCT
GATATTTAAATTCAGGCTACTGCTTCAATCTCAATTGCTTTGTAAGTGAAAAACATGACCCAGAGGACAGCACAGACTATGGCCATGGCT
CACATGGTTTACATCCTTCACTGCTCACGTGTTTGCTGTCAAGCCATTTTTACATCTAAACTAAGATGTGCAGCATTTCACTTATTTAGA
TTCACTTAACAAACAAATTTTTCTGCTTTAAAAATGTCTTATTGTCCCAAGTGTACTATAGCGGCATATAGAGCTAGCTAATCTCTACAA
ACCCTCTGTAGGCCAGTAGTTCTCAAAGTGTGGTCTCTGGAAGAGCAGTATCAGCATCATCTGGGAACTTGTCACAGATGCAGATTCTAG
GGACCACTCCAGACCTACACAATCAGAAACTCTTGGGGGAGGGCCCGAAATATCTATGTTTTACCAAGCCCACCACATGATTCTGATGTA
CTCTAAATACTGAGAAAACCTGTTCTAGACAAATACCCAAGCAACAACTCCGCAGGCAGTTACCAAGTACGGCTGGCTACAACTGCTCCA
TCCGTGCCTCTTTTTAAAGTTCAAACTCACAGGTGACTCTAAGGTTATCTACTTTTACTCATAAGTAAAAGCCCTAGACTGGTGCTAATG
TCAAACCACTGGCCTCCACTCAGGCCTCCATCTTCTCATGCCCTCTTACCAGTATTTAACTTCTGAGGAAGACAAGTGATGCTAAAACCT
GAAATTCCAATGAAGCCATATGAACAGCTGTTCAGTTGCACTTCTAAGACTTTACTTAGCAGTAAATTATAGCTCATGTGCATTATTTTC
CAGATAACTTAGCTTATGAGTAGCTTATACAATTATGAAGATTTAATATTACAGATAAAATGTAAACTGTTTCTTTAAAATTGGGGCTTC
AACTTTGGAATTTCACAGCGTGCTAAAATAACAGATTTCTCAGAAGTCTTTCAGCAAGATAAACATTATTAAGTAACTTATTTATGAAAG
TATTAAAATGCTTACATTTGAACTTGATGGCTAACTTACAAAGATTCTCTATGTATCAAATGTAACTTACTGCGACTAAACTTAATTTAA
TATTTACTCTATAACCAAATGAAATATATTTAAAATATATTGAATATTTTATATTGTTATATCCTGACAAGATTATAATATTTTAATGTA

>26960_26960_1_EPHA4-COL4A3_EPHA4_chr2_222428451_ENST00000281821_COL4A3_chr2_228102684_ENST00000396578_length(amino acids)=1729AA_BP=1161
MWGPASPWYQSVCSIKSVHSQSAIWPSFLTPSQGLIRLPWWKFEAPVSTTQKRKMCQKCTVGQMVNGWYPLATAYATLGMRSGAENAKGC
VCKDKGQCFCDGAKGEKGEKGFPGPPGSPGQKGFTGPEGLPGPQGPKGFPGLPGLTGSKGVRGISGLPGFSGSPGLPGTPGNTGPYGLVG
VPGCSGSKGEQGFPGLPGTLGYPGIPGAAGLKGQKGAPAKEEDIELDAKGDPGLPGAPGPQGLPGPPGFPGPVGPPGPPGFFGFPGAMGP
RGPKGHMGERVIGHKGERGVKGLTGPPGPPGTVIVTLTGPDNRTDLKGEKGDKGAMGEPGPPGPSGLPGESYGSEKGAPGDPGLQGKPGK
DGVPGFPGSEGVKGNRGFPGLMGEDGIKGQKGDIGPPGFRGPTEYYDTYQEKGDEGTPGPPGPRGARGPQGPSGPPGVPGSPGSSRPGLR
GAPGWPGLKGSKGERGRPGKDAMGTPGSPGCAGSPGLPGSPGPPGPPGDIVFRKGPPGDHGLPGYLGSPGIPGVDGPKGEPGLLCTQCPY
IPGPPGLPGLPGLHGVKGIPGRQGAAGLKGSPGSPGNTGLPGFPGFPGAQGDPGLKGEKGETLQPEGQVGVPGDPGLRGQPGRKGLDGIP
GTPGVKGLPGPKGELALSGEKGDQGPPGDPGSPGSPGPAGPAGPPGYGPQGEPGLQGTQGVPGAPGPPGEAGPRGELSVSTPVPGPPGPP
GPPGHPGPQGPPGIPGSLGKCGDPGLPGPDGEPGIPGIGFPGPPGPKGDQGFPGTKGSLGCPGKMGEPGLPGKPGLPGAKGEPAVAMPGG
PGTPGFPGERGNSGEHGEIGLPGLPGLPGTPGNEGLDGPRGDPGQPGPPGEQGPPGRCIEGPRGAQGLPGLNGLKGQQGRRGKTGPKGDP
GIPGLDRSGFPGETGSPGIPGHQGEMGPLGQRGYPGNPGILGPPGEDGVIGMMGFPGAIGPPGPPGNPGTPGQRGSPGIPGVKGQRGTPG
AKGEQGDKGNPGPSEISHVIGDKGEPGLKGFAGNPGEKGNRGVPGMPGLKGLKGLPGPAGPPGPRGDLGSTGNPGEPGLRGIPGSMGNMG
MPGSKGKRGTLGFPGRAGRPGLPGIHGLQGDKGEPGYSEGTRPGPPGPTGDPGLPGDMGKKGEMGQPGPPGHLGPAGPEGAPGSPGSPGL
PGKPGPHGDLGFKGIKGLLGPPGIRGPPGLPGFPGSPGPMGIRGDQGRDGIPGPAGEKGETGLLRAPPGPRGNPGAQGAKGDRGAPGFPG
LPGRKGAMGDAGPRGPTGIEGFPGPPGLPGAIIPGQTGNRGPPGSRGSPGAPGPPGPPGSHVIGIKGDKGSMGHPGPKGPPGTAGDMGPP
GRLGAPGTPGLPGPRGDPGFQGFPGVKGEKGNPGFLGSIGPPGPIGPKGPPGVRGDPGTLKIISLPGSPGPPGTPGEPGMQGEPGPPGPP
GNLGPCGPRGKPGKDGKPGTPGPAGEKGNKGSKGEPGPAGSDGLPGLKGKRGDSGSPATWTTRGFVFTRHSQTTAIPSCPEGTVPLYSGF
SFLFVQGNQRAHGQDLGTLGSCLQRFTTMPFLFCNVNDVCNFASRNDYSYWLSTPALMPMNMAPITGRALEPYISRCTVCEGPAIAIAVH
SQTTDIPPCPHGWISLWKGFSFIMFTSAGSEGTGQALASPGSCLEEFRASPFLECHGRGTCNYYSNSYSFWLASLNPERMFRKPIPSTVK

--------------------------------------------------------------
>26960_26960_2_EPHA4-COL4A3_EPHA4_chr2_222428451_ENST00000409854_COL4A3_chr2_228102684_ENST00000396578_length(transcript)=8706nt_BP=858nt
AGAAGCGGCAGGAGCAGCGTTGGCACCGGCGAACCATGGCTGGGATTTTCTATTTCGCCCTATTTTCGTGTCTCTTCGGGATTTGCGACG
CTGTCACAGGTTCCAGGGTATACCCCGCGAATGAAGTTACCTTATTGGATTCCAGATCTGTTCAGGGAGAACTTGGGTGGATAGCAAGCC
CTCTGGAAGGAGGGTGGGAGGAAGTGAGTATCATGGATGAAAAAAATACACCAATCCGAACCTACCAAGTGTGCAATGTGATGGAACCCA
GCCAGAATAACTGGCTACGAACTGATTGGATCACCCGAGAAGGGGCTCAGAGGGTGTATATTGAGATTAAATTCACCTTGAGGGACTGCA
ATAGTCTTCCGGGCGTCATGGGGACTTGCAAGGAGACGTTTAACCTGTACTACTATGAATCAGACAACGACAAAGAGCGTTTCATCAGAG
AGAACCAGTTTGTCAAAATTGACACCATTGCTGCTGATGAGAGCTTCACCCAAGTGGACATTGGTGACAGAATCATGAAGCTGAACACCG
AGATCCGGGATGTAGGGCCATTAAGCAAAAAGGGGTTTTACCTGGCTTTTCAGGATGTGGGGGCCTGCATCGCCCTGGTATCAGTCCGTG
TGTTCTATAAAAAGTGTCCACTCACAGTCCGCAATCTGGCCCAGTTTCCTGACACCATCACAGGGGCTGATACGTCTTCCCTGGTGGAAG
TTCGAGGCTCCTGTGTCAACAACTCAGAAGAGAAAGATGTGCCAAAAATGTACTGTGGGGCAGATGGTGAATGGCTGGTACCCATTGGCA
ACTGCCTATGCAACGCTGGGCATGAGGAGCGGAGCGGAGAATGCCAAGGGTTGTGTCTGTAAAGACAAAGGCCAGTGCTTCTGTGACGGG
GCCAAAGGGGAGAAGGGGGAGAAGGGCTTTCCTGGACCCCCCGGTTCTCCTGGCCAGAAAGGATTCACAGGTCCTGAAGGCTTGCCTGGA
CCGCAGGGACCCAAGGGCTTTCCAGGACTTCCAGGACTCACGGGTTCCAAAGGTGTAAGGGGAATAAGTGGATTGCCAGGATTTTCTGGT
TCTCCTGGACTTCCAGGCACCCCAGGCAATACCGGGCCTTACGGACTTGTCGGTGTACCAGGATGCAGTGGTTCTAAGGGTGAGCAGGGG
TTTCCAGGACTCCCAGGGACACTGGGCTACCCAGGGATCCCGGGTGCTGCTGGTTTGAAAGGACAAAAGGGTGCTCCTGCTAAAGAAGAA
GATATAGAACTTGATGCAAAAGGCGACCCCGGGTTGCCAGGGGCTCCAGGACCCCAGGGTTTGCCAGGCCCTCCAGGTTTTCCTGGGCCT
GTTGGCCCACCTGGTCCTCCGGGATTCTTTGGCTTTCCAGGAGCCATGGGACCTAGAGGACCTAAGGGTCACATGGGTGAAAGAGTGATA
GGACATAAAGGAGAGCGGGGTGTGAAAGGGTTAACAGGACCCCCGGGACCACCAGGAACAGTTATTGTGACCCTAACTGGCCCAGATAAC
AGAACGGACCTCAAGGGGGAAAAGGGAGACAAGGGAGCAATGGGCGAGCCTGGACCTCCTGGACCCTCAGGACTGCCTGGAGAATCATAT
GGATCTGAAAAGGGTGCTCCTGGAGACCCTGGCCTGCAGGGAAAACCCGGAAAAGATGGTGTTCCTGGCTTCCCTGGAAGTGAGGGAGTC
AAGGGCAACAGGGGTTTCCCTGGGTTAATGGGTGAAGATGGCATTAAGGGACAGAAAGGGGACATTGGCCCTCCAGGATTTCGTGGTCCA
ACAGAATATTATGACACATACCAGGAAAAGGGAGATGAAGGCACTCCAGGCCCACCAGGGCCCAGAGGAGCTCGTGGCCCACAAGGTCCC
AGTGGTCCCCCCGGAGTTCCTGGAAGTCCTGGATCATCAAGGCCTGGCCTCAGAGGAGCCCCTGGATGGCCAGGCCTGAAAGGAAGTAAA
GGGGAACGAGGCCGCCCAGGAAAGGATGCCATGGGGACTCCTGGGTCCCCAGGTTGTGCTGGTTCACCAGGTCTTCCAGGATCACCGGGA
CCTCCAGGACCGCCAGGTGACATCGTTTTTCGCAAGGGTCCACCTGGAGATCACGGACTGCCAGGCTATCTAGGGTCTCCAGGAATCCCA
GGAGTTGATGGGCCCAAAGGAGAACCAGGCCTCCTGTGTACACAGTGCCCTTATATCCCAGGGCCTCCCGGTCTCCCAGGATTGCCAGGG
TTACATGGTGTAAAAGGAATCCCAGGAAGACAAGGCGCAGCTGGCTTGAAAGGAAGCCCAGGGTCCCCAGGAAATACAGGTCTTCCAGGA
TTTCCAGGTTTCCCAGGTGCCCAGGGTGACCCAGGACTTAAAGGAGAAAAAGGTGAAACACTTCAGCCTGAGGGGCAAGTGGGTGTCCCA
GGTGACCCGGGGCTCAGAGGCCAACCTGGGAGAAAGGGCTTGGATGGAATTCCTGGAACTCCGGGAGTGAAAGGATTACCAGGACCTAAA
GGCGAACTGGCTCTGAGTGGTGAGAAAGGGGACCAAGGTCCTCCAGGGGATCCTGGCTCCCCTGGGTCCCCAGGACCTGCAGGACCAGCT
GGACCACCTGGCTACGGACCCCAAGGAGAACCTGGTCTCCAGGGCACGCAAGGAGTTCCTGGAGCCCCCGGACCACCCGGAGAAGCCGGC
CCTAGGGGAGAGCTCAGTGTTTCAACACCAGTTCCAGGCCCACCAGGACCTCCAGGGCCCCCTGGCCATCCTGGCCCCCAAGGTCCACCT
GGTATCCCTGGATCCCTGGGGAAATGTGGAGATCCTGGTCTTCCAGGGCCTGATGGTGAACCAGGAATTCCAGGAATTGGATTTCCTGGG
CCTCCTGGACCTAAGGGAGACCAAGGTTTTCCAGGTACAAAAGGATCACTGGGTTGTCCTGGAAAAATGGGAGAGCCTGGGTTACCTGGA
AAGCCAGGCCTCCCAGGAGCCAAGGGAGAACCAGCAGTAGCCATGCCTGGAGGACCAGGAACACCAGGTTTTCCAGGAGAAAGAGGCAAT
TCTGGGGAACATGGAGAAATTGGACTCCCTGGACTTCCAGGTCTCCCTGGAACTCCAGGAAATGAAGGGCTTGATGGACCACGAGGAGAT
CCAGGGCAGCCTGGACCACCTGGAGAACAAGGACCCCCAGGAAGGTGCATAGAGGGTCCCAGGGGAGCCCAAGGACTTCCAGGCTTAAAT
GGATTGAAAGGGCAACAAGGCAGAAGAGGTAAAACGGGGCCAAAGGGAGACCCAGGAATTCCAGGCTTGGATAGATCAGGATTTCCTGGA
GAAACTGGATCACCAGGAATTCCAGGTCATCAAGGTGAAATGGGACCACTGGGTCAAAGAGGATATCCAGGAAATCCGGGAATTTTAGGG
CCACCAGGTGAAGATGGAGTGATTGGGATGATGGGCTTTCCTGGAGCCATTGGCCCTCCAGGGCCCCCTGGGAACCCAGGCACACCAGGG
CAGAGGGGGAGCCCTGGAATTCCAGGAGTAAAGGGCCAGAGAGGAACCCCAGGAGCCAAGGGGGAACAAGGAGATAAAGGAAATCCCGGG
CCTTCAGAGATATCCCACGTAATAGGGGACAAAGGAGAACCAGGTCTCAAAGGATTCGCAGGAAATCCAGGTGAGAAAGGAAACAGAGGC
GTTCCAGGGATGCCAGGTTTAAAGGGCCTCAAAGGACTACCCGGACCAGCAGGACCACCAGGCCCCAGAGGAGATTTGGGCAGCACTGGG
AATCCTGGAGAACCAGGACTGCGTGGTATACCAGGAAGCATGGGGAACATGGGCATGCCAGGTTCTAAAGGAAAAAGGGGAACTTTGGGA
TTCCCAGGTCGAGCAGGAAGACCAGGCCTCCCAGGTATTCATGGTCTCCAGGGAGATAAGGGAGAGCCAGGTTATTCAGAAGGTACAAGG
CCAGGACCACCGGGACCAACGGGGGATCCAGGACTGCCGGGTGATATGGGAAAGAAAGGAGAAATGGGGCAACCTGGCCCACCTGGACAT
TTGGGGCCTGCTGGACCTGAGGGAGCCCCTGGAAGTCCTGGAAGTCCTGGCCTCCCAGGAAAGCCAGGTCCTCATGGTGATTTGGGTTTT
AAAGGAATCAAAGGCCTCCTGGGCCCTCCAGGAATCAGAGGCCCTCCAGGTCTTCCAGGATTTCCAGGATCTCCTGGACCAATGGGTATA
AGAGGTGACCAAGGACGTGATGGAATTCCTGGTCCAGCCGGAGAAAAGGGAGAAACGGGTTTATTGAGGGCCCCTCCAGGCCCAAGAGGG
AACCCTGGTGCTCAAGGAGCCAAAGGAGACAGGGGAGCCCCAGGTTTTCCTGGCCTCCCGGGCAGAAAAGGGGCCATGGGAGATGCTGGA
CCTCGAGGACCCACAGGCATAGAAGGATTCCCAGGGCCACCAGGTCTGCCCGGTGCAATTATCCCTGGCCAGACAGGAAATCGTGGTCCA
CCAGGCTCAAGAGGAAGCCCAGGTGCGCCTGGTCCCCCTGGACCTCCAGGGAGTCATGTAATAGGCATAAAAGGAGACAAAGGGTCTATG
GGCCACCCTGGCCCAAAAGGTCCACCTGGAACTGCAGGAGACATGGGACCACCAGGTCGTCTGGGAGCACCAGGTACTCCAGGTCTTCCA
GGACCCAGAGGTGATCCTGGATTCCAGGGGTTTCCAGGCGTGAAAGGAGAAAAGGGTAATCCTGGATTTCTAGGATCCATTGGACCTCCA
GGACCAATTGGGCCAAAAGGACCACCTGGTGTACGTGGAGACCCTGGCACACTTAAGATTATCTCCCTTCCAGGAAGCCCAGGGCCACCT
GGCACACCTGGAGAACCAGGGATGCAGGGAGAACCTGGGCCACCAGGGCCACCTGGAAACCTAGGACCCTGTGGGCCAAGAGGTAAGCCA
GGCAAGGATGGAAAACCAGGAACTCCTGGACCAGCTGGAGAAAAAGGCAACAAAGGTTCTAAAGGAGAGCCAGGACCAGCTGGATCAGAT
GGATTGCCAGGTTTGAAAGGAAAACGTGGAGACAGTGGATCACCTGCAACCTGGACAACGAGAGGCTTTGTCTTCACCCGACACAGTCAA
ACCACAGCAATTCCTTCATGTCCAGAGGGGACAGTGCCACTCTACAGTGGGTTTTCTTTTCTTTTTGTACAAGGAAATCAACGAGCCCAC
GGACAAGACCTTGGAACTCTTGGCAGCTGCCTGCAGCGATTTACCACAATGCCATTCTTATTCTGCAATGTCAATGATGTATGTAATTTT
GCATCTCGAAATGATTATTCATACTGGCTGTCAACACCAGCTCTGATGCCAATGAACATGGCTCCCATTACTGGCAGAGCCCTTGAGCCT
TATATAAGCAGATGCACTGTTTGTGAAGGTCCTGCGATCGCCATAGCCGTTCACAGCCAAACCACTGACATTCCTCCATGTCCTCACGGC
TGGATTTCTCTCTGGAAAGGATTTTCATTCATCATGTTCACAAGTGCAGGTTCTGAGGGCACCGGGCAAGCACTGGCCTCCCCTGGCTCC
TGCCTGGAAGAATTCCGAGCCAGCCCATTTCTAGAATGTCATGGAAGAGGAACGTGCAACTACTATTCAAATTCCTACAGTTTCTGGCTG
GCTTCATTAAACCCAGAAAGAATGTTCAGAAAGCCTATTCCATCAACTGTGAAAGCTGGGGAATTAGAAAAAATAATAAGTCGCTGTCAG
GTGTGCATGAAGAAAAGACACTGAAGCTAAAAAAGACAGCAGAACTGCTATTTTTCATCCTAAAGAACAAAGTAATGACAGAACATGCTG
TTATTTAGGTATTTTTCTTTAACCAAACAATATTGCTCCATGATGACTTAGTACAAAGTTTCAATTTGTTTCCCCACAAAACAAAGCAAT
TCTTTCAAGTCAGTTCTGTGATCTGGGTCTCTAATCTGTGCTGTTTCAAAGTTCTCTGTGGCAAAGCAGCAACTATTCACAAAATATCAC
CAAAAACCTATTCCACTTACATCCAAGGCACTGTCACTACGGTGATTGTATGAAGTTTGAATGCTGCAAGTTATGAAATATTTGGCCCGC
TGGATTCCCACATTTGTCTTCTTTCTGTCTTTAAGACTCAGGGAGGCTAAATCAGTGTTTGATTGCCCCGCCAACCCTTCCTGAAACTTC
AGACCCTGGGTAGGGGAAGAGAAGGGGGCATGTGGTATCCTGGAGCATTGTGTATAGAACTGGATTTTCAGACCTGCTGAGGACCGTAAG
GCCTGATGGAACACAGAACTGAACTGAGGTTCATGGATTTTCCAGGACTGTTTCAAACATGCCCATTACTAACGGCAAAAGGGGGATTCC
CTGATGGAACCATAATACCCTTGGAAATACTGTATGGTTTTGTTTTGTTTTGTTGGTTTTTAAAGATTTTTGTTTGTTTATTGAATTCAT
TTCACTGTAGCTCTAAAATCTGCTTGTATTCCAAGCATATAAAATTTTCCCCCTTAGTGAATTAGTTTTAAAATGATATTGTTATATACA
TACTATGAAATATGTATAACTTTAACTTCTGTTTTACCAGCATACCCACACAAATAACAAGAATACTACTTATGAAATGTGCACTTTATC
CTCATTCCATAAATGTCGGTGCATACCTTATGTAAGGGAGCAGTTCAATAATCCATGAAAGAACTTAAGGCATTTGTTGGTTTATCAGAC
TCGGAATCTATTTTCTCATTGCTCTGAATATGTCATCACTCTAGGTTTTACAGATTTATTCCTTTGTTACTTCTCTAATTCTTCCTTTGT
AAAAAAAAAAAAAAGCAACACTTTTTATGTTATATGTTGTTCTTACAAACCATACTGAAAGAGTCCATTGTTTAAAAATCTTAATGTATC
AAACTGTATAACTTGGCCGCTGTATGTCTTAAAACCTGCTTTTCAATGTGTTGATACATTCCCAAGGTTACTTAATTCAACTTAACTATC
ATCTTATTCAGCACCAAGCATGTCCCAGGCACTGTACTAACCTACAGAGATGCTAAGAGAAAAAAAAGACTTGTTTCTGATCTAATATCC
CAGAAAAAGTAACTCATTGCTCTGTTAATAATCTCACATATACAAGTAGCTTCCCTCCCCTCTAGTTTTTTCTTCCTTTTCACTGCTGTT
ATATTTCATCATGATAATTCAGCAGGCCCAAGTAAAGGTTAAAAATAAGGTCTATGCCTAGGGAAACTCAGGGCTTCTAGTTTCTCTTAG
AAAAGCTAAGAGAAGATAAGGTCTGAATAATAGCAGAAAAACCAACATCTACAAAACATTAAACTAGTGTTATACTTGATGATAACACTA
TTTGATGAGTCTTAGAGTCCAGACACAAAGAGACAAAGCTTTGAAGATGCTTTTTGATCTACCTAGGTGGAGTTGGTGGTGCTGATATTT
AAATTCAGGCTACTGCTTCAATCTCAATTGCTTTGTAAGTGAAAAACATGACCCAGAGGACAGCACAGACTATGGCCATGGCTCACATGG
TTTACATCCTTCACTGCTCACGTGTTTGCTGTCAAGCCATTTTTACATCTAAACTAAGATGTGCAGCATTTCACTTATTTAGATTCACTT
AACAAACAAATTTTTCTGCTTTAAAAATGTCTTATTGTCCCAAGTGTACTATAGCGGCATATAGAGCTAGCTAATCTCTACAAACCCTCT
GTAGGCCAGTAGTTCTCAAAGTGTGGTCTCTGGAAGAGCAGTATCAGCATCATCTGGGAACTTGTCACAGATGCAGATTCTAGGGACCAC
TCCAGACCTACACAATCAGAAACTCTTGGGGGAGGGCCCGAAATATCTATGTTTTACCAAGCCCACCACATGATTCTGATGTACTCTAAA
TACTGAGAAAACCTGTTCTAGACAAATACCCAAGCAACAACTCCGCAGGCAGTTACCAAGTACGGCTGGCTACAACTGCTCCATCCGTGC
CTCTTTTTAAAGTTCAAACTCACAGGTGACTCTAAGGTTATCTACTTTTACTCATAAGTAAAAGCCCTAGACTGGTGCTAATGTCAAACC
ACTGGCCTCCACTCAGGCCTCCATCTTCTCATGCCCTCTTACCAGTATTTAACTTCTGAGGAAGACAAGTGATGCTAAAACCTGAAATTC
CAATGAAGCCATATGAACAGCTGTTCAGTTGCACTTCTAAGACTTTACTTAGCAGTAAATTATAGCTCATGTGCATTATTTTCCAGATAA
CTTAGCTTATGAGTAGCTTATACAATTATGAAGATTTAATATTACAGATAAAATGTAAACTGTTTCTTTAAAATTGGGGCTTCAACTTTG
GAATTTCACAGCGTGCTAAAATAACAGATTTCTCAGAAGTCTTTCAGCAAGATAAACATTATTAAGTAACTTATTTATGAAAGTATTAAA
ATGCTTACATTTGAACTTGATGGCTAACTTACAAAGATTCTCTATGTATCAAATGTAACTTACTGCGACTAAACTTAATTTAATATTTAC
TCTATAACCAAATGAAATATATTTAAAATATATTGAATATTTTATATTGTTATATCCTGACAAGATTATAATATTTTAATGTACTAATAT

>26960_26960_2_EPHA4-COL4A3_EPHA4_chr2_222428451_ENST00000409854_COL4A3_chr2_228102684_ENST00000396578_length(amino acids)=1729AA_BP=1161
MWGPASPWYQSVCSIKSVHSQSAIWPSFLTPSQGLIRLPWWKFEAPVSTTQKRKMCQKCTVGQMVNGWYPLATAYATLGMRSGAENAKGC
VCKDKGQCFCDGAKGEKGEKGFPGPPGSPGQKGFTGPEGLPGPQGPKGFPGLPGLTGSKGVRGISGLPGFSGSPGLPGTPGNTGPYGLVG
VPGCSGSKGEQGFPGLPGTLGYPGIPGAAGLKGQKGAPAKEEDIELDAKGDPGLPGAPGPQGLPGPPGFPGPVGPPGPPGFFGFPGAMGP
RGPKGHMGERVIGHKGERGVKGLTGPPGPPGTVIVTLTGPDNRTDLKGEKGDKGAMGEPGPPGPSGLPGESYGSEKGAPGDPGLQGKPGK
DGVPGFPGSEGVKGNRGFPGLMGEDGIKGQKGDIGPPGFRGPTEYYDTYQEKGDEGTPGPPGPRGARGPQGPSGPPGVPGSPGSSRPGLR
GAPGWPGLKGSKGERGRPGKDAMGTPGSPGCAGSPGLPGSPGPPGPPGDIVFRKGPPGDHGLPGYLGSPGIPGVDGPKGEPGLLCTQCPY
IPGPPGLPGLPGLHGVKGIPGRQGAAGLKGSPGSPGNTGLPGFPGFPGAQGDPGLKGEKGETLQPEGQVGVPGDPGLRGQPGRKGLDGIP
GTPGVKGLPGPKGELALSGEKGDQGPPGDPGSPGSPGPAGPAGPPGYGPQGEPGLQGTQGVPGAPGPPGEAGPRGELSVSTPVPGPPGPP
GPPGHPGPQGPPGIPGSLGKCGDPGLPGPDGEPGIPGIGFPGPPGPKGDQGFPGTKGSLGCPGKMGEPGLPGKPGLPGAKGEPAVAMPGG
PGTPGFPGERGNSGEHGEIGLPGLPGLPGTPGNEGLDGPRGDPGQPGPPGEQGPPGRCIEGPRGAQGLPGLNGLKGQQGRRGKTGPKGDP
GIPGLDRSGFPGETGSPGIPGHQGEMGPLGQRGYPGNPGILGPPGEDGVIGMMGFPGAIGPPGPPGNPGTPGQRGSPGIPGVKGQRGTPG
AKGEQGDKGNPGPSEISHVIGDKGEPGLKGFAGNPGEKGNRGVPGMPGLKGLKGLPGPAGPPGPRGDLGSTGNPGEPGLRGIPGSMGNMG
MPGSKGKRGTLGFPGRAGRPGLPGIHGLQGDKGEPGYSEGTRPGPPGPTGDPGLPGDMGKKGEMGQPGPPGHLGPAGPEGAPGSPGSPGL
PGKPGPHGDLGFKGIKGLLGPPGIRGPPGLPGFPGSPGPMGIRGDQGRDGIPGPAGEKGETGLLRAPPGPRGNPGAQGAKGDRGAPGFPG
LPGRKGAMGDAGPRGPTGIEGFPGPPGLPGAIIPGQTGNRGPPGSRGSPGAPGPPGPPGSHVIGIKGDKGSMGHPGPKGPPGTAGDMGPP
GRLGAPGTPGLPGPRGDPGFQGFPGVKGEKGNPGFLGSIGPPGPIGPKGPPGVRGDPGTLKIISLPGSPGPPGTPGEPGMQGEPGPPGPP
GNLGPCGPRGKPGKDGKPGTPGPAGEKGNKGSKGEPGPAGSDGLPGLKGKRGDSGSPATWTTRGFVFTRHSQTTAIPSCPEGTVPLYSGF
SFLFVQGNQRAHGQDLGTLGSCLQRFTTMPFLFCNVNDVCNFASRNDYSYWLSTPALMPMNMAPITGRALEPYISRCTVCEGPAIAIAVH
SQTTDIPPCPHGWISLWKGFSFIMFTSAGSEGTGQALASPGSCLEEFRASPFLECHGRGTCNYYSNSYSFWLASLNPERMFRKPIPSTVK

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for EPHA4-COL4A3


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for EPHA4-COL4A3


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for EPHA4-COL4A3


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource