FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:ITGB4-KRT18 (FusionGDB2 ID:40535)

Fusion Gene Summary for ITGB4-KRT18

check button Fusion gene summary
Fusion gene informationFusion gene name: ITGB4-KRT18
Fusion gene ID: 40535
HgeneTgene
Gene symbol

ITGB4

KRT18

Gene ID

3691

3875

Gene nameintegrin subunit beta 4keratin 18
SynonymsCD104|GP150CK-18|CYK18|K18
Cytomap

17q25.1

12q13.13

Type of geneprotein-codingprotein-coding
Descriptionintegrin beta-4CD104 antigenkeratin, type I cytoskeletal 18cell proliferation-inducing gene 46 proteincytokeratin 18keratin 18, type I
Modification date2020032920200327
UniProtAcc.

P05783

Ensembl transtripts involved in fusion geneENST00000584558, ENST00000200181, 
ENST00000339591, ENST00000449880, 
ENST00000450894, ENST00000579662, 
ENST00000388835, ENST00000388837, 
ENST00000550600, 
Fusion gene scores* DoF score23 X 19 X 10=43704 X 2 X 2=16
# samples 295
** MAII scorelog2(29/4370*10)=-3.91350847437303
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(5/16*10)=1.64385618977472
effective Gene in Pan-Cancer Fusion Genes (eGinPCFGs).
DoF>8 and MAII>0
Context

PubMed: ITGB4 [Title/Abstract] AND KRT18 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointITGB4(73732235)-KRT18(53345903), # samples:1
Anticipated loss of major functional domain due to fusion event.ITGB4-KRT18 seems lost the major protein functional domain in Hgene partner, which is a essential gene due to the frame-shifted ORF.
ITGB4-KRT18 seems lost the major protein functional domain in Hgene partner, which is a IUPHAR drug target due to the frame-shifted ORF.
ITGB4-KRT18 seems lost the major protein functional domain in Tgene partner, which is a essential gene due to the frame-shifted ORF.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneITGB4

GO:0009611

response to wounding

19403692

HgeneITGB4

GO:0031581

hemidesmosome assembly

12482924

TgeneKRT18

GO:0043000

Golgi to plasma membrane CFTR protein transport

15529338

TgeneKRT18

GO:0043066

negative regulation of apoptotic process

11684708

TgeneKRT18

GO:0045104

intermediate filament cytoskeleton organization

20346438


check buttonFusion gene breakpoints across ITGB4 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across KRT18 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4COADTCGA-G4-6293-01AITGB4chr17

73732235

+KRT18chr12

53345903

+


Top

Fusion Gene ORF analysis for ITGB4-KRT18

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
3UTR-3CDSENST00000584558ENST00000388835ITGB4chr17

73732235

+KRT18chr12

53345903

+
3UTR-3CDSENST00000584558ENST00000388837ITGB4chr17

73732235

+KRT18chr12

53345903

+
3UTR-3CDSENST00000584558ENST00000550600ITGB4chr17

73732235

+KRT18chr12

53345903

+
Frame-shiftENST00000200181ENST00000550600ITGB4chr17

73732235

+KRT18chr12

53345903

+
Frame-shiftENST00000339591ENST00000550600ITGB4chr17

73732235

+KRT18chr12

53345903

+
Frame-shiftENST00000449880ENST00000550600ITGB4chr17

73732235

+KRT18chr12

53345903

+
Frame-shiftENST00000450894ENST00000550600ITGB4chr17

73732235

+KRT18chr12

53345903

+
Frame-shiftENST00000579662ENST00000550600ITGB4chr17

73732235

+KRT18chr12

53345903

+
In-frameENST00000200181ENST00000388835ITGB4chr17

73732235

+KRT18chr12

53345903

+
In-frameENST00000200181ENST00000388837ITGB4chr17

73732235

+KRT18chr12

53345903

+
In-frameENST00000339591ENST00000388835ITGB4chr17

73732235

+KRT18chr12

53345903

+
In-frameENST00000339591ENST00000388837ITGB4chr17

73732235

+KRT18chr12

53345903

+
In-frameENST00000449880ENST00000388835ITGB4chr17

73732235

+KRT18chr12

53345903

+
In-frameENST00000449880ENST00000388837ITGB4chr17

73732235

+KRT18chr12

53345903

+
In-frameENST00000450894ENST00000388835ITGB4chr17

73732235

+KRT18chr12

53345903

+
In-frameENST00000450894ENST00000388837ITGB4chr17

73732235

+KRT18chr12

53345903

+
In-frameENST00000579662ENST00000388835ITGB4chr17

73732235

+KRT18chr12

53345903

+
In-frameENST00000579662ENST00000388837ITGB4chr17

73732235

+KRT18chr12

53345903

+

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000579662ITGB4chr1773732235+ENST00000388837KRT18chr1253345903+241020072462351701
ENST00000579662ITGB4chr1773732235+ENST00000388835KRT18chr1253345903+241120072462351701
ENST00000339591ITGB4chr1773732235+ENST00000388837KRT18chr1253345903+235119481632292709
ENST00000339591ITGB4chr1773732235+ENST00000388835KRT18chr1253345903+235219481632292709
ENST00000200181ITGB4chr1773732235+ENST00000388837KRT18chr1253345903+235119481632292709
ENST00000200181ITGB4chr1773732235+ENST00000388835KRT18chr1253345903+235219481632292709
ENST00000450894ITGB4chr1773732235+ENST00000388837KRT18chr1253345903+229018871022231709
ENST00000450894ITGB4chr1773732235+ENST00000388835KRT18chr1253345903+229118871022231709
ENST00000449880ITGB4chr1773732235+ENST00000388837KRT18chr1253345903+2172176982113701
ENST00000449880ITGB4chr1773732235+ENST00000388835KRT18chr1253345903+2173176982113701

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000579662ENST00000388837ITGB4chr1773732235+KRT18chr1253345903+0.0102308320.9897691
ENST00000579662ENST00000388835ITGB4chr1773732235+KRT18chr1253345903+0.0103530830.98964685
ENST00000339591ENST00000388837ITGB4chr1773732235+KRT18chr1253345903+0.011871610.98812836
ENST00000339591ENST00000388835ITGB4chr1773732235+KRT18chr1253345903+0.012003540.9879965
ENST00000200181ENST00000388837ITGB4chr1773732235+KRT18chr1253345903+0.011871610.98812836
ENST00000200181ENST00000388835ITGB4chr1773732235+KRT18chr1253345903+0.012003540.9879965
ENST00000450894ENST00000388837ITGB4chr1773732235+KRT18chr1253345903+0.0123892910.9876107
ENST00000450894ENST00000388835ITGB4chr1773732235+KRT18chr1253345903+0.0125112620.9874887
ENST00000449880ENST00000388837ITGB4chr1773732235+KRT18chr1253345903+0.012210290.98778975
ENST00000449880ENST00000388835ITGB4chr1773732235+KRT18chr1253345903+0.0123564820.9876436

Top

Fusion Genomic Features for ITGB4-KRT18


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)
ITGB4chr1773732235+KRT18chr1253345902+0.484341860.51565814
ITGB4chr1773732235+KRT18chr1253345902+0.484341860.51565814

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.
genomic feature of top 1%

Top

Fusion Protein Features for ITGB4-KRT18


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr17:73732235/chr12:53345903)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
.KRT18

P05783

FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.FUNCTION: Involved in the uptake of thrombin-antithrombin complexes by hepatic cells (By similarity). When phosphorylated, plays a role in filament reorganization. Involved in the delivery of mutated CFTR to the plasma membrane. Together with KRT8, is involved in interleukin-6 (IL-6)-mediated barrier protection. {ECO:0000250, ECO:0000269|PubMed:15529338, ECO:0000269|PubMed:16424149, ECO:0000269|PubMed:17213200, ECO:0000269|PubMed:7523419, ECO:0000269|PubMed:8522591, ECO:0000269|PubMed:9298992, ECO:0000269|PubMed:9524113}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneITGB4chr17:73732235chr12:53345903ENST00000200181+1440131_3295871823.0DomainNote=VWFA
HgeneITGB4chr17:73732235chr12:53345903ENST00000200181+144029_735871823.0DomainNote=PSI
HgeneITGB4chr17:73732235chr12:53345903ENST00000339591+1440131_3295871806.0DomainNote=VWFA
HgeneITGB4chr17:73732235chr12:53345903ENST00000339591+144029_735871806.0DomainNote=PSI
HgeneITGB4chr17:73732235chr12:53345903ENST00000449880+1339131_3295871806.0DomainNote=VWFA
HgeneITGB4chr17:73732235chr12:53345903ENST00000449880+133929_735871806.0DomainNote=PSI
HgeneITGB4chr17:73732235chr12:53345903ENST00000450894+1439131_3295871753.0DomainNote=VWFA
HgeneITGB4chr17:73732235chr12:53345903ENST00000450894+143929_735871753.0DomainNote=PSI
HgeneITGB4chr17:73732235chr12:53345903ENST00000579662+1439131_3295871753.0DomainNote=VWFA
HgeneITGB4chr17:73732235chr12:53345903ENST00000579662+143929_735871753.0DomainNote=PSI
HgeneITGB4chr17:73732235chr12:53345903ENST00000200181+1440194_1995871823.0RegionInvolved in NRG1- and IGF1-binding
HgeneITGB4chr17:73732235chr12:53345903ENST00000339591+1440194_1995871806.0RegionInvolved in NRG1- and IGF1-binding
HgeneITGB4chr17:73732235chr12:53345903ENST00000449880+1339194_1995871806.0RegionInvolved in NRG1- and IGF1-binding
HgeneITGB4chr17:73732235chr12:53345903ENST00000450894+1439194_1995871753.0RegionInvolved in NRG1- and IGF1-binding
HgeneITGB4chr17:73732235chr12:53345903ENST00000579662+1439194_1995871753.0RegionInvolved in NRG1- and IGF1-binding
HgeneITGB4chr17:73732235chr12:53345903ENST00000200181+1440456_5025871823.0RepeatNote=I
HgeneITGB4chr17:73732235chr12:53345903ENST00000200181+1440503_5425871823.0RepeatNote=II
HgeneITGB4chr17:73732235chr12:53345903ENST00000200181+1440543_5815871823.0RepeatNote=III
HgeneITGB4chr17:73732235chr12:53345903ENST00000339591+1440456_5025871806.0RepeatNote=I
HgeneITGB4chr17:73732235chr12:53345903ENST00000339591+1440503_5425871806.0RepeatNote=II
HgeneITGB4chr17:73732235chr12:53345903ENST00000339591+1440543_5815871806.0RepeatNote=III
HgeneITGB4chr17:73732235chr12:53345903ENST00000449880+1339456_5025871806.0RepeatNote=I
HgeneITGB4chr17:73732235chr12:53345903ENST00000449880+1339503_5425871806.0RepeatNote=II
HgeneITGB4chr17:73732235chr12:53345903ENST00000449880+1339543_5815871806.0RepeatNote=III
HgeneITGB4chr17:73732235chr12:53345903ENST00000450894+1439456_5025871753.0RepeatNote=I
HgeneITGB4chr17:73732235chr12:53345903ENST00000450894+1439503_5425871753.0RepeatNote=II
HgeneITGB4chr17:73732235chr12:53345903ENST00000450894+1439543_5815871753.0RepeatNote=III
HgeneITGB4chr17:73732235chr12:53345903ENST00000579662+1439456_5025871753.0RepeatNote=I
HgeneITGB4chr17:73732235chr12:53345903ENST00000579662+1439503_5425871753.0RepeatNote=II
HgeneITGB4chr17:73732235chr12:53345903ENST00000579662+1439543_5815871753.0RepeatNote=III
TgeneKRT18chr17:73732235chr12:53345903ENST0000038883547388_430316431.0RegionNote=Tail
TgeneKRT18chr17:73732235chr12:53345903ENST0000038883758388_430316431.0RegionNote=Tail

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneITGB4chr17:73732235chr12:53345903ENST00000200181+14401129_12185871823.0DomainFibronectin type-III 1
HgeneITGB4chr17:73732235chr12:53345903ENST00000200181+14401222_13215871823.0DomainFibronectin type-III 2
HgeneITGB4chr17:73732235chr12:53345903ENST00000200181+14401530_16255871823.0DomainFibronectin type-III 3
HgeneITGB4chr17:73732235chr12:53345903ENST00000200181+14401643_17395871823.0DomainFibronectin type-III 4
HgeneITGB4chr17:73732235chr12:53345903ENST00000200181+1440979_10845871823.0DomainNote=Calx-beta
HgeneITGB4chr17:73732235chr12:53345903ENST00000339591+14401129_12185871806.0DomainFibronectin type-III 1
HgeneITGB4chr17:73732235chr12:53345903ENST00000339591+14401222_13215871806.0DomainFibronectin type-III 2
HgeneITGB4chr17:73732235chr12:53345903ENST00000339591+14401530_16255871806.0DomainFibronectin type-III 3
HgeneITGB4chr17:73732235chr12:53345903ENST00000339591+14401643_17395871806.0DomainFibronectin type-III 4
HgeneITGB4chr17:73732235chr12:53345903ENST00000339591+1440979_10845871806.0DomainNote=Calx-beta
HgeneITGB4chr17:73732235chr12:53345903ENST00000449880+13391129_12185871806.0DomainFibronectin type-III 1
HgeneITGB4chr17:73732235chr12:53345903ENST00000449880+13391222_13215871806.0DomainFibronectin type-III 2
HgeneITGB4chr17:73732235chr12:53345903ENST00000449880+13391530_16255871806.0DomainFibronectin type-III 3
HgeneITGB4chr17:73732235chr12:53345903ENST00000449880+13391643_17395871806.0DomainFibronectin type-III 4
HgeneITGB4chr17:73732235chr12:53345903ENST00000449880+1339979_10845871806.0DomainNote=Calx-beta
HgeneITGB4chr17:73732235chr12:53345903ENST00000450894+14391129_12185871753.0DomainFibronectin type-III 1
HgeneITGB4chr17:73732235chr12:53345903ENST00000450894+14391222_13215871753.0DomainFibronectin type-III 2
HgeneITGB4chr17:73732235chr12:53345903ENST00000450894+14391530_16255871753.0DomainFibronectin type-III 3
HgeneITGB4chr17:73732235chr12:53345903ENST00000450894+14391643_17395871753.0DomainFibronectin type-III 4
HgeneITGB4chr17:73732235chr12:53345903ENST00000450894+1439979_10845871753.0DomainNote=Calx-beta
HgeneITGB4chr17:73732235chr12:53345903ENST00000579662+14391129_12185871753.0DomainFibronectin type-III 1
HgeneITGB4chr17:73732235chr12:53345903ENST00000579662+14391222_13215871753.0DomainFibronectin type-III 2
HgeneITGB4chr17:73732235chr12:53345903ENST00000579662+14391530_16255871753.0DomainFibronectin type-III 3
HgeneITGB4chr17:73732235chr12:53345903ENST00000579662+14391643_17395871753.0DomainFibronectin type-III 4
HgeneITGB4chr17:73732235chr12:53345903ENST00000579662+1439979_10845871753.0DomainNote=Calx-beta
HgeneITGB4chr17:73732235chr12:53345903ENST00000200181+1440456_6195871823.0RegionNote=Cysteine-rich tandem repeats
HgeneITGB4chr17:73732235chr12:53345903ENST00000200181+1440732_7495871823.0RegionNote=Palmitoylated on several cysteines
HgeneITGB4chr17:73732235chr12:53345903ENST00000339591+1440456_6195871806.0RegionNote=Cysteine-rich tandem repeats
HgeneITGB4chr17:73732235chr12:53345903ENST00000339591+1440732_7495871806.0RegionNote=Palmitoylated on several cysteines
HgeneITGB4chr17:73732235chr12:53345903ENST00000449880+1339456_6195871806.0RegionNote=Cysteine-rich tandem repeats
HgeneITGB4chr17:73732235chr12:53345903ENST00000449880+1339732_7495871806.0RegionNote=Palmitoylated on several cysteines
HgeneITGB4chr17:73732235chr12:53345903ENST00000450894+1439456_6195871753.0RegionNote=Cysteine-rich tandem repeats
HgeneITGB4chr17:73732235chr12:53345903ENST00000450894+1439732_7495871753.0RegionNote=Palmitoylated on several cysteines
HgeneITGB4chr17:73732235chr12:53345903ENST00000579662+1439456_6195871753.0RegionNote=Cysteine-rich tandem repeats
HgeneITGB4chr17:73732235chr12:53345903ENST00000579662+1439732_7495871753.0RegionNote=Palmitoylated on several cysteines
HgeneITGB4chr17:73732235chr12:53345903ENST00000200181+1440582_6195871823.0RepeatNote=IV
HgeneITGB4chr17:73732235chr12:53345903ENST00000339591+1440582_6195871806.0RepeatNote=IV
HgeneITGB4chr17:73732235chr12:53345903ENST00000449880+1339582_6195871806.0RepeatNote=IV
HgeneITGB4chr17:73732235chr12:53345903ENST00000450894+1439582_6195871753.0RepeatNote=IV
HgeneITGB4chr17:73732235chr12:53345903ENST00000579662+1439582_6195871753.0RepeatNote=IV
HgeneITGB4chr17:73732235chr12:53345903ENST00000200181+144028_7105871823.0Topological domainExtracellular
HgeneITGB4chr17:73732235chr12:53345903ENST00000200181+1440734_18225871823.0Topological domainCytoplasmic
HgeneITGB4chr17:73732235chr12:53345903ENST00000339591+144028_7105871806.0Topological domainExtracellular
HgeneITGB4chr17:73732235chr12:53345903ENST00000339591+1440734_18225871806.0Topological domainCytoplasmic
HgeneITGB4chr17:73732235chr12:53345903ENST00000449880+133928_7105871806.0Topological domainExtracellular
HgeneITGB4chr17:73732235chr12:53345903ENST00000449880+1339734_18225871806.0Topological domainCytoplasmic
HgeneITGB4chr17:73732235chr12:53345903ENST00000450894+143928_7105871753.0Topological domainExtracellular
HgeneITGB4chr17:73732235chr12:53345903ENST00000450894+1439734_18225871753.0Topological domainCytoplasmic
HgeneITGB4chr17:73732235chr12:53345903ENST00000579662+143928_7105871753.0Topological domainExtracellular
HgeneITGB4chr17:73732235chr12:53345903ENST00000579662+1439734_18225871753.0Topological domainCytoplasmic
HgeneITGB4chr17:73732235chr12:53345903ENST00000200181+1440711_7335871823.0TransmembraneHelical
HgeneITGB4chr17:73732235chr12:53345903ENST00000339591+1440711_7335871806.0TransmembraneHelical
HgeneITGB4chr17:73732235chr12:53345903ENST00000449880+1339711_7335871806.0TransmembraneHelical
HgeneITGB4chr17:73732235chr12:53345903ENST00000450894+1439711_7335871753.0TransmembraneHelical
HgeneITGB4chr17:73732235chr12:53345903ENST00000579662+1439711_7335871753.0TransmembraneHelical
TgeneKRT18chr17:73732235chr12:53345903ENST000003888354780_391316431.0DomainIF rod
TgeneKRT18chr17:73732235chr12:53345903ENST000003888375880_391316431.0DomainIF rod
TgeneKRT18chr17:73732235chr12:53345903ENST0000038883547116_132316431.0RegionNote=Linker 1
TgeneKRT18chr17:73732235chr12:53345903ENST0000038883547133_224316431.0RegionNote=Coil 1B
TgeneKRT18chr17:73732235chr12:53345903ENST0000038883547225_248316431.0RegionNote=Linker 12
TgeneKRT18chr17:73732235chr12:53345903ENST0000038883547249_387316431.0RegionNote=Coil 2
TgeneKRT18chr17:73732235chr12:53345903ENST00000388835472_79316431.0RegionNote=Head
TgeneKRT18chr17:73732235chr12:53345903ENST000003888354780_115316431.0RegionNote=Coil 1A
TgeneKRT18chr17:73732235chr12:53345903ENST0000038883758116_132316431.0RegionNote=Linker 1
TgeneKRT18chr17:73732235chr12:53345903ENST0000038883758133_224316431.0RegionNote=Coil 1B
TgeneKRT18chr17:73732235chr12:53345903ENST0000038883758225_248316431.0RegionNote=Linker 12
TgeneKRT18chr17:73732235chr12:53345903ENST0000038883758249_387316431.0RegionNote=Coil 2
TgeneKRT18chr17:73732235chr12:53345903ENST00000388837582_79316431.0RegionNote=Head
TgeneKRT18chr17:73732235chr12:53345903ENST000003888375880_115316431.0RegionNote=Coil 1A


Top

Fusion Gene Sequence for ITGB4-KRT18


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>40535_40535_1_ITGB4-KRT18_ITGB4_chr17_73732235_ENST00000200181_KRT18_chr12_53345903_ENST00000388835_length(transcript)=2352nt_BP=1948nt
GCGCTGCCCGCCTCGTCCCCACCCCCCCAACCCCCGCGCCCGCCCTCGGACAGTCCCTGCTCGCCCGCGCGCTGCAGCCCCATCTCCTAG
CGGCAGCCCAGGCGCGGAGGGAGCGAGTCCGCCCCGAGGTAGGTCCAGGACGGGCGCACAGCAGCAGCCGAGGCTGGCCGGGAGAGGGAG
GAAGAGGATGGCAGGGCCACGCCCCAGCCCATGGGCCAGGCTGCTCCTGGCAGCCTTGATCAGCGTCAGCCTCTCTGGGACCTTGGCAAA
CCGCTGCAAGAAGGCCCCAGTGAAGAGCTGCACGGAGTGTGTCCGTGTGGATAAGGACTGCGCCTACTGCACAGACGAGATGTTCAGGGA
CCGGCGCTGCAACACCCAGGCGGAGCTGCTGGCCGCGGGCTGCCAGCGGGAGAGCATCGTGGTCATGGAGAGCAGCTTCCAAATCACAGA
GGAGACCCAGATTGACACCACCCTGCGGCGCAGCCAGATGTCCCCCCAAGGCCTGCGGGTCCGTCTGCGGCCCGGTGAGGAGCGGCATTT
TGAGCTGGAGGTGTTTGAGCCACTGGAGAGCCCCGTGGACCTGTACATCCTCATGGACTTCTCCAACTCCATGTCCGATGATCTGGACAA
CCTCAAGAAGATGGGGCAGAACCTGGCTCGGGTCCTGAGCCAGCTCACCAGCGACTACACTATTGGATTTGGCAAGTTTGTGGACAAAGT
CAGCGTCCCGCAGACGGACATGAGGCCTGAGAAGCTGAAGGAGCCCTGGCCCAACAGTGACCCCCCCTTCTCCTTCAAGAACGTCATCAG
CCTGACAGAAGATGTGGATGAGTTCCGGAATAAACTGCAGGGAGAGCGGATCTCAGGCAACCTGGATGCTCCTGAGGGCGGCTTCGATGC
CATCCTGCAGACAGCTGTGTGCACGAGGGACATTGGCTGGCGCCCGGACAGCACCCACCTGCTGGTCTTCTCCACCGAGTCAGCCTTCCA
CTATGAGGCTGATGGCGCCAACGTGCTGGCTGGCATCATGAGCCGCAACGATGAACGGTGCCACCTGGACACCACGGGCACCTACACCCA
GTACAGGACACAGGACTACCCGTCGGTGCCCACCCTGGTGCGCCTGCTCGCCAAGCACAACATCATCCCCATCTTTGCTGTCACCAACTA
CTCCTATAGCTACTACGAGAAGCTTCACACCTATTTCCCTGTCTCCTCACTGGGGGTGCTGCAGGAGGACTCGTCCAACATCGTGGAGCT
GCTGGAGGAGGCCTTCAATCGGATCCGCTCCAACCTGGACATCCGGGCCCTAGACAGCCCCCGAGGCCTTCGGACAGAGGTCACCTCCAA
GATGTTCCAGAAGACGAGGACTGGGTCCTTTCACATCCGGCGGGGGGAAGTGGGTATATACCAGGTGCAGCTGCGGGCCCTTGAGCACGT
GGATGGGACGCACGTGTGCCAGCTGCCGGAGGACCAGAAGGGCAACATCCATCTGAAACCTTCCTTCTCCGACGGCCTCAAGATGGACGC
GGGCATCATCTGTGATGTGTGCACCTGCGAGCTGCAAAAAGAGGTGCGGTCAGCTCGCTGCAGCTTCAACGGAGACTTCGTGTGCGGACA
GTGTGTGTGCAGCGAGGGCTGGAGTGGCCAGACCTGCAACTGCTCCACCGGCTCTCTGAGTGACATTCAGCCCTGCCTGCGGGAGGGCGA
GGACAAGCCGTGCTCCGGCCGTGGGGAGTGCCAGTGCGGGCACTGTGTGTGCTACGGCGAAGGCCGCTACGAGGGTCAGTTCTGCGAGTA
TGACAACTTCCAGTGTCCCCGCACTTCCGGGTTCCTCTGCAATGACCGAGGACGCTGCTCCATGGGCCAGTGTGTGTGTGAGCCTGGTTG
GACAGGCCCAAGCTGTGACTGTCCCCTCAGCAATGCCACCTGCATCGACAGCAATGGGAAGGCCAGCTTGGAGAACAGCCTGAGGGAGGT
GGAGGCCCGCTACGCCCTACAGATGGAGCAGCTCAACGGGATCCTGCTGCACCTTGAGTCAGAGCTGGCACAGACCCGGGCAGAGGGACA
GCGCCAGGCCCAGGAGTATGAGGCCCTGCTGAACATCAAGGTCAAGCTGGAGGCTGAGATCGCCACCTACCGCCGCCTGCTGGAAGATGG
CGAGGACTTTAATCTTGGTGATGCCTTGGACAGCAGCAACTCCATGCAAACCATCCAAAAGACCACCACCCGCCGGATAGTGGATGGCAA
AGTGGTGTCTGAGACCAATGACACCAAAGTTCTGAGGCATTAAGCCAGCAGAAGCAGGGTACCCTTTGGGGAGCAGGAGGCCAATAAAAA

>40535_40535_1_ITGB4-KRT18_ITGB4_chr17_73732235_ENST00000200181_KRT18_chr12_53345903_ENST00000388835_length(amino acids)=709AA_BP=110
MAGRGRKRMAGPRPSPWARLLLAALISVSLSGTLANRCKKAPVKSCTECVRVDKDCAYCTDEMFRDRRCNTQAELLAAGCQRESIVVMES
SFQITEETQIDTTLRRSQMSPQGLRVRLRPGEERHFELEVFEPLESPVDLYILMDFSNSMSDDLDNLKKMGQNLARVLSQLTSDYTIGFG
KFVDKVSVPQTDMRPEKLKEPWPNSDPPFSFKNVISLTEDVDEFRNKLQGERISGNLDAPEGGFDAILQTAVCTRDIGWRPDSTHLLVFS
TESAFHYEADGANVLAGIMSRNDERCHLDTTGTYTQYRTQDYPSVPTLVRLLAKHNIIPIFAVTNYSYSYYEKLHTYFPVSSLGVLQEDS
SNIVELLEEAFNRIRSNLDIRALDSPRGLRTEVTSKMFQKTRTGSFHIRRGEVGIYQVQLRALEHVDGTHVCQLPEDQKGNIHLKPSFSD
GLKMDAGIICDVCTCELQKEVRSARCSFNGDFVCGQCVCSEGWSGQTCNCSTGSLSDIQPCLREGEDKPCSGRGECQCGHCVCYGEGRYE
GQFCEYDNFQCPRTSGFLCNDRGRCSMGQCVCEPGWTGPSCDCPLSNATCIDSNGKASLENSLREVEARYALQMEQLNGILLHLESELAQ

--------------------------------------------------------------
>40535_40535_2_ITGB4-KRT18_ITGB4_chr17_73732235_ENST00000200181_KRT18_chr12_53345903_ENST00000388837_length(transcript)=2351nt_BP=1948nt
GCGCTGCCCGCCTCGTCCCCACCCCCCCAACCCCCGCGCCCGCCCTCGGACAGTCCCTGCTCGCCCGCGCGCTGCAGCCCCATCTCCTAG
CGGCAGCCCAGGCGCGGAGGGAGCGAGTCCGCCCCGAGGTAGGTCCAGGACGGGCGCACAGCAGCAGCCGAGGCTGGCCGGGAGAGGGAG
GAAGAGGATGGCAGGGCCACGCCCCAGCCCATGGGCCAGGCTGCTCCTGGCAGCCTTGATCAGCGTCAGCCTCTCTGGGACCTTGGCAAA
CCGCTGCAAGAAGGCCCCAGTGAAGAGCTGCACGGAGTGTGTCCGTGTGGATAAGGACTGCGCCTACTGCACAGACGAGATGTTCAGGGA
CCGGCGCTGCAACACCCAGGCGGAGCTGCTGGCCGCGGGCTGCCAGCGGGAGAGCATCGTGGTCATGGAGAGCAGCTTCCAAATCACAGA
GGAGACCCAGATTGACACCACCCTGCGGCGCAGCCAGATGTCCCCCCAAGGCCTGCGGGTCCGTCTGCGGCCCGGTGAGGAGCGGCATTT
TGAGCTGGAGGTGTTTGAGCCACTGGAGAGCCCCGTGGACCTGTACATCCTCATGGACTTCTCCAACTCCATGTCCGATGATCTGGACAA
CCTCAAGAAGATGGGGCAGAACCTGGCTCGGGTCCTGAGCCAGCTCACCAGCGACTACACTATTGGATTTGGCAAGTTTGTGGACAAAGT
CAGCGTCCCGCAGACGGACATGAGGCCTGAGAAGCTGAAGGAGCCCTGGCCCAACAGTGACCCCCCCTTCTCCTTCAAGAACGTCATCAG
CCTGACAGAAGATGTGGATGAGTTCCGGAATAAACTGCAGGGAGAGCGGATCTCAGGCAACCTGGATGCTCCTGAGGGCGGCTTCGATGC
CATCCTGCAGACAGCTGTGTGCACGAGGGACATTGGCTGGCGCCCGGACAGCACCCACCTGCTGGTCTTCTCCACCGAGTCAGCCTTCCA
CTATGAGGCTGATGGCGCCAACGTGCTGGCTGGCATCATGAGCCGCAACGATGAACGGTGCCACCTGGACACCACGGGCACCTACACCCA
GTACAGGACACAGGACTACCCGTCGGTGCCCACCCTGGTGCGCCTGCTCGCCAAGCACAACATCATCCCCATCTTTGCTGTCACCAACTA
CTCCTATAGCTACTACGAGAAGCTTCACACCTATTTCCCTGTCTCCTCACTGGGGGTGCTGCAGGAGGACTCGTCCAACATCGTGGAGCT
GCTGGAGGAGGCCTTCAATCGGATCCGCTCCAACCTGGACATCCGGGCCCTAGACAGCCCCCGAGGCCTTCGGACAGAGGTCACCTCCAA
GATGTTCCAGAAGACGAGGACTGGGTCCTTTCACATCCGGCGGGGGGAAGTGGGTATATACCAGGTGCAGCTGCGGGCCCTTGAGCACGT
GGATGGGACGCACGTGTGCCAGCTGCCGGAGGACCAGAAGGGCAACATCCATCTGAAACCTTCCTTCTCCGACGGCCTCAAGATGGACGC
GGGCATCATCTGTGATGTGTGCACCTGCGAGCTGCAAAAAGAGGTGCGGTCAGCTCGCTGCAGCTTCAACGGAGACTTCGTGTGCGGACA
GTGTGTGTGCAGCGAGGGCTGGAGTGGCCAGACCTGCAACTGCTCCACCGGCTCTCTGAGTGACATTCAGCCCTGCCTGCGGGAGGGCGA
GGACAAGCCGTGCTCCGGCCGTGGGGAGTGCCAGTGCGGGCACTGTGTGTGCTACGGCGAAGGCCGCTACGAGGGTCAGTTCTGCGAGTA
TGACAACTTCCAGTGTCCCCGCACTTCCGGGTTCCTCTGCAATGACCGAGGACGCTGCTCCATGGGCCAGTGTGTGTGTGAGCCTGGTTG
GACAGGCCCAAGCTGTGACTGTCCCCTCAGCAATGCCACCTGCATCGACAGCAATGGGAAGGCCAGCTTGGAGAACAGCCTGAGGGAGGT
GGAGGCCCGCTACGCCCTACAGATGGAGCAGCTCAACGGGATCCTGCTGCACCTTGAGTCAGAGCTGGCACAGACCCGGGCAGAGGGACA
GCGCCAGGCCCAGGAGTATGAGGCCCTGCTGAACATCAAGGTCAAGCTGGAGGCTGAGATCGCCACCTACCGCCGCCTGCTGGAAGATGG
CGAGGACTTTAATCTTGGTGATGCCTTGGACAGCAGCAACTCCATGCAAACCATCCAAAAGACCACCACCCGCCGGATAGTGGATGGCAA
AGTGGTGTCTGAGACCAATGACACCAAAGTTCTGAGGCATTAAGCCAGCAGAAGCAGGGTACCCTTTGGGGAGCAGGAGGCCAATAAAAA

>40535_40535_2_ITGB4-KRT18_ITGB4_chr17_73732235_ENST00000200181_KRT18_chr12_53345903_ENST00000388837_length(amino acids)=709AA_BP=110
MAGRGRKRMAGPRPSPWARLLLAALISVSLSGTLANRCKKAPVKSCTECVRVDKDCAYCTDEMFRDRRCNTQAELLAAGCQRESIVVMES
SFQITEETQIDTTLRRSQMSPQGLRVRLRPGEERHFELEVFEPLESPVDLYILMDFSNSMSDDLDNLKKMGQNLARVLSQLTSDYTIGFG
KFVDKVSVPQTDMRPEKLKEPWPNSDPPFSFKNVISLTEDVDEFRNKLQGERISGNLDAPEGGFDAILQTAVCTRDIGWRPDSTHLLVFS
TESAFHYEADGANVLAGIMSRNDERCHLDTTGTYTQYRTQDYPSVPTLVRLLAKHNIIPIFAVTNYSYSYYEKLHTYFPVSSLGVLQEDS
SNIVELLEEAFNRIRSNLDIRALDSPRGLRTEVTSKMFQKTRTGSFHIRRGEVGIYQVQLRALEHVDGTHVCQLPEDQKGNIHLKPSFSD
GLKMDAGIICDVCTCELQKEVRSARCSFNGDFVCGQCVCSEGWSGQTCNCSTGSLSDIQPCLREGEDKPCSGRGECQCGHCVCYGEGRYE
GQFCEYDNFQCPRTSGFLCNDRGRCSMGQCVCEPGWTGPSCDCPLSNATCIDSNGKASLENSLREVEARYALQMEQLNGILLHLESELAQ

--------------------------------------------------------------
>40535_40535_3_ITGB4-KRT18_ITGB4_chr17_73732235_ENST00000339591_KRT18_chr12_53345903_ENST00000388835_length(transcript)=2352nt_BP=1948nt
GCGCTGCCCGCCTCGTCCCCACCCCCCCAACCCCCGCGCCCGCCCTCGGACAGTCCCTGCTCGCCCGCGCGCTGCAGCCCCATCTCCTAG
CGGCAGCCCAGGCGCGGAGGGAGCGAGTCCGCCCCGAGGTAGGTCCAGGACGGGCGCACAGCAGCAGCCGAGGCTGGCCGGGAGAGGGAG
GAAGAGGATGGCAGGGCCACGCCCCAGCCCATGGGCCAGGCTGCTCCTGGCAGCCTTGATCAGCGTCAGCCTCTCTGGGACCTTGGCAAA
CCGCTGCAAGAAGGCCCCAGTGAAGAGCTGCACGGAGTGTGTCCGTGTGGATAAGGACTGCGCCTACTGCACAGACGAGATGTTCAGGGA
CCGGCGCTGCAACACCCAGGCGGAGCTGCTGGCCGCGGGCTGCCAGCGGGAGAGCATCGTGGTCATGGAGAGCAGCTTCCAAATCACAGA
GGAGACCCAGATTGACACCACCCTGCGGCGCAGCCAGATGTCCCCCCAAGGCCTGCGGGTCCGTCTGCGGCCCGGTGAGGAGCGGCATTT
TGAGCTGGAGGTGTTTGAGCCACTGGAGAGCCCCGTGGACCTGTACATCCTCATGGACTTCTCCAACTCCATGTCCGATGATCTGGACAA
CCTCAAGAAGATGGGGCAGAACCTGGCTCGGGTCCTGAGCCAGCTCACCAGCGACTACACTATTGGATTTGGCAAGTTTGTGGACAAAGT
CAGCGTCCCGCAGACGGACATGAGGCCTGAGAAGCTGAAGGAGCCCTGGCCCAACAGTGACCCCCCCTTCTCCTTCAAGAACGTCATCAG
CCTGACAGAAGATGTGGATGAGTTCCGGAATAAACTGCAGGGAGAGCGGATCTCAGGCAACCTGGATGCTCCTGAGGGCGGCTTCGATGC
CATCCTGCAGACAGCTGTGTGCACGAGGGACATTGGCTGGCGCCCGGACAGCACCCACCTGCTGGTCTTCTCCACCGAGTCAGCCTTCCA
CTATGAGGCTGATGGCGCCAACGTGCTGGCTGGCATCATGAGCCGCAACGATGAACGGTGCCACCTGGACACCACGGGCACCTACACCCA
GTACAGGACACAGGACTACCCGTCGGTGCCCACCCTGGTGCGCCTGCTCGCCAAGCACAACATCATCCCCATCTTTGCTGTCACCAACTA
CTCCTATAGCTACTACGAGAAGCTTCACACCTATTTCCCTGTCTCCTCACTGGGGGTGCTGCAGGAGGACTCGTCCAACATCGTGGAGCT
GCTGGAGGAGGCCTTCAATCGGATCCGCTCCAACCTGGACATCCGGGCCCTAGACAGCCCCCGAGGCCTTCGGACAGAGGTCACCTCCAA
GATGTTCCAGAAGACGAGGACTGGGTCCTTTCACATCCGGCGGGGGGAAGTGGGTATATACCAGGTGCAGCTGCGGGCCCTTGAGCACGT
GGATGGGACGCACGTGTGCCAGCTGCCGGAGGACCAGAAGGGCAACATCCATCTGAAACCTTCCTTCTCCGACGGCCTCAAGATGGACGC
GGGCATCATCTGTGATGTGTGCACCTGCGAGCTGCAAAAAGAGGTGCGGTCAGCTCGCTGCAGCTTCAACGGAGACTTCGTGTGCGGACA
GTGTGTGTGCAGCGAGGGCTGGAGTGGCCAGACCTGCAACTGCTCCACCGGCTCTCTGAGTGACATTCAGCCCTGCCTGCGGGAGGGCGA
GGACAAGCCGTGCTCCGGCCGTGGGGAGTGCCAGTGCGGGCACTGTGTGTGCTACGGCGAAGGCCGCTACGAGGGTCAGTTCTGCGAGTA
TGACAACTTCCAGTGTCCCCGCACTTCCGGGTTCCTCTGCAATGACCGAGGACGCTGCTCCATGGGCCAGTGTGTGTGTGAGCCTGGTTG
GACAGGCCCAAGCTGTGACTGTCCCCTCAGCAATGCCACCTGCATCGACAGCAATGGGAAGGCCAGCTTGGAGAACAGCCTGAGGGAGGT
GGAGGCCCGCTACGCCCTACAGATGGAGCAGCTCAACGGGATCCTGCTGCACCTTGAGTCAGAGCTGGCACAGACCCGGGCAGAGGGACA
GCGCCAGGCCCAGGAGTATGAGGCCCTGCTGAACATCAAGGTCAAGCTGGAGGCTGAGATCGCCACCTACCGCCGCCTGCTGGAAGATGG
CGAGGACTTTAATCTTGGTGATGCCTTGGACAGCAGCAACTCCATGCAAACCATCCAAAAGACCACCACCCGCCGGATAGTGGATGGCAA
AGTGGTGTCTGAGACCAATGACACCAAAGTTCTGAGGCATTAAGCCAGCAGAAGCAGGGTACCCTTTGGGGAGCAGGAGGCCAATAAAAA

>40535_40535_3_ITGB4-KRT18_ITGB4_chr17_73732235_ENST00000339591_KRT18_chr12_53345903_ENST00000388835_length(amino acids)=709AA_BP=110
MAGRGRKRMAGPRPSPWARLLLAALISVSLSGTLANRCKKAPVKSCTECVRVDKDCAYCTDEMFRDRRCNTQAELLAAGCQRESIVVMES
SFQITEETQIDTTLRRSQMSPQGLRVRLRPGEERHFELEVFEPLESPVDLYILMDFSNSMSDDLDNLKKMGQNLARVLSQLTSDYTIGFG
KFVDKVSVPQTDMRPEKLKEPWPNSDPPFSFKNVISLTEDVDEFRNKLQGERISGNLDAPEGGFDAILQTAVCTRDIGWRPDSTHLLVFS
TESAFHYEADGANVLAGIMSRNDERCHLDTTGTYTQYRTQDYPSVPTLVRLLAKHNIIPIFAVTNYSYSYYEKLHTYFPVSSLGVLQEDS
SNIVELLEEAFNRIRSNLDIRALDSPRGLRTEVTSKMFQKTRTGSFHIRRGEVGIYQVQLRALEHVDGTHVCQLPEDQKGNIHLKPSFSD
GLKMDAGIICDVCTCELQKEVRSARCSFNGDFVCGQCVCSEGWSGQTCNCSTGSLSDIQPCLREGEDKPCSGRGECQCGHCVCYGEGRYE
GQFCEYDNFQCPRTSGFLCNDRGRCSMGQCVCEPGWTGPSCDCPLSNATCIDSNGKASLENSLREVEARYALQMEQLNGILLHLESELAQ

--------------------------------------------------------------
>40535_40535_4_ITGB4-KRT18_ITGB4_chr17_73732235_ENST00000339591_KRT18_chr12_53345903_ENST00000388837_length(transcript)=2351nt_BP=1948nt
GCGCTGCCCGCCTCGTCCCCACCCCCCCAACCCCCGCGCCCGCCCTCGGACAGTCCCTGCTCGCCCGCGCGCTGCAGCCCCATCTCCTAG
CGGCAGCCCAGGCGCGGAGGGAGCGAGTCCGCCCCGAGGTAGGTCCAGGACGGGCGCACAGCAGCAGCCGAGGCTGGCCGGGAGAGGGAG
GAAGAGGATGGCAGGGCCACGCCCCAGCCCATGGGCCAGGCTGCTCCTGGCAGCCTTGATCAGCGTCAGCCTCTCTGGGACCTTGGCAAA
CCGCTGCAAGAAGGCCCCAGTGAAGAGCTGCACGGAGTGTGTCCGTGTGGATAAGGACTGCGCCTACTGCACAGACGAGATGTTCAGGGA
CCGGCGCTGCAACACCCAGGCGGAGCTGCTGGCCGCGGGCTGCCAGCGGGAGAGCATCGTGGTCATGGAGAGCAGCTTCCAAATCACAGA
GGAGACCCAGATTGACACCACCCTGCGGCGCAGCCAGATGTCCCCCCAAGGCCTGCGGGTCCGTCTGCGGCCCGGTGAGGAGCGGCATTT
TGAGCTGGAGGTGTTTGAGCCACTGGAGAGCCCCGTGGACCTGTACATCCTCATGGACTTCTCCAACTCCATGTCCGATGATCTGGACAA
CCTCAAGAAGATGGGGCAGAACCTGGCTCGGGTCCTGAGCCAGCTCACCAGCGACTACACTATTGGATTTGGCAAGTTTGTGGACAAAGT
CAGCGTCCCGCAGACGGACATGAGGCCTGAGAAGCTGAAGGAGCCCTGGCCCAACAGTGACCCCCCCTTCTCCTTCAAGAACGTCATCAG
CCTGACAGAAGATGTGGATGAGTTCCGGAATAAACTGCAGGGAGAGCGGATCTCAGGCAACCTGGATGCTCCTGAGGGCGGCTTCGATGC
CATCCTGCAGACAGCTGTGTGCACGAGGGACATTGGCTGGCGCCCGGACAGCACCCACCTGCTGGTCTTCTCCACCGAGTCAGCCTTCCA
CTATGAGGCTGATGGCGCCAACGTGCTGGCTGGCATCATGAGCCGCAACGATGAACGGTGCCACCTGGACACCACGGGCACCTACACCCA
GTACAGGACACAGGACTACCCGTCGGTGCCCACCCTGGTGCGCCTGCTCGCCAAGCACAACATCATCCCCATCTTTGCTGTCACCAACTA
CTCCTATAGCTACTACGAGAAGCTTCACACCTATTTCCCTGTCTCCTCACTGGGGGTGCTGCAGGAGGACTCGTCCAACATCGTGGAGCT
GCTGGAGGAGGCCTTCAATCGGATCCGCTCCAACCTGGACATCCGGGCCCTAGACAGCCCCCGAGGCCTTCGGACAGAGGTCACCTCCAA
GATGTTCCAGAAGACGAGGACTGGGTCCTTTCACATCCGGCGGGGGGAAGTGGGTATATACCAGGTGCAGCTGCGGGCCCTTGAGCACGT
GGATGGGACGCACGTGTGCCAGCTGCCGGAGGACCAGAAGGGCAACATCCATCTGAAACCTTCCTTCTCCGACGGCCTCAAGATGGACGC
GGGCATCATCTGTGATGTGTGCACCTGCGAGCTGCAAAAAGAGGTGCGGTCAGCTCGCTGCAGCTTCAACGGAGACTTCGTGTGCGGACA
GTGTGTGTGCAGCGAGGGCTGGAGTGGCCAGACCTGCAACTGCTCCACCGGCTCTCTGAGTGACATTCAGCCCTGCCTGCGGGAGGGCGA
GGACAAGCCGTGCTCCGGCCGTGGGGAGTGCCAGTGCGGGCACTGTGTGTGCTACGGCGAAGGCCGCTACGAGGGTCAGTTCTGCGAGTA
TGACAACTTCCAGTGTCCCCGCACTTCCGGGTTCCTCTGCAATGACCGAGGACGCTGCTCCATGGGCCAGTGTGTGTGTGAGCCTGGTTG
GACAGGCCCAAGCTGTGACTGTCCCCTCAGCAATGCCACCTGCATCGACAGCAATGGGAAGGCCAGCTTGGAGAACAGCCTGAGGGAGGT
GGAGGCCCGCTACGCCCTACAGATGGAGCAGCTCAACGGGATCCTGCTGCACCTTGAGTCAGAGCTGGCACAGACCCGGGCAGAGGGACA
GCGCCAGGCCCAGGAGTATGAGGCCCTGCTGAACATCAAGGTCAAGCTGGAGGCTGAGATCGCCACCTACCGCCGCCTGCTGGAAGATGG
CGAGGACTTTAATCTTGGTGATGCCTTGGACAGCAGCAACTCCATGCAAACCATCCAAAAGACCACCACCCGCCGGATAGTGGATGGCAA
AGTGGTGTCTGAGACCAATGACACCAAAGTTCTGAGGCATTAAGCCAGCAGAAGCAGGGTACCCTTTGGGGAGCAGGAGGCCAATAAAAA

>40535_40535_4_ITGB4-KRT18_ITGB4_chr17_73732235_ENST00000339591_KRT18_chr12_53345903_ENST00000388837_length(amino acids)=709AA_BP=110
MAGRGRKRMAGPRPSPWARLLLAALISVSLSGTLANRCKKAPVKSCTECVRVDKDCAYCTDEMFRDRRCNTQAELLAAGCQRESIVVMES
SFQITEETQIDTTLRRSQMSPQGLRVRLRPGEERHFELEVFEPLESPVDLYILMDFSNSMSDDLDNLKKMGQNLARVLSQLTSDYTIGFG
KFVDKVSVPQTDMRPEKLKEPWPNSDPPFSFKNVISLTEDVDEFRNKLQGERISGNLDAPEGGFDAILQTAVCTRDIGWRPDSTHLLVFS
TESAFHYEADGANVLAGIMSRNDERCHLDTTGTYTQYRTQDYPSVPTLVRLLAKHNIIPIFAVTNYSYSYYEKLHTYFPVSSLGVLQEDS
SNIVELLEEAFNRIRSNLDIRALDSPRGLRTEVTSKMFQKTRTGSFHIRRGEVGIYQVQLRALEHVDGTHVCQLPEDQKGNIHLKPSFSD
GLKMDAGIICDVCTCELQKEVRSARCSFNGDFVCGQCVCSEGWSGQTCNCSTGSLSDIQPCLREGEDKPCSGRGECQCGHCVCYGEGRYE
GQFCEYDNFQCPRTSGFLCNDRGRCSMGQCVCEPGWTGPSCDCPLSNATCIDSNGKASLENSLREVEARYALQMEQLNGILLHLESELAQ

--------------------------------------------------------------
>40535_40535_5_ITGB4-KRT18_ITGB4_chr17_73732235_ENST00000449880_KRT18_chr12_53345903_ENST00000388835_length(transcript)=2173nt_BP=1769nt
GGAAGAGGATGGCAGGGCCACGCCCCAGCCCATGGGCCAGGCTGCTCCTGGCAGCCTTGATCAGCGTCAGCCTCTCTGGGACCTTGGCAA
ACCGCTGCAAGAAGGCCCCAGTGAAGAGCTGCACGGAGTGTGTCCGTGTGGATAAGGACTGCGCCTACTGCACAGACGAGATGTTCAGGG
ACCGGCGCTGCAACACCCAGGCGGAGCTGCTGGCCGCGGGCTGCCAGCGGGAGAGCATCGTGGTCATGGAGAGCAGCTTCCAAATCACAG
AGGAGACCCAGATTGACACCACCCTGCGGCGCAGCCAGATGTCCCCCCAAGGCCTGCGGGTCCGTCTGCGGCCCGGTGAGGAGCGGCATT
TTGAGCTGGAGGTGTTTGAGCCACTGGAGAGCCCCGTGGACCTGTACATCCTCATGGACTTCTCCAACTCCATGTCCGATGATCTGGACA
ACCTCAAGAAGATGGGGCAGAACCTGGCTCGGGTCCTGAGCCAGCTCACCAGCGACTACACTATTGGATTTGGCAAGTTTGTGGACAAAG
TCAGCGTCCCGCAGACGGACATGAGGCCTGAGAAGCTGAAGGAGCCCTGGCCCAACAGTGACCCCCCCTTCTCCTTCAAGAACGTCATCA
GCCTGACAGAAGATGTGGATGAGTTCCGGAATAAACTGCAGGGAGAGCGGATCTCAGGCAACCTGGATGCTCCTGAGGGCGGCTTCGATG
CCATCCTGCAGACAGCTGTGTGCACGAGGGACATTGGCTGGCGCCCGGACAGCACCCACCTGCTGGTCTTCTCCACCGAGTCAGCCTTCC
ACTATGAGGCTGATGGCGCCAACGTGCTGGCTGGCATCATGAGCCGCAACGATGAACGGTGCCACCTGGACACCACGGGCACCTACACCC
AGTACAGGACACAGGACTACCCGTCGGTGCCCACCCTGGTGCGCCTGCTCGCCAAGCACAACATCATCCCCATCTTTGCTGTCACCAACT
ACTCCTATAGCTACTACGAGAAGCTTCACACCTATTTCCCTGTCTCCTCACTGGGGGTGCTGCAGGAGGACTCGTCCAACATCGTGGAGC
TGCTGGAGGAGGCCTTCAATCGGATCCGCTCCAACCTGGACATCCGGGCCCTAGACAGCCCCCGAGGCCTTCGGACAGAGGTCACCTCCA
AGATGTTCCAGAAGACGAGGACTGGGTCCTTTCACATCCGGCGGGGGGAAGTGGGTATATACCAGGTGCAGCTGCGGGCCCTTGAGCACG
TGGATGGGACGCACGTGTGCCAGCTGCCGGAGGACCAGAAGGGCAACATCCATCTGAAACCTTCCTTCTCCGACGGCCTCAAGATGGACG
CGGGCATCATCTGTGATGTGTGCACCTGCGAGCTGCAAAAAGAGGTGCGGTCAGCTCGCTGCAGCTTCAACGGAGACTTCGTGTGCGGAC
AGTGTGTGTGCAGCGAGGGCTGGAGTGGCCAGACCTGCAACTGCTCCACCGGCTCTCTGAGTGACATTCAGCCCTGCCTGCGGGAGGGCG
AGGACAAGCCGTGCTCCGGCCGTGGGGAGTGCCAGTGCGGGCACTGTGTGTGCTACGGCGAAGGCCGCTACGAGGGTCAGTTCTGCGAGT
ATGACAACTTCCAGTGTCCCCGCACTTCCGGGTTCCTCTGCAATGACCGAGGACGCTGCTCCATGGGCCAGTGTGTGTGTGAGCCTGGTT
GGACAGGCCCAAGCTGTGACTGTCCCCTCAGCAATGCCACCTGCATCGACAGCAATGGGAAGGCCAGCTTGGAGAACAGCCTGAGGGAGG
TGGAGGCCCGCTACGCCCTACAGATGGAGCAGCTCAACGGGATCCTGCTGCACCTTGAGTCAGAGCTGGCACAGACCCGGGCAGAGGGAC
AGCGCCAGGCCCAGGAGTATGAGGCCCTGCTGAACATCAAGGTCAAGCTGGAGGCTGAGATCGCCACCTACCGCCGCCTGCTGGAAGATG
GCGAGGACTTTAATCTTGGTGATGCCTTGGACAGCAGCAACTCCATGCAAACCATCCAAAAGACCACCACCCGCCGGATAGTGGATGGCA
AAGTGGTGTCTGAGACCAATGACACCAAAGTTCTGAGGCATTAAGCCAGCAGAAGCAGGGTACCCTTTGGGGAGCAGGAGGCCAATAAAA

>40535_40535_5_ITGB4-KRT18_ITGB4_chr17_73732235_ENST00000449880_KRT18_chr12_53345903_ENST00000388835_length(amino acids)=701AA_BP=102
MAGPRPSPWARLLLAALISVSLSGTLANRCKKAPVKSCTECVRVDKDCAYCTDEMFRDRRCNTQAELLAAGCQRESIVVMESSFQITEET
QIDTTLRRSQMSPQGLRVRLRPGEERHFELEVFEPLESPVDLYILMDFSNSMSDDLDNLKKMGQNLARVLSQLTSDYTIGFGKFVDKVSV
PQTDMRPEKLKEPWPNSDPPFSFKNVISLTEDVDEFRNKLQGERISGNLDAPEGGFDAILQTAVCTRDIGWRPDSTHLLVFSTESAFHYE
ADGANVLAGIMSRNDERCHLDTTGTYTQYRTQDYPSVPTLVRLLAKHNIIPIFAVTNYSYSYYEKLHTYFPVSSLGVLQEDSSNIVELLE
EAFNRIRSNLDIRALDSPRGLRTEVTSKMFQKTRTGSFHIRRGEVGIYQVQLRALEHVDGTHVCQLPEDQKGNIHLKPSFSDGLKMDAGI
ICDVCTCELQKEVRSARCSFNGDFVCGQCVCSEGWSGQTCNCSTGSLSDIQPCLREGEDKPCSGRGECQCGHCVCYGEGRYEGQFCEYDN
FQCPRTSGFLCNDRGRCSMGQCVCEPGWTGPSCDCPLSNATCIDSNGKASLENSLREVEARYALQMEQLNGILLHLESELAQTRAEGQRQ

--------------------------------------------------------------
>40535_40535_6_ITGB4-KRT18_ITGB4_chr17_73732235_ENST00000449880_KRT18_chr12_53345903_ENST00000388837_length(transcript)=2172nt_BP=1769nt
GGAAGAGGATGGCAGGGCCACGCCCCAGCCCATGGGCCAGGCTGCTCCTGGCAGCCTTGATCAGCGTCAGCCTCTCTGGGACCTTGGCAA
ACCGCTGCAAGAAGGCCCCAGTGAAGAGCTGCACGGAGTGTGTCCGTGTGGATAAGGACTGCGCCTACTGCACAGACGAGATGTTCAGGG
ACCGGCGCTGCAACACCCAGGCGGAGCTGCTGGCCGCGGGCTGCCAGCGGGAGAGCATCGTGGTCATGGAGAGCAGCTTCCAAATCACAG
AGGAGACCCAGATTGACACCACCCTGCGGCGCAGCCAGATGTCCCCCCAAGGCCTGCGGGTCCGTCTGCGGCCCGGTGAGGAGCGGCATT
TTGAGCTGGAGGTGTTTGAGCCACTGGAGAGCCCCGTGGACCTGTACATCCTCATGGACTTCTCCAACTCCATGTCCGATGATCTGGACA
ACCTCAAGAAGATGGGGCAGAACCTGGCTCGGGTCCTGAGCCAGCTCACCAGCGACTACACTATTGGATTTGGCAAGTTTGTGGACAAAG
TCAGCGTCCCGCAGACGGACATGAGGCCTGAGAAGCTGAAGGAGCCCTGGCCCAACAGTGACCCCCCCTTCTCCTTCAAGAACGTCATCA
GCCTGACAGAAGATGTGGATGAGTTCCGGAATAAACTGCAGGGAGAGCGGATCTCAGGCAACCTGGATGCTCCTGAGGGCGGCTTCGATG
CCATCCTGCAGACAGCTGTGTGCACGAGGGACATTGGCTGGCGCCCGGACAGCACCCACCTGCTGGTCTTCTCCACCGAGTCAGCCTTCC
ACTATGAGGCTGATGGCGCCAACGTGCTGGCTGGCATCATGAGCCGCAACGATGAACGGTGCCACCTGGACACCACGGGCACCTACACCC
AGTACAGGACACAGGACTACCCGTCGGTGCCCACCCTGGTGCGCCTGCTCGCCAAGCACAACATCATCCCCATCTTTGCTGTCACCAACT
ACTCCTATAGCTACTACGAGAAGCTTCACACCTATTTCCCTGTCTCCTCACTGGGGGTGCTGCAGGAGGACTCGTCCAACATCGTGGAGC
TGCTGGAGGAGGCCTTCAATCGGATCCGCTCCAACCTGGACATCCGGGCCCTAGACAGCCCCCGAGGCCTTCGGACAGAGGTCACCTCCA
AGATGTTCCAGAAGACGAGGACTGGGTCCTTTCACATCCGGCGGGGGGAAGTGGGTATATACCAGGTGCAGCTGCGGGCCCTTGAGCACG
TGGATGGGACGCACGTGTGCCAGCTGCCGGAGGACCAGAAGGGCAACATCCATCTGAAACCTTCCTTCTCCGACGGCCTCAAGATGGACG
CGGGCATCATCTGTGATGTGTGCACCTGCGAGCTGCAAAAAGAGGTGCGGTCAGCTCGCTGCAGCTTCAACGGAGACTTCGTGTGCGGAC
AGTGTGTGTGCAGCGAGGGCTGGAGTGGCCAGACCTGCAACTGCTCCACCGGCTCTCTGAGTGACATTCAGCCCTGCCTGCGGGAGGGCG
AGGACAAGCCGTGCTCCGGCCGTGGGGAGTGCCAGTGCGGGCACTGTGTGTGCTACGGCGAAGGCCGCTACGAGGGTCAGTTCTGCGAGT
ATGACAACTTCCAGTGTCCCCGCACTTCCGGGTTCCTCTGCAATGACCGAGGACGCTGCTCCATGGGCCAGTGTGTGTGTGAGCCTGGTT
GGACAGGCCCAAGCTGTGACTGTCCCCTCAGCAATGCCACCTGCATCGACAGCAATGGGAAGGCCAGCTTGGAGAACAGCCTGAGGGAGG
TGGAGGCCCGCTACGCCCTACAGATGGAGCAGCTCAACGGGATCCTGCTGCACCTTGAGTCAGAGCTGGCACAGACCCGGGCAGAGGGAC
AGCGCCAGGCCCAGGAGTATGAGGCCCTGCTGAACATCAAGGTCAAGCTGGAGGCTGAGATCGCCACCTACCGCCGCCTGCTGGAAGATG
GCGAGGACTTTAATCTTGGTGATGCCTTGGACAGCAGCAACTCCATGCAAACCATCCAAAAGACCACCACCCGCCGGATAGTGGATGGCA
AAGTGGTGTCTGAGACCAATGACACCAAAGTTCTGAGGCATTAAGCCAGCAGAAGCAGGGTACCCTTTGGGGAGCAGGAGGCCAATAAAA

>40535_40535_6_ITGB4-KRT18_ITGB4_chr17_73732235_ENST00000449880_KRT18_chr12_53345903_ENST00000388837_length(amino acids)=701AA_BP=102
MAGPRPSPWARLLLAALISVSLSGTLANRCKKAPVKSCTECVRVDKDCAYCTDEMFRDRRCNTQAELLAAGCQRESIVVMESSFQITEET
QIDTTLRRSQMSPQGLRVRLRPGEERHFELEVFEPLESPVDLYILMDFSNSMSDDLDNLKKMGQNLARVLSQLTSDYTIGFGKFVDKVSV
PQTDMRPEKLKEPWPNSDPPFSFKNVISLTEDVDEFRNKLQGERISGNLDAPEGGFDAILQTAVCTRDIGWRPDSTHLLVFSTESAFHYE
ADGANVLAGIMSRNDERCHLDTTGTYTQYRTQDYPSVPTLVRLLAKHNIIPIFAVTNYSYSYYEKLHTYFPVSSLGVLQEDSSNIVELLE
EAFNRIRSNLDIRALDSPRGLRTEVTSKMFQKTRTGSFHIRRGEVGIYQVQLRALEHVDGTHVCQLPEDQKGNIHLKPSFSDGLKMDAGI
ICDVCTCELQKEVRSARCSFNGDFVCGQCVCSEGWSGQTCNCSTGSLSDIQPCLREGEDKPCSGRGECQCGHCVCYGEGRYEGQFCEYDN
FQCPRTSGFLCNDRGRCSMGQCVCEPGWTGPSCDCPLSNATCIDSNGKASLENSLREVEARYALQMEQLNGILLHLESELAQTRAEGQRQ

--------------------------------------------------------------
>40535_40535_7_ITGB4-KRT18_ITGB4_chr17_73732235_ENST00000450894_KRT18_chr12_53345903_ENST00000388835_length(transcript)=2291nt_BP=1887nt
CGCCCGCGCGCTGCAGCCCCATCTCCTAGCGGCAGCCCAGGCGCGGAGGGAGCGAGTCCGCCCCGAGGTAGGTCCAGGACGGGCGCACAG
CAGCAGCCGAGGCTGGCCGGGAGAGGGAGGAAGAGGATGGCAGGGCCACGCCCCAGCCCATGGGCCAGGCTGCTCCTGGCAGCCTTGATC
AGCGTCAGCCTCTCTGGGACCTTGGCAAACCGCTGCAAGAAGGCCCCAGTGAAGAGCTGCACGGAGTGTGTCCGTGTGGATAAGGACTGC
GCCTACTGCACAGACGAGATGTTCAGGGACCGGCGCTGCAACACCCAGGCGGAGCTGCTGGCCGCGGGCTGCCAGCGGGAGAGCATCGTG
GTCATGGAGAGCAGCTTCCAAATCACAGAGGAGACCCAGATTGACACCACCCTGCGGCGCAGCCAGATGTCCCCCCAAGGCCTGCGGGTC
CGTCTGCGGCCCGGTGAGGAGCGGCATTTTGAGCTGGAGGTGTTTGAGCCACTGGAGAGCCCCGTGGACCTGTACATCCTCATGGACTTC
TCCAACTCCATGTCCGATGATCTGGACAACCTCAAGAAGATGGGGCAGAACCTGGCTCGGGTCCTGAGCCAGCTCACCAGCGACTACACT
ATTGGATTTGGCAAGTTTGTGGACAAAGTCAGCGTCCCGCAGACGGACATGAGGCCTGAGAAGCTGAAGGAGCCCTGGCCCAACAGTGAC
CCCCCCTTCTCCTTCAAGAACGTCATCAGCCTGACAGAAGATGTGGATGAGTTCCGGAATAAACTGCAGGGAGAGCGGATCTCAGGCAAC
CTGGATGCTCCTGAGGGCGGCTTCGATGCCATCCTGCAGACAGCTGTGTGCACGAGGGACATTGGCTGGCGCCCGGACAGCACCCACCTG
CTGGTCTTCTCCACCGAGTCAGCCTTCCACTATGAGGCTGATGGCGCCAACGTGCTGGCTGGCATCATGAGCCGCAACGATGAACGGTGC
CACCTGGACACCACGGGCACCTACACCCAGTACAGGACACAGGACTACCCGTCGGTGCCCACCCTGGTGCGCCTGCTCGCCAAGCACAAC
ATCATCCCCATCTTTGCTGTCACCAACTACTCCTATAGCTACTACGAGAAGCTTCACACCTATTTCCCTGTCTCCTCACTGGGGGTGCTG
CAGGAGGACTCGTCCAACATCGTGGAGCTGCTGGAGGAGGCCTTCAATCGGATCCGCTCCAACCTGGACATCCGGGCCCTAGACAGCCCC
CGAGGCCTTCGGACAGAGGTCACCTCCAAGATGTTCCAGAAGACGAGGACTGGGTCCTTTCACATCCGGCGGGGGGAAGTGGGTATATAC
CAGGTGCAGCTGCGGGCCCTTGAGCACGTGGATGGGACGCACGTGTGCCAGCTGCCGGAGGACCAGAAGGGCAACATCCATCTGAAACCT
TCCTTCTCCGACGGCCTCAAGATGGACGCGGGCATCATCTGTGATGTGTGCACCTGCGAGCTGCAAAAAGAGGTGCGGTCAGCTCGCTGC
AGCTTCAACGGAGACTTCGTGTGCGGACAGTGTGTGTGCAGCGAGGGCTGGAGTGGCCAGACCTGCAACTGCTCCACCGGCTCTCTGAGT
GACATTCAGCCCTGCCTGCGGGAGGGCGAGGACAAGCCGTGCTCCGGCCGTGGGGAGTGCCAGTGCGGGCACTGTGTGTGCTACGGCGAA
GGCCGCTACGAGGGTCAGTTCTGCGAGTATGACAACTTCCAGTGTCCCCGCACTTCCGGGTTCCTCTGCAATGACCGAGGACGCTGCTCC
ATGGGCCAGTGTGTGTGTGAGCCTGGTTGGACAGGCCCAAGCTGTGACTGTCCCCTCAGCAATGCCACCTGCATCGACAGCAATGGGAAG
GCCAGCTTGGAGAACAGCCTGAGGGAGGTGGAGGCCCGCTACGCCCTACAGATGGAGCAGCTCAACGGGATCCTGCTGCACCTTGAGTCA
GAGCTGGCACAGACCCGGGCAGAGGGACAGCGCCAGGCCCAGGAGTATGAGGCCCTGCTGAACATCAAGGTCAAGCTGGAGGCTGAGATC
GCCACCTACCGCCGCCTGCTGGAAGATGGCGAGGACTTTAATCTTGGTGATGCCTTGGACAGCAGCAACTCCATGCAAACCATCCAAAAG
ACCACCACCCGCCGGATAGTGGATGGCAAAGTGGTGTCTGAGACCAATGACACCAAAGTTCTGAGGCATTAAGCCAGCAGAAGCAGGGTA

>40535_40535_7_ITGB4-KRT18_ITGB4_chr17_73732235_ENST00000450894_KRT18_chr12_53345903_ENST00000388835_length(amino acids)=709AA_BP=110
MAGRGRKRMAGPRPSPWARLLLAALISVSLSGTLANRCKKAPVKSCTECVRVDKDCAYCTDEMFRDRRCNTQAELLAAGCQRESIVVMES
SFQITEETQIDTTLRRSQMSPQGLRVRLRPGEERHFELEVFEPLESPVDLYILMDFSNSMSDDLDNLKKMGQNLARVLSQLTSDYTIGFG
KFVDKVSVPQTDMRPEKLKEPWPNSDPPFSFKNVISLTEDVDEFRNKLQGERISGNLDAPEGGFDAILQTAVCTRDIGWRPDSTHLLVFS
TESAFHYEADGANVLAGIMSRNDERCHLDTTGTYTQYRTQDYPSVPTLVRLLAKHNIIPIFAVTNYSYSYYEKLHTYFPVSSLGVLQEDS
SNIVELLEEAFNRIRSNLDIRALDSPRGLRTEVTSKMFQKTRTGSFHIRRGEVGIYQVQLRALEHVDGTHVCQLPEDQKGNIHLKPSFSD
GLKMDAGIICDVCTCELQKEVRSARCSFNGDFVCGQCVCSEGWSGQTCNCSTGSLSDIQPCLREGEDKPCSGRGECQCGHCVCYGEGRYE
GQFCEYDNFQCPRTSGFLCNDRGRCSMGQCVCEPGWTGPSCDCPLSNATCIDSNGKASLENSLREVEARYALQMEQLNGILLHLESELAQ

--------------------------------------------------------------
>40535_40535_8_ITGB4-KRT18_ITGB4_chr17_73732235_ENST00000450894_KRT18_chr12_53345903_ENST00000388837_length(transcript)=2290nt_BP=1887nt
CGCCCGCGCGCTGCAGCCCCATCTCCTAGCGGCAGCCCAGGCGCGGAGGGAGCGAGTCCGCCCCGAGGTAGGTCCAGGACGGGCGCACAG
CAGCAGCCGAGGCTGGCCGGGAGAGGGAGGAAGAGGATGGCAGGGCCACGCCCCAGCCCATGGGCCAGGCTGCTCCTGGCAGCCTTGATC
AGCGTCAGCCTCTCTGGGACCTTGGCAAACCGCTGCAAGAAGGCCCCAGTGAAGAGCTGCACGGAGTGTGTCCGTGTGGATAAGGACTGC
GCCTACTGCACAGACGAGATGTTCAGGGACCGGCGCTGCAACACCCAGGCGGAGCTGCTGGCCGCGGGCTGCCAGCGGGAGAGCATCGTG
GTCATGGAGAGCAGCTTCCAAATCACAGAGGAGACCCAGATTGACACCACCCTGCGGCGCAGCCAGATGTCCCCCCAAGGCCTGCGGGTC
CGTCTGCGGCCCGGTGAGGAGCGGCATTTTGAGCTGGAGGTGTTTGAGCCACTGGAGAGCCCCGTGGACCTGTACATCCTCATGGACTTC
TCCAACTCCATGTCCGATGATCTGGACAACCTCAAGAAGATGGGGCAGAACCTGGCTCGGGTCCTGAGCCAGCTCACCAGCGACTACACT
ATTGGATTTGGCAAGTTTGTGGACAAAGTCAGCGTCCCGCAGACGGACATGAGGCCTGAGAAGCTGAAGGAGCCCTGGCCCAACAGTGAC
CCCCCCTTCTCCTTCAAGAACGTCATCAGCCTGACAGAAGATGTGGATGAGTTCCGGAATAAACTGCAGGGAGAGCGGATCTCAGGCAAC
CTGGATGCTCCTGAGGGCGGCTTCGATGCCATCCTGCAGACAGCTGTGTGCACGAGGGACATTGGCTGGCGCCCGGACAGCACCCACCTG
CTGGTCTTCTCCACCGAGTCAGCCTTCCACTATGAGGCTGATGGCGCCAACGTGCTGGCTGGCATCATGAGCCGCAACGATGAACGGTGC
CACCTGGACACCACGGGCACCTACACCCAGTACAGGACACAGGACTACCCGTCGGTGCCCACCCTGGTGCGCCTGCTCGCCAAGCACAAC
ATCATCCCCATCTTTGCTGTCACCAACTACTCCTATAGCTACTACGAGAAGCTTCACACCTATTTCCCTGTCTCCTCACTGGGGGTGCTG
CAGGAGGACTCGTCCAACATCGTGGAGCTGCTGGAGGAGGCCTTCAATCGGATCCGCTCCAACCTGGACATCCGGGCCCTAGACAGCCCC
CGAGGCCTTCGGACAGAGGTCACCTCCAAGATGTTCCAGAAGACGAGGACTGGGTCCTTTCACATCCGGCGGGGGGAAGTGGGTATATAC
CAGGTGCAGCTGCGGGCCCTTGAGCACGTGGATGGGACGCACGTGTGCCAGCTGCCGGAGGACCAGAAGGGCAACATCCATCTGAAACCT
TCCTTCTCCGACGGCCTCAAGATGGACGCGGGCATCATCTGTGATGTGTGCACCTGCGAGCTGCAAAAAGAGGTGCGGTCAGCTCGCTGC
AGCTTCAACGGAGACTTCGTGTGCGGACAGTGTGTGTGCAGCGAGGGCTGGAGTGGCCAGACCTGCAACTGCTCCACCGGCTCTCTGAGT
GACATTCAGCCCTGCCTGCGGGAGGGCGAGGACAAGCCGTGCTCCGGCCGTGGGGAGTGCCAGTGCGGGCACTGTGTGTGCTACGGCGAA
GGCCGCTACGAGGGTCAGTTCTGCGAGTATGACAACTTCCAGTGTCCCCGCACTTCCGGGTTCCTCTGCAATGACCGAGGACGCTGCTCC
ATGGGCCAGTGTGTGTGTGAGCCTGGTTGGACAGGCCCAAGCTGTGACTGTCCCCTCAGCAATGCCACCTGCATCGACAGCAATGGGAAG
GCCAGCTTGGAGAACAGCCTGAGGGAGGTGGAGGCCCGCTACGCCCTACAGATGGAGCAGCTCAACGGGATCCTGCTGCACCTTGAGTCA
GAGCTGGCACAGACCCGGGCAGAGGGACAGCGCCAGGCCCAGGAGTATGAGGCCCTGCTGAACATCAAGGTCAAGCTGGAGGCTGAGATC
GCCACCTACCGCCGCCTGCTGGAAGATGGCGAGGACTTTAATCTTGGTGATGCCTTGGACAGCAGCAACTCCATGCAAACCATCCAAAAG
ACCACCACCCGCCGGATAGTGGATGGCAAAGTGGTGTCTGAGACCAATGACACCAAAGTTCTGAGGCATTAAGCCAGCAGAAGCAGGGTA

>40535_40535_8_ITGB4-KRT18_ITGB4_chr17_73732235_ENST00000450894_KRT18_chr12_53345903_ENST00000388837_length(amino acids)=709AA_BP=110
MAGRGRKRMAGPRPSPWARLLLAALISVSLSGTLANRCKKAPVKSCTECVRVDKDCAYCTDEMFRDRRCNTQAELLAAGCQRESIVVMES
SFQITEETQIDTTLRRSQMSPQGLRVRLRPGEERHFELEVFEPLESPVDLYILMDFSNSMSDDLDNLKKMGQNLARVLSQLTSDYTIGFG
KFVDKVSVPQTDMRPEKLKEPWPNSDPPFSFKNVISLTEDVDEFRNKLQGERISGNLDAPEGGFDAILQTAVCTRDIGWRPDSTHLLVFS
TESAFHYEADGANVLAGIMSRNDERCHLDTTGTYTQYRTQDYPSVPTLVRLLAKHNIIPIFAVTNYSYSYYEKLHTYFPVSSLGVLQEDS
SNIVELLEEAFNRIRSNLDIRALDSPRGLRTEVTSKMFQKTRTGSFHIRRGEVGIYQVQLRALEHVDGTHVCQLPEDQKGNIHLKPSFSD
GLKMDAGIICDVCTCELQKEVRSARCSFNGDFVCGQCVCSEGWSGQTCNCSTGSLSDIQPCLREGEDKPCSGRGECQCGHCVCYGEGRYE
GQFCEYDNFQCPRTSGFLCNDRGRCSMGQCVCEPGWTGPSCDCPLSNATCIDSNGKASLENSLREVEARYALQMEQLNGILLHLESELAQ

--------------------------------------------------------------
>40535_40535_9_ITGB4-KRT18_ITGB4_chr17_73732235_ENST00000579662_KRT18_chr12_53345903_ENST00000388835_length(transcript)=2411nt_BP=2007nt
CCGGCGGCGGCACCCAGCTCCTGCCCCGACAGGTGCGCGCCGCGCGAAGGAATGCAGCCGGTCTGACTCACCAGCGCCTCCTTCCTACCT
GCGCGCCCGCCCCATAAAGCGCTGCCCGCCTCGTCCCCACCCCCCCAACCCCCGCGCCCGCCCTCGGACAGTCCCTGCTCGCCCGCGCGC
TGCAGCCCCATCTCCTAGCGGCAGCCCAGGCGCGGAGGGAGCGAGTCCGCCCCGAGGAGGAAGAGGATGGCAGGGCCACGCCCCAGCCCA
TGGGCCAGGCTGCTCCTGGCAGCCTTGATCAGCGTCAGCCTCTCTGGGACCTTGGCAAACCGCTGCAAGAAGGCCCCAGTGAAGAGCTGC
ACGGAGTGTGTCCGTGTGGATAAGGACTGCGCCTACTGCACAGACGAGATGTTCAGGGACCGGCGCTGCAACACCCAGGCGGAGCTGCTG
GCCGCGGGCTGCCAGCGGGAGAGCATCGTGGTCATGGAGAGCAGCTTCCAAATCACAGAGGAGACCCAGATTGACACCACCCTGCGGCGC
AGCCAGATGTCCCCCCAAGGCCTGCGGGTCCGTCTGCGGCCCGGTGAGGAGCGGCATTTTGAGCTGGAGGTGTTTGAGCCACTGGAGAGC
CCCGTGGACCTGTACATCCTCATGGACTTCTCCAACTCCATGTCCGATGATCTGGACAACCTCAAGAAGATGGGGCAGAACCTGGCTCGG
GTCCTGAGCCAGCTCACCAGCGACTACACTATTGGATTTGGCAAGTTTGTGGACAAAGTCAGCGTCCCGCAGACGGACATGAGGCCTGAG
AAGCTGAAGGAGCCCTGGCCCAACAGTGACCCCCCCTTCTCCTTCAAGAACGTCATCAGCCTGACAGAAGATGTGGATGAGTTCCGGAAT
AAACTGCAGGGAGAGCGGATCTCAGGCAACCTGGATGCTCCTGAGGGCGGCTTCGATGCCATCCTGCAGACAGCTGTGTGCACGAGGGAC
ATTGGCTGGCGCCCGGACAGCACCCACCTGCTGGTCTTCTCCACCGAGTCAGCCTTCCACTATGAGGCTGATGGCGCCAACGTGCTGGCT
GGCATCATGAGCCGCAACGATGAACGGTGCCACCTGGACACCACGGGCACCTACACCCAGTACAGGACACAGGACTACCCGTCGGTGCCC
ACCCTGGTGCGCCTGCTCGCCAAGCACAACATCATCCCCATCTTTGCTGTCACCAACTACTCCTATAGCTACTACGAGAAGCTTCACACC
TATTTCCCTGTCTCCTCACTGGGGGTGCTGCAGGAGGACTCGTCCAACATCGTGGAGCTGCTGGAGGAGGCCTTCAATCGGATCCGCTCC
AACCTGGACATCCGGGCCCTAGACAGCCCCCGAGGCCTTCGGACAGAGGTCACCTCCAAGATGTTCCAGAAGACGAGGACTGGGTCCTTT
CACATCCGGCGGGGGGAAGTGGGTATATACCAGGTGCAGCTGCGGGCCCTTGAGCACGTGGATGGGACGCACGTGTGCCAGCTGCCGGAG
GACCAGAAGGGCAACATCCATCTGAAACCTTCCTTCTCCGACGGCCTCAAGATGGACGCGGGCATCATCTGTGATGTGTGCACCTGCGAG
CTGCAAAAAGAGGTGCGGTCAGCTCGCTGCAGCTTCAACGGAGACTTCGTGTGCGGACAGTGTGTGTGCAGCGAGGGCTGGAGTGGCCAG
ACCTGCAACTGCTCCACCGGCTCTCTGAGTGACATTCAGCCCTGCCTGCGGGAGGGCGAGGACAAGCCGTGCTCCGGCCGTGGGGAGTGC
CAGTGCGGGCACTGTGTGTGCTACGGCGAAGGCCGCTACGAGGGTCAGTTCTGCGAGTATGACAACTTCCAGTGTCCCCGCACTTCCGGG
TTCCTCTGCAATGACCGAGGACGCTGCTCCATGGGCCAGTGTGTGTGTGAGCCTGGTTGGACAGGCCCAAGCTGTGACTGTCCCCTCAGC
AATGCCACCTGCATCGACAGCAATGGGAAGGCCAGCTTGGAGAACAGCCTGAGGGAGGTGGAGGCCCGCTACGCCCTACAGATGGAGCAG
CTCAACGGGATCCTGCTGCACCTTGAGTCAGAGCTGGCACAGACCCGGGCAGAGGGACAGCGCCAGGCCCAGGAGTATGAGGCCCTGCTG
AACATCAAGGTCAAGCTGGAGGCTGAGATCGCCACCTACCGCCGCCTGCTGGAAGATGGCGAGGACTTTAATCTTGGTGATGCCTTGGAC
AGCAGCAACTCCATGCAAACCATCCAAAAGACCACCACCCGCCGGATAGTGGATGGCAAAGTGGTGTCTGAGACCAATGACACCAAAGTT

>40535_40535_9_ITGB4-KRT18_ITGB4_chr17_73732235_ENST00000579662_KRT18_chr12_53345903_ENST00000388835_length(amino acids)=701AA_BP=102
MAGPRPSPWARLLLAALISVSLSGTLANRCKKAPVKSCTECVRVDKDCAYCTDEMFRDRRCNTQAELLAAGCQRESIVVMESSFQITEET
QIDTTLRRSQMSPQGLRVRLRPGEERHFELEVFEPLESPVDLYILMDFSNSMSDDLDNLKKMGQNLARVLSQLTSDYTIGFGKFVDKVSV
PQTDMRPEKLKEPWPNSDPPFSFKNVISLTEDVDEFRNKLQGERISGNLDAPEGGFDAILQTAVCTRDIGWRPDSTHLLVFSTESAFHYE
ADGANVLAGIMSRNDERCHLDTTGTYTQYRTQDYPSVPTLVRLLAKHNIIPIFAVTNYSYSYYEKLHTYFPVSSLGVLQEDSSNIVELLE
EAFNRIRSNLDIRALDSPRGLRTEVTSKMFQKTRTGSFHIRRGEVGIYQVQLRALEHVDGTHVCQLPEDQKGNIHLKPSFSDGLKMDAGI
ICDVCTCELQKEVRSARCSFNGDFVCGQCVCSEGWSGQTCNCSTGSLSDIQPCLREGEDKPCSGRGECQCGHCVCYGEGRYEGQFCEYDN
FQCPRTSGFLCNDRGRCSMGQCVCEPGWTGPSCDCPLSNATCIDSNGKASLENSLREVEARYALQMEQLNGILLHLESELAQTRAEGQRQ

--------------------------------------------------------------
>40535_40535_10_ITGB4-KRT18_ITGB4_chr17_73732235_ENST00000579662_KRT18_chr12_53345903_ENST00000388837_length(transcript)=2410nt_BP=2007nt
CCGGCGGCGGCACCCAGCTCCTGCCCCGACAGGTGCGCGCCGCGCGAAGGAATGCAGCCGGTCTGACTCACCAGCGCCTCCTTCCTACCT
GCGCGCCCGCCCCATAAAGCGCTGCCCGCCTCGTCCCCACCCCCCCAACCCCCGCGCCCGCCCTCGGACAGTCCCTGCTCGCCCGCGCGC
TGCAGCCCCATCTCCTAGCGGCAGCCCAGGCGCGGAGGGAGCGAGTCCGCCCCGAGGAGGAAGAGGATGGCAGGGCCACGCCCCAGCCCA
TGGGCCAGGCTGCTCCTGGCAGCCTTGATCAGCGTCAGCCTCTCTGGGACCTTGGCAAACCGCTGCAAGAAGGCCCCAGTGAAGAGCTGC
ACGGAGTGTGTCCGTGTGGATAAGGACTGCGCCTACTGCACAGACGAGATGTTCAGGGACCGGCGCTGCAACACCCAGGCGGAGCTGCTG
GCCGCGGGCTGCCAGCGGGAGAGCATCGTGGTCATGGAGAGCAGCTTCCAAATCACAGAGGAGACCCAGATTGACACCACCCTGCGGCGC
AGCCAGATGTCCCCCCAAGGCCTGCGGGTCCGTCTGCGGCCCGGTGAGGAGCGGCATTTTGAGCTGGAGGTGTTTGAGCCACTGGAGAGC
CCCGTGGACCTGTACATCCTCATGGACTTCTCCAACTCCATGTCCGATGATCTGGACAACCTCAAGAAGATGGGGCAGAACCTGGCTCGG
GTCCTGAGCCAGCTCACCAGCGACTACACTATTGGATTTGGCAAGTTTGTGGACAAAGTCAGCGTCCCGCAGACGGACATGAGGCCTGAG
AAGCTGAAGGAGCCCTGGCCCAACAGTGACCCCCCCTTCTCCTTCAAGAACGTCATCAGCCTGACAGAAGATGTGGATGAGTTCCGGAAT
AAACTGCAGGGAGAGCGGATCTCAGGCAACCTGGATGCTCCTGAGGGCGGCTTCGATGCCATCCTGCAGACAGCTGTGTGCACGAGGGAC
ATTGGCTGGCGCCCGGACAGCACCCACCTGCTGGTCTTCTCCACCGAGTCAGCCTTCCACTATGAGGCTGATGGCGCCAACGTGCTGGCT
GGCATCATGAGCCGCAACGATGAACGGTGCCACCTGGACACCACGGGCACCTACACCCAGTACAGGACACAGGACTACCCGTCGGTGCCC
ACCCTGGTGCGCCTGCTCGCCAAGCACAACATCATCCCCATCTTTGCTGTCACCAACTACTCCTATAGCTACTACGAGAAGCTTCACACC
TATTTCCCTGTCTCCTCACTGGGGGTGCTGCAGGAGGACTCGTCCAACATCGTGGAGCTGCTGGAGGAGGCCTTCAATCGGATCCGCTCC
AACCTGGACATCCGGGCCCTAGACAGCCCCCGAGGCCTTCGGACAGAGGTCACCTCCAAGATGTTCCAGAAGACGAGGACTGGGTCCTTT
CACATCCGGCGGGGGGAAGTGGGTATATACCAGGTGCAGCTGCGGGCCCTTGAGCACGTGGATGGGACGCACGTGTGCCAGCTGCCGGAG
GACCAGAAGGGCAACATCCATCTGAAACCTTCCTTCTCCGACGGCCTCAAGATGGACGCGGGCATCATCTGTGATGTGTGCACCTGCGAG
CTGCAAAAAGAGGTGCGGTCAGCTCGCTGCAGCTTCAACGGAGACTTCGTGTGCGGACAGTGTGTGTGCAGCGAGGGCTGGAGTGGCCAG
ACCTGCAACTGCTCCACCGGCTCTCTGAGTGACATTCAGCCCTGCCTGCGGGAGGGCGAGGACAAGCCGTGCTCCGGCCGTGGGGAGTGC
CAGTGCGGGCACTGTGTGTGCTACGGCGAAGGCCGCTACGAGGGTCAGTTCTGCGAGTATGACAACTTCCAGTGTCCCCGCACTTCCGGG
TTCCTCTGCAATGACCGAGGACGCTGCTCCATGGGCCAGTGTGTGTGTGAGCCTGGTTGGACAGGCCCAAGCTGTGACTGTCCCCTCAGC
AATGCCACCTGCATCGACAGCAATGGGAAGGCCAGCTTGGAGAACAGCCTGAGGGAGGTGGAGGCCCGCTACGCCCTACAGATGGAGCAG
CTCAACGGGATCCTGCTGCACCTTGAGTCAGAGCTGGCACAGACCCGGGCAGAGGGACAGCGCCAGGCCCAGGAGTATGAGGCCCTGCTG
AACATCAAGGTCAAGCTGGAGGCTGAGATCGCCACCTACCGCCGCCTGCTGGAAGATGGCGAGGACTTTAATCTTGGTGATGCCTTGGAC
AGCAGCAACTCCATGCAAACCATCCAAAAGACCACCACCCGCCGGATAGTGGATGGCAAAGTGGTGTCTGAGACCAATGACACCAAAGTT

>40535_40535_10_ITGB4-KRT18_ITGB4_chr17_73732235_ENST00000579662_KRT18_chr12_53345903_ENST00000388837_length(amino acids)=701AA_BP=102
MAGPRPSPWARLLLAALISVSLSGTLANRCKKAPVKSCTECVRVDKDCAYCTDEMFRDRRCNTQAELLAAGCQRESIVVMESSFQITEET
QIDTTLRRSQMSPQGLRVRLRPGEERHFELEVFEPLESPVDLYILMDFSNSMSDDLDNLKKMGQNLARVLSQLTSDYTIGFGKFVDKVSV
PQTDMRPEKLKEPWPNSDPPFSFKNVISLTEDVDEFRNKLQGERISGNLDAPEGGFDAILQTAVCTRDIGWRPDSTHLLVFSTESAFHYE
ADGANVLAGIMSRNDERCHLDTTGTYTQYRTQDYPSVPTLVRLLAKHNIIPIFAVTNYSYSYYEKLHTYFPVSSLGVLQEDSSNIVELLE
EAFNRIRSNLDIRALDSPRGLRTEVTSKMFQKTRTGSFHIRRGEVGIYQVQLRALEHVDGTHVCQLPEDQKGNIHLKPSFSDGLKMDAGI
ICDVCTCELQKEVRSARCSFNGDFVCGQCVCSEGWSGQTCNCSTGSLSDIQPCLREGEDKPCSGRGECQCGHCVCYGEGRYEGQFCEYDN
FQCPRTSGFLCNDRGRCSMGQCVCEPGWTGPSCDCPLSNATCIDSNGKASLENSLREVEARYALQMEQLNGILLHLESELAQTRAEGQRQ

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for ITGB4-KRT18


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with
TgeneKRT18chr17:73732235chr12:53345903ENST0000038883547243_391316.0431.0DNAJB6
TgeneKRT18chr17:73732235chr12:53345903ENST0000038883758243_391316.0431.0DNAJB6
TgeneKRT18chr17:73732235chr12:53345903ENST000003888354777_128316.0431.0TRADD
TgeneKRT18chr17:73732235chr12:53345903ENST000003888375877_128316.0431.0TRADD


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for ITGB4-KRT18


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for ITGB4-KRT18


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource