FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:KRTCAP3-BAZ1A (FusionGDB2 ID:43713)

Fusion Gene Summary for KRTCAP3-BAZ1A

check button Fusion gene summary
Fusion gene informationFusion gene name: KRTCAP3-BAZ1A
Fusion gene ID: 43713
HgeneTgene
Gene symbol

KRTCAP3

BAZ1A

Gene ID

200634

11177

Gene namekeratinocyte associated protein 3bromodomain adjacent to zinc finger domain 1A
SynonymsKCP3ACF1|WALp1|WCRF180|hACF1
Cytomap

2p23.3

14q13.1-q13.2

Type of geneprotein-codingprotein-coding
Descriptionkeratinocyte-associated protein 3keratinocytes associated protein 3bromodomain adjacent to zinc finger domain protein 1AATP-dependent chromatin remodeling proteinATP-utilizing chromatin assembly and remodeling factor 1CHRAC subunit ACF1hWALp1williams syndrome transcription factor-related chromatin-remodeling factor
Modification date2020031320200313
UniProtAcc

Q53RY4

Q9NRL2

Ensembl transtripts involved in fusion geneENST00000288873, ENST00000407293, 
ENST00000543753, 
ENST00000553853, 
ENST00000358716, ENST00000360310, 
ENST00000382422, 
Fusion gene scores* DoF score2 X 1 X 2=48 X 7 X 5=280
# samples 29
** MAII scorelog2(2/4*10)=2.32192809488736log2(9/280*10)=-1.63742992061529
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: KRTCAP3 [Title/Abstract] AND BAZ1A [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointKRTCAP3(27666399)-BAZ1A(35272194), # samples:1
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
TgeneBAZ1A

GO:0006261

DNA-dependent DNA replication

12434153


check buttonFusion gene breakpoints across KRTCAP3 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across BAZ1A (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4STADTCGA-BR-A4QI-01AKRTCAP3chr2

27666399

+BAZ1Achr14

35272194

-


Top

Fusion Gene ORF analysis for KRTCAP3-BAZ1A

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-intronENST00000288873ENST00000553853KRTCAP3chr2

27666399

+BAZ1Achr14

35272194

-
5CDS-intronENST00000407293ENST00000553853KRTCAP3chr2

27666399

+BAZ1Achr14

35272194

-
5CDS-intronENST00000543753ENST00000553853KRTCAP3chr2

27666399

+BAZ1Achr14

35272194

-
In-frameENST00000288873ENST00000358716KRTCAP3chr2

27666399

+BAZ1Achr14

35272194

-
In-frameENST00000288873ENST00000360310KRTCAP3chr2

27666399

+BAZ1Achr14

35272194

-
In-frameENST00000288873ENST00000382422KRTCAP3chr2

27666399

+BAZ1Achr14

35272194

-
In-frameENST00000407293ENST00000358716KRTCAP3chr2

27666399

+BAZ1Achr14

35272194

-
In-frameENST00000407293ENST00000360310KRTCAP3chr2

27666399

+BAZ1Achr14

35272194

-
In-frameENST00000407293ENST00000382422KRTCAP3chr2

27666399

+BAZ1Achr14

35272194

-
In-frameENST00000543753ENST00000358716KRTCAP3chr2

27666399

+BAZ1Achr14

35272194

-
In-frameENST00000543753ENST00000360310KRTCAP3chr2

27666399

+BAZ1Achr14

35272194

-
In-frameENST00000543753ENST00000382422KRTCAP3chr2

27666399

+BAZ1Achr14

35272194

-

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000543753KRTCAP3chr227666399+ENST00000358716BAZ1Achr1435272194-52886624745101487
ENST00000543753KRTCAP3chr227666399+ENST00000382422BAZ1Achr1435272194-53836624746061519
ENST00000543753KRTCAP3chr227666399+ENST00000360310BAZ1Achr1435272194-53786624746061519
ENST00000288873KRTCAP3chr227666399+ENST00000358716BAZ1Achr1435272194-52736473244951487
ENST00000288873KRTCAP3chr227666399+ENST00000382422BAZ1Achr1435272194-53686473245911519
ENST00000288873KRTCAP3chr227666399+ENST00000360310BAZ1Achr1435272194-53636473245911519
ENST00000407293KRTCAP3chr227666399+ENST00000358716BAZ1Achr1435272194-534672015645681470
ENST00000407293KRTCAP3chr227666399+ENST00000382422BAZ1Achr1435272194-544172015646641502
ENST00000407293KRTCAP3chr227666399+ENST00000360310BAZ1Achr1435272194-543672015646641502

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000543753ENST00000358716KRTCAP3chr227666399+BAZ1Achr1435272194-0.0002660220.999734
ENST00000543753ENST00000382422KRTCAP3chr227666399+BAZ1Achr1435272194-0.0003044070.99969566
ENST00000543753ENST00000360310KRTCAP3chr227666399+BAZ1Achr1435272194-0.0003070880.9996929
ENST00000288873ENST00000358716KRTCAP3chr227666399+BAZ1Achr1435272194-0.0002679080.9997321
ENST00000288873ENST00000382422KRTCAP3chr227666399+BAZ1Achr1435272194-0.0003069830.99969304
ENST00000288873ENST00000360310KRTCAP3chr227666399+BAZ1Achr1435272194-0.0003106220.99968946
ENST00000407293ENST00000358716KRTCAP3chr227666399+BAZ1Achr1435272194-0.0003065580.9996935
ENST00000407293ENST00000382422KRTCAP3chr227666399+BAZ1Achr1435272194-0.0003420010.999658
ENST00000407293ENST00000360310KRTCAP3chr227666399+BAZ1Achr1435272194-0.0003470580.999653

Top

Fusion Genomic Features for KRTCAP3-BAZ1A


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.

Top

Fusion Protein Features for KRTCAP3-BAZ1A


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr2:27666399/chr14:35272194)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
KRTCAP3

Q53RY4

BAZ1A

Q9NRL2

FUNCTION: Component of the ACF complex, an ATP-dependent chromatin remodeling complex, that regulates spacing of nucleosomes using ATP to generate evenly spaced nucleosomes along the chromatin. The ATPase activity of the complex is regulated by the length of flanking DNA. Also involved in facilitating the DNA replication process. BAZ1A is the accessory, non-catalytic subunit of the complex which can enhance and direct the process provided by the ATPase subunit, SMARCA5, probably through targeting pericentromeric heterochromatin in late S phase. Moves end-positioned nucleosomes to a predominantly central position. May have a role in nuclear receptor-mediated transcription repression.; FUNCTION: Component of the histone-fold protein complex CHRAC complex which facilitates nucleosome sliding by the ACF complex and enhances ACF-mediated chromatin assembly. The C-terminal regions of both CHRAC1 and POLE1 are required for these functions.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneKRTCAP3chr2:27666399chr14:35272194ENST00000288873+57163_183205258.3333333333333TransmembraneHelical
HgeneKRTCAP3chr2:27666399chr14:35272194ENST00000288873+5721_41205258.3333333333333TransmembraneHelical
HgeneKRTCAP3chr2:27666399chr14:35272194ENST00000288873+5763_83205258.3333333333333TransmembraneHelical
HgeneKRTCAP3chr2:27666399chr14:35272194ENST00000288873+5794_114205258.3333333333333TransmembraneHelical
HgeneKRTCAP3chr2:27666399chr14:35272194ENST00000407293+46163_183187153.33333333333334TransmembraneHelical
HgeneKRTCAP3chr2:27666399chr14:35272194ENST00000407293+4621_41187153.33333333333334TransmembraneHelical
HgeneKRTCAP3chr2:27666399chr14:35272194ENST00000407293+4663_83187153.33333333333334TransmembraneHelical
HgeneKRTCAP3chr2:27666399chr14:35272194ENST00000407293+4694_114187153.33333333333334TransmembraneHelical
HgeneKRTCAP3chr2:27666399chr14:35272194ENST00000543753+57163_183205276.3333333333333TransmembraneHelical
HgeneKRTCAP3chr2:27666399chr14:35272194ENST00000543753+5721_41205276.3333333333333TransmembraneHelical
HgeneKRTCAP3chr2:27666399chr14:35272194ENST00000543753+5763_83205276.3333333333333TransmembraneHelical
HgeneKRTCAP3chr2:27666399chr14:35272194ENST00000543753+5794_114205276.3333333333333TransmembraneHelical
TgeneBAZ1Achr2:27666399chr14:35272194ENST00000358716526306_3972421525.0Coiled coilOntology_term=ECO:0000255
TgeneBAZ1Achr2:27666399chr14:35272194ENST00000358716526634_7092421525.0Coiled coilOntology_term=ECO:0000255
TgeneBAZ1Achr2:27666399chr14:35272194ENST00000360310527306_3972421557.0Coiled coilOntology_term=ECO:0000255
TgeneBAZ1Achr2:27666399chr14:35272194ENST00000360310527634_7092421557.0Coiled coilOntology_term=ECO:0000255
TgeneBAZ1Achr2:27666399chr14:35272194ENST00000382422426306_3972421557.0Coiled coilOntology_term=ECO:0000255
TgeneBAZ1Achr2:27666399chr14:35272194ENST00000382422426634_7092421557.0Coiled coilOntology_term=ECO:0000255
TgeneBAZ1Achr2:27666399chr14:35272194ENST000003587165261239_12572421525.0Compositional biasNote=Glu-rich
TgeneBAZ1Achr2:27666399chr14:35272194ENST00000358716526487_4912421525.0Compositional biasNote=Poly-Glu
TgeneBAZ1Achr2:27666399chr14:35272194ENST000003603105271239_12572421557.0Compositional biasNote=Glu-rich
TgeneBAZ1Achr2:27666399chr14:35272194ENST00000360310527487_4912421557.0Compositional biasNote=Poly-Glu
TgeneBAZ1Achr2:27666399chr14:35272194ENST000003824224261239_12572421557.0Compositional biasNote=Glu-rich
TgeneBAZ1Achr2:27666399chr14:35272194ENST00000382422426487_4912421557.0Compositional biasNote=Poly-Glu
TgeneBAZ1Achr2:27666399chr14:35272194ENST000003587165261446_15162421525.0DomainBromo
TgeneBAZ1Achr2:27666399chr14:35272194ENST00000358716526422_4872421525.0DomainDDT
TgeneBAZ1Achr2:27666399chr14:35272194ENST000003603105271446_15162421557.0DomainBromo
TgeneBAZ1Achr2:27666399chr14:35272194ENST00000360310527422_4872421557.0DomainDDT
TgeneBAZ1Achr2:27666399chr14:35272194ENST000003824224261446_15162421557.0DomainBromo
TgeneBAZ1Achr2:27666399chr14:35272194ENST00000382422426422_4872421557.0DomainDDT
TgeneBAZ1Achr2:27666399chr14:35272194ENST000003587165261148_11982421525.0Zinc fingerPHD-type
TgeneBAZ1Achr2:27666399chr14:35272194ENST000003603105271148_11982421557.0Zinc fingerPHD-type
TgeneBAZ1Achr2:27666399chr14:35272194ENST000003824224261148_11982421557.0Zinc fingerPHD-type

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
TgeneBAZ1Achr2:27666399chr14:35272194ENST0000035871652622_1282421525.0DomainWAC
TgeneBAZ1Achr2:27666399chr14:35272194ENST0000036031052722_1282421557.0DomainWAC
TgeneBAZ1Achr2:27666399chr14:35272194ENST0000038242242622_1282421557.0DomainWAC
TgeneBAZ1Achr2:27666399chr14:35272194ENST000003587165261_1282421525.0RegionNote=Required for association with the CHRAC1/POLE3 complex
TgeneBAZ1Achr2:27666399chr14:35272194ENST000003603105271_1282421557.0RegionNote=Required for association with the CHRAC1/POLE3 complex
TgeneBAZ1Achr2:27666399chr14:35272194ENST000003824224261_1282421557.0RegionNote=Required for association with the CHRAC1/POLE3 complex


Top

Fusion Gene Sequence for KRTCAP3-BAZ1A


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>43713_43713_1_KRTCAP3-BAZ1A_KRTCAP3_chr2_27666399_ENST00000288873_BAZ1A_chr14_35272194_ENST00000358716_length(transcript)=5273nt_BP=647nt
ACAGCGGCCCTGCGGCTGGCGCGGCGGACGGGATGAGGCGCTGCAGTCTCTGCGCTTTCGACGCCGCCCGGGGGCCCAGGCGGCTGATGC
GTGTGGGCCTCGCGCTGATCTTGGTGGGCCACGTGAACCTGCTGCTGGGGGCCGTGCTGCATGGCACCGTCCTGCGGCACGTGGCCAATC
CCCGCGGCGCTGTCACGCCGGAGTACACCGTAGCCAATGTCATCTCTGTCGGCTCGGGGCTGCTGAGCGTTTCCGTGGGACTTGTGGCCC
TCCTGGCGTCCAGGAACCTTCTTCGCCCTCCACTGCACTGGGTCCTGCTGGCACTAGCTCTGGTGAACCTGCTCTTGTCCGTTGCCTGCT
CCCTGGGCCTCCTTCTTGCTGTGTCACTCACTGTGGCCAACGGTGGCCGCCGCCTTATTGCTGACTGCCACCCAGGACTGCTGGATCCTC
TGGTACCACTGGATGAGGGGCCGGGACATACTGACTGCCCCTTTGACCCCACAAGAATCTATGATACAGCCTTGGCTCTCTGGATCCCTT
CTTTGCTCATGTCTGCAGGGGAGGCTGCTCTATCTGGTTACTGCTGTGTGGCTGCACTCACTCTACGTGGAGTTGGGCCCTGCAGGAAGG
ACGGACTTCAGGGGCAGGCATCATCTCTTTCAACGTATAAAATAGCAGAACAAGATTTTTCTTATTTCTTCCCTGATGATCCACCCACAT
TTATCTTCAGTCCTGCTAACAGACGAAGAGGGAGACCTCCCAAACGAATACATATTAGTCAAGAGGACAATGTTGCTAATAAACAGACTC
TTGCAAGTTATAGGAGCAAAGCTACTAAAGAAAGAGATAAACTTTTGAAACAAGAAGAAATGAAGTCACTGGCTTTTGAAAAGGCTAAAT
TAAAAAGAGAAAAAGCAGATGCCCTAGAAGCGAAGAAAAAAGAAAAAGAAGATAAAGAGAAAAAGAGGGAAGAATTGAAAAAAATTGTTG
AAGAAGAGAGACTAAAGAAAAAAGAAGAAAAAGAGAGGCTTAAAGTAGAAAGAGAAAAGGAAAGAGAGAAGTTACGTGAAGAAAAGCGAA
AGTATGTGGAATACTTAAAACAGTGGAGTAAACCTAGAGAAGATATGGAATGTGATGACCTTAAGGAACTTCCAGAACCAACACCAGTGA
AAACTAGACTACCTCCTGAAATCTTTGGTGATGCTCTGATGGTTTTGGAGTTCCTTAATGCATTTGGGGAACTTTTTGATCTTCAAGATG
AGTTTCCTGATGGAGTAACCCTAGAAGTATTAGAGGAAGCTCTTGTAGGAAATGACAGTGAAGGCCCACTGTGTGAATTGCTTTTTTTCT
TCCTGACTGCAATCTTCCAGGCAATAGCTGAAGAAGAAGAGGAAGTAGCCAAAGAGCAACTAACTGATGCTGACACCAAAGGCTGCAGTT
TGAAAAGTTTGGATCTTGATAGCTGCACTCTTTCAGAAATCCTCAGACTGCACATCTTAGCTTCAGGTGCTGATGTAACATCAGCAAATG
CAAAGTATAGATATCAAAAACGAGGAGGATTTGATGCTACAGATGATGCTTGTATGGAGCTTCGTTTGAGCAATCCCAGTCTAGTGAAGA
AACTGTCAAGCACCTCAGTGTATGATTTGACACCAGGAGAAAAAATGAAGATACTCCATGCTCTCTGTGGAAAGCTACTGACCCTAGTTT
CAACTAGGGATTTTATTGAAGATTATGTTGATATATTACGACAGGCAAAGCAGGAGTTCCGGGAATTAAAAGCAGAACAACATCGAAAAG
AGAGGGAAGAAGCAGCTGCCAGAATTCGTAAAAGGAAGGAAGAAAAACTTAAGGAGCAAGAACAAAAAATGAAAGAGAAACAAGAAAAAC
TGAAAGAAGATGAGCAAAGAAATTCAACGGCAGATATATCTATTGGGGAGGAAGAAAGGGAAGATTTTGATACTAGCATTGAGAGCAAAG
ACACAGAGCAAAAGGAATTAGATCAAGATATGGTCACTGAAGATGAAGATGACCCAGGATCACATAAAAGAGGCAGAAGGGGGAAAAGAG
GACAAAATGGATTTAAAGAATTTACAAGGCAAGAACAGATCAACTGTGTAACAAGAGAGCCTCTTACTGCTGATGAGGAAGAAGCATTAA
AACAGGAACACCAACGAAAAGAGAAAGAGCTCTTAGAAAAAATCCAAAGTGCCATAGCCTGTACCAATATCTTTCCCTTGGGTCGCGACC
GCATGTATAGACGATACTGGATTTTCCCTTCTATTCCTGGACTCTTTATTGAAGAGGATTATTCTGGTCTTACTGAAGACATGCTGTTGC
CTAGACCTTCATCATTTCAGAATAATGTACAGTCTCAAGATCCTCAGGTATCCACTAAAACTGGAGAGCCTTTGATGTCTGAATCTACCT
CCAACATTGACCAAGGTCCACGTGACCATTCTGTGCAGCTGCCAAAACCAGTGCATAAGCCAAATCGGTGGTGCTTTTACAGTTCTTGTG
AACAGCTAGACCAGCTTATTGAAGCTCTTAATTCTAGAGGACATAGAGAAAGTGCCTTAAAAGAAACTTTGTTACAAGAGAAAAGCAGAA
TATGTGCACAGCTAGCCCGTTTTTCTGAAGAGAAATTTCATTTTTCAGACAAACCTCAGCCTGATAGCAAACCAACATATAGTCGGGGAA
GATCTTCCAATGCATATGATCCATCTCAGATGTGTGCAGAAAAGCAACTTGAACTAAGGCTGAGAGATTTTCTTTTAGATATTGAAGATA
GAATCTACCAAGGAACATTAGGAGCCATCAAGGTTACAGATCGACATATCTGGAGATCAGCATTAGAAAGTGGACGGTATGAGCTGTTAA
GTGAGGAAAACAAGGAAAATGGGATAATTAAAACTGTGAATGAAGACGTAGAAGAGATGGAAATTGATGAACAAACAAAGGTCATAGTAA
AAGACAGACTTTTGGGGATAAAAACAGAAACTCCAAGTACTGTATCAACAAATGCAAGTACACCACAATCAGTGAGCAGTGTGGTTCATT
ATCTGGCAATGGCACTCTTTCAAATAGAGCAGGGCATTGAGCGGCGTTTTCTGAAAGCTCCACTTGATGCCAGTGACAGTGGGCGTTCTT
ATAAAACAGTTCTGGACCGTTGGAGAGAGTCTCTCCTTTCTTCTGCTAGTCTATCCCAAGTTTTTCTTCACCTATCCACCTTGGATCGTA
GCGTGATATGGTCTAAATCTATACTGAATGCGCGTTGCAAGATATGTCGAAAGAAAGGCGATGCTGAAAACATGGTTCTTTGTGATGGCT
GTGATAGGGGTCATCATACCTACTGTGTTCGACCAAAGCTCAAGACTGTGCCTGAAGGAGACTGGTTTTGTCCAGAATGTCGACCAAAGC
AACGTTCTAGAAGACTCTCCTCTAGACAGAGACCATCCTTGGAAAGTGATGAAGATGTGGAAGACAGTATGGGAGGTGAGGATGATGAAG
TTGATGGCGATGAAGAAGAAGGTCAAAGTGAGGAGGAAGAGTATGAGGTAGAACAAGATGAAGATGACTCTCAAGAAGAGGAAGAAGTCA
GCCTACCCAAACGAGGAAGACCACAAGTTAGATTGCCAGTTAAAACAAGAGGGAAACTTAGCTCTTCTTTCTCAAGTCGTGGCCAACAAC
AAGAACCTGGAAGATACCCTTCAAGGAGTCAGCAGAGCACACCCAAAACAACTGTTTCTTCTAAAACTGGTAGAAGCCTAAGAAAGATAA
ACTCTGCTCCTCCTACAGAAACAAAATCTTTAAGAATTGCCAGTCGTTCTACTCGCCACAGTCATGGCCCACTGCAAGCAGATGTATTTG
TGGAATTGCTTAGTCCTCGTAGAAAACGCAGAGGCAGGAAAAGTGCTAATAATACACCAGAAAATAGTCCCAACTTCCCTAACTTCAGAG
TCATTGCCACAAAGTCAAGTGAACAGTCAAGATCTGTAAATATTGCTTCAAAACTTTCTCTCCAAGAGAGTGAATCCAAAAGAAGATGCA
GAAAAAGACAATCTCCAGAGCCATCGCCTGTGACACTGGGTCGAAGGAGTTCTGGCCGACAGGGAGGAGTTCATGAATTGTCTGCTTTTG
AACAACTTGTTGTAGAATTGGTACGACATGATGACAGCTGGCCTTTTTTGAAACTTGTTTCTAAAATCCAGGTCCCAGACTACTATGACA
TCATCAAAAAGCCCATTGCCTTAAATATAATTCGTGAAAAAGTGAATAAGTGTGAATATAAATTAGCATCTGAGTTTATTGATGACATTG
AGTTAATGTTTTCGAACTGCTTTGAATACAACCCTCGTAACACAAGTGAAGCAAAAGCTGGAACTAGGCTTCAAGCATTTTTTCATATTC
AGGCTCAAAAGCTTGGACTCCACGTCACACCCAGTAATGTGGACCAAGTTAGCACACCACCGGCTGCGAAAAAGTCACGAATCTGACTTT
GTCCTTCTAAAGGATATATTTGAAGAAAAACAAATTGTTCATGAAAATGGAACATTAAATCATGCTGTATAAAGCAATAACAATTGATTG
ACCACATGAAAGTGTGGCCTGCACTATATTCTCAATTTTAATATTAAGCACTCAGGAGAATGTAGGAAAGATATCCTTTGCTACAGTTTT
GTTCAGTATCTAATAAGTTTGATAGATGTATTGGATACAGTACTGGTTTACAGAGGTTTTTGTACATTTTTGAGATCATTCATGTGTCCA
GAGATCTTGGAAAATATTTTTTCACCCACGATTTATTTTGTTATTGATGATTTTTTTTTAAAGTGGTGGTATTAAGGGAGAGTTATCTAC
ATGGATGAGTCTTCCGCTATAGCACAGTTTAGAAAAGGTGTTTATGTCTTAATTAATTGTTTGAGTACATTCTTTCAACACTACACATGA
ATGAATCCAATCTTATAACCTTGAAGTGCTGTACCAGTGCTGGCTGCAGGTATTAAGTCCAAGTTTATTAACTAGATATTTATTTAGTAT
TGAGAGTAATTTGTGAATTTGTTTTGTATTTATAAAATTTATACCTGAAAAATGTTCCTTAATGTTTTAAACCTTTTACTGTGTTTTTAT
TCCTCTAACTTCCTTAATGATCAATCAAAAAAAGTAACACCCTCCCTTTTTCCTGACAGTTCTTTCAGCTTTACAGAACTGTATTATAAG

>43713_43713_1_KRTCAP3-BAZ1A_KRTCAP3_chr2_27666399_ENST00000288873_BAZ1A_chr14_35272194_ENST00000358716_length(amino acids)=1487AA_BP=205
MRRCSLCAFDAARGPRRLMRVGLALILVGHVNLLLGAVLHGTVLRHVANPRGAVTPEYTVANVISVGSGLLSVSVGLVALLASRNLLRPP
LHWVLLALALVNLLLSVACSLGLLLAVSLTVANGGRRLIADCHPGLLDPLVPLDEGPGHTDCPFDPTRIYDTALALWIPSLLMSAGEAAL
SGYCCVAALTLRGVGPCRKDGLQGQASSLSTYKIAEQDFSYFFPDDPPTFIFSPANRRRGRPPKRIHISQEDNVANKQTLASYRSKATKE
RDKLLKQEEMKSLAFEKAKLKREKADALEAKKKEKEDKEKKREELKKIVEEERLKKKEEKERLKVEREKEREKLREEKRKYVEYLKQWSK
PREDMECDDLKELPEPTPVKTRLPPEIFGDALMVLEFLNAFGELFDLQDEFPDGVTLEVLEEALVGNDSEGPLCELLFFFLTAIFQAIAE
EEEEVAKEQLTDADTKGCSLKSLDLDSCTLSEILRLHILASGADVTSANAKYRYQKRGGFDATDDACMELRLSNPSLVKKLSSTSVYDLT
PGEKMKILHALCGKLLTLVSTRDFIEDYVDILRQAKQEFRELKAEQHRKEREEAAARIRKRKEEKLKEQEQKMKEKQEKLKEDEQRNSTA
DISIGEEEREDFDTSIESKDTEQKELDQDMVTEDEDDPGSHKRGRRGKRGQNGFKEFTRQEQINCVTREPLTADEEEALKQEHQRKEKEL
LEKIQSAIACTNIFPLGRDRMYRRYWIFPSIPGLFIEEDYSGLTEDMLLPRPSSFQNNVQSQDPQVSTKTGEPLMSESTSNIDQGPRDHS
VQLPKPVHKPNRWCFYSSCEQLDQLIEALNSRGHRESALKETLLQEKSRICAQLARFSEEKFHFSDKPQPDSKPTYSRGRSSNAYDPSQM
CAEKQLELRLRDFLLDIEDRIYQGTLGAIKVTDRHIWRSALESGRYELLSEENKENGIIKTVNEDVEEMEIDEQTKVIVKDRLLGIKTET
PSTVSTNASTPQSVSSVVHYLAMALFQIEQGIERRFLKAPLDASDSGRSYKTVLDRWRESLLSSASLSQVFLHLSTLDRSVIWSKSILNA
RCKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECRPKQRSRRLSSRQRPSLESDEDVEDSMGGEDDEVDGDEEEGQSE
EEEYEVEQDEDDSQEEEEVSLPKRGRPQVRLPVKTRGKLSSSFSSRGQQQEPGRYPSRSQQSTPKTTVSSKTGRSLRKINSAPPTETKSL
RIASRSTRHSHGPLQADVFVELLSPRRKRRGRKSANNTPENSPNFPNFRVIATKSSEQSRSVNIASKLSLQESESKRRCRKRQSPEPSPV
TLGRRSSGRQGGVHELSAFEQLVVELVRHDDSWPFLKLVSKIQVPDYYDIIKKPIALNIIREKVNKCEYKLASEFIDDIELMFSNCFEYN

--------------------------------------------------------------
>43713_43713_2_KRTCAP3-BAZ1A_KRTCAP3_chr2_27666399_ENST00000288873_BAZ1A_chr14_35272194_ENST00000360310_length(transcript)=5363nt_BP=647nt
ACAGCGGCCCTGCGGCTGGCGCGGCGGACGGGATGAGGCGCTGCAGTCTCTGCGCTTTCGACGCCGCCCGGGGGCCCAGGCGGCTGATGC
GTGTGGGCCTCGCGCTGATCTTGGTGGGCCACGTGAACCTGCTGCTGGGGGCCGTGCTGCATGGCACCGTCCTGCGGCACGTGGCCAATC
CCCGCGGCGCTGTCACGCCGGAGTACACCGTAGCCAATGTCATCTCTGTCGGCTCGGGGCTGCTGAGCGTTTCCGTGGGACTTGTGGCCC
TCCTGGCGTCCAGGAACCTTCTTCGCCCTCCACTGCACTGGGTCCTGCTGGCACTAGCTCTGGTGAACCTGCTCTTGTCCGTTGCCTGCT
CCCTGGGCCTCCTTCTTGCTGTGTCACTCACTGTGGCCAACGGTGGCCGCCGCCTTATTGCTGACTGCCACCCAGGACTGCTGGATCCTC
TGGTACCACTGGATGAGGGGCCGGGACATACTGACTGCCCCTTTGACCCCACAAGAATCTATGATACAGCCTTGGCTCTCTGGATCCCTT
CTTTGCTCATGTCTGCAGGGGAGGCTGCTCTATCTGGTTACTGCTGTGTGGCTGCACTCACTCTACGTGGAGTTGGGCCCTGCAGGAAGG
ACGGACTTCAGGGGCAGGCATCATCTCTTTCAACGTATAAAATAGCAGAACAAGATTTTTCTTATTTCTTCCCTGATGATCCACCCACAT
TTATCTTCAGTCCTGCTAACAGACGAAGAGGGAGACCTCCCAAACGAATACATATTAGTCAAGAGGACAATGTTGCTAATAAACAGACTC
TTGCAAGTTATAGGAGCAAAGCTACTAAAGAAAGAGATAAACTTTTGAAACAAGAAGAAATGAAGTCACTGGCTTTTGAAAAGGCTAAAT
TAAAAAGAGAAAAAGCAGATGCCCTAGAAGCGAAGAAAAAAGAAAAAGAAGATAAAGAGAAAAAGAGGGAAGAATTGAAAAAAATTGTTG
AAGAAGAGAGACTAAAGAAAAAAGAAGAAAAAGAGAGGCTTAAAGTAGAAAGAGAAAAGGAAAGAGAGAAGTTACGTGAAGAAAAGCGAA
AGTATGTGGAATACTTAAAACAGTGGAGTAAACCTAGAGAAGATATGGAATGTGATGACCTTAAGGAACTTCCAGAACCAACACCAGTGA
AAACTAGACTACCTCCTGAAATCTTTGGTGATGCTCTGATGGTTTTGGAGTTCCTTAATGCATTTGGGGAACTTTTTGATCTTCAAGATG
AGTTTCCTGATGGAGTAACCCTAGAAGTATTAGAGGAAGCTCTTGTAGGAAATGACAGTGAAGGCCCACTGTGTGAATTGCTTTTTTTCT
TCCTGACTGCAATCTTCCAGGCAATAGCTGAAGAAGAAGAGGAAGTAGCCAAAGAGCAACTAACTGATGCTGACACCAAAGATTTAACAG
AGGCTTTGGATGAAGATGCAGACCCCACAAAATCTGCACTGTCTGCAGTTGCATCTTTGGCAGCTGCATGGCCACAGTTACACCAGGGCT
GCAGTTTGAAAAGTTTGGATCTTGATAGCTGCACTCTTTCAGAAATCCTCAGACTGCACATCTTAGCTTCAGGTGCTGATGTAACATCAG
CAAATGCAAAGTATAGATATCAAAAACGAGGAGGATTTGATGCTACAGATGATGCTTGTATGGAGCTTCGTTTGAGCAATCCCAGTCTAG
TGAAGAAACTGTCAAGCACCTCAGTGTATGATTTGACACCAGGAGAAAAAATGAAGATACTCCATGCTCTCTGTGGAAAGCTACTGACCC
TAGTTTCAACTAGGGATTTTATTGAAGATTATGTTGATATATTACGACAGGCAAAGCAGGAGTTCCGGGAATTAAAAGCAGAACAACATC
GAAAAGAGAGGGAAGAAGCAGCTGCCAGAATTCGTAAAAGGAAGGAAGAAAAACTTAAGGAGCAAGAACAAAAAATGAAAGAGAAACAAG
AAAAACTGAAAGAAGATGAGCAAAGAAATTCAACGGCAGATATATCTATTGGGGAGGAAGAAAGGGAAGATTTTGATACTAGCATTGAGA
GCAAAGACACAGAGCAAAAGGAATTAGATCAAGATATGGTCACTGAAGATGAAGATGACCCAGGATCACATAAAAGAGGCAGAAGGGGGA
AAAGAGGACAAAATGGATTTAAAGAATTTACAAGGCAAGAACAGATCAACTGTGTAACAAGAGAGCCTCTTACTGCTGATGAGGAAGAAG
CATTAAAACAGGAACACCAACGAAAAGAGAAAGAGCTCTTAGAAAAAATCCAAAGTGCCATAGCCTGTACCAATATCTTTCCCTTGGGTC
GCGACCGCATGTATAGACGATACTGGATTTTCCCTTCTATTCCTGGACTCTTTATTGAAGAGGATTATTCTGGTCTTACTGAAGACATGC
TGTTGCCTAGACCTTCATCATTTCAGAATAATGTACAGTCTCAAGATCCTCAGGTATCCACTAAAACTGGAGAGCCTTTGATGTCTGAAT
CTACCTCCAACATTGACCAAGGTCCACGTGACCATTCTGTGCAGCTGCCAAAACCAGTGCATAAGCCAAATCGGTGGTGCTTTTACAGTT
CTTGTGAACAGCTAGACCAGCTTATTGAAGCTCTTAATTCTAGAGGACATAGAGAAAGTGCCTTAAAAGAAACTTTGTTACAAGAGAAAA
GCAGAATATGTGCACAGCTAGCCCGTTTTTCTGAAGAGAAATTTCATTTTTCAGACAAACCTCAGCCTGATAGCAAACCAACATATAGTC
GGGGAAGATCTTCCAATGCATATGATCCATCTCAGATGTGTGCAGAAAAGCAACTTGAACTAAGGCTGAGAGATTTTCTTTTAGATATTG
AAGATAGAATCTACCAAGGAACATTAGGAGCCATCAAGGTTACAGATCGACATATCTGGAGATCAGCATTAGAAAGTGGACGGTATGAGC
TGTTAAGTGAGGAAAACAAGGAAAATGGGATAATTAAAACTGTGAATGAAGACGTAGAAGAGATGGAAATTGATGAACAAACAAAGGTCA
TAGTAAAAGACAGACTTTTGGGGATAAAAACAGAAACTCCAAGTACTGTATCAACAAATGCAAGTACACCACAATCAGTGAGCAGTGTGG
TTCATTATCTGGCAATGGCACTCTTTCAAATAGAGCAGGGCATTGAGCGGCGTTTTCTGAAAGCTCCACTTGATGCCAGTGACAGTGGGC
GTTCTTATAAAACAGTTCTGGACCGTTGGAGAGAGTCTCTCCTTTCTTCTGCTAGTCTATCCCAAGTTTTTCTTCACCTATCCACCTTGG
ATCGTAGCGTGATATGGTCTAAATCTATACTGAATGCGCGTTGCAAGATATGTCGAAAGAAAGGCGATGCTGAAAACATGGTTCTTTGTG
ATGGCTGTGATAGGGGTCATCATACCTACTGTGTTCGACCAAAGCTCAAGACTGTGCCTGAAGGAGACTGGTTTTGTCCAGAATGTCGAC
CAAAGCAACGTTCTAGAAGACTCTCCTCTAGACAGAGACCATCCTTGGAAAGTGATGAAGATGTGGAAGACAGTATGGGAGGTGAGGATG
ATGAAGTTGATGGCGATGAAGAAGAAGGTCAAAGTGAGGAGGAAGAGTATGAGGTAGAACAAGATGAAGATGACTCTCAAGAAGAGGAAG
AAGTCAGCCTACCCAAACGAGGAAGACCACAAGTTAGATTGCCAGTTAAAACAAGAGGGAAACTTAGCTCTTCTTTCTCAAGTCGTGGCC
AACAACAAGAACCTGGAAGATACCCTTCAAGGAGTCAGCAGAGCACACCCAAAACAACTGTTTCTTCTAAAACTGGTAGAAGCCTAAGAA
AGATAAACTCTGCTCCTCCTACAGAAACAAAATCTTTAAGAATTGCCAGTCGTTCTACTCGCCACAGTCATGGCCCACTGCAAGCAGATG
TATTTGTGGAATTGCTTAGTCCTCGTAGAAAACGCAGAGGCAGGAAAAGTGCTAATAATACACCAGAAAATAGTCCCAACTTCCCTAACT
TCAGAGTCATTGCCACAAAGTCAAGTGAACAGTCAAGATCTGTAAATATTGCTTCAAAACTTTCTCTCCAAGAGAGTGAATCCAAAAGAA
GATGCAGAAAAAGACAATCTCCAGAGCCATCGCCTGTGACACTGGGTCGAAGGAGTTCTGGCCGACAGGGAGGAGTTCATGAATTGTCTG
CTTTTGAACAACTTGTTGTAGAATTGGTACGACATGATGACAGCTGGCCTTTTTTGAAACTTGTTTCTAAAATCCAGGTCCCAGACTACT
ATGACATCATCAAAAAGCCCATTGCCTTAAATATAATTCGTGAAAAAGTGAATAAGTGTGAATATAAATTAGCATCTGAGTTTATTGATG
ACATTGAGTTAATGTTTTCGAACTGCTTTGAATACAACCCTCGTAACACAAGTGAAGCAAAAGCTGGAACTAGGCTTCAAGCATTTTTTC
ATATTCAGGCTCAAAAGCTTGGACTCCACGTCACACCCAGTAATGTGGACCAAGTTAGCACACCACCGGCTGCGAAAAAGTCACGAATCT
GACTTTGTCCTTCTAAAGGATATATTTGAAGAAAAACAAATTGTTCATGAAAATGGAACATTAAATCATGCTGTATAAAGCAATAACAAT
TGATTGACCACATGAAAGTGTGGCCTGCACTATATTCTCAATTTTAATATTAAGCACTCAGGAGAATGTAGGAAAGATATCCTTTGCTAC
AGTTTTGTTCAGTATCTAATAAGTTTGATAGATGTATTGGATACAGTACTGGTTTACAGAGGTTTTTGTACATTTTTGAGATCATTCATG
TGTCCAGAGATCTTGGAAAATATTTTTTCACCCACGATTTATTTTGTTATTGATGATTTTTTTTTAAAGTGGTGGTATTAAGGGAGAGTT
ATCTACATGGATGAGTCTTCCGCTATAGCACAGTTTAGAAAAGGTGTTTATGTCTTAATTAATTGTTTGAGTACATTCTTTCAACACTAC
ACATGAATGAATCCAATCTTATAACCTTGAAGTGCTGTACCAGTGCTGGCTGCAGGTATTAAGTCCAAGTTTATTAACTAGATATTTATT
TAGTATTGAGAGTAATTTGTGAATTTGTTTTGTATTTATAAAATTTATACCTGAAAAATGTTCCTTAATGTTTTAAACCTTTTACTGTGT
TTTTATTCCTCTAACTTCCTTAATGATCAATCAAAAAAAGTAACACCCTCCCTTTTTCCTGACAGTTCTTTCAGCTTTACAGAACTGTAT

>43713_43713_2_KRTCAP3-BAZ1A_KRTCAP3_chr2_27666399_ENST00000288873_BAZ1A_chr14_35272194_ENST00000360310_length(amino acids)=1519AA_BP=205
MRRCSLCAFDAARGPRRLMRVGLALILVGHVNLLLGAVLHGTVLRHVANPRGAVTPEYTVANVISVGSGLLSVSVGLVALLASRNLLRPP
LHWVLLALALVNLLLSVACSLGLLLAVSLTVANGGRRLIADCHPGLLDPLVPLDEGPGHTDCPFDPTRIYDTALALWIPSLLMSAGEAAL
SGYCCVAALTLRGVGPCRKDGLQGQASSLSTYKIAEQDFSYFFPDDPPTFIFSPANRRRGRPPKRIHISQEDNVANKQTLASYRSKATKE
RDKLLKQEEMKSLAFEKAKLKREKADALEAKKKEKEDKEKKREELKKIVEEERLKKKEEKERLKVEREKEREKLREEKRKYVEYLKQWSK
PREDMECDDLKELPEPTPVKTRLPPEIFGDALMVLEFLNAFGELFDLQDEFPDGVTLEVLEEALVGNDSEGPLCELLFFFLTAIFQAIAE
EEEEVAKEQLTDADTKDLTEALDEDADPTKSALSAVASLAAAWPQLHQGCSLKSLDLDSCTLSEILRLHILASGADVTSANAKYRYQKRG
GFDATDDACMELRLSNPSLVKKLSSTSVYDLTPGEKMKILHALCGKLLTLVSTRDFIEDYVDILRQAKQEFRELKAEQHRKEREEAAARI
RKRKEEKLKEQEQKMKEKQEKLKEDEQRNSTADISIGEEEREDFDTSIESKDTEQKELDQDMVTEDEDDPGSHKRGRRGKRGQNGFKEFT
RQEQINCVTREPLTADEEEALKQEHQRKEKELLEKIQSAIACTNIFPLGRDRMYRRYWIFPSIPGLFIEEDYSGLTEDMLLPRPSSFQNN
VQSQDPQVSTKTGEPLMSESTSNIDQGPRDHSVQLPKPVHKPNRWCFYSSCEQLDQLIEALNSRGHRESALKETLLQEKSRICAQLARFS
EEKFHFSDKPQPDSKPTYSRGRSSNAYDPSQMCAEKQLELRLRDFLLDIEDRIYQGTLGAIKVTDRHIWRSALESGRYELLSEENKENGI
IKTVNEDVEEMEIDEQTKVIVKDRLLGIKTETPSTVSTNASTPQSVSSVVHYLAMALFQIEQGIERRFLKAPLDASDSGRSYKTVLDRWR
ESLLSSASLSQVFLHLSTLDRSVIWSKSILNARCKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECRPKQRSRRLSSR
QRPSLESDEDVEDSMGGEDDEVDGDEEEGQSEEEEYEVEQDEDDSQEEEEVSLPKRGRPQVRLPVKTRGKLSSSFSSRGQQQEPGRYPSR
SQQSTPKTTVSSKTGRSLRKINSAPPTETKSLRIASRSTRHSHGPLQADVFVELLSPRRKRRGRKSANNTPENSPNFPNFRVIATKSSEQ
SRSVNIASKLSLQESESKRRCRKRQSPEPSPVTLGRRSSGRQGGVHELSAFEQLVVELVRHDDSWPFLKLVSKIQVPDYYDIIKKPIALN

--------------------------------------------------------------
>43713_43713_3_KRTCAP3-BAZ1A_KRTCAP3_chr2_27666399_ENST00000288873_BAZ1A_chr14_35272194_ENST00000382422_length(transcript)=5368nt_BP=647nt
ACAGCGGCCCTGCGGCTGGCGCGGCGGACGGGATGAGGCGCTGCAGTCTCTGCGCTTTCGACGCCGCCCGGGGGCCCAGGCGGCTGATGC
GTGTGGGCCTCGCGCTGATCTTGGTGGGCCACGTGAACCTGCTGCTGGGGGCCGTGCTGCATGGCACCGTCCTGCGGCACGTGGCCAATC
CCCGCGGCGCTGTCACGCCGGAGTACACCGTAGCCAATGTCATCTCTGTCGGCTCGGGGCTGCTGAGCGTTTCCGTGGGACTTGTGGCCC
TCCTGGCGTCCAGGAACCTTCTTCGCCCTCCACTGCACTGGGTCCTGCTGGCACTAGCTCTGGTGAACCTGCTCTTGTCCGTTGCCTGCT
CCCTGGGCCTCCTTCTTGCTGTGTCACTCACTGTGGCCAACGGTGGCCGCCGCCTTATTGCTGACTGCCACCCAGGACTGCTGGATCCTC
TGGTACCACTGGATGAGGGGCCGGGACATACTGACTGCCCCTTTGACCCCACAAGAATCTATGATACAGCCTTGGCTCTCTGGATCCCTT
CTTTGCTCATGTCTGCAGGGGAGGCTGCTCTATCTGGTTACTGCTGTGTGGCTGCACTCACTCTACGTGGAGTTGGGCCCTGCAGGAAGG
ACGGACTTCAGGGGCAGGCATCATCTCTTTCAACGTATAAAATAGCAGAACAAGATTTTTCTTATTTCTTCCCTGATGATCCACCCACAT
TTATCTTCAGTCCTGCTAACAGACGAAGAGGGAGACCTCCCAAACGAATACATATTAGTCAAGAGGACAATGTTGCTAATAAACAGACTC
TTGCAAGTTATAGGAGCAAAGCTACTAAAGAAAGAGATAAACTTTTGAAACAAGAAGAAATGAAGTCACTGGCTTTTGAAAAGGCTAAAT
TAAAAAGAGAAAAAGCAGATGCCCTAGAAGCGAAGAAAAAAGAAAAAGAAGATAAAGAGAAAAAGAGGGAAGAATTGAAAAAAATTGTTG
AAGAAGAGAGACTAAAGAAAAAAGAAGAAAAAGAGAGGCTTAAAGTAGAAAGAGAAAAGGAAAGAGAGAAGTTACGTGAAGAAAAGCGAA
AGTATGTGGAATACTTAAAACAGTGGAGTAAACCTAGAGAAGATATGGAATGTGATGACCTTAAGGAACTTCCAGAACCAACACCAGTGA
AAACTAGACTACCTCCTGAAATCTTTGGTGATGCTCTGATGGTTTTGGAGTTCCTTAATGCATTTGGGGAACTTTTTGATCTTCAAGATG
AGTTTCCTGATGGAGTAACCCTAGAAGTATTAGAGGAAGCTCTTGTAGGAAATGACAGTGAAGGCCCACTGTGTGAATTGCTTTTTTTCT
TCCTGACTGCAATCTTCCAGGCAATAGCTGAAGAAGAAGAGGAAGTAGCCAAAGAGCAACTAACTGATGCTGACACCAAAGATTTAACAG
AGGCTTTGGATGAAGATGCAGACCCCACAAAATCTGCACTGTCTGCAGTTGCATCTTTGGCAGCTGCATGGCCACAGTTACACCAGGGCT
GCAGTTTGAAAAGTTTGGATCTTGATAGCTGCACTCTTTCAGAAATCCTCAGACTGCACATCTTAGCTTCAGGTGCTGATGTAACATCAG
CAAATGCAAAGTATAGATATCAAAAACGAGGAGGATTTGATGCTACAGATGATGCTTGTATGGAGCTTCGTTTGAGCAATCCCAGTCTAG
TGAAGAAACTGTCAAGCACCTCAGTGTATGATTTGACACCAGGAGAAAAAATGAAGATACTCCATGCTCTCTGTGGAAAGCTACTGACCC
TAGTTTCAACTAGGGATTTTATTGAAGATTATGTTGATATATTACGACAGGCAAAGCAGGAGTTCCGGGAATTAAAAGCAGAACAACATC
GAAAAGAGAGGGAAGAAGCAGCTGCCAGAATTCGTAAAAGGAAGGAAGAAAAACTTAAGGAGCAAGAACAAAAAATGAAAGAGAAACAAG
AAAAACTGAAAGAAGATGAGCAAAGAAATTCAACGGCAGATATATCTATTGGGGAGGAAGAAAGGGAAGATTTTGATACTAGCATTGAGA
GCAAAGACACAGAGCAAAAGGAATTAGATCAAGATATGGTCACTGAAGATGAAGATGACCCAGGATCACATAAAAGAGGCAGAAGGGGGA
AAAGAGGACAAAATGGATTTAAAGAATTTACAAGGCAAGAACAGATCAACTGTGTAACAAGAGAGCCTCTTACTGCTGATGAGGAAGAAG
CATTAAAACAGGAACACCAACGAAAAGAGAAAGAGCTCTTAGAAAAAATCCAAAGTGCCATAGCCTGTACCAATATCTTTCCCTTGGGTC
GCGACCGCATGTATAGACGATACTGGATTTTCCCTTCTATTCCTGGACTCTTTATTGAAGAGGATTATTCTGGTCTTACTGAAGACATGC
TGTTGCCTAGACCTTCATCATTTCAGAATAATGTACAGTCTCAAGATCCTCAGGTATCCACTAAAACTGGAGAGCCTTTGATGTCTGAAT
CTACCTCCAACATTGACCAAGGTCCACGTGACCATTCTGTGCAGCTGCCAAAACCAGTGCATAAGCCAAATCGGTGGTGCTTTTACAGTT
CTTGTGAACAGCTAGACCAGCTTATTGAAGCTCTTAATTCTAGAGGACATAGAGAAAGTGCCTTAAAAGAAACTTTGTTACAAGAGAAAA
GCAGAATATGTGCACAGCTAGCCCGTTTTTCTGAAGAGAAATTTCATTTTTCAGACAAACCTCAGCCTGATAGCAAACCAACATATAGTC
GGGGAAGATCTTCCAATGCATATGATCCATCTCAGATGTGTGCAGAAAAGCAACTTGAACTAAGGCTGAGAGATTTTCTTTTAGATATTG
AAGATAGAATCTACCAAGGAACATTAGGAGCCATCAAGGTTACAGATCGACATATCTGGAGATCAGCATTAGAAAGTGGACGGTATGAGC
TGTTAAGTGAGGAAAACAAGGAAAATGGGATAATTAAAACTGTGAATGAAGACGTAGAAGAGATGGAAATTGATGAACAAACAAAGGTCA
TAGTAAAAGACAGACTTTTGGGGATAAAAACAGAAACTCCAAGTACTGTATCAACAAATGCAAGTACACCACAATCAGTGAGCAGTGTGG
TTCATTATCTGGCAATGGCACTCTTTCAAATAGAGCAGGGCATTGAGCGGCGTTTTCTGAAAGCTCCACTTGATGCCAGTGACAGTGGGC
GTTCTTATAAAACAGTTCTGGACCGTTGGAGAGAGTCTCTCCTTTCTTCTGCTAGTCTATCCCAAGTTTTTCTTCACCTATCCACCTTGG
ATCGTAGCGTGATATGGTCTAAATCTATACTGAATGCGCGTTGCAAGATATGTCGAAAGAAAGGCGATGCTGAAAACATGGTTCTTTGTG
ATGGCTGTGATAGGGGTCATCATACCTACTGTGTTCGACCAAAGCTCAAGACTGTGCCTGAAGGAGACTGGTTTTGTCCAGAATGTCGAC
CAAAGCAACGTTCTAGAAGACTCTCCTCTAGACAGAGACCATCCTTGGAAAGTGATGAAGATGTGGAAGACAGTATGGGAGGTGAGGATG
ATGAAGTTGATGGCGATGAAGAAGAAGGTCAAAGTGAGGAGGAAGAGTATGAGGTAGAACAAGATGAAGATGACTCTCAAGAAGAGGAAG
AAGTCAGCCTACCCAAACGAGGAAGACCACAAGTTAGATTGCCAGTTAAAACAAGAGGGAAACTTAGCTCTTCTTTCTCAAGTCGTGGCC
AACAACAAGAACCTGGAAGATACCCTTCAAGGAGTCAGCAGAGCACACCCAAAACAACTGTTTCTTCTAAAACTGGTAGAAGCCTAAGAA
AGATAAACTCTGCTCCTCCTACAGAAACAAAATCTTTAAGAATTGCCAGTCGTTCTACTCGCCACAGTCATGGCCCACTGCAAGCAGATG
TATTTGTGGAATTGCTTAGTCCTCGTAGAAAACGCAGAGGCAGGAAAAGTGCTAATAATACACCAGAAAATAGTCCCAACTTCCCTAACT
TCAGAGTCATTGCCACAAAGTCAAGTGAACAGTCAAGATCTGTAAATATTGCTTCAAAACTTTCTCTCCAAGAGAGTGAATCCAAAAGAA
GATGCAGAAAAAGACAATCTCCAGAGCCATCGCCTGTGACACTGGGTCGAAGGAGTTCTGGCCGACAGGGAGGAGTTCATGAATTGTCTG
CTTTTGAACAACTTGTTGTAGAATTGGTACGACATGATGACAGCTGGCCTTTTTTGAAACTTGTTTCTAAAATCCAGGTCCCAGACTACT
ATGACATCATCAAAAAGCCCATTGCCTTAAATATAATTCGTGAAAAAGTGAATAAGTGTGAATATAAATTAGCATCTGAGTTTATTGATG
ACATTGAGTTAATGTTTTCGAACTGCTTTGAATACAACCCTCGTAACACAAGTGAAGCAAAAGCTGGAACTAGGCTTCAAGCATTTTTTC
ATATTCAGGCTCAAAAGCTTGGACTCCACGTCACACCCAGTAATGTGGACCAAGTTAGCACACCACCGGCTGCGAAAAAGTCACGAATCT
GACTTTGTCCTTCTAAAGGATATATTTGAAGAAAAACAAATTGTTCATGAAAATGGAACATTAAATCATGCTGTATAAAGCAATAACAAT
TGATTGACCACATGAAAGTGTGGCCTGCACTATATTCTCAATTTTAATATTAAGCACTCAGGAGAATGTAGGAAAGATATCCTTTGCTAC
AGTTTTGTTCAGTATCTAATAAGTTTGATAGATGTATTGGATACAGTACTGGTTTACAGAGGTTTTTGTACATTTTTGAGATCATTCATG
TGTCCAGAGATCTTGGAAAATATTTTTTCACCCACGATTTATTTTGTTATTGATGATTTTTTTTTAAAGTGGTGGTATTAAGGGAGAGTT
ATCTACATGGATGAGTCTTCCGCTATAGCACAGTTTAGAAAAGGTGTTTATGTCTTAATTAATTGTTTGAGTACATTCTTTCAACACTAC
ACATGAATGAATCCAATCTTATAACCTTGAAGTGCTGTACCAGTGCTGGCTGCAGGTATTAAGTCCAAGTTTATTAACTAGATATTTATT
TAGTATTGAGAGTAATTTGTGAATTTGTTTTGTATTTATAAAATTTATACCTGAAAAATGTTCCTTAATGTTTTAAACCTTTTACTGTGT
TTTTATTCCTCTAACTTCCTTAATGATCAATCAAAAAAAGTAACACCCTCCCTTTTTCCTGACAGTTCTTTCAGCTTTACAGAACTGTAT

>43713_43713_3_KRTCAP3-BAZ1A_KRTCAP3_chr2_27666399_ENST00000288873_BAZ1A_chr14_35272194_ENST00000382422_length(amino acids)=1519AA_BP=205
MRRCSLCAFDAARGPRRLMRVGLALILVGHVNLLLGAVLHGTVLRHVANPRGAVTPEYTVANVISVGSGLLSVSVGLVALLASRNLLRPP
LHWVLLALALVNLLLSVACSLGLLLAVSLTVANGGRRLIADCHPGLLDPLVPLDEGPGHTDCPFDPTRIYDTALALWIPSLLMSAGEAAL
SGYCCVAALTLRGVGPCRKDGLQGQASSLSTYKIAEQDFSYFFPDDPPTFIFSPANRRRGRPPKRIHISQEDNVANKQTLASYRSKATKE
RDKLLKQEEMKSLAFEKAKLKREKADALEAKKKEKEDKEKKREELKKIVEEERLKKKEEKERLKVEREKEREKLREEKRKYVEYLKQWSK
PREDMECDDLKELPEPTPVKTRLPPEIFGDALMVLEFLNAFGELFDLQDEFPDGVTLEVLEEALVGNDSEGPLCELLFFFLTAIFQAIAE
EEEEVAKEQLTDADTKDLTEALDEDADPTKSALSAVASLAAAWPQLHQGCSLKSLDLDSCTLSEILRLHILASGADVTSANAKYRYQKRG
GFDATDDACMELRLSNPSLVKKLSSTSVYDLTPGEKMKILHALCGKLLTLVSTRDFIEDYVDILRQAKQEFRELKAEQHRKEREEAAARI
RKRKEEKLKEQEQKMKEKQEKLKEDEQRNSTADISIGEEEREDFDTSIESKDTEQKELDQDMVTEDEDDPGSHKRGRRGKRGQNGFKEFT
RQEQINCVTREPLTADEEEALKQEHQRKEKELLEKIQSAIACTNIFPLGRDRMYRRYWIFPSIPGLFIEEDYSGLTEDMLLPRPSSFQNN
VQSQDPQVSTKTGEPLMSESTSNIDQGPRDHSVQLPKPVHKPNRWCFYSSCEQLDQLIEALNSRGHRESALKETLLQEKSRICAQLARFS
EEKFHFSDKPQPDSKPTYSRGRSSNAYDPSQMCAEKQLELRLRDFLLDIEDRIYQGTLGAIKVTDRHIWRSALESGRYELLSEENKENGI
IKTVNEDVEEMEIDEQTKVIVKDRLLGIKTETPSTVSTNASTPQSVSSVVHYLAMALFQIEQGIERRFLKAPLDASDSGRSYKTVLDRWR
ESLLSSASLSQVFLHLSTLDRSVIWSKSILNARCKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECRPKQRSRRLSSR
QRPSLESDEDVEDSMGGEDDEVDGDEEEGQSEEEEYEVEQDEDDSQEEEEVSLPKRGRPQVRLPVKTRGKLSSSFSSRGQQQEPGRYPSR
SQQSTPKTTVSSKTGRSLRKINSAPPTETKSLRIASRSTRHSHGPLQADVFVELLSPRRKRRGRKSANNTPENSPNFPNFRVIATKSSEQ
SRSVNIASKLSLQESESKRRCRKRQSPEPSPVTLGRRSSGRQGGVHELSAFEQLVVELVRHDDSWPFLKLVSKIQVPDYYDIIKKPIALN

--------------------------------------------------------------
>43713_43713_4_KRTCAP3-BAZ1A_KRTCAP3_chr2_27666399_ENST00000407293_BAZ1A_chr14_35272194_ENST00000358716_length(transcript)=5346nt_BP=720nt
TTCCGGGCCCTGGCGTCTCGTCTCCTTACCCTGGGGCTACCCTTGCCCCGTCCTACTGCCCGCGGTTAACCCGCCGCGAGCCGCCTCTCC
CCTCCCCGCCCGACTCAACCCTGCCCTCCCCCGTGCTTTGCAGACGCCGCCCGGGGGCCCAGGCGGCTGATGCGTGTGGGCCTCGCGCTG
ATCTTGGTGGGCCACGTGAACCTGCTGCTGGGGGCCGTGCTGCATGGCACCGTCCTGCGGCACGTGGCCAATCCCCGCGGCGCTGTCACG
CCGGAGTACACCGTAGCCAATGTCATCTCTGTCGGCTCGGGGCTGCTGAGCGTTTCCGTGGGACTTGTGGCCCTCCTGGCGTCCAGGAAC
CTTCTTCGCCCTCCACTGCACTGGGTCCTGCTGGCACTAGCTCTGGTGAACCTGCTCTTGTCCGTTGCCTGCTCCCTGGGCCTCCTTCTT
GCTGTGTCACTCACTGTGGCCAACGGTGGCCGCCGCCTTATTGCTGACTGCCACCCAGGACTGCTGGATCCTCTGGTACCACTGGATGAG
GGGCCGGGACATACTGACTGCCCCTTTGACCCCACAAGAATCTATGATACAGCCTTGGCTCTCTGGATCCCTTCTTTGCTCATGTCTGCA
GGGGAGGCTGCTCTATCTGGTTACTGCTGTGTGGCTGCACTCACTCTACGTGGAGTTGGGCCCTGCAGGAAGGACGGACTTCAGGGGCAG
GCATCATCTCTTTCAACGTATAAAATAGCAGAACAAGATTTTTCTTATTTCTTCCCTGATGATCCACCCACATTTATCTTCAGTCCTGCT
AACAGACGAAGAGGGAGACCTCCCAAACGAATACATATTAGTCAAGAGGACAATGTTGCTAATAAACAGACTCTTGCAAGTTATAGGAGC
AAAGCTACTAAAGAAAGAGATAAACTTTTGAAACAAGAAGAAATGAAGTCACTGGCTTTTGAAAAGGCTAAATTAAAAAGAGAAAAAGCA
GATGCCCTAGAAGCGAAGAAAAAAGAAAAAGAAGATAAAGAGAAAAAGAGGGAAGAATTGAAAAAAATTGTTGAAGAAGAGAGACTAAAG
AAAAAAGAAGAAAAAGAGAGGCTTAAAGTAGAAAGAGAAAAGGAAAGAGAGAAGTTACGTGAAGAAAAGCGAAAGTATGTGGAATACTTA
AAACAGTGGAGTAAACCTAGAGAAGATATGGAATGTGATGACCTTAAGGAACTTCCAGAACCAACACCAGTGAAAACTAGACTACCTCCT
GAAATCTTTGGTGATGCTCTGATGGTTTTGGAGTTCCTTAATGCATTTGGGGAACTTTTTGATCTTCAAGATGAGTTTCCTGATGGAGTA
ACCCTAGAAGTATTAGAGGAAGCTCTTGTAGGAAATGACAGTGAAGGCCCACTGTGTGAATTGCTTTTTTTCTTCCTGACTGCAATCTTC
CAGGCAATAGCTGAAGAAGAAGAGGAAGTAGCCAAAGAGCAACTAACTGATGCTGACACCAAAGGCTGCAGTTTGAAAAGTTTGGATCTT
GATAGCTGCACTCTTTCAGAAATCCTCAGACTGCACATCTTAGCTTCAGGTGCTGATGTAACATCAGCAAATGCAAAGTATAGATATCAA
AAACGAGGAGGATTTGATGCTACAGATGATGCTTGTATGGAGCTTCGTTTGAGCAATCCCAGTCTAGTGAAGAAACTGTCAAGCACCTCA
GTGTATGATTTGACACCAGGAGAAAAAATGAAGATACTCCATGCTCTCTGTGGAAAGCTACTGACCCTAGTTTCAACTAGGGATTTTATT
GAAGATTATGTTGATATATTACGACAGGCAAAGCAGGAGTTCCGGGAATTAAAAGCAGAACAACATCGAAAAGAGAGGGAAGAAGCAGCT
GCCAGAATTCGTAAAAGGAAGGAAGAAAAACTTAAGGAGCAAGAACAAAAAATGAAAGAGAAACAAGAAAAACTGAAAGAAGATGAGCAA
AGAAATTCAACGGCAGATATATCTATTGGGGAGGAAGAAAGGGAAGATTTTGATACTAGCATTGAGAGCAAAGACACAGAGCAAAAGGAA
TTAGATCAAGATATGGTCACTGAAGATGAAGATGACCCAGGATCACATAAAAGAGGCAGAAGGGGGAAAAGAGGACAAAATGGATTTAAA
GAATTTACAAGGCAAGAACAGATCAACTGTGTAACAAGAGAGCCTCTTACTGCTGATGAGGAAGAAGCATTAAAACAGGAACACCAACGA
AAAGAGAAAGAGCTCTTAGAAAAAATCCAAAGTGCCATAGCCTGTACCAATATCTTTCCCTTGGGTCGCGACCGCATGTATAGACGATAC
TGGATTTTCCCTTCTATTCCTGGACTCTTTATTGAAGAGGATTATTCTGGTCTTACTGAAGACATGCTGTTGCCTAGACCTTCATCATTT
CAGAATAATGTACAGTCTCAAGATCCTCAGGTATCCACTAAAACTGGAGAGCCTTTGATGTCTGAATCTACCTCCAACATTGACCAAGGT
CCACGTGACCATTCTGTGCAGCTGCCAAAACCAGTGCATAAGCCAAATCGGTGGTGCTTTTACAGTTCTTGTGAACAGCTAGACCAGCTT
ATTGAAGCTCTTAATTCTAGAGGACATAGAGAAAGTGCCTTAAAAGAAACTTTGTTACAAGAGAAAAGCAGAATATGTGCACAGCTAGCC
CGTTTTTCTGAAGAGAAATTTCATTTTTCAGACAAACCTCAGCCTGATAGCAAACCAACATATAGTCGGGGAAGATCTTCCAATGCATAT
GATCCATCTCAGATGTGTGCAGAAAAGCAACTTGAACTAAGGCTGAGAGATTTTCTTTTAGATATTGAAGATAGAATCTACCAAGGAACA
TTAGGAGCCATCAAGGTTACAGATCGACATATCTGGAGATCAGCATTAGAAAGTGGACGGTATGAGCTGTTAAGTGAGGAAAACAAGGAA
AATGGGATAATTAAAACTGTGAATGAAGACGTAGAAGAGATGGAAATTGATGAACAAACAAAGGTCATAGTAAAAGACAGACTTTTGGGG
ATAAAAACAGAAACTCCAAGTACTGTATCAACAAATGCAAGTACACCACAATCAGTGAGCAGTGTGGTTCATTATCTGGCAATGGCACTC
TTTCAAATAGAGCAGGGCATTGAGCGGCGTTTTCTGAAAGCTCCACTTGATGCCAGTGACAGTGGGCGTTCTTATAAAACAGTTCTGGAC
CGTTGGAGAGAGTCTCTCCTTTCTTCTGCTAGTCTATCCCAAGTTTTTCTTCACCTATCCACCTTGGATCGTAGCGTGATATGGTCTAAA
TCTATACTGAATGCGCGTTGCAAGATATGTCGAAAGAAAGGCGATGCTGAAAACATGGTTCTTTGTGATGGCTGTGATAGGGGTCATCAT
ACCTACTGTGTTCGACCAAAGCTCAAGACTGTGCCTGAAGGAGACTGGTTTTGTCCAGAATGTCGACCAAAGCAACGTTCTAGAAGACTC
TCCTCTAGACAGAGACCATCCTTGGAAAGTGATGAAGATGTGGAAGACAGTATGGGAGGTGAGGATGATGAAGTTGATGGCGATGAAGAA
GAAGGTCAAAGTGAGGAGGAAGAGTATGAGGTAGAACAAGATGAAGATGACTCTCAAGAAGAGGAAGAAGTCAGCCTACCCAAACGAGGA
AGACCACAAGTTAGATTGCCAGTTAAAACAAGAGGGAAACTTAGCTCTTCTTTCTCAAGTCGTGGCCAACAACAAGAACCTGGAAGATAC
CCTTCAAGGAGTCAGCAGAGCACACCCAAAACAACTGTTTCTTCTAAAACTGGTAGAAGCCTAAGAAAGATAAACTCTGCTCCTCCTACA
GAAACAAAATCTTTAAGAATTGCCAGTCGTTCTACTCGCCACAGTCATGGCCCACTGCAAGCAGATGTATTTGTGGAATTGCTTAGTCCT
CGTAGAAAACGCAGAGGCAGGAAAAGTGCTAATAATACACCAGAAAATAGTCCCAACTTCCCTAACTTCAGAGTCATTGCCACAAAGTCA
AGTGAACAGTCAAGATCTGTAAATATTGCTTCAAAACTTTCTCTCCAAGAGAGTGAATCCAAAAGAAGATGCAGAAAAAGACAATCTCCA
GAGCCATCGCCTGTGACACTGGGTCGAAGGAGTTCTGGCCGACAGGGAGGAGTTCATGAATTGTCTGCTTTTGAACAACTTGTTGTAGAA
TTGGTACGACATGATGACAGCTGGCCTTTTTTGAAACTTGTTTCTAAAATCCAGGTCCCAGACTACTATGACATCATCAAAAAGCCCATT
GCCTTAAATATAATTCGTGAAAAAGTGAATAAGTGTGAATATAAATTAGCATCTGAGTTTATTGATGACATTGAGTTAATGTTTTCGAAC
TGCTTTGAATACAACCCTCGTAACACAAGTGAAGCAAAAGCTGGAACTAGGCTTCAAGCATTTTTTCATATTCAGGCTCAAAAGCTTGGA
CTCCACGTCACACCCAGTAATGTGGACCAAGTTAGCACACCACCGGCTGCGAAAAAGTCACGAATCTGACTTTGTCCTTCTAAAGGATAT
ATTTGAAGAAAAACAAATTGTTCATGAAAATGGAACATTAAATCATGCTGTATAAAGCAATAACAATTGATTGACCACATGAAAGTGTGG
CCTGCACTATATTCTCAATTTTAATATTAAGCACTCAGGAGAATGTAGGAAAGATATCCTTTGCTACAGTTTTGTTCAGTATCTAATAAG
TTTGATAGATGTATTGGATACAGTACTGGTTTACAGAGGTTTTTGTACATTTTTGAGATCATTCATGTGTCCAGAGATCTTGGAAAATAT
TTTTTCACCCACGATTTATTTTGTTATTGATGATTTTTTTTTAAAGTGGTGGTATTAAGGGAGAGTTATCTACATGGATGAGTCTTCCGC
TATAGCACAGTTTAGAAAAGGTGTTTATGTCTTAATTAATTGTTTGAGTACATTCTTTCAACACTACACATGAATGAATCCAATCTTATA
ACCTTGAAGTGCTGTACCAGTGCTGGCTGCAGGTATTAAGTCCAAGTTTATTAACTAGATATTTATTTAGTATTGAGAGTAATTTGTGAA
TTTGTTTTGTATTTATAAAATTTATACCTGAAAAATGTTCCTTAATGTTTTAAACCTTTTACTGTGTTTTTATTCCTCTAACTTCCTTAA
TGATCAATCAAAAAAAGTAACACCCTCCCTTTTTCCTGACAGTTCTTTCAGCTTTACAGAACTGTATTATAAGTTTCTATGTATAACTTT

>43713_43713_4_KRTCAP3-BAZ1A_KRTCAP3_chr2_27666399_ENST00000407293_BAZ1A_chr14_35272194_ENST00000358716_length(amino acids)=1470AA_BP=188
MMRVGLALILVGHVNLLLGAVLHGTVLRHVANPRGAVTPEYTVANVISVGSGLLSVSVGLVALLASRNLLRPPLHWVLLALALVNLLLSV
ACSLGLLLAVSLTVANGGRRLIADCHPGLLDPLVPLDEGPGHTDCPFDPTRIYDTALALWIPSLLMSAGEAALSGYCCVAALTLRGVGPC
RKDGLQGQASSLSTYKIAEQDFSYFFPDDPPTFIFSPANRRRGRPPKRIHISQEDNVANKQTLASYRSKATKERDKLLKQEEMKSLAFEK
AKLKREKADALEAKKKEKEDKEKKREELKKIVEEERLKKKEEKERLKVEREKEREKLREEKRKYVEYLKQWSKPREDMECDDLKELPEPT
PVKTRLPPEIFGDALMVLEFLNAFGELFDLQDEFPDGVTLEVLEEALVGNDSEGPLCELLFFFLTAIFQAIAEEEEEVAKEQLTDADTKG
CSLKSLDLDSCTLSEILRLHILASGADVTSANAKYRYQKRGGFDATDDACMELRLSNPSLVKKLSSTSVYDLTPGEKMKILHALCGKLLT
LVSTRDFIEDYVDILRQAKQEFRELKAEQHRKEREEAAARIRKRKEEKLKEQEQKMKEKQEKLKEDEQRNSTADISIGEEEREDFDTSIE
SKDTEQKELDQDMVTEDEDDPGSHKRGRRGKRGQNGFKEFTRQEQINCVTREPLTADEEEALKQEHQRKEKELLEKIQSAIACTNIFPLG
RDRMYRRYWIFPSIPGLFIEEDYSGLTEDMLLPRPSSFQNNVQSQDPQVSTKTGEPLMSESTSNIDQGPRDHSVQLPKPVHKPNRWCFYS
SCEQLDQLIEALNSRGHRESALKETLLQEKSRICAQLARFSEEKFHFSDKPQPDSKPTYSRGRSSNAYDPSQMCAEKQLELRLRDFLLDI
EDRIYQGTLGAIKVTDRHIWRSALESGRYELLSEENKENGIIKTVNEDVEEMEIDEQTKVIVKDRLLGIKTETPSTVSTNASTPQSVSSV
VHYLAMALFQIEQGIERRFLKAPLDASDSGRSYKTVLDRWRESLLSSASLSQVFLHLSTLDRSVIWSKSILNARCKICRKKGDAENMVLC
DGCDRGHHTYCVRPKLKTVPEGDWFCPECRPKQRSRRLSSRQRPSLESDEDVEDSMGGEDDEVDGDEEEGQSEEEEYEVEQDEDDSQEEE
EVSLPKRGRPQVRLPVKTRGKLSSSFSSRGQQQEPGRYPSRSQQSTPKTTVSSKTGRSLRKINSAPPTETKSLRIASRSTRHSHGPLQAD
VFVELLSPRRKRRGRKSANNTPENSPNFPNFRVIATKSSEQSRSVNIASKLSLQESESKRRCRKRQSPEPSPVTLGRRSSGRQGGVHELS
AFEQLVVELVRHDDSWPFLKLVSKIQVPDYYDIIKKPIALNIIREKVNKCEYKLASEFIDDIELMFSNCFEYNPRNTSEAKAGTRLQAFF

--------------------------------------------------------------
>43713_43713_5_KRTCAP3-BAZ1A_KRTCAP3_chr2_27666399_ENST00000407293_BAZ1A_chr14_35272194_ENST00000360310_length(transcript)=5436nt_BP=720nt
TTCCGGGCCCTGGCGTCTCGTCTCCTTACCCTGGGGCTACCCTTGCCCCGTCCTACTGCCCGCGGTTAACCCGCCGCGAGCCGCCTCTCC
CCTCCCCGCCCGACTCAACCCTGCCCTCCCCCGTGCTTTGCAGACGCCGCCCGGGGGCCCAGGCGGCTGATGCGTGTGGGCCTCGCGCTG
ATCTTGGTGGGCCACGTGAACCTGCTGCTGGGGGCCGTGCTGCATGGCACCGTCCTGCGGCACGTGGCCAATCCCCGCGGCGCTGTCACG
CCGGAGTACACCGTAGCCAATGTCATCTCTGTCGGCTCGGGGCTGCTGAGCGTTTCCGTGGGACTTGTGGCCCTCCTGGCGTCCAGGAAC
CTTCTTCGCCCTCCACTGCACTGGGTCCTGCTGGCACTAGCTCTGGTGAACCTGCTCTTGTCCGTTGCCTGCTCCCTGGGCCTCCTTCTT
GCTGTGTCACTCACTGTGGCCAACGGTGGCCGCCGCCTTATTGCTGACTGCCACCCAGGACTGCTGGATCCTCTGGTACCACTGGATGAG
GGGCCGGGACATACTGACTGCCCCTTTGACCCCACAAGAATCTATGATACAGCCTTGGCTCTCTGGATCCCTTCTTTGCTCATGTCTGCA
GGGGAGGCTGCTCTATCTGGTTACTGCTGTGTGGCTGCACTCACTCTACGTGGAGTTGGGCCCTGCAGGAAGGACGGACTTCAGGGGCAG
GCATCATCTCTTTCAACGTATAAAATAGCAGAACAAGATTTTTCTTATTTCTTCCCTGATGATCCACCCACATTTATCTTCAGTCCTGCT
AACAGACGAAGAGGGAGACCTCCCAAACGAATACATATTAGTCAAGAGGACAATGTTGCTAATAAACAGACTCTTGCAAGTTATAGGAGC
AAAGCTACTAAAGAAAGAGATAAACTTTTGAAACAAGAAGAAATGAAGTCACTGGCTTTTGAAAAGGCTAAATTAAAAAGAGAAAAAGCA
GATGCCCTAGAAGCGAAGAAAAAAGAAAAAGAAGATAAAGAGAAAAAGAGGGAAGAATTGAAAAAAATTGTTGAAGAAGAGAGACTAAAG
AAAAAAGAAGAAAAAGAGAGGCTTAAAGTAGAAAGAGAAAAGGAAAGAGAGAAGTTACGTGAAGAAAAGCGAAAGTATGTGGAATACTTA
AAACAGTGGAGTAAACCTAGAGAAGATATGGAATGTGATGACCTTAAGGAACTTCCAGAACCAACACCAGTGAAAACTAGACTACCTCCT
GAAATCTTTGGTGATGCTCTGATGGTTTTGGAGTTCCTTAATGCATTTGGGGAACTTTTTGATCTTCAAGATGAGTTTCCTGATGGAGTA
ACCCTAGAAGTATTAGAGGAAGCTCTTGTAGGAAATGACAGTGAAGGCCCACTGTGTGAATTGCTTTTTTTCTTCCTGACTGCAATCTTC
CAGGCAATAGCTGAAGAAGAAGAGGAAGTAGCCAAAGAGCAACTAACTGATGCTGACACCAAAGATTTAACAGAGGCTTTGGATGAAGAT
GCAGACCCCACAAAATCTGCACTGTCTGCAGTTGCATCTTTGGCAGCTGCATGGCCACAGTTACACCAGGGCTGCAGTTTGAAAAGTTTG
GATCTTGATAGCTGCACTCTTTCAGAAATCCTCAGACTGCACATCTTAGCTTCAGGTGCTGATGTAACATCAGCAAATGCAAAGTATAGA
TATCAAAAACGAGGAGGATTTGATGCTACAGATGATGCTTGTATGGAGCTTCGTTTGAGCAATCCCAGTCTAGTGAAGAAACTGTCAAGC
ACCTCAGTGTATGATTTGACACCAGGAGAAAAAATGAAGATACTCCATGCTCTCTGTGGAAAGCTACTGACCCTAGTTTCAACTAGGGAT
TTTATTGAAGATTATGTTGATATATTACGACAGGCAAAGCAGGAGTTCCGGGAATTAAAAGCAGAACAACATCGAAAAGAGAGGGAAGAA
GCAGCTGCCAGAATTCGTAAAAGGAAGGAAGAAAAACTTAAGGAGCAAGAACAAAAAATGAAAGAGAAACAAGAAAAACTGAAAGAAGAT
GAGCAAAGAAATTCAACGGCAGATATATCTATTGGGGAGGAAGAAAGGGAAGATTTTGATACTAGCATTGAGAGCAAAGACACAGAGCAA
AAGGAATTAGATCAAGATATGGTCACTGAAGATGAAGATGACCCAGGATCACATAAAAGAGGCAGAAGGGGGAAAAGAGGACAAAATGGA
TTTAAAGAATTTACAAGGCAAGAACAGATCAACTGTGTAACAAGAGAGCCTCTTACTGCTGATGAGGAAGAAGCATTAAAACAGGAACAC
CAACGAAAAGAGAAAGAGCTCTTAGAAAAAATCCAAAGTGCCATAGCCTGTACCAATATCTTTCCCTTGGGTCGCGACCGCATGTATAGA
CGATACTGGATTTTCCCTTCTATTCCTGGACTCTTTATTGAAGAGGATTATTCTGGTCTTACTGAAGACATGCTGTTGCCTAGACCTTCA
TCATTTCAGAATAATGTACAGTCTCAAGATCCTCAGGTATCCACTAAAACTGGAGAGCCTTTGATGTCTGAATCTACCTCCAACATTGAC
CAAGGTCCACGTGACCATTCTGTGCAGCTGCCAAAACCAGTGCATAAGCCAAATCGGTGGTGCTTTTACAGTTCTTGTGAACAGCTAGAC
CAGCTTATTGAAGCTCTTAATTCTAGAGGACATAGAGAAAGTGCCTTAAAAGAAACTTTGTTACAAGAGAAAAGCAGAATATGTGCACAG
CTAGCCCGTTTTTCTGAAGAGAAATTTCATTTTTCAGACAAACCTCAGCCTGATAGCAAACCAACATATAGTCGGGGAAGATCTTCCAAT
GCATATGATCCATCTCAGATGTGTGCAGAAAAGCAACTTGAACTAAGGCTGAGAGATTTTCTTTTAGATATTGAAGATAGAATCTACCAA
GGAACATTAGGAGCCATCAAGGTTACAGATCGACATATCTGGAGATCAGCATTAGAAAGTGGACGGTATGAGCTGTTAAGTGAGGAAAAC
AAGGAAAATGGGATAATTAAAACTGTGAATGAAGACGTAGAAGAGATGGAAATTGATGAACAAACAAAGGTCATAGTAAAAGACAGACTT
TTGGGGATAAAAACAGAAACTCCAAGTACTGTATCAACAAATGCAAGTACACCACAATCAGTGAGCAGTGTGGTTCATTATCTGGCAATG
GCACTCTTTCAAATAGAGCAGGGCATTGAGCGGCGTTTTCTGAAAGCTCCACTTGATGCCAGTGACAGTGGGCGTTCTTATAAAACAGTT
CTGGACCGTTGGAGAGAGTCTCTCCTTTCTTCTGCTAGTCTATCCCAAGTTTTTCTTCACCTATCCACCTTGGATCGTAGCGTGATATGG
TCTAAATCTATACTGAATGCGCGTTGCAAGATATGTCGAAAGAAAGGCGATGCTGAAAACATGGTTCTTTGTGATGGCTGTGATAGGGGT
CATCATACCTACTGTGTTCGACCAAAGCTCAAGACTGTGCCTGAAGGAGACTGGTTTTGTCCAGAATGTCGACCAAAGCAACGTTCTAGA
AGACTCTCCTCTAGACAGAGACCATCCTTGGAAAGTGATGAAGATGTGGAAGACAGTATGGGAGGTGAGGATGATGAAGTTGATGGCGAT
GAAGAAGAAGGTCAAAGTGAGGAGGAAGAGTATGAGGTAGAACAAGATGAAGATGACTCTCAAGAAGAGGAAGAAGTCAGCCTACCCAAA
CGAGGAAGACCACAAGTTAGATTGCCAGTTAAAACAAGAGGGAAACTTAGCTCTTCTTTCTCAAGTCGTGGCCAACAACAAGAACCTGGA
AGATACCCTTCAAGGAGTCAGCAGAGCACACCCAAAACAACTGTTTCTTCTAAAACTGGTAGAAGCCTAAGAAAGATAAACTCTGCTCCT
CCTACAGAAACAAAATCTTTAAGAATTGCCAGTCGTTCTACTCGCCACAGTCATGGCCCACTGCAAGCAGATGTATTTGTGGAATTGCTT
AGTCCTCGTAGAAAACGCAGAGGCAGGAAAAGTGCTAATAATACACCAGAAAATAGTCCCAACTTCCCTAACTTCAGAGTCATTGCCACA
AAGTCAAGTGAACAGTCAAGATCTGTAAATATTGCTTCAAAACTTTCTCTCCAAGAGAGTGAATCCAAAAGAAGATGCAGAAAAAGACAA
TCTCCAGAGCCATCGCCTGTGACACTGGGTCGAAGGAGTTCTGGCCGACAGGGAGGAGTTCATGAATTGTCTGCTTTTGAACAACTTGTT
GTAGAATTGGTACGACATGATGACAGCTGGCCTTTTTTGAAACTTGTTTCTAAAATCCAGGTCCCAGACTACTATGACATCATCAAAAAG
CCCATTGCCTTAAATATAATTCGTGAAAAAGTGAATAAGTGTGAATATAAATTAGCATCTGAGTTTATTGATGACATTGAGTTAATGTTT
TCGAACTGCTTTGAATACAACCCTCGTAACACAAGTGAAGCAAAAGCTGGAACTAGGCTTCAAGCATTTTTTCATATTCAGGCTCAAAAG
CTTGGACTCCACGTCACACCCAGTAATGTGGACCAAGTTAGCACACCACCGGCTGCGAAAAAGTCACGAATCTGACTTTGTCCTTCTAAA
GGATATATTTGAAGAAAAACAAATTGTTCATGAAAATGGAACATTAAATCATGCTGTATAAAGCAATAACAATTGATTGACCACATGAAA
GTGTGGCCTGCACTATATTCTCAATTTTAATATTAAGCACTCAGGAGAATGTAGGAAAGATATCCTTTGCTACAGTTTTGTTCAGTATCT
AATAAGTTTGATAGATGTATTGGATACAGTACTGGTTTACAGAGGTTTTTGTACATTTTTGAGATCATTCATGTGTCCAGAGATCTTGGA
AAATATTTTTTCACCCACGATTTATTTTGTTATTGATGATTTTTTTTTAAAGTGGTGGTATTAAGGGAGAGTTATCTACATGGATGAGTC
TTCCGCTATAGCACAGTTTAGAAAAGGTGTTTATGTCTTAATTAATTGTTTGAGTACATTCTTTCAACACTACACATGAATGAATCCAAT
CTTATAACCTTGAAGTGCTGTACCAGTGCTGGCTGCAGGTATTAAGTCCAAGTTTATTAACTAGATATTTATTTAGTATTGAGAGTAATT
TGTGAATTTGTTTTGTATTTATAAAATTTATACCTGAAAAATGTTCCTTAATGTTTTAAACCTTTTACTGTGTTTTTATTCCTCTAACTT
CCTTAATGATCAATCAAAAAAAGTAACACCCTCCCTTTTTCCTGACAGTTCTTTCAGCTTTACAGAACTGTATTATAAGTTTCTATGTAT

>43713_43713_5_KRTCAP3-BAZ1A_KRTCAP3_chr2_27666399_ENST00000407293_BAZ1A_chr14_35272194_ENST00000360310_length(amino acids)=1502AA_BP=188
MMRVGLALILVGHVNLLLGAVLHGTVLRHVANPRGAVTPEYTVANVISVGSGLLSVSVGLVALLASRNLLRPPLHWVLLALALVNLLLSV
ACSLGLLLAVSLTVANGGRRLIADCHPGLLDPLVPLDEGPGHTDCPFDPTRIYDTALALWIPSLLMSAGEAALSGYCCVAALTLRGVGPC
RKDGLQGQASSLSTYKIAEQDFSYFFPDDPPTFIFSPANRRRGRPPKRIHISQEDNVANKQTLASYRSKATKERDKLLKQEEMKSLAFEK
AKLKREKADALEAKKKEKEDKEKKREELKKIVEEERLKKKEEKERLKVEREKEREKLREEKRKYVEYLKQWSKPREDMECDDLKELPEPT
PVKTRLPPEIFGDALMVLEFLNAFGELFDLQDEFPDGVTLEVLEEALVGNDSEGPLCELLFFFLTAIFQAIAEEEEEVAKEQLTDADTKD
LTEALDEDADPTKSALSAVASLAAAWPQLHQGCSLKSLDLDSCTLSEILRLHILASGADVTSANAKYRYQKRGGFDATDDACMELRLSNP
SLVKKLSSTSVYDLTPGEKMKILHALCGKLLTLVSTRDFIEDYVDILRQAKQEFRELKAEQHRKEREEAAARIRKRKEEKLKEQEQKMKE
KQEKLKEDEQRNSTADISIGEEEREDFDTSIESKDTEQKELDQDMVTEDEDDPGSHKRGRRGKRGQNGFKEFTRQEQINCVTREPLTADE
EEALKQEHQRKEKELLEKIQSAIACTNIFPLGRDRMYRRYWIFPSIPGLFIEEDYSGLTEDMLLPRPSSFQNNVQSQDPQVSTKTGEPLM
SESTSNIDQGPRDHSVQLPKPVHKPNRWCFYSSCEQLDQLIEALNSRGHRESALKETLLQEKSRICAQLARFSEEKFHFSDKPQPDSKPT
YSRGRSSNAYDPSQMCAEKQLELRLRDFLLDIEDRIYQGTLGAIKVTDRHIWRSALESGRYELLSEENKENGIIKTVNEDVEEMEIDEQT
KVIVKDRLLGIKTETPSTVSTNASTPQSVSSVVHYLAMALFQIEQGIERRFLKAPLDASDSGRSYKTVLDRWRESLLSSASLSQVFLHLS
TLDRSVIWSKSILNARCKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECRPKQRSRRLSSRQRPSLESDEDVEDSMGG
EDDEVDGDEEEGQSEEEEYEVEQDEDDSQEEEEVSLPKRGRPQVRLPVKTRGKLSSSFSSRGQQQEPGRYPSRSQQSTPKTTVSSKTGRS
LRKINSAPPTETKSLRIASRSTRHSHGPLQADVFVELLSPRRKRRGRKSANNTPENSPNFPNFRVIATKSSEQSRSVNIASKLSLQESES
KRRCRKRQSPEPSPVTLGRRSSGRQGGVHELSAFEQLVVELVRHDDSWPFLKLVSKIQVPDYYDIIKKPIALNIIREKVNKCEYKLASEF

--------------------------------------------------------------
>43713_43713_6_KRTCAP3-BAZ1A_KRTCAP3_chr2_27666399_ENST00000407293_BAZ1A_chr14_35272194_ENST00000382422_length(transcript)=5441nt_BP=720nt
TTCCGGGCCCTGGCGTCTCGTCTCCTTACCCTGGGGCTACCCTTGCCCCGTCCTACTGCCCGCGGTTAACCCGCCGCGAGCCGCCTCTCC
CCTCCCCGCCCGACTCAACCCTGCCCTCCCCCGTGCTTTGCAGACGCCGCCCGGGGGCCCAGGCGGCTGATGCGTGTGGGCCTCGCGCTG
ATCTTGGTGGGCCACGTGAACCTGCTGCTGGGGGCCGTGCTGCATGGCACCGTCCTGCGGCACGTGGCCAATCCCCGCGGCGCTGTCACG
CCGGAGTACACCGTAGCCAATGTCATCTCTGTCGGCTCGGGGCTGCTGAGCGTTTCCGTGGGACTTGTGGCCCTCCTGGCGTCCAGGAAC
CTTCTTCGCCCTCCACTGCACTGGGTCCTGCTGGCACTAGCTCTGGTGAACCTGCTCTTGTCCGTTGCCTGCTCCCTGGGCCTCCTTCTT
GCTGTGTCACTCACTGTGGCCAACGGTGGCCGCCGCCTTATTGCTGACTGCCACCCAGGACTGCTGGATCCTCTGGTACCACTGGATGAG
GGGCCGGGACATACTGACTGCCCCTTTGACCCCACAAGAATCTATGATACAGCCTTGGCTCTCTGGATCCCTTCTTTGCTCATGTCTGCA
GGGGAGGCTGCTCTATCTGGTTACTGCTGTGTGGCTGCACTCACTCTACGTGGAGTTGGGCCCTGCAGGAAGGACGGACTTCAGGGGCAG
GCATCATCTCTTTCAACGTATAAAATAGCAGAACAAGATTTTTCTTATTTCTTCCCTGATGATCCACCCACATTTATCTTCAGTCCTGCT
AACAGACGAAGAGGGAGACCTCCCAAACGAATACATATTAGTCAAGAGGACAATGTTGCTAATAAACAGACTCTTGCAAGTTATAGGAGC
AAAGCTACTAAAGAAAGAGATAAACTTTTGAAACAAGAAGAAATGAAGTCACTGGCTTTTGAAAAGGCTAAATTAAAAAGAGAAAAAGCA
GATGCCCTAGAAGCGAAGAAAAAAGAAAAAGAAGATAAAGAGAAAAAGAGGGAAGAATTGAAAAAAATTGTTGAAGAAGAGAGACTAAAG
AAAAAAGAAGAAAAAGAGAGGCTTAAAGTAGAAAGAGAAAAGGAAAGAGAGAAGTTACGTGAAGAAAAGCGAAAGTATGTGGAATACTTA
AAACAGTGGAGTAAACCTAGAGAAGATATGGAATGTGATGACCTTAAGGAACTTCCAGAACCAACACCAGTGAAAACTAGACTACCTCCT
GAAATCTTTGGTGATGCTCTGATGGTTTTGGAGTTCCTTAATGCATTTGGGGAACTTTTTGATCTTCAAGATGAGTTTCCTGATGGAGTA
ACCCTAGAAGTATTAGAGGAAGCTCTTGTAGGAAATGACAGTGAAGGCCCACTGTGTGAATTGCTTTTTTTCTTCCTGACTGCAATCTTC
CAGGCAATAGCTGAAGAAGAAGAGGAAGTAGCCAAAGAGCAACTAACTGATGCTGACACCAAAGATTTAACAGAGGCTTTGGATGAAGAT
GCAGACCCCACAAAATCTGCACTGTCTGCAGTTGCATCTTTGGCAGCTGCATGGCCACAGTTACACCAGGGCTGCAGTTTGAAAAGTTTG
GATCTTGATAGCTGCACTCTTTCAGAAATCCTCAGACTGCACATCTTAGCTTCAGGTGCTGATGTAACATCAGCAAATGCAAAGTATAGA
TATCAAAAACGAGGAGGATTTGATGCTACAGATGATGCTTGTATGGAGCTTCGTTTGAGCAATCCCAGTCTAGTGAAGAAACTGTCAAGC
ACCTCAGTGTATGATTTGACACCAGGAGAAAAAATGAAGATACTCCATGCTCTCTGTGGAAAGCTACTGACCCTAGTTTCAACTAGGGAT
TTTATTGAAGATTATGTTGATATATTACGACAGGCAAAGCAGGAGTTCCGGGAATTAAAAGCAGAACAACATCGAAAAGAGAGGGAAGAA
GCAGCTGCCAGAATTCGTAAAAGGAAGGAAGAAAAACTTAAGGAGCAAGAACAAAAAATGAAAGAGAAACAAGAAAAACTGAAAGAAGAT
GAGCAAAGAAATTCAACGGCAGATATATCTATTGGGGAGGAAGAAAGGGAAGATTTTGATACTAGCATTGAGAGCAAAGACACAGAGCAA
AAGGAATTAGATCAAGATATGGTCACTGAAGATGAAGATGACCCAGGATCACATAAAAGAGGCAGAAGGGGGAAAAGAGGACAAAATGGA
TTTAAAGAATTTACAAGGCAAGAACAGATCAACTGTGTAACAAGAGAGCCTCTTACTGCTGATGAGGAAGAAGCATTAAAACAGGAACAC
CAACGAAAAGAGAAAGAGCTCTTAGAAAAAATCCAAAGTGCCATAGCCTGTACCAATATCTTTCCCTTGGGTCGCGACCGCATGTATAGA
CGATACTGGATTTTCCCTTCTATTCCTGGACTCTTTATTGAAGAGGATTATTCTGGTCTTACTGAAGACATGCTGTTGCCTAGACCTTCA
TCATTTCAGAATAATGTACAGTCTCAAGATCCTCAGGTATCCACTAAAACTGGAGAGCCTTTGATGTCTGAATCTACCTCCAACATTGAC
CAAGGTCCACGTGACCATTCTGTGCAGCTGCCAAAACCAGTGCATAAGCCAAATCGGTGGTGCTTTTACAGTTCTTGTGAACAGCTAGAC
CAGCTTATTGAAGCTCTTAATTCTAGAGGACATAGAGAAAGTGCCTTAAAAGAAACTTTGTTACAAGAGAAAAGCAGAATATGTGCACAG
CTAGCCCGTTTTTCTGAAGAGAAATTTCATTTTTCAGACAAACCTCAGCCTGATAGCAAACCAACATATAGTCGGGGAAGATCTTCCAAT
GCATATGATCCATCTCAGATGTGTGCAGAAAAGCAACTTGAACTAAGGCTGAGAGATTTTCTTTTAGATATTGAAGATAGAATCTACCAA
GGAACATTAGGAGCCATCAAGGTTACAGATCGACATATCTGGAGATCAGCATTAGAAAGTGGACGGTATGAGCTGTTAAGTGAGGAAAAC
AAGGAAAATGGGATAATTAAAACTGTGAATGAAGACGTAGAAGAGATGGAAATTGATGAACAAACAAAGGTCATAGTAAAAGACAGACTT
TTGGGGATAAAAACAGAAACTCCAAGTACTGTATCAACAAATGCAAGTACACCACAATCAGTGAGCAGTGTGGTTCATTATCTGGCAATG
GCACTCTTTCAAATAGAGCAGGGCATTGAGCGGCGTTTTCTGAAAGCTCCACTTGATGCCAGTGACAGTGGGCGTTCTTATAAAACAGTT
CTGGACCGTTGGAGAGAGTCTCTCCTTTCTTCTGCTAGTCTATCCCAAGTTTTTCTTCACCTATCCACCTTGGATCGTAGCGTGATATGG
TCTAAATCTATACTGAATGCGCGTTGCAAGATATGTCGAAAGAAAGGCGATGCTGAAAACATGGTTCTTTGTGATGGCTGTGATAGGGGT
CATCATACCTACTGTGTTCGACCAAAGCTCAAGACTGTGCCTGAAGGAGACTGGTTTTGTCCAGAATGTCGACCAAAGCAACGTTCTAGA
AGACTCTCCTCTAGACAGAGACCATCCTTGGAAAGTGATGAAGATGTGGAAGACAGTATGGGAGGTGAGGATGATGAAGTTGATGGCGAT
GAAGAAGAAGGTCAAAGTGAGGAGGAAGAGTATGAGGTAGAACAAGATGAAGATGACTCTCAAGAAGAGGAAGAAGTCAGCCTACCCAAA
CGAGGAAGACCACAAGTTAGATTGCCAGTTAAAACAAGAGGGAAACTTAGCTCTTCTTTCTCAAGTCGTGGCCAACAACAAGAACCTGGA
AGATACCCTTCAAGGAGTCAGCAGAGCACACCCAAAACAACTGTTTCTTCTAAAACTGGTAGAAGCCTAAGAAAGATAAACTCTGCTCCT
CCTACAGAAACAAAATCTTTAAGAATTGCCAGTCGTTCTACTCGCCACAGTCATGGCCCACTGCAAGCAGATGTATTTGTGGAATTGCTT
AGTCCTCGTAGAAAACGCAGAGGCAGGAAAAGTGCTAATAATACACCAGAAAATAGTCCCAACTTCCCTAACTTCAGAGTCATTGCCACA
AAGTCAAGTGAACAGTCAAGATCTGTAAATATTGCTTCAAAACTTTCTCTCCAAGAGAGTGAATCCAAAAGAAGATGCAGAAAAAGACAA
TCTCCAGAGCCATCGCCTGTGACACTGGGTCGAAGGAGTTCTGGCCGACAGGGAGGAGTTCATGAATTGTCTGCTTTTGAACAACTTGTT
GTAGAATTGGTACGACATGATGACAGCTGGCCTTTTTTGAAACTTGTTTCTAAAATCCAGGTCCCAGACTACTATGACATCATCAAAAAG
CCCATTGCCTTAAATATAATTCGTGAAAAAGTGAATAAGTGTGAATATAAATTAGCATCTGAGTTTATTGATGACATTGAGTTAATGTTT
TCGAACTGCTTTGAATACAACCCTCGTAACACAAGTGAAGCAAAAGCTGGAACTAGGCTTCAAGCATTTTTTCATATTCAGGCTCAAAAG
CTTGGACTCCACGTCACACCCAGTAATGTGGACCAAGTTAGCACACCACCGGCTGCGAAAAAGTCACGAATCTGACTTTGTCCTTCTAAA
GGATATATTTGAAGAAAAACAAATTGTTCATGAAAATGGAACATTAAATCATGCTGTATAAAGCAATAACAATTGATTGACCACATGAAA
GTGTGGCCTGCACTATATTCTCAATTTTAATATTAAGCACTCAGGAGAATGTAGGAAAGATATCCTTTGCTACAGTTTTGTTCAGTATCT
AATAAGTTTGATAGATGTATTGGATACAGTACTGGTTTACAGAGGTTTTTGTACATTTTTGAGATCATTCATGTGTCCAGAGATCTTGGA
AAATATTTTTTCACCCACGATTTATTTTGTTATTGATGATTTTTTTTTAAAGTGGTGGTATTAAGGGAGAGTTATCTACATGGATGAGTC
TTCCGCTATAGCACAGTTTAGAAAAGGTGTTTATGTCTTAATTAATTGTTTGAGTACATTCTTTCAACACTACACATGAATGAATCCAAT
CTTATAACCTTGAAGTGCTGTACCAGTGCTGGCTGCAGGTATTAAGTCCAAGTTTATTAACTAGATATTTATTTAGTATTGAGAGTAATT
TGTGAATTTGTTTTGTATTTATAAAATTTATACCTGAAAAATGTTCCTTAATGTTTTAAACCTTTTACTGTGTTTTTATTCCTCTAACTT
CCTTAATGATCAATCAAAAAAAGTAACACCCTCCCTTTTTCCTGACAGTTCTTTCAGCTTTACAGAACTGTATTATAAGTTTCTATGTAT

>43713_43713_6_KRTCAP3-BAZ1A_KRTCAP3_chr2_27666399_ENST00000407293_BAZ1A_chr14_35272194_ENST00000382422_length(amino acids)=1502AA_BP=188
MMRVGLALILVGHVNLLLGAVLHGTVLRHVANPRGAVTPEYTVANVISVGSGLLSVSVGLVALLASRNLLRPPLHWVLLALALVNLLLSV
ACSLGLLLAVSLTVANGGRRLIADCHPGLLDPLVPLDEGPGHTDCPFDPTRIYDTALALWIPSLLMSAGEAALSGYCCVAALTLRGVGPC
RKDGLQGQASSLSTYKIAEQDFSYFFPDDPPTFIFSPANRRRGRPPKRIHISQEDNVANKQTLASYRSKATKERDKLLKQEEMKSLAFEK
AKLKREKADALEAKKKEKEDKEKKREELKKIVEEERLKKKEEKERLKVEREKEREKLREEKRKYVEYLKQWSKPREDMECDDLKELPEPT
PVKTRLPPEIFGDALMVLEFLNAFGELFDLQDEFPDGVTLEVLEEALVGNDSEGPLCELLFFFLTAIFQAIAEEEEEVAKEQLTDADTKD
LTEALDEDADPTKSALSAVASLAAAWPQLHQGCSLKSLDLDSCTLSEILRLHILASGADVTSANAKYRYQKRGGFDATDDACMELRLSNP
SLVKKLSSTSVYDLTPGEKMKILHALCGKLLTLVSTRDFIEDYVDILRQAKQEFRELKAEQHRKEREEAAARIRKRKEEKLKEQEQKMKE
KQEKLKEDEQRNSTADISIGEEEREDFDTSIESKDTEQKELDQDMVTEDEDDPGSHKRGRRGKRGQNGFKEFTRQEQINCVTREPLTADE
EEALKQEHQRKEKELLEKIQSAIACTNIFPLGRDRMYRRYWIFPSIPGLFIEEDYSGLTEDMLLPRPSSFQNNVQSQDPQVSTKTGEPLM
SESTSNIDQGPRDHSVQLPKPVHKPNRWCFYSSCEQLDQLIEALNSRGHRESALKETLLQEKSRICAQLARFSEEKFHFSDKPQPDSKPT
YSRGRSSNAYDPSQMCAEKQLELRLRDFLLDIEDRIYQGTLGAIKVTDRHIWRSALESGRYELLSEENKENGIIKTVNEDVEEMEIDEQT
KVIVKDRLLGIKTETPSTVSTNASTPQSVSSVVHYLAMALFQIEQGIERRFLKAPLDASDSGRSYKTVLDRWRESLLSSASLSQVFLHLS
TLDRSVIWSKSILNARCKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECRPKQRSRRLSSRQRPSLESDEDVEDSMGG
EDDEVDGDEEEGQSEEEEYEVEQDEDDSQEEEEVSLPKRGRPQVRLPVKTRGKLSSSFSSRGQQQEPGRYPSRSQQSTPKTTVSSKTGRS
LRKINSAPPTETKSLRIASRSTRHSHGPLQADVFVELLSPRRKRRGRKSANNTPENSPNFPNFRVIATKSSEQSRSVNIASKLSLQESES
KRRCRKRQSPEPSPVTLGRRSSGRQGGVHELSAFEQLVVELVRHDDSWPFLKLVSKIQVPDYYDIIKKPIALNIIREKVNKCEYKLASEF

--------------------------------------------------------------
>43713_43713_7_KRTCAP3-BAZ1A_KRTCAP3_chr2_27666399_ENST00000543753_BAZ1A_chr14_35272194_ENST00000358716_length(transcript)=5288nt_BP=662nt
GGGCCGGGCCCAGGTACAGCGGCCCTGCGGCTGGCGCGGCGGACGGGATGAGGCGCTGCAGTCTCTGCGCTTTCGACGCCGCCCGGGGGC
CCAGGCGGCTGATGCGTGTGGGCCTCGCGCTGATCTTGGTGGGCCACGTGAACCTGCTGCTGGGGGCCGTGCTGCATGGCACCGTCCTGC
GGCACGTGGCCAATCCCCGCGGCGCTGTCACGCCGGAGTACACCGTAGCCAATGTCATCTCTGTCGGCTCGGGGCTGCTGAGCGTTTCCG
TGGGACTTGTGGCCCTCCTGGCGTCCAGGAACCTTCTTCGCCCTCCACTGCACTGGGTCCTGCTGGCACTAGCTCTGGTGAACCTGCTCT
TGTCCGTTGCCTGCTCCCTGGGCCTCCTTCTTGCTGTGTCACTCACTGTGGCCAACGGTGGCCGCCGCCTTATTGCTGACTGCCACCCAG
GACTGCTGGATCCTCTGGTACCACTGGATGAGGGGCCGGGACATACTGACTGCCCCTTTGACCCCACAAGAATCTATGATACAGCCTTGG
CTCTCTGGATCCCTTCTTTGCTCATGTCTGCAGGGGAGGCTGCTCTATCTGGTTACTGCTGTGTGGCTGCACTCACTCTACGTGGAGTTG
GGCCCTGCAGGAAGGACGGACTTCAGGGGCAGGCATCATCTCTTTCAACGTATAAAATAGCAGAACAAGATTTTTCTTATTTCTTCCCTG
ATGATCCACCCACATTTATCTTCAGTCCTGCTAACAGACGAAGAGGGAGACCTCCCAAACGAATACATATTAGTCAAGAGGACAATGTTG
CTAATAAACAGACTCTTGCAAGTTATAGGAGCAAAGCTACTAAAGAAAGAGATAAACTTTTGAAACAAGAAGAAATGAAGTCACTGGCTT
TTGAAAAGGCTAAATTAAAAAGAGAAAAAGCAGATGCCCTAGAAGCGAAGAAAAAAGAAAAAGAAGATAAAGAGAAAAAGAGGGAAGAAT
TGAAAAAAATTGTTGAAGAAGAGAGACTAAAGAAAAAAGAAGAAAAAGAGAGGCTTAAAGTAGAAAGAGAAAAGGAAAGAGAGAAGTTAC
GTGAAGAAAAGCGAAAGTATGTGGAATACTTAAAACAGTGGAGTAAACCTAGAGAAGATATGGAATGTGATGACCTTAAGGAACTTCCAG
AACCAACACCAGTGAAAACTAGACTACCTCCTGAAATCTTTGGTGATGCTCTGATGGTTTTGGAGTTCCTTAATGCATTTGGGGAACTTT
TTGATCTTCAAGATGAGTTTCCTGATGGAGTAACCCTAGAAGTATTAGAGGAAGCTCTTGTAGGAAATGACAGTGAAGGCCCACTGTGTG
AATTGCTTTTTTTCTTCCTGACTGCAATCTTCCAGGCAATAGCTGAAGAAGAAGAGGAAGTAGCCAAAGAGCAACTAACTGATGCTGACA
CCAAAGGCTGCAGTTTGAAAAGTTTGGATCTTGATAGCTGCACTCTTTCAGAAATCCTCAGACTGCACATCTTAGCTTCAGGTGCTGATG
TAACATCAGCAAATGCAAAGTATAGATATCAAAAACGAGGAGGATTTGATGCTACAGATGATGCTTGTATGGAGCTTCGTTTGAGCAATC
CCAGTCTAGTGAAGAAACTGTCAAGCACCTCAGTGTATGATTTGACACCAGGAGAAAAAATGAAGATACTCCATGCTCTCTGTGGAAAGC
TACTGACCCTAGTTTCAACTAGGGATTTTATTGAAGATTATGTTGATATATTACGACAGGCAAAGCAGGAGTTCCGGGAATTAAAAGCAG
AACAACATCGAAAAGAGAGGGAAGAAGCAGCTGCCAGAATTCGTAAAAGGAAGGAAGAAAAACTTAAGGAGCAAGAACAAAAAATGAAAG
AGAAACAAGAAAAACTGAAAGAAGATGAGCAAAGAAATTCAACGGCAGATATATCTATTGGGGAGGAAGAAAGGGAAGATTTTGATACTA
GCATTGAGAGCAAAGACACAGAGCAAAAGGAATTAGATCAAGATATGGTCACTGAAGATGAAGATGACCCAGGATCACATAAAAGAGGCA
GAAGGGGGAAAAGAGGACAAAATGGATTTAAAGAATTTACAAGGCAAGAACAGATCAACTGTGTAACAAGAGAGCCTCTTACTGCTGATG
AGGAAGAAGCATTAAAACAGGAACACCAACGAAAAGAGAAAGAGCTCTTAGAAAAAATCCAAAGTGCCATAGCCTGTACCAATATCTTTC
CCTTGGGTCGCGACCGCATGTATAGACGATACTGGATTTTCCCTTCTATTCCTGGACTCTTTATTGAAGAGGATTATTCTGGTCTTACTG
AAGACATGCTGTTGCCTAGACCTTCATCATTTCAGAATAATGTACAGTCTCAAGATCCTCAGGTATCCACTAAAACTGGAGAGCCTTTGA
TGTCTGAATCTACCTCCAACATTGACCAAGGTCCACGTGACCATTCTGTGCAGCTGCCAAAACCAGTGCATAAGCCAAATCGGTGGTGCT
TTTACAGTTCTTGTGAACAGCTAGACCAGCTTATTGAAGCTCTTAATTCTAGAGGACATAGAGAAAGTGCCTTAAAAGAAACTTTGTTAC
AAGAGAAAAGCAGAATATGTGCACAGCTAGCCCGTTTTTCTGAAGAGAAATTTCATTTTTCAGACAAACCTCAGCCTGATAGCAAACCAA
CATATAGTCGGGGAAGATCTTCCAATGCATATGATCCATCTCAGATGTGTGCAGAAAAGCAACTTGAACTAAGGCTGAGAGATTTTCTTT
TAGATATTGAAGATAGAATCTACCAAGGAACATTAGGAGCCATCAAGGTTACAGATCGACATATCTGGAGATCAGCATTAGAAAGTGGAC
GGTATGAGCTGTTAAGTGAGGAAAACAAGGAAAATGGGATAATTAAAACTGTGAATGAAGACGTAGAAGAGATGGAAATTGATGAACAAA
CAAAGGTCATAGTAAAAGACAGACTTTTGGGGATAAAAACAGAAACTCCAAGTACTGTATCAACAAATGCAAGTACACCACAATCAGTGA
GCAGTGTGGTTCATTATCTGGCAATGGCACTCTTTCAAATAGAGCAGGGCATTGAGCGGCGTTTTCTGAAAGCTCCACTTGATGCCAGTG
ACAGTGGGCGTTCTTATAAAACAGTTCTGGACCGTTGGAGAGAGTCTCTCCTTTCTTCTGCTAGTCTATCCCAAGTTTTTCTTCACCTAT
CCACCTTGGATCGTAGCGTGATATGGTCTAAATCTATACTGAATGCGCGTTGCAAGATATGTCGAAAGAAAGGCGATGCTGAAAACATGG
TTCTTTGTGATGGCTGTGATAGGGGTCATCATACCTACTGTGTTCGACCAAAGCTCAAGACTGTGCCTGAAGGAGACTGGTTTTGTCCAG
AATGTCGACCAAAGCAACGTTCTAGAAGACTCTCCTCTAGACAGAGACCATCCTTGGAAAGTGATGAAGATGTGGAAGACAGTATGGGAG
GTGAGGATGATGAAGTTGATGGCGATGAAGAAGAAGGTCAAAGTGAGGAGGAAGAGTATGAGGTAGAACAAGATGAAGATGACTCTCAAG
AAGAGGAAGAAGTCAGCCTACCCAAACGAGGAAGACCACAAGTTAGATTGCCAGTTAAAACAAGAGGGAAACTTAGCTCTTCTTTCTCAA
GTCGTGGCCAACAACAAGAACCTGGAAGATACCCTTCAAGGAGTCAGCAGAGCACACCCAAAACAACTGTTTCTTCTAAAACTGGTAGAA
GCCTAAGAAAGATAAACTCTGCTCCTCCTACAGAAACAAAATCTTTAAGAATTGCCAGTCGTTCTACTCGCCACAGTCATGGCCCACTGC
AAGCAGATGTATTTGTGGAATTGCTTAGTCCTCGTAGAAAACGCAGAGGCAGGAAAAGTGCTAATAATACACCAGAAAATAGTCCCAACT
TCCCTAACTTCAGAGTCATTGCCACAAAGTCAAGTGAACAGTCAAGATCTGTAAATATTGCTTCAAAACTTTCTCTCCAAGAGAGTGAAT
CCAAAAGAAGATGCAGAAAAAGACAATCTCCAGAGCCATCGCCTGTGACACTGGGTCGAAGGAGTTCTGGCCGACAGGGAGGAGTTCATG
AATTGTCTGCTTTTGAACAACTTGTTGTAGAATTGGTACGACATGATGACAGCTGGCCTTTTTTGAAACTTGTTTCTAAAATCCAGGTCC
CAGACTACTATGACATCATCAAAAAGCCCATTGCCTTAAATATAATTCGTGAAAAAGTGAATAAGTGTGAATATAAATTAGCATCTGAGT
TTATTGATGACATTGAGTTAATGTTTTCGAACTGCTTTGAATACAACCCTCGTAACACAAGTGAAGCAAAAGCTGGAACTAGGCTTCAAG
CATTTTTTCATATTCAGGCTCAAAAGCTTGGACTCCACGTCACACCCAGTAATGTGGACCAAGTTAGCACACCACCGGCTGCGAAAAAGT
CACGAATCTGACTTTGTCCTTCTAAAGGATATATTTGAAGAAAAACAAATTGTTCATGAAAATGGAACATTAAATCATGCTGTATAAAGC
AATAACAATTGATTGACCACATGAAAGTGTGGCCTGCACTATATTCTCAATTTTAATATTAAGCACTCAGGAGAATGTAGGAAAGATATC
CTTTGCTACAGTTTTGTTCAGTATCTAATAAGTTTGATAGATGTATTGGATACAGTACTGGTTTACAGAGGTTTTTGTACATTTTTGAGA
TCATTCATGTGTCCAGAGATCTTGGAAAATATTTTTTCACCCACGATTTATTTTGTTATTGATGATTTTTTTTTAAAGTGGTGGTATTAA
GGGAGAGTTATCTACATGGATGAGTCTTCCGCTATAGCACAGTTTAGAAAAGGTGTTTATGTCTTAATTAATTGTTTGAGTACATTCTTT
CAACACTACACATGAATGAATCCAATCTTATAACCTTGAAGTGCTGTACCAGTGCTGGCTGCAGGTATTAAGTCCAAGTTTATTAACTAG
ATATTTATTTAGTATTGAGAGTAATTTGTGAATTTGTTTTGTATTTATAAAATTTATACCTGAAAAATGTTCCTTAATGTTTTAAACCTT
TTACTGTGTTTTTATTCCTCTAACTTCCTTAATGATCAATCAAAAAAAGTAACACCCTCCCTTTTTCCTGACAGTTCTTTCAGCTTTACA

>43713_43713_7_KRTCAP3-BAZ1A_KRTCAP3_chr2_27666399_ENST00000543753_BAZ1A_chr14_35272194_ENST00000358716_length(amino acids)=1487AA_BP=205
MRRCSLCAFDAARGPRRLMRVGLALILVGHVNLLLGAVLHGTVLRHVANPRGAVTPEYTVANVISVGSGLLSVSVGLVALLASRNLLRPP
LHWVLLALALVNLLLSVACSLGLLLAVSLTVANGGRRLIADCHPGLLDPLVPLDEGPGHTDCPFDPTRIYDTALALWIPSLLMSAGEAAL
SGYCCVAALTLRGVGPCRKDGLQGQASSLSTYKIAEQDFSYFFPDDPPTFIFSPANRRRGRPPKRIHISQEDNVANKQTLASYRSKATKE
RDKLLKQEEMKSLAFEKAKLKREKADALEAKKKEKEDKEKKREELKKIVEEERLKKKEEKERLKVEREKEREKLREEKRKYVEYLKQWSK
PREDMECDDLKELPEPTPVKTRLPPEIFGDALMVLEFLNAFGELFDLQDEFPDGVTLEVLEEALVGNDSEGPLCELLFFFLTAIFQAIAE
EEEEVAKEQLTDADTKGCSLKSLDLDSCTLSEILRLHILASGADVTSANAKYRYQKRGGFDATDDACMELRLSNPSLVKKLSSTSVYDLT
PGEKMKILHALCGKLLTLVSTRDFIEDYVDILRQAKQEFRELKAEQHRKEREEAAARIRKRKEEKLKEQEQKMKEKQEKLKEDEQRNSTA
DISIGEEEREDFDTSIESKDTEQKELDQDMVTEDEDDPGSHKRGRRGKRGQNGFKEFTRQEQINCVTREPLTADEEEALKQEHQRKEKEL
LEKIQSAIACTNIFPLGRDRMYRRYWIFPSIPGLFIEEDYSGLTEDMLLPRPSSFQNNVQSQDPQVSTKTGEPLMSESTSNIDQGPRDHS
VQLPKPVHKPNRWCFYSSCEQLDQLIEALNSRGHRESALKETLLQEKSRICAQLARFSEEKFHFSDKPQPDSKPTYSRGRSSNAYDPSQM
CAEKQLELRLRDFLLDIEDRIYQGTLGAIKVTDRHIWRSALESGRYELLSEENKENGIIKTVNEDVEEMEIDEQTKVIVKDRLLGIKTET
PSTVSTNASTPQSVSSVVHYLAMALFQIEQGIERRFLKAPLDASDSGRSYKTVLDRWRESLLSSASLSQVFLHLSTLDRSVIWSKSILNA
RCKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECRPKQRSRRLSSRQRPSLESDEDVEDSMGGEDDEVDGDEEEGQSE
EEEYEVEQDEDDSQEEEEVSLPKRGRPQVRLPVKTRGKLSSSFSSRGQQQEPGRYPSRSQQSTPKTTVSSKTGRSLRKINSAPPTETKSL
RIASRSTRHSHGPLQADVFVELLSPRRKRRGRKSANNTPENSPNFPNFRVIATKSSEQSRSVNIASKLSLQESESKRRCRKRQSPEPSPV
TLGRRSSGRQGGVHELSAFEQLVVELVRHDDSWPFLKLVSKIQVPDYYDIIKKPIALNIIREKVNKCEYKLASEFIDDIELMFSNCFEYN

--------------------------------------------------------------
>43713_43713_8_KRTCAP3-BAZ1A_KRTCAP3_chr2_27666399_ENST00000543753_BAZ1A_chr14_35272194_ENST00000360310_length(transcript)=5378nt_BP=662nt
GGGCCGGGCCCAGGTACAGCGGCCCTGCGGCTGGCGCGGCGGACGGGATGAGGCGCTGCAGTCTCTGCGCTTTCGACGCCGCCCGGGGGC
CCAGGCGGCTGATGCGTGTGGGCCTCGCGCTGATCTTGGTGGGCCACGTGAACCTGCTGCTGGGGGCCGTGCTGCATGGCACCGTCCTGC
GGCACGTGGCCAATCCCCGCGGCGCTGTCACGCCGGAGTACACCGTAGCCAATGTCATCTCTGTCGGCTCGGGGCTGCTGAGCGTTTCCG
TGGGACTTGTGGCCCTCCTGGCGTCCAGGAACCTTCTTCGCCCTCCACTGCACTGGGTCCTGCTGGCACTAGCTCTGGTGAACCTGCTCT
TGTCCGTTGCCTGCTCCCTGGGCCTCCTTCTTGCTGTGTCACTCACTGTGGCCAACGGTGGCCGCCGCCTTATTGCTGACTGCCACCCAG
GACTGCTGGATCCTCTGGTACCACTGGATGAGGGGCCGGGACATACTGACTGCCCCTTTGACCCCACAAGAATCTATGATACAGCCTTGG
CTCTCTGGATCCCTTCTTTGCTCATGTCTGCAGGGGAGGCTGCTCTATCTGGTTACTGCTGTGTGGCTGCACTCACTCTACGTGGAGTTG
GGCCCTGCAGGAAGGACGGACTTCAGGGGCAGGCATCATCTCTTTCAACGTATAAAATAGCAGAACAAGATTTTTCTTATTTCTTCCCTG
ATGATCCACCCACATTTATCTTCAGTCCTGCTAACAGACGAAGAGGGAGACCTCCCAAACGAATACATATTAGTCAAGAGGACAATGTTG
CTAATAAACAGACTCTTGCAAGTTATAGGAGCAAAGCTACTAAAGAAAGAGATAAACTTTTGAAACAAGAAGAAATGAAGTCACTGGCTT
TTGAAAAGGCTAAATTAAAAAGAGAAAAAGCAGATGCCCTAGAAGCGAAGAAAAAAGAAAAAGAAGATAAAGAGAAAAAGAGGGAAGAAT
TGAAAAAAATTGTTGAAGAAGAGAGACTAAAGAAAAAAGAAGAAAAAGAGAGGCTTAAAGTAGAAAGAGAAAAGGAAAGAGAGAAGTTAC
GTGAAGAAAAGCGAAAGTATGTGGAATACTTAAAACAGTGGAGTAAACCTAGAGAAGATATGGAATGTGATGACCTTAAGGAACTTCCAG
AACCAACACCAGTGAAAACTAGACTACCTCCTGAAATCTTTGGTGATGCTCTGATGGTTTTGGAGTTCCTTAATGCATTTGGGGAACTTT
TTGATCTTCAAGATGAGTTTCCTGATGGAGTAACCCTAGAAGTATTAGAGGAAGCTCTTGTAGGAAATGACAGTGAAGGCCCACTGTGTG
AATTGCTTTTTTTCTTCCTGACTGCAATCTTCCAGGCAATAGCTGAAGAAGAAGAGGAAGTAGCCAAAGAGCAACTAACTGATGCTGACA
CCAAAGATTTAACAGAGGCTTTGGATGAAGATGCAGACCCCACAAAATCTGCACTGTCTGCAGTTGCATCTTTGGCAGCTGCATGGCCAC
AGTTACACCAGGGCTGCAGTTTGAAAAGTTTGGATCTTGATAGCTGCACTCTTTCAGAAATCCTCAGACTGCACATCTTAGCTTCAGGTG
CTGATGTAACATCAGCAAATGCAAAGTATAGATATCAAAAACGAGGAGGATTTGATGCTACAGATGATGCTTGTATGGAGCTTCGTTTGA
GCAATCCCAGTCTAGTGAAGAAACTGTCAAGCACCTCAGTGTATGATTTGACACCAGGAGAAAAAATGAAGATACTCCATGCTCTCTGTG
GAAAGCTACTGACCCTAGTTTCAACTAGGGATTTTATTGAAGATTATGTTGATATATTACGACAGGCAAAGCAGGAGTTCCGGGAATTAA
AAGCAGAACAACATCGAAAAGAGAGGGAAGAAGCAGCTGCCAGAATTCGTAAAAGGAAGGAAGAAAAACTTAAGGAGCAAGAACAAAAAA
TGAAAGAGAAACAAGAAAAACTGAAAGAAGATGAGCAAAGAAATTCAACGGCAGATATATCTATTGGGGAGGAAGAAAGGGAAGATTTTG
ATACTAGCATTGAGAGCAAAGACACAGAGCAAAAGGAATTAGATCAAGATATGGTCACTGAAGATGAAGATGACCCAGGATCACATAAAA
GAGGCAGAAGGGGGAAAAGAGGACAAAATGGATTTAAAGAATTTACAAGGCAAGAACAGATCAACTGTGTAACAAGAGAGCCTCTTACTG
CTGATGAGGAAGAAGCATTAAAACAGGAACACCAACGAAAAGAGAAAGAGCTCTTAGAAAAAATCCAAAGTGCCATAGCCTGTACCAATA
TCTTTCCCTTGGGTCGCGACCGCATGTATAGACGATACTGGATTTTCCCTTCTATTCCTGGACTCTTTATTGAAGAGGATTATTCTGGTC
TTACTGAAGACATGCTGTTGCCTAGACCTTCATCATTTCAGAATAATGTACAGTCTCAAGATCCTCAGGTATCCACTAAAACTGGAGAGC
CTTTGATGTCTGAATCTACCTCCAACATTGACCAAGGTCCACGTGACCATTCTGTGCAGCTGCCAAAACCAGTGCATAAGCCAAATCGGT
GGTGCTTTTACAGTTCTTGTGAACAGCTAGACCAGCTTATTGAAGCTCTTAATTCTAGAGGACATAGAGAAAGTGCCTTAAAAGAAACTT
TGTTACAAGAGAAAAGCAGAATATGTGCACAGCTAGCCCGTTTTTCTGAAGAGAAATTTCATTTTTCAGACAAACCTCAGCCTGATAGCA
AACCAACATATAGTCGGGGAAGATCTTCCAATGCATATGATCCATCTCAGATGTGTGCAGAAAAGCAACTTGAACTAAGGCTGAGAGATT
TTCTTTTAGATATTGAAGATAGAATCTACCAAGGAACATTAGGAGCCATCAAGGTTACAGATCGACATATCTGGAGATCAGCATTAGAAA
GTGGACGGTATGAGCTGTTAAGTGAGGAAAACAAGGAAAATGGGATAATTAAAACTGTGAATGAAGACGTAGAAGAGATGGAAATTGATG
AACAAACAAAGGTCATAGTAAAAGACAGACTTTTGGGGATAAAAACAGAAACTCCAAGTACTGTATCAACAAATGCAAGTACACCACAAT
CAGTGAGCAGTGTGGTTCATTATCTGGCAATGGCACTCTTTCAAATAGAGCAGGGCATTGAGCGGCGTTTTCTGAAAGCTCCACTTGATG
CCAGTGACAGTGGGCGTTCTTATAAAACAGTTCTGGACCGTTGGAGAGAGTCTCTCCTTTCTTCTGCTAGTCTATCCCAAGTTTTTCTTC
ACCTATCCACCTTGGATCGTAGCGTGATATGGTCTAAATCTATACTGAATGCGCGTTGCAAGATATGTCGAAAGAAAGGCGATGCTGAAA
ACATGGTTCTTTGTGATGGCTGTGATAGGGGTCATCATACCTACTGTGTTCGACCAAAGCTCAAGACTGTGCCTGAAGGAGACTGGTTTT
GTCCAGAATGTCGACCAAAGCAACGTTCTAGAAGACTCTCCTCTAGACAGAGACCATCCTTGGAAAGTGATGAAGATGTGGAAGACAGTA
TGGGAGGTGAGGATGATGAAGTTGATGGCGATGAAGAAGAAGGTCAAAGTGAGGAGGAAGAGTATGAGGTAGAACAAGATGAAGATGACT
CTCAAGAAGAGGAAGAAGTCAGCCTACCCAAACGAGGAAGACCACAAGTTAGATTGCCAGTTAAAACAAGAGGGAAACTTAGCTCTTCTT
TCTCAAGTCGTGGCCAACAACAAGAACCTGGAAGATACCCTTCAAGGAGTCAGCAGAGCACACCCAAAACAACTGTTTCTTCTAAAACTG
GTAGAAGCCTAAGAAAGATAAACTCTGCTCCTCCTACAGAAACAAAATCTTTAAGAATTGCCAGTCGTTCTACTCGCCACAGTCATGGCC
CACTGCAAGCAGATGTATTTGTGGAATTGCTTAGTCCTCGTAGAAAACGCAGAGGCAGGAAAAGTGCTAATAATACACCAGAAAATAGTC
CCAACTTCCCTAACTTCAGAGTCATTGCCACAAAGTCAAGTGAACAGTCAAGATCTGTAAATATTGCTTCAAAACTTTCTCTCCAAGAGA
GTGAATCCAAAAGAAGATGCAGAAAAAGACAATCTCCAGAGCCATCGCCTGTGACACTGGGTCGAAGGAGTTCTGGCCGACAGGGAGGAG
TTCATGAATTGTCTGCTTTTGAACAACTTGTTGTAGAATTGGTACGACATGATGACAGCTGGCCTTTTTTGAAACTTGTTTCTAAAATCC
AGGTCCCAGACTACTATGACATCATCAAAAAGCCCATTGCCTTAAATATAATTCGTGAAAAAGTGAATAAGTGTGAATATAAATTAGCAT
CTGAGTTTATTGATGACATTGAGTTAATGTTTTCGAACTGCTTTGAATACAACCCTCGTAACACAAGTGAAGCAAAAGCTGGAACTAGGC
TTCAAGCATTTTTTCATATTCAGGCTCAAAAGCTTGGACTCCACGTCACACCCAGTAATGTGGACCAAGTTAGCACACCACCGGCTGCGA
AAAAGTCACGAATCTGACTTTGTCCTTCTAAAGGATATATTTGAAGAAAAACAAATTGTTCATGAAAATGGAACATTAAATCATGCTGTA
TAAAGCAATAACAATTGATTGACCACATGAAAGTGTGGCCTGCACTATATTCTCAATTTTAATATTAAGCACTCAGGAGAATGTAGGAAA
GATATCCTTTGCTACAGTTTTGTTCAGTATCTAATAAGTTTGATAGATGTATTGGATACAGTACTGGTTTACAGAGGTTTTTGTACATTT
TTGAGATCATTCATGTGTCCAGAGATCTTGGAAAATATTTTTTCACCCACGATTTATTTTGTTATTGATGATTTTTTTTTAAAGTGGTGG
TATTAAGGGAGAGTTATCTACATGGATGAGTCTTCCGCTATAGCACAGTTTAGAAAAGGTGTTTATGTCTTAATTAATTGTTTGAGTACA
TTCTTTCAACACTACACATGAATGAATCCAATCTTATAACCTTGAAGTGCTGTACCAGTGCTGGCTGCAGGTATTAAGTCCAAGTTTATT
AACTAGATATTTATTTAGTATTGAGAGTAATTTGTGAATTTGTTTTGTATTTATAAAATTTATACCTGAAAAATGTTCCTTAATGTTTTA
AACCTTTTACTGTGTTTTTATTCCTCTAACTTCCTTAATGATCAATCAAAAAAAGTAACACCCTCCCTTTTTCCTGACAGTTCTTTCAGC

>43713_43713_8_KRTCAP3-BAZ1A_KRTCAP3_chr2_27666399_ENST00000543753_BAZ1A_chr14_35272194_ENST00000360310_length(amino acids)=1519AA_BP=205
MRRCSLCAFDAARGPRRLMRVGLALILVGHVNLLLGAVLHGTVLRHVANPRGAVTPEYTVANVISVGSGLLSVSVGLVALLASRNLLRPP
LHWVLLALALVNLLLSVACSLGLLLAVSLTVANGGRRLIADCHPGLLDPLVPLDEGPGHTDCPFDPTRIYDTALALWIPSLLMSAGEAAL
SGYCCVAALTLRGVGPCRKDGLQGQASSLSTYKIAEQDFSYFFPDDPPTFIFSPANRRRGRPPKRIHISQEDNVANKQTLASYRSKATKE
RDKLLKQEEMKSLAFEKAKLKREKADALEAKKKEKEDKEKKREELKKIVEEERLKKKEEKERLKVEREKEREKLREEKRKYVEYLKQWSK
PREDMECDDLKELPEPTPVKTRLPPEIFGDALMVLEFLNAFGELFDLQDEFPDGVTLEVLEEALVGNDSEGPLCELLFFFLTAIFQAIAE
EEEEVAKEQLTDADTKDLTEALDEDADPTKSALSAVASLAAAWPQLHQGCSLKSLDLDSCTLSEILRLHILASGADVTSANAKYRYQKRG
GFDATDDACMELRLSNPSLVKKLSSTSVYDLTPGEKMKILHALCGKLLTLVSTRDFIEDYVDILRQAKQEFRELKAEQHRKEREEAAARI
RKRKEEKLKEQEQKMKEKQEKLKEDEQRNSTADISIGEEEREDFDTSIESKDTEQKELDQDMVTEDEDDPGSHKRGRRGKRGQNGFKEFT
RQEQINCVTREPLTADEEEALKQEHQRKEKELLEKIQSAIACTNIFPLGRDRMYRRYWIFPSIPGLFIEEDYSGLTEDMLLPRPSSFQNN
VQSQDPQVSTKTGEPLMSESTSNIDQGPRDHSVQLPKPVHKPNRWCFYSSCEQLDQLIEALNSRGHRESALKETLLQEKSRICAQLARFS
EEKFHFSDKPQPDSKPTYSRGRSSNAYDPSQMCAEKQLELRLRDFLLDIEDRIYQGTLGAIKVTDRHIWRSALESGRYELLSEENKENGI
IKTVNEDVEEMEIDEQTKVIVKDRLLGIKTETPSTVSTNASTPQSVSSVVHYLAMALFQIEQGIERRFLKAPLDASDSGRSYKTVLDRWR
ESLLSSASLSQVFLHLSTLDRSVIWSKSILNARCKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECRPKQRSRRLSSR
QRPSLESDEDVEDSMGGEDDEVDGDEEEGQSEEEEYEVEQDEDDSQEEEEVSLPKRGRPQVRLPVKTRGKLSSSFSSRGQQQEPGRYPSR
SQQSTPKTTVSSKTGRSLRKINSAPPTETKSLRIASRSTRHSHGPLQADVFVELLSPRRKRRGRKSANNTPENSPNFPNFRVIATKSSEQ
SRSVNIASKLSLQESESKRRCRKRQSPEPSPVTLGRRSSGRQGGVHELSAFEQLVVELVRHDDSWPFLKLVSKIQVPDYYDIIKKPIALN

--------------------------------------------------------------
>43713_43713_9_KRTCAP3-BAZ1A_KRTCAP3_chr2_27666399_ENST00000543753_BAZ1A_chr14_35272194_ENST00000382422_length(transcript)=5383nt_BP=662nt
GGGCCGGGCCCAGGTACAGCGGCCCTGCGGCTGGCGCGGCGGACGGGATGAGGCGCTGCAGTCTCTGCGCTTTCGACGCCGCCCGGGGGC
CCAGGCGGCTGATGCGTGTGGGCCTCGCGCTGATCTTGGTGGGCCACGTGAACCTGCTGCTGGGGGCCGTGCTGCATGGCACCGTCCTGC
GGCACGTGGCCAATCCCCGCGGCGCTGTCACGCCGGAGTACACCGTAGCCAATGTCATCTCTGTCGGCTCGGGGCTGCTGAGCGTTTCCG
TGGGACTTGTGGCCCTCCTGGCGTCCAGGAACCTTCTTCGCCCTCCACTGCACTGGGTCCTGCTGGCACTAGCTCTGGTGAACCTGCTCT
TGTCCGTTGCCTGCTCCCTGGGCCTCCTTCTTGCTGTGTCACTCACTGTGGCCAACGGTGGCCGCCGCCTTATTGCTGACTGCCACCCAG
GACTGCTGGATCCTCTGGTACCACTGGATGAGGGGCCGGGACATACTGACTGCCCCTTTGACCCCACAAGAATCTATGATACAGCCTTGG
CTCTCTGGATCCCTTCTTTGCTCATGTCTGCAGGGGAGGCTGCTCTATCTGGTTACTGCTGTGTGGCTGCACTCACTCTACGTGGAGTTG
GGCCCTGCAGGAAGGACGGACTTCAGGGGCAGGCATCATCTCTTTCAACGTATAAAATAGCAGAACAAGATTTTTCTTATTTCTTCCCTG
ATGATCCACCCACATTTATCTTCAGTCCTGCTAACAGACGAAGAGGGAGACCTCCCAAACGAATACATATTAGTCAAGAGGACAATGTTG
CTAATAAACAGACTCTTGCAAGTTATAGGAGCAAAGCTACTAAAGAAAGAGATAAACTTTTGAAACAAGAAGAAATGAAGTCACTGGCTT
TTGAAAAGGCTAAATTAAAAAGAGAAAAAGCAGATGCCCTAGAAGCGAAGAAAAAAGAAAAAGAAGATAAAGAGAAAAAGAGGGAAGAAT
TGAAAAAAATTGTTGAAGAAGAGAGACTAAAGAAAAAAGAAGAAAAAGAGAGGCTTAAAGTAGAAAGAGAAAAGGAAAGAGAGAAGTTAC
GTGAAGAAAAGCGAAAGTATGTGGAATACTTAAAACAGTGGAGTAAACCTAGAGAAGATATGGAATGTGATGACCTTAAGGAACTTCCAG
AACCAACACCAGTGAAAACTAGACTACCTCCTGAAATCTTTGGTGATGCTCTGATGGTTTTGGAGTTCCTTAATGCATTTGGGGAACTTT
TTGATCTTCAAGATGAGTTTCCTGATGGAGTAACCCTAGAAGTATTAGAGGAAGCTCTTGTAGGAAATGACAGTGAAGGCCCACTGTGTG
AATTGCTTTTTTTCTTCCTGACTGCAATCTTCCAGGCAATAGCTGAAGAAGAAGAGGAAGTAGCCAAAGAGCAACTAACTGATGCTGACA
CCAAAGATTTAACAGAGGCTTTGGATGAAGATGCAGACCCCACAAAATCTGCACTGTCTGCAGTTGCATCTTTGGCAGCTGCATGGCCAC
AGTTACACCAGGGCTGCAGTTTGAAAAGTTTGGATCTTGATAGCTGCACTCTTTCAGAAATCCTCAGACTGCACATCTTAGCTTCAGGTG
CTGATGTAACATCAGCAAATGCAAAGTATAGATATCAAAAACGAGGAGGATTTGATGCTACAGATGATGCTTGTATGGAGCTTCGTTTGA
GCAATCCCAGTCTAGTGAAGAAACTGTCAAGCACCTCAGTGTATGATTTGACACCAGGAGAAAAAATGAAGATACTCCATGCTCTCTGTG
GAAAGCTACTGACCCTAGTTTCAACTAGGGATTTTATTGAAGATTATGTTGATATATTACGACAGGCAAAGCAGGAGTTCCGGGAATTAA
AAGCAGAACAACATCGAAAAGAGAGGGAAGAAGCAGCTGCCAGAATTCGTAAAAGGAAGGAAGAAAAACTTAAGGAGCAAGAACAAAAAA
TGAAAGAGAAACAAGAAAAACTGAAAGAAGATGAGCAAAGAAATTCAACGGCAGATATATCTATTGGGGAGGAAGAAAGGGAAGATTTTG
ATACTAGCATTGAGAGCAAAGACACAGAGCAAAAGGAATTAGATCAAGATATGGTCACTGAAGATGAAGATGACCCAGGATCACATAAAA
GAGGCAGAAGGGGGAAAAGAGGACAAAATGGATTTAAAGAATTTACAAGGCAAGAACAGATCAACTGTGTAACAAGAGAGCCTCTTACTG
CTGATGAGGAAGAAGCATTAAAACAGGAACACCAACGAAAAGAGAAAGAGCTCTTAGAAAAAATCCAAAGTGCCATAGCCTGTACCAATA
TCTTTCCCTTGGGTCGCGACCGCATGTATAGACGATACTGGATTTTCCCTTCTATTCCTGGACTCTTTATTGAAGAGGATTATTCTGGTC
TTACTGAAGACATGCTGTTGCCTAGACCTTCATCATTTCAGAATAATGTACAGTCTCAAGATCCTCAGGTATCCACTAAAACTGGAGAGC
CTTTGATGTCTGAATCTACCTCCAACATTGACCAAGGTCCACGTGACCATTCTGTGCAGCTGCCAAAACCAGTGCATAAGCCAAATCGGT
GGTGCTTTTACAGTTCTTGTGAACAGCTAGACCAGCTTATTGAAGCTCTTAATTCTAGAGGACATAGAGAAAGTGCCTTAAAAGAAACTT
TGTTACAAGAGAAAAGCAGAATATGTGCACAGCTAGCCCGTTTTTCTGAAGAGAAATTTCATTTTTCAGACAAACCTCAGCCTGATAGCA
AACCAACATATAGTCGGGGAAGATCTTCCAATGCATATGATCCATCTCAGATGTGTGCAGAAAAGCAACTTGAACTAAGGCTGAGAGATT
TTCTTTTAGATATTGAAGATAGAATCTACCAAGGAACATTAGGAGCCATCAAGGTTACAGATCGACATATCTGGAGATCAGCATTAGAAA
GTGGACGGTATGAGCTGTTAAGTGAGGAAAACAAGGAAAATGGGATAATTAAAACTGTGAATGAAGACGTAGAAGAGATGGAAATTGATG
AACAAACAAAGGTCATAGTAAAAGACAGACTTTTGGGGATAAAAACAGAAACTCCAAGTACTGTATCAACAAATGCAAGTACACCACAAT
CAGTGAGCAGTGTGGTTCATTATCTGGCAATGGCACTCTTTCAAATAGAGCAGGGCATTGAGCGGCGTTTTCTGAAAGCTCCACTTGATG
CCAGTGACAGTGGGCGTTCTTATAAAACAGTTCTGGACCGTTGGAGAGAGTCTCTCCTTTCTTCTGCTAGTCTATCCCAAGTTTTTCTTC
ACCTATCCACCTTGGATCGTAGCGTGATATGGTCTAAATCTATACTGAATGCGCGTTGCAAGATATGTCGAAAGAAAGGCGATGCTGAAA
ACATGGTTCTTTGTGATGGCTGTGATAGGGGTCATCATACCTACTGTGTTCGACCAAAGCTCAAGACTGTGCCTGAAGGAGACTGGTTTT
GTCCAGAATGTCGACCAAAGCAACGTTCTAGAAGACTCTCCTCTAGACAGAGACCATCCTTGGAAAGTGATGAAGATGTGGAAGACAGTA
TGGGAGGTGAGGATGATGAAGTTGATGGCGATGAAGAAGAAGGTCAAAGTGAGGAGGAAGAGTATGAGGTAGAACAAGATGAAGATGACT
CTCAAGAAGAGGAAGAAGTCAGCCTACCCAAACGAGGAAGACCACAAGTTAGATTGCCAGTTAAAACAAGAGGGAAACTTAGCTCTTCTT
TCTCAAGTCGTGGCCAACAACAAGAACCTGGAAGATACCCTTCAAGGAGTCAGCAGAGCACACCCAAAACAACTGTTTCTTCTAAAACTG
GTAGAAGCCTAAGAAAGATAAACTCTGCTCCTCCTACAGAAACAAAATCTTTAAGAATTGCCAGTCGTTCTACTCGCCACAGTCATGGCC
CACTGCAAGCAGATGTATTTGTGGAATTGCTTAGTCCTCGTAGAAAACGCAGAGGCAGGAAAAGTGCTAATAATACACCAGAAAATAGTC
CCAACTTCCCTAACTTCAGAGTCATTGCCACAAAGTCAAGTGAACAGTCAAGATCTGTAAATATTGCTTCAAAACTTTCTCTCCAAGAGA
GTGAATCCAAAAGAAGATGCAGAAAAAGACAATCTCCAGAGCCATCGCCTGTGACACTGGGTCGAAGGAGTTCTGGCCGACAGGGAGGAG
TTCATGAATTGTCTGCTTTTGAACAACTTGTTGTAGAATTGGTACGACATGATGACAGCTGGCCTTTTTTGAAACTTGTTTCTAAAATCC
AGGTCCCAGACTACTATGACATCATCAAAAAGCCCATTGCCTTAAATATAATTCGTGAAAAAGTGAATAAGTGTGAATATAAATTAGCAT
CTGAGTTTATTGATGACATTGAGTTAATGTTTTCGAACTGCTTTGAATACAACCCTCGTAACACAAGTGAAGCAAAAGCTGGAACTAGGC
TTCAAGCATTTTTTCATATTCAGGCTCAAAAGCTTGGACTCCACGTCACACCCAGTAATGTGGACCAAGTTAGCACACCACCGGCTGCGA
AAAAGTCACGAATCTGACTTTGTCCTTCTAAAGGATATATTTGAAGAAAAACAAATTGTTCATGAAAATGGAACATTAAATCATGCTGTA
TAAAGCAATAACAATTGATTGACCACATGAAAGTGTGGCCTGCACTATATTCTCAATTTTAATATTAAGCACTCAGGAGAATGTAGGAAA
GATATCCTTTGCTACAGTTTTGTTCAGTATCTAATAAGTTTGATAGATGTATTGGATACAGTACTGGTTTACAGAGGTTTTTGTACATTT
TTGAGATCATTCATGTGTCCAGAGATCTTGGAAAATATTTTTTCACCCACGATTTATTTTGTTATTGATGATTTTTTTTTAAAGTGGTGG
TATTAAGGGAGAGTTATCTACATGGATGAGTCTTCCGCTATAGCACAGTTTAGAAAAGGTGTTTATGTCTTAATTAATTGTTTGAGTACA
TTCTTTCAACACTACACATGAATGAATCCAATCTTATAACCTTGAAGTGCTGTACCAGTGCTGGCTGCAGGTATTAAGTCCAAGTTTATT
AACTAGATATTTATTTAGTATTGAGAGTAATTTGTGAATTTGTTTTGTATTTATAAAATTTATACCTGAAAAATGTTCCTTAATGTTTTA
AACCTTTTACTGTGTTTTTATTCCTCTAACTTCCTTAATGATCAATCAAAAAAAGTAACACCCTCCCTTTTTCCTGACAGTTCTTTCAGC

>43713_43713_9_KRTCAP3-BAZ1A_KRTCAP3_chr2_27666399_ENST00000543753_BAZ1A_chr14_35272194_ENST00000382422_length(amino acids)=1519AA_BP=205
MRRCSLCAFDAARGPRRLMRVGLALILVGHVNLLLGAVLHGTVLRHVANPRGAVTPEYTVANVISVGSGLLSVSVGLVALLASRNLLRPP
LHWVLLALALVNLLLSVACSLGLLLAVSLTVANGGRRLIADCHPGLLDPLVPLDEGPGHTDCPFDPTRIYDTALALWIPSLLMSAGEAAL
SGYCCVAALTLRGVGPCRKDGLQGQASSLSTYKIAEQDFSYFFPDDPPTFIFSPANRRRGRPPKRIHISQEDNVANKQTLASYRSKATKE
RDKLLKQEEMKSLAFEKAKLKREKADALEAKKKEKEDKEKKREELKKIVEEERLKKKEEKERLKVEREKEREKLREEKRKYVEYLKQWSK
PREDMECDDLKELPEPTPVKTRLPPEIFGDALMVLEFLNAFGELFDLQDEFPDGVTLEVLEEALVGNDSEGPLCELLFFFLTAIFQAIAE
EEEEVAKEQLTDADTKDLTEALDEDADPTKSALSAVASLAAAWPQLHQGCSLKSLDLDSCTLSEILRLHILASGADVTSANAKYRYQKRG
GFDATDDACMELRLSNPSLVKKLSSTSVYDLTPGEKMKILHALCGKLLTLVSTRDFIEDYVDILRQAKQEFRELKAEQHRKEREEAAARI
RKRKEEKLKEQEQKMKEKQEKLKEDEQRNSTADISIGEEEREDFDTSIESKDTEQKELDQDMVTEDEDDPGSHKRGRRGKRGQNGFKEFT
RQEQINCVTREPLTADEEEALKQEHQRKEKELLEKIQSAIACTNIFPLGRDRMYRRYWIFPSIPGLFIEEDYSGLTEDMLLPRPSSFQNN
VQSQDPQVSTKTGEPLMSESTSNIDQGPRDHSVQLPKPVHKPNRWCFYSSCEQLDQLIEALNSRGHRESALKETLLQEKSRICAQLARFS
EEKFHFSDKPQPDSKPTYSRGRSSNAYDPSQMCAEKQLELRLRDFLLDIEDRIYQGTLGAIKVTDRHIWRSALESGRYELLSEENKENGI
IKTVNEDVEEMEIDEQTKVIVKDRLLGIKTETPSTVSTNASTPQSVSSVVHYLAMALFQIEQGIERRFLKAPLDASDSGRSYKTVLDRWR
ESLLSSASLSQVFLHLSTLDRSVIWSKSILNARCKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECRPKQRSRRLSSR
QRPSLESDEDVEDSMGGEDDEVDGDEEEGQSEEEEYEVEQDEDDSQEEEEVSLPKRGRPQVRLPVKTRGKLSSSFSSRGQQQEPGRYPSR
SQQSTPKTTVSSKTGRSLRKINSAPPTETKSLRIASRSTRHSHGPLQADVFVELLSPRRKRRGRKSANNTPENSPNFPNFRVIATKSSEQ
SRSVNIASKLSLQESESKRRCRKRQSPEPSPVTLGRRSSGRQGGVHELSAFEQLVVELVRHDDSWPFLKLVSKIQVPDYYDIIKKPIALN

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for KRTCAP3-BAZ1A


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with
TgeneBAZ1Achr2:27666399chr14:35272194ENST00000358716526667_933242.01525.0SMARCA5
TgeneBAZ1Achr2:27666399chr14:35272194ENST00000360310527667_933242.01557.0SMARCA5
TgeneBAZ1Achr2:27666399chr14:35272194ENST00000382422426667_933242.01557.0SMARCA5


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for KRTCAP3-BAZ1A


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for KRTCAP3-BAZ1A


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource