Fusion Gene Studies
in Kim Lab

FusionBase FusionGDB FusionGDB2 FusionPDB FusionNeoAntigen FusionAI FusionNW FGviewer Publication Contact
FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:C3-HP (FusionGDB2 ID:HG718TG3240)

Fusion Gene Summary for C3-HP

check button Fusion gene summary
Fusion gene informationFusion gene name: C3-HP
Fusion gene ID: hg718tg3240
HgeneTgene
Gene symbol

C3

HP

Gene ID

718

3240

Gene namecomplement C3haptoglobin
SynonymsAHUS5|ARMD9|ASP|C3a|C3b|CPAMD1|HEL-S-62pBP|HP2ALPHA2|HPA1S
Cytomap('C3')('HP')

19p13.3

16q22.2

Type of geneprotein-codingprotein-coding
Descriptioncomplement C3C3 and PZP-like alpha-2-macroglobulin domain-containing protein 1C3a anaphylatoxinacylation-stimulating protein cleavage productcomplement component 3complement component C3acomplement component C3bepididymis secretory sperm binding prhaptoglobinbinding peptidehaptoglobin alpha(1S)-betahaptoglobin alpha(2FS)-betahaptoglobin, alpha polypeptidehaptoglobin, beta polypeptidezonulin
Modification date2020032720200313
UniProtAcc.

P00738

Ensembl transtripts involved in fusion geneENST00000245907, ENST00000599668, 
ENST00000245907, ENST00000599668, 
Fusion gene scores* DoF score28 X 24 X 14=940814 X 11 X 3=462
# samples 3313
** MAII scorelog2(33/9408*10)=-4.83335013059055
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(13/462*10)=-1.8293812283876
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: C3 [Title/Abstract] AND HP [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointC3(6694442)-HP(72093016), # samples:1
Anticipated loss of major functional domain due to fusion event.C3-HP seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
C3-HP seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
C3-HP seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
C3-HP seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
HP-C3 seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
HP-C3 seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
C3-HP seems lost the major protein functional domain in Hgene partner, which is a essential gene due to the frame-shifted ORF.
HP-C3 seems lost the major protein functional domain in Tgene partner, which is a essential gene due to the frame-shifted ORF.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneC3

GO:0001934

positive regulation of protein phosphorylation

15833747

HgeneC3

GO:0010575

positive regulation of vascular endothelial growth factor production

16452172

HgeneC3

GO:0010828

positive regulation of glucose transmembrane transport

9059512|15833747

HgeneC3

GO:0010866

regulation of triglyceride biosynthetic process

10432298

HgeneC3

GO:0010884

positive regulation of lipid storage

9555951

HgeneC3

GO:0045745

positive regulation of G protein-coupled receptor signaling pathway

15833747

TgeneHP

GO:0010942

positive regulation of cell death

19740759

TgeneHP

GO:0042542

response to hydrogen peroxide

19740759

TgeneHP

GO:0051354

negative regulation of oxidoreductase activity

19740759

TgeneHP

GO:2000296

negative regulation of hydrogen peroxide catabolic process

19740759


check buttonFusion gene breakpoints across C3 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure
check buttonFusion gene breakpoints across HP (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4LIHCTCGA-EP-A2KC-01AC3chr19

6694442

-HPchr16

72093016

+


Top

Fusion Gene ORF analysis for C3-HP

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-intronENST00000245907ENST00000565574C3chr19

6694442

-HPchr16

72093016

+
Frame-shiftENST00000245907ENST00000569639C3chr19

6694442

-HPchr16

72093016

+
In-frameENST00000245907ENST00000355906C3chr19

6694442

-HPchr16

72093016

+
In-frameENST00000245907ENST00000357763C3chr19

6694442

-HPchr16

72093016

+
In-frameENST00000245907ENST00000398131C3chr19

6694442

-HPchr16

72093016

+
In-frameENST00000245907ENST00000562526C3chr19

6694442

-HPchr16

72093016

+
In-frameENST00000245907ENST00000570083C3chr19

6694442

-HPchr16

72093016

+
intron-3CDSENST00000599668ENST00000355906C3chr19

6694442

-HPchr16

72093016

+
intron-3CDSENST00000599668ENST00000357763C3chr19

6694442

-HPchr16

72093016

+
intron-3CDSENST00000599668ENST00000398131C3chr19

6694442

-HPchr16

72093016

+
intron-3CDSENST00000599668ENST00000562526C3chr19

6694442

-HPchr16

72093016

+
intron-3CDSENST00000599668ENST00000569639C3chr19

6694442

-HPchr16

72093016

+
intron-3CDSENST00000599668ENST00000570083C3chr19

6694442

-HPchr16

72093016

+
intron-intronENST00000599668ENST00000565574C3chr19

6694442

-HPchr16

72093016

+

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000245907C3chr196694442-ENST00000570083HPchr1672093016+426632476941001343
ENST00000245907C3chr196694442-ENST00000355906HPchr1672093016+426532476941001343
ENST00000245907C3chr196694442-ENST00000398131HPchr1672093016+426532476941001343
ENST00000245907C3chr196694442-ENST00000357763HPchr1672093016+422732476941001343
ENST00000245907C3chr196694442-ENST00000562526HPchr1672093016+371532476933351088

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000245907ENST00000570083C3chr196694442-HPchr1672093016+0.0012709720.99872905
ENST00000245907ENST00000355906C3chr196694442-HPchr1672093016+0.0012843620.99871564
ENST00000245907ENST00000398131C3chr196694442-HPchr1672093016+0.0012843620.99871564
ENST00000245907ENST00000357763C3chr196694442-HPchr1672093016+0.0013033940.9986966
ENST00000245907ENST00000562526C3chr196694442-HPchr1672093016+0.0022165740.9977835

Top

Fusion Genomic Features for C3-HP


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)
C3chr196694441-HPchr1672093012+1.80E-081
C3chr196694441-HPchr1672093012+1.80E-081

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.
genomic feature of top 1%

Top

Fusion Protein Features for C3-HP


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr19:6694442/chr16:72093016)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
.HP

P00738

FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.FUNCTION: As a result of hemolysis, hemoglobin is found to accumulate in the kidney and is secreted in the urine. Haptoglobin captures, and combines with free plasma hemoglobin to allow hepatic recycling of heme iron and to prevent kidney damage. Haptoglobin also acts as an antioxidant, has antibacterial activity, and plays a role in modulating many aspects of the acute phase response. Hemoglobin/haptoglobin complexes are rapidly cleared by the macrophage CD163 scavenger receptor expressed on the surface of liver Kupfer cells through an endocytic lysosomal degradation pathway. {ECO:0000269|PubMed:21248165}.; FUNCTION: The uncleaved form of allele alpha-2 (2-2), known as zonulin, plays a role in intestinal permeability, allowing intercellular tight junction disassembly, and controlling the equilibrium between tolerance and immunity to non-self antigens. {ECO:0000269|PubMed:21248165}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneC3chr19:6694442chr16:72093016ENST00000245907-2441693_72810511664.0DomainAnaphylatoxin-like
TgeneHPchr19:6694442chr16:72093016ENST0000035590647162_404122407.0DomainPeptidase S1
TgeneHPchr19:6694442chr16:72093016ENST0000039813125162_40463348.0DomainPeptidase S1
TgeneHPchr19:6694442chr16:72093016ENST000003981312590_14763348.0DomainSushi 2
TgeneHPchr19:6694442chr16:72093016ENST0000056557405162_4040348.0DomainPeptidase S1
TgeneHPchr19:6694442chr16:72093016ENST000005655740531_880348.0DomainSushi 1
TgeneHPchr19:6694442chr16:72093016ENST000005655740590_1470348.0DomainSushi 2
TgeneHPchr19:6694442chr16:72093016ENST0000057008325162_40463348.0DomainPeptidase S1
TgeneHPchr19:6694442chr16:72093016ENST000005700832590_14763348.0DomainSushi 2

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneC3chr19:6694442chr16:72093016ENST00000245907-24411518_166110511664.0DomainNTR
TgeneHPchr19:6694442chr16:72093016ENST000003559064731_88122407.0DomainSushi 1
TgeneHPchr19:6694442chr16:72093016ENST000003559064790_147122407.0DomainSushi 2
TgeneHPchr19:6694442chr16:72093016ENST000003981312531_8863348.0DomainSushi 1
TgeneHPchr19:6694442chr16:72093016ENST000005700832531_8863348.0DomainSushi 1


Top

Fusion Gene Sequence for C3-HP


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>11694_11694_1_C3-HP_C3_chr19_6694442_ENST00000245907_HP_chr16_72093016_ENST00000355906_length(transcript)=4265nt_BP=3247nt
AGATAAAAAGCCAGCTCCAGCAGGCGCTGCTCACTCCTCCCCATCCTCTCCCTCTGTCCCTCTGTCCCTCTGACCCTGCACTGTCCCAGC
ACCATGGGACCCACCTCAGGTCCCAGCCTGCTGCTCCTGCTACTAACCCACCTCCCCCTGGCTCTGGGGAGTCCCATGTACTCTATCATC
ACCCCCAACATCTTGCGGCTGGAGAGCGAGGAGACCATGGTGCTGGAGGCCCACGACGCGCAAGGGGATGTTCCAGTCACTGTTACTGTC
CACGACTTCCCAGGCAAAAAACTAGTGCTGTCCAGTGAGAAGACTGTGCTGACCCCTGCCACCAACCACATGGGCAACGTCACCTTCACG
ATCCCAGCCAACAGGGAGTTCAAGTCAGAAAAGGGGCGCAACAAGTTCGTGACCGTGCAGGCCACCTTCGGGACCCAAGTGGTGGAGAAG
GTGGTGCTGGTCAGCCTGCAGAGCGGGTACCTCTTCATCCAGACAGACAAGACCATCTACACCCCTGGCTCCACAGTTCTCTATCGGATC
TTCACCGTCAACCACAAGCTGCTACCCGTGGGCCGGACGGTCATGGTCAACATTGAGAACCCGGAAGGCATCCCGGTCAAGCAGGACTCC
TTGTCTTCTCAGAACCAGCTTGGCGTCTTGCCCTTGTCTTGGGACATTCCGGAACTCGTCAACATGGGCCAGTGGAAGATCCGAGCCTAC
TATGAAAACTCACCACAGCAGGTCTTCTCCACTGAGTTTGAGGTGAAGGAGTACGTGCTGCCCAGTTTCGAGGTCATAGTGGAGCCTACA
GAGAAATTCTACTACATCTATAACGAGAAGGGCCTGGAGGTCACCATCACCGCCAGGTTCCTCTACGGGAAGAAAGTGGAGGGAACTGCC
TTTGTCATCTTCGGGATCCAGGATGGCGAACAGAGGATTTCCCTGCCTGAATCCCTCAAGCGCATTCCGATTGAGGATGGCTCGGGGGAG
GTTGTGCTGAGCCGGAAGGTACTGCTGGACGGGGTGCAGAACCCCCGAGCAGAAGACCTGGTGGGGAAGTCTTTGTACGTGTCTGCCACC
GTCATCTTGCACTCAGGCAGTGACATGGTGCAGGCAGAGCGCAGCGGGATCCCCATCGTGACCTCTCCCTACCAGATCCACTTCACCAAG
ACACCCAAGTACTTCAAACCAGGAATGCCCTTTGACCTCATGGTGTTCGTGACGAACCCTGATGGCTCTCCAGCCTACCGAGTCCCCGTG
GCAGTCCAGGGCGAGGACACTGTGCAGTCTCTAACCCAGGGAGATGGCGTGGCCAAACTCAGCATCAACACACACCCCAGCCAGAAGCCC
TTGAGCATCACGGTGCGCACGAAGAAGCAGGAGCTCTCGGAGGCAGAGCAGGCTACCAGGACCATGCAGGCTCTGCCCTACAGCACCGTG
GGCAACTCCAACAATTACCTGCATCTCTCAGTGCTACGTACAGAGCTCAGACCCGGGGAGACCCTCAACGTCAACTTCCTCCTGCGAATG
GACCGCGCCCACGAGGCCAAGATCCGCTACTACACCTACCTGATCATGAACAAGGGCAGGCTGTTGAAGGCGGGACGCCAGGTGCGAGAG
CCCGGCCAGGACCTGGTGGTGCTGCCCCTGTCCATCACCACCGACTTCATCCCTTCCTTCCGCCTGGTGGCGTACTACACGCTGATCGGT
GCCAGCGGCCAGAGGGAGGTGGTGGCCGACTCCGTGTGGGTGGACGTCAAGGACTCCTGCGTGGGCTCGCTGGTGGTAAAAAGCGGCCAG
TCAGAAGACCGGCAGCCTGTACCTGGGCAGCAGATGACCCTGAAGATAGAGGGTGACCACGGGGCCCGGGTGGTACTGGTGGCCGTGGAC
AAGGGCGTGTTCGTGCTGAATAAGAAGAACAAACTGACGCAGAGTAAGATCTGGGACGTGGTGGAGAAGGCAGACATCGGCTGCACCCCG
GGCAGTGGGAAGGATTACGCCGGTGTCTTCTCCGACGCAGGGCTGACCTTCACGAGCAGCAGTGGCCAGCAGACCGCCCAGAGGGCAGAA
CTTCAGTGCCCGCAGCCAGCCGCCCGCCGACGCCGTTCCGTGCAGCTCACGGAGAAGCGAATGGACAAAGTCGGCAAGTACCCCAAGGAG
CTGCGCAAGTGCTGCGAGGACGGCATGCGGGAGAACCCCATGAGGTTCTCGTGCCAGCGCCGGACCCGTTTCATCTCCCTGGGCGAGGCG
TGCAAGAAGGTCTTCCTGGACTGCTGCAACTACATCACAGAGCTGCGGCGGCAGCACGCGCGGGCCAGCCACCTGGGCCTGGCCAGGAGT
AACCTGGATGAGGACATCATTGCAGAAGAGAACATCGTTTCCCGAAGTGAGTTCCCAGAGAGCTGGCTGTGGAACGTTGAGGACTTGAAA
GAGCCACCGAAAAATGGAATCTCTACGAAGCTCATGAATATATTTTTGAAAGACTCCATCACCACGTGGGAGATTCTGGCTGTGAGCATG
TCGGACAAGAAAGGGATCTGTGTGGCAGACCCCTTCGAGGTCACAGTAATGCAGGACTTCTTCATCGACCTGCGGCTACCCTACTCTGTT
GTTCGAAACGAGCAGGTGGAAATCCGAGCCGTTCTCTACAATTACCGGCAGAACCAAGAGCTCAAGGTGAGGGTGGAACTACTCCACAAT
CCAGCCTTCTGCAGCCTGGCCACCACCAAGAGGCGTCACCAGCAGACCGTAACCATCCCCCCCAAGTCCTCGTTGTCCGTTCCATATGTC
ATCGTGCCGCTAAAGACCGGCCTGCAGGAAGTGGAAGTCAAGGCTGCTGTCTACCATCATTTCATCAGTGACGGTGTCAGGAAGTCCCTG
AAGGTCGTGCCGGAAGGAATCAGAATGAACAAAACTGTGGCTGTTCGCACCCTGGATCCAGAACGCCTGGGCCGTGAAGGAGTGCAGAAA
GAGGACATCCCACCTGCAGACCTCAGTGACCAAGTCCCGGACACCGAGTCTGAGACCAGAATTCTCCTGCAAGGGACCCCAGTGGCCCAG
ATGACAGAGGATGCCGTCGACGCGGAACGGCTGAAGCACCTCATTGTGACCCCCTCGGGCTGCGGGGAACAGAACATGATCGGCATGACG
CCCACGGTCATCGCTGTGCATTACCTGGATGAAACGGAGCAGTGGGAGAAGTTCGGCCTAGAGAAGCGGCAGGGGGCCTTGGAGCTCATC
AAGAAGGGAGTGTACACCTTAAACAATGAGAAGCAGTGGATAAATAAGGCTGTTGGAGATAAACTTCCTGAATGTGAAGCAGTATGTGGG
AAGCCCAAGAATCCGGCAAACCCAGTGCAGCGGATCCTGGGTGGACACCTGGATGCCAAAGGCAGCTTTCCCTGGCAGGCTAAGATGGTT
TCCCACCATAATCTCACCACAGGTGCCACGCTGATCAATGAACAATGGCTGCTGACCACGGCTAAAAATCTCTTCCTGAACCATTCAGAA
AATGCAACAGCGAAAGACATTGCCCCTACTTTAACACTCTATGTGGGGAAAAAGCAGCTTGTAGAGATTGAGAAGGTTGTTCTACACCCT
AACTACTCCCAGGTAGATATTGGGCTCATCAAACTCAAACAGAAGGTGTCTGTTAATGAGAGAGTGATGCCCATCTGCCTACCTTCAAAG
GATTATGCAGAAGTAGGGCGTGTGGGTTATGTTTCTGGCTGGGGGCGAAATGCCAATTTTAAATTTACTGACCATCTGAAGTATGTCATG
CTGCCTGTGGCTGACCAAGACCAATGCATAAGGCATTATGAAGGCAGCACAGTCCCCGAAAAGAAGACACCGAAGAGCCCTGTAGGGGTG
CAGCCCATACTGAATGAACACACCTTCTGTGCTGGCATGTCTAAGTACCAAGAAGACACCTGCTATGGCGATGCGGGCAGTGCCTTTGCC
GTTCACGACCTGGAGGAGGACACCTGGTATGCGACTGGGATCTTAAGCTTTGATAAGAGCTGTGCTGTGGCTGAGTATGGTGTGTATGTG
AAGGTGACTTCCATCCAGGACTGGGTTCAGAAGACCATAGCTGAGAACTAATGCAAGGCTGGCCGGAAGCCCTTGCCTGAAAGCAAGATT
TCAGCCTGGAAGAGGGCAAAGTGGACGGGAGTGGACAGGAGTGGATGCGATAAGATGTGGTTTGAAGCTGATGGGTGCCAGCCCTGCATT

>11694_11694_1_C3-HP_C3_chr19_6694442_ENST00000245907_HP_chr16_72093016_ENST00000355906_length(amino acids)=1343AA_BP=1059
MTLHCPSTMGPTSGPSLLLLLLTHLPLALGSPMYSIITPNILRLESEETMVLEAHDAQGDVPVTVTVHDFPGKKLVLSSEKTVLTPATNH
MGNVTFTIPANREFKSEKGRNKFVTVQATFGTQVVEKVVLVSLQSGYLFIQTDKTIYTPGSTVLYRIFTVNHKLLPVGRTVMVNIENPEG
IPVKQDSLSSQNQLGVLPLSWDIPELVNMGQWKIRAYYENSPQQVFSTEFEVKEYVLPSFEVIVEPTEKFYYIYNEKGLEVTITARFLYG
KKVEGTAFVIFGIQDGEQRISLPESLKRIPIEDGSGEVVLSRKVLLDGVQNPRAEDLVGKSLYVSATVILHSGSDMVQAERSGIPIVTSP
YQIHFTKTPKYFKPGMPFDLMVFVTNPDGSPAYRVPVAVQGEDTVQSLTQGDGVAKLSINTHPSQKPLSITVRTKKQELSEAEQATRTMQ
ALPYSTVGNSNNYLHLSVLRTELRPGETLNVNFLLRMDRAHEAKIRYYTYLIMNKGRLLKAGRQVREPGQDLVVLPLSITTDFIPSFRLV
AYYTLIGASGQREVVADSVWVDVKDSCVGSLVVKSGQSEDRQPVPGQQMTLKIEGDHGARVVLVAVDKGVFVLNKKNKLTQSKIWDVVEK
ADIGCTPGSGKDYAGVFSDAGLTFTSSSGQQTAQRAELQCPQPAARRRRSVQLTEKRMDKVGKYPKELRKCCEDGMRENPMRFSCQRRTR
FISLGEACKKVFLDCCNYITELRRQHARASHLGLARSNLDEDIIAEENIVSRSEFPESWLWNVEDLKEPPKNGISTKLMNIFLKDSITTW
EILAVSMSDKKGICVADPFEVTVMQDFFIDLRLPYSVVRNEQVEIRAVLYNYRQNQELKVRVELLHNPAFCSLATTKRRHQQTVTIPPKS
SLSVPYVIVPLKTGLQEVEVKAAVYHHFISDGVRKSLKVVPEGIRMNKTVAVRTLDPERLGREGVQKEDIPPADLSDQVPDTESETRILL
QGTPVAQMTEDAVDAERLKHLIVTPSGCGEQNMIGMTPTVIAVHYLDETEQWEKFGLEKRQGALELIKKGVYTLNNEKQWINKAVGDKLP
ECEAVCGKPKNPANPVQRILGGHLDAKGSFPWQAKMVSHHNLTTGATLINEQWLLTTAKNLFLNHSENATAKDIAPTLTLYVGKKQLVEI
EKVVLHPNYSQVDIGLIKLKQKVSVNERVMPICLPSKDYAEVGRVGYVSGWGRNANFKFTDHLKYVMLPVADQDQCIRHYEGSTVPEKKT

--------------------------------------------------------------
>11694_11694_2_C3-HP_C3_chr19_6694442_ENST00000245907_HP_chr16_72093016_ENST00000357763_length(transcript)=4227nt_BP=3247nt
AGATAAAAAGCCAGCTCCAGCAGGCGCTGCTCACTCCTCCCCATCCTCTCCCTCTGTCCCTCTGTCCCTCTGACCCTGCACTGTCCCAGC
ACCATGGGACCCACCTCAGGTCCCAGCCTGCTGCTCCTGCTACTAACCCACCTCCCCCTGGCTCTGGGGAGTCCCATGTACTCTATCATC
ACCCCCAACATCTTGCGGCTGGAGAGCGAGGAGACCATGGTGCTGGAGGCCCACGACGCGCAAGGGGATGTTCCAGTCACTGTTACTGTC
CACGACTTCCCAGGCAAAAAACTAGTGCTGTCCAGTGAGAAGACTGTGCTGACCCCTGCCACCAACCACATGGGCAACGTCACCTTCACG
ATCCCAGCCAACAGGGAGTTCAAGTCAGAAAAGGGGCGCAACAAGTTCGTGACCGTGCAGGCCACCTTCGGGACCCAAGTGGTGGAGAAG
GTGGTGCTGGTCAGCCTGCAGAGCGGGTACCTCTTCATCCAGACAGACAAGACCATCTACACCCCTGGCTCCACAGTTCTCTATCGGATC
TTCACCGTCAACCACAAGCTGCTACCCGTGGGCCGGACGGTCATGGTCAACATTGAGAACCCGGAAGGCATCCCGGTCAAGCAGGACTCC
TTGTCTTCTCAGAACCAGCTTGGCGTCTTGCCCTTGTCTTGGGACATTCCGGAACTCGTCAACATGGGCCAGTGGAAGATCCGAGCCTAC
TATGAAAACTCACCACAGCAGGTCTTCTCCACTGAGTTTGAGGTGAAGGAGTACGTGCTGCCCAGTTTCGAGGTCATAGTGGAGCCTACA
GAGAAATTCTACTACATCTATAACGAGAAGGGCCTGGAGGTCACCATCACCGCCAGGTTCCTCTACGGGAAGAAAGTGGAGGGAACTGCC
TTTGTCATCTTCGGGATCCAGGATGGCGAACAGAGGATTTCCCTGCCTGAATCCCTCAAGCGCATTCCGATTGAGGATGGCTCGGGGGAG
GTTGTGCTGAGCCGGAAGGTACTGCTGGACGGGGTGCAGAACCCCCGAGCAGAAGACCTGGTGGGGAAGTCTTTGTACGTGTCTGCCACC
GTCATCTTGCACTCAGGCAGTGACATGGTGCAGGCAGAGCGCAGCGGGATCCCCATCGTGACCTCTCCCTACCAGATCCACTTCACCAAG
ACACCCAAGTACTTCAAACCAGGAATGCCCTTTGACCTCATGGTGTTCGTGACGAACCCTGATGGCTCTCCAGCCTACCGAGTCCCCGTG
GCAGTCCAGGGCGAGGACACTGTGCAGTCTCTAACCCAGGGAGATGGCGTGGCCAAACTCAGCATCAACACACACCCCAGCCAGAAGCCC
TTGAGCATCACGGTGCGCACGAAGAAGCAGGAGCTCTCGGAGGCAGAGCAGGCTACCAGGACCATGCAGGCTCTGCCCTACAGCACCGTG
GGCAACTCCAACAATTACCTGCATCTCTCAGTGCTACGTACAGAGCTCAGACCCGGGGAGACCCTCAACGTCAACTTCCTCCTGCGAATG
GACCGCGCCCACGAGGCCAAGATCCGCTACTACACCTACCTGATCATGAACAAGGGCAGGCTGTTGAAGGCGGGACGCCAGGTGCGAGAG
CCCGGCCAGGACCTGGTGGTGCTGCCCCTGTCCATCACCACCGACTTCATCCCTTCCTTCCGCCTGGTGGCGTACTACACGCTGATCGGT
GCCAGCGGCCAGAGGGAGGTGGTGGCCGACTCCGTGTGGGTGGACGTCAAGGACTCCTGCGTGGGCTCGCTGGTGGTAAAAAGCGGCCAG
TCAGAAGACCGGCAGCCTGTACCTGGGCAGCAGATGACCCTGAAGATAGAGGGTGACCACGGGGCCCGGGTGGTACTGGTGGCCGTGGAC
AAGGGCGTGTTCGTGCTGAATAAGAAGAACAAACTGACGCAGAGTAAGATCTGGGACGTGGTGGAGAAGGCAGACATCGGCTGCACCCCG
GGCAGTGGGAAGGATTACGCCGGTGTCTTCTCCGACGCAGGGCTGACCTTCACGAGCAGCAGTGGCCAGCAGACCGCCCAGAGGGCAGAA
CTTCAGTGCCCGCAGCCAGCCGCCCGCCGACGCCGTTCCGTGCAGCTCACGGAGAAGCGAATGGACAAAGTCGGCAAGTACCCCAAGGAG
CTGCGCAAGTGCTGCGAGGACGGCATGCGGGAGAACCCCATGAGGTTCTCGTGCCAGCGCCGGACCCGTTTCATCTCCCTGGGCGAGGCG
TGCAAGAAGGTCTTCCTGGACTGCTGCAACTACATCACAGAGCTGCGGCGGCAGCACGCGCGGGCCAGCCACCTGGGCCTGGCCAGGAGT
AACCTGGATGAGGACATCATTGCAGAAGAGAACATCGTTTCCCGAAGTGAGTTCCCAGAGAGCTGGCTGTGGAACGTTGAGGACTTGAAA
GAGCCACCGAAAAATGGAATCTCTACGAAGCTCATGAATATATTTTTGAAAGACTCCATCACCACGTGGGAGATTCTGGCTGTGAGCATG
TCGGACAAGAAAGGGATCTGTGTGGCAGACCCCTTCGAGGTCACAGTAATGCAGGACTTCTTCATCGACCTGCGGCTACCCTACTCTGTT
GTTCGAAACGAGCAGGTGGAAATCCGAGCCGTTCTCTACAATTACCGGCAGAACCAAGAGCTCAAGGTGAGGGTGGAACTACTCCACAAT
CCAGCCTTCTGCAGCCTGGCCACCACCAAGAGGCGTCACCAGCAGACCGTAACCATCCCCCCCAAGTCCTCGTTGTCCGTTCCATATGTC
ATCGTGCCGCTAAAGACCGGCCTGCAGGAAGTGGAAGTCAAGGCTGCTGTCTACCATCATTTCATCAGTGACGGTGTCAGGAAGTCCCTG
AAGGTCGTGCCGGAAGGAATCAGAATGAACAAAACTGTGGCTGTTCGCACCCTGGATCCAGAACGCCTGGGCCGTGAAGGAGTGCAGAAA
GAGGACATCCCACCTGCAGACCTCAGTGACCAAGTCCCGGACACCGAGTCTGAGACCAGAATTCTCCTGCAAGGGACCCCAGTGGCCCAG
ATGACAGAGGATGCCGTCGACGCGGAACGGCTGAAGCACCTCATTGTGACCCCCTCGGGCTGCGGGGAACAGAACATGATCGGCATGACG
CCCACGGTCATCGCTGTGCATTACCTGGATGAAACGGAGCAGTGGGAGAAGTTCGGCCTAGAGAAGCGGCAGGGGGCCTTGGAGCTCATC
AAGAAGGGAGTGTACACCTTAAACAATGAGAAGCAGTGGATAAATAAGGCTGTTGGAGATAAACTTCCTGAATGTGAAGCAGTATGTGGG
AAGCCCAAGAATCCGGCAAACCCAGTGCAGCGGATCCTGGGTGGACACCTGGATGCCAAAGGCAGCTTTCCCTGGCAGGCTAAGATGGTT
TCCCACCATAATCTCACCACAGGTGCCACGCTGATCAATGAACAATGGCTGCTGACCACGGCTAAAAATCTCTTCCTGAACCATTCAGAA
AATGCAACAGCGAAAGACATTGCCCCTACTTTAACACTCTATGTGGGGAAAAAGCAGCTTGTAGAGATTGAGAAGGTTGTTCTACACCCT
AACTACTCCCAGGTAGATATTGGGCTCATCAAACTCAAACAGAAGGTGTCTGTTAATGAGAGAGTGATGCCCATCTGCCTACCTTCAAAG
GATTATGCAGAAGTAGGGCGTGTGGGTTATGTTTCTGGCTGGGGGCGAAATGCCAATTTTAAATTTACTGACCATCTGAAGTATGTCATG
CTGCCTGTGGCTGACCAAGACCAATGCATAAGGCATTATGAAGGCAGCACAGTCCCCGAAAAGAAGACACCGAAGAGCCCTGTAGGGGTG
CAGCCCATACTGAATGAACACACCTTCTGTGCTGGCATGTCTAAGTACCAAGAAGACACCTGCTATGGCGATGCGGGCAGTGCCTTTGCC
GTTCACGACCTGGAGGAGGACACCTGGTATGCGACTGGGATCTTAAGCTTTGATAAGAGCTGTGCTGTGGCTGAGTATGGTGTGTATGTG
AAGGTGACTTCCATCCAGGACTGGGTTCAGAAGACCATAGCTGAGAACTAATGCAAGGCTGGCCGGAAGCCCTTGCCTGAAAGCAAGATT

>11694_11694_2_C3-HP_C3_chr19_6694442_ENST00000245907_HP_chr16_72093016_ENST00000357763_length(amino acids)=1343AA_BP=1059
MTLHCPSTMGPTSGPSLLLLLLTHLPLALGSPMYSIITPNILRLESEETMVLEAHDAQGDVPVTVTVHDFPGKKLVLSSEKTVLTPATNH
MGNVTFTIPANREFKSEKGRNKFVTVQATFGTQVVEKVVLVSLQSGYLFIQTDKTIYTPGSTVLYRIFTVNHKLLPVGRTVMVNIENPEG
IPVKQDSLSSQNQLGVLPLSWDIPELVNMGQWKIRAYYENSPQQVFSTEFEVKEYVLPSFEVIVEPTEKFYYIYNEKGLEVTITARFLYG
KKVEGTAFVIFGIQDGEQRISLPESLKRIPIEDGSGEVVLSRKVLLDGVQNPRAEDLVGKSLYVSATVILHSGSDMVQAERSGIPIVTSP
YQIHFTKTPKYFKPGMPFDLMVFVTNPDGSPAYRVPVAVQGEDTVQSLTQGDGVAKLSINTHPSQKPLSITVRTKKQELSEAEQATRTMQ
ALPYSTVGNSNNYLHLSVLRTELRPGETLNVNFLLRMDRAHEAKIRYYTYLIMNKGRLLKAGRQVREPGQDLVVLPLSITTDFIPSFRLV
AYYTLIGASGQREVVADSVWVDVKDSCVGSLVVKSGQSEDRQPVPGQQMTLKIEGDHGARVVLVAVDKGVFVLNKKNKLTQSKIWDVVEK
ADIGCTPGSGKDYAGVFSDAGLTFTSSSGQQTAQRAELQCPQPAARRRRSVQLTEKRMDKVGKYPKELRKCCEDGMRENPMRFSCQRRTR
FISLGEACKKVFLDCCNYITELRRQHARASHLGLARSNLDEDIIAEENIVSRSEFPESWLWNVEDLKEPPKNGISTKLMNIFLKDSITTW
EILAVSMSDKKGICVADPFEVTVMQDFFIDLRLPYSVVRNEQVEIRAVLYNYRQNQELKVRVELLHNPAFCSLATTKRRHQQTVTIPPKS
SLSVPYVIVPLKTGLQEVEVKAAVYHHFISDGVRKSLKVVPEGIRMNKTVAVRTLDPERLGREGVQKEDIPPADLSDQVPDTESETRILL
QGTPVAQMTEDAVDAERLKHLIVTPSGCGEQNMIGMTPTVIAVHYLDETEQWEKFGLEKRQGALELIKKGVYTLNNEKQWINKAVGDKLP
ECEAVCGKPKNPANPVQRILGGHLDAKGSFPWQAKMVSHHNLTTGATLINEQWLLTTAKNLFLNHSENATAKDIAPTLTLYVGKKQLVEI
EKVVLHPNYSQVDIGLIKLKQKVSVNERVMPICLPSKDYAEVGRVGYVSGWGRNANFKFTDHLKYVMLPVADQDQCIRHYEGSTVPEKKT

--------------------------------------------------------------
>11694_11694_3_C3-HP_C3_chr19_6694442_ENST00000245907_HP_chr16_72093016_ENST00000398131_length(transcript)=4265nt_BP=3247nt
AGATAAAAAGCCAGCTCCAGCAGGCGCTGCTCACTCCTCCCCATCCTCTCCCTCTGTCCCTCTGTCCCTCTGACCCTGCACTGTCCCAGC
ACCATGGGACCCACCTCAGGTCCCAGCCTGCTGCTCCTGCTACTAACCCACCTCCCCCTGGCTCTGGGGAGTCCCATGTACTCTATCATC
ACCCCCAACATCTTGCGGCTGGAGAGCGAGGAGACCATGGTGCTGGAGGCCCACGACGCGCAAGGGGATGTTCCAGTCACTGTTACTGTC
CACGACTTCCCAGGCAAAAAACTAGTGCTGTCCAGTGAGAAGACTGTGCTGACCCCTGCCACCAACCACATGGGCAACGTCACCTTCACG
ATCCCAGCCAACAGGGAGTTCAAGTCAGAAAAGGGGCGCAACAAGTTCGTGACCGTGCAGGCCACCTTCGGGACCCAAGTGGTGGAGAAG
GTGGTGCTGGTCAGCCTGCAGAGCGGGTACCTCTTCATCCAGACAGACAAGACCATCTACACCCCTGGCTCCACAGTTCTCTATCGGATC
TTCACCGTCAACCACAAGCTGCTACCCGTGGGCCGGACGGTCATGGTCAACATTGAGAACCCGGAAGGCATCCCGGTCAAGCAGGACTCC
TTGTCTTCTCAGAACCAGCTTGGCGTCTTGCCCTTGTCTTGGGACATTCCGGAACTCGTCAACATGGGCCAGTGGAAGATCCGAGCCTAC
TATGAAAACTCACCACAGCAGGTCTTCTCCACTGAGTTTGAGGTGAAGGAGTACGTGCTGCCCAGTTTCGAGGTCATAGTGGAGCCTACA
GAGAAATTCTACTACATCTATAACGAGAAGGGCCTGGAGGTCACCATCACCGCCAGGTTCCTCTACGGGAAGAAAGTGGAGGGAACTGCC
TTTGTCATCTTCGGGATCCAGGATGGCGAACAGAGGATTTCCCTGCCTGAATCCCTCAAGCGCATTCCGATTGAGGATGGCTCGGGGGAG
GTTGTGCTGAGCCGGAAGGTACTGCTGGACGGGGTGCAGAACCCCCGAGCAGAAGACCTGGTGGGGAAGTCTTTGTACGTGTCTGCCACC
GTCATCTTGCACTCAGGCAGTGACATGGTGCAGGCAGAGCGCAGCGGGATCCCCATCGTGACCTCTCCCTACCAGATCCACTTCACCAAG
ACACCCAAGTACTTCAAACCAGGAATGCCCTTTGACCTCATGGTGTTCGTGACGAACCCTGATGGCTCTCCAGCCTACCGAGTCCCCGTG
GCAGTCCAGGGCGAGGACACTGTGCAGTCTCTAACCCAGGGAGATGGCGTGGCCAAACTCAGCATCAACACACACCCCAGCCAGAAGCCC
TTGAGCATCACGGTGCGCACGAAGAAGCAGGAGCTCTCGGAGGCAGAGCAGGCTACCAGGACCATGCAGGCTCTGCCCTACAGCACCGTG
GGCAACTCCAACAATTACCTGCATCTCTCAGTGCTACGTACAGAGCTCAGACCCGGGGAGACCCTCAACGTCAACTTCCTCCTGCGAATG
GACCGCGCCCACGAGGCCAAGATCCGCTACTACACCTACCTGATCATGAACAAGGGCAGGCTGTTGAAGGCGGGACGCCAGGTGCGAGAG
CCCGGCCAGGACCTGGTGGTGCTGCCCCTGTCCATCACCACCGACTTCATCCCTTCCTTCCGCCTGGTGGCGTACTACACGCTGATCGGT
GCCAGCGGCCAGAGGGAGGTGGTGGCCGACTCCGTGTGGGTGGACGTCAAGGACTCCTGCGTGGGCTCGCTGGTGGTAAAAAGCGGCCAG
TCAGAAGACCGGCAGCCTGTACCTGGGCAGCAGATGACCCTGAAGATAGAGGGTGACCACGGGGCCCGGGTGGTACTGGTGGCCGTGGAC
AAGGGCGTGTTCGTGCTGAATAAGAAGAACAAACTGACGCAGAGTAAGATCTGGGACGTGGTGGAGAAGGCAGACATCGGCTGCACCCCG
GGCAGTGGGAAGGATTACGCCGGTGTCTTCTCCGACGCAGGGCTGACCTTCACGAGCAGCAGTGGCCAGCAGACCGCCCAGAGGGCAGAA
CTTCAGTGCCCGCAGCCAGCCGCCCGCCGACGCCGTTCCGTGCAGCTCACGGAGAAGCGAATGGACAAAGTCGGCAAGTACCCCAAGGAG
CTGCGCAAGTGCTGCGAGGACGGCATGCGGGAGAACCCCATGAGGTTCTCGTGCCAGCGCCGGACCCGTTTCATCTCCCTGGGCGAGGCG
TGCAAGAAGGTCTTCCTGGACTGCTGCAACTACATCACAGAGCTGCGGCGGCAGCACGCGCGGGCCAGCCACCTGGGCCTGGCCAGGAGT
AACCTGGATGAGGACATCATTGCAGAAGAGAACATCGTTTCCCGAAGTGAGTTCCCAGAGAGCTGGCTGTGGAACGTTGAGGACTTGAAA
GAGCCACCGAAAAATGGAATCTCTACGAAGCTCATGAATATATTTTTGAAAGACTCCATCACCACGTGGGAGATTCTGGCTGTGAGCATG
TCGGACAAGAAAGGGATCTGTGTGGCAGACCCCTTCGAGGTCACAGTAATGCAGGACTTCTTCATCGACCTGCGGCTACCCTACTCTGTT
GTTCGAAACGAGCAGGTGGAAATCCGAGCCGTTCTCTACAATTACCGGCAGAACCAAGAGCTCAAGGTGAGGGTGGAACTACTCCACAAT
CCAGCCTTCTGCAGCCTGGCCACCACCAAGAGGCGTCACCAGCAGACCGTAACCATCCCCCCCAAGTCCTCGTTGTCCGTTCCATATGTC
ATCGTGCCGCTAAAGACCGGCCTGCAGGAAGTGGAAGTCAAGGCTGCTGTCTACCATCATTTCATCAGTGACGGTGTCAGGAAGTCCCTG
AAGGTCGTGCCGGAAGGAATCAGAATGAACAAAACTGTGGCTGTTCGCACCCTGGATCCAGAACGCCTGGGCCGTGAAGGAGTGCAGAAA
GAGGACATCCCACCTGCAGACCTCAGTGACCAAGTCCCGGACACCGAGTCTGAGACCAGAATTCTCCTGCAAGGGACCCCAGTGGCCCAG
ATGACAGAGGATGCCGTCGACGCGGAACGGCTGAAGCACCTCATTGTGACCCCCTCGGGCTGCGGGGAACAGAACATGATCGGCATGACG
CCCACGGTCATCGCTGTGCATTACCTGGATGAAACGGAGCAGTGGGAGAAGTTCGGCCTAGAGAAGCGGCAGGGGGCCTTGGAGCTCATC
AAGAAGGGAGTGTACACCTTAAACAATGAGAAGCAGTGGATAAATAAGGCTGTTGGAGATAAACTTCCTGAATGTGAAGCAGTATGTGGG
AAGCCCAAGAATCCGGCAAACCCAGTGCAGCGGATCCTGGGTGGACACCTGGATGCCAAAGGCAGCTTTCCCTGGCAGGCTAAGATGGTT
TCCCACCATAATCTCACCACAGGTGCCACGCTGATCAATGAACAATGGCTGCTGACCACGGCTAAAAATCTCTTCCTGAACCATTCAGAA
AATGCAACAGCGAAAGACATTGCCCCTACTTTAACACTCTATGTGGGGAAAAAGCAGCTTGTAGAGATTGAGAAGGTTGTTCTACACCCT
AACTACTCCCAGGTAGATATTGGGCTCATCAAACTCAAACAGAAGGTGTCTGTTAATGAGAGAGTGATGCCCATCTGCCTACCTTCAAAG
GATTATGCAGAAGTAGGGCGTGTGGGTTATGTTTCTGGCTGGGGGCGAAATGCCAATTTTAAATTTACTGACCATCTGAAGTATGTCATG
CTGCCTGTGGCTGACCAAGACCAATGCATAAGGCATTATGAAGGCAGCACAGTCCCCGAAAAGAAGACACCGAAGAGCCCTGTAGGGGTG
CAGCCCATACTGAATGAACACACCTTCTGTGCTGGCATGTCTAAGTACCAAGAAGACACCTGCTATGGCGATGCGGGCAGTGCCTTTGCC
GTTCACGACCTGGAGGAGGACACCTGGTATGCGACTGGGATCTTAAGCTTTGATAAGAGCTGTGCTGTGGCTGAGTATGGTGTGTATGTG
AAGGTGACTTCCATCCAGGACTGGGTTCAGAAGACCATAGCTGAGAACTAATGCAAGGCTGGCCGGAAGCCCTTGCCTGAAAGCAAGATT
TCAGCCTGGAAGAGGGCAAAGTGGACGGGAGTGGACAGGAGTGGATGCGATAAGATGTGGTTTGAAGCTGATGGGTGCCAGCCCTGCATT

>11694_11694_3_C3-HP_C3_chr19_6694442_ENST00000245907_HP_chr16_72093016_ENST00000398131_length(amino acids)=1343AA_BP=1059
MTLHCPSTMGPTSGPSLLLLLLTHLPLALGSPMYSIITPNILRLESEETMVLEAHDAQGDVPVTVTVHDFPGKKLVLSSEKTVLTPATNH
MGNVTFTIPANREFKSEKGRNKFVTVQATFGTQVVEKVVLVSLQSGYLFIQTDKTIYTPGSTVLYRIFTVNHKLLPVGRTVMVNIENPEG
IPVKQDSLSSQNQLGVLPLSWDIPELVNMGQWKIRAYYENSPQQVFSTEFEVKEYVLPSFEVIVEPTEKFYYIYNEKGLEVTITARFLYG
KKVEGTAFVIFGIQDGEQRISLPESLKRIPIEDGSGEVVLSRKVLLDGVQNPRAEDLVGKSLYVSATVILHSGSDMVQAERSGIPIVTSP
YQIHFTKTPKYFKPGMPFDLMVFVTNPDGSPAYRVPVAVQGEDTVQSLTQGDGVAKLSINTHPSQKPLSITVRTKKQELSEAEQATRTMQ
ALPYSTVGNSNNYLHLSVLRTELRPGETLNVNFLLRMDRAHEAKIRYYTYLIMNKGRLLKAGRQVREPGQDLVVLPLSITTDFIPSFRLV
AYYTLIGASGQREVVADSVWVDVKDSCVGSLVVKSGQSEDRQPVPGQQMTLKIEGDHGARVVLVAVDKGVFVLNKKNKLTQSKIWDVVEK
ADIGCTPGSGKDYAGVFSDAGLTFTSSSGQQTAQRAELQCPQPAARRRRSVQLTEKRMDKVGKYPKELRKCCEDGMRENPMRFSCQRRTR
FISLGEACKKVFLDCCNYITELRRQHARASHLGLARSNLDEDIIAEENIVSRSEFPESWLWNVEDLKEPPKNGISTKLMNIFLKDSITTW
EILAVSMSDKKGICVADPFEVTVMQDFFIDLRLPYSVVRNEQVEIRAVLYNYRQNQELKVRVELLHNPAFCSLATTKRRHQQTVTIPPKS
SLSVPYVIVPLKTGLQEVEVKAAVYHHFISDGVRKSLKVVPEGIRMNKTVAVRTLDPERLGREGVQKEDIPPADLSDQVPDTESETRILL
QGTPVAQMTEDAVDAERLKHLIVTPSGCGEQNMIGMTPTVIAVHYLDETEQWEKFGLEKRQGALELIKKGVYTLNNEKQWINKAVGDKLP
ECEAVCGKPKNPANPVQRILGGHLDAKGSFPWQAKMVSHHNLTTGATLINEQWLLTTAKNLFLNHSENATAKDIAPTLTLYVGKKQLVEI
EKVVLHPNYSQVDIGLIKLKQKVSVNERVMPICLPSKDYAEVGRVGYVSGWGRNANFKFTDHLKYVMLPVADQDQCIRHYEGSTVPEKKT

--------------------------------------------------------------
>11694_11694_4_C3-HP_C3_chr19_6694442_ENST00000245907_HP_chr16_72093016_ENST00000562526_length(transcript)=3715nt_BP=3247nt
AGATAAAAAGCCAGCTCCAGCAGGCGCTGCTCACTCCTCCCCATCCTCTCCCTCTGTCCCTCTGTCCCTCTGACCCTGCACTGTCCCAGC
ACCATGGGACCCACCTCAGGTCCCAGCCTGCTGCTCCTGCTACTAACCCACCTCCCCCTGGCTCTGGGGAGTCCCATGTACTCTATCATC
ACCCCCAACATCTTGCGGCTGGAGAGCGAGGAGACCATGGTGCTGGAGGCCCACGACGCGCAAGGGGATGTTCCAGTCACTGTTACTGTC
CACGACTTCCCAGGCAAAAAACTAGTGCTGTCCAGTGAGAAGACTGTGCTGACCCCTGCCACCAACCACATGGGCAACGTCACCTTCACG
ATCCCAGCCAACAGGGAGTTCAAGTCAGAAAAGGGGCGCAACAAGTTCGTGACCGTGCAGGCCACCTTCGGGACCCAAGTGGTGGAGAAG
GTGGTGCTGGTCAGCCTGCAGAGCGGGTACCTCTTCATCCAGACAGACAAGACCATCTACACCCCTGGCTCCACAGTTCTCTATCGGATC
TTCACCGTCAACCACAAGCTGCTACCCGTGGGCCGGACGGTCATGGTCAACATTGAGAACCCGGAAGGCATCCCGGTCAAGCAGGACTCC
TTGTCTTCTCAGAACCAGCTTGGCGTCTTGCCCTTGTCTTGGGACATTCCGGAACTCGTCAACATGGGCCAGTGGAAGATCCGAGCCTAC
TATGAAAACTCACCACAGCAGGTCTTCTCCACTGAGTTTGAGGTGAAGGAGTACGTGCTGCCCAGTTTCGAGGTCATAGTGGAGCCTACA
GAGAAATTCTACTACATCTATAACGAGAAGGGCCTGGAGGTCACCATCACCGCCAGGTTCCTCTACGGGAAGAAAGTGGAGGGAACTGCC
TTTGTCATCTTCGGGATCCAGGATGGCGAACAGAGGATTTCCCTGCCTGAATCCCTCAAGCGCATTCCGATTGAGGATGGCTCGGGGGAG
GTTGTGCTGAGCCGGAAGGTACTGCTGGACGGGGTGCAGAACCCCCGAGCAGAAGACCTGGTGGGGAAGTCTTTGTACGTGTCTGCCACC
GTCATCTTGCACTCAGGCAGTGACATGGTGCAGGCAGAGCGCAGCGGGATCCCCATCGTGACCTCTCCCTACCAGATCCACTTCACCAAG
ACACCCAAGTACTTCAAACCAGGAATGCCCTTTGACCTCATGGTGTTCGTGACGAACCCTGATGGCTCTCCAGCCTACCGAGTCCCCGTG
GCAGTCCAGGGCGAGGACACTGTGCAGTCTCTAACCCAGGGAGATGGCGTGGCCAAACTCAGCATCAACACACACCCCAGCCAGAAGCCC
TTGAGCATCACGGTGCGCACGAAGAAGCAGGAGCTCTCGGAGGCAGAGCAGGCTACCAGGACCATGCAGGCTCTGCCCTACAGCACCGTG
GGCAACTCCAACAATTACCTGCATCTCTCAGTGCTACGTACAGAGCTCAGACCCGGGGAGACCCTCAACGTCAACTTCCTCCTGCGAATG
GACCGCGCCCACGAGGCCAAGATCCGCTACTACACCTACCTGATCATGAACAAGGGCAGGCTGTTGAAGGCGGGACGCCAGGTGCGAGAG
CCCGGCCAGGACCTGGTGGTGCTGCCCCTGTCCATCACCACCGACTTCATCCCTTCCTTCCGCCTGGTGGCGTACTACACGCTGATCGGT
GCCAGCGGCCAGAGGGAGGTGGTGGCCGACTCCGTGTGGGTGGACGTCAAGGACTCCTGCGTGGGCTCGCTGGTGGTAAAAAGCGGCCAG
TCAGAAGACCGGCAGCCTGTACCTGGGCAGCAGATGACCCTGAAGATAGAGGGTGACCACGGGGCCCGGGTGGTACTGGTGGCCGTGGAC
AAGGGCGTGTTCGTGCTGAATAAGAAGAACAAACTGACGCAGAGTAAGATCTGGGACGTGGTGGAGAAGGCAGACATCGGCTGCACCCCG
GGCAGTGGGAAGGATTACGCCGGTGTCTTCTCCGACGCAGGGCTGACCTTCACGAGCAGCAGTGGCCAGCAGACCGCCCAGAGGGCAGAA
CTTCAGTGCCCGCAGCCAGCCGCCCGCCGACGCCGTTCCGTGCAGCTCACGGAGAAGCGAATGGACAAAGTCGGCAAGTACCCCAAGGAG
CTGCGCAAGTGCTGCGAGGACGGCATGCGGGAGAACCCCATGAGGTTCTCGTGCCAGCGCCGGACCCGTTTCATCTCCCTGGGCGAGGCG
TGCAAGAAGGTCTTCCTGGACTGCTGCAACTACATCACAGAGCTGCGGCGGCAGCACGCGCGGGCCAGCCACCTGGGCCTGGCCAGGAGT
AACCTGGATGAGGACATCATTGCAGAAGAGAACATCGTTTCCCGAAGTGAGTTCCCAGAGAGCTGGCTGTGGAACGTTGAGGACTTGAAA
GAGCCACCGAAAAATGGAATCTCTACGAAGCTCATGAATATATTTTTGAAAGACTCCATCACCACGTGGGAGATTCTGGCTGTGAGCATG
TCGGACAAGAAAGGGATCTGTGTGGCAGACCCCTTCGAGGTCACAGTAATGCAGGACTTCTTCATCGACCTGCGGCTACCCTACTCTGTT
GTTCGAAACGAGCAGGTGGAAATCCGAGCCGTTCTCTACAATTACCGGCAGAACCAAGAGCTCAAGGTGAGGGTGGAACTACTCCACAAT
CCAGCCTTCTGCAGCCTGGCCACCACCAAGAGGCGTCACCAGCAGACCGTAACCATCCCCCCCAAGTCCTCGTTGTCCGTTCCATATGTC
ATCGTGCCGCTAAAGACCGGCCTGCAGGAAGTGGAAGTCAAGGCTGCTGTCTACCATCATTTCATCAGTGACGGTGTCAGGAAGTCCCTG
AAGGTCGTGCCGGAAGGAATCAGAATGAACAAAACTGTGGCTGTTCGCACCCTGGATCCAGAACGCCTGGGCCGTGAAGGAGTGCAGAAA
GAGGACATCCCACCTGCAGACCTCAGTGACCAAGTCCCGGACACCGAGTCTGAGACCAGAATTCTCCTGCAAGGGACCCCAGTGGCCCAG
ATGACAGAGGATGCCGTCGACGCGGAACGGCTGAAGCACCTCATTGTGACCCCCTCGGGCTGCGGGGAACAGAACATGATCGGCATGACG
CCCACGGTCATCGCTGTGCATTACCTGGATGAAACGGAGCAGTGGGAGAAGTTCGGCCTAGAGAAGCGGCAGGGGGCCTTGGAGCTCATC
AAGAAGGGAGTGTACACCTTAAACAATGAGAAGCAGTGGATAAATAAGGCTGTTGGAGATAAACTTCCTGAATGTGAAGCAGCCCATACT
GAATGAACACACCTTCTGTGCTGGCATGTCTAAGTACCAAGAAGACACCTGCTATGGCGATGCGGGCAGTGCCTTTGCCGTTCACGACCT
GGAGGAGGACACCTGGTATGCGACTGGGATCTTAAGCTTTGATAAGAGCTGTGCTGTGGCTGAGTATGGTGTGTATGTGAAGGTGACTTC
CATCCAGGACTGGGTTCAGAAGACCATAGCTGAGAACTAATGCAAGGCTGGCCGGAAGCCCTTGCCTGAAAGCAAGATTTCAGCCTGGAA
GAGGGCAAAGTGGACGGGAGTGGACAGGAGTGGATGCGATAAGATGTGGTTTGAAGCTGATGGGTGCCAGCCCTGCATTGCTGAGTCAAT

>11694_11694_4_C3-HP_C3_chr19_6694442_ENST00000245907_HP_chr16_72093016_ENST00000562526_length(amino acids)=1088AA_BP=1059
MTLHCPSTMGPTSGPSLLLLLLTHLPLALGSPMYSIITPNILRLESEETMVLEAHDAQGDVPVTVTVHDFPGKKLVLSSEKTVLTPATNH
MGNVTFTIPANREFKSEKGRNKFVTVQATFGTQVVEKVVLVSLQSGYLFIQTDKTIYTPGSTVLYRIFTVNHKLLPVGRTVMVNIENPEG
IPVKQDSLSSQNQLGVLPLSWDIPELVNMGQWKIRAYYENSPQQVFSTEFEVKEYVLPSFEVIVEPTEKFYYIYNEKGLEVTITARFLYG
KKVEGTAFVIFGIQDGEQRISLPESLKRIPIEDGSGEVVLSRKVLLDGVQNPRAEDLVGKSLYVSATVILHSGSDMVQAERSGIPIVTSP
YQIHFTKTPKYFKPGMPFDLMVFVTNPDGSPAYRVPVAVQGEDTVQSLTQGDGVAKLSINTHPSQKPLSITVRTKKQELSEAEQATRTMQ
ALPYSTVGNSNNYLHLSVLRTELRPGETLNVNFLLRMDRAHEAKIRYYTYLIMNKGRLLKAGRQVREPGQDLVVLPLSITTDFIPSFRLV
AYYTLIGASGQREVVADSVWVDVKDSCVGSLVVKSGQSEDRQPVPGQQMTLKIEGDHGARVVLVAVDKGVFVLNKKNKLTQSKIWDVVEK
ADIGCTPGSGKDYAGVFSDAGLTFTSSSGQQTAQRAELQCPQPAARRRRSVQLTEKRMDKVGKYPKELRKCCEDGMRENPMRFSCQRRTR
FISLGEACKKVFLDCCNYITELRRQHARASHLGLARSNLDEDIIAEENIVSRSEFPESWLWNVEDLKEPPKNGISTKLMNIFLKDSITTW
EILAVSMSDKKGICVADPFEVTVMQDFFIDLRLPYSVVRNEQVEIRAVLYNYRQNQELKVRVELLHNPAFCSLATTKRRHQQTVTIPPKS
SLSVPYVIVPLKTGLQEVEVKAAVYHHFISDGVRKSLKVVPEGIRMNKTVAVRTLDPERLGREGVQKEDIPPADLSDQVPDTESETRILL
QGTPVAQMTEDAVDAERLKHLIVTPSGCGEQNMIGMTPTVIAVHYLDETEQWEKFGLEKRQGALELIKKGVYTLNNEKQWINKAVGDKLP

--------------------------------------------------------------
>11694_11694_5_C3-HP_C3_chr19_6694442_ENST00000245907_HP_chr16_72093016_ENST00000570083_length(transcript)=4266nt_BP=3247nt
AGATAAAAAGCCAGCTCCAGCAGGCGCTGCTCACTCCTCCCCATCCTCTCCCTCTGTCCCTCTGTCCCTCTGACCCTGCACTGTCCCAGC
ACCATGGGACCCACCTCAGGTCCCAGCCTGCTGCTCCTGCTACTAACCCACCTCCCCCTGGCTCTGGGGAGTCCCATGTACTCTATCATC
ACCCCCAACATCTTGCGGCTGGAGAGCGAGGAGACCATGGTGCTGGAGGCCCACGACGCGCAAGGGGATGTTCCAGTCACTGTTACTGTC
CACGACTTCCCAGGCAAAAAACTAGTGCTGTCCAGTGAGAAGACTGTGCTGACCCCTGCCACCAACCACATGGGCAACGTCACCTTCACG
ATCCCAGCCAACAGGGAGTTCAAGTCAGAAAAGGGGCGCAACAAGTTCGTGACCGTGCAGGCCACCTTCGGGACCCAAGTGGTGGAGAAG
GTGGTGCTGGTCAGCCTGCAGAGCGGGTACCTCTTCATCCAGACAGACAAGACCATCTACACCCCTGGCTCCACAGTTCTCTATCGGATC
TTCACCGTCAACCACAAGCTGCTACCCGTGGGCCGGACGGTCATGGTCAACATTGAGAACCCGGAAGGCATCCCGGTCAAGCAGGACTCC
TTGTCTTCTCAGAACCAGCTTGGCGTCTTGCCCTTGTCTTGGGACATTCCGGAACTCGTCAACATGGGCCAGTGGAAGATCCGAGCCTAC
TATGAAAACTCACCACAGCAGGTCTTCTCCACTGAGTTTGAGGTGAAGGAGTACGTGCTGCCCAGTTTCGAGGTCATAGTGGAGCCTACA
GAGAAATTCTACTACATCTATAACGAGAAGGGCCTGGAGGTCACCATCACCGCCAGGTTCCTCTACGGGAAGAAAGTGGAGGGAACTGCC
TTTGTCATCTTCGGGATCCAGGATGGCGAACAGAGGATTTCCCTGCCTGAATCCCTCAAGCGCATTCCGATTGAGGATGGCTCGGGGGAG
GTTGTGCTGAGCCGGAAGGTACTGCTGGACGGGGTGCAGAACCCCCGAGCAGAAGACCTGGTGGGGAAGTCTTTGTACGTGTCTGCCACC
GTCATCTTGCACTCAGGCAGTGACATGGTGCAGGCAGAGCGCAGCGGGATCCCCATCGTGACCTCTCCCTACCAGATCCACTTCACCAAG
ACACCCAAGTACTTCAAACCAGGAATGCCCTTTGACCTCATGGTGTTCGTGACGAACCCTGATGGCTCTCCAGCCTACCGAGTCCCCGTG
GCAGTCCAGGGCGAGGACACTGTGCAGTCTCTAACCCAGGGAGATGGCGTGGCCAAACTCAGCATCAACACACACCCCAGCCAGAAGCCC
TTGAGCATCACGGTGCGCACGAAGAAGCAGGAGCTCTCGGAGGCAGAGCAGGCTACCAGGACCATGCAGGCTCTGCCCTACAGCACCGTG
GGCAACTCCAACAATTACCTGCATCTCTCAGTGCTACGTACAGAGCTCAGACCCGGGGAGACCCTCAACGTCAACTTCCTCCTGCGAATG
GACCGCGCCCACGAGGCCAAGATCCGCTACTACACCTACCTGATCATGAACAAGGGCAGGCTGTTGAAGGCGGGACGCCAGGTGCGAGAG
CCCGGCCAGGACCTGGTGGTGCTGCCCCTGTCCATCACCACCGACTTCATCCCTTCCTTCCGCCTGGTGGCGTACTACACGCTGATCGGT
GCCAGCGGCCAGAGGGAGGTGGTGGCCGACTCCGTGTGGGTGGACGTCAAGGACTCCTGCGTGGGCTCGCTGGTGGTAAAAAGCGGCCAG
TCAGAAGACCGGCAGCCTGTACCTGGGCAGCAGATGACCCTGAAGATAGAGGGTGACCACGGGGCCCGGGTGGTACTGGTGGCCGTGGAC
AAGGGCGTGTTCGTGCTGAATAAGAAGAACAAACTGACGCAGAGTAAGATCTGGGACGTGGTGGAGAAGGCAGACATCGGCTGCACCCCG
GGCAGTGGGAAGGATTACGCCGGTGTCTTCTCCGACGCAGGGCTGACCTTCACGAGCAGCAGTGGCCAGCAGACCGCCCAGAGGGCAGAA
CTTCAGTGCCCGCAGCCAGCCGCCCGCCGACGCCGTTCCGTGCAGCTCACGGAGAAGCGAATGGACAAAGTCGGCAAGTACCCCAAGGAG
CTGCGCAAGTGCTGCGAGGACGGCATGCGGGAGAACCCCATGAGGTTCTCGTGCCAGCGCCGGACCCGTTTCATCTCCCTGGGCGAGGCG
TGCAAGAAGGTCTTCCTGGACTGCTGCAACTACATCACAGAGCTGCGGCGGCAGCACGCGCGGGCCAGCCACCTGGGCCTGGCCAGGAGT
AACCTGGATGAGGACATCATTGCAGAAGAGAACATCGTTTCCCGAAGTGAGTTCCCAGAGAGCTGGCTGTGGAACGTTGAGGACTTGAAA
GAGCCACCGAAAAATGGAATCTCTACGAAGCTCATGAATATATTTTTGAAAGACTCCATCACCACGTGGGAGATTCTGGCTGTGAGCATG
TCGGACAAGAAAGGGATCTGTGTGGCAGACCCCTTCGAGGTCACAGTAATGCAGGACTTCTTCATCGACCTGCGGCTACCCTACTCTGTT
GTTCGAAACGAGCAGGTGGAAATCCGAGCCGTTCTCTACAATTACCGGCAGAACCAAGAGCTCAAGGTGAGGGTGGAACTACTCCACAAT
CCAGCCTTCTGCAGCCTGGCCACCACCAAGAGGCGTCACCAGCAGACCGTAACCATCCCCCCCAAGTCCTCGTTGTCCGTTCCATATGTC
ATCGTGCCGCTAAAGACCGGCCTGCAGGAAGTGGAAGTCAAGGCTGCTGTCTACCATCATTTCATCAGTGACGGTGTCAGGAAGTCCCTG
AAGGTCGTGCCGGAAGGAATCAGAATGAACAAAACTGTGGCTGTTCGCACCCTGGATCCAGAACGCCTGGGCCGTGAAGGAGTGCAGAAA
GAGGACATCCCACCTGCAGACCTCAGTGACCAAGTCCCGGACACCGAGTCTGAGACCAGAATTCTCCTGCAAGGGACCCCAGTGGCCCAG
ATGACAGAGGATGCCGTCGACGCGGAACGGCTGAAGCACCTCATTGTGACCCCCTCGGGCTGCGGGGAACAGAACATGATCGGCATGACG
CCCACGGTCATCGCTGTGCATTACCTGGATGAAACGGAGCAGTGGGAGAAGTTCGGCCTAGAGAAGCGGCAGGGGGCCTTGGAGCTCATC
AAGAAGGGAGTGTACACCTTAAACAATGAGAAGCAGTGGATAAATAAGGCTGTTGGAGATAAACTTCCTGAATGTGAAGCAGTATGTGGG
AAGCCCAAGAATCCGGCAAACCCAGTGCAGCGGATCCTGGGTGGACACCTGGATGCCAAAGGCAGCTTTCCCTGGCAGGCTAAGATGGTT
TCCCACCATAATCTCACCACAGGTGCCACGCTGATCAATGAACAATGGCTGCTGACCACGGCTAAAAATCTCTTCCTGAACCATTCAGAA
AATGCAACAGCGAAAGACATTGCCCCTACTTTAACACTCTATGTGGGGAAAAAGCAGCTTGTAGAGATTGAGAAGGTTGTTCTACACCCT
AACTACTCCCAGGTAGATATTGGGCTCATCAAACTCAAACAGAAGGTGTCTGTTAATGAGAGAGTGATGCCCATCTGCCTACCTTCAAAG
GATTATGCAGAAGTAGGGCGTGTGGGTTATGTTTCTGGCTGGGGGCGAAATGCCAATTTTAAATTTACTGACCATCTGAAGTATGTCATG
CTGCCTGTGGCTGACCAAGACCAATGCATAAGGCATTATGAAGGCAGCACAGTCCCCGAAAAGAAGACACCGAAGAGCCCTGTAGGGGTG
CAGCCCATACTGAATGAACACACCTTCTGTGCTGGCATGTCTAAGTACCAAGAAGACACCTGCTATGGCGATGCGGGCAGTGCCTTTGCC
GTTCACGACCTGGAGGAGGACACCTGGTATGCGACTGGGATCTTAAGCTTTGATAAGAGCTGTGCTGTGGCTGAGTATGGTGTGTATGTG
AAGGTGACTTCCATCCAGGACTGGGTTCAGAAGACCATAGCTGAGAACTAATGCAAGGCTGGCCGGAAGCCCTTGCCTGAAAGCAAGATT
TCAGCCTGGAAGAGGGCAAAGTGGACGGGAGTGGACAGGAGTGGATGCGATAAGATGTGGTTTGAAGCTGATGGGTGCCAGCCCTGCATT

>11694_11694_5_C3-HP_C3_chr19_6694442_ENST00000245907_HP_chr16_72093016_ENST00000570083_length(amino acids)=1343AA_BP=1059
MTLHCPSTMGPTSGPSLLLLLLTHLPLALGSPMYSIITPNILRLESEETMVLEAHDAQGDVPVTVTVHDFPGKKLVLSSEKTVLTPATNH
MGNVTFTIPANREFKSEKGRNKFVTVQATFGTQVVEKVVLVSLQSGYLFIQTDKTIYTPGSTVLYRIFTVNHKLLPVGRTVMVNIENPEG
IPVKQDSLSSQNQLGVLPLSWDIPELVNMGQWKIRAYYENSPQQVFSTEFEVKEYVLPSFEVIVEPTEKFYYIYNEKGLEVTITARFLYG
KKVEGTAFVIFGIQDGEQRISLPESLKRIPIEDGSGEVVLSRKVLLDGVQNPRAEDLVGKSLYVSATVILHSGSDMVQAERSGIPIVTSP
YQIHFTKTPKYFKPGMPFDLMVFVTNPDGSPAYRVPVAVQGEDTVQSLTQGDGVAKLSINTHPSQKPLSITVRTKKQELSEAEQATRTMQ
ALPYSTVGNSNNYLHLSVLRTELRPGETLNVNFLLRMDRAHEAKIRYYTYLIMNKGRLLKAGRQVREPGQDLVVLPLSITTDFIPSFRLV
AYYTLIGASGQREVVADSVWVDVKDSCVGSLVVKSGQSEDRQPVPGQQMTLKIEGDHGARVVLVAVDKGVFVLNKKNKLTQSKIWDVVEK
ADIGCTPGSGKDYAGVFSDAGLTFTSSSGQQTAQRAELQCPQPAARRRRSVQLTEKRMDKVGKYPKELRKCCEDGMRENPMRFSCQRRTR
FISLGEACKKVFLDCCNYITELRRQHARASHLGLARSNLDEDIIAEENIVSRSEFPESWLWNVEDLKEPPKNGISTKLMNIFLKDSITTW
EILAVSMSDKKGICVADPFEVTVMQDFFIDLRLPYSVVRNEQVEIRAVLYNYRQNQELKVRVELLHNPAFCSLATTKRRHQQTVTIPPKS
SLSVPYVIVPLKTGLQEVEVKAAVYHHFISDGVRKSLKVVPEGIRMNKTVAVRTLDPERLGREGVQKEDIPPADLSDQVPDTESETRILL
QGTPVAQMTEDAVDAERLKHLIVTPSGCGEQNMIGMTPTVIAVHYLDETEQWEKFGLEKRQGALELIKKGVYTLNNEKQWINKAVGDKLP
ECEAVCGKPKNPANPVQRILGGHLDAKGSFPWQAKMVSHHNLTTGATLINEQWLLTTAKNLFLNHSENATAKDIAPTLTLYVGKKQLVEI
EKVVLHPNYSQVDIGLIKLKQKVSVNERVMPICLPSKDYAEVGRVGYVSGWGRNANFKFTDHLKYVMLPVADQDQCIRHYEGSTVPEKKT

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for C3-HP


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with
TgeneHPchr19:6694442chr16:72093016ENST0000035590647318_323122.33333333333333407.0CD163
TgeneHPchr19:6694442chr16:72093016ENST0000039813125318_32363.333333333333336348.0CD163
TgeneHPchr19:6694442chr16:72093016ENST0000056557405318_3230348.0CD163
TgeneHPchr19:6694442chr16:72093016ENST0000057008325318_32363.333333333333336348.0CD163


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with
HgeneC3chr19:6694442chr16:72093016ENST00000245907-24411634_16591051.33333333333331664.0CFP/properdin


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for C3-HP


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status
TgeneHPP00738DB14548Zinc sulfate, unspecified formBinderSmall moleculeApproved|Experimental
TgeneHPP00738DB14533Zinc chlorideBinderSmall moleculeApproved|Investigational

Top

Related Diseases for C3-HP


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource
HgeneC3C3151071COMPLEMENT COMPONENT 3 DEFICIENCY, AUTOSOMAL RECESSIVE5CTD_human;GENOMICS_ENGLAND;ORPHANET;UNIPROT
HgeneC3C0242383Age related macular degeneration4CTD_human;GENOMICS_ENGLAND
HgeneC3C2752037HEMOLYTIC UREMIC SYNDROME, ATYPICAL, SUSCEPTIBILITY TO, 54GENOMICS_ENGLAND;UNIPROT
HgeneC3C1969651Macular Degeneration, Age-Related, 92CTD_human;UNIPROT
HgeneC3C0003257Antibody Deficiency Syndrome1CTD_human
HgeneC3C0007787Transient Ischemic Attack1CTD_human
HgeneC3C0011860Diabetes Mellitus, Non-Insulin-Dependent1CTD_human
HgeneC3C0013221Drug toxicity1CTD_human
HgeneC3C0017665Membranous glomerulonephritis1CTD_human
HgeneC3C0019061Hemolytic-Uremic Syndrome1GENOMICS_ENGLAND
HgeneC3C0019193Hepatitis, Toxic1CTD_human
HgeneC3C0021051Immunologic Deficiency Syndromes1CTD_human
HgeneC3C0021655Insulin Resistance1CTD_human
HgeneC3C0022660Kidney Failure, Acute1CTD_human
HgeneC3C0030524Paratuberculosis1CTD_human
HgeneC3C0030807Pemphigus1CTD_human
HgeneC3C0030809Pemphigus Vulgaris1CTD_human
HgeneC3C0034152Henoch-Schoenlein Purpura1CTD_human
HgeneC3C0041755Adverse reaction to drug1CTD_human
HgeneC3C0042386Vasculitis, Hemorrhagic1CTD_human
HgeneC3C0086445Idiopathic Membranous Glomerulonephritis1CTD_human
HgeneC3C0086922Rheumatoid Purpura1CTD_human
HgeneC3C0238281Middle Cerebral Artery Syndrome1CTD_human
HgeneC3C0242461Purpura, Nonthrombocytopenic1CTD_human
HgeneC3C0263313Pemphigus Foliaceus1CTD_human
HgeneC3C0272242Complement deficiency disease1GENOMICS_ENGLAND
HgeneC3C0376362Purpura Hemorrhagica1CTD_human
HgeneC3C0472381Posterior Circulation Transient Ischemic Attack1CTD_human
HgeneC3C0740376Middle Cerebral Artery Thrombosis1CTD_human
HgeneC3C0740391Middle Cerebral Artery Occlusion1CTD_human
HgeneC3C0740392Infarction, Middle Cerebral Artery1CTD_human
HgeneC3C0751019Carotid Circulation Transient Ischemic Attack1CTD_human
HgeneC3C0751020Transient Ischemic Attack, Vertebrobasilar Circulation1CTD_human
HgeneC3C0751021Crescendo Transient Ischemic Attacks1CTD_human
HgeneC3C0751022Brain Stem Ischemia, Transient1CTD_human
HgeneC3C0751845Middle Cerebral Artery Embolus1CTD_human
HgeneC3C0751846Left Middle Cerebral Artery Infarction1CTD_human
HgeneC3C0751847Embolic Infarction, Middle Cerebral Artery1CTD_human
HgeneC3C0751848Thrombotic Infarction, Middle Cerebral Artery1CTD_human
HgeneC3C0751849Right Middle Cerebral Artery Infarction1CTD_human
HgeneC3C0860207Drug-Induced Liver Disease1CTD_human
HgeneC3C0917805Transient Cerebral Ischemia1CTD_human
HgeneC3C0920563Insulin Sensitivity1CTD_human
HgeneC3C1262760Hepatitis, Drug-Induced1CTD_human
HgeneC3C1332655Complement component 3 deficiency1GENOMICS_ENGLAND
HgeneC3C1527335Transient Ischemic Attack, Anterior Circulation1CTD_human
HgeneC3C1565662Acute Kidney Insufficiency1CTD_human
HgeneC3C1704378Heymann Nephritis1CTD_human
HgeneC3C2609414Acute kidney injury1CTD_human
HgeneC3C2931788Atypical Hemolytic Uremic Syndrome1CTD_human;GENOMICS_ENGLAND
HgeneC3C3658290Drug-Induced Acute Liver Injury1CTD_human
HgeneC3C4087273C3 glomerulopathy1GENOMICS_ENGLAND
HgeneC3C4277682Chemical and Drug Induced Liver Injury1CTD_human
HgeneC3C4279912Chemically-Induced Liver Toxicity1CTD_human
TgeneC0041696Unipolar Depression4PSYGENET
TgeneC0011570Mental Depression3PSYGENET
TgeneC0011581Depressive disorder3PSYGENET
TgeneC0027051Myocardial Infarction3CTD_human
TgeneC1269683Major Depressive Disorder3PSYGENET
TgeneC0024530Malaria2CTD_human
TgeneC0525045Mood Disorders2PSYGENET
TgeneC0001723Affective Disorders, Psychotic1PSYGENET
TgeneC0002871Anemia1CTD_human
TgeneC0002895Anemia, Sickle Cell1CTD_human
TgeneC0003864Arthritis1CTD_human
TgeneC0004153Atherosclerosis1CTD_human
TgeneC0004364Autoimmune Diseases1CTD_human
TgeneC0006142Malignant neoplasm of breast1CTD_human
TgeneC0007222Cardiovascular Diseases1CTD_human
TgeneC0011854Diabetes Mellitus, Insulin-Dependent1CTD_human
TgeneC0011860Diabetes Mellitus, Non-Insulin-Dependent1CTD_human
TgeneC0011875Diabetic Angiopathies1CTD_human
TgeneC0013221Drug toxicity1CTD_human
TgeneC0016479Food Poisoning1CTD_human
TgeneC0017416Genital Neoplasms, Female1CTD_human
TgeneC0018995Hemochromatosis1CTD_human
TgeneC0019054Hemolysis (disorder)1CTD_human
TgeneC0019163Hepatitis B1CTD_human
TgeneC0019693HIV Infections1CTD_human
TgeneC0020517Hypersensitivity1CTD_human
TgeneC0020538Hypertensive disease1CTD_human
TgeneC0022660Kidney Failure, Acute1CTD_human
TgeneC0023418leukemia1CTD_human
TgeneC0025945Microangiopathy, Diabetic1CTD_human
TgeneC0035305Retinal Detachment1CTD_human
TgeneC0036341Schizophrenia1CTD_human
TgeneC0041296Tuberculosis1CTD_human
TgeneC0041755Adverse reaction to drug1CTD_human
TgeneC0085397Pasteurellaceae Infections1CTD_human
TgeneC0162323Polyarthritis1CTD_human
TgeneC0205734Diabetes, Autoimmune1CTD_human
TgeneC0235574Intravascular hemolysis1CTD_human
TgeneC0242339Dyslipidemias1CTD_human
TgeneC0312854Extravascular Hemolysis1CTD_human
TgeneC0339546Retinal Pigment Epithelial Detachment1CTD_human
TgeneC0341934Transient hypertension of pregnancy1CTD_human
TgeneC0342257Complications of Diabetes Mellitus1CTD_human
TgeneC0342302Brittle diabetes1CTD_human
TgeneC0392514Hereditary hemochromatosis1CTD_human
TgeneC0524909Hepatitis B, Chronic1CTD_human
TgeneC0524910Hepatitis C, Chronic1CTD_human
TgeneC0598784Dyslipoproteinemias1CTD_human
TgeneC0678222Breast Carcinoma1CTD_human
TgeneC0679360Foodborne Disease1CTD_human
TgeneC0852036Pregnancy associated hypertension1CTD_human
TgeneC1257931Mammary Neoplasms, Human1CTD_human
TgeneC1458155Mammary Neoplasms1CTD_human
TgeneC1527304Allergic Reaction1CTD_human
TgeneC1563937Atherogenesis1CTD_human
TgeneC1565662Acute Kidney Insufficiency1CTD_human
TgeneC2609414Acute kidney injury1CTD_human
TgeneC3279786ANHAPTOGLOBINEMIA1UNIPROT
TgeneC3837958Diabetes Mellitus, Ketosis-Prone1CTD_human
TgeneC4505456HIV Coinfection1CTD_human
TgeneC4554117Diabetes Mellitus, Sudden-Onset1CTD_human
TgeneC4704874Mammary Carcinoma, Human1CTD_human