UTHEALTH HOME ABOUT SBMI A-Z WEBMAIL INSIDE THE UNIVERSITY |
|
Fusion Protein:HMGA1-LAMA4 |
Fusion Protein Summary |
Fusion gene summary |
Fusion partner gene information | Fusion gene name: HMGA1-LAMA4 | FusionPDB ID: 36806 | FusionGDB2.0 ID: 36806 | Hgene | Tgene | Gene symbol | HMGA1 | LAMA4 | Gene ID | 3159 | 3910 |
Gene name | high mobility group AT-hook 1 | laminin subunit alpha 4 | |
Synonyms | HMG-R|HMGA1A|HMGIY | CMD1JJ|LAMA3|LAMA4*-1 | |
Cytomap | 6p21.31 | 6q21 | |
Type of gene | protein-coding | protein-coding | |
Description | high mobility group protein HMG-I/HMG-Yhigh mobility group protein A1high mobility group protein Rhigh-mobility group (nonhistone chromosomal) protein isoforms I and Ynonhistone chromosomal high-mobility group protein HMG-I/HMG-Y | laminin subunit alpha-4laminin alpha 4 chainlaminin, alpha 4 | |
Modification date | 20200315 | 20200313 | |
UniProtAcc | P17096 | Q16363 | |
Ensembl transtripts involved in fusion gene | ENST ids | ENST00000478214, ENST00000311487, ENST00000347617, ENST00000374116, ENST00000401473, ENST00000447654, ENST00000395004, | ENST00000230538, ENST00000389463, ENST00000424408, ENST00000522006, ENST00000431543, ENST00000524032, ENST00000368638, ENST00000453937, |
Fusion gene scores for assessment (based on all fusion genes of FusionGDB 2.0) | * DoF score | 13 X 9 X 5=585 | 9 X 7 X 7=441 |
# samples | 12 | 9 | |
** MAII score | log2(12/585*10)=-2.28540221886225 possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs). DoF>8 and MAII<0 | log2(9/441*10)=-2.29278174922785 possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs). DoF>8 and MAII<0 | |
Context (manual curation of fusion genes in FusionPDB) | PubMed: HMGA1 [Title/Abstract] AND LAMA4 [Title/Abstract] AND fusion [Title/Abstract] | ||
Most frequent breakpoint (based on all fusion genes of FusionGDB 2.0) | |||
Anticipated loss of major functional domain due to fusion event. |
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types ** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10) |
Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez |
Partner | Gene | GO ID | GO term | PubMed ID |
Hgene | HMGA1 | GO:0035986 | senescence-associated heterochromatin focus assembly | 16901784 |
Hgene | HMGA1 | GO:0090402 | oncogene-induced cell senescence | 16901784 |
Fusion gene breakpoints across HMGA1 (5'-gene) * Click on the image to open the UCSC genome browser with custom track showing this image in a new window. |
Fusion gene breakpoints across LAMA4 (3'-gene) * Click on the image to open the UCSC genome browser with custom track showing this image in a new window. |
Top |
Fusion Gene Sample Information |
Fusion gene information from FusionGDB2.0. |
Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0) * All genome coordinats were lifted-over on hg19. * Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser. |
Source | Disease | Sample | Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand |
ChimerKB3 | . | . | HMGA1 | chr6 | 34211295 | + | LAMA4 | chr6 | 112537670 | - |
Top |
Fusion ORF Analysis |
Fusion information from ORFfinder translation from full-length transcript sequence from FusionPDB. |
Henst | Tenst | Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand | Seq length (transcript) | BP loci (transcript) | Predicted start (transcript) | Predicted stop (transcript) | Seq length (amino acids) |
ENST00000447654 | HMGA1 | chr6 | 34211295 | + | ENST00000230538 | LAMA4 | chr6 | 112537670 | - | 6713 | 759 | 270 | 6035 | 1921 |
ENST00000447654 | HMGA1 | chr6 | 34211295 | + | ENST00000522006 | LAMA4 | chr6 | 112537670 | - | 6592 | 759 | 270 | 6014 | 1914 |
ENST00000447654 | HMGA1 | chr6 | 34211295 | + | ENST00000389463 | LAMA4 | chr6 | 112537670 | - | 6578 | 759 | 270 | 6014 | 1914 |
ENST00000447654 | HMGA1 | chr6 | 34211295 | + | ENST00000424408 | LAMA4 | chr6 | 112537670 | - | 6159 | 759 | 270 | 6014 | 1914 |
ENST00000401473 | HMGA1 | chr6 | 34211295 | + | ENST00000230538 | LAMA4 | chr6 | 112537670 | - | 6440 | 486 | 39 | 5762 | 1907 |
ENST00000401473 | HMGA1 | chr6 | 34211295 | + | ENST00000522006 | LAMA4 | chr6 | 112537670 | - | 6319 | 486 | 39 | 5741 | 1900 |
ENST00000401473 | HMGA1 | chr6 | 34211295 | + | ENST00000389463 | LAMA4 | chr6 | 112537670 | - | 6305 | 486 | 39 | 5741 | 1900 |
ENST00000401473 | HMGA1 | chr6 | 34211295 | + | ENST00000424408 | LAMA4 | chr6 | 112537670 | - | 5886 | 486 | 39 | 5741 | 1900 |
ENST00000311487 | HMGA1 | chr6 | 34211295 | + | ENST00000230538 | LAMA4 | chr6 | 112537670 | - | 6473 | 519 | 39 | 5795 | 1918 |
ENST00000311487 | HMGA1 | chr6 | 34211295 | + | ENST00000522006 | LAMA4 | chr6 | 112537670 | - | 6352 | 519 | 39 | 5774 | 1911 |
ENST00000311487 | HMGA1 | chr6 | 34211295 | + | ENST00000389463 | LAMA4 | chr6 | 112537670 | - | 6338 | 519 | 39 | 5774 | 1911 |
ENST00000311487 | HMGA1 | chr6 | 34211295 | + | ENST00000424408 | LAMA4 | chr6 | 112537670 | - | 5919 | 519 | 39 | 5774 | 1911 |
ENST00000347617 | HMGA1 | chr6 | 34211295 | + | ENST00000230538 | LAMA4 | chr6 | 112537670 | - | 6326 | 372 | 39 | 5648 | 1869 |
ENST00000347617 | HMGA1 | chr6 | 34211295 | + | ENST00000522006 | LAMA4 | chr6 | 112537670 | - | 6205 | 372 | 39 | 5627 | 1862 |
ENST00000347617 | HMGA1 | chr6 | 34211295 | + | ENST00000389463 | LAMA4 | chr6 | 112537670 | - | 6191 | 372 | 39 | 5627 | 1862 |
ENST00000347617 | HMGA1 | chr6 | 34211295 | + | ENST00000424408 | LAMA4 | chr6 | 112537670 | - | 5772 | 372 | 39 | 5627 | 1862 |
ENST00000374116 | HMGA1 | chr6 | 34211295 | + | ENST00000230538 | LAMA4 | chr6 | 112537670 | - | 6402 | 448 | 190 | 5724 | 1844 |
ENST00000374116 | HMGA1 | chr6 | 34211295 | + | ENST00000522006 | LAMA4 | chr6 | 112537670 | - | 6281 | 448 | 190 | 5703 | 1837 |
ENST00000374116 | HMGA1 | chr6 | 34211295 | + | ENST00000389463 | LAMA4 | chr6 | 112537670 | - | 6267 | 448 | 190 | 5703 | 1837 |
ENST00000374116 | HMGA1 | chr6 | 34211295 | + | ENST00000424408 | LAMA4 | chr6 | 112537670 | - | 5848 | 448 | 190 | 5703 | 1837 |
DeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated. |
Henst | Tenst | Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand | No-coding score | Coding score |
Top |
Fusion Amino Acid Sequences |
For individual full-length fusion transcript sequence from FusionPDB, we ran ORFfinder and chose the longest ORF among the all predicted ones. |
>FusionGDB ID_FusionGDB isoform ID_FGname_Hgene_Hchr_Hbp_Henst_Tgene_Tchr_Tbp_Tenst_length(fusion AA) seq_BP >36806_36806_1_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000311487_LAMA4_chr6_112537670_ENST00000230538_length(amino acids)=1918AA_BP=160 MGLRAGAISGAGAAPRRHPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDG TEKRGRGRPRKQPPVSPGTALVGSQKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCN GNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYG NPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTG MDCPTISCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEE LVEKENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDE EADEAYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNMSLS TSADSLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASN VYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDA VKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQK RPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYM GLAIKNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVG GVPSNFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRF DIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEK MKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFD GFEGGFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQ TQASEKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKD APSWDPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLA HGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFS GCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQV IVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRH -------------------------------------------------------------- >36806_36806_2_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000311487_LAMA4_chr6_112537670_ENST00000389463_length(amino acids)=1911AA_BP=160 MGLRAGAISGAGAAPRRHPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDG TEKRGRGRPRKQPPVSPGTALVGSQKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCN GNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYG NPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTG CDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQ ASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYE LLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLT TPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYENIVN YVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAA ERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVS ASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKND NLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFK LPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEVRTP ADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTD IYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFN FRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKK FYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPV ALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRLVYM FNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQ LNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNG IRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHP -------------------------------------------------------------- >36806_36806_3_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000311487_LAMA4_chr6_112537670_ENST00000424408_length(amino acids)=1911AA_BP=160 MGLRAGAISGAGAAPRRHPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDG TEKRGRGRPRKQPPVSPGTALVGSQKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCN GNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYG NPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTG CDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQ ASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYE LLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLT TPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYENIVN YVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAA ERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVS ASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKND NLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFK LPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEVRTP ADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTD IYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFN FRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKK FYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPV ALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRLVYM FNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQ LNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNG IRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHP -------------------------------------------------------------- >36806_36806_4_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000311487_LAMA4_chr6_112537670_ENST00000522006_length(amino acids)=1911AA_BP=160 MGLRAGAISGAGAAPRRHPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDG TEKRGRGRPRKQPPVSPGTALVGSQKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCN GNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYG NPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTG CDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQ ASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYE LLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLT TPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYENIVN YVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAA ERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVS ASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKND NLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFK LPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEVRTP ADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTD IYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFN FRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKK FYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPV ALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRLVYM FNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQ LNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNG IRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHP -------------------------------------------------------------- >36806_36806_5_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000347617_LAMA4_chr6_112537670_ENST00000230538_length(amino acids)=1869AA_BP=111 MGLRAGAISGAGAAPRRPSQPSLFHLLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGA AKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPC PLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERC APGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTGMDCPTISCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEI NATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLY YGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQ ALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSH DLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLN QARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQ NLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDD LKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGK VFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVP CARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLED TLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNL LEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVD KQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEK VHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEH LKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRV LEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESF NIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHV -------------------------------------------------------------- >36806_36806_6_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000347617_LAMA4_chr6_112537670_ENST00000389463_length(amino acids)=1862AA_BP=111 MGLRAGAISGAGAAPRRPSQPSLFHLLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGA AKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPC PLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERC APGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTGCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLL KTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHEL SPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRD AEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAI DHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQA KAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDS SAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSL SLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPS LSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLA FTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQI NDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETL GVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGL SHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYE CPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGA KSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPP TEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFE IAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPK -------------------------------------------------------------- >36806_36806_7_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000347617_LAMA4_chr6_112537670_ENST00000424408_length(amino acids)=1862AA_BP=111 MGLRAGAISGAGAAPRRPSQPSLFHLLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGA AKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPC PLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERC APGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTGCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLL KTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHEL SPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRD AEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAI DHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQA KAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDS SAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSL SLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPS LSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLA FTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQI NDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETL GVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGL SHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYE CPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGA KSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPP TEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFE IAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPK -------------------------------------------------------------- >36806_36806_8_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000347617_LAMA4_chr6_112537670_ENST00000522006_length(amino acids)=1862AA_BP=111 MGLRAGAISGAGAAPRRPSQPSLFHLLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGA AKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPC PLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERC APGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTGCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLL KTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHEL SPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRD AEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAI DHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQA KAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDS SAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSL SLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPS LSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLA FTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQI NDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETL GVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGL SHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYE CPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGA KSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPP TEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFE IAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPK -------------------------------------------------------------- >36806_36806_9_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000374116_LAMA4_chr6_112537670_ENST00000230538_length(amino acids)=1844AA_BP=86 MLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNA GFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNE NYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCD SVTGECLEEGFEPPTGMDCPTISCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQI NNAENTMKSLLSDVEELVEKENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEE IRSRQPFFTQRELVDEEADEAYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQ QERVREQMEVVNMSLSTSADSLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLH SSDMNGLVQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVG GALARKSALKTRLSDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLT EVVPQLLDQLRTVEQKRPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETAD QFILYLGSKNAKKEYMGLAIKNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGD DSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAV VRDITRRGKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKM ILVVDRRHVKSMDNEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYF NGQSFIASIQKISFFDGFEGGFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVD KSRVGSKNPTKGKIEQTQASEKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNL SKPKASQNKKGGKSKDAPSWDPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIF YVSDQEENDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAP GKAVKNVQINSIYSFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGH SVNGEYLNVHMKNGQVIVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLL -------------------------------------------------------------- >36806_36806_10_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000374116_LAMA4_chr6_112537670_ENST00000389463_length(amino acids)=1837AA_BP=86 MLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNA GFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNE NYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCD SVTGECLEEGFEPPTGCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTM KSLLSDVEELVEKENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPF FTQRELVDEEADEAYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQ MEVVNMSLSTSADSLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGL VQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKS ALKTRLSDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLL DQLRTVEQKRPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLG SKNAKKEYMGLAIKNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLD PEDTVFYVGGVPSNFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRR GKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRR HVKSMDNEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIA SIQKISFFDGFEGGFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSK NPTKGKIEQTQASEKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQ NKKGGKSKDAPSWDPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEE NDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNV QINSIYSFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYL NVHMKNGQVIVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPS -------------------------------------------------------------- >36806_36806_11_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000374116_LAMA4_chr6_112537670_ENST00000424408_length(amino acids)=1837AA_BP=86 MLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNA GFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNE NYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCD SVTGECLEEGFEPPTGCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTM KSLLSDVEELVEKENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPF FTQRELVDEEADEAYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQ MEVVNMSLSTSADSLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGL VQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKS ALKTRLSDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLL DQLRTVEQKRPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLG SKNAKKEYMGLAIKNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLD PEDTVFYVGGVPSNFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRR GKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRR HVKSMDNEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIA SIQKISFFDGFEGGFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSK NPTKGKIEQTQASEKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQ NKKGGKSKDAPSWDPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEE NDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNV QINSIYSFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYL NVHMKNGQVIVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPS -------------------------------------------------------------- >36806_36806_12_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000374116_LAMA4_chr6_112537670_ENST00000522006_length(amino acids)=1837AA_BP=86 MLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNA GFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNE NYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCD SVTGECLEEGFEPPTGCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTM KSLLSDVEELVEKENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPF FTQRELVDEEADEAYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQ MEVVNMSLSTSADSLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGL VQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKS ALKTRLSDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLL DQLRTVEQKRPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLG SKNAKKEYMGLAIKNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLD PEDTVFYVGGVPSNFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRR GKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRR HVKSMDNEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIA SIQKISFFDGFEGGFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSK NPTKGKIEQTQASEKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQ NKKGGKSKDAPSWDPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEE NDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNV QINSIYSFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYL NVHMKNGQVIVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPS -------------------------------------------------------------- >36806_36806_13_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000401473_LAMA4_chr6_112537670_ENST00000230538_length(amino acids)=1907AA_BP=149 MGLRAGAISGAGAAPRRHPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDG TEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSG YCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYGNPLLIGSTCKK CDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTGMDCPTISCDKC VWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQASRK GQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELLSQ AESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLTTPRL TLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYENIVNYVSE ANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAAERGD AQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVSASIQ RIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNLVY VYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTS LNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEVRTPADNG LILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTDIYIG GAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFNFRTL QPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFYFG GSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKL PERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRLVYMFNVG HKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQLNGA SITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNGIRDF STSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHPVSFS -------------------------------------------------------------- >36806_36806_14_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000401473_LAMA4_chr6_112537670_ENST00000389463_length(amino acids)=1900AA_BP=149 MGLRAGAISGAGAAPRRHPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDG TEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSG YCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYGNPLLIGSTCKK CDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTGCDKCVWDLTDA LRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQASRKGQLVQKE SMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELLSQAESWQRL HNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELDD IIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYENIVNYVSEANETAEF ALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAAERGDAQQRLGQ SRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVSASIQRIRELIA QTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNLVYVYNLGTK DVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFV GCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEVRTPADNGLILLMVN GSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTDIYIGGAPPEIL QSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFNFRTLQPNGLLF YYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFYFGGSPISAQ YANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTPR NSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRLVYMFNVGHKKLKIR SQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQLNGASITSASQ TFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNGIRDFSTSVTPK QSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHPVSFSKAALVSG -------------------------------------------------------------- >36806_36806_15_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000401473_LAMA4_chr6_112537670_ENST00000424408_length(amino acids)=1900AA_BP=149 MGLRAGAISGAGAAPRRHPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDG TEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSG YCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYGNPLLIGSTCKK CDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTGCDKCVWDLTDA LRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQASRKGQLVQKE SMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELLSQAESWQRL HNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELDD IIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYENIVNYVSEANETAEF ALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAAERGDAQQRLGQ SRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVSASIQRIRELIA QTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNLVYVYNLGTK DVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFV GCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEVRTPADNGLILLMVN GSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTDIYIGGAPPEIL QSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFNFRTLQPNGLLF YYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFYFGGSPISAQ YANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTPR NSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRLVYMFNVGHKKLKIR SQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQLNGASITSASQ TFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNGIRDFSTSVTPK QSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHPVSFSKAALVSG -------------------------------------------------------------- >36806_36806_16_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000401473_LAMA4_chr6_112537670_ENST00000522006_length(amino acids)=1900AA_BP=149 MGLRAGAISGAGAAPRRHPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDG TEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSG YCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYGNPLLIGSTCKK CDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTGCDKCVWDLTDA LRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQASRKGQLVQKE SMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELLSQAESWQRL HNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELDD IIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYENIVNYVSEANETAEF ALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAAERGDAQQRLGQ SRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVSASIQRIRELIA QTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNLVYVYNLGTK DVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFV GCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEVRTPADNGLILLMVN GSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTDIYIGGAPPEIL QSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFNFRTLQPNGLLF YYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFYFGGSPISAQ YANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTPR NSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRLVYMFNVGHKKLKIR SQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQLNGASITSASQ TFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNGIRDFSTSVTPK QSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHPVSFSKAALVSG -------------------------------------------------------------- >36806_36806_17_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000447654_LAMA4_chr6_112537670_ENST00000230538_length(amino acids)=1921AA_BP=163 MGGEHAAAAVSERLCSLPVSDPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQE KDGTEKRGRGRPRKQPPVSPGTALVGSQKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPC DCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPG YYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEP PTGMDCPTISCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSD VEELVEKENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQREL VDEEADEAYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNM SLSTSADSLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALD ASNVYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRL SDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTV EQKRPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKK EYMGLAIKNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVF YVGGVPSNFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQV TRFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMD NEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKIS FFDGFEGGFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGK IEQTQASEKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGK SKDAPSWDPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTL FLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIY SFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKN GQVIVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGC -------------------------------------------------------------- >36806_36806_18_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000447654_LAMA4_chr6_112537670_ENST00000389463_length(amino acids)=1914AA_BP=163 MGGEHAAAAVSERLCSLPVSDPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQE KDGTEKRGRGRPRKQPPVSPGTALVGSQKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPC DCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPG YYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEP PTGCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEK ENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADE AYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSAD SLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYEN IVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQL QAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPAS NVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAI KNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPS NFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEV RTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIP FTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEG GFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQAS EKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSW DPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRL VYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLS NLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKV NNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVID -------------------------------------------------------------- >36806_36806_19_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000447654_LAMA4_chr6_112537670_ENST00000424408_length(amino acids)=1914AA_BP=163 MGGEHAAAAVSERLCSLPVSDPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQE KDGTEKRGRGRPRKQPPVSPGTALVGSQKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPC DCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPG YYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEP PTGCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEK ENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADE AYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSAD SLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYEN IVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQL QAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPAS NVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAI KNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPS NFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEV RTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIP FTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEG GFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQAS EKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSW DPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRL VYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLS NLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKV NNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVID -------------------------------------------------------------- >36806_36806_20_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000447654_LAMA4_chr6_112537670_ENST00000522006_length(amino acids)=1914AA_BP=163 MGGEHAAAAVSERLCSLPVSDPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQE KDGTEKRGRGRPRKQPPVSPGTALVGSQKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPC DCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPG YYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEP PTGCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEK ENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADE AYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSAD SLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYEN IVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQL QAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPAS NVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAI KNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPS NFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEV RTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIP FTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEG GFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQAS EKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSW DPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRL VYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLS NLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKV NNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVID -------------------------------------------------------------- |
Top |
Fusion Protein Functional Features |
Four levels of functional features of fusion genes Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr6:/chr6:) - FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels. - How to search 1. Put your fusion gene symbol. 2. Press the tab key until there will be shown the breakpoint information filled. 4. Go down and press 'Search' tab twice. 4. Go down to have the hyperlink of the search result. 5. Click the hyperlink. 6. See the FGviewer result for your fusion gene. |
Main function of each fusion partner protein. (from UniProt) |
Hgene | Tgene |
HMGA1 | LAMA4 |
FUNCTION: HMG-I/Y bind preferentially to the minor groove of A+T rich regions in double-stranded DNA. It is suggested that these proteins could function in nucleosome phasing and in the 3'-end processing of mRNA transcripts. They are also involved in the transcription regulation of genes containing, or in close proximity to A+T-rich regions. | FUNCTION: Binding to cells via a high affinity receptor, laminin is thought to mediate the attachment, migration and organization of cells into tissues during embryonic development by interacting with other extracellular matrix components. |
Retention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at * Minus value of BPloci means that the break pointn is located before the CDS. |
- Retained protein feature among the 13 regional features. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Protein feature | Protein feature note |
- Not-retained protein feature among the 13 regional features. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Protein feature | Protein feature note |
Top |
Fusion Protein Structures |
PDB and CIF files of the predicted fusion proteins * Here we show the 3D structure of the fusion proteins using Mol*. AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. Model confidence is shown from the pLDDT values per residue. pLDDT corresponds to the model’s prediction of its score on the local Distance Difference Test. It is a measure of local accuracy (from AlphfaFold website). To color code individual residues, we transformed individual PDB files into CIF format. |
Fusion protein PDB link (fusion AA seq ID in FusionPDB) | Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand | AA seq | Len(AA seq) |
PDB file (1042)CIF file (1042) >>>1042.cif | HMGA1 | chr6 | 34211295 | + | LAMA4 | chr6 | 112537670 | - | MLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPRKQPPKEPSEVPTP KRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGEC VPCDCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFC QPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYGNPLL IGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYY GDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTGCDKCVWDLTDALRL AALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRK IQINNAENTMKSLLSDVEELVEKENQASRKGQLVQKESMDTINHASQLVE QAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPF FTQRELVDEEADEAYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLS DLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNMSLST SADSLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDL VQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYENIVNYVSE ANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESS SDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAAERGDAQQRLGQSRL ITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVR NLTEVVPQLLDQLRTVEQKRPASNVSASIQRIRELIAQTRSVASKIQVSM MFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLG SKNAKKEYMGLAIKNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKI ERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGG VPSNFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCA RDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEVRTPADNG LILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKY HEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTDIYIGGAPPEILQSR ALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRR AYFNGQSFIASIQKISFFDGFEGGFNFRTLQPNGLLFYYASGSDVFSISL DNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSK NPTKGKIEQTQASEKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVE DFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDA PSWDPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLK GDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRLVYMFNVG HKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATW KIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQLNGASITSASQTFS VTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLV HGHSVNGEYLNVHMKNGQVIVKVNNGIRDFSTSVTPKQSLCDGRWHRITV IRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPS | 1837 |
3D view using mol* of 1042 (AA BP:) | ||||||||||
PDB file (1044)CIF file (1044) >>>1044.cif | HMGA1 | chr6 | 34211295 | + | LAMA4 | chr6 | 112537670 | - | MLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPRKQPPKEPSEVPTP KRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGEC VPCDCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFC QPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYGNPLL IGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYY GDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTGMDCPTISCDKCVWD LTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERE NQYALRKIQINNAENTMKSLLSDVEELVEKENQASRKGQLVQKESMDTIN HASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEE IRSRQPFFTQRELVDEEADEAYELLSQAESWQRLHNETRTLFPVVLEQLD DYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEV VNMSLSTSADSLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNL SNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYEN IVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQAREL QAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAAERGDAQQ RLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVN SARDAVRNLTEVVPQLLDQLRTVEQKRPASNVSASIQRIRELIAQTRSVA SKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETAD QFILYLGSKNAKKEYMGLAIKNDNLVYVYNLGTKDVEIPLDSKPVSSWPA YFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPED TVFYVGGVPSNFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDP STSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEV RTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKA QINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTDIYIGGAP PEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPE DSLISRRAYFNGQSFIASIQKISFFDGFEGGFNFRTLQPNGLLFYYASGS DVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVD KSRVGSKNPTKGKIEQTQASEKKFYFGGSPISAQYANFTGCISNAYFTRV DRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKK GGKSKDAPSWDPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSR QEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRL VYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESL PPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQLNGASIT SASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPR SSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNGIRDFSTSVTPKQSLCDG RWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLL | 1844 |
3D view using mol* of 1044 (AA BP:) | ||||||||||
PDB file (1047) >>>1047.pdbFusion protein BP residue: 111 CIF file (1047) >>>1047.cif | HMGA1 | chr6 | 34211295 | + | LAMA4 | chr6 | 112537670 | - | MGLRAGAISGAGAAPRRPSQPSLFHLLLREGKMSESSSKSSQPLASKQEK DGTEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTP GRKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRN TTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVR CICNENYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDE VTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGE CLEEGFEPPTGCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHV NEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKEN QASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHEL SPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELLSQAESWQ RLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAA RQRDHEKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELDDIIKNASGI YAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSS DMNGLVQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQ IIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTR LSDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTN WSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNV SASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSL SLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNLVYVYNLG TKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFI KKGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFVGCLELATL NNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVR DITRRGKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFY DFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSM DNEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKD FNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGF NFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGL SHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFYFGGSPIS AQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFL LHKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTPRNSHCHLSN SPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYV SDQEENDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRE RSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSI YSFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLD ESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNN GIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPK PIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHPVSFSKAALV | 1862 |
3D view using mol* of 1047 (AA BP:111) | ||||||||||
PDB file (1048)CIF file (1048) >>>1048.cif | HMGA1 | chr6 | 34211295 | + | LAMA4 | chr6 | 112537670 | - | MGLRAGAISGAGAAPRRPSQPSLFHLLLREGKMSESSSKSSQPLASKQEK DGTEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTP GRKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRN TTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVR CICNENYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDE VTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGE CLEEGFEPPTGMDCPTISCDKCVWDLTDALRLAALSIEEGKSGVLSVSSG AAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVE ELVEKENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLY YGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELL SQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAED MNRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELDDI IKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANEL SRKLHSSDMNGLVQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDA VSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALAR KSALKTRLSDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAP MANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQ KRPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDD LKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNL VYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSS TAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFVG CLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDG SGYAVVRDITRRGKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRN GYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVD RRHVKSMDNEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKG FQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFF DGFEGGFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVD KQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFY FGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPI ESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTPRN SHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSS HGMIFYVSDQEENDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWH DVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVK NVQINSIYSFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTE GGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQ VIVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHV VGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHPVS | 1869 |
3D view using mol* of 1048 (AA BP:) | ||||||||||
PDB file (1054)CIF file (1054) >>>1054.cif | HMGA1 | chr6 | 34211295 | + | LAMA4 | chr6 | 112537670 | - | MGLRAGAISGAGAAPRRHPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQA GGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPR KQPPKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLK CNAGFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDG YIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNC ERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNT TGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTGC DKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKT KLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQASRKGQLVQKE SMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLA QKMLEEIRSRQPFFTQRELVDEEADEAYELLSQAESWQRLHNETRTLFPV VLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERV REQMEVVNMSLSTSADSLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQ VKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDA SNVYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLL NQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAAE RGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSA YNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVSASIQRIRELIA QTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPE LTETADQFILYLGSKNAKKEYMGLAIKNDNLVYVYNLGTKDVEIPLDSKP VSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLL DLDPEDTVFYVGGVPSNFKLPTSLNLPGFVGCLELATLNNDVISLYNFKH IYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVT RFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLE DTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTDI YIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGV GYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFNFRTLQPNGLLF YYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTR YELIVDKSRVGSKNPTKGKIEQTQASEKKFYFGGSPISAQYANFTGCISN AYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPK ASQNKKGGKSKDAPSWDPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYG GTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLF LAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLR VLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQL NGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIA FEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNGIRDFSTSVTPK QSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGG VPESLLTPRLAPSKPFTGCIRHFVIDGHPVSFSKAALVSGAVSINSCPAA | 1900 |
3D view using mol* of 1054 (AA BP:) | ||||||||||
PDB file (1056) | HMGA1 | chr6 | 34211295 | + | LAMA4 | chr6 | 112537670 | - | MGLRAGAISGAGAAPRRHPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQA GGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPR KQPPKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLK CNAGFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDG YIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNC ERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNT TGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTGM DCPTISCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINA TIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQASRK GQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEI SEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELLSQAESWQRLHNE TRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDH EKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELDDIIKNASGIYAEID GAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGL VQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHK DESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAV KQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNL QHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVSASIQ RIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMK PPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNLVYVYNLGTKDVE IPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEF SGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFVGCLELATLNNDVI SLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRR GKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFS GGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKM KIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLE QTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFNFRTL QPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVI SSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFYFGGSPISAQYAN FTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKG KNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTPRNSHCHLSNSPRAI EHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEE NDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGR LVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSG CLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNI GLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNGIRDF STSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHR EPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHPVSFSKAALVSGAVS | 1907 |
PDB file (1058) | HMGA1 | chr6 | 34211295 | + | LAMA4 | chr6 | 112537670 | - | MGLRAGAISGAGAAPRRHPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQA GGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPR KQPPVSPGTALVGSQKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPG RKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRNT TGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRC ICNENYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEV TGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGEC LEEGFEPPTGCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVN EINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQ ASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELS PKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELLSQAESWQR LHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAAR QRDHEKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELDDIIKNASGIY AEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSD MNGLVQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQI IYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRL SDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNW SQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVS ASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLS LYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNLVYVYNLGT KDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIK KGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFVGCLELATLN NDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRD ITRRGKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFYD FGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMD NEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDF NLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFN FRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLS HFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFYFGGSPISA QYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLL HKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTPRNSHCHLSNS PRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVS DQEENDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRER SSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIY SFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDE SFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNG IRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKP IDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHPVSFSKAALVS | 1911 |
PDB file (1062) >>>1062.pdbFusion protein BP residue: 163 CIF file (1062) >>>1062.cif | HMGA1 | chr6 | 34211295 | + | LAMA4 | chr6 | 112537670 | - | MGGEHAAAAVSERLCSLPVSDPHLLPAAAAAEPGRSSAPSTAAPGNPERA PQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDGTEKRGRG RPRKQPPVSPGTALVGSQKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTT TPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQ RNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGA VRCICNENYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDC DEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVT GECLEEGFEPPTGCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHR HVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEK ENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEH ELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELLSQAES WQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRAT AARQRDHEKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELDDIIKNAS GIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLH SSDMNGLVQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDAVSGID TQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALK TRLSDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNL TNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPAS NVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFT SLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNLVYVYN LGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEK FIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFVGCLELA TLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAV VRDITRRGKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHV FYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVK SMDNEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQK KDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEG GFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYND GLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFYFGGSP ISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPL FLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTPRNSHCHL SNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIF YVSDQEENDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFI RERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQIN SIYSFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVV LDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKV NNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLN PKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHPVSFSKAA | 1914 |
3D view using mol* of 1062 (AA BP:163) | ||||||||||
PDB file (1064) | HMGA1 | chr6 | 34211295 | + | LAMA4 | chr6 | 112537670 | - | MGLRAGAISGAGAAPRRHPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQA GGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPR KQPPVSPGTALVGSQKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPG RKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRNT TGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRC ICNENYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEV TGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGEC LEEGFEPPTGMDCPTISCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGA AAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEE LVEKENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYY GEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELLS QAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDM NRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELDDII KNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELS RKLHSSDMNGLVQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDAV SGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARK SALKTRLSDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPM ANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQK RPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDL KAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNLV YVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSST AEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFVGC LELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGS GYAVVRDITRRGKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRNG YLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDR RHVKSMDNEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGF QFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFD GFEGGFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDK QYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFYF GGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIE SSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTPRNS HCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSH GMIFYVSDQEENDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHD VIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKN VQINSIYSFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEG GYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQV IVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVV GPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHPVSF | 1918 |
PDB file (1065) | HMGA1 | chr6 | 34211295 | + | LAMA4 | chr6 | 112537670 | - | MGGEHAAAAVSERLCSLPVSDPHLLPAAAAAEPGRSSAPSTAAPGNPERA PQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDGTEKRGRG RPRKQPPVSPGTALVGSQKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTT TPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQ RNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGA VRCICNENYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDC DEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVT GECLEEGFEPPTGMDCPTISCDKCVWDLTDALRLAALSIEEGKSGVLSVS SGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSD VEELVEKENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKM LYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYE LLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDA EDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELD DIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEAN ELSRKLHSSDMNGLVQKALDASNVYENIVNYVSEANETAEFALNTTDRIY DAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGAL ARKSALKTRLSDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQAT APMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTV EQKRPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSM DDLKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKND NLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSL SSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGF VGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFF DGSGYAVVRDITRRGKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEM RNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILV VDRRHVKSMDNEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCM KGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKIS FFDGFEGGFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQS VDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKK FYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYEC PIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTP RNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTR SSHGMIFYVSDQEENDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGL WHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKA VKNVQINSIYSFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFS TEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKN GQVIVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVN HVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHP | 1921 |
Top |
pLDDT score distribution |
pLDDT score distribution of the predicted wild-type structures of two partner proteins from AlphaFold2 * AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. |
HMGA1_pLDDT.png |
LAMA4_pLDDT.png |
pLDDT score distribution of the predicted fusion protein structures from AlphaFold2 * AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. |
Top |
Ramachandran Plot of Fusion Protein Structure |
Ramachandran plot of the torsional angles - phi (φ)and psi (ψ) - of the residues (amino acids) contained in this fusion protein peptide. |
Fusion AA seq ID in FusionPDB and their Ramachandran plots |
HMGA1_LAMA4_1047.png |
Top |
Potential Active Site Information |
The potential binding sites of these fusion proteins were identified using SiteMap, a module of the Schrodinger suite. |
Fusion AA seq ID in FusionPDB | Site score | Size | D score | Volume | Exposure | Enclosure | Contact | Phobic | Philic | Balance | Don/Acc | Residues |
1042 | 1.1 | 353 | 1.202 | 962.115 | 0.555 | 0.673 | 0.883 | 1.677 | 0.436 | 3.844 | 3.209 | Chain A: 572,575,576,578,579,580,582,583,585,586,5 87,589,604,605,607,608,611,640,641,643,644,645,647 ,648,650,651,654,655,657,658,659,661,662,663,664,6 65,668,729,732,733,735,736,737,738,740,741,742,744 ,745,747,748,749,751,752,754,755,758,759,762,763 |
1042 | 1.1 | 353 | 1.202 | 962.115 | 0.555 | 0.673 | 0.883 | 1.677 | 0.436 | 3.844 | 3.209 | Chain A: 572,575,576,578,579,580,582,583,585,586,5 87,589,604,605,607,608,611,640,641,643,644,645,647 ,648,650,651,654,655,657,658,659,661,662,663,664,6 65,668,729,732,733,735,736,737,738,740,741,742,744 ,745,747,748,749,751,752,754,755,758,759,762,763 |
1044 | 1.123 | 296 | 1.206 | 707.952 | 0.472 | 0.738 | 0.946 | 1.962 | 0.541 | 3.627 | 1.797 | Chain A: 651,652,655,656,658,659,660,662,663,665,6 66,667,748,751,752,754,755,756,758,759,760,762,763 ,766,816,817,819,820,821,823,824,825,826,827,828,8 29,832,835,836,839,840,843 |
1047 | 1.057 | 297 | 1.081 | 971.033 | 0.489 | 0.764 | 0.979 | 0.79 | 0.975 | 0.81 | 0.993 | Chain A: 179,181,182,195,196,198,199,200,210,211,2 13,214,215,221,222,223,224,225,250,251,1512,1513,1 514,1515,1516,1517,1518,1542,1544,1635,1636,1637,1 638,1639,1640,1642,1647,1648,1650,1651,1652,1654,1 674,1675,1698,1699,1700,1701,1702,1805,1806,1807,1 845,1846,1847,1848,1849,1850,1856,1857 |
1048 | 1.053 | 160 | 1.124 | 636.951 | 0.618 | 0.671 | 0.895 | 0.926 | 0.671 | 1.379 | 1.858 | Chain A: 395,399,402,403,405,406,409,410,413,414,4 16,417,499,502,503,504,506,507,509,510,513,514,517 ,811,812,814,815,816,817,818,819,820 |
1054 | 1.142 | 183 | 1.224 | 422.576 | 0.423 | 0.767 | 1.09 | 2.42 | 0.541 | 4.473 | 1.622 | Chain A: 621,624,625,628,632,739,742,743,745,746,7 49,750,752,753,756,771,774,775,777,778,781,782,785 ,786,789 |
1056 | 1.076 | 240 | 1.173 | 594.419 | 0.582 | 0.651 | 0.873 | 1.682 | 0.483 | 3.486 | 0.982 | Chain A: 444,447,448,451,452,454,455,457,458,459,4 61,462,464,465,468,469,471,472,473,475,476,479,480 ,520,523,524,527,528,530,531,533,534,535,537,538,5 40,541,544,545,547,548,549,551,552,555,559 |
1058 | 1.106 | 426 | 1.203 | 883.911 | 0.553 | 0.691 | 0.894 | 1.877 | 0.464 | 4.048 | 2.32 | Chain A: 607,610,611,614,615,617,618,619,621,622,6 25,628,629,631,632,633,635,636,638,735,738,739,742 ,746,749,750,753,754,756,757,758,760,761,763,764,7 67,768,770,771,773,774,776,778,779,782,783,785,786 ,789,790,792,793,795,796,797,799,800,801,803,804,8 06,807,808,810 |
1062 | 1.082 | 333 | 1.145 | 801 | 0.543 | 0.722 | 0.94 | 0.964 | 0.699 | 1.378 | 0.878 | Chain A: 102,103,104,105,106,568,572,645,649,652,6 53,655,656,659,660,662,663,664,666,724,725,727,728 ,731,732,734,735,736,738,739,742,745,746,809,810,8 12,813,814,816,817,818,821,822,824,825,826,828,829 ,831,832,833 |
1064 | 1.019 | 355 | 1.043 | 1165.514 | 0.612 | 0.713 | 0.853 | 0.63 | 0.995 | 0.633 | 0.615 | Chain A: 915,918,919,922,923,925,926,927,928,944,9 45,946,948,949,953,955,957,959,1028,1029,1030,1032 ,1041,1042,1043,1054,1056,1104,1105,1107,1108,1109 ,1110,1132,1146,1148,1150,1152,1273,1274,1275,1278 ,1292,1294,1310,1312,1313,1314,1315,1316,1317,1318 ,1319,1320,1323,1324,1325,1326,1328,1329,1330,1331 ,1357,1358,1359,1406,1407,1465,1466,1467,1468,1495 ,1497 |
1065 | 1.097 | 207 | 1.171 | 613.627 | 0.547 | 0.722 | 0.886 | 1.29 | 0.622 | 2.073 | 0.606 | Chain A: 134,135,136,137,416,419,420,422,423,425,4 26,427,429,430,433,434,437,597,600,601,604,605,607 ,608,611,826,828,829,830,832,833,836,837,839,840 |
Top |
Potentially Interacting Small Molecules through Virtual Screening |
The FDA-approved small molecule library molecules were subjected to virtual screening using the Glide. |
Fusion AA seq ID in FusionPDB | ZINC ID | DrugBank ID | Drug name | Docking score | Glide gscore |
Top |
Drug information from DrugBank of the top 20 interacting small molecules. |
ZINC ID | DrugBank ID | Drug name | Drug type | SMILES | Drug group |
Top |
Biochemical Features of Small Molecules |
ADME (Absorption, Distribution, Metabolism, and Excretion) of drugs using QikProp(v3.9) |
ZINC ID | mol_MW | dipole | SASA | FOSA | FISA | PISA | WPSA | volume | donorHB | accptHB | IP | Human Oral Absorption | Percent Human Oral Absorption | Rule Of Five | Rule Of Three |
Top |
Drug Toxicity Information |
Toxicity information of individual drugs using eToxPred |
ZINC ID | Smile | Surface Accessibility | Toxicity |
Top |
Fusion Protein-Protein Interaction |
Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in |
Protein-protein interactors with each fusion partner protein in wild-type from validated records (BIOGRID-3.4.160) |
Gene | PPI interactors |
HMGA1 | EWSR1, SP1, CEBPB, ELF1, NFYA, POU3F1, ATF2, JUN, IRF1, BANF1, CHD1, PRMT6, NPM1, PARP1, PPARG, PRKCA, RARA, CREBBP, POU2F1, POU2F2, KAT2B, NFKB1, TP53, RN7SK, Stag2, ORC6, ORC2, HHV8GK18_gp81, CBX7, HDAC2, CDK2, TERF2, FN1, HSP90AA1, CSNK2A1, PAN2, CDK1, NCL, PSIP1, EIF2S2, RCC1, HNRNPD, PPP1R8, HNRNPAB, HNRNPDL, DNAJC8, HNRNPA1, HMGB1, HMGB3, HMGB2, SYNCRIP, CWC27, METAP2, SSB, PA2G4, OLA1, APEX1, ANXA2, GTF2F2, XRCC6, HNRNPL, HSPA1A, PTBP1, PCBP1, ANXA1, PCBP2, HTT, UPF2, UBE2I, NCOR2, CUL7, OBSL1, APP, SRPK2, AP4M1, EMC2, RACGAP1, ZSCAN5A, Csk, Fbl, Gspt1, Srp72, Kifc1, CREB1, NFATC1, FOXA1, FOXE1, HEMGN, TRIM29, PCGF1, DPPA4, NANOG, POU5F1, C4orf27, EGFR, CDC14B, G3BP1, EFTUD2, NKX2-1, RNF123, HIF1A, AGR2, EZH2, DCPS, GPC1, REST, PRKDC, TYK2, CDC25A, OXT, CSF3R, KCNQ4, CYP11B2, CDK5, EPHB2, ITGB7, MYC, HIC1, HIST1H3A, ATG16L1, RBX1, PRDM16, AGRN, VRK1, HIST1H4A, CMTR1, FANCD2, N, HCVgp1, ZC3H18, CAMK2A, FYN, MAP2K1, PEBP1, RALBP1, RPS6KA3, ARNTL, YKT6, CD72, TRIP10, HIST1H2AH, NSD1, AKAP9, RBFOX2, ARR3, ECT2L, ABCC8, CAMSAP1, COL13A1, ELP2, FBXO38, RNF214, PANX2, STRN4, KCND1, ROBO1, HIST2H3PS2, HIST2H2BC, ARHGEF10, DSP, HIST1H1B, HIST1H1D, SOX6, RGS6, LOC102724334, HIST2H2BE, MST1L, UHRF1BP1, ADCK1, HIST2H2AB, CFAP46, CNBD1, SEPT1, TTN, JPH2, ATXN3L, UNC79, PCLO, ACACB, NOP14, HIST1H2AB, RGS3, GTF2H3, HIST3H3, MURC, HIST2H2BF, PHF3, INPP5D, HIST1H2BH, SPEN, TXLNA, CIT, ANLN, CHMP4B, ECT2, KIF14, KIF20A, PRC1, C12orf65, C1QBP, GRSF1, ICT1, MRPL11, TSFM, ZNF263, MAFB, BRD4, FBP1, NEDD4, HMGB1P1, FKBP3, RBM8A, PARK7, TRIM37, CBX3, CENPA, HIST1H2BG, LMNB1, ZNF330, WDR5, NAA40, H2AFY, UBN2, HIST1H2AG, VRK3, WDR89, H2AFX, HIRA, PARP2, CABIN1, XPC, CD274, BTF3, SLFN11, JMJD6, Ube2i, Klc4, TAX1BP1, PER2, |
Protein-protein interactors based on sequence similarity (STRING) |
Gene | STRING network |
HMGA1 | |
LAMA4 |
- Retained interactions in fusion protein (protein functional feature from UniProt). |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Still interaction with |
- Lost interactions due to fusion (protein functional feature from UniProt). |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Interaction lost with |
Top |
Related Drugs to HMGA1-LAMA4 |
Drugs used for this fusion-positive patient. (Manual curation of PubMed, 04-30-2022 + MyCancerGenome) |
Hgene | Tgene | Drug | Source | PMID |
Top |
Related Diseases to HMGA1-LAMA4 |
Diseases that have this fusion gene. (Manual curation of PubMed, 04-30-2022 + MyCancerGenome) |
Hgene | Tgene | Disease | Source | PMID |
Diseases associated with fusion partners. (DisGeNet 4.0) |
Partner | Gene | Disease ID | Disease name | # pubmeds | Source |
Hgene | HMGA1 | C0011860 | Diabetes Mellitus, Non-Insulin-Dependent | 1 | CTD_human |
Hgene | HMGA1 | C0020456 | Hyperglycemia | 1 | CTD_human |
Hgene | HMGA1 | C0021655 | Insulin Resistance | 1 | CTD_human |
Hgene | HMGA1 | C0023269 | leiomyosarcoma | 1 | CTD_human |
Hgene | HMGA1 | C0036341 | Schizophrenia | 1 | PSYGENET |
Hgene | HMGA1 | C0042138 | Uterine Neoplasms | 1 | CTD_human |
Hgene | HMGA1 | C0153567 | Uterine Cancer | 1 | CTD_human |
Hgene | HMGA1 | C0205815 | Leiomyosarcoma, Epithelioid | 1 | CTD_human |
Hgene | HMGA1 | C0205816 | Leiomyosarcoma, Myxoid | 1 | CTD_human |
Hgene | HMGA1 | C0524620 | Metabolic Syndrome X | 1 | CTD_human |
Hgene | HMGA1 | C0920563 | Insulin Sensitivity | 1 | CTD_human |
Hgene | HMGA1 | C1855520 | Hyperglycemia, Postprandial | 1 | CTD_human |
Tgene | LAMA4 | C0000786 | Spontaneous abortion | 1 | CTD_human |
Tgene | LAMA4 | C0000822 | Abortion, Tubal | 1 | CTD_human |
Tgene | LAMA4 | C0340427 | Familial dilated cardiomyopathy | 1 | ORPHANET |
Tgene | LAMA4 | C3808935 | CARDIOMYOPATHY, DILATED, 1JJ | 1 | CTD_human;UNIPROT |
Tgene | LAMA4 | C3830362 | Early Pregnancy Loss | 1 | CTD_human |
Tgene | LAMA4 | C4552766 | Miscarriage | 1 | CTD_human |