UTHEALTH HOME    ABOUT SBMI    A-Z    WEBMAIL    INSIDE THE UNIVERSITY
FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Terms of Use

Center for Computational Systems Medicine level3
leaf

Fusion Gene Summary

leaf

Fusion Gene Sample Information

leaf

Fusion ORF Analysis

leaf

Fusion Amino Acid Sequences

leaf

Fusion Protein Functional Features

leaf

Fusion Protein Structure

leaf

pLDDT scores

leaf

Ramachandran Plot of Fusion Protein Structure

leaf

Potential Active Site Information

leaf

Potentially Interacting Small Molecules through Virtual Screening

leaf

Biochemical Features of Small Molecules with ADME

leaf

Drug Toxicity Information

leaf

Fusion Protein-Protein Interaction

leaf

Related drugs with this fusion protein

leaf

Related disease with this fusion protein

Fusion Protein:HMGA1-LAMA4

Fusion Protein Summary

check button Fusion gene summary
Fusion partner gene informationFusion gene name: HMGA1-LAMA4
FusionPDB ID: 36806
FusionGDB2.0 ID: 36806
HgeneTgene
Gene symbol

HMGA1

LAMA4

Gene ID

3159

3910

Gene namehigh mobility group AT-hook 1laminin subunit alpha 4
SynonymsHMG-R|HMGA1A|HMGIYCMD1JJ|LAMA3|LAMA4*-1
Cytomap

6p21.31

6q21

Type of geneprotein-codingprotein-coding
Descriptionhigh mobility group protein HMG-I/HMG-Yhigh mobility group protein A1high mobility group protein Rhigh-mobility group (nonhistone chromosomal) protein isoforms I and Ynonhistone chromosomal high-mobility group protein HMG-I/HMG-Ylaminin subunit alpha-4laminin alpha 4 chainlaminin, alpha 4
Modification date2020031520200313
UniProtAcc

P17096

Q16363

Ensembl transtripts involved in fusion geneENST idsENST00000478214, ENST00000311487, 
ENST00000347617, ENST00000374116, 
ENST00000401473, ENST00000447654, 
ENST00000395004, 
ENST00000230538, 
ENST00000389463, ENST00000424408, 
ENST00000522006, ENST00000431543, 
ENST00000524032, ENST00000368638, 
ENST00000453937, 
Fusion gene scores for assessment (based on all fusion genes of FusionGDB 2.0)* DoF score13 X 9 X 5=5859 X 7 X 7=441
# samples 129
** MAII scorelog2(12/585*10)=-2.28540221886225
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(9/441*10)=-2.29278174922785
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context (manual curation of fusion genes in FusionPDB)

PubMed: HMGA1 [Title/Abstract] AND LAMA4 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpoint (based on all fusion genes of FusionGDB 2.0)
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneHMGA1

GO:0035986

senescence-associated heterochromatin focus assembly

16901784

HgeneHMGA1

GO:0090402

oncogene-induced cell senescence

16901784


check buttonFusion gene breakpoints across HMGA1 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.

check buttonFusion gene breakpoints across LAMA4 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.


Top

Fusion Gene Sample Information

check buttonFusion gene information from FusionGDB2.0.
check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerKB3..HMGA1chr6

34211295

+LAMA4chr6

112537670

-


Top

Fusion ORF Analysis


check buttonFusion information from ORFfinder translation from full-length transcript sequence from FusionPDB.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000447654HMGA1chr634211295+ENST00000230538LAMA4chr6112537670-671375927060351921
ENST00000447654HMGA1chr634211295+ENST00000522006LAMA4chr6112537670-659275927060141914
ENST00000447654HMGA1chr634211295+ENST00000389463LAMA4chr6112537670-657875927060141914
ENST00000447654HMGA1chr634211295+ENST00000424408LAMA4chr6112537670-615975927060141914
ENST00000401473HMGA1chr634211295+ENST00000230538LAMA4chr6112537670-64404863957621907
ENST00000401473HMGA1chr634211295+ENST00000522006LAMA4chr6112537670-63194863957411900
ENST00000401473HMGA1chr634211295+ENST00000389463LAMA4chr6112537670-63054863957411900
ENST00000401473HMGA1chr634211295+ENST00000424408LAMA4chr6112537670-58864863957411900
ENST00000311487HMGA1chr634211295+ENST00000230538LAMA4chr6112537670-64735193957951918
ENST00000311487HMGA1chr634211295+ENST00000522006LAMA4chr6112537670-63525193957741911
ENST00000311487HMGA1chr634211295+ENST00000389463LAMA4chr6112537670-63385193957741911
ENST00000311487HMGA1chr634211295+ENST00000424408LAMA4chr6112537670-59195193957741911
ENST00000347617HMGA1chr634211295+ENST00000230538LAMA4chr6112537670-63263723956481869
ENST00000347617HMGA1chr634211295+ENST00000522006LAMA4chr6112537670-62053723956271862
ENST00000347617HMGA1chr634211295+ENST00000389463LAMA4chr6112537670-61913723956271862
ENST00000347617HMGA1chr634211295+ENST00000424408LAMA4chr6112537670-57723723956271862
ENST00000374116HMGA1chr634211295+ENST00000230538LAMA4chr6112537670-640244819057241844
ENST00000374116HMGA1chr634211295+ENST00000522006LAMA4chr6112537670-628144819057031837
ENST00000374116HMGA1chr634211295+ENST00000389463LAMA4chr6112537670-626744819057031837
ENST00000374116HMGA1chr634211295+ENST00000424408LAMA4chr6112537670-584844819057031837

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score

Top

Fusion Amino Acid Sequences


check button For individual full-length fusion transcript sequence from FusionPDB, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>FusionGDB ID_FusionGDB isoform ID_FGname_Hgene_Hchr_Hbp_Henst_Tgene_Tchr_Tbp_Tenst_length(fusion AA) seq_BP

>36806_36806_1_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000311487_LAMA4_chr6_112537670_ENST00000230538_length(amino acids)=1918AA_BP=160
MGLRAGAISGAGAAPRRHPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDG
TEKRGRGRPRKQPPVSPGTALVGSQKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCN
GNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYG
NPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTG
MDCPTISCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEE
LVEKENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDE
EADEAYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNMSLS
TSADSLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASN
VYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDA
VKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQK
RPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYM
GLAIKNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVG
GVPSNFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRF
DIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEK
MKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFD
GFEGGFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQ
TQASEKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKD
APSWDPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLA
HGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFS
GCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQV
IVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRH

--------------------------------------------------------------

>36806_36806_2_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000311487_LAMA4_chr6_112537670_ENST00000389463_length(amino acids)=1911AA_BP=160
MGLRAGAISGAGAAPRRHPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDG
TEKRGRGRPRKQPPVSPGTALVGSQKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCN
GNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYG
NPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTG
CDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQ
ASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYE
LLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLT
TPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYENIVN
YVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAA
ERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVS
ASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKND
NLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFK
LPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEVRTP
ADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTD
IYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFN
FRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKK
FYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPV
ALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRLVYM
FNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQ
LNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNG
IRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHP

--------------------------------------------------------------

>36806_36806_3_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000311487_LAMA4_chr6_112537670_ENST00000424408_length(amino acids)=1911AA_BP=160
MGLRAGAISGAGAAPRRHPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDG
TEKRGRGRPRKQPPVSPGTALVGSQKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCN
GNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYG
NPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTG
CDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQ
ASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYE
LLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLT
TPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYENIVN
YVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAA
ERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVS
ASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKND
NLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFK
LPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEVRTP
ADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTD
IYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFN
FRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKK
FYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPV
ALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRLVYM
FNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQ
LNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNG
IRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHP

--------------------------------------------------------------

>36806_36806_4_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000311487_LAMA4_chr6_112537670_ENST00000522006_length(amino acids)=1911AA_BP=160
MGLRAGAISGAGAAPRRHPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDG
TEKRGRGRPRKQPPVSPGTALVGSQKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCN
GNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYG
NPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTG
CDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQ
ASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYE
LLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLT
TPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYENIVN
YVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAA
ERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVS
ASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKND
NLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFK
LPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEVRTP
ADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTD
IYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFN
FRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKK
FYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPV
ALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRLVYM
FNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQ
LNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNG
IRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHP

--------------------------------------------------------------

>36806_36806_5_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000347617_LAMA4_chr6_112537670_ENST00000230538_length(amino acids)=1869AA_BP=111
MGLRAGAISGAGAAPRRPSQPSLFHLLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGA
AKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPC
PLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERC
APGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTGMDCPTISCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEI
NATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLY
YGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQ
ALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSH
DLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLN
QARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQ
NLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDD
LKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGK
VFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVP
CARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLED
TLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNL
LEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVD
KQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEK
VHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEH
LKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRV
LEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESF
NIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHV

--------------------------------------------------------------

>36806_36806_6_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000347617_LAMA4_chr6_112537670_ENST00000389463_length(amino acids)=1862AA_BP=111
MGLRAGAISGAGAAPRRPSQPSLFHLLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGA
AKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPC
PLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERC
APGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTGCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLL
KTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHEL
SPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRD
AEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAI
DHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQA
KAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDS
SAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSL
SLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPS
LSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLA
FTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQI
NDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETL
GVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGL
SHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYE
CPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGA
KSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPP
TEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFE
IAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPK

--------------------------------------------------------------

>36806_36806_7_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000347617_LAMA4_chr6_112537670_ENST00000424408_length(amino acids)=1862AA_BP=111
MGLRAGAISGAGAAPRRPSQPSLFHLLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGA
AKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPC
PLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERC
APGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTGCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLL
KTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHEL
SPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRD
AEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAI
DHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQA
KAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDS
SAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSL
SLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPS
LSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLA
FTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQI
NDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETL
GVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGL
SHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYE
CPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGA
KSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPP
TEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFE
IAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPK

--------------------------------------------------------------

>36806_36806_8_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000347617_LAMA4_chr6_112537670_ENST00000522006_length(amino acids)=1862AA_BP=111
MGLRAGAISGAGAAPRRPSQPSLFHLLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGA
AKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPC
PLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERC
APGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTGCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLL
KTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHEL
SPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRD
AEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAI
DHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQA
KAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDS
SAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSL
SLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPS
LSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLA
FTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQI
NDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETL
GVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGL
SHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYE
CPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGA
KSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPP
TEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFE
IAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPK

--------------------------------------------------------------

>36806_36806_9_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000374116_LAMA4_chr6_112537670_ENST00000230538_length(amino acids)=1844AA_BP=86
MLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNA
GFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNE
NYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCD
SVTGECLEEGFEPPTGMDCPTISCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQI
NNAENTMKSLLSDVEELVEKENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEE
IRSRQPFFTQRELVDEEADEAYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQ
QERVREQMEVVNMSLSTSADSLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLH
SSDMNGLVQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVG
GALARKSALKTRLSDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLT
EVVPQLLDQLRTVEQKRPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETAD
QFILYLGSKNAKKEYMGLAIKNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGD
DSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAV
VRDITRRGKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKM
ILVVDRRHVKSMDNEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYF
NGQSFIASIQKISFFDGFEGGFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVD
KSRVGSKNPTKGKIEQTQASEKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNL
SKPKASQNKKGGKSKDAPSWDPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIF
YVSDQEENDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAP
GKAVKNVQINSIYSFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGH
SVNGEYLNVHMKNGQVIVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLL

--------------------------------------------------------------

>36806_36806_10_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000374116_LAMA4_chr6_112537670_ENST00000389463_length(amino acids)=1837AA_BP=86
MLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNA
GFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNE
NYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCD
SVTGECLEEGFEPPTGCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTM
KSLLSDVEELVEKENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPF
FTQRELVDEEADEAYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQ
MEVVNMSLSTSADSLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGL
VQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKS
ALKTRLSDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLL
DQLRTVEQKRPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLG
SKNAKKEYMGLAIKNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLD
PEDTVFYVGGVPSNFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRR
GKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRR
HVKSMDNEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIA
SIQKISFFDGFEGGFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSK
NPTKGKIEQTQASEKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQ
NKKGGKSKDAPSWDPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEE
NDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNV
QINSIYSFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYL
NVHMKNGQVIVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPS

--------------------------------------------------------------

>36806_36806_11_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000374116_LAMA4_chr6_112537670_ENST00000424408_length(amino acids)=1837AA_BP=86
MLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNA
GFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNE
NYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCD
SVTGECLEEGFEPPTGCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTM
KSLLSDVEELVEKENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPF
FTQRELVDEEADEAYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQ
MEVVNMSLSTSADSLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGL
VQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKS
ALKTRLSDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLL
DQLRTVEQKRPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLG
SKNAKKEYMGLAIKNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLD
PEDTVFYVGGVPSNFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRR
GKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRR
HVKSMDNEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIA
SIQKISFFDGFEGGFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSK
NPTKGKIEQTQASEKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQ
NKKGGKSKDAPSWDPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEE
NDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNV
QINSIYSFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYL
NVHMKNGQVIVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPS

--------------------------------------------------------------

>36806_36806_12_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000374116_LAMA4_chr6_112537670_ENST00000522006_length(amino acids)=1837AA_BP=86
MLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNA
GFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNE
NYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCD
SVTGECLEEGFEPPTGCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTM
KSLLSDVEELVEKENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPF
FTQRELVDEEADEAYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQ
MEVVNMSLSTSADSLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGL
VQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKS
ALKTRLSDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLL
DQLRTVEQKRPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLG
SKNAKKEYMGLAIKNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLD
PEDTVFYVGGVPSNFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRR
GKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRR
HVKSMDNEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIA
SIQKISFFDGFEGGFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSK
NPTKGKIEQTQASEKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQ
NKKGGKSKDAPSWDPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEE
NDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNV
QINSIYSFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYL
NVHMKNGQVIVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPS

--------------------------------------------------------------

>36806_36806_13_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000401473_LAMA4_chr6_112537670_ENST00000230538_length(amino acids)=1907AA_BP=149
MGLRAGAISGAGAAPRRHPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDG
TEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSG
YCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYGNPLLIGSTCKK
CDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTGMDCPTISCDKC
VWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQASRK
GQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELLSQ
AESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLTTPRL
TLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYENIVNYVSE
ANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAAERGD
AQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVSASIQ
RIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNLVY
VYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTS
LNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEVRTPADNG
LILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTDIYIG
GAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFNFRTL
QPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFYFG
GSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKL
PERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRLVYMFNVG
HKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQLNGA
SITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNGIRDF
STSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHPVSFS

--------------------------------------------------------------

>36806_36806_14_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000401473_LAMA4_chr6_112537670_ENST00000389463_length(amino acids)=1900AA_BP=149
MGLRAGAISGAGAAPRRHPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDG
TEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSG
YCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYGNPLLIGSTCKK
CDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTGCDKCVWDLTDA
LRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQASRKGQLVQKE
SMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELLSQAESWQRL
HNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELDD
IIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYENIVNYVSEANETAEF
ALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAAERGDAQQRLGQ
SRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVSASIQRIRELIA
QTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNLVYVYNLGTK
DVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFV
GCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEVRTPADNGLILLMVN
GSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTDIYIGGAPPEIL
QSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFNFRTLQPNGLLF
YYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFYFGGSPISAQ
YANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTPR
NSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRLVYMFNVGHKKLKIR
SQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQLNGASITSASQ
TFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNGIRDFSTSVTPK
QSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHPVSFSKAALVSG

--------------------------------------------------------------

>36806_36806_15_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000401473_LAMA4_chr6_112537670_ENST00000424408_length(amino acids)=1900AA_BP=149
MGLRAGAISGAGAAPRRHPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDG
TEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSG
YCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYGNPLLIGSTCKK
CDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTGCDKCVWDLTDA
LRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQASRKGQLVQKE
SMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELLSQAESWQRL
HNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELDD
IIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYENIVNYVSEANETAEF
ALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAAERGDAQQRLGQ
SRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVSASIQRIRELIA
QTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNLVYVYNLGTK
DVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFV
GCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEVRTPADNGLILLMVN
GSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTDIYIGGAPPEIL
QSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFNFRTLQPNGLLF
YYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFYFGGSPISAQ
YANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTPR
NSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRLVYMFNVGHKKLKIR
SQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQLNGASITSASQ
TFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNGIRDFSTSVTPK
QSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHPVSFSKAALVSG

--------------------------------------------------------------

>36806_36806_16_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000401473_LAMA4_chr6_112537670_ENST00000522006_length(amino acids)=1900AA_BP=149
MGLRAGAISGAGAAPRRHPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDG
TEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSG
YCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYGNPLLIGSTCKK
CDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTGCDKCVWDLTDA
LRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQASRKGQLVQKE
SMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELLSQAESWQRL
HNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELDD
IIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYENIVNYVSEANETAEF
ALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAAERGDAQQRLGQ
SRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVSASIQRIRELIA
QTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNLVYVYNLGTK
DVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFV
GCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEVRTPADNGLILLMVN
GSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTDIYIGGAPPEIL
QSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFNFRTLQPNGLLF
YYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFYFGGSPISAQ
YANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTPR
NSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRLVYMFNVGHKKLKIR
SQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQLNGASITSASQ
TFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNGIRDFSTSVTPK
QSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHPVSFSKAALVSG

--------------------------------------------------------------

>36806_36806_17_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000447654_LAMA4_chr6_112537670_ENST00000230538_length(amino acids)=1921AA_BP=163
MGGEHAAAAVSERLCSLPVSDPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQE
KDGTEKRGRGRPRKQPPVSPGTALVGSQKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPC
DCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPG
YYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEP
PTGMDCPTISCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSD
VEELVEKENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQREL
VDEEADEAYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNM
SLSTSADSLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALD
ASNVYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRL
SDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTV
EQKRPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKK
EYMGLAIKNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVF
YVGGVPSNFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQV
TRFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMD
NEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKIS
FFDGFEGGFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGK
IEQTQASEKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGK
SKDAPSWDPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTL
FLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIY
SFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKN
GQVIVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGC

--------------------------------------------------------------

>36806_36806_18_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000447654_LAMA4_chr6_112537670_ENST00000389463_length(amino acids)=1914AA_BP=163
MGGEHAAAAVSERLCSLPVSDPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQE
KDGTEKRGRGRPRKQPPVSPGTALVGSQKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPC
DCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPG
YYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEP
PTGCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEK
ENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADE
AYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSAD
SLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYEN
IVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQL
QAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPAS
NVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAI
KNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPS
NFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEV
RTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIP
FTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEG
GFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQAS
EKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSW
DPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRL
VYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLS
NLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKV
NNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVID

--------------------------------------------------------------

>36806_36806_19_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000447654_LAMA4_chr6_112537670_ENST00000424408_length(amino acids)=1914AA_BP=163
MGGEHAAAAVSERLCSLPVSDPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQE
KDGTEKRGRGRPRKQPPVSPGTALVGSQKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPC
DCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPG
YYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEP
PTGCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEK
ENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADE
AYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSAD
SLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYEN
IVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQL
QAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPAS
NVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAI
KNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPS
NFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEV
RTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIP
FTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEG
GFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQAS
EKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSW
DPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRL
VYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLS
NLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKV
NNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVID

--------------------------------------------------------------

>36806_36806_20_HMGA1-LAMA4_HMGA1_chr6_34211295_ENST00000447654_LAMA4_chr6_112537670_ENST00000522006_length(amino acids)=1914AA_BP=163
MGGEHAAAAVSERLCSLPVSDPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQE
KDGTEKRGRGRPRKQPPVSPGTALVGSQKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGECVPC
DCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPG
YYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEP
PTGCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEK
ENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADE
AYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSAD
SLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYEN
IVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQL
QAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPAS
NVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAI
KNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPS
NFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEV
RTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIP
FTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEG
GFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQAS
EKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSW
DPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRL
VYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLS
NLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKV
NNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVID

--------------------------------------------------------------

Top

Fusion Protein Functional Features


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr6:/chr6:)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
HMGA1

P17096

LAMA4

Q16363

FUNCTION: HMG-I/Y bind preferentially to the minor groove of A+T rich regions in double-stranded DNA. It is suggested that these proteins could function in nucleosome phasing and in the 3'-end processing of mRNA transcripts. They are also involved in the transcription regulation of genes containing, or in close proximity to A+T-rich regions.FUNCTION: Binding to cells via a high affinity receptor, laminin is thought to mediate the attachment, migration and organization of cells into tissues during embryonic development by interacting with other extracellular matrix components.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page

* Minus value of BPloci means that the break pointn is located before the CDS.
- Retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note

- Not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note


Top

Fusion Protein Structures

check button PDB and CIF files of the predicted fusion proteins
* Here we show the 3D structure of the fusion proteins using Mol*. AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. Model confidence is shown from the pLDDT values per residue. pLDDT corresponds to the model’s prediction of its score on the local Distance Difference Test. It is a measure of local accuracy (from AlphfaFold website). To color code individual residues, we transformed individual PDB files into CIF format.
Fusion protein PDB link (fusion AA seq ID in FusionPDB)HgeneHchrHbpHstrandTgeneTchrTbpTstrandAA seqLen(AA seq)
PDB file (1042)CIF file (1042) >>>1042.cifHMGA1chr634211295+LAMA4chr6112537670-
MLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPRKQPPKEPSEVPTP
KRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGEC
VPCDCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFC
QPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYGNPLL
IGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYY
GDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTGCDKCVWDLTDALRL
AALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRK
IQINNAENTMKSLLSDVEELVEKENQASRKGQLVQKESMDTINHASQLVE
QAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPF
FTQRELVDEEADEAYELLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLS
DLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEVVNMSLST
SADSLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDL
VQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYENIVNYVSE
ANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESS
SDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAAERGDAQQRLGQSRL
ITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVR
NLTEVVPQLLDQLRTVEQKRPASNVSASIQRIRELIAQTRSVASKIQVSM
MFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETADQFILYLG
SKNAKKEYMGLAIKNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKI
ERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGG
VPSNFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCA
RDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEVRTPADNG
LILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKY
HEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTDIYIGGAPPEILQSR
ALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRR
AYFNGQSFIASIQKISFFDGFEGGFNFRTLQPNGLLFYYASGSDVFSISL
DNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSK
NPTKGKIEQTQASEKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEVE
DFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDA
PSWDPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLK
GDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRLVYMFNVG
HKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESLPPTEATW
KIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQLNGASITSASQTFS
VTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLV
HGHSVNGEYLNVHMKNGQVIVKVNNGIRDFSTSVTPKQSLCDGRWHRITV
IRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPS
1837
3D view using mol* of 1042 (AA BP:)
PDB file (1044)CIF file (1044) >>>1044.cifHMGA1chr634211295+LAMA4chr6112537670-
MLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPRKQPPKEPSEVPTP
KRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLKCNAGFFHTLSGEC
VPCDCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDGYIGDSIRGAPQFC
QPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNCERCAPGYYGNPLL
IGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYY
GDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTGMDCPTISCDKCVWD
LTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERE
NQYALRKIQINNAENTMKSLLSDVEELVEKENQASRKGQLVQKESMDTIN
HASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLAQKMLEE
IRSRQPFFTQRELVDEEADEAYELLSQAESWQRLHNETRTLFPVVLEQLD
DYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERVREQMEV
VNMSLSTSADSLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQVKLSNL
SNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYEN
IVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQAREL
QAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAAERGDAQQ
RLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVN
SARDAVRNLTEVVPQLLDQLRTVEQKRPASNVSASIQRIRELIAQTRSVA
SKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPELTETAD
QFILYLGSKNAKKEYMGLAIKNDNLVYVYNLGTKDVEIPLDSKPVSSWPA
YFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPED
TVFYVGGVPSNFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDP
STSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEV
RTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLEDTLKKA
QINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTDIYIGGAP
PEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGVGYGCPE
DSLISRRAYFNGQSFIASIQKISFFDGFEGGFNFRTLQPNGLLFYYASGS
DVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELIVD
KSRVGSKNPTKGKIEQTQASEKKFYFGGSPISAQYANFTGCISNAYFTRV
DRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKK
GGKSKDAPSWDPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSR
QEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLFLAHGRL
VYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLRVLEESL
PPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQLNGASIT
SASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPR
SSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNGIRDFSTSVTPKQSLCDG
RWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLL
1844
3D view using mol* of 1044 (AA BP:)
PDB file (1047) >>>1047.pdbFusion protein BP residue: 111
CIF file (1047) >>>1047.cif
HMGA1chr634211295+LAMA4chr6112537670-
MGLRAGAISGAGAAPRRPSQPSLFHLLLREGKMSESSSKSSQPLASKQEK
DGTEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTP
GRKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRN
TTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVR
CICNENYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDE
VTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGE
CLEEGFEPPTGCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHV
NEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKEN
QASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHEL
SPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELLSQAESWQ
RLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAA
RQRDHEKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELDDIIKNASGI
YAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSS
DMNGLVQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQ
IIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTR
LSDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTN
WSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNV
SASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSL
SLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNLVYVYNLG
TKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFI
KKGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFVGCLELATL
NNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVR
DITRRGKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFY
DFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSM
DNEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKD
FNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGF
NFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGL
SHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFYFGGSPIS
AQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFL
LHKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTPRNSHCHLSN
SPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYV
SDQEENDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRE
RSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSI
YSFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLD
ESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNN
GIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPK
PIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHPVSFSKAALV
1862
3D view using mol* of 1047 (AA BP:111)
PDB file (1048)CIF file (1048) >>>1048.cifHMGA1chr634211295+LAMA4chr6112537670-
MGLRAGAISGAGAAPRRPSQPSLFHLLLREGKMSESSSKSSQPLASKQEK
DGTEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTP
GRKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRN
TTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVR
CICNENYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDE
VTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGE
CLEEGFEPPTGMDCPTISCDKCVWDLTDALRLAALSIEEGKSGVLSVSSG
AAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVE
ELVEKENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLY
YGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELL
SQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAED
MNRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELDDI
IKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANEL
SRKLHSSDMNGLVQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDA
VSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALAR
KSALKTRLSDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAP
MANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQ
KRPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDD
LKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNL
VYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSS
TAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFVG
CLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDG
SGYAVVRDITRRGKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRN
GYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVD
RRHVKSMDNEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKG
FQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFF
DGFEGGFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVD
KQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFY
FGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPI
ESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTPRN
SHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSS
HGMIFYVSDQEENDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWH
DVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVK
NVQINSIYSFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTE
GGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQ
VIVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHV
VGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHPVS
1869
3D view using mol* of 1048 (AA BP:)
PDB file (1054)CIF file (1054) >>>1054.cifHMGA1chr634211295+LAMA4chr6112537670-
MGLRAGAISGAGAAPRRHPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQA
GGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPR
KQPPKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLK
CNAGFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDG
YIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNC
ERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNT
TGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTGC
DKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKT
KLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQASRKGQLVQKE
SMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKLVLA
QKMLEEIRSRQPFFTQRELVDEEADEAYELLSQAESWQRLHNETRTLFPV
VLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDHEKQQERV
REQMEVVNMSLSTSADSLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQ
VKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDA
SNVYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLL
NQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQLQAAE
RGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSA
YNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVSASIQRIRELIA
QTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMKPPVKRPE
LTETADQFILYLGSKNAKKEYMGLAIKNDNLVYVYNLGTKDVEIPLDSKP
VSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLL
DLDPEDTVFYVGGVPSNFKLPTSLNLPGFVGCLELATLNNDVISLYNFKH
IYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVT
RFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFSGGPVHLE
DTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKMKIPFTDI
YIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLEQTETLGV
GYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFNFRTLQPNGLLF
YYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTR
YELIVDKSRVGSKNPTKGKIEQTQASEKKFYFGGSPISAQYANFTGCISN
AYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPK
ASQNKKGGKSKDAPSWDPVALKLPERNTPRNSHCHLSNSPRAIEHAYQYG
GTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEENDFMTLF
LAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGRLVIDGLR
VLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSNLQL
NGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIA
FEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNGIRDFSTSVTPK
QSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGG
VPESLLTPRLAPSKPFTGCIRHFVIDGHPVSFSKAALVSGAVSINSCPAA
1900
3D view using mol* of 1054 (AA BP:)
PDB file (1056)HMGA1chr634211295+LAMA4chr6112537670-
MGLRAGAISGAGAAPRRHPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQA
GGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPR
KQPPKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLK
CNAGFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRNTTGEHCEKCLDG
YIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNENYAGPNC
ERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNT
TGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTGM
DCPTISCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVNEINA
TIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQASRK
GQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEI
SEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELLSQAESWQRLHNE
TRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAARQRDH
EKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELDDIIKNASGIYAEID
GAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGL
VQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHK
DESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAV
KQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNL
QHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVSASIQ
RIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSLYMK
PPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNLVYVYNLGTKDVE
IPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEF
SGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFVGCLELATLNNDVI
SLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRDITRR
GKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFYDFGFS
GGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMDNEKM
KIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDFNLLE
QTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFNFRTL
QPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVI
SSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFYFGGSPISAQYAN
FTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKG
KNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTPRNSHCHLSNSPRAI
EHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVSDQEE
NDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRERSSGR
LVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSG
CLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNI
GLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNGIRDF
STSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHR
EPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHPVSFSKAALVSGAVS
1907
PDB file (1058)HMGA1chr634211295+LAMA4chr6112537670-
MGLRAGAISGAGAAPRRHPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQA
GGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPR
KQPPVSPGTALVGSQKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPG
RKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRNT
TGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRC
ICNENYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEV
TGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGEC
LEEGFEPPTGCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHRHVN
EINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKENQ
ASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEHELS
PKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELLSQAESWQR
LHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRATAAR
QRDHEKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELDDIIKNASGIY
AEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSD
MNGLVQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDAVSGIDTQI
IYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRL
SDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNW
SQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPASNVS
ASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLS
LYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNLVYVYNLGT
KDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIK
KGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFVGCLELATLN
NDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAVVRD
ITRRGKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHVFYD
FGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVKSMD
NEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKDF
NLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFN
FRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLS
HFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFYFGGSPISA
QYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPLFLL
HKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTPRNSHCHLSNS
PRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIFYVS
DQEENDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFIRER
SSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIY
SFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDE
SFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNG
IRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLNPKP
IDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHPVSFSKAALVS
1911
PDB file (1062) >>>1062.pdbFusion protein BP residue: 163
CIF file (1062) >>>1062.cif
HMGA1chr634211295+LAMA4chr6112537670-
MGGEHAAAAVSERLCSLPVSDPHLLPAAAAAEPGRSSAPSTAAPGNPERA
PQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDGTEKRGRG
RPRKQPPVSPGTALVGSQKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTT
TPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQ
RNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGA
VRCICNENYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDC
DEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVT
GECLEEGFEPPTGCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGAAAHR
HVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEK
ENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYYGEEH
ELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELLSQAES
WQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDMNRAT
AARQRDHEKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELDDIIKNAS
GIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLH
SSDMNGLVQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDAVSGID
TQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALK
TRLSDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNL
TNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQKRPAS
NVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFT
SLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNLVYVYN
LGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEK
FIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFVGCLELA
TLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAV
VRDITRRGKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRNGYLHV
FYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDRRHVK
SMDNEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQK
KDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEG
GFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYND
GLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFYFGGSP
ISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIESSPL
FLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTPRNSHCHL
SNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSHGMIF
YVSDQEENDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFI
RERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQIN
SIYSFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVV
LDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKV
NNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVVGPLN
PKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHPVSFSKAA
1914
3D view using mol* of 1062 (AA BP:163)
PDB file (1064)HMGA1chr634211295+LAMA4chr6112537670-
MGLRAGAISGAGAAPRRHPHLLPAAAAAEPGRSSAPSTAAPGNPERAPQA
GGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDGTEKRGRGRPR
KQPPVSPGTALVGSQKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPG
RKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQRNT
TGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRC
ICNENYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEV
TGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGEC
LEEGFEPPTGMDCPTISCDKCVWDLTDALRLAALSIEEGKSGVLSVSSGA
AAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEE
LVEKENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKMLYY
GEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELLS
QAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDAEDM
NRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELDDII
KNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEANELS
RKLHSSDMNGLVQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDAV
SGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARK
SALKTRLSDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQATAPM
ANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQK
RPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDL
KAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNLV
YVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSST
AEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGFVGC
LELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGS
GYAVVRDITRRGKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEMRNG
YLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVVDR
RHVKSMDNEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGF
QFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFD
GFEGGFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDK
QYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKKFYF
GGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYECPIE
SSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTPRNS
HCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTRSSH
GMIFYVSDQEENDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHD
VIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKN
VQINSIYSFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEG
GYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKNGQV
IVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVNHVV
GPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHPVSF
1918
PDB file (1065)HMGA1chr634211295+LAMA4chr6112537670-
MGGEHAAAAVSERLCSLPVSDPHLLPAAAAAEPGRSSAPSTAAPGNPERA
PQAGGRARASQPSLFHLLLREGKMSESSSKSSQPLASKQEKDGTEKRGRG
RPRKQPPVSPGTALVGSQKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTT
TPGRKPRGRPKKLKCNAGFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQ
RNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGA
VRCICNENYAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDC
DEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVT
GECLEEGFEPPTGMDCPTISCDKCVWDLTDALRLAALSIEEGKSGVLSVS
SGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSD
VEELVEKENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEINNKM
LYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYE
LLSQAESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNYVRDA
EDMNRATAARQRDHEKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELD
DIIKNASGIYAEIDGAKSELQVKLSNLSNLSHDLVQEAIDHAQDLQQEAN
ELSRKLHSSDMNGLVQKALDASNVYENIVNYVSEANETAEFALNTTDRIY
DAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGAL
ARKSALKTRLSDAVKQLQAAERGDAQQRLGQSRLITEEANRTTMEVQQAT
APMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTV
EQKRPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSM
DDLKAFTSLSLYMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKND
NLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSL
SSTAEEKFIKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPTSLNLPGF
VGCLELATLNNDVISLYNFKHIYNMDPSTSVPCARDKLAFTQSRAASYFF
DGSGYAVVRDITRRGKFGQVTRFDIEVRTPADNGLILLMVNGSMFFRLEM
RNGYLHVFYDFGFSGGPVHLEDTLKKAQINDAKYHEISIIYHNDKKMILV
VDRRHVKSMDNEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCM
KGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKIS
FFDGFEGGFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQS
VDKQYNDGLSHFVISSVSPTRYELIVDKSRVGSKNPTKGKIEQTQASEKK
FYFGGSPISAQYANFTGCISNAYFTRVDRDVEVEDFQRYTEKVHTSLYEC
PIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPVALKLPERNTP
RNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRTR
SSHGMIFYVSDQEENDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGL
WHDVIFIRERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKA
VKNVQINSIYSFSGCLSNLQLNGASITSASQTFSVTPCFEGPMETGTYFS
TEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLVHGHSVNGEYLNVHMKN
GQVIVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLDVDSEVN
HVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDGHP
1921


Top

pLDDT score distribution

check button pLDDT score distribution of the predicted wild-type structures of two partner proteins from AlphaFold2
* AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100.
HMGA1_pLDDT.png
all structure
all structure
LAMA4_pLDDT.png
all structure
all structure

check button pLDDT score distribution of the predicted fusion protein structures from AlphaFold2
* AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100.
HMGA1_LAMA4_1042_pLDDT_and_active_sites.png (AA BP:)
all structure
HMGA1_LAMA4_1042_violinplot.png (AA BP:)
all structure
HMGA1_LAMA4_1044_pLDDT_and_active_sites.png (AA BP:)
all structure
HMGA1_LAMA4_1044_violinplot.png (AA BP:)
all structure
HMGA1_LAMA4_1047_pLDDT.png (AA BP:111)
all structure
HMGA1_LAMA4_1047_pLDDT_and_active_sites.png (AA BP:111)
all structure
HMGA1_LAMA4_1047_violinplot.png (AA BP:111)
all structure
HMGA1_LAMA4_1048_pLDDT_and_active_sites.png (AA BP:)
all structure
HMGA1_LAMA4_1048_violinplot.png (AA BP:)
all structure
HMGA1_LAMA4_1062_pLDDT.png (AA BP:163)
all structure
HMGA1_LAMA4_1062_violinplot.png (AA BP:163)
all structure


Top

Ramachandran Plot of Fusion Protein Structure


check button Ramachandran plot of the torsional angles - phi (φ)and psi (ψ) - of the residues (amino acids) contained in this fusion protein peptide.
Fusion AA seq ID in FusionPDB and their Ramachandran plots
HMGA1_LAMA4_1047.png
all structure

Top

Potential Active Site Information


check button The potential binding sites of these fusion proteins were identified using SiteMap, a module of the Schrodinger suite.
Fusion AA seq ID in FusionPDBSite scoreSizeD scoreVolumeExposureEnclosureContactPhobicPhilicBalanceDon/AccResidues
10421.13531.202962.1150.5550.6730.8831.6770.4363.8443.209Chain A: 572,575,576,578,579,580,582,583,585,586,5
87,589,604,605,607,608,611,640,641,643,644,645,647
,648,650,651,654,655,657,658,659,661,662,663,664,6
65,668,729,732,733,735,736,737,738,740,741,742,744
,745,747,748,749,751,752,754,755,758,759,762,763
10421.13531.202962.1150.5550.6730.8831.6770.4363.8443.209Chain A: 572,575,576,578,579,580,582,583,585,586,5
87,589,604,605,607,608,611,640,641,643,644,645,647
,648,650,651,654,655,657,658,659,661,662,663,664,6
65,668,729,732,733,735,736,737,738,740,741,742,744
,745,747,748,749,751,752,754,755,758,759,762,763
10441.1232961.206707.9520.4720.7380.9461.9620.5413.6271.797Chain A: 651,652,655,656,658,659,660,662,663,665,6
66,667,748,751,752,754,755,756,758,759,760,762,763
,766,816,817,819,820,821,823,824,825,826,827,828,8
29,832,835,836,839,840,843
10471.0572971.081971.0330.4890.7640.9790.790.9750.810.993Chain A: 179,181,182,195,196,198,199,200,210,211,2
13,214,215,221,222,223,224,225,250,251,1512,1513,1
514,1515,1516,1517,1518,1542,1544,1635,1636,1637,1
638,1639,1640,1642,1647,1648,1650,1651,1652,1654,1
674,1675,1698,1699,1700,1701,1702,1805,1806,1807,1
845,1846,1847,1848,1849,1850,1856,1857
10481.0531601.124636.9510.6180.6710.8950.9260.6711.3791.858Chain A: 395,399,402,403,405,406,409,410,413,414,4
16,417,499,502,503,504,506,507,509,510,513,514,517
,811,812,814,815,816,817,818,819,820
10541.1421831.224422.5760.4230.7671.092.420.5414.4731.622Chain A: 621,624,625,628,632,739,742,743,745,746,7
49,750,752,753,756,771,774,775,777,778,781,782,785
,786,789
10561.0762401.173594.4190.5820.6510.8731.6820.4833.4860.982Chain A: 444,447,448,451,452,454,455,457,458,459,4
61,462,464,465,468,469,471,472,473,475,476,479,480
,520,523,524,527,528,530,531,533,534,535,537,538,5
40,541,544,545,547,548,549,551,552,555,559
10581.1064261.203883.9110.5530.6910.8941.8770.4644.0482.32Chain A: 607,610,611,614,615,617,618,619,621,622,6
25,628,629,631,632,633,635,636,638,735,738,739,742
,746,749,750,753,754,756,757,758,760,761,763,764,7
67,768,770,771,773,774,776,778,779,782,783,785,786
,789,790,792,793,795,796,797,799,800,801,803,804,8
06,807,808,810
10621.0823331.1458010.5430.7220.940.9640.6991.3780.878Chain A: 102,103,104,105,106,568,572,645,649,652,6
53,655,656,659,660,662,663,664,666,724,725,727,728
,731,732,734,735,736,738,739,742,745,746,809,810,8
12,813,814,816,817,818,821,822,824,825,826,828,829
,831,832,833
10641.0193551.0431165.5140.6120.7130.8530.630.9950.6330.615Chain A: 915,918,919,922,923,925,926,927,928,944,9
45,946,948,949,953,955,957,959,1028,1029,1030,1032
,1041,1042,1043,1054,1056,1104,1105,1107,1108,1109
,1110,1132,1146,1148,1150,1152,1273,1274,1275,1278
,1292,1294,1310,1312,1313,1314,1315,1316,1317,1318
,1319,1320,1323,1324,1325,1326,1328,1329,1330,1331
,1357,1358,1359,1406,1407,1465,1466,1467,1468,1495
,1497
10651.0972071.171613.6270.5470.7220.8861.290.6222.0730.606Chain A: 134,135,136,137,416,419,420,422,423,425,4
26,427,429,430,433,434,437,597,600,601,604,605,607
,608,611,826,828,829,830,832,833,836,837,839,840

Top

Potentially Interacting Small Molecules through Virtual Screening


check button The FDA-approved small molecule library molecules were subjected to virtual screening using the Glide.
Fusion AA seq ID in FusionPDBZINC IDDrugBank IDDrug nameDocking scoreGlide gscore

Top

check button Drug information from DrugBank of the top 20 interacting small molecules.
ZINC IDDrugBank IDDrug nameDrug typeSMILESDrug group

Top

Biochemical Features of Small Molecules


check button ADME (Absorption, Distribution, Metabolism, and Excretion) of drugs using QikProp(v3.9)
ZINC IDmol_MWdipoleSASAFOSAFISAPISAWPSAvolumedonorHBaccptHBIPHuman Oral AbsorptionPercent Human Oral AbsorptionRule Of FiveRule Of Three


Top

Drug Toxicity Information


check button Toxicity information of individual drugs using eToxPred
ZINC IDSmileSurface AccessibilityToxicity


Top

Fusion Protein-Protein Interaction


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type from validated records (BIOGRID-3.4.160)
GenePPI interactors
HMGA1EWSR1, SP1, CEBPB, ELF1, NFYA, POU3F1, ATF2, JUN, IRF1, BANF1, CHD1, PRMT6, NPM1, PARP1, PPARG, PRKCA, RARA, CREBBP, POU2F1, POU2F2, KAT2B, NFKB1, TP53, RN7SK, Stag2, ORC6, ORC2, HHV8GK18_gp81, CBX7, HDAC2, CDK2, TERF2, FN1, HSP90AA1, CSNK2A1, PAN2, CDK1, NCL, PSIP1, EIF2S2, RCC1, HNRNPD, PPP1R8, HNRNPAB, HNRNPDL, DNAJC8, HNRNPA1, HMGB1, HMGB3, HMGB2, SYNCRIP, CWC27, METAP2, SSB, PA2G4, OLA1, APEX1, ANXA2, GTF2F2, XRCC6, HNRNPL, HSPA1A, PTBP1, PCBP1, ANXA1, PCBP2, HTT, UPF2, UBE2I, NCOR2, CUL7, OBSL1, APP, SRPK2, AP4M1, EMC2, RACGAP1, ZSCAN5A, Csk, Fbl, Gspt1, Srp72, Kifc1, CREB1, NFATC1, FOXA1, FOXE1, HEMGN, TRIM29, PCGF1, DPPA4, NANOG, POU5F1, C4orf27, EGFR, CDC14B, G3BP1, EFTUD2, NKX2-1, RNF123, HIF1A, AGR2, EZH2, DCPS, GPC1, REST, PRKDC, TYK2, CDC25A, OXT, CSF3R, KCNQ4, CYP11B2, CDK5, EPHB2, ITGB7, MYC, HIC1, HIST1H3A, ATG16L1, RBX1, PRDM16, AGRN, VRK1, HIST1H4A, CMTR1, FANCD2, N, HCVgp1, ZC3H18, CAMK2A, FYN, MAP2K1, PEBP1, RALBP1, RPS6KA3, ARNTL, YKT6, CD72, TRIP10, HIST1H2AH, NSD1, AKAP9, RBFOX2, ARR3, ECT2L, ABCC8, CAMSAP1, COL13A1, ELP2, FBXO38, RNF214, PANX2, STRN4, KCND1, ROBO1, HIST2H3PS2, HIST2H2BC, ARHGEF10, DSP, HIST1H1B, HIST1H1D, SOX6, RGS6, LOC102724334, HIST2H2BE, MST1L, UHRF1BP1, ADCK1, HIST2H2AB, CFAP46, CNBD1, SEPT1, TTN, JPH2, ATXN3L, UNC79, PCLO, ACACB, NOP14, HIST1H2AB, RGS3, GTF2H3, HIST3H3, MURC, HIST2H2BF, PHF3, INPP5D, HIST1H2BH, SPEN, TXLNA, CIT, ANLN, CHMP4B, ECT2, KIF14, KIF20A, PRC1, C12orf65, C1QBP, GRSF1, ICT1, MRPL11, TSFM, ZNF263, MAFB, BRD4, FBP1, NEDD4, HMGB1P1, FKBP3, RBM8A, PARK7, TRIM37, CBX3, CENPA, HIST1H2BG, LMNB1, ZNF330, WDR5, NAA40, H2AFY, UBN2, HIST1H2AG, VRK3, WDR89, H2AFX, HIRA, PARP2, CABIN1, XPC, CD274, BTF3, SLFN11, JMJD6, Ube2i, Klc4, TAX1BP1, PER2,


check button Protein-protein interactors based on sequence similarity (STRING)
GeneSTRING network
HMGA1all structure
LAMA4all structure


check button - Retained interactions in fusion protein (protein functional feature from UniProt).
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost interactions due to fusion (protein functional feature from UniProt).
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs to HMGA1-LAMA4


check button Drugs used for this fusion-positive patient.
(Manual curation of PubMed, 04-30-2022 + MyCancerGenome)
HgeneTgeneDrugSourcePMID

Top

Related Diseases to HMGA1-LAMA4


check button Diseases that have this fusion gene.
(Manual curation of PubMed, 04-30-2022 + MyCancerGenome)
HgeneTgeneDiseaseSourcePMID

check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource
HgeneHMGA1C0011860Diabetes Mellitus, Non-Insulin-Dependent1CTD_human
HgeneHMGA1C0020456Hyperglycemia1CTD_human
HgeneHMGA1C0021655Insulin Resistance1CTD_human
HgeneHMGA1C0023269leiomyosarcoma1CTD_human
HgeneHMGA1C0036341Schizophrenia1PSYGENET
HgeneHMGA1C0042138Uterine Neoplasms1CTD_human
HgeneHMGA1C0153567Uterine Cancer1CTD_human
HgeneHMGA1C0205815Leiomyosarcoma, Epithelioid1CTD_human
HgeneHMGA1C0205816Leiomyosarcoma, Myxoid1CTD_human
HgeneHMGA1C0524620Metabolic Syndrome X1CTD_human
HgeneHMGA1C0920563Insulin Sensitivity1CTD_human
HgeneHMGA1C1855520Hyperglycemia, Postprandial1CTD_human
TgeneLAMA4C0000786Spontaneous abortion1CTD_human
TgeneLAMA4C0000822Abortion, Tubal1CTD_human
TgeneLAMA4C0340427Familial dilated cardiomyopathy1ORPHANET
TgeneLAMA4C3808935CARDIOMYOPATHY, DILATED, 1JJ1CTD_human;UNIPROT
TgeneLAMA4C3830362Early Pregnancy Loss1CTD_human
TgeneLAMA4C4552766Miscarriage1CTD_human