Fusion Gene Studies
in Kim Lab

FusionBase FusionGDB FusionGDB2 FusionPDB FusionNeoAntigen FusionAI FusionNW FGviewer Publication Contact
FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:ATG4B-SEC23IP (FusionGDB2 ID:HG23192TG11196)

Fusion Gene Summary for ATG4B-SEC23IP

check button Fusion gene summary
Fusion gene informationFusion gene name: ATG4B-SEC23IP
Fusion gene ID: hg23192tg11196
HgeneTgene
Gene symbol

ATG4B

SEC23IP

Gene ID

23192

11196

Gene nameautophagy related 4B cysteine peptidaseSEC23 interacting protein
SynonymsAPG4B|AUTL1MSTP053|P125|P125A
Cytomap('ATG4B')('SEC23IP')

2q37.3

10q26.11-q26.12

Type of geneprotein-codingprotein-coding
Descriptioncysteine protease ATG4BAPG4 autophagy 4 homolog BATG4 autophagy related 4 homolog BAUT-like 1 cysteine endopeptidaseautophagin-1autophagy-related cysteine endopeptidase 1autophagy-related protein 4 homolog BhAPG4BSEC23-interacting protein
Modification date2020032920200313
UniProtAcc..
Ensembl transtripts involved in fusion geneENST00000396411, ENST00000402096, 
ENST00000404914, ENST00000405546, 
ENST00000474739, ENST00000491867, 
Fusion gene scores* DoF score5 X 4 X 4=807 X 9 X 4=252
# samples 47
** MAII scorelog2(4/80*10)=-1
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(7/252*10)=-1.84799690655495
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: ATG4B [Title/Abstract] AND SEC23IP [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointATG4B(242606253)-SEC23IP(121678956), # samples:1
Anticipated loss of major functional domain due to fusion event.ATG4B-SEC23IP seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
ATG4B-SEC23IP seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
ATG4B-SEC23IP seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
ATG4B-SEC23IP seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneATG4B

GO:0006508

proteolysis

15169837|18387192

HgeneATG4B

GO:0006914

autophagy

18387192

HgeneATG4B

GO:0051697

protein delipidation

25327288


check buttonFusion gene breakpoints across ATG4B (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure
check buttonFusion gene breakpoints across SEC23IP (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4STADTCGA-BR-8058-01AATG4Bchr2

242606253

+SEC23IPchr10

121678956

+


Top

Fusion Gene ORF analysis for ATG4B-SEC23IP

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-intronENST00000396411ENST00000475542ATG4Bchr2

242606253

+SEC23IPchr10

121678956

+
5CDS-intronENST00000402096ENST00000475542ATG4Bchr2

242606253

+SEC23IPchr10

121678956

+
5CDS-intronENST00000404914ENST00000475542ATG4Bchr2

242606253

+SEC23IPchr10

121678956

+
5CDS-intronENST00000405546ENST00000475542ATG4Bchr2

242606253

+SEC23IPchr10

121678956

+
5CDS-intronENST00000474739ENST00000475542ATG4Bchr2

242606253

+SEC23IPchr10

121678956

+
In-frameENST00000396411ENST00000369075ATG4Bchr2

242606253

+SEC23IPchr10

121678956

+
In-frameENST00000396411ENST00000543134ATG4Bchr2

242606253

+SEC23IPchr10

121678956

+
In-frameENST00000402096ENST00000369075ATG4Bchr2

242606253

+SEC23IPchr10

121678956

+
In-frameENST00000402096ENST00000543134ATG4Bchr2

242606253

+SEC23IPchr10

121678956

+
In-frameENST00000404914ENST00000369075ATG4Bchr2

242606253

+SEC23IPchr10

121678956

+
In-frameENST00000404914ENST00000543134ATG4Bchr2

242606253

+SEC23IPchr10

121678956

+
In-frameENST00000405546ENST00000369075ATG4Bchr2

242606253

+SEC23IPchr10

121678956

+
In-frameENST00000405546ENST00000543134ATG4Bchr2

242606253

+SEC23IPchr10

121678956

+
In-frameENST00000474739ENST00000369075ATG4Bchr2

242606253

+SEC23IPchr10

121678956

+
In-frameENST00000474739ENST00000543134ATG4Bchr2

242606253

+SEC23IPchr10

121678956

+
intron-3CDSENST00000491867ENST00000369075ATG4Bchr2

242606253

+SEC23IPchr10

121678956

+
intron-3CDSENST00000491867ENST00000543134ATG4Bchr2

242606253

+SEC23IPchr10

121678956

+
intron-intronENST00000491867ENST00000475542ATG4Bchr2

242606253

+SEC23IPchr10

121678956

+

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000405546ATG4Bchr2242606253+ENST00000369075SEC23IPchr10121678956+394612342382364708
ENST00000405546ATG4Bchr2242606253+ENST00000543134SEC23IPchr10121678956+253412342382364708
ENST00000402096ATG4Bchr2242606253+ENST00000369075SEC23IPchr10121678956+35478351031965620
ENST00000402096ATG4Bchr2242606253+ENST00000543134SEC23IPchr10121678956+21358351031965620
ENST00000404914ATG4Bchr2242606253+ENST00000369075SEC23IPchr10121678956+35478351031965620
ENST00000404914ATG4Bchr2242606253+ENST00000543134SEC23IPchr10121678956+21358351031965620
ENST00000396411ATG4Bchr2242606253+ENST00000369075SEC23IPchr10121678956+35298173071947546
ENST00000396411ATG4Bchr2242606253+ENST00000543134SEC23IPchr10121678956+21178173071947546
ENST00000474739ATG4Bchr2242606253+ENST00000369075SEC23IPchr10121678956+3557845591975638
ENST00000474739ATG4Bchr2242606253+ENST00000543134SEC23IPchr10121678956+2145845591975638

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000405546ENST00000369075ATG4Bchr2242606253+SEC23IPchr10121678956+0.0009553390.9990446
ENST00000405546ENST00000543134ATG4Bchr2242606253+SEC23IPchr10121678956+0.0047034620.99529654
ENST00000402096ENST00000369075ATG4Bchr2242606253+SEC23IPchr10121678956+0.000269990.99973005
ENST00000402096ENST00000543134ATG4Bchr2242606253+SEC23IPchr10121678956+0.0010464930.99895346
ENST00000404914ENST00000369075ATG4Bchr2242606253+SEC23IPchr10121678956+0.000269990.99973005
ENST00000404914ENST00000543134ATG4Bchr2242606253+SEC23IPchr10121678956+0.0010464930.99895346
ENST00000396411ENST00000369075ATG4Bchr2242606253+SEC23IPchr10121678956+0.0011023640.99889755
ENST00000396411ENST00000543134ATG4Bchr2242606253+SEC23IPchr10121678956+0.0030710280.996929
ENST00000474739ENST00000369075ATG4Bchr2242606253+SEC23IPchr10121678956+0.0005990430.999401
ENST00000474739ENST00000543134ATG4Bchr2242606253+SEC23IPchr10121678956+0.0021787490.9978212

Top

Fusion Genomic Features for ATG4B-SEC23IP


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)
ATG4Bchr2242606253+SEC23IPchr10121678955+4.19E-081
ATG4Bchr2242606253+SEC23IPchr10121678955+4.19E-081

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.
genomic feature of top 1%

Top

Fusion Protein Features for ATG4B-SEC23IP


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr2:242606253/chr10:121678956)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
..
FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
TgeneSEC23IPchr2:242606253chr10:121678956ENST00000369075919644_7076241443.0DomainNote=SAM
TgeneSEC23IPchr2:242606253chr10:121678956ENST00000369075919779_9896241443.0DomainDDHD

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
TgeneSEC23IPchr2:242606253chr10:121678956ENST00000369075919142_2596241443.0Compositional biasNote=Pro-rich


Top

Fusion Gene Sequence for ATG4B-SEC23IP


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>7568_7568_1_ATG4B-SEC23IP_ATG4B_chr2_242606253_ENST00000396411_SEC23IP_chr10_121678956_ENST00000369075_length(transcript)=3529nt_BP=817nt
AGTCGGCGGCCGGACTGGGAAGATGGACGCAGCTACTCTGACCTACGACACTCTCCGGTTTGCTGAGTTTGAAGATTTTCCTGAGACCTC
AGAGCCCGTTTGGATACTGGGTAGAAAATACAGCATTTTCACAGAAAAGGACGAGATCTTGTCTGATGTGGCATCTAGACTTTGGTTTAC
ATACAGGAAAAACTTTCCAGCCATTGGAGGTTGTCGGGACATTTCACTCTCCAGTAAATGTGGCCCTTGGCGTTACTGAGTCCTAAGAGG
GGGGACAGGCCCCACCTCGGACACAGGCTGGGGCTGCATGCTGCGGTGTGGACAGATGATCTTTGCCCAAGCCCTGGTGTGCCGGCACCT
AGGCCGAGATTGGAGGTGGACACAAAGGAAGAGGCAGCCAGACAGCTACTTCAGCGTCCTCAACGCATTCATCGACAGGAAGGACAGTTA
CTACTCCATTCACCAGATAGCGCAAATGGGAGTTGGCGAAGGCAAGTCCATAGGCCAGTGGTACGGGCCCAACACTGTCGCCCAGGTCCT
GAAGAAGCTTGCTGTCTTCGATACGTGGAGCTCCTTGGCGGTCCACATTGCAATGGACAACACTGTTGTGATGGAGGAAATCAGAAGGTT
GTGCAGGACCAGCGTTCCCTGTGCAGGCGCCACTGCGTTTCCTGCAGATTCCGACCGGCACTGCAACGGATTCCCTGCCGGAGCTGAGGT
CACCAACAGGCCGTCGCCATGGAGACCCCTGGTACTTCTCATTCCCCTGCGCCTGGGGCTCACGGACATCAACGAGGCCTACGTGGAGAC
GCTGAAGATGCCTGAAGAGCCAAAGCTGACTTTGGATGAGTCGTATGACCTTGTTGTTGAAAATAAAGAAGTCCTAACTTTGCAAGAAAC
TCTGGAAGCACTTAGCCTCTCTGAATATTTTAGCACTTTTGAAAAGGAAAAGATTGATATGGAGTCCCTGCTTATGTGTACAGTTGATGA
CCTGAAGGAAATGGGGATACCCCTTGGACCCAGAAAGAAGATAGCTAACTTTGTAGAACATAAAGCAGCCAAACTGAAAAAAGCAGCGTC
AGAAAAGAAGGCAGTGGCGGCCACTTCTACAAAAGGACAAGAGCAAAGTGCCCAGAAGACTAAAGACATGGCTTCCCTCCCCTCAGAATC
CAATGAGCCAAAGAGGAAACTTCCAGTTGGTGCTTGCGTGTCTTCTGTGTGTGTGAATTATGAATCTTTTGAAGTTGGCGCCGGACAGGT
TTCTGTTGCTTACAACTCATTAGATTTTGAACCAGAGATATTCTTTGCCTTGGGGTCTCCAATTGCTATGTTTCTCACTATTCGAGGAGT
TGATAGGATAGATGAGAATTACAGCCTTCCTACCTGTAAAGGGTTCTTCAATATTTATCATCCGCTTGATCCAGTGGCATATAGATTAGA
ACCTATGATTGTTCCAGATTTGGACCTAAAAGCTGTTCTCATTCCACATCACAAAGGCAGAAAAAGACTTCATTTAGAATTGAAAGAGAG
TCTCTCTCGTATGGGATCTGATTTGAAGCAGGGTTTTATTAGCTCTCTCAAAAGTGCTTGGCAGACATTAAATGAGTTTGCCCGTGCTCA
TACGTCTTCAACCCAGTTGCAAGAAGAATTGGAGAAGGTGGCCAATCAGATCAAAGAAGAAGAAGAAAAGCAAGTAGTTGAAGCAGAAAA
GGTTGTTGAAAGTCCAGATTTTTCCAAGGATGAGGACTACTTAGGAAAGGTTGGAATGTTAAATGGAGGCCGCCGAATTGACTACGTTCT
CCAAGAAAAACCAATAGAGAGTTTTAATGAATACCTTTTCGCTCTTCAGAGTCACTTATGCTATTGGGAATCTGAAGATACTGCTCTGTT
ACTACTTAAAGAAATTTATCGAACAATGAACATTAGTCCAGAACAGCCCCAGCATTGATCAAACTTCAGTTTTACTGTACTTTCTTGTCT
GCACAGAAAGTCCCAGTACAACTTCCATTGCTGAGAAAATCCTCAGAGGACTTTCCCACTTCGCTCCTGTGATGGATGACAGAAGAGTGA
TTCATTAACAATTGCTCAGCCACAATTCTCGGATATAGGGATTCAAAAGACAGGACACAGAACTAACACAGTGAAAAAAATCAGTACCAC
ATTTGGACAGTATAGGTGAGAAAACATAATTATAAAAATGATGCCATGAAAAATTCCACAGATCAGTTTAGTTGTATAGTTGTCAAAGTT
ATATGTGATATCAATGAAGAAATATTTGTAGCATGTAAACGGTTATTTCTGTTTCTTAAAAAGTATTGTTAGTGGGCTATTAAACTTGGA
TTTTTCTTTTTATTAATGCAGTATGTTCTTTTTATTCAAGTATGAACTTGTTGAGAAACTATAGTAATATGATTTTTAAGAGATTTATGT
TCTACTTAAAATGTGAATTGTACTTCTGAGCTGCCTTAATGCAAGGTCATTTATATTTGTTAAGAGGAAATAATCAAGATCACTCATATC
CCAACTGAATCTGAGGTTTTATAAATCCCTCAAACGATTGCTGAGAGCCTGATTGTGGAAAGAAGTGAGATGCACCTTATTTTCAAGAAG
TCCTGGGAAGCGCTCTCCTAGCACGTCCATTTCCAGGAGGAGAAGCAAGCAGATGAGAGGTTTTCCATTTTGTCATCCAAGGTAGCTGTG
CACTTGCCTTGTTGCTGAAGTTCCAATAATGTGAAAACCAAAGTAGAGGTTTTTTTCTTCTTCTTTTTGTTTTCTATTAATTTCACTTAT
ACCAAAGTGTTTGAAAGTATGAAATGTGTTGCTTCTGAGTTATATAAGGCTACTTCATGACAAGACTGCTTTGTAATATTTCACTTTGTT
TTACTACAAATTCAGATCACTTTGTTTTACTATAAATTCAGATTATCCAAATATTTTCCTAATACTATGTGGGAATGCTGATTTTCTTTT
GTTACGTAGTGGAAACATTTTGCATTGTTTACATAGTTCTCATGGAACATGGAAATTTTTGAAAGTGATATATGATACACATTTTTTGTG
TATGTATTCTAATTAGTGTGAATAAAGCAGTAACATTAATGCATTTTTTAAGCAGCAAACTTATGTATTTCTCTTGTCTTCCTTAAAAGT
GTCCCCATGAACTCAGTGTTTATTCCCTTTTCATTTTGAGTACCTGCTTATATGGTCAGTATGTAACGTTAGCATTGGCTCCTAATGGTA
GAATTAGAACAGCAAGATTGTAGAGCTGTAATTGACTCCAGACAACATAGATTTCAGCCACCTCATTCTACAGCTGAGGCCAGGACAATA
AATGCCTTTCCCAGACTGGGTAGTGGCAGATCTGGGATGGAATATGGTTTTCTTGATTCCCTTTCAGCCTTCATTTCTCTCTCTCAGGAC
TACTACTTTTTAATTACTTTTCACTTAATTTCCCAATACTGATGAAATAAAGAAAAATGAGGGTTATTTATATACATTTCAATAAAATCC

>7568_7568_1_ATG4B-SEC23IP_ATG4B_chr2_242606253_ENST00000396411_SEC23IP_chr10_121678956_ENST00000369075_length(amino acids)=546AA_BP=168
MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSL
AVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKMPEEPKLTLD
ESYDLVVENKEVLTLQETLEALSLSEYFSTFEKEKIDMESLLMCTVDDLKEMGIPLGPRKKIANFVEHKAAKLKKAASEKKAVAATSTKG
QEQSAQKTKDMASLPSESNEPKRKLPVGACVSSVCVNYESFEVGAGQVSVAYNSLDFEPEIFFALGSPIAMFLTIRGVDRIDENYSLPTC
KGFFNIYHPLDPVAYRLEPMIVPDLDLKAVLIPHHKGRKRLHLELKESLSRMGSDLKQGFISSLKSAWQTLNEFARAHTSSTQLQEELEK
VANQIKEEEEKQVVEAEKVVESPDFSKDEDYLGKVGMLNGGRRIDYVLQEKPIESFNEYLFALQSHLCYWESEDTALLLLKEIYRTMNIS

--------------------------------------------------------------
>7568_7568_2_ATG4B-SEC23IP_ATG4B_chr2_242606253_ENST00000396411_SEC23IP_chr10_121678956_ENST00000543134_length(transcript)=2117nt_BP=817nt
AGTCGGCGGCCGGACTGGGAAGATGGACGCAGCTACTCTGACCTACGACACTCTCCGGTTTGCTGAGTTTGAAGATTTTCCTGAGACCTC
AGAGCCCGTTTGGATACTGGGTAGAAAATACAGCATTTTCACAGAAAAGGACGAGATCTTGTCTGATGTGGCATCTAGACTTTGGTTTAC
ATACAGGAAAAACTTTCCAGCCATTGGAGGTTGTCGGGACATTTCACTCTCCAGTAAATGTGGCCCTTGGCGTTACTGAGTCCTAAGAGG
GGGGACAGGCCCCACCTCGGACACAGGCTGGGGCTGCATGCTGCGGTGTGGACAGATGATCTTTGCCCAAGCCCTGGTGTGCCGGCACCT
AGGCCGAGATTGGAGGTGGACACAAAGGAAGAGGCAGCCAGACAGCTACTTCAGCGTCCTCAACGCATTCATCGACAGGAAGGACAGTTA
CTACTCCATTCACCAGATAGCGCAAATGGGAGTTGGCGAAGGCAAGTCCATAGGCCAGTGGTACGGGCCCAACACTGTCGCCCAGGTCCT
GAAGAAGCTTGCTGTCTTCGATACGTGGAGCTCCTTGGCGGTCCACATTGCAATGGACAACACTGTTGTGATGGAGGAAATCAGAAGGTT
GTGCAGGACCAGCGTTCCCTGTGCAGGCGCCACTGCGTTTCCTGCAGATTCCGACCGGCACTGCAACGGATTCCCTGCCGGAGCTGAGGT
CACCAACAGGCCGTCGCCATGGAGACCCCTGGTACTTCTCATTCCCCTGCGCCTGGGGCTCACGGACATCAACGAGGCCTACGTGGAGAC
GCTGAAGATGCCTGAAGAGCCAAAGCTGACTTTGGATGAGTCGTATGACCTTGTTGTTGAAAATAAAGAAGTCCTAACTTTGCAAGAAAC
TCTGGAAGCACTTAGCCTCTCTGAATATTTTAGCACTTTTGAAAAGGAAAAGATTGATATGGAGTCCCTGCTTATGTGTACAGTTGATGA
CCTGAAGGAAATGGGGATACCCCTTGGACCCAGAAAGAAGATAGCTAACTTTGTAGAACATAAAGCAGCCAAACTGAAAAAAGCAGCGTC
AGAAAAGAAGGCAGTGGCGGCCACTTCTACAAAAGGACAAGAGCAAAGTGCCCAGAAGACTAAAGACATGGCTTCCCTCCCCTCAGAATC
CAATGAGCCAAAGAGGAAACTTCCAGTTGGTGCTTGCGTGTCTTCTGTGTGTGTGAATTATGAATCTTTTGAAGTTGGCGCCGGACAGGT
TTCTGTTGCTTACAACTCATTAGATTTTGAACCAGAGATATTCTTTGCCTTGGGGTCTCCAATTGCTATGTTTCTCACTATTCGAGGAGT
TGATAGGATAGATGAGAATTACAGCCTTCCTACCTGTAAAGGGTTCTTCAATATTTATCATCCGCTTGATCCAGTGGCATATAGATTAGA
ACCTATGATTGTTCCAGATTTGGACCTAAAAGCTGTTCTCATTCCACATCACAAAGGCAGAAAAAGACTTCATTTAGAATTGAAAGAGAG
TCTCTCTCGTATGGGATCTGATTTGAAGCAGGGTTTTATTAGCTCTCTCAAAAGTGCTTGGCAGACATTAAATGAGTTTGCCCGTGCTCA
TACGTCTTCAACCCAGTTGCAAGAAGAATTGGAGAAGGTGGCCAATCAGATCAAAGAAGAAGAAGAAAAGCAAGTAGTTGAAGCAGAAAA
GGTTGTTGAAAGTCCAGATTTTTCCAAGGATGAGGACTACTTAGGAAAGGTTGGAATGTTAAATGGAGGCCGCCGAATTGACTACGTTCT
CCAAGAAAAACCAATAGAGAGTTTTAATGAATACCTTTTCGCTCTTCAGAGTCACTTATGCTATTGGGAATCTGAAGATACTGCTCTGTT
ACTACTTAAAGAAATTTATCGAACAATGAACATTAGTCCAGAACAGCCCCAGCATTGATCAAACTTCAGTTTTACTGTACTTTCTTGTCT
GCACAGAAAGTCCCAGTACAACTTCCATTGCTGAGAAAATCCTCAGAGGACTTTCCCACTTCGCTCCTGTGATGGATGACAGAAGAGTGA

>7568_7568_2_ATG4B-SEC23IP_ATG4B_chr2_242606253_ENST00000396411_SEC23IP_chr10_121678956_ENST00000543134_length(amino acids)=546AA_BP=168
MLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSL
AVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKMPEEPKLTLD
ESYDLVVENKEVLTLQETLEALSLSEYFSTFEKEKIDMESLLMCTVDDLKEMGIPLGPRKKIANFVEHKAAKLKKAASEKKAVAATSTKG
QEQSAQKTKDMASLPSESNEPKRKLPVGACVSSVCVNYESFEVGAGQVSVAYNSLDFEPEIFFALGSPIAMFLTIRGVDRIDENYSLPTC
KGFFNIYHPLDPVAYRLEPMIVPDLDLKAVLIPHHKGRKRLHLELKESLSRMGSDLKQGFISSLKSAWQTLNEFARAHTSSTQLQEELEK
VANQIKEEEEKQVVEAEKVVESPDFSKDEDYLGKVGMLNGGRRIDYVLQEKPIESFNEYLFALQSHLCYWESEDTALLLLKEIYRTMNIS

--------------------------------------------------------------
>7568_7568_3_ATG4B-SEC23IP_ATG4B_chr2_242606253_ENST00000402096_SEC23IP_chr10_121678956_ENST00000369075_length(transcript)=3547nt_BP=835nt
GCGCCGGCACACCTATTGGCCCCCGCGGCGTCCCGTCGCCGCGTCGCGTTGCTGGCCCGTCGGAGCGACGCCGCTCGGGTCAGTCGGCGG
CCGGACTGGGAAGATGGACGCAGCTACTCTGACCTACGACACTCTCCGGTTTGCTGAGTTTGAAGATTTTCCTGAGACCTCAGAGCCCGT
TTGGATACTGGGTAGAAAATACAGCATTTTCACAGAAAAGGACGAGATCTTGTCTGATGTGGCATCTAGACTTTGGTTTACATACAGGAA
AAACTTTCCAGCCATTGGGGGGACAGGCCCCACCTCGGACACAGGCTGGGGCTGCATGCTGCGGTGTGGACAGATGATCTTTGCCCAAGC
CCTGGTGTGCCGGCACCTAGGCCGAGATTGGAGGTGGACACAAAGGAAGAGGCAGCCAGACAGCTACTTCAGCGTCCTCAACGCATTCAT
CGACAGGAAGGACAGTTACTACTCCATTCACCAGATAGCGCAAATGGGAGTTGGCGAAGGCAAGTCCATAGGCCAGTGGTACGGGCCCAA
CACTGTCGCCCAGGTCCTGAAGAAGCTTGCTGTCTTCGATACGTGGAGCTCCTTGGCGGTCCACATTGCAATGGACAACACTGTTGTGAT
GGAGGAAATCAGAAGGTTGTGCAGGACCAGCGTTCCCTGTGCAGGCGCCACTGCGTTTCCTGCAGATTCCGACCGGCACTGCAACGGATT
CCCTGCCGGAGCTGAGGTCACCAACAGGCCGTCGCCATGGAGACCCCTGGTACTTCTCATTCCCCTGCGCCTGGGGCTCACGGACATCAA
CGAGGCCTACGTGGAGACGCTGAAGATGCCTGAAGAGCCAAAGCTGACTTTGGATGAGTCGTATGACCTTGTTGTTGAAAATAAAGAAGT
CCTAACTTTGCAAGAAACTCTGGAAGCACTTAGCCTCTCTGAATATTTTAGCACTTTTGAAAAGGAAAAGATTGATATGGAGTCCCTGCT
TATGTGTACAGTTGATGACCTGAAGGAAATGGGGATACCCCTTGGACCCAGAAAGAAGATAGCTAACTTTGTAGAACATAAAGCAGCCAA
ACTGAAAAAAGCAGCGTCAGAAAAGAAGGCAGTGGCGGCCACTTCTACAAAAGGACAAGAGCAAAGTGCCCAGAAGACTAAAGACATGGC
TTCCCTCCCCTCAGAATCCAATGAGCCAAAGAGGAAACTTCCAGTTGGTGCTTGCGTGTCTTCTGTGTGTGTGAATTATGAATCTTTTGA
AGTTGGCGCCGGACAGGTTTCTGTTGCTTACAACTCATTAGATTTTGAACCAGAGATATTCTTTGCCTTGGGGTCTCCAATTGCTATGTT
TCTCACTATTCGAGGAGTTGATAGGATAGATGAGAATTACAGCCTTCCTACCTGTAAAGGGTTCTTCAATATTTATCATCCGCTTGATCC
AGTGGCATATAGATTAGAACCTATGATTGTTCCAGATTTGGACCTAAAAGCTGTTCTCATTCCACATCACAAAGGCAGAAAAAGACTTCA
TTTAGAATTGAAAGAGAGTCTCTCTCGTATGGGATCTGATTTGAAGCAGGGTTTTATTAGCTCTCTCAAAAGTGCTTGGCAGACATTAAA
TGAGTTTGCCCGTGCTCATACGTCTTCAACCCAGTTGCAAGAAGAATTGGAGAAGGTGGCCAATCAGATCAAAGAAGAAGAAGAAAAGCA
AGTAGTTGAAGCAGAAAAGGTTGTTGAAAGTCCAGATTTTTCCAAGGATGAGGACTACTTAGGAAAGGTTGGAATGTTAAATGGAGGCCG
CCGAATTGACTACGTTCTCCAAGAAAAACCAATAGAGAGTTTTAATGAATACCTTTTCGCTCTTCAGAGTCACTTATGCTATTGGGAATC
TGAAGATACTGCTCTGTTACTACTTAAAGAAATTTATCGAACAATGAACATTAGTCCAGAACAGCCCCAGCATTGATCAAACTTCAGTTT
TACTGTACTTTCTTGTCTGCACAGAAAGTCCCAGTACAACTTCCATTGCTGAGAAAATCCTCAGAGGACTTTCCCACTTCGCTCCTGTGA
TGGATGACAGAAGAGTGATTCATTAACAATTGCTCAGCCACAATTCTCGGATATAGGGATTCAAAAGACAGGACACAGAACTAACACAGT
GAAAAAAATCAGTACCACATTTGGACAGTATAGGTGAGAAAACATAATTATAAAAATGATGCCATGAAAAATTCCACAGATCAGTTTAGT
TGTATAGTTGTCAAAGTTATATGTGATATCAATGAAGAAATATTTGTAGCATGTAAACGGTTATTTCTGTTTCTTAAAAAGTATTGTTAG
TGGGCTATTAAACTTGGATTTTTCTTTTTATTAATGCAGTATGTTCTTTTTATTCAAGTATGAACTTGTTGAGAAACTATAGTAATATGA
TTTTTAAGAGATTTATGTTCTACTTAAAATGTGAATTGTACTTCTGAGCTGCCTTAATGCAAGGTCATTTATATTTGTTAAGAGGAAATA
ATCAAGATCACTCATATCCCAACTGAATCTGAGGTTTTATAAATCCCTCAAACGATTGCTGAGAGCCTGATTGTGGAAAGAAGTGAGATG
CACCTTATTTTCAAGAAGTCCTGGGAAGCGCTCTCCTAGCACGTCCATTTCCAGGAGGAGAAGCAAGCAGATGAGAGGTTTTCCATTTTG
TCATCCAAGGTAGCTGTGCACTTGCCTTGTTGCTGAAGTTCCAATAATGTGAAAACCAAAGTAGAGGTTTTTTTCTTCTTCTTTTTGTTT
TCTATTAATTTCACTTATACCAAAGTGTTTGAAAGTATGAAATGTGTTGCTTCTGAGTTATATAAGGCTACTTCATGACAAGACTGCTTT
GTAATATTTCACTTTGTTTTACTACAAATTCAGATCACTTTGTTTTACTATAAATTCAGATTATCCAAATATTTTCCTAATACTATGTGG
GAATGCTGATTTTCTTTTGTTACGTAGTGGAAACATTTTGCATTGTTTACATAGTTCTCATGGAACATGGAAATTTTTGAAAGTGATATA
TGATACACATTTTTTGTGTATGTATTCTAATTAGTGTGAATAAAGCAGTAACATTAATGCATTTTTTAAGCAGCAAACTTATGTATTTCT
CTTGTCTTCCTTAAAAGTGTCCCCATGAACTCAGTGTTTATTCCCTTTTCATTTTGAGTACCTGCTTATATGGTCAGTATGTAACGTTAG
CATTGGCTCCTAATGGTAGAATTAGAACAGCAAGATTGTAGAGCTGTAATTGACTCCAGACAACATAGATTTCAGCCACCTCATTCTACA
GCTGAGGCCAGGACAATAAATGCCTTTCCCAGACTGGGTAGTGGCAGATCTGGGATGGAATATGGTTTTCTTGATTCCCTTTCAGCCTTC
ATTTCTCTCTCTCAGGACTACTACTTTTTAATTACTTTTCACTTAATTTCCCAATACTGATGAAATAAAGAAAAATGAGGGTTATTTATA

>7568_7568_3_ATG4B-SEC23IP_ATG4B_chr2_242606253_ENST00000402096_SEC23IP_chr10_121678956_ENST00000369075_length(amino acids)=620AA_BP=242
MDAATLTYDTLRFAEFEDFPETSEPVWILGRKYSIFTEKDEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCR
HLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIR
RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKMPEEPKLTLDESYDLVVENKEVLTLQ
ETLEALSLSEYFSTFEKEKIDMESLLMCTVDDLKEMGIPLGPRKKIANFVEHKAAKLKKAASEKKAVAATSTKGQEQSAQKTKDMASLPS
ESNEPKRKLPVGACVSSVCVNYESFEVGAGQVSVAYNSLDFEPEIFFALGSPIAMFLTIRGVDRIDENYSLPTCKGFFNIYHPLDPVAYR
LEPMIVPDLDLKAVLIPHHKGRKRLHLELKESLSRMGSDLKQGFISSLKSAWQTLNEFARAHTSSTQLQEELEKVANQIKEEEEKQVVEA

--------------------------------------------------------------
>7568_7568_4_ATG4B-SEC23IP_ATG4B_chr2_242606253_ENST00000402096_SEC23IP_chr10_121678956_ENST00000543134_length(transcript)=2135nt_BP=835nt
GCGCCGGCACACCTATTGGCCCCCGCGGCGTCCCGTCGCCGCGTCGCGTTGCTGGCCCGTCGGAGCGACGCCGCTCGGGTCAGTCGGCGG
CCGGACTGGGAAGATGGACGCAGCTACTCTGACCTACGACACTCTCCGGTTTGCTGAGTTTGAAGATTTTCCTGAGACCTCAGAGCCCGT
TTGGATACTGGGTAGAAAATACAGCATTTTCACAGAAAAGGACGAGATCTTGTCTGATGTGGCATCTAGACTTTGGTTTACATACAGGAA
AAACTTTCCAGCCATTGGGGGGACAGGCCCCACCTCGGACACAGGCTGGGGCTGCATGCTGCGGTGTGGACAGATGATCTTTGCCCAAGC
CCTGGTGTGCCGGCACCTAGGCCGAGATTGGAGGTGGACACAAAGGAAGAGGCAGCCAGACAGCTACTTCAGCGTCCTCAACGCATTCAT
CGACAGGAAGGACAGTTACTACTCCATTCACCAGATAGCGCAAATGGGAGTTGGCGAAGGCAAGTCCATAGGCCAGTGGTACGGGCCCAA
CACTGTCGCCCAGGTCCTGAAGAAGCTTGCTGTCTTCGATACGTGGAGCTCCTTGGCGGTCCACATTGCAATGGACAACACTGTTGTGAT
GGAGGAAATCAGAAGGTTGTGCAGGACCAGCGTTCCCTGTGCAGGCGCCACTGCGTTTCCTGCAGATTCCGACCGGCACTGCAACGGATT
CCCTGCCGGAGCTGAGGTCACCAACAGGCCGTCGCCATGGAGACCCCTGGTACTTCTCATTCCCCTGCGCCTGGGGCTCACGGACATCAA
CGAGGCCTACGTGGAGACGCTGAAGATGCCTGAAGAGCCAAAGCTGACTTTGGATGAGTCGTATGACCTTGTTGTTGAAAATAAAGAAGT
CCTAACTTTGCAAGAAACTCTGGAAGCACTTAGCCTCTCTGAATATTTTAGCACTTTTGAAAAGGAAAAGATTGATATGGAGTCCCTGCT
TATGTGTACAGTTGATGACCTGAAGGAAATGGGGATACCCCTTGGACCCAGAAAGAAGATAGCTAACTTTGTAGAACATAAAGCAGCCAA
ACTGAAAAAAGCAGCGTCAGAAAAGAAGGCAGTGGCGGCCACTTCTACAAAAGGACAAGAGCAAAGTGCCCAGAAGACTAAAGACATGGC
TTCCCTCCCCTCAGAATCCAATGAGCCAAAGAGGAAACTTCCAGTTGGTGCTTGCGTGTCTTCTGTGTGTGTGAATTATGAATCTTTTGA
AGTTGGCGCCGGACAGGTTTCTGTTGCTTACAACTCATTAGATTTTGAACCAGAGATATTCTTTGCCTTGGGGTCTCCAATTGCTATGTT
TCTCACTATTCGAGGAGTTGATAGGATAGATGAGAATTACAGCCTTCCTACCTGTAAAGGGTTCTTCAATATTTATCATCCGCTTGATCC
AGTGGCATATAGATTAGAACCTATGATTGTTCCAGATTTGGACCTAAAAGCTGTTCTCATTCCACATCACAAAGGCAGAAAAAGACTTCA
TTTAGAATTGAAAGAGAGTCTCTCTCGTATGGGATCTGATTTGAAGCAGGGTTTTATTAGCTCTCTCAAAAGTGCTTGGCAGACATTAAA
TGAGTTTGCCCGTGCTCATACGTCTTCAACCCAGTTGCAAGAAGAATTGGAGAAGGTGGCCAATCAGATCAAAGAAGAAGAAGAAAAGCA
AGTAGTTGAAGCAGAAAAGGTTGTTGAAAGTCCAGATTTTTCCAAGGATGAGGACTACTTAGGAAAGGTTGGAATGTTAAATGGAGGCCG
CCGAATTGACTACGTTCTCCAAGAAAAACCAATAGAGAGTTTTAATGAATACCTTTTCGCTCTTCAGAGTCACTTATGCTATTGGGAATC
TGAAGATACTGCTCTGTTACTACTTAAAGAAATTTATCGAACAATGAACATTAGTCCAGAACAGCCCCAGCATTGATCAAACTTCAGTTT
TACTGTACTTTCTTGTCTGCACAGAAAGTCCCAGTACAACTTCCATTGCTGAGAAAATCCTCAGAGGACTTTCCCACTTCGCTCCTGTGA

>7568_7568_4_ATG4B-SEC23IP_ATG4B_chr2_242606253_ENST00000402096_SEC23IP_chr10_121678956_ENST00000543134_length(amino acids)=620AA_BP=242
MDAATLTYDTLRFAEFEDFPETSEPVWILGRKYSIFTEKDEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCR
HLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIR
RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKMPEEPKLTLDESYDLVVENKEVLTLQ
ETLEALSLSEYFSTFEKEKIDMESLLMCTVDDLKEMGIPLGPRKKIANFVEHKAAKLKKAASEKKAVAATSTKGQEQSAQKTKDMASLPS
ESNEPKRKLPVGACVSSVCVNYESFEVGAGQVSVAYNSLDFEPEIFFALGSPIAMFLTIRGVDRIDENYSLPTCKGFFNIYHPLDPVAYR
LEPMIVPDLDLKAVLIPHHKGRKRLHLELKESLSRMGSDLKQGFISSLKSAWQTLNEFARAHTSSTQLQEELEKVANQIKEEEEKQVVEA

--------------------------------------------------------------
>7568_7568_5_ATG4B-SEC23IP_ATG4B_chr2_242606253_ENST00000404914_SEC23IP_chr10_121678956_ENST00000369075_length(transcript)=3547nt_BP=835nt
GCGCCGGCACACCTATTGGCCCCCGCGGCGTCCCGTCGCCGCGTCGCGTTGCTGGCCCGTCGGAGCGACGCCGCTCGGGTCAGTCGGCGG
CCGGACTGGGAAGATGGACGCAGCTACTCTGACCTACGACACTCTCCGGTTTGCTGAGTTTGAAGATTTTCCTGAGACCTCAGAGCCCGT
TTGGATACTGGGTAGAAAATACAGCATTTTCACAGAAAAGGACGAGATCTTGTCTGATGTGGCATCTAGACTTTGGTTTACATACAGGAA
AAACTTTCCAGCCATTGGGGGGACAGGCCCCACCTCGGACACAGGCTGGGGCTGCATGCTGCGGTGTGGACAGATGATCTTTGCCCAAGC
CCTGGTGTGCCGGCACCTAGGCCGAGATTGGAGGTGGACACAAAGGAAGAGGCAGCCAGACAGCTACTTCAGCGTCCTCAACGCATTCAT
CGACAGGAAGGACAGTTACTACTCCATTCACCAGATAGCGCAAATGGGAGTTGGCGAAGGCAAGTCCATAGGCCAGTGGTACGGGCCCAA
CACTGTCGCCCAGGTCCTGAAGAAGCTTGCTGTCTTCGATACGTGGAGCTCCTTGGCGGTCCACATTGCAATGGACAACACTGTTGTGAT
GGAGGAAATCAGAAGGTTGTGCAGGACCAGCGTTCCCTGTGCAGGCGCCACTGCGTTTCCTGCAGATTCCGACCGGCACTGCAACGGATT
CCCTGCCGGAGCTGAGGTCACCAACAGGCCGTCGCCATGGAGACCCCTGGTACTTCTCATTCCCCTGCGCCTGGGGCTCACGGACATCAA
CGAGGCCTACGTGGAGACGCTGAAGATGCCTGAAGAGCCAAAGCTGACTTTGGATGAGTCGTATGACCTTGTTGTTGAAAATAAAGAAGT
CCTAACTTTGCAAGAAACTCTGGAAGCACTTAGCCTCTCTGAATATTTTAGCACTTTTGAAAAGGAAAAGATTGATATGGAGTCCCTGCT
TATGTGTACAGTTGATGACCTGAAGGAAATGGGGATACCCCTTGGACCCAGAAAGAAGATAGCTAACTTTGTAGAACATAAAGCAGCCAA
ACTGAAAAAAGCAGCGTCAGAAAAGAAGGCAGTGGCGGCCACTTCTACAAAAGGACAAGAGCAAAGTGCCCAGAAGACTAAAGACATGGC
TTCCCTCCCCTCAGAATCCAATGAGCCAAAGAGGAAACTTCCAGTTGGTGCTTGCGTGTCTTCTGTGTGTGTGAATTATGAATCTTTTGA
AGTTGGCGCCGGACAGGTTTCTGTTGCTTACAACTCATTAGATTTTGAACCAGAGATATTCTTTGCCTTGGGGTCTCCAATTGCTATGTT
TCTCACTATTCGAGGAGTTGATAGGATAGATGAGAATTACAGCCTTCCTACCTGTAAAGGGTTCTTCAATATTTATCATCCGCTTGATCC
AGTGGCATATAGATTAGAACCTATGATTGTTCCAGATTTGGACCTAAAAGCTGTTCTCATTCCACATCACAAAGGCAGAAAAAGACTTCA
TTTAGAATTGAAAGAGAGTCTCTCTCGTATGGGATCTGATTTGAAGCAGGGTTTTATTAGCTCTCTCAAAAGTGCTTGGCAGACATTAAA
TGAGTTTGCCCGTGCTCATACGTCTTCAACCCAGTTGCAAGAAGAATTGGAGAAGGTGGCCAATCAGATCAAAGAAGAAGAAGAAAAGCA
AGTAGTTGAAGCAGAAAAGGTTGTTGAAAGTCCAGATTTTTCCAAGGATGAGGACTACTTAGGAAAGGTTGGAATGTTAAATGGAGGCCG
CCGAATTGACTACGTTCTCCAAGAAAAACCAATAGAGAGTTTTAATGAATACCTTTTCGCTCTTCAGAGTCACTTATGCTATTGGGAATC
TGAAGATACTGCTCTGTTACTACTTAAAGAAATTTATCGAACAATGAACATTAGTCCAGAACAGCCCCAGCATTGATCAAACTTCAGTTT
TACTGTACTTTCTTGTCTGCACAGAAAGTCCCAGTACAACTTCCATTGCTGAGAAAATCCTCAGAGGACTTTCCCACTTCGCTCCTGTGA
TGGATGACAGAAGAGTGATTCATTAACAATTGCTCAGCCACAATTCTCGGATATAGGGATTCAAAAGACAGGACACAGAACTAACACAGT
GAAAAAAATCAGTACCACATTTGGACAGTATAGGTGAGAAAACATAATTATAAAAATGATGCCATGAAAAATTCCACAGATCAGTTTAGT
TGTATAGTTGTCAAAGTTATATGTGATATCAATGAAGAAATATTTGTAGCATGTAAACGGTTATTTCTGTTTCTTAAAAAGTATTGTTAG
TGGGCTATTAAACTTGGATTTTTCTTTTTATTAATGCAGTATGTTCTTTTTATTCAAGTATGAACTTGTTGAGAAACTATAGTAATATGA
TTTTTAAGAGATTTATGTTCTACTTAAAATGTGAATTGTACTTCTGAGCTGCCTTAATGCAAGGTCATTTATATTTGTTAAGAGGAAATA
ATCAAGATCACTCATATCCCAACTGAATCTGAGGTTTTATAAATCCCTCAAACGATTGCTGAGAGCCTGATTGTGGAAAGAAGTGAGATG
CACCTTATTTTCAAGAAGTCCTGGGAAGCGCTCTCCTAGCACGTCCATTTCCAGGAGGAGAAGCAAGCAGATGAGAGGTTTTCCATTTTG
TCATCCAAGGTAGCTGTGCACTTGCCTTGTTGCTGAAGTTCCAATAATGTGAAAACCAAAGTAGAGGTTTTTTTCTTCTTCTTTTTGTTT
TCTATTAATTTCACTTATACCAAAGTGTTTGAAAGTATGAAATGTGTTGCTTCTGAGTTATATAAGGCTACTTCATGACAAGACTGCTTT
GTAATATTTCACTTTGTTTTACTACAAATTCAGATCACTTTGTTTTACTATAAATTCAGATTATCCAAATATTTTCCTAATACTATGTGG
GAATGCTGATTTTCTTTTGTTACGTAGTGGAAACATTTTGCATTGTTTACATAGTTCTCATGGAACATGGAAATTTTTGAAAGTGATATA
TGATACACATTTTTTGTGTATGTATTCTAATTAGTGTGAATAAAGCAGTAACATTAATGCATTTTTTAAGCAGCAAACTTATGTATTTCT
CTTGTCTTCCTTAAAAGTGTCCCCATGAACTCAGTGTTTATTCCCTTTTCATTTTGAGTACCTGCTTATATGGTCAGTATGTAACGTTAG
CATTGGCTCCTAATGGTAGAATTAGAACAGCAAGATTGTAGAGCTGTAATTGACTCCAGACAACATAGATTTCAGCCACCTCATTCTACA
GCTGAGGCCAGGACAATAAATGCCTTTCCCAGACTGGGTAGTGGCAGATCTGGGATGGAATATGGTTTTCTTGATTCCCTTTCAGCCTTC
ATTTCTCTCTCTCAGGACTACTACTTTTTAATTACTTTTCACTTAATTTCCCAATACTGATGAAATAAAGAAAAATGAGGGTTATTTATA

>7568_7568_5_ATG4B-SEC23IP_ATG4B_chr2_242606253_ENST00000404914_SEC23IP_chr10_121678956_ENST00000369075_length(amino acids)=620AA_BP=242
MDAATLTYDTLRFAEFEDFPETSEPVWILGRKYSIFTEKDEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCR
HLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIR
RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKMPEEPKLTLDESYDLVVENKEVLTLQ
ETLEALSLSEYFSTFEKEKIDMESLLMCTVDDLKEMGIPLGPRKKIANFVEHKAAKLKKAASEKKAVAATSTKGQEQSAQKTKDMASLPS
ESNEPKRKLPVGACVSSVCVNYESFEVGAGQVSVAYNSLDFEPEIFFALGSPIAMFLTIRGVDRIDENYSLPTCKGFFNIYHPLDPVAYR
LEPMIVPDLDLKAVLIPHHKGRKRLHLELKESLSRMGSDLKQGFISSLKSAWQTLNEFARAHTSSTQLQEELEKVANQIKEEEEKQVVEA

--------------------------------------------------------------
>7568_7568_6_ATG4B-SEC23IP_ATG4B_chr2_242606253_ENST00000404914_SEC23IP_chr10_121678956_ENST00000543134_length(transcript)=2135nt_BP=835nt
GCGCCGGCACACCTATTGGCCCCCGCGGCGTCCCGTCGCCGCGTCGCGTTGCTGGCCCGTCGGAGCGACGCCGCTCGGGTCAGTCGGCGG
CCGGACTGGGAAGATGGACGCAGCTACTCTGACCTACGACACTCTCCGGTTTGCTGAGTTTGAAGATTTTCCTGAGACCTCAGAGCCCGT
TTGGATACTGGGTAGAAAATACAGCATTTTCACAGAAAAGGACGAGATCTTGTCTGATGTGGCATCTAGACTTTGGTTTACATACAGGAA
AAACTTTCCAGCCATTGGGGGGACAGGCCCCACCTCGGACACAGGCTGGGGCTGCATGCTGCGGTGTGGACAGATGATCTTTGCCCAAGC
CCTGGTGTGCCGGCACCTAGGCCGAGATTGGAGGTGGACACAAAGGAAGAGGCAGCCAGACAGCTACTTCAGCGTCCTCAACGCATTCAT
CGACAGGAAGGACAGTTACTACTCCATTCACCAGATAGCGCAAATGGGAGTTGGCGAAGGCAAGTCCATAGGCCAGTGGTACGGGCCCAA
CACTGTCGCCCAGGTCCTGAAGAAGCTTGCTGTCTTCGATACGTGGAGCTCCTTGGCGGTCCACATTGCAATGGACAACACTGTTGTGAT
GGAGGAAATCAGAAGGTTGTGCAGGACCAGCGTTCCCTGTGCAGGCGCCACTGCGTTTCCTGCAGATTCCGACCGGCACTGCAACGGATT
CCCTGCCGGAGCTGAGGTCACCAACAGGCCGTCGCCATGGAGACCCCTGGTACTTCTCATTCCCCTGCGCCTGGGGCTCACGGACATCAA
CGAGGCCTACGTGGAGACGCTGAAGATGCCTGAAGAGCCAAAGCTGACTTTGGATGAGTCGTATGACCTTGTTGTTGAAAATAAAGAAGT
CCTAACTTTGCAAGAAACTCTGGAAGCACTTAGCCTCTCTGAATATTTTAGCACTTTTGAAAAGGAAAAGATTGATATGGAGTCCCTGCT
TATGTGTACAGTTGATGACCTGAAGGAAATGGGGATACCCCTTGGACCCAGAAAGAAGATAGCTAACTTTGTAGAACATAAAGCAGCCAA
ACTGAAAAAAGCAGCGTCAGAAAAGAAGGCAGTGGCGGCCACTTCTACAAAAGGACAAGAGCAAAGTGCCCAGAAGACTAAAGACATGGC
TTCCCTCCCCTCAGAATCCAATGAGCCAAAGAGGAAACTTCCAGTTGGTGCTTGCGTGTCTTCTGTGTGTGTGAATTATGAATCTTTTGA
AGTTGGCGCCGGACAGGTTTCTGTTGCTTACAACTCATTAGATTTTGAACCAGAGATATTCTTTGCCTTGGGGTCTCCAATTGCTATGTT
TCTCACTATTCGAGGAGTTGATAGGATAGATGAGAATTACAGCCTTCCTACCTGTAAAGGGTTCTTCAATATTTATCATCCGCTTGATCC
AGTGGCATATAGATTAGAACCTATGATTGTTCCAGATTTGGACCTAAAAGCTGTTCTCATTCCACATCACAAAGGCAGAAAAAGACTTCA
TTTAGAATTGAAAGAGAGTCTCTCTCGTATGGGATCTGATTTGAAGCAGGGTTTTATTAGCTCTCTCAAAAGTGCTTGGCAGACATTAAA
TGAGTTTGCCCGTGCTCATACGTCTTCAACCCAGTTGCAAGAAGAATTGGAGAAGGTGGCCAATCAGATCAAAGAAGAAGAAGAAAAGCA
AGTAGTTGAAGCAGAAAAGGTTGTTGAAAGTCCAGATTTTTCCAAGGATGAGGACTACTTAGGAAAGGTTGGAATGTTAAATGGAGGCCG
CCGAATTGACTACGTTCTCCAAGAAAAACCAATAGAGAGTTTTAATGAATACCTTTTCGCTCTTCAGAGTCACTTATGCTATTGGGAATC
TGAAGATACTGCTCTGTTACTACTTAAAGAAATTTATCGAACAATGAACATTAGTCCAGAACAGCCCCAGCATTGATCAAACTTCAGTTT
TACTGTACTTTCTTGTCTGCACAGAAAGTCCCAGTACAACTTCCATTGCTGAGAAAATCCTCAGAGGACTTTCCCACTTCGCTCCTGTGA

>7568_7568_6_ATG4B-SEC23IP_ATG4B_chr2_242606253_ENST00000404914_SEC23IP_chr10_121678956_ENST00000543134_length(amino acids)=620AA_BP=242
MDAATLTYDTLRFAEFEDFPETSEPVWILGRKYSIFTEKDEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCR
HLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIR
RLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKMPEEPKLTLDESYDLVVENKEVLTLQ
ETLEALSLSEYFSTFEKEKIDMESLLMCTVDDLKEMGIPLGPRKKIANFVEHKAAKLKKAASEKKAVAATSTKGQEQSAQKTKDMASLPS
ESNEPKRKLPVGACVSSVCVNYESFEVGAGQVSVAYNSLDFEPEIFFALGSPIAMFLTIRGVDRIDENYSLPTCKGFFNIYHPLDPVAYR
LEPMIVPDLDLKAVLIPHHKGRKRLHLELKESLSRMGSDLKQGFISSLKSAWQTLNEFARAHTSSTQLQEELEKVANQIKEEEEKQVVEA

--------------------------------------------------------------
>7568_7568_7_ATG4B-SEC23IP_ATG4B_chr2_242606253_ENST00000405546_SEC23IP_chr10_121678956_ENST00000369075_length(transcript)=3946nt_BP=1234nt
CTCGCCAGCCGCGGGTTCGGGCGTCTTCGGACCAGCGGGGCGCCGCAGGCCCCTCGCAGCGTCCGTCGGCAGGCGGGCAGACGGGCGGGG
GAGTCGCCCCGGCGGGGCAAGTCCGTACCGCGACATGGGCGCGCCGAGCACGTCCGTACCGCAAGATGGCTGCTCGGACGGGGACAGAGC
TCGCCTCTGCCGCCTCGACAACTGCTCCTGGGTCCTCTAAGAGGAGGAAGCGCCACCCATGGCACACAGTGTCCCGTCGGACAGCAGAAC
CAGCCGTCGTCCCACGACACGACCCCATGCCGCCCGCAGGGCGCCCCGGGGCTCGCGTCGGCCCGGCCGTACGCCAAAATGGCGGCTCCC
GCGTATTTCCGCTCGCGCGCCGTATCGTCTTCGCCGCCTGCGCCGGCACACCTATTGGCCCCCGCGGCGTCCCGTCGCCGCGTCGCGTTG
CTGGCCCGTCGGAGCGACGCCGCTCGGGTCAGTCGGCGGCCGGACTGGGAAGATGGACGCAGCTACTCTGACCTACGACACTCTCCGGTT
TGCTGAGTTTGAAGATTTTCCTGAGACCTCAGAGCCCGTTTGGATACTGGGTAGAAAATACAGCATTTTCACAGAAAAGGACGAGATCTT
GTCTGATGTGGCATCTAGACTTTGGTTTACATACAGGAAAAACTTTCCAGCCATTGGGGGGACAGGCCCCACCTCGGACACAGGCTGGGG
CTGCATGCTGCGGTGTGGACAGATGATCTTTGCCCAAGCCCTGGTGTGCCGGCACCTAGGCCGAGATTGGAGGTGGACACAAAGGAAGAG
GCAGCCAGACAGCTACTTCAGCGTCCTCAACGCATTCATCGACAGGAAGGACAGTTACTACTCCATTCACCAGATAGCGCAAATGGGAGT
TGGCGAAGGCAAGTCCATAGGCCAGTGGTACGGGCCCAACACTGTCGCCCAGGTCCTGAAGAAGCTTGCTGTCTTCGATACGTGGAGCTC
CTTGGCGGTCCACATTGCAATGGACAACACTGTTGTGATGGAGGAAATCAGAAGGTTGTGCAGGACCAGCGTTCCCTGTGCAGGCGCCAC
TGCGTTTCCTGCAGATTCCGACCGGCACTGCAACGGATTCCCTGCCGGAGCTGAGGTCACCAACAGGCCGTCGCCATGGAGACCCCTGGT
ACTTCTCATTCCCCTGCGCCTGGGGCTCACGGACATCAACGAGGCCTACGTGGAGACGCTGAAGATGCCTGAAGAGCCAAAGCTGACTTT
GGATGAGTCGTATGACCTTGTTGTTGAAAATAAAGAAGTCCTAACTTTGCAAGAAACTCTGGAAGCACTTAGCCTCTCTGAATATTTTAG
CACTTTTGAAAAGGAAAAGATTGATATGGAGTCCCTGCTTATGTGTACAGTTGATGACCTGAAGGAAATGGGGATACCCCTTGGACCCAG
AAAGAAGATAGCTAACTTTGTAGAACATAAAGCAGCCAAACTGAAAAAAGCAGCGTCAGAAAAGAAGGCAGTGGCGGCCACTTCTACAAA
AGGACAAGAGCAAAGTGCCCAGAAGACTAAAGACATGGCTTCCCTCCCCTCAGAATCCAATGAGCCAAAGAGGAAACTTCCAGTTGGTGC
TTGCGTGTCTTCTGTGTGTGTGAATTATGAATCTTTTGAAGTTGGCGCCGGACAGGTTTCTGTTGCTTACAACTCATTAGATTTTGAACC
AGAGATATTCTTTGCCTTGGGGTCTCCAATTGCTATGTTTCTCACTATTCGAGGAGTTGATAGGATAGATGAGAATTACAGCCTTCCTAC
CTGTAAAGGGTTCTTCAATATTTATCATCCGCTTGATCCAGTGGCATATAGATTAGAACCTATGATTGTTCCAGATTTGGACCTAAAAGC
TGTTCTCATTCCACATCACAAAGGCAGAAAAAGACTTCATTTAGAATTGAAAGAGAGTCTCTCTCGTATGGGATCTGATTTGAAGCAGGG
TTTTATTAGCTCTCTCAAAAGTGCTTGGCAGACATTAAATGAGTTTGCCCGTGCTCATACGTCTTCAACCCAGTTGCAAGAAGAATTGGA
GAAGGTGGCCAATCAGATCAAAGAAGAAGAAGAAAAGCAAGTAGTTGAAGCAGAAAAGGTTGTTGAAAGTCCAGATTTTTCCAAGGATGA
GGACTACTTAGGAAAGGTTGGAATGTTAAATGGAGGCCGCCGAATTGACTACGTTCTCCAAGAAAAACCAATAGAGAGTTTTAATGAATA
CCTTTTCGCTCTTCAGAGTCACTTATGCTATTGGGAATCTGAAGATACTGCTCTGTTACTACTTAAAGAAATTTATCGAACAATGAACAT
TAGTCCAGAACAGCCCCAGCATTGATCAAACTTCAGTTTTACTGTACTTTCTTGTCTGCACAGAAAGTCCCAGTACAACTTCCATTGCTG
AGAAAATCCTCAGAGGACTTTCCCACTTCGCTCCTGTGATGGATGACAGAAGAGTGATTCATTAACAATTGCTCAGCCACAATTCTCGGA
TATAGGGATTCAAAAGACAGGACACAGAACTAACACAGTGAAAAAAATCAGTACCACATTTGGACAGTATAGGTGAGAAAACATAATTAT
AAAAATGATGCCATGAAAAATTCCACAGATCAGTTTAGTTGTATAGTTGTCAAAGTTATATGTGATATCAATGAAGAAATATTTGTAGCA
TGTAAACGGTTATTTCTGTTTCTTAAAAAGTATTGTTAGTGGGCTATTAAACTTGGATTTTTCTTTTTATTAATGCAGTATGTTCTTTTT
ATTCAAGTATGAACTTGTTGAGAAACTATAGTAATATGATTTTTAAGAGATTTATGTTCTACTTAAAATGTGAATTGTACTTCTGAGCTG
CCTTAATGCAAGGTCATTTATATTTGTTAAGAGGAAATAATCAAGATCACTCATATCCCAACTGAATCTGAGGTTTTATAAATCCCTCAA
ACGATTGCTGAGAGCCTGATTGTGGAAAGAAGTGAGATGCACCTTATTTTCAAGAAGTCCTGGGAAGCGCTCTCCTAGCACGTCCATTTC
CAGGAGGAGAAGCAAGCAGATGAGAGGTTTTCCATTTTGTCATCCAAGGTAGCTGTGCACTTGCCTTGTTGCTGAAGTTCCAATAATGTG
AAAACCAAAGTAGAGGTTTTTTTCTTCTTCTTTTTGTTTTCTATTAATTTCACTTATACCAAAGTGTTTGAAAGTATGAAATGTGTTGCT
TCTGAGTTATATAAGGCTACTTCATGACAAGACTGCTTTGTAATATTTCACTTTGTTTTACTACAAATTCAGATCACTTTGTTTTACTAT
AAATTCAGATTATCCAAATATTTTCCTAATACTATGTGGGAATGCTGATTTTCTTTTGTTACGTAGTGGAAACATTTTGCATTGTTTACA
TAGTTCTCATGGAACATGGAAATTTTTGAAAGTGATATATGATACACATTTTTTGTGTATGTATTCTAATTAGTGTGAATAAAGCAGTAA
CATTAATGCATTTTTTAAGCAGCAAACTTATGTATTTCTCTTGTCTTCCTTAAAAGTGTCCCCATGAACTCAGTGTTTATTCCCTTTTCA
TTTTGAGTACCTGCTTATATGGTCAGTATGTAACGTTAGCATTGGCTCCTAATGGTAGAATTAGAACAGCAAGATTGTAGAGCTGTAATT
GACTCCAGACAACATAGATTTCAGCCACCTCATTCTACAGCTGAGGCCAGGACAATAAATGCCTTTCCCAGACTGGGTAGTGGCAGATCT
GGGATGGAATATGGTTTTCTTGATTCCCTTTCAGCCTTCATTTCTCTCTCTCAGGACTACTACTTTTTAATTACTTTTCACTTAATTTCC

>7568_7568_7_ATG4B-SEC23IP_ATG4B_chr2_242606253_ENST00000405546_SEC23IP_chr10_121678956_ENST00000369075_length(amino acids)=708AA_BP=330
MAHSVPSDSRTSRRPTTRPHAARRAPRGSRRPGRTPKWRLPRISARAPYRLRRLRRHTYWPPRRPVAASRCWPVGATPLGSVGGRTGKMD
AATLTYDTLRFAEFEDFPETSEPVWILGRKYSIFTEKDEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHL
GRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIRRL
CRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKMPEEPKLTLDESYDLVVENKEVLTLQET
LEALSLSEYFSTFEKEKIDMESLLMCTVDDLKEMGIPLGPRKKIANFVEHKAAKLKKAASEKKAVAATSTKGQEQSAQKTKDMASLPSES
NEPKRKLPVGACVSSVCVNYESFEVGAGQVSVAYNSLDFEPEIFFALGSPIAMFLTIRGVDRIDENYSLPTCKGFFNIYHPLDPVAYRLE
PMIVPDLDLKAVLIPHHKGRKRLHLELKESLSRMGSDLKQGFISSLKSAWQTLNEFARAHTSSTQLQEELEKVANQIKEEEEKQVVEAEK

--------------------------------------------------------------
>7568_7568_8_ATG4B-SEC23IP_ATG4B_chr2_242606253_ENST00000405546_SEC23IP_chr10_121678956_ENST00000543134_length(transcript)=2534nt_BP=1234nt
CTCGCCAGCCGCGGGTTCGGGCGTCTTCGGACCAGCGGGGCGCCGCAGGCCCCTCGCAGCGTCCGTCGGCAGGCGGGCAGACGGGCGGGG
GAGTCGCCCCGGCGGGGCAAGTCCGTACCGCGACATGGGCGCGCCGAGCACGTCCGTACCGCAAGATGGCTGCTCGGACGGGGACAGAGC
TCGCCTCTGCCGCCTCGACAACTGCTCCTGGGTCCTCTAAGAGGAGGAAGCGCCACCCATGGCACACAGTGTCCCGTCGGACAGCAGAAC
CAGCCGTCGTCCCACGACACGACCCCATGCCGCCCGCAGGGCGCCCCGGGGCTCGCGTCGGCCCGGCCGTACGCCAAAATGGCGGCTCCC
GCGTATTTCCGCTCGCGCGCCGTATCGTCTTCGCCGCCTGCGCCGGCACACCTATTGGCCCCCGCGGCGTCCCGTCGCCGCGTCGCGTTG
CTGGCCCGTCGGAGCGACGCCGCTCGGGTCAGTCGGCGGCCGGACTGGGAAGATGGACGCAGCTACTCTGACCTACGACACTCTCCGGTT
TGCTGAGTTTGAAGATTTTCCTGAGACCTCAGAGCCCGTTTGGATACTGGGTAGAAAATACAGCATTTTCACAGAAAAGGACGAGATCTT
GTCTGATGTGGCATCTAGACTTTGGTTTACATACAGGAAAAACTTTCCAGCCATTGGGGGGACAGGCCCCACCTCGGACACAGGCTGGGG
CTGCATGCTGCGGTGTGGACAGATGATCTTTGCCCAAGCCCTGGTGTGCCGGCACCTAGGCCGAGATTGGAGGTGGACACAAAGGAAGAG
GCAGCCAGACAGCTACTTCAGCGTCCTCAACGCATTCATCGACAGGAAGGACAGTTACTACTCCATTCACCAGATAGCGCAAATGGGAGT
TGGCGAAGGCAAGTCCATAGGCCAGTGGTACGGGCCCAACACTGTCGCCCAGGTCCTGAAGAAGCTTGCTGTCTTCGATACGTGGAGCTC
CTTGGCGGTCCACATTGCAATGGACAACACTGTTGTGATGGAGGAAATCAGAAGGTTGTGCAGGACCAGCGTTCCCTGTGCAGGCGCCAC
TGCGTTTCCTGCAGATTCCGACCGGCACTGCAACGGATTCCCTGCCGGAGCTGAGGTCACCAACAGGCCGTCGCCATGGAGACCCCTGGT
ACTTCTCATTCCCCTGCGCCTGGGGCTCACGGACATCAACGAGGCCTACGTGGAGACGCTGAAGATGCCTGAAGAGCCAAAGCTGACTTT
GGATGAGTCGTATGACCTTGTTGTTGAAAATAAAGAAGTCCTAACTTTGCAAGAAACTCTGGAAGCACTTAGCCTCTCTGAATATTTTAG
CACTTTTGAAAAGGAAAAGATTGATATGGAGTCCCTGCTTATGTGTACAGTTGATGACCTGAAGGAAATGGGGATACCCCTTGGACCCAG
AAAGAAGATAGCTAACTTTGTAGAACATAAAGCAGCCAAACTGAAAAAAGCAGCGTCAGAAAAGAAGGCAGTGGCGGCCACTTCTACAAA
AGGACAAGAGCAAAGTGCCCAGAAGACTAAAGACATGGCTTCCCTCCCCTCAGAATCCAATGAGCCAAAGAGGAAACTTCCAGTTGGTGC
TTGCGTGTCTTCTGTGTGTGTGAATTATGAATCTTTTGAAGTTGGCGCCGGACAGGTTTCTGTTGCTTACAACTCATTAGATTTTGAACC
AGAGATATTCTTTGCCTTGGGGTCTCCAATTGCTATGTTTCTCACTATTCGAGGAGTTGATAGGATAGATGAGAATTACAGCCTTCCTAC
CTGTAAAGGGTTCTTCAATATTTATCATCCGCTTGATCCAGTGGCATATAGATTAGAACCTATGATTGTTCCAGATTTGGACCTAAAAGC
TGTTCTCATTCCACATCACAAAGGCAGAAAAAGACTTCATTTAGAATTGAAAGAGAGTCTCTCTCGTATGGGATCTGATTTGAAGCAGGG
TTTTATTAGCTCTCTCAAAAGTGCTTGGCAGACATTAAATGAGTTTGCCCGTGCTCATACGTCTTCAACCCAGTTGCAAGAAGAATTGGA
GAAGGTGGCCAATCAGATCAAAGAAGAAGAAGAAAAGCAAGTAGTTGAAGCAGAAAAGGTTGTTGAAAGTCCAGATTTTTCCAAGGATGA
GGACTACTTAGGAAAGGTTGGAATGTTAAATGGAGGCCGCCGAATTGACTACGTTCTCCAAGAAAAACCAATAGAGAGTTTTAATGAATA
CCTTTTCGCTCTTCAGAGTCACTTATGCTATTGGGAATCTGAAGATACTGCTCTGTTACTACTTAAAGAAATTTATCGAACAATGAACAT
TAGTCCAGAACAGCCCCAGCATTGATCAAACTTCAGTTTTACTGTACTTTCTTGTCTGCACAGAAAGTCCCAGTACAACTTCCATTGCTG
AGAAAATCCTCAGAGGACTTTCCCACTTCGCTCCTGTGATGGATGACAGAAGAGTGATTCATTAACAATTGCTCAGCCACAATTCTCGGA

>7568_7568_8_ATG4B-SEC23IP_ATG4B_chr2_242606253_ENST00000405546_SEC23IP_chr10_121678956_ENST00000543134_length(amino acids)=708AA_BP=330
MAHSVPSDSRTSRRPTTRPHAARRAPRGSRRPGRTPKWRLPRISARAPYRLRRLRRHTYWPPRRPVAASRCWPVGATPLGSVGGRTGKMD
AATLTYDTLRFAEFEDFPETSEPVWILGRKYSIFTEKDEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHL
GRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIRRL
CRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKMPEEPKLTLDESYDLVVENKEVLTLQET
LEALSLSEYFSTFEKEKIDMESLLMCTVDDLKEMGIPLGPRKKIANFVEHKAAKLKKAASEKKAVAATSTKGQEQSAQKTKDMASLPSES
NEPKRKLPVGACVSSVCVNYESFEVGAGQVSVAYNSLDFEPEIFFALGSPIAMFLTIRGVDRIDENYSLPTCKGFFNIYHPLDPVAYRLE
PMIVPDLDLKAVLIPHHKGRKRLHLELKESLSRMGSDLKQGFISSLKSAWQTLNEFARAHTSSTQLQEELEKVANQIKEEEEKQVVEAEK

--------------------------------------------------------------
>7568_7568_9_ATG4B-SEC23IP_ATG4B_chr2_242606253_ENST00000474739_SEC23IP_chr10_121678956_ENST00000369075_length(transcript)=3557nt_BP=845nt
AGTCGGCGGCCGGACTGGGAAGATGGACGCAGCTACTCTGACCTACGACACTCTCCGGTTTGCTGAGTTTGAAGATTTTCCTGAGACCTC
AGAGCCCGTTTGGATACTGGGTAGAAAATACAGCATTTTCACAGAAAAGGACGAGATCTTGTCTGATGTGGCATCTAGACTTTGGTTTAC
ATACAGGAAAAACTTTCCAGCCATTGCCATTCATCATTGTTGTATACCCTGGAGCTGGAAGGAGATGGGGACTGGTTCTCAGCCTTGCCT
CTCACCGGCGGAGAACTGAGGCCGGAGGGGGGACAGGCCCCACCTCGGACACAGGCTGGGGCTGCATGCTGCGGTGTGGACAGATGATCT
TTGCCCAAGCCCTGGTGTGCCGGCACCTAGGCCGAGATTGGAGGTGGACACAAAGGAAGAGGCAGCCAGACAGCTACTTCAGCGTCCTCA
ACGCATTCATCGACAGGAAGGACAGTTACTACTCCATTCACCAGATAGCGCAAATGGGAGTTGGCGAAGGCAAGTCCATAGGCCAGTGGT
ACGGGCCCAACACTGTCGCCCAGGTCCTGAAGAAGCTTGCTGTCTTCGATACGTGGAGCTCCTTGGCGGTCCACATTGCAATGGACAACA
CTGTTGTGATGGAGGAAATCAGAAGGTTGTGCAGGACCAGCGTTCCCTGTGCAGGCGCCACTGCGTTTCCTGCAGATTCCGACCGGCACT
GCAACGGATTCCCTGCCGGAGCTGAGGTCACCAACAGGCCGTCGCCATGGAGACCCCTGGTACTTCTCATTCCCCTGCGCCTGGGGCTCA
CGGACATCAACGAGGCCTACGTGGAGACGCTGAAGATGCCTGAAGAGCCAAAGCTGACTTTGGATGAGTCGTATGACCTTGTTGTTGAAA
ATAAAGAAGTCCTAACTTTGCAAGAAACTCTGGAAGCACTTAGCCTCTCTGAATATTTTAGCACTTTTGAAAAGGAAAAGATTGATATGG
AGTCCCTGCTTATGTGTACAGTTGATGACCTGAAGGAAATGGGGATACCCCTTGGACCCAGAAAGAAGATAGCTAACTTTGTAGAACATA
AAGCAGCCAAACTGAAAAAAGCAGCGTCAGAAAAGAAGGCAGTGGCGGCCACTTCTACAAAAGGACAAGAGCAAAGTGCCCAGAAGACTA
AAGACATGGCTTCCCTCCCCTCAGAATCCAATGAGCCAAAGAGGAAACTTCCAGTTGGTGCTTGCGTGTCTTCTGTGTGTGTGAATTATG
AATCTTTTGAAGTTGGCGCCGGACAGGTTTCTGTTGCTTACAACTCATTAGATTTTGAACCAGAGATATTCTTTGCCTTGGGGTCTCCAA
TTGCTATGTTTCTCACTATTCGAGGAGTTGATAGGATAGATGAGAATTACAGCCTTCCTACCTGTAAAGGGTTCTTCAATATTTATCATC
CGCTTGATCCAGTGGCATATAGATTAGAACCTATGATTGTTCCAGATTTGGACCTAAAAGCTGTTCTCATTCCACATCACAAAGGCAGAA
AAAGACTTCATTTAGAATTGAAAGAGAGTCTCTCTCGTATGGGATCTGATTTGAAGCAGGGTTTTATTAGCTCTCTCAAAAGTGCTTGGC
AGACATTAAATGAGTTTGCCCGTGCTCATACGTCTTCAACCCAGTTGCAAGAAGAATTGGAGAAGGTGGCCAATCAGATCAAAGAAGAAG
AAGAAAAGCAAGTAGTTGAAGCAGAAAAGGTTGTTGAAAGTCCAGATTTTTCCAAGGATGAGGACTACTTAGGAAAGGTTGGAATGTTAA
ATGGAGGCCGCCGAATTGACTACGTTCTCCAAGAAAAACCAATAGAGAGTTTTAATGAATACCTTTTCGCTCTTCAGAGTCACTTATGCT
ATTGGGAATCTGAAGATACTGCTCTGTTACTACTTAAAGAAATTTATCGAACAATGAACATTAGTCCAGAACAGCCCCAGCATTGATCAA
ACTTCAGTTTTACTGTACTTTCTTGTCTGCACAGAAAGTCCCAGTACAACTTCCATTGCTGAGAAAATCCTCAGAGGACTTTCCCACTTC
GCTCCTGTGATGGATGACAGAAGAGTGATTCATTAACAATTGCTCAGCCACAATTCTCGGATATAGGGATTCAAAAGACAGGACACAGAA
CTAACACAGTGAAAAAAATCAGTACCACATTTGGACAGTATAGGTGAGAAAACATAATTATAAAAATGATGCCATGAAAAATTCCACAGA
TCAGTTTAGTTGTATAGTTGTCAAAGTTATATGTGATATCAATGAAGAAATATTTGTAGCATGTAAACGGTTATTTCTGTTTCTTAAAAA
GTATTGTTAGTGGGCTATTAAACTTGGATTTTTCTTTTTATTAATGCAGTATGTTCTTTTTATTCAAGTATGAACTTGTTGAGAAACTAT
AGTAATATGATTTTTAAGAGATTTATGTTCTACTTAAAATGTGAATTGTACTTCTGAGCTGCCTTAATGCAAGGTCATTTATATTTGTTA
AGAGGAAATAATCAAGATCACTCATATCCCAACTGAATCTGAGGTTTTATAAATCCCTCAAACGATTGCTGAGAGCCTGATTGTGGAAAG
AAGTGAGATGCACCTTATTTTCAAGAAGTCCTGGGAAGCGCTCTCCTAGCACGTCCATTTCCAGGAGGAGAAGCAAGCAGATGAGAGGTT
TTCCATTTTGTCATCCAAGGTAGCTGTGCACTTGCCTTGTTGCTGAAGTTCCAATAATGTGAAAACCAAAGTAGAGGTTTTTTTCTTCTT
CTTTTTGTTTTCTATTAATTTCACTTATACCAAAGTGTTTGAAAGTATGAAATGTGTTGCTTCTGAGTTATATAAGGCTACTTCATGACA
AGACTGCTTTGTAATATTTCACTTTGTTTTACTACAAATTCAGATCACTTTGTTTTACTATAAATTCAGATTATCCAAATATTTTCCTAA
TACTATGTGGGAATGCTGATTTTCTTTTGTTACGTAGTGGAAACATTTTGCATTGTTTACATAGTTCTCATGGAACATGGAAATTTTTGA
AAGTGATATATGATACACATTTTTTGTGTATGTATTCTAATTAGTGTGAATAAAGCAGTAACATTAATGCATTTTTTAAGCAGCAAACTT
ATGTATTTCTCTTGTCTTCCTTAAAAGTGTCCCCATGAACTCAGTGTTTATTCCCTTTTCATTTTGAGTACCTGCTTATATGGTCAGTAT
GTAACGTTAGCATTGGCTCCTAATGGTAGAATTAGAACAGCAAGATTGTAGAGCTGTAATTGACTCCAGACAACATAGATTTCAGCCACC
TCATTCTACAGCTGAGGCCAGGACAATAAATGCCTTTCCCAGACTGGGTAGTGGCAGATCTGGGATGGAATATGGTTTTCTTGATTCCCT
TTCAGCCTTCATTTCTCTCTCTCAGGACTACTACTTTTTAATTACTTTTCACTTAATTTCCCAATACTGATGAAATAAAGAAAAATGAGG

>7568_7568_9_ATG4B-SEC23IP_ATG4B_chr2_242606253_ENST00000474739_SEC23IP_chr10_121678956_ENST00000369075_length(amino acids)=638AA_BP=260
MLSLKIFLRPQSPFGYWVENTAFSQKRTRSCLMWHLDFGLHTGKTFQPLPFIIVVYPGAGRRWGLVLSLASHRRRTEAGGGTGPTSDTGW
GCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWS
SLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKMPEEPKLT
LDESYDLVVENKEVLTLQETLEALSLSEYFSTFEKEKIDMESLLMCTVDDLKEMGIPLGPRKKIANFVEHKAAKLKKAASEKKAVAATST
KGQEQSAQKTKDMASLPSESNEPKRKLPVGACVSSVCVNYESFEVGAGQVSVAYNSLDFEPEIFFALGSPIAMFLTIRGVDRIDENYSLP
TCKGFFNIYHPLDPVAYRLEPMIVPDLDLKAVLIPHHKGRKRLHLELKESLSRMGSDLKQGFISSLKSAWQTLNEFARAHTSSTQLQEEL
EKVANQIKEEEEKQVVEAEKVVESPDFSKDEDYLGKVGMLNGGRRIDYVLQEKPIESFNEYLFALQSHLCYWESEDTALLLLKEIYRTMN

--------------------------------------------------------------
>7568_7568_10_ATG4B-SEC23IP_ATG4B_chr2_242606253_ENST00000474739_SEC23IP_chr10_121678956_ENST00000543134_length(transcript)=2145nt_BP=845nt
AGTCGGCGGCCGGACTGGGAAGATGGACGCAGCTACTCTGACCTACGACACTCTCCGGTTTGCTGAGTTTGAAGATTTTCCTGAGACCTC
AGAGCCCGTTTGGATACTGGGTAGAAAATACAGCATTTTCACAGAAAAGGACGAGATCTTGTCTGATGTGGCATCTAGACTTTGGTTTAC
ATACAGGAAAAACTTTCCAGCCATTGCCATTCATCATTGTTGTATACCCTGGAGCTGGAAGGAGATGGGGACTGGTTCTCAGCCTTGCCT
CTCACCGGCGGAGAACTGAGGCCGGAGGGGGGACAGGCCCCACCTCGGACACAGGCTGGGGCTGCATGCTGCGGTGTGGACAGATGATCT
TTGCCCAAGCCCTGGTGTGCCGGCACCTAGGCCGAGATTGGAGGTGGACACAAAGGAAGAGGCAGCCAGACAGCTACTTCAGCGTCCTCA
ACGCATTCATCGACAGGAAGGACAGTTACTACTCCATTCACCAGATAGCGCAAATGGGAGTTGGCGAAGGCAAGTCCATAGGCCAGTGGT
ACGGGCCCAACACTGTCGCCCAGGTCCTGAAGAAGCTTGCTGTCTTCGATACGTGGAGCTCCTTGGCGGTCCACATTGCAATGGACAACA
CTGTTGTGATGGAGGAAATCAGAAGGTTGTGCAGGACCAGCGTTCCCTGTGCAGGCGCCACTGCGTTTCCTGCAGATTCCGACCGGCACT
GCAACGGATTCCCTGCCGGAGCTGAGGTCACCAACAGGCCGTCGCCATGGAGACCCCTGGTACTTCTCATTCCCCTGCGCCTGGGGCTCA
CGGACATCAACGAGGCCTACGTGGAGACGCTGAAGATGCCTGAAGAGCCAAAGCTGACTTTGGATGAGTCGTATGACCTTGTTGTTGAAA
ATAAAGAAGTCCTAACTTTGCAAGAAACTCTGGAAGCACTTAGCCTCTCTGAATATTTTAGCACTTTTGAAAAGGAAAAGATTGATATGG
AGTCCCTGCTTATGTGTACAGTTGATGACCTGAAGGAAATGGGGATACCCCTTGGACCCAGAAAGAAGATAGCTAACTTTGTAGAACATA
AAGCAGCCAAACTGAAAAAAGCAGCGTCAGAAAAGAAGGCAGTGGCGGCCACTTCTACAAAAGGACAAGAGCAAAGTGCCCAGAAGACTA
AAGACATGGCTTCCCTCCCCTCAGAATCCAATGAGCCAAAGAGGAAACTTCCAGTTGGTGCTTGCGTGTCTTCTGTGTGTGTGAATTATG
AATCTTTTGAAGTTGGCGCCGGACAGGTTTCTGTTGCTTACAACTCATTAGATTTTGAACCAGAGATATTCTTTGCCTTGGGGTCTCCAA
TTGCTATGTTTCTCACTATTCGAGGAGTTGATAGGATAGATGAGAATTACAGCCTTCCTACCTGTAAAGGGTTCTTCAATATTTATCATC
CGCTTGATCCAGTGGCATATAGATTAGAACCTATGATTGTTCCAGATTTGGACCTAAAAGCTGTTCTCATTCCACATCACAAAGGCAGAA
AAAGACTTCATTTAGAATTGAAAGAGAGTCTCTCTCGTATGGGATCTGATTTGAAGCAGGGTTTTATTAGCTCTCTCAAAAGTGCTTGGC
AGACATTAAATGAGTTTGCCCGTGCTCATACGTCTTCAACCCAGTTGCAAGAAGAATTGGAGAAGGTGGCCAATCAGATCAAAGAAGAAG
AAGAAAAGCAAGTAGTTGAAGCAGAAAAGGTTGTTGAAAGTCCAGATTTTTCCAAGGATGAGGACTACTTAGGAAAGGTTGGAATGTTAA
ATGGAGGCCGCCGAATTGACTACGTTCTCCAAGAAAAACCAATAGAGAGTTTTAATGAATACCTTTTCGCTCTTCAGAGTCACTTATGCT
ATTGGGAATCTGAAGATACTGCTCTGTTACTACTTAAAGAAATTTATCGAACAATGAACATTAGTCCAGAACAGCCCCAGCATTGATCAA
ACTTCAGTTTTACTGTACTTTCTTGTCTGCACAGAAAGTCCCAGTACAACTTCCATTGCTGAGAAAATCCTCAGAGGACTTTCCCACTTC

>7568_7568_10_ATG4B-SEC23IP_ATG4B_chr2_242606253_ENST00000474739_SEC23IP_chr10_121678956_ENST00000543134_length(amino acids)=638AA_BP=260
MLSLKIFLRPQSPFGYWVENTAFSQKRTRSCLMWHLDFGLHTGKTFQPLPFIIVVYPGAGRRWGLVLSLASHRRRTEAGGGTGPTSDTGW
GCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWS
SLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKMPEEPKLT
LDESYDLVVENKEVLTLQETLEALSLSEYFSTFEKEKIDMESLLMCTVDDLKEMGIPLGPRKKIANFVEHKAAKLKKAASEKKAVAATST
KGQEQSAQKTKDMASLPSESNEPKRKLPVGACVSSVCVNYESFEVGAGQVSVAYNSLDFEPEIFFALGSPIAMFLTIRGVDRIDENYSLP
TCKGFFNIYHPLDPVAYRLEPMIVPDLDLKAVLIPHHKGRKRLHLELKESLSRMGSDLKQGFISSLKSAWQTLNEFARAHTSSTQLQEEL
EKVANQIKEEEEKQVVEAEKVVESPDFSKDEDYLGKVGMLNGGRRIDYVLQEKPIESFNEYLFALQSHLCYWESEDTALLLLKEIYRTMN

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for ATG4B-SEC23IP


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with
TgeneSEC23IPchr2:242606253chr10:121678956ENST000003690759191_367624.01443.0SEC23A


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for ATG4B-SEC23IP


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for ATG4B-SEC23IP


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource