FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:SLC22A23-FOXP4 (FusionGDB2 ID:82458)

Fusion Gene Summary for SLC22A23-FOXP4

check button Fusion gene summary
Fusion gene informationFusion gene name: SLC22A23-FOXP4
Fusion gene ID: 82458
HgeneTgene
Gene symbol

SLC22A23

FOXP4

Gene ID

63027

116113

Gene namesolute carrier family 22 member 23forkhead box P4
SynonymsC6orf85hFKHLA
Cytomap

6p25.2

6p21.1

Type of geneprotein-codingprotein-coding
Descriptionsolute carrier family 22 member 23ion transporter proteinforkhead box protein P4fork head-related protein like Awinged-helix repressor FOXP4
Modification date2020031320200313
UniProtAcc.

Q8IVH2

Ensembl transtripts involved in fusion geneENST00000380298, ENST00000406686, 
ENST00000436008, ENST00000380302, 
ENST00000433689, ENST00000490273, 
ENST00000307972, ENST00000373057, 
ENST00000373060, ENST00000373063, 
ENST00000409208, 
Fusion gene scores* DoF score12 X 5 X 9=5405 X 6 X 4=120
# samples 146
** MAII scorelog2(14/540*10)=-1.94753258010586
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(6/120*10)=-1
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: SLC22A23 [Title/Abstract] AND FOXP4 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointSLC22A23(3456140)-FOXP4(41545724), # samples:1
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID

check buttonFusion gene breakpoints across SLC22A23 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across FOXP4 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4STADTCGA-BR-A4PD-01ASLC22A23chr6

3456140

-FOXP4chr6

41545724

+


Top

Fusion Gene ORF analysis for SLC22A23-FOXP4

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
In-frameENST00000380298ENST00000307972SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
In-frameENST00000380298ENST00000373057SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
In-frameENST00000380298ENST00000373060SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
In-frameENST00000380298ENST00000373063SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
In-frameENST00000380298ENST00000409208SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
In-frameENST00000406686ENST00000307972SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
In-frameENST00000406686ENST00000373057SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
In-frameENST00000406686ENST00000373060SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
In-frameENST00000406686ENST00000373063SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
In-frameENST00000406686ENST00000409208SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
In-frameENST00000436008ENST00000307972SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
In-frameENST00000436008ENST00000373057SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
In-frameENST00000436008ENST00000373060SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
In-frameENST00000436008ENST00000373063SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
In-frameENST00000436008ENST00000409208SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
intron-3CDSENST00000380302ENST00000307972SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
intron-3CDSENST00000380302ENST00000373057SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
intron-3CDSENST00000380302ENST00000373060SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
intron-3CDSENST00000380302ENST00000373063SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
intron-3CDSENST00000380302ENST00000409208SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
intron-3CDSENST00000433689ENST00000307972SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
intron-3CDSENST00000433689ENST00000373057SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
intron-3CDSENST00000433689ENST00000373060SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
intron-3CDSENST00000433689ENST00000373063SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
intron-3CDSENST00000433689ENST00000409208SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
intron-3CDSENST00000490273ENST00000307972SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
intron-3CDSENST00000490273ENST00000373057SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
intron-3CDSENST00000490273ENST00000373060SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
intron-3CDSENST00000490273ENST00000373063SLC22A23chr6

3456140

-FOXP4chr6

41545724

+
intron-3CDSENST00000490273ENST00000409208SLC22A23chr6

3456140

-FOXP4chr6

41545724

+

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000436008SLC22A23chr63456140-ENST00000373060FOXP4chr641545724+640311173072955882
ENST00000436008SLC22A23chr63456140-ENST00000373063FOXP4chr641545724+636511173072916869
ENST00000436008SLC22A23chr63456140-ENST00000409208FOXP4chr641545724+394311173072919870
ENST00000436008SLC22A23chr63456140-ENST00000373057FOXP4chr641545724+397711173072949880
ENST00000436008SLC22A23chr63456140-ENST00000307972FOXP4chr641545724+464611173072955882
ENST00000406686SLC22A23chr63456140-ENST00000373060FOXP4chr641545724+594065402492830
ENST00000406686SLC22A23chr63456140-ENST00000373063FOXP4chr641545724+590265402453817
ENST00000406686SLC22A23chr63456140-ENST00000409208FOXP4chr641545724+348065402456818
ENST00000406686SLC22A23chr63456140-ENST00000373057FOXP4chr641545724+351465402486828
ENST00000406686SLC22A23chr63456140-ENST00000307972FOXP4chr641545724+418365402492830
ENST00000380298SLC22A23chr63456140-ENST00000373060FOXP4chr641545724+594065402492830
ENST00000380298SLC22A23chr63456140-ENST00000373063FOXP4chr641545724+590265402453817
ENST00000380298SLC22A23chr63456140-ENST00000409208FOXP4chr641545724+348065402456818
ENST00000380298SLC22A23chr63456140-ENST00000373057FOXP4chr641545724+351465402486828
ENST00000380298SLC22A23chr63456140-ENST00000307972FOXP4chr641545724+418365402492830

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000436008ENST00000373060SLC22A23chr63456140-FOXP4chr641545724+0.0187461230.98125386
ENST00000436008ENST00000373063SLC22A23chr63456140-FOXP4chr641545724+0.0183868560.98161316
ENST00000436008ENST00000409208SLC22A23chr63456140-FOXP4chr641545724+0.0521429030.9478571
ENST00000436008ENST00000373057SLC22A23chr63456140-FOXP4chr641545724+0.0469824150.95301765
ENST00000436008ENST00000307972SLC22A23chr63456140-FOXP4chr641545724+0.041559330.95844066
ENST00000406686ENST00000373060SLC22A23chr63456140-FOXP4chr641545724+0.017537790.98246217
ENST00000406686ENST00000373063SLC22A23chr63456140-FOXP4chr641545724+0.0170148260.9829852
ENST00000406686ENST00000409208SLC22A23chr63456140-FOXP4chr641545724+0.0556135030.9443864
ENST00000406686ENST00000373057SLC22A23chr63456140-FOXP4chr641545724+0.0504198040.94958025
ENST00000406686ENST00000307972SLC22A23chr63456140-FOXP4chr641545724+0.0412343670.9587657
ENST00000380298ENST00000373060SLC22A23chr63456140-FOXP4chr641545724+0.017537790.98246217
ENST00000380298ENST00000373063SLC22A23chr63456140-FOXP4chr641545724+0.0170148260.9829852
ENST00000380298ENST00000409208SLC22A23chr63456140-FOXP4chr641545724+0.0556135030.9443864
ENST00000380298ENST00000373057SLC22A23chr63456140-FOXP4chr641545724+0.0504198040.94958025
ENST00000380298ENST00000307972SLC22A23chr63456140-FOXP4chr641545724+0.0412343670.9587657

Top

Fusion Genomic Features for SLC22A23-FOXP4


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)
SLC22A23chr63456139-FOXP4chr641545723+0.0134899490.9865101
SLC22A23chr63456139-FOXP4chr641545723+0.0134899490.9865101

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.
genomic feature of top 1%

Top

Fusion Protein Features for SLC22A23-FOXP4


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr6:3456140/chr6:41545724)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
.FOXP4

Q8IVH2

FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.FUNCTION: Transcriptional repressor that represses lung-specific expression. {ECO:0000250}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
TgeneFOXP4chr6:3456140chr6:41545724ENST0000030797201665_21968681.0Compositional biasNote=Gln-rich
TgeneFOXP4chr6:3456140chr6:41545724ENST0000037305711765_21968679.0Compositional biasNote=Gln-rich
TgeneFOXP4chr6:3456140chr6:41545724ENST0000037306011765_21968681.0Compositional biasNote=Gln-rich
TgeneFOXP4chr6:3456140chr6:41545724ENST0000037306311765_21968668.0Compositional biasNote=Gln-rich
TgeneFOXP4chr6:3456140chr6:41545724ENST00000307972016467_55968681.0DNA bindingFork-head
TgeneFOXP4chr6:3456140chr6:41545724ENST00000373057117467_55968679.0DNA bindingFork-head
TgeneFOXP4chr6:3456140chr6:41545724ENST00000373060117467_55968681.0DNA bindingFork-head
TgeneFOXP4chr6:3456140chr6:41545724ENST00000373063117467_55968668.0DNA bindingFork-head
TgeneFOXP4chr6:3456140chr6:41545724ENST00000307972016349_37068681.0RegionNote=Leucine-zipper
TgeneFOXP4chr6:3456140chr6:41545724ENST00000373057117349_37068679.0RegionNote=Leucine-zipper
TgeneFOXP4chr6:3456140chr6:41545724ENST00000373060117349_37068681.0RegionNote=Leucine-zipper
TgeneFOXP4chr6:3456140chr6:41545724ENST00000373063117349_37068668.0RegionNote=Leucine-zipper
TgeneFOXP4chr6:3456140chr6:41545724ENST00000307972016307_33268681.0Zinc fingerNote=C2H2-type
TgeneFOXP4chr6:3456140chr6:41545724ENST00000373057117307_33268679.0Zinc fingerNote=C2H2-type
TgeneFOXP4chr6:3456140chr6:41545724ENST00000373060117307_33268681.0Zinc fingerNote=C2H2-type
TgeneFOXP4chr6:3456140chr6:41545724ENST00000373063117307_33268668.0Zinc fingerNote=C2H2-type

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000380298-14234_254218362.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000380298-14258_278218362.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000380298-14288_308218362.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000380298-14315_335218362.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000380298-14344_364218362.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000380298-14467_487218362.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000380298-14494_514218362.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000380298-14538_558218362.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000380298-14569_589218362.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000380298-14598_618218362.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000380302-110234_2540406.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000380302-110258_2780406.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000380302-110288_3080406.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000380302-110315_3350406.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000380302-110344_3640406.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000380302-110467_4870406.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000380302-110494_5140406.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000380302-110538_5580406.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000380302-110569_5890406.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000380302-110598_6180406.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000406686-110234_254218687.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000406686-110258_278218687.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000406686-110288_308218687.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000406686-110315_335218687.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000406686-110344_364218687.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000406686-110467_487218687.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000406686-110494_514218687.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000406686-110538_558218687.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000406686-110569_589218687.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000406686-110598_618218687.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000490273-111234_2540406.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000490273-111258_2780406.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000490273-111288_3080406.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000490273-111315_3350406.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000490273-111344_3640406.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000490273-111467_4870406.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000490273-111494_5140406.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000490273-111538_5580406.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000490273-111569_5890406.0TransmembraneHelical
HgeneSLC22A23chr6:3456140chr6:41545724ENST00000490273-111598_6180406.0TransmembraneHelical


Top

Fusion Gene Sequence for SLC22A23-FOXP4


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>82458_82458_1_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000380298_FOXP4_chr6_41545724_ENST00000307972_length(transcript)=4183nt_BP=654nt
ATGGCCATAGACCGGCGGCGCGAGGCGGCGGGCGGCGGGCCTGGGCGGCAGCCGGCCCCGGCCGAGGAGAACGGCTCCCTGCCGCCCGGG
GACGCGGCGGCCTCGGCGCCCCTCGGGGGACGCGCGGGCCCCGGCGGCGGCGCGGAGATCCAGCCGCTGCCCCCACTGCATCCTGGAGGC
GGCCCGCACCCGAGCTGCTGCTCCGCGGCTGCGGCCCCGAGCCTCTTGTTGCTGGACTATGACGGGTCGGTGCTGCCCTTCCTCGGGGGC
CTGGGCGGGGGCTATCAGAAGACCCTCGTGCTGCTCACCTGGATCCCGGCGCTGTTCATCGGCTTCAGCCAGTTCTCGGACTCGTTCCTC
CTGGACCAGCCCAACTTCTGGTGCCGCGGGGCCGGCAAAGGCACCGAGCTGGCAGGGGTCACCACCACAGGCCGGGGCGGGGACATGGGC
AACTGGACCAGCCTCCCCACCACCCCCTTCGCCACTGCCCCCTGGGAGGCTGCGGGCAACCGGAGCAACAGCAGCGGCGCGGACGGAGGC
GACACACCACCCCTGCCATCCCCTCCGGACAAGGGGGACAACGCCTCCAACTGTGACTGCCGCGCATGGGACTACGGCATCCGCGCCGGC
CTCGTCCAGAACGTGGTCAGCAAGGCTCTCCAAGTGGCCCGGCAGTTCCTGCTGCAGCAGGCCTCAGGCCTGAGCTCCCCAGGGAACAAT
GACAGCAAACAGTCTGCCTCTGCTGTGCAGGTGCCTGTGTCGGTGGCCATGATGTCGCCGCAGATGCTTACCCCGCAACAGATGCAGCAG
ATCCTGTCGCCCCCGCAGCTGCAGGCCTTGCTCCAGCAGCAGCAAGCCCTCATGCTCCAGCAGCTACAGGAGTACTACAAGAAGCAGCAG
GAGCAGCTCCACCTGCAGCTCCTCACCCAGCAGCAGGCTGGGAAACCGCAGCCCAAAGAGGCACTGGGGAACAAGCAGCTGGCCTTCCAG
CAGCAGCTCCTGCAAATGCAACAGTTGCAGCAGCAGCACCTGCTCAACCTGCAGAGGCAGGGGCTGGTCAGCCTGCAGCCCAACCAAGCC
TCGGGGCCCCTCCAGACCCTTCCGCAAGCAGCTGTTTGCCCAACAGACCTGCCCCAGCTGTGGAAGGGCGAGGGTGCCCCCGGGCAGCCT
GCCGAGGACAGCGTCAAGCAGGAGGGGCTGGACCTCACTGGCACGGCCGCCACCGCTACCTCGTTTGCCGCTCCCCCCAAGGTCTCACCC
CCCCTCTCCCACCATACCCTGCCCAACGGACAGCCTACTGTGCTCACATCTCGGAGAGACAGCTCTTCCCACGAGGAGACCCCCGGCTCC
CACCCCCTGTACGGACACGGAGAGTGCAAGTGGCCAGGCTGTGAGACCCTGTGTGAAGACCTGGGCCAGTTTATCAAACACCTCAACACA
GAGCACGCCCTGGATGACCGGAGTACAGCCCAGTGCCGGGTACAGATGCAGGTGGTGCAGCAGCTGGAGATCCAGCTCGCCAAGGAGAGC
GAGCGGCTGCAGGCCATGATGGCCCACCTGCACATGCGGCCCTCGGAGCCCAAGCCCTTCAGCCAGCCACTGAACCCGGTCCCCGGCTCC
TCCTCATTCTCCAAGGTGACCGTCTCTGCAGCAGACTCATTCCCAGATGGTCTCGTGCACCCCCCGACCTCGGCCGCAGCCCCTGTCACC
CCTCTACGGCCCCCTGGCCTGGGCTCTGCCTCCCTGCATGGTGGGGGCCCAGCCCGTCGGAGAAGCAGTGACAAGTTCTGCTCCCCCATC
TCCTCAGAGCTGGCCCAGAATCATGAGTTCTACAAGAACGCCGACGTCCGGCCCCCCTTCACCTACGCCTCCCTCATCCGCCAGGCCATC
CTGGAAACCCCTGACAGGCAGCTGACCCTGAATGAGATCTATAACTGGTTCACCAGGATGTTCGCCTATTTCCGCAGAAACACTGCCACC
TGGAAGAACGCCGTGCGCCACAACCTCAGCCTGCACAAGTGCTTCGTCCGCGTGGAGAACGTCAAGGGTGCCGTGTGGACTGTGGACGAG
CGGGAGTATCAGAAGCGGAGACCGCCAAAGATGACAGGGAGCCCCACCCTGGTGAAGAACATGATCTCTGGCCTCAGCTATGGAGCACTT
AATGCCAGCTACCAGGCCGCCCTGGCCGAGAGCAGCTTCCCCCTCCTCAACAGCCCTGGCATGCTGAACCCTGGCTCCGCCAGCAGCCTG
CTGCCCCTCAGCCACGATGACGTGGGTGCCCCCGTGGAGCCGCTGCCCAGCAACGGCAGCAGCAGCCCTCCTCGCCTCTCCCCGCCCCAG
TACAGCCACCAGGTGCAGGTGAAGGAGGAGCCAGCAGAGGCAGAGGAAGACAGGCAGCCCGGGCCTCCCCTGGGCGCCCCTAACCCCAGC
GCCTCGGGGCCTCCGGAAGACAGGGACCTGGAGGAGGAGCTGCCGGGAGAAGAACTGTCCTAAGGGCCTGTAGTGACCGGCAGGGCTGGG
GTGAGACCCCTCCCTTCCAGAATCCAGGCCCCATCTCCCCCAACTCCACAGCCCCTCCCGAGCCTCAAGGCAAGTCCAGGACTCAGACCG
GGGAGGCCCGGGCCAGCAGCTCCCAGTGTGACCTGACAAAAACACGTAGGGGCAGGGACGGTCCCCACCCCCAGGGACACAACCCCTGGT
CTTGGACCAGTAGAGGACACGGAGGGTTCAGACCCCTCCTCAGACCCTCCCCACATCTGAAACTGCCTCCCCCCAACCACCAGCAGCAGC
AGGGCCCTCCTCCCCCACCAGCTCTCCCCACAGGGCCCCTCAGCATCATGGAGACCCGCAGGCGGGGCTTAGCCACCCCTCAAACCCAGG
GCCCCCTGGCACCTGGCTCTGGCCGTGTTTTCTGGCCAGAGGCCCCCACTTTCCTAACTCGTGCTCCCTTCCGCCTTCTTTTCCGTACTG
TGAAGAAAGAACTCTCCACCCCAGCTCCCACCCTGCCCTGGCCTGGGTGGAGGAACTGTGCCTCCATCCCCAGAAGAAACAGCCCCCTCT
GCTGCTGGGGTGGGACTGTCTGTGTGCCCTGTGGGGGTCCGTGTGAGCAGGCCCACCTGGCTCCAGACCCGCCCCCAACCTGAGACAGAA
CCAGGCTGAGCCAGGCCTCCACCCCCACCCCCGTTTGCTGGGGGCTCCTCCAGCCGCCCCCATGGGAAGAGGCCTGGTACCGCCTCACCC
ACAGAGGTCTGTGCCAGGTGCGCTTCTGCAGGTGGAGCCAAGCTCTCCCTGAGGCCAGAGGCGGGGCCTGGGCCGGGAGCCCAGGGGAAG
GCCAGGCTGGACCCCGGCTCCACACCCACATCCAGCCTGCAGGCCTCTCTGCAGTCCTCTCACCCTCCCTCAGCTCCCCTTCCTCTGCAG
TCACCCTCAGCTCCCCTTCCTTGCCCGCCTCTCCCCCCGCCGCCCCACCAGTTAAACGGATGACCAAAGACCTTTCTTATGCCGGAAGCA
AAAACCAAAACTTTTTGTTGGCTTTTTCCTTTGTCGCCTCCCCAGCACCTGCCCTCCCAGTCTCCCACCCCGGCCCCAGGCTGGAAGCCC
TCCCTCCACTTAAGTTATTGTTTTAAACCAAAGTTTACAGTGTCTGTTGGTGGCCAAGACCTTCTCTCTCCACCCCTCCTCCATCCACCC
TGAGGACCCTGGGGCTCAGTGGAGGCAGGGCCCTGCCCCCCTCCCTTCCGCTCCTGCCCAGCCTGGGGGAAGGAGAAAGGAGGGGAGAAA
GCGGGCTCTCACCCCCTCAGGAGTGGGCACGGGAGCCCTTCTCCCTGACCCTGGGCTGCTTCCTGGGGGCTCTCCAGACCCCTCTCTAGG
ACCAAGTCACCCGTCGTGCTGGGAGTGTGGATTCTAGCAAAAGAGCTGGAAAAAAGTCAGACTCTCCACAGACCCCCTATGGGGGACCCC
CAACTCAAGGCCAAGGACTGGGCGTATCGGATGCTCATAACACCCCTGGCCTGGCCCCTTTACTGAGAAGACTCCTTGGATATTTCCCAA
GAACCCCCCACATACACCCCTCACAAGCCACCCCTCCTGAGAGGCAGGGGGCCCTCCGCCCCCTCCCCATGTATTCCCCACCTGTGTTCC

>82458_82458_1_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000380298_FOXP4_chr6_41545724_ENST00000307972_length(amino acids)=830AA_BP=218
MAIDRRREAAGGGPGRQPAPAEENGSLPPGDAAASAPLGGRAGPGGGAEIQPLPPLHPGGGPHPSCCSAAAAPSLLLLDYDGSVLPFLGG
LGGGYQKTLVLLTWIPALFIGFSQFSDSFLLDQPNFWCRGAGKGTELAGVTTTGRGGDMGNWTSLPTTPFATAPWEAAGNRSNSSGADGG
DTPPLPSPPDKGDNASNCDCRAWDYGIRAGLVQNVVSKALQVARQFLLQQASGLSSPGNNDSKQSASAVQVPVSVAMMSPQMLTPQQMQQ
ILSPPQLQALLQQQQALMLQQLQEYYKKQQEQLHLQLLTQQQAGKPQPKEALGNKQLAFQQQLLQMQQLQQQHLLNLQRQGLVSLQPNQA
SGPLQTLPQAAVCPTDLPQLWKGEGAPGQPAEDSVKQEGLDLTGTAATATSFAAPPKVSPPLSHHTLPNGQPTVLTSRRDSSSHEETPGS
HPLYGHGECKWPGCETLCEDLGQFIKHLNTEHALDDRSTAQCRVQMQVVQQLEIQLAKESERLQAMMAHLHMRPSEPKPFSQPLNPVPGS
SSFSKVTVSAADSFPDGLVHPPTSAAAPVTPLRPPGLGSASLHGGGPARRRSSDKFCSPISSELAQNHEFYKNADVRPPFTYASLIRQAI
LETPDRQLTLNEIYNWFTRMFAYFRRNTATWKNAVRHNLSLHKCFVRVENVKGAVWTVDEREYQKRRPPKMTGSPTLVKNMISGLSYGAL
NASYQAALAESSFPLLNSPGMLNPGSASSLLPLSHDDVGAPVEPLPSNGSSSPPRLSPPQYSHQVQVKEEPAEAEEDRQPGPPLGAPNPS

--------------------------------------------------------------
>82458_82458_2_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000380298_FOXP4_chr6_41545724_ENST00000373057_length(transcript)=3514nt_BP=654nt
ATGGCCATAGACCGGCGGCGCGAGGCGGCGGGCGGCGGGCCTGGGCGGCAGCCGGCCCCGGCCGAGGAGAACGGCTCCCTGCCGCCCGGG
GACGCGGCGGCCTCGGCGCCCCTCGGGGGACGCGCGGGCCCCGGCGGCGGCGCGGAGATCCAGCCGCTGCCCCCACTGCATCCTGGAGGC
GGCCCGCACCCGAGCTGCTGCTCCGCGGCTGCGGCCCCGAGCCTCTTGTTGCTGGACTATGACGGGTCGGTGCTGCCCTTCCTCGGGGGC
CTGGGCGGGGGCTATCAGAAGACCCTCGTGCTGCTCACCTGGATCCCGGCGCTGTTCATCGGCTTCAGCCAGTTCTCGGACTCGTTCCTC
CTGGACCAGCCCAACTTCTGGTGCCGCGGGGCCGGCAAAGGCACCGAGCTGGCAGGGGTCACCACCACAGGCCGGGGCGGGGACATGGGC
AACTGGACCAGCCTCCCCACCACCCCCTTCGCCACTGCCCCCTGGGAGGCTGCGGGCAACCGGAGCAACAGCAGCGGCGCGGACGGAGGC
GACACACCACCCCTGCCATCCCCTCCGGACAAGGGGGACAACGCCTCCAACTGTGACTGCCGCGCATGGGACTACGGCATCCGCGCCGGC
CTCGTCCAGAACGTGGTCAGCAAGGCTCTCCAAGTGGCCCGGCAGTTCCTGCTGCAGCAGGCCTCAGGCCTGAGCTCCCCAGGGAACAAT
GACAGCAAACAGTCTGCCTCTGCTGTGCAGGTGCCTGTGTCGGTGGCCATGATGTCGCCGCAGATGCTTACCCCGCAACAGATGCAGCAG
ATCCTGTCGCCCCCGCAGCTGCAGGCCTTGCTCCAGCAGCAGCAAGCCCTCATGCTCCAGCAGGAGTACTACAAGAAGCAGCAGGAGCAG
CTCCACCTGCAGCTCCTCACCCAGCAGCAGGCTGGGAAACCGCAGCCCAAAGAGGCACTGGGGAACAAGCAGCTGGCCTTCCAGCAGCAG
CTCCTGCAAATGCAACAGTTGCAGCAGCAGCACCTGCTCAACCTGCAGAGGCAGGGGCTGGTCAGCCTGCAGCCCAACCAAGCCTCGGGG
CCCCTCCAGACCCTTCCGCAAGCAGCTGTTTGCCCAACAGACCTGCCCCAGCTGTGGAAGGGCGAGGGTGCCCCCGGGCAGCCTGCCGAG
GACAGCGTCAAGCAGGAGGGGCTGGACCTCACTGGCACGGCCGCCACCGCTACCTCGTTTGCCGCTCCCCCCAAGGTCTCACCCCCCCTC
TCCCACCATACCCTGCCCAACGGACAGCCTACTGTGCTCACATCTCGGAGAGACAGCTCTTCCCACGAGGAGACCCCCGGCTCCCACCCC
CTGTACGGACACGGAGAGTGCAAGTGGCCAGGCTGTGAGACCCTGTGTGAAGACCTGGGCCAGTTTATCAAACACCTCAACACAGAGCAC
GCCCTGGATGACCGGAGTACAGCCCAGTGCCGGGTACAGATGCAGGTGGTGCAGCAGCTGGAGATCCAGCTCGCCAAGGAGAGCGAGCGG
CTGCAGGCCATGATGGCCCACCTGCACATGCGGCCCTCGGAGCCCAAGCCCTTCAGCCAGCCACTGAACCCGGTCCCCGGCTCCTCCTCA
TTCTCCAAGGTGACCGTCTCTGCAGCAGACTCATTCCCAGATGGTCTCGTGCACCCCCCGACCTCGGCCGCAGCCCCTGTCACCCCTCTA
CGGCCCCCTGGCCTGGGCTCTGCCTCCCTGCATGGTGGGGGCCCAGCCCGTCGGAGAAGCAGTGACAAGTTCTGCTCCCCCATCTCCTCA
GAGCTGGCCCAGAATCATGAGTTCTACAAGAACGCCGACGTCCGGCCCCCCTTCACCTACGCCTCCCTCATCCGCCAGGCCATCCTGGAA
ACCCCTGACAGGCAGCTGACCCTGAATGAGATCTATAACTGGTTCACCAGGATGTTCGCCTATTTCCGCAGAAACACTGCCACCTGGAAG
AACGCCGTGCGCCACAACCTCAGCCTGCACAAGTGCTTCGTCCGCGTGGAGAACGTCAAGGGTGCCGTGTGGACTGTGGACGAGCGGGAG
TATCAGAAGCGGAGACCGCCAAAGATGACAGGGAGCCCCACCCTGGTGAAGAACATGATCTCTGGCCTCAGCTATGGAGCACTTAATGCC
AGCTACCAGGCCGCCCTGGCCGAGAGCAGCTTCCCCCTCCTCAACAGCCCTGGCATGCTGAACCCTGGCTCCGCCAGCAGCCTGCTGCCC
CTCAGCCACGATGACGTGGGTGCCCCCGTGGAGCCGCTGCCCAGCAACGGCAGCAGCAGCCCTCCTCGCCTCTCCCCGCCCCAGTACAGC
CACCAGGTGCAGGTGAAGGAGGAGCCAGCAGAGGCAGAGGAAGACAGGCAGCCCGGGCCTCCCCTGGGCGCCCCTAACCCCAGCGCCTCG
GGGCCTCCGGAAGACAGGGACCTGGAGGAGGAGCTGCCGGGAGAAGAACTGTCCTAAGGGCCTGTAGTGACCGGCAGGGCTGGGGTGAGA
CCCCTCCCTTCCAGAATCCAGGCCCCATCTCCCCCAACTCCACAGCCCCTCCCGAGCCTCAAGGCAAGTCCAGGACTCAGACCGGGGAGG
CCCGGGCCAGCAGCTCCCAGTGTGACCTGACAAAAACACGTAGGGGCAGGGACGGTCCCCACCCCCAGGGACACAACCCCTGGTCTTGGA
CCAGTAGAGGACACGGAGGGTTCAGACCCCTCCTCAGACCCTCCCCACATCTGAAACTGCCTCCCCCCAACCACCAGCAGCAGCAGGGCC
CTCCTCCCCCACCAGCTCTCCCCACAGGGCCCCTCAGCATCATGGAGACCCGCAGGCGGGGCTTAGCCACCCCTCAAACCCAGGGCCCCC
TGGCACCTGGCTCTGGCCGTGTTTTCTGGCCAGAGGCCCCCACTTTCCTAACTCGTGCTCCCTTCCGCCTTCTTTTCCGTACTGTGAAGA
AAGAACTCTCCACCCCAGCTCCCACCCTGCCCTGGCCTGGGTGGAGGAACTGTGCCTCCATCCCCAGAAGAAACAGCCCCCTCTGCTGCT
GGGGTGGGACTGTCTGTGTGCCCTGTGGGGGTCCGTGTGAGCAGGCCCACCTGGCTCCAGACCCGCCCCCAACCTGAGACAGAACCAGGC
TGAGCCAGGCCTCCACCCCCACCCCCGTTTGCTGGGGGCTCCTCCAGCCGCCCCCATGGGAAGAGGCCTGGTACCGCCTCACCCACAGAG
GTCTGTGCCAGGTGCGCTTCTGCAGGTGGAGCCAAGCTCTCCCTGAGGCCAGAGGCGGGGCCTGGGCCGGGAGCCCAGGGGAAGGCCAGG
CTGGACCCCGGCTCCACACCCACATCCAGCCTGCAGGCCTCTCTGCAGTCCTCTCACCCTCCCTCAGCTCCCCTTCCTCTGCAGTCACCC
TCAGCTCCCCTTCCTTGCCCGCCTCTCCCCCCGCCGCCCCACCAGTTAAACGGATGACCAAAGACCTTTCTTATGCCGGAAGCAAAAACC

>82458_82458_2_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000380298_FOXP4_chr6_41545724_ENST00000373057_length(amino acids)=828AA_BP=218
MAIDRRREAAGGGPGRQPAPAEENGSLPPGDAAASAPLGGRAGPGGGAEIQPLPPLHPGGGPHPSCCSAAAAPSLLLLDYDGSVLPFLGG
LGGGYQKTLVLLTWIPALFIGFSQFSDSFLLDQPNFWCRGAGKGTELAGVTTTGRGGDMGNWTSLPTTPFATAPWEAAGNRSNSSGADGG
DTPPLPSPPDKGDNASNCDCRAWDYGIRAGLVQNVVSKALQVARQFLLQQASGLSSPGNNDSKQSASAVQVPVSVAMMSPQMLTPQQMQQ
ILSPPQLQALLQQQQALMLQQEYYKKQQEQLHLQLLTQQQAGKPQPKEALGNKQLAFQQQLLQMQQLQQQHLLNLQRQGLVSLQPNQASG
PLQTLPQAAVCPTDLPQLWKGEGAPGQPAEDSVKQEGLDLTGTAATATSFAAPPKVSPPLSHHTLPNGQPTVLTSRRDSSSHEETPGSHP
LYGHGECKWPGCETLCEDLGQFIKHLNTEHALDDRSTAQCRVQMQVVQQLEIQLAKESERLQAMMAHLHMRPSEPKPFSQPLNPVPGSSS
FSKVTVSAADSFPDGLVHPPTSAAAPVTPLRPPGLGSASLHGGGPARRRSSDKFCSPISSELAQNHEFYKNADVRPPFTYASLIRQAILE
TPDRQLTLNEIYNWFTRMFAYFRRNTATWKNAVRHNLSLHKCFVRVENVKGAVWTVDEREYQKRRPPKMTGSPTLVKNMISGLSYGALNA
SYQAALAESSFPLLNSPGMLNPGSASSLLPLSHDDVGAPVEPLPSNGSSSPPRLSPPQYSHQVQVKEEPAEAEEDRQPGPPLGAPNPSAS

--------------------------------------------------------------
>82458_82458_3_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000380298_FOXP4_chr6_41545724_ENST00000373060_length(transcript)=5940nt_BP=654nt
ATGGCCATAGACCGGCGGCGCGAGGCGGCGGGCGGCGGGCCTGGGCGGCAGCCGGCCCCGGCCGAGGAGAACGGCTCCCTGCCGCCCGGG
GACGCGGCGGCCTCGGCGCCCCTCGGGGGACGCGCGGGCCCCGGCGGCGGCGCGGAGATCCAGCCGCTGCCCCCACTGCATCCTGGAGGC
GGCCCGCACCCGAGCTGCTGCTCCGCGGCTGCGGCCCCGAGCCTCTTGTTGCTGGACTATGACGGGTCGGTGCTGCCCTTCCTCGGGGGC
CTGGGCGGGGGCTATCAGAAGACCCTCGTGCTGCTCACCTGGATCCCGGCGCTGTTCATCGGCTTCAGCCAGTTCTCGGACTCGTTCCTC
CTGGACCAGCCCAACTTCTGGTGCCGCGGGGCCGGCAAAGGCACCGAGCTGGCAGGGGTCACCACCACAGGCCGGGGCGGGGACATGGGC
AACTGGACCAGCCTCCCCACCACCCCCTTCGCCACTGCCCCCTGGGAGGCTGCGGGCAACCGGAGCAACAGCAGCGGCGCGGACGGAGGC
GACACACCACCCCTGCCATCCCCTCCGGACAAGGGGGACAACGCCTCCAACTGTGACTGCCGCGCATGGGACTACGGCATCCGCGCCGGC
CTCGTCCAGAACGTGGTCAGCAAGGCTCTCCAAGTGGCCCGGCAGTTCCTGCTGCAGCAGGCCTCAGGCCTGAGCTCCCCAGGGAACAAT
GACAGCAAACAGTCTGCCTCTGCTGTGCAGGTGCCTGTGTCGGTGGCCATGATGTCGCCGCAGATGCTTACCCCGCAACAGATGCAGCAG
ATCCTGTCGCCCCCGCAGCTGCAGGCCTTGCTCCAGCAGCAGCAAGCCCTCATGCTCCAGCAGCTACAGGAGTACTACAAGAAGCAGCAG
GAGCAGCTCCACCTGCAGCTCCTCACCCAGCAGCAGGCTGGGAAACCGCAGCCCAAAGAGGCACTGGGGAACAAGCAGCTGGCCTTCCAG
CAGCAGCTCCTGCAAATGCAACAGTTGCAGCAGCAGCACCTGCTCAACCTGCAGAGGCAGGGGCTGGTCAGCCTGCAGCCCAACCAAGCC
TCGGGGCCCCTCCAGACCCTTCCGCAAGCAGCTGTTTGCCCAACAGACCTGCCCCAGCTGTGGAAGGGCGAGGGTGCCCCCGGGCAGCCT
GCCGAGGACAGCGTCAAGCAGGAGGGGCTGGACCTCACTGGCACGGCCGCCACCGCTACCTCGTTTGCCGCTCCCCCCAAGGTCTCACCC
CCCCTCTCCCACCATACCCTGCCCAACGGACAGCCTACTGTGCTCACATCTCGGAGAGACAGCTCTTCCCACGAGGAGACCCCCGGCTCC
CACCCCCTGTACGGACACGGAGAGTGCAAGTGGCCAGGCTGTGAGACCCTGTGTGAAGACCTGGGCCAGTTTATCAAACACCTCAACACA
GAGCACGCCCTGGATGACCGGAGTACAGCCCAGTGCCGGGTACAGATGCAGGTGGTGCAGCAGCTGGAGATCCAGCTCGCCAAGGAGAGC
GAGCGGCTGCAGGCCATGATGGCCCACCTGCACATGCGGCCCTCGGAGCCCAAGCCCTTCAGCCAGCCACTGAACCCGGTCCCCGGCTCC
TCCTCATTCTCCAAGGTGACCGTCTCTGCAGCAGACTCATTCCCAGATGGTCTCGTGCACCCCCCGACCTCGGCCGCAGCCCCTGTCACC
CCTCTACGGCCCCCTGGCCTGGGCTCTGCCTCCCTGCATGGTGGGGGCCCAGCCCGTCGGAGAAGCAGTGACAAGTTCTGCTCCCCCATC
TCCTCAGAGCTGGCCCAGAATCATGAGTTCTACAAGAACGCCGACGTCCGGCCCCCCTTCACCTACGCCTCCCTCATCCGCCAGGCCATC
CTGGAAACCCCTGACAGGCAGCTGACCCTGAATGAGATCTATAACTGGTTCACCAGGATGTTCGCCTATTTCCGCAGAAACACTGCCACC
TGGAAGAACGCCGTGCGCCACAACCTCAGCCTGCACAAGTGCTTCGTCCGCGTGGAGAACGTCAAGGGTGCCGTGTGGACTGTGGACGAG
CGGGAGTATCAGAAGCGGAGACCGCCAAAGATGACAGGGAGCCCCACCCTGGTGAAGAACATGATCTCTGGCCTCAGCTATGGAGCACTT
AATGCCAGCTACCAGGCCGCCCTGGCCGAGAGCAGCTTCCCCCTCCTCAACAGCCCTGGCATGCTGAACCCTGGCTCCGCCAGCAGCCTG
CTGCCCCTCAGCCACGATGACGTGGGTGCCCCCGTGGAGCCGCTGCCCAGCAACGGCAGCAGCAGCCCTCCTCGCCTCTCCCCGCCCCAG
TACAGCCACCAGGTGCAGGTGAAGGAGGAGCCAGCAGAGGCAGAGGAAGACAGGCAGCCCGGGCCTCCCCTGGGCGCCCCTAACCCCAGC
GCCTCGGGGCCTCCGGAAGACAGGGACCTGGAGGAGGAGCTGCCGGGAGAAGAACTGTCCTAAGGGCCTGTAGTGACCGGCAGGGCTGGG
GTGAGACCCCTCCCTTCCAGAATCCAGGCCCCATCTCCCCCAACTCCACAGCCCCTCCCGAGCCTCAAGGCAAGTCCAGGACTCAGACCG
GGGAGGCCCGGGCCAGCAGCTCCCAGTGTGACCTGACAAAAACACGTAGGGGCAGGGACGGTCCCCACCCCCAGGGACACAACCCCTGGT
CTTGGACCAGTAGAGGACACGGAGGGTTCAGACCCCTCCTCAGACCCTCCCCACATCTGAAACTGCCTCCCCCCAACCACCAGCAGCAGC
AGGGCCCTCCTCCCCCACCAGCTCTCCCCACAGGGCCCCTCAGCATCATGGAGACCCGCAGGCGGGGCTTAGCCACCCCTCAAACCCAGG
GCCCCCTGGCACCTGGCTCTGGCCGTGTTTTCTGGCCAGAGGCCCCCACTTTCCTAACTCGTGCTCCCTTCCGCCTTCTTTTCCGTACTG
TGAAGAAAGAACTCTCCACCCCAGCTCCCACCCTGCCCTGGCCTGGGTGGAGGAACTGTGCCTCCATCCCCAGAAGAAACAGCCCCCTCT
GCTGCTGGGGTGGGACTGTCTGTGTGCCCTGTGGGGGTCCGTGTGAGCAGGCCCACCTGGCTCCAGACCCGCCCCCAACCTGAGACAGAA
CCAGGCTGAGCCAGGCCTCCACCCCCACCCCCGTTTGCTGGGGGCTCCTCCAGCCGCCCCCATGGGAAGAGGCCTGGTACCGCCTCACCC
ACAGAGGTCTGTGCCAGGTGCGCTTCTGCAGGTGGAGCCAAGCTCTCCCTGAGGCCAGAGGCGGGGCCTGGGCCGGGAGCCCAGGGGAAG
GCCAGGCTGGACCCCGGCTCCACACCCACATCCAGCCTGCAGGCCTCTCTGCAGTCCTCTCACCCTCCCTCAGCTCCCCTTCCTCTGCAG
TCACCCTCAGCTCCCCTTCCTTGCCCGCCTCTCCCCCCGCCGCCCCACCAGTTAAACGGATGACCAAAGACCTTTCTTATGCCGGAAGCA
AAAACCAAAACTTTTTGTTGGCTTTTTCCTTTGTCGCCTCCCCAGCACCTGCCCTCCCAGTCTCCCACCCCGGCCCCAGGCTGGAAGCCC
TCCCTCCACTTAAGTTATTGTTTTAAACCAAAGTTTACAGTGTCTGTTGGTGGCCAAGACCTTCTCTCTCCACCCCTCCTCCATCCACCC
TGAGGACCCTGGGGCTCAGTGGAGGCAGGGCCCTGCCCCCCTCCCTTCCGCTCCTGCCCAGCCTGGGGGAAGGAGAAAGGAGGGGAGAAA
GCGGGCTCTCACCCCCTCAGGAGTGGGCACGGGAGCCCTTCTCCCTGACCCTGGGCTGCTTCCTGGGGGCTCTCCAGACCCCTCTCTAGG
ACCAAGTCACCCGTCGTGCTGGGAGTGTGGATTCTAGCAAAAGAGCTGGAAAAAAGTCAGACTCTCCACAGACCCCCTATGGGGGACCCC
CAACTCAAGGCCAAGGACTGGGCGTATCGGATGCTCATAACACCCCTGGCCTGGCCCCTTTACTGAGAAGACTCCTTGGATATTTCCCAA
GAACCCCCCACATACACCCCTCACAAGCCACCCCTCCTGAGAGGCAGGGGGCCCTCCGCCCCCTCCCCATGTATTCCCCACCTGTGTTCC
GTTTGACCAGCACAGAAATATTAAACGTCCTCTATTCACCGGGCCCTGTGTGTGTCACCGAGGTGCGGGAGGGGAGGAGCATTAAAGCTG
AAAGATCGCTCTGCTCGGGGAGCCTGGGCAGAGCAGCAGCAACGTGAGGGTCGCTGTGGTGGTGGTTTCTGTGAGTGGATGGAATGAGCA
GCCCTGCAGGGGCGCTGGGCATGTGCCCTCACTGTGGACAGGGCCCACCCACCTCCGGTTCCCCTGTGCCTCCTGTCCCTTCCACGCTTA
AACAGGGTTCTCTGTCATTTTCCTGTTTTCTTCCAGAGTCCCAATCCTTTGCCCTAGTTCTTTCACTAGTTTGAAATCCAAGTTCTTGCC
AGAGTGTTGGAGCAAGGCAGCTGATTTGCTGCAGGGATGGAGAGGACCACCGCCGCAGGGTTCTTTTACCTGTGCCACCAGCTCTGGAGA
CATCCACACCCATTCCCAGGTGTCCTTCCAGGCCCATTGTCCACGTCTTCAAGGGGGCTGCCTGGATAGGCGTGTGTGTGTGTAGTGCCC
AGGTGTGGTGGTGGCATCCTGAAGCAGTAGGACCATTAGTGTGCGTGCACACCCACGTGGCACACTGTGTGGTGACCATGGTCATCATAG
TGGGCTCACGGCGGAAACGGGATCATTCAACCTATACAAAGGGGACCCTGGATAGGCTGCAAGGAAAGAGGCCAAGGGCCAAGCTGCTAA
GCCAAAGATGGCCCCTGACACCTCCCCCAGCCCCAGCACACCAGCCTGCACCTCTTATCCCGTGTTAAGTCCTGGTTCCCCACCTGCTGC
CCCTCCTCAACACAGAGCCCCTCCCGCCGCCTCAGACCCCTGTGCACATCCCCAGGGCCTCAGCCGTCTCATTGGTCTTTATTTTTTATT
TTTTTTAAGATGGAGTTTCGCTCTTATTGCCCAGACTGGAGTGCAGTGATGCTATCTCGGCTCACTGCAACCTTTGCCTCCCAGGTTCAA
GTGATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGGCGTGCACCACCACGCCGGGCTAAATTTTTTTGTATTTTTAGTAGAGA
CGGGGTTTCTCTATGTTGGTCAGGCTGATCTCGAACTCCCGACCTCAGGTGATCCGCCAGCCTCAGCCTCCCAAAGTGCTGGGATTACAG
GCGTGAGCCACTGCACCCGGCTCTCACTGGTCTTACGCCACCTTCTGGACACTCCCTCCTTGAGGGCAGAAAGGAGTCCCAGGCCTGTCC
CTAGGGACAAGGCCCAGGGAAGAGTGTATTTGGGGAGCAGGGGAGGGGAGGGTGTTGAGAAAGCTGAACTGGAGTCAATCACCCTTCCCA
CAAATCACCAAACTGCTGGAACTCTCCAGCCAAATGCTGGGAGAAGGACCTGGAGGGTGAGTCTTTGCTGACCTCTCTCTACTCTCAGGC
ATGTCTTTTGTCCTTTTCGTCCATCTATTTCTGTCTGTCGCTCACTCGCCCCGCTTTCTCTGTCTCACCTTCATCCACTCTGCAGGCCTG
CTCCACCACAGCCCTAATCCTCTGGACGCTTGTGTAGGGCCTGGGGTGAATTCCCTGTCCCCCATGGTACCTCGAGAGGGGCTGGGGAGC
TCAGCTTGGTCTCAGAGTCTCCCCACCAGATACTGTTTAAAAAAGTAGCACTGATGTGTTTTGTAATCTGCCCCTCCCAGCCCTCCGTGG
AGGCTGCCAGGGCCTTGTACGGTAAACCTAGCTGCATGTAATCTGTGGACAATGGCATTCTCTACAATGCAATAAAAACAATTACCCATG

>82458_82458_3_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000380298_FOXP4_chr6_41545724_ENST00000373060_length(amino acids)=830AA_BP=218
MAIDRRREAAGGGPGRQPAPAEENGSLPPGDAAASAPLGGRAGPGGGAEIQPLPPLHPGGGPHPSCCSAAAAPSLLLLDYDGSVLPFLGG
LGGGYQKTLVLLTWIPALFIGFSQFSDSFLLDQPNFWCRGAGKGTELAGVTTTGRGGDMGNWTSLPTTPFATAPWEAAGNRSNSSGADGG
DTPPLPSPPDKGDNASNCDCRAWDYGIRAGLVQNVVSKALQVARQFLLQQASGLSSPGNNDSKQSASAVQVPVSVAMMSPQMLTPQQMQQ
ILSPPQLQALLQQQQALMLQQLQEYYKKQQEQLHLQLLTQQQAGKPQPKEALGNKQLAFQQQLLQMQQLQQQHLLNLQRQGLVSLQPNQA
SGPLQTLPQAAVCPTDLPQLWKGEGAPGQPAEDSVKQEGLDLTGTAATATSFAAPPKVSPPLSHHTLPNGQPTVLTSRRDSSSHEETPGS
HPLYGHGECKWPGCETLCEDLGQFIKHLNTEHALDDRSTAQCRVQMQVVQQLEIQLAKESERLQAMMAHLHMRPSEPKPFSQPLNPVPGS
SSFSKVTVSAADSFPDGLVHPPTSAAAPVTPLRPPGLGSASLHGGGPARRRSSDKFCSPISSELAQNHEFYKNADVRPPFTYASLIRQAI
LETPDRQLTLNEIYNWFTRMFAYFRRNTATWKNAVRHNLSLHKCFVRVENVKGAVWTVDEREYQKRRPPKMTGSPTLVKNMISGLSYGAL
NASYQAALAESSFPLLNSPGMLNPGSASSLLPLSHDDVGAPVEPLPSNGSSSPPRLSPPQYSHQVQVKEEPAEAEEDRQPGPPLGAPNPS

--------------------------------------------------------------
>82458_82458_4_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000380298_FOXP4_chr6_41545724_ENST00000373063_length(transcript)=5902nt_BP=654nt
ATGGCCATAGACCGGCGGCGCGAGGCGGCGGGCGGCGGGCCTGGGCGGCAGCCGGCCCCGGCCGAGGAGAACGGCTCCCTGCCGCCCGGG
GACGCGGCGGCCTCGGCGCCCCTCGGGGGACGCGCGGGCCCCGGCGGCGGCGCGGAGATCCAGCCGCTGCCCCCACTGCATCCTGGAGGC
GGCCCGCACCCGAGCTGCTGCTCCGCGGCTGCGGCCCCGAGCCTCTTGTTGCTGGACTATGACGGGTCGGTGCTGCCCTTCCTCGGGGGC
CTGGGCGGGGGCTATCAGAAGACCCTCGTGCTGCTCACCTGGATCCCGGCGCTGTTCATCGGCTTCAGCCAGTTCTCGGACTCGTTCCTC
CTGGACCAGCCCAACTTCTGGTGCCGCGGGGCCGGCAAAGGCACCGAGCTGGCAGGGGTCACCACCACAGGCCGGGGCGGGGACATGGGC
AACTGGACCAGCCTCCCCACCACCCCCTTCGCCACTGCCCCCTGGGAGGCTGCGGGCAACCGGAGCAACAGCAGCGGCGCGGACGGAGGC
GACACACCACCCCTGCCATCCCCTCCGGACAAGGGGGACAACGCCTCCAACTGTGACTGCCGCGCATGGGACTACGGCATCCGCGCCGGC
CTCGTCCAGAACGTGGTCAGCAAGGCTCTCCAAGTGGCCCGGCAGTTCCTGCTGCAGCAGGCCTCAGGCCTGAGCTCCCCAGGGAACAAT
GACAGCAAACAGTCTGCCTCTGCTGTGCAGGTGCCTGTGTCGGTGGCCATGATGTCGCCGCAGATGCTTACCCCGCAACAGATGCAGCAG
ATCCTGTCGCCCCCGCAGCTGCAGGCCTTGCTCCAGCAGCAGCAAGCCCTCATGCTCCAGCAGCTACAGGAGTACTACAAGAAGCAGCAG
GAGCAGCTCCACCTGCAGCTCCTCACCCAGCAGCAGGCTGGGAAACCGCAGCCCAAAGAGGCACTGGGGAACAAGCAGCTGGCCTTCCAG
CAGCAGCTCCTGCAAATGCAACAGTTGCAGCAGCAGCACCTGCTCAACCTGCAGAGGCAGGGGCTGGTCAGCCTGCAGCCCAACCAAGCC
TCGGGGCCCCTCCAGACCCTTCCGCAAGCTGTTTGCCCAACAGACCTGCCCCAGCTGTGGAAGGGCGAGGGTGCCCCCGGGCAGCCTGCC
GAGGACAGCGTCAAGCAGGAGGGGCTGGACCTCACTGGCACGGCCGCCACCGCTACCTCGTTTGCCGCTCCCCCCAAGGTCTCACCCCCC
CTCTCCCACCATACCCTGCCCAACGGACAGCCTACTGTGCTCACATCTCGGAGAGACAGCTCTTCCCACGAGGAGACCCCCGGCTCCCAC
CCCCTGTACGGACACGGAGAGTGCAAGTGGCCAGGCTGTGAGACCCTGTGTGAAGACCTGGGCCAGTTTATCAAACACCTCAACACAGAG
CACGCCCTGGATGACCGGAGTACAGCCCAGTGCCGGGTACAGATGCAGGTGGTGCAGCAGCTGGAGATCCAGCTCGCCAAGGAGAGCGAG
CGGCTGCAGGCCATGATGGCCCACCTGCACATGCGGCCCTCGGAGCCCAAGCCCTTCAGCCAGCCAGTGACCGTCTCTGCAGCAGACTCA
TTCCCAGATGGTCTCGTGCACCCCCCGACCTCGGCCGCAGCCCCTGTCACCCCTCTACGGCCCCCTGGCCTGGGCTCTGCCTCCCTGCAT
GGTGGGGGCCCAGCCCGTCGGAGAAGCAGTGACAAGTTCTGCTCCCCCATCTCCTCAGAGCTGGCCCAGAATCATGAGTTCTACAAGAAC
GCCGACGTCCGGCCCCCCTTCACCTACGCCTCCCTCATCCGCCAGGCCATCCTGGAAACCCCTGACAGGCAGCTGACCCTGAATGAGATC
TATAACTGGTTCACCAGGATGTTCGCCTATTTCCGCAGAAACACTGCCACCTGGAAGAACGCCGTGCGCCACAACCTCAGCCTGCACAAG
TGCTTCGTCCGCGTGGAGAACGTCAAGGGTGCCGTGTGGACTGTGGACGAGCGGGAGTATCAGAAGCGGAGACCGCCAAAGATGACAGGG
AGCCCCACCCTGGTGAAGAACATGATCTCTGGCCTCAGCTATGGAGCACTTAATGCCAGCTACCAGGCCGCCCTGGCCGAGAGCAGCTTC
CCCCTCCTCAACAGCCCTGGCATGCTGAACCCTGGCTCCGCCAGCAGCCTGCTGCCCCTCAGCCACGATGACGTGGGTGCCCCCGTGGAG
CCGCTGCCCAGCAACGGCAGCAGCAGCCCTCCTCGCCTCTCCCCGCCCCAGTACAGCCACCAGGTGCAGGTGAAGGAGGAGCCAGCAGAG
GCAGAGGAAGACAGGCAGCCCGGGCCTCCCCTGGGCGCCCCTAACCCCAGCGCCTCGGGGCCTCCGGAAGACAGGGACCTGGAGGAGGAG
CTGCCGGGAGAAGAACTGTCCTAAGGGCCTGTAGTGACCGGCAGGGCTGGGGTGAGACCCCTCCCTTCCAGAATCCAGGCCCCATCTCCC
CCAACTCCACAGCCCCTCCCGAGCCTCAAGGCAAGTCCAGGACTCAGACCGGGGAGGCCCGGGCCAGCAGCTCCCAGTGTGACCTGACAA
AAACACGTAGGGGCAGGGACGGTCCCCACCCCCAGGGACACAACCCCTGGTCTTGGACCAGTAGAGGACACGGAGGGTTCAGACCCCTCC
TCAGACCCTCCCCACATCTGAAACTGCCTCCCCCCAACCACCAGCAGCAGCAGGGCCCTCCTCCCCCACCAGCTCTCCCCACAGGGCCCC
TCAGCATCATGGAGACCCGCAGGCGGGGCTTAGCCACCCCTCAAACCCAGGGCCCCCTGGCACCTGGCTCTGGCCGTGTTTTCTGGCCAG
AGGCCCCCACTTTCCTAACTCGTGCTCCCTTCCGCCTTCTTTTCCGTACTGTGAAGAAAGAACTCTCCACCCCAGCTCCCACCCTGCCCT
GGCCTGGGTGGAGGAACTGTGCCTCCATCCCCAGAAGAAACAGCCCCCTCTGCTGCTGGGGTGGGACTGTCTGTGTGCCCTGTGGGGGTC
CGTGTGAGCAGGCCCACCTGGCTCCAGACCCGCCCCCAACCTGAGACAGAACCAGGCTGAGCCAGGCCTCCACCCCCACCCCCGTTTGCT
GGGGGCTCCTCCAGCCGCCCCCATGGGAAGAGGCCTGGTACCGCCTCACCCACAGAGGTCTGTGCCAGGTGCGCTTCTGCAGGTGGAGCC
AAGCTCTCCCTGAGGCCAGAGGCGGGGCCTGGGCCGGGAGCCCAGGGGAAGGCCAGGCTGGACCCCGGCTCCACACCCACATCCAGCCTG
CAGGCCTCTCTGCAGTCCTCTCACCCTCCCTCAGCTCCCCTTCCTCTGCAGTCACCCTCAGCTCCCCTTCCTTGCCCGCCTCTCCCCCCG
CCGCCCCACCAGTTAAACGGATGACCAAAGACCTTTCTTATGCCGGAAGCAAAAACCAAAACTTTTTGTTGGCTTTTTCCTTTGTCGCCT
CCCCAGCACCTGCCCTCCCAGTCTCCCACCCCGGCCCCAGGCTGGAAGCCCTCCCTCCACTTAAGTTATTGTTTTAAACCAAAGTTTACA
GTGTCTGTTGGTGGCCAAGACCTTCTCTCTCCACCCCTCCTCCATCCACCCTGAGGACCCTGGGGCTCAGTGGAGGCAGGGCCCTGCCCC
CCTCCCTTCCGCTCCTGCCCAGCCTGGGGGAAGGAGAAAGGAGGGGAGAAAGCGGGCTCTCACCCCCTCAGGAGTGGGCACGGGAGCCCT
TCTCCCTGACCCTGGGCTGCTTCCTGGGGGCTCTCCAGACCCCTCTCTAGGACCAAGTCACCCGTCGTGCTGGGAGTGTGGATTCTAGCA
AAAGAGCTGGAAAAAAGTCAGACTCTCCACAGACCCCCTATGGGGGACCCCCAACTCAAGGCCAAGGACTGGGCGTATCGGATGCTCATA
ACACCCCTGGCCTGGCCCCTTTACTGAGAAGACTCCTTGGATATTTCCCAAGAACCCCCCACATACACCCCTCACAAGCCACCCCTCCTG
AGAGGCAGGGGGCCCTCCGCCCCCTCCCCATGTATTCCCCACCTGTGTTCCGTTTGACCAGCACAGAAATATTAAACGTCCTCTATTCAC
CGGGCCCTGTGTGTGTCACCGAGGTGCGGGAGGGGAGGAGCATTAAAGCTGAAAGATCGCTCTGCTCGGGGAGCCTGGGCAGAGCAGCAG
CAACGTGAGGGTCGCTGTGGTGGTGGTTTCTGTGAGTGGATGGAATGAGCAGCCCTGCAGGGGCGCTGGGCATGTGCCCTCACTGTGGAC
AGGGCCCACCCACCTCCGGTTCCCCTGTGCCTCCTGTCCCTTCCACGCTTAAACAGGGTTCTCTGTCATTTTCCTGTTTTCTTCCAGAGT
CCCAATCCTTTGCCCTAGTTCTTTCACTAGTTTGAAATCCAAGTTCTTGCCAGAGTGTTGGAGCAAGGCAGCTGATTTGCTGCAGGGATG
GAGAGGACCACCGCCGCAGGGTTCTTTTACCTGTGCCACCAGCTCTGGAGACATCCACACCCATTCCCAGGTGTCCTTCCAGGCCCATTG
TCCACGTCTTCAAGGGGGCTGCCTGGATAGGCGTGTGTGTGTGTAGTGCCCAGGTGTGGTGGTGGCATCCTGAAGCAGTAGGACCATTAG
TGTGCGTGCACACCCACGTGGCACACTGTGTGGTGACCATGGTCATCATAGTGGGCTCACGGCGGAAACGGGATCATTCAACCTATACAA
AGGGGACCCTGGATAGGCTGCAAGGAAAGAGGCCAAGGGCCAAGCTGCTAAGCCAAAGATGGCCCCTGACACCTCCCCCAGCCCCAGCAC
ACCAGCCTGCACCTCTTATCCCGTGTTAAGTCCTGGTTCCCCACCTGCTGCCCCTCCTCAACACAGAGCCCCTCCCGCCGCCTCAGACCC
CTGTGCACATCCCCAGGGCCTCAGCCGTCTCATTGGTCTTTATTTTTTATTTTTTTTAAGATGGAGTTTCGCTCTTATTGCCCAGACTGG
AGTGCAGTGATGCTATCTCGGCTCACTGCAACCTTTGCCTCCCAGGTTCAAGTGATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGATTA
CAGGCGTGCACCACCACGCCGGGCTAAATTTTTTTGTATTTTTAGTAGAGACGGGGTTTCTCTATGTTGGTCAGGCTGATCTCGAACTCC
CGACCTCAGGTGATCCGCCAGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACTGCACCCGGCTCTCACTGGTCTTACGCC
ACCTTCTGGACACTCCCTCCTTGAGGGCAGAAAGGAGTCCCAGGCCTGTCCCTAGGGACAAGGCCCAGGGAAGAGTGTATTTGGGGAGCA
GGGGAGGGGAGGGTGTTGAGAAAGCTGAACTGGAGTCAATCACCCTTCCCACAAATCACCAAACTGCTGGAACTCTCCAGCCAAATGCTG
GGAGAAGGACCTGGAGGGTGAGTCTTTGCTGACCTCTCTCTACTCTCAGGCATGTCTTTTGTCCTTTTCGTCCATCTATTTCTGTCTGTC
GCTCACTCGCCCCGCTTTCTCTGTCTCACCTTCATCCACTCTGCAGGCCTGCTCCACCACAGCCCTAATCCTCTGGACGCTTGTGTAGGG
CCTGGGGTGAATTCCCTGTCCCCCATGGTACCTCGAGAGGGGCTGGGGAGCTCAGCTTGGTCTCAGAGTCTCCCCACCAGATACTGTTTA
AAAAAGTAGCACTGATGTGTTTTGTAATCTGCCCCTCCCAGCCCTCCGTGGAGGCTGCCAGGGCCTTGTACGGTAAACCTAGCTGCATGT

>82458_82458_4_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000380298_FOXP4_chr6_41545724_ENST00000373063_length(amino acids)=817AA_BP=218
MAIDRRREAAGGGPGRQPAPAEENGSLPPGDAAASAPLGGRAGPGGGAEIQPLPPLHPGGGPHPSCCSAAAAPSLLLLDYDGSVLPFLGG
LGGGYQKTLVLLTWIPALFIGFSQFSDSFLLDQPNFWCRGAGKGTELAGVTTTGRGGDMGNWTSLPTTPFATAPWEAAGNRSNSSGADGG
DTPPLPSPPDKGDNASNCDCRAWDYGIRAGLVQNVVSKALQVARQFLLQQASGLSSPGNNDSKQSASAVQVPVSVAMMSPQMLTPQQMQQ
ILSPPQLQALLQQQQALMLQQLQEYYKKQQEQLHLQLLTQQQAGKPQPKEALGNKQLAFQQQLLQMQQLQQQHLLNLQRQGLVSLQPNQA
SGPLQTLPQAVCPTDLPQLWKGEGAPGQPAEDSVKQEGLDLTGTAATATSFAAPPKVSPPLSHHTLPNGQPTVLTSRRDSSSHEETPGSH
PLYGHGECKWPGCETLCEDLGQFIKHLNTEHALDDRSTAQCRVQMQVVQQLEIQLAKESERLQAMMAHLHMRPSEPKPFSQPVTVSAADS
FPDGLVHPPTSAAAPVTPLRPPGLGSASLHGGGPARRRSSDKFCSPISSELAQNHEFYKNADVRPPFTYASLIRQAILETPDRQLTLNEI
YNWFTRMFAYFRRNTATWKNAVRHNLSLHKCFVRVENVKGAVWTVDEREYQKRRPPKMTGSPTLVKNMISGLSYGALNASYQAALAESSF
PLLNSPGMLNPGSASSLLPLSHDDVGAPVEPLPSNGSSSPPRLSPPQYSHQVQVKEEPAEAEEDRQPGPPLGAPNPSASGPPEDRDLEEE

--------------------------------------------------------------
>82458_82458_5_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000380298_FOXP4_chr6_41545724_ENST00000409208_length(transcript)=3480nt_BP=654nt
ATGGCCATAGACCGGCGGCGCGAGGCGGCGGGCGGCGGGCCTGGGCGGCAGCCGGCCCCGGCCGAGGAGAACGGCTCCCTGCCGCCCGGG
GACGCGGCGGCCTCGGCGCCCCTCGGGGGACGCGCGGGCCCCGGCGGCGGCGCGGAGATCCAGCCGCTGCCCCCACTGCATCCTGGAGGC
GGCCCGCACCCGAGCTGCTGCTCCGCGGCTGCGGCCCCGAGCCTCTTGTTGCTGGACTATGACGGGTCGGTGCTGCCCTTCCTCGGGGGC
CTGGGCGGGGGCTATCAGAAGACCCTCGTGCTGCTCACCTGGATCCCGGCGCTGTTCATCGGCTTCAGCCAGTTCTCGGACTCGTTCCTC
CTGGACCAGCCCAACTTCTGGTGCCGCGGGGCCGGCAAAGGCACCGAGCTGGCAGGGGTCACCACCACAGGCCGGGGCGGGGACATGGGC
AACTGGACCAGCCTCCCCACCACCCCCTTCGCCACTGCCCCCTGGGAGGCTGCGGGCAACCGGAGCAACAGCAGCGGCGCGGACGGAGGC
GACACACCACCCCTGCCATCCCCTCCGGACAAGGGGGACAACGCCTCCAACTGTGACTGCCGCGCATGGGACTACGGCATCCGCGCCGGC
CTCGTCCAGAACGTGGTCAGCAAGGCTCTCCAAGTGGCCCGGCAGTTCCTGCTGCAGCAGGCCTCAGGCCTGAGCTCCCCAGGGAACAAT
GACAGCAAACAGTCTGCCTCTGCTGTGCAGGTGCCTGTGTCGGTGGCCATGATGTCGCCGCAGATGCTTACCCCGCAACAGATGCAGCAG
ATCCTGTCGCCCCCGCAGCTGCAGGCCTTGCTCCAGCAGCAGCAAGCCCTCATGCTCCAGCAGCTACAGGAGTACTACAAGAAGCAGCAG
GAGCAGCTCCACCTGCAGCTCCTCACCCAGCAGCAGGCTGGGAAACCGCAGCCCAAAGAGGCACTGGGGAACAAGCAGCTGGCCTTCCAG
CAGCAGCTCCTGCAAATGCAACAGTTGCAGCAGCAGCACCTGCTCAACCTGCAGAGGCAGGGGCTGGTCAGCCTGCAGCCCAACCAAGCC
TCGGGGCCCCTCCAGACCCTTCCGCAAGCAGCTGTTTGCCCAACAGACCTGCCCCAGCTGTGGAAGGGCGAGGGTGCCCCCGGGCAGCCT
GCCGAGGACAGCGTCAAGCAGGAGGGGCTGGACCTCACTGGCACGGCCGCCACCGCTACCTCGTTTGCCGCTCCCCCCAAGGTCTCACCC
CCCCTCTCCCACCATACCCTGCCCAACGGACAGCCTACTGTGCTCACATCTCGGAGAGACAGCTCTTCCCACGAGGAGACCCCCGGCTCC
CACCCCCTGTACGGACACGGAGAGTGCAAGTGGCCAGGCTGTGAGACCCTGTGTGAAGACCTGGGCCAGTTTATCAAACACCTCAACACA
GAGCACGCCCTGGATGACCGGAGTACAGCCCAGTGCCGGGTACAGATGCAGGTGGTGCAGCAGCTGGAGATCCAGCTCGCCAAGGAGAGC
GAGCGGCTGCAGGCCATGATGGCCCACCTGCACATGCGGCCCTCGGAGCCCAAGCCCTTCAGCCAGCCAGTGACCGTCTCTGCAGCAGAC
TCATTCCCAGATGGTCTCGTGCACCCCCCGACCTCGGCCGCAGCCCCTGTCACCCCTCTACGGCCCCCTGGCCTGGGCTCTGCCTCCCTG
CATGGTGGGGGCCCAGCCCGTCGGAGAAGCAGTGACAAGTTCTGCTCCCCCATCTCCTCAGAGCTGGCCCAGAATCATGAGTTCTACAAG
AACGCCGACGTCCGGCCCCCCTTCACCTACGCCTCCCTCATCCGCCAGGCCATCCTGGAAACCCCTGACAGGCAGCTGACCCTGAATGAG
ATCTATAACTGGTTCACCAGGATGTTCGCCTATTTCCGCAGAAACACTGCCACCTGGAAGAACGCCGTGCGCCACAACCTCAGCCTGCAC
AAGTGCTTCGTCCGCGTGGAGAACGTCAAGGGTGCCGTGTGGACTGTGGACGAGCGGGAGTATCAGAAGCGGAGACCGCCAAAGATGACA
GGGAGCCCCACCCTGGTGAAGAACATGATCTCTGGCCTCAGCTATGGAGCACTTAATGCCAGCTACCAGGCCGCCCTGGCCGAGAGCAGC
TTCCCCCTCCTCAACAGCCCTGGCATGCTGAACCCTGGCTCCGCCAGCAGCCTGCTGCCCCTCAGCCACGATGACGTGGGTGCCCCCGTG
GAGCCGCTGCCCAGCAACGGCAGCAGCAGCCCTCCTCGCCTCTCCCCGCCCCAGTACAGCCACCAGGTGCAGGTGAAGGAGGAGCCAGCA
GAGGCAGAGGAAGACAGGCAGCCCGGGCCTCCCCTGGGCGCCCCTAACCCCAGCGCCTCGGGGCCTCCGGAAGACAGGGACCTGGAGGAG
GAGCTGCCGGGAGAAGAACTGTCCTAAGGGCCTGTAGTGACCGGCAGGGCTGGGGTGAGACCCCTCCCTTCCAGAATCCAGGCCCCATCT
CCCCCAACTCCACAGCCCCTCCCGAGCCTCAAGGCAAGTCCAGGACTCAGACCGGGGAGGCCCGGGCCAGCAGCTCCCAGTGTGACCTGA
CAAAAACACGTAGGGGCAGGGACGGTCCCCACCCCCAGGGACACAACCCCTGGTCTTGGACCAGTAGAGGACACGGAGGGTTCAGACCCC
TCCTCAGACCCTCCCCACATCTGAAACTGCCTCCCCCCAACCACCAGCAGCAGCAGGGCCCTCCTCCCCCACCAGCTCTCCCCACAGGGC
CCCTCAGCATCATGGAGACCCGCAGGCGGGGCTTAGCCACCCCTCAAACCCAGGGCCCCCTGGCACCTGGCTCTGGCCGTGTTTTCTGGC
CAGAGGCCCCCACTTTCCTAACTCGTGCTCCCTTCCGCCTTCTTTTCCGTACTGTGAAGAAAGAACTCTCCACCCCAGCTCCCACCCTGC
CCTGGCCTGGGTGGAGGAACTGTGCCTCCATCCCCAGAAGAAACAGCCCCCTCTGCTGCTGGGGTGGGACTGTCTGTGTGCCCTGTGGGG
GTCCGTGTGAGCAGGCCCACCTGGCTCCAGACCCGCCCCCAACCTGAGACAGAACCAGGCTGAGCCAGGCCTCCACCCCCACCCCCGTTT
GCTGGGGGCTCCTCCAGCCGCCCCCATGGGAAGAGGCCTGGTACCGCCTCACCCACAGAGGTCTGTGCCAGGTGCGCTTCTGCAGGTGGA
GCCAAGCTCTCCCTGAGGCCAGAGGCGGGGCCTGGGCCGGGAGCCCAGGGGAAGGCCAGGCTGGACCCCGGCTCCACACCCACATCCAGC
CTGCAGGCCTCTCTGCAGTCCTCTCACCCTCCCTCAGCTCCCCTTCCTCTGCAGTCACCCTCAGCTCCCCTTCCTTGCCCGCCTCTCCCC

>82458_82458_5_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000380298_FOXP4_chr6_41545724_ENST00000409208_length(amino acids)=818AA_BP=218
MAIDRRREAAGGGPGRQPAPAEENGSLPPGDAAASAPLGGRAGPGGGAEIQPLPPLHPGGGPHPSCCSAAAAPSLLLLDYDGSVLPFLGG
LGGGYQKTLVLLTWIPALFIGFSQFSDSFLLDQPNFWCRGAGKGTELAGVTTTGRGGDMGNWTSLPTTPFATAPWEAAGNRSNSSGADGG
DTPPLPSPPDKGDNASNCDCRAWDYGIRAGLVQNVVSKALQVARQFLLQQASGLSSPGNNDSKQSASAVQVPVSVAMMSPQMLTPQQMQQ
ILSPPQLQALLQQQQALMLQQLQEYYKKQQEQLHLQLLTQQQAGKPQPKEALGNKQLAFQQQLLQMQQLQQQHLLNLQRQGLVSLQPNQA
SGPLQTLPQAAVCPTDLPQLWKGEGAPGQPAEDSVKQEGLDLTGTAATATSFAAPPKVSPPLSHHTLPNGQPTVLTSRRDSSSHEETPGS
HPLYGHGECKWPGCETLCEDLGQFIKHLNTEHALDDRSTAQCRVQMQVVQQLEIQLAKESERLQAMMAHLHMRPSEPKPFSQPVTVSAAD
SFPDGLVHPPTSAAAPVTPLRPPGLGSASLHGGGPARRRSSDKFCSPISSELAQNHEFYKNADVRPPFTYASLIRQAILETPDRQLTLNE
IYNWFTRMFAYFRRNTATWKNAVRHNLSLHKCFVRVENVKGAVWTVDEREYQKRRPPKMTGSPTLVKNMISGLSYGALNASYQAALAESS
FPLLNSPGMLNPGSASSLLPLSHDDVGAPVEPLPSNGSSSPPRLSPPQYSHQVQVKEEPAEAEEDRQPGPPLGAPNPSASGPPEDRDLEE

--------------------------------------------------------------
>82458_82458_6_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000406686_FOXP4_chr6_41545724_ENST00000307972_length(transcript)=4183nt_BP=654nt
ATGGCCATAGACCGGCGGCGCGAGGCGGCGGGCGGCGGGCCTGGGCGGCAGCCGGCCCCGGCCGAGGAGAACGGCTCCCTGCCGCCCGGG
GACGCGGCGGCCTCGGCGCCCCTCGGGGGACGCGCGGGCCCCGGCGGCGGCGCGGAGATCCAGCCGCTGCCCCCACTGCATCCTGGAGGC
GGCCCGCACCCGAGCTGCTGCTCCGCGGCTGCGGCCCCGAGCCTCTTGTTGCTGGACTATGACGGGTCGGTGCTGCCCTTCCTCGGGGGC
CTGGGCGGGGGCTATCAGAAGACCCTCGTGCTGCTCACCTGGATCCCGGCGCTGTTCATCGGCTTCAGCCAGTTCTCGGACTCGTTCCTC
CTGGACCAGCCCAACTTCTGGTGCCGCGGGGCCGGCAAAGGCACCGAGCTGGCAGGGGTCACCACCACAGGCCGGGGCGGGGACATGGGC
AACTGGACCAGCCTCCCCACCACCCCCTTCGCCACTGCCCCCTGGGAGGCTGCGGGCAACCGGAGCAACAGCAGCGGCGCGGACGGAGGC
GACACACCACCCCTGCCATCCCCTCCGGACAAGGGGGACAACGCCTCCAACTGTGACTGCCGCGCATGGGACTACGGCATCCGCGCCGGC
CTCGTCCAGAACGTGGTCAGCAAGGCTCTCCAAGTGGCCCGGCAGTTCCTGCTGCAGCAGGCCTCAGGCCTGAGCTCCCCAGGGAACAAT
GACAGCAAACAGTCTGCCTCTGCTGTGCAGGTGCCTGTGTCGGTGGCCATGATGTCGCCGCAGATGCTTACCCCGCAACAGATGCAGCAG
ATCCTGTCGCCCCCGCAGCTGCAGGCCTTGCTCCAGCAGCAGCAAGCCCTCATGCTCCAGCAGCTACAGGAGTACTACAAGAAGCAGCAG
GAGCAGCTCCACCTGCAGCTCCTCACCCAGCAGCAGGCTGGGAAACCGCAGCCCAAAGAGGCACTGGGGAACAAGCAGCTGGCCTTCCAG
CAGCAGCTCCTGCAAATGCAACAGTTGCAGCAGCAGCACCTGCTCAACCTGCAGAGGCAGGGGCTGGTCAGCCTGCAGCCCAACCAAGCC
TCGGGGCCCCTCCAGACCCTTCCGCAAGCAGCTGTTTGCCCAACAGACCTGCCCCAGCTGTGGAAGGGCGAGGGTGCCCCCGGGCAGCCT
GCCGAGGACAGCGTCAAGCAGGAGGGGCTGGACCTCACTGGCACGGCCGCCACCGCTACCTCGTTTGCCGCTCCCCCCAAGGTCTCACCC
CCCCTCTCCCACCATACCCTGCCCAACGGACAGCCTACTGTGCTCACATCTCGGAGAGACAGCTCTTCCCACGAGGAGACCCCCGGCTCC
CACCCCCTGTACGGACACGGAGAGTGCAAGTGGCCAGGCTGTGAGACCCTGTGTGAAGACCTGGGCCAGTTTATCAAACACCTCAACACA
GAGCACGCCCTGGATGACCGGAGTACAGCCCAGTGCCGGGTACAGATGCAGGTGGTGCAGCAGCTGGAGATCCAGCTCGCCAAGGAGAGC
GAGCGGCTGCAGGCCATGATGGCCCACCTGCACATGCGGCCCTCGGAGCCCAAGCCCTTCAGCCAGCCACTGAACCCGGTCCCCGGCTCC
TCCTCATTCTCCAAGGTGACCGTCTCTGCAGCAGACTCATTCCCAGATGGTCTCGTGCACCCCCCGACCTCGGCCGCAGCCCCTGTCACC
CCTCTACGGCCCCCTGGCCTGGGCTCTGCCTCCCTGCATGGTGGGGGCCCAGCCCGTCGGAGAAGCAGTGACAAGTTCTGCTCCCCCATC
TCCTCAGAGCTGGCCCAGAATCATGAGTTCTACAAGAACGCCGACGTCCGGCCCCCCTTCACCTACGCCTCCCTCATCCGCCAGGCCATC
CTGGAAACCCCTGACAGGCAGCTGACCCTGAATGAGATCTATAACTGGTTCACCAGGATGTTCGCCTATTTCCGCAGAAACACTGCCACC
TGGAAGAACGCCGTGCGCCACAACCTCAGCCTGCACAAGTGCTTCGTCCGCGTGGAGAACGTCAAGGGTGCCGTGTGGACTGTGGACGAG
CGGGAGTATCAGAAGCGGAGACCGCCAAAGATGACAGGGAGCCCCACCCTGGTGAAGAACATGATCTCTGGCCTCAGCTATGGAGCACTT
AATGCCAGCTACCAGGCCGCCCTGGCCGAGAGCAGCTTCCCCCTCCTCAACAGCCCTGGCATGCTGAACCCTGGCTCCGCCAGCAGCCTG
CTGCCCCTCAGCCACGATGACGTGGGTGCCCCCGTGGAGCCGCTGCCCAGCAACGGCAGCAGCAGCCCTCCTCGCCTCTCCCCGCCCCAG
TACAGCCACCAGGTGCAGGTGAAGGAGGAGCCAGCAGAGGCAGAGGAAGACAGGCAGCCCGGGCCTCCCCTGGGCGCCCCTAACCCCAGC
GCCTCGGGGCCTCCGGAAGACAGGGACCTGGAGGAGGAGCTGCCGGGAGAAGAACTGTCCTAAGGGCCTGTAGTGACCGGCAGGGCTGGG
GTGAGACCCCTCCCTTCCAGAATCCAGGCCCCATCTCCCCCAACTCCACAGCCCCTCCCGAGCCTCAAGGCAAGTCCAGGACTCAGACCG
GGGAGGCCCGGGCCAGCAGCTCCCAGTGTGACCTGACAAAAACACGTAGGGGCAGGGACGGTCCCCACCCCCAGGGACACAACCCCTGGT
CTTGGACCAGTAGAGGACACGGAGGGTTCAGACCCCTCCTCAGACCCTCCCCACATCTGAAACTGCCTCCCCCCAACCACCAGCAGCAGC
AGGGCCCTCCTCCCCCACCAGCTCTCCCCACAGGGCCCCTCAGCATCATGGAGACCCGCAGGCGGGGCTTAGCCACCCCTCAAACCCAGG
GCCCCCTGGCACCTGGCTCTGGCCGTGTTTTCTGGCCAGAGGCCCCCACTTTCCTAACTCGTGCTCCCTTCCGCCTTCTTTTCCGTACTG
TGAAGAAAGAACTCTCCACCCCAGCTCCCACCCTGCCCTGGCCTGGGTGGAGGAACTGTGCCTCCATCCCCAGAAGAAACAGCCCCCTCT
GCTGCTGGGGTGGGACTGTCTGTGTGCCCTGTGGGGGTCCGTGTGAGCAGGCCCACCTGGCTCCAGACCCGCCCCCAACCTGAGACAGAA
CCAGGCTGAGCCAGGCCTCCACCCCCACCCCCGTTTGCTGGGGGCTCCTCCAGCCGCCCCCATGGGAAGAGGCCTGGTACCGCCTCACCC
ACAGAGGTCTGTGCCAGGTGCGCTTCTGCAGGTGGAGCCAAGCTCTCCCTGAGGCCAGAGGCGGGGCCTGGGCCGGGAGCCCAGGGGAAG
GCCAGGCTGGACCCCGGCTCCACACCCACATCCAGCCTGCAGGCCTCTCTGCAGTCCTCTCACCCTCCCTCAGCTCCCCTTCCTCTGCAG
TCACCCTCAGCTCCCCTTCCTTGCCCGCCTCTCCCCCCGCCGCCCCACCAGTTAAACGGATGACCAAAGACCTTTCTTATGCCGGAAGCA
AAAACCAAAACTTTTTGTTGGCTTTTTCCTTTGTCGCCTCCCCAGCACCTGCCCTCCCAGTCTCCCACCCCGGCCCCAGGCTGGAAGCCC
TCCCTCCACTTAAGTTATTGTTTTAAACCAAAGTTTACAGTGTCTGTTGGTGGCCAAGACCTTCTCTCTCCACCCCTCCTCCATCCACCC
TGAGGACCCTGGGGCTCAGTGGAGGCAGGGCCCTGCCCCCCTCCCTTCCGCTCCTGCCCAGCCTGGGGGAAGGAGAAAGGAGGGGAGAAA
GCGGGCTCTCACCCCCTCAGGAGTGGGCACGGGAGCCCTTCTCCCTGACCCTGGGCTGCTTCCTGGGGGCTCTCCAGACCCCTCTCTAGG
ACCAAGTCACCCGTCGTGCTGGGAGTGTGGATTCTAGCAAAAGAGCTGGAAAAAAGTCAGACTCTCCACAGACCCCCTATGGGGGACCCC
CAACTCAAGGCCAAGGACTGGGCGTATCGGATGCTCATAACACCCCTGGCCTGGCCCCTTTACTGAGAAGACTCCTTGGATATTTCCCAA
GAACCCCCCACATACACCCCTCACAAGCCACCCCTCCTGAGAGGCAGGGGGCCCTCCGCCCCCTCCCCATGTATTCCCCACCTGTGTTCC

>82458_82458_6_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000406686_FOXP4_chr6_41545724_ENST00000307972_length(amino acids)=830AA_BP=218
MAIDRRREAAGGGPGRQPAPAEENGSLPPGDAAASAPLGGRAGPGGGAEIQPLPPLHPGGGPHPSCCSAAAAPSLLLLDYDGSVLPFLGG
LGGGYQKTLVLLTWIPALFIGFSQFSDSFLLDQPNFWCRGAGKGTELAGVTTTGRGGDMGNWTSLPTTPFATAPWEAAGNRSNSSGADGG
DTPPLPSPPDKGDNASNCDCRAWDYGIRAGLVQNVVSKALQVARQFLLQQASGLSSPGNNDSKQSASAVQVPVSVAMMSPQMLTPQQMQQ
ILSPPQLQALLQQQQALMLQQLQEYYKKQQEQLHLQLLTQQQAGKPQPKEALGNKQLAFQQQLLQMQQLQQQHLLNLQRQGLVSLQPNQA
SGPLQTLPQAAVCPTDLPQLWKGEGAPGQPAEDSVKQEGLDLTGTAATATSFAAPPKVSPPLSHHTLPNGQPTVLTSRRDSSSHEETPGS
HPLYGHGECKWPGCETLCEDLGQFIKHLNTEHALDDRSTAQCRVQMQVVQQLEIQLAKESERLQAMMAHLHMRPSEPKPFSQPLNPVPGS
SSFSKVTVSAADSFPDGLVHPPTSAAAPVTPLRPPGLGSASLHGGGPARRRSSDKFCSPISSELAQNHEFYKNADVRPPFTYASLIRQAI
LETPDRQLTLNEIYNWFTRMFAYFRRNTATWKNAVRHNLSLHKCFVRVENVKGAVWTVDEREYQKRRPPKMTGSPTLVKNMISGLSYGAL
NASYQAALAESSFPLLNSPGMLNPGSASSLLPLSHDDVGAPVEPLPSNGSSSPPRLSPPQYSHQVQVKEEPAEAEEDRQPGPPLGAPNPS

--------------------------------------------------------------
>82458_82458_7_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000406686_FOXP4_chr6_41545724_ENST00000373057_length(transcript)=3514nt_BP=654nt
ATGGCCATAGACCGGCGGCGCGAGGCGGCGGGCGGCGGGCCTGGGCGGCAGCCGGCCCCGGCCGAGGAGAACGGCTCCCTGCCGCCCGGG
GACGCGGCGGCCTCGGCGCCCCTCGGGGGACGCGCGGGCCCCGGCGGCGGCGCGGAGATCCAGCCGCTGCCCCCACTGCATCCTGGAGGC
GGCCCGCACCCGAGCTGCTGCTCCGCGGCTGCGGCCCCGAGCCTCTTGTTGCTGGACTATGACGGGTCGGTGCTGCCCTTCCTCGGGGGC
CTGGGCGGGGGCTATCAGAAGACCCTCGTGCTGCTCACCTGGATCCCGGCGCTGTTCATCGGCTTCAGCCAGTTCTCGGACTCGTTCCTC
CTGGACCAGCCCAACTTCTGGTGCCGCGGGGCCGGCAAAGGCACCGAGCTGGCAGGGGTCACCACCACAGGCCGGGGCGGGGACATGGGC
AACTGGACCAGCCTCCCCACCACCCCCTTCGCCACTGCCCCCTGGGAGGCTGCGGGCAACCGGAGCAACAGCAGCGGCGCGGACGGAGGC
GACACACCACCCCTGCCATCCCCTCCGGACAAGGGGGACAACGCCTCCAACTGTGACTGCCGCGCATGGGACTACGGCATCCGCGCCGGC
CTCGTCCAGAACGTGGTCAGCAAGGCTCTCCAAGTGGCCCGGCAGTTCCTGCTGCAGCAGGCCTCAGGCCTGAGCTCCCCAGGGAACAAT
GACAGCAAACAGTCTGCCTCTGCTGTGCAGGTGCCTGTGTCGGTGGCCATGATGTCGCCGCAGATGCTTACCCCGCAACAGATGCAGCAG
ATCCTGTCGCCCCCGCAGCTGCAGGCCTTGCTCCAGCAGCAGCAAGCCCTCATGCTCCAGCAGGAGTACTACAAGAAGCAGCAGGAGCAG
CTCCACCTGCAGCTCCTCACCCAGCAGCAGGCTGGGAAACCGCAGCCCAAAGAGGCACTGGGGAACAAGCAGCTGGCCTTCCAGCAGCAG
CTCCTGCAAATGCAACAGTTGCAGCAGCAGCACCTGCTCAACCTGCAGAGGCAGGGGCTGGTCAGCCTGCAGCCCAACCAAGCCTCGGGG
CCCCTCCAGACCCTTCCGCAAGCAGCTGTTTGCCCAACAGACCTGCCCCAGCTGTGGAAGGGCGAGGGTGCCCCCGGGCAGCCTGCCGAG
GACAGCGTCAAGCAGGAGGGGCTGGACCTCACTGGCACGGCCGCCACCGCTACCTCGTTTGCCGCTCCCCCCAAGGTCTCACCCCCCCTC
TCCCACCATACCCTGCCCAACGGACAGCCTACTGTGCTCACATCTCGGAGAGACAGCTCTTCCCACGAGGAGACCCCCGGCTCCCACCCC
CTGTACGGACACGGAGAGTGCAAGTGGCCAGGCTGTGAGACCCTGTGTGAAGACCTGGGCCAGTTTATCAAACACCTCAACACAGAGCAC
GCCCTGGATGACCGGAGTACAGCCCAGTGCCGGGTACAGATGCAGGTGGTGCAGCAGCTGGAGATCCAGCTCGCCAAGGAGAGCGAGCGG
CTGCAGGCCATGATGGCCCACCTGCACATGCGGCCCTCGGAGCCCAAGCCCTTCAGCCAGCCACTGAACCCGGTCCCCGGCTCCTCCTCA
TTCTCCAAGGTGACCGTCTCTGCAGCAGACTCATTCCCAGATGGTCTCGTGCACCCCCCGACCTCGGCCGCAGCCCCTGTCACCCCTCTA
CGGCCCCCTGGCCTGGGCTCTGCCTCCCTGCATGGTGGGGGCCCAGCCCGTCGGAGAAGCAGTGACAAGTTCTGCTCCCCCATCTCCTCA
GAGCTGGCCCAGAATCATGAGTTCTACAAGAACGCCGACGTCCGGCCCCCCTTCACCTACGCCTCCCTCATCCGCCAGGCCATCCTGGAA
ACCCCTGACAGGCAGCTGACCCTGAATGAGATCTATAACTGGTTCACCAGGATGTTCGCCTATTTCCGCAGAAACACTGCCACCTGGAAG
AACGCCGTGCGCCACAACCTCAGCCTGCACAAGTGCTTCGTCCGCGTGGAGAACGTCAAGGGTGCCGTGTGGACTGTGGACGAGCGGGAG
TATCAGAAGCGGAGACCGCCAAAGATGACAGGGAGCCCCACCCTGGTGAAGAACATGATCTCTGGCCTCAGCTATGGAGCACTTAATGCC
AGCTACCAGGCCGCCCTGGCCGAGAGCAGCTTCCCCCTCCTCAACAGCCCTGGCATGCTGAACCCTGGCTCCGCCAGCAGCCTGCTGCCC
CTCAGCCACGATGACGTGGGTGCCCCCGTGGAGCCGCTGCCCAGCAACGGCAGCAGCAGCCCTCCTCGCCTCTCCCCGCCCCAGTACAGC
CACCAGGTGCAGGTGAAGGAGGAGCCAGCAGAGGCAGAGGAAGACAGGCAGCCCGGGCCTCCCCTGGGCGCCCCTAACCCCAGCGCCTCG
GGGCCTCCGGAAGACAGGGACCTGGAGGAGGAGCTGCCGGGAGAAGAACTGTCCTAAGGGCCTGTAGTGACCGGCAGGGCTGGGGTGAGA
CCCCTCCCTTCCAGAATCCAGGCCCCATCTCCCCCAACTCCACAGCCCCTCCCGAGCCTCAAGGCAAGTCCAGGACTCAGACCGGGGAGG
CCCGGGCCAGCAGCTCCCAGTGTGACCTGACAAAAACACGTAGGGGCAGGGACGGTCCCCACCCCCAGGGACACAACCCCTGGTCTTGGA
CCAGTAGAGGACACGGAGGGTTCAGACCCCTCCTCAGACCCTCCCCACATCTGAAACTGCCTCCCCCCAACCACCAGCAGCAGCAGGGCC
CTCCTCCCCCACCAGCTCTCCCCACAGGGCCCCTCAGCATCATGGAGACCCGCAGGCGGGGCTTAGCCACCCCTCAAACCCAGGGCCCCC
TGGCACCTGGCTCTGGCCGTGTTTTCTGGCCAGAGGCCCCCACTTTCCTAACTCGTGCTCCCTTCCGCCTTCTTTTCCGTACTGTGAAGA
AAGAACTCTCCACCCCAGCTCCCACCCTGCCCTGGCCTGGGTGGAGGAACTGTGCCTCCATCCCCAGAAGAAACAGCCCCCTCTGCTGCT
GGGGTGGGACTGTCTGTGTGCCCTGTGGGGGTCCGTGTGAGCAGGCCCACCTGGCTCCAGACCCGCCCCCAACCTGAGACAGAACCAGGC
TGAGCCAGGCCTCCACCCCCACCCCCGTTTGCTGGGGGCTCCTCCAGCCGCCCCCATGGGAAGAGGCCTGGTACCGCCTCACCCACAGAG
GTCTGTGCCAGGTGCGCTTCTGCAGGTGGAGCCAAGCTCTCCCTGAGGCCAGAGGCGGGGCCTGGGCCGGGAGCCCAGGGGAAGGCCAGG
CTGGACCCCGGCTCCACACCCACATCCAGCCTGCAGGCCTCTCTGCAGTCCTCTCACCCTCCCTCAGCTCCCCTTCCTCTGCAGTCACCC
TCAGCTCCCCTTCCTTGCCCGCCTCTCCCCCCGCCGCCCCACCAGTTAAACGGATGACCAAAGACCTTTCTTATGCCGGAAGCAAAAACC

>82458_82458_7_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000406686_FOXP4_chr6_41545724_ENST00000373057_length(amino acids)=828AA_BP=218
MAIDRRREAAGGGPGRQPAPAEENGSLPPGDAAASAPLGGRAGPGGGAEIQPLPPLHPGGGPHPSCCSAAAAPSLLLLDYDGSVLPFLGG
LGGGYQKTLVLLTWIPALFIGFSQFSDSFLLDQPNFWCRGAGKGTELAGVTTTGRGGDMGNWTSLPTTPFATAPWEAAGNRSNSSGADGG
DTPPLPSPPDKGDNASNCDCRAWDYGIRAGLVQNVVSKALQVARQFLLQQASGLSSPGNNDSKQSASAVQVPVSVAMMSPQMLTPQQMQQ
ILSPPQLQALLQQQQALMLQQEYYKKQQEQLHLQLLTQQQAGKPQPKEALGNKQLAFQQQLLQMQQLQQQHLLNLQRQGLVSLQPNQASG
PLQTLPQAAVCPTDLPQLWKGEGAPGQPAEDSVKQEGLDLTGTAATATSFAAPPKVSPPLSHHTLPNGQPTVLTSRRDSSSHEETPGSHP
LYGHGECKWPGCETLCEDLGQFIKHLNTEHALDDRSTAQCRVQMQVVQQLEIQLAKESERLQAMMAHLHMRPSEPKPFSQPLNPVPGSSS
FSKVTVSAADSFPDGLVHPPTSAAAPVTPLRPPGLGSASLHGGGPARRRSSDKFCSPISSELAQNHEFYKNADVRPPFTYASLIRQAILE
TPDRQLTLNEIYNWFTRMFAYFRRNTATWKNAVRHNLSLHKCFVRVENVKGAVWTVDEREYQKRRPPKMTGSPTLVKNMISGLSYGALNA
SYQAALAESSFPLLNSPGMLNPGSASSLLPLSHDDVGAPVEPLPSNGSSSPPRLSPPQYSHQVQVKEEPAEAEEDRQPGPPLGAPNPSAS

--------------------------------------------------------------
>82458_82458_8_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000406686_FOXP4_chr6_41545724_ENST00000373060_length(transcript)=5940nt_BP=654nt
ATGGCCATAGACCGGCGGCGCGAGGCGGCGGGCGGCGGGCCTGGGCGGCAGCCGGCCCCGGCCGAGGAGAACGGCTCCCTGCCGCCCGGG
GACGCGGCGGCCTCGGCGCCCCTCGGGGGACGCGCGGGCCCCGGCGGCGGCGCGGAGATCCAGCCGCTGCCCCCACTGCATCCTGGAGGC
GGCCCGCACCCGAGCTGCTGCTCCGCGGCTGCGGCCCCGAGCCTCTTGTTGCTGGACTATGACGGGTCGGTGCTGCCCTTCCTCGGGGGC
CTGGGCGGGGGCTATCAGAAGACCCTCGTGCTGCTCACCTGGATCCCGGCGCTGTTCATCGGCTTCAGCCAGTTCTCGGACTCGTTCCTC
CTGGACCAGCCCAACTTCTGGTGCCGCGGGGCCGGCAAAGGCACCGAGCTGGCAGGGGTCACCACCACAGGCCGGGGCGGGGACATGGGC
AACTGGACCAGCCTCCCCACCACCCCCTTCGCCACTGCCCCCTGGGAGGCTGCGGGCAACCGGAGCAACAGCAGCGGCGCGGACGGAGGC
GACACACCACCCCTGCCATCCCCTCCGGACAAGGGGGACAACGCCTCCAACTGTGACTGCCGCGCATGGGACTACGGCATCCGCGCCGGC
CTCGTCCAGAACGTGGTCAGCAAGGCTCTCCAAGTGGCCCGGCAGTTCCTGCTGCAGCAGGCCTCAGGCCTGAGCTCCCCAGGGAACAAT
GACAGCAAACAGTCTGCCTCTGCTGTGCAGGTGCCTGTGTCGGTGGCCATGATGTCGCCGCAGATGCTTACCCCGCAACAGATGCAGCAG
ATCCTGTCGCCCCCGCAGCTGCAGGCCTTGCTCCAGCAGCAGCAAGCCCTCATGCTCCAGCAGCTACAGGAGTACTACAAGAAGCAGCAG
GAGCAGCTCCACCTGCAGCTCCTCACCCAGCAGCAGGCTGGGAAACCGCAGCCCAAAGAGGCACTGGGGAACAAGCAGCTGGCCTTCCAG
CAGCAGCTCCTGCAAATGCAACAGTTGCAGCAGCAGCACCTGCTCAACCTGCAGAGGCAGGGGCTGGTCAGCCTGCAGCCCAACCAAGCC
TCGGGGCCCCTCCAGACCCTTCCGCAAGCAGCTGTTTGCCCAACAGACCTGCCCCAGCTGTGGAAGGGCGAGGGTGCCCCCGGGCAGCCT
GCCGAGGACAGCGTCAAGCAGGAGGGGCTGGACCTCACTGGCACGGCCGCCACCGCTACCTCGTTTGCCGCTCCCCCCAAGGTCTCACCC
CCCCTCTCCCACCATACCCTGCCCAACGGACAGCCTACTGTGCTCACATCTCGGAGAGACAGCTCTTCCCACGAGGAGACCCCCGGCTCC
CACCCCCTGTACGGACACGGAGAGTGCAAGTGGCCAGGCTGTGAGACCCTGTGTGAAGACCTGGGCCAGTTTATCAAACACCTCAACACA
GAGCACGCCCTGGATGACCGGAGTACAGCCCAGTGCCGGGTACAGATGCAGGTGGTGCAGCAGCTGGAGATCCAGCTCGCCAAGGAGAGC
GAGCGGCTGCAGGCCATGATGGCCCACCTGCACATGCGGCCCTCGGAGCCCAAGCCCTTCAGCCAGCCACTGAACCCGGTCCCCGGCTCC
TCCTCATTCTCCAAGGTGACCGTCTCTGCAGCAGACTCATTCCCAGATGGTCTCGTGCACCCCCCGACCTCGGCCGCAGCCCCTGTCACC
CCTCTACGGCCCCCTGGCCTGGGCTCTGCCTCCCTGCATGGTGGGGGCCCAGCCCGTCGGAGAAGCAGTGACAAGTTCTGCTCCCCCATC
TCCTCAGAGCTGGCCCAGAATCATGAGTTCTACAAGAACGCCGACGTCCGGCCCCCCTTCACCTACGCCTCCCTCATCCGCCAGGCCATC
CTGGAAACCCCTGACAGGCAGCTGACCCTGAATGAGATCTATAACTGGTTCACCAGGATGTTCGCCTATTTCCGCAGAAACACTGCCACC
TGGAAGAACGCCGTGCGCCACAACCTCAGCCTGCACAAGTGCTTCGTCCGCGTGGAGAACGTCAAGGGTGCCGTGTGGACTGTGGACGAG
CGGGAGTATCAGAAGCGGAGACCGCCAAAGATGACAGGGAGCCCCACCCTGGTGAAGAACATGATCTCTGGCCTCAGCTATGGAGCACTT
AATGCCAGCTACCAGGCCGCCCTGGCCGAGAGCAGCTTCCCCCTCCTCAACAGCCCTGGCATGCTGAACCCTGGCTCCGCCAGCAGCCTG
CTGCCCCTCAGCCACGATGACGTGGGTGCCCCCGTGGAGCCGCTGCCCAGCAACGGCAGCAGCAGCCCTCCTCGCCTCTCCCCGCCCCAG
TACAGCCACCAGGTGCAGGTGAAGGAGGAGCCAGCAGAGGCAGAGGAAGACAGGCAGCCCGGGCCTCCCCTGGGCGCCCCTAACCCCAGC
GCCTCGGGGCCTCCGGAAGACAGGGACCTGGAGGAGGAGCTGCCGGGAGAAGAACTGTCCTAAGGGCCTGTAGTGACCGGCAGGGCTGGG
GTGAGACCCCTCCCTTCCAGAATCCAGGCCCCATCTCCCCCAACTCCACAGCCCCTCCCGAGCCTCAAGGCAAGTCCAGGACTCAGACCG
GGGAGGCCCGGGCCAGCAGCTCCCAGTGTGACCTGACAAAAACACGTAGGGGCAGGGACGGTCCCCACCCCCAGGGACACAACCCCTGGT
CTTGGACCAGTAGAGGACACGGAGGGTTCAGACCCCTCCTCAGACCCTCCCCACATCTGAAACTGCCTCCCCCCAACCACCAGCAGCAGC
AGGGCCCTCCTCCCCCACCAGCTCTCCCCACAGGGCCCCTCAGCATCATGGAGACCCGCAGGCGGGGCTTAGCCACCCCTCAAACCCAGG
GCCCCCTGGCACCTGGCTCTGGCCGTGTTTTCTGGCCAGAGGCCCCCACTTTCCTAACTCGTGCTCCCTTCCGCCTTCTTTTCCGTACTG
TGAAGAAAGAACTCTCCACCCCAGCTCCCACCCTGCCCTGGCCTGGGTGGAGGAACTGTGCCTCCATCCCCAGAAGAAACAGCCCCCTCT
GCTGCTGGGGTGGGACTGTCTGTGTGCCCTGTGGGGGTCCGTGTGAGCAGGCCCACCTGGCTCCAGACCCGCCCCCAACCTGAGACAGAA
CCAGGCTGAGCCAGGCCTCCACCCCCACCCCCGTTTGCTGGGGGCTCCTCCAGCCGCCCCCATGGGAAGAGGCCTGGTACCGCCTCACCC
ACAGAGGTCTGTGCCAGGTGCGCTTCTGCAGGTGGAGCCAAGCTCTCCCTGAGGCCAGAGGCGGGGCCTGGGCCGGGAGCCCAGGGGAAG
GCCAGGCTGGACCCCGGCTCCACACCCACATCCAGCCTGCAGGCCTCTCTGCAGTCCTCTCACCCTCCCTCAGCTCCCCTTCCTCTGCAG
TCACCCTCAGCTCCCCTTCCTTGCCCGCCTCTCCCCCCGCCGCCCCACCAGTTAAACGGATGACCAAAGACCTTTCTTATGCCGGAAGCA
AAAACCAAAACTTTTTGTTGGCTTTTTCCTTTGTCGCCTCCCCAGCACCTGCCCTCCCAGTCTCCCACCCCGGCCCCAGGCTGGAAGCCC
TCCCTCCACTTAAGTTATTGTTTTAAACCAAAGTTTACAGTGTCTGTTGGTGGCCAAGACCTTCTCTCTCCACCCCTCCTCCATCCACCC
TGAGGACCCTGGGGCTCAGTGGAGGCAGGGCCCTGCCCCCCTCCCTTCCGCTCCTGCCCAGCCTGGGGGAAGGAGAAAGGAGGGGAGAAA
GCGGGCTCTCACCCCCTCAGGAGTGGGCACGGGAGCCCTTCTCCCTGACCCTGGGCTGCTTCCTGGGGGCTCTCCAGACCCCTCTCTAGG
ACCAAGTCACCCGTCGTGCTGGGAGTGTGGATTCTAGCAAAAGAGCTGGAAAAAAGTCAGACTCTCCACAGACCCCCTATGGGGGACCCC
CAACTCAAGGCCAAGGACTGGGCGTATCGGATGCTCATAACACCCCTGGCCTGGCCCCTTTACTGAGAAGACTCCTTGGATATTTCCCAA
GAACCCCCCACATACACCCCTCACAAGCCACCCCTCCTGAGAGGCAGGGGGCCCTCCGCCCCCTCCCCATGTATTCCCCACCTGTGTTCC
GTTTGACCAGCACAGAAATATTAAACGTCCTCTATTCACCGGGCCCTGTGTGTGTCACCGAGGTGCGGGAGGGGAGGAGCATTAAAGCTG
AAAGATCGCTCTGCTCGGGGAGCCTGGGCAGAGCAGCAGCAACGTGAGGGTCGCTGTGGTGGTGGTTTCTGTGAGTGGATGGAATGAGCA
GCCCTGCAGGGGCGCTGGGCATGTGCCCTCACTGTGGACAGGGCCCACCCACCTCCGGTTCCCCTGTGCCTCCTGTCCCTTCCACGCTTA
AACAGGGTTCTCTGTCATTTTCCTGTTTTCTTCCAGAGTCCCAATCCTTTGCCCTAGTTCTTTCACTAGTTTGAAATCCAAGTTCTTGCC
AGAGTGTTGGAGCAAGGCAGCTGATTTGCTGCAGGGATGGAGAGGACCACCGCCGCAGGGTTCTTTTACCTGTGCCACCAGCTCTGGAGA
CATCCACACCCATTCCCAGGTGTCCTTCCAGGCCCATTGTCCACGTCTTCAAGGGGGCTGCCTGGATAGGCGTGTGTGTGTGTAGTGCCC
AGGTGTGGTGGTGGCATCCTGAAGCAGTAGGACCATTAGTGTGCGTGCACACCCACGTGGCACACTGTGTGGTGACCATGGTCATCATAG
TGGGCTCACGGCGGAAACGGGATCATTCAACCTATACAAAGGGGACCCTGGATAGGCTGCAAGGAAAGAGGCCAAGGGCCAAGCTGCTAA
GCCAAAGATGGCCCCTGACACCTCCCCCAGCCCCAGCACACCAGCCTGCACCTCTTATCCCGTGTTAAGTCCTGGTTCCCCACCTGCTGC
CCCTCCTCAACACAGAGCCCCTCCCGCCGCCTCAGACCCCTGTGCACATCCCCAGGGCCTCAGCCGTCTCATTGGTCTTTATTTTTTATT
TTTTTTAAGATGGAGTTTCGCTCTTATTGCCCAGACTGGAGTGCAGTGATGCTATCTCGGCTCACTGCAACCTTTGCCTCCCAGGTTCAA
GTGATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGGCGTGCACCACCACGCCGGGCTAAATTTTTTTGTATTTTTAGTAGAGA
CGGGGTTTCTCTATGTTGGTCAGGCTGATCTCGAACTCCCGACCTCAGGTGATCCGCCAGCCTCAGCCTCCCAAAGTGCTGGGATTACAG
GCGTGAGCCACTGCACCCGGCTCTCACTGGTCTTACGCCACCTTCTGGACACTCCCTCCTTGAGGGCAGAAAGGAGTCCCAGGCCTGTCC
CTAGGGACAAGGCCCAGGGAAGAGTGTATTTGGGGAGCAGGGGAGGGGAGGGTGTTGAGAAAGCTGAACTGGAGTCAATCACCCTTCCCA
CAAATCACCAAACTGCTGGAACTCTCCAGCCAAATGCTGGGAGAAGGACCTGGAGGGTGAGTCTTTGCTGACCTCTCTCTACTCTCAGGC
ATGTCTTTTGTCCTTTTCGTCCATCTATTTCTGTCTGTCGCTCACTCGCCCCGCTTTCTCTGTCTCACCTTCATCCACTCTGCAGGCCTG
CTCCACCACAGCCCTAATCCTCTGGACGCTTGTGTAGGGCCTGGGGTGAATTCCCTGTCCCCCATGGTACCTCGAGAGGGGCTGGGGAGC
TCAGCTTGGTCTCAGAGTCTCCCCACCAGATACTGTTTAAAAAAGTAGCACTGATGTGTTTTGTAATCTGCCCCTCCCAGCCCTCCGTGG
AGGCTGCCAGGGCCTTGTACGGTAAACCTAGCTGCATGTAATCTGTGGACAATGGCATTCTCTACAATGCAATAAAAACAATTACCCATG

>82458_82458_8_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000406686_FOXP4_chr6_41545724_ENST00000373060_length(amino acids)=830AA_BP=218
MAIDRRREAAGGGPGRQPAPAEENGSLPPGDAAASAPLGGRAGPGGGAEIQPLPPLHPGGGPHPSCCSAAAAPSLLLLDYDGSVLPFLGG
LGGGYQKTLVLLTWIPALFIGFSQFSDSFLLDQPNFWCRGAGKGTELAGVTTTGRGGDMGNWTSLPTTPFATAPWEAAGNRSNSSGADGG
DTPPLPSPPDKGDNASNCDCRAWDYGIRAGLVQNVVSKALQVARQFLLQQASGLSSPGNNDSKQSASAVQVPVSVAMMSPQMLTPQQMQQ
ILSPPQLQALLQQQQALMLQQLQEYYKKQQEQLHLQLLTQQQAGKPQPKEALGNKQLAFQQQLLQMQQLQQQHLLNLQRQGLVSLQPNQA
SGPLQTLPQAAVCPTDLPQLWKGEGAPGQPAEDSVKQEGLDLTGTAATATSFAAPPKVSPPLSHHTLPNGQPTVLTSRRDSSSHEETPGS
HPLYGHGECKWPGCETLCEDLGQFIKHLNTEHALDDRSTAQCRVQMQVVQQLEIQLAKESERLQAMMAHLHMRPSEPKPFSQPLNPVPGS
SSFSKVTVSAADSFPDGLVHPPTSAAAPVTPLRPPGLGSASLHGGGPARRRSSDKFCSPISSELAQNHEFYKNADVRPPFTYASLIRQAI
LETPDRQLTLNEIYNWFTRMFAYFRRNTATWKNAVRHNLSLHKCFVRVENVKGAVWTVDEREYQKRRPPKMTGSPTLVKNMISGLSYGAL
NASYQAALAESSFPLLNSPGMLNPGSASSLLPLSHDDVGAPVEPLPSNGSSSPPRLSPPQYSHQVQVKEEPAEAEEDRQPGPPLGAPNPS

--------------------------------------------------------------
>82458_82458_9_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000406686_FOXP4_chr6_41545724_ENST00000373063_length(transcript)=5902nt_BP=654nt
ATGGCCATAGACCGGCGGCGCGAGGCGGCGGGCGGCGGGCCTGGGCGGCAGCCGGCCCCGGCCGAGGAGAACGGCTCCCTGCCGCCCGGG
GACGCGGCGGCCTCGGCGCCCCTCGGGGGACGCGCGGGCCCCGGCGGCGGCGCGGAGATCCAGCCGCTGCCCCCACTGCATCCTGGAGGC
GGCCCGCACCCGAGCTGCTGCTCCGCGGCTGCGGCCCCGAGCCTCTTGTTGCTGGACTATGACGGGTCGGTGCTGCCCTTCCTCGGGGGC
CTGGGCGGGGGCTATCAGAAGACCCTCGTGCTGCTCACCTGGATCCCGGCGCTGTTCATCGGCTTCAGCCAGTTCTCGGACTCGTTCCTC
CTGGACCAGCCCAACTTCTGGTGCCGCGGGGCCGGCAAAGGCACCGAGCTGGCAGGGGTCACCACCACAGGCCGGGGCGGGGACATGGGC
AACTGGACCAGCCTCCCCACCACCCCCTTCGCCACTGCCCCCTGGGAGGCTGCGGGCAACCGGAGCAACAGCAGCGGCGCGGACGGAGGC
GACACACCACCCCTGCCATCCCCTCCGGACAAGGGGGACAACGCCTCCAACTGTGACTGCCGCGCATGGGACTACGGCATCCGCGCCGGC
CTCGTCCAGAACGTGGTCAGCAAGGCTCTCCAAGTGGCCCGGCAGTTCCTGCTGCAGCAGGCCTCAGGCCTGAGCTCCCCAGGGAACAAT
GACAGCAAACAGTCTGCCTCTGCTGTGCAGGTGCCTGTGTCGGTGGCCATGATGTCGCCGCAGATGCTTACCCCGCAACAGATGCAGCAG
ATCCTGTCGCCCCCGCAGCTGCAGGCCTTGCTCCAGCAGCAGCAAGCCCTCATGCTCCAGCAGCTACAGGAGTACTACAAGAAGCAGCAG
GAGCAGCTCCACCTGCAGCTCCTCACCCAGCAGCAGGCTGGGAAACCGCAGCCCAAAGAGGCACTGGGGAACAAGCAGCTGGCCTTCCAG
CAGCAGCTCCTGCAAATGCAACAGTTGCAGCAGCAGCACCTGCTCAACCTGCAGAGGCAGGGGCTGGTCAGCCTGCAGCCCAACCAAGCC
TCGGGGCCCCTCCAGACCCTTCCGCAAGCTGTTTGCCCAACAGACCTGCCCCAGCTGTGGAAGGGCGAGGGTGCCCCCGGGCAGCCTGCC
GAGGACAGCGTCAAGCAGGAGGGGCTGGACCTCACTGGCACGGCCGCCACCGCTACCTCGTTTGCCGCTCCCCCCAAGGTCTCACCCCCC
CTCTCCCACCATACCCTGCCCAACGGACAGCCTACTGTGCTCACATCTCGGAGAGACAGCTCTTCCCACGAGGAGACCCCCGGCTCCCAC
CCCCTGTACGGACACGGAGAGTGCAAGTGGCCAGGCTGTGAGACCCTGTGTGAAGACCTGGGCCAGTTTATCAAACACCTCAACACAGAG
CACGCCCTGGATGACCGGAGTACAGCCCAGTGCCGGGTACAGATGCAGGTGGTGCAGCAGCTGGAGATCCAGCTCGCCAAGGAGAGCGAG
CGGCTGCAGGCCATGATGGCCCACCTGCACATGCGGCCCTCGGAGCCCAAGCCCTTCAGCCAGCCAGTGACCGTCTCTGCAGCAGACTCA
TTCCCAGATGGTCTCGTGCACCCCCCGACCTCGGCCGCAGCCCCTGTCACCCCTCTACGGCCCCCTGGCCTGGGCTCTGCCTCCCTGCAT
GGTGGGGGCCCAGCCCGTCGGAGAAGCAGTGACAAGTTCTGCTCCCCCATCTCCTCAGAGCTGGCCCAGAATCATGAGTTCTACAAGAAC
GCCGACGTCCGGCCCCCCTTCACCTACGCCTCCCTCATCCGCCAGGCCATCCTGGAAACCCCTGACAGGCAGCTGACCCTGAATGAGATC
TATAACTGGTTCACCAGGATGTTCGCCTATTTCCGCAGAAACACTGCCACCTGGAAGAACGCCGTGCGCCACAACCTCAGCCTGCACAAG
TGCTTCGTCCGCGTGGAGAACGTCAAGGGTGCCGTGTGGACTGTGGACGAGCGGGAGTATCAGAAGCGGAGACCGCCAAAGATGACAGGG
AGCCCCACCCTGGTGAAGAACATGATCTCTGGCCTCAGCTATGGAGCACTTAATGCCAGCTACCAGGCCGCCCTGGCCGAGAGCAGCTTC
CCCCTCCTCAACAGCCCTGGCATGCTGAACCCTGGCTCCGCCAGCAGCCTGCTGCCCCTCAGCCACGATGACGTGGGTGCCCCCGTGGAG
CCGCTGCCCAGCAACGGCAGCAGCAGCCCTCCTCGCCTCTCCCCGCCCCAGTACAGCCACCAGGTGCAGGTGAAGGAGGAGCCAGCAGAG
GCAGAGGAAGACAGGCAGCCCGGGCCTCCCCTGGGCGCCCCTAACCCCAGCGCCTCGGGGCCTCCGGAAGACAGGGACCTGGAGGAGGAG
CTGCCGGGAGAAGAACTGTCCTAAGGGCCTGTAGTGACCGGCAGGGCTGGGGTGAGACCCCTCCCTTCCAGAATCCAGGCCCCATCTCCC
CCAACTCCACAGCCCCTCCCGAGCCTCAAGGCAAGTCCAGGACTCAGACCGGGGAGGCCCGGGCCAGCAGCTCCCAGTGTGACCTGACAA
AAACACGTAGGGGCAGGGACGGTCCCCACCCCCAGGGACACAACCCCTGGTCTTGGACCAGTAGAGGACACGGAGGGTTCAGACCCCTCC
TCAGACCCTCCCCACATCTGAAACTGCCTCCCCCCAACCACCAGCAGCAGCAGGGCCCTCCTCCCCCACCAGCTCTCCCCACAGGGCCCC
TCAGCATCATGGAGACCCGCAGGCGGGGCTTAGCCACCCCTCAAACCCAGGGCCCCCTGGCACCTGGCTCTGGCCGTGTTTTCTGGCCAG
AGGCCCCCACTTTCCTAACTCGTGCTCCCTTCCGCCTTCTTTTCCGTACTGTGAAGAAAGAACTCTCCACCCCAGCTCCCACCCTGCCCT
GGCCTGGGTGGAGGAACTGTGCCTCCATCCCCAGAAGAAACAGCCCCCTCTGCTGCTGGGGTGGGACTGTCTGTGTGCCCTGTGGGGGTC
CGTGTGAGCAGGCCCACCTGGCTCCAGACCCGCCCCCAACCTGAGACAGAACCAGGCTGAGCCAGGCCTCCACCCCCACCCCCGTTTGCT
GGGGGCTCCTCCAGCCGCCCCCATGGGAAGAGGCCTGGTACCGCCTCACCCACAGAGGTCTGTGCCAGGTGCGCTTCTGCAGGTGGAGCC
AAGCTCTCCCTGAGGCCAGAGGCGGGGCCTGGGCCGGGAGCCCAGGGGAAGGCCAGGCTGGACCCCGGCTCCACACCCACATCCAGCCTG
CAGGCCTCTCTGCAGTCCTCTCACCCTCCCTCAGCTCCCCTTCCTCTGCAGTCACCCTCAGCTCCCCTTCCTTGCCCGCCTCTCCCCCCG
CCGCCCCACCAGTTAAACGGATGACCAAAGACCTTTCTTATGCCGGAAGCAAAAACCAAAACTTTTTGTTGGCTTTTTCCTTTGTCGCCT
CCCCAGCACCTGCCCTCCCAGTCTCCCACCCCGGCCCCAGGCTGGAAGCCCTCCCTCCACTTAAGTTATTGTTTTAAACCAAAGTTTACA
GTGTCTGTTGGTGGCCAAGACCTTCTCTCTCCACCCCTCCTCCATCCACCCTGAGGACCCTGGGGCTCAGTGGAGGCAGGGCCCTGCCCC
CCTCCCTTCCGCTCCTGCCCAGCCTGGGGGAAGGAGAAAGGAGGGGAGAAAGCGGGCTCTCACCCCCTCAGGAGTGGGCACGGGAGCCCT
TCTCCCTGACCCTGGGCTGCTTCCTGGGGGCTCTCCAGACCCCTCTCTAGGACCAAGTCACCCGTCGTGCTGGGAGTGTGGATTCTAGCA
AAAGAGCTGGAAAAAAGTCAGACTCTCCACAGACCCCCTATGGGGGACCCCCAACTCAAGGCCAAGGACTGGGCGTATCGGATGCTCATA
ACACCCCTGGCCTGGCCCCTTTACTGAGAAGACTCCTTGGATATTTCCCAAGAACCCCCCACATACACCCCTCACAAGCCACCCCTCCTG
AGAGGCAGGGGGCCCTCCGCCCCCTCCCCATGTATTCCCCACCTGTGTTCCGTTTGACCAGCACAGAAATATTAAACGTCCTCTATTCAC
CGGGCCCTGTGTGTGTCACCGAGGTGCGGGAGGGGAGGAGCATTAAAGCTGAAAGATCGCTCTGCTCGGGGAGCCTGGGCAGAGCAGCAG
CAACGTGAGGGTCGCTGTGGTGGTGGTTTCTGTGAGTGGATGGAATGAGCAGCCCTGCAGGGGCGCTGGGCATGTGCCCTCACTGTGGAC
AGGGCCCACCCACCTCCGGTTCCCCTGTGCCTCCTGTCCCTTCCACGCTTAAACAGGGTTCTCTGTCATTTTCCTGTTTTCTTCCAGAGT
CCCAATCCTTTGCCCTAGTTCTTTCACTAGTTTGAAATCCAAGTTCTTGCCAGAGTGTTGGAGCAAGGCAGCTGATTTGCTGCAGGGATG
GAGAGGACCACCGCCGCAGGGTTCTTTTACCTGTGCCACCAGCTCTGGAGACATCCACACCCATTCCCAGGTGTCCTTCCAGGCCCATTG
TCCACGTCTTCAAGGGGGCTGCCTGGATAGGCGTGTGTGTGTGTAGTGCCCAGGTGTGGTGGTGGCATCCTGAAGCAGTAGGACCATTAG
TGTGCGTGCACACCCACGTGGCACACTGTGTGGTGACCATGGTCATCATAGTGGGCTCACGGCGGAAACGGGATCATTCAACCTATACAA
AGGGGACCCTGGATAGGCTGCAAGGAAAGAGGCCAAGGGCCAAGCTGCTAAGCCAAAGATGGCCCCTGACACCTCCCCCAGCCCCAGCAC
ACCAGCCTGCACCTCTTATCCCGTGTTAAGTCCTGGTTCCCCACCTGCTGCCCCTCCTCAACACAGAGCCCCTCCCGCCGCCTCAGACCC
CTGTGCACATCCCCAGGGCCTCAGCCGTCTCATTGGTCTTTATTTTTTATTTTTTTTAAGATGGAGTTTCGCTCTTATTGCCCAGACTGG
AGTGCAGTGATGCTATCTCGGCTCACTGCAACCTTTGCCTCCCAGGTTCAAGTGATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGATTA
CAGGCGTGCACCACCACGCCGGGCTAAATTTTTTTGTATTTTTAGTAGAGACGGGGTTTCTCTATGTTGGTCAGGCTGATCTCGAACTCC
CGACCTCAGGTGATCCGCCAGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACTGCACCCGGCTCTCACTGGTCTTACGCC
ACCTTCTGGACACTCCCTCCTTGAGGGCAGAAAGGAGTCCCAGGCCTGTCCCTAGGGACAAGGCCCAGGGAAGAGTGTATTTGGGGAGCA
GGGGAGGGGAGGGTGTTGAGAAAGCTGAACTGGAGTCAATCACCCTTCCCACAAATCACCAAACTGCTGGAACTCTCCAGCCAAATGCTG
GGAGAAGGACCTGGAGGGTGAGTCTTTGCTGACCTCTCTCTACTCTCAGGCATGTCTTTTGTCCTTTTCGTCCATCTATTTCTGTCTGTC
GCTCACTCGCCCCGCTTTCTCTGTCTCACCTTCATCCACTCTGCAGGCCTGCTCCACCACAGCCCTAATCCTCTGGACGCTTGTGTAGGG
CCTGGGGTGAATTCCCTGTCCCCCATGGTACCTCGAGAGGGGCTGGGGAGCTCAGCTTGGTCTCAGAGTCTCCCCACCAGATACTGTTTA
AAAAAGTAGCACTGATGTGTTTTGTAATCTGCCCCTCCCAGCCCTCCGTGGAGGCTGCCAGGGCCTTGTACGGTAAACCTAGCTGCATGT

>82458_82458_9_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000406686_FOXP4_chr6_41545724_ENST00000373063_length(amino acids)=817AA_BP=218
MAIDRRREAAGGGPGRQPAPAEENGSLPPGDAAASAPLGGRAGPGGGAEIQPLPPLHPGGGPHPSCCSAAAAPSLLLLDYDGSVLPFLGG
LGGGYQKTLVLLTWIPALFIGFSQFSDSFLLDQPNFWCRGAGKGTELAGVTTTGRGGDMGNWTSLPTTPFATAPWEAAGNRSNSSGADGG
DTPPLPSPPDKGDNASNCDCRAWDYGIRAGLVQNVVSKALQVARQFLLQQASGLSSPGNNDSKQSASAVQVPVSVAMMSPQMLTPQQMQQ
ILSPPQLQALLQQQQALMLQQLQEYYKKQQEQLHLQLLTQQQAGKPQPKEALGNKQLAFQQQLLQMQQLQQQHLLNLQRQGLVSLQPNQA
SGPLQTLPQAVCPTDLPQLWKGEGAPGQPAEDSVKQEGLDLTGTAATATSFAAPPKVSPPLSHHTLPNGQPTVLTSRRDSSSHEETPGSH
PLYGHGECKWPGCETLCEDLGQFIKHLNTEHALDDRSTAQCRVQMQVVQQLEIQLAKESERLQAMMAHLHMRPSEPKPFSQPVTVSAADS
FPDGLVHPPTSAAAPVTPLRPPGLGSASLHGGGPARRRSSDKFCSPISSELAQNHEFYKNADVRPPFTYASLIRQAILETPDRQLTLNEI
YNWFTRMFAYFRRNTATWKNAVRHNLSLHKCFVRVENVKGAVWTVDEREYQKRRPPKMTGSPTLVKNMISGLSYGALNASYQAALAESSF
PLLNSPGMLNPGSASSLLPLSHDDVGAPVEPLPSNGSSSPPRLSPPQYSHQVQVKEEPAEAEEDRQPGPPLGAPNPSASGPPEDRDLEEE

--------------------------------------------------------------
>82458_82458_10_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000406686_FOXP4_chr6_41545724_ENST00000409208_length(transcript)=3480nt_BP=654nt
ATGGCCATAGACCGGCGGCGCGAGGCGGCGGGCGGCGGGCCTGGGCGGCAGCCGGCCCCGGCCGAGGAGAACGGCTCCCTGCCGCCCGGG
GACGCGGCGGCCTCGGCGCCCCTCGGGGGACGCGCGGGCCCCGGCGGCGGCGCGGAGATCCAGCCGCTGCCCCCACTGCATCCTGGAGGC
GGCCCGCACCCGAGCTGCTGCTCCGCGGCTGCGGCCCCGAGCCTCTTGTTGCTGGACTATGACGGGTCGGTGCTGCCCTTCCTCGGGGGC
CTGGGCGGGGGCTATCAGAAGACCCTCGTGCTGCTCACCTGGATCCCGGCGCTGTTCATCGGCTTCAGCCAGTTCTCGGACTCGTTCCTC
CTGGACCAGCCCAACTTCTGGTGCCGCGGGGCCGGCAAAGGCACCGAGCTGGCAGGGGTCACCACCACAGGCCGGGGCGGGGACATGGGC
AACTGGACCAGCCTCCCCACCACCCCCTTCGCCACTGCCCCCTGGGAGGCTGCGGGCAACCGGAGCAACAGCAGCGGCGCGGACGGAGGC
GACACACCACCCCTGCCATCCCCTCCGGACAAGGGGGACAACGCCTCCAACTGTGACTGCCGCGCATGGGACTACGGCATCCGCGCCGGC
CTCGTCCAGAACGTGGTCAGCAAGGCTCTCCAAGTGGCCCGGCAGTTCCTGCTGCAGCAGGCCTCAGGCCTGAGCTCCCCAGGGAACAAT
GACAGCAAACAGTCTGCCTCTGCTGTGCAGGTGCCTGTGTCGGTGGCCATGATGTCGCCGCAGATGCTTACCCCGCAACAGATGCAGCAG
ATCCTGTCGCCCCCGCAGCTGCAGGCCTTGCTCCAGCAGCAGCAAGCCCTCATGCTCCAGCAGCTACAGGAGTACTACAAGAAGCAGCAG
GAGCAGCTCCACCTGCAGCTCCTCACCCAGCAGCAGGCTGGGAAACCGCAGCCCAAAGAGGCACTGGGGAACAAGCAGCTGGCCTTCCAG
CAGCAGCTCCTGCAAATGCAACAGTTGCAGCAGCAGCACCTGCTCAACCTGCAGAGGCAGGGGCTGGTCAGCCTGCAGCCCAACCAAGCC
TCGGGGCCCCTCCAGACCCTTCCGCAAGCAGCTGTTTGCCCAACAGACCTGCCCCAGCTGTGGAAGGGCGAGGGTGCCCCCGGGCAGCCT
GCCGAGGACAGCGTCAAGCAGGAGGGGCTGGACCTCACTGGCACGGCCGCCACCGCTACCTCGTTTGCCGCTCCCCCCAAGGTCTCACCC
CCCCTCTCCCACCATACCCTGCCCAACGGACAGCCTACTGTGCTCACATCTCGGAGAGACAGCTCTTCCCACGAGGAGACCCCCGGCTCC
CACCCCCTGTACGGACACGGAGAGTGCAAGTGGCCAGGCTGTGAGACCCTGTGTGAAGACCTGGGCCAGTTTATCAAACACCTCAACACA
GAGCACGCCCTGGATGACCGGAGTACAGCCCAGTGCCGGGTACAGATGCAGGTGGTGCAGCAGCTGGAGATCCAGCTCGCCAAGGAGAGC
GAGCGGCTGCAGGCCATGATGGCCCACCTGCACATGCGGCCCTCGGAGCCCAAGCCCTTCAGCCAGCCAGTGACCGTCTCTGCAGCAGAC
TCATTCCCAGATGGTCTCGTGCACCCCCCGACCTCGGCCGCAGCCCCTGTCACCCCTCTACGGCCCCCTGGCCTGGGCTCTGCCTCCCTG
CATGGTGGGGGCCCAGCCCGTCGGAGAAGCAGTGACAAGTTCTGCTCCCCCATCTCCTCAGAGCTGGCCCAGAATCATGAGTTCTACAAG
AACGCCGACGTCCGGCCCCCCTTCACCTACGCCTCCCTCATCCGCCAGGCCATCCTGGAAACCCCTGACAGGCAGCTGACCCTGAATGAG
ATCTATAACTGGTTCACCAGGATGTTCGCCTATTTCCGCAGAAACACTGCCACCTGGAAGAACGCCGTGCGCCACAACCTCAGCCTGCAC
AAGTGCTTCGTCCGCGTGGAGAACGTCAAGGGTGCCGTGTGGACTGTGGACGAGCGGGAGTATCAGAAGCGGAGACCGCCAAAGATGACA
GGGAGCCCCACCCTGGTGAAGAACATGATCTCTGGCCTCAGCTATGGAGCACTTAATGCCAGCTACCAGGCCGCCCTGGCCGAGAGCAGC
TTCCCCCTCCTCAACAGCCCTGGCATGCTGAACCCTGGCTCCGCCAGCAGCCTGCTGCCCCTCAGCCACGATGACGTGGGTGCCCCCGTG
GAGCCGCTGCCCAGCAACGGCAGCAGCAGCCCTCCTCGCCTCTCCCCGCCCCAGTACAGCCACCAGGTGCAGGTGAAGGAGGAGCCAGCA
GAGGCAGAGGAAGACAGGCAGCCCGGGCCTCCCCTGGGCGCCCCTAACCCCAGCGCCTCGGGGCCTCCGGAAGACAGGGACCTGGAGGAG
GAGCTGCCGGGAGAAGAACTGTCCTAAGGGCCTGTAGTGACCGGCAGGGCTGGGGTGAGACCCCTCCCTTCCAGAATCCAGGCCCCATCT
CCCCCAACTCCACAGCCCCTCCCGAGCCTCAAGGCAAGTCCAGGACTCAGACCGGGGAGGCCCGGGCCAGCAGCTCCCAGTGTGACCTGA
CAAAAACACGTAGGGGCAGGGACGGTCCCCACCCCCAGGGACACAACCCCTGGTCTTGGACCAGTAGAGGACACGGAGGGTTCAGACCCC
TCCTCAGACCCTCCCCACATCTGAAACTGCCTCCCCCCAACCACCAGCAGCAGCAGGGCCCTCCTCCCCCACCAGCTCTCCCCACAGGGC
CCCTCAGCATCATGGAGACCCGCAGGCGGGGCTTAGCCACCCCTCAAACCCAGGGCCCCCTGGCACCTGGCTCTGGCCGTGTTTTCTGGC
CAGAGGCCCCCACTTTCCTAACTCGTGCTCCCTTCCGCCTTCTTTTCCGTACTGTGAAGAAAGAACTCTCCACCCCAGCTCCCACCCTGC
CCTGGCCTGGGTGGAGGAACTGTGCCTCCATCCCCAGAAGAAACAGCCCCCTCTGCTGCTGGGGTGGGACTGTCTGTGTGCCCTGTGGGG
GTCCGTGTGAGCAGGCCCACCTGGCTCCAGACCCGCCCCCAACCTGAGACAGAACCAGGCTGAGCCAGGCCTCCACCCCCACCCCCGTTT
GCTGGGGGCTCCTCCAGCCGCCCCCATGGGAAGAGGCCTGGTACCGCCTCACCCACAGAGGTCTGTGCCAGGTGCGCTTCTGCAGGTGGA
GCCAAGCTCTCCCTGAGGCCAGAGGCGGGGCCTGGGCCGGGAGCCCAGGGGAAGGCCAGGCTGGACCCCGGCTCCACACCCACATCCAGC
CTGCAGGCCTCTCTGCAGTCCTCTCACCCTCCCTCAGCTCCCCTTCCTCTGCAGTCACCCTCAGCTCCCCTTCCTTGCCCGCCTCTCCCC

>82458_82458_10_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000406686_FOXP4_chr6_41545724_ENST00000409208_length(amino acids)=818AA_BP=218
MAIDRRREAAGGGPGRQPAPAEENGSLPPGDAAASAPLGGRAGPGGGAEIQPLPPLHPGGGPHPSCCSAAAAPSLLLLDYDGSVLPFLGG
LGGGYQKTLVLLTWIPALFIGFSQFSDSFLLDQPNFWCRGAGKGTELAGVTTTGRGGDMGNWTSLPTTPFATAPWEAAGNRSNSSGADGG
DTPPLPSPPDKGDNASNCDCRAWDYGIRAGLVQNVVSKALQVARQFLLQQASGLSSPGNNDSKQSASAVQVPVSVAMMSPQMLTPQQMQQ
ILSPPQLQALLQQQQALMLQQLQEYYKKQQEQLHLQLLTQQQAGKPQPKEALGNKQLAFQQQLLQMQQLQQQHLLNLQRQGLVSLQPNQA
SGPLQTLPQAAVCPTDLPQLWKGEGAPGQPAEDSVKQEGLDLTGTAATATSFAAPPKVSPPLSHHTLPNGQPTVLTSRRDSSSHEETPGS
HPLYGHGECKWPGCETLCEDLGQFIKHLNTEHALDDRSTAQCRVQMQVVQQLEIQLAKESERLQAMMAHLHMRPSEPKPFSQPVTVSAAD
SFPDGLVHPPTSAAAPVTPLRPPGLGSASLHGGGPARRRSSDKFCSPISSELAQNHEFYKNADVRPPFTYASLIRQAILETPDRQLTLNE
IYNWFTRMFAYFRRNTATWKNAVRHNLSLHKCFVRVENVKGAVWTVDEREYQKRRPPKMTGSPTLVKNMISGLSYGALNASYQAALAESS
FPLLNSPGMLNPGSASSLLPLSHDDVGAPVEPLPSNGSSSPPRLSPPQYSHQVQVKEEPAEAEEDRQPGPPLGAPNPSASGPPEDRDLEE

--------------------------------------------------------------
>82458_82458_11_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000436008_FOXP4_chr6_41545724_ENST00000307972_length(transcript)=4646nt_BP=1117nt
GGAGGCGCCCGAAGTCGGAGAAATACAGCGCGGCCACCCCGGGAGCAGCAGCCTTTGCCCGCACCGCCGCCCGCAGGGCCCTGAACCGCC
GGCAAGGGCGCGGGCCGGGCGCGGCGCCGGCTCCTCCTCCTGCCGCGTCCGGCGCGGAGCAGCGGAGAGCGGCGGGCTCGGCAGCCGGCG
CCCGGGCCGCGGAACTGATGAGCCGCGGCGGCTGAGGGCCCGGCCGGCCTCTGCCCGCCTCTGCCCGGCGCCCGAGCGGGGCCGCAGCGC
CCGGGGCGCGGGGTCCGGCGGGTGACATGGGGGCGGCCTGACGCACCCGGAGCCGCCGAGCGCTCTCTCTCGAGCCGCGGGCCCTCCTCG
GCGGCAGCGGCTGCAGGCGCCTGGCCCGGCGCGCTGTGCCCGGGGCGCCCGCGGAGCCTCCGCCGCGCTCTATGCGCCTCTGCGGGAGCC
GCGGGCCCGGGCCATGGCCATAGACCGGCGGCGCGAGGCGGCGGGCGGCGGGCCTGGGCGGCAGCCGGCCCCGGCCGAGGAGAACGGCTC
CCTGCCGCCCGGGGACGCGGCGGCCTCGGCGCCCCTCGGGGGACGCGCGGGCCCCGGCGGCGGCGCGGAGATCCAGCCGCTGCCCCCACT
GCATCCTGGAGGCGGCCCGCACCCGAGCTGCTGCTCCGCGGCTGCGGCCCCGAGCCTCTTGTTGCTGGACTATGACGGGTCGGTGCTGCC
CTTCCTCGGGGGCCTGGGCGGGGGCTATCAGAAGACCCTCGTGCTGCTCACCTGGATCCCGGCGCTGTTCATCGGCTTCAGCCAGTTCTC
GGACTCGTTCCTCCTGGACCAGCCCAACTTCTGGTGCCGCGGGGCCGGCAAAGGCACCGAGCTGGCAGGGGTCACCACCACAGGCCGGGG
CGGGGACATGGGCAACTGGACCAGCCTCCCCACCACCCCCTTCGCCACTGCCCCCTGGGAGGCTGCGGGCAACCGGAGCAACAGCAGCGG
CGCGGACGGAGGCGACACACCACCCCTGCCATCCCCTCCGGACAAGGGGGACAACGCCTCCAACTGTGACTGCCGCGCATGGGACTACGG
CATCCGCGCCGGCCTCGTCCAGAACGTGGTCAGCAAGGCTCTCCAAGTGGCCCGGCAGTTCCTGCTGCAGCAGGCCTCAGGCCTGAGCTC
CCCAGGGAACAATGACAGCAAACAGTCTGCCTCTGCTGTGCAGGTGCCTGTGTCGGTGGCCATGATGTCGCCGCAGATGCTTACCCCGCA
ACAGATGCAGCAGATCCTGTCGCCCCCGCAGCTGCAGGCCTTGCTCCAGCAGCAGCAAGCCCTCATGCTCCAGCAGCTACAGGAGTACTA
CAAGAAGCAGCAGGAGCAGCTCCACCTGCAGCTCCTCACCCAGCAGCAGGCTGGGAAACCGCAGCCCAAAGAGGCACTGGGGAACAAGCA
GCTGGCCTTCCAGCAGCAGCTCCTGCAAATGCAACAGTTGCAGCAGCAGCACCTGCTCAACCTGCAGAGGCAGGGGCTGGTCAGCCTGCA
GCCCAACCAAGCCTCGGGGCCCCTCCAGACCCTTCCGCAAGCAGCTGTTTGCCCAACAGACCTGCCCCAGCTGTGGAAGGGCGAGGGTGC
CCCCGGGCAGCCTGCCGAGGACAGCGTCAAGCAGGAGGGGCTGGACCTCACTGGCACGGCCGCCACCGCTACCTCGTTTGCCGCTCCCCC
CAAGGTCTCACCCCCCCTCTCCCACCATACCCTGCCCAACGGACAGCCTACTGTGCTCACATCTCGGAGAGACAGCTCTTCCCACGAGGA
GACCCCCGGCTCCCACCCCCTGTACGGACACGGAGAGTGCAAGTGGCCAGGCTGTGAGACCCTGTGTGAAGACCTGGGCCAGTTTATCAA
ACACCTCAACACAGAGCACGCCCTGGATGACCGGAGTACAGCCCAGTGCCGGGTACAGATGCAGGTGGTGCAGCAGCTGGAGATCCAGCT
CGCCAAGGAGAGCGAGCGGCTGCAGGCCATGATGGCCCACCTGCACATGCGGCCCTCGGAGCCCAAGCCCTTCAGCCAGCCACTGAACCC
GGTCCCCGGCTCCTCCTCATTCTCCAAGGTGACCGTCTCTGCAGCAGACTCATTCCCAGATGGTCTCGTGCACCCCCCGACCTCGGCCGC
AGCCCCTGTCACCCCTCTACGGCCCCCTGGCCTGGGCTCTGCCTCCCTGCATGGTGGGGGCCCAGCCCGTCGGAGAAGCAGTGACAAGTT
CTGCTCCCCCATCTCCTCAGAGCTGGCCCAGAATCATGAGTTCTACAAGAACGCCGACGTCCGGCCCCCCTTCACCTACGCCTCCCTCAT
CCGCCAGGCCATCCTGGAAACCCCTGACAGGCAGCTGACCCTGAATGAGATCTATAACTGGTTCACCAGGATGTTCGCCTATTTCCGCAG
AAACACTGCCACCTGGAAGAACGCCGTGCGCCACAACCTCAGCCTGCACAAGTGCTTCGTCCGCGTGGAGAACGTCAAGGGTGCCGTGTG
GACTGTGGACGAGCGGGAGTATCAGAAGCGGAGACCGCCAAAGATGACAGGGAGCCCCACCCTGGTGAAGAACATGATCTCTGGCCTCAG
CTATGGAGCACTTAATGCCAGCTACCAGGCCGCCCTGGCCGAGAGCAGCTTCCCCCTCCTCAACAGCCCTGGCATGCTGAACCCTGGCTC
CGCCAGCAGCCTGCTGCCCCTCAGCCACGATGACGTGGGTGCCCCCGTGGAGCCGCTGCCCAGCAACGGCAGCAGCAGCCCTCCTCGCCT
CTCCCCGCCCCAGTACAGCCACCAGGTGCAGGTGAAGGAGGAGCCAGCAGAGGCAGAGGAAGACAGGCAGCCCGGGCCTCCCCTGGGCGC
CCCTAACCCCAGCGCCTCGGGGCCTCCGGAAGACAGGGACCTGGAGGAGGAGCTGCCGGGAGAAGAACTGTCCTAAGGGCCTGTAGTGAC
CGGCAGGGCTGGGGTGAGACCCCTCCCTTCCAGAATCCAGGCCCCATCTCCCCCAACTCCACAGCCCCTCCCGAGCCTCAAGGCAAGTCC
AGGACTCAGACCGGGGAGGCCCGGGCCAGCAGCTCCCAGTGTGACCTGACAAAAACACGTAGGGGCAGGGACGGTCCCCACCCCCAGGGA
CACAACCCCTGGTCTTGGACCAGTAGAGGACACGGAGGGTTCAGACCCCTCCTCAGACCCTCCCCACATCTGAAACTGCCTCCCCCCAAC
CACCAGCAGCAGCAGGGCCCTCCTCCCCCACCAGCTCTCCCCACAGGGCCCCTCAGCATCATGGAGACCCGCAGGCGGGGCTTAGCCACC
CCTCAAACCCAGGGCCCCCTGGCACCTGGCTCTGGCCGTGTTTTCTGGCCAGAGGCCCCCACTTTCCTAACTCGTGCTCCCTTCCGCCTT
CTTTTCCGTACTGTGAAGAAAGAACTCTCCACCCCAGCTCCCACCCTGCCCTGGCCTGGGTGGAGGAACTGTGCCTCCATCCCCAGAAGA
AACAGCCCCCTCTGCTGCTGGGGTGGGACTGTCTGTGTGCCCTGTGGGGGTCCGTGTGAGCAGGCCCACCTGGCTCCAGACCCGCCCCCA
ACCTGAGACAGAACCAGGCTGAGCCAGGCCTCCACCCCCACCCCCGTTTGCTGGGGGCTCCTCCAGCCGCCCCCATGGGAAGAGGCCTGG
TACCGCCTCACCCACAGAGGTCTGTGCCAGGTGCGCTTCTGCAGGTGGAGCCAAGCTCTCCCTGAGGCCAGAGGCGGGGCCTGGGCCGGG
AGCCCAGGGGAAGGCCAGGCTGGACCCCGGCTCCACACCCACATCCAGCCTGCAGGCCTCTCTGCAGTCCTCTCACCCTCCCTCAGCTCC
CCTTCCTCTGCAGTCACCCTCAGCTCCCCTTCCTTGCCCGCCTCTCCCCCCGCCGCCCCACCAGTTAAACGGATGACCAAAGACCTTTCT
TATGCCGGAAGCAAAAACCAAAACTTTTTGTTGGCTTTTTCCTTTGTCGCCTCCCCAGCACCTGCCCTCCCAGTCTCCCACCCCGGCCCC
AGGCTGGAAGCCCTCCCTCCACTTAAGTTATTGTTTTAAACCAAAGTTTACAGTGTCTGTTGGTGGCCAAGACCTTCTCTCTCCACCCCT
CCTCCATCCACCCTGAGGACCCTGGGGCTCAGTGGAGGCAGGGCCCTGCCCCCCTCCCTTCCGCTCCTGCCCAGCCTGGGGGAAGGAGAA
AGGAGGGGAGAAAGCGGGCTCTCACCCCCTCAGGAGTGGGCACGGGAGCCCTTCTCCCTGACCCTGGGCTGCTTCCTGGGGGCTCTCCAG
ACCCCTCTCTAGGACCAAGTCACCCGTCGTGCTGGGAGTGTGGATTCTAGCAAAAGAGCTGGAAAAAAGTCAGACTCTCCACAGACCCCC
TATGGGGGACCCCCAACTCAAGGCCAAGGACTGGGCGTATCGGATGCTCATAACACCCCTGGCCTGGCCCCTTTACTGAGAAGACTCCTT
GGATATTTCCCAAGAACCCCCCACATACACCCCTCACAAGCCACCCCTCCTGAGAGGCAGGGGGCCCTCCGCCCCCTCCCCATGTATTCC

>82458_82458_11_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000436008_FOXP4_chr6_41545724_ENST00000307972_length(amino acids)=882AA_BP=270
MTHPEPPSALSRAAGPPRRQRLQAPGPARCARGARGASAALYAPLREPRARAMAIDRRREAAGGGPGRQPAPAEENGSLPPGDAAASAPL
GGRAGPGGGAEIQPLPPLHPGGGPHPSCCSAAAAPSLLLLDYDGSVLPFLGGLGGGYQKTLVLLTWIPALFIGFSQFSDSFLLDQPNFWC
RGAGKGTELAGVTTTGRGGDMGNWTSLPTTPFATAPWEAAGNRSNSSGADGGDTPPLPSPPDKGDNASNCDCRAWDYGIRAGLVQNVVSK
ALQVARQFLLQQASGLSSPGNNDSKQSASAVQVPVSVAMMSPQMLTPQQMQQILSPPQLQALLQQQQALMLQQLQEYYKKQQEQLHLQLL
TQQQAGKPQPKEALGNKQLAFQQQLLQMQQLQQQHLLNLQRQGLVSLQPNQASGPLQTLPQAAVCPTDLPQLWKGEGAPGQPAEDSVKQE
GLDLTGTAATATSFAAPPKVSPPLSHHTLPNGQPTVLTSRRDSSSHEETPGSHPLYGHGECKWPGCETLCEDLGQFIKHLNTEHALDDRS
TAQCRVQMQVVQQLEIQLAKESERLQAMMAHLHMRPSEPKPFSQPLNPVPGSSSFSKVTVSAADSFPDGLVHPPTSAAAPVTPLRPPGLG
SASLHGGGPARRRSSDKFCSPISSELAQNHEFYKNADVRPPFTYASLIRQAILETPDRQLTLNEIYNWFTRMFAYFRRNTATWKNAVRHN
LSLHKCFVRVENVKGAVWTVDEREYQKRRPPKMTGSPTLVKNMISGLSYGALNASYQAALAESSFPLLNSPGMLNPGSASSLLPLSHDDV

--------------------------------------------------------------
>82458_82458_12_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000436008_FOXP4_chr6_41545724_ENST00000373057_length(transcript)=3977nt_BP=1117nt
GGAGGCGCCCGAAGTCGGAGAAATACAGCGCGGCCACCCCGGGAGCAGCAGCCTTTGCCCGCACCGCCGCCCGCAGGGCCCTGAACCGCC
GGCAAGGGCGCGGGCCGGGCGCGGCGCCGGCTCCTCCTCCTGCCGCGTCCGGCGCGGAGCAGCGGAGAGCGGCGGGCTCGGCAGCCGGCG
CCCGGGCCGCGGAACTGATGAGCCGCGGCGGCTGAGGGCCCGGCCGGCCTCTGCCCGCCTCTGCCCGGCGCCCGAGCGGGGCCGCAGCGC
CCGGGGCGCGGGGTCCGGCGGGTGACATGGGGGCGGCCTGACGCACCCGGAGCCGCCGAGCGCTCTCTCTCGAGCCGCGGGCCCTCCTCG
GCGGCAGCGGCTGCAGGCGCCTGGCCCGGCGCGCTGTGCCCGGGGCGCCCGCGGAGCCTCCGCCGCGCTCTATGCGCCTCTGCGGGAGCC
GCGGGCCCGGGCCATGGCCATAGACCGGCGGCGCGAGGCGGCGGGCGGCGGGCCTGGGCGGCAGCCGGCCCCGGCCGAGGAGAACGGCTC
CCTGCCGCCCGGGGACGCGGCGGCCTCGGCGCCCCTCGGGGGACGCGCGGGCCCCGGCGGCGGCGCGGAGATCCAGCCGCTGCCCCCACT
GCATCCTGGAGGCGGCCCGCACCCGAGCTGCTGCTCCGCGGCTGCGGCCCCGAGCCTCTTGTTGCTGGACTATGACGGGTCGGTGCTGCC
CTTCCTCGGGGGCCTGGGCGGGGGCTATCAGAAGACCCTCGTGCTGCTCACCTGGATCCCGGCGCTGTTCATCGGCTTCAGCCAGTTCTC
GGACTCGTTCCTCCTGGACCAGCCCAACTTCTGGTGCCGCGGGGCCGGCAAAGGCACCGAGCTGGCAGGGGTCACCACCACAGGCCGGGG
CGGGGACATGGGCAACTGGACCAGCCTCCCCACCACCCCCTTCGCCACTGCCCCCTGGGAGGCTGCGGGCAACCGGAGCAACAGCAGCGG
CGCGGACGGAGGCGACACACCACCCCTGCCATCCCCTCCGGACAAGGGGGACAACGCCTCCAACTGTGACTGCCGCGCATGGGACTACGG
CATCCGCGCCGGCCTCGTCCAGAACGTGGTCAGCAAGGCTCTCCAAGTGGCCCGGCAGTTCCTGCTGCAGCAGGCCTCAGGCCTGAGCTC
CCCAGGGAACAATGACAGCAAACAGTCTGCCTCTGCTGTGCAGGTGCCTGTGTCGGTGGCCATGATGTCGCCGCAGATGCTTACCCCGCA
ACAGATGCAGCAGATCCTGTCGCCCCCGCAGCTGCAGGCCTTGCTCCAGCAGCAGCAAGCCCTCATGCTCCAGCAGGAGTACTACAAGAA
GCAGCAGGAGCAGCTCCACCTGCAGCTCCTCACCCAGCAGCAGGCTGGGAAACCGCAGCCCAAAGAGGCACTGGGGAACAAGCAGCTGGC
CTTCCAGCAGCAGCTCCTGCAAATGCAACAGTTGCAGCAGCAGCACCTGCTCAACCTGCAGAGGCAGGGGCTGGTCAGCCTGCAGCCCAA
CCAAGCCTCGGGGCCCCTCCAGACCCTTCCGCAAGCAGCTGTTTGCCCAACAGACCTGCCCCAGCTGTGGAAGGGCGAGGGTGCCCCCGG
GCAGCCTGCCGAGGACAGCGTCAAGCAGGAGGGGCTGGACCTCACTGGCACGGCCGCCACCGCTACCTCGTTTGCCGCTCCCCCCAAGGT
CTCACCCCCCCTCTCCCACCATACCCTGCCCAACGGACAGCCTACTGTGCTCACATCTCGGAGAGACAGCTCTTCCCACGAGGAGACCCC
CGGCTCCCACCCCCTGTACGGACACGGAGAGTGCAAGTGGCCAGGCTGTGAGACCCTGTGTGAAGACCTGGGCCAGTTTATCAAACACCT
CAACACAGAGCACGCCCTGGATGACCGGAGTACAGCCCAGTGCCGGGTACAGATGCAGGTGGTGCAGCAGCTGGAGATCCAGCTCGCCAA
GGAGAGCGAGCGGCTGCAGGCCATGATGGCCCACCTGCACATGCGGCCCTCGGAGCCCAAGCCCTTCAGCCAGCCACTGAACCCGGTCCC
CGGCTCCTCCTCATTCTCCAAGGTGACCGTCTCTGCAGCAGACTCATTCCCAGATGGTCTCGTGCACCCCCCGACCTCGGCCGCAGCCCC
TGTCACCCCTCTACGGCCCCCTGGCCTGGGCTCTGCCTCCCTGCATGGTGGGGGCCCAGCCCGTCGGAGAAGCAGTGACAAGTTCTGCTC
CCCCATCTCCTCAGAGCTGGCCCAGAATCATGAGTTCTACAAGAACGCCGACGTCCGGCCCCCCTTCACCTACGCCTCCCTCATCCGCCA
GGCCATCCTGGAAACCCCTGACAGGCAGCTGACCCTGAATGAGATCTATAACTGGTTCACCAGGATGTTCGCCTATTTCCGCAGAAACAC
TGCCACCTGGAAGAACGCCGTGCGCCACAACCTCAGCCTGCACAAGTGCTTCGTCCGCGTGGAGAACGTCAAGGGTGCCGTGTGGACTGT
GGACGAGCGGGAGTATCAGAAGCGGAGACCGCCAAAGATGACAGGGAGCCCCACCCTGGTGAAGAACATGATCTCTGGCCTCAGCTATGG
AGCACTTAATGCCAGCTACCAGGCCGCCCTGGCCGAGAGCAGCTTCCCCCTCCTCAACAGCCCTGGCATGCTGAACCCTGGCTCCGCCAG
CAGCCTGCTGCCCCTCAGCCACGATGACGTGGGTGCCCCCGTGGAGCCGCTGCCCAGCAACGGCAGCAGCAGCCCTCCTCGCCTCTCCCC
GCCCCAGTACAGCCACCAGGTGCAGGTGAAGGAGGAGCCAGCAGAGGCAGAGGAAGACAGGCAGCCCGGGCCTCCCCTGGGCGCCCCTAA
CCCCAGCGCCTCGGGGCCTCCGGAAGACAGGGACCTGGAGGAGGAGCTGCCGGGAGAAGAACTGTCCTAAGGGCCTGTAGTGACCGGCAG
GGCTGGGGTGAGACCCCTCCCTTCCAGAATCCAGGCCCCATCTCCCCCAACTCCACAGCCCCTCCCGAGCCTCAAGGCAAGTCCAGGACT
CAGACCGGGGAGGCCCGGGCCAGCAGCTCCCAGTGTGACCTGACAAAAACACGTAGGGGCAGGGACGGTCCCCACCCCCAGGGACACAAC
CCCTGGTCTTGGACCAGTAGAGGACACGGAGGGTTCAGACCCCTCCTCAGACCCTCCCCACATCTGAAACTGCCTCCCCCCAACCACCAG
CAGCAGCAGGGCCCTCCTCCCCCACCAGCTCTCCCCACAGGGCCCCTCAGCATCATGGAGACCCGCAGGCGGGGCTTAGCCACCCCTCAA
ACCCAGGGCCCCCTGGCACCTGGCTCTGGCCGTGTTTTCTGGCCAGAGGCCCCCACTTTCCTAACTCGTGCTCCCTTCCGCCTTCTTTTC
CGTACTGTGAAGAAAGAACTCTCCACCCCAGCTCCCACCCTGCCCTGGCCTGGGTGGAGGAACTGTGCCTCCATCCCCAGAAGAAACAGC
CCCCTCTGCTGCTGGGGTGGGACTGTCTGTGTGCCCTGTGGGGGTCCGTGTGAGCAGGCCCACCTGGCTCCAGACCCGCCCCCAACCTGA
GACAGAACCAGGCTGAGCCAGGCCTCCACCCCCACCCCCGTTTGCTGGGGGCTCCTCCAGCCGCCCCCATGGGAAGAGGCCTGGTACCGC
CTCACCCACAGAGGTCTGTGCCAGGTGCGCTTCTGCAGGTGGAGCCAAGCTCTCCCTGAGGCCAGAGGCGGGGCCTGGGCCGGGAGCCCA
GGGGAAGGCCAGGCTGGACCCCGGCTCCACACCCACATCCAGCCTGCAGGCCTCTCTGCAGTCCTCTCACCCTCCCTCAGCTCCCCTTCC
TCTGCAGTCACCCTCAGCTCCCCTTCCTTGCCCGCCTCTCCCCCCGCCGCCCCACCAGTTAAACGGATGACCAAAGACCTTTCTTATGCC

>82458_82458_12_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000436008_FOXP4_chr6_41545724_ENST00000373057_length(amino acids)=880AA_BP=270
MTHPEPPSALSRAAGPPRRQRLQAPGPARCARGARGASAALYAPLREPRARAMAIDRRREAAGGGPGRQPAPAEENGSLPPGDAAASAPL
GGRAGPGGGAEIQPLPPLHPGGGPHPSCCSAAAAPSLLLLDYDGSVLPFLGGLGGGYQKTLVLLTWIPALFIGFSQFSDSFLLDQPNFWC
RGAGKGTELAGVTTTGRGGDMGNWTSLPTTPFATAPWEAAGNRSNSSGADGGDTPPLPSPPDKGDNASNCDCRAWDYGIRAGLVQNVVSK
ALQVARQFLLQQASGLSSPGNNDSKQSASAVQVPVSVAMMSPQMLTPQQMQQILSPPQLQALLQQQQALMLQQEYYKKQQEQLHLQLLTQ
QQAGKPQPKEALGNKQLAFQQQLLQMQQLQQQHLLNLQRQGLVSLQPNQASGPLQTLPQAAVCPTDLPQLWKGEGAPGQPAEDSVKQEGL
DLTGTAATATSFAAPPKVSPPLSHHTLPNGQPTVLTSRRDSSSHEETPGSHPLYGHGECKWPGCETLCEDLGQFIKHLNTEHALDDRSTA
QCRVQMQVVQQLEIQLAKESERLQAMMAHLHMRPSEPKPFSQPLNPVPGSSSFSKVTVSAADSFPDGLVHPPTSAAAPVTPLRPPGLGSA
SLHGGGPARRRSSDKFCSPISSELAQNHEFYKNADVRPPFTYASLIRQAILETPDRQLTLNEIYNWFTRMFAYFRRNTATWKNAVRHNLS
LHKCFVRVENVKGAVWTVDEREYQKRRPPKMTGSPTLVKNMISGLSYGALNASYQAALAESSFPLLNSPGMLNPGSASSLLPLSHDDVGA

--------------------------------------------------------------
>82458_82458_13_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000436008_FOXP4_chr6_41545724_ENST00000373060_length(transcript)=6403nt_BP=1117nt
GGAGGCGCCCGAAGTCGGAGAAATACAGCGCGGCCACCCCGGGAGCAGCAGCCTTTGCCCGCACCGCCGCCCGCAGGGCCCTGAACCGCC
GGCAAGGGCGCGGGCCGGGCGCGGCGCCGGCTCCTCCTCCTGCCGCGTCCGGCGCGGAGCAGCGGAGAGCGGCGGGCTCGGCAGCCGGCG
CCCGGGCCGCGGAACTGATGAGCCGCGGCGGCTGAGGGCCCGGCCGGCCTCTGCCCGCCTCTGCCCGGCGCCCGAGCGGGGCCGCAGCGC
CCGGGGCGCGGGGTCCGGCGGGTGACATGGGGGCGGCCTGACGCACCCGGAGCCGCCGAGCGCTCTCTCTCGAGCCGCGGGCCCTCCTCG
GCGGCAGCGGCTGCAGGCGCCTGGCCCGGCGCGCTGTGCCCGGGGCGCCCGCGGAGCCTCCGCCGCGCTCTATGCGCCTCTGCGGGAGCC
GCGGGCCCGGGCCATGGCCATAGACCGGCGGCGCGAGGCGGCGGGCGGCGGGCCTGGGCGGCAGCCGGCCCCGGCCGAGGAGAACGGCTC
CCTGCCGCCCGGGGACGCGGCGGCCTCGGCGCCCCTCGGGGGACGCGCGGGCCCCGGCGGCGGCGCGGAGATCCAGCCGCTGCCCCCACT
GCATCCTGGAGGCGGCCCGCACCCGAGCTGCTGCTCCGCGGCTGCGGCCCCGAGCCTCTTGTTGCTGGACTATGACGGGTCGGTGCTGCC
CTTCCTCGGGGGCCTGGGCGGGGGCTATCAGAAGACCCTCGTGCTGCTCACCTGGATCCCGGCGCTGTTCATCGGCTTCAGCCAGTTCTC
GGACTCGTTCCTCCTGGACCAGCCCAACTTCTGGTGCCGCGGGGCCGGCAAAGGCACCGAGCTGGCAGGGGTCACCACCACAGGCCGGGG
CGGGGACATGGGCAACTGGACCAGCCTCCCCACCACCCCCTTCGCCACTGCCCCCTGGGAGGCTGCGGGCAACCGGAGCAACAGCAGCGG
CGCGGACGGAGGCGACACACCACCCCTGCCATCCCCTCCGGACAAGGGGGACAACGCCTCCAACTGTGACTGCCGCGCATGGGACTACGG
CATCCGCGCCGGCCTCGTCCAGAACGTGGTCAGCAAGGCTCTCCAAGTGGCCCGGCAGTTCCTGCTGCAGCAGGCCTCAGGCCTGAGCTC
CCCAGGGAACAATGACAGCAAACAGTCTGCCTCTGCTGTGCAGGTGCCTGTGTCGGTGGCCATGATGTCGCCGCAGATGCTTACCCCGCA
ACAGATGCAGCAGATCCTGTCGCCCCCGCAGCTGCAGGCCTTGCTCCAGCAGCAGCAAGCCCTCATGCTCCAGCAGCTACAGGAGTACTA
CAAGAAGCAGCAGGAGCAGCTCCACCTGCAGCTCCTCACCCAGCAGCAGGCTGGGAAACCGCAGCCCAAAGAGGCACTGGGGAACAAGCA
GCTGGCCTTCCAGCAGCAGCTCCTGCAAATGCAACAGTTGCAGCAGCAGCACCTGCTCAACCTGCAGAGGCAGGGGCTGGTCAGCCTGCA
GCCCAACCAAGCCTCGGGGCCCCTCCAGACCCTTCCGCAAGCAGCTGTTTGCCCAACAGACCTGCCCCAGCTGTGGAAGGGCGAGGGTGC
CCCCGGGCAGCCTGCCGAGGACAGCGTCAAGCAGGAGGGGCTGGACCTCACTGGCACGGCCGCCACCGCTACCTCGTTTGCCGCTCCCCC
CAAGGTCTCACCCCCCCTCTCCCACCATACCCTGCCCAACGGACAGCCTACTGTGCTCACATCTCGGAGAGACAGCTCTTCCCACGAGGA
GACCCCCGGCTCCCACCCCCTGTACGGACACGGAGAGTGCAAGTGGCCAGGCTGTGAGACCCTGTGTGAAGACCTGGGCCAGTTTATCAA
ACACCTCAACACAGAGCACGCCCTGGATGACCGGAGTACAGCCCAGTGCCGGGTACAGATGCAGGTGGTGCAGCAGCTGGAGATCCAGCT
CGCCAAGGAGAGCGAGCGGCTGCAGGCCATGATGGCCCACCTGCACATGCGGCCCTCGGAGCCCAAGCCCTTCAGCCAGCCACTGAACCC
GGTCCCCGGCTCCTCCTCATTCTCCAAGGTGACCGTCTCTGCAGCAGACTCATTCCCAGATGGTCTCGTGCACCCCCCGACCTCGGCCGC
AGCCCCTGTCACCCCTCTACGGCCCCCTGGCCTGGGCTCTGCCTCCCTGCATGGTGGGGGCCCAGCCCGTCGGAGAAGCAGTGACAAGTT
CTGCTCCCCCATCTCCTCAGAGCTGGCCCAGAATCATGAGTTCTACAAGAACGCCGACGTCCGGCCCCCCTTCACCTACGCCTCCCTCAT
CCGCCAGGCCATCCTGGAAACCCCTGACAGGCAGCTGACCCTGAATGAGATCTATAACTGGTTCACCAGGATGTTCGCCTATTTCCGCAG
AAACACTGCCACCTGGAAGAACGCCGTGCGCCACAACCTCAGCCTGCACAAGTGCTTCGTCCGCGTGGAGAACGTCAAGGGTGCCGTGTG
GACTGTGGACGAGCGGGAGTATCAGAAGCGGAGACCGCCAAAGATGACAGGGAGCCCCACCCTGGTGAAGAACATGATCTCTGGCCTCAG
CTATGGAGCACTTAATGCCAGCTACCAGGCCGCCCTGGCCGAGAGCAGCTTCCCCCTCCTCAACAGCCCTGGCATGCTGAACCCTGGCTC
CGCCAGCAGCCTGCTGCCCCTCAGCCACGATGACGTGGGTGCCCCCGTGGAGCCGCTGCCCAGCAACGGCAGCAGCAGCCCTCCTCGCCT
CTCCCCGCCCCAGTACAGCCACCAGGTGCAGGTGAAGGAGGAGCCAGCAGAGGCAGAGGAAGACAGGCAGCCCGGGCCTCCCCTGGGCGC
CCCTAACCCCAGCGCCTCGGGGCCTCCGGAAGACAGGGACCTGGAGGAGGAGCTGCCGGGAGAAGAACTGTCCTAAGGGCCTGTAGTGAC
CGGCAGGGCTGGGGTGAGACCCCTCCCTTCCAGAATCCAGGCCCCATCTCCCCCAACTCCACAGCCCCTCCCGAGCCTCAAGGCAAGTCC
AGGACTCAGACCGGGGAGGCCCGGGCCAGCAGCTCCCAGTGTGACCTGACAAAAACACGTAGGGGCAGGGACGGTCCCCACCCCCAGGGA
CACAACCCCTGGTCTTGGACCAGTAGAGGACACGGAGGGTTCAGACCCCTCCTCAGACCCTCCCCACATCTGAAACTGCCTCCCCCCAAC
CACCAGCAGCAGCAGGGCCCTCCTCCCCCACCAGCTCTCCCCACAGGGCCCCTCAGCATCATGGAGACCCGCAGGCGGGGCTTAGCCACC
CCTCAAACCCAGGGCCCCCTGGCACCTGGCTCTGGCCGTGTTTTCTGGCCAGAGGCCCCCACTTTCCTAACTCGTGCTCCCTTCCGCCTT
CTTTTCCGTACTGTGAAGAAAGAACTCTCCACCCCAGCTCCCACCCTGCCCTGGCCTGGGTGGAGGAACTGTGCCTCCATCCCCAGAAGA
AACAGCCCCCTCTGCTGCTGGGGTGGGACTGTCTGTGTGCCCTGTGGGGGTCCGTGTGAGCAGGCCCACCTGGCTCCAGACCCGCCCCCA
ACCTGAGACAGAACCAGGCTGAGCCAGGCCTCCACCCCCACCCCCGTTTGCTGGGGGCTCCTCCAGCCGCCCCCATGGGAAGAGGCCTGG
TACCGCCTCACCCACAGAGGTCTGTGCCAGGTGCGCTTCTGCAGGTGGAGCCAAGCTCTCCCTGAGGCCAGAGGCGGGGCCTGGGCCGGG
AGCCCAGGGGAAGGCCAGGCTGGACCCCGGCTCCACACCCACATCCAGCCTGCAGGCCTCTCTGCAGTCCTCTCACCCTCCCTCAGCTCC
CCTTCCTCTGCAGTCACCCTCAGCTCCCCTTCCTTGCCCGCCTCTCCCCCCGCCGCCCCACCAGTTAAACGGATGACCAAAGACCTTTCT
TATGCCGGAAGCAAAAACCAAAACTTTTTGTTGGCTTTTTCCTTTGTCGCCTCCCCAGCACCTGCCCTCCCAGTCTCCCACCCCGGCCCC
AGGCTGGAAGCCCTCCCTCCACTTAAGTTATTGTTTTAAACCAAAGTTTACAGTGTCTGTTGGTGGCCAAGACCTTCTCTCTCCACCCCT
CCTCCATCCACCCTGAGGACCCTGGGGCTCAGTGGAGGCAGGGCCCTGCCCCCCTCCCTTCCGCTCCTGCCCAGCCTGGGGGAAGGAGAA
AGGAGGGGAGAAAGCGGGCTCTCACCCCCTCAGGAGTGGGCACGGGAGCCCTTCTCCCTGACCCTGGGCTGCTTCCTGGGGGCTCTCCAG
ACCCCTCTCTAGGACCAAGTCACCCGTCGTGCTGGGAGTGTGGATTCTAGCAAAAGAGCTGGAAAAAAGTCAGACTCTCCACAGACCCCC
TATGGGGGACCCCCAACTCAAGGCCAAGGACTGGGCGTATCGGATGCTCATAACACCCCTGGCCTGGCCCCTTTACTGAGAAGACTCCTT
GGATATTTCCCAAGAACCCCCCACATACACCCCTCACAAGCCACCCCTCCTGAGAGGCAGGGGGCCCTCCGCCCCCTCCCCATGTATTCC
CCACCTGTGTTCCGTTTGACCAGCACAGAAATATTAAACGTCCTCTATTCACCGGGCCCTGTGTGTGTCACCGAGGTGCGGGAGGGGAGG
AGCATTAAAGCTGAAAGATCGCTCTGCTCGGGGAGCCTGGGCAGAGCAGCAGCAACGTGAGGGTCGCTGTGGTGGTGGTTTCTGTGAGTG
GATGGAATGAGCAGCCCTGCAGGGGCGCTGGGCATGTGCCCTCACTGTGGACAGGGCCCACCCACCTCCGGTTCCCCTGTGCCTCCTGTC
CCTTCCACGCTTAAACAGGGTTCTCTGTCATTTTCCTGTTTTCTTCCAGAGTCCCAATCCTTTGCCCTAGTTCTTTCACTAGTTTGAAAT
CCAAGTTCTTGCCAGAGTGTTGGAGCAAGGCAGCTGATTTGCTGCAGGGATGGAGAGGACCACCGCCGCAGGGTTCTTTTACCTGTGCCA
CCAGCTCTGGAGACATCCACACCCATTCCCAGGTGTCCTTCCAGGCCCATTGTCCACGTCTTCAAGGGGGCTGCCTGGATAGGCGTGTGT
GTGTGTAGTGCCCAGGTGTGGTGGTGGCATCCTGAAGCAGTAGGACCATTAGTGTGCGTGCACACCCACGTGGCACACTGTGTGGTGACC
ATGGTCATCATAGTGGGCTCACGGCGGAAACGGGATCATTCAACCTATACAAAGGGGACCCTGGATAGGCTGCAAGGAAAGAGGCCAAGG
GCCAAGCTGCTAAGCCAAAGATGGCCCCTGACACCTCCCCCAGCCCCAGCACACCAGCCTGCACCTCTTATCCCGTGTTAAGTCCTGGTT
CCCCACCTGCTGCCCCTCCTCAACACAGAGCCCCTCCCGCCGCCTCAGACCCCTGTGCACATCCCCAGGGCCTCAGCCGTCTCATTGGTC
TTTATTTTTTATTTTTTTTAAGATGGAGTTTCGCTCTTATTGCCCAGACTGGAGTGCAGTGATGCTATCTCGGCTCACTGCAACCTTTGC
CTCCCAGGTTCAAGTGATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGGCGTGCACCACCACGCCGGGCTAAATTTTTTTGTA
TTTTTAGTAGAGACGGGGTTTCTCTATGTTGGTCAGGCTGATCTCGAACTCCCGACCTCAGGTGATCCGCCAGCCTCAGCCTCCCAAAGT
GCTGGGATTACAGGCGTGAGCCACTGCACCCGGCTCTCACTGGTCTTACGCCACCTTCTGGACACTCCCTCCTTGAGGGCAGAAAGGAGT
CCCAGGCCTGTCCCTAGGGACAAGGCCCAGGGAAGAGTGTATTTGGGGAGCAGGGGAGGGGAGGGTGTTGAGAAAGCTGAACTGGAGTCA
ATCACCCTTCCCACAAATCACCAAACTGCTGGAACTCTCCAGCCAAATGCTGGGAGAAGGACCTGGAGGGTGAGTCTTTGCTGACCTCTC
TCTACTCTCAGGCATGTCTTTTGTCCTTTTCGTCCATCTATTTCTGTCTGTCGCTCACTCGCCCCGCTTTCTCTGTCTCACCTTCATCCA
CTCTGCAGGCCTGCTCCACCACAGCCCTAATCCTCTGGACGCTTGTGTAGGGCCTGGGGTGAATTCCCTGTCCCCCATGGTACCTCGAGA
GGGGCTGGGGAGCTCAGCTTGGTCTCAGAGTCTCCCCACCAGATACTGTTTAAAAAAGTAGCACTGATGTGTTTTGTAATCTGCCCCTCC
CAGCCCTCCGTGGAGGCTGCCAGGGCCTTGTACGGTAAACCTAGCTGCATGTAATCTGTGGACAATGGCATTCTCTACAATGCAATAAAA

>82458_82458_13_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000436008_FOXP4_chr6_41545724_ENST00000373060_length(amino acids)=882AA_BP=270
MTHPEPPSALSRAAGPPRRQRLQAPGPARCARGARGASAALYAPLREPRARAMAIDRRREAAGGGPGRQPAPAEENGSLPPGDAAASAPL
GGRAGPGGGAEIQPLPPLHPGGGPHPSCCSAAAAPSLLLLDYDGSVLPFLGGLGGGYQKTLVLLTWIPALFIGFSQFSDSFLLDQPNFWC
RGAGKGTELAGVTTTGRGGDMGNWTSLPTTPFATAPWEAAGNRSNSSGADGGDTPPLPSPPDKGDNASNCDCRAWDYGIRAGLVQNVVSK
ALQVARQFLLQQASGLSSPGNNDSKQSASAVQVPVSVAMMSPQMLTPQQMQQILSPPQLQALLQQQQALMLQQLQEYYKKQQEQLHLQLL
TQQQAGKPQPKEALGNKQLAFQQQLLQMQQLQQQHLLNLQRQGLVSLQPNQASGPLQTLPQAAVCPTDLPQLWKGEGAPGQPAEDSVKQE
GLDLTGTAATATSFAAPPKVSPPLSHHTLPNGQPTVLTSRRDSSSHEETPGSHPLYGHGECKWPGCETLCEDLGQFIKHLNTEHALDDRS
TAQCRVQMQVVQQLEIQLAKESERLQAMMAHLHMRPSEPKPFSQPLNPVPGSSSFSKVTVSAADSFPDGLVHPPTSAAAPVTPLRPPGLG
SASLHGGGPARRRSSDKFCSPISSELAQNHEFYKNADVRPPFTYASLIRQAILETPDRQLTLNEIYNWFTRMFAYFRRNTATWKNAVRHN
LSLHKCFVRVENVKGAVWTVDEREYQKRRPPKMTGSPTLVKNMISGLSYGALNASYQAALAESSFPLLNSPGMLNPGSASSLLPLSHDDV

--------------------------------------------------------------
>82458_82458_14_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000436008_FOXP4_chr6_41545724_ENST00000373063_length(transcript)=6365nt_BP=1117nt
GGAGGCGCCCGAAGTCGGAGAAATACAGCGCGGCCACCCCGGGAGCAGCAGCCTTTGCCCGCACCGCCGCCCGCAGGGCCCTGAACCGCC
GGCAAGGGCGCGGGCCGGGCGCGGCGCCGGCTCCTCCTCCTGCCGCGTCCGGCGCGGAGCAGCGGAGAGCGGCGGGCTCGGCAGCCGGCG
CCCGGGCCGCGGAACTGATGAGCCGCGGCGGCTGAGGGCCCGGCCGGCCTCTGCCCGCCTCTGCCCGGCGCCCGAGCGGGGCCGCAGCGC
CCGGGGCGCGGGGTCCGGCGGGTGACATGGGGGCGGCCTGACGCACCCGGAGCCGCCGAGCGCTCTCTCTCGAGCCGCGGGCCCTCCTCG
GCGGCAGCGGCTGCAGGCGCCTGGCCCGGCGCGCTGTGCCCGGGGCGCCCGCGGAGCCTCCGCCGCGCTCTATGCGCCTCTGCGGGAGCC
GCGGGCCCGGGCCATGGCCATAGACCGGCGGCGCGAGGCGGCGGGCGGCGGGCCTGGGCGGCAGCCGGCCCCGGCCGAGGAGAACGGCTC
CCTGCCGCCCGGGGACGCGGCGGCCTCGGCGCCCCTCGGGGGACGCGCGGGCCCCGGCGGCGGCGCGGAGATCCAGCCGCTGCCCCCACT
GCATCCTGGAGGCGGCCCGCACCCGAGCTGCTGCTCCGCGGCTGCGGCCCCGAGCCTCTTGTTGCTGGACTATGACGGGTCGGTGCTGCC
CTTCCTCGGGGGCCTGGGCGGGGGCTATCAGAAGACCCTCGTGCTGCTCACCTGGATCCCGGCGCTGTTCATCGGCTTCAGCCAGTTCTC
GGACTCGTTCCTCCTGGACCAGCCCAACTTCTGGTGCCGCGGGGCCGGCAAAGGCACCGAGCTGGCAGGGGTCACCACCACAGGCCGGGG
CGGGGACATGGGCAACTGGACCAGCCTCCCCACCACCCCCTTCGCCACTGCCCCCTGGGAGGCTGCGGGCAACCGGAGCAACAGCAGCGG
CGCGGACGGAGGCGACACACCACCCCTGCCATCCCCTCCGGACAAGGGGGACAACGCCTCCAACTGTGACTGCCGCGCATGGGACTACGG
CATCCGCGCCGGCCTCGTCCAGAACGTGGTCAGCAAGGCTCTCCAAGTGGCCCGGCAGTTCCTGCTGCAGCAGGCCTCAGGCCTGAGCTC
CCCAGGGAACAATGACAGCAAACAGTCTGCCTCTGCTGTGCAGGTGCCTGTGTCGGTGGCCATGATGTCGCCGCAGATGCTTACCCCGCA
ACAGATGCAGCAGATCCTGTCGCCCCCGCAGCTGCAGGCCTTGCTCCAGCAGCAGCAAGCCCTCATGCTCCAGCAGCTACAGGAGTACTA
CAAGAAGCAGCAGGAGCAGCTCCACCTGCAGCTCCTCACCCAGCAGCAGGCTGGGAAACCGCAGCCCAAAGAGGCACTGGGGAACAAGCA
GCTGGCCTTCCAGCAGCAGCTCCTGCAAATGCAACAGTTGCAGCAGCAGCACCTGCTCAACCTGCAGAGGCAGGGGCTGGTCAGCCTGCA
GCCCAACCAAGCCTCGGGGCCCCTCCAGACCCTTCCGCAAGCTGTTTGCCCAACAGACCTGCCCCAGCTGTGGAAGGGCGAGGGTGCCCC
CGGGCAGCCTGCCGAGGACAGCGTCAAGCAGGAGGGGCTGGACCTCACTGGCACGGCCGCCACCGCTACCTCGTTTGCCGCTCCCCCCAA
GGTCTCACCCCCCCTCTCCCACCATACCCTGCCCAACGGACAGCCTACTGTGCTCACATCTCGGAGAGACAGCTCTTCCCACGAGGAGAC
CCCCGGCTCCCACCCCCTGTACGGACACGGAGAGTGCAAGTGGCCAGGCTGTGAGACCCTGTGTGAAGACCTGGGCCAGTTTATCAAACA
CCTCAACACAGAGCACGCCCTGGATGACCGGAGTACAGCCCAGTGCCGGGTACAGATGCAGGTGGTGCAGCAGCTGGAGATCCAGCTCGC
CAAGGAGAGCGAGCGGCTGCAGGCCATGATGGCCCACCTGCACATGCGGCCCTCGGAGCCCAAGCCCTTCAGCCAGCCAGTGACCGTCTC
TGCAGCAGACTCATTCCCAGATGGTCTCGTGCACCCCCCGACCTCGGCCGCAGCCCCTGTCACCCCTCTACGGCCCCCTGGCCTGGGCTC
TGCCTCCCTGCATGGTGGGGGCCCAGCCCGTCGGAGAAGCAGTGACAAGTTCTGCTCCCCCATCTCCTCAGAGCTGGCCCAGAATCATGA
GTTCTACAAGAACGCCGACGTCCGGCCCCCCTTCACCTACGCCTCCCTCATCCGCCAGGCCATCCTGGAAACCCCTGACAGGCAGCTGAC
CCTGAATGAGATCTATAACTGGTTCACCAGGATGTTCGCCTATTTCCGCAGAAACACTGCCACCTGGAAGAACGCCGTGCGCCACAACCT
CAGCCTGCACAAGTGCTTCGTCCGCGTGGAGAACGTCAAGGGTGCCGTGTGGACTGTGGACGAGCGGGAGTATCAGAAGCGGAGACCGCC
AAAGATGACAGGGAGCCCCACCCTGGTGAAGAACATGATCTCTGGCCTCAGCTATGGAGCACTTAATGCCAGCTACCAGGCCGCCCTGGC
CGAGAGCAGCTTCCCCCTCCTCAACAGCCCTGGCATGCTGAACCCTGGCTCCGCCAGCAGCCTGCTGCCCCTCAGCCACGATGACGTGGG
TGCCCCCGTGGAGCCGCTGCCCAGCAACGGCAGCAGCAGCCCTCCTCGCCTCTCCCCGCCCCAGTACAGCCACCAGGTGCAGGTGAAGGA
GGAGCCAGCAGAGGCAGAGGAAGACAGGCAGCCCGGGCCTCCCCTGGGCGCCCCTAACCCCAGCGCCTCGGGGCCTCCGGAAGACAGGGA
CCTGGAGGAGGAGCTGCCGGGAGAAGAACTGTCCTAAGGGCCTGTAGTGACCGGCAGGGCTGGGGTGAGACCCCTCCCTTCCAGAATCCA
GGCCCCATCTCCCCCAACTCCACAGCCCCTCCCGAGCCTCAAGGCAAGTCCAGGACTCAGACCGGGGAGGCCCGGGCCAGCAGCTCCCAG
TGTGACCTGACAAAAACACGTAGGGGCAGGGACGGTCCCCACCCCCAGGGACACAACCCCTGGTCTTGGACCAGTAGAGGACACGGAGGG
TTCAGACCCCTCCTCAGACCCTCCCCACATCTGAAACTGCCTCCCCCCAACCACCAGCAGCAGCAGGGCCCTCCTCCCCCACCAGCTCTC
CCCACAGGGCCCCTCAGCATCATGGAGACCCGCAGGCGGGGCTTAGCCACCCCTCAAACCCAGGGCCCCCTGGCACCTGGCTCTGGCCGT
GTTTTCTGGCCAGAGGCCCCCACTTTCCTAACTCGTGCTCCCTTCCGCCTTCTTTTCCGTACTGTGAAGAAAGAACTCTCCACCCCAGCT
CCCACCCTGCCCTGGCCTGGGTGGAGGAACTGTGCCTCCATCCCCAGAAGAAACAGCCCCCTCTGCTGCTGGGGTGGGACTGTCTGTGTG
CCCTGTGGGGGTCCGTGTGAGCAGGCCCACCTGGCTCCAGACCCGCCCCCAACCTGAGACAGAACCAGGCTGAGCCAGGCCTCCACCCCC
ACCCCCGTTTGCTGGGGGCTCCTCCAGCCGCCCCCATGGGAAGAGGCCTGGTACCGCCTCACCCACAGAGGTCTGTGCCAGGTGCGCTTC
TGCAGGTGGAGCCAAGCTCTCCCTGAGGCCAGAGGCGGGGCCTGGGCCGGGAGCCCAGGGGAAGGCCAGGCTGGACCCCGGCTCCACACC
CACATCCAGCCTGCAGGCCTCTCTGCAGTCCTCTCACCCTCCCTCAGCTCCCCTTCCTCTGCAGTCACCCTCAGCTCCCCTTCCTTGCCC
GCCTCTCCCCCCGCCGCCCCACCAGTTAAACGGATGACCAAAGACCTTTCTTATGCCGGAAGCAAAAACCAAAACTTTTTGTTGGCTTTT
TCCTTTGTCGCCTCCCCAGCACCTGCCCTCCCAGTCTCCCACCCCGGCCCCAGGCTGGAAGCCCTCCCTCCACTTAAGTTATTGTTTTAA
ACCAAAGTTTACAGTGTCTGTTGGTGGCCAAGACCTTCTCTCTCCACCCCTCCTCCATCCACCCTGAGGACCCTGGGGCTCAGTGGAGGC
AGGGCCCTGCCCCCCTCCCTTCCGCTCCTGCCCAGCCTGGGGGAAGGAGAAAGGAGGGGAGAAAGCGGGCTCTCACCCCCTCAGGAGTGG
GCACGGGAGCCCTTCTCCCTGACCCTGGGCTGCTTCCTGGGGGCTCTCCAGACCCCTCTCTAGGACCAAGTCACCCGTCGTGCTGGGAGT
GTGGATTCTAGCAAAAGAGCTGGAAAAAAGTCAGACTCTCCACAGACCCCCTATGGGGGACCCCCAACTCAAGGCCAAGGACTGGGCGTA
TCGGATGCTCATAACACCCCTGGCCTGGCCCCTTTACTGAGAAGACTCCTTGGATATTTCCCAAGAACCCCCCACATACACCCCTCACAA
GCCACCCCTCCTGAGAGGCAGGGGGCCCTCCGCCCCCTCCCCATGTATTCCCCACCTGTGTTCCGTTTGACCAGCACAGAAATATTAAAC
GTCCTCTATTCACCGGGCCCTGTGTGTGTCACCGAGGTGCGGGAGGGGAGGAGCATTAAAGCTGAAAGATCGCTCTGCTCGGGGAGCCTG
GGCAGAGCAGCAGCAACGTGAGGGTCGCTGTGGTGGTGGTTTCTGTGAGTGGATGGAATGAGCAGCCCTGCAGGGGCGCTGGGCATGTGC
CCTCACTGTGGACAGGGCCCACCCACCTCCGGTTCCCCTGTGCCTCCTGTCCCTTCCACGCTTAAACAGGGTTCTCTGTCATTTTCCTGT
TTTCTTCCAGAGTCCCAATCCTTTGCCCTAGTTCTTTCACTAGTTTGAAATCCAAGTTCTTGCCAGAGTGTTGGAGCAAGGCAGCTGATT
TGCTGCAGGGATGGAGAGGACCACCGCCGCAGGGTTCTTTTACCTGTGCCACCAGCTCTGGAGACATCCACACCCATTCCCAGGTGTCCT
TCCAGGCCCATTGTCCACGTCTTCAAGGGGGCTGCCTGGATAGGCGTGTGTGTGTGTAGTGCCCAGGTGTGGTGGTGGCATCCTGAAGCA
GTAGGACCATTAGTGTGCGTGCACACCCACGTGGCACACTGTGTGGTGACCATGGTCATCATAGTGGGCTCACGGCGGAAACGGGATCAT
TCAACCTATACAAAGGGGACCCTGGATAGGCTGCAAGGAAAGAGGCCAAGGGCCAAGCTGCTAAGCCAAAGATGGCCCCTGACACCTCCC
CCAGCCCCAGCACACCAGCCTGCACCTCTTATCCCGTGTTAAGTCCTGGTTCCCCACCTGCTGCCCCTCCTCAACACAGAGCCCCTCCCG
CCGCCTCAGACCCCTGTGCACATCCCCAGGGCCTCAGCCGTCTCATTGGTCTTTATTTTTTATTTTTTTTAAGATGGAGTTTCGCTCTTA
TTGCCCAGACTGGAGTGCAGTGATGCTATCTCGGCTCACTGCAACCTTTGCCTCCCAGGTTCAAGTGATTCTCCTGCCTCAGCCTCCCAA
GTAGCTGGGATTACAGGCGTGCACCACCACGCCGGGCTAAATTTTTTTGTATTTTTAGTAGAGACGGGGTTTCTCTATGTTGGTCAGGCT
GATCTCGAACTCCCGACCTCAGGTGATCCGCCAGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACTGCACCCGGCTCTCA
CTGGTCTTACGCCACCTTCTGGACACTCCCTCCTTGAGGGCAGAAAGGAGTCCCAGGCCTGTCCCTAGGGACAAGGCCCAGGGAAGAGTG
TATTTGGGGAGCAGGGGAGGGGAGGGTGTTGAGAAAGCTGAACTGGAGTCAATCACCCTTCCCACAAATCACCAAACTGCTGGAACTCTC
CAGCCAAATGCTGGGAGAAGGACCTGGAGGGTGAGTCTTTGCTGACCTCTCTCTACTCTCAGGCATGTCTTTTGTCCTTTTCGTCCATCT
ATTTCTGTCTGTCGCTCACTCGCCCCGCTTTCTCTGTCTCACCTTCATCCACTCTGCAGGCCTGCTCCACCACAGCCCTAATCCTCTGGA
CGCTTGTGTAGGGCCTGGGGTGAATTCCCTGTCCCCCATGGTACCTCGAGAGGGGCTGGGGAGCTCAGCTTGGTCTCAGAGTCTCCCCAC
CAGATACTGTTTAAAAAAGTAGCACTGATGTGTTTTGTAATCTGCCCCTCCCAGCCCTCCGTGGAGGCTGCCAGGGCCTTGTACGGTAAA

>82458_82458_14_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000436008_FOXP4_chr6_41545724_ENST00000373063_length(amino acids)=869AA_BP=270
MTHPEPPSALSRAAGPPRRQRLQAPGPARCARGARGASAALYAPLREPRARAMAIDRRREAAGGGPGRQPAPAEENGSLPPGDAAASAPL
GGRAGPGGGAEIQPLPPLHPGGGPHPSCCSAAAAPSLLLLDYDGSVLPFLGGLGGGYQKTLVLLTWIPALFIGFSQFSDSFLLDQPNFWC
RGAGKGTELAGVTTTGRGGDMGNWTSLPTTPFATAPWEAAGNRSNSSGADGGDTPPLPSPPDKGDNASNCDCRAWDYGIRAGLVQNVVSK
ALQVARQFLLQQASGLSSPGNNDSKQSASAVQVPVSVAMMSPQMLTPQQMQQILSPPQLQALLQQQQALMLQQLQEYYKKQQEQLHLQLL
TQQQAGKPQPKEALGNKQLAFQQQLLQMQQLQQQHLLNLQRQGLVSLQPNQASGPLQTLPQAVCPTDLPQLWKGEGAPGQPAEDSVKQEG
LDLTGTAATATSFAAPPKVSPPLSHHTLPNGQPTVLTSRRDSSSHEETPGSHPLYGHGECKWPGCETLCEDLGQFIKHLNTEHALDDRST
AQCRVQMQVVQQLEIQLAKESERLQAMMAHLHMRPSEPKPFSQPVTVSAADSFPDGLVHPPTSAAAPVTPLRPPGLGSASLHGGGPARRR
SSDKFCSPISSELAQNHEFYKNADVRPPFTYASLIRQAILETPDRQLTLNEIYNWFTRMFAYFRRNTATWKNAVRHNLSLHKCFVRVENV
KGAVWTVDEREYQKRRPPKMTGSPTLVKNMISGLSYGALNASYQAALAESSFPLLNSPGMLNPGSASSLLPLSHDDVGAPVEPLPSNGSS

--------------------------------------------------------------
>82458_82458_15_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000436008_FOXP4_chr6_41545724_ENST00000409208_length(transcript)=3943nt_BP=1117nt
GGAGGCGCCCGAAGTCGGAGAAATACAGCGCGGCCACCCCGGGAGCAGCAGCCTTTGCCCGCACCGCCGCCCGCAGGGCCCTGAACCGCC
GGCAAGGGCGCGGGCCGGGCGCGGCGCCGGCTCCTCCTCCTGCCGCGTCCGGCGCGGAGCAGCGGAGAGCGGCGGGCTCGGCAGCCGGCG
CCCGGGCCGCGGAACTGATGAGCCGCGGCGGCTGAGGGCCCGGCCGGCCTCTGCCCGCCTCTGCCCGGCGCCCGAGCGGGGCCGCAGCGC
CCGGGGCGCGGGGTCCGGCGGGTGACATGGGGGCGGCCTGACGCACCCGGAGCCGCCGAGCGCTCTCTCTCGAGCCGCGGGCCCTCCTCG
GCGGCAGCGGCTGCAGGCGCCTGGCCCGGCGCGCTGTGCCCGGGGCGCCCGCGGAGCCTCCGCCGCGCTCTATGCGCCTCTGCGGGAGCC
GCGGGCCCGGGCCATGGCCATAGACCGGCGGCGCGAGGCGGCGGGCGGCGGGCCTGGGCGGCAGCCGGCCCCGGCCGAGGAGAACGGCTC
CCTGCCGCCCGGGGACGCGGCGGCCTCGGCGCCCCTCGGGGGACGCGCGGGCCCCGGCGGCGGCGCGGAGATCCAGCCGCTGCCCCCACT
GCATCCTGGAGGCGGCCCGCACCCGAGCTGCTGCTCCGCGGCTGCGGCCCCGAGCCTCTTGTTGCTGGACTATGACGGGTCGGTGCTGCC
CTTCCTCGGGGGCCTGGGCGGGGGCTATCAGAAGACCCTCGTGCTGCTCACCTGGATCCCGGCGCTGTTCATCGGCTTCAGCCAGTTCTC
GGACTCGTTCCTCCTGGACCAGCCCAACTTCTGGTGCCGCGGGGCCGGCAAAGGCACCGAGCTGGCAGGGGTCACCACCACAGGCCGGGG
CGGGGACATGGGCAACTGGACCAGCCTCCCCACCACCCCCTTCGCCACTGCCCCCTGGGAGGCTGCGGGCAACCGGAGCAACAGCAGCGG
CGCGGACGGAGGCGACACACCACCCCTGCCATCCCCTCCGGACAAGGGGGACAACGCCTCCAACTGTGACTGCCGCGCATGGGACTACGG
CATCCGCGCCGGCCTCGTCCAGAACGTGGTCAGCAAGGCTCTCCAAGTGGCCCGGCAGTTCCTGCTGCAGCAGGCCTCAGGCCTGAGCTC
CCCAGGGAACAATGACAGCAAACAGTCTGCCTCTGCTGTGCAGGTGCCTGTGTCGGTGGCCATGATGTCGCCGCAGATGCTTACCCCGCA
ACAGATGCAGCAGATCCTGTCGCCCCCGCAGCTGCAGGCCTTGCTCCAGCAGCAGCAAGCCCTCATGCTCCAGCAGCTACAGGAGTACTA
CAAGAAGCAGCAGGAGCAGCTCCACCTGCAGCTCCTCACCCAGCAGCAGGCTGGGAAACCGCAGCCCAAAGAGGCACTGGGGAACAAGCA
GCTGGCCTTCCAGCAGCAGCTCCTGCAAATGCAACAGTTGCAGCAGCAGCACCTGCTCAACCTGCAGAGGCAGGGGCTGGTCAGCCTGCA
GCCCAACCAAGCCTCGGGGCCCCTCCAGACCCTTCCGCAAGCAGCTGTTTGCCCAACAGACCTGCCCCAGCTGTGGAAGGGCGAGGGTGC
CCCCGGGCAGCCTGCCGAGGACAGCGTCAAGCAGGAGGGGCTGGACCTCACTGGCACGGCCGCCACCGCTACCTCGTTTGCCGCTCCCCC
CAAGGTCTCACCCCCCCTCTCCCACCATACCCTGCCCAACGGACAGCCTACTGTGCTCACATCTCGGAGAGACAGCTCTTCCCACGAGGA
GACCCCCGGCTCCCACCCCCTGTACGGACACGGAGAGTGCAAGTGGCCAGGCTGTGAGACCCTGTGTGAAGACCTGGGCCAGTTTATCAA
ACACCTCAACACAGAGCACGCCCTGGATGACCGGAGTACAGCCCAGTGCCGGGTACAGATGCAGGTGGTGCAGCAGCTGGAGATCCAGCT
CGCCAAGGAGAGCGAGCGGCTGCAGGCCATGATGGCCCACCTGCACATGCGGCCCTCGGAGCCCAAGCCCTTCAGCCAGCCAGTGACCGT
CTCTGCAGCAGACTCATTCCCAGATGGTCTCGTGCACCCCCCGACCTCGGCCGCAGCCCCTGTCACCCCTCTACGGCCCCCTGGCCTGGG
CTCTGCCTCCCTGCATGGTGGGGGCCCAGCCCGTCGGAGAAGCAGTGACAAGTTCTGCTCCCCCATCTCCTCAGAGCTGGCCCAGAATCA
TGAGTTCTACAAGAACGCCGACGTCCGGCCCCCCTTCACCTACGCCTCCCTCATCCGCCAGGCCATCCTGGAAACCCCTGACAGGCAGCT
GACCCTGAATGAGATCTATAACTGGTTCACCAGGATGTTCGCCTATTTCCGCAGAAACACTGCCACCTGGAAGAACGCCGTGCGCCACAA
CCTCAGCCTGCACAAGTGCTTCGTCCGCGTGGAGAACGTCAAGGGTGCCGTGTGGACTGTGGACGAGCGGGAGTATCAGAAGCGGAGACC
GCCAAAGATGACAGGGAGCCCCACCCTGGTGAAGAACATGATCTCTGGCCTCAGCTATGGAGCACTTAATGCCAGCTACCAGGCCGCCCT
GGCCGAGAGCAGCTTCCCCCTCCTCAACAGCCCTGGCATGCTGAACCCTGGCTCCGCCAGCAGCCTGCTGCCCCTCAGCCACGATGACGT
GGGTGCCCCCGTGGAGCCGCTGCCCAGCAACGGCAGCAGCAGCCCTCCTCGCCTCTCCCCGCCCCAGTACAGCCACCAGGTGCAGGTGAA
GGAGGAGCCAGCAGAGGCAGAGGAAGACAGGCAGCCCGGGCCTCCCCTGGGCGCCCCTAACCCCAGCGCCTCGGGGCCTCCGGAAGACAG
GGACCTGGAGGAGGAGCTGCCGGGAGAAGAACTGTCCTAAGGGCCTGTAGTGACCGGCAGGGCTGGGGTGAGACCCCTCCCTTCCAGAAT
CCAGGCCCCATCTCCCCCAACTCCACAGCCCCTCCCGAGCCTCAAGGCAAGTCCAGGACTCAGACCGGGGAGGCCCGGGCCAGCAGCTCC
CAGTGTGACCTGACAAAAACACGTAGGGGCAGGGACGGTCCCCACCCCCAGGGACACAACCCCTGGTCTTGGACCAGTAGAGGACACGGA
GGGTTCAGACCCCTCCTCAGACCCTCCCCACATCTGAAACTGCCTCCCCCCAACCACCAGCAGCAGCAGGGCCCTCCTCCCCCACCAGCT
CTCCCCACAGGGCCCCTCAGCATCATGGAGACCCGCAGGCGGGGCTTAGCCACCCCTCAAACCCAGGGCCCCCTGGCACCTGGCTCTGGC
CGTGTTTTCTGGCCAGAGGCCCCCACTTTCCTAACTCGTGCTCCCTTCCGCCTTCTTTTCCGTACTGTGAAGAAAGAACTCTCCACCCCA
GCTCCCACCCTGCCCTGGCCTGGGTGGAGGAACTGTGCCTCCATCCCCAGAAGAAACAGCCCCCTCTGCTGCTGGGGTGGGACTGTCTGT
GTGCCCTGTGGGGGTCCGTGTGAGCAGGCCCACCTGGCTCCAGACCCGCCCCCAACCTGAGACAGAACCAGGCTGAGCCAGGCCTCCACC
CCCACCCCCGTTTGCTGGGGGCTCCTCCAGCCGCCCCCATGGGAAGAGGCCTGGTACCGCCTCACCCACAGAGGTCTGTGCCAGGTGCGC
TTCTGCAGGTGGAGCCAAGCTCTCCCTGAGGCCAGAGGCGGGGCCTGGGCCGGGAGCCCAGGGGAAGGCCAGGCTGGACCCCGGCTCCAC
ACCCACATCCAGCCTGCAGGCCTCTCTGCAGTCCTCTCACCCTCCCTCAGCTCCCCTTCCTCTGCAGTCACCCTCAGCTCCCCTTCCTTG

>82458_82458_15_SLC22A23-FOXP4_SLC22A23_chr6_3456140_ENST00000436008_FOXP4_chr6_41545724_ENST00000409208_length(amino acids)=870AA_BP=270
MTHPEPPSALSRAAGPPRRQRLQAPGPARCARGARGASAALYAPLREPRARAMAIDRRREAAGGGPGRQPAPAEENGSLPPGDAAASAPL
GGRAGPGGGAEIQPLPPLHPGGGPHPSCCSAAAAPSLLLLDYDGSVLPFLGGLGGGYQKTLVLLTWIPALFIGFSQFSDSFLLDQPNFWC
RGAGKGTELAGVTTTGRGGDMGNWTSLPTTPFATAPWEAAGNRSNSSGADGGDTPPLPSPPDKGDNASNCDCRAWDYGIRAGLVQNVVSK
ALQVARQFLLQQASGLSSPGNNDSKQSASAVQVPVSVAMMSPQMLTPQQMQQILSPPQLQALLQQQQALMLQQLQEYYKKQQEQLHLQLL
TQQQAGKPQPKEALGNKQLAFQQQLLQMQQLQQQHLLNLQRQGLVSLQPNQASGPLQTLPQAAVCPTDLPQLWKGEGAPGQPAEDSVKQE
GLDLTGTAATATSFAAPPKVSPPLSHHTLPNGQPTVLTSRRDSSSHEETPGSHPLYGHGECKWPGCETLCEDLGQFIKHLNTEHALDDRS
TAQCRVQMQVVQQLEIQLAKESERLQAMMAHLHMRPSEPKPFSQPVTVSAADSFPDGLVHPPTSAAAPVTPLRPPGLGSASLHGGGPARR
RSSDKFCSPISSELAQNHEFYKNADVRPPFTYASLIRQAILETPDRQLTLNEIYNWFTRMFAYFRRNTATWKNAVRHNLSLHKCFVRVEN
VKGAVWTVDEREYQKRRPPKMTGSPTLVKNMISGLSYGALNASYQAALAESSFPLLNSPGMLNPGSASSLLPLSHDDVGAPVEPLPSNGS

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for SLC22A23-FOXP4


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for SLC22A23-FOXP4


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for SLC22A23-FOXP4


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource