Tải bản đầy đủ (.pdf) (13 trang)

TWO FUNCTIONAL LUPUS-ASSOCIATED BLK PROMOTER VARIANTS CONTROL CELL-TYPE- AND DEVELOPMENTAL-STAGE-SPECIFIC TRANSCRIPTION

Bạn đang xem bản rút gọn của tài liệu. Xem và tải ngay bản đầy đủ của tài liệu tại đây (1.12 MB, 13 trang )

<span class="text_page_counter">Trang 1</span><div class="page_container" data-page="1">

Two Functional Lupus-Associated

BLK Promoter Variants Control

Cell-Type-and Developmental-Stage-Specific Transcription

Joel M. Guthridge,

<small>1,33,</small>

*Rufei Lu,

<small>1,2,33</small>

Harry Sun,

<small>3</small>

Celi Sun,

<small>1</small>

Graham B. Wiley,

<small>1</small>

Nicolas Dominguez,

<small>1</small>

Kenneth M. Kaufman,

<small>4,5</small>

Jennifer A. Kelly,

<small>1</small>

Carl D. Langefeld,

<small>6</small>

Adam J. Adler,

<small>1</small>

Isaac T.W. Harley,

<small>7</small>

Joan T. Merrill,

<small>8</small>

Gary S. Gilkeson,

<small>9</small>

Diane L. Kamen,

<small>9</small>

Timothy B. Niewold,

<small>10</small>

Elizabeth E. Brown,

<small>11,12</small>

Jeffery C. Edberg,

<small>13</small>

Michelle A. Petri,

<small>14</small>

Rosalind Ramsey-Goldman,

<small>15</small>

John D. Reveille,

<small>16</small>

Luis M. Vila´,

<small>17</small>

Robert P. Kimberly,

<small>13</small>

Barry I. Freedman,

<small>18</small>

Anne M. Stevens,

<small>19</small>

Susan A. Boackle,

<small>20</small>

Marta E. Alarco´n-Riquelme,

<small>1,24,35</small>

Kathy L. Sivils,

<small>1</small>

Jiyoung Choi,

<small>25</small>

Young Bin Joo,

<small>25</small>

So-Young Bang,

<small>25</small>

Hye-Soon Lee,

<small>25</small>

Sang-Cheol Bae,

<small>25</small>

Nan Shen,

<small>26</small>

Xiaoxia Qian,

<small>26</small>

Betty P. Tsao,

<small>27</small>

R. Hal Scofield,

<small>1,31,32</small>

John B. Harley,

<small>4,5</small>

Carol F. Webb,

<small>28,29</small>

Edward K. Wakeland,

<small>30</small>

Judith A. James,

<small>1,2,31</small>

Swapan K. Nath,

<small>1,34</small>

Robert R. Graham,

<small>3,34</small>

and Patrick M. Gaffney

<small>1,34</small>

Efforts to identify lupus-associated causal variants in the FAM167A/BLK locus on 8p21 are hampered by highly associated noncausalvariants. In this report, we used a trans-population mapping and sequencing strategy to identify a common variant (rs922483) in theproximal BLK promoter and a tri-allelic variant (rs1382568) in the upstream alternative BLK promoter as putative causal variants forassociation with systemic lupus erythematosus. The risk allele (T) at rs922483 reduced proximal promoter activity and modulated alter-native promoter usage. Allelic differences at rs1382568 resulted in altered promoter activity in B progenitor cell lines. Thus, our resultsdemonstrated that both lupus-associated functional variants contribute to the autoimmune disease association by modulating transcrip-tion of BLK in B cells and thus potentially altering immune responses.

The gene structures of BLK (MIM 191305), a member of thesrc-family tyrosine kinases, have been described in B cellspreviously.

<sup>1</sup>

More recently, the BLK-deficiency-inducedunderdevelopment of IL-17-producing gd T cells has impli-

cated a critical role of expression-altering BLK variants inthe pathogenesis of autoimmune diseases.

<sup>2</sup>

Studies withBlk-deficient mice suggest that BLK influences both B andT cell development and proliferation.

<sup>2,3</sup>

This locus is asso-ciated with multiple autoimmune diseases, including sys-temic lupus erythematosus (SLE [MIM 152700]), systemic

<small>1</small>Arthritis and Clinical Immunology Program, Oklahoma Medical Research Foundation, Oklahoma City, OK 73104, USA;<small>2</small>Department of Pathology, versity of Oklahoma Health Sciences Center, Oklahoma City, OK 73104, USA;<sup>3</sup>Immune and Tissue Growth and Repair and Human Genetics Department,Genentech, South San Francisco, CA 94080, USA;<small>4</small>Cincinnati Children’s Hospital Medical Center, Cincinnati, OH 45229, USA;<small>5</small>Cincinnati Veterans AffairsMedical Center, Cincinnati, OH 45220, USA;<small>6</small>Department of Biostatistical Sciences, Wake Forest University, Winston-Salem, NC 27106, USA;<small>7</small>Division ofMolecular Immunology and Graduate Program in Immunobiology, Cincinnati Children’s Hospital Research Foundation, Cincinnati, OH 45229, USA;

<small>Uni-8</small>Department of Clinical Pharmacology, Oklahoma Medical Research Foundation, Oklahoma City, OK 73104, USA;<small>9</small>Department of Medicine, Divisionof Rheumatology, Medical University of South Carolina, Charleston, SC 29425, USA;<small>10</small>Division of Rheumatology and Department of Immunology,Mayo Clinic, Rochester, MN 55902, USA;<small>11</small>Department of Epidemiology, University of Alabama-Birmingham, Birmingham, AL 35294, USA;<small>12</small>Departmentof Medicine, University of Alabama-Birmingham, Birmingham, AL 35294, USA;<small>13</small>Division of Clinical Immunology and Rheumatology, University ofAlabama-Birmingham School of Medicine, Birmingham, AL 35294, USA;<small>14</small>Department of Medicine, Johns Hopkins University School of Medicine, Balti-more, MD 21205, USA;<small>15</small>Division of Rheumatology, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA;<small>16</small>Rheumatology andClinical Immunogenetics, University of Texas Health Science Center at Houston, Houston, TX.77030, USA;<small>17</small>Department of Medicine, Division of Rheu-matology, University of Puerto Rico Medical Sciences Campus, San Juan 00921, Puerto Rico;<sup>18</sup>Department of Internal Medicine, Wake Forest School ofMedicine, Winston-Salem, NC 27106, USA;<small>19</small>Division of Rheumatology, Department of Pediatrics, University of Washington Center for Immunity andImmunotherapies, Seattle Children’s Research Institute, Seattle, WA 98101, USA;<small>20</small>Division of Rheumatology, University of Colorado Denver, Aurora,CO 80045, USA;<small>21</small>Rosalind Russell Medical Research Center for Arthritis, University of California San Francisco, San Francisco, CA 94143, USA;<small>22</small>Divisionof Medicine, Imperial College of London, London SW7 2AZ, UK;<small>23</small>Department of Medicine, University of Southern California, Los Angeles, CA 90089,USA; <small>24</small>Centro de Geno´mica e Investigaciones Oncolo´gicas (GENYO). Pfizer-Universidad de Granada-Junta de Andalucı´a, Granada 18016, Spain;

<small>25</small>Department of Rheumatology, Hanyang University Hospital for Rheumatic Diseases, Seoul 133-791, Korea;<small>26</small>Molecular Rheumatology Laboratory, tute of Health Sciences, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences and Shanghai Jiao Tong University School of Medicine,Shanghai 200025, China;<small>27</small>Department of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA;<small>28</small>Immunobiology and CancerProgram, Oklahoma Medical Research Foundation, Oklahoma City, OK 73104, USA;<sup>29</sup>Department of Cell Biology and Department of Microbiologyand Immunology, University of Oklahoma Health Sciences Center, Oklahoma City, OK 73104, USA;<small>30</small>Department of Immunology, University of TexasSouthwestern Medical Center, Dallas, TX 75235, USA;<small>31</small>Department of Medicine, University of Oklahoma Health Sciences Center, Oklahoma City, OK73105, USA;<small>32</small>United States Department of Veterans Affairs Medical Center, Oklahoma City, OK 73105, USA

<small>Insti-33</small>These authors contributed equally to this work

<small>34</small>These authors contributed equally to this work

<small>35</small>On behalf of the BIOLUPUS Network; members of BIOLUPUS are listed in the Consortia section*Correspondence:

by The American Society of Human Genetics. All rights reserved.

586 The American Journal of Human Genetics94, 586–598, April 3, 2014

</div><span class="text_page_counter">Trang 2</span><div class="page_container" data-page="2">

sclerosis (MIM 181750), rheumatoid arthritis (MIM180300) and Sjoăgrens syndrome (MIM 270150).

<sup>4–11</sup>

Ana-lyses of expression in transformed B cell lines demonstratethat risk-conferring variants within FAM167A (MIM610085) and BLK are associated with altered mRNA expres-sion of both FAM167A and BLK; however, the causal allelesand mechanisms remain undefined.

<sup>7</sup>

Like other genes with TATA-less promoters, the genomicDNA upstream of exon 1 of BLK has two transcription startsites and promoters that drive BLK transcription: a ubiqui-tous proximal promoter (P1) and a B-lymphocyte-specificpromoter (P2).

<small>1</small>

Recent evidence suggests that immatureB cells from individuals carrying lupus risk alleles havelower amounts of BLK than such cells from individualswithout lupus risk alleles.

<small>12</small>

In this study, we leveraged the difference in linkagedisequilibrium (LD) structure across populations toexamine the FAM167A/BLK locus in a multiethnic popula-tion of SLE cases and controls and then used focused rese-quencing to identify additional lupus-associated variants.Functional assessment revealed the molecular mechanismimpacted by the variant alleles. Using this approach, wesuccessfully identified two functional variants that regu-late transcription from the promoters in a cell-type- anddevelopmental-stage-specific fashion.

Subjects and Methods

Study Subjects

Approval by the institutional review boards of the OklahomaMedical Research Foundation and the collaborators’ institutionswas obtained prior to sample collection. All study participantsprovided written consent at the time of sample collection. De-identified genomic DNA samples from individuals with SLE andcontrol subjects were analyzed from 6,658 unrelated individuals(3,980 individuals of European ancestry [EA], 1,272 of Asianancestry [AS], and 1,406 of African American ancestry [AA]) and6,550 unrelated controls (3,546 EA, 1,270 of AS, and 1,734 AA)(Table 1). These samples were obtained through the LupusFamily Registry and Repository (LFRR) as part of the OklahomaRheumatic Disease Research and Cores Center (ORDRCC) andthrough collaborators from 24 additional study sites. Collabora-tors and the sources of all case and control individuals used inthese studies are shown inTable S1 in the Supplemental Dataavailable online.

For resequencing experiments, deidentified genomic DNAsamples from individuals with SLE and controls were obtained

(ABCoN) of the New York Cancer Project (NYCP) (191 EA SLEindividuals and 96 EA controls) courtesy of Dr. Gregersen forthe discovery cohort (Table S2). All individuals with SLE metclassification criteria<small>13</small>(American College of Rheumatology). Allsamples were independent. Only one randomly selected SLE sam-ple was included if multiple affected individuals were availablefrom a multiplex lupus pedigree. DNA was obtained from bloodsamples.

Genotyping and Quality Control

All samples were genotyped as a part of a joint effort of more than40 investigators from around the world. These investigatorscontributed samples, funding, and hypotheses used for designinga custom, highly multiplexed Illumina-bead-based array methodon a BeadStation system.<small>14</small>Select SNPs were also assayed for geno-type confirmation via TaqMan methods (Applied Biosystems).Genotyping facilities are located at the Oklahoma MedicalResearch Foundation, and data were sent to a central data centerat Wake Forest Medical Center for quality control. These datawere then distributed back to the investigators who had requestedspecific SNPs for final analysis and publication.

Genotype data were only used from samples with a call rategreater than 90% of the SNPs screened (98.05% of the samples).For analyses, only genotype data from SNPs with a call frequencygreater than 90% in the samples tested and an Illumina Gen-Train score greater than 0.7 (96.74% of all SNPs screened) wereused. In addition, at least one previously genotyped samplewas randomly placed on each assay plate and used for trackingsamples through the genotyping process. More information onIllumina genotyping can be found at the Illumina website(Web Resourcessection).

Correction for Population Stratification

Following best practices in genome-wide association studies, weused all of the genotype data from all SNPs that passed quality con-trol, including the published set of ancestry-informative makers(AIMs),<small>15</small>and computed the principal components and admixtureestimates. Regions of known extended LD were removed. Thecombination of 12,000 SNPs, including published sets of AIMsand the principal-component analysis computed across all ethnic-ities, generated principal components that separated ethnicities.To minimize the inflation of the test statistics, we included popu-lation-specific principal components in the logistic regressionmodels as covariates.<small>15,16</small>Population clustering based upon thethree-dimensional plot of principal component 1 (PC1), PC2,and PC3 of the final samples used in these studies is presented(Figure S1).

Imputation-Based Association Analysis

Initially, we genotyped 372 SNPs within the FAM167A/BLK region(11,033,737–11,618,107 bp, hg19), and after performing qualitycontrol (HWE > 0.001 in controls and minor allele frequency[MAF]> 0.01), we had 329 SNPs in AA samples, 259 SNPs in EAsamples, and 201 SNPs in AS available for imputation. To investi-gate the new variants in the FAM167A/BLK region, we used the1000 Genomes project<sup>17</sup>as a reference panel for imputation toestimate missing genotypes. After quality control measures(HWE> 0.001 in controls and MAF > 0.01) for the 1000 Genomesproject reference panel, which contains 11,528 SNPs within theFAM167A/BLK region, we used 246 AA samples with 4,813 SNPs,381 EA samples with 2,508 SNPs, and 286 AS samples withTable 1. Demographics of SLE Populations Studied

Affected Individuals Control Individuals

</div><span class="text_page_counter">Trang 3</span><div class="page_container" data-page="3">

1,847 SNPs for imputation. Imputation was carried out withMACH,<small>17,18</small>which provided a quantitative assessment of estimateuncertainty (Rsq). All imputed SNPs were filtered with the qualitycontrols (HWE> 0.001, MAF > 0.01, and Rsq > 0.6), and 2,137SNPs in AA samples, 1,199 SNPs in EA samples, and 738 SNPs inAS samples were used for further analysis. At each SNP, p value,odds ratio (OR), and 95% confidence interval (CI) were calculatedwith gPLINK.<small>19</small> We calculated allelic association results (Table 2andTable S3) to account for imputation uncertainty with mach2dat;<sup>20</sup>genotyped and imputed SNPs with p values% 0.05 fromat least one population are shown.

For each ethnic population, we used WHAP<sup>19</sup>to calculate wise conditional analysis for each pair of SNPs (the most signifi-cant SNP plus each other SNP) and identify the independenteffects for each SNP. We assessed whether the joint effect isexplained by a single SNP. If a haplotype was significant andremained significant after we conditioned on a SNP, then thatSNP did not independently account for the association. However,if the p value was no longer significant after we conditioned on aSNP, then we considered that SNP to be the source of theassociation.

pair-Resequencing ofFAM167A/BLK Exons and theUpstream Promoter Region

Resequencing was performed on 191 individuals with SLE and 96controls from ABCoN, as detailed above (Table S2). All 13 exonsand the 2.5 kb upstream promoter sequence were resequencedwith whole-genome amplified genomic DNA (Cat#150045,QIAGEN). Primers for resequencing were designed to target the13 exon regions and 2.5 kb upstream promoter region. PCR ampli-fication was performed on genomic DNA via high-fidelity Taqpolymerase according to standard protocols. PCR product purityand size were assessed on 2% agarose gels. Sanger sequencingwas performed per the manufacture’s protocol. Sequence trace fileswere manually analyzed for variations.

Haplotype Analysis

We used the expectation-maximization algorithm in the WHAPprogram<sup>19</sup> to estimate haplotype frequencies. WHAP directlycalculates likelihood estimates, likelihood ratios, and p valuesby taking into account the information loss due to haplotype-phase uncertainty and missing genotypes. Association betweeninferred haplotypes and SLE was tested with an omnibus test.We used both conditional analysis and global haplotype analysisto disentangle the correlation structure in which SNPs are trulyassociated with phenotype. To test which of the associatedSNPs were causal and which were significantly associated byLD, we performed haplotype conditional analysis on each SNP.If the global haplotype association disappeared, then the specificSNP on which we had conditioned accounted for the wholeassociation.

Nuclear Extract Preparation

Nuclear extracts from the human Jurkat T cell line, RS4;11 pro-Bcell line, Nalm-6 and Reh pre-B cell lines, Ramos immature B cellline, and Daudi mature B cell line (American Type Tissue CultureCollection) were obtained. Cells were maintained in RPMI with10% heat-inactivated fetal bovine serum, L-glutamine (2 mM),and penicillin and streptomycin (100 units/ml). Nuclear proteinextracts were prepared from cells, dialyzed against a buffercomposed of 20 mM HEPES, 20% glycerol, 0.1M potassium chlo-

ride, and 0.2 mM EDTA (pH 7.9), and used in nuclear bindingassays (Figures S2andS3).<small>21</small>

Electrophoretic Mobility-Shift Assay

A forward and reverse 21 base pair synthetic oligonucleotide fromthe BLK promoter flanking the rs922483 polymorphism was pur-chased from Integrated DNA Technologies. All oligos were purifiedwith polyacrylamide gel electrophoresis. Probes carrying the riskallele (T) and nonrisk allele (C) were generated, and pairs of oneforward and one reverse oligonucleotide were mixed in equalmolar ratios, heated, and then allowed to anneal to generatethe 21 bp, double-stranded probes. T4 polynucleotide kinase(Invitrogen) was used for labeling the end of each DNA probewith (g-<sup>32</sup>P) adenosine triphosphate (Amersham). The nuclear ex-tracts prepared as discussed above were incubated for 25 min at37<sup></sup>C with labeled probes in binding buffer (1 mg poly(dI-dC),20 mM HEPES, 10% glycerol, 100 mM KCl, and 0.2 mM EDTA[pH 7.9]). DNA-protein complexes were resolved on denaturing5% acrylamide gels. For supershift assays, varying concentrationsof anti-pol II antibody (clone 8A7 and clone H-224, Santa Cruz)were added to the DNA-protein complexes; this was followed byincubation for 15 min prior to resolution on denaturing 5% acryl-amide gels (Figure S3).

Luciferase Reporter Assay

We amplified the upstream sequence (2,256 to ỵ55 bp) of BLK byusing genomic DNA from individuals with nonrisk haplotypes.PCR products were cloned into pCR2.1-TOPO vector (Invitrogen,Cat# K4500-01) and subcloned into pGL4 luciferase reporter vec-tors (Promega, Cat# E6651, Madison, WI). The construct carryingthe nonrisk haplotype was used as a template for mutagenesis(Stratagene) to create other allelic haplotypes.

An internal control reporter vector, pRL-TK, containing Renillaluciferase driven by the thymidine kinase promoter was simulta-neously transfected with our experimental vectors as a controlfor assay-to-assay variability. The Renilla luciferase activity ex-pressed by the internal control vector was used for normalizationof transfection efficiency. One to 5 mg of each vector was trans-fected into the Jurkat (13 10<small>6</small>

/sample in triplicate), RS4;11 (2310<small>6</small>/sample in triplicate), Nalm-6 (33 10<small>6</small>/sample in triplicate),Ramos (33 10<small>6</small>

/sample in triplicate), and Daudi (53 10<small>6</small>/samplein triplicate) cell lines. Cells were then incubated at 37<sup></sup>C for 16 hr.Luciferase activity was measured with the Dual-Luciferase ReporterAssay System (Promega, Cat# E1960). Luciferase activity wasnormalized through division of BLK risk or nonrisk construct re-porter activity by the reporter activity of the pRL-TK construct.The mean and standard error of measurement were calculatedon the basis of the normalized luciferase activities and used forfurther analysis.

Trans-Population Association Testing Identifiedrs922483 as the Predominant SLE-Associated CausalVariant

To identify the causal variants responsible for the tion of FAM167A/BLK with SLE, we genotyped 372SNPs selected from the phase II HapMap in the regionspanning 584.37 kb (11,033,737–11,618,107 bp, hg19) inchromosomal region 8p21 in three ethnic populations.

associa-588 The American Journal of Human Genetics94, 586–598, April 3, 2014

</div><span class="text_page_counter">Trang 4</span><div class="page_container" data-page="4">

Chr. dsSNP

OR(95% CI)

Freq_Allele1(Case/Control) Adj. p

OR(95% CI)

rs1478901Freq_Allele1(Case/Control) Adj. p

OR(95% CI)

0.00000411 0.73 (0.63–0.84)

0.79 (0.73–0.85)

0.00000218 0.73 (0.64–0.83)

</div><span class="text_page_counter">Trang 5</span><div class="page_container" data-page="5">

Table 2. Continued

Chr. dsSNP

OR(95% CI)

Freq_Allele1(Case/Control) Adj. p

OR(95% CI)

rs1478901Freq_Allele1(Case/Control) Adj. p

OR(95% CI)

0.00008384 1.23 (1.11–1.36)

0.00000525 0.73 (0.64–0.84)

</div><span class="text_page_counter">Trang 6</span><div class="page_container" data-page="6">

After applying quality-control measures and adjusting foradmixture within and across populations (Figure S1), weanalyzed a total of 6,658 independent cases and 6,550 in-dependent controls (Table 1andTable S1).

To enrich the genotyped data set for nongenotyped SNPs,we imputed variants located between 11,033,737 bp and11,618,107 bp (hg19) by using population-specific refer-ence panels derived from the 1000 Genomes Project.

<small>22</small>

SNP-association results for each population are shown orlisted inFigures 1A–1C,Table 2, andTable S3). Consideringthe correlated variants that had r

<sup>2</sup>

> 0.6 with the peak asso-ciated SNP in each population, we observed 30 SNPsdemonstrating association in the AS population (peakSNP rs1478901, p¼ 1.32 3 10

<small>11</small>

, OR¼ 0.64, 95% CI ¼0.56–0.73) and 20 SNPs demonstrating association in theEA population (peak SNP rs998683, p¼ 5.22 3 10

<small>14</small>

,OR¼ 0.76, 95% CI ¼ 0.71–0.82) (Table 2). However, weobserved only two associated SNPs (SNP rs2736345, p¼1.493 10

<small>6</small>

, OR¼ 1.28, 95% CI ¼ 1.15-1.42 and peakSNP rs922483, p¼ 1.15 3 10

<small>6</small>

, OR¼ 1.31, 95% CI ¼1.17–1.47) in the AA population because of the reducedLD in this region. Both variants identified in the AApopulation are within the subset of variants that were iden-tified in the EA and AS samples as having r

<small>2</small>

> 0.6 relativeto the peak SNPs, suggesting that the same causal variantsare present in all three populations. Conditional associa-tion tests performed within each population validatedrs998683, rs1478901, and rs922483 as the main SLE-associ-ated variant for EA, AS, and AA, respectively (Table 2). Thus,rs922483 is likely to be the predominant SLE-associatedvariant.

We concluded that, of the common associated variants,rs922483 was the stronger functional candidate given thatit is located near a putative transcript initiator (INR) site

<sup>23</sup>

(Figure S4) in a region predicted to bind RNA polymerase II(RNAPII), and its association with SLE remained significantwhen conditioned on rs2736345 (Table 2).

Resequencing Identified an Additional SLE-Associatedtriallelic SNP, rs1382568, Located within the

B-Cell-Specific Promoter

To ensure identification of other uncommon and allelic genetic variation in this region, we resequencedall 13 BLK exons and the 2.5 kb upstream promoterregions in 191 EA SLE individuals and 96 EA controlsfrom the Autoimmune Biomarkers Collaborative Network(ABCoN) and the New York Cancer Project (NYCP), respec-tively. Although no additional nongenotyped or nonim-puted biallelic variants were detected, an SLE-associatedtri-allelic variant, rs1382568 (A/G/C), that is highly corre-lated with the variant (rs922483) identified in our trans-population association study was identified (Table 3andTable S3).

multi-To confirm the association of these two variants, we useddata obtained for these two SNPs from additional rese-quencing efforts on 960 subjects (710 affected individualsand 250 control individuals). Association analysis results

</div><span class="text_page_counter">Trang 7</span><div class="page_container" data-page="7">

from these data demonstrate that both C and A alleles atrs1382568 individually contributed to the increased SLErisk when compared to the G allele (OR 1.70, p¼ 4 310

<sup>3</sup>

; and OR 2.53, p¼ 6.66 3 10

<small>4</small>

, respectively). Associ-ation analysis using the combined C/A risk allele atrs1382568 had an OR¼ 1.90 and p ¼ 6.66 3 10

<small>4</small>

. Thistri-allelic variant is located within the alternative BLKpromoter (P2)

<small>1</small>

(Figure 1D). These data, and previouslypublished results demonstrating that endogenous BLKexpression varies with B cell developmental stage,

<sup>24</sup>

ledus to hypothesize that the SLE-associated P2 variant mightcontribute to disease risk by promoting functionaleffects in B cells at discrete stages of development. Wefunctionally characterized both variants (rs1382568 andrs922483) in B cell lines that phenotypically representdifferent stages of B cell development.

Both Risk Alleles at rs922483 (T) and rs1382568 (C)Alter BLK Transcription

To investigate the impact of the SLE-associated promotervariants on BLK transcription, we cloned the BLK promoterregion (2256 to ỵ55 bp) into a firefly luciferase re-porter vector and performed site-directed mutagenesis togenerate all six possible haplotype combinations of thers1382568 (P2) and rs922483 (P1) variants. B lymphomacell lines with distinct phenotypes representing variousB cell developmental stages were transfected with the re-porter constructs. RS4;11 and Nalm-6 cell lines are repre-sentative of early stages of B cell development (pre- andpro-B cells), whereas Ramos and Daudi lines representmore mature B cells. The allelic effects of both BLK pro-moter variants were also tested in Jurkat cells, which arephenotypically similar to mature T cells. EndogenousBLK protein expression in each of these lines wasconfirmed to be as previously described (Figure S2).

<small>1,12</small>

Because of the small numbers of SLE-affected individualscarrying both risk alleles P1 and P2, we utilized in vitroassays to better isolate the influence of the P1 variant onBLK promoter activity. We assessed the average of lucif-erase activities of all P1-risk-allele- (T)-containing vectors,including T(P1)-C(P2), T(P1)-A(P2), and T(P1)-G(P2), aswell as all P1-nonrisk-allele-containing vectors. The riskallele (T) at the P1 variant resulted in reductions of normal-ized luciferase expression in mature B (35%, Daudi) andmature T (32%, Jurkat) cell lines regardless of the allele atthe P2 variant (p value< 0.05) (Figure 2A). The effect ofthe risk allele at the P1 variant on BLK-promoter-driventranscription was less pronounced in RS4;11 (pro-B) andNalm-6 (pre-B) cells. Nuclear-factor binding assays demon-strated that the allelic variants at the P1 site alterednuclear-factor recruitment to the P1 promoter (Figure S3A),most likely as a result of changes in either the recruitmentor the affinity of binding of the complement of nuclear fac-tors and RNA-polymerase-complex components to this re-gion of the BLK promoter, as suggested by a super-shiftbinding assay (Figure S3B). However, the complex natureof nuclear-factor binding to this site hampered our ability

FAM167A/BLK Gene Locus in SLE-Affected Individuals

SNPs in and around the FAM167A/BLK gene locus in individualswith SLE with (A) European ancestry, (B) Asian ancestry (C), andAfrican American ancestry are shown. All SNPs with an r<small>2</small>> 0.6(correlation with previously reported peak SNP rs13277113) aredisplayed. The solid blue line represents recombination ratesacross the region. The most significantly associated SNP in eachpopulation is colored purple, and the SNP number is indicated.(D) A schematic with key features of the BLK proximal promoteris shown. Probe P2 and P1 represent the 100 bp probe flankingthe candidate variants, rs1382568 and rs922483. P2 and commonqPCR products represent the products from luciferase gene-spe-cific reverse transcription using product-specific primers (repre-sented by red arrows).

592 The American Journal of Human Genetics94, 586–598, April 3, 2014

</div><span class="text_page_counter">Trang 8</span><div class="page_container" data-page="8">

to define the exact molecular interaction affected by thenucleotide variation at this site.

In order to explore the effect of P2, we compared theaveraged luciferase activities from all vectors containingthe P2 risk allele (C) with other vectors containing the P2risk allele (C). We observed the most significant alleliceffect at the P2 site in early B cells (RS4;11 and Nalm-6),where risk alleles A or C at the P2 site reduced luciferaseexpression in comparison to the nonrisk allele (G) at thisvariant (p value< 0.05) (Figure 2B). However, the impactof the P2 variant became insignificant when this variantwas transfected into more mature B cell lines. Nuclear-factor binding assays showed that the risk allele (C)reduced the binding affinity of multiple nuclear-factorcomplexes to the probe containing the P2 allelic variant(Figure S3C).

The results from these assays demonstrate that thelupus-associated risk alleles at both the P1 site (rs922483)and the P2 site (rs1382568) reduce the transcriptional

activity of the BLK promoter in vitro. However, the effectof the risk allele at the P1 site most significantly affectsBLK transcription in more mature B cells, whereas theeffects of the risk alleles at the P2 site most significantlyaffect BLK transcription in more immature B cells.P1 Variant Modulates Promoter Usage

Genes such as BLK that have multiple TSSs (transcriptionstart sites) represent a class of genes in which changes ingene expression might be attributed to polymorphisms atmultiple promoter sites. Selection of promoter use canvary on the basis of the organization of specific nuclear-factor binding sites and/or the epigenetic conformationof the genomic DNA in the promoters surrounding theseTSSs. In addition, the organization of the promoters and/or TSSs and the dynamics of the transcription initiationand elongation steps of the RNA polymerase from eachpromoter influence which transcripts predominate withina cell. Differential promoter and TSS usage has been

Case, Control Ratio

r<small>2</small>(withrs13277113)

</div><span class="text_page_counter">Trang 9</span><div class="page_container" data-page="9">

elegantly demonstrated in the regulation of expression ofthe human c-myc gene (MIM 190080).

<sup>25</sup>

In this case, apreferred downstream promoter normally impedes (atten-uates) the transcription initiated from the upstreampromoter. However, inhibition of binding of the transcrip-tional machinery (e.g., RNA polymerase complex) preventstranscription initiation at the downstream c-myc pro-moter, removing attenuation of the upstream promoterand resulting in the upstream promoter’s becoming thepreferred promoter.

To determine whether such a mechanism controls BLKpromoter selection and whether lupus-disease-associatedvariants in the BLK promoter P1 site can alter this mecha-nism, we used a transcript-specific luciferase reporter RT-qPCR assay to quantitate the percentage of the total BLKreporter transcripts in the B cell panel representing variouscell stages of development. The usage of P2 and TSS2 wassignificantly higher in a majority of the B cell lines than

in the mature T (Jurkat) cell line (p< 0.05) (Figure 3).This finding is consistent with the observations made byLin et.al.,

<small>1</small>

who showed that the P2 promoter is primarilyused by B cells. The risk allele (T) at the P1 variant reducedthe P1 and TSS1 contribution to the overall BLK-luciferase-reporter transcript levels in all cells, independent of the P2variant (p< 0.05) (Figure 2A). However, the usage of P2and TSS2 was increased by 21% and 12% in the immatureB cell lines (RS4;11 and Nalm-6, respectively) in the pres-ence of a risk-allele (T) at the P1 variant (Figure 3). Theseresults suggest that lupus-associated risk alleles at the P1variant decrease the effective initiation of the BLK-reportertranscription from P1 and TSS1. This might lower theattenuation of P2 and TSS2 in early B cells, presumablyby a mechanism similar to that observed with the c-mycgene. These findings provide mechanistic insights as tohow multiple disease-associated variants in different pro-moters can have a collective effect modulating expressionof disease-associated genes.

Previous studies have linked multiple genetic variants atmany loci with the development of autoimmune dis-ease.

<sup>26–31</sup>

Genetic variants found at the FAM167A/BLKlocus are associated with multiple autoimmune diseases,including SLE, systemic sclerosis, rheumatoid arthritis,and Sjoăgrens syndrome.

<small>411</small>

Although risk-conferring var-iants within FAM167A/BLK have been shown to be associ-ated with altered mRNA expression of both FAM167A andBLK,

<small>7</small>

the causal allele or alleles remain undefined as aresult of the strong association between potential causalalleles and noncausal variants. Using the trans-populationmapping and sequencing strategy, we focused on two com-mon associated variants (rs922483 and rs1382568) located

Figure 2. Both P1 and P2 Variants AffectBLK-Promoter-DrivenTranscriptional Activity

Mean and standard error of measure (SEM) are displayed in thecenter, and probability density functions are represented by thesides. The effect of P1 variant with either risk or nonrisk P2 haplo-type on overall luciferase expression (A) and the transcriptionalactivity in cell lines transfected with reporter vectors carryingone of the three SLE-associated P2 variants with a nonrisk P1 (B)is shown. Nine transfections of each vector carrying the P1 allelebeing compared were performed in each model cell line (n¼ 9),and triplicates were assessed for luciferase activity to give normal-ized means for each transfection. P1 risk [R(T)] and nonrisk[NR(C)] variants are compared (mean5 SEM). P2 variants ofeach allele (G, A, or C) were assessed in six experiments. Normal-ized luciferase ratio¼ (normalized luciferase activity of the haplo-type)/(normalized luciferase activity of the T allele at P1 theluciferase activity of the C allele at P2). The normalized luciferaseactivity for the haplotype¼ luciferase activity of BLK:pGL4/lucif-erase activity of TK:pRL. *p< 0.05 in a paired t test. Means 5 SEMare shown.

Figure 3. P1 Variant Altered Promoter Usage in RS4;11 andNalm-6 Cell Lines

Percentages of the total BLK promoter-luciferase derived scripts initiated from the P2 were determined using gene-specificRT-qPCR 16 hr post-transfection. *p-value < 0.05 using pairedt test. Mean5 SEM are shown.

tran-594 The American Journal of Human Genetics94, 586–598, April 3, 2014

</div><span class="text_page_counter">Trang 10</span><div class="page_container" data-page="10">

within the two promoter regions of BLK for additionalfunctional analysis.

Previously published data defined the two BLK moters and TSSs as a ubiquitously expressed TSS1 and aB cell-specific TSS2 located approximately 400 bp upstreamof the ubiquitous promoter.

<small>1</small>

Because both candidatelupus-associated variants were located in functionallyimportant loci of the BLK promoter, we hypothesizedthat they might alter unique aspects of BLK transcriptionalregulation. The rs922483 SNP resides in the ubiquitousP1 and TSS1 site within a putative initiator of transcription(INR) site.

<sup>23</sup>

The other lupus-associated variant, rs1382568,is located in an upstream P2 region that is highly enrichedfor several B-cell-specific nuclear-factor binding sites.Because rs922483 and rs1382568 have a high degree ofassociation with SLE and are located in key regions of pro-moters, our results confirm the possibility that these vari-ants contribute to disease development through regulationof BLK promoter activity.

pro-We used reporter assays and nuclear-factor binding inB cell lines with phenotypes representative of differentdevelopmental stages to study the effects of variants onpromoter activity. We cannot exclude the possibility thatfresh B cells might behave differently; it is possible that pri-mary lymphocytes might have different expression levelsand activity levels of transcription factors and that thesedifferent levels might result in altered BLK transcriptionnot observed in cell lines. However, our data directlycompared the effects of promoter alleles within varioustypes of developmental stages of B cell lines characterizedto represent different stages of B cell development to givea clearer picture of BLK transcription in early B cell devel-opment. Isolating sufficient numbers of primary progeni-tor B cells with all haplotypes would be prohibitive.Despite its limitations, this reporter assay allowed assess-ment of both the allelic and haplotype effects of thesevariants on BLK promoter activity within multiple repre-sentative cell types.

Our results demonstrated that both variants play a rolein regulating BLK transcription. Risk alleles at these sitesmost likely alter the affinity and/or specificity of bindingof critical nuclear factors and their interactions with RNApolymerase II subunits. Our results indicate that the degreeof impact of a particular risk allele on BLK transcription de-pends both upon cell type and, in the cases of B cells, uponthe developmental stage. This is consistent with observa-tions made by Simpfendorfer et.al. in primary cells, wherethey reported that a risk allele at rs922483 (P1 variant) ledto an overall reduction in BLK mRNA expression in T cellsfrom human peripheral-blood and umbilical-cord B cells.

<small>12</small>

Although the transcription of BLK was affected by thevariant in early B and T cells, BLK protein level was onlysignificantly reduced in umbilical-cord B cells.

<small>12</small>

On the basis of our results and the previously publishedinformation, we propose a molecular mechanistic modeldepicting the cell-type- and developmental-stage-specificeffect of both lupus-associated variants on the overall

BLK promoter activity (Figure 4A). In this model, the P1promoter is the predominant promoter. When the RNApolymerase II complex binds and initiates transcriptionfrom this promoter, the P2 B-cell-specific promoter isstochastically inhibited or P2-initiated transcription isprematurely terminated by RNA polymerase complexesbound to the P1 site. Because P1 is the only active pro-moter in non-B cells, a switch to a risk allele at the P1site alone will lead to a significant reduction in overallBLK promoter activity.

Alternatively, in B cells, production of BLK transcriptswould be derived from both the P1 and TSS1 site and theP2 and TSS2 site. In mature B cells, P1 and TSS1 remainthe preferred promoters, possibly as a result of nuclear fac-tors and chromatin conformation at that site, which favorhigh-affinity RNA polymerase II binding and transcriptionfrom P1 and TSS1. When a lupus risk allele is present at theP1 site, possibly lowering the affinity of nuclear factorbinding or efficiency of RNA polymerase transcriptioninitiation, the obstruction and attenuation of P2 initiatedtranscription would be diminished resulting in more P2derived transcripts. In this environment, an additionalrisk allele at the P2 site would result in altered nuclear-factor binding and RNA-polymerase-complex bindingand initiation of transcription from this promoter. Fromthis model, one would predict that the most dramaticdecrease in BLK expression in immature B cells wouldoccur when risk alleles were found at both the P1 and P2sites and that this would result in increased risk for devel-oping lupus.

Information accumulated from this and other studies isbeginning to shape our overall understanding of how var-iations in BLK transcription expression and BLK proteinlevels contribute to development and/or progression oflupus.

<small>2,3,12,32</small>

The emerging picture suggests that the varia-tion of BLK expression is likely to result in varyingfunctional consequences at different stages of B cell devel-opment and in different cell types (Figure 4B). Reductionin BLK expression by risk haplotypes could directly affectB lymphocyte development and/or impair functional re-sponses in B cells early in development. Indeed, severalpreviously published results indicate that the knockoutof one allele of Blk leads to increased splenic marginalzone and peritoneal B1 B cells in older mice,

<sup>3</sup>

suggestinga regulatory role for BLK. Because BLK is capable of inter-acting with both pre-B cell receptors and mature B cellreceptors, it could play a critical role in regulating B cellselection and immune responses. Recently, BLK has alsobeen shown to enhance BANK1 (MIM 610292) andPLCg1 (MIM 172420) interactions upon BCR activationto modulate B cell responses.

<small>33</small>

Other lupus-associatedrisk alleles in coding SNPs of BLK have been shown toresult in reduced BLK protein stability.

<small>10</small>

In addition, BLKdeficiency can impair early T cell development as well asthe development of IL-17-producing gd T cells.

<sup>2</sup>

Althoughthere has been a suggestion that BLK is also an importantsignal transduction molecule in plasmacytoid dendritic

The American Journal of Human Genetics94, 586–598, April 3, 2014 595

</div>

×