| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
(Hypertension. 2006;48:105.)
© 2006 American Heart Association, Inc.
Original Articles |
From the Clinical Pharmacology and Barts and the London Genome Centre (P.B.M., C.W., M-Z.X., A.C.M., R.J.D., A.K.P., B.B., J.G., S.J.N., J.P., C.A.M., M.C.), William Harvey Research Institute Barts and the London School of Medicine, Charterhouse Square, London, United Kingdom; Clinical Pharmacology and the Cambridge Institute of Medical Research (M.B., D.C.), University of Cambridge, Addenbrookes Hospital, Cambridge, United Kingdom; BHF Glasgow Cardiovascular Research Centre, Division of Cardiovascular and Medical Sciences (A.F.D., J.C.), Department of Cardiovascular Medicine, University of Glasgow, Western Infirmary, Glasgow, United Kingdom; Department of Cardiovascular Sciences (N.J.S.), University of Leicester, Glenfield Hospital, Leicester, United Kingdom; Centre National de Genotype (M.L.), Evry, France; Medicine and Therapeutics (J.W.), Aberdeen Royal Infirmary, Aberdeen, United Kingdom; University of Oxford (M.F.), Wellcome Trust Centre for Human Genetics, Oxford, United Kingdom.
Correspondence to Mark Caulfield, Centre for Clinical Pharmacology, William Harvey Research Institute, Barts and The London, Queen Marys School of Medicine and Dentistry, John Vane Science Centre, Charterhouse Square, London, EC1M 6BQ UK. E-mail m.j.caulfield{at}qmul.ac.uk
| Abstract |
|---|
|
|
|---|
Key Words: genetics hypertension, essential polymorphism
| Introduction |
|---|
|
|
|---|
A "first pass" genome scan from the BRItish Genetics of HyperTension (BRIGHT) Study in 2010 severely affected, white European sib pairs indicated 4 regions that attained genome-wide significance using a locus counting method.7 In any such experiment, the results of an initial sparse genome scan (markers every 10 cM) requires further validation by placement of an additional dense "grid" of genetic markers across the region of interest to assess support for linkage (markers every 2 cM for the most promising signals or 5 cM for regions with lesser support).8 This process, known as "grid tightening," is a key stage in defining linkage peaks worthy of detailed interrogation.8 In the event that additional support for linkage accrues, it would be ideal to replicate these findings in a second separate resource with the same white European ancestry and phenotyped using the same strategy. The BRIGHT Study is able to do this in 712 parentoffspring trios, where the proband meets the same phenotypic criteria and has the same ancestry as our affected sib pairs.9 This distinct family-based resource offers the opportunity to corroborate findings from the affected sib pairs by using a family-based test of linkage and association known as the Transmission Disequilibrium Test (TDT).10 The BRIGHT consortium now reports results from the grid tightening analysis in 2142 hypertensive sib pairs and follow-up of a linked region in 712 distinct TDT trios.
| Methods |
|---|
|
|
|---|
2 severely affected hypertensive sibs. Both affected sibs had onset of hypertension before 60 years of age and blood pressure (BP) recordings of 150/100 mm Hg using 1 reading or 145/95 mm Hg, the mean of 3 readings, while seated. BPs were recorded with the Omron HEM-705CP portable semiautomated oscillometric device (Omron Healthcare). Individuals who were diabetic, had intrinsic renal disease, and those with a history of secondary hypertension or coexisting illness were excluded from the study. As part of the BRIGHT Study strategy to characterize complimentary resources for replication and association studies, we have recruited 712 distinct hypertensive families of white European ancestry (ascertained as for affected sib pairs to the level of grandparental ancestry) to allow transmission disequilibrium testing.10 These families consist of 371 standard trios (affected proband and both parents) and 341 trios of single parent, affected proband, and a sibling. The ascertainment and phenotyping of the proband was the same as for the sib-pair recruitment.9 Ethics committee approval was obtained from multi-research and local research committees of all of the participating institutes for both family collections, and all of the participants gave written informed consent. Further details of the phenotyping, data quality assurance, and the biochemical variables collected for both family collections are available online at http://www.brightstudy.ac.uk.
Genotyping
The BRIGHT investigators have reported previously a 10 cM genome scan using Linkage Marker set MD10 (Applied Biosystems).7 Thirty-four additional microsatellite markers were genotyped in the sib-pair resource to tighten the grid on regions of interest with the highest multipoint or single point logarithm of odds (LOD) scores. Markers spaced, on average, every 2 cM were genotyped in primary regions of interest from the first-pass genome screen on chromosomes 2 (D2S2275, D2S2241, D2S2299, D2S306, D2S354, D2S2345, and D2S2284), 5 (D5S398, D5S427, D5S1359, D5S2019, D5S1988, D5S1977, D5S646, and D5S672), 6 (D6S503 and D6S2522), and 9 (D9S1881, D9S1821, D9S1795, D9S1863, D9S1818, and D9S312). In addition, markers spaced every 5 cM were genotyped on chromosome 8p (D8S1706), chromosome 11q (D11S1974), chromosome 13q (D13S1812), chromosome 15q (D15S986, D15S1019, D15S1023, D15S979, D15S652, D15S657, and D15S212), and chromosome 21q (D21S1256). These regions were selected for further analysis if LOD scores were either >1.0 in the first-pass genome scan using Cooperative Human Linkage Center (CHLC) or deCODE Genetics maps (http://www.decode.com) or if quality control checks had revealed that additional markers were required. In our primary screen of chromosome 5, we had not maximized data capture for all of the markers; additional genotyping data are provided here so that a comprehensive analysis is presented. The majority of markers were selected from the Sequence Tag Site database at the National Centre of Biotechnology Information (http://www.ncbi.nlm.nih.gov/dbSTS/).
Genotype Quality Assurance
After genotyping we performed extensive additional quality control checks on all of the markers (the original 400 plus 34 new markers). Allele frequency distributions were compared between phases of the project to detect differences in allele binning over time. Furthermore, we identified alleles outside of the expected range and tested for deviations from Mendelian inheritance (using the Pedcheck program) and from HardyWeinberg equilibrium.11
A further novel approach that we took was to search for "sudden death" (or "birth") markers, which is analogous to identification of outliers that have excessive leverage on results in a regression analysis. We reanalyzed the data for each chromosome, in turn dropping the data for a single marker to establish any markers that exert a large influence on the LOD score. The rationale being that identity by descent estimates should change smoothly along the chromosome and that a single marker with a large influence on the multipoint LOD score could suggest either genotyping or map errors.
For any markers highlighted by these tests, the raw genotype data were re-examined manually to identify errors in genotype calling. Any markers exhibiting unequivocal evidence of genotyping error that could be resolved were rescored; those that showed clear errors but could not reliably be scored were removed from further analysis. Once these additional quality checks were completed, kinship analysis was repeated to identify any misreported relationships.11
Statistical Analysis of Affected Sib Pairs
As for our primary genome screen, we estimated allele frequencies by maximum likelihood using the SPLINK computer program, version 1.9.8 We used marker map positions given in the Rutgers combined linkage-physical map and estimated by identity by descent probabilities for all of the affected full- and half-sib pairs using Merlin.12,13 From these, we computed single and multipoint LOD scores using MLSix, which maximizes the likelihood of the observed data as a function of the sib recurrence risk ratio,
S. We computed confidence intervals for the associated
S estimates by bootstrap resampling of the identity by descent vectors.
To evaluate the genomewide significance of our previous results, we applied a locus-counting approach.14 This compares the number of distinct regions showing evidence for linkage at different thresholds to that expected if no hypertension locus existed. The latter is calculated by simulation using gene dropping, which preserves family structure and missing data patterns. Although some new genotype data have been included, we have used thresholds from our primary scan that remain broadly applicable to our current results.
Linkage and Association Analysis Using the Transmission Disequilibrium Test
To replicate our linkage findings, we genotyped 3 markers, which yielded the strongest evidence for linkage in 712 distinct TDT trios. When an allele is associated with disease, it is expected to be inherited more frequently by affected offspring than the alternate allele carried by a heterozygous parent. The TDT approach tests for preferential transmission of particular alleles to affected children from their parents.10 The untransmitted allele acts as a matched control, so this test is robust to problems caused by population stratification. We used TRANSMIT (version 2.5.4) to conduct the analysis, which allows for missing parental genotypes to be reconstructed from unaffected siblings of the affected proband.15 Significance was calculated by 2 methods. First, the score statistic was assumed to asymptotically follow a
2 distribution and, second, by comparison to its empirical distribution estimated using 10 000 bootstrap replicates. The
2 approximation may not hold for rare alleles, and so the bootstrap P values are favored in this instance.
| Results |
|---|
|
|
|---|
|
Grid Tightening Analysis in Affected Sib Pairs
The maximum LOD scores are presented in Figure 1. No chromosomal region reached genome-wide significance according to the multipoint LOD score. The maximum multipoint LOD score genomewide is 2.50 at marker D5S2019 on chromosome 5q13. This maximum LOD score reflects an increase from the LOD score of 1.85 found in the first pass genome scan and includes 3 neighboring markers with single point LOD scores >2.5, 1 of which was >3 (see Figure 2). However, the LOD score fell in the other 3 regions of primary interest from the first pass genome scan (see Table 2). LOD scores were augmented by additional data on chromosomes 1 and 11, both achieving multipoint LOD scores >1.5, with 2 markers under the chromosome 11 peak (D11S1974 and D11S4175) achieving single-point LOD scores >2. We present the "top 10" maximum multipoint LOD scores >1.1 and the associated estimate of the recurrence risk (
S) for all of the regions in Table 2.
|
|
|
No single locus achieves genomewide significance, and a locus-counting analysis of the linked regions with the largest scores also suggests that we do not reach a genomewide significance level of 5%. However, more regions achieve modest LODs than expected (eg, 9 exceed 1.16 when only 8 would be expected to reach this threshold), suggesting that there seem to be too many modest LOD scores for them all to be attributable to chance.
Effect of Different Marker Maps on LOD Scores
Genetic maps of the markers used in linkage analysis continue to improve in accuracy as markers are typed in more families16 and different maps are merged.12 Multipoint LOD scores can depend heavily on the marker map when parents are not available for genotyping, which is common in studies of late-onset diseases, such as hypertension. Results must, therefore, be interpreted against an evolving genetic marker map. Our initial study used the CHLC genetic map. Since then, further information has been used to create more precise genetic maps, and the current analysis was conducted under the more recently published Rutgers map, which incorporates sequence data to ensure the correct ordering of markers.12 We detected several differences between these maps. The most marked difference was in the telomeric region of 6q, where 2 markers originally placed 16 cM apart by CHLC were, in fact, in very close proximity and their positions reversed in Rutgers. When analyzed under the 2 maps, the evidence for linkage differed considerably (LOD=3.2, CHLC map; LOD=0.7, Rutgers map). Further exploration of the 6q locus with additional microsatellites reduced support for linkage.
Between the CHLC and Rutgers maps there were 3 further map order differences between pairs of markers used in our study (D1S2896 and D1SS206, D5S1977 and D5S424, and D19S894 and D19S209). Whereas the changes at the first 2 pairs of markers had a minimal effect on the LOD score, the maximum multipoint LOD score on chromosome 19 increased from 0.9 using the CHLC map to 1.4 using Rutgers.12 This highlights the need to recognize that genetic maps are still in evolution as marker positions are revalidated.
Analysis using the sudden death approach found 2 points of interest. The first was an inversion of a 12 cM region on chromosome 8 in the genetic marker map, which has since been published,17 and 1 marker, D5S1359, which led to a change in the maximum multipoint LOD score on chromosome 5 (LOD=2.12 with D5S1359 included; LOD=2.50 with D5S1359 excluded). This marker is not in the Rutgers map, and had been selected according to position at Southampton (http://cedar.genetics.soton.ac.uk/public_html/LDB2000/release.html), which we have not been able to further validate. Because of its uncertain position and effect on the LOD score, we removed it from further analysis.
Transmission Disequilibrium Testing
Of the 3 chromosome 5 markers displaying single-point LOD scores >3, which were genotyped in the TDT resource, D5S2019 showed borderline evidence for linkage and association (P<0.07; see Table 3).
|
| Discussion |
|---|
|
|
|---|
1 susceptibility gene arises from our borderline support for linkage and association of 1 of these markers in a complimentary but distinct parentoffspring trio resource. The use of multiallelic markers with higher degrees of freedom causes a marked loss of power compared with biallelic markers. Under these circumstances, this borderline result is very encouraging. TDT replication of linkage has been used to extend support for loci in a limited number of diverse disease states18,19; but this is the first such finding in hypertension. It is encouraging to note a recent genome scan of myocardial infarction and stroke that demonstrated modest support for linkage that was followed up using a similar strategy of genotyping microsatellites in casecontrol design. This approach led to identification of an associated haplotype of the 5 lipoxygenase gene.20 Our grid tightening results also indicate diminished support for loci on chromosomes 2q, 6q, and 9q. The locus at the telomere of 6q exhibited the strongest support for linkage in our first-pass genome scan based on marker positions derived from the Genethon and Marshfield Maps (CHLC map). All of the genomic maps, linkage and sequence based, have undergone revisions and improvements as new data have accumulated, and the CHLC map has now been superseded by deCODE and Rutgers map. The Rutgers maps fixes marker order on the basis of physical sequence data and includes a substantial increase in the number of informative meioses, which allows more precise estimates of intermarker genetic distances.12 As recombination increases toward the ends of chromosomes, we have always been cautious about the 6q linkage peak. Multipoint linkage analysis is very sensitive to intermarker distance and order when parents are unavailable. Therefore, evidence for linkage can be inflated by tightly juxtaposed markers being assigned an inaccurate position.21 Our observation that the order and position of several markers has changed with appreciable effects on the resulting linkage statistics suggests that other complex trait researchers should consider reanalyzing their genome scan data using the most accurate contemporary map.
Recent data evaluating the behavior of LOD scores during a grid tightening process implies that false-negative results, as well as false-positive results, can arise in these studies.22 In the BRIGHT Study, the contribution of multiple markers with positive single-point LOD scores and augmentation of the multipoint LOD score on chromosome 5q13 makes this unlikely to be a false-positive linkage produced by genotyping or map errors. In addition, the evidence of simultaneous borderline support for association in our TDT resource increases the chance of it being a true positive locus.
The chromosome 5q13 interval (D5S398-D5S672) spans &22 Mb of DNA, and there is no other genome scan with significant linkage to this region; however, there is 1 report of suggestive linkage with BP in Mexican Americans.23 Human chromosome 5q13 has synteny with rat chromosome 2 (23 to 41 Mb) and mouse chromosome 13 (90 to 170 Mb; http://nar.oupjournals.org/cgi/content/full/30/1/38). A number of BP quantitative trait loci have been mapped to rat chromosome 2. Although a congenic mapping approach has been used to narrow some of these regions, none have been shown to be syntenic with human 5q13.24,25 Bioinformatic analysis of the region reveals: 131 reference sequence genes, 1001 messenger ribonucleic acids, 34 386 expressed sequence tags, and nearly 7000 HapMap (http://www.hapmapg.org) single nucleotide polymorphisms ([SNPs] http://genome.ucsc.edu/ and http://www.hapmap.org). There are several potential candidate genes within this interval (phosphatidylinositol 3-kinase regulatory
subunit, 3-hydroxy-3-methylglutaryl-coenzyme A reductase, and 5-hydroxytryptamine 1A receptor), which could be investigated in casecontrol studies using SNPs, which tag the combined genotypes or haplotypes.26,27 Although this is a reasonable strategy, it is somewhat constrained by our current understanding of the biological plausibility of candidates. A complementary strategy that we will use is the interrogation of the entire interval under our linkage peak with a dense group of tag SNPs derived from the evolving public domain HapMap data set that enable definition of the genetic architecture of this locus in a casecontrol design.26,27 For the other loci where LOD scores have diminished, we cannot make a compelling case for extensive survey using dense SNP maps of these intervals.
Perspectives
Despite many genome scans and candidate gene studies, the genetic factors underpinning hypertension remain elusive. In 2003, we reported linkage between loci on chromosomes 2, 5, 6, and 9, and hypertension in the BRIGHT Study affected sib-pair resource. After analysis of further families and markers, we now present enhanced support for a locus on chromosome 5q13, which achieves genomewide significance according to the single-point LOD score. Additional credibility comes from the rise in the multipoint LOD score with increasing information content in the region and borderline significant evidence for replication from our distinct resource of parentoffspring trios. These data provide an important step forward in the identification of a hypertension susceptibility locus in the BRIGHT Study. We are undertaking detailed fine mapping of this locus using >3000 SNPs selected from HapMap and which we are genotyping in 1600 hypertensive cases and normotensive controls to expedite localization and gene discovery. We also expect further loci of interest to arise from genegene interaction studies and analysis of covariate phenotypes within the sibpair resource.
| Acknowledgments |
|---|
Sources of Funding
This study was supported by UK Medical Research Council (grants G921010 and G000666), the British Heart Foundation (grant PG/02/128), and the British Hypertension Society. C.W. and M.-Z.X. were supported by grants from St Barts and The Royal London Charitable Foundation Research Advisory Board (RAB03/PJ/01 and XMNL, respectively).
Disclosures
None.
Received March 4, 2006; first decision April 4, 2006; accepted May 12, 2006.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
P.-a. B. Shih and D. T. O'Connor Hereditary Determinants of Human Hypertension: Strategies in the Setting of Genetic Complexity Hypertension, June 1, 2008; 51(6): 1456 - 1464. [Full Text] [PDF] |
||||
![]() |
M. Tomaszewski, F. J. Charchar, M. D. Lynch, S. Padmanabhan, W. Y.S. Wang, W. H. Miller, W. Grzeszczak, C. Maric, E. Zukowska-Szczechowska, and A. F. Dominiczak Fibroblast Growth Factor 1 Gene and Hypertension: From the Quantitative Trait Locus to Positional Analysis Circulation, October 23, 2007; 116(17): 1915 - 1924. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Y. Deng Genetic basis of polygenic hypertension Hum. Mol. Genet., October 15, 2007; 16(R2): R195 - R202. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Hypertension Home | Subscriptions | Archives | Feedback | Authors | Help | AHA Journals Home | Search Copyright © 2006 American Heart Association, Inc. All rights reserved. Unauthorized use prohibited. |