doi:10.1086/520769


626 The American Journal of Human Genetics Volume 81 September 2007 www.ajhg.org

REPORT

Recent Genetic Selection in the Ancestral Admixture
of Puerto Ricans
Hua Tang, Shweta Choudhry, Rui Mei, Martin Morgan, William Rodriguez-Cintron,
Esteban González Burchard, and Neil J. Risch

Recent studies have used dense markers to examine the human genome in ancestrally homogeneous populations for
hallmarks of selection. No genomewide studies have focused on recently admixed groups—populations that have ex-
perienced admixing among continentally divided ancestral populations within the past 200–500 years. New World
admixed populations are unique in that they represent the sudden confluence of geographically diverged genomes with
novel environmental challenges. Here, we present a novel approach for studying selection by examining the genomewide
distribution of ancestry in the genetically admixed Puerto Ricans. We find strong statistical evidence of recent selection
in three chromosomal regions, including the human leukocyte antigen region on chromosome 6p, chromosome 8q, and
chromosome 11q. Two of these regions harbor genes for olfactory receptors. Interestingly, all three regions exhibit
deficiencies in the European-ancestry proportion.

From the Department of Genetics, Stanford University School of Medicine, Stanford (H.T.); Departments of Biopharmaceutical Sciences and Medicine
(S.C.; E.G.B.) and Epidemiology and Biostatistics (N.J.R.) and Institute for Human Genetics (S.C.; E.G.B.; N.J.R.), University of California–San Francisco,
San Francisco; Affymetrix, Santa Clara, CA (R.M.); Fred Hutchinson Cancer Research Center, Seattle (M.M.); Veterans Caribbean Health Care System
(W.R.-C.) and University of Puerto Rico School of Medicine (W.R.-C.), San Juan; and Division of Research, Kaiser Permanente, Oakland (N.J.R.)

Received March 8, 2007; accepted for publication May 23, 2007; electronically published August 1, 2007.
Address for correspondence and reprints: Dr. Neil J. Risch, Institute for Human Genetics, University of California, 513 Parnassus Avenue, S965, San

Francisco, CA 94143. E-mail: RischN@humgen.ucsf.edu
Am. J. Hum. Genet. 2007;81:626–633. � 2007 by The American Society of Human Genetics. All rights reserved. 0002-9297/2007/8103-0020$15.00
DOI: 10.1086/520769

Whereas the vast majority of DNA sequence–level varia-
tion evolves neutrally and is maintained by a balance be-
tween the mutation process and random drift,1 some parts
of the human genome have been subjected to natural se-
lection. With use of dense genotype data on human pop-
ulations, recent studies have identified regions that appear
to have undergone positive selection.2–5 These results pro-
vide information about population history, as well as clues
for the identification of genes with important biological
functions.

Admixed populations offer special opportunities for
studying recent selection. Before admixing, the ancestral
populations had typically been isolated by distance, and
their genomes had evolved in distinct environments. In
contrast, in admixed populations of the Americas, the en-
vironmental changes have been sudden. For example, His-
panic populations resulted from recent admixing between
genomes from the Old World (Africans and Europeans)
and those from the New World (Native Americans). In-
tercontinental migration brought genomes of some an-
cestral populations to an unfamiliar environment in the
New World and, at the same time, disturbed the local
environment, introducing life-style changes as well as
pathogens, such as smallpox, from the Old World. Thus,
genomes from each ancestral population were presented
with new challenges. This kind of selection pressure may
be quite different from that faced by stationary popula-
tions, in which the local environmental changes may oc-
cur gradually, allowing for rare advantageous alleles to in-
crease in frequency. At the same time, it is possible that

a region (or regions) of the genome has been dispropor-
tionately included in the newly admixed population be-
cause of selection. Therefore, in theory, it is possible, with
use of dense genotype data, to detect a signature of selec-
tion by looking in an admixed population for genomic
regions that exhibit unusually large deviations in ancestry
proportions compared with what is typically observed
elsewhere in the genome.

The idea of detecting signatures of selection in an ad-
mixed population has a considerable history and has been
explored in various studies, which compare admixture es-
timates at a small number of loci.6–8 Workman et al.6 ex-
amined 12 genetic markers in a sample of 1,287 African
Americans from Claxton, Georgia. The average European
admixture proportion at most markers was estimated
to be ∼.10. A few markers, however, exhibited higher
European ancestry; those included the loci for G6PD, he-
moglobin, haptoglobin, and phenylthiocarbamide taste
blindness. Those authors concluded that the markers had
been subject to natural selection. However, evolutionary
fluctuations due to both small population size (genetic
drift) and sampling variability can also generate differ-
ences in ancestry proportions among loci.9 Subsequent
reanalysis of these and related data argued that the ob-
served variation in European admixture among markers
can be attributed largely to evolutionary and statistical
noise.9,10 In a rigorous analysis, Long9 showed that genetic
drift alone could generate an interlocus ancestry distri-
bution with SD of .044 in the Claxton population. Fur-
thermore, because these early studies estimated ancestry


www.ajhg.org The American Journal of Human Genetics Volume 81 September 2007 627

Figure 1. Genomewide variation of ancestry in Puerto Ricans. The X-axis denotes the physical location of SNPs; the Y-axis indicates
the excess/deficiency in ancestry at the corresponding SNP, averaged for 192 Puerto Ricans. The red, blue, and green curves represent
African, European, and Native American ancestries, respectively.

proportions on the basis of a single marker at each locus,
the ancestry estimates were sensitive to bias because of
erroneous estimates of allele frequencies in the ancestral
populations.

The recent development of high-density SNP markers
now allows for accurate ancestry estimation on a global,
genomewide scale.11 This allows us to revisit the question
of ancestral selection in admixed populations. To do so,
we employ a recently proposed statistical approach for
locus-by-locus ancestry estimation on a genomewide
scale.11

The Puerto Rican population arose as a result of admix-
ing among Europeans, West Africans, and Native Ameri-
cans. The indigenous people on the island were the Taı́no
Indians, who came from South America. In November
1493, Christopher Columbus discovered the island on his
second voyage to the New World. In 1513, Africans were
introduced to the island as slaves.12 Today, most Puerto
Rican individuals have genomes derived from all three
ancestral populations. The average genomewide individ-
ual ancestry proportions have been estimated as .66, .18,
and .16, for European, West African, and Native American,
respectively.13 Under neutral evolution, we expect each
individual’s genome to represent an ensemble of ancestry
blocks, randomly sampled, with a probability similar to
that of the genomewide average. Therefore, at each ge-
nomic location, the regional ancestry proportions are also

expected to follow the same probability distribution. Here,
we examine the distributions of these three ancestry pro-
portions across the genome in a sample of 192 Puerto
Rican individuals, in a search for outlier regions.

The 192 Puerto Ricans were recruited from six primary-
care clinics in Puerto Rico as part of the Genetics of
Asthma in Latino Americans study.14 All study participants
were aged 8–40 years. Individuals of European (from the
United States), West African (from Nigeria), and Native
American (Pima and Mayan) descent were included, to
approximate the ancestral populations. With use of the
Affymetrix Human Mapping 100K GeneChip set,15 192
Puerto Ricans, 42 Europeans, 37 Africans, and 30 Native
Americans were genotyped. Details of data-quality assess-
ment and statistical analysis are provided in appendix A.

Under the assumption of a trihybrid population model
and of 112,584 SNPs, marker-specific ancestries were es-
timated, with the program SABER,11 for all autosomes for
each individual. Averaged over 192 Puerto Ricans, the ge-
nomewide mean estimated European ancestry was .67, Af-
rican ancestry was .18, and Native American ancestry was
.15.18 For each individual, we then computed D ancestry
by subtracting the genomewide ancestry from locus-spe-
cific ancestry for each of the three ancestry components.
We then calculated average locus-specific D ancestry val-
ues for the 192 subjects.

Figure 1 shows the average excess ancestry at each SNP


628 The American Journal of Human Genetics Volume 81 September 2007 www.ajhg.org

Figure 2. Comparison of observed (histogram) and simulated (line) variation in ancestry across the genome. A population model was
assumed in which, for the first 5 generations, Europeans and Native Americans admix at a ratio of 0.82:0.18, with a total population
size of 1,000. At generation 5, Africans enter the gene pool, bringing N to 1,250. This trihybrid population is then allowed to mate
randomly for 10 generations at constant population size. The density curve for each population is based on 107 independent simulations.

location for the 192 Puerto Rican individuals, with the
genomewide ancestry as the baseline reference. At most
locations, deviations are within the range of evolutionary
drift and statistical fluctuation. However, a few regions
exhibit extreme fluctuations that are unlikely to have oc-
curred by chance. Most prominently, on chromosome
6p21-6p22, an excess of African ancestry reaches .14
around SNP rs169679; this level of excess was not observed
in 20,000 permutations (see appendix A) and represents
6.4 SD from the mean of the distribution. Correspond-
ingly, the European ancestry at this location is depressed
by .14, whereas Native American ancestry remains un-
changed. Additionally, chromosome 8 (peak at SNP
rs896760) and chromosome 11 (peak at SNP rs637249)
show an excess of .13 in Native American ancestry and
a deficit of European ancestry ( for each by per-P ! .001
mutation test). This level of excess in Native American
ancestry represents 4.6 SD from the mean of the
distribution.

Figure 2 compares the distribution of estimated locus-
specific ancestry with that simulated under genetic drift

(see appendix A for population parameters). The histo-
grams indicate that, under the assumption of a reasonable
population-history model, the distribution of the locus-
specific ancestry matches well with the observed distri-
bution except for the few outliers noted (chromosomes
6p, 8q, and 11q). There is modest skewness in these dis-
tributions, but it is insufficient to account for the outliers
observed in any of the three ancestry distributions. No-
ticeably, in 107 simulations, the African ancestry deviation
exceeded .14 (in either direction) once. The Native Amer-
ican ancestry deviation exceeded .13 35 times in 107 sim-
ulations. Because ancestry blocks extend, on average, sev-
eral centimorgans,19 these results suggest that it is unlikely
that the outliers we observed are due to genetic drift, even
allowing for testing of multiple chromosomal regions.

In previous studies, locus-specific ancestry was esti-
mated using a single marker at each locus; however, the
high-density genotyping platform we used allows for in-
ference based on multiple neighboring SNPs. Since most
SNPs in the genome are not completely informative, com-
bining information at neighboring SNPs produces more-


www.ajhg.org The American Journal of Human Genetics Volume 81 September 2007 629

Figure 3. Extreme variations of ancestry on chromosomes 6 (a), 8 (b), and 11 (c). SNPs were divided into two sets (even-numbered
vs. odd-numbered markers), which were analyzed separately with use of SABER. The red and green points are the excess African and
Native American ancestries, respectively, on the two marker subsets, whereas the black points are the results with use of all available
markers on that chromosome.

accurate estimates than does using single markers. Fur-
thermore, an ancestry estimate based on a single marker
is sensitive to mutation or misspecification of ancestral-
allele frequencies. In contrast, under evolutionary neu-
trality, it is unlikely that these factors affect several neigh-

boring markers in a way that leads to biased ancestry
estimates. Thus, the estimation approach we use provides
some safeguards against inaccurate ancestral-allele fre-
quencies.11,18 Similarly, our method of estimating locus-
specific ancestry is more robust to systematic genotyp-


630 The American Journal of Human Genetics Volume 81 September 2007 www.ajhg.org

Table 1. Maximum-Likelihood Estimates of
Ancestry Proportions in Puerto Ricans

Marker Region European African
Native

American

HLA-A .51 .32 .17
HLA-B .59 .28 .13
HLA-DRB .55 .24 .21
HLA-DQB .43 .37 .20
Combined (HLA) .51 .30 .20
Genomewide .76 .17 .07

NOTE.—The top five rows are estimates for the MHC region
on chromosome 6; the last row is the genomewide average
ancestry estimated from 15 microsatellites.13

Figure 4. Display of estimated ancestry on chromosome 6. Each pair of horizontal strips represents an (unphased) individual, with
the vertical height proportional to the marker-specific ancestry estimates of African (red), European (blue), and Native American (green)
ancestries. Left, Ancestry estimates around the peak on 6p21-22. Right, Randomly selected region of chromosome 6.

ing error or deviation from Hardy-Weinberg equilibrium
(HWE) at a single marker.

We investigated more closely the three chromosome
regions that achieved statistical significance. To exclude
the possibility that the excess in ancestry is related to the
inclusion of individuals with asthma, we examined the
asthmatic cases ( ) and controls ( ) separately,n p 96 n p 96
and we found that the excess of African (or Native Amer-
ican) ancestry and the deficiency of European ancestry
occurred identically in both groups. To eliminate the
possibility that the peaks are an artifact due to a few outli-
er markers, we repeated the analysis separately with the
even-numbered and odd-numbered markers. The result-
ing excess ancestry along chromosomes 6, 8, and 11 is
shown in figure 3; both marker sets show comparable ex-
cess ancestry in all three regions. Figure 4 illustrates that
the estimated African ancestry (shown in red) is indeed

higher around chromosome 6p21-6p22 by comparing the
estimated ancestries in two regions on chromosome 6: on
the left is the 10-Mb window around the peak of excess
African ancestry on chromosome 6p21-6p22, whereas on


www.ajhg.org The American Journal of Human Genetics Volume 81 September 2007 631

the right is a randomly selected region of the same phys-
ical width. The areas that are colored blue, red, and green
correspond to the estimated European, African and Native
American ancestries, respectively.

The region on chromosome 6p harbors numerous
genes; some of the most studied are those of the major
histocompatibility complex (MHC). Since the human leu-
kocyte antigen (HLA) loci have been well studied in a
variety of populations, we sought independent confir-
matory evidence using existing published HLA data. We
compiled allele frequencies for Puerto Ricans, West Eu-
ropeans, sub-Saharan Africans, and Native Amerindians
for four HLA loci (HLA-A, -B, -DRB, and -DQB) from lit-
erature sources16 and the HLA Frequency database from
the American Society for Histocompatibility and Immu-
nogenetics (ASHI). The data were used to determine the
maximum-likelihood estimates of Puerto Rican ancestries
at the HLA loci (table 1). For all four HLA loci examined
separately and combined, ancestry proportions are similar
to our previous estimates for the corresponding region,
with elevated African ancestral proportion and decreased
European ancestral proportion, in particular, compared
with genomewide ancestry estimates obtained in another
study of Puerto Ricans that used 15 microsatellite markers
randomly distributed across the genome.17

The region underneath our observed peak on chromo-
some 6p22 includes, in addition to HLA, an olfactory gene
cluster. Similarly, the peak on chromosome 11q11 also
harbors an olfactory gene cluster, which has been iden-
tified elsewhere as a target of ongoing positive selection.4,20

The peak on 8q23.3 harbors less obvious candidates. There
is only one gene in the region, CSMD3 (CUB and sushi
multiple domains 3), of which the highest expression is
found in fetal brain and intermediate expression is found
in adult pancreas, spleen, testis, spinal cord, and all spe-
cific adult brain regions examined. Little to no expression
was detected in the other tissues examined.21

Admixed populations offer unique opportunities for the
detection of signatures of selection by examination of the
genomewide distribution of ancestry. It is reasonable to
assume that selection occurred in recent times, subsequent
to the intercontinental migration. Because the peaks are
quite broad, it is difficult to identify the precise gene(s)
that has been the target of selection, nor can we state with
confidence the nature of the selective agent. However, we
can postulate two scenarios that led to apparent regional
admixture bias. First, the Africans and Europeans were ex-
posed to an unfamiliar environment, with various new
pathogens that are native to the New World. Because they
are genetically more diverse, some African-admixed alleles
might have conferred an advantageous phenotype. The
European-derived alleles are less diverse and thus may
have been less advantageous. This would have led to an
excess of either African admixture (in the case of chro-
mosome 6) or Native American admixture (in the case of
chromosomes 8 and 11) and a corresponding deficiency
in European admixture. Alternatively, it is possible that

the massive intercontinental migration altered environ-
mental conditions in the New World sufficiently that all
three ancestral populations were challenged. Again, spe-
cific African- or Native American–derived alleles rose in
frequency.

A second possibility is that various genes contributed
to specific phenotypes, which have been influenced either
by natural selection or by mating preferences. In the latter
case, the admixture bias may reflect some type of social
selection. In terms of the target of selection, it is intriguing
that two regions exhibiting the strongest selection signals
both harbor olfactory genes. This is consistent with the
findings of Voight et al.4—namely, that genes related to
olfaction are overrepresented in the regions under recent
positive selection. But olfactory genes may not be the
sole target. The region on chromosome 6p harbors various
other genes; noticeably, some HLA loci are !2 Mb away
from the peak. Because of their known role in im-
mune defense, the HLA loci are certainly also strong
candidates.22

It is also important to consider that the genetic inter-
mixing that has occurred in this population over the past
5 centuries has brought together new combinations of
diverged genomes and therefore created a novel genetic
background on which selection may have operated. In this
regard, as opposed to nonadmixed groups, admixed pop-
ulations offer novel and complementary information
about complex interactions involving both genes and en-
vironment. Further investigations are required to reveal
the targets and agents of selection that have played im-
portant roles in shaping the admixed gene pool.

Acknowledgments

We acknowledge the families and the patients for their partici-
pation. We thank Dr. Mark Shriver for contributing samples of
European, African, and Native American descent, and we thank
Dr. Eduardo Santiago-Delpin for providing the HLA frequencies
in Puerto Ricans. This work is supported by National Institute of
General Medical Sciences grants GM073059 (to H.T.) and
HL078885 (to E.G.B.), American Thoracic Society “Breakthrough
Opportunities in Lung Disease” Award (to S.C.), the Sandler Cen-
ter for Basic Research in Asthma, and the Sandler Family Sup-
porting Foundation.

Appendix A
Genotyping and Assessing the Data Quality

A total of 192 Puerto Ricans, 42 Europeans, 37 Africans,
and 30 Native Americans were genotyped with the Affy-
metrix Human Mapping 100K GeneChip set,15 which
comprises two arrays, each capable of genotyping 150,000
SNPs (58,960 for XbaI array and 57,244 for HindIII array).
Approximately 250 ng of genomic DNA was processed for
each chip, in accordance with the Affymetrix protocol.

In all analyses reported in this work, only autosomal
SNPs were included. For quality control, SNPs with call


632 The American Journal of Human Genetics Volume 81 September 2007 www.ajhg.org

rates !.85 were eliminated from the analysis. To further
eliminate SNPs with possible genotyping errors, we ex-
cluded SNPs with only heterozygous calls. We chose not
to use HWE as a quality-control criterion, because SNPs
with large allele-frequency differences between major ra-
cial/ethnic groups would tend to deviate from HWE if
there is nonrandom mating with respect to ancestry in
this population.23 These exclusions left us with 112,984
autosomal SNPs for analysis. Among the ancestral sam-
ples, two Africans have allele sharing comparable to a sib
pair, as do two Native Americans. In the subsequent anal-
yses, we excluded one of the individuals from each pair.

Statistical Analysis

Marker-specific ancestry estimates were produced using
the program SABER.11 We assumed a trihybrid population
model and used the genotypes of the European, African,
and Native American subjects to estimate the ancestral-
allele frequencies, as well as the two-marker haplotype
frequencies within each ancestral population. In this anal-
ysis, we excluded 400 SNPs, which were monomorphic in
the entire sample. The ancestral-allele frequencies were
further refined using the program FRAPPE, which esti-
mates the ancestral-allele frequencies with use of both the
ancestral and the admixed individuals.18 The starting val-
ues of genomewide average individual admixture fractions
(IA), which are required input parameters in SABER, were
also estimated using FRAPPE. The SABER output was used
to re-estimate IA. We then reran SABER with updated IA
estimates. We observed that new IA estimates from SABER
did not change much in subsequent iterations. Both
FRAPPE and SABER are available at the Tang Lab Web site.

Once the locus-specific ancestry was estimated, we com-
puted the “D ancestry,” deficiency or excess at each locus,
using the genomewide IA as a baseline. Specifically, the D
ancestry for ancestral population k at marker m is defined
as

I
1

m i,m i m˜¯ ¯d p (q � q ) p q � q ,�k k k k kI ip1
where is the locus-specific ancestry of individual i ati,mqk
marker m, estimated from SABER; is the IA of individualiq̄k
i; is the mean ancestry at marker m averaged over allmq̃k
individuals; and is the proportion from ancestral pop-q̄k
ulation k, averaged over all individuals and the entire
genome.

As shown by Long,9 genetic drift can create intermarker
variation in ancestry proportions. Therefore, we per-
formed a simulation to evaluate its potential impact on
our analyses. We assumed a simple population model: at
time 0, Europeans and Native Americans admix at a ratio
of 0.82:0.18. The population size is kept at forN p 1,000

generations. At , African individuals enter thet p 5 t p 5
gene pool, bringing N to 1,250, achieving ancestry pro-
portions similar to those estimated in the Puerto Ricans

(i.e., .18), and reducing the European and Native American
proportions to .67 and .15, respectively. This trihybrid
population is then allowed to mate randomly for 10 ad-
ditional generations at the constant population size. Each
simulation generates the evolutionary history of one locus
independently, with no linkage. The parameters of the
population model were chosen on the basis of our un-
derstanding of the history of this population and to match
the resulting locus-specific–ancestry distribution with the
corresponding empirical distributions. Whereas 15 gen-
erations of admixing time may seem short (when the fact
that Europeans arrived on the island 1500 years ago is
considered), we note that the effect of genetic drift di-
minishes as the size of the population grows, as has been
the case in Puerto Rico for the past several hundred years.

We note that Kimura analytically studied the question
of allele-frequency distributions over time under drift.24

He showed that the derived distribution is a complex func-
tion of , where t is the number of generations and Nt/2N
is the effective population size. Although his results would
be directly relevant to the ancestry distribution in a dihy-
drid population over time, they are not directly applicable
to our more complex scenario of three populations ad-
mixing at different time points, with the exception of Af-
rican ancestry, which we assumed entered at a single time
point 10 generations ago, when the population size was
1,250. Results with use of his analytic formulas closely
mirror our simulation results.

Since the evolutionary simulation does not account for
statistical sampling error and error associated with esti-
mating ancestry, we performed a permutation test. In each
individual, we randomly concatenated the ancestry esti-
mates of 22 autosomes. This linear, long piece of per-
mutated genome was circularized and was cut at a location
chosen uniformly. A permuted sample was formed by re-
peating this process for each individual, with the order of
concatenation and the location of linearization chosen
independently in each individual. We then computed the
excess/deficiency in ancestry in each permuted sample
and recorded the extreme values (maximum and mini-
mum) achieved with respect to European, African, and
Native American components. When markers are evenly
spaced and linkage disequilibrium between neighboring
markers is similar, an advantage of this permutation pro-
cedure is that it preserves the correlation structure be-
tween markers. In our data, this assumption does not hold
exactly. Further, the estimation in ancestry with use of the
same set of ancestral individuals induces a correlation at
each site, which is not preserved in the permutation pro-
cedure. To cope with these complexities, we further added
a quantile transformation step—specifically, in each per-
mutation, we multiplied the permuted statistics by a scale
factor, so that the SD of the distribution of the permuted
statistics (trimming .05 on each end) matched that in the
corresponding observed distribution. In total, 20,000 per-
mutations were performed, and the maximum absolute
deviations (excess or deficiency) from these permuted ge-


www.ajhg.org The American Journal of Human Genetics Volume 81 September 2007 633

nomes were treated as the null distribution, with which
we computed the significance levels of the observed an-
cestry deviation.

One region of potential interest was the HLA region on
chromosome 6p. To estimate the ancestry proportion in
Puerto Ricans at the HLA loci, we used published and pub-
licly available data. For Puerto Ricans, the gene frequencies
at HLA-A, -B, -DRB, and -DQB were taken from 461 in-
dividuals who were genotyped during 1997 and 1998.16

The corresponding gene frequencies in non-Hispanic Eu-
ropeans, sub-Saharan Americans, and Native Americans
were compiled using the HLA Frequency database from
ASHI. Population-level ancestry proportions were then es-
timated using maximum likelihood. For the genomewide
estimate of admixture in Puerto Ricans, we used the data
of Zuniga et al.,17 who studied a random collection of 15
microsatellites.

Web Resources

The URLs for data presented herein are as follows:

ASHI, http://www.ashi-hla.org/ (for the HLA Frequency database)
Tang Lab Web site, http://www.fhcrc.org/labs/tang (for FRAPPE

and SABER software)

References

1. Kimura M (2003) The neutral theory of molecular evolution.
Cambridge University Press, Cambridge, United Kingdom

2. Sabeti PC, Reich DE, Higgins JM, Levine HZ, Richter DJ,
Schaffner SF, Gabriel SB, Platko JV, Patterson NJ, McDonald
GJ, et al (2002) Detecting recent positive selection in the hu-
man genome from haplotype structure. Nature 419:832–837

3. Carlson CS, Thomas DJ, Eberle MA, Swanson JE, Livingston
RJ, Rieder MJ, Nickerson DA (2005) Genomic regions exhib-
iting positive selection identified from dense genotype data.
Genome Res 15:1553–1565

4. Voight BF, Kudaravalli S, Wen X, Pritchard JK (2006) A map
of recent positive selection in the human genome. PLoS Biol
4:e72

5. Tishkoff SA, Reed FA, Ranciaro A, Voight BF, Babbitt CC, Sil-
verman JS, Powell K, Mortensen HM, Hirbo JB, Osman M, et
al (2007) Convergent adaptation of human lactase persistence
in Africa and Europe. Nat Genet 39:31–40

6. Workman PL, Blumberg BS, Cooper AJ (1963) Selection, gene
migration and polymorphic stability in a US white and negro
population. Am J Hum Genet 15:429–437

7. Reed TE (1969) Caucasian genes in American negroes. Science
165:762–768

8. Cavalli-Sforza LL, Bodmer WF (1971) The genetics of human
populations. Freeman, San Francisco

9. Long J (1991) The genetic structure of admixed populations.
Genetics 127:417–428

10. Adams J, Ward RH (1973) Admixture studies and detection
of section. Science 180:1137–1143

11. Tang H, Coram M, Wang P, Zhu X, Risch N (2006) Recon-
structing genetic ancestry blocks in admixed individuals. Am
J Hum Genet 79:1–12

12. Carrion AM (1984) Puerto Rico: a political and cultural his-
tory. Norton, New York

13. Burchard EG, Borrell LN, Choudhry S, Naqvi M, Tsai HJ, Rod-
riguez-Santana JR, Chapela R, Rogers SD, Mei R, Rodriguez-
Cintron W, et al (2005) Latino populations: a unique oppor-
tunity for the study of race, genetics, and social environment
in epidemiological research. Am J Public Health 95:2161–2168

14. Burchard EG, Avila PC, Nazario S, Casal J, Torres A, Rodriguez-
Santana JR, Sylvia JS, Fagan JK, Salas J, Lilly CM, et al (2004)
Lower bronchodilator responsiveness in Puerto Rican than
in Mexican asthmatic subjects. Am J Respir Crit Care Med
169:386–392

15. Kennedy GC, Matsuzaki H, Dong S, Liu WM, Huang J, Liu
G, Su X, Cao M, Chen W, Zhang J, et al (2003) Large-scale
genotyping of complex DNA. Nat Biotechnol 21:1233–1237

16. Santiago-Delpin EA, de Echegaray S, Rivera-Cruz F, Rodriguez-
Trinidad A (2002) The histocompatibility profile of the Puerto
Rican population. Transplant Proc 34:3075–3078

17. Zuniga J, Ilzarbe M, Acunha-Alonzo V, Rosetti F, Herbert Z,
Romero V, Almeciga I, Clavijo O, Stern JN, Granados J, et al
(2006) Allele frequencies for 15 autosomal STR loci and ad-
mixture estimates in Puerto Rican Americans. Forensic Sci Int
164:266–270

18. Tang H, Peng J, Wang P, Risch NJ (2005) Estimation of in-
dividual admixture: analytical and study design considerations.
Genet Epidemiol 28:289–301

19. Lautenberger JA, Stephens JD, O’Brien SJ, Smith MW (2000)
Significant admixture linkage disequilibrium across 30 cM
around the FY locus in African Americans. Am J Hum Genet
66:969–978

20. Gilad Y, Bustamante CD, Lancet D, Pääbo S (2003) Natural
selection on the olfactory receptor gene family in humans
and chimpanzees. Am J Hum Genet 73:489–501

21. Shimizu A, Asakawa S, Sasaki T, Yamazaki S, Yamagata H,
Kudoh J, Minoshima S, Kondo I, Shimizu N (2003) A novel
giant gene CSMD3 encoding a protein with CUB and sushi
multiple domains: a candidate gene for benign adult familial
myoclonic epilepsy on human chromosome 8q23.3-q24.1.
Biochem Biophys Res Commun 309:143–154

22. Meyer D, Single RM, Mack SJ, Erlich HA, Thomson G (2006)
Signatures of demographic history and natural selection in
the human major histocompatibility complex loci. Genetics
173:2121–2142

23. Choudhry S, Coyle NE, Tang H, Salari K, Lind D, Clark SL,
Tsai HJ, Naqvi M, Phong A, Ung N, et al (2006) Population
stratification confounds genetic association studies among
Latinos. Hum Genet 118:652–664

24. Kimura M (1955) Solution of a process of random genetic
drift with a continuous model. Proc Natl Acad Sci USA 41:
144–150


	Recent Genetic Selection in the Ancestral Admixture of Puerto Ricans
	Acknowledgments
	References