<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1475-2875-8-117</ui>
   <ji>1475-2875</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Inferring selection in the <it>Anopheles gambiae </it>species complex: an example from immune-related serine protease inhibitors</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Obbard</snm>
               <mi>J</mi>
               <fnm>Darren</fnm>
               <insr iid="I1"/>
               <email>darren.obbard@ed.ac.uk</email>
            </au>
            <au id="A2">
               <snm>Welch</snm>
               <mi>J</mi>
               <fnm>John</fnm>
               <insr iid="I1"/>
               <email>j.j.welch@ed.ac.uk</email>
            </au>
            <au id="A3">
               <snm>Little</snm>
               <mi>J</mi>
               <fnm>Tom</fnm>
               <insr iid="I1"/>
               <email>tom.little@ed.ac.uk</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Institute of Evolutionary Biology, University of Edinburgh, Kings Buildings, West Mains Road, Edinburgh, UK</p>
            </ins>
         </insg>
         <source>Malaria Journal</source>
         <issn>1475-2875</issn>
         <pubdate>2009</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>117</fpage>
         <url>http://www.malariajournal.com/content/8/1/117</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">19497100</pubid>
               <pubid idtype="doi">10.1186/1475-2875-8-117</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>19</day>
               <month>2</month>
               <year>2009</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>04</day>
               <month>6</month>
               <year>2009</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>04</day>
               <month>6</month>
               <year>2009</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2009</year>
         <collab>Obbard et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Mosquitoes of the <it>Anopheles gambiae </it>species complex are the primary vectors of human malaria in sub-Saharan Africa. Many host genes have been shown to affect <it>Plasmodium </it>development in the mosquito, and so are expected to engage in an evolutionary arms race with the pathogen. However, there is little conclusive evidence that any of these mosquito genes evolve rapidly, or show other signatures of adaptive evolution.</p>
            </sec>
            <sec>
               <st>
                  <p>Methods</p>
               </st>
               <p>Three serine protease inhibitors have previously been identified as candidate immune system genes mediating mosquito-Plasmodium interaction, and serine protease inhibitors have been identified as hot-spots of adaptive evolution in other taxa. Population-genetic tests for selection, including a recent multi-gene extension of the McDonald-Kreitman test, were applied to 16 serine protease inhibitors and 16 other genes sampled from the <it>An. gambiae </it>species complex in both East and West Africa.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Serine protease inhibitors were found to show a marginally significant trend towards higher levels of amino acid diversity than other genes, and display extensive genetic structuring associated with the 2La chromosomal inversion. However, although serpins are candidate targets for strong parasite-mediated selection, no evidence was found for rapid adaptive evolution in these genes.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>It is well known that phylogenetic and population history in the <it>An. gambiae </it>complex can present special problems for the application of standard population-genetic tests for selection, and this may explain the failure of this study to detect selection acting on serine protease inhibitors. The pitfalls of uncritically applying these tests in this species complex are highlighted, and the future prospects for detecting selection acting on the <it>An. gambiae </it>genome are discussed.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>By vectoring <it>Plasmodium </it>parasites, <it>Anopheles </it>mosquitoes are a central component of the Malaria crisis. Consequently, there has been a substantial effort to identify the genes involved in the mosquito immune response against <it>Plasmodium</it>, including studies to identify genes associated with variation in vector competence <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. It has been widely hypothesized that these immune response genes may be subject to strong parasite-mediated selection, such as that which occurs in a coevolutionary 'arms-race' <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>. Such arms-races involve strong reciprocally-antagonistic selection, leading to the frequent and rapid fixation of new alleles. This reduces within-species diversity, while driving between-species protein divergence, and leaves a genomic signature of past selection that can be identified through DNA sequence analysis <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. Thus, DNA sequence analysis and the tools of population genetics can augment understanding of immune gene function in host-parasite interaction by identifying genes that are the target of parasite adaptation, and even distinguish between forms of parasite-mediated selection <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B9">9</abbr></abbrgrp>.</p>
         <p>Population genetic methods have previously shed light on the nature and intensity of selection in both mammalian and <it>Drosophila </it>immune systems. For example, <it>Drosophila </it>studies have suggested that pathogens which manipulate signal transduction pathways or the antiviral RNAi pathway have been a major selective force <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. In <it>Anopheles </it>mosquitoes, the potential for immune-related genes to determine vector competence provides a clear incentive to elucidate the selective forces that drive evolution. Serine protease inhibitors (serpins, or SRPNs) are prime candidates for such parasite-mediated selection in <it>Anopheles </it>mosquitoes. Serpins comprise a large and rapidly evolving super-family of proteins (reviewed in <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>) with key roles in the immune systems of vertebrates <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> and invertebrates <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. In particular, <it>Drosophila </it>serpins, such as Nec and SRPN27A, modulate two of the most important defense pathways: the Toll-pathway <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>, and the melanization cascade <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>, and many are up-regulated on septic injury (Spn28D, SRPN27A, Spn5, CG6687 and Spn4, see <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>). Moreover, some <it>Drosophila </it>serpins display very high rates of amino acid substitution, and/or other signatures of adaptive evolution, e.g. <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>.</p>
         <p>Three <it>Anopheles </it>serpins have been experimentally associated with <it>Plasmodium</it>-interaction phenotypes (see Table <tblr tid="T1">1</tblr>). In <it>Anopheles gambiae </it>and <it>Anopheles stephensi </it>SRPN10 is expressed in the mosquito midgut and in haemocytes <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>, and during <it>Plasmodium berghei </it>(a rodent parasite) invasion of the midgut epithelium SRPN10 moves from the nucleus to the cytoplasm, and its expression is strongly induced <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. SRPN6 is also expressed in infected midgut epithelial cells and in haemocytes, and again its expression is strongly induced by <it>P. berghei </it>invasion in both <it>An. gambiae </it>and <it>An. stephensi</it>. The expression of SRPN6 is also induced by the human parasite <it>Plasmodium falciparum </it><abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. RNAi knockdown of SRPN6 in <it>An. stephensi </it>resulted in a significant increase in the number of developing <it>P. berghei </it>oocysts, and although knockdown had no effect on oocyst numbers in susceptible strains of <it>An. gambiae</it>, in a resistant strain, the number of melanized <it>P. berghei </it>ookinetes was significantly increased <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. More recently it has also been shown that SRPN6 is induced in the salivary glands of <it>An. gambiae </it>in response to <it>P. berghei </it>sporozoite invasion, and knock-down of SRPN6 by RNAi significantly increases the number of sprozoites reaching the salivary glands <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. Finally, knockdown of SRPN2 in <it>P. berghei</it>-susceptible <it>An. gambiae </it>has a broadly opposite effect, resulting in a 97% decrease in oocyst formation through increased lysis and melanization, following midgut invasion <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>.</p>
         <tbl id="T1">
            <title>
               <p>Table 1</p>
            </title>
            <caption>
               <p>Locus Details and location</p>
            </caption>
            <tblbdy cols="4">
               <r>
                  <c ca="left">
                     <p>NAME</p>
                  </c>
                  <c ca="left">
                     <p>Identifier</p>
                  </c>
                  <c ca="left">
                     <p>Putative function</p>
                  </c>
                  <c ca="left">
                     <p>Genomic location</p>
                  </c>
               </r>
               <r>
                  <c cspan="4">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>SRPN1</p>
                  </c>
                  <c ca="left">
                     <p>AGAP006909</p>
                  </c>
                  <c ca="left">
                     <p>Inhibitory Serine Protease inhibitor</p>
                  </c>
                  <c ca="left">
                     <p>2L:39892128-39893864</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>SRPN2</p>
                  </c>
                  <c ca="left">
                     <p>AGAP006911</p>
                  </c>
                  <c ca="left">
                     <p>Plasmodium-related Inhibitory Serine Protease inhibitor</p>
                  </c>
                  <c ca="left">
                     <p>2L:39897002-39899744</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>SRPN3</p>
                  </c>
                  <c ca="left">
                     <p>AGAP006910</p>
                  </c>
                  <c ca="left">
                     <p>Inhibitory Serine Protease inhibitor</p>
                  </c>
                  <c ca="left">
                     <p>2L:39895229-39896338</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>SRPN4C</p>
                  </c>
                  <c ca="left">
                     <p>AGAP009670</p>
                  </c>
                  <c ca="left">
                     <p>Inhibitory Serine Protease inhibitor</p>
                  </c>
                  <c ca="left">
                     <p>3R:38145527-38154288</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>SRPN5</p>
                  </c>
                  <c ca="left">
                     <p>AGAP009221</p>
                  </c>
                  <c ca="left">
                     <p>Inhibitory Serine Protease inhibitor</p>
                  </c>
                  <c ca="left">
                     <p>3R:28858000-28859778</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>SRPN6</p>
                  </c>
                  <c ca="left">
                     <p>AGAP009212</p>
                  </c>
                  <c ca="left">
                     <p>Plasmodium-related Inhibitory Serine Protease inhibitor</p>
                  </c>
                  <c ca="left">
                     <p>3R:28811997-28818217</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>SRPN7</p>
                  </c>
                  <c ca="left">
                     <p>AGAP007693</p>
                  </c>
                  <c ca="left">
                     <p>Inhibitory Serine Protease inhibitor</p>
                  </c>
                  <c ca="left">
                     <p>2L:49090665-49091915</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>SRPN8</p>
                  </c>
                  <c ca="left">
                     <p>AGAP003194</p>
                  </c>
                  <c ca="left">
                     <p>Inhibitory Serine Protease inhibitor</p>
                  </c>
                  <c ca="left">
                     <p>2R:33744972-33746720</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>SRPN9</p>
                  </c>
                  <c ca="left">
                     <p>AGAP003139</p>
                  </c>
                  <c ca="left">
                     <p>Inhibitory Serine Protease inhibitor</p>
                  </c>
                  <c ca="left">
                     <p>2R:33148444-33154607</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>SRPN10</p>
                  </c>
                  <c ca="left">
                     <p>AGAP005246</p>
                  </c>
                  <c ca="left">
                     <p>Plasmodium-related Inhibitory Serine Protease inhibitor</p>
                  </c>
                  <c ca="left">
                     <p>2L:12996143-13001508</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>SRPN11</p>
                  </c>
                  <c ca="left">
                     <p>AGAP001377</p>
                  </c>
                  <c ca="left">
                     <p>Non-inhibitory Serine Protease inhibitor</p>
                  </c>
                  <c ca="left">
                     <p>2R:4017728-4019706</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>SRPN12</p>
                  </c>
                  <c ca="left">
                     <p>AGAP001375</p>
                  </c>
                  <c ca="left">
                     <p>Non-inhibitory Serine Protease inhibitor</p>
                  </c>
                  <c ca="left">
                     <p>2R:4010431-4012512</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>SRPN14</p>
                  </c>
                  <c ca="left">
                     <p>AGAP007692</p>
                  </c>
                  <c ca="left">
                     <p>Non-inhibitory Serine Protease inhibitor</p>
                  </c>
                  <c ca="left">
                     <p>2L:49084812-49086463</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>SRPN16</p>
                  </c>
                  <c ca="left">
                     <p>AGAP009213</p>
                  </c>
                  <c ca="left">
                     <p>Inhibitory Serine Protease inhibitor</p>
                  </c>
                  <c ca="left">
                     <p>3R:28824548-28826209</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>SRPN17</p>
                  </c>
                  <c ca="left">
                     <p>AGAP001376</p>
                  </c>
                  <c ca="left">
                     <p>Inhibitory Serine Protease inhibitor</p>
                  </c>
                  <c ca="left">
                     <p>2R:4015617-4016537</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>SRPN18</p>
                  </c>
                  <c ca="left">
                     <p>AGAP007691</p>
                  </c>
                  <c ca="left">
                     <p>Non-inhibitory Serine Protease inhibitor</p>
                  </c>
                  <c ca="left">
                     <p>2L:49086842-49088278</p>
                  </c>
               </r>
               <r>
                  <c cspan="4">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Control1</p>
                  </c>
                  <c ca="left">
                     <p>AGAP006906</p>
                  </c>
                  <c ca="left">
                     <p>Adenosine deaminase-related growth factor</p>
                  </c>
                  <c ca="left">
                     <p>2L:39852471-39854636</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Control2</p>
                  </c>
                  <c ca="left">
                     <p>AGAP006904</p>
                  </c>
                  <c ca="left">
                     <p>Matrix metalloproteinase</p>
                  </c>
                  <c ca="left">
                     <p>2L:39831595-39836700</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Control3</p>
                  </c>
                  <c ca="left">
                     <p>AGAP006918</p>
                  </c>
                  <c ca="left">
                     <p>Putative NADH:ubiquinone dehydrogenase</p>
                  </c>
                  <c ca="left">
                     <p>2L:39995907-39997095</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Control4</p>
                  </c>
                  <c ca="left">
                     <p>AGAP009673</p>
                  </c>
                  <c ca="left">
                     <p>glutaminyl-peptide cyclotransferase</p>
                  </c>
                  <c ca="left">
                     <p>3R:38248845-38249780</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Control5</p>
                  </c>
                  <c ca="left">
                     <p>ENSANGG8091</p>
                  </c>
                  <c ca="left">
                     <p>(retrotransposon)</p>
                  </c>
                  <c ca="left">
                     <p>3R:28965847-28968579</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Control6</p>
                  </c>
                  <c ca="left">
                     <p>AGAP009207</p>
                  </c>
                  <c ca="left">
                     <p>Mitogen-activated protein kinase ERK</p>
                  </c>
                  <c ca="left">
                     <p>3R:28697030-28708787</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Control7</p>
                  </c>
                  <c ca="left">
                     <p>AGAP007712</p>
                  </c>
                  <c ca="left">
                     <p>Putative RHO guanyl-nucleotide exchange factor</p>
                  </c>
                  <c ca="left">
                     <p>2L:49181235-49190516</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Control8</p>
                  </c>
                  <c ca="left">
                     <p>AGAP003205</p>
                  </c>
                  <c ca="left">
                     <p>Similar to Drosophila CG8468</p>
                  </c>
                  <c ca="left">
                     <p>2R:33825401-33827998</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Control9</p>
                  </c>
                  <c ca="left">
                     <p>AGAP003143</p>
                  </c>
                  <c ca="left">
                     <p>Similar to Drosophila CG9904</p>
                  </c>
                  <c ca="left">
                     <p>2R:33211906-33213476</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Control10</p>
                  </c>
                  <c ca="left">
                     <p>AGAP005247</p>
                  </c>
                  <c ca="left">
                     <p>no annotation</p>
                  </c>
                  <c ca="left">
                     <p>2L:13062962-13067750</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Control11</p>
                  </c>
                  <c ca="left">
                     <p>AGAP001384</p>
                  </c>
                  <c ca="left">
                     <p>cAMP-dependent protein kinase, beta-catalytic subunit</p>
                  </c>
                  <c ca="left">
                     <p>2R:4098545-4103634</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Control12</p>
                  </c>
                  <c ca="left">
                     <p>AGAP001371</p>
                  </c>
                  <c ca="left">
                     <p>Similar to Drosophila CG18643</p>
                  </c>
                  <c ca="left">
                     <p>2R:3885127-3885956</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Control14</p>
                  </c>
                  <c ca="left">
                     <p>AGAP007713</p>
                  </c>
                  <c ca="left">
                     <p>Similar to human solute carrier family 39</p>
                  </c>
                  <c ca="left">
                     <p>2L:49196817-49198177</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Control16</p>
                  </c>
                  <c ca="left">
                     <p>AGAP900209</p>
                  </c>
                  <c ca="left">
                     <p>DNA-directed RNA polymerase II subunit J</p>
                  </c>
                  <c ca="left">
                     <p>3R:28746535-28747425</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Control17</p>
                  </c>
                  <c ca="left">
                     <p>AGAP001388</p>
                  </c>
                  <c ca="left">
                     <p>Similar to human mab-3-related transcription factor 3</p>
                  </c>
                  <c ca="left">
                     <p>2R:4120810-4122586</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Control18</p>
                  </c>
                  <c ca="left">
                     <p>AGAP007717</p>
                  </c>
                  <c ca="left">
                     <p>Similar to Drosophila CAP CG18408-PE</p>
                  </c>
                  <c ca="left">
                     <p>2L:49212258-49224889</p>
                  </c>
               </r>
            </tblbdy>
         </tbl>
         <p>Here, population-genetic approaches are used to search for evidence of natural selection acting on 16 serpin genes in the <it>An. gambiae </it>species complex, including those implicated in immune function. First, by comparing serpins to other nearby genes, patterns of genetic diversity within and between populations of <it>An. gambiae</it>, <it>Anopheles arabiensis</it>, and <it>Anopheles melas </it>are used to identify loci that deviate strongly from neutral predictions. Second, a recent extension of the McDonald-Kreitman test is used to test for evidence of adaptive substitution between species <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>. The data are then discussed in terms of on-going population processes in the <it>An. gambiae </it>complex, many of which have important implications for the robust inference of selection.</p>
         <p>Serpins were found to have slightly higher levels of amino acid diversity than other genes, consistent with either reduced constraint, or potentially with balancing selection. In common with previous analyses. <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>, considerable structuring of genetic diversity in the SRPN1-2-3 cluster was found in association with the 2La chromosomal inversion. However, although serpins are good <it>a priori </it>candidates as targets for strong 'arms-race' selection, as with similar studies on other <it>Anopheles gambiae </it>immune-related genes (e.g. <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>), the tests for adaptive evolution presented here are largely inconclusive. The results show how standard population-genetic tests for selection may be difficult to apply in the <it>An. gambiae </it>species complex; this is due to for both demographic and phylogenetic factors that are already widely known, and further supported by the present data.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Samples</p>
            </st>
            <p><it>Anopheles gambiae </it>individuals were collected from West Africa ('BK': Burkina Faso, Koubri village, 12&#176;11'54 N; 1&#176;23'43W) and East Africa ('KY': Kenya, Mbita, Suba District). <it>Anopheles arabiensis </it>individuals were also collected from West Africa ('BK', in the same collections as <it>An. gambiae</it>, above) and East Africa ('TZ', Tanzania, Ifakara). All <it>An. gambiae </it>and <it>An. arabiensis </it>used in this study were provided by H. M. Ferguson (University of Glasgow, UK). <it>Anopheles melas </it>individuals were collected from Coastal Ghana (Ghana, Essiama, 4&#176;57.4 N; 2&#176;24.1W) by N. Tuno (Institute of Tropical Medicine, Nagasaki University, Japan). <it>Anopheles merus </it>and <it>Anopheles quadriannulatus </it>species <it>A </it>(hereafter <it>An. quadriannulatus</it>) were both obtained from laboratory colonies, maintained by the Medical Research Council of South Africa (provided by R. Maharaj; MRC, Durban, South Africa) and the University of Wageningen (Strain 'Sangqua', Zimbabwe, provided by W. Takken), respectively.</p>
            <p>The species identity of all <it>gambiae </it>complex members was verified by diagnostic PCR <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, and the <it>M </it>and <it>S </it>molecular forms of <it>An. gambiae </it>were distinguished by PCR-RFLP [e.g. <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>]. As expected from their known geographic distributions <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>, all KY <it>An. gambiae </it>individuals were <it>S </it>form, and all but two of the BK <it>An. gambiae </it>sample were <it>M</it>-form. All <it>An. gambiae </it>individuals were also surveyed for 2La/+ chromosomal inversion status, using the PCR assay of White et al <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>, derived from the sequenced breakpoints <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. As reported previously, in addition to the expected diagnostic 207 bp and 492 bp fragment lengths, these primers were found to amplify fragments of lengths <it>ca</it>. 687 bp, 672 bp, 760 bp and 1020 bp in some individuals <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. Direct sequencing of these fragments from a subset of individuals suggest they are insertion/deletion derivatives of expected assay products <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B40">40</abbr></abbrgrp>, and within the polymorphic KY population the 2La/2L+<sup>a </sup>amplification fragments were in Hardy-Weinberg equilibrium (51 individuals, &#967;<sup>2 </sup>= 0.44, 1df., <it>p </it>= 0.51), allowing us to tentatively assign 2La/2L+<sup>a </sup>inversion status to all individuals <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Loci</p>
            </st>
            <p>Thirty-two loci were selected for sequencing and analysis, including 16 of the 18 serpins currently identified in the <it>An. gambiae </it>genome (M. Kanost and K. Michel, pers. comm.), and 16 other protein-coding loci chosen to match the genomic position of the serpins without regard to function. SRPN19 (a non-inhibitory serpin with 1:1:1 orthologs in <it>Drosophila melanogaster</it>, <it>Aedes aegypti </it>and <it>An. gambiae</it>) and SRPN13, which does not appear in the current <it>An. gambiae </it>assembly, were not sequenced. The total sequenced length was ~19 Kbp of coding sequence per individual (i.e. approximately 600 bp from each locus; range 240 bp&#8211;800 bp). Not all loci were sequenced from the same individuals within populations, and not all loci were amplified from <it>An. melas</it>. A full summary of gene names, locations, and classification is presented in Table <tblr tid="T1">1</tblr>.</p>
            <p>The 'control' loci should represent an unbiased sample of <it>Anopheles </it>genes, to which serpins can be compared. Because these control genes are position-matched, each lying ~90 Kbp (range 40&#8211;125 Kbp) from a 'partner' serpin, they should control for the effects of large-scale position-based variation in recombination and mutation rates. Note that improvements to the <it>An. gambiae </it>annotation have subsequently identified control locus 5 (previously annotated as ENSANGG000008091) as deriving from a retrotransposon (AgamP3.4, July, 2007).</p>
         </sec>
         <sec>
            <st>
               <p>PCR and sequencing</p>
            </st>
            <p>Genomic DNA was extracted from single mosquitoes using DNeasy kits (QIAgen). PCR primers were designed from the published <it>An. gambiae </it>genome sequence <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>, and the final primer sequences selected after trouble-shooting (sequences are given in Additional file <supplr sid="S1">1</supplr>). Only one PCR amplicon was used per locus, thus sequences do not represent entire genes. Following PCR, unincorporated primers and dNTPs were removed using exonuclease I (New England BioLabs) and shrimp alkaline phosphatase (Amersham). PCR products were sequenced in both directions using BigDye&#8482; reagents (v3.1, Applied BioSystems) and an ABI capillary sequencer. In some amplicons, indel polymorphism required the use of additional sequencing primers. The sequence chromatograms were assembled using SeqManII (DNAstar Inc., Madison USA) then inspected by eye to confirm the validity of all differences within and between species and all heterozygous base-calls. The heterozygous sequence from each diploid individual was decomposed into two pseudohaplotypes for analysis using PHASE <abbrgrp><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr></abbrgrp>. However, the presented analyses should be highly robust to any errors in phase assignment, as only explicitly tree-based results, such as Hudson's nearest neighbour statistic (Snn), are affected by allelic phase. All unphased sequences have been submitted to GenBank as population sets, using ambiguity codes to indicate heterozygous sites. Sequence accession numbers span the range <ext-link ext-link-type="gen" ext-link-id="GQ146469">GQ146469</ext-link>&#8211;<ext-link ext-link-type="gen" ext-link-id="GQ148534">GQ148534</ext-link>.</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p><b>PCR primers</b>. Locus names, identifiers, genomic locations and PCR primer sequences.</p>
               </text>
               <file name="1475-2875-8-117-S1.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Divergence, diversity and differentiation</p>
            </st>
            <p>The number of synonymous and non-synonymous polymorphisms and substitutions, and the average pairwise genetic diversity (<it>&#960;</it>) were calculated using DnaSP [version 4.50.3, ref <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>]. Diversity was calculated separately for synonymous (<it>&#960;</it><sub><it>s</it></sub>) and non-synonymous (<it>&#960;</it><sub><it>a</it></sub>) sites, and used a Jukes-Cantor correction for multiple substitutions, as implemented in DnaSP. Departures from the allele frequency spectrum expected under the standard neutral model were quantified using Tajima's <it>D </it>statistic <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>, also calculated using DnaSP. Tajima's <it>D </it>(which measures departures from the expected allele frequency distribution under a standard neutral model) was calculated using synonymous sites only, and was calculated separately for both populations of <it>An. gambiae </it>and <it>An. arabiensis </it>(but not for <it>An. melas</it>, where sample sizes were too small to give meaningful results). The significance of departures from the expected allele frequency distribution were tested using 10,000 rounds of coalescent simulation (as implemented in DnaSP) conditional on the number of segregating sites and conservatively assuming no recombination within loci.</p>
            <p>Genetic differentiation between populations was quantified in DnaSP using Hudson's <it>K</it><sub><it>ST </it></sub>statistic [equations 7 to 9 in reference 46] which is calculated from the average number of pairwise differences between sequences taken within populations and between all populations, and is identical to Nei's <it>&#947;</it><sub><it>ST </it></sub><abbrgrp><abbr bid="B47">47</abbr></abbrgrp> except for the population-size weighting scheme [see <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>]. Significant departures from zero population differentiation were inferred by permuting sequences between populations to create a null distribution of <it>K</it><sub><it>ST </it></sub>values. All non-parametric statistical tests on diversity and differentiation were performed using the R statistical language (R Development Core Team, 2008 <url>http://www.R-project.org</url>). Although non-parametric tests (Spearman's rank correlation, paired Wilcoxon tests) are presented below, except where noted explicitly, parametric equivalents (Pearson's correlation, paired t-tests) gave qualitatively identical results.</p>
         </sec>
         <sec>
            <st>
               <p>The proportion of adaptive substitutions</p>
            </st>
            <p>If it is assumed that synonymous mutations are effectively neutral, and that the fixation or loss of selected amino-acid variants is so rapid that the vast majority of non-synonymous polymorphisms are also effectively neutral, then the relative numbers of polymorphisms (<it>P</it>, within species) and fixed differences (<it>D</it>, between species) at synonymous and non-synonymous sites can be used to identify the action of selection [see <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> for an introduction]. This forms the basis of the McDonald-Kreitman test [MK test, <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>], which seeks to detect a departure from independence in a simple 2 &#215; 2 contingency table of polymorphisms (<it>P</it><sub><it>N </it></sub>and <it>P</it><sub><it>S</it></sub>) and fixed differences (<it>D</it><sub><it>N </it></sub>and <it>D</it><sub><it>S</it></sub>). For a single gene, the departure from neutrality can easily be quantified by summary statistic such as the neutrality index (N.I.= (<it>P</it><sub><it>N</it></sub>/<it>P</it><sub><it>S</it></sub>)/(<it>D</it><sub><it>N</it></sub>/<it>D</it><sub><it>S</it></sub>) <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>), or by the estimated proportion of adaptive substitutions (&#945; = 1 - (<it>D</it><sub><it>S</it></sub><it>P</it><sub><it>N</it></sub>)/(<it>D</it><sub><it>N</it></sub><it>P</it><sub><it>S</it></sub>) = 1 - N.I., <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>). This approach can be extended to multiple genes using <it>D</it><sub><it>S</it></sub>, <it>P</it><sub><it>N</it></sub>, <it>D</it><sub><it>N </it></sub>and <it>P</it><sub><it>S </it></sub>averaged across genes <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>, or using a more sophisticated maximum-likelihood estimator of <it>&#945;</it>, such as that of Bierne and Eyre-Walker <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>.</p>
            <p>Here, an extension of the maximum-likelihood method of Welch <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> was used. This method is very closely related to that of Bierne and Eyre-Walker <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>, but additionally allows for the possibility that some apparent fixed differences may actually be polymorphisms that only appear fixed due to small sample size <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>. The method was extended to include polymorphism values from two species simultaneously e.g. <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B51">51</abbr></abbrgrp>. Using this approach, models were fitted in which expected neutral divergence, &#955; = &#956;<it>t</it>, took a single value at all loci; expected neutral diversity, &#952; = 4<it>N</it><sub><it>e</it></sub>&#956;, was also shared between all loci, but free to vary between species; and selective constraint, <it>f</it>, was free to vary between loci [see <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> for other details of the model]. Three nested models were fitted, in which (1) &#945; was constrained to zero at all loci, i.e. no adaptive evolution, (2) a single &#945; was shared by all genes, and (3) &#945; was free to differ between serpins and 'control' genes. In this way, it was possible to test both for evidence of adaptive evolution, and whether serpins have a different rate of adaptive evolution to other genes. Model fit was tested using both likelihood ratio tests and Akaike weighting (derived from the Akaike information criterion) <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>. Confidence intervals on &#945; were obtained by adjusting &#945; away from its maximum likelihood value, and allowing the other parameters to take their maximum likelihood value, conditional on that &#945;, until log likelihood decreased by 2 units <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. C code to fit these models is available on request from the authors, or from <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Synonymous site diversity</p>
            </st>
            <p>Across all 32 loci, average pairwise genetic diversity at synonymous sites (<it>&#960;</it><sub><it>s</it></sub>) was highest in <it>An. gambiae </it>(<it>&#960;</it><sub><it>s </it></sub>= 2.82%, 95% bootstrap interval 2.25&#8211;3.41%), lowest in <it>An. melas </it>(<it>&#960;</it><sub><it>s </it></sub>= 0.86%, 0.53&#8211;1.19%), and intermediate in <it>An. arabiensis </it>(<it>&#960;</it><sub><it>s </it></sub>= 1.97%, [1.50, 2.46]; Figure <figr fid="F1">1a</figr>). The difference in <it>&#960;</it><sub><it>s </it></sub>between <it>An. gambiae </it>and <it>An. arabiensis </it>was highly significant (Paired Wilcox test using 32 loci, V = 444, <it>p </it>= 0.0004), and <it>&#960;</it><sub><it>s </it></sub>correlated strongly between these species (Spearman's rho = 0.72, S = 1504, <it>p </it>&lt; 6 &#215; 10<sup>-6</sup>, Additional file <supplr sid="S2">2</supplr>). Diversity in <it>An. gambiae </it>and <it>An. arabiensis </it>did not correlate with diversity in <it>An. melas </it>(<it>p </it>> 0.5 in both cases, 26 loci). For a full summary of synonymous diversity, and all other summary stats that follow, see Additional file <supplr sid="S3">3</supplr>.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Genetic diversity at synonymous and non-synonymous sites</p>
               </caption>
               <text>
                  <p><b>Genetic diversity at synonymous and non-synonymous sites</b>. Genetic diversity (the percentage of sites that differ on average between haplotypes) at synonymous (<it>&#960;</it><sub><it>s</it></sub>) and nonsynonymous (<it>&#960;</it><sub><it>a</it></sub>) sites measured at 32 loci in populations of <it>An. gambiae</it>, <it>An. arabiensis </it>and <it>An. melas</it>. Diversity is shown separately for control loci (dark bars) and serpins (pale bars), and is shown for the species as a whole, and for each population separately. Note that although only two individuals (4 haplotypes) were sampled for S-form <it>An. gambiae </it>in population BK, ~19 Kbp of sequence will provide a good estimate of <it>&#960; </it>if mating within the population is random. Diversity was significantly higher in <it>An. gambiae </it>than in <it>An. arabiensis</it>, and significantly lower in <it>An. melas</it>. For non-synonymous sites, serpins had significantly higher diversity than control loci, but this trend was non-significant at synonymous sites. See main text for details, and Additional File <supplr sid="S1">1</supplr> for the raw data.</p>
               </text>
               <graphic file="1475-2875-8-117-1"/>
            </fig>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p><b>Correlations in genetic diversity between species</b>. The correlation in genetic diversity for loci sampled from <it>Anopheles gambiae </it>and <it>Anopheles arabiensis</it>.</p>
               </text>
               <file name="1475-2875-8-117-S2.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S3">
               <title>
                  <p>Additional file 3</p>
               </title>
               <text>
                  <p><b>Genetic diversity and differentiation</b>. Sheet 1: genetic diversity at synonymous and non-synonymous sites, and Tajima's D statistic for synonymous sites, for all loci and populations. Sheet 2: genetic differentiation (Fst, Kst and Snn) between populations for all loci.</p>
               </text>
               <file name="1475-2875-8-117-S3.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>For <it>An. gambiae</it>, genetic diversity was slightly higher in West Africa (<it>&#960;</it><sub><it>s </it></sub>= 2.74%; BK M-form) than in East Africa (<it>&#960;</it><sub><it>s </it></sub>= 2.17%; KY S-form), and although the effect was small, it was statistically significant (Paired Wilcox V = 380, <it>p </it>= 0.03, Figure <figr fid="F1">1a</figr>). Diversity was also highly correlated between East- and West-African populations (Spearman's rho = 0.72, S = 1520, <it>p </it>= 3 &#215; 10<sup>-6</sup>). Although only two S-form individuals were sampled from West Africa, they displayed higher diversity than the either of the other two <it>An. gambiae </it>populations (BK S-form; <it>&#960;</it><sub><it>s </it></sub>= 2.98%). For <it>An. arabiensis</it>, diversity correlated even more strongly between East and West Africa (rho = 0.87, S = 729, <it>p </it>&lt; 2 &#215; 10<sup>-10</sup>) and did not differ significantly between the populations (<it>&#960;</it><sub><it>s </it></sub>= 1.96% in BK vs. <it>&#960;</it><sub><it>s </it></sub>= 1.79% in TZ, Paired Wilcoxon test V = 316, <it>p </it>= 0.19).</p>
            <p>Synonymous site diversity did not differ significantly between serpins and other genes in either <it>An. gambiae </it>(3.10% vs. 2.53%, Paired Wilcoxon V = 48, <it>p </it>= 0.32, Figure <figr fid="F1">1a</figr>) or <it>An. arabiensis </it>(2.00% vs. 1.93%, Paired Wilcox V = 60, <it>p </it>= 0.71). Although position-matched, no correlation in diversity could be detected between serpins and their corresponding control genes (<it>p </it>> 0.1 in both cases), suggesting that the effect of genomic location on neutral diversity was relatively weak.</p>
         </sec>
         <sec>
            <st>
               <p>Non-synonymous site diversity</p>
            </st>
            <p>Non-synonymous diversity (<it>&#960;</it><sub><it>a</it></sub>) was very similar between <it>An. gambiae </it>(&#960;<sub><it>a </it></sub>= 0.22%; 95% bootstrap interval 0.13&#8211;0.31%) and <it>An. arabiensis </it>(<it>&#960;</it><sub><it>a </it></sub>= 0.18%; bootstrap interval: 0.12&#8211;0.24%, Figure <figr fid="F1">1b</figr>), and did not differ significantly between the species (Paired Wilcoxon test V = 285, <it>p </it>= 0.29). Although <it>&#960;</it><sub><it>a </it></sub>did not correlate significantly with <it>&#960;</it><sub><it>s </it></sub>in either <it>An. gambiae </it>or <it>An. arabiensis </it>(correlation coefficients were 0.17 and 0.07 respectively, <it>p </it>> 0.3 in both cases), there was a strong correlation in <it>&#960;</it><sub><it>a </it></sub>between the two species (rho = 0.81, S = 1037, <it>p </it>= 2 &#215; 10<sup>-8</sup>, Additional file <supplr sid="S2">2</supplr>). East and West African populations did not differ significantly in <it>&#960;</it><sub><it>a </it></sub>for either <it>An. gambiae </it>or <it>An. arabiensis </it>(<it>p </it>> 0.1 in both cases), but <it>&#960;</it><sub><it>a </it></sub>did correlate very highly between East and West African populations (rho = 0.80 <it>p </it>= 6. &#215; 10<sup>-8</sup>, and rho = 0.88, p = 2 &#215; 10<sup>-11</sup>, respectively). Interestingly, &#960;<sub><it>a </it></sub>was higher for serpins than for other genes in both <it>An. gambiae </it>and <it>An. arabiensis </it>(<it>&#960;</it><sub><it>a </it></sub>= 0.30% vs. 0.13%, and <it>&#960;</it><sub><it>a </it></sub>= 0.25% vs. 0.11%), although statistical significance was marginal for <it>An. gambiae </it>(Paired Wilcoxon V = 30 <it>p </it>= 0.051, and V = 24, <it>p </it>= 0.024 for <it>An. gambiae </it>and <it>An. arabiensis </it>respectively, Figure <figr fid="F1">1b</figr>). Despite fewer loci being sequenced, this effect could also be detected in <it>An. melas </it>(&#960;<sub>a </sub>= 0.15% vs. 0.06%, unpaired Wilcoxon test W = 25, <it>p </it>= 0.034, Figure <figr fid="F1">1b</figr>).</p>
         </sec>
         <sec>
            <st>
               <p>Allele frequency spectra</p>
            </st>
            <p>Tajima's <it>D </it>statistic for synonymous sites was negative in both populations of <it>An. arabiensis </it>(<it>D </it>= -0.29 and <it>D </it>= -0.38, averages across loci in TZ and BK respectively) and in <it>An. gambiae </it>BK (<it>D </it>= -0.71, <it>M</it>-form individuals only), and did not differ between serpins and other genes (Wilcoxon tests, <it>p </it>> 0.5 in all cases). Tajima's <it>D </it>did not differ significantly between the two populations of <it>An. arabiensis </it>(Paired Wilcoxon V = 193, <it>p </it>= 0.42), but did correlate between the populations (rho = 0.43 S = 2543.131, <it>p </it>= 0.017). In population BK, Tajima's <it>D </it>was correlated between <it>An. gambiae </it>and <it>An. arabiensis </it>(rho = 0.490 S = 2293, <it>p </it>= 0.006), but was significantly more negative in <it>An. gambiae </it>(Paired Wilcoxon V = 115, <it>p </it>= 0.015). Strikingly, Tajima's <it>D </it>was generally positive in <it>An. gambiae </it>population KY (mean across loci 0.77, 22 out of the 28 genes with non-zero diversity had <it>D </it>> 0, with overall 95% bootstrap interval [0.40, 1.15]). This was significantly higher than in BK (Paired Wilcoxon test V = 15, <it>p </it>= 1 &#215; 10<sup>-6</sup>).</p>
            <p>In <it>An. arabiensis</it>, five genes had individually significantly negative Tajima's D statistics (<it>p </it>&lt; 0.05 in all cases, no correction for multiple tests): control loci 1 (BK and TZ) and 5 (BK), and serpins 10 (BK), 6 and 7 (TZ). In <it>An. gambiae </it>population BK (<it>M</it>-form only) 7 genes had significantly negative <it>D </it>values: serpins 7, 9 and 14, and control loci 5, 6, 10 and 11 (<it>p </it>&lt; 0.05 in all cases).</p>
         </sec>
         <sec>
            <st>
               <p>Genetic differentiation between populations</p>
            </st>
            <p>In <it>An. arabiensis</it>, differentiation between East and West Africa was very low (<it>K</it><sub><it>ST </it></sub>= 0.08) and not significantly different from zero at 15 of the 32 loci examined. In <it>An. gambiae</it>, differentiation between East and West Africa was much higher (<it>K</it><sub><it>ST </it></sub>= 0.14; all loci except control locus 9 were individually significantly differentiated) and this difference between the species was statistically significant (paired Wilcoxon test V = 392, <it>p </it>= 0.016). Note that this differentiation reflects not only geographic separation, but also differentiation between <it>M </it>and <it>S </it>molecular forms of <it>An. gambiae</it>. In West Africa (BK) differentiation between <it>M</it>- and <it>S-</it>form was very low (<it>K</it><sub><it>ST </it></sub>= 0.016), and significantly lower than differentiation between <it>S</it>-form sampled from East Africa and <it>S</it>-form sampled from West Africa (<it>K</it><sub><it>ST </it></sub>= 0.073, paired Wilcoxon test V = 442, <it>p </it>= 0.0005). Only two S-form <it>An. gambiae </it>individuals were sampled from BK, making estimates of differentiation potentially poor and reducing the power of the test. However, assuming random mating, the estimates should not be biased by the small number of individuals sampled, and the large number of loci (32, ~19 Kbp of sequence) will reduce sampling error. For an overview of genetic differentiation see Figure <figr fid="F2">2</figr>.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Genetic differentiation between populations</p>
               </caption>
               <text>
                  <p><b>Genetic differentiation between populations</b>. Arrows indicate approximate sampling locations within Africa, and letters identify species (A- <it>An. arabiensis</it>, G- <it>An. gambiae</it>, M-form and S-form). Dashed lines indicate pairs of populations for which genetic differentiation was calculated, and numbers are <it>K</it><sub><it>ST </it></sub>statistics, averaged across all 32 loci.</p>
               </text>
               <graphic file="1475-2875-8-117-2"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>The 2La chromosomal inversion</p>
            </st>
            <p><it>Anopheles gambiae </it>population KY was highly polymorphic for the 2La/+ chromosomal inversion, allowing us to test for differentiation between the two inversion states. Dividing population KY into two groups on the basis of on inversion status (2La homozygotes <it>versus </it>2L+<sup>a </sup>homozygotes, heterozygotes excluded) identified very strong population structure associated with the inversion. At the six loci sampled from within the inversion (serpins 1&#8211;3 and control loci 1&#8211;3) mean differentiation between inversion-groups was <it>K</it><sub><it>ST </it></sub>= 0.25 (all six loci were significantly differentiated: permutation <it>p </it>&lt; 0.01 for each locus). Across the six other polymorphic loci sequenced from chromosome arm 2L (serpins 7, 10, 14 and 18, control loci 10 and 18), differentiation between these groupings was <it>K</it><sub><it>ST </it></sub>= 0.03, and was not significantly different from zero in any locus (<it>p </it>> 0.14 by permutation, in each locus).</p>
         </sec>
         <sec>
            <st>
               <p>Divergence and differentiation between species</p>
            </st>
            <p>Genetic divergence (substitutions per site) between <it>An. gambiae </it>and <it>An. arabiensis </it>was very low: <it>K</it><sub><it>S </it></sub>= 3.5% (averaged across loci), and when corrected for diversity (i.e. <inline-formula><graphic file="1475-2875-8-117-i1.gif"/></inline-formula>, page 220 in <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>), <it>K</it><sub><it>S </it></sub>= 1.2%. No synonymous fixed differences were identified between these species, and only three non-synonymous fixed differences (all in control locus 5, derived from a transposable element). Divergence from <it>An. melas </it>was higher for both <it>An. gambiae </it>(<it>K</it><sub><it>S </it></sub>= 6.4%, corrected <it>K</it><sub><it>S </it></sub>= 4.6%) and <it>An. arabiensis </it>(<it>K</it><sub><it>S </it></sub>= 6.3%, corrected <it>K</it><sub><it>S </it></sub>= 4.9%), as was divergence from <it>An. merus </it>(uncorrected <it>K</it><sub><it>S </it></sub>= 5.3% and <it>K</it><sub><it>S </it></sub>= 4.8%, respectively); divergence between these species and <it>An. quadriannulatus </it>was intermediate (uncorrected): <it>K</it><sub><it>S </it></sub>= 4.01% and <it>K</it><sub><it>S </it></sub>= 4.30% respectively. For an overview of interspecies divergence and an illustration of gene trees versus species trees, see Figure <figr fid="F3">3</figr>.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Genetic divergence within the <it>gambiae </it>species complex</p>
               </caption>
               <text>
                  <p><b>Genetic divergence within the <it>gambiae </it>species complex</b>. (a) An un-rooted neighbour-joining tree, calculated from pairwise <it>K</it><sub><it>S </it></sub>between species averaged across loci. Branch lengths are to scale. The filled triangles illustrate the relative scale of diversity and divergence within the complex, such that the length of the triangle is half the divergence between haplotypes within species (i.e <it>&#960;</it><sub><it>s</it></sub>/2) and net divergence (<it>K</it><sub><it>S</it></sub>-<it>&#960;</it><sub><it>s</it></sub>) corresponds to branch-lengths that are not part of the triangle. (Note that population samples and thus <it>&#960;</it><sub><it>s </it></sub>were not available for <it>An. quadriannulatus A </it>and <it>An. merus</it>). (b)-(d) Neighbour-joining cladograms (i.e. topology only, branch-lengths uninformative) showing the unique alleles sequenced from three loci. Note that in all cases <it>An. gambiae </it>and <it>An. arabiensis </it>alleles are intermixed. (b) to (d) are control locus 14, SRPN7 and SRPN11, selected to illustrate a wide range of <it>K</it><sub><it>ST </it></sub>values between <it>An. gambiae </it>and <it>An. arabiensis </it>(<it>K</it><sub><it>ST </it></sub>= 0.51, 0.10 and 0.09, respectively).</p>
               </text>
               <graphic file="1475-2875-8-117-3"/>
            </fig>
            <p>Genetic differentiation (the proportion of total diversity attributable to between-species differences) between <it>An. gambiae </it>and <it>An. arabiensis </it>species was <it>K</it><sub><it>ST </it></sub>= 0.19 (95% bootstrap interval across loci [0.16, 0.24]). Differentiation between <it>An. gambiae </it>and <it>An. melas </it>was much higher (<it>K</it><sub><it>ST </it></sub>= 0.33 [0.27, 0.40]), as was differentiation between <it>An. arabiensis </it>and <it>An. melas </it>(<it>K</it><sub><it>ST </it></sub>= 0.50 [0.44, 0.57]). Differentiation between <it>An. gambiae </it>and <it>An. arabiensis </it>was lower for serpins than for control loci (<it>K</it><sub><it>ST </it></sub>= 0.14 vs. <it>K</it><sub><it>ST </it></sub>= 0.25, paired Wilcox test V = 113, <it>p </it>= 0.02). However, although the trend was in the same direction in East and West Africa, this effect was only significant in West Africa (V = 108, <it>p </it>= 0.04, as compared to V = 99, <it>p </it>= 0.12).</p>
         </sec>
         <sec>
            <st>
               <p>Inference of adaptive substitutions</p>
            </st>
            <p>Due to the lack of fixed differences between <it>An. gambiae</it>, <it>An. arabiensis </it>and <it>An. quadriannulatus</it>, McDonald-Kreitman based approaches were not applied to these data (see Discussion). However, it was possible to apply the method of Welch <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> using <it>An. gambiae</it>/<it>An. arabiensis</it>, and their divergence from <it>An. melas </it>and/or <it>An. merus</it>. According to the likelihood ratio test, neither analysis using <it>An. melas </it>provided any evidence supporting adaptive substitutions between the lineages (there was no significant improvement in model fit between fixing &#945; = 0 and allowing &#945; to take its maximum likelihood value; <it>p </it>> 0.19 in all cases: Table <tblr tid="T2">2</tblr>). This was also true for the test using <it>An. gambiae </it>and <it>An. merus </it>(<it>p </it>> 0.2: Table <tblr tid="T2">2</tblr>). For the test using <it>An. arabiensis </it>and <it>An. merus </it>there was some evidence that &#945; was significantly greater than zero, (&#945; = 0.34 [0.08, 0.53]; 2&#916;lnL = 6.16, 1d.f., <it>p </it>= 0.013, significance lost if a correction is made for multiple tests), but no significant improvement in model fit was obtained by allowing &#945; to differ between serpins and control loci, or between Plasmodium-related serpins and all other loci. This suggests that approximately 8&#8211;53% of amino-acid substitutions between <it>An. arabiensis </it>and <it>An. merus </it>were adaptive, but this value did not differ significantly between serpins and the control loci. Analysis of Akaike weights gives a qualitatively identical result: in each species-pair no model is strongly preferred, but in the <it>merus</it>-<it>arabiensis </it>comparison, &#945; = 0 receives relatively little weight. Full results of the McDonald-Kreitman analysis are presented in table <tblr tid="T2">2</tblr>, and raw data are given in Additional file <supplr sid="S4">4</supplr>.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Estimates of the proportion of adaptive substitutions</p>
               </caption>
               <tblbdy cols="9">
                  <r>
                     <c ca="right">
                        <p>
                           <b>&#945; model</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>Par</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>log<it>(L)</it></b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>2&#916;log<it>(L)</it></b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>&#967;<sup>2 </sup><it>p</it>-value</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>AIC</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>Akaike weight</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>&#945;<sub>a</sub></b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>&#945;<sub>b</sub></b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="9">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c cspan="9" ca="left">
                        <p>
                           <it>An. gambiae vs. An. melas</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="right">
                        <p>&#945; = 0</p>
                     </c>
                     <c ca="right">
                        <p>29</p>
                     </c>
                     <c ca="right">
                        <p>-336.49</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>730.98</p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>0.424</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>[0]</p>
                     </c>
                     <c ca="right">
                        <p>[0]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="right">
                        <p>&#945; ~ (all loci)</p>
                     </c>
                     <c ca="right">
                        <p>30</p>
                     </c>
                     <c ca="right">
                        <p>-335.76</p>
                     </c>
                     <c ca="right">
                        <p>1.46</p>
                     </c>
                     <c ca="right">
                        <p>0.23</p>
                     </c>
                     <c ca="right">
                        <p>731.52</p>
                     </c>
                     <c ca="right">
                        <p>0.323</p>
                     </c>
                     <c ca="right">
                        <p>0.23</p>
                     </c>
                     <c ca="right">
                        <p>[0.23]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="right">
                        <p>&#945; ~ (control, serpin)</p>
                     </c>
                     <c ca="right">
                        <p>31</p>
                     </c>
                     <c ca="right">
                        <p>-335.66</p>
                     </c>
                     <c ca="right">
                        <p>0.19</p>
                     </c>
                     <c ca="right">
                        <p>0.91</p>
                     </c>
                     <c ca="right">
                        <p>733.33</p>
                     </c>
                     <c ca="right">
                        <p>0.131</p>
                     </c>
                     <c ca="right">
                        <p>0.18</p>
                     </c>
                     <c ca="right">
                        <p>0.25</p>
                     </c>
                  </r>
                  <r>
                     <c ca="right">
                        <p>&#945; ~ (other, immune)</p>
                     </c>
                     <c ca="right">
                        <p>31</p>
                     </c>
                     <c ca="right">
                        <p>-335.73</p>
                     </c>
                     <c ca="right">
                        <p>0.06</p>
                     </c>
                     <c ca="right">
                        <p>0.97</p>
                     </c>
                     <c ca="right">
                        <p>733.46</p>
                     </c>
                     <c ca="right">
                        <p>0.123</p>
                     </c>
                     <c ca="right">
                        <p>0.24</p>
                     </c>
                     <c ca="right">
                        <p>0.07</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="9">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c cspan="9" ca="left">
                        <p>
                           <it>An. arabiensis vs. An. melas</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="right">
                        <p>&#945; = 0</p>
                     </c>
                     <c ca="right">
                        <p>29</p>
                     </c>
                     <c ca="right">
                        <p>-329.36</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>716.73</p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>0.495</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>[0]</p>
                     </c>
                     <c ca="right">
                        <p>[0]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="right">
                        <p>&#945; ~ (all loci)</p>
                     </c>
                     <c ca="right">
                        <p>30</p>
                     </c>
                     <c ca="right">
                        <p>-329.31</p>
                     </c>
                     <c ca="right">
                        <p>0.10</p>
                     </c>
                     <c ca="right">
                        <p>0.75</p>
                     </c>
                     <c ca="right">
                        <p>718.63</p>
                     </c>
                     <c ca="right">
                        <p>0.191</p>
                     </c>
                     <c ca="right">
                        <p>-0.05</p>
                     </c>
                     <c ca="right">
                        <p>[-0.05]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="right">
                        <p>&#945; ~ (control, serpin)</p>
                     </c>
                     <c ca="right">
                        <p>31</p>
                     </c>
                     <c ca="right">
                        <p>-328.07</p>
                     </c>
                     <c ca="right">
                        <p>2.48</p>
                     </c>
                     <c ca="right">
                        <p>0.29</p>
                     </c>
                     <c ca="right">
                        <p>718.15</p>
                     </c>
                     <c ca="right">
                        <p>0.243</p>
                     </c>
                     <c ca="right">
                        <p>-0.54</p>
                     </c>
                     <c ca="right">
                        <p>0.10</p>
                     </c>
                  </r>
                  <r>
                     <c ca="right">
                        <p>&#945; ~ (other, immune)</p>
                     </c>
                     <c ca="right">
                        <p>31</p>
                     </c>
                     <c ca="right">
                        <p>-329.30</p>
                     </c>
                     <c ca="right">
                        <p>0.02</p>
                     </c>
                     <c ca="right">
                        <p>0.99</p>
                     </c>
                     <c ca="right">
                        <p>720.61</p>
                     </c>
                     <c ca="right">
                        <p>0.071</p>
                     </c>
                     <c ca="right">
                        <p>-0.03</p>
                     </c>
                     <c ca="right">
                        <p>-0.15</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="9">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c cspan="9" ca="left">
                        <p>
                           <it>An. gambiae vs. An. merus</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="right">
                        <p>&#945; = 0</p>
                     </c>
                     <c ca="right">
                        <p>34</p>
                     </c>
                     <c ca="right">
                        <p>-333.36</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>734.72</p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>0.349</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>[0]</p>
                     </c>
                     <c ca="right">
                        <p>[0]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="right">
                        <p>&#945; ~ (all loci)</p>
                     </c>
                     <c ca="right">
                        <p>35</p>
                     </c>
                     <c ca="right">
                        <p>-332.60</p>
                     </c>
                     <c ca="right">
                        <p>1.52</p>
                     </c>
                     <c ca="right">
                        <p>0.22</p>
                     </c>
                     <c ca="right">
                        <p>735.21</p>
                     </c>
                     <c ca="right">
                        <p>0.274</p>
                     </c>
                     <c ca="right">
                        <p>0.15</p>
                     </c>
                     <c ca="right">
                        <p>[0.15]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="right">
                        <p>&#945; ~ (control, serpin)</p>
                     </c>
                     <c ca="right">
                        <p>36</p>
                     </c>
                     <c ca="right">
                        <p>-331.59</p>
                     </c>
                     <c ca="right">
                        <p>2.02</p>
                     </c>
                     <c ca="right">
                        <p>0.36</p>
                     </c>
                     <c ca="right">
                        <p>735.18</p>
                     </c>
                     <c ca="right">
                        <p>0.277</p>
                     </c>
                     <c ca="right">
                        <p>0.34</p>
                     </c>
                     <c ca="right">
                        <p>0.03</p>
                     </c>
                  </r>
                  <r>
                     <c ca="right">
                        <p>&#945; ~ (other, immune)</p>
                     </c>
                     <c ca="right">
                        <p>36</p>
                     </c>
                     <c ca="right">
                        <p>-332.60</p>
                     </c>
                     <c ca="right">
                        <p>0.00</p>
                     </c>
                     <c ca="right">
                        <p>1.00</p>
                     </c>
                     <c ca="right">
                        <p>737.21</p>
                     </c>
                     <c ca="right">
                        <p>0.101</p>
                     </c>
                     <c ca="right">
                        <p>0.15</p>
                     </c>
                     <c ca="right">
                        <p>0.17</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="9">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c cspan="9" ca="left">
                        <p>
                           <it>An. arabiensis vs. An. merus</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="right">
                        <p>&#945; = 0</p>
                     </c>
                     <c ca="right">
                        <p>34</p>
                     </c>
                     <c ca="right">
                        <p>-311.04</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>690.07</p>
                     </c>
                     <c ca="right">
                        <p>0.053</p>
                     </c>
                     <c ca="right">
                        <p>[0]</p>
                     </c>
                     <c ca="right">
                        <p>[0]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="right">
                        <p>&#945; ~ (all loci)</p>
                     </c>
                     <c ca="right">
                        <p>35</p>
                     </c>
                     <c ca="right">
                        <p>-307.96</p>
                     </c>
                     <c ca="right">
                        <p>6.15</p>
                     </c>
                     <c ca="right">
                        <p>0.01</p>
                     </c>
                     <c ca="right">
                        <p>685.92</p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>0.422</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>0.34</p>
                     </c>
                     <c ca="right">
                        <p>[0.34]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="right">
                        <p>&#945; ~ (control, serpin)</p>
                     </c>
                     <c ca="right">
                        <p>36</p>
                     </c>
                     <c ca="right">
                        <p>-307.22</p>
                     </c>
                     <c ca="right">
                        <p>1.48</p>
                     </c>
                     <c ca="right">
                        <p>0.48</p>
                     </c>
                     <c ca="right">
                        <p>686.44</p>
                     </c>
                     <c ca="right">
                        <p>0.325</p>
                     </c>
                     <c ca="right">
                        <p>0.47</p>
                     </c>
                     <c ca="right">
                        <p>0.25</p>
                     </c>
                  </r>
                  <r>
                     <c ca="right">
                        <p>&#945; ~ (other, immune)</p>
                     </c>
                     <c ca="right">
                        <p>36</p>
                     </c>
                     <c ca="right">
                        <p>-307.71</p>
                     </c>
                     <c ca="right">
                        <p>0.50</p>
                     </c>
                     <c ca="right">
                        <p>0.78</p>
                     </c>
                     <c ca="right">
                        <p>687.42</p>
                     </c>
                     <c ca="right">
                        <p>0.200</p>
                     </c>
                     <c ca="right">
                        <p>0.36</p>
                     </c>
                     <c ca="right">
                        <p>0.10</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>&#945;<sub>a </sub>and &#945;<sub>b </sub>are estimates of the proportion of adaptive substitutions in each of the two classes of gene (control/serpin or non-immune/immune, respectively), Par is the number of parameters in the model. Where the value of &#945; is constrained by the model it is marked in square brackets. Negative values arise from an 'excess' of non-synonymous polymorphism, and could represent sampling error, or mildly deleterious polymorphisms <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. AIC is the Akaike Information Criterion. The Akaike weighting can be interpreted as the weight of evidence in favour of the corresponding model, given the relative support for all the available models <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>.</p>
               </tblfn>
            </tbl>
            <suppl id="S4">
               <title>
                  <p>Additional file 4</p>
               </title>
               <text>
                  <p><b>McDonald-Kreitman data</b>. Sample sizes, analysed gene lengths, polymorphisms and fixed differences for each locus, for each pair of species</p>
               </text>
               <file name="1475-2875-8-117-S4.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <sec>
            <st>
               <p>Population history and speciation in Anopheles</p>
            </st>
            <p>The <it>An. gambiae </it>complex falls within the <it>Pyretophorus </it>series of the subgenus <it>Cellia</it>, and comprises a closely-related group of approximately eight species (<it>An. gambiae s.s., An. arabiensis, Anopheles bwambae, An. quadriannulatus A and B, An. merus, An. melas and Anopheles comorensis</it>) <abbrgrp><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr></abbrgrp>, plus at least one case of incipient speciation (<it>M </it>and <it>S </it>molecular forms of <it>An. gambiae </it>s.s. <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B58">58</abbr><abbr bid="B59">59</abbr><abbr bid="B60">60</abbr></abbrgrp>). Because lineages within the complex differ in their importance as <it>Plasmodium </it>vectors [e.g., <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>], in their ecological preferences <abbrgrp><abbr bid="B62">62</abbr><abbr bid="B63">63</abbr></abbrgrp>, and in their resistance to pesticides [e.g. <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>], there is considerable value in understanding both species relationships and how populations are structured. This may be of particular consequence if any attempt is ever made to genetically modify wild <it>Anopheles </it>populations to block or reduce <it>Plasmodium </it>transmission <abbrgrp><abbr bid="B64">64</abbr><abbr bid="B65">65</abbr></abbrgrp>. However, in addition to having important implications for vector control, as discussed below, understanding phylogeny and population history are also essential to the robust inference of selection.</p>
            <p>Previous analyses suggest that <it>An. gambiae </it>and <it>An. arabiensis </it>are sister taxa, and the data presented here from five of the eight species are in strong agreement, placing <it>An. gambiae </it>and <it>An. arabiensis </it>as the most closely related species pair, with greater divergence to <it>An. merus </it>and <it>An. melas </it>(Figure <figr fid="F3">3a</figr>; note that the tree is unrooted). However, in common with previous studies [e.g. <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B66">66</abbr><abbr bid="B67">67</abbr></abbrgrp>], these data suggest extensive shared polymorphism (Figure <figr fid="F3">3</figr>) and very low differentiation between <it>An. gambiae </it>and <it>An. arabiensis </it>(<it>K</it><sub><it>ST </it></sub>= 0.19). The inter-species differentiation between <it>An. gambiae </it>and <it>An. arabiensis </it>is approximately the same as inter-population differentiation between African and European <it>D. melanogaster </it>(<it>K</it><sub><it>ST </it></sub>= 0.16 to 0.24, depending on population; pers. comm. P. R. Haddrill, data from <abbrgrp><abbr bid="B68">68</abbr></abbrgrp>), and is lower than inter-population differentiation in the predominantly selfing nematode <it>Caenorhabditis elegans </it>(<it>K</it><sub><it>ST </it></sub>= 0.38, pers. com. A. D. Cutter, data from <abbrgrp><abbr bid="B69">69</abbr></abbrgrp>). Thus, although <it>An. gambiae </it>and <it>An. arabiensis </it>are largely reproductively isolated and significantly differentiated <abbrgrp><abbr bid="B70">70</abbr><abbr bid="B71">71</abbr></abbrgrp>, these data confirm that either they share extensive ancestral polymorphism, or that there is considerable introgression between them [see also <abbrgrp><abbr bid="B67">67</abbr><abbr bid="B72">72</abbr><abbr bid="B73">73</abbr></abbrgrp>]. This is further supported by the very high correlation in neutral diversity across genes, between these two species (Additional file <supplr sid="S2">2</supplr>).</p>
            <p>Given a particular divergence time, effective population size is the primary determinant of the amount of shared ancestral polymorphism between taxa, because drift (and therefore lineage-sorting) is faster in small populations [e.g. <abbrgrp><abbr bid="B74">74</abbr></abbrgrp>]. Thus, although it is likely that <it>An. gambiae </it>and <it>An. arabiensis </it>share a more recent common ancestor that either does with <it>An. melas</it>, the lower differentiation between <it>An. gambiae </it>and <it>An. arabiensis </it>could also be explained (at least in part) by differences in effective population size within the complex. Since <it>&#960;</it><sub><it>s </it></sub>is an estimator of 4<it>N</it><sub><it>e</it></sub>&#956;, differences in neutral diversity imply that <it>An. gambiae </it>and <it>An. arabiensis </it>have larger effective population sizes than <it>An. merus</it>, consistent both with their wider geographic range, and with potentially higher levels of shared ancestral polymorphism (see above: <it>&#960;</it><sub><it>s</it></sub>~2.8%, 2.0%, and 0.9% for <it>An. gambiae</it>, <it>An. arabiensis </it>and <it>An. melas</it>, respectively). Diversity in <it>An. gambiae </it>and <it>An. arabiensis </it>is similar to that seen in African populations of <it>Drosophila simulans </it>and <it>D. melanogaster </it>(<it>&#960;</it><sub><it>s </it></sub>= 3.2% and <it>&#960;</it><sub><it>s </it></sub>= 1.7%, respectively) [e.g. <abbrgrp><abbr bid="B75">75</abbr></abbrgrp>], and assuming the mutation rate (&#956;) is similar between mosquitoes and <it>Drosophila</it>, this also suggests a long-term effective population size for <it>An. gambiae </it>that is about 70% larger than <it>D. melanogaster </it><abbrgrp><abbr bid="B76">76</abbr></abbrgrp>, i.e. well in excess of 1 million. This is broadly consistent with previous estimates from mitochondrial sequence, but is much larger than estimates based on microsatellites and allozyme variants [reviewed in <abbrgrp><abbr bid="B77">77</abbr></abbrgrp>].</p>
            <p>Incipient speciation between the <it>M </it>and <it>S </it>molecular forms of <it>An. gambiae </it>is a major focus of ongoing research <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B58">58</abbr><abbr bid="B59">59</abbr><abbr bid="B60">60</abbr></abbrgrp>, culminating in the recently completed sequencing of the <it>M </it>and <it>S </it>molecular-form genomes <abbrgrp><abbr bid="B78">78</abbr></abbrgrp>. Although differentiation between the <it>M </it>and <it>S </it>form of <it>An. gambiae </it>in West Africa is extremely low (<it>K</it><sub><it>ST </it></sub>= 0.02; Figure <figr fid="F2">2</figr>), it is consistently non-zero at some loci, even where the lineages are sympatric (see e.g. <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>). This unambiguously identifies <it>M </it>and <it>S </it>form <it>An. gambiae </it>as being (at least partly) reproductively isolated <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>, and it has been argued that different <it>M </it>and <it>S</it>-form niches may be distinguishable <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. Moreover, differentiation is variable around the genome, being higher at so-called 'islands of speciation', potentially associated with adaptive differences <abbrgrp><abbr bid="B58">58</abbr><abbr bid="B59">59</abbr></abbrgrp>. However, other studies have shown that in some geographic regions microclimate is a better predictor of population divergence than is molecular form <abbrgrp><abbr bid="B79">79</abbr></abbrgrp>, and it is clear that the S-form of <it>An. gambiae </it>is not a single homogenous lineage <abbrgrp><abbr bid="B66">66</abbr><abbr bid="B80">80</abbr></abbrgrp>. Indeed, it is well-established that there is extensive genetic differentiation associated with the Great Rift Valley <abbrgrp><abbr bid="B80">80</abbr></abbrgrp>, and consistent with this, the data presented here not only identify considerable differentiation between East and West Africa (KY S-form <it>versus </it>BK M-form, <it>K</it><sub><it>ST </it></sub>= 0.12), but also show that West African <it>M</it>-form and <it>S</it>-form are less differentiated from each other than either is from East African S-form (Figure <figr fid="F2">2</figr>). This suggests either that the West African <it>M </it>and <it>S </it>lineages share a more recent common ancestor, or alternatively that they have experienced considerable recent gene flow [<abbrgrp><abbr bid="B37">37</abbr></abbrgrp>, e.g. <abbrgrp><abbr bid="B80">80</abbr></abbrgrp>, but see also <abbrgrp><abbr bid="B81">81</abbr></abbrgrp>]. For <it>An. arabiensis</it>, differentiation across the width of the continent is only <it>K</it><sub><it>ST </it></sub>= 0.08, which is very similar to that between Eastern and Western <it>S</it>-form <it>An. gambiae </it>(<it>K</it><sub><it>ST </it></sub>= 0.07; Figure <figr fid="F2">2</figr>), and approximately twice that between <it>D. melanogaster </it>populations sampled across a similar geographic range (Gabon vs. Kenya or Zimbabwe, <it>K</it><sub><it>ST </it></sub>= 0.04) [P. R. Haddrill pers. comm., data from <abbrgrp><abbr bid="B68">68</abbr></abbrgrp>].</p>
            <p>Most populations showed a slight skew in the allele-frequency spectrum toward low-frequency variants (i.e., average Tajima's <it>D </it>was negative), consistent either with population growth, or with weak selection against some synonymous variants. In contrast, however, the data presented here also identify a strong skew toward intermediate frequency alleles in <it>An. gambiae </it>population KY (i.e., a positive Tajima's <it>D</it>, 0.77 averaged across loci, 95% bootstrap interval 0.40&#8211;1.15). One potential explanation for this is that population KY is admixed or contains cryptic population structure, for example as would be the case if two divergent lineages of <it>S</it>-form are coexisting there. Alternatively, a positive Tajima's <it>D </it>could also result from a recent decrease in effective population size.</p>
            <p>The phylogenetic and phylogeographic complexity of the <it>An. gambiae </it>species group means that inferences drawn from single individuals should be treated with caution, as the low differentiation between species and high diversity within species means that any one individual is not necessarily typical or representative [e.g. <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>]. For example, the recently sequenced <it>M </it>and <it>S</it>-form genomes <abbrgrp><abbr bid="B78">78</abbr></abbrgrp> were both obtained from mosquitoes sampled in Mali, but <it>S</it>-form divergence between East and West Africa is considerably greater than <it>M</it>-<it>S </it>divergence within West Africa (see Figure <figr fid="F2">2</figr> above, and <abbrgrp><abbr bid="B66">66</abbr><abbr bid="B80">80</abbr></abbrgrp>, cf. <abbrgrp><abbr bid="B81">81</abbr></abbrgrp>). It is therefore not clear that any conclusions regarding <it>M </it>- <it>S </it>genome divergence will generalize to S-form individuals from the east African coast. The same issue arises with inter-species comparisons. For example, the divergence between two randomly sampled <it>An. gambiae </it>genomes is <it>K</it><sub><it>S</it></sub>~3%, and that between one randomly selected <it>An. gambiae </it>genome and one randomly selected <it>An. quadriannulatus </it>genome is only <it>K</it><sub><it>S</it></sub>~4% (above, and Figure <figr fid="F3">3</figr>). This means that a single inbred strain of <it>An. quadriannulatus </it>is only marginally more informative about <it>An. quadriannulatus </it>than it is about <it>An. gambiae</it>, and without more extensive sampling it cannot reliably be used to identify genetic differences between the species [cf. <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>]. These issues will be of paramount importance in analysing the recently approved complete genomes of <it>An. arabiensis</it>, <it>An. quadriannulatus </it>and <it>An. merus </it><abbrgrp><abbr bid="B82">82</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Evidence for adaptive evolution in <it>Anopheles </it>serpins</p>
            </st>
            <p>There is considerable evidence from other taxa that serpins are an evolutionarily dynamic gene family, with high turnover between lineages and occasional lineage-specific expansions (e.g. <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, see <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> for a review). For example, although most <it>Anopheles </it>and <it>Aedes </it>serpins have 1:1 orthologs, there are 29 serpins in <it>D. melanogaster </it>but only 18 in <it>An. gambiae</it>, and very few mosquito serpins have 1:1 orthologs in <it>Drosophila </it><abbrgrp><abbr bid="B83">83</abbr></abbrgrp>. Some serpins also show very high rates of adaptive evolution, such as <it>Drosophila </it>Spn28D [CG7219 in ref. <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>], and it has been suggested that, in general, rapid turnover and strong selection in serpins may be associated with serpin immune function, and could be driven by an evolutionary 'arms-race' <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. In <it>An. gambiae</it>, three serpins are known to have immune-related function in response to <it>Plasmodium </it>infection (SRPN2, SRPN6 and SRPN10 in <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B26">26</abbr><abbr bid="B28">28</abbr></abbrgrp>).</p>
            <p>Strong selection can affect patterns of genetic diversity, both between populations, and between chromosomal inversions. In <it>An. gambiae </it>population KY, considerable differentiation was identified between 2La and 2L+<sup>a </sup>homozygotes around the SRPN1, 2 and 3 cluster (and control loci 1, 2 and 3). For these loci <it>K</it><sub><it>ST </it></sub>= 0.25 between inversion states, which is actually higher than the overall differentiation between <it>An. gambiae </it>and <it>An. arabiensis</it>, and twice as high as the differentiation between <it>M</it>-form and <it>S</it>-form <it>An. gambiae</it>. No such differentiation was seen for other loci on chromosome arm 2L, indicating that this is strongly associated with the inversion. Although chromosomal inversions are in general expected to suppress recombination, especially near breakpoints, this is unlikely to lead to extreme or long-term differentiation unless maintained by selection [e.g. <abbrgrp><abbr bid="B84">84</abbr><abbr bid="B85">85</abbr></abbrgrp>]. In particular, despite recombination being suppressed in heterozygous individuals, genetic exchange (including recombination and gene-conversion) within the region of the 2La/+ inversion is not zero <abbrgrp><abbr bid="B86">86</abbr></abbrgrp>, and this should allow such differentiation to break down rapidly if it is not selectively maintained. The finding of elevated differentiation around these loci agrees with previous analyses, which found the SRPN1-3 cluster to be close to the region of highest differentiation between 2La and 2L+<sup>a</sup>. <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. While this doesn't provide strong evidence that any of these serpins are being directly selected, it is interesting to note that SRPN2 is required for successful infection of <it>An. gambiae </it>by <it>P. berghei </it><abbrgrp><abbr bid="B28">28</abbr></abbrgrp>, and that 2La inversion-status was identified with vector-competence in some early studies <abbrgrp><abbr bid="B87">87</abbr><abbr bid="B88">88</abbr></abbrgrp>. Thus it is possible that genetic structuring in serpins 1&#8211;3, introduced and maintained by the 2La inversion, may affect variation in vector competence, even if the underlying cause of 2La/+ differentiation is elsewhere.</p>
            <p>In contrast, using a McDonald-Kreitman based approach to detecting adaptive substitutions <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>, no strong or consistent evidence of selection could be detected, nor could differences in the rate of adaptive evolution between serpins and other genes (or between immune serpins and other genes; Table <tblr tid="T2">2</tblr>). Specifically, in three of the four species pairings that were analysed, the rate of adaptive evolution could not be distinguished from zero (Table <tblr tid="T2">2</tblr>). This may indicate that neither <it>Anopheles </it>immune-related serpins, nor <it>Anopheles </it>serpins as a family, are subject to selection for rapid change, and consequently that all selection acting on these genes is purifying. However, it may also be an artefactual result arising either from limitations of the McDonald-Kreitman framework, or from issues specific to the gambiae species complex (see next section). In particular, the McDonald-Kreitman approach assumes that non-synonymous substitutions can be divided into three classes: strongly advantageous mutations that fix rapidly, strongly deleterious mutations that are rapidly lost, and effectively neutral mutations that drift in frequency <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. If there is also a large class of weakly deleterious mutations that remain polymorphic for an extended period, but are lost by selection in the long term, then this will reduce estimates of &#945; <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>.</p>
            <p>In agreement with this, a trend toward higher amino-acid diversity was found in serpins, as compared to other genes, in all three species (<it>An. gambiae</it>, <it>An. arabiensis </it>and <it>An. melas</it>; Figure <figr fid="F1">1b</figr>). This could suggest that purifying selection on serpins is weak or intermittent (as compared to purifying selection on other genes), or that there is selection favouring diversity in serpins, such as balancing selection <abbrgrp><abbr bid="B89">89</abbr></abbrgrp>. However, if the latter were the case, then one might also expect to find an increase in diversity at linked synonymous sites, and although slightly higher, synonymous diversity in serpins did not differ significantly from other genes (Figure <figr fid="F1">1a</figr>), suggesting that they neither experience long-term selection for increased polymorphism, nor undergo more frequent selective sweeps [e.g. <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>]. Moreover, in no population or species did a serpin display the highest or lowest neutral genetic diversity, and there was no clear pattern in the allele frequency spectrum (as measured by Tajima's <it>D </it>statistic) or inter-population differentiation (measured by <it>K</it><sub><it>ST</it></sub>) that supports the notion of strong selection acting on <it>Anopheles </it>serpins. These data therefore fail to identify any serpins as candidates for recent strong selection within the gambiae complex.</p>
         </sec>
         <sec>
            <st>
               <p>Prospects and pitfalls in inferring adaptive evolution in <it>Anopheles</it></p>
            </st>
            <p>A null result in tests for selection may also result from low power or model violations associated with the phylogenetic and population history in the <it>An. gambiae </it>complex. Motivated by an interest in identifying targets of pathogen-mediated selection, several studies have now attempted to identify adaptive evolution in immune-related genes from the <it>Anopheles gambiae </it>species complex <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr><abbr bid="B90">90</abbr><abbr bid="B91">91</abbr><abbr bid="B92">92</abbr></abbrgrp>. However, in stark contrast to almost identical studies in taxa such as <it>Drosophila </it>[for example <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B21">21</abbr><abbr bid="B93">93</abbr><abbr bid="B94">94</abbr><abbr bid="B95">95</abbr></abbrgrp>], to date there is very little clear evidence supporting adaptive evolution in the <it>An. gambiae </it>complex [but see <abbrgrp><abbr bid="B90">90</abbr><abbr bid="B92">92</abbr></abbrgrp>].</p>
            <p>One likely reason for the difference between <it>Anopheles </it>and <it>Drosophila </it>studies is the dearth of suitable outgroups for <it>An. gambiae </it>[e.g. <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>]. First, well-studied <it>Anopheles </it>species outside of the gambiae complex appear to be too distantly related to reliably infer the divergence between them. For example, the <it>An. stephensi </it>SRPN6 cDNA sequence <abbrgrp><abbr bid="B26">26</abbr></abbrgrp> suggests that <it>K</it><sub><it>S </it></sub>between with <it>An. stephensi </it>and <it>An. gambiae </it>is ~1.33 (&#177; 0.157) substitutions per site when estimated by maximum likelihood <abbrgrp><abbr bid="B96">96</abbr></abbrgrp>, or 0.92 (&#177; 0.117) by the method of Li <abbrgrp><abbr bid="B97">97</abbr></abbrgrp>. Second, low divergence within the complex means there is little power to infer the substitution rate between them <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>. For example, across all 8 serpins appearing in the extensive survey of Cohuet <it>et al</it>. <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, there are only two fixed amino acid differences between <it>An. arabiensis </it>and <it>An. gambiae</it>. Similarly, the present study found no fixed differences at all in the same genes, probably because wider geographic sampling provided greater power to distinguish between polymorphisms and fixed differences. The stochastic errors involved here mean that estimates of substitution rate are likely to be wildly variable, reducing the power to estimate the fraction of adaptive substitutions using a McDonald-Kreitman framework (Figure <figr fid="F4">4</figr>).</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>The power to estimate &#945; using <it>An. gambiae </it>and <it>An. arabiensis</it></p>
               </caption>
               <text>
                  <p><b>The power to estimate &#945; using <it>An. gambiae </it>and <it>An. arabiensis</it></b>. The relative log-likelihood of &#945; (the proportion of amino-acid substitutions that are adaptive) estimated using the modified McDonald-Kreitman approach <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. The grey curve is calculated from all 102 genes for which both <it>An. arabiensis </it>and <it>An. gambiae </it>population samples were available in the dataset of Cohuet et al. <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. The black curve shows an equivalent dataset of 102 genes from <it>Drosophila melanogaster </it>and <it>D. simulans</it>, with genes selected to be the same average length as those in the Cohuet dataset (D. J. Obbard, J. J. Welch and F. M. Jiggins, unpublished data). Despite both pairs of species having similar levels of diversity (<it>&#960;</it><sub><it>s </it></sub>from 1.6% to 2.9%), for the <it>Anopheles </it>dataset the bounds (2 units of log Likelihood) stretch from -0.33 to 0.32 (and include zero) while for <it>Drosophila </it>the bounds only stretch from 0.26 to 0.44, and the maximum-likelihood estimate of &#945; is 35%. The low precision in the second estimate reflects the very low power available due to the low divergence in <it>An. gambiae</it>-<it>An. arabiensis </it>comparisons</p>
               </text>
               <graphic file="1475-2875-8-117-4"/>
            </fig>
            <p>In principle, other species from within the gambiae complex might be informative outgroups for <it>An. gambiae </it>and <it>An. arabiensis</it>: although divergence is small (1 to 5%; Figure <figr fid="F3">3a</figr>), this may be sufficient in other taxa, such as the human-chimp comparison [e.g. <abbrgrp><abbr bid="B98">98</abbr></abbrgrp>]. However, unlike the human-chimp case, diversity in <it>Anopheles </it>is very high compared to humans (&#960;<sub>s</sub>~3% in <it>An. gambiae</it>, ~2% in <it>An. arabiensis</it>, Figure <figr fid="F1">1</figr>, c.f. ~0.1% in humans, e.g. <abbrgrp><abbr bid="B99">99</abbr></abbrgrp>), and this leads to two potential problems. First, a large proportion of apparent substitutions will in reality be polymorphisms <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B51">51</abbr></abbrgrp>, and although this effect is small enough to be negligible for pairs of species in which <it>K</it><sub><it>S </it></sub>>> <it>&#960;</it><sub><it>S</it></sub>, it becomes a concern when comparing <it>An. gambiae </it>to <it>An. arabiensis</it>, for which <it>K</it><sub><it>S </it></sub>&lt;<it>&#960;</it><sub><it>S </it></sub>(Figure <figr fid="F3">3</figr>). However, as here, this effect can be accounted for using models which include the sample size and diversity, and thereby infer the 'true' number of substitutions <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B51">51</abbr></abbrgrp>. Second, and potentially more serious, is the opportunity for extensive shared polymorphism. The McDonald-Kreitman framework uses information from current diversity (i.e. &#960;<sub>s </sub>and &#960;<sub>a</sub>) to infer whether some proportion of historic substitutions (<it>K</it><sub><it>A</it></sub>) cannot be explained by purely neutral processes. This model implicitly assumes a time period when the two lineages were diverging from their common ancestor, during which selection (that is to be detected) was able to act. If there is extensive shared polymorphism between the species, for example if gene trees are rarely reciprocally monophyletic (as is the case with <it>An. gambiae </it>and <it>An. arabiensis</it>; see Figure <figr fid="F3">3b&#8211;d</figr> and e.g. <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>), then it is hard to see how the McDonald-Kreitman approach can ever be usefully applied. In other words, unlike the straight-forward cases where divergence is too high (the branch is too long) or divergence is too low (the branch is too short), for <it>An. gambiae </it>and <it>An. arabiensis</it>, in effect there is no branch at all (Figure <figr fid="F3">3b&#8211;d</figr>).</p>
            <p>Unfortunately, for similar reasons, there are also serious concerns about the application of other selection-inference methods to the gambiae complex, such as the phylogenetic methods implemented in PAML <abbrgrp><abbr bid="B96">96</abbr></abbrgrp> and HyPhy <abbrgrp><abbr bid="B100">100</abbr></abbrgrp>. These methods use multiple sequences related by a gene (or species) tree to infer relative rates of synonymous and non-synonymous substitution, allowing variable rates at different sites or in different parts of the gene. Most phylogenetic methods assume there is no recombination within loci [but see OmegaMap, <abbrgrp><abbr bid="B101">101</abbr></abbrgrp>], and simulation suggests false positives can reach >50% when 2<it>N</it><sub><it>e</it></sub><it>r </it>> 0.01 <abbrgrp><abbr bid="B102">102</abbr></abbrgrp>. Because 2<it>N</it><sub><it>e</it></sub><it>r </it>(i.e: 2 &#215; effective population size &#215; recombination rate per codon per generation) in <it>An. gambiae </it>is likely to be of the order 0.01 &#8211; 0.1 or higher &#8211; primarily due to the large effective population size &#8211; such phylogenetic approaches cannot be applied reliably to within-species <it>Anopheles </it>data.</p>
            <p>Additionally, where the McDonald-Kreitman framework assumes that between-species <it>K</it><sub><it>A </it></sub>results from the joint action of selection and drift, and within-species <it>&#960;</it><sub><it>a </it></sub>results only from drift, the phylogenetic approaches (as they are most commonly applied) assume either that all amino acid variants have the same cause (i.e. <it>K</it><sub><it>A </it></sub>and <it>&#960;</it><sub><it>a </it></sub>do not provide independent information about selection and drift) or that most differences are fixed between species, (i.e. <it>K</it><sub><it>A </it></sub>>> <it>&#960;</it><sub><it>a</it></sub>, such that <it>&#960;</it><sub><it>a </it></sub>is negligible). Thus the phylogenetic and McDonald-Kreitman approaches constitute very different models that lend themselves to different datasets, and it is not clear that any single dataset can reasonably be analysed using both. Instead, to analyse a dataset that includes substantial within-species sampling using a phylogenetic approach, it may be more rational to fit a model which allows the relative rates of synonymous and non-synonymous substitution to differ between within- and between-species branches [e.g. <abbrgrp><abbr bid="B103">103</abbr></abbrgrp>]. However, in the case of the <it>gambiae </it>complex, the low power associated with the low inter-species divergence will then be encountered again.</p>
            <p>If McDonald-Kreitman tests have relatively low power within the gambiae complex, and phylogenetic methods cannot easily be applied to within-species data, how can selection be inferred from <it>Anopheles </it>population genetic data? One possibility is to use approaches based solely on within-population diversity to identify recent selective sweeps or regions of elevated polymorphism, and this will work for some loci [e.g. TEP1, <abbrgrp><abbr bid="B92">92</abbr></abbrgrp>], though the possibility of introgression, and/or chromosomal inversions that might affect the distribution of diversity, should then be taken into account. Nevertheless, at present it seems the best options for outgroup-based analyses are <it>An. merus </it>and/or <it>An. melas</it>, and both the data presented here (Figure <figr fid="F3">3</figr>) and previous studies [e.g. <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B66">66</abbr></abbrgrp>] suggest that their divergence from the <it>An. gambiae</it>/<it>An. arabiensis </it>clade should be sufficient in the case of genes evolving under very strong selection. However, the recently approved genome sequences from 13 more species of <it>Anopheles </it>mosquitoes <abbrgrp><abbr bid="B82">82</abbr></abbrgrp> may hold the solution, and particularly the complete genome sequence of <it>Anopheles sundaicus </it>(also subgenus <it>Cellia</it>, <it>Pyretophorous </it>series).</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Competing interests</p>
         </st>
         <p>The authors declare that they have no competing interests.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>DJO selected the loci, designed the primers, and performed all PCR, sequencing and analysis. JJW developed the likelihood-based methods for inferring the proportion of adaptive substitutions, and provided statistical support. TJL conceived the project, and all authors contributed to the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank H. M. Ferguson and N. Tuno for providing field samples of <it>Anopheles </it>mosquitoes, R. Maharaj and W. Takken for providing preserved mosquitoes from laboratory culture, and G. Yan for ongoing logistical support with fieldwork. We thank Abraham Eappen and Marcelo Jacobs-Lorena for providing the cDNA sequence of <it>An. Stephensi </it>SRPN6, Asher Cutter and Penny Haddrill for unpublished <it>K</it><sub><it>ST </it></sub>statistics, and Mike Kanost and Kristin Michel for sharing pre-publication data on <it>Anopheles </it>serpin classification and function. We thank an anonymous reviewer for helpful comments on the manuscript, and Marcel Hommel for careful copyediting. This work was funded by Wellcome Trust Grant 073210 to TJL, and DJO also received Wellcome Trust funding in the form of 'Value In People' bridging salary and Wellcome Trust Research Career Development Fellowship 085064/Z/08/Z. JJW is funded by BBSRC grant DO17750 awarded to Andrew Rambaut.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>A major genetic locus controlling natural <it>Plasmodium falciparum </it>infection is shared by East and West African <it>Anopheles gambiae</it></p>
            </title>
            <aug>
               <au>
                  <snm>Riehle</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Markianos</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Lambrechts</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Xia</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sharakhov</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Koella</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Vernick</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Malar J</source>
            <pubdate>2007</pubdate>
            <volume>6</volume>
            <fpage>87</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1936428</pubid>
                  <pubid idtype="pmpid" link="fulltext">17612409</pubid>
                  <pubid idtype="doi">10.1186/1475-2875-6-87</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p><it>Anopheles gambiae </it>immune responses to human and rodent <it>Plasmodium </it>parasite species</p>
            </title>
            <aug>
               <au>
                  <snm>Dong</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Aguilar</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Xi</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Warr</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Mongin</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Dimopoulos</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>PLoS Pathog</source>
            <pubdate>2006</pubdate>
            <volume>2</volume>
            <fpage>e52</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1475661</pubid>
                  <pubid idtype="pmpid" link="fulltext">16789837</pubid>
                  <pubid idtype="doi">10.1371/journal.ppat.0020052</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Mosquito immunity against <it>Plasmodium </it></p>
            </title>
            <aug>
               <au>
                  <snm>Michel</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kafatos</snm>
                  <fnm>FC</fnm>
               </au>
            </aug>
            <source>Insect Bioch Mol Biol</source>
            <pubdate>2005</pubdate>
            <volume>35</volume>
            <fpage>677</fpage>
            <lpage>689</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/j.ibmb.2005.02.009</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Effects of mosquito genes on <it>Plasmodium </it>development</p>
            </title>
            <aug>
               <au>
                  <snm>Osta</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Christophides</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Kafatos</snm>
                  <fnm>FC</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2004</pubdate>
            <volume>303</volume>
            <fpage>2030</fpage>
            <lpage>2032</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1091789</pubid>
                  <pubid idtype="pmpid" link="fulltext">15044804</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>The evolution of immune-related genes from disease carrying mosquitoes: diversity in a peptidoglycan- and a thioester-recognizing protein</p>
            </title>
            <aug>
               <au>
                  <snm>Little</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Cobbe</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Insect Mol Biol</source>
            <pubdate>2005</pubdate>
            <volume>14</volume>
            <fpage>599</fpage>
            <lpage>605</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-2583.2005.00588.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">16313560</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>SNP discovery and molecular evolution in <it>Anopheles gambiae</it>, with special emphasis on innate immune system</p>
            </title>
            <aug>
               <au>
                  <snm>Cohuet</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Krishnakumar</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Simard</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Morlais</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Koutsos</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Fontenille</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Mindrinos</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kafatos</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2008</pubdate>
            <volume>9</volume>
            <fpage>227</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2405807</pubid>
                  <pubid idtype="pmpid" link="fulltext">18489733</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-9-227</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>The genomic rate of adaptive evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Eyre-Walker</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Trends Ecol Evolution  Trends Ecol Evol. 2006 Oct;21(10):569-75</source>
            <pubdate>2006</pubdate>
            <volume>21</volume>
            <issue>10</issue>
            <fpage>569</fpage>
            <lpage>575</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tree.2006.06.015</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Molecular signatures of natural selection</p>
            </title>
            <aug>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Ann Rev Genetics</source>
            <pubdate>2005</pubdate>
            <volume>39</volume>
            <fpage>197</fpage>
            <lpage>218</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1146/annurev.genet.39.073003.112420</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p><it>Anopheles </it>mosquitoes: not just flying malaria vectors especially in the field</p>
            </title>
            <aug>
               <au>
                  <snm>Bo&#235;te</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Trends Parasitol</source>
            <pubdate>2009</pubdate>
            <volume>25</volume>
            <fpage>53</fpage>
            <lpage>55</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.pt.2008.10.005</pubid>
                  <pubid idtype="pmpid" link="fulltext">19095498</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Natural selection drives extremely rapid evolution in antiviral RNAi genes</p>
            </title>
            <aug>
               <au>
                  <snm>Obbard</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Jiggins</snm>
                  <fnm>FM</fnm>
               </au>
               <au>
                  <snm>Halligan</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Little</snm>
                  <fnm>TJ</fnm>
               </au>
            </aug>
            <source>Current Biol</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <fpage>580</fpage>
            <lpage>585</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/j.cub.2006.01.065</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Dynamic evolution of the innate immune system in <it>Drosophila </it></p>
            </title>
            <aug>
               <au>
                  <snm>Sackton</snm>
                  <fnm>TB</fnm>
               </au>
               <au>
                  <snm>Lazzaro</snm>
                  <fnm>BP</fnm>
               </au>
               <au>
                  <snm>Schlenke</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Evans</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Hultmark</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>AG</fnm>
               </au>
            </aug>
            <source>Nature Genetics</source>
            <pubdate>2007</pubdate>
            <volume>39</volume>
            <fpage>1461</fpage>
            <lpage>1468</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng.2007.60</pubid>
                  <pubid idtype="pmpid" link="fulltext">17987029</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Serpin structure, mechanism, and function</p>
            </title>
            <aug>
               <au>
                  <snm>Gettins</snm>
                  <fnm>PGW</fnm>
               </au>
            </aug>
            <source>Chem Rev</source>
            <pubdate>2002</pubdate>
            <volume>102</volume>
            <fpage>4751</fpage>
            <lpage>4803</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1021/cr010170+</pubid>
                  <pubid idtype="pmpid" link="fulltext">12475206</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Evolutionary mechanisms acting on proteinase inhibitor variability</p>
            </title>
            <aug>
               <au>
                  <snm>Christeller</snm>
                  <fnm>JT</fnm>
               </au>
            </aug>
            <source>FEBS J</source>
            <pubdate>2005</pubdate>
            <volume>272</volume>
            <fpage>5710</fpage>
            <lpage>5722</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1742-4658.2005.04975.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">16279937</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>The role of serpins in vertebrate immunity</p>
            </title>
            <aug>
               <au>
                  <snm>Mangan</snm>
                  <fnm>MSJ</fnm>
               </au>
               <au>
                  <snm>Kaiserman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bird</snm>
                  <fnm>PI</fnm>
               </au>
            </aug>
            <source>Tissue Antigens</source>
            <pubdate>2008</pubdate>
            <volume>72</volume>
            <fpage>1</fpage>
            <lpage>10</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1399-0039.2008.01059.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">18498291</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Cell-mediated immunity in arthropods: Hematopoiesis, coagulation, melanization and opsonization</p>
            </title>
            <aug>
               <au>
                  <snm>Jiravanichpaisal</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>S&#246;derh&#228;ll</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Immunobiology</source>
            <pubdate>2006</pubdate>
            <volume>211</volume>
            <fpage>213</fpage>
            <lpage>236</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.imbio.2005.10.015</pubid>
                  <pubid idtype="pmpid" link="fulltext">16697916</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Constitutive activation of toll-mediated antifungal defense in serpin-deficient <it>Drosophila </it></p>
            </title>
            <aug>
               <au>
                  <snm>Levashina</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Langley</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gubb</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ashburner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hoffmann</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Reichhart</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1999</pubdate>
            <volume>285</volume>
            <fpage>1917</fpage>
            <lpage>1919</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.285.5435.1917</pubid>
                  <pubid idtype="pmpid" link="fulltext">10489372</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Activation of <it>Drosophila </it>Toll during fungal infection by a blood serine protease</p>
            </title>
            <aug>
               <au>
                  <snm>Ligoxygakis</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Pelte</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hoffmann</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Reichhart</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>297</volume>
            <fpage>114</fpage>
            <lpage>116</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1072391</pubid>
                  <pubid idtype="pmpid" link="fulltext">12098703</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>An immune-responsive serpin regulates the melanization cascade in <it>Drosophila </it></p>
            </title>
            <aug>
               <au>
                  <snm>De Gregorio</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Baek</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Osaki</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kawabata</snm>
                  <fnm>SI</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>Iwanaga</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lemaitre</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Brey</snm>
                  <fnm>PT</fnm>
               </au>
            </aug>
            <source>Developmental Cell</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <fpage>581</fpage>
            <lpage>592</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1534-5807(02)00267-8</pubid>
                  <pubid idtype="pmpid" link="fulltext">12408809</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>A serpin mutant links Toll activation to melanization in the host defence of <it>Drosophila </it></p>
            </title>
            <aug>
               <au>
                  <snm>Ligoxygakis</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Pelte</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Ji</snm>
                  <fnm>CY</fnm>
               </au>
               <au>
                  <snm>Leclerc</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Duvic</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Belvin</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jiang</snm>
                  <fnm>HB</fnm>
               </au>
               <au>
                  <snm>Hoffmann</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Reichhart</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>EMBO J</source>
            <pubdate>2002</pubdate>
            <volume>21</volume>
            <fpage>6330</fpage>
            <lpage>6337</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">136964</pubid>
                  <pubid idtype="pmpid">12456640</pubid>
                  <pubid idtype="doi">10.1093/emboj/cdf661</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Genome-wide analysis of the <it>Drosophila </it>immune response by using oligonucleotide microarrays</p>
            </title>
            <aug>
               <au>
                  <snm>De Gregorio</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Spellman</snm>
                  <fnm>PT</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Lemaitre</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <fpage>12590</fpage>
            <lpage>12595</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">60098</pubid>
                  <pubid idtype="pmpid" link="fulltext">11606746</pubid>
                  <pubid idtype="doi">10.1073/pnas.221458698</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>A screen for immunity genes evolving under positive selection in <it>Drosophila </it></p>
            </title>
            <aug>
               <au>
                  <snm>Jiggins</snm>
                  <fnm>FM</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>KW</fnm>
               </au>
            </aug>
            <source>J Evolut Biol</source>
            <pubdate>2007</pubdate>
            <volume>20</volume>
            <fpage>965</fpage>
            <lpage>970</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1111/j.1420-9101.2007.01305.x</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Evolutionary rate analyses of orthologs and paralogs from 12 <it>Drosophila </it>genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Heger</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ponting</snm>
                  <fnm>CP</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2007</pubdate>
            <note>gr.6249707.</note>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Functional diversification of a protease inhibitor gene in the genus <it>Drosophila </it>and its molecular basis</p>
            </title>
            <aug>
               <au>
                  <snm>Borner</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ragg</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2008</pubdate>
            <volume>415</volume>
            <fpage>23</fpage>
            <lpage>31</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.gene.2008.02.004</pubid>
                  <pubid idtype="pmpid" link="fulltext">18395367</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Cloning and characterization of four <it>Anopheles </it>gambiae serpin isoforms, differentially induced in the midgut by <it>Plasmodium berghei </it>invasion</p>
            </title>
            <aug>
               <au>
                  <snm>Danielli</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kafatos</snm>
                  <fnm>FC</fnm>
               </au>
               <au>
                  <snm>Loukeris</snm>
                  <fnm>TG</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2003</pubdate>
            <volume>278</volume>
            <fpage>4184</fpage>
            <lpage>4193</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.M208187200</pubid>
                  <pubid idtype="pmpid" link="fulltext">12456678</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Overexpression and altered nucleocytoplasmic distribution of <it>Anopheles </it>ovalbumin-like SRPN10 serpins in <it>Plasmodium</it>-infected midgut cells</p>
            </title>
            <aug>
               <au>
                  <snm>Danielli</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Barillas-Mury</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kumar</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kafatos</snm>
                  <fnm>FC</fnm>
               </au>
               <au>
                  <snm>Loukeris</snm>
                  <fnm>TG</fnm>
               </au>
            </aug>
            <source>Cell Microbiol</source>
            <pubdate>2005</pubdate>
            <volume>7</volume>
            <fpage>181</fpage>
            <lpage>190</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15659062</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>An immune-responsive serpin, SRPN6, mediates mosquito defense against malaria parasites</p>
            </title>
            <aug>
               <au>
                  <snm>Abraham</snm>
                  <fnm>EG</fnm>
               </au>
               <au>
                  <snm>Pinto</snm>
                  <fnm>SB</fnm>
               </au>
               <au>
                  <snm>Ghosh</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Vanlandingham</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Budd</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Higgs</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kafatos</snm>
                  <fnm>FC</fnm>
               </au>
               <au>
                  <snm>Jacobs-Lorena</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Michel</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>16327</fpage>
            <lpage>16332</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1283470</pubid>
                  <pubid idtype="pmpid" link="fulltext">16260729</pubid>
                  <pubid idtype="doi">10.1073/pnas.0508335102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>The parasite invasion marker SRPN6 reduces sporozoite numbers in salivary glands of <it>Anopheles gambiae </it></p>
            </title>
            <aug>
               <au>
                  <snm>Pinto</snm>
                  <fnm>SB</fnm>
               </au>
               <au>
                  <snm>Kafatos</snm>
                  <fnm>FC</fnm>
               </au>
               <au>
                  <snm>Michel</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Cell Microbiol</source>
            <pubdate>2008</pubdate>
            <volume>10</volume>
            <fpage>891</fpage>
            <lpage>898</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1462-5822.2007.01091.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">18005239</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p><it>Anopheles gambiae </it>SRPN2 facilitates midgut invasion by the malaria parasite <it>Plasmodium berghei</it></p>
            </title>
            <aug>
               <au>
                  <snm>Michel</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Budd</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Pinto</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Kafatos</snm>
                  <fnm>FC</fnm>
               </au>
            </aug>
            <source>EMBO Reports</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>891</fpage>
            <lpage>897</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1369158</pubid>
                  <pubid idtype="pmpid" link="fulltext">16113656</pubid>
                  <pubid idtype="doi">10.1038/sj.embor.7400478</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Adaptive protein evolution at the adh locus in <it>Drosophila </it></p>
            </title>
            <aug>
               <au>
                  <snm>McDonald</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Kreitman</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1991</pubdate>
            <volume>351</volume>
            <fpage>652</fpage>
            <lpage>654</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/351652a0</pubid>
                  <pubid idtype="pmpid" link="fulltext">1904993</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Estimating the genomewide rate of adaptive protein evolution in <it>Drosophila </it></p>
            </title>
            <aug>
               <au>
                  <snm>Welch</snm>
                  <fnm>JJ</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2006</pubdate>
            <volume>173</volume>
            <fpage>821</fpage>
            <lpage>837</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1526489</pubid>
                  <pubid idtype="pmpid">16582427</pubid>
                  <pubid idtype="doi">10.1534/genetics.106.056911</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Localization of candidate regions maintaining a common polymorphic inversion (2La) in <it>Anopheles gambiae </it></p>
            </title>
            <aug>
               <au>
                  <snm>White</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Hahn</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Pombi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cassone</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Lobo</snm>
                  <fnm>NF</fnm>
               </au>
               <au>
                  <snm>Simard</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Besansky</snm>
                  <fnm>NJ</fnm>
               </au>
            </aug>
            <source>Plos Genetics</source>
            <pubdate>2007</pubdate>
            <volume>3</volume>
            <fpage>2404</fpage>
            <lpage>2414</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1371/journal.pgen.0030217</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Population genetics of <it>Plasmodium </it>resistance genes in <it>Anopheles gambiae</it>: no evidence for strong selection</p>
            </title>
            <aug>
               <au>
                  <snm>Obbard</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Linton</snm>
                  <fnm>YM</fnm>
               </au>
               <au>
                  <snm>Jiggins</snm>
                  <fnm>FM</fnm>
               </au>
               <au>
                  <snm>Yan</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Little</snm>
                  <fnm>TJ</fnm>
               </au>
            </aug>
            <source>Molecular Ecology</source>
            <pubdate>2007</pubdate>
            <volume>16</volume>
            <fpage>3497</fpage>
            <lpage>3510</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-294X.2007.03395.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">17688548</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>The molecular evolution of four anti-malarial immune genes in the <it>Anopheles gambiae </it>species complex</p>
            </title>
            <aug>
               <au>
                  <snm>Parmakelis</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Slotman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Marshall</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Awono-Ambene</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Antonio-Nkondjio</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Simard</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Caccone</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Powell</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>BMC Evolutionary Biology</source>
            <pubdate>2008</pubdate>
            <volume>8</volume>
            <fpage>79</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2288592</pubid>
                  <pubid idtype="pmpid" link="fulltext">18325105</pubid>
                  <pubid idtype="doi">10.1186/1471-2148-8-79</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Molecular Evolution of Immune Genes in the Malaria Mosquito <it>Anopheles gambiae </it></p>
            </title>
            <aug>
               <au>
                  <snm>Lehmann</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hume</snm>
                  <fnm>JCC</fnm>
               </au>
               <au>
                  <snm>Licht</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Burns</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Wollenberg</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Simard</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Ribeiro</snm>
                  <fnm>JMC</fnm>
               </au>
            </aug>
            <source>PLoS ONE</source>
            <pubdate>2009</pubdate>
            <volume>4</volume>
            <fpage>e4549</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2642720</pubid>
                  <pubid idtype="pmpid" link="fulltext">19234606</pubid>
                  <pubid idtype="doi">10.1371/journal.pone.0004549</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Identification of single specimens of the <it>Anopheles gambiae </it>complex by the polymerase chain reaction</p>
            </title>
            <aug>
               <au>
                  <snm>Scott</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Brogdon</snm>
                  <fnm>WG</fnm>
               </au>
               <au>
                  <snm>Collins</snm>
                  <fnm>FH</fnm>
               </au>
            </aug>
            <source>Am J Trop Med Hyg</source>
            <pubdate>1993</pubdate>
            <volume>49</volume>
            <fpage>520</fpage>
            <lpage>529</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8214283</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Molecular identification of sympatric chromosomal forms of <it>Anopheles gambiae </it>and further evidence of their reproductive isolation</p>
            </title>
            <aug>
               <au>
                  <snm>Favia</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>della Torre</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bagayoko</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lanfrancotti</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sagnon</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Toure</snm>
                  <fnm>YT</fnm>
               </au>
               <au>
                  <snm>Coluzzi</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Insect Mol Biol</source>
            <pubdate>1997</pubdate>
            <volume>6</volume>
            <fpage>377</fpage>
            <lpage>383</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2583.1997.00189.x</pubid>
                  <pubid idtype="pmpid">9359579</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>The molecular forms of <it>Anopheles gambiae</it>: A phenotypic perspective</p>
            </title>
            <aug>
               <au>
                  <snm>Lehmann</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Diabate</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Infection, Genetics and Evolution</source>
            <pubdate>2008</pubdate>
            <volume>8</volume>
            <fpage>737</fpage>
            <lpage>746</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/j.meegid.2008.06.003</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Molecular karyotyping of the 2La inversion in <it>Anopheles gambiae </it></p>
            </title>
            <aug>
               <au>
                  <snm>White</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Santolamazza</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Kamau</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Pombi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Grushko</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Mouline</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Brengues</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Guelbeogo</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Coulibaly</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kayondo</snm>
                  <fnm>JK</fnm>
               </au>
               <au>
                  <snm>Sharakhov</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Simard</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Petrarca</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Della Torre</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Besansky</snm>
                  <fnm>NJ</fnm>
               </au>
            </aug>
            <source>Am J Trop Med Hyg</source>
            <pubdate>2007</pubdate>
            <volume>76</volume>
            <fpage>334</fpage>
            <lpage>339</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17297045</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Breakpoint structure reveals the unique origin of an interspecific chromosomal inversion (2<it>La</it>) in the <it>Anopheles gambiae </it>complex</p>
            </title>
            <aug>
               <au>
                  <snm>Sharakhov</snm>
                  <fnm>IV</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Sharakhova</snm>
                  <fnm>MV</fnm>
               </au>
               <au>
                  <snm>Kayondo</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lobo</snm>
                  <fnm>NF</fnm>
               </au>
               <au>
                  <snm>Santolamazza</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>della Torre</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Simard</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Collins</snm>
                  <fnm>FH</fnm>
               </au>
               <au>
                  <snm>Besansky</snm>
                  <fnm>NJ</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <fpage>6258</fpage>
            <lpage>6262</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1458865</pubid>
                  <pubid idtype="pmpid" link="fulltext">16606844</pubid>
                  <pubid idtype="doi">10.1073/pnas.0509683103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Clarification of anomalies in the application of a 2La molecular karyotyping method for the malaria vector <it>Anopheles gambiae </it></p>
            </title>
            <aug>
               <au>
                  <snm>Ng'habi</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Meneses</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Cornel</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Slotman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Knols</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Ferguson</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lanzaro</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Parasites &amp; Vectors</source>
            <pubdate>2008</pubdate>
            <volume>1</volume>
            <fpage>45</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2628339</pubid>
                  <pubid idtype="pmpid" link="fulltext">19091112</pubid>
                  <pubid idtype="doi">10.1186/1756-3305-1-45</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>The genome sequence of the malaria mosquito <it>Anopheles gambiae </it></p>
            </title>
            <aug>
               <au>
                  <snm>Holt</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Subramanian</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Halpern</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sutton</snm>
                  <fnm>GG</fnm>
               </au>
               <au>
                  <snm>Charlab</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Nusskern</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Wincker</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Ribeiro</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Wides</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Salzberg</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Loftus</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Yandell</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Majoros</snm>
                  <fnm>WH</fnm>
               </au>
               <au>
                  <snm>Rusch</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Lai</snm>
                  <fnm>ZW</fnm>
               </au>
               <au>
                  <snm>Kraft</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Abril</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Anthouard</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Arensburger</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Atkinson</snm>
                  <fnm>PW</fnm>
               </au>
               <au>
                  <snm>Baden</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>de Berardinis</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Baldwin</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Benes</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Biedler</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Blass</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bolanos</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Boscus</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Barnstead</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cai</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Center</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Chatuverdi</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Christophides</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Chrystal</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Clamp</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cravchik</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Curwen</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Dana</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Delcher</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Dew</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Evans</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Flanigan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Grundschober-Freimoser</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Friedli</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Gu</snm>
                  <fnm>ZP</fnm>
               </au>
               <au>
                  <snm>Guan</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Guigo</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hillenmeyer</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Hladun</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Hogan</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Hong</snm>
                  <fnm>YS</fnm>
               </au>
               <au>
                  <snm>Hoover</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jaillon</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Ke</snm>
                  <fnm>ZX</fnm>
               </au>
               <au>
                  <snm>Kodira</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kokoza</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Koutsos</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Letunic</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Levitsky</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Liang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Lobo</snm>
                  <fnm>NF</fnm>
               </au>
               <au>
                  <snm>Lopez</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Malek</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>McIntosh</snm>
                  <fnm>TC</fnm>
               </au>
               <au>
                  <snm>Meister</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mobarry</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Mongin</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Murphy</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>O'Brochta</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Pfannkoch</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Qi</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Regier</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Remington</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Shao</snm>
                  <fnm>HG</fnm>
               </au>
               <au>
                  <snm>Sharakhova</snm>
                  <fnm>MV</fnm>
               </au>
               <au>
                  <snm>Sitter</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Shetty</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Strong</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Sun</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Thomasova</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ton</snm>
                  <fnm>LQ</fnm>
               </au>
               <au>
                  <snm>Topalis</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Tu</snm>
                  <fnm>ZJ</fnm>
               </au>
               <au>
                  <snm>Unger</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Walenz</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>AH</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>XL</fnm>
               </au>
               <au>
                  <snm>Woodford</snm>
                  <fnm>KJ</fnm>
               </au>
               <au>
                  <snm>Wortman</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Yao</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Zdobnov</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>HY</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>SY</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Zhimulev</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Coluzzi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>della Torre</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Roth</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Louis</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kalush</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Mural</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Myers</snm>
                  <fnm>EW</fnm>
               </au>
               <au>
                  <snm>Adams</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>HO</fnm>
               </au>
               <au>
                  <snm>Broder</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gardner</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Brey</snm>
                  <fnm>PT</fnm>
               </au>
               <au>
                  <snm>Venter</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Weissenbach</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kafatos</snm>
                  <fnm>FC</fnm>
               </au>
               <au>
                  <snm>Collins</snm>
                  <fnm>FH</fnm>
               </au>
               <au>
                  <snm>Hoffman</snm>
                  <fnm>SL</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>298</volume>
            <fpage>129</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1076181</pubid>
                  <pubid idtype="pmpid" link="fulltext">12364791</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>A comparison of Bayesian methods for haplotype reconstruction from population genotype data</p>
            </title>
            <aug>
               <au>
                  <snm>Stephens</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Donnelly</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet. </source>
            <pubdate>2003</pubdate>
            <volume>73</volume>
            <issue>5</issue>
            <fpage>1162</fpage>
            <lpage>1169</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1086/379378</pubid>
                  <pubid idtype="pmpid" link="fulltext">14574645</pubid>
                  <pubid idtype="pmcid">1180495</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>A new statistical method for haplotype reconstruction from population data</p>
            </title>
            <aug>
               <au>
                  <snm>Stephens</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Donnelly</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet.</source>
            <pubdate>2001</pubdate>
            <volume>68</volume>
            <issue>4</issue>
            <fpage>978</fpage>
            <lpage>989</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1086/319501</pubid>
                  <pubid idtype="pmpid" link="fulltext">11254454</pubid>
                  <pubid idtype="pmcid">1275651</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>DnaSP, DNA polymorphism analyses by the coalescent and other methods</p>
            </title>
            <aug>
               <au>
                  <snm>Rozas</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sanchez-DelBarrio</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Messeguer</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Rozas</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <fpage>2496</fpage>
            <lpage>2497</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btg359</pubid>
                  <pubid idtype="pmpid" link="fulltext">14668244</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Statistical-Method for Testing the Neutral Mutation Hypothesis by DNA Polymorphism</p>
            </title>
            <aug>
               <au>
                  <snm>Tajima</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1989</pubdate>
            <volume>123</volume>
            <fpage>585</fpage>
            <lpage>595</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1203831</pubid>
                  <pubid idtype="pmpid" link="fulltext">2513255</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>A statistical test for detecting geographic subdivision</p>
            </title>
            <aug>
               <au>
                  <snm>Hudson</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Boos</snm>
                  <fnm>DD</fnm>
               </au>
               <au>
                  <snm>Kaplan</snm>
                  <fnm>NL</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1992</pubdate>
            <volume>9</volume>
            <fpage>138</fpage>
            <lpage>151</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">1552836</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Evolution of human races at the gene level</p>
            </title>
            <aug>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Human genetics, part A: The unfolding genome</source>
            <publisher>New York.: Alan R. Liss</publisher>
            <editor>Bonne-Tamir B, Cohen T, Goodman RM</editor>
            <pubdate>1982</pubdate>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Excess amino acid polymorphism in mitochondrial DNA: contrasts among genes from <it>Drosophila</it>, mice, and humans</p>
            </title>
            <aug>
               <au>
                  <snm>Rand</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Kann</snm>
                  <fnm>LM</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1996</pubdate>
            <volume>13</volume>
            <fpage>735</fpage>
            <lpage>748</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8754210</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Adaptive protein evolution in <it>Drosophila </it></p>
            </title>
            <aug>
               <au>
                  <snm>Smith</snm>
                  <fnm>NGC</fnm>
               </au>
               <au>
                  <snm>Eyre-Walker</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>415</volume>
            <fpage>1022</fpage>
            <lpage>1024</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/4151022a</pubid>
                  <pubid idtype="pmpid" link="fulltext">11875568</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>The Genomic Rate of Adaptive Amino Acid Substitution in <it>Drosophila </it></p>
            </title>
            <aug>
               <au>
                  <snm>Bierne</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Eyre-Walker</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2004</pubdate>
            <volume>21</volume>
            <fpage>1350</fpage>
            <lpage>1360</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msh134</pubid>
                  <pubid idtype="pmpid" link="fulltext">15044594</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Population-Genetics Of Polymorphism And Divergence</p>
            </title>
            <aug>
               <au>
                  <snm>Sawyer</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Hartl</snm>
                  <fnm>DL</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1992</pubdate>
            <volume>132</volume>
            <fpage>1161</fpage>
            <lpage>1176</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1205236</pubid>
                  <pubid idtype="pmpid" link="fulltext">1459433</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <aug>
               <au>
                  <snm>Burnham</snm>
                  <fnm>KP</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>DR</fnm>
               </au>
            </aug>
            <source>Model Selection and Multi-model Inference: A Practical Information-theoretic Approach</source>
            <publisher>Springer</publisher>
            <edition>2</edition>
            <pubdate>2002</pubdate>
         </bibl>
         <bibl id="B53">
            <title>
               <p>MKtest</p>
            </title>
            <url>http://tree.bio.ed.ac.uk/software/mktest/</url>
         </bibl>
         <bibl id="B54">
            <aug>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Molecular Evolutionary Genetics</source>
            <publisher>New York: Columbia University Press</publisher>
            <pubdate>1987</pubdate>
         </bibl>
         <bibl id="B55">
            <title>
               <p>Phylogeny of the Pyretophorus Series of <it>Anopheles </it>subgenus <it>Cellia </it>(Diptera: Culicidae)</p>
            </title>
            <aug>
               <au>
                  <snm>Anthony</snm>
                  <fnm>TG</fnm>
               </au>
               <au>
                  <snm>Harbach</snm>
                  <fnm>RE</fnm>
               </au>
               <au>
                  <snm>Kitching</snm>
                  <fnm>IJ</fnm>
               </au>
            </aug>
            <source>Systematic Entomol</source>
            <pubdate>1999</pubdate>
            <volume>24</volume>
            <fpage>193</fpage>
            <lpage>205</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-3113.1999.00092.x</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>Review of the internal classification of the genus <it>Anopheles </it>(Diptera, Culicidae) &#8211; The foundation for comparative systematics and phylogenetic research</p>
            </title>
            <aug>
               <au>
                  <snm>Harbach</snm>
                  <fnm>RE</fnm>
               </au>
            </aug>
            <source>Bull Entomol Res</source>
            <pubdate>1994</pubdate>
            <volume>84</volume>
            <fpage>331</fpage>
            <lpage>342</lpage>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Evolution and Systematics of <it>Anopheles</it>: Insights from a Molecular Phylogeny of Australasian Mosquitoes</p>
            </title>
            <aug>
               <au>
                  <snm>Foley</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Bryan</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Yeates</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Saul</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Mol Phylogenet Evol</source>
            <pubdate>1998</pubdate>
            <volume>9</volume>
            <fpage>262</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/mpev.1997.0457</pubid>
                  <pubid idtype="pmpid" link="fulltext">9562985</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>Locus- and Population-Specific Selection and Differentiation between Incipient Species of <it>Anopheles gambiae</it></p>
            </title>
            <aug>
               <au>
                  <snm>Turner</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Hahn</snm>
                  <fnm>MW</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2007</pubdate>
            <volume>24</volume>
            <fpage>2132</fpage>
            <lpage>2138</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msm143</pubid>
                  <pubid idtype="pmpid" link="fulltext">17636041</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>Genomic islands of speciation in <it>Anopheles gambiae</it></p>
            </title>
            <aug>
               <au>
                  <snm>Turner</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Hahn</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Nuzhdin</snm>
                  <fnm>SV</fnm>
               </au>
            </aug>
            <source>PLoS Biology</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>e285</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1182689</pubid>
                  <pubid idtype="pmpid" link="fulltext">16076241</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0030285</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>Speciation within <it>Anopheles gambiae </it>&#8211; the glass is half full</p>
            </title>
            <aug>
               <au>
                  <snm>della Torre</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Costantini</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Besansky</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Caccone</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Petrarca</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Powell</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Coluzzi</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>298</volume>
            <fpage>115</fpage>
            <lpage>117</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1078170</pubid>
                  <pubid idtype="pmpid" link="fulltext">12364784</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Transmission blocking immunity in the malaria non-vector mosquito <it>Anopheles quadriannulatus </it>species A</p>
            </title>
            <aug>
               <au>
                  <snm>Habtewold</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Povelones</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Blagborough</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Christophides</snm>
                  <fnm>GK</fnm>
               </au>
            </aug>
            <source>PLoS Pathog</source>
            <pubdate>2008</pubdate>
            <volume>4</volume>
            <fpage>8</fpage>
            <xrefbib>
               <pubid idtype="doi">10.1371/journal.ppat.1000070</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>Mapping the ranges and relative abundance of the two principal African malaria vectors, <it>Anopheles gambiae sensu stricto </it>and <it>An. arabiensis</it>, using climate data</p>
            </title>
            <aug>
               <au>
                  <snm>Lindsay</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>Parson</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Thomas</snm>
                  <fnm>CJ</fnm>
               </au>
            </aug>
            <source>Proc R Soc London Series B-Biol Sci</source>
            <pubdate>1998</pubdate>
            <volume>265</volume>
            <fpage>847</fpage>
            <lpage>854</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1098/rspb.1998.0369</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>Insecticide resistance in <it>Anopheles gambiae </it>from south-western Chad, Central Africa</p>
            </title>
            <aug>
               <au>
                  <snm>Kerah-Hinzoumbe</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Peka</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nwane</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Donan-Gouni</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Etang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Same-Ekobo</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Simard</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Malar J</source>
            <pubdate>2008</pubdate>
            <volume>7</volume>
            <fpage>192</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2566574</pubid>
                  <pubid idtype="pmpid" link="fulltext">18823537</pubid>
                  <pubid idtype="doi">10.1186/1475-2875-7-192</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>Multilevel analyses of genetic differentiation in <it>Anopheles gambiae </it>s.s. reveal patterns of gene flow important for malaria-fighting mosquito projects</p>
            </title>
            <aug>
               <au>
                  <snm>Tripet</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Dolo</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Lanzaro</snm>
                  <fnm>GC</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2005</pubdate>
            <volume>169</volume>
            <fpage>313</fpage>
            <lpage>324</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1448890</pubid>
                  <pubid idtype="pmpid">15677750</pubid>
                  <pubid idtype="doi">10.1534/genetics.104.026534</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>Immune system polymorphism: Implications for genetic engineering</p>
            </title>
            <aug>
               <au>
                  <snm>Little</snm>
                  <fnm>TJ</fnm>
               </au>
            </aug>
            <source>Genetically modified mosquitoes for malaria control</source>
            <publisher>Georgetown, Texas: Landes Bioscience</publisher>
            <editor>Boete C</editor>
            <pubdate>2006</pubdate>
            <fpage>36</fpage>
            <lpage>59</lpage>
         </bibl>
         <bibl id="B66">
            <title>
               <p>Mosaic genome architecture of the <it>Anopheles gambiae </it>species complex</p>
            </title>
            <aug>
               <au>
                  <snm>Wang-Sattler</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Blandin</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ning</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Blass</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Dolo</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Toure</snm>
                  <fnm>YT</fnm>
               </au>
               <au>
                  <snm>Torre</snm>
                  <fnm>Ad</fnm>
               </au>
               <au>
                  <snm>Lanzaro</snm>
                  <fnm>GC</fnm>
               </au>
               <au>
                  <snm>Steinmetz</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Kafatos</snm>
                  <fnm>FC</fnm>
               </au>
               <au>
                  <snm>Zheng</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>PLoS ONE</source>
            <pubdate>2007</pubdate>
            <volume>2</volume>
            <fpage>e1249</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2082662</pubid>
                  <pubid idtype="pmpid" link="fulltext">18043756</pubid>
                  <pubid idtype="doi">10.1371/journal.pone.0001249</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B67">
            <title>
               <p>Revisiting the role of introgression vs shared ancestral polymorphisms as key processes shaping genetic diversity in the recently separated sibling species of the <it>Anopheles gambiae </it>complex</p>
            </title>
            <aug>
               <au>
                  <snm>Donnelly</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Pinto</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Girod</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Besansky</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Lehmann</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Heredity</source>
            <pubdate>2004</pubdate>
            <volume>92</volume>
            <fpage>61</fpage>
            <lpage>68</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.hdy.6800377</pubid>
                  <pubid idtype="pmpid" link="fulltext">14666125</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B68">
            <title>
               <p>Multilocus patterns of nucleotide variability and the demographic and selection history of <it>Drosophila melanogaster </it>populations</p>
            </title>
            <aug>
               <au>
                  <snm>Haddrill</snm>
                  <fnm>PR</fnm>
               </au>
               <au>
                  <snm>Thornton</snm>
                  <fnm>KR</fnm>
               </au>
               <au>
                  <snm>Charlesworth</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Andolfatto</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Genome Research</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <fpage>790</fpage>
            <lpage>799</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1142469</pubid>
                  <pubid idtype="pmpid" link="fulltext">15930491</pubid>
                  <pubid idtype="doi">10.1101/gr.3541005</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B69">
            <title>
               <p>Nucleotide polymorphism and linkage disequilibrium in wild populations of the partial selfer <it>Caenorhabditis elegans</it></p>
            </title>
            <aug>
               <au>
                  <snm>Cutter</snm>
                  <fnm>AD</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2006</pubdate>
            <volume>172</volume>
            <fpage>171</fpage>
            <lpage>184</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1456145</pubid>
                  <pubid idtype="pmpid">16272415</pubid>
                  <pubid idtype="doi">10.1534/genetics.105.048207</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B70">
            <title>
               <p>Differential introgression of chromsomal regions between <it>Anopheles gambiae </it>and <it>An. arabiensis</it></p>
            </title>
            <aug>
               <au>
                  <snm>Slotman</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Della Torre</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Calzetta</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Powell</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>Am J Trop Med Hyg</source>
            <pubdate>2005</pubdate>
            <volume>73</volume>
            <fpage>326</fpage>
            <lpage>335</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16103599</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B71">
            <title>
               <p>Female sterility in hybrids between <it>Anopheles gambiae </it>and <it>An. arabiensis</it>, and the causes of Haldane's rule</p>
            </title>
            <aug>
               <au>
                  <snm>Slotman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Della Torre</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Powell</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>Evolution</source>
            <pubdate>2005</pubdate>
            <volume>59</volume>
            <fpage>1016</fpage>
            <lpage>1026</lpage>
            <xrefbib>
               <pubid idtype="pmpid">16136801</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B72">
            <title>
               <p>Semipermeable species boundaries between <it>Anopheles gambiae </it>and <it>Anopheles arabiensis</it>: Evidence from multilocus DNA sequence variation</p>
            </title>
            <aug>
               <au>
                  <snm>Besansky</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Krzywinski</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lehmann</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Simard</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Kern</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Mukabayire</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Fontenille</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Toure</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Sagnon</snm>
                  <fnm>NF</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <fpage>10818</fpage>
            <lpage>10823</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">196886</pubid>
                  <pubid idtype="pmpid" link="fulltext">12947038</pubid>
                  <pubid idtype="doi">10.1073/pnas.1434337100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B73">
            <title>
               <p>Distribution of genetic variation among chromosomal forms of <it>Anopheles gambiae </it>s.s.: introgressive hybridization, adaptive inversions, or recent reproductive isolation?</p>
            </title>
            <aug>
               <au>
                  <snm>Black</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Lanzaro</snm>
                  <fnm>GC</fnm>
               </au>
            </aug>
            <source>Insect Molecular Biology</source>
            <pubdate>2001</pubdate>
            <volume>10</volume>
            <fpage>3</fpage>
            <lpage>7</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2583.2001.00234.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">11240631</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B74">
            <title>
               <p>The Probability of Topological Concordance of Gene Trees and Species Trees</p>
            </title>
            <aug>
               <au>
                  <snm>Rosenberg</snm>
                  <fnm>NA</fnm>
               </au>
            </aug>
            <source>Theoretical Population Biology</source>
            <pubdate>2002</pubdate>
            <volume>61</volume>
            <fpage>225</fpage>
            <lpage>247</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/tpbi.2001.1568</pubid>
                  <pubid idtype="pmpid" link="fulltext">11969392</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B75">
            <title>
               <p>Contrasting patterns of X-linked and autosomal nucleotide variation in <it>Drosophila melanogaster </it>and <it>Drosophila simulans</it></p>
            </title>
            <aug>
               <au>
                  <snm>Andolfatto</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2001</pubdate>
            <volume>18</volume>
            <fpage>279</fpage>
            <lpage>290</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11230529</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B76">
            <title>
               <p>A Genome-wide departure from the standard neutral model in natural populations of <it>Drosophila</it></p>
            </title>
            <aug>
               <au>
                  <snm>Andolfatto</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Przeworski</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2000</pubdate>
            <volume>156</volume>
            <fpage>257</fpage>
            <lpage>172</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1461228</pubid>
                  <pubid idtype="pmpid">10978290</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B77">
            <title>
               <p>The effective population size of <it>Anopheles gambiae </it>in Kenya: implications for population structure</p>
            </title>
            <aug>
               <au>
                  <snm>Lehmann</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hawley</snm>
                  <fnm>WA</fnm>
               </au>
               <au>
                  <snm>Grebert</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Collins</snm>
                  <fnm>FH</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1998</pubdate>
            <volume>15</volume>
            <fpage>264</fpage>
            <lpage>276</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9501493</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B78">
            <title>
               <p>White Paper: Proposal for the Eight Genomes Cluster for Genus <it>Anopheles</it></p>
            </title>
            <aug>
               <au>
                  <snm>Besansky</snm>
                  <fnm>NJ</fnm>
               </au>
            </aug>
            <publisher>NIH, National Human Genome Research Institute</publisher>
            <pubdate>2005</pubdate>
         </bibl>
         <bibl id="B79">
            <title>
               <p>Ecological zones rather than molecular forms predict genetic differentiation in the malaria vector <it>Anopheles gambiae </it>s.s. in Ghana</p>
            </title>
            <aug>
               <au>
                  <snm>Yawson</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Weetman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Donnelly</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2007</pubdate>
            <volume>175</volume>
            <fpage>751</fpage>
            <lpage>761</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1800615</pubid>
                  <pubid idtype="pmpid">17110481</pubid>
                  <pubid idtype="doi">10.1534/genetics.106.065888</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B80">
            <title>
               <p>Population structure of <it>Anopheles gambiae </it>in Africa</p>
            </title>
            <aug>
               <au>
                  <snm>Lehmann</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Licht</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Elissa</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Maega</snm>
                  <fnm>BT</fnm>
               </au>
               <au>
                  <snm>Chimumbwa</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Watsenga</snm>
                  <fnm>FT</fnm>
               </au>
               <au>
                  <snm>Wondji</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Simard</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Hawley</snm>
                  <fnm>WA</fnm>
               </au>
            </aug>
            <source>J Hered. </source>
            <pubdate>2003</pubdate>
            <volume>94</volume>
            <issue>2</issue>
            <fpage>133</fpage>
            <lpage>147</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/jhered/esg024</pubid>
                  <pubid idtype="pmpid" link="fulltext">12721225</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B81">
            <title>
               <p>High Genetic Differentiation between the M and S Molecular Forms of <it>Anopheles gambiae </it>in Africa</p>
            </title>
            <aug>
               <au>
                  <snm>Esnault</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Boulesteix</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Duchemin</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Koffi</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Chandre</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Dabir&#233;</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Robert</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Simard</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Tripet</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Donnelly</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Fontenille</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bi&#233;mont</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>PLoS ONE</source>
            <pubdate>2008</pubdate>
            <volume>3</volume>
            <fpage>e1968</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2278371</pubid>
                  <pubid idtype="pmpid" link="fulltext">18414665</pubid>
                  <pubid idtype="doi">10.1371/journal.pone.0001968</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B82">
            <title>
               <p>White Paper: Genome analysis of vectorial capacity in major Anopheles vectors of malaria parasites</p>
            </title>
            <aug>
               <au>
                  <snm>Besansky</snm>
                  <fnm>NJ</fnm>
               </au>
            </aug>
            <publisher>NIH, National Human Genome Research Institute</publisher>
            <pubdate>2008</pubdate>
         </bibl>
         <bibl id="B83">
            <title>
               <p>Evolutionary dynamics of immune-related genes and pathways in disease-vector mosquitoes</p>
            </title>
            <aug>
               <au>
                  <snm>Waterhouse</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Kriventseva</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Meister</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Xi</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Alvarez</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Bartholomay</snm>
                  <fnm>LC</fnm>
               </au>
               <au>
                  <snm>Barillas-Mury</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bian</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Blandin</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Christensen</snm>
                  <fnm>BM</fnm>
               </au>
               <au>
                  <snm>Dong</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Jiang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kanost</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Koutsos</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Levashina</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ligoxygakis</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>MacCallum</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Mayhew</snm>
                  <fnm>GF</fnm>
               </au>
               <au>
                  <snm>Mendes</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Michel</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Osta</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Paskewitz</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Shin</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>Vlachou</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Wei</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Zheng</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Zou</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Severson</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Raikhel</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Kafatos</snm>
                  <fnm>FC</fnm>
               </au>
               <au>
                  <snm>Dimopoulos</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Zdobnov</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Christophides</snm>
                  <fnm>GK</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2007</pubdate>
            <volume>316</volume>
            <fpage>1738</fpage>
            <lpage>1743</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2042107</pubid>
                  <pubid idtype="pmpid" link="fulltext">17588928</pubid>
                  <pubid idtype="doi">10.1126/science.1139862</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B84">
            <title>
               <p>Inversion polymorphisms and nucleotide variability in <it>Drosophila</it></p>
            </title>
            <aug>
               <au>
                  <snm>Andolfatto</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Depaulis</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Navarro</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Genetical Research</source>
            <pubdate>2001</pubdate>
            <volume>77</volume>
            <fpage>1</fpage>
            <lpage>8</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1017/S0016672301004955</pubid>
                  <pubid idtype="pmpid">11279826</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B85">
            <title>
               <p>Mechanisms of genetic exchange within the chromosomal inversions of <it>Drosophila pseudoobscura</it></p>
            </title>
            <aug>
               <au>
                  <snm>Schaeffer</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>WW</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2005</pubdate>
            <volume>171</volume>
            <fpage>1729</fpage>
            <lpage>1739</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1456099</pubid>
                  <pubid idtype="pmpid">16143625</pubid>
                  <pubid idtype="doi">10.1534/genetics.105.041947</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B86">
            <title>
               <p>Genetic exchange in 2La inversion heterokaryotypes of <it>Anopheles gambiae</it></p>
            </title>
            <aug>
               <au>
                  <snm>Stump</snm>
                  <fnm>AD</fnm>
               </au>
               <au>
                  <snm>Pombi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Goeddel</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Ribeiro</snm>
                  <fnm>JMC</fnm>
               </au>
               <au>
                  <snm>Wilder</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Torre</snm>
                  <fnm>AD</fnm>
               </au>
               <au>
                  <snm>Besansky</snm>
                  <fnm>NJ</fnm>
               </au>
            </aug>
            <source>Insect Mol Biol</source>
            <pubdate>2007</pubdate>
            <volume>16</volume>
            <fpage>703</fpage>
            <lpage>709</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">18092999</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B87">
            <title>
               <p>Association of a <it>Plasmodium </it>refractory phenotype with an esterase locus in <it>Anopheles gambiae</it></p>
            </title>
            <aug>
               <au>
                  <snm>Vernick</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Collins</snm>
                  <fnm>FH</fnm>
               </au>
            </aug>
            <source>Am J Trop Med Hyg</source>
            <pubdate>1989</pubdate>
            <volume>40</volume>
            <fpage>593</fpage>
            <lpage>597</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">2742037</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B88">
            <title>
               <p>Intraspecific chromosomal polymorphism in the <it>Anopheles gambiae </it>complex as a factor affecting malaria transmission in the Kisumu area of Kenya</p>
            </title>
            <aug>
               <au>
                  <snm>Petrarca</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Beier</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Am J Trop Med Hyg</source>
            <pubdate>1992</pubdate>
            <volume>46</volume>
            <fpage>229</fpage>
            <lpage>237</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">1539757</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B89">
            <title>
               <p>Balancing selection and its effects on sequences in nearby genome regions</p>
            </title>
            <aug>
               <au>
                  <snm>Charlesworth</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>PloS Genetics</source>
            <pubdate>2006</pubdate>
            <volume>2</volume>
            <issue>4</issue>
            <fpage>e64</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.pgen.0020064</pubid>
                  <pubid idtype="pmpid" link="fulltext">16683038</pubid>
                  <pubid idtype="pmcid">1449905</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B90">
            <title>
               <p>Patterns of selection in anti-malarial immune genes in malaria vectors: evidence for adaptive evolution in LRIM1 in <it>Anopheles arabiensis</it></p>
            </title>
            <aug>
               <au>
                  <snm>Slotman</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Parmakelis</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Marshall</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Awono-Ambene</snm>
                  <fnm>PH</fnm>
               </au>
               <au>
                  <snm>Antonio-Nkondjo</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Simard</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Caccone</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Powell</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>PLoS ONE</source>
            <pubdate>2007</pubdate>
            <volume>2</volume>
            <fpage>e793</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1945087</pubid>
                  <pubid idtype="pmpid" link="fulltext">17726523</pubid>
                  <pubid idtype="doi">10.1371/journal.pone.0000793</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B91">
            <title>
               <p>Polymorphism at the defensin gene in the <it>Anopheles gambiae </it>complex: Testing different selection hypotheses</p>
            </title>
            <aug>
               <au>
                  <snm>Simard</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Licht</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Besansky</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Lehmann</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Infect Genet Evol. </source>
            <pubdate>2007</pubdate>
            <volume>7</volume>
            <issue>2</issue>
            <fpage>285</fpage>
            <lpage>292</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.meegid.2006.11.004</pubid>
                  <pubid idtype="pmpid" link="fulltext">17161659</pubid>
                  <pubid idtype="pmcid">1876661</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B92">
            <title>
               <p>The evolution of TEP1, an exceptionally polymorphic immunity gene in <it>Anopheles gambiae</it></p>
            </title>
            <aug>
               <au>
                  <snm>Obbard</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Callister</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Jiggins</snm>
                  <fnm>FM</fnm>
               </au>
               <au>
                  <snm>Soares</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Yan</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Little</snm>
                  <fnm>TJ</fnm>
               </au>
            </aug>
            <source>BMC Evolutionary Biology</source>
            <pubdate>2008</pubdate>
            <volume>8</volume>
            <fpage>274</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2576239</pubid>
                  <pubid idtype="pmpid" link="fulltext">18840262</pubid>
                  <pubid idtype="doi">10.1186/1471-2148-8-274</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B93">
            <title>
               <p>Adaptive eolution of relish, a <it>Drosophila </it>NF-{kappa}B/I{kappa}B Protein</p>
            </title>
            <aug>
               <au>
                  <snm>Begun</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Whitley</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2000</pubdate>
            <volume>154</volume>
            <fpage>1231</fpage>
            <lpage>1238</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1460974</pubid>
                  <pubid idtype="pmpid">10757765</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B94">
            <title>
               <p>Natural selection drives <it>Drosophila </it>immune system evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Schlenke</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Begun</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2003</pubdate>
            <volume>164</volume>
            <fpage>1471</fpage>
            <lpage>1480</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1462669</pubid>
                  <pubid idtype="pmpid">12930753</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B95">
            <title>
               <p>Elevated polymorphism and divergence in the class C scavenger receptors of <it>Drosophila melanogaster </it>and <it>D. simulans</it></p>
            </title>
            <aug>
               <au>
                  <snm>Lazzaro</snm>
                  <fnm>BP</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2005</pubdate>
            <volume>169</volume>
            <fpage>2023</fpage>
            <lpage>2034</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1449580</pubid>
                  <pubid idtype="pmpid">15716507</pubid>
                  <pubid idtype="doi">10.1534/genetics.104.034249</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B96">
            <title>
               <p>PAML 4: Phylogenetic analysis by maximum likelihood</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>ZH</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2007</pubdate>
            <volume>24</volume>
            <fpage>1586</fpage>
            <lpage>1591</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msm088</pubid>
                  <pubid idtype="pmpid" link="fulltext">17483113</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B97">
            <title>
               <p>Unbiased estimation of the rates of synonymous and nonsynonymous substitution</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>WH</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1993</pubdate>
            <volume>36</volume>
            <fpage>96</fpage>
            <lpage>99</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF02407308</pubid>
                  <pubid idtype="pmpid">8433381</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B98">
            <title>
               <p>Positive selection in alternatively spliced exons of human genes</p>
            </title>
            <aug>
               <au>
                  <snm>Ramensky</snm>
                  <fnm>VE</fnm>
               </au>
               <au>
                  <snm>Nurtdinov</snm>
                  <fnm>RN</fnm>
               </au>
               <au>
                  <snm>Neverov</snm>
                  <fnm>AD</fnm>
               </au>
               <au>
                  <snm>Mironov</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Gelfand</snm>
                  <fnm>MS</fnm>
               </au>
            </aug>
            <source>Am J Human Genetics</source>
            <pubdate>2008</pubdate>
            <volume>83</volume>
            <fpage>94</fpage>
            <lpage>98</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/j.ajhg.2008.05.017</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B99">
            <title>
               <p>Low nucleotide diversity in man</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>WH</fnm>
               </au>
               <au>
                  <snm>Sadler</snm>
                  <fnm>LA</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1991</pubdate>
            <volume>129</volume>
            <fpage>513</fpage>
            <lpage>523</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1204640</pubid>
                  <pubid idtype="pmpid" link="fulltext">1743489</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B100">
            <title>
               <p>HyPhy: hypothesis testing using phylogenies</p>
            </title>
            <aug>
               <au>
                  <snm>Kosakovsky Pond</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Frost</snm>
                  <fnm>SDW</fnm>
               </au>
               <au>
                  <snm>Muse</snm>
                  <fnm>SV</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <fpage>676</fpage>
            <lpage>679</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti079</pubid>
                  <pubid idtype="pmpid" link="fulltext">15509596</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B101">
            <title>
               <p>Estimating diversifying selection and functional constraint in the presence of recombination</p>
            </title>
            <aug>
               <au>
                  <snm>Wilson</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>McVean</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2006</pubdate>
            <volume>172</volume>
            <fpage>1411</fpage>
            <lpage>1425</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1456295</pubid>
                  <pubid idtype="pmpid">16387887</pubid>
                  <pubid idtype="doi">10.1534/genetics.105.044917</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B102">
            <title>
               <p>Effect of recombination on the accuracy of the likelihood method for detecting positive selection at amino acid sites</p>
            </title>
            <aug>
               <au>
                  <snm>Anisimova</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>ZH</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2003</pubdate>
            <volume>164</volume>
            <fpage>1229</fpage>
            <lpage>1236</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1462615</pubid>
                  <pubid idtype="pmpid">12871927</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B103">
            <title>
               <p>Preponderance of slightly deleterious polymorphism in mitochondrial DNA: Nonsynonymous/synonymous rate ratio is much higher within species than between species</p>
            </title>
            <aug>
               <au>
                  <snm>Hasegawa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cao</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>ZH</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1998</pubdate>
            <volume>15</volume>
            <fpage>1499</fpage>
            <lpage>1505</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12572613</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>

