Interview with the authors: Habitat light sets the boundaries for the rapid evolution of cichlid fish vision, while sexual selection can tune it within those limits

Non-model organisms provide an interesting avenue to explore evolution in real time in natural populations. Here, we speak to Ralf F. Schneider and Sina J. Rometsch of University of Konstanz, Germany about their co-authored Molecular Ecology article, where they investigate sex‐specific opsin expression of several cichlids from Africa and the Neotropics which they coupled with data sets on sex‐specific body coloration, species‐specific visual sensitivities, lens transmission and habitat light properties. They illustrate how integrative approaches can address specific questions on the factors and mechanisms driving diversification, and the evolution of cichlid vision in particular.  Read on to get a behind-the-scenes view of this study.

1. What led to your interest in this topic / what was the motivation for this study? 

Cichlid fishes are an amazing system to work with. They are one of the most species rich vertebrate families and adapted to a wide range of ecological niches. This is reflected in outstanding phenotypic diversity in numerous traits. Their striking variation in body coloration, which can sometimes differ considerably between the sexes, has even been acknowledged in their German name “Buntbarsche”, which translates to colorful perches. Moreover, their visual system is intriguingly complex, because cichlids possess a total of seven opsin genes that allow for color vision (humans only have three). While sexual selection has been recognized as a major driver in the evolution of cichlid body coloration (coloration being the “sender”), less is known about what shapes visual sensitivities (the receiving end of the system) in cichlids. Therefore, we were interested in whether the phenotypic diversity found across cichlid visual sensitivities is primarily driven by sexual selection (e.g., vision co-evolves with body colors), or whether environmental factors, such as light availability and water turbidity, turn out to have a stronger effect on the evolution of cichlid vision.

2. What difficulties did you run into along the way? 

The main challenge in addressing the question on whether the light environment (“ecological selection”) or conspecifics’ body colorations (“sexual selection”) are driving the diversification in cichlid vision was that both potential drivers are very complex and can be challenging to quantify. The visual environment across cichlids’ habitats varies tremendously, and is dependent on factors, such as water depth and turbidity. Coloration patterns often change across the fish’s body and only photospectrometric measurements across the whole range of visible light wavelengths can objectively quantify a color. Moreover, the fish’s visual system is highly complex. Visual sensitivity can be modified physiologically or by phenotypic plasticity by a number of factors including expressing different subsets of the seven opsin genes, changing their expression level, lens filtering etc. Combining all this information, either obtained from our own experiments or from published studies, in a common framework allowing for meaningful statistical analyses was challenging.

3. What is the biggest or most surprising innovation highlighted in this study? 

Most surprising, in terms of results, was to us that we did not find sexual dimorphism in opsin expression in any cichlid – not even those with very strong dimorphism in body coloration (such as in Pseudotropheus lombardoi (attached photo), where females are blue and males are yellow). While we did expect that evaluating mating partners based on their body color would favor associated sexual dimorphism in the visual system, this seemingly has not (yet?) happened in these fish. In terms of methodology, our study integrates a complex data-set on ecological and physiological parameters that can affect the visual sensitivity. This allowed us to evaluate the potential interactions of these parameters in a very comprehensive way.

4. Moving forward, what are the next steps in this area of research?

In our study, we show that a wide range of data can be integrated in a single model, which allowed us to investigate interactions among variables that are rarely used in a common framework. Thus, we encourage future studies to also consider comprehensive approaches when addressing questions concerning the visual ecology of these (or other) fish, if this information is available or obtainable. Additionally, while we have a relatively good understanding of how visual information is perceived by cichlids, there is only very little information on how visual information is processed in the neuronal circuitry of the eye and later in the brain. Understanding signal processing in cichlid eyes will provide a new information layer for evolutionary ecologists to work with.

5. What would your message be for students about to start developing or using novel techniques in Molecular Ecology? 

In the last decades, due to ever-increasing computational power, storage capacities and high-throughput techniques, such as next-gen sequencing, large amounts of data can be more reasonably collected and are accessible by more researchers. New methods can benefit from incorporating these data into analysis pipelines to consolidate them or broaden their scope. Being aware of available data can thus be very useful. However, it is also important to us to stress that approaching a scientific question from several angles and across biological disciplines that don’t frequently communicate is often the soundest approach. Classical lab methods, such as in situ hybridization or histology, as well as cutting edge techniques, such as Crispr/Cas9, can provide valuable validation/falsification of formulated hypotheses.

6. What have you learned about methods and resources development over the course of this project? 

It was great to see how this study evolved: one question and technique lead to another until we finally aimed at developing an analysis frame-work for the complex data-sets that are obtained in visual ecology of (cichlid) fishes. This comprised changing and further developing our pipeline while analyzing the data. Several preliminary pipelines had to be discarded as they did not properly address our core hypotheses. Thus, an important lesson for us was that it can take a while until a newly developed analysis pipeline does actually what one envisioned roughly at the beginning of the project. Overall, collaborating in a team with members of quite different backgrounds such as ecology, molecular biology and data science and working in a large and well-established lab made it possible to learn and apply new techniques.

7. Describe the significance of this research for the general scientific community in one sentence.

Evaluating the relative strengths of natural vs. sexual selection is a very interesting question and these two forces are often very hard to disentangle, but using a set of multidisciplinary approaches combined with a comprehensive statistical analysis allowed us to show that in narrow light environments visual sensitivity is tuned to exploit all available light, while broader light environments allow for more specialized visual sensitivities.

8. Describe the significance of this research for your scientific community in one sentence.

We show that ambient light is a prime driver for the evolution of visual sensitivities through natural selection in cichlid fishes, whereas sexual selection seems to finetune the observed diversity within the limits set by natural selection.

9. How has COVID-19 affected work in your group?

For the last two months we’ve all been confined to working from home which – on the plus side – allowed us to dedicate more time to data analyses and finishing manuscripts, but on the down side required lab experiments to be currently on hold – unfortunately.

Full paper: Schneider, R. F., Rometsch, S. J., Torres-Dowdall, J., & Meyer, A. (2020). Habitat light sets the boundaries for the rapid evolution of cichlid fish vision, while sexual selection can tune it within those limits. Molecular Ecology. https://onlinelibrary.wiley.com/doi/full/10.1111/mec.15416

Interview with the authors: Applying genomic data in wildlife monitoring

Massive parallel sequencing has led to an explosion of sequence data in recent years. However, the methods used to obtain such data are usually high-cost and time-intensive, and often require high-quality samples. This creates limits as to whether and how well such data can be used by researchers working in applied conservation science. Here, we speak to Alina von Thaden about her recent study in Molecular Ecology Resources. Using European wildcats as a case study, Alina and co-authors present a relatively low-cost and time-efficient workflow for the development and optimisation of microfluidic SNP panels, which can be used to obtain SNP data from minimally invasive samples. Beyond outlining the workflow and its applications, they go so far as to estimate the costs of their pipeline, providing valuable practical information for conservation scientists. Read on for an in-depth view of this study.

Monitoring elusive European wildcats (Felis silvestris) is heavily reliant on noninvasively collected DNA samples. Photo credit: Annsophie Schmidt.

What led to your interest in this topic / what was the motivation for this study? 

We are mainly working on genetic monitoring of large carnivores and most of our research is based on noninvasively collected wildlife samples such as hairs, faeces and saliva traces. The field demands for very fast and reliable genetic analyses of samples with degraded DNA. And since funding is generally sparse in applied conservation, our methods need to be cost-effective and suitable for high-throughput approaches.

Genomic tools, on the other hand, usually involve large amounts of data, complex bioinformatic pipelines and typically rely on samples with high-quality DNA. We have been looking into ways to combine the advantages of genomics with the challenges of conservation monitoring. For some years now, we have been working with microfluidic arrays combined with reduced SNP panels and wanted to share our experiences with other labs interested in applying them.

What difficulties did you run into along the way? 

Setting-up and optimizing methodological resources comes along with several challenges – but there is a lot to learn! Most important to me was to remain skeptical about the results and to constantly validate them through analyzing the data from several perspectives and with different software. The validation of the technology also took a lot of extra lab hours, but we are confident that the workflow and guidelines that we present now will save others a lot of hands-on time and costs when optimizing SNP panels for degraded samples.

What is the biggest or most surprising finding from this study? 

First of all, after years of developing the framework, we applied it to a new SNP panel designed for dog-wolf hybridization assessment (to be published) and found that the lab work for generating a new ready-to-use marker panel took us only a few weeks. To see the approach being proved effective was great and encouraged us to share it with the community.

Secondly, a large proportion of noninvasively collected samples could be run without or with only very few genotyping errors as compared to more traditional microsatellite-based genotyping (see also von Thaden et al. 2017). This has direct implications for genotyping costs and thus promotes the broader establishment of a genomic technology in applied conservation.

Alina von Thaden collecting reference samples of European wildcat (Felis silvestris) for testing a newly developed SNP panel. Photo credit: Annsophie Schmidt.

Moving forward, what are the next steps for this research? 

One of our next steps is to apply the technology to historical samples from museum collections. Additionally, we are going to implement the SNP panel from our current paper in routine genetic monitoring of European wildcats in Germany.

We currently develop other reduced SNP panels for a variety of endangered species in our lab, such as dormice and European bison. Besides neutral variation, we also aim to integrate functional markers, such as SNPs associated with disease susceptibility.

Further, we will test alternative platforms that will allow generating larger SNP sets for degraded samples. Ultimately, our long-term goal is the effective implementation of an “applied genomic wildlife monitoring” approach.

What would your message be for students about to start their first research projects in this topic? 

Get in contact with other groups working in this area! Sharing ideas and experience really helps to shape your project and refine the aims of your research. Most people are very cooperative and happy to contribute or answer questions.

What have you learned about science over the course of this project? 

Perseverance and tenacity. When exploring new directions the research journey may well become bumpy and lead you somewhere else than you initially expected. But it’s worth it – keep your goal in mind and be ready to rethink your strategy.

Describe the significance of this research for the general scientific community in one sentence.

Bridging the gap between genomics and applied conservation is a key prerequisite for effective wildlife management, especially in the light of rapid biodiversity declines.

Describe the significance of this research for your scientific community in one sentence.

We demonstrate how reduced SNP panels can be efficiently developed and optimized for genotyping based on degraded wildlife samples.

References

von Thaden, A., Cocchiararo, B., Jarausch, A., Jüngling, H., Karamanlidis, A. A., Tiesmeyer, A. … Muñoz-Fuentes, V. (2017). Assessing SNP genotyping of noninvasively collected wildlife samples using microfluidic arrays. Scientific Reports, 7, 83. https://doi.org/10.1038/s41598-017-10647-w

Full paper

von Thaden, A., Nowak, C., Tiesmeyer, A., Reiners, T. E., Alves, P. C., Lyons, L. A., … & Hegyeli, Z. (2020). Applying genomic data in wildlife monitoring: Development guidelines for genotyping degraded samples with reduced single nucleotide polymorphism (SNP) panels. Molecular Ecology Resources. https://doi.org/10.1111/1755-0998.13136

.

Interview with the authors: Strong divergent selection at multiple loci in two closely related species of ragworts adapted to high and low elevations on Mount Etna

Non-model organisms provide an interesting avenue to explore evolution in real time in natural populations.Here, we speak to Edgar Wong of Department of Plant Sciences, University of Oxford, UK about his Molecular Ecology article, which investigated speciation in two closely related Senecio species, S. aethnensis and S. chrysanthemifolius, which grow at high and low elevations, respectively, on Mount Etna, Sicily and form a hybrid zone at intermediate elevations.  Wong and his co-authors found an extremely strong selection (up to 0.78) against hybrids in the system. This estimate is one of the highest reported in literature, and much higher than the one reported in the same system in the past. Read on to get a behind-the-scenes view of this study.

1. What led to your interest in this topic / what was the motivation for this study?

Speciation and hybridisation have always been interesting topics to me. In the case of Senecio on Mount Etna, they have an especially fascinating story: first, Mount Etna is a relatively young mountain (less than half a million years old), and previous research hypothesized that the formation of the mountain led to the divergence of the two species, Senecio aethnensis and S. chrysanthemifolius. These species are thought to be a rare example of clear-cut, recent speciation subject to divergent selection – the formation of new species driven by adaptation to distinct conditions – high- and low-elevations in our study. Second, botanists around 300 years ago brought some live Senecio specimens of the plants from Mount Etna back to the UK, and led to hybrid speciation of S. squalidus that has since spread all over the UK (although crossing experiments using plants from Mount Etna suggested hybrid breakdown). A lot is still unknown about the plants both on Mount Etna and in the UK. Hence, I was intrigued to find out unknown aspects in the system and focused on the species on Mount Etna.

2. What difficulties did you run into along the way?

One big difficulty was that Asteraceae (which Senecio belongs to) is notorious for being hard to extract clean DNA. It was a struggle to extract good-quality DNA for this study, which was resolved in the end. Also, we only had a draft genome for the hybrid species, S. squalidus, which limited the scope of analyses we could carry out. Luckily, we managed to find some interesting, highly differentiated genes that might be underlying speciation and adaptation.

3. What is the biggest or most surprising innovation highlighted in this study?

The most surprising finding in our study is that we estimated an extremely strong selection (up to 0.78) against hybrids in the system. This estimate is one of the highest reported in literature, and much higher than the one reported in the same system in the past. Such strong selection was surprising to us because hybrids between the two species are (apparently) happily growing at intermediate elevations between the typical habitats of ‘pure’ S. aethnensis and S. chrysanthemifolius. We think this strong cumulative selection on multiple loci works together with intrinsic incompatibility to maintain the phenotypic and genotypic divergence between the two target species.

4. Moving forward, what are the next steps in this area of research?

In the future, we hope to identify the environmental and ecological selective forces that had shaped this system. We also hope to characterise the genetic aspect of the species by improving the genome assembly and study more in detail the intrinsic incompatibility between the two target species (such as hybrid breakdown). With more data on both extrinsic and intrinsic processes, we can integrate these findings to get a more comprehensive picture of reproduction isolation in this system.

5. What would your message be for students about to start developing or using novel techniques in Molecular Ecology?

I would say to spend enough time understanding the experimental techniques and different types of data analyses (and the theories behind them). Most importantly, make sure that the type of data you generate are suitable for answering your research questions. As a graduate student myself, I would also suggest not to rush your work and not get transfixed on certain issues/ problems along the way – taking a step back and asking for advice and opinions from other researchers are always helpful in getting another perspective, which often helps to find a solution.

6. What have you learned about methods and resources development over the course of this project?

The type of data I used and subsequent data analyses were all new to me when I started the project, so there is no doubt I learnt a great deal about handling new types of data and how to analyse it. Another thing I have learnt is that there are always newer or ‘better’ technologies and methods that give you more data and/ or data with higher accuracy. It is inevitable that sometimes you would be worried whether what you have is not good enough. However, I have come to realise that more isn’t always better and there will always be more advanced methods; the most important thing is to use what you have and try to answer your research questions.

7. Describe the significance of this research for the general scientific community in one sentence.

Non-model organisms inform us a lot about evolutionary processes such as hybridisation, adaptation and speciation.

8. Describe the significance of this research for your scientific community in one sentence.

Strong multifarious selection could be crucial in maintaining species divergence despite on-going gene flow.

Summary from the authors: Interdependent sensory systems regulate larval settlement in a marine sponge

In the ocean, pelagic larvae that settle onto the seafloor and metamorphose into an adult directly regulate the ecology and evolution of all benthic communities. To settle, larvae of most species need to encounter specific biochemical cues that indicate an optimal environment, and many also prefer to settle in the dark. It appears likely, then, that larval responses to light and to biochemical cues are closely linked, but exactly how this happens at a molecular level is largely unexplored.

We explored how changes in gene expression regulate larval settlement in a marine sponge. We find that these larvae naturally settle at twilight, and that this is directly related to the expression of receptors and signalling pathway components. Further, we find that constant light prevents larval settlement via blocking the ability of larvae to respond to biochemical cues. Our data provide the first suggestions of candidate genes and molecular pathways that may regulate the way in which light can directly affect larval settlement. Our findings in a sponge, one of the earliest branching extant animal lineages, raises the possibility that larval responses to light and to biochemical cues might be a mechanism regulating settlement across the animal kingdom.

(Left) Scanning electron micrograph of an Amphimedon queenslandica larva. A ring of very long cilia, which are associated with photosensory pigment cells, are clearly visible at the posterior end of the larva. Photo credit: Sally Leys. (Right) Tahsha Say in the field on Heron Island Reef flat, Great Barrier Reef, Australia.

Full article: Say, TE, Degnan, SM. Molecular and behavioural evidence that interdependent photo ‐ and chemosensory systems regulate larval settlement in a marine sponge. Mol Ecol. 2020; 29: 247– 261. https://doi.org/10.1111/mec.15318

This summary was written by the study’s first author,TE Say.

Victoria Sork awarded the 2020 Molecular Ecology Prize

The Molecular Ecology Prize Committee is pleased to announce that the 2020 Molecular Ecology prize has been awarded to Dr. Victoria Sork, Distinguished Professor in Ecology and Evolutionary Biology, Dean of Life Sciences, and Director of the Mildred E. Mathias Botanical Garden at University of California Los Angeles. Throughout her career, Dr. Sork has made substantial and diverse scientific contributions to the field of molecular ecology – from working to build the foundation of landscape genetics, to pioneering the use of molecular markers in tracking plant dispersal, to unraveling the genomic and epi-genomic basis of climate adaptation in non-model organisms. With well over 100 publications, she has proven herself to be a preeminent scholar in her field for decades, while serving as a role model and mentor for many early career scientists, and as a continual advocate for increasing diversity and inclusion in STEM.

Dr. Sork joins the previous winners of the Molecular Ecology Prize: Godfrey Hewitt, John Avise, Pierre Taberlet, Harry Smith, Terry Burke, Josephine Pemberton, Deborah Charlesworth, Craig Moritz, Laurent Excoffier, Johanna Schmitt, Fred Allendorf, Louis Bernatchez, Nancy Moran, Robin Waples, and Scott Edwards.

Summary from the authors: A metagenomic assessment of microbial eukaryotic diversity in the global ocean

Marine microbial eukaryotes are key components of planktonic ecosystems in all ocean biomes. They are, along with cyanobacteria, responsible for nearly half of the global primary production, and play important roles in food-web dynamics as grazers and parasites, carbon export to the deep ocean, and nutrient remineralization. Currently, one of the most common approaches to survey their diversity is sequencing marker genes amplified from genomic DNA extracted from microbial assemblages. However, this approach requires a PCR step, which is known to introduce biases in microbial diversity estimates. One alternative to overcome this issue involves exploiting the taxonomic information contained in metagenomes, which use massive shotgun sequencing of the same DNA extracts with the goal of assessing the putative functions of environmental microbes.

In this study we investigated the potential of metagenomics to provide taxonomic reports of marine microbial eukaryotes. The overall diversity reported by this approach was similar to that obtained by amplicon sequencing, although the latter performed poorly for some taxonomic groups. We then studied the diversity of picoeukaryotes and nanoeukaryotes using 91 metagenomes from surface down to bathypelagic layers in different oceans, unveiling a clear separation of taxonomic groups between size fractions and depth layers.

Overall, this study shows metagenomics as an excellent resource for taxonomic exploration of marine microbial eukaryotes.

Summary of the relevance of main eukaryotic taxonomic groups within two size fractions of marine plankton (picoeukaryotes [0.2-3 µm] and nanoeukaryotes [3-20µm]) and in two different layers of the global ocean (photic [0-200 m] and aphotic [200-4000m]) as seen by metagenomics. The median of the relative abundance was calculated for each taxonomic group with samples from the 4 categories (pico-photic, pico-aphotic, nano-photic, nano-aphotic) and dots represent these median values transformed to a 0-100 scale. Dots are then colored based on the category where the taxonomic group is most relevant.

This summary was written by the study’s first author, Aleix Obiol.

Full article:
Obiol, A., Giner, C. R., Sánchez, P., Duarte, C. M., Acinas, S. G., & Massana, R. (2020). A metagenomic assessment of microbial eukaryotic diversity in the global ocean. Molecular Ecology Resources. https://doi.org/10.1111/1755-0998.13147

Interview with the authors: Which software is best to use for de novo assembly?

Reduced representation sequencing (e.g. RAD and GBS) is becoming ever more popular, but for species which lack a reference genome, little work has been done to assess which software may be best suited to building de novo assemblies from this data. Here, we speak to Melanie LaCava of the University of Wyoming about her recent Molecular Ecology Resources article, which explores the accuracy of de novo assemblies built by various software programs using DNA generated from double-digest libraries. Melanie and her co-authors found highly variable degrees of accuracy of assemblies built by six different software programs, and discuss which programs are best suited to this application. They also highlight the importance of optimising parameter settings within any given software. Read on to get a behind-the-scenes view of this study.

The completeness of assemblies in simulations of unmutated genomes (a, d), in simulations of an equal number of SNPs and indels (b, e), and simulations of 1–5 base pair indels (c, f). Values are reported for five assemblers: CDHIT (green), STACkS (blue), STACkS2 (purple), VelVeT (pink) and VSeARCH (orange). The hue of each color corresponds to the percent match parameter setting used in the assembly. For more information on this figure go to Figure 1’s caption here.

What led to your interest in this topic / what was the motivation for this study? 

This study began as a research project in a graduate-level course on computational biology at the University of Wyoming led by the senior author on the paper, Alex Buerkle. Dr. Buerkle initiated the project and worked with the rest of the coauthors to pursue this de novo assembly software comparison. As reduced representation genotyping-by-sequencing has become more popular, new and repurposed software programs have been applied to each step in the bioinformatics pipeline. When a reference genome is unavailable for a study species, de novo assembly is essential, yet we recognized a gap in the evaluation of software used for this important step.

What difficulties did you run into along the way? 

Technology and software associated with genotyping-by-sequencing and de novo genome assembly are rapidly changing. During the course of our project, some of the software programs we tested were significantly updated, so we chose to rerun our analyses using the new software versions to ensure we were providing up-to-date information in our manuscript.

What is the biggest or most surprising finding from this study? 

We were surprised to find such a substantial difference in performance among these assembly programs. We were especially surprised at the variation in performance among software for our first simulation where no mutations were introduced. In this scenario, we made many identical copies of genome fragments and then performed de novo assembly using each software program. Without any mutations introduced, the job is basically to generate a list of unique sequences – it should be very straightforward. In some cases, however, these genome fragments were broken into shorter sequences and rearranged beyond recognition, leading to incorrect reconstruction of the simple, unmutated data.

Moving forward, what are the next steps for this research?

For our study, we selected a sample of assemblers from peer-reviewed literature that use different assembly algorithms, are freely available, and have updated user resources available online. However, this was not a comprehensive evaluation of all software capable of de novo assembly. Therefore, the evaluation of other programs would be valuable. Additionally, as new software programs are introduced or existing programs are updated, continued efforts to evaluate de novo assembly performance is warranted.

What would your message be for students about to start their first research projects in this topic? 

Reduced representation genotyping-by-sequencing is becoming less expensive and more accessible, making it a viable option for more research projects. While it is exciting to apply these emerging technologies and methods, it is important to recognize that approaches to filter and analyze these large datasets are still in development. Doing your background research to ensure you are applying the best available tools and using the most appropriate methods for your study is essential to doing good research in this field and in any field of research.

What have you learned about science over the course of this project? 

Doing this study has reaffirmed the importance of simulations to test how software works. Testing analyses on simulated data and altering parameters of the simulation or analysis can provide immense insight into how the software works and how variation in real data may affect software performance. Larger simulation projects like our study can provide information that many people can use, but I also find it incredibly helpful to run a simulated dataset through an analysis before analyzing my own data to ensure I understand what the software is doing. Taking advantage of simulated datasets available in vignettes for software is a great tool to get acquainted with the analyses you plan to do.

Describe the significance of this research for the general scientific community in one sentence.

Our study demonstrates the importance of ensuring that software you use is really doing what you think it is supposed to do; and simulations can help evaluate software performance.

Describe the significance of this research for your scientific community in one sentence.

Researchers who need to perform de novo assembly of reduced representation genotyping-by-sequencing data can use our study as a guide for which software to use and the importance of different parameter settings for assembly.

LaCava, M. E., Aikens, E. O., Megna, L. C., Randolph, G., Hubbard, C., & Buerkle, C. A. (2019). Accuracy of de novo assembly of DNA sequences from double‐digest libraries varies substantially among software. Molecular ecology resources. https://doi.org/10.1111/1755-0998.13108

Interview with the authors: How does invasiveness evolve? A look at feral pigs

Understanding how and why some species readily invade new habitats is an interesting view into the myriad ways species evolve. Limiting the expansion of such introduced species can be important for managing ecosystems, particularly when the invasive species is as ecologically destructive and economically costly as the feral swine in the US south. In a paper published recently in Molecular Ecology, researchers led by Dr. Tim Smyser investigated the origins of the invasive feral swine populations to determine how much the expanding footprint of this species was a due of recently escaped domesticated pigs. Surprisingly, they found that the expanding range was largely attributable to range expansion by the established invasive swine population. Read on to for more details from Dr. Smyser into this very interesting work!

Invasive feral swine originated from a combination of European feral pigs and domesticated stock. Photo by Dr. Mirte Bosse, dvdwphotography.

What led to your interest in this topic / what was the motivation for this study? 

Invasive feral swine have expanded rapidly throughout the United States over the past 30 years. The impetus for the study was to identify the drivers for that expansion, to ask: where are new feral swine populations coming from? Prior to our work, there was a hypothesis that domestic pigs had sufficient phenotypic plasticity that they would revert to a wild phenotype, resembling a wild boar, if living in the wild. Under this hypothesis, any pig farm could have served as a viable source population for invasive feral swine. With this study, we revealed that there is very little direct contribution to invasive feral swine populations from domestic pigs, potbellied pigs, or wild boar. Rather, the rapid expansion observed over the past 30 years has been driven by incremental range expansion of established invasive feral swine, which overwhelmingly represent animals of mixed European wild boar-heritage domestic breed ancestry, and long-distance translocation of feral swine from established populations to uninvaded habitats.

What difficulties did you run into along the way? 

The challenges were largely computational. We had amassed over 9,000 genotypes by the time we compiled the reference set and generated genotypes from invasive feral swine genotypes for this study. Such a large dataset required that we do everything we could to optimize runtime efficiency. Even with these efforts, the analysis still took about 4 months of runtime while using 30 CPUs with 60 threads.

What is the biggest or most surprising innovation highlighted in this study? 

I would say the most surprising result was the very high proportion of invasive feral swine that had a significant ancestry association to European wild boar. The historical record suggests wild boar releases have been far more limited than the potential for domestic pig releases, yet 97% of feral swine had significant European wild boar ancestry. This might suggest hybrid wild boar-domestic pig ancestry is biologically important for feral swine to establish self-sustaining populations and become invasive.

Moving forward, what are the next steps in this area of research?

Descending from this work, our next steps are multifaceted. With this analysis, we have identified the drivers of range expansion at a broad-scale with ancestry results pointing to the expansion of established populations. We are now interested in adding a fine-scale understanding of expansion to identify the specific sources of newly emergent populations and map the patterns of feral swine expansion. Also, this analysis has provided an understanding of the ancestral composition of invasive feral swine. Given the hybrid origin of these animals, we will identify elements of the genomes from their ancestral groups, that is heritage breeds of pig and European wild boar, that have been selectively retained in feral swine. By describing selective sweeps relative to ancestral groups, this analysis will allow us to describe the evolution of invasiveness among feral swine.   

What would your message be for students about to start developing or using novel techniques in Molecular Ecology?

The field of Molecular Ecology is changing so quickly that it is hard as a scientist to keep up, from both a computational/statistical standpoint and with all the new molecular techniques and analyses that allow us to dive deeper into the genome than we had previously imagined. My recommendation for students would be to not let the lack of a specific skill deter you from asking interesting questions – take the time to develop the needed skill sets or develop collaborations to facilitate your learning or use of those skills. Also, keep asking questions – don’t be content with the answers we are able to resolve today.

What have you learned about methods and resources development over the course of this project? 

Reflecting back on my answer immediately above, when I started asking the question of what are the drivers of invasive feral swine range expansion, I did not have the data or the skills to meaningfully address that question. Through the development of a great team of collaborators and independent learning, I was able to assemble the needed skills and then the data to pose this question and reveal interesting results. Through this project, I learned about the statistical tools used in the analyses, developed the coding skills necessary to execute those analyses, and identified strategies to maximize computational efficiency as was needed for working with such a large dataset.

Describe the significance of this research for the general scientific community in one sentence.

We have demonstrated that the recent and rapid expansion of feral swine, an ecologically destructive and economically costly invasive species distributed throughout much of the US and the world, has been facilitated by movement (in many cases anthropogenic movement) from established populations to uninvaded habitats as opposed to novel introductions of either domestic pigs or wild boar.

Describe the significance of this research for your scientific community in one sentence.

In identifying the admixed origins of invasive feral swine, descending from heritage domestic pig breeds and European wild boar ancestry, we can begin to gain an understanding of the evolution of invasiveness for this species and invasive species more broadly. 

Feral Swine are not native to the U.S. They are the result of recent and historical (1500’s Spanish explorers) releases of domestic swine and Eurasian boar. USDA APHIS photo Laurie Paulik.

Smyser TJ, Tabak MA, Slootmaker C, Robeson MS, Miller RS, Bosse M, Megens H-H, Groenen MAM, Rezende Paiva S, Assis de Faria D, Blackburn HD, Schmidt BS, Piaggio AJ. 2020. Mixed ancestry from wild and domestic lineages contributes to the rapid expansion of invasive feral swine. Molecular Ecology. https://doi.org/10.1111/mec.15392