New Resources for Mumps

Jennifer Chang, Richard Neher, Jover Lee, Kim Andrews

Expanding the collection of core pathogen datasets, we now provide regularly updated phylogenetic monitoring of mumps virus at:

These phylogenies are generated using genomic data from NCBI GenBank, and are updated daily when new sequences are uploaded to NCBI.

# Phylogenetic analysis

Mumps orthorubulavirus causes mumps, a viral disease spread primarily among humans that is highly contagious and spreads through respiratory droplets. The virus typically infects the salivary glands, causing characteristic swelling and inflammation, making it difficult to speak or move the jaw. The occurrence of mumps infections has been reduced by the widespread use of the MMR (measles, mumps, rubella) vaccine.

We provide two views of mumps for genomic analysis, a Global and a North-America filtered dataset.

Figure 1 Figure 1. Phylogenetic views of mumps viral genetic diversity. Views are split out into "global" and "north-america" focused phylogenies. Available at nextstrain.org/mumps/global and nextstrain.org/mumps/north-america.

Figure 2 Figure 2. Annotated by Mumps genotypes. Samples are annotated by Mumps genotypes based on both GenBank annotations and Nextclade assignments. Tree, map, and frequencies plots of the mumps genome tree are shown and available at nextstrain.org/mumps/global.

# Nextclade dataset

We provide two Nextclade datasets (SH region and genome) to assign genotypes (e.g. A to N) to mumps samples based on criteria outlined by the WHO and tree placement. Scaffold strains for the Nextclade dataset were pulled from the literature (Jin et al., 2005, WHO 2012, Jin et al., 2015) and augmented with some subsampling to fill in the tree. Mutations are called against a Jeryl-Lynn reference (D90232 for the SH region and HQ416907 for the full genome).

Scope Nextclade dataset Reference
SH region (~316 bp region) mumps/sh D90232 (Jeryl-Lynn)
Full genome mumps/genome HQ416907 (Jeryl-Lynn)

Figure 3 Figure 3. Nextclade datasets of mumps virus based on the ~316 SH region and the full genome. Scaffold phylogenetic trees were created for a Nextclade dataset and colored by genotype assignment. Below the trees, the SH region is highlighted in a magenta box to highlight the difference between the SH and full genome datasets. The Nextclade datasets are available as mumps/sh and mumps/genome.

# Nextstrain resources

We curate sequence data and metadata from NCBI as the starting point for our analyses. Curated sequences and metadata are available as flat files at:

# Acknowledgments & request for comments

We welcome comments or suggestions from mumps researchers to improve these Nextstrain datasets for their use case. Special thanks for feedback from Louise Moncla for answering questions and providing some biological context.

All source code is freely available under the terms of the GNU Affero General Public License. Screenshots may be used under a CC-BY-4.0 license and attribution to nextstrain.org must be provided.

This work is made possible by the open sharing of genetic data by research groups from all over the world. We gratefully acknowledge their contributions. Special thanks to Kristian Andersen, Josh Batson, David Blazes, Jesse Bloom, Peter Bogner, Anderson Brito, Matt Cotten, Ana Crisan, Tulio de Oliveira, Gytis Dudas, Vivien Dugan, Karl Erlandson, Nuno Faria, Jennifer Gardy, Nate Grubaugh, Becky Kondor, Dylan George, Ian Goodfellow, Betz Halloran, Christian Happi, Jeff Joy, Paul Kellam, Philippe Lemey, Nick Loman, Duncan MacCannell, Erick Matsen, Sebastian Maurer-Stroh, Placide Mbala, Danny Park, Oliver Pybus, Andrew Rambaut, Colin Russell, Pardis Sabeti, Katherine Siddle, Kristof Theys, Dave Wentworth, Shirlee Wohl and Cecile Viboud for comments, suggestions and data sharing.

Nextstrain is supported by

logologologologologologologologo