Nextstrain

Real-time tracking of pathogen evolution

About us

An open-source project to harness the scientific and public health potential of pathogen genome data

Core pathogens

Continually updated views of a range of pathogens maintained by the Nextstrain team

SARS-CoV-2

Up-to-date analyses and a range of resources for SARS-CoV-2, the virus responsible for COVID-19 disease

Open source tooling

Bioinformatic workflows, analysis tools and visualization apps for use by the community

Nextclade

In-browser phylogenetic placement, clade assignment, mutation calling and sequence quality checks

Nextstrain Groups

Datasets and narratives shared by research labs, public health entities and others

Featured analyses

Andes Hantavirus

Phylogenies of ANDV (L, G & S segments)

Measles outbreak

Spread of 2025-2026 North America measles outbreak

BDBV (Ebola)

Bundibugyo ebolavirus genomes

SARS-CoV-2

Ongoing evolution and spread of SARS-CoV-2

Mpox in the DRC

INRB analysis of ongoing mpox clade I outbreak in the DRC

Tuberculosis

Mycobacterium tuberculosis evolution and spread

All Ebola outbreaks

Evolutionary history of all Zaire ebolavirus outbreaks

HPAI outbreak

Highly pathogenic avian influenza in North America

Seasonal influenza

Seasonal influenza evolution and antigenic drift

H5N1 cattle outbreak

Influenza H5N1 cattle outbreak in the USA (genotype B3.13)

Avian influenza

Avian influenza A/H5N1 evolution and spread

SARS-CoV-2 lineages

Nextclade reference tree of SARS-CoV-2 Pango lineages

SARS-CoV-2 forecasts

SARS-CoV-2 clade and lineage frequency dynamics

Oropouche

Oropouche virus evolution and spread

Mpox

Evolution and spread of mpox clades I and II

Lassa

Lassa virus evolution and spread

Yersinia pestis

Historical analysis of geographic spread of plague

Mumps

Mumps virus evolution and spread

Nipah

Nipah virus evolution and spread

Norovirus

Norovirus evolution and spread

Rubella

Rubella virus evolution and spread

RSV

RSV evolution and spread

Ebola in the DRC

Genomic epidemiology of the 2018-20 Ebola epidemic in DRC

WNV in the Americas

Analysis of twenty years of West Nile virus spread in the Americas

Chikungunya

Molecular epidemiology of Chikungunya virus

HIV

Genetic diversity of HIV Env

Rabies

Rabies virus evolution and spread

Yellow Fever

Yellow fever virus evolution and spread

HMPV

Human metapneumovirus evolution and spread

Philosophy

Pathogen Phylogenies

In the course of an infection and over an epidemic, pathogens naturally accumulate random mutations to their genomes. This is an inevitable consequence of error-prone genome replication. Since different genomes typically pick up different mutations, mutations can be used as a marker of transmission in which closely related genomes indicate closely related infections. By reconstructing a phylogeny we can learn about important epidemiological phenomena such as spatial spread, introduction timings and epidemic growth rate.

Actionable Inferences

However, if pathogen genome sequences are going to inform public health interventions, then analyses have to be rapidly conducted and results widely disseminated. Current scientific publishing practices hinder the rapid dissemination of epidemiologically relevant results. We thought an open online system that implements robust bioinformatic pipelines to synthesize data from across research groups has the best capacity to make epidemiologically actionable inferences.

This Website

This website aims to provide a real-time snapshot of evolving pathogen populations and to provide interactive data visualizations to virologists, epidemiologists, public health officials and citizen scientists. Through interactive data visualizations, we aim to allow exploration of continually up-to-date datasets, providing a novel surveillance tool to the scientific and public health communities.

Future Directions

Nextstrain is under active development and we have big plans for its future, including visualization, bioinformatics analysis and an increasing number and variety of datasets. If you have any questions or ideas, please contact us.

A bioinformatics and data viz toolkit

Nextstrain provides an open-source toolkit enabling the bioinformatics and visualization you see on this site. Tweak our analyses and create your own using the same tools we do. We aim to empower the wider genomic epidemiology and public health communities.

Resources

Tools

Support

About

Hadfield et al., Nextstrain: real-time tracking of pathogen evolution, Bioinformatics (2018)

The core Nextstrain team is

Please see the team page for more details.

All source code is freely available under the terms of the GNU Affero General Public License. Screenshots may be used under a CC-BY-4.0 license and attribution to nextstrain.org must be provided.

This work is made possible by the open sharing of genetic data by research groups from all over the world. We gratefully acknowledge their contributions. Special thanks to Kristian Andersen, Josh Batson, David Blazes, Jesse Bloom, Peter Bogner, Anderson Brito, Matt Cotten, Ana Crisan, Tulio de Oliveira, Gytis Dudas, Vivien Dugan, Karl Erlandson, Nuno Faria, Jennifer Gardy, Nate Grubaugh, Becky Kondor, Dylan George, Ian Goodfellow, Betz Halloran, Christian Happi, Jeff Joy, Paul Kellam, Philippe Lemey, Nick Loman, Duncan MacCannell, Erick Matsen, Sebastian Maurer-Stroh, Placide Mbala, Danny Park, Oliver Pybus, Andrew Rambaut, Colin Russell, Pardis Sabeti, Katherine Siddle, Kristof Theys, Dave Wentworth, Shirlee Wohl and Cecile Viboud for comments, suggestions and data sharing.

Nextstrain is supported by