The H5N1 influenza subtype is enzootic in birds and can also infect other animals. Since 2018 the highly pathogenic subtype HPAI A(H5N1) has become the dominant strain in bird populations worldwide and has also caused widespread disease in cattle.
We assign clades based on the AF144305.1 HA segment, as defined in the community dataset: community/moncla-lab/iav-h5/ha/all-clades. For the other segments we use our own custom dataset based the GCF_000864105.1 assembly: genspectrum/flu/h5n1.
Genspectrum uses all open influenza A data that is available on the INSDC (taxonid: 197911). To classify influenza segments and subtypes we use nextclade sort (using half of all k-mers for each subtype defined in https://github.com/anna-parker/InfluenzaAReferenceDB ) to improve classification). Where available we use the assembly information to group segments that are from the same sample/isolate. For all remaining segments we use a heuristic grouping algorithm to group all segments from the same sample/isolate using the metadata available from each segment.
For each individual influenza subtype you can view the CDS of each protein in the genome data viewer.