H3N2 is one of the two major subtypes of influenza circulating in humans. Major outbreaks of A/H3N2 strains in humans include Hong Kong Flu (1968-1969), and Fujian flu (2003-2004).
We use the official nextclade dataset for sequence alignment and HA and NA clade assignment: nextstrain/flu/h3n2 (more specifically we use the CY121680.1 HA reference and our custom dataset genspectrum/h3n2/seg6/CY114383 for NA, for all other sequences we use the GCF_000865085.1. We have converted nextclade's clade assignments that are based on a non-open GISAID sequence to use a similar INSDC-available reference sequence.
Genspectrum uses all open influenza A data that is available on the INSDC (taxonid: 197911). To classify influenza segments and subtypes we use nextclade sort (using half of all k-mers for each subtype defined in https://github.com/anna-parker/InfluenzaAReferenceDB ) to improve classification). Where available we use the assembly information to group segments that are from the same sample/isolate. For all remaining segments we use a heuristic grouping algorithm to group all segments from the same sample/isolate using the metadata available from each segment.
For each individual influenza subtype you can view the CDS of each protein in the genome data viewer.