Hierarchical clustering using Chari et al. methodology on the null and the actual dataset. Counts for SAGE tags that met a threshold p-value cutoff of 0.05 for the null hypothesis of never = current smokers. The counts were row normalized and underwent single-link hierarchical clustering using a Pearson correlation as a distance metric. The left-hand tree represents the actual dataset and the right-hand tree represents the null dataset.