This data set contains for each name used in any of the studies, (not just those in WEFAT 2) and its the gender proportions in the US population. It was generated by the gender package, which uses US Social Security Administration data.

cbn_gender_name_stats

Format

An object of class data.frame with 210 rows and 6 columns.

Details

The columns of the data set are name, the name, proportion_male and proportion_female, gender (a best guess from the proportions), and the years within which the SSA search was performed. This data set can merged with several of the study item sets, but is most useful for replicating the second WEFAT study, as shown in the replication vignette.

This data should typically be joined e.g. using merge, to other item information using the columns 'name' and 'Word' (assuming that information comes from cbn_get_items). The replication vignette has an example.