cbn_gender_name_stats.Rd
This data set contains for each name used in any of the studies,
(not just those in WEFAT 2) and its the gender proportions in the US population.
It was generated by the gender
package, which uses US Social Security
Administration data.
cbn_gender_name_stats
An object of class data.frame
with 210 rows and 6 columns.
The columns of the data set are name
, the name,
proportion_male
and proportion_female
, gender (a best guess
from the proportions), and the years within which the SSA search was
performed. This data set can merged with several of the study item sets,
but is most useful for replicating the second WEFAT study, as shown in the
replication vignette.
This data should typically be joined e.g. using merge
, to other item
information using the columns 'name' and 'Word' (assuming that information
comes from cbn_get_items
). The replication vignette has an example.