Do you have data with just first names or even just first initials but no information on the person’s gender/sex? If you would like better insights on your customers, based on whether they are likely male or female, then this data download is a great way to maximize your ROI! Download it today and begin using it to tailor your messaging and improve future communications.
There are three licenses available for this data- individual, corporate and corporate for multi-company consumers. The individual version is available free (with discount code) for a limited time. Simply select the Individual license above for purchase and use discount code discfreepers at the checkout page- this will deduct $3.99 from your purchase price.
The primary table in this data download is First names by Freakalytics with 5164 rows (distinct names and common misspellings). You can use this data to guess if someone is a male or female based on their first name or find the probability that they are male or female based on their first name.
Here is the column information and simple summaries for this table:
Data Column | Max | Min | Average | Median | Mode |
---|---|---|---|---|---|
Name mixed case | Zulma | Aaron | N/A | N/A | James |
Most likely gender | Male | Female | N/A | N/A | Female |
Rank Overall | 4,019 | 1 | 2354 | 2397 | 4019 |
Male Probability | 100% | 0% | 22% | 0% | 0% |
Female Probability | 100% | 0% | 78% | 100% | 100% |
Count Either Gender | 99,989 | 32 | 1,079 | 127 | 32 |
Male Count | 99,671 | 0 | 524 | 0 | 0 |
Female Count | 83,718 | 0 | 555 | 64 | 32 |
Male Probability Within | 3.68% | 0.00% | 0.08% | 0.01% | 0.00% |
Female Probability Within | 2.92% | 0.00% | 0.02% | 0.00% | 0.00% |
Male Rank | 1,054 | 1 | 584 | 608 | 1,054 |
Female Rank | 3,052 | 1 | 1,825 | 2,131 | 3,052 |
Name first initial | Z | A | N/A | N/A | J |
Name upper case | ZULMA | AARON | N/A | N/A | JAMES |
The top few rows from this table (as a snapshot of the data in Excel 2003 format and in text):
Name mixed case | Most likely gender | Rank Overall | Male Probability | Female Probability | Count Either Gender | Male Count | Female Count | Male Probability Within | Female Probability Within | Male Rank | Female Rank | Name first initial | Name upper case |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
James | Male | 1 | 99.7% | 0.3% | 99,989 | 99,671 | 318 | 3.6847% | 0.0111% | 1 | 867 | J | JAMES |
John | Male | 2 | 99.6% | 0.4% | 98,641 | 98,259 | 382 | 3.6325% | 0.0133% | 2 | 790 | J | JOHN |
Robert | Male | 3 | 99.7% | 0.3% | 94,669 | 94,414 | 255 | 3.4903% | 0.0089% | 3 | 982 | R | ROBERT |
Mary | Female | 4 | 0.3% | 99.7% | 83,988 | 270 | 83,718 | 0.0100% | 2.9225% | 699 | 1 | M | MARY |
Michael | Male | 5 | 99.5% | 0.5% | 79,356 | 78,974 | 382 | 2.9195% | 0.0133% | 4 | 790 | M | MICHAEL |
The second table in this data download is First initial by Freakalytics with 26 rows. If you only have people’s first initials, you can use this to guess if they are a male or female as well -OR- use the probabilities of them being a male or female.
Here is the column information and simple summaries for this table:
Data Column | Max | Min | Average | Median | Mode |
---|---|---|---|---|---|
First initial | Z | A | N/A | N/A | J |
Most likely gender | Male | Female | N/A | N/A | Female |
Disparity between genders | 93.9% | 0.2% | 29.1% | 23.7% | N/A |
Female Probability | 97.0% | 16.8% | 54.0% | 56.0% | N/A |
Male Probability | 83.2% | 3.0% | 46.0% | 44.0% | N/A |
Female Initial Probability Within | 12.4% | 0.0% | 3.8% | 3.0% | N/A |
Male Initial Probability Within | 17.6% | 0.0% | 3.8% | 3.2% | N/A |
Female Distinct Names | 424 | 5 | 164 | 122 | 238 |
Male Distinct Names | 100 | 1 | 47 | 45 | 1 |
Female Rank First Initial | 26 | 1 | 13 | 14 | 8 |
Male Rank First Initial | 26 | 1 | 14 | 14 | N/A |
Female Count in 1990 Census Analysis | 353,967 | 255 | 110,178 | 85,355 | #N/A |
Male Count in 1990 Census Analysis | 477,081 | 300 | 104,039 | 87,530 | #N/A |
First initial | Z | A | N/A | N/A | J |
The top few rows from this table (as a snapshot of the data in Excel 2003 format and in text):
First initial | Most likely gender | Disparity between genders | Female Probability | Male Probability | Female Initial Probability Within | Male Initial Probability Within | Female Distinct Names | Male Distinct Names | Female Rank First Initial | Male Rank First Initial | Female Count in 1990 Census Analysis | Male Count in 1990 Census Analysis | First initial |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Y | Female | 93.9% | 97.0% | 3.0% | 0.4% | 0.0% | 47 | 2 | 20 | 25 | 12,457 | 390 | Y |
W | Male | 66.5% | 16.8% | 83.2% | 0.9% | 4.7% | 43 | 44 | 22 | 8 | 25,483 | 126,642 | W |
V | Female | 53.1% | 76.6% | 23.4% | 2.2% | 0.7% | 104 | 17 | 15 | 19 | 64,183 | 19,645 | V |
U | Female | 52.2% | 76.1% | 23.9% | 0.0% | 0.0% | 7 | 1 | 24 | 26 | 956 | 300 | U |