Dear Mr. Schönberger,
Thank you for your message and suggestions.
I’m interested in the gender differences in the transition to parenthood. Maybe it was not clear in previous messages, but I don’t compare people between cohorts, but compare men and women within each cohort.
Based on your explanation, I checked the number of parents, children, share of people who became parents, average number of children per parent, and the average number of children per person:
Here is my code in r:
basic ← read_stata(„P:/NEPS_project 1/Data/Data_sc6/Stata14/SC6_Basics_D_14-0-0.dta“)
child ← read_stata(„P:/NEPS_project 1/Data/Data_sc6/Stata14/SC6_Children_D_14-0-0.dta“)
Keep info only on people’s year/month of birth, their ID and their gender
basic_merge ← subset(basic, select = c(1:4))
Merge basic info with data on children
child_parent ← merge(x = child, y = basic_merge, by=„ID_t“, all.x =T)
Keep only the biological children
child_parent ← subset(child_parent, tx27100==1)
Calculate the cohort
child_parent$cohort ← 10*(floor((child_parent$t70000y-1900)/10))
Calculate the total number of children per cohort per gender (since each row corresponds to one child, the number of rows is the number of children)
df1 ← child_parent%>%
group_by(t700001, cohort)%>%
summarise(n_kid=n())
Calculate the number of person who became a parent by cohort and gender
df2 ← child_parent %>%
group_by(t700001, cohort) %>%
summarise(parent = n_distinct(ID_t))
Merge the two tables to calculate the number of children per person by cohort and sex
df3 ← left_join(df1, df2, by = c(„cohort“, „t700001“))
df3$kid_per_parent ← df3$n_kid/df3$parent
check the number of respondent by gender per cohort
basic$cohort ← 10*(floor((basic$t70000y-1900)/10))
df5 ← basic %>%
group_by(t700001, cohort) %>%
summarise(people = n_distinct(ID_t))
df6 ← left_join(df3, df5, by =c(„cohort“, „t700001“))
df6$share_parent ← df6$parent/df6$people
df6$kid_per_person ← df6$n_kid / df6$people
################################
Then I got this:
t700001 |
|
cohort |
n_kid |
parent |
kid_per_parent |
people |
share_parent |
kid_per_person |
1 [[m] |
männlich] |
40 |
1640 |
806 |
2.03 |
1017 |
0.793 |
1.61 |
1 [[m] |
männlich] |
50 |
3962 |
1927 |
2.06 |
2360 |
0.817 |
1.68 |
1 [[m] |
männlich] |
60 |
3990 |
1961 |
2.03 |
2581 |
0.76 |
1.55 |
1 [[m] |
männlich] |
70 |
1814 |
942 |
1.93 |
1414 |
0.666 |
1.28 |
1 [[m] |
männlich] |
80 |
651 |
388 |
1.68 |
1117 |
0.347 |
0.583 |
2 [[w] |
weiblich] |
40 |
1556 |
775 |
2.01 |
927 |
0.836 |
1.68 |
2 [[w] |
weiblich] |
50 |
4058 |
1975 |
2.05 |
2357 |
0.838 |
1.72 |
2 [[w] |
weiblich] |
60 |
4834 |
2362 |
2.05 |
2813 |
0.84 |
1.72 |
2 [[w] |
weiblich] |
70 |
2553 |
1238 |
2.06 |
1557 |
0.795 |
1.64 |
2 [[w] |
weiblich] |
80 |
926 |
507 |
1.83 |
997 |
0.509 |
0.929 |
So the problem of the gender gap in the transition to the parenthood for the 70s and 80s (my first post here), is that more men born in the 80s cohort (who have not became fathers) are in the sample (1117 vs997). And the share of father is smaller than the share of mother (0.666 vs 0.795) for the 70s cohort.
Please correct me if you found any mistakes.
Best regards,
Chen