Alcohol Consumption

Good morning everyone,

For my master’s thesis, I am conducting a trend analysis of alcohol consumption in Germany in relation to socioeconomic status over time. While reviewing the SOEP dataset, I noticed that the variables related to alcohol consumption are either unavailable in some waves (for example, the early waves did not include any alcohol-related questions) or are inconsistently distributed across different waves. For instance, in 2006, the individual questionnaire included the question “How much alcohol do you drink?”, whereas in 2009, a different question—“Have you ever drunk alcohol?”—appeared only in the youth questionnaire.

My main challenge now is how to harmonize these scattered variables into consistent measures that can be used for longitudinal analysis.

My first question is whether there is a way to determine exactly in which waves alcohol-related questions were included.

My second question is whether you could provide any guidance or suggestions on how to harmonize these inconsistently measured variables across waves.

Thank you very much for your help.

Best regards,
Christian Carrillo Gonzalez

Hi Christian,

Thank you for your question.

The topic of alcohol consumption came up in the years 2006, 2008 and 2010 in the SOEP and then changed to another new form of surveying it in 2016. Since 2016, we ask it every four years with the same items at least. You can find all variables in the row “Alcohol consumption” in our SOEPcompanion topic list here: https://companion.soep.de/Topics_of_SOEPcore/Health%20and%20Care.html

Whenever possible, our Research Infrastructure team harmonizes items over time. When the colleagues don’t see a way to harmonize, they use different variable names. Then it’s up to the users whether to put together items for a trend analysis.

Best regards
Philipp