The data files that are provided on the Understanding America Study data pages provide cleaned versions of the raw data. Cleaned here means the followings:
- Answers given to questions that are not applicable anymore at survey completion (for example because a respondent went back in the survey and skipped over a previously answered question) are treated as if the questions were never asked. In the data files all questions that were asked, but not answered by the respondent are marked with ".e". All questions never seen by the respondent (or any dirty data) are marked with ".". The latter may mean that a respondent did not view the question because s/he skipped over it; or alternatively that s/he never reached that question in the survey due to a survey break off.
- For incomplete surveys the end date variables are set to ".c".
- Raw standard data variables are processed to match the standard variables and basic demographics listed here. As such, their names may differ from those in the survey codebook.
Default variables
The mapping between the cleaned default data variables and the raw data is as follows:
| Cleaned data variable | Raw data variable |
|---|---|
| uasid | Equivalent of prim_key. |
| primary_respondent |
No equivalent. Is derived from sampletype as follows:
|
| uashhid | Equivalent of uashhid. |
| uashhid_current | Equivalent of uashhid_current. |
| survhhid | No equivalent. Calculated using cantor pairing of the uasid of all other members of the same UAS household at the time of the survey. |
| uasmembers | No equivalent. Calculated as the sum of all other members of the same UAS household at the time of the survey. Not available in data sets after October 8, 2024. |
| sampleframe |
No equivalent. Is derived from sampletype as follows:
|
| batch |
No equivalent. Is derived from sampletype as follows:
|
| language | Equivalent of language. |
| start_date (start_year, start_month, start_day, start_hour, start_min, start_sec) | No equivalent. Is derived from begintime. |
| end_date (end_year, end_month, end_day, end_hour, end_min, end_sec) | No equivalent. Is derived from endtime. For incomplete surveys the end date variables are set to ".c". |
Background demographics
The mapping between the cleaned basic demographic variables and the raw data is as follows:
| Cleaned data variable | Raw data variable |
|---|---|
| sex | Equivalent of sex (gender of the respondent, named gender in data sets prior to October 8, 2024). |
| genderid | Equivalent of newgender (empty if My Household Version 3 not answered yet). |
| sexualorientation | Equivalent of sexualorientation (empty if My Household Version 3 not answered yet). |
| dateofbirth_year | Equivalent of dateofbirth_year. |
| age | No equivalent. Calculated based on dateofbirth_year, dateofbirth_month and dateofbirth_day. |
| agerange | Equivalent of agerange. Is set to ".a" if age is known. |
| citizenus |
Recoded equivalent of citizenus:
|
| bornus |
Recoded equivalent of bornus:
|
| stateborn | Equivalent of stateborn. Is set to ".a" if bornus = 0. |
| countryborn | Equivalent of countryborn. Is set to ".a" if bornus = 1. |
| countryborn_other | Equivalent of countryborn_other. Is set to ".a" if bornus = 1 or countryborn != 300. Not available in data sets after October 8, 2024. |
| statereside | Equivalent of statereside. |
| maritalstatus | Equivalent of maritalstatus. |
| livewithpartner |
Recoded equivalent of livewithpartner:
|
| education | Equivalent of education. |
| hisplatino |
Recoded equivalent of spanish:
|
| hisplatinogroup | Equivalent of spanishgroup and spanishgroup_new. Is set to ".a" if hisplatino = 0. Available in data sets after October 8, 2024 on request. |
| white |
No equivalent. Is derived from race as follows:
|
| black |
No equivalent. Is derived from race as follows:
|
| nativeamer |
No equivalent. Is derived from race as follows:
|
| asian |
No equivalent. Is derived from race as follows:
|
| pacific |
No equivalent. Is derived from race as follows:
|
| mena |
Is equivalent to mena. |
| race |
No equivalent. Is derived from white, black, nativeamer, asian and pacific as follows:
|
| working |
No equivalent. Is derived from laborstatus_13 (laborstatus if My Household Version 3 not answered yet) as follows:
|
| sick_leave |
No equivalent. Is derived from laborstatus_13 (laborstatus if My Household Version 3 not answered yet) as follows:
|
| unemp_layoff |
No equivalent. Is derived from laborstatus_13 (laborstatus if My Household Version 3 not answered yet) as follows:
|
| unemp_look |
No equivalent. Is derived from laborstatus_13 (laborstatus if My Household Version 3 not answered yet) as follows:
|
| retired |
No equivalent. Is derived from laborstatus_13 (laborstatus if My Household Version 3 not answered yet) as follows:
|
| disabled |
No equivalent. Is derived from laborstatus_13 (laborstatus if My Household Version 3 not answered yet) as follows:
|
| workemployer |
No equivalent. Is derived from laborstatus_13 (empty if My Household Version 3 not answered yet) as follows:
|
| workself |
No equivalent. Is derived from laborstatus_13 (empty if My Household Version 3 not answered yet) as follows:
|
| homemaker |
No equivalent. Is derived from laborstatus_13 (empty if My Household Version 3 not answered yet) as follows:
|
| student |
No equivalent. Is derived from laborstatus_13 (empty if My Household Version 3 not answered yet) as follows:
|
| notworking |
No equivalent. Is derived from laborstatus_13 (empty if My Household Version 3 not answered yet) as follows:
|
| lf_other |
No equivalent. Is derived from laborstatus_13 (laborstatus if My Household Version 3 not answered yet) as follows:
|
| laborstatus |
No equivalent. Is derived from working, sick_leave, unemp_layoff, unemp_look, retired, disabled and lf_other as follows:
|
| employmenttype | Equivalent of employmenttype. Available in data sets after October 8, 2024 on request. |
| workfullpart | Equivalent of workfullpart. Available in data sets after October 8, 2024 on request. |
| hourswork | Equivalent of hourswork. |
| hhincome | Equivalent of hhincome. |
| hhmembernumber | No equivalent. Is equal to hhcomp_total for My Household Version 3. Prior to October 8, 2024 it is derived from hhmemberactive_# by counting the number of instances with value equal to 1. |
| anyhhmember |
No equivalent. Is based on hhcomp_total > 0 or not for My Household Version 3. Prior to October 8, 2024 it is derived from hhmembers_anyone, hhmembers_new and hhmemberactive_#. |
| hhcomp_male_0_4 to hhcomp_other_65plus |
Are equivalent to their exact counterparts. |
| hhcomp_total_18_64, hhcomp_total_65plus, hhcomp_total_adults, hhcomp_total_children and hhcomp_total |
Are equivalent to the sum of the relevant variables as calculated within the My Household survey. |
| hhmemberin_# |
No equivalent. Asked up until October 8, 2024. Available in data sets after October 8, 2024 on request. Is derived from hhmemberactive_# as follows:
|
| hhmembergen_# | Equivalent of hhmembergender_#. Asked up until October 8, 2024. Available in data sets after October 8, 2024 on request. |
| hhmemberage_# | Equivalent of hhmemberage_#. Asked up until October 8, 2024. Available in data sets after October 8, 2024 on request. |
| hhmemberrel_# | Equivalent of hhmemberrelationship_#. Asked up until October 8, 2024. Available in data sets after October 8, 2024 on request. |
| hhmemberuasid_# | Equivalent of hhmemberuasrtid_#. Asked up until October 8, 2024. Available in data sets after October 8, 2024 on request. |
| lastmhyhh_date | Reformatted from lastmyhh. |
| endtime_previous_hh_version | Equivalent of endtime_previous_hh_version. |

