The data files that are provided on the Understanding America Study data pages provide cleaned versions of the raw data. Cleaned here means the followings:
- Answers given to questions that are not applicable anymore at survey completion (for example because a respondent went back in the survey and skipped over a previously answered question) are treated as if the questions were never asked. In the data files all questions that were asked, but not answered by the respondent are marked with ".e". All questions never seen by the respondent (or any dirty data) are marked with ".". The latter may mean that a respondent did not view the question because s/he skipped over it; or alternatively that s/he never reached that question in the survey due to a survey break off.
- For incomplete surveys the end date variables are set to ".c".
- Raw standard data variables are processed to match the standard variables and basic demographics listed here. As such, their names may differ from those in the survey codebook.
Default variables
The mapping between the cleaned default data variables and the raw data is as follows:
Cleaned data variable | Raw data variable |
---|---|
uasid | Equivalent of prim_key. |
primary_respondent |
No equivalent. Is derived from sampletype as follows:
|
uashhid | Equivalent of uashhid. |
survhhid | No equivalent. Calculated using cantor pairing of the uasid of all other members of the same UAS household at the time of the survey. |
uasmembers | No equivalent. Calculated as the sum of all other members of the same UAS household at the time of the survey. |
sampletype |
No equivalent. Is derived from sampletype as follows:
|
batch |
No equivalent. Is derived from sampletype as follows:
|
language | Equivalent of language. |
start_date (start_year, start_month, start_day, start_hour, start_min, start_sec) | No equivalent. Is derived from begintime. |
end_date (end_year, end_month, end_day, end_hour, end_min, end_sec) | No equivalent. Is derived from endtime. For incomplete surveys the end date variables are set to ".c". |
Background demographics
The mapping between the cleaned basic demographic variables and the raw data is as follows:
Cleaned data variable | Raw data variable |
---|---|
gender | Equivalent of gender. |
dateofbirth_year | Equivalent of dateofbirth_year. |
dateofbirth_month | Equivalent of dateofbirth_month. |
dateofbirth_day | Equivalent of dateofbirth_day. |
age | No equivalent. Calculated based on dateofbirth_year, dateofbirth_month and dateofbirth_day. |
agerange | Equivalent of agerange. Is set to ".a" if age is known. |
citizenus |
Recoded equivalent of citizenus:
|
bornus |
Recoded equivalent of bornus:
|
stateborn | Equivalent of stateborn. Is set to ".a" if bornus = 0. |
countryborn | Equivalent of countryborn. Is set to ".a" if bornus = 1. |
countryborn_other | Equivalent of countryborn_other. Is set to ".a" if bornus = 1 or countryborn != 300. |
statereside | Equivalent of statereside. |
maritalstatus | Equivalent of maritalstatus. |
livewithpartner |
Recoded equivalent of livewithpartner:
|
education | Equivalent of education. |
hisplatino |
Recoded equivalent of spanish:
|
hisplatinogroup | Equivalent of spanishgroup. Is set to ".a" if hisplatino = 0. |
white |
No equivalent. Is derived from race as follows:
|
black |
No equivalent. Is derived from race as follows:
|
nativeamer |
No equivalent. Is derived from race as follows:
|
asian |
No equivalent. Is derived from race as follows:
|
pacific |
No equivalent. Is derived from race as follows:
|
race |
No equivalent. Is derived from race as follows:
|
working |
No equivalent. Is derived from laborstatus as follows:
|
sick_leave |
No equivalent. Is derived from laborstatus as follows:
|
unemp_layoff |
No equivalent. Is derived from laborstatus as follows:
|
unemp_look |
No equivalent. Is derived from laborstatus as follows:
|
retired |
No equivalent. Is derived from laborstatus as follows:
|
disabled |
No equivalent. Is derived from laborstatus as follows:
|
lf_other |
No equivalent. Is derived from laborstatus as follows:
|
laborstatus |
No equivalent. Is derived from laborstatus as follows:
|
employmenttype | Equivalent of employmenttype. |
workfullpart | Equivalent of workfullpart. |
hourswork | Equivalent of hourswork. |
hhincome | Equivalent of hhincome. |
hhmembernumber | No equivalent. Is derived from hhmemberactive_# by counting the number of instances with value equal to 1. |
anyhhmember |
No equivalent. Is derived from hhmembers_anyone, hhmembers_new, hhmemberactive_# as follows:
|
hhmemberin_# |
No equivalent. Is derived from hhmemberactive_# as follows:
|
hhmembergen_# | Equivalent of hhmembergender_#. |
hhmemberage_# | Equivalent of hhmemberage_#. |
hhmemberrel_# | Equivalent of hhmemberrelationship_#. |
hhmemberuasid_# | Equivalent of hhmemberuasrtid_#. |
lastmhyhh_date | Reformatted from lastmyhh. |