Hey,
This is great!
Thank you so much for your guidance:)
Original Message:
Sent: Sun May 30, 2021 07:08 PM
From: Jon Peck
Subject: Data quality check for a multiple response question
Such conditions can be defined as standard COMPUTE commands, but if you have the Data Validation option (now included in Base), you can define single and cross-variable conditions and get a set of reports on violations.
Here is a compute that checks this particular condition.
compute invalid = (Q1 eq 1 and not any(1, Q2_1 to Q2_4)) or
Q1 eq 0 and any(1, Q2_1 to Q2_4).
--
Original Message:
Sent: 5/30/2021 6:31:00 PM
From: Rose
Subject: Data quality check for a multiple response question
Hey,
I have a multiple response question which should be answered if a "yes" response was selected on a previous question. However, I have noticed data entry errors.
Let's say that Q1 asks - do you have a medical condition? (coded as 1-yes, 2-no, 3-don't know)
Q2 should be answer if 'yes' was selected for Q1 (Q2 is coded as 0-no, 1-yes)
1. back problems
2. high blood pressure
3. diabetes
4. other
Some scenarios I have noticed...
1. Yes was selected for Q1; however, no answer was provided for Q2
2. No was selected for Q1; however, an answer(s) was provided for Q2
3. Don't know was selected for Q1; however, an answer(s) was provided for Q2
2. Q1 was left blank; however, an answer(s) was provided for Q2
I am dealing with a big data set, so I would like to know how can I perform a quality check for the scenarios above. I have these data from different states, so I would like to find out the percentage affected by these scenarios (if any). Based on that, I can create a threshold to DQ X state from the analysis.
Any guidance is appreciated.
Thanks.
#SPSSStatistics