Significance tests vs total in custom tables

View Only

Expand all | Collapse all

Significance tests vs total in custom tables

1. Significance tests vs total in custom tables

1 Like
Christine L
Posted Fri April 05, 2024 10:22 AM

Reply
Hi team,
we are using significance tests in custom tables.
Next to countries as columns we also include a total column and want to test all other columns not only against each other but also against total.
Can you kindly advise how to do that in syntax?

CTABLES
/VLABELS VARIABLES=Q34 dcountry
DISPLAY=LABEL
/TABLE Q34 [MEAN] BY dcountry
/CATEGORIES VARIABLES=dcountry ORDER=A KEY=VALUE EMPTY=INCLUDE total=yes
/CRITERIA CILEVEL=95
/COMPARETEST TYPE=MEAN ALPHA=0.05 ADJUST=BONFERRONI ORIGIN=COLUMN INCLUDEMRSETS=YES
CATEGORIES=ALLVISIBLE MEANSVARIANCE=ALLCATS MERGE=NO SHOWSIG=NO
/TITLES title="Subscription Metrics".

Thanks much for your support.

------------------------------
Christine Loof
------------------------------
2. RE: Significance tests vs total in custom tables

0 Like
Kirill Orlov
Posted Fri April 05, 2024 12:15 PM
Edited by Kirill Orlov Fri April 05, 2024 01:55 PM

Reply
There is no such option. Because - tell us - which way to go to (say) "compare a percentage of a yes-response in a group with that in the total sample".

Treating the total's percent as a fixed number and do the one-sample test? But total sample isn't the population, its percent has its error. (And, besides, mind that the same every respondent who effected the group's percent participates one more time to form the total sample percent. This violates the assumption that the large population must be independent from the sample from it in the sense that it is infinitely larger.)

Alternatively, comparing a group with the total would be comparing that group with all the rest groups joint. This way, two groups are compared. It is legal. However, each group is thus gets compared with a nonfixed group, the group percent - compared with "total" percent which changes each time, each comparison.

The third option is to break - in advance - each of your groups (subsamples) randomly into two parts: one (a bigger part) will still represent that group; the other (smaller and proportional to its share in the population) - will be sent to join with the other such parts from the other groups - to form the "group representing the population". So, having k groups initially, you end with k+1 groups and compare the k ones with that quasi-population representative one. You are comparing each group with one fixed group, nice; but, as your groups are of smaller n now, you've lost some power.

Unfortunately, CTABLES does only all pairwise comparisons, and not each-with-one. But you could still request all pairwise without adjustment for multiple comparisons, and then apply the actual adjustment you need yourself.

P.S. I was speaking of comparing proportions - as an example - but the same logic applies to comparing means.

------------------------------
Kirill Orlov
------------------------------

Original Message
3. RE: Significance tests vs total in custom tables

0 Like
IBM Champion

Jon Peck
Posted Fri April 05, 2024 12:50 PM

Reply
As Kirill points out, the problem with testing against the total is multiple counting. Imagine that there are two groups. The total comparison would effectively be a two-group t test.

You could aggregate the columns with k-1 in one column and the remaining column and do the column means test if one column is of particular interest.

But you might want to think about this differently. Considering averages rather than totals, this is a search for outliers. There are some sig tests for outliers, but these are iffy. Rather, one might look at graphical displays for patterns without sig testing. You might also look for large residuals in a two-way crosstab.

------------------------------
Jon Peck
------------------------------

Original Message
4. RE: Significance tests vs total in custom tables

0 Like
Zhiying Na
Posted 8 days ago

Reply
I know what you mean, but too many non-professionals need significance tests - the differences between "total" and "every category".

I find a way, syntax is :

COMPUTE ttl = 0 .

EXECUTE .

MRSETS /MCGROUP NAME=$test VARIABLES=ttl test LABEL='test' .

CTABLES

/VLABELS VARIABLES=$anal $test DISPLAY=DEFAULT

/TABLE $anal [C][COLPCT.COUNT PCT40.1, TOTALS[COUNT F40.0]] BY $test [C]

/SLABELS POSITION=ROW

/CATEGORIES VARIABLES=$anal EMPTY=INCLUDE TOTAL=YES POSITION=BEFORE LABEL="BASE"

/CATEGORIES VARIABLES=$test EMPTY=INCLUDE

/COMPARETEST TYPE=PROP ALPHA=0.05 ADJUST=NONE ORIGIN=COLUMN INCLUDEMRSETS=YES CATEGORIES=ALLVISIBLE MERGE=NO .

But, the significance tests result has errors. One of the error examples: "Total" has 500 cases, and "success" is 249. "Target Category" has 193 cases, "success" is 107, and "CTABLE" tell me: they have significance difference .

By the way, I have another idea, "Merge File With Cases", and forge a category of "Total", a magical thing happened, and there was no significant difference in the results this time.

So, I am very confused about the testing method for "CTABLES". If possible, I hope you can list out the formulas. If necessary, I can also send you my data.

------------------------------
Zhiying Na
------------------------------

Original Message
5. RE: Significance tests vs total in custom tables

0 Like
Kirill Orlov
Posted Fri April 05, 2024 01:31 PM
Edited by Kirill Orlov Fri April 05, 2024 01:50 PM

Reply
Jon's mentioning departures from the central tendency, and especially chi-square residuals (if we speak of percentage tests) - is quite relevant. Imagine you have a frequency table with 2 rows: the yes-response of interest, and the "other responses" response, and several groups of respondents as the columns. Then the standardized or adjusted resuduals in the cells tell you the departure of each group from the total sample (which percent is the weighted average percent). Here is an answer on the two kind of residuals you can get in the Crosstabs procedure in SPSS: https://stats.stackexchange.com/a/178068/3277

Note that you may do approximate significance inference out of the residual magnitude.

I would append the "resisuals option" as the fourth one to the three I mentioned earlier.

P.S. I am putting an accent on comparing proportions here, though - as I see now - the OP has given a table syntax analyzing specifically means.

------------------------------
Kirill Orlov
------------------------------

Original Message
6. RE: Significance tests vs total in custom tables

0 Like
IBM Champion

Jon Peck
Posted Fri April 05, 2024 04:33 PM

Reply
For means, you could work off of the residuals from a two-way ANOVA.--
Jon K Peck
jkpeck@gmail.com

Original Message

AI and Data Science

Master the art of AI and Data Science.

SPSS Statistics

Significance tests vs total in custom tables

Christine LFri April 05, 2024 10:22 AM

Kirill OrlovFri April 05, 2024 12:15 PM

Jon PeckFri April 05, 2024 12:50 PM

Zhiying Na8 days ago

Kirill OrlovFri April 05, 2024 01:31 PM

Jon PeckFri April 05, 2024 04:33 PM

1. Significance tests vs total in custom tables

2. RE: Significance tests vs total in custom tables

3. RE: Significance tests vs total in custom tables

4. RE: Significance tests vs total in custom tables

5. RE: Significance tests vs total in custom tables

6. RE: Significance tests vs total in custom tables