"
I'm wondering if there is any reason to keep a feature from a dataset in order to perform prediction even if it doesn't have a significant correlation to the target?"There might be; it's hard to say, given this level of information. Personally (and assuming that all assumptions for such analyses are met and/or dealt with), if I am looking at some kind of regression model, I am looking not at the Pearson correlation but at the part and partial correlations. I encourage you to take a look at some sources for those if you are unfamiliar with them. For example:
https://www.statisticssolutions.com/what-are-zero-order-partial-and-part-correlationsYou can obtain these in the REGRESSION procedure of SPSS Statistics by requesting /STATISTICS ZPP (zero, part, and partial correlations)
Also, consider in your model the role of variable; whether a mediating or moderating variable, for example. See sources like
https://www.statisticshowto.com/mediator-variable/Apart from these, sometimes it's interesting to know what variables (that I thought would be) are NOT related in the way I thought they were, at least in my data sample.
Rick M
------------------------------
Rick Marcantonio Quality Assurance
Quality Assurance
IBM
IL
------------------------------
Original Message:
Sent: Mon November 02, 2020 07:19 AM
From: Marco Aurelio Sánchez Sorondo
Subject: Is low correlation (with a target) enough to dismiss a feature from a baseline model?
I'm wondering if there is any reason to keep a feature from a dataset in order to perform prediction even if it doesn't have a significant correlation to the target.
Has anyone been able to take advantage of such a feature?
Thanks
------------------------------
[Marco] [Sánchez Sorondo]
[UBA]
------------------------------
#GlobalAIandDataScience
#GlobalDataScience