Your data is already in categories which means you don't have to use clustering algorithms though you can re-categorize your data using DBSCAN and get noise points from that analysis, the more noise points, the more anomalies or outliners you can see in your data.
I can't be certain about "distance measurement" for DBSCAN because I don't have much idea about your data, is your categories based on your numerical scope then you might be able to tweak it for results.
------------------------------
Muhammad Tehseen
------------------------------
Original Message:
Sent: Wed March 04, 2020 11:23 PM
From: Zhi Guang Tan
Subject: DBSCAN on categorical data
Hi,
I trying to find anomalies from my Data and I have read that DBSCAN is one of the best clustering algorithm for anomalies detection. My data consists of categorical data, does distance measurement still works?? How should I go about doing it?
It would be appreciated if anyone can share their knowledge with me. Thank you in advance!!!
#GlobalAIandDataScience
#GlobalDataScience