Unsupervised clustering of categorical data
11 views (last 30 days)
Show older comments
Answered: Pratyush Roy on 1 Dec 2021
I wanted to cluster a time serie dataset which has 30 timepoints and more than 50'000 rows. The dataset is categorical (from 1 to 6) which represent different categories.
The problem with my current clustergram method using the euclidian distance metrics, is that it will cluster the category 5 closer to 6. I don't want that, those categories are not somehow related. How is it possible to remove this bias in the clustering?
Hope my question is clear, thanks for your further help!
Pratyush Roy on 1 Dec 2021
The link here might be helpful for clustering categorical or non-numeric data.
Hope this helps!
Find more on Descriptive Statistics and Visualization in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!Start Hunting!