Data normalization using robust scaling

13 views (last 30 days)
MByk
MByk on 4 Jun 2024
Commented: Voss on 4 Jun 2024
Hello all, I am trying to implement "Robust Scaling" but I am confused. Should I use "all" argument for "median" and "iqr" functions?
Thanks for the help.
DataSet = readtable('Datasets/Test.csv');
DataSet = table2array(DataSet); % Row:7195 x Colums:22
RScaling = (DataSet - median(DataSet))./iqr(DataSet)

Accepted Answer

Voss
Voss on 4 Jun 2024
If you want to normalize all columns the same way (i.e., using the median and inter-quartile range of the entire data set), then use "all".
If you want to normalize each column separately (i.e., using each column's own median and inter-quartile range), then do not use "all". And in this case, it's best to use the dim argument set to 1, to explicitly say you want the median and iqr by column, in order to properly handle the situation that your data set has only one row.
  4 Comments

Sign in to comment.

More Answers (0)

Categories

Find more on Mathematics in Help Center and File Exchange

Products


Release

R2024a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!