Removing outliers from a matrix

4 views (last 30 days)
Seyed Navid Shoaiby
Seyed Navid Shoaiby on 11 Oct 2022
Edited: Bjorn Gustavsson on 11 Oct 2022
I removed outliers from my each column of my feature matrix, but now I have feature vectors with differing lengths. I cannot make them the same length because I will lose data from a certain range. If I shuffle, then the correlation between my features and my outcome becomes messy. How can I do that? My matrix of features and outcomes is a 55000*14 matrix. The last column is the outputs.
  1 Comment
Davide Masiello
Davide Masiello on 11 Oct 2022
Maybe, instead of removing the outliers, you could replace them with an interpolated value.

Sign in to comment.

Answers (1)

Bjorn Gustavsson
Bjorn Gustavsson on 11 Oct 2022
Edited: Bjorn Gustavsson on 11 Oct 2022
QD-answer: The best you can do for single pairs of columns is to only use the rows where neither are outliers. With the cov-function you can get this handled with the options 'omitrows' or 'partialrows' if you replace each outlier with a nan. If you want to use the corrcoef-function it has a slightly different interface. It takes a parameter-value pair to set the options for 'rows', see the help and documentation for description.
HTH

Categories

Find more on Descriptive Statistics in Help Center and File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!