Remove NaN Entries in a Dataset
7 views (last 30 days)
Show older comments
Allen Hammack
on 20 Jan 2023
Commented: Allen Hammack
on 30 Jan 2023
I have a data set with multiple variables (var1, var2, var3). Any of the variables can have a NaN values. If one value for one of the variables is a NaN, I'd like to remove that instance for each variable. I have the following code that does this, but is very slow:
count = 1;
for j = 1:length(var1)
if isnan(var1(j)) == 1 || ...
isnan(var2(j)) == 1 || ...
isnan(var3(j)) == 1
else
var1_cull(count,1) = var1(j);
var2_cull(count,1) = var2(j);
var3_cull(count,1) = var3(j);
count = count + 1;
end
end
How can I modify this routine to have the speed up the data removal?
I've tried using the find() function, but I haven't been able to get it to work when checking to see if each variable is a nan.
Thank you!
Accepted Answer
Vilém Frynta
on 20 Jan 2023
Edited: Vilém Frynta
on 20 Jan 2023
You do not need to go through every element of vector with for loop.
You can use isnan on the vector, which will give you logical vector (0s and 1s), which you can then use on all the other vectors. You can also combine logical vectors together.
I will try my best to demonstrate:
% Random vars for demonstration
var1 = [1, 2, 3, NaN, 5, 6];
var2 = [9, NaN, 7, 6, 5, 4];
% 1 = NaN values, we will invert this later (~)
idx1 = isnan(var1);
idx2 = isnan(var2)
% Combine logical vectors
idx = idx1 | idx2
% Apply logical vector to your variables
var1(~idx)
var2(~idx)
0 Comments
More Answers (0)
See Also
Categories
Find more on Resizing and Reshaping Matrices in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!