Difference between individual and cumulative oobMargin of TreeBagger

8 views (last 30 days)
Why aren't the following two plots the same?
b = TreeBagger(500,X,Y,'oobpred','on');
mc = oobMargin(b,'mode','cumulative');
mi = oobMargin(b,'mode','individual');
figure; plot(mc.')
figure; plot(bsxfun(@rdivide,cumsum(mi.'),(1:500).'))

Accepted Answer

Ilya
Ilya on 17 Jun 2011
When you ask for an OOB margin from one tree, you get zero if this observation was in bag for this tree. The margin is undefined in this case, and TreeBagger returns 0 by default. The cumulative calculation averages over trees for which this observation was out of bag only. Check this out:
>> load fisheriris
>> b = TreeBagger(10,meas,species,'oobpred','on');
>> mi = oobMargin(b,'mode','individual');
>> mi(1,:)
ans = 1 0 0 0 1 0 1 1 0 0
>> b.OOBIndices(1,:)
ans = 1 0 0 0 1 0 1 1 0 0
>> mc = oobMargin(b,'mode','cumulative');
>> mc(1,:)
ans = 1 1 1 1 1 1 1 1 1 1

More Answers (1)

K
K on 21 Jun 2011
Code using the individual mode that produces the same plot as the cumulative mode is the following.
load ionosphere
b = TreeBagger(500,X,Y,'oobpred','on');
mc = oobMargin(b,'mode','cumulative');
mi = oobMargin(b,'mode','individual');
figure; plot(mc.')
% figure; plot(bsxfun(@rdivide,cumsum(mi.'),(1:500).'))
cumavg = zeros(size(mc));
cumavg(:,1) = mi(:,1);
for ii = 1:size(mc,1)
for jj = 2:size(mc,2)
if sum(b.OOBIndices(ii,1:jj)) == 0
cumavg(ii,jj) = mi(ii,1);
else
micurrent = mi(ii,1:jj);
cumavg(ii,jj) = mean(micurrent(b.OOBIndices(ii,1:jj)));
end
end
end
figure; plot(cumavg.')

Tags

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!