What does sumd method in k-means clustering function exactly calculate?

Question

Onur Kapucu on 8 May 2018

0
Link

Direct link to this question

https://au.mathworks.com/matlabcentral/answers/399776-what-does-sumd-method-in-k-means-clustering-function-exactly-calculate

Commented: Onur Kapucu on 8 May 2018

I am doing basic experiments with kmeans function. As a real simple example, say that I have a data set of 4 items with 1 attribute and this attribute is their value:

Data=[1;2;3;4];

If I want to split this data set into 2 clusters I should get one centroid in 1.5 and another in 3.5:

[idx,C,sumd]=kmeans(Data,2)
C =     
1.5000
3.5000

and I get it. However to my understanding sumd in this case should be:

abs(1-1.5)+abs(2-1.5) or  abs(3-3.5)+abs(4-3.5)
ans =
       1

but I am getting sumd as:

sumd =
      0.5000
      0.5000

for both clusters. Instead of getting 1's for both.

My question is what exactly does sumd calculate?

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Ameer Hamza on 8 May 2018

1
Link

Direct link to this answer

https://au.mathworks.com/matlabcentral/answers/399776-what-does-sumd-method-in-k-means-clustering-function-exactly-calculate#answer_319322

Edited: Ameer Hamza on 8 May 2018

Open in MATLAB Online

If you look at the documentation of kmeans(), you will know that it uses the square of the Euclidean distance, by default. So you should calculate it like this

abs(1-1.5).^2+abs(2-1.5).^2 or  abs(3-3.5).^2+abs(4-3.5).^2
ans = 
  0.5 (both cases)

1 Comment
Show -1 older commentsHide -1 older comments

Onur Kapucu on 8 May 2018

Thanks

Sign in to comment.

Answer 2

the cyclist on 8 May 2018

1
Link

Direct link to this answer

https://au.mathworks.com/matlabcentral/answers/399776-what-does-sumd-method-in-k-means-clustering-function-exactly-calculate#answer_319323

It's because the default distance metric used is the squared Euclidean distance (for minimization, and reporting). See the Distance input parameter.

1 Comment
Show -1 older commentsHide -1 older comments

Onur Kapucu on 8 May 2018

Thanks

Sign in to comment.

What does sumd method in k-means clustering function exactly calculate?

0 Comments
Show -2 older commentsHide -2 older comments

Accepted Answer

1 Comment
Show -1 older commentsHide -1 older comments

More Answers (1)

1 Comment
Show -1 older commentsHide -1 older comments

See Also

Categories

Tags

Community Treasure Hunt

What does sumd method in k-means clustering function exactly calculate?

0 Comments Show -2 older commentsHide -2 older comments

Accepted Answer

1 Comment Show -1 older commentsHide -1 older comments

More Answers (1)

1 Comment Show -1 older commentsHide -1 older comments

See Also

Categories

Tags

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

1 Comment
Show -1 older commentsHide -1 older comments

1 Comment
Show -1 older commentsHide -1 older comments