How do I get the program to display the most and least Freq used words? Tells me the # times used but I dont know how to get it to display the words. PLease Help!

1 view (last 30 days)
a =fileread('GreatExpectations.txt');
w = regexpi(a, ' ', 'split');
v = unique(w);
n = length(v);
counts = zeros(n,1);
for i=1:n
counts(i) = sum(strcmp(w, v{i}));
end
most_frequent =max(counts)
for i=1:n
counts(i) = sum(strcmp(w, v{i}));
end
least_frequent= min(counts)

Accepted Answer

Star Strider
Star Strider on 25 Apr 2015
The easiest way to do this is with accumarray:
a = fileread('GreatExpectations.txt');
w = regexpi(a, ' ', 'split');
[v,~,ic] = unique(w);
counts = accumarray(ic, 1);
imx = find(counts == max(counts));
imn = find(counts == min(counts));
fprintf(1,'\n\tMost frequent: “%s” appears %d times\n', v{imx}, counts(imx))
fprintf(1,'\n\tThere are %d that only appear one time\n\n', length(imn))
I used find rather than the second output of max and min because they only return the first index of the occurrence of the maximum or minimun while find returns them all.
The code produces:
Most frequent: “the” appears 25 times
There are 188 that only appear one time

More Answers (1)

Ahmet Cecen
Ahmet Cecen on 25 Apr 2015
add to the end:
v{counts==most_frequent}
v{counts==least_frequent}

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!