您的位置:首页 > 产品设计 > UI/UE

Word Frequency

2015-06-28 11:57 465 查看

Write a bash script to calculate the frequency of each word in a text file
words.txt
.

For simplicity sake, you may assume:

words.txt
contains only lowercase characters and space
' '
characters.
Each word must consist of lowercase characters only.
Words are separated by one or more whitespace characters.

For example, assume that
words.txt
has the following content:

the day is sunny the the
the sunny is is

Your script should output the following, sorted by descending frequency:
the 4
is 3
sunny 2
day 1

题意:给出一个文件,输出每个单词及出现频率

思路:用awk,其中的数组是key-value形式

awk '{words[$1]+=1} END{for (word in words) {print word, words[word]}}' RS="[ \n]+" words.txt | sort -nrk2
内容来自用户分享和网络整理,不保证内容的准确性,如有侵权内容,可联系管理员处理 点击这里给我发消息
标签: