Frequencies of Letters vs. First Letters
Show that in an English dictionary the most frequent characters at the beginning of words do not coincide with the most frequent characters globally.
Get a list of common English words from WordList.
In[1]:=
data:image/s3,"s3://crabby-images/d7c3d/d7c3daff75a64edb7395cdb7d945a908f3df18b9" alt="Click for copyable input"
Length[words = WordList[]]
Out[1]=
data:image/s3,"s3://crabby-images/800bc/800bc775beff6a4bafdb18e147f7a9b6fc016a6d" alt=""
Take the first letter of each word.
In[2]:=
data:image/s3,"s3://crabby-images/5e3ef/5e3ef7a47a56da9eb193ae89499e0508f61a9253" alt="Click for copyable input"
firstchars = StringTake[words, 1];
Count the number of words starting with each of these letters.
In[3]:=
data:image/s3,"s3://crabby-images/f274d/f274d8e4cf270a78c5d8b48709c861d25dbd2552" alt="Click for copyable input"
Counts[firstchars]
Out[3]=
data:image/s3,"s3://crabby-images/17df2/17df2f2b135e286bda83914571042524cc5afa71" alt=""
Visualize their relative predominance by generating a WordCloud. The most frequent first letters are the consonants s, c, p, d.
In[4]:=
data:image/s3,"s3://crabby-images/d68ff/d68ff9eda6fb2e348312024b2c6bc0e87b5eec52" alt="Click for copyable input"
WordCloud[firstchars]
Out[4]=
data:image/s3,"s3://crabby-images/8ded5/8ded532214f2332bf3934cc3a400fb595154f1eb" alt=""
Compute the relative predominance of all the letters in all words by using LetterCounts.
In[5]:=
data:image/s3,"s3://crabby-images/5dc54/5dc54b2990f6efa1efb15cb87a76bc0026b6cf74" alt="Click for copyable input"
allchars = LetterCounts[StringJoin[words], IgnoreCase -> True]
Out[5]=
data:image/s3,"s3://crabby-images/6674e/6674e45b488c38042e107d922e03311806084cb4" alt=""
Now the most frequent letters are the vowels e, i, a.
In[6]:=
data:image/s3,"s3://crabby-images/6bd04/6bd04b79993753485925d83d562b8e38d6de516d" alt="Click for copyable input"
WordCloud[allchars]
Out[6]=
data:image/s3,"s3://crabby-images/15bab/15bab87ffeedb670893fe183afd739fd5cd0fbd6" alt=""