Solutions to Lab 5
------------------
1&2: Here is the three-line sed script:
s/
/\n/g
s/<[^>]*>//g
/The Tragedy/,$!d
3: Count misspelled words using:
sed -f othello.html | spell | wc -l
4: The 'tr' command transliterates, i.e., it replaces characters by other
specified characters. In its current usage, 'tr' replaces all characters
not A-Z or a-z with a new line, so that each word comes on one line. 'sort'
then sorts them alphabetically. 'uniq' expects sorted input and removes
duplicate lines (i.e., words). Then we sort again, but this time first from
most to least frequent, then alphabetically.
This approach is not perfect, but you get the basic idea.