Automatic identity recognition from text outputs
Johnny, Winnie, Manny, and Cathy use their
one-dimensional keyboard (as described in the handout) to
type the entire Biola
vision statement one word at a time and because of typos in the process we
have a collection of 8 text files A,
B, C, D, E, F, G and H. Each of these eight documents
was typed by one of Johnny,
Winnie, Manny, and Cathy. Your task is
to determine for each document d the most likely person who has
generated the document d when trying to type the Biola vision
statement.
·
For each of these 8 documents, who do
you think is the most likely author? How do you know?
·
Is there a systematic way for the
computer to conduct automatic identity recognition from text outputs?