A hapax legomenon (often abbreviated to hapax) is a word which occurs only once in a document. In the fields of computational linguistics and natural language processing, it is common to disregard hapax legomena (and sometimes other infrequent words), as they are likely to have little value for computational techniques. This disregard has the added benefit of significantly reducing the memory use of an application. Define a function that given the file name of a text will return all its hapaxes. Make sure your program ignores capitalization.
Already registered? Login
Not Account? Sign up
Enter your email address to reset your password
Back to Login? Click here