This paper proposes an iterative hybrid of available methods in a quest for a more flexible and robust machine-assisted content analysis system.
This paper proposes an iterative hybrid of available methods in a quest for a more flexible and robust machine-assisted content analysis system.
wf.sh is a frequently used in introductory CompSci courses to get students to rethink how they imagine data. The classic puzzle is expressed as: “Given a text file and an integer K, you are to print the K most common words in the file (and the number of their occurrences) [...]
A longitudinal study of keyword frequencies in Section A of the New York Times between 2001 and 2008 supported the hypothesized typologies of catastrophic myths. Patterns of occurrence are consistent between natural and man-made disasters, and suggest a relationship between U.S. political elections and modern myth-creation that is worthy of further investigation.