A follow-up to the ngram.sh post with data sources for Wikipedia and Project Gutenberg ngrams. The ngram.sh script can easily be modified to extract keywords from these databases, and many others.
A follow-up to the ngram.sh post with data sources for Wikipedia and Project Gutenberg ngrams. The ngram.sh script can easily be modified to extract keywords from these databases, and many others.
The Google Ngram Viewer is a database browser used to chart the relative frequency of words or phrases. The data source is the Google Books database and the graphic engine is Google Charts. It’s cool. It’s pretty. It doesn’t easily give up the raw data. This script helps.