Sunday, October 18, 2015

Google NGram Viewer

While considering some unstructured text analysis the Google Ngram viewer came to mind.  The system " .... charts frequencies of any word or short sentence using yearly count of n-grams found in the sources printed between 1800 and 2012 ,,, ".  Based on Google Books or other collections of texts.   In multiple languages.  Addictive for the linguistically inquisitive. We had proposed something for a textual 'content analysis' project where we could have used the idea.   In the Wikipedia.  Can anyone point me me its use for text analytics?  This made me recall it had been posted on quite a few times here before, see the tag below.  See also the book: Uncharted: Big Data as a Lens on Human Culture by Erez Aiden and Jean-Baptiste Michel. 

