prosody and dr. syntax, 1832

There is an interesting series of articles in the new york times on the benefits and dangers of using large-scale corpora and statistical methods in the analysis of literary and other texts in the humanities. The first discusses some projects that are part of the digging-into-data challenge. The second article illustrates what race horses with conspicuous names can teach us about the pitfalls of the new windfall of data (hat-tip to Kate McCurdy).