Tag Archives: corpora

“Juvescence” and other poetical “Errors”

This morning on the Twitter came this from @nemoloris: OED says “juvescence” is “irregular”, not “erroneous”, but (notorious TSE fan) Robert Burchfield himself called it a malformation (in his Eliot memorial lectures, I believe). Eliot’s defensive letter, sourced by @rngould, is worth keeping in mind: irregular needn’t be erroneous, and sometimes poets are looking to […]

Englishing Non-European Words

My last post focussed on words that are formed within English from other English words with non-English origins. I mostly concentrated on European donor languages, because they make up the overwhelming majority, and show the most variation. But English Englishes wherever it goes, and non-European languages have contributed plenty of English words over the years. […]

The What and When of English’s Englishing

In my previous post, I used OED3’s etymologies to chart the languages that gave English its words, noting that most English words come from other English words.  I then dug deep into all the non-English sources of English. Today I’ll take a closer look at the etymological sources of English words developed within English. Lexical […]

European and Non-European English Etymons

This weekend I’ve been poking around in OED3’s etymologies, and it occurred to me that an interesting thing might be lay out all English words according to when they are first attested, and what language they come from. This morning I made a bunch of graphs, below. Before having a look, it’s worth mentioning that […]

Lurking Impactfuls

Today on the NPR I heard someone say, “even more impactfully” [link].  Knowing that anything to do with “impact” is peever-bait [of the “only teeth can be impacted” variety–see BBC Magazine: “Should “impact” ever be used as a verb?“], I was surprised to find that “impactfully” occurs unselfconsciously about once a day on Twitter, and […]

The Lifespan of Words (three ways)

Getting ready for DH2017 this morning, I found myself curious about the lifespan of English words–when they come into the language and when they fall out. So I got all the earliest and latest attestation dates for all the words in OED3, and plotted them out. Here are three graphs (“visualizations,” if you like), all […]

Three conferences this summer

After a baby-related travelling hiatus of a couple three years, TLOW is hitting the road this summer, with stops at Ryerson University in Toronto (just barely down the road, really) at the end of May, for the Canadian Society for Digital Humanities meeting at CFHSS Congress; then off to Barbados and the University of the […]

One last round with metadata from Hathi and Underwood

In “Hathi’s Automatic Genre Classifier” and “Hathi Genre Again – Zero Recall“, I ran a couple of experiments comparing genre categories assigned by human taggers working on the Life of Words OED mark-up project to two sources of genre metadata associated with the HathiTrust Digital Library. The first post looked at data from the automatic […]

Shakespeare’s Earliest Citations in the OED

No author’s representation in the OED has received more comment than Shakespeare’s: if you ever come across a mention of OED citation evidence, more than likely it’s being used to substantiate (sometimes challenge or qualify) a claim that Shakespeare invented the most English words, or made up the most new meanings for existing words, or […]

OED Subject Matter

In my last post I described using HathiTrust’s Solr Proxy API to fetch Hathi genre metadata for OED quotations. But genre is not the only metadata that Hathi sends back down the intertubes when I ask it a question. For most works, I also get a Library of Congress Classification code for the volume. This […]