Tag Archives: lexicography

European and Non-European English Etymons

This weekend I’ve been poking around in OED3’s etymologies, and it occurred to me that an interesting thing might be lay out all English words according to when they are first attested, and what language they come from. This morning I made a bunch of graphs, below. Before having a look, it’s worth mentioning that […]

Lurking Impactfuls

Today on the NPR I heard someone say, “even more impactfully” [link].  Knowing that anything to do with “impact” is peever-bait [of the “only teeth can be impacted” variety–see BBC Magazine: “Should “impact” ever be used as a verb?“], I was surprised to find that “impactfully” occurs unselfconsciously about once a day on Twitter, and […]

The Lifespan of Words (three ways)

Getting ready for DH2017 this morning, I found myself curious about the lifespan of English words–when they come into the language and when they fall out. So I got all the earliest and latest attestation dates for all the words in OED3, and plotted them out. Here are three graphs (“visualizations,” if you like), all […]

How Indigenous American words came into English

I’ve been deep in the OED documentation of borrowings and loanwords for my look at “tramlines” [see my previous post, and look out for a few more to come] and OED’s treatment of foreign, about to be naturalized, and naturalized words. I got curious about some of the Indigenous American words in my dataset, and […]

||-Tripping over tramlines-||

“Tramlines”, icydk, are those upright parallel bars that OED1 and OED2 editors used to indicate that a word was “alien or not fully naturalized”. So, for instance, zeitgeist you may recognize as a word of German origin, not infrequently heard in English. In OED1 (1928) it appeared as ||Zeitgeist, and this mark was preserved on […]

Guest Post: Cataloguing the Catalogue

Cosmin Dszurdsza is a research assistant at The Life of Words. In my last guest post I discussed problematic magazine classifications. Now, once again, a periodical publication proves to be an exciting and difficult genre identification challenge. The kind of text I will be dealing with today is the “catalogue” (filtered out of our data […]

Three conferences this summer

After a baby-related travelling hiatus of a couple three years, TLOW is hitting the road this summer, with stops at Ryerson University in Toronto (just barely down the road, really) at the end of May, for the Canadian Society for Digital Humanities meeting at CFHSS Congress; then off to Barbados and the University of the […]

Shakespeare’s Earliest Citations in the OED

No author’s representation in the OED has received more comment than Shakespeare’s: if you ever come across a mention of OED citation evidence, more than likely it’s being used to substantiate (sometimes challenge or qualify) a claim that Shakespeare invented the most English words, or made up the most new meanings for existing words, or […]

OED Subject Matter

In my last post I described using HathiTrust’s Solr Proxy API to fetch Hathi genre metadata for OED quotations. But genre is not the only metadata that Hathi sends back down the intertubes when I ask it a question. For most works, I also get a Library of Congress Classification code for the volume. This […]

Hathi Genre Again – Zero Recall

In “Hathi’s Automatic Genre Classifier” [17.01.06] I compared the consolidated automatic genre metadata for a subset of HathiTrust Digital Library texts (available here) to the genre classifications arrived at for human-inspected works as part of the OED quotation tagging project under-way at The Life of Words. My process there was pretty closely supervised, but the […]