[I posted this in 2014, but since so much of the EEBO TCP database came into the public domain in January 2015 I thought it worth updating.]
For those who come to this blog from academia, this is probably a post you can skip, but for people in other walks of life I thought it might be worth submitting a short piece on some of the basic tools of the trade.
When I first started researching the early modern period, in the 1970s, I spent nearly all my time in the Rare Books Room at Cambridge University Library with several ancient tomes in front of me, something like this:
The room has been completely redesigned since my postgraduate days, but it’s the same basic process. These days, I still do a lot of my real research in the same room, but a lot of the time I’m working on one of the computers in a glass-partitioned area at the back of the room. This is because much of the corpus of early modern books in English (which is what I mostly work on) is available online at Early English Books Online (EEBO). Yes, you need a password and log-in before you can actually access the database, and subscription is through institutions, not issued on an individual basis, which effectively locks the average person out, but there is some good news for Joe public, which I’ll come to later. Basically, what subscribers get from EEBO is a PDF image of the original text. When it first started going online (in phases, during the 1990s) it was a radical improvement on microfilm, which was fiddly to use, gave you headache and came a poor second to having the actual book in your hand. EEBO PDFs can be viewed page by page online or downloaded as a single PDF file, making it an acceptable – even, sometimes, a preferred – alternative to reading the actual printed book.Now, though, even that’s been superseded. EEBO PDFs, with all their advantages, were not text-searchable, meaning you had to – gasp! – read stuff to know whether it was relevant to what you were researching or not. The Early English Books Online Text Creation Partnership changed all that. Until January 2015, it too was only accessible via subscription, but these days some 25,000 texts are in the public domain. Yes, you read that right – 25,000! Go to one of the search pages (I usually use the Boolean search) and enter some search terms. For example, if you search for Shakespeare or Shaksper and plays you get these results. Even without logging in you can find out that (at the last count) there are 75 publications prior to 1700 which contain these terms. If you can log in, you’d be able to read the exact pages on which the search terms occur, as well as being able to search for other terms in any of the 75 search results. Even if you can’t log in you will still be able to see the full text of the public domain items and search for other lexical terms within those texts. When you realize that you can do this for any search terms you can envisage you see that this is a very powerful research tool. Something which would have taken a lifetime of research a couple of generations ago can now be clinched with a few mouse-clicks. For example, one can do a proximity search to find out something like this:
A search of EEBO TCP indicates that (including variant spellings) cruel/cruelty is closely collocated with unjust/injustice or iniquity – normally conveying the idea that an act is cruel if the intention behind it is unjust – in only about 80 texts during the whole of the sixteenth century. However, during the seventeenth century, there are over 1,500 such collocations, more than a third of which were published between 1680 and 1700. (Pain, Pleasure and Perversity, introduction, page 14; you can download the complete introduction here).
Did you find this article informative or useful? Post a comment or vote on it!
Further reading: Heather Froehlich, Richard J. Whitt and Jonathan Hope, ‘EEBO-TCP as a Tool for Integrating Teaching and Research’.