The Shire and Mirkwood compared to random text grabs.

From earlier this week: The Shire text uses 11,119 words, of which 1,484 do not appear in Mirkwood, this is counting every word used – “yes” counts as six words.  That’s 13.3% Shire words.

What we learned today: The Shire text compared to a random word grab of the same sample size – 1,339 Shire words do not match my random text.  That is basically indistinguishable from the Mirkwood difference.  Hmm, fascinating!  Yet most of our Lexos graphs which show both regions paint them as very different from one another at the word level.  Hold on…

Oho!  the Mirkwood text has more words – 16,400 – and only 1,265 are different from a random grab of 16,400 words in the whole novel.  That’s 7.7%.  Very different, my friends!

Let’s clean that up a bit:

  • Shire text: 11,119 words
  • Shire words not appearing in Mirkwood: 13.3%
  • Shire words not appearing in Random text: 12%
  • Mirkwood text: 16,400 words
  • Mirkwood words not appearing in the Shire text: 14.6
  • Mirkwood words not appearing in Random text: 7.7%

Well, well, well.  time to poke at Mirkwood a bit, friends.  Also, it’s time to use the newly-discovered Lexos feature “how many of these words are unique”!  See you soon!

 

 

Thank you, Tech Support

My very dear Tech Support has added a new tool to the Digital Humanities Toolkit (which is also linked on our About page).  It is random-choice.py and it will grab your choice of a number of words from a given text file as randomly as a computer can grab and present them on your Terminal window along with the ordinal number of that word in the text.  It will grab numbers as though they are words, but it will not grab things inside of square brackets (like our paragraph references) or double-x (like our phrase separator).  It’s a short little bit of code, so I simply copy/pasted it from github to a text file and named it random-choice.py.  Seems to have worked.

Thank you to Daroc Alden, who always has the time to write a little script for their Mama, even during mid-terms.

Then they came to lands where people spoke strangely

I am following a little rabbit-trail, Word Fans, about dialogue and narration in the Shire.  What are the characteristics of these bits which distinguish it from all the other bits?  Won’t this be fun!

[02.028] At first they had passed through hobbit-lands, a wide respectable country inhabited by decent folk, with good roads, an inn or two, and now and then a dwarf or a farmer ambling by on business. Then they came to lands where people spoke strangely, and sang songs Bilbo had never heard before.

It would be luxurious to include all the prose about the Shire as well, but my current project has made me stare at a deadline and hmph at it.  For our purposes, then, I am counting “In the Shire” as from [01.001] to [02.028], up to but not including the words in the title of this post, plus [19.028] to the end, [19.048], inclusive.

To pass on a tantalizing bit of my thought, I’m calling “In Mirkwood” from [07.154] through [09.069], inclusive.

The plan is to use the Mirkwood text as the stopwords to look at the Shire text and vice-versa…  I wonder if I need to do this for all regions and chart their differences from the Shire?  I may have to.  If I don’t come up for air in a few days, please send chocolate.

The Shire text uses 11,119 words, of which 1484 do not appear in Mirkwood, this is counting every word used – “yes” counts as six words.  That’s 13.3% Shire words.  There are 562 words used in the Shire which are not used anywhere else in the book – 5%.  And yes, I see the logical error there and am going to – soon! – compare the Shire Text with a similarly sized sample.  If I’m lucky, Tech Support can create a “grab a random sample of text from here of size N” script.

The Mirkwood text uses 16,400 words, of which 2,400 do not appear in the Shire, and variations on “spider” account for about 60 of these.  14.6% . Nearly identical.  I do find it odd that the Mirkwood text numbers come out on an even “400” – I will chase that for a while with your indulgence, Word Fans.

A Secret Vice

Today I am enjoying the Fimi & Higgins edition of Tolkien’s A Secret Vice. Thrilling to hear in the professor’s own words his thoughts on sound-play.

For us departed are the unsophisticated days, when even Homer could pervert a word to suit sound-music; or such merry freedom as one sees in the Kalevala, when a line can be adorned by words phonetic trills – as in enkä lähe Inkerelle, Penkerelle, pänkerelle (Kal. xi 55), or Ihveniä ahvenia, tuimenia, taimenia (Kal. xlviii 100), where pänkerelle, ihveniä, taimenia are ‘non-significant’, mere notes in a phonetic tune struck to harmonize with penkerelle, or tuimenia which do ‘mean’ something.

Tolkien, J. R. R.. A Secret Vice: Tolkien on Invented Languages, edited by Dimitra Fimi and Andrew Higgins. (Kindle Locations 1347-1352). HarperCollins Publishers. Kindle Edition.

What shall we do with Mountain-king?

In my mission to identify which hyphenated words are Tolkien original compositions, I have use the Oxford English Dictionary’s word on whether something like “Moss-green” is only ever found as “mossgreen” or “moss green” and if the hyphenated form is not attested, I’ve given it the “JRRT” tag.

Further, if the hyphenated form is found in OED, but the only example is from Tolkien’s work, I’m giving him credit for putting together this form as his own intentional style.

Mountain-king“,   however, has three examples, one of which is Tolkien’s and one of which comes earlier.

I would love to hear from you, Word Fans!  This is the type of art-work that has crept into what I thought would be the cut-and-dry list-making of this project.

Thanks for your notes, Word Fans – I have reached clarity.  Since the other examples of “Mountain king” do not have the hyphen (unbehyphenated?), I am giving JRRT credit for an original-ish spelling.

Forest compounds

Tolkien uses nine “forest-” compound words, of which three are attested in OED – forest-floor, forest-path, and forest-road.

OED has  61 “forest-” compound words, by the way:

forest-bred
forest-brother
forest-brown
forest-clad
forest-court
forest-craft
forest-crowned
forest-deep
forest-dove
forest-dweller
forest-dwelling
forest-feller
forest-felling
forest fire
forest-floor
forest-fly
forest-folk
forest-frowning
forest-fruit
forest-glade
forest-green
forest-hearse
forest-house
forest-kangaroo
forest-land
forest-lawn
forest-laws
forest-leaf
forest-life
forest-like
forest-lodge
forest-lord
forest marble
forest-matter
forest-nymph
forest-oak
forest-path
forest pathology
forest-peat
forest-pony
forest red gum
forest reserve
forest-ridge
forest-rights
forest-road
forest-rustling
forest-school
forest-shade
forest-sheriff
forest shrew
forest-side
forest-skirt
forest-sport
forest-steading
forest-stone
forest stream
forest-top
forest-tree
forest-walk
forest-wards
forest-wood

“forest, n.” OED Online, Oxford University Press, June 2017, http://www.oed.com/view/Entry/73187. Accessed 13 September 2017.

Elf compounds

In comparing the hyphenated words, I have reached the elf compounds.  OED attests all of the words below.  Only “elf-fire” and “elf-friend” overlap with the elf compounds of The Hobbit!

I am particularly intrigued by words of elven persons.  OED has the compound with folk, girl, kingdom, lady, queen, and woman, while The Hobbit has guard, host, king, lord, maiden, and prince.

Now… you know me, Word Fans.  I dug a little deeper.  “Elven” is a noun, obviously, meaning a female elf, like fox/fixin and monk/minchin.  In its second meaning, however, it is a combining appositive or attributive form:

 2. Comb. (referring to a kind of imaginary being in the works of J. R. R. Tolkien).

and Elf-king is attested therein.

To be thorough, “elvish” is the OED’s adjective for elf, also spelled “elfish”.  Not “elven”.  That’s pure JRRT.

elf-arrow
elf-bolt
elf-bore
elf-castle
elf-child
elf-craft
elf-cup
elf-dance
elf-dart
elf-dock
elf-fire – found in The Hobbit
elf-flame
elf-flower
elf-folk
elf-friend – found in The Hobbit
elf-girl
elf-god
elf-horn
elf-house
elf-key
elf-kingdom
elf-knight
elf-knot
elf-lady
elf-land
elf-light
elf-like
elf-lock
elf-queen
elf-rod
elf-shoot
elf-shot
elf-speech
elf-stone
elf-stricken
elf-striking
elf-struck
elf-taken
elf-twisted
elf-wing
elf-woman
elf-wort

“elf, n.1.” OED Online, Oxford University Press, June 2017, http://www.oed.com/view/Entry/60431. Accessed 13 September 2017.

“elf-lock, n.” OED Online, Oxford University Press, June 2017, http://www.oed.com/view/Entry/60439. Accessed 13 September

“ˈelf-shoot, v.” OED Online, Oxford University Press, June 2017, http://www.oed.com/view/Entry/60441. Accessed 13 September 2017.

“ˈelf-shot, n.” OED Online, Oxford University Press, June 2017, http://www.oed.com/view/Entry/60442. Accessed 13 September 2017.

“ˈelven, n.” OED Online, Oxford University Press, June 2017, http://www.oed.com/view/Entry/60661. Accessed 13 September 2017.

“elvish, adj.” OED Online, Oxford University Press, June 2017, http://www.oed.com/view/Entry/60664. Accessed 13 September 2017.