Grove Words are defined as words which appear at the beginning of a paragraph and begin with a gallows character: <k, t, f, p>. They make up the bulk of paragraph–initial words throughout the text. Many will have been present in the statistics I discussed concerning linefirst words. I wish to outline a few thoughts on Grove Words, in order that they might be better typified. This should help us to eventually explain them, and also feed in to our understanding of linefirst words.
For the following discussion I use a list containing all those words which begin with a gallows character and occur at the beginning of a paragraph at least once in the ‘Stars’ section of the Voynich manuscript. There are 183 such words according to my count. The initial characters of these words are: <f>, 13 words (7%); <k>, 17 words (9%); <p>, 106 words (58%); <t>, 47 words (26%).
Sometimes it is observed that Grove Words are unique. About 57% of the Grove Words in the Stars section are unique in the whole manuscript, and about 85% are unique in the Stars section. But about 65% of all words in the Stars section are unique, and a potentially higher percentage for the manuscript as a whole (the figures are hard to count due to words with difficult readings). We cannot consider Grove Words to be substantially different from the text as a whole on this count.
We also cannot consider Grove Words to be strictly paragraph–initial only. Some of the more common examples occur in various places, away from both the beginning of a paragraph and the beginning of a line. Together with the former point, it is clear that any given Grove Word could occur elsewhere in the text. There can be no assumptions about a word in these respects simply because it appears in one place as a Grove Word.
The main point we wish to answer is why do so many Grove Words occur? That is, why do the majority of paragraph–initial words begin with a gallows? In the Stars section every paragraph on some pages begin with a Grove Word. The gallows characters <f, p, t> (but interestingly not <k>) are overrepresented at the beginning of lines when compared with the text as a whole. The cause of this is Grove Words.
Grove Words do not come about by random chance, as the number of possible initial characters and paragraphs in the Voynich manuscript is too great, and the pattern for Grove Words is too comprehensive.
Nor is it likely that the first words in most paragraphs begin with the same few characters due to the underlying sounds. That the first word on a page might represent a specific kind of thing (such as topic or the name of a plant) is possible, but the presence of Grove Words make it seem as though all these thing begin with one of only four sounds. Although some languages, such as in the Bantu family, have noun classes with a small number of prefixes, we would expect to see this pattern throughout the text, which we do not.
There must be a process which creates Grove Words. We can think of that process broadly working in one of two possible ways. The first is that words which already begin with a gallows are brought to the start of a paragraph, the other is that word which already start a paragraph are made to begin with a gallows.
Considering the first, we would have to presume that the word order is flexible enough for a gallows–initial word to be moved to the desired position, and that such words occur in the first sentence or even in the paragraph as a whole. Yet there are many examples of Grove Words being the only gallows–initial word in the whole paragraph, and also some paragraphs without a Grove Word yet with one or more gallows–initial word.
Further, even though <p> is the most common character at the beginning of a Grove Word, it is no more common as a word–initial character than <t> in the Stars section, and less common than <k>. Indeed, <k> is only a little more common as the first character of a Grove Word than <f>, despite being over eight times more common as an word–initial character in the text of the Stars section as a whole. If there was a process for moving gallows–initial words to the beginning of paragraphs we would expect to see the statistics for them to be the same as the whole text, and we do not.
There seems not to be any particular evidence for the belief that gallows–initial words are moved to the beginning of a paragraph to make Grove Words.
Considering the alternative, that paragraph–initial words are made to begin with a gallows, we have to think how a word might be transformed for this to occur. One is that the first character (or characters) is replaced with a gallows, the next that one or more characters before a gallows is removed, or the third—which is the one commonly assumed—that a gallows character is added onto the beginning of a word.
That the initial gallows may be replacing another character is unlikely. The characters immediately after the gallows are so diverse that no one character could replace a gallows and result in a valid word. It would have to be that a single gallows can replace multiple different characters, in which case the linguistic information of that character is lost. How would the reader know what word was meant?
Further, the structure of some words, such as <fchoctheody, kchdaldy, pcholky, tchokedy> suggest that the characters immediately after the gallows are part of a ‘Fore’ section (as discussed in my post about high level word structure) as so should not have anything in front of them. The same goes for a small number of Grove Words where the second character is <y>, which usually does not come in the middle of a word.
Some of the same objections carry over into the idea that the gallows character has become initial after the removal of a preceding character. The longer words would look even more abnormal and occurrences of middle <y> would not be solved.
But it should further be noted that restoring an initial character makes most of the words look less acceptable. We can test the idea of restoration by adding <o> to the beginning of Grove Words, as that is the most common initial character, and perfectly acceptable before a gallows. In 70% of cases the resulting word does not exist in the whole of the text. A further 14% has only one or two occurrences. This idea, then, outcomes in an even odder set of words at the beginning of paragraphs with greater diversion from the main text.
However, about 11% of words with an added <o> are common, with five or more occurrences, and some even into the low hundreds. But almost all of these words perform even better under the third option, that Grove Words are made by adding a gallows to an existing word. If we test this option by removing the gallows nearly 40% of Grove Words have more than ten occurrences, and about half have five or more. The words thus become more regular and more like the main text.
Yet 31% of words with the gallows removed have no occurrences, which presses for an explanation. There may be two reasons for this. One is that such words are genuinely unique. The properly do not begin with a gallows character, but as their only occurrence is at the beginning of a paragraph their form as a Grove Word is the only one known.
The other reason is that some Grove Words properly begin with a gallows character. Whatever process adds a gallows to the beginning of Grove Words does not happen if that words already begins with one. It may be that removing the gallows from such a word does result in a more common word, even if both are still valid.
Indeed, we must not assume that a word becoming more common with the removal of an initial gallows is a great proof, as many words beginning with different characters show an alike pattern. Characters can often be removed from the beginning of words and result in a valid outcome. It must be borne in mind that the problem of Grove Words is not that the words themselves make more sense with the gallows removed, but that their paragraph–initial distribution makes is sensible to question where the initial gallows comes from. That they can often be reasonably removed leads us to a possible solution.
The existence of Grove Words calls for an explanation, which can only be arrived at once they have been clearly typified. We have looked at a few aspects of such words and found that neither their uniqueness nor their position are strictly useful for typing them. Many words in the text are unique, and some Grove Words also occur elsewhere in the text.
The main problem for Grove Words is the underlying process which consistently puts words beginning with a gallows at the beginning of a paragraph. We are able to say a few things about this process which furthers our thinking.
1) A gallows character is added to the beginning of a paragraph–initial word. There is no reason to suspect that a word beginning with a gallows is moved to the start of a paragraph. The word is presumably in its position due to the logic of the underlying sentence, and the Grove Word is made from it.
2) Some Grove Words already begin with a gallows character and are unmodified (although Neal Lines could play a role). Again, such words are paragraph–initial due to the logic of the sentence, and a certain percentage of such words will begin with a gallows regardless.
3) Some Grove Words are genuinely unique even without the gallows character. This is statistically unsurprising, but allows for the possibility that some Grove Words may represent the name of topics or plants on their page—albeit with the gallows removed.
There is scope for further research on those words which appear at the beginning of paragraphs but without an initial gallows. It would also be worthwhile seeking to understand which Grove Words may properly have an initial gallows, as opposed to those which do not.