Certain glyphs, bigrams, or words are more or less common depending on their position in a line or on the first line of a paragraph. The text exhibits some kind of structure based on the line as a whole. The physical line of text—the line on a page—was in some way important or meaningful to the writer.
- The glyphs [f, p] are much more likely to occur on the first line of a paragraph than elsewhere in the text.
- The first word on the first line of a paragraph often begins with a gallows glyph [k, t, f, p]. This word is typically rare in the text but the first glyph can often be removed to result in a more common word. The paragraph initial words which begin with a gallows glyph are known as Grove words after their discoverer.
- Words beginning with the glyphs [y, d, s] are more common at the start of a line than in the text as a whole. Some specific bigrams of these glyphs occur more in this position than elsewhere else.
- The bigrams [ych] and [ysh] have 78% and 90%, respectively, of their occurrences at the start of a line.
- The bigrams [dch] and [dsh] have 54% and 72%, respectively, of their occurrences at the start of a line.
- Conversely, words beginning with the glyph [a] are less common than expected at the start of a line. Only 1% occur in this position compared with the expected ~12%.
- While the gallows glyph [k] occurs at the beginning of a line start word 10% of the time, the glyph [t] occurs in the same position 41% of the time.
[More detail to follow.