Sunday 24 October 2010

Skrifenn Etek - Writing Eighteen - Tekstow Kernewe - Cornish Texts

I have recently acquired a book on "Natural Language Processing with Python" and have begun to apply its principles to a few Cornish texts. I have downloaded a number of the traditional texts from http://corpus.kernewek.cymru247.net/ as well as a couple of modern texts (the short story Solempnyta by Benjamin Bruch and an translation of a chapter of the Lord of the Rings into Cornish from Keskewsel. If anyone reading this has any further texts in electronic form that they'd be willing to let me use let me know.

So here are a few basic results:

['bmkk.txt', 'cwkdlkk.txt', 'omkkks.txt', 'pckk.txt', 'rdkk.txt', 'solempnyta_kk.txt', 'tolkien_kk.txt', 'tregkk.txt']

Text: Improved version 21 / 10 / 96 Bewnans... (Bewnans Meryasek)

Collocations: Building collocations list Collocations are words that tend to be more likely to occur together than would be suggested by their general frequency. This is a built in function of Python's Natural Language Toolkit

pur wir; Comes venetensis; heb falladow; Tertius tortor; Secundus
tortor; Primus tortor; pub eur; Yesu Krist; Episcopus Kernow; Rag
kerensa; heb ahwer; kuv kolonn; wosa hemma; deun alemma; pur dhiogel;
pub termyn; heb namm; heb wow; Primus exulator; dha vodh
None

number of words = 26815

number of different words = 4664

Lengths of words in descending order of frequency [(3, 5094), (2, 4813), (4, 3857), (5, 3270), (1, 3078), (6, 2636), (7, 1697), (8, 1180), (9, 612), (10, 385), (11, 115), (12, 57), (13, 18), (14, 2), (18, 1)]

Top 50 words: ['a', 'y', 'n', 'dhe', 'ha', 'yn', 'an', 'ow', 'my', 'yw', 'c', 'ny', 'na', 're', 's', 'dha', 'omma', 'pur', 'ni', 'm', 'rag', 'meryasek', 'ma', 'sur', 'krist', 'yesu', 'bys', 'th', 'hwi', 'mar', 'heb', 'arloedh', 'oll', 'ev', 'vynn', 'gans', 'yma', 'dyw', 'vydh', 'lemmyn', 'vy', 'maria', 'den', 'ty', 'wir', 'dell', 'eus', 'meriadocus', 'dhymm', 'sertan']

Top 50 words of 4 or more letters: ['omma', 'meryasek', 'krist', 'yesu', 'arloedh', 'vynn', 'gans', 'vydh', 'lemmyn', 'maria', 'dell', 'meriadocus', 'dhymm', 'sertan', 'meur', 'dhymmo', 'dhyn', 'dhis', 'finit', 'episcopus', 'agas', 'comes', 'primus', 'secundus', 'nyns', 'yredi', 'orth', 'henna', 'prest', 'syrr', 'agan', 'devri', 'tortor', 'dhywgh', 'nevra', 'gweres', 'alemma', 'hanow', 'bydh', 'bynytha', 'deun', 'dhodho', 'epskop', 'hemma', 'lies', 'descendit', 'dhiso', 'lowena', 'mones', 'aredy']

Text: # ------------------------------------------------------------------------ # # The text of _Gwreans... (Gwreans an Bys)

Collocations: Building collocations list (note that I really should have trimmed some of the comments out that are in the file but not part of the Cornish text)
KDL page; ### KDL; par dell; lever dhis; pub eur; pub tra; pur vras;
pub prys; wosa hemma; pur wir; Nyns eus; vynn mos; dhe vos; Der henna;
mos alemma; mar vras; heb falladow; warn ugens; FIRST DEVIL; myns eus
None

number of words = 15044

number of different words = 2539

Lengths of words in descending order of frequency [(3, 3450), (2, 3007), (4, 2420), (5, 1666), (1, 1663), (6, 1474), (7, 747), (8, 381), (9, 144), (10, 59), (11, 29), (12, 4)]

Top 50 words: ['a', 'ha', 'y', 'n', 'an', 'dhe', 'my', 'ow', 'yw', 'yn', 'ny', 'na', 'yth', 'adam', 'the', 'bys', 'pur', 'dha', 'ty', 'vydh', 'rag', 'hag', 'henna', 'm', 'pub', 'ma', 'oll', 'omma', 'kdl', 'page', 'th', 'dhymm', 'dyw', 'gans', 'ev', 'mar', 'tas', 'to', 'vynn', 'hwi', 'in', 'der', 'dhymmo', 'eus', 'heb', 'ms', 'and', 'eva', 'vy', 'gwrys']

Top 50 words of 4 or more letters: ['adam', 'vydh', 'henna', 'omma', 'page', 'dhymm', 'gans', 'vynn', 'dhymmo', 'gwrys', 'dhis', 'bras', 'dell', 'father', 'lemmyn', 'nevra', 'serpent', 'dout', 'genev', 'kaym', 'seth', 'keth', 'vras', 'nyns', 'rakhenna', 'hemma', 'meur', 'cain', 'prys', 'orth', 'abel', 'alemma', 'fydh', 'hwath', 'lever', 'yndella', 'ynwedh', 'sertan', 'woer', 'heaven', 'plas', 'agan', 'gwel', 'mayth', 'ragdho', 'wartha', 'genes', 'lavar', 'maga', 'wydhenn']

Text: # --------- ORIGO MUNDI --------- # Keith Syed...

Collocations: Building collocations list
DEUS PATER; REX SAL; pur wir; Tas Dyw; heb falladow; Nyns eus; teyr
gwelenn; may hallo; nyns eus; heb fall; heb wow; Lavar dhymmo; dell
vynni; dres puptra; Arloedh ker; pub huni; pub eur; kollenwel bodh;
verr dermyn; war bayn
None

number of words = 15533

number of different words = 2608

Lengths of words in descending order of frequency [(3, 3375), (2, 3033), (4, 2396), (1, 1898), (5, 1673), (6, 1421), (7, 956), (8, 469), (9, 171), (10, 97), (12, 21), (11, 19), (13, 2), (14, 2)]

Top 50 words: ['a', 'ha', 'y', 'an', 'n', 'yn', 'dhe', 'ow', 'my', 'dha', 'rag', 'yw', 'na', 'ny', 'dyw', 'oll', 'm', 're', 'bys', 'th', 'vydh', 'arloedh', 'war', 'hag', 'heb', 'may', 'dell', 'ev', 'gans', 'mar', 'dhis', 'ty', 'ma', 'tas', 'i', 'wra', 'ni', 'dhymm', 'lemmyn', 'deus', 'dre', 'nev', 'adam', 'vynn', 'moyses', 'pan', 'pur', 'bos', 'eus', 'pater']

Top 50 words of 4 or more letters: ['vydh', 'arloedh', 'dell', 'gans', 'dhis', 'dhymm', 'lemmyn', 'deus', 'adam', 'vynn', 'moyses', 'pater', 'dhymmo', 'dhyn', 'agan', 'skon', 'bras', 'hware', 'nevra', 'dhodho', 'gwrys', 'nyns', 'sertan', 'dhiso', 'dhyw', 'meur', 'keffrys', 'kyns', 'orth', 'dhywgh', 'henna', 'leun', 'fydh', 'hweg', 'vynytha', 'bydh', 'abel', 'omma', 'onan', 'awos', 'kemmer', 'ellas', 'gwel', 'gwra', 'bones', 'deun', 'nuncius', 'pyth', 'ynwedh', 'bennath']

Text: PASSIO CHRISTI - KK Version made from Norris...

Collocations: Building collocations list
Mab Dyw; Princeps Annas; IVs Tortor; IIs Tortor; IIIs Tortor; pur wir;
heb lettya; tri dydh; kepar dell; dha vodh; IIs Doctor; Dydh Breus;
dhis lowena; Pur wir; dhe wruthyl; kettep onan; may hallo; Arloedh
ker; Myghtern Yedhewon; Nyns eus
None

number of words = 21260

number of different words = 3604

Lengths of words in descending order of frequency [(3, 4249), (2, 4152), (4, 3066), (5, 2425), (1, 2406), (6, 2115), (7, 1408), (8, 779), (9, 366), (10, 218), (11, 47), (12, 18), (13, 5), (14, 5), (16, 1)]

Top 50 words: ['a', 'y', 'n', 'yn', 'my', 'an', 'ha', 'dhe', 'ow', 'yw', 'et', 'rag', 'ny', 're', 'na', 'dha', 'ev', 'oll', 'hag', 'm', 'dyw', 'tortor', 'bys', 'war', 'gans', 'hic', 'mar', 'ihc', 'th', 'cayphas', 'ma', 'mab', 'may', 'ad', 'ihesu', 'dell', 'lemmyn', 'dhis', 'hwi', 'ty', 'heb', 'pan', 'tunc', 'vydh', 'wra', 'ni', 'arloedh', 's', 'den', 'dre']

Top 50 words of 4 or more letters: ['tortor', 'gans', 'cayphas', 'ihesu', 'dell', 'lemmyn', 'dhis', 'tunc', 'vydh', 'arloedh', 'pilatus', 'dhodho', 'sertan', 'meur', 'vynn', 'dicit', 'dhymm', 'henna', 'dhyn', 'mara', 'annas', 'skon', 'kyns', 'dhywgh', 'dhymmo', 'hware', 'agan', 'agas', 'lowena', 'ellas', 'gwas', 'gwir', 'mars', 'petrus', 'princeps', 'syrr', 'bras', 'hweg', 'lavar', 'worth', 'bydh', 'hayl', 'myghtern', 'dhiso', 'fydh', 'lever', 'nyns', 'yndella', 'awos', 'kettep']

Text: Resurrectio Domini - KK Version made from Norris...

Collocations: Building collocations list
pur wir; Spyrys Sans; fem .?}; verr spys; tressa dydh; kepar dell;
Arloedh ker; Penn vyghternedh; dhe dhasserghi; Kepar dell; vos
dasserghys; osculatur eos; IVs Miles; Ihesu Cryst; Mab Maria; tri
dydh; januis clausis; IIIs Miles; hakkra mernans; heb lettya
None

number of words = 16209

number of different words = 2635

Lengths of words in descending order of frequency [(3, 3314), (2, 3146), (4, 2310), (1, 2142), (5, 1864), (6, 1476), (7, 990), (8, 497), (9, 239), (10, 171), (11, 40), (12, 16), (13, 3), (14, 1)]

Top 50 words: ['a', 'y', 'n', 'yn', 'ha', 'dhe', 'an', 'ow', 'my', 'yw', 'ny', 'na', 'ev', 'rag', 'arloedh', 'dha', 'mar', 'ty', 'm', 'bys', 'dell', 'ni', 'oll', 'th', 'sur', 're', 'vydh', 'gans', 'meur', 'hag', 'pur', 'ihesu', 'lemmyn', 'nev', 'thomas', 'et', 'dre', 'ma', 'heb', 'bedh', 'pan', 'cryst', 'dhymmo', 'ns', 'dhymm', 'hwi', 'maria', 'dhis', 'dyw', 'neb']

Top 50 words of 4 or more letters: ['arloedh', 'dell', 'vydh', 'gans', 'meur', 'ihesu', 'lemmyn', 'thomas', 'bedh', 'cryst', 'dhymmo', 'dhymm', 'maria', 'dhis', 'dhyn', 'skon', 'bydh', 'korf', 'imperator', 'nyns', 'agan', 'miles', 'sertan', 'marow', 'henna', 'ellas', 'ynwedh', 'tortor', 'bras', 'dhywgh', 'lavar', 'vernona', 'agas', 'leun', 'drog', 'grys', 'myghtern', 'pilatus', 'genen', 'golonn', 'dasserghys', 'genev', 'krysi', 'vynn', 'dhodho', 'gwir', 'tunc', 'hedhyw', 'kepar', 'nevra']

Text: Solempnyta Blackheath . 17 Metheven 1997 . Wel...

Collocations: Building collocations list (Note that this is quite a short text)
Pow Sows; yth esa; dro dhe; brassa rann; mos tre; Hag ythó; Pur dha;
rann anedha; Gov haâ; dell grysav; esov omma; eus saw; rag covhÃ; den
vyth; omma rag; dhe Loundres; ledhys gans; Nyns eus; nyns eus; Henry
ledhys
None

number of words = 1264

number of different words = 511

Lengths of words in descending order of frequency [(2, 279), (3, 229), (4, 171), (5, 138), (1, 129), (6, 118), (7, 82), (8, 51), (9, 27), (10, 20), (11, 18), (13, 1), (14, 1)]

Top 50 words: ['an', 'a', 'n', 'yn', 'yw', 'ha', 'y', 'dhe', 'sowsnek', 'ma', 'ow', 'rag', 'my', 'nyns', 'henry', 'mes', 'vy', 'gans', 're', 'hag', 'omma', 'hy', 'o', 'dell', 'shakespeare', 'sows', 'taves', 'war', 'yeth', 'yth', 'dhymm', 'le', 'na', 'nebes', 'ny', 'oll', 'pan', 'po', 'pow', 'yma', 'bos', 'genev', 'gov', 'gwari', 'heb', 'hi', 'may', 'tus', 'vyth', 'aga']

Top 50 words of 4 or more letters: ['sowsnek', 'nyns', 'henry', 'gans', 'omma', 'dell', 'shakespeare', 'sows', 'taves', 'yeth', 'dhymm', 'nebes', 'genev', 'gwari', 'vyth', 'avel', 'erel', 'hedhyw', 'henna', 'kernewek', 'orth', 'sowsneger', 'studhya', 'whath', 'wosa', 'anedha', 'aral', 'bedh', 'blackheath', 'clewes', 'dann', 'dherag', 'honan', 'ledhys', 'margh', 'martesen', 'mernans', 'nans', 'ogas', 'skila', 'tiwedh', 'vernans', 'yethow', 'arta', 'bothek', 'brassa', 'cales', 'cansblydhen', 'clappya', 'codhas']

Text: Osta karer Arloedh An Bysowyer ? Wel ,... (A chapter of Lord Of the Rings rendered in Cornish at www.keskewsel.com)

Collocations: Building collocations list
Yth esa; yth esa; dhe vos; haval orth; Unn Bysow; medh Gandalf; Bag
End; Nyns eus; dro dhe; leveris Gandalf; wovynnas Frodo; medh Frodo;
neb kas; fatell wrug; dell dybav; dann gel; res dhis; rewlya oll;
Parkow Gladen; dre dermyn
None

number of words = 11147

number of different words = 1966

Lengths of words in descending order of frequency [(2, 2309), (3, 2004), (1, 1526), (4, 1442), (5, 1342), (6, 976), (7, 772), (8, 395), (9, 185), (10, 129), (11, 36), (12, 10), (13, 9), (17, 5), (15, 3), (18, 2), (14, 1), (16, 1)]

Top 50 words: ['a', 'an', 'ev', 'y', 'yn', 'ha', 'n', 'dhe', 'hag', 'mes', 'o', 'ow', 'na', 'yw', 'ny', 'frodo', 'bysow', 'esa', 'vy', 'yth', 're', 'my', 'nyns', 'gans', 'wrug', 'dell', 'bos', 'rag', 'i', 'oll', 'gandalf', 'vos', 'bylbo', 'orth', 'po', 'mar', 'termyn', 'henna', 'dre', 'leveris', 'meur', 'dhodho', 'medh', 'aga', 'es', 'pan', 'pur', 'dres', 'ta', 'yma']

Top 50 words of 4 or more letters: ['frodo', 'bysow', 'nyns', 'gans', 'wrug', 'dell', 'gandalf', 'bylbo', 'orth', 'termyn', 'henna', 'leveris', 'meur', 'dhodho', 'medh', 'dres', 'arta', 'kever', 'nerth', 'dhymm', 'diworth', 'golum', 'shayr', 'tewl', 'haval', 'hobytow', 'hwir', 'nebes', 'wosa', 'henn', 'honan', 'lemmyn', 'yndella', 'arall', 'kyns', 'vydh', 'hwath', 'ganso', 'klywes', 'pyth', 'woer', 'drefenn', 'elfow', 'leverel', 'owth', 'ytho', 'dhis', 'nans', 'nevra', 'orto']

Text: THE TREGEAR HOMILIES KK Version made from Christopher...

Collocations: Building collocations list
dhe vos; kepar dell; Spyrys Sans; agan Savyour; katholik eglos; pub
eur; mar veur; heb diwedh; fatell wrug; Yesu Krist; dre reson; agan
honan; Savyour Yesu; res dhyn; agan Arloedh; Katholik Eglos; Sans
Eglos; dell wrug; dhe leverel; gan Savyour
None

number of words = 40897

number of different words = 5246

Lengths of words in descending order of frequency [(2, 8508), (3, 7334), (1, 5121), (4, 5001), (5, 4461), (6, 3516), (7, 2555), (8, 2112), (9, 1009), (10, 637), (11, 317), (12, 155), (13, 99), (14, 43), (15, 17), (16, 5), (17, 3), (19, 2), (18, 1), (20, 1)]

Top 50 words: ['a', 'ha', 'an', 'n', 'dhe', 'y', 'yn', 'yw', 'ow', 'ni', 'ev', 'ma', 'na', 'rag', 'krist', 'agan', 'wrug', 's', 'oll', 'dre', 'yma', 'eglos', 'dyw', 'gans', 'hag', 'bonner', 'fatell', 'henna', 'et', 'kepar', 'den', 'leverel', 'vos', 'aga', 'yth', 'mar', 'keth', 're', 'honan', 'dell', 'bos', 'i', 'in', 'vydh', 'folio', 'ny', 'o', 'de', 'homily', 'nyns']

Top 50 words of 4 or more letters: ['krist', 'agan', 'wrug', 'eglos', 'gans', 'bonner', 'fatell', 'henna', 'kepar', 'leverel', 'keth', 'honan', 'dell', 'vydh', 'folio', 'homily', 'nyns', 'dhyn', 'dhyw', 'ynwedh', 'korf', 'savyour', 'rakhenna', 'hemma', 'henn', 'dhiworth', 'katholik', 'onan', 'geryow', 'pyth', 'hwath', 'arloedh', 'peder', 'chaptra', 'gwrys', 'omma', 'yndella', 'skryptor', 'lemmyn', 'bobel', 'sans', 'arall', 'dhodho', 'goes', 'leveris', 'lies', 'spyrys', 'agas', 'powl', 'termyn']

Here we see what percentage of the text words of various lengths make up:

Text: Improved version 21 / 10 / 96 Bewnans...

3 letters : 14.05 %
2 letters : 13.27 %
4 letters : 10.63 %
5 letters : 9.020 %
1 letters : 8.490 %
6 letters : 7.271 %
7 letters : 4.681 %
8 letters : 3.255 %
9 letters : 1.688 %
10 letters : 1.062 %
11 letters : 0.317 %
12 letters : 0.157 %
13 letters : 0.049 %
14 letters : 0.005 %
18 letters : 0.002 %



Text: # ------------------------------------------------------------------------ # # The text of _Gwreans...

3 letters : 15.71 %
2 letters : 13.70 %
4 letters : 11.02 %
5 letters : 7.590 %
1 letters : 7.577 %
6 letters : 6.715 %
7 letters : 3.403 %
8 letters : 1.735 %
9 letters : 0.656 %
10 letters : 0.268 %
11 letters : 0.132 %
12 letters : 0.018 %



Text: # --------- ORIGO MUNDI --------- # Keith Syed...

3 letters : 16.67 %
2 letters : 14.98 %
4 letters : 11.83 %
1 letters : 9.376 %
5 letters : 8.264 %
6 letters : 7.020 %
7 letters : 4.722 %
8 letters : 2.316 %
9 letters : 0.844 %
10 letters : 0.479 %
12 letters : 0.103 %
11 letters : 0.093 %
13 letters : 0.009 %
14 letters : 0.009 %



Text: PASSIO CHRISTI - KK Version made from Norris...

3 letters : 15.49 %
2 letters : 15.14 %
4 letters : 11.18 %
5 letters : 8.844 %
1 letters : 8.774 %
6 letters : 7.713 %
7 letters : 5.135 %
8 letters : 2.841 %
9 letters : 1.334 %
10 letters : 0.795 %
11 letters : 0.171 %
12 letters : 0.065 %
13 letters : 0.018 %
14 letters : 0.018 %
16 letters : 0.003 %



Text: Resurrectio Domini - KK Version made from Norris...

3 letters : 15.66 %
2 letters : 14.86 %
4 letters : 10.91 %
1 letters : 10.12 %
5 letters : 8.808 %
6 letters : 6.974 %
7 letters : 4.678 %
8 letters : 2.348 %
9 letters : 1.129 %
10 letters : 0.808 %
11 letters : 0.189 %
12 letters : 0.075 %
13 letters : 0.014 %
14 letters : 0.004 %



Text: Solempnyta Blackheath . 17 Metheven 1997 . Wel...

2 letters : 16.55 %
3 letters : 13.59 %
4 letters : 10.14 %
5 letters : 8.189 %
1 letters : 7.655 %
6 letters : 7.002 %
7 letters : 4.866 %
8 letters : 3.026 %
9 letters : 1.602 %
10 letters : 1.186 %
11 letters : 1.068 %
13 letters : 0.059 %
14 letters : 0.059 %



Text: Osta karer Arloedh An Bysowyer ? Wel ,...

2 letters : 15.47 %
3 letters : 13.42 %
1 letters : 10.22 %
4 letters : 9.661 %
5 letters : 8.991 %
6 letters : 6.539 %
7 letters : 5.172 %
8 letters : 2.646 %
9 letters : 1.239 %
10 letters : 0.864 %
11 letters : 0.241 %
12 letters : 0.067 %
13 letters : 0.060 %
17 letters : 0.033 %
15 letters : 0.020 %
18 letters : 0.013 %
14 letters : 0.006 %
16 letters : 0.006 %



Text: THE TREGEAR HOMILIES KK Version made from Christopher...

2 letters : 16.19 %
3 letters : 13.95 %
1 letters : 9.745 %
4 letters : 9.517 %
5 letters : 8.489 %
6 letters : 6.691 %
7 letters : 4.862 %
8 letters : 4.019 %
9 letters : 1.920 %
10 letters : 1.212 %
11 letters : 0.603 %
12 letters : 0.294 %
13 letters : 0.188 %
14 letters : 0.081 %
15 letters : 0.032 %
16 letters : 0.009 %
17 letters : 0.005 %
19 letters : 0.003 %
18 letters : 0.001 %
20 letters : 0.001 %

Here we show what percentage individual words make up of a given text.

Text: Improved version 21 / 10 / 96 Bewnans...

a : 3.233 %
y : 1.613 %
n : 1.431 %
dhe : 1.359 %
ha : 1.274 %
yn : 1.150 %
an : 1.051 %
ow : 1.006 %
my : 0.976 %
yw : 0.846 %
c : 0.822 %
ny : 0.689 %
na : 0.590 %
re : 0.571 %
s : 0.513 %
dha : 0.499 %
omma : 0.499 %
pur : 0.496 %
ni : 0.463 %
m : 0.449 %



Text: # ------------------------------------------------------------------------ # # The text of _Gwreans...

a : 3.781 %
ha : 1.581 %
y : 1.476 %
n : 1.435 %
an : 1.362 %
dhe : 1.316 %
my : 1.152 %
ow : 1.075 %
yw : 0.952 %
yn : 0.947 %
ny : 0.701 %
na : 0.651 %
yth : 0.542 %
adam : 0.492 %
the : 0.482 %
bys : 0.473 %
pur : 0.469 %
dha : 0.446 %
ty : 0.441 %
vydh : 0.428 %



Text: # --------- ORIGO MUNDI --------- # Keith Syed...

a : 4.890 %
ha : 1.753 %
y : 1.719 %
an : 1.570 %
n : 1.526 %
yn : 1.511 %
dhe : 1.383 %
ow : 1.373 %
my : 1.249 %
dha : 0.844 %
rag : 0.755 %
yw : 0.755 %
na : 0.666 %
ny : 0.666 %
dyw : 0.573 %
oll : 0.568 %
m : 0.543 %
re : 0.533 %
bys : 0.508 %
th : 0.464 %



Text: PASSIO CHRISTI - KK Version made from Norris...

a : 3.924 %
y : 2.337 %
n : 1.531 %
yn : 1.385 %
my : 1.316 %
an : 1.283 %
ha : 1.276 %
dhe : 1.050 %
ow : 1.013 %
yw : 0.846 %
et : 0.751 %
rag : 0.707 %
ny : 0.689 %
re : 0.652 %
na : 0.649 %
dha : 0.576 %
ev : 0.503 %
oll : 0.452 %
hag : 0.415 %
m : 0.404 %



Text: Resurrectio Domini - KK Version made from Norris...

a : 4.853 %
y : 2.216 %
n : 1.989 %
yn : 1.606 %
ha : 1.356 %
dhe : 1.327 %
an : 1.256 %
ow : 1.063 %
my : 1.001 %
yw : 0.954 %
ny : 0.907 %
na : 0.803 %
ev : 0.666 %
rag : 0.619 %
arloedh : 0.567 %
dha : 0.533 %
mar : 0.496 %
ty : 0.472 %
m : 0.448 %
bys : 0.430 %



Text: Solempnyta Blackheath . 17 Metheven 1997 . Wel...

an : 3.738 %
a : 3.323 %
n : 2.195 %
yn : 1.721 %
yw : 1.661 %
ha : 1.602 %
y : 1.186 %
dhe : 1.127 %
sowsnek : 1.127 %
ma : 1.008 %
ow : 1.008 %
rag : 0.890 %
my : 0.830 %
nyns : 0.830 %
henry : 0.712 %
mes : 0.652 %
vy : 0.652 %
gans : 0.593 %
re : 0.593 %
hag : 0.534 %



Text: Osta karer Arloedh An Bysowyer ? Wel ,...

a : 4.783 %
an : 2.599 %
ev : 2.512 %
y : 2.445 %
yn : 2.070 %
ha : 1.862 %
n : 1.587 %
dhe : 1.226 %
hag : 1.038 %
mes : 1.011 %
o : 0.884 %
ow : 0.824 %
na : 0.676 %
yw : 0.663 %
ny : 0.636 %
frodo : 0.596 %
bysow : 0.589 %
esa : 0.509 %
vy : 0.502 %
yth : 0.495 %



Text: THE TREGEAR HOMILIES KK Version made from Christopher...

a : 4.411 %
ha : 3.016 %
an : 2.925 %
n : 2.148 %
dhe : 1.899 %
y : 1.821 %
yn : 1.288 %
yw : 1.157 %
ow : 1.073 %
ni : 0.858 %
ev : 0.839 %
ma : 0.749 %
na : 0.698 %
rag : 0.679 %
krist : 0.664 %
agan : 0.616 %
wrug : 0.603 %
s : 0.527 %
oll : 0.464 %
dre : 0.439 %

No comments:

Post a Comment