Friday, February 3, 2012

Lessons Learned: Lingua Compara 00

I tried to run a few big samples, and it seems that my little device will go no grander in size than something the length of Beowulf - which for the time being is okay. I can cover a lot with that.

Internet Explorer continues to vex me with it's native input character length restrictions.

I think there may be something strange going on with the/to frequencies - more on that in a later post.

I think I may try something like map reduce to speed up the analysis of really large texts.

