J. Dzerins and K. Dzonsons. Harvesting national language text corpora from the Web. Proceedings of the 3rd Baltic Conference on Human Language Technologies (Baltic HLT), 2007.