Since the previous post we’ve succeeded in using tesseract and we now have a nice plain text version of the EB entry on shakespeare: http://knowledgeforge.net/shakespeare/svn/trunk/shksprdata/ancillary/britannica-11th.txt What we now need to do is ‘proof’ this to correct the OCR errors. This kind of think is perfect for distributed volunteers so if you’d like to help out […]
Don't miss a thing! Stay on top of what's happening in the #OpenMovement around the world.