Sunday, September 1, 2013

more training data

Still a bit irregular with the updates, but finally I have some more training data that is about 80% ready.
Also, I am working on a separate script to generate the test image for producing the test data and hence, the 20% incompleteness in the new data. A preliminary version of the same is already in the repository.
However, I expect this script to be complete by next Thursday, and then have the final matrix by next Sunday night.
After that, as I have been reminded by my mentor again, I should focus on preparing the final report.
It's sad, but harsh reality, that OCR projects never seem to turn out enough data. I plan to publish at least 2 sets of training data I generated during this summer in some way through the repository. Though, the stock data still seems to perform better.
Also, I need to start working on an outline plan for the final report. Lots of work to do in a seemingly short period of time, maybe more so because it coincides with some period of health issues and then re-opening of school. It is one of the points I would like to mention in the GSoC review, but of course as my personal opinion. 
The most important factor right now is not to loose motivation and carry on with the project. So Long.

No comments:

Post a Comment