As planned, I tested the algorithm with some image samples of bengali newspaper, found as a result of a simple Google Image search.
The original image i used had a lot of noise, here is a preview :
The original image i used had a lot of noise, here is a preview :
Here, we can see that it has a lot of salt-pepper style noise, ad the resolution is not particularly good.
However, trying to whitewash it as it is yielded poor results.
It can be seen that due to the noise, the algorithm fails to detect the individual lines. S I appliead an adaptive thresholding filter on the image, and the denoised image looked like :
Applying whitewashing to this as input resulted as expected,
This proves the generic working of the algorithm.
However, I still had to manually reset the padding values, which needs to be resolved.
As midterm approaches, i have a lot at hand and on my mind.
Here is my new plan, in brief, and in the order I plan to carry them out :
To Do :
- Document Everything done so far. (2 notebooks - ImageTest with filters and WhiteWashing algorithm)
- Update and Publish the documents.
- Document effects of domain-dependency (like this post deals with close-typed documents) and useful filters.
- Document summarising what I learnt during this period, and what made me change my initial plan.
- Test with different resolutions. (After mid-term)
- Merge the notebooks.
- Document.
I think I will stop coding for a while and get started with the documentation now. I will have to discuss the same with my mentor.
So Long.
No comments:
Post a Comment