<div dir="ltr"><div>@Lukas: I was having a bit of trouble getting the guest account permissions set up on my AWS but then Bryce went ahead and posted a direct link, thanks for that!</div><div><br></div><div>Isaac<br></div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Sat, Apr 10, 2021 at 5:52 AM Bryce Cogswell via talk <<a href="mailto:talk@openstreetmap.org">talk@openstreetmap.org</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div style="overflow-wrap: break-word;"><div dir="auto" style="overflow-wrap: break-word;"><br><div><blockquote type="cite"><div><div>@Bryce: Did you already make significant efforts regarding deduplicating / sorting or otherwise processing the images? If yes, maybe you could share this altered dataset with Isaac and other interested parties?<br></div></div></blockquote><br>I didn’t do any additional work on deduplicating the images. I’m not sure why you think this is important if you’re going to use it for ML training.<br><br><blockquote type="cite"><div><div>@Bryce: Congratulations! I already saw some correctly recognized specimens! That is certainly encouraging, isn't it? Do you already know if/how you would proceed further? If you would be okay with publishing with what you already have, maybe others could build upon that.<br><br>I remember one idea we had: If users of such a recognition feature would be willing to (automatically, with little/no effort) share the pictures to increase the pool of pictures you could create a virtuos cycle, especially if you can motivate them to either mark detections as correct or let them fix it as needed.<br></div></div></blockquote><br></div><div>Keep in mind I’m not doing any ML training, so having a larger sample size doesn’t benefit me. I wanted a large number of test images in order to measure the expected accuracy of the OCR and algorithm in a real-world settings. My plan now is to build a stand-alone app for testing during surveying, improve the recognition by building better spatial models of how the text is laid out, and then finally integrate it into Go Map!!</div><div><br></div><div>I’m working on this at <a href="https://github.com/bryceco/OpeningHoursPhoto" target="_blank">https://github.com/bryceco/OpeningHoursPhoto</a> but the code is super rough at this point.</div><div>The image set it is at <span style="font-family:Menlo;font-size:16px"><a href="http://gomaposm.com/opening_hours/opening_hours.zip" target="_blank">https://gomaposm.com/opening_hours/opening_hours.zip</a> (12.5GB download)</span></div><div><span style="font-family:Menlo;font-size:16px"><br></span></div><div><span style="font-family:Menlo;font-size:16px">Bryce</span></div><div><span style="font-family:Menlo;font-size:16px"><br></span></div></div></div>_______________________________________________<br>

talk mailing list<br>

<a href="mailto:talk@openstreetmap.org" target="_blank">talk@openstreetmap.org</a><br>

<a href="https://lists.openstreetmap.org/listinfo/talk" rel="noreferrer" target="_blank">https://lists.openstreetmap.org/listinfo/talk</a><br>

</blockquote></div>