Theo wrote: ↑
Why not use the native framework offered by Apple? The Vision framework offers multilingual character recognition. It is efficient but if it is not perfect (it has trouble with multicolumn texts), but is fully integrated into the system, evolves regularly and avoids cost problems and licenses for external solutions,
many thanks for pointing out the text-recognition features of Apple's Vision framework! Use of a built-in framework would indeed be ideal and I should indeed look into this.
That said, in the case of my app, the typical use case would be to add a text layer over the (scanned) text of an imported PDF. After the OCR process, users should be able to search, select & copy the text of the PDF. So, ideally the framework itself would understand how to deal with PDFs. Also, multi-column PDFs are very common in the academic world, so it would be good if these could be dealt with properly.
Anyways, I appreciate the pointer, thanks again!