I am using ABBYY Finereader 10.0 to OCR some material I've scanned. I would like to export the result as a PDF with the text "under" the image. Everything seems to be going well, except the way that soft-hyphens (Finereader calls them "optional hyphens") are handled in the PDF output.
While the OCR successfully recognizes most end-of-line hyphens as soft hyphens, when the result is exported to PDF, the word remains split in half and therefore will not appear in search results. For example, if the word "digital" appeared at the end of a line, hyphenated as "digi-tal," Finereader recognizes the hyphen as a soft hyphen. But if I export to PDF and search for the term "digital" it will not be found (but "digi-" and "tal" would).
Any thoughts on how to handle this? I could just manually rejoin these words, but that seems absurd. After futzing about for quite awhile, though, I've been unable to find a better solution.