I would like to use LSI (latent semantic indexing) to provide access within the public library to a local history collection which I have been scanning (usually newspaper pages or clippings; using the Paper Port software which came with our Brother all-in-one scan/fax/copy machine).
While I've read the "right tool(s)for an archive of newspaper columns" post, I do not have copyright clearance for the scanned material, nor do I seek to put it online, as most of the suggested tools seem to do. DevonThink seems to be out, as we're strictly Windows(including a few 7's)-- no Macs.
We're also perpetually broke (who isn't?), and unlikely to spend anything on this project, including renting IT help--have none in-house. Although I'm motivated to do the text cleanup of the scanned corpus, I'm not too tech-savvy, nor am I a mathematician. (I have a bad case of English-major-brain; it gets all fuzzy somewhere around the word "vector"). I still hope we can transcend our limitations and those of keyword indexing. Any help? Any suggestions? Thanks! DH