Comment Re:Document Management Software and OCR (Score 3, Informative) 211
For an Open Source DMS that generates searchable PDF Files, try ArchivistaBox: http://sourceforge.net/projects/archivista/
Tesseract (including fracture / black-letter recognition) and the Linux port of Cuneiform (BSD licence) OCR engines are used for text recognition. The hocr2pdf module (see http://www.exactcode.de/ is used to generate the searchable PDF files.
(http://sourceforge.net/forum/forum.php?forum_id=868471)