Projects / openDIAS


openDIAS (Document Imaging Archive System) provides document imaging with OCR. You can scan documents (with SANE) or import office documents, then assign them tags. It can store all your letters, bills, statements, etc. in a convenient, safe, and easily retrievable way.

Operating Systems

Recent releases

  •  31 May 2014 11:00

    Release Notes: This release added auto tagging by detecting similar documents. Username authentication now uses a new session system. A Dutch translation was added and a number of German translations were fixed. Scanning color and performance were improved. The UI was improved UI with print now, date formats, file uploads, and error feedback. There were lots of internal refactors and improvements, new tests, and increased test performance.

    •  19 Nov 2012 18:03

      Release Notes: This release implement issues #7 (document linkage), issue #12 (better testing), and issue #13 (device locking). PDF, ODF, and image imports correctly have a thumbnail and OCR performed. Works on 64-bit machines. Migrated to tesseract v3. I/O is all UTF-8. The front end will now time out rather than hang on an error. The document list is now auto-loaded rather than using a paged table. A localization framework has been put in place (with English and German languages added). Defunct speech functionality has been removed. Various bugfixes and cleanups.

      •  11 Jan 2012 22:30

        Release Notes: This release introduces major new functionality and polishes the code and user interface. Overall, it is a solid increase in functionality and quality.

        •  15 May 2011 14:39

          Release Notes: The software was totally rewritten from the bottom up. It is now a Web based client, interfacing into a server backend that controls the SANE devices and the database.

          •  02 Jul 2008 11:02

            Release Notes: Threading was added to "slick up" the UI in places. More memory leaks were fixed. Things are handled when the loading image is not available. Compiler flags are used to set the data_dir. Cleanup was done. A "no OCR libs" error was fixed. Lots of memory leaks were fixed. Bind vars are now used for database updates and inserts. The "should we OCR" setting now defaults to on. Image processing was added to allow paged views of scanned images. An icon was added to the application. A build failure bug if tesseract is not installed was fixed.


            Project Spotlight


            A Fluent OpenStack client API for Java.


            Project Spotlight

            TurnKey TWiki Appliance

            A TWiki appliance that is easy to use and lightweight.