Projects / Babeldoc


Babeldoc is a framework and set of applications to process documents for business-to-business and other Internet/integration applications. It is primarily intended for text documents, especially XML, but supports a wide range of operations and data types. It has a sophisticated journaling system that supports replaying and reprocessing. Babeldoc is pipeline based and supports numerous ways to combine the pipeline stages in a dynamically reconfigurable fashion. It has a GUI and a Web-based console for document processing and monitoring, and comes with tools for the tranformation of flatfile data to XML, archival, and cryptography. Additionally it is able to scan various data sources based on sophisticated constraints.

Operating Systems

Recent releases

  •  26 Dec 2003 19:35

    Release Notes: A directory scanning defect that stopped directories from being added to the Babeldoc classpath has been fixed. An error in segmented line processing in the conversion module has been fixed.

    •  17 Nov 2003 00:43

      Release Notes: This new codebase carries with it many big improvements as well as bugfixes. The modular system has been reworked so that dynamic runtime and build time behavior is simple, flexible, and powerful. The Scanner module has been rewritten almost completely. The J2EE module is back and better than ever. In addition, the multithreaded pipeline engine is even more capable for online applications.

      •  20 Oct 2003 16:18

        Release Notes: Fixes were made to filters. A feature for accessing velocity scripts from the filesystem was added. Other small fixes were made to the code base.

        •  29 Sep 2003 16:11

          Release Notes: This version prepares for the big 1.2 release. The directory scanner is now protected against incomplete reads. The init module was updated so that bad classpath entries do not kill babeldoc. It is now possible to exclude modules from build by using Documentation and javadoc improvements were made. More work was done on the GUI and J2EE modules, including an MDB feeder.

          •  04 Sep 2003 08:03

            Release Notes: Documentation updates and cleanups were made. IConfigInfo objects were implemented throughout the program and the J2EE module was reinstated. PipelineErrorHandler bugs were fixed, and ReaderPipelineStage was added. Small tweaks were made to the build (modules can now be exluded), and fixes were made to the Journal and Scanner. The SQL and GUI modules were updated, a new null scanner was added, and PostgreSQL journal support was included.


            Project Spotlight


            A Fluent OpenStack client API for Java.


            Project Spotlight

            TurnKey TWiki Appliance

            A TWiki appliance that is easy to use and lightweight.