Projects / Infovore


Infovore is a map/reduce framework for processing large RDF data sets such as Freebase and DBpedia. It is based on Hadoop.

Operating Systems

Recent releases

  •  14 Apr 2014 20:38

    Release Notes: This release adds a job cost accounting function.

    •  08 Apr 2014 21:31

      Release Notes: Haruhi now writes a tag with the Hadoop job ID to all line items for the job, so this release can add up line items with this tag to calculate the cost of a job after the fact. When running a flow (multiple jobs), Haruhi now uses the command line arguments of the flow to determine the name of the flow.

      •  19 Mar 2014 15:47

        Release Notes: Tuning job parameters has sped up the weekly flow from 2.5 hours to about 57 minutes with a small cost reduction. A job to smush objects has been created, so it is now possible to import Dbpedia PageLinks into the :BaseKB space.

        •  12 Mar 2014 22:12

          Release Notes: This adds the "sumRDF" tool, which sums up RDF values and is necessary for the conversion of DBpedia-derived subjective importance scores to :BaseKB-compatible scores.

          •  10 Mar 2014 14:03

            Release Notes: This release adds the "smushSubject" tool, which can change the vocabulary used in the subject field using a reduce-side join.


            Project Spotlight


            A Fluent OpenStack client API for Java.


            Project Spotlight

            TurnKey TWiki Appliance

            A TWiki appliance that is easy to use and lightweight.