Skip to main content.

SIREn (Semantic Information Retrieval Engine)

SIREn, the Information Retrieval system at the core of the Semantic Web Index Sindice, is now available for download and includes the full source under Apache License 2.0. SIREn is based on best practices and our own experience in solving large-scale semi-structured data search. Our goal is to bring the benefits of state-of-the-art techniques for semi-structured Information Retrieval into Lucene / Solr, and to provide a full-featured search engine for semi-structured data.
You can visit the project web site for more information.


Sindice is a lookup index for Semantic Web resources that enables humans or machines to find Semantic Web documents. Sindice allows to automatically locate documents containing information about a given resource through its URI, textual terms or structured queries such as attribute-value pairs.
You can read the journal article or the research publication to have a general description of the infrastructure.



LOD2 is a large-scale integrating project co-funded by the European Commission within the FP7 Information and Communication Technologies Work Programme (Grant Agreement No. 257943). Commencing in September 2010, this 4-year project comprises leading Linked Open Data technology researchers, companies, and service providers from across 7 European countries and is coordinated by the AKSW research group at the University of Leipzig.
You can find more information about the project on the LOD2 web site .



OKKAM is a Large-Scale Integrating Project funded by the European Commission under the 7th Framework Programme (FP7) until June 2010.
The OKKAM project aims at enabling the Web of Entities, namely a virtual space where any collection of data and information about any type of entities (e.g. people, locations, organisations, events, products, ...) published on the Web can be integrated into a single virtual, decentralised, open knowledge base (like the Web did for hypertexts).
You can find more information about the project on the Okkam web site .



ActiveRDF is a library for accessing RDF data from Ruby programs. It can be used as data layer in Ruby-on-Rails, similar to ActiveRecord (which provides an O/R mapping to relational databases).
You can read the ActiveRDF research publication to learn more about the mapping of RDF data to the Object-Oriented model and the advantages of a dynamic language such as Ruby or my internship report to find a more detailled description.
Official web site


BrowseRDF - Faceteer

BrowseRDF is a faceted browser that works on arbitrary RDF data. We analyse the data and apply heuristics to determine more and less interesting facets. We then generate a browser interface to navigate and query the data using those facets.
Faceteer is the public Ruby API that enables the automatic generation of a faceted interface for arbitrary RDF data.
You can read the research publication to learn more about how the facet theory can be extended for graph-based data or my internship report to find a more detailled description.