I am a French computer science engineer with a major in Sciences Cognitives et Informatique Avancée (Cognitive Science and Advanced Computer Science) since the summer 2006. My engineering school was EPITA and is located in Paris periphery.
In December 2010, I received a Ph.D. for my thesis on Information Retrieval techniques for Semantic Web data from DERI Galway at NUIG. The thesis were supervised by Giovanni Tummarello and examined by Jérôme Euzenat and Fabrizio Silvestri.
At present, I am a Postdoctoral researcher at DERI Galway in the DI2 research unit, where my main objective is to develop new methods for integrating, searching and exploring semi-structured data at web scale. I have recently been awarded a IRCSET EMPOWER 2012 Government of Ireland Postdoctoral Fellowship in Science, Engineering and Technology for my project on "Big Data Semantic Integration".
I have previously developed, with Eyal Oren, ActiveRDF, a library for accessing RDF data through programmatic objects and integrated with Ruby on Rails, and BrowseRDF, a faceted browser for arbitrary RDF data. You can find more details about these two projects in my internship report and the research publications.
I am a founder member of the Sindice project, a search engine for Semantic Web resources that enables humans or machines to find Semantic Web documents. I am actively contributing to the project, and I am in charge of the indexing, reasoning and querying aspects. I am leading the project SIREn, the Information Retrieval system at the core of Sindice, which is available for download and includes the full source under Apache License 2.0.
From 2008 to 2010, I was contributing to the DERI's part of the Okkam project, an EU-funded project focusing on enabling the Web of Entities. Since September 2010, I am co-responsible for DERI's part of the LOD2 project, an EU-funded project focusing on creating knowledge out of interlinked data.
Since January 2011, myself and Giovanni Tummarello have founded SindiceTech, a spin-out of NUI Galway. Based at the Digital Enterprise Research Institute in Galway, SindiceTech is commercialising a new data management middleware for knowledge intensive enterprises. This new data management solution is based on our expertise and on the novel technologies in large-scale semantic data management built during more than 5 years on the Sindice research project.^ TOP
- Research interests
- My research interests are focused on Semantic Web, Linked Data, Information Retrieval, Semantic Ranking, Big Data, Graph Analytics, Database, Compression, Data Structure, Logics, Context Awareness and Distributed systems.
- Technical interests
- My current technical interests are focussed on Apache Lucene, Solr, Hadoop, HBase, which are at the core of the Sindice infrastructure. During my free time, I am contributing to the Lucene project.
I am (or was) supervising:
- Nickolai Toupikov
- Distributed Ranking of Resources in a Large RDF Data Graph. June 2008 - June 2009. (equiv. to MSc. Thesis)
- Stephane Campinas
- Compression of Inverted Lists. February 2010 - December 2010. (equiv. to MSc. Thesis)
- Thomas E. Perry
- Data Graph Summarisation. May 2011 - December 2011. (MSc. Thesis)
- Pierre Bailly-Ferry
- RDF Graph Summary with application to Assisted SPARQL Formulation. February 2012 - October 2012. (equiv. to MSc. Thesis)
Digital Enterprise Research Institute
National University of Ireland