<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Automated Linking Data with Apache Stanbol (SemWeb.Pro) RSS Feed</title>
    <description></description>
    <link>https://cms.semweb.pro/talk/2479</link>
<item>
<guid isPermaLink="true">https://cms.semweb.pro/talk/2479</guid>
  <title>Automated Linking Data with Apache Stanbol</title>
  <link>https://cms.semweb.pro/talk/2479</link>
  <description>&lt;p&gt;This talk will introduce the Stanbol_ project and showcase how it can be integrated in traditional Enterprise Content Management solutions.&lt;/p&gt;
&lt;ul class=&quot;simple&quot;&gt;
&lt;li&gt;[Stanbol](&lt;a class=&quot;reference&quot; href=&quot;http://incubator.apache.org/stanbol&quot;&gt;http://incubator.apache.org/stanbol&lt;/a&gt;)&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Stanbol is an Open Source project under incubation at the Apache Software Foundation. Its goal is to provide Web and CMS developers with a set of HTTP / RESTful services to help them integrate semantic technologies into their products and web sites.&lt;/p&gt;
&lt;p&gt;The following Stanbol services are currently under active developments:&lt;/p&gt;
&lt;ul class=&quot;simple&quot;&gt;
&lt;li&gt;Enhancement engines: use Natural Language Processing tools such as [Apache OpenNLP](&lt;a class=&quot;reference&quot; href=&quot;https://opennlp.apache.org/index.html&quot;&gt;https://opennlp.apache.org/index.html&lt;/a&gt;) to extract knowledge (topics, named entities, facts) from unstructured content and link it to unambiguous URIs from reference knowledge bases;&lt;/li&gt;
&lt;li&gt;Entity Hub: a Linked Data indexing cache built on top of [Apache Solr](&lt;a class=&quot;reference&quot; href=&quot;https://lucene.apache.org/solr/&quot;&gt;https://lucene.apache.org/solr/&lt;/a&gt;), [Clerezza](&lt;a class=&quot;reference&quot; href=&quot;https://incubator.apache.org/clerezza&quot;&gt;https://incubator.apache.org/clerezza&lt;/a&gt;) and [Jena](&lt;a class=&quot;reference&quot; href=&quot;https://incubator.apache.org/jena/&quot;&gt;https://incubator.apache.org/jena/&lt;/a&gt;) that comes with precomputed indexes and live connectors to popular knowledge bases such as [DBpedia](&lt;a class=&quot;reference&quot; href=&quot;http://dbpedia.org&quot;&gt;http://dbpedia.org&lt;/a&gt;), [Geonames ](&lt;a class=&quot;reference&quot; href=&quot;http://www.geonames.org/&quot;&gt;http://www.geonames.org/&lt;/a&gt;), [YAGO](&lt;a class=&quot;reference&quot; href=&quot;https://en.wikipedia.org/wiki/YAGO_%28ontology%29&quot;&gt;https://en.wikipedia.org/wiki/YAGO_%28ontology%29&lt;/a&gt;)...&lt;/li&gt;
&lt;li&gt;Content Hub: a faceted search engine based on Solr to search for content using the knowledge automatically extracted by the enhancement engines;&lt;/li&gt;
&lt;li&gt;CMS bridges to lift the structured content of document repositories using the JCR and [CMIS](&lt;a class=&quot;reference&quot; href=&quot;https://en.wikipedia.org/wiki&quot;&gt;https://en.wikipedia.org/wiki&lt;/a&gt;)&lt;/li&gt;
&lt;/ul&gt;
&lt;div class=&quot;system-message&quot;&gt;&lt;b&gt;ReST / HTML errors:&lt;/b&gt;System Message: WARNING/2 (&amp;amp;lt;string&amp;amp;gt; , line 16)&amp;lt;/p&amp;gt;
&lt;/div&gt;Bullet list ends without a blank line; unexpected unindent.&lt;p&gt;Content_Management_Interoperability_Services access protocols (using [Apache Chemistry](&lt;a class=&quot;reference&quot; href=&quot;https://chemistry.apache.org&quot;&gt;https://chemistry.apache.org&lt;/a&gt;)) and store the result into a triple store suitable for [SPARQL](&lt;a class=&quot;reference&quot; href=&quot;https://en.wikipedia.org/wiki/SPARQL&quot;&gt;https://en.wikipedia.org/wiki/SPARQL&lt;/a&gt;) access;&lt;/p&gt;
&lt;ul class=&quot;simple&quot;&gt;
&lt;li&gt;Rules engine based on [Apache Jena](&lt;a class=&quot;reference&quot; href=&quot;https://incubator.apache.org/jena/&quot;&gt;https://incubator.apache.org/jena/&lt;/a&gt;) for knowledge refactoring (e.g. convert extracted knowledge into the rich snippet vocabulary for SEO), integrity checks, merging rules, deductive inference...&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Automatically extracting and post-processing structured knowledge from semi-structured content it a key step towards better interoperability of the user intents and building smarter applications. [Apache Stanbol](&lt;a class=&quot;reference&quot; href=&quot;http://incubator.apache.org/stanbol&quot;&gt;http://incubator.apache.org/stanbol&lt;/a&gt;) aims to make it as easy as possible to achieve that goal.&lt;/p&gt;
&lt;div class=&quot;system-messages section&quot;&gt;
&lt;h3&gt;&lt;a&gt;Docutils System Messages&lt;/a&gt;&lt;/h3&gt;
&lt;div class=&quot;system-message&quot;&gt;&lt;b&gt;ReST / HTML errors:&lt;/b&gt;System Message: ERROR/3 (&amp;amp;lt;string&amp;amp;gt; , line 1); &amp;lt;em&amp;gt;backlink&amp;lt;/em&amp;gt;&amp;lt;/p&amp;gt;
&lt;/div&gt;Unknown target name: &quot;stanbol&quot;.&lt;/div&gt;</description>
  <dc:date>2024-05-03T14:05+00:00</dc:date>
  <dc:creator>Olivier Grisel</dc:creator>
</item>
  </channel>
</rss>