[iks-community] A Content and Knowledge Reference Infrastructure for IKS

Rupert Westenthaler rupert.westenthaler at salzburgresearch.at
Wed Apr 28 09:17:51 CEST 2010


Dear IKS team,

as you might have already noticed because the twitter message john sent
yesterday evening, I published a proposal for a "Content and Knowledge
Reference Infrastructure" on the Wiki (see
http://wiki.iks-project.eu/index.php/Content-knowledge-reference-infrastructure).

To give a short impression of what such an infrastructure aims to provide:
    - a infrastructure for identification and management of entities within
the CMS (e.g. define and manage "Paris" as an entity within the loacl CMS
and providing a local entity identifier - called Symbol)
    - support for referring and importing content and knowledge about
entities from public data sets  (e.g. referring Paris at wikipedia,
Paris at geonames.org and importing content and knowledge to the local
representation of "Paris")
    - support for organizations to establish and manage linked sites (e.g.
adding Wikipedia as linked site would enable mappings to Paris at wikipedia)
    - services to search for entities known by the local CMS (e.g. a
journalist tagging a news story about "Paris" would find the corresponding
local symbol for "Paris")
    - services to suggest entities defined by linked sites (e.g. A  
Journalist
tagging a news story about "Teotihuacan" would get the suggestion to use
"Teotihuacan at Wikipedia".
    - services to easily create new symbols (e.g. (1) A Journalist tagging a
news story with "Teotihuacan at Wikipedia" would create a new symbol (local
entity identifier; (2) An NLP engine detects the named entity
"Teotihuacan" within a content item and creates a new symbol)
    - life cycle support for symbols and mappings (e.g. Symbols created by
the NLP engine are initial in the "proposed" state and need to be
confirmed e.g. by an Journalist accepting them as tag for the content.)

For more information please see the Wiki page.

In addition to this I also published a specification of a "Symbol Service"
that aims to cover some of the functionalities suggested for the Content
and Knowledge Reference Infrastructure. This specification can be vied at
https://docs.google.com/Doc?docid=0AXkrJWeeMbEfZGd0OHI1anNfMTJnNW53bTdnNw&hl=en

Finally I would like to ask some questions:

 From industry I would like to get feedback about the "big
picture".
    - Are the assumptions about public data sets and the challenges when
using them correct?
    - Are important aspects missing?

I am quite excited about the work that BBC is doing in this area. Also the
methodology of how they build web pages looks appealing to me!
    - Do you agree with that?
    - Are there weaknesses that you can spot?
    - Any other good examples around? Examples for adding more data to  
linked
data are not so interesting, but usage scenarios of public available
content/knowledge for content management and (web) publishing would be
great!

Feedback and suggestions regarding the "Content and Knowledge Reference
Infrastructure" and the specification for the symbol service are also very
welcome.
Especially user stories and usage scenarios would be great.

If someone would like to add notes to the specification of the Symbol
Service, then please send me a short message with your E-Mail address and
I will grant you editing privileges for the google docs document.

With the hope for a fruitful discussion on this topic
Best,
Rupert Westenthaler



|--
| Rupert Westenthaler             rupert.westenthaler at salzburgresearch.at
| Salzburg Research Forschungsgesellschaft http://www.salzburgresearch.at
| Knowledge Based Information Systems                    +43 662 2288 413
| Jakob-Haringer Strasse 5/II                          Skype-Name: westei
| A-5020 Salzburg


More information about the iks-community mailing list