[iks-community] IKS workshop "Semantic Search - Fact and Fiction"

Stéphane Croisier scroisier at jahia.com
Fri Sep 4 14:19:17 CEST 2009


Hi John,

At 13:12 04.09.2009, you wrote:
>3. Who are the people doing great things in the 
>area of search within the CMS world that you would like to see at the workshop?
>4. Who would be on your wish list from industry 
>or academia beyond the CMS world to talk about 
>"Semantic Search" at the workshop. Maybe 
>personalities from Google, W3C, Bing, Yahoo, Lucene, Wolfram Alpha?

Regarding point 3 and 4, I would propose to you 
SalsaDev (http://www.salsadev.com/). This is a 
bit biaised as I am one of the Board Member of 
this company (http://www.salsadev.com/team). This 
is a new Swiss start-up which is developing some 
interesting things around the notion of vectors, 
LSA and other similar algorithms in order to 
automatically extract the meaning of any piece of 
information. We were speaking a lot during the 
last event about RDF; OWL; Linked Data & co but 
it would also be interesting to investigate a bit 
more the second facet of the semantic world 
driven by the automatic extraction of concepts 
and the representation of a vector-based 
knowledge map of the information based no more on 
keywords, associations and metadata but on the 
underlying abstracted senses and distances between vectors.

We had an interesting dinner yesterday with the 
SalsaDev folks and Bertrand Delacrétaz. They have 
a nice demo based on a Firefox plug-in which 
could automatically understand the different 
meaning of the web page your are currently 
browsing and can automatically suggest to you 
some related articles (they currently indexed 
wikipedia as a demo example but this could 
perfectly rather be related to articles stored in 
your private CMS repositories instead).

In the same perspective you could also perhaps 
try to invite Recommind 
(http://www.recommind.com/) which is also 
leveraging the PLSA algorithm 
(http://www.recommind.com/technology/core) and 
recently launched their new Categorization 
MindServer in order to automatically categorize 
doucments 
(http://www.cmswatch.com/Trends/1665-Recommind-Auto-categorization). 
Automony and Reuters OpenCalais look like also interesting people to invite.

I could also suggest Zemanta: 
http://www.zemanta.com/  which was not present 
last time and is located in Slovenia.

As everybody looke like interested by a semantic 
Lucene, it would be interesting to invite the 
newly created Lucid Imagination company which 
regroups some of the key committers of the Apache 
Lucene project (6 mio USD raised from the CIA end 
of last 
spring:http://www.cmswire.com/cms/enterprise-20/cia-invests-in-open-source-lucene-solr-search-004830.php). 
There is the ApacheCon US 2009 
(http://us.apachecon.com/c/acus2009/) the week 
before in California but perhaps they will be 
interested to come or to send someone. However I 
do not know if they really have any semantic 
initiatives for the moment. But perhaps it could 
be interesting to have them on board and help 
make some contacts between them and other 
Lucene-based projects. Another interesting Apache 
project currently under incubation is UIMA 
(<http://incubator.apache.org/uima/>http://incubator.apache.org/uima). 
I personally do not know any of the current 
committers 
(http://incubator.apache.org/uima/team-list.html) 
but it would be interesting to also get them on board.

In France, one of our customers (Manpower: 
http://www.cio-online.com/actualites/lire-manpower-procede-a-l-analyse-semantique-des-cv-non-structures-2353.html) 
integrated Lingway. They perhaps also have some 
interesting demo and use cases to show to us: 
http://www.lingway.com/index.php?lang=en

Some thoughts....
Stéphane

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.iks-project.eu/pipermail/iks-community/attachments/20090904/2c6dcd4c/attachment.htm>


More information about the iks-community mailing list