[iks-community] [Fwd: [Dbworld] Source-code/demo Releases on Fuzzy String Queries from UC Irvine]
Wernher Behrendt
wernher.behrendt at salzburgresearch.at
Wed Mar 31 09:50:16 CEST 2010
Dear all,
Some of you might find this relevant to CMS.
Note that they use an "academic" BSD license with restrictions for
commercial use - see their web site for details.
Best regards,
Wernher
-------- Original Message --------
Subject: [Dbworld] Source-code/demo Releases on Fuzzy String Queries
from UC Irvine
Date: Tue, 30 Mar 2010 14:20:44 -0500
From: Chen Li <chenli at ics.uci.edu>
Reply-To: dbworld_owner at yahoo.com
To: dbworld at cs.wisc.edu
We are very glad to announce three releases on fuzzy string matching.
1. Flamingo Source-Code Package (version 3.0) on approximate string
matching
http://flamingo.ics.uci.edu/releases/3.0/
URLs of earlier DBWorld messages:
Version 1.0:
http://www.cs.wisc.edu/dbworld/messages/2007-04/1176855447.html
Version 2.0:
http://www.cs.wisc.edu/dbworld/messages/2008-10/1224008939.html
Main changes in this version:
* Added Compressed Indexers based on the Techniques from:
"Space-Constrained Gram-Based Indexing for Efficient Approximate
String Search", by Alexander Behm, Shengyue Ji, Chen Li, and
Jiaheng Lu, in ICDE 2009
* Added Module for Top-K Approximate String Search from: "Efficient
top-k algorithms for fuzzy search in string collections", by Rares
Vernica, Chen Li, in KEYS 2009: 9-14. (Workshop on Keyword Search
on Structured Data, collocated with SIGMOD 2009)
* Added Disk-Based Inverted Index, Disk-Based StringContainer and
Efficient Search Algorithms using the Disk-Based Components from:
"Answering Set-Similarity Selection Queries on Large Disk-Resident
Data Sets", by Alexander Behm, Chen Li, Michael J. Carey, UCI
Technical Report 2010
* Added Some Auto-Tuning Features, e.g. Automatic Choice of
Partitioning Filter
Main contributors in this new release:
Alexander Behm, Rares Vernica, Shengyue Ji, and Chen Li,
2. Source code for Parallel Set-Similarity Joins Using MapReduce
http://asterix.ics.uci.edu/fuzzyjoin-mapreduce/
Its techniques are described in the SIGMOD 2010 paper titled:
"Efficient Parallel Set-Similarity Joins Using MapReduce", by Rares
Vernica, Michael J. Carey, Chen Li.
3. Demos on Fuzzy Keyword Search on Spatial Data (Maps)
http://flamingo.ics.uci.edu/localsearch/fuzzysearch/
Its techniques are described in the DASFAA 2010 demo paper titled
"Fuzzy Keyword Search on Spatial Data", by Sattam Alsubaiee and
Chen Li.
Chen Li
UC Irvine
_______________________________________________
Please do not post msgs that are not relevant to the database community
at large. Go to www.cs.wisc.edu/dbworld for guidelines and posting forms.
To unsubscribe, go to https://lists.cs.wisc.edu/mailman/listinfo/dbworld
--
Wernher Behrendt
Salzburg Research Forschungsgesellschaft
Knowledge Based Information Systems
Jakob-Haringer Strasse 5/II
5020 Salzburg
Austria
email wernher.behrendt at salzburgresearch.at
phone +43 (0)662 2288 409
fax +43 (0)662 2288 222
http://www.salzburgresearch.at
More information about the iks-community
mailing list