home > community > projects > duke - fast deduplication engine

close subject identifiers for duke - fast deduplication engine
  • http://www.topicmapslab.de/projects/duke
165987104_721b792428_m_1_

duke - fast deduplication engine

Project category: Utilities and Components
Project status: Alpha

http://code.google.com/p/duke/

Duke is a fast and flexible deduplication (or entity resolution, or record linkage) engine written in Java on top of Lucene. At the moment (2011-04-07) it can process 1,000,000 records in 11 minutes on a standard laptop in a single thread.

Project Leader

Lars Marius Garshol

No contact information available. 

Photoserv_1_

Lars Marius is project leader of TM Photo, Topic Maps Tools, and duke - fast deduplication.. .

 

Follow us on Twitter

maiana

Resellers can give their customers the opportunity to apply Web 3.0 technology to their web sites with topics maps-based searching, which is more suitable for single sites than Google-style navigation.

Kal2_bw
Kal Ahmed
TMCore
practical-semantics.com
Topic Maps Lab auf der Cebit 2011
Partners

Graduate from the Topic Maps Lab

onotoa