home > community > projects > duke - fast deduplication engine

close subject identifiers for duke - fast deduplication engine
  • http://www.topicmapslab.de/projects/duke
165987104_721b792428_m_1_

duke - fast deduplication engine

Project category: Utilities and Components
Project status: Alpha

http://code.google.com/p/duke/

Duke is a fast and flexible deduplication (or entity resolution, or record linkage) engine written in Java on top of Lucene. At the moment (2011-04-07) it can process 1,000,000 records in 11 minutes on a standard laptop in a single thread.

Project Leader

Lars Marius Garshol

No contact information available. 

Photoserv_1_

Lars Marius is project leader of TM Photo, Topic Maps Tools, and duke - fast deduplication.. .

 

Follow us on Twitter

maiana

Topic Maps provides a proven means for data integration scaling to the
web, as well as a core technology for our highly flexible applications
with largely autogenerated frontend structures.

Stekeivevk_1_
Stefan Kesberg
topicWorks Navigator
practical-semantics.com
Topic Maps Lab auf der Cebit 2011
Partners

Graduate from the Topic Maps Lab

onotoa