home > community > projects > duke - fast deduplication engine

close subject identifiers for duke - fast deduplication engine
  • http://www.topicmapslab.de/projects/duke
165987104_721b792428_m_1_

duke - fast deduplication engine

Project category: Utilities and Components
Project status: Alpha

http://code.google.com/p/duke/

Duke is a fast and flexible deduplication (or entity resolution, or record linkage) engine written in Java on top of Lucene. At the moment (2011-04-07) it can process 1,000,000 records in 11 minutes on a standard laptop in a single thread.

Project Leader

Lars Marius Garshol

No contact information available. 

Photoserv_1_

Lars Marius is project leader of TM Photo, Topic Maps Tools, and duke - fast deduplication.. .

 

Follow us on Twitter

maiana

Topic Maps is the only formal semantic model which is optimized for humans, not for computers. Applications and web portals based on Topic Maps are easy to use, without limitations for flexibility and creativity.

Benjamin-medium
Benjamin Bock
Ruby Topic Maps
practical-semantics.com
Topic Maps Lab auf der Cebit 2011
Partners

Graduate from the Topic Maps Lab

onotoa