home > community > projects > duke - fast deduplication engine

close subject identifiers for duke - fast deduplication engine
  • http://www.topicmapslab.de/projects/duke
165987104_721b792428_m_1_

duke - fast deduplication engine

Project category: Utilities and Components
Project status: Alpha

http://code.google.com/p/duke/

Duke is a fast and flexible deduplication (or entity resolution, or record linkage) engine written in Java on top of Lucene. At the moment (2011-04-07) it can process 1,000,000 records in 11 minutes on a standard laptop in a single thread.

Project Leader

Lars Marius Garshol

No contact information available. 

Photoserv_1_

Lars Marius is project leader of TM Photo, Topic Maps Tools, and duke - fast deduplication.. .

 

Follow us on Twitter

maiana

Topic Maps offers an information architecture for semantic portals with
highly networked content and access paths in support of the associative
human mind. It is our technology of choice for knowledge oriented
application systems.

Gerweivev1k_1_
Gerhard Weber
topicWorks Domains
practical-semantics.com
Topic Maps Lab auf der Cebit 2011
Partners

Graduate from the Topic Maps Lab

onotoa