home > news > corpora browser of the wortschatz project now based on topic maps

close subject identifiers for Corpora browser of the Wortschatz project now based on Topic Maps
  • http://www.topicmapslab.de/documents/Wortschatz_and_Topic_Maps
Wortschatz

Corpora browser of the Wortschatz project now based on Topic Maps

Published by Benjamin Bock and Lutz Maicher on 2009-11-24.

Abstract:

With the Wortschatz project the NLP research group at the University of Leipzig – which the Topic Maps Lab is affiliated with – provides one of the most important language resources in Germany. The new Wortschatz browser for statistical analyses is now based in Topic Maps.

The Wortschatz project is one of the most important language resources in Germany. It is provided by the NLP research group at the University of Leipzig, which is also the host of the Topic Maps Lab.

A few days ago a new browser for corpora and language statistics was launched by the Wortschatz group. Within this portal users can compare a bunch of statistical parameters of currently 27 different languages and of different corpora. These are parameters like “most frequent word beginning n-grams”, “distribution of sentence length in words or characters”, or “Language fingerprints”.

And the best news is: the browser as it is depicted in the figure is 100% based on Topic Maps. The data is collected from several thousand files and merged using JRTM with tinyTIM as backend. This project fairly shows, that integrating Topic Maps in a lightweight way into third party projects might be a appropriate path to a more Topic Mappish world. And with RTM the Topic Maps Lab provides a good “glue” facility for such purposes.

Authors of this document are

Benjamin Bock

http://twitter.com/bnjmnbck 

Benjamin-medium

Benjamin is project leader of Ruby Topic Maps and rtm-tmql. He is involved in Topic Maps Lab Community.. , TMQL4J, and yacca.me.

Lutz Maicher

http://www.lutzmaicher.de/ 

Foto_lutz

Lutz is project leader of Musica Migrans and Topic Maps Lab Community.. . He is involved in yacca.me and Maiana.

Subject Matter

Ruby Topic Maps

is a Topic Maps Engine.

Waffel

RTM is a Topic Maps Engine written in Ruby.

Visit homepage of Ruby Topic Maps

 
maiana

Topic Maps offers an information architecture for semantic portals with
highly networked content and access paths in support of the associative
human mind. It is our technology of choice for knowledge oriented
application systems.

Gerweivev1k_1_
Gerhard Weber
topicWorks Domains

Topic Maps

Academy

 

next course:

Grundlagen von Topic-Maps-Portalen

Start: Monday September 13 2010 17:00