Skip to content
StephanOepen edited this page Nov 24, 2012 · 20 revisions

Background

The Wikipedia Corpus Builder (WCB) is a toolkit for extracting relevant linguistic content from Wikipedia. It was used in the creation of the 2012 versions of WeScience and WikiWoods, through the MSc thesis of Lars Jørgen Solberg at the Department of Informatics at the University of Oslo.

Installation

Running on the English Wikipedia

Adaptations to Other Languages

Construction of WeScience 2.0

Clone this wiki locally