Skip to content

LapDevelopment_Annotations

StephanOepen edited this page Jan 20, 2015 · 15 revisions

General Background

This page documents the structure and form of annotations in LAP, i.e. the adptation of the Linguistic Annotation Framework (LAF) to LAP and serialization in JSON (in the LAP Store).

Types of Annotations

Database Terminology

The MongoDB storage distinguishes three types of entities that nest hierarchically: databases, collections, and documents. In January 2015, all LAP annotations are stored in the same database (called lapstore), but we envision moving to a set-up where each Galaxy user has their own MogoDB database (which will make it easier to track ownership of annotation records).

Running Example

Following are the annotation recorded in LAP Store when processing the default English Dependency Parsing workflow, which in January 2015 comprised tokenizer, REPP, HunPos, and MaltParser. We assume the following toy input file

  The cat chased the dog.
  Fido barked.
Clone this wiki locally