This repository was archived by the owner on Apr 16, 2025. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 195
Information Extraction in Karma
Pedro Szekely edited this page Feb 10, 2014
·
10 revisions
This page presents the design to support information extraction in Karma. Suppose the user loads a worksheet that contains a column with text data, such as the biographies of artists.
To extract entities, the user invokes the Information Extraction command:
This command creates a JSON document containing the input data for the Information Extraction service.
The JSON document consists of an array of objects.
Each object has a rowHash
attribute, a Karma generated hash id for the worksheet row containing the text and a text
attribute, which contains the text where we want to run extraction:
[
{
rowHash: "5f0266c4c326",
text: "... Berninghaus attended the Saint Louis School of Fine Arts at night. ..."
}
,
{
rowHash: "c326b9a1ef9e",
text: "Paris was where the 20th century was. ... attributed to Gertrude Stein prove apocryphal ..."
}
,
{
rowHash: "1ef9e39cb78c3",
text: "The daughter of a furniture manufacturer, Anni Albers (Fleischmann) was born in Berlin. ..."
}
]