Skip to content

SemiRfc

MichaelGoodman edited this page Apr 20, 2016 · 12 revisions

SEM-I

A SEM-I, or SEMantic-Interface is a description of the semantic structures output by the grammar, and may include a number of sections:

  • Hierarchical relations among variable sorts and properties
  • A list of valid roles and allowed types for their values
  • A predicate inventory including the allowed roles for the predicate and further constraints on role values and properties
  • (aliases?)
  • A list of variable properties and their allowed values
  • (ges?)
  • (types?)

Additionally, a variable property mapping (VPM) is a component considered part of the SEM-I.

SEM-Is can be useful for validating the semantic output of grammars without having to load the entire grammar.

.smi file syntax

The .smi files (e.g. erg.smi, core.smi) are pseudo-TDL files. Here's a descriptive example:

   1 ; comments begin with semicolons
   2 
   3 signature:  ; sections begin at column 0 and are followed by a colon
   4   ; definitions are indented by 2 spaces
   5   u := top.  ; definition syntax depends on the section
   6   ;...
   7 
   8 properties:  ; the next section begins at column 0 again
   9   DIV [x] : bool
  10   ;...
  11 
  12 include: core.smi  ; external files can be included
  13 ; sections in included files are merged with sections in the main file

Proposal based on discussion at the Abbey on 2013-08-28

Woodley argues that the SEM-I should encode the hierarchy of predicates. This will allow a tool to calculate MRS subsumption with only the SEM-I and the MRSs (i.e. without having to parse the grammar). E.g. currently we have:

def_q_rel : ARG0 x, BODY h, RSTR h.

And we'd like something like:

def_q_rel (def_udef_a_q_rel) : ARG0 x, BODY h, RSTR h.

There is also talk of linking preds that differ by sense (e.g. number of arguments, like "he ate" vs "he ate a banana"), or mass/count distinctions ("every paper" vs "all the paper"). This is not trying to recreate something like WordNet.

Clone this wiki locally