You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: section/result.tex
+1-1Lines changed: 1 addition & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -29,7 +29,7 @@ \section{Preliminary Results}
29
29
30
30
An open-source implementation of the \href{https://github.com/constraintAutomaton/query-shape-detection}{algorithm} and an
31
31
\href{https://github.com/constraintAutomaton/comunica-feature-link-traversal/tree/feature/shapeIndex}{integration} in the query engine
32
-
Comunica \cite{taelman_iswc_resources_comunica_2018} is available online~\sepfootnote{fn:impl}.
32
+
Comunica \cite{taelman_iswc_resources_comunica_2018} is available online.~\sepfootnote{fn:impl}
33
33
We use the \href{https://github.com/SolidBench/SolidBench.js}{benchmark Solidbench} \cite{Taelman2023} to compare our approach with the current state-of-the-art (the \href{https://solid.github.io/type-indexes/}{type index} and the \href{https://www.w3.org/TR/ldp/}{LDP specification}~\sepfootnote{fn:ldp} as structural assumptions)~\cite{Taelman2023}.
34
34
We used the supported subset of SolidBench queries, skipping the currently unimplemented \href{https://www.w3.org/TR/sparql11-query/#propertypaths}{SPARQL property paths}~\sepfootnote{fn:propertyPath} and unions.
35
35
We executed each query 50 times with a timeout of 1 minute (6,000 ms).
Copy file name to clipboardExpand all lines: section/shape_index.tex
+5-5Lines changed: 5 additions & 5 deletions
Original file line number
Diff line number
Diff line change
@@ -31,11 +31,11 @@ \section{Shape Index and Query-Shape Containment}
31
31
32
32
\subsection{Shape Index}
33
33
34
-
Our method introduces the concept of a \emph{shape index} to reduce query execution time by minimizing unnecessary dereferencing of RDF documents within web subdomains (sets of URLs or URL patterns)~\sepfootnote{fn:domain}.
34
+
Our method introduces the concept of a \emph{shape index} to reduce query execution time by minimizing unnecessary dereferencing of RDF documents within web subdomains (sets of URLs or URL patterns).~\sepfootnote{fn:domain}
35
35
We define a shape index as a set of mappings between RDF data shapes and sets of resources.
36
36
This mapping concept shares similarities with shape mapping~\cite{Gayo2018} and target declarations~\cite{Gayo2018Shacl}.
37
37
However, instead of mapping shapes to RDF subgraphs, the shape index maps shapes to sets of documents.
38
-
The shape index also shares commonalities with \href{https://shapetrees.org/}{shape trees}~\sepfootnote{fn:shapetrees}, however, it is designed to be a simpler formulation focused on query optimization.
38
+
The shape index also shares commonalities with \href{https://shapetrees.org/}{shape trees},~\sepfootnote{fn:shapetrees} however, it is designed to be a simpler formulation focused on query optimization.
39
39
The shape index has a range of applications defined in a domain and a flag indicating if the index is \emph{complete}.
40
40
A shape index is complete when every resource in the domain is associated with a shape within the shape index.
41
41
In a shape index when a shape is mapped to a set of RDF resources then the shape \emph{must} validate those resources.
In a similar case with an \emph{incomplete} shape index, the query engine can only use the shape index for data discovery.
65
65
This case is similar to the usage of the type index but with a more reaching ability to match a query with the index because shapes in their definition describe the properties (RDF predicates) of the entities whereas the type index only provides the classes IRIs.
66
66
It is possible to dereference the class IRIs to get information about the properties (if available), however, it is not the current practice \cite{Taelman2023}.
67
-
A comparison of the RDF data shapes and RDF class approach due to their potential similarities is delegated to future works~\sepfootnote{fn:litShapeComparaison}.
67
+
A comparison of the RDF data shapes and RDF class approach due to their potential similarities is delegated to future works.~\sepfootnote{fn:litShapeComparaison}
68
68
69
69
\subsection{A Concrete Example}
70
70
@@ -76,7 +76,7 @@ \subsection{A Concrete Example}
76
76
When traversing the network the query engine cannot know the content of the documents encountered, therefore, the engine \emph{must} deference every reachable document as defined by a reachability criterion.
77
77
The presence of a shape index can change the state of affairs.
78
78
If the engine encounters a domain containing exclusively book data as indicated by a complete shape index, the engine can skip the documents of the domain.
79
-
Conversely, if a domain has comment and movie review data declared by a complete shape index, the query engine can safely limit its dereferencing operations to the set of documents related to comments without affecting results completeness.
79
+
If a domain has comment and movie review data declared by a complete shape index, the query engine can safely limit its dereferencing operations to the set of documents related to comments without affecting results completeness.
80
80
The engine can restrict its dereferencing operations because at least one star pattern is contained in the comment shape and none in the movie review and book shapes.
81
81
If the engine encounters a domain regardless of the completeness of the index, declaring comment, forum, and individual (moderators are individuals/people) data, among others, then the documents associated with the non-query-relevant part of the domain can be ignored with the same containment logic presented earlier.
82
82
Thus, we can consider that the traversal proceeds domain by domain ignoring documents known a priori to not content query-relevant data.
0 commit comments