OfficeGraph is a real-world dataset of measurements from 444 IoT devices taken over 11 months. The devices are made up of 17 different sensor models which measure different properties. The data was collected in a Dutch 7-story office building and consists of about 90 million RDF triples. See also the paper for more details.
The elements in the dataset are ordered from oldest to newest by the measurement time (saref:hasTimestamp predicate).
Additional data such as the metadata about the devices taking measurements and rooms in the building, as well as copy of this dataset can be found on Zenodo.
This README is a snapshot of documentation for the latest development version of the dataset. Full documentation for all versions can be found on the website.
- Title: OfficeGraph (en)
- Identifier:
officegraph
- Version:
dev
- Theme:
- Data collection (eurovoc:6030)
- Internet of Things (eurovoc:c_b12a760a)
- Office space (eurovoc:c_4b5a18f8)
- Creator:
- Adam Skaskiewicz (1)
- Name: Adam Skaskiewicz
- Nickname:
adamskas
- Comment: Author of benchmark dataset (en)
- Roderick van der Weerdt (2)
- Name: Roderick van der Weerdt
- Comment: Co-author of original dataset (en)
- Victor de Boer (3)
- Name: Victor de Boer
- Comment: Co-author of original dataset (en)
- Ronald Siebes (4)
- Name: Ronald Siebes
- Comment: Co-author of original dataset (en)
- Ronnie Groenewold (5)
- Name: Ronnie Groenewold
- Comment: Co-author of original dataset (en)
- Frank van Harmelen (6)
- Name: Frank van Harmelen
- Comment: Co-author of original dataset (en)
- Adam Skaskiewicz (1)
- License: https://spdx.org/licenses/CC-BY-4.0
- Source:
- van der Weerdt, R., de Boer, V., Siebes, R., Groenewold, R., & van Harmelen, F. (2024). OfficeGraph: A Knowledge Graph of Office Building IoT Measurements. The Semantic Web, 94–109. https://doi.org/10.1007/978-3-031-60635-9_6
- https://github.com/RoderickvanderWeerdt/OfficeGraph/tree/main
- Date Issued: 2025-01-18
- Date Modified: 2025-04-06
- Landing page: officegraph (dev)
- Has stream type usage:
- RDF stream type usage (1)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs. Each graph corresponds to one measurement from one sensor in the office building. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
- RDF stream type usage (2)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
- RDF stream type usage (1)
- Has stream element count: 14,930,478
- Has stream element split:
- Type:
- Stream elements split by time (rb:TimeStreamElementSplit)
- Stream elements split by topic (rb:TopicStreamElementSplit)
- Comment: Each stream element corresponds to one measurement from one sensor in the office building. (en)
- Has temporal property: http://www.w3.org/ns/sosa/resultTime
- Has subject shape:
- Comment: Target instances of class
saref:Measurement
. (en) - Target class: https://saref.etsi.org/core/Measurement
- Comment: Target instances of class
- Type:
- Uses vocabulary:
- Conforms to W3C RDF 1.1 specification: yes
- Conforms to W3C RDF-star draft specification as of December 17, 2021: yes
- Uses generalized triples: no
- Uses generalized RDF datasets: no
- Uses RDF-star: no
- Temporal resolution: PT1M
- Title: Full flat distribution
- Identifier:
flat-full
- Has file name:
flat_full.nt.gz
- Has distribution type:
- Flat distribution (rb:flatDistribution)
- Full distribution (rb:fullDistribution)
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
- Has stream element count: 14,930,478
- Byte size: 506.2 MB
- Media type: application/n-triples
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/officegraph/dev/files/flat_full.nt.gz
- Title: Full stream distribution
- Identifier:
stream-full
- Has file name:
stream_full.tar.gz
- Has distribution type:
- Full distribution (rb:fullDistribution)
- Stream distribution (rb:streamDistribution)
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs. Each graph corresponds to one measurement from one sensor in the office building. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
- Has stream element count: 14,930,478
- Byte size: 441.2 MB
- Media type: text/turtle
- Packaging format: application/tar
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/officegraph/dev/files/stream_full.tar.gz
- Title: Full Jelly distribution
- Identifier:
jelly-full
- Has file name:
jelly_full.jelly.gz
- Has distribution type:
- Full distribution (rb:fullDistribution)
- Jelly distribution (rb:jellyDistribution)
- Has stream type usage:
- RDF stream type usage (1)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs. Each graph corresponds to one measurement from one sensor in the office building. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
- RDF stream type usage (2)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
- RDF stream type usage (1)
- Has stream element count: 14,930,478
- Byte size: 338.3 MB
- Media type: application/x-jelly-rdf
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/officegraph/dev/files/jelly_full.jelly.gz
- Title: 10M elements flat distribution
- Identifier:
flat-10m
- Has file name:
flat_10M.nt.gz
- Has distribution type:
- Flat distribution (rb:flatDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
- Has stream element count: 10,000,000
- Byte size: 335.7 MB
- Media type: application/n-triples
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/officegraph/dev/files/flat_10M.nt.gz
- Title: 10M elements stream distribution
- Identifier:
stream-10m
- Has file name:
stream_10M.tar.gz
- Has distribution type:
- Partial distribution (rb:partialDistribution)
- Stream distribution (rb:streamDistribution)
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs. Each graph corresponds to one measurement from one sensor in the office building. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
- Has stream element count: 10,000,000
- Byte size: 293.0 MB
- Media type: text/turtle
- Packaging format: application/tar
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/officegraph/dev/files/stream_10M.tar.gz
- Title: 10M elements Jelly distribution
- Identifier:
jelly-10m
- Has file name:
jelly_10M.jelly.gz
- Has distribution type:
- Jelly distribution (rb:jellyDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream type usage:
- RDF stream type usage (1)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
- RDF stream type usage (2)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs. Each graph corresponds to one measurement from one sensor in the office building. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
- RDF stream type usage (1)
- Has stream element count: 10,000,000
- Byte size: 222.4 MB
- Media type: application/x-jelly-rdf
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/officegraph/dev/files/jelly_10M.jelly.gz
- Title: 1M elements flat distribution
- Identifier:
flat-1m
- Has file name:
flat_1M.nt.gz
- Has distribution type:
- Flat distribution (rb:flatDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
- Has stream element count: 1,000,000
- Byte size: 31.4 MB
- Media type: application/n-triples
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/officegraph/dev/files/flat_1M.nt.gz
- Title: 1M elements stream distribution
- Identifier:
stream-1m
- Has file name:
stream_1M.tar.gz
- Has distribution type:
- Partial distribution (rb:partialDistribution)
- Stream distribution (rb:streamDistribution)
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs. Each graph corresponds to one measurement from one sensor in the office building. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
- Has stream element count: 1,000,000
- Byte size: 27.8 MB
- Media type: text/turtle
- Packaging format: application/tar
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/officegraph/dev/files/stream_1M.tar.gz
- Title: 1M elements Jelly distribution
- Identifier:
jelly-1m
- Has file name:
jelly_1M.jelly.gz
- Has distribution type:
- Jelly distribution (rb:jellyDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream type usage:
- RDF stream type usage (1)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs. Each graph corresponds to one measurement from one sensor in the office building. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
- RDF stream type usage (2)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
- RDF stream type usage (1)
- Has stream element count: 1,000,000
- Byte size: 20.6 MB
- Media type: application/x-jelly-rdf
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/officegraph/dev/files/jelly_1M.jelly.gz
- Title: 100K elements flat distribution
- Identifier:
flat-100k
- Has file name:
flat_100K.nt.gz
- Has distribution type:
- Flat distribution (rb:flatDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
- Has stream element count: 100,000
- Byte size: 2.8 MB
- Media type: application/n-triples
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/officegraph/dev/files/flat_100K.nt.gz
- Title: 100K elements stream distribution
- Identifier:
stream-100k
- Has file name:
stream_100K.tar.gz
- Has distribution type:
- Partial distribution (rb:partialDistribution)
- Stream distribution (rb:streamDistribution)
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs. Each graph corresponds to one measurement from one sensor in the office building. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
- Has stream element count: 100,000
- Byte size: 2.5 MB
- Media type: text/turtle
- Packaging format: application/tar
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/officegraph/dev/files/stream_100K.tar.gz
- Title: 100K elements Jelly distribution
- Identifier:
jelly-100k
- Has file name:
jelly_100K.jelly.gz
- Has distribution type:
- Jelly distribution (rb:jellyDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream type usage:
- RDF stream type usage (1)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs. Each graph corresponds to one measurement from one sensor in the office building. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
- RDF stream type usage (2)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
- RDF stream type usage (1)
- Has stream element count: 100,000
- Byte size: 1.8 MB
- Media type: application/x-jelly-rdf
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/officegraph/dev/files/jelly_100K.jelly.gz
- Title: 10K elements flat distribution
- Identifier:
flat-10k
- Has file name:
flat_10K.nt.gz
- Has distribution type:
- Flat distribution (rb:flatDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
- Has stream element count: 10,000
- Byte size: 315.9 KB
- Media type: application/n-triples
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/officegraph/dev/files/flat_10K.nt.gz
- Title: 10K elements stream distribution
- Identifier:
stream-10k
- Has file name:
stream_10K.tar.gz
- Has distribution type:
- Partial distribution (rb:partialDistribution)
- Stream distribution (rb:streamDistribution)
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs. Each graph corresponds to one measurement from one sensor in the office building. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
- Has stream element count: 10,000
- Byte size: 289.7 KB
- Media type: text/turtle
- Packaging format: application/tar
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/officegraph/dev/files/stream_10K.tar.gz
- Title: 10K elements Jelly distribution
- Identifier:
jelly-10k
- Has file name:
jelly_10K.jelly.gz
- Has distribution type:
- Jelly distribution (rb:jellyDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream type usage:
- RDF stream type usage (1)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
- RDF stream type usage (2)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs. Each graph corresponds to one measurement from one sensor in the office building. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
- RDF stream type usage (1)
- Has stream element count: 10,000
- Byte size: 180.6 KB
- Media type: application/x-jelly-rdf
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/officegraph/dev/files/jelly_10K.jelly.gz