You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/manual/usage.rst
+14-6Lines changed: 14 additions & 6 deletions
Original file line number
Diff line number
Diff line change
@@ -154,20 +154,20 @@ To enable auto-indexing, run with ``wayback -a`` or ``wayback -a --auto-interval
154
154
Creating a Web Archive
155
155
----------------------
156
156
157
-
Using Webrecorder
158
-
^^^^^^^^^^^^^^^^^
157
+
Using ArchiveWeb.page
158
+
^^^^^^^^^^^^^^^^^^^^^
159
159
160
-
If you do not have a web archive to test, one easy way to create one is to use `Webrecorder<https://webrecorder.io>`_
160
+
If you do not have a web archive to test, one easy way to create one is to use the `ArchiveWeb.page<https://archiveweb.page>`_ browser extension for Chrome and other Chromium-based browsers such as Brave Browser. ArchiveWeb.page records pages visited during an archiving session in the browser, and provides means of both replaying and downloading the archived items created.
161
161
162
-
After recording, you can click **Stop** and then click `Download Collection` to receive a WARC (`.warc.gz`) file.
162
+
Follow the instructions in `How To Create Web Archives with ArchiveWeb.page <https://archiveweb.page/en/usage/>`_. After recording, press **Stop** and then `download your collection <https://archiveweb.page/en/download/>`_ to receive a WARC (`.warc.gz`) file. If you choose to download your collection in the WACZ format, the WARC files can be found inside the zipped WACZ in the ``archive/`` directory.
163
163
164
-
You can then use this with work with pywb.
164
+
You can then use your WARCs to work with pywb.
165
165
166
166
167
167
Using pywb Recorder
168
168
^^^^^^^^^^^^^^^^^^^
169
169
170
-
The core recording functionality in Webrecorder is also part of :mod:`pywb`. If you want to create a WARC locally, this can be
170
+
Recording functionality is also part of :mod:`pywb`. If you want to create a WARC locally, this can be
171
171
done by directly recording into your pywb collection:
172
172
173
173
1. Create a collection: ``wb-manager init my-web-archive`` (if you haven't already created a web archive collection)
@@ -180,6 +180,14 @@ In this configuration, the indexing happens every 10 seconds.. After 10 seconds,
For a more automated browser-based web archiving experience, `Browsertrix <https://browsertrix.com/>`_ provides a web interface for configuring, scheduling, running, reviewing, and curating crawls of web content. Crawl activity is shown in a live screencast of the browsers used for crawling and all web archives created in Browsertrix can be easily downloaded from the application in the WACZ format.
187
+
188
+
`Browsertrix Crawler <https://crawler.docs.browsertrix.com/>`_, which provides the underlying crawling functionality of Browsertrix, can also be run standalone in a Docker container on your local computer.
0 commit comments