(scripts|docs) Update scripts and documentations for the new operator's gui and file storage workflows.

2025-10-05 21:22:39 +02:00 · 2023-08-01 22:47:37 +02:00
parent 8de3e6ab80
commit ba724bc1b2
4 changed files with 86 additions and 146 deletions
--- a/run/readme.md
+++ b/run/readme.md
@@ -51,7 +51,31 @@ $ docker-compose up

 6. Download Sample Data

-TODO: How?
+A script is available for downloading sample data. The script will download the
+data from https://downloads.marginalia.nu/ and extract it to the correct location.
+
+The system will pick the data up automatically.
+
+```shell
+$ run/download-samples l
+```
+
+Four sets are available:
+
+| Name | Description                     |
+|------|---------------------------------|
+| s    | Small set, 1000 domains         |
+| m    | Medium set, 2000 domains        |
+| l    | Large set, 5000 domains         |
+| xl   | Extra large set, 50,000 domains |
+
+Warning: The XL set is intended to provide a large amount of data for 
+setting up a pre-production environment. It may be hard to run on a smaller
+machine.  It's barely runnable on a 32GB machine; and total processing time
+is around 5 hours.
+
+The 'l' set is a good compromise between size and processing time and should
+work on most machines.

 ## Experiment Runner