1
1
mirror of https://github.com/MarginaliaSearch/MarginaliaSearch.git synced 2025-10-05 21:22:39 +02:00

(scripts|docs) Update scripts and documentations for the new operator's gui and file storage workflows.

This commit is contained in:
Viktor Lofgren
2023-08-01 22:47:37 +02:00
parent 8de3e6ab80
commit ba724bc1b2
4 changed files with 86 additions and 146 deletions

View File

@@ -51,7 +51,31 @@ $ docker-compose up
6. Download Sample Data
TODO: How?
A script is available for downloading sample data. The script will download the
data from https://downloads.marginalia.nu/ and extract it to the correct location.
The system will pick the data up automatically.
```shell
$ run/download-samples l
```
Four sets are available:
| Name | Description |
|------|---------------------------------|
| s | Small set, 1000 domains |
| m | Medium set, 2000 domains |
| l | Large set, 5000 domains |
| xl | Extra large set, 50,000 domains |
Warning: The XL set is intended to provide a large amount of data for
setting up a pre-production environment. It may be hard to run on a smaller
machine. It's barely runnable on a 32GB machine; and total processing time
is around 5 hours.
The 'l' set is a good compromise between size and processing time and should
work on most machines.
## Experiment Runner