You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: oonipipeline/Readme.md
+9-3Lines changed: 9 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,7 +8,7 @@ For historical context, these are the major revisions:
8
8
-`v1` - OONI Pipeline based on custom CLI scripts using mongodb as a backend. Used until ~2015.
9
9
-`v2` - OONI Pipeline based on [luigi](https://luigi.readthedocs.io/en/stable/). Used until ~2017.
10
10
-`v3` - OONI Pipeline based on [airflow](https://airflow.apache.org/). Used until ~2020.
11
-
-`v4` - OONI Pipeline basedon custom script and systemd units (aka fastpath). Currently in use in production.
11
+
-`v4` - OONI Pipeline based on custom script and systemd units (aka fastpath). Currently in use in production.
12
12
-`v5` - Next generation OONI Pipeline. What this readme is relevant to. Expected to become in production by Q4 2024.
13
13
14
14
## Setup
@@ -41,13 +41,19 @@ clickhouse server
41
41
42
42
Workflows are started by first scheduling them and then triggering a backfill operation on them. When they are scheduled they will also run on a daily basis.
43
43
44
+
44
45
```
45
-
hatch run oonipipeline schedule --probe-cc US --test-name signal --create-tables
46
+
hatch run oonipipeline schedule --probe-cc US --test-name signal
46
47
```
47
48
48
49
You can then trigger the backfill operation like so:
49
50
```
50
-
hatch run oonipipeline backfill --probe-cc US --test-name signal --workflow-name observations --start-at 2024-01-01 --end-at 2024-02-01
51
+
hatch run oonipipeline backfill --create-tables --probe-cc US --test-name signal --workflow-name observations --start-at 2024-01-01 --end-at 2024-02-01
52
+
```
53
+
54
+
If you need to re-create the database tables (because the schema has changed), you want to add the `--drop-tables` flag to the invocation:
55
+
```
56
+
hatch run oonipipeline backfill --create-tables --drop-tables --probe-cc US --test-name signal --workflow-name observations --start-at 2024-01-01 --end-at 2024-02-01
51
57
```
52
58
53
59
You will then need some workers to actually perform the task you backfilled, these can be started like so:
0 commit comments