@@ -19,16 +19,18 @@ Feature: Pipeline tests using the movies dataset
1919 Then the movies entity is stored as a parquet after the file_transformation phase
2020 And the latest audit record for the submission is marked with processing status data_contract
2121 When I run the data contract phase
22- Then there are 3 record rejections from the data_contract phase
22+ Then there is 1 submission rejection from the data_contract phase
23+ And there are 3 record rejections from the data_contract phase
2324 And there are errors with the following details and associated error_count from the data_contract phase
24- | Entity | ErrorCode | ErrorMessage | RecordIndex | error_count |
25- | movies | BLANKYEAR | year not provided | 2 | 1 |
26- | movies_rename_test | DODGYYEAR | year value (NOT_A_NUMBER ) is invalid | 1 | 1 |
27- | movies | DODGYDATE | date_joined value is not valid : daft_date | 1 | 1 |
25+ | Entity | ErrorCode | ErrorMessage | RecordIndex | error_count |
26+ | movies | BLANKYEAR | year not provided | 2 | 1 |
27+ | movies_rename_test | DODGYYEAR | year value (NOT_A_NUMBER ) is invalid | 1 | 1 |
28+ | movies | DODGYDATE | date_joined value is not valid : daft_date | 1 | 1 |
29+ | movies | BLANKTITLE | title should not be blank | 4 | 1 |
2830 And the movies entity is stored as a parquet after the data_contract phase
2931 And the latest audit record for the submission is marked with processing status business_rules
3032 When I run the business rules phase
31- Then The rules restrict "movies" to 2 qualifying records
33+ Then The rules restrict "movies" to 3 qualifying records
3234 And there are errors with the following details and associated error_count from the business_rules phase
3335 | ErrorCode | ErrorMessage | RecordIndex | error_count |
3436 | LIMITED_RATINGS | Movie has too few ratings ([6 .5 ]) | 4 | 1 |
@@ -37,10 +39,11 @@ Feature: Pipeline tests using the movies dataset
3739 When I run the error report phase
3840 Then An error report is produced
3941 And The statistics entry for the submission shows the following information
40- | parameter | value |
41- | record_count | 5 |
42- | number_record_rejections | 4 |
43- | number_warnings | 1 |
42+ | parameter | value |
43+ | record_count | 5 |
44+ | number_submission_rejections | 1 |
45+ | number_record_rejections | 3 |
46+ | number_warnings | 2 |
4447 And the error aggregates are persisted
4548
4649 Scenario : Validate and filter movies (duckdb)
@@ -55,16 +58,18 @@ Feature: Pipeline tests using the movies dataset
5558 Then the movies entity is stored as a parquet after the file_transformation phase
5659 And the latest audit record for the submission is marked with processing status data_contract
5760 When I run the data contract phase
58- Then there are 3 record rejections from the data_contract phase
61+ Then there is 1 submission rejection from the data_contract phase
62+ And there are 3 record rejections from the data_contract phase
5963 And there are errors with the following details and associated error_count from the data_contract phase
6064 | Entity | ErrorCode | ErrorMessage | RecordIndex | error_count |
6165 | movies | BLANKYEAR | year not provided | 2 | 1 |
6266 | movies_rename_test | DODGYYEAR | year value (NOT_A_NUMBER ) is invalid | 1 | 1 |
6367 | movies | DODGYDATE | date_joined value is not valid : daft_date | 1 | 1 |
68+ | movies | BLANKTITLE | title should not be blank | 4 | 1 |
6469 And the movies entity is stored as a parquet after the data_contract phase
6570 And the latest audit record for the submission is marked with processing status business_rules
6671 When I run the business rules phase
67- Then The rules restrict "movies" to 2 qualifying records
72+ Then The rules restrict "movies" to 3 qualifying records
6873 And there are errors with the following details and associated error_count from the business_rules phase
6974 | ErrorCode | ErrorMessage | RecordIndex | error_count |
7075 | LIMITED_RATINGS | Movie has too few ratings ([6 .5 ]) | 4 | 1 |
@@ -73,9 +78,10 @@ Feature: Pipeline tests using the movies dataset
7378 When I run the error report phase
7479 Then An error report is produced
7580 And The statistics entry for the submission shows the following information
76- | parameter | value |
77- | record_count | 5 |
78- | number_record_rejections | 4 |
79- | number_warnings | 1 |
81+ | parameter | value |
82+ | record_count | 5 |
83+ | number_submission_rejections | 1 |
84+ | number_record_rejections | 3 |
85+ | number_warnings | 2 |
8086 And the error aggregates are persisted
8187
0 commit comments