Skip to content

Conversation

@VenkatSNarayanan
Copy link
Contributor

Adds support for custom partitioning patterns to MSCK repair table.

What changes were proposed in this pull request?
Adds support for custom partitioning patterns to MSCK repair table.

Why are the changes needed?
HCatStorer supports custom partitioning patterns when using dynamic partitioning, but Hive itself does not support this. This change adds support for non-pathological cases to Hive.

Does this PR introduce any user-facing change?
MSCK repair table with a configured custom partition pattern would previously ignore that pattern and error on finding nonstandard paths. With the code from this PR, it will respect the defined custom pattern when it extracts partition key values from the paths.

Is the change a dependency upgrade?
No

How was this patch tested?
A test was added to TestHiveMetastoreChecker to test the common kinds of custom patterns supported.

Adds support for custom partitioning patterns to MSCK repair table.

Adds support for custom partitioning patterns to MSCK
repair table.
@sonarqubecloud
Copy link

@VenkatSNarayanan
Copy link
Contributor Author

@deniskuzZ Addressed all the review comments from the old PR. The hcat.dynamic.partitioning.custom.pattern setting needs to be used in hive-standalone-metastore-server in one place, so I moved the string with the name of the setting there(hcatalog-core and hive-exec depend on that module). I've rebased it on current master and rerun the MSCK unit tests, which all pass.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants