Skip to content

[GLUTEN-11550][UT] Enable Python-related tests and disable flaky JobTagging suite for Spark 4.x#11908

Draft
baibaichen wants to merge 3 commits intoapache:mainfrom
baibaichen:fix/11550-python
Draft

[GLUTEN-11550][UT] Enable Python-related tests and disable flaky JobTagging suite for Spark 4.x#11908
baibaichen wants to merge 3 commits intoapache:mainfrom
baibaichen:fix/11550-python

Conversation

@baibaichen
Copy link
Copy Markdown
Contributor

@baibaichen baibaichen commented Apr 10, 2026

What changes were proposed in this pull request?

Enable Python-related test suites and fix issues for Spark 4.1. Also disable a flaky suite.

ArrowEvalPythonExecSuite — Remove testWithMaxSparkVersion("4.0") restriction on 3 arrow_udf tests. The underlying issues have been resolved.

GlutenRowQueueSuite — Enable (spark41-only, 7/7 passed).

GlutenPythonUDFSuite — Enable with testGluten override for SPARK-48706. The original test uses this.getClass.getSimpleName in ExpectedContext pattern which returns GlutenPythonUDFSuite but the actual callSite records PythonUDFSuite. The testGluten fixes the pattern.

GlutenPythonDataSourceSuite — Enable. All 23 tests cancel via assume(shouldTestPythonUDFs) when Python is not available (no CI failure).

GlutenPythonUDTFSuite — Fix to properly extend PythonUDTFSuite (was an empty class). Enable. Tests cancel without Python.

GlutenSparkSessionJobTaggingAndCancellationSuite — Disable (flaky). The testGluten for "Tags set from session are prefixed with session UUID" suffers from InheritableThreadLocal + ForkJoinPool timing issues causing intermittent NoSuchElementException: head of empty list. Reverted testGluten and disabled the entire suite until root cause is fully resolved.

How was this patch tested?

  • GlutenRowQueueSuite verified on spark41 (7/7 passed)
  • GlutenPythonUDFSuite verified on spark41 (7 passed, 1 excluded with testGluten replacement, 2 canceled)
  • Other suites cancel gracefully without Python environment

Related issue: #11550

@github-actions github-actions bot added CORE works for Gluten Core VELOX labels Apr 10, 2026
@baibaichen baibaichen changed the title [GLUTEN-11550][UT] Enable Python-related tests for Spark 4.1 [GLUTEN-11550][UT] Enable Python-related tests and disable flaky JobTagging suite for Spark 4.x Apr 10, 2026
@github-actions github-actions bot added the INFRA label Apr 10, 2026
@baibaichen baibaichen force-pushed the fix/11550-python branch 3 times, most recently from e64b6b0 to 5773249 Compare April 11, 2026 03:36
baibaichen and others added 2 commits April 11, 2026 06:16
- ArrowEvalPythonExecSuite: remove testWithMaxSparkVersion("4.0") on 3 arrow_udf tests
- GlutenRowQueueSuite: enable (spark41-only, 7/7 passed)
- GlutenPythonUDFSuite: enable with testGluten for SPARK-48706 (class name pattern fix)
- GlutenPythonDataSourceSuite: enable (all tests cancel without Python, won't fail CI)

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
The slow jobs were missing Python/pyspark installation, causing
GlutenPythonUDTFSuite to fail with RuntimeException during suite loading.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
…shdown

Gluten replaces FilterExec with FilterExecTransformer and BatchScanExec with
BatchScanExecTransformer. Add testGluten matching the Gluten operator names.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CORE works for Gluten Core INFRA VELOX

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant