[SPARK-55890] Check arrow memory at end of tests by garlandz-db · Pull Request #54689 · apache/spark

garlandz-db · 2026-03-09T14:34:33Z

Fixes Arrow off-heap memory leaks in Spark Connect and adds afterAll guards to detect future leaks pre-merge.

Leaks fixed:

SparkConnectPlannerSuite.scala (createLocalRelationProto) — test helper called .next() without closing the
iterator.
SparkConnectProtoSuite.scala (createLocalRelationProtoByAttributeReferences) — same pattern.
ArrowConvertersSuite.scala ("two batches with different schema") — two ArrowBatchWithSchemaIterator instances
drained but never closed.

Detection guards added — afterAll assertions on ArrowUtils.rootAllocator.getAllocatedMemory == 0 in:

SparkConnectPlanTest (covers SparkConnectPlannerSuite, SparkConnectProtoSuite)
SparkConnectServerTest (covers SparkConnectServiceE2ESuite, SparkConnectServiceInternalServerSuite, and any
future subclass)

Why are the changes needed?

ArrowUtils.rootAllocator is a JVM-wide singleton. Every toBatchWithSchemaIterator and fromIPCStream call allocates a child allocator and Arrow buffers from it. If the iterator is not explicitly closed, those buffers are never freed, causing off-heap memory growth on the driver. A related leak in the deserialization path (fromIPCStream) was fixed in SPARK-54696. This PR closes the complementary serialization-path gap that SPARK-54696 missed, and adds test-time assertions to catch regressions before merge.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

SparkConnectPlannerSuite and SparkConnectProtoSuite pass with the new afterAll memory assertion. Before fixing leaks 2 and 3, SparkConnectProtoSuite aborted with 16896 bytes still allocated, confirming the guards work.
ArrowConvertersSuite passes with the iterator close fix (43 tests).
To manually validate detection: removing iter.close() from any fixed site causes the suite to abort with a non-zero allocation count.

build/sbt "connect/Test/testOnly org.apache.spark.sql.connect.planner.SparkConnectPlannerSuite
org.apache.spark.sql.connect.planner.SparkConnectProtoSuite"
build/sbt "sql/Test/testOnly org.apache.spark.sql.execution.arrow.ArrowConvertersSuite"

Was this patch authored or co-authored using generative AI tooling?

Generated-by: Claude Sonnet 4.6

hvanhovell · 2026-03-09T15:55:04Z

@garlandz-db can you file a JIRA ticket?

garlandz-db · 2026-03-09T16:59:49Z

done

hvanhovell · 2026-03-09T20:47:34Z

...nect/server/src/test/scala/org/apache/spark/sql/connect/planner/SparkConnectProtoSuite.scala


    val attributes = attrs.map(exp => AttributeReference(exp.name, exp.dataType)())
-    val buffer = ArrowConverters
+    val iter = ArrowConverters


This is leaking right?

without this fix

SparkConnectProtoSuite: [info] org.apache.spark.sql.connect.planner.SparkConnectProtoSuite *** ABORTED *** (8 seconds, 710 milliseconds) [info] 16896 did not equal 0 Arrow rootAllocator memory leak: 16896 bytes still allocated (ArrowAllocatorLeakCheck.scala:33) [info] org.scalatest.exceptions.TestFailedException: [info] at org.scalatest.Assertions.newAssertionFailedException(Assertions.scala:472) [info] at org.scalatest.Assertions.newAssertionFailedException$(Assertions.scala:471) [info] at org.scalatest.Assertions$.newAssertionFailedException(Assertions.scala:1231) [info] at org.scalatest.Assertio

garlandz-db · 2026-03-19T13:00:53Z