Skip to content

Align master#7856

Open
vkarampudi wants to merge 161 commits into
tensorflow:masterfrom
vkarampudi:align-master
Open

Align master#7856
vkarampudi wants to merge 161 commits into
tensorflow:masterfrom
vkarampudi:align-master

Conversation

@vkarampudi

Copy link
Copy Markdown
Contributor

No description provided.

vkarampudi added 30 commits May 17, 2026 19:47
…traint files to resolve pip installer conflicts
…-build-isolation to fix build-isolation errors on Python 3.13
…che-beam wheels and transitive protobuf v6 conflict
…un on NIGHTLY and GIT_MASTER"

This reverts commit 2a8d8a5.
…e-beam's setup script under --no-build-isolation
@vkarampudi vkarampudi force-pushed the align-master branch 2 times, most recently from dbea6fd to 5505684 Compare June 12, 2026 23:47
@keerthanakadiri keerthanakadiri requested a review from nikelite June 15, 2026 14:41
@keerthanakadiri keerthanakadiri self-assigned this Jun 15, 2026

@rwitcher rwitcher left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Comments added

python-version: ['3.10', '3.11', '3.12', '3.13']
which-tests: ["not e2e", "e2e"]
dependency-selector: ["NIGHTLY", "DEFAULT"]
dependency-selector: ["DEFAULT"]

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why was NIGHTLY removed here?

@vkarampudi vkarampudi Jun 15, 2026

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nightly will run on tf 2.22.0.dev and failing with incompatability issues, since we are targeting tf 2.21 for this release, Default workflows would be fine, Nightly's are blocking rest of the workflows to build.

| 'WritePredictionLogs' >> beam.io.WriteToTFRecord(
os.path.join(inference_result.uri, _PREDICTION_LOGS_FILE_NAME),
file_name_suffix='.gz',
num_shards=self._get_num_shards(self._beam_pipeline_args),

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How is "num_shards" used? It's not clear to my why this was added.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

num_shards was added to configure the output sharding strategy dynamically based on the runner environment:

  1. Avoiding Local Runner Bugs: When executing the pipeline locally (e.g., using DirectRunner or PrismRunner), Apache Beam's dynamic sharding (num_shards=0) combined with local loopback worker setups can trigger file rename/cleanup bugs in Beam's FileBasedSink. Forcing num_shards=1 for local runs avoids these issues by ensuring a single deterministic file is written and renamed.
  2. Preserving Distributed Scaling: When running on distributed runners like Google Cloud Dataflow, _get_num_shards returns 0. This allows Apache Beam to dynamically scale the number of output shards based on the volume of data being processed, preventing write bottlenecks.

@rwitcher rwitcher left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@vkarampudi vkarampudi removed the request for review from nikelite June 16, 2026 07:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants