feat(triggers): handle MSK events by joeyzhao2018 · Pull Request #1066 · DataDog/datadog-lambda-extension

joeyzhao2018 · 2026-03-05T21:04:38Z

https://datadoghq.atlassian.net/browse/SLES-2739

In Kafka's wire protocol (KIP-82), header values are always byte[]. Every Kafka client library enforces this:

Tracer	Injection code	Mechanism
dd-trace-java	headers.add(key, value.getBytes(UTF_8))	String.getBytes() → byte[]
dd-trace-go	Value: []byte(val)	Go type conversion → []byte
dd-trace-dotnet	_headers.Add(name, Encoding.UTF8.GetBytes(value))	UTF8.GetBytes() → byte[]

All three tracers accept string trace context values from the propagation layer, convert to UTF-8 bytes at the carrier adapter boundary, and hand byte[] to the Kafka client.
This isn't a quirk of Java's getBytes() — it's the only way Kafka headers work.

What MSK Lambda does

When MSK triggers a Lambda, AWS serializes the Kafka record to JSON. Since header values are byte[] on the wire, AWS encodes them as decimal byte values. However, the exact JSON
shape depends on the Lambda runtime:

Array format (observed in the existing msk_event.json testing payloads, i didn't change the support for this to be safe): byte values as a JSON array of integers
"headers": [{"x-datadog-trace-id": [51, 54, 57, ...]}]
Object format (observed with the Java Lambda runtime): both the records list and the per-header byte values are JSON objects with numeric string keys, and byte values are
decimal strings
"records": {
"topic-0": {
"0": {
"headers": {
"0": {"someOtherHeader": ["70", "114", ...]},
"2": {"x-datadog-trace-id": {"0":"52","1":"54",...}},
"4": {"x-datadog-sampling-priority": ["49"]}
}
}
}
}
Note that Datadog headers can appear at any index — non-instrumentation headers may precede them.

What's the difference between the msk_event.json and the newly added `msk_event_with_headers.json` here?

msk_event.json represents a standard MSK trigger where the producer didn't attach any Kafka headers — i.e. no Datadog tracer was running on the producer side (or it's a non-instrumented producer like a raw Kafka client, a Kinesis Firehose delivery stream, or a schema-registry message). In those cases Lambda still delivers the event but with "headers": []. It's also the format you get when testing MSK triggers manually in the AWS console, which doesn't inject headers. ( source: Claude Code)
msk_event_with_headers.json reflects the real-world object format produced by the Java Lambda runtime, with a producer instrumented with a Datadog tracer injecting trace context
as Kafka headers. It includes non-Datadog headers at lower indices to verify that the carrier extraction correctly finds Datadog headers regardless of their position. (source: I did a real world example and below is the evidence of testing)

MSK event headers delivered by the Java Lambda runtime use a JSON object with numeric string keys and decimal string values rather than an array of integers. Records are similarly delivered as an object with numeric string keys instead of an array. Update deserialization and carrier extraction to support both formats, and update the fixture and tests to reflect the real-world payload shape. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

- Replace `n as u8` cast with `u8::try_from(n).ok()` to avoid truncation - Collapse nested `if let` blocks into a single `if let ... && let ...` - Replace redundant closure `|o| o.len()` with `serde_json::Map::len` Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

bottlecap/Cargo.lock

bottlecap/src/lifecycle/invocation/triggers/msk_event.rs

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

…ssary sort in `headers_to_string_map`

performance enhancement from codex Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

performance gain during normalization Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Copilot

Pull request overview

Adds support for MSK Lambda events where Kafka headers are serialized into JSON in multiple runtime-dependent shapes, enabling trace context extraction from those headers.

Changes:

Add fixture payload representing the MSK “object format” headers shape (as observed in Java runtime).
Extend MSK trigger parsing to handle headers in both array and object encodings, and extract Datadog trace context from them.
Add tests covering object-format MSK events and selecting a record containing trace context.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 4 comments.

File	Description
bottlecap/tests/payloads/msk_event_with_headers.json	New test payload covering the Java runtime’s object-shaped record/headers encoding.
bottlecap/src/lifecycle/invocation/triggers/msk_event.rs	Adds header decoding/extraction logic, chooses a record with trace context, and updates tests accordingly.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

You can also share your feedback on Copilot code review. Take the survey.

bottlecap/src/lifecycle/invocation/triggers/msk_event.rs

joeyzhao2018 · 2026-03-18T16:21:25Z

bottlecap/src/lifecycle/invocation/triggers/msk_event.rs

+/// Scans all records in the records map and returns the `(topic_key, record_value)` of the first
+/// record whose headers contain a tracecontext key. Returns `None` if none found.
+fn find_record_with_trace_context(
+    records_map: &serde_json::Map<String, Value>,
+) -> Option<(String, Value)> {
+    for (key, group) in records_map {
+        match group {
+            Value::Array(arr) => {
+                for record in arr {
+                    if let Some(headers) = record.get("headers")
+                        && headers_has_trace_context(headers)
+                    {
+                        return Some((key.clone(), record.clone()));
+                    }
+                }
+            }
+            Value::Object(obj) => {
+                for record in obj.values() {
+                    if let Some(headers) = record.get("headers")
+                        && headers_has_trace_context(headers)
+                    {
+                        return Some((key.clone(), record.clone()));
+                    }
+                }
+            }


"The concern is real but the optimization is premature here. The Lambda payload cap is 6MB total, so even in the worst case the clone is bounded. The locator adds a non-trivial amount of complexity (new enum, more complex extraction logic) for a one-time cost per invocation on what's already a cold path. The current code is significantly easier to read and maintain. " by Claude Code

bottlecap/src/lifecycle/invocation/triggers/msk_event.rs

joeyzhao2018 added 2 commits March 9, 2026 15:14

handle MSK events

513f0ca

cargo fmt and merge main

fe2c84b

joeyzhao2018 force-pushed the joey/handle_msk branch from d33b0e2 to fe2c84b Compare March 9, 2026 19:20

joeyzhao2018 and others added 10 commits March 9, 2026 15:32

fix

1dbb7d6

cargo fmt

4683724

Merge branch 'main' into joey/handle_msk

e731d52

Merge branch 'main' into joey/handle_msk

b13ade2

fix critical vulnerability

22b24a2

Merge branch 'main' into joey/handle_msk

9f9af1f

Merge branch 'main' into joey/handle_msk

9dbcb53

cargo fmt

46014bb

joeyzhao2018 commented Mar 13, 2026

View reviewed changes

bottlecap/Cargo.lock Show resolved Hide resolved

joeyzhao2018 marked this pull request as ready for review March 13, 2026 15:45

joeyzhao2018 requested a review from a team as a code owner March 13, 2026 15:45

joeyzhao2018 requested a review from duncanista March 13, 2026 15:45

duncanista changed the title ~~feature: handle MSK events~~ feat(triggers): handle MSK events Mar 13, 2026

duncanista requested a review from Copilot March 16, 2026 16:07

Copilot started reviewing on behalf of duncanista March 16, 2026 16:08 View session

duncanista reviewed Mar 16, 2026

View reviewed changes

bottlecap/src/lifecycle/invocation/triggers/msk_event.rs Outdated Show resolved Hide resolved

This comment was marked as resolved.

Sign in to view

joeyzhao2018 and others added 4 commits March 16, 2026 12:26

Potential fix for pull request finding

79fe393

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Merge branch 'main' into joey/handle_msk

8df55f0

rename method and update comment

148d9f3

cargo fmt

9ff4e25

joeyzhao2018 requested review from Copilot and duncanista March 17, 2026 16:27

Copilot started reviewing on behalf of joeyzhao2018 March 18, 2026 14:02 View session

This comment was marked as duplicate.

Sign in to view

joeyzhao2018 added 2 commits March 18, 2026 10:39

updates based on codex suggestions

dd16016

Merge branch 'main' into joey/handle_msk

26c328f

joeyzhao2018 requested a review from Copilot March 18, 2026 14:51

Copilot started reviewing on behalf of joeyzhao2018 March 18, 2026 14:52 View session

This comment was marked as resolved.

Sign in to view

find the first record with actual tracecontext and remove the unnecce…

aa37257

…ssary sort in `headers_to_string_map`

joeyzhao2018 requested a review from Copilot March 18, 2026 15:25

Copilot started reviewing on behalf of joeyzhao2018 March 18, 2026 15:25 View session

This comment was marked as resolved.

Sign in to view

joeyzhao2018 and others added 4 commits March 18, 2026 11:34

Potential fix for pull request finding

bb20187

performance enhancement from codex Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Potential fix for pull request finding

89fee19

performance gain during normalization Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

fixing issues from codex commits

e321cf8

clippy fix

1b1427d

joeyzhao2018 requested a review from Copilot March 18, 2026 15:59

Copilot started reviewing on behalf of joeyzhao2018 March 18, 2026 16:06 View session

Copilot AI reviewed Mar 18, 2026

View reviewed changes

joeyzhao2018 added 2 commits March 18, 2026 12:30

codex suggested fixes

3c9ab3b

cargo fmt

509a079

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(triggers): handle MSK events#1066

feat(triggers): handle MSK events#1066
joeyzhao2018 wants to merge 25 commits intomainfrom
joey/handle_msk

joeyzhao2018 commented Mar 5, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as duplicate.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

joeyzhao2018 Mar 18, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

joeyzhao2018 commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What MSK Lambda does

What's the difference between the msk_event.json and the newly added msk_event_with_headers.json here?

Uh oh!

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as duplicate.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

joeyzhao2018 Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

joeyzhao2018 commented Mar 5, 2026 •

edited

Loading

What's the difference between the msk_event.json and the newly added `msk_event_with_headers.json` here?