Add QEMU Disk Image Extractor by 0xXA · Pull Request #1691 · google/osv-scalibr

0xXA · 2026-01-20T12:50:43Z

Closes: #1213

0xXA · 2026-01-20T14:38:48Z

alessandro-Doyensec · 2026-02-05T16:02:23Z

Thanks for the contribution!

I left a few comments that may need your attention.

Could you also consider splitting the qcow2.go file into multiple ones? For example, the structs related to IV generation could be moved to a separate file. This isn’t a strict requirement, so I’m open to your suggestions on the best approach.

0xXA · 2026-02-05T16:25:48Z

Could you also consider splitting the qcow2.go file into multiple ones? For example, the structs related to IV generation could be moved to a separate file. This isn’t a strict requirement, so I’m open to your suggestions on the best approach.

Thanks for the comments gentleman.

I don't think it's a wise idea to split qcow2.go into multiple files because it'll get hard to manage. In addition, if any extractor in the future requires IV generation, I will move this logic to common.go so other extractors can use it. For now it's good as it is.

alessandro-Doyensec · 2026-02-05T17:04:34Z

Hi @0xXA

I don't think it's a wise idea to split qcow2.go into multiple files because it'll get hard to manage. In addition, if any extractor in the future requires IV generation, I will move this logic to common.go so other extractors can use it. For now it's good as it is.

I'm having troubles understanding the code flow logic right now. Please, split this into separate files.

I noticed that some components implement complicated patterns such as encryption and raw binary data manipulation. To ensure correctness please also test those components separately.

Note: Please, in future let me mark conversation as resolved in order for me to understand what is resolved or not

erikvarga · 2026-02-05T17:40:51Z

if there are specific larger categories of helper functions those would be useful to move to separate files (ideally with separate tests for their major functions)
e.g. the Extractor interface functions (that mostly just call the other helper funcs) could be stay in qcov.go, IV related functions could go into an iv.go (and thus easier to move into common later). We can also have a common.go for anything else that doesn't fit elsewhere. All of this can use the same package name so we'd only be moving code.

erikvarga · 2026-02-05T17:43:03Z

As a separate question, are there any helper libraries available for any of the qcow2 parsing logic being implemented here? Since we already need to import a couple new helper libs like github.com/emmansun/gmsm/sm4 it should be fine to introduce a few more if that helps reduce the SCALIBR code size.

0xXA · 2026-02-05T18:23:54Z

Some encryption code related to grains and tables parsing can't be tested individually because we can't fully emulate or recreate format behavior at the spot. It'll complicate the plugin because all we want is to just read and understand the format for our job.

I'll move rest of the code to separate files.

As a separate question, are there any helper libraries available for any of the qcow2 parsing logic being implemented here? Since we already need to import a couple new helper libs like github.com/emmansun/gmsm/sm4 it should be fine to introduce a few more if that helps reduce the SCALIBR code size.

Good question. There are no known working qcow2 parsers in Golang. It took me 2 months to create this parser: Reverse Engineering the format by hand from disk, slicing files sector by sector, testing those sectors individually through manually encrypting / decrypting using masterkey materials, manually recreating the masterkey materials to match the behavior of qemu-img, recreating table and grain parsing, verifying structure defs using official documentation and so on. In short, if there were any parsing libraries available, it'll only take around 3 minutes to build this parser.

But that's what makes it interesting. Just like a Patek Philippe, it's a masterpiece.

0xXA · 2026-02-05T19:53:44Z

if there are specific larger categories of helper functions those would be useful to move to separate files (ideally with separate tests for their major functions)
e.g. the Extractor interface functions (that mostly just call the other helper funcs) could be stay in qcov.go, IV related functions could go into an iv.go (and thus easier to move into common later). We can also have a common.go for anything else that doesn't fit elsewhere. All of this can use the same package name so we'd only be moving code.

I will separate IV related logic into iv.go, crypto related logic in crypto.go, and compression related logic in compression.go, format related stuff in format.go. Everything else will be in qcow2.go.

Does this align with what you intend to do Erik !?

Closes: google#1213 Signed-off-by: Yuvraj Saxena <ysaxenax@gmail.com>

Signed-off-by: Yuvraj Saxena <ysaxenax@gmail.com>

0xXA · 2026-02-06T07:08:02Z

@erikvarga Done!

alessandro-Doyensec · 2026-02-06T11:45:26Z

Hello @0xXA

Thanks for the changes.

I noticed that the initESSIV function and the essivCipher field are never used inside tests. Could you please add tests also for that? A simple test like this would suffice:

Cipher some data using "aes"
Call initESSIV
Call config.Decrypt
Verify that gotData is equal to the one generated by aes

Alternatively, adding another .qcow2 which uses ivGenESSIV in the full TestExtractValidQCOW2 would suffice, but I assume it would be complicated.

0xXA · 2026-02-06T12:02:54Z

Hello @0xXA

Thanks for the changes.

I noticed that the initESSIV function and the essivCipher field are never used inside tests. Could you please add tests also for that? A simple test like this would suffice:
* Cipher some data using "aes"

* Call initESSIV

* Call config.Decrypt

* Verify that gotData is equal to the one generated by aes
Alternatively, adding another .qcow2 which uses ivGenESSIV in the full TestExtractValidQCOW2 would suffice, but I assume it would be complicated.

Thanks for reviewing @alessandro-Doyensec really appreciate your effort man.

I have added the mentioned test cases.

Signed-off-by: Yuvraj Saxena <ysaxenax@gmail.com>

alessandro-Doyensec · 2026-02-06T13:38:51Z

Hello @0xXA,

To clarify my previous #1691 (comment): GetDiskPartitions returns nil, nil, err when it encounters an error. This would result in a nil pointer dereference in your current implementation. Ref:

osv-scalibr/extractor/filesystem/embeddedfs/qcow2/qcow2.go

Line 129 in c172554

disk.Close()

I noticed that this pattern/bug is present in other filesystem/embeddedfs packages like vdi and vmdk.

I suggest to update GetDiskPartitions to directly close the disk before returning if an error occurs. Thus removing the need to close it from the outside if any error happens.

⚠️ Note: Please, in future let me mark conversation as resolved in order for me to understand what is resolved or not.

erikvarga · 2026-02-06T13:45:57Z

I will separate IV related logic into iv.go, crypto related logic in crypto.go, and compression related logic in compression.go, format related stuff in format.go. Everything else will be in qcow2.go.

Does this align with what you intend to do Erik !?

Yes, this should work for me. wrt testing, as long as we trigger the code paths added to the businesslogic with either individual tests or qcow2 testdata files with the right format I think that should be good enough for the test coverage.

0xXA · 2026-02-06T13:53:56Z

I suggest to update GetDiskPartitions to directly close the disk before returning if an error occurs. Thus removing the need to close it from the outside if any error happens.

I think it's best to remove the call to disk.Close() in that case. Also, I won't touch code in other extractors unless you consider that as extra work or you can do it yourself.

Yes, this should work for me. wrt testing, as long as we trigger the code paths added to the businesslogic with either individual tests or qcow2 testdata files with the right format I think that should be good enough for the test coverage.

Sure thing.

Signed-off-by: Yuvraj Saxena <ysaxenax@gmail.com>

0xXA · 2026-02-06T13:57:00Z

@erikvarga can you rerun the actions !? Just addressed a comment made by @alessandro-Doyensec

erikvarga · 2026-02-06T14:01:00Z

I think it's best to remove the call to disk.Close() in that case

IIUIC this is currently a problem in other extractors as well then? That they call Close() on disk which is nil if GetDiskPartitions returns an error?

If that's the case we can do a separate cleanup pass (e.g. moving the close into GetDiskPartitions) after this PR is merged.

alessandro-Doyensec · 2026-02-06T14:03:57Z

IIUIC this is currently a problem in other extractors as well then? That they call Close() on disk which is nil if GetDiskPartitions returns an error?

👍

Basically every disk.Close call should be moved into the GetDiskPartitions function.

erikvarga · 2026-02-06T15:05:32Z

Good question. There are no known working qcow2 parsers in Golang

Looking around I see a couple qcow2 implementations such as

https://github.com/zchee/go-qcow2
https://github.com/dypflying/go-qcow2lib
https://github.com/dpeckett/qcow2

Do these generally not parse the disk file in a format compatible with the fs interface we're using?

Just checking that we're doing our due diligence before adding a large amount of new code to SCALIBR.

0xXA · 2026-02-06T15:27:11Z

Just checking that we're doing our due diligence before adding a large amount of new code to SCALIBR.

I highly doubt they’re going to work. Besides none of the above mentioned links support qcow2 v3 header. Moreover, some of them are experimental and don’t support the features encountered in everyday qcow2 file.

For example,
dpeckett/qcow2: It’s mentioned in the readme that it’s experimental and doesn’t have support for encryption and compression:
The library is not yet complete. It can read and write most QCOW2 images, but some features are not supported:

Compression (expect for reading DEFLATE) Encryption Backing files External data You shouldn't use this library in any application that requires data integrity. It has not been tested thoroughly and definitely will result in data loss.

The best way to verify this is by comparing the qcow2 header against what I’ve implemented.

What I meant by that statement is that there are no known working parsers that fully adhere to the current qcow2 specification. Experimental or partially implemented features don’t count.

The whole point of this plugin is to handle regular files as well as edge cases (including encryption). If it doesn’t even support basic features like compression (which almost all real-world qcow2 files use)then there’s no point in adding this plugin. It would only take up more space.

0xXA · 2026-02-09T14:34:50Z

@erikvarga I have addressed all the comments.

0xXA · 2026-02-09T16:58:34Z

Whats sup with this internal server error !?

Signed-off-by: Yuvraj Saxena <ysaxenax@gmail.com>

0xXA · 2026-02-10T07:48:14Z

I have added the required comment, could you rerun the actions @erikvarga !?

0xXA · 2026-02-11T14:08:03Z

Hi @erikvarga

As previously agreed, all Salesforce extractors were to be considered as 2 validations and 2 detections, and the n-tuples plugin + pair plugin rework was to be treated as additional work.

However, the issued reward does not seem to align with the overall amount of work completed.

Could you please discuss this with the panel and consider revising the reward accordingly?

Since this PR is merged, can you please assign me something else to work on !?

erikvarga · 2026-02-12T14:29:46Z

I brought up the payout amount for this submission with the rest of the panel and the consensus was that different secret types for a single platform should be grouped as one PRP and that reward amounts should be capped at the maximum amount for the category. For transparency, this was the only secret scanner contribution that received the maximum payout so far since the PRP has resumed this year - other contributions involving less work have received lower amounts.
We'll clarify the above panel decision in the PRP rules page with an update shortly.

erikvarga · 2026-02-12T14:31:11Z

Since this PR is merged, can you please assign me something else to work on !?

We haven't finished evaluating all the new PRP submissions yet so we'll likely need until tomorrow to decide on new issue assignments.

0xXA · 2026-02-12T15:00:07Z

I brought up the payout amount for this submission with the rest of the panel and the consensus was that different secret types for a single platform should be grouped as one PRP and that reward amounts should be capped at the maximum amount for the category. For transparency, this was the only secret scanner contribution that received the maximum payout so far since the PRP has resumed this year - other contributions involving less work have received lower amounts. We'll clarify the above panel decision in the PRP rules page with an update shortly.

Appreciate you looking into it brother :)

From my understanding,

Payout for,

1 detection + 1 validation (complex) = n-detection + n-validation

Is that true ?

Capping the payout at max for different secret types belonging to same product makes sense but Salesforce is not the only group of secret detectors that will benefit from n-tuples plugin. I thought you’ll increase the reward beyond max payout if it benefits several other secret detectors. Just like you did with VMDK and embedded FS extractors or is this thing only applies to secret detectors !?

We haven't finished evaluating all the new PRP submissions yet so we'll likely need until tomorrow to decide on new issue assignments.

Sure thing.

erikvarga · 2026-02-13T16:05:02Z

From my understanding,
Payout for,
1 detection + 1 validation (complex) = n-detection + n-validation
Is that true ?

More specifically, detection for the N most relevant secrets of a single platform is accepted as one submission, and the payout for detection+validation of the secrets is capped to the amounts described. See the updated "Secret Detectors" section of https://bughunters.google.com/about/rules/open-source/osv-scalibr-patch-rewards-program-rules#reward-amounts

erikvarga · 2026-02-17T16:28:59Z

Hi @0xXA ,

Our internal crypto reviewer noticed that there are no unit tests for the usage of the XTS cipher mode and for Serpent.
IIUIC the two qcow2 testdata images do not use these encryptions, right? In that case, would it be possible to add unit tests for XTS and Serpent to crypto_test.go?

0xXA · 2026-02-17T16:55:33Z

would it be possible to add unit tests for XTS and Serpent to crypto_test.go?

Done!

Signed-off-by: Yuvraj Saxena <ysaxenax@gmail.com>

0xXA · 2026-02-18T11:12:21Z

@erikvarga Any update on this ?

erikvarga · 2026-02-18T14:07:53Z

@0xXA I added your changes to our internal import and am now waiting for the crypto reviewer to take another look.

0xXA · 2026-02-18T15:43:57Z

@0xXA I added your changes to our internal import and am now waiting for the crypto reviewer to take another look.

Appreciate the update.
LOoks like there are some import order issues let me fix it quickly!

Signed-off-by: Yuvraj Saxena <ysaxenax@gmail.com>

0xXA · 2026-02-20T09:58:32Z

Any update on this, @erikvarga ?

erikvarga · 2026-02-20T13:41:26Z

Still waiting for crypto review, I'll ping the reviewer on Monday if there's no progress.

0xXA · 2026-02-20T14:20:34Z

Sure.

What does this mean in the context of Software Inventory Extractor payouts:

Up to XXXX for extraction capabilities that can be matched to a publicly available vulnerability feed (e.g. OSV.dev) to find a significant amount of new vulnerabilities.

Can you give an example of a plugin which satisfies this ?
Also, did someone ever receive this reward !?

erikvarga · 2026-02-20T15:57:06Z

For example, plugin should find new vulns that we wouldn't previously have found when enabled together with osvdev/vulnmatch. If the OSV matcher doesn't work it can also be fine if the results can be combined with other vuln matcher tools, though it's of course less immediately useful since special setup is needed.

I don't recall us having paid out this reward amount so far since most new package types don't have existing vuln feeds integrated into OSV.dev. One candidate for this might be #1775 depending on the number of vulns associated with MCP that we wouldn't have otherwise found.

0xXA · 2026-02-20T16:13:47Z

Im not sure if I understood it correctly but let’s say I added X extractor, X can’t be detected through standard package manager configurations because it wasn’t stalled with any standard package manager(say dpkg, etc). Now if I add an extractor for X to osv-scalibr. What else do I need to add so I get the full Reward for this category?

PiperOrigin-RevId: 873956307

erikvarga · 2026-02-23T10:30:27Z

Merged in 0afbfa3

0xXA force-pushed the qemu-plugin branch from 00b3d98 to 9c66038 Compare January 20, 2026 14:32

alessandro-Doyensec self-requested a review February 5, 2026 15:11

alessandro-Doyensec reviewed Feb 5, 2026

View reviewed changes

Comment thread extractor/filesystem/embeddedfs/qcow2/qcow2.go Outdated

alessandro-Doyensec reviewed Feb 5, 2026

View reviewed changes

Comment thread extractor/filesystem/embeddedfs/qcow2/qcow2.go Outdated

0xXA added 2 commits February 6, 2026 12:35

Add QEMU Disk Image Extractor

6cfc3ac

Closes: google#1213 Signed-off-by: Yuvraj Saxena <ysaxenax@gmail.com>

Address the code review feedback for QEMU Disk Image Extractor

c172554

Signed-off-by: Yuvraj Saxena <ysaxenax@gmail.com>

0xXA force-pushed the qemu-plugin branch from 5656761 to c172554 Compare February 6, 2026 07:05

alessandro-Doyensec reviewed Feb 6, 2026

View reviewed changes

Comment thread extractor/filesystem/embeddedfs/qcow2/qcow2.go Outdated

Address the code review feedback for QEMU Disk Image Extractor

d54aca9

Signed-off-by: Yuvraj Saxena <ysaxenax@gmail.com>

Address the code review feedback for QEMU Disk Image Extractor

c8f5e9d

Signed-off-by: Yuvraj Saxena <ysaxenax@gmail.com>

alessandro-Doyensec added the lgtm The PR has been reviewed and approved ("Looks Good To Me") by vendors helping with code reviews. label Feb 6, 2026

Merge branch 'main' into qemu-plugin

b18af5a

erikvarga reviewed Feb 9, 2026

View reviewed changes

Comment thread extractor/filesystem/embeddedfs/qcow2/format.go

0xXA added 2 commits February 10, 2026 13:16

Address the code review feedback for QEMU Disk Image Extractor

515f3cb

Signed-off-by: Yuvraj Saxena <ysaxenax@gmail.com>

Merge branch 'main' into qemu-plugin

d978a57

erikvarga approved these changes Feb 10, 2026

View reviewed changes

0xXA added 3 commits February 17, 2026 22:27

Address the code review feedback for QEMU Disk Image Extractor

567ea51

Signed-off-by: Yuvraj Saxena <ysaxenax@gmail.com>

Merge remote-tracking branch 'upstream/main' into qemu-plugin

ee31e5d

Merge branch 'main' into qemu-plugin

fe3ee6a

Fix(embeddedfs/qcow2): lint errors

9cc88f2

Signed-off-by: Yuvraj Saxena <ysaxenax@gmail.com>

copybara-service Bot pushed a commit that referenced this pull request Feb 23, 2026

Merge pull request #1691 from 0xXA:qemu-plugin

0afbfa3

PiperOrigin-RevId: 873956307

erikvarga closed this Feb 23, 2026

Conversation

0xXA commented Jan 20, 2026

Uh oh!

0xXA commented Jan 20, 2026

Uh oh!

Uh oh!

Uh oh!

alessandro-Doyensec commented Feb 5, 2026

Uh oh!

0xXA commented Feb 5, 2026

Uh oh!

alessandro-Doyensec commented Feb 5, 2026

Uh oh!

erikvarga commented Feb 5, 2026

Uh oh!

erikvarga commented Feb 5, 2026

Uh oh!

0xXA commented Feb 5, 2026

Uh oh!

0xXA commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

0xXA commented Feb 6, 2026

Uh oh!

Uh oh!

alessandro-Doyensec commented Feb 6, 2026

Uh oh!

0xXA commented Feb 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alessandro-Doyensec commented Feb 6, 2026

Uh oh!

erikvarga commented Feb 6, 2026

Uh oh!

0xXA commented Feb 6, 2026

Uh oh!

0xXA commented Feb 6, 2026

Uh oh!

erikvarga commented Feb 6, 2026

Uh oh!

alessandro-Doyensec commented Feb 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

erikvarga commented Feb 6, 2026

Uh oh!

0xXA commented Feb 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

0xXA commented Feb 9, 2026

Uh oh!

Uh oh!

0xXA commented Feb 9, 2026

Uh oh!

0xXA commented Feb 10, 2026

Uh oh!

0xXA commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

erikvarga commented Feb 12, 2026

Uh oh!

erikvarga commented Feb 12, 2026

Uh oh!

0xXA commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

erikvarga commented Feb 13, 2026

Uh oh!

erikvarga commented Feb 17, 2026

Uh oh!

0xXA commented Feb 17, 2026

Uh oh!

0xXA commented Feb 18, 2026

Uh oh!

erikvarga commented Feb 18, 2026

Uh oh!

0xXA commented Feb 18, 2026

Uh oh!

0xXA commented Feb 20, 2026

Uh oh!

erikvarga commented Feb 20, 2026

0xXA commented Feb 5, 2026 •

edited

Loading

0xXA commented Feb 6, 2026 •

edited

Loading

alessandro-Doyensec commented Feb 6, 2026 •

edited

Loading

0xXA commented Feb 6, 2026 •

edited

Loading

0xXA commented Feb 11, 2026 •

edited

Loading

0xXA commented Feb 12, 2026 •

edited

Loading

0xXA commented Feb 20, 2026 •

edited

Loading