layers: support BigData #787

giuseppe · 2020-12-15T20:28:05Z

similarly to containers and images, add support for storing big data
also for layers, so that it is possible to store arbitrary data when
it is not feasible to embed it in the layers JSON metadata file.

Subset of: #775

Signed-off-by: Giuseppe Scrivano gscrivan@redhat.com

giuseppe · 2020-12-15T20:28:30Z

@nalind @rhatdan PTAL

rhatdan · 2020-12-15T21:04:29Z

LGTM
@vrothberg @mtrmac @TomSweeneyRedHat PTAL

cmd/containers-storage/layer.go

docs/containers-storage-get-layer-data.md

mtrmac

A literally one-minute impression, and design concerns like this are definitely decided by @nalind , not me:

This can make sense as an internal implementation mechanism; it’s not obvious to me that this must be exposed as a raw data to external callers. If c/image is only going to consume that data through a zstd:chunked - specific API, only that zstd:chunked - specific API needs to be public.

Making raw data public like this

effectively freezes the format of the raw data, slowing down future improvements.
invites other uses of raw data for unexpected purposes. That can be very useful and allow for unexpected innovations, OTOH it also further cements down the concept of individual layers and makes it even harder to significantly restructure (e.g. to a file-based approach like OSTree or the chunked format).

Again, definitely up to @nalind, just airing my first impression.

mtrmac

A few more notes. I didn’t read all of the code carefully, and I completely defer to actual c/storage maintainers for all the design decisions.

cmd/containers-storage/container.go

docs/containers-storage-get-layer-data.md

docs/containers-storage-set-layer-data.md

layers.go

mtrmac · 2021-01-06T15:54:56Z

layers.go

+
+	// BigDataSizes maps the names in BigDataNames to the sizes of the data
+	// that has been stored, if they're known.
+	BigDataSizes map[string]int64 `json:"big-data-sizes,omitempty"`


Just reading the API documentation, I would expect BigDataSizes to be included in ImageSize. (Otherwise it’s not even clear to me there needs to be a separate faster-to-access record of the sizes.)

I've only added it because it was present in the image API. I am fine with dropping it

Let's drop it.

Why shouldn’t the size be included in ImageSize?

Let's drop it.

dropped and pushed a new version.

Why shouldn’t the size be included in ImageSize?

it is not used at the moment, we can add it if there is any user of it

The Podman repo suggests that store.ImageSize is called for “disk usage statistics” (df?), “inspect”, “prune” and “tree”; I didn’t trace the full call stack, it may be exposed via other features or it might be dead code.

OTOH it’s true that c/image’s types.Image.Size() implementation does its own calculation which does not currently benefit from extending store.ImageSize.

giuseppe · 2021-01-14T15:11:23Z

is there anything more left to address?

rhatdan · 2021-01-19T18:28:33Z

@nalind @mtrmac PTAL

mtrmac · 2021-01-19T19:24:57Z

See the 2 points from my previous review.

giuseppe · 2021-01-20T13:33:57Z

See the 2 points from my previous review.

just replied to them

giuseppe · 2021-01-22T08:13:36Z

@mtrmac @rhatdan anything more holding this PR?

rhatdan · 2021-01-22T14:10:52Z

Not from me. Just waiting for @mtrmac comment.

mtrmac · 2021-01-22T15:51:41Z

*shrug* I don’t have anything to add WRT the implementation; I have some concerns but they are not quite blocking.

As for the more general design review, I know very little about c/storage , that would be up to @nalind I think.

giuseppe · 2021-01-27T19:26:37Z

how to move forward? This is just a subset of the changes I'd like to get merged for zstd:chunked support

rhatdan · 2021-01-29T14:18:31Z

@vrothberg @nalind PTAL so we can get this merged.

vrothberg · 2021-01-29T14:23:22Z

Added to my todo list. I won't make it today but promise to have a look on Monday.

Apologies for not having looked into it until now!

rhatdan

Setting approved

layers.go

vrothberg

LGTM

layers.go

similarly to containers and images, add support for storing big data also for layers, so that it is possible to store arbitrary data when it is not feasible to embed it in the layers JSON metadata file. Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>

Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>

rhatdan · 2021-02-02T11:07:37Z

LGTM

giuseppe mentioned this pull request Dec 15, 2020

Enable zstd:chunked support in containers/image #775

Merged

TomSweeneyRedHat reviewed Dec 15, 2020

View reviewed changes

cmd/containers-storage/layer.go Outdated Show resolved Hide resolved

TomSweeneyRedHat reviewed Dec 15, 2020

View reviewed changes

cmd/containers-storage/layer.go Outdated Show resolved Hide resolved

TomSweeneyRedHat reviewed Dec 15, 2020

View reviewed changes

docs/containers-storage-get-layer-data.md Outdated Show resolved Hide resolved

giuseppe force-pushed the big-data-layers branch from 24a89bc to 3a3b51b Compare December 16, 2020 08:57

giuseppe marked this pull request as draft December 16, 2020 09:02

giuseppe force-pushed the big-data-layers branch from 3a3b51b to 11ee17b Compare December 16, 2020 09:23

giuseppe marked this pull request as ready for review December 16, 2020 09:39

mtrmac reviewed Dec 16, 2020

View reviewed changes

giuseppe mentioned this pull request Jan 4, 2021

Support additional layer store (patch for containers/image) containers/image#1109

Merged

mtrmac reviewed Jan 6, 2021

View reviewed changes

giuseppe force-pushed the big-data-layers branch 3 times, most recently from 44b1af2 to 3e4cb85 Compare January 9, 2021 14:54

rhatdan approved these changes Feb 1, 2021

View reviewed changes

vrothberg reviewed Feb 1, 2021

View reviewed changes

layers.go Outdated Show resolved Hide resolved

layers.go Outdated Show resolved Hide resolved

layers.go Show resolved Hide resolved

giuseppe force-pushed the big-data-layers branch from 3e4cb85 to a975d8d Compare February 1, 2021 13:29

vrothberg reviewed Feb 1, 2021

View reviewed changes

giuseppe force-pushed the big-data-layers branch from a975d8d to 755130c Compare February 1, 2021 13:50

mtrmac reviewed Feb 1, 2021

View reviewed changes

layers.go Show resolved Hide resolved

giuseppe added 3 commits February 2, 2021 11:39

layers: support BigData

13260d4

similarly to containers and images, add support for storing big data also for layers, so that it is possible to store arbitrary data when it is not feasible to embed it in the layers JSON metadata file. Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>

containers-storage: support layer bigdata

9cefbae

Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>

tests: test layer bigdata

55bb805

Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>

giuseppe force-pushed the big-data-layers branch from 755130c to 55bb805 Compare February 2, 2021 10:39

rhatdan merged commit 94f18b8 into containers:master Feb 2, 2021

layers: support BigData #787

layers: support BigData #787

Uh oh!

Conversation

giuseppe commented Dec 15, 2020

Uh oh!

giuseppe commented Dec 15, 2020

Uh oh!

rhatdan commented Dec 15, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mtrmac left a comment

Choose a reason for hiding this comment

Uh oh!

mtrmac left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mtrmac Jan 6, 2021

Choose a reason for hiding this comment

Uh oh!

giuseppe Jan 7, 2021

Choose a reason for hiding this comment

Uh oh!

rhatdan Jan 8, 2021

Choose a reason for hiding this comment

Uh oh!

mtrmac Jan 8, 2021

Choose a reason for hiding this comment

Uh oh!

giuseppe Jan 9, 2021

Choose a reason for hiding this comment

Uh oh!

giuseppe Jan 20, 2021

Choose a reason for hiding this comment

Uh oh!

mtrmac Jan 20, 2021

Choose a reason for hiding this comment

Uh oh!

giuseppe commented Jan 14, 2021

Uh oh!

rhatdan commented Jan 19, 2021

Uh oh!

mtrmac commented Jan 19, 2021

Uh oh!

giuseppe commented Jan 20, 2021

Uh oh!

giuseppe commented Jan 22, 2021

Uh oh!

rhatdan commented Jan 22, 2021

Uh oh!

mtrmac commented Jan 22, 2021

Uh oh!

giuseppe commented Jan 27, 2021

Uh oh!

rhatdan commented Jan 29, 2021

Uh oh!

vrothberg commented Jan 29, 2021

Uh oh!

rhatdan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vrothberg left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rhatdan commented Feb 2, 2021

Uh oh!

Uh oh!