Story #6024
closedAs a plugin writer, I can iterate over repository versions in a memory efficient manner
100%
Description
Motivation¶
Plugins need to iterate over the content in a repository version to create a publication. Pulp
should offer a batching iterator over content in order to limit the memory footprint and to allow
efficient use of content batches in DB operations.
Solution¶
We could make batch_qs()
available to plugin writers. But this generator is hard to use correctly because of the ordering requirement (even in pulpcore it has not been used correctly).
Instead, add a generator to RepositoryVersion
, which does "the right thing" by default and is customizable by plugin writers if necessary.
For an example use case, see https://github.com/gmbnomis/pulp_cookbook/blob/batched_publication_upstream/pulp_cookbook/app/tasks/publishing.py#L87-L117
Updated by gmbnomis about 5 years ago
Updated by bmbouter almost 5 years ago
- Status changed from NEW to POST
PR available at: https://github.com/pulp/pulpcore/pull/509
Added by gmbnomis almost 5 years ago
Updated by gmbnomis almost 5 years ago
- Status changed from POST to MODIFIED
- % Done changed from 0 to 100
Applied in changeset pulpcore|2a9b28821b204329946536d878e0e1f7687ed521.
Updated by bmbouter almost 5 years ago
- Status changed from MODIFIED to CLOSED - CURRENTRELEASE
Add a batching content iterator
content_batch_qs()
toRepositoryVersion
Plugins need to iterate over the content in a repository version to create a publication.
RepositoryVersion.content_batch_qs()
generates content batches to efficiently iterate over all content.fixes: #6024 https://pulp.plan.io/issues/6024