Story #236
closedDon't re-download rpms if they exist on disk
100%
Description
If a content unit exists on disk but is not on the DB, Pulp re-downloads the content at sync time.
Pulp should instead recognize that a rpm plugin unit (rpm, srpm, drpm, distribution) is on disk and use that to create the unit in the database from. The result would be that content which exists on disk is not re-downloaded.
Steps to Reproduce:
1. Sync a large repo
2. Watch it take a while
3. Clear your mongo db
4. rerun pulp-manage-db
5. Sync the same repo
6. Watch it take the same amount of time (all the files are on the file system, so it should take the same amount of time)
Related issues
Updated by mhrivnak about 9 years ago
- Platform Release set to 2.8.0
- Groomed set to No
- Sprint Candidate set to Yes
Updated by bmbouter almost 9 years ago
- Subject changed from RFE: Don't re-download rpms if they exist on disk to Don't re-download rpms if they exist on disk
Updated by mhrivnak almost 9 years ago
- Priority changed from High to Normal
- Groomed changed from No to Yes
- Tags Documentation added
This is a documentation task with the checklist items above.
Document what to backup from mongodb, but we should probably avoid documenting specific commands that can change over time.
Updated by jortel@redhat.com almost 9 years ago
- Priority changed from Normal to High
- Platform Release deleted (
2.8.0)
Updated by bmbouter almost 9 years ago
- Blocked by Task #1600: Store content using consistent and deterministic paths added
Updated by bmbouter almost 9 years ago
- Blocked by Task #1601: Migrate /var/lib/pulp/content to new 2.8 storage paths. added
Updated by bmbouter almost 9 years ago
- Description updated (diff)
- Private changed from Yes to No
Updated by bmbouter almost 9 years ago
I want to know if there is a way this could be a platform functionality. Perhaps platform could provide a hook which plugin writers could call.
Updated by Anonymous over 8 years ago
- Sprint/Milestone set to 24
- Sprint Candidate changed from No to Yes
Updated by mhrivnak over 8 years ago
- Status changed from NEW to ASSIGNED
- Assignee set to mhrivnak
Updated by mhrivnak about 8 years ago
- Related to Issue #2177: download_repo task fails to mark some units as downloaded added
Updated by mhrivnak about 8 years ago
- Status changed from ASSIGNED to POST
Added by mhrivnak about 8 years ago
Added by mhrivnak about 8 years ago
Revision 3dd7cefc | View on GitHub
Adds docs for re-using files on disk when DB is lost
Updated by mhrivnak about 8 years ago
- Status changed from POST to MODIFIED
- % Done changed from 0 to 100
Applied in changeset pulp:pulp|3dd7cefc2343b2b7b901dce2ce92ebbf14d3e723.
Updated by semyers about 8 years ago
- Status changed from 5 to CLOSED - CURRENTRELEASE
Adds docs for re-using files on disk when DB is lost
fixes #236 https://pulp.plan.io/issues/236