Issue #7540: workers and resource-manager go missing during large migration - Migration Plugin - Pulp

Actions

Send by e-mail Copy link

Issue #7540

closed

workers and resource-manager go missing during large migration

Added by jsherril@redhat.com over 3 years ago. Updated over 3 years ago.

Status:

CLOSED - CURRENTRELEASE

Priority:

High

Assignee:

ttereshc

Sprint/Milestone:

Start date:

Due date:

Estimated time:

Severity:

2. Medium

Platform Release:

OS:

Triaged:

Yes

Groomed:

Sprint Candidate:

Tags:

Katello

Sprint:

Sprint 85

Quarter:

Description

When doing a large migration with ~300K rpms, my workers and resource-manager went missing. Upon further investigation, it appeared that postgresql was stuck in a large IO wait trying to commit a large transaction, for ~10-15 minutes.

My guess is that there is a very large transaction that needs to be broken up into smaller ones, probably around saving artifacts (although this is just a guess)

Actions

Copy link

Updated by jsherril@redhat.com over 3 years ago

Description updated (diff)

Actions

Copy link

Updated by ttereshc over 3 years ago

Triaged changed from No to Yes
Sprint set to Sprint 82

Actions

Copy link

Updated by jsherril@redhat.com over 3 years ago

Priority changed from Normal to High

Actions

Copy link

Updated by ttereshc over 3 years ago

Status changed from NEW to ASSIGNED
Assignee set to ttereshc

Actions

Copy link

Updated by rchan over 3 years ago

Sprint changed from Sprint 82 to Sprint 83

Actions

Copy link

Updated by rchan over 3 years ago

Sprint changed from Sprint 83 to Sprint 84

Actions

Copy link

Updated by rchan over 3 years ago

Sprint changed from Sprint 84 to Sprint 85

Actions

Copy link

Updated by ttereshc over 3 years ago

Status changed from ASSIGNED to CLOSED - CURRENTRELEASE

Resolved by multiple fixes released in 0.5.0 and 0.5.1.

The main problem was a memory leaking in createrepo_c which caused a system to use swap and slow everything down. Workers were going missing because heartbeat update was way too slow. Createrepo_ c fixes (dalley++):

Actions

Send by e-mail Copy link

Also available in: Atom PDF

Project

Profile

Help

Migration Plugin

Agile boards

Custom queries

Issue #7540

workers and resource-manager go missing during large migration

Updated by jsherril@redhat.com over 3 years ago

Updated by ttereshc over 3 years ago

Updated by jsherril@redhat.com over 3 years ago

Updated by ttereshc over 3 years ago

Updated by rchan over 3 years ago

Updated by rchan over 3 years ago

Updated by rchan over 3 years ago

Updated by ttereshc over 3 years ago