Issue #2835: Tasks stuck in waiting after restart of pulp services - Pulp

Actions

Send by e-mail Copy link

Issue #2835

closed

Tasks stuck in waiting after restart of pulp services

Added by gmbnomis over 7 years ago. Updated over 5 years ago.

Status:

CLOSED - CURRENTRELEASE

Priority:

High

Assignee:

dalley

Category:

Sprint/Milestone:

Start date:

Due date:

Estimated time:

Severity:

3. High

Version:

2.13.1

Platform Release:

2.15.2

OS:

RHEL 7

Triaged:

Yes

Groomed:

Sprint Candidate:

Tags:

Pulp 2

Sprint:

Sprint 31

Quarter:

Description

We observed tasks being stuck in "Waiting" state and not beginning to run multiple times in the past (often after upgrading Pulp (2.13.2 in this case). Sometimes also after a reboot.) This looks similar to issue #1838

For example, the task list looks like this when the problem occurs:

$ pulp-admin tasks list
+----------------------------------------------------------------------+
                                 Tasks
+----------------------------------------------------------------------+

Operations:  publish
Resources:   e2 (repository)
State:       Waiting
Start Time:  Unstarted
Finish Time: Incomplete
Task Id:     4a28ec8d-c02e-4a64-b00f-e39adc9e7184

Operations:  publish
Resources:   e3 (repository)
State:       Waiting
Start Time:  Unstarted
Finish Time: Incomplete
Task Id:     0c7e8a5a-d3da-401f-b6da-220812a51471

Operations:  publish
Resources:   e4 (repository)
State:       Waiting
Start Time:  Unstarted
Finish Time: Incomplete
Task Id:     c43f74ae-00a4-435c-89b8-74011e8f2fa7

Operations:  sync
Resources:   e1 (repository)
State:       Waiting
Start Time:  Unstarted
Finish Time: Incomplete
Task Id:     f5e26ee1-158d-4323-bb0e-d2d954ae07d5

Moreover, new tasks will not begin to run. In this situation, restarting the pulp services does not help. The only remedy we came up with is to cancel all waiting tasks and then restart the pulp services.

The situation seems to occur when pulp services are restarted with tasks being queued. The following script generates this situation for me in a vagrant pulp dev environment (docker, 2.13 dev branch). The number of workers is set to 1 in order to ensure that tasks get queued.

set -x

sudo bash -c "echo -e 'PULP_CONCURRENCY=1\nPYTHONIOENCODING="UTF-8"' > /etc/default/pulp_workers"

sudo systemctl restart pulp_celerybeat.service pulp_resource_manager.service pulp_workers.service
sleep 5

for i in e1 e2 e3 e4 ; do pulp-admin rpm repo create --repo-id $i --feed https://dl.fedoraproject.org/pub/epel/7/x86_64/ --serve-https true --relative-url $i ; done

pulp-admin rpm repo sync run --repo-id e1 --bg ; for i in e2 e3 e4 ; do pulp-admin rpm repo publish run --repo-id $i --bg ; done
sleep 10
pulp-admin tasks  list
sudo systemctl restart pulp_celerybeat.service pulp_resource_manager.service pulp_workers.service
sleep 2m
pulp-admin rpm repo sync run --repo-id e1 --bg
pulp-admin tasks  list

The end result is the task list shown above.

Files

Download all files

gdb_process_dumps.zip (9.24 KB) gdb_process_dumps.zip		dalley, 09/13/2017 05:42 AM
celery_startup_log_bootsteps_and_signal.txt (8.12 KB) celery_startup_log_bootsteps_and_signal.txt		dalley, 09/19/2017 02:10 AM
celery_startup_log_signal_only.txt (7.38 KB) celery_startup_log_signal_only.txt		dalley, 09/19/2017 02:10 AM

Related issues

Actions

Send by e-mail Copy link

Also available in: Atom PDF

Project

Profile

Help

Pulp

Agile boards

Custom queries

Issue #2835

Tasks stuck in waiting after restart of pulp services

Updated by bmbouter over 7 years ago

Updated by ttereshc over 7 years ago

Updated by ttereshc over 7 years ago

Updated by ttereshc over 7 years ago

Updated by dalley over 7 years ago

Updated by mhrivnak over 7 years ago

Updated by dalley over 7 years ago

Updated by mhrivnak over 7 years ago

Updated by ttereshc over 7 years ago

Updated by mhrivnak over 7 years ago

Updated by jortel@redhat.com about 7 years ago

Updated by dalley about 7 years ago

Updated by dalley about 7 years ago

Updated by dalley about 7 years ago

Summary:¶

Reproduction Steps:¶

Symptoms:¶

How to "fix":¶

Origin of this issue:¶

Mitigation:¶

Debugging:¶

Added by dalley about 7 years ago

Added by dalley about 7 years ago

Updated by dalley about 7 years ago

Updated by dalley about 7 years ago

Updated by mhrivnak about 7 years ago

Updated by jortel@redhat.com about 7 years ago

Updated by mhrivnak about 7 years ago

Updated by dalley about 7 years ago

Updated by rchan almost 7 years ago

Updated by dalley almost 7 years ago

Updated by rchan almost 7 years ago

Updated by rchan almost 7 years ago

Updated by dalley almost 7 years ago

Added by dalley almost 7 years ago

Updated by dalley almost 7 years ago

Updated by bmbouter almost 7 years ago

Added by dalley almost 7 years ago

Updated by dalley almost 7 years ago

Updated by daviddavis almost 7 years ago

Updated by pthomas@redhat.com over 6 years ago

Updated by bmbouter over 6 years ago

Updated by bmbouter over 6 years ago

Updated by bmbouter over 6 years ago

Updated by bmbouter over 5 years ago