Project

Profile

Help

Story #3421

closed

As a plugin writer I have HTTPDownloaders which provide exponential backoff for HTTP 429 errors

Added by daviddavis almost 7 years ago. Updated about 5 years ago.

Status:
CLOSED - CURRENTRELEASE
Priority:
Normal
Assignee:
Category:
-
Sprint/Milestone:
Start date:
Due date:
% Done:

100%

Estimated time:
Platform Release:
Groomed:
Yes
Sprint Candidate:
No
Tags:
Sprint:
Sprint 35
Quarter:

Description

When HTTP downloads fail with 429 responses (Too Many Requests), the HttpDownloader could wait-and-retry later with possible success. This would allow for downloads to complete over time instead of the first few completing and then all subsequent requests being rejected. For example this is how github rate limits you if you try to download too many files from them at once.

Use Cases

  • the downloaders will exponential backoff and retry when a HTTP 429 response is encountered
  • As a plugin writer, I can disable the exponential backoff and retry feature

Implementation Options

Having a coordinated rate limiting implementation is straightforward, but then it needs to be configured probably site-by-site by either users or plugin writers. This is a pain and prone to many problems. Instead a dumb exponential backoff behavior can cause rejected requests to retry automatically when the server is overloaded up to 10 times.

We could do something like use backoff-async and passthrough those options to the plugin writer, but I'm hoping for something really simple hence the hardcoded 10 times.

When to finally fail?

After 10 times. If it backs off exponentially it would back of for roughly 17 minutes or so.

Actions #1

Updated by bmbouter almost 7 years ago

Yes we need to add some sort of rate limiting to the HttpDownloaders.

What about having the downloaders do an exponential backoff and retry with the 429 errors?

Actions #2

Updated by daviddavis almost 7 years ago

+1. That's a good option.

Actions #3

Updated by daviddavis almost 7 years ago

  • Tracker changed from Issue to Story
  • % Done set to 0
Actions #4

Updated by bmbouter almost 7 years ago

  • Project changed from Ansible Plugin to Pulp
  • Subject changed from Github rate limits requests to As a plugin writer I have HTTPDownloaders which provide exponential backoff for HTTP 429 errors
  • Description updated (diff)

Rewriting to be a core story for the downloaders since all plugins would benefit from this feature.

Actions #5

Updated by bmbouter almost 7 years ago

  • Sprint Candidate changed from No to Yes
Actions #6

Updated by bmbouter almost 7 years ago

  • Description updated (diff)
Actions #7

Updated by daviddavis almost 7 years ago

  • Groomed changed from No to Yes
Actions #8

Updated by bmbouter almost 7 years ago

  • Status changed from NEW to ASSIGNED
  • Assignee set to bmbouter
  • Sprint set to Sprint 34

Moving onto sprint since it's groomed and a core beta deliverable. I'm also taking as assigned to post something on it tomorrow.

Actions #9

Updated by bmbouter over 6 years ago

  • Status changed from ASSIGNED to CLOSED - NOTABUG
  • Assignee deleted (bmbouter)
  • Sprint Candidate changed from Yes to No
  • Sprint deleted (Sprint 34)
  • Tags deleted (Pulp 3 MVP)

I couldn't reproduce this issue, and it's possible that the remote servers were having some issues the day it was a problem. In those situations maybe the right thing to do is to fail and not continue retrying anyway?

Since we can't reproduce the problem I think we should hold off on adding retry support for now. I'm removing from the sprint and closing as notabug. We should reopen if the need for retry support of 429 errors.

Actions #10

Updated by bmbouter over 6 years ago

  • Status changed from CLOSED - NOTABUG to ASSIGNED
  • Assignee set to bmbouter
  • Sprint set to Sprint 35
  • Tags Pulp 3 MVP added

After working through some pulp_ansible bugs I can now reproduce this regularly. I'm reopening to add the 429 retry support. Currently the tasks get a lot of errors like this one:

        {
            "code": null,
            "description": "429, message='Too Many Requests'",
            "traceback": null
        },
Actions #11

Updated by bmbouter over 6 years ago

  • Description updated (diff)
  • Status changed from ASSIGNED to POST

Since we went with the 'backoff' library really we just had to choose how many times we wanted to retry for. Since we had already said 10 I just went with that. I updated the ticket description with this.

It got pulp_ansible syncing everything out of github with no failures!

PR available at: https://github.com/pulp/pulp/pull/3425

Added by bmbouter over 6 years ago

Revision d8ea62f4 | View on GitHub

Adds 429 Retry Support

All retry support is provided by the 'backoff' library which also provides an asyncio compatible interface.

This updates the docs and docstrings to call out the new retry functionality.

https://pulp.plan.io/issues/3421 closes #3421

Added by bmbouter over 6 years ago

Revision d8ea62f4 | View on GitHub

Adds 429 Retry Support

All retry support is provided by the 'backoff' library which also provides an asyncio compatible interface.

This updates the docs and docstrings to call out the new retry functionality.

https://pulp.plan.io/issues/3421 closes #3421

Actions #12

Updated by bmbouter over 6 years ago

  • Status changed from POST to MODIFIED
  • % Done changed from 0 to 100
Actions #13

Updated by dkliban@redhat.com over 6 years ago

  • Sprint/Milestone set to 3.0.0
Actions #14

Updated by bmbouter over 5 years ago

  • Tags deleted (Pulp 3, Pulp 3 MVP)
Actions #15

Updated by bmbouter about 5 years ago

  • Status changed from MODIFIED to CLOSED - CURRENTRELEASE

Also available in: Atom PDF