Ensure the update_downloads job doesn't run concurrently #2157

jtgeibel · 2020-01-25T05:15:02Z

If multiple instances of this job are run concurrently then it is
possible to overcount downloads, at least temporarily. The job first
selects all matching version_downloads and later uses those values to
calculate how many downloads to add to versions and crates. If a
second job is run, it would select some rows from version_downloads
that were already queued for processing by the first task.

If an overcount were to occur, the next time the job is run it should
calculate a negative adjustment and correct the situation. There's no
point in doing extra work and if we eventually need concurrency we
should built that out intentionally. Therefore, this commit wraps the
entire job in a transaction and obtains an transaction level advisory
lock from the database.

If the lock has already been taken the job will fail and will be retried
by swirl. If the duration of this job begins to approach the scheduling
interval, then we will want to increase that interval to avoid
triggering alerts.

If multiple instances of this job are run concurrently then it is possible to overcount downloads, at least temporarily. The job first selects all matching `version_downloads` and later uses those values to calculate how many downloads to add to `versions` and `crates`. If a second job is run, it would select some rows from `version_downloads` that were already queued for processing by the first task. If an overcount were to occur, the next time the job is run it should calculate a negative adjustment and correct the situation. There's no point in doing extra work and if we eventually need concurrency we should built that out intentionally. Therefore, this commit wraps the entire job in a transaction and obtains an transaction level advisory lock from the database. If the lock has already been taken the job will fail and will be retried by swirl. If the duration of this job begins to approach the scheduling interval, then we will want to increase that interval to avoid triggering alerts.

rust-highfive · 2020-01-25T05:15:06Z

r? @sgrif

(rust_highfive has picked a reviewer for you, use r? to override)

jtgeibel · 2020-02-06T00:18:07Z

r? @smarnach

sgrif · 2020-02-06T02:22:43Z

@bors r+

bors · 2020-02-06T02:22:45Z

📌 Commit 0b03ae6 has been approved by sgrif

bors · 2020-02-06T02:22:54Z

⌛ Testing commit 0b03ae6 with merge c07223b...

Ensure the update_downloads job doesn't run concurrently If multiple instances of this job are run concurrently then it is possible to overcount downloads, at least temporarily. The job first selects all matching `version_downloads` and later uses those values to calculate how many downloads to add to `versions` and `crates`. If a second job is run, it would select some rows from `version_downloads` that were already queued for processing by the first task. If an overcount were to occur, the next time the job is run it should calculate a negative adjustment and correct the situation. There's no point in doing extra work and if we eventually need concurrency we should built that out intentionally. Therefore, this commit wraps the entire job in a transaction and obtains an transaction level advisory lock from the database. If the lock has already been taken the job will fail and will be retried by swirl. If the duration of this job begins to approach the scheduling interval, then we will want to increase that interval to avoid triggering alerts.

bors · 2020-02-06T02:36:49Z

☀️ Test successful - checks-travis
Approved by: sgrif
Pushing c07223b to master...

It appears that the additional wrapper transaction added around the update_downloads task causes delays and timeouts to download requests whenever the background job is run. Reverting so that master can be deployed. Revert "Auto merge of rust-lang#2157 - jtgeibel:add-lock-to-update-downloads-job, r=sgrif" This reverts commit c07223b, reversing changes made to c6d13eb.

@ghost

Revert changes to update_downloads task in #2157 It appears that the additional wrapper transaction added around the update_downloads task causes delays and timeouts to download requests whenever the background job is run. Reverting so that master can be deployed. r? @ghost

@ghost

Revert changes to update_downloads task in #2157 It appears that the additional wrapper transaction added around the update_downloads task causes delays and timeouts to download requests whenever the background job is run. Reverting so that master can be deployed. r? @ghost

This is an improved implementation of rust-lang#2157. The previous design relied on a transaction based lock to manage the lifetime of the lock. The wrapper transaction caused the `update_downloads` job to interfere with incoming download requests, and the changes had to be reverted. This implementation uses a session lock which is automatically released even if the callback panics. If multiple instances of the `update_downloads` job are run concurrently then it is possible to over count downloads, at least temporarily. The first job selects all matching `version_downloads` and later uses those values to calculate how many downloads to add to `versions` and `crates`. If a second job is run, it would select some rows from `version_downloads` that were already queued for processing by the first task. If an over count were to occur, the next time the job is run it should calculate a negative adjustment and correct the situation. There's no point in doing extra work and if we eventually need concurrency we should built that out intentionally. Therefore, this commit wraps the entire job in a transaction and obtains an transaction level advisory lock from the database. If the lock has already been taken the job will fail and will be retried by swirl. If the duration of this job begins to approach the scheduling interval, then we will want to increase that interval to avoid triggering alerts.

rust-highfive assigned sgrif Jan 25, 2020

rust-highfive added the S-waiting-on-review label Jan 25, 2020

rust-highfive assigned smarnach and unassigned sgrif Feb 6, 2020

bors merged commit 0b03ae6 into rust-lang:master Feb 6, 2020

jtgeibel deleted the add-lock-to-update-downloads-job branch March 8, 2020 22:37

jtgeibel mentioned this pull request Mar 10, 2020

Ensure the update_downloads job doesn't run concurrently #2266

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Ensure the update_downloads job doesn't run concurrently #2157

Ensure the update_downloads job doesn't run concurrently #2157

Uh oh!

jtgeibel commented Jan 25, 2020

Uh oh!

rust-highfive commented Jan 25, 2020

Uh oh!

jtgeibel commented Feb 6, 2020

Uh oh!

sgrif commented Feb 6, 2020

Uh oh!

bors commented Feb 6, 2020

Uh oh!

bors commented Feb 6, 2020

Uh oh!

bors commented Feb 6, 2020

Uh oh!

Uh oh!

Ensure the update_downloads job doesn't run concurrently #2157

Ensure the update_downloads job doesn't run concurrently #2157

Uh oh!

Conversation

jtgeibel commented Jan 25, 2020

Uh oh!

rust-highfive commented Jan 25, 2020

Uh oh!

jtgeibel commented Feb 6, 2020

Uh oh!

sgrif commented Feb 6, 2020

Uh oh!

bors commented Feb 6, 2020

Uh oh!

bors commented Feb 6, 2020

Uh oh!

bors commented Feb 6, 2020

Uh oh!

Uh oh!