when hosting-cron is interrupted, it keeps its semaphore in the database (reboot problem) [#931550]

Comment	File	Size	Author
#10	sites_locked_semaphore_correct.patch.txt	1.21 KB	omega8cc
#5	sites_locked_semaphore.patch	1.18 KB	omega8cc

Comment #1

j0nathan commented 4 October 2010 at 22:17

Subscribing.

Log in or register to post comments

Comment #2

omega8cc commented 5 October 2010 at 18:37

Here is a tiny patch which could help to prevent overloading system with too many crons fired up at once, which is one of the possible reasons of problems with not released semaphore, but it is not a solution, of course (you submitted that issue about semaphore which I should submit a few weeks ago probably). http://github.com/omega8cc/hostmaster/commit/fd4c5413b47c86b4b8cd3d32179...

Log in or register to post comments

Comment #3

omega8cc commented 5 October 2010 at 18:47

In the meantime we could also add a simple how-to (FAQ), like:

When cron for your sites stopped working, it is possible that for some reason (system overload, broken site, timeout etc), Aegir failed to release a sites cron semaphore. To release it, use this simple recipe:

$ su -s /bin/bash - aegir
$ cd /path/to/hostmaster/sites/domain
$ drush vdel hosting_queue_cron_running -y

Log in or register to post comments

Comment #4

Anonymous (not verified) commented 5 November 2010 at 20:09

It's been added to the FAQ at least.

Log in or register to post comments

Comment #5

omega8cc commented 28 November 2010 at 17:50

Status:

Active

» Needs review

Status	File	Size
new	sites_locked_semaphore.patch	1.18 KB

Attached patch should fix this issue. It is a simple port from core function drupal_cron_run().

Log in or register to post comments

Comment #6

omega8cc commented 29 November 2010 at 15:39

This patch just helped also with locked cron for tasks, as expected, since it will release any *running old semaphore. Please review it/test etc to make sure it doesn't break anything in the meantime.

Log in or register to post comments

Comment #7

omega8cc commented 29 November 2010 at 18:34

Status:

Needs review

» Reviewed & tested by the community

Tested on a few "locked" hostmasters and works fine.

Marking as RTBC.

Log in or register to post comments

Comment #8

DanielJohnston commented 1 December 2010 at 00:26

Subscribe. I've run into this a few times now, looking forward to it popping up in the next beta.

Log in or register to post comments

Comment #9

Anonymous (not verified) commented 16 December 2010 at 23:45

So I was just talking in IRC and mused whether this code should be directly in dispatch.hosting.inc just before the check for 'already running' is performed.

If the code is in hosting_get_queues(), it will be run a lot of times in other places, even when the queue summary block loads in the frontend, and I think that's a lot of mechanism, despite a small amount of variable_get/del stuff, just to load the page.

If it's to allow the dispatch to run, it should be specific to dispatch, that's just my opinion, but I am welcome to others telling me it's not that costly an operation.

Log in or register to post comments

Comment #10

omega8cc commented 17 December 2010 at 01:10

Status	File	Size
new	sites_locked_semaphore_correct.patch.txt	1.21 KB

I agree. The correct patch: http://gitorious.org/aegir/hostmaster/commit/ce39134436f2b1c2fbd3b3bde05...

Log in or register to post comments

Comment #11

omega8cc commented 17 December 2010 at 01:43

The minor result of moving this check to the dispatch.hosting.inc is that it will require two cron runs after the locked semaphore will reach the 3600s limit before it will start the sites cron again, because the check runs after $queues = hosting_get_queues(); so the $info['running'] is true at first attempt (which is obvious probably).

Log in or register to post comments

Comment #12

anarcat commented 20 December 2010 at 19:11

Status:

Reviewed & tested by the community

» Needs work

I reviewed your patch, but I think we can improve this so that we check the process table. To do this, we need:

1. store the process ID when dispatching
2. check the process ID for existence

Log in or register to post comments

Comment #13

DanielJohnston commented 28 March 2011 at 13:40

Does this still need work? I thought it was meant to have been fixed in more recent releases.

Log in or register to post comments

Comment #14

juliangb commented 3 April 2011 at 16:20

Subscribe

Log in or register to post comments

Comment #15

crea commented 30 April 2011 at 10:55

Subscribing

Log in or register to post comments

Comment #16

playfulwolf commented 11 April 2012 at 14:55

any progress? got the same problem :(

Log in or register to post comments

Comment #17

playfulwolf commented 14 April 2012 at 18:58

Status:

Needs work

» Closed (fixed)

sorry, wrong window!!!!!!

Log in or register to post comments

Comment #18

playfulwolf commented 14 April 2012 at 18:59

Status:

Closed (fixed)

» Needs work

sorry again

Log in or register to post comments

Comment #19

steven jones commented 16 April 2012 at 14:47

Assigned:

Unassigned

» steven jones

Looking at this issue as part of office hours.

Log in or register to post comments

Comment #20

steven jones commented 16 April 2012 at 15:12

Assigned:	steven jones	» Unassigned
Status:	Needs work	» Closed (cannot reproduce)

Looking at the code, this really should be fixed. If you still have issues with this on 1.7 or higher, please re-open and we'll take a look.

Log in or register to post comments

Comment #21

steven jones commented 16 April 2012 at 15:13

Status:

Closed (cannot reproduce)

» Fixed

Actually 'fixed' is probably a better status here.

Log in or register to post comments

Comment #22

30 April 2012 at 15:20

Status:

Fixed

» Closed (fixed)

Automatically closed -- issue fixed for 2 weeks with no activity.

Log in or register to post comments

Comment #23

9 May 2014 at 13:46

Commit b1e410c on 6.x-2.x, 7.x-3.x, dev-ssl-ip-allocation-refactor, dev-sni, dev-helmo-3.x by anarcat:
```
#931550 - release old cron semaphore if it exists
```

Log in or register to post comments

Comment #24

12 June 2014 at 08:59

Commit b1e410c on 6.x-2.x, 7.x-3.x, dev-ssl-ip-allocation-refactor, dev-sni, dev-helmo-3.x by anarcat:
```
#931550 - release old cron semaphore if it exists
```

Log in or register to post comments

when hosting-cron is interrupted, it keeps its semaphore in the database (reboot problem)

Comments

Comment #1

Comment #2

Comment #3

Comment #4

Comment #5

Comment #6

Comment #7

Comment #8

Comment #9

Comment #10

Comment #11

Comment #12

Comment #13

Comment #14

Comment #15

Comment #16

Comment #17

Comment #18

Comment #19

Comment #20

Comment #21

Comment #22

Comment #23

Comment #24

News items

Our community

Documentation

Drupal code base

Governance of community