Concurrently execute a single drush command on multiple sites (con'd) [#1053072]

Comment	File	Size	Author
#34	concurrent-invoke-34.patch	41.92 KB	greg.1.anderson
#26	concurrent-invoke-26.patch	40.75 KB	greg.1.anderson
#25	concurrent-invoke-25.patch	40.55 KB	greg.1.anderson
#21	concurrent-invoke-20.patch	40.22 KB	greg.1.anderson
#16	concurrent-invoke-16.patch	39.99 KB	greg.1.anderson
#15	concurrent-invoke.patch	31.59 KB	greg.1.anderson
#12	parallel-1053072-12.patch	22.95 KB	msonnabaum
#10	parallel-1053072-10.patch	23.21 KB	msonnabaum
#5	parallel-1053072-5.patch	18.75 KB	msonnabaum
#1	1053072-parallel-remote-commands.patch	9.16 KB	msonnabaum
	drush_concurrent.patch	8.75 KB	msonnabaum

Comment #1

msonnabaum commented 6 March 2011 at 19:33

Status	File	Size
new	1053072-parallel-remote-commands.patch	9.16 KB

Last patch was reversed for some reason. Here's a new one that applies against the latest master.

Log in or register to post comments

Comment #2

msonnabaum commented 9 September 2011 at 19:41

Think its time to resurrect this thread. I updated the async version of _drush_proc_open to put in drush deploy, which incorporates the changes that have occurred in drush 5 thus far:

http://drupalcode.org/project/drush_deploy.git/blob_plain/HEAD:/upstream...

This version (and the above patches) treat $cmd as an array, which works well if you have different commands you need to run, which is the case with deploy since each one ssh's into a different machine.

I think I'd also be useful to be able to do something like this however:

drush_invoke_process('@self', 'queue-cron', array(), array(), array('concurrency' => 4);

I'll probably start on a new patch soon as I keep finding myself needing this in other commands, but I wanted to restart the conversation on how we present this at the API level first.

Also, I'm not sure if I got the new integrate stuff right in this version, so I'd appreciate a bit of review there.

Log in or register to post comments

Comment #3

greg.1.anderson commented 9 September 2011 at 20:15

Hey, this looks awesome! Haven't tried the code yet, but the integrate stuff looks correct as you have it.

I feel encouraged to try to plug this in to the regular drush @sitelist some-command handler, so that it would run any time you needed it to. This would be a pretty simple matter of just gening up appropriate $backend_options for each site and calling the new function. I'd have to say that there should be an option to control output, because by default it will look something like this:

site1 >> Some output
site2 >> Different output
site1 >> Some more output
site1 >> Some even more output
site2 >> Different output continues

This is exactly what you'd want if the output was from some lengthy operation (e.g. status logs), but it would be mighty annoying for just a drush status. So by default, I'd say that when concurrency > 1, output would be held in the bucket until the command completed, and then output all at once when the pipe is closed (n.b. commands are still executing concurrently). This is for use from the cli; when called via the API, then the default should be to output everything as soon as it became available.

I note that your current implementation does not yet implement concurrency < N, but it seems this would be pretty easy. You'd want to change your main while loop to while ((sizeof($open_processes)) || (sizeof($cmd))), then move the call to proc_open inside the while loop, popping an item to be opened off of $cmd if concurrency > sizeof($open_processes).

Log in or register to post comments

Comment #4

msonnabaum commented 9 September 2011 at 20:38

Using this for the @sitelist group aliases was actually the original motivation as this is how deploy works, so I definitely want to get that working. I'm unsure about multi-concurrency as a default though, I feel like there are probably some commands where that would be problematic.

I'm actually ok with the output you have there. This is how both capistrano and pdsh work. I find it nicer to see the output in real time than have it sorted. When I do want to see output sorted by site with pdsh, I just pipe it to sort.

And yeah, I just started hacking on the concurrency option yesterday when the need came up. Will try to implement your suggestion there.

Log in or register to post comments

Comment #5

msonnabaum commented 11 September 2011 at 00:45

Status	File	Size
new	parallel-1053072-5.patch	18.75 KB

Ok, here's my first crack at this.

Concurrency option is supported in $backend_options and it also works with groups of site aliases. Don't yet have an option to set blocking.

Log in or register to post comments

Comment #6

moshe weitzman commented 17 September 2011 at 11:12

Priority:	Normal	» Critical
Issue tags:		+Release blocker

Log in or register to post comments

Comment #7

moshe weitzman commented 20 October 2011 at 18:36

Assigned:

Unassigned

» msonnabaum

Log in or register to post comments

Comment #8

moshe weitzman commented 28 October 2011 at 20:57

Whats the next step here?

Log in or register to post comments

Comment #9

greg.1.anderson commented 16 November 2011 at 18:46

@msonnabaum had an updated version at BADCamp, I think, that hasn't been posted here yet.

Log in or register to post comments

Comment #10

msonnabaum commented 16 November 2011 at 19:33

Status	File	Size
new	parallel-1053072-10.patch	23.21 KB

Here's where I left off after BADCAMP.

I think there are still issues with output-label.

Log in or register to post comments

Comment #11

moshe weitzman commented 18 December 2011 at 17:38

Status:

Active

» Needs work

No longer applies cleanly :(. We'd really like to get this in and hook up drush_make as per #1267228-28: Drush Make should use Drush core's native download abilities concurrently

Log in or register to post comments

Comment #12

msonnabaum commented 19 December 2011 at 00:47

Status	File	Size
new	parallel-1053072-12.patch	22.95 KB

New patch that applies. No changes other than resolving upstream conflicts.

Log in or register to post comments

Comment #13

greg.1.anderson commented 21 December 2011 at 01:50

Assigned:

msonnabaum

» greg.1.anderson

I've started to clean up #12 per #1267228-17. Still need to do a little more work on it before it's ready to post.

Log in or register to post comments

Comment #14

msonnabaum commented 21 December 2011 at 04:44

Hmm, I think I like this drush_invoke_concurrent(). That could simplify a path that was getting complex.

Look forward to seeing what you come up with.

Log in or register to post comments

Comment #15

greg.1.anderson commented 21 December 2011 at 08:00

Status	File	Size
new	concurrent-invoke.patch	31.59 KB

Here's a snapshot just for the curious. A lot of stuff works, but there are some variables not being passed through correctly, so a lot of tests are failing too. Will keep working on this.

Log in or register to post comments

Comment #16

greg.1.anderson commented 22 December 2011 at 01:09

Assigned:	greg.1.anderson	» msonnabaum
Status:	Needs work	» Needs review

Status	File	Size
new	concurrent-invoke-16.patch	39.99 KB

Here is a patch that passes all tests (except for one complete test case, which I expect is an ordering issue. :p).

Works really well, and has a lot of coverage with existing tests. In the site alias test, I had to sort the output, because when php-eval is executed multiple times concurrently, the output can appear in any order. The test's expected results look odd, but it works. :)

There is no way to disable concurrent execution right now, and backend invoke will always spawn as many processes as it can all at once. Perhaps we should put in some throttling, and throttle to 1 to disable concurrency? I don't know if it's important. Output is usually ordered sanely due to the fact that different sites take different amounts of time to respond, and output is sent in chunks. Efficiency and readability for free! We may find instances where this is not the case, but for now it seems to be working pretty well.

The other thing I didn't put in was Mark's 'concurrency' switch, that allows the same command to be executed on the same site multiple times. I'm not sure how you want to use that or put that in, but it might work to just add your array_fill to drush_invoke_process. Note that #output-label is now handled at a very low level, so something reasonable will be filled in if you omit it. Set 'no-label' to get rid of it.

Log in or register to post comments

Comment #17

greg.1.anderson commented 22 December 2011 at 01:12

Oh, note that the function drush_invoke_concurrent() was not doing anything at all except calling through to backend invoke, so I just renamed drush_backend_invoke_sitealias_command to drush_backend_invoke_concurrent, and left it at that.

drush_invoke_process still works with its previous function signature, so clients (e.g. features) won't have to rev again.

Log in or register to post comments

Comment #18

msonnabaum commented 22 December 2011 at 02:54

Still looking over this and playing with it a bit, but it doesn't look like I'm getting any output when I run a command across a group of aliases.

This is the command I usually use to test this patch, where @devcloud is a group of 3 aliases:

drush -v @devcloud -y status

Did you test it differently or am I doing something wrong here?

Log in or register to post comments

Comment #19

moshe weitzman commented 22 December 2011 at 03:29

I'm quite concerned about 'now way to turn concurrency off and no way to limit it'. that sounds like a recipe for swamping servers. imagine trying top run updatedb on a site list containing 1000 sites, you would swamp the server where drush is running, the DB server, and maybe more than i can't think of.

Log in or register to post comments

Comment #20

greg.1.anderson commented 22 December 2011 at 03:50

Assigned:	msonnabaum	» greg.1.anderson
Status:	Needs review	» Needs work

Regarding #18, I defined this:

$aliases['two'] = array(
  'site-list' => '@gk.dev,@sos.dev',
);

Then when I run this:

drush -v @two status

It works fine. I just started with master and applied the patch from #17 clean; not sure how your test might be different.

Limits won't be hard to put in; I'll take care of #19.

Log in or register to post comments

Comment #21

greg.1.anderson commented 22 December 2011 at 04:32

Status	File	Size
new	concurrent-invoke-20.patch	40.22 KB

Here is a patch for testing that hard-codes the process limit to two. Just add some intelligence, maybe a drush_get_option somewhere, and it'll be all set.

Still interested to find out why this isn't working for Mark.

Log in or register to post comments

Comment #22

msonnabaum commented 22 December 2011 at 05:00

Same thing if I setup my alias like you have there. I get some output if I use -d, but it looks like it's just giving me the debug output for the remote commands, not the normal stdout.

Log in or register to post comments

Comment #23

greg.1.anderson commented 22 December 2011 at 05:30

Ah, I see the difference. Local sites are working, remote sites emit no output.

Log in or register to post comments

Comment #24

greg.1.anderson commented 22 December 2011 at 16:56

Only had a little bit of time to look at this, but I verified that the cli options are the same with and without the patch (thank you, git stash, for rapid regression capability). It seems that I have a really old drush on my remote server; old drush puts the backend output in the json-encoded return variable, and new drush emits it in real time, prior to the output of the json-encoded results. Prior to this patch, drush would emit the output from the json results if there was no output before the start-marker; just need to put this logic back in.

As a workaround, try upgrading drush to HEAD on your remote machine and see if it starts working.

Log in or register to post comments

Comment #25

greg.1.anderson commented 22 December 2011 at 20:04

Status:

Needs work

» Needs review

Status	File	Size
new	concurrent-invoke-25.patch	40.55 KB

Here is an update that works better with old versions of Drush. Output from the json 'output' field will be displayed if Drush does not emit any real-time output before the json start-marker. Note that there is a limitation in that --debug mode with old versions of remote Drush interfere with the backwards compatibility feature, as debug messages printed out by old Drush are interpreted as regular incremental output by new Drush. It would be possible to improve this, but I don't know how much effort we want to spend on backwards compatibility.

I also altered the return value of drush_invoke_process. Previously (in #10), if there was one process, the return value would be the same as it had been in previous versions of Drush (a $values array with 'output', etc.), but if there were two processes, all of the $value arrays would be merged together with an array + operation. This causes significant data loss, so I changed it to return an array with an element 'multiple' that contains an associative array of $site => $value mappings. If clients call drush_invoke_process with a single site, then they will get a compatible result returned, but they will need to know to test for 'multiple' in the result if a multi-site operation was desired.

If there was a reasonable way to merge together the results, then I'd say maybe we need a flag in $backend_options that causes the 'multiple' return value. I'm not sure if there's anything we can do to make code written to work with a single site happy with the return values from multiple concurrent processes. Thoughts on this?

Log in or register to post comments

Comment #26

greg.1.anderson commented 22 December 2011 at 20:06

Status	File	Size
new	concurrent-invoke-26.patch	40.75 KB

Wrong patch in #25; here's the right one.

Log in or register to post comments

Comment #27

moshe weitzman commented 22 December 2011 at 21:20

I think we could get more test coverage here, no? If system level tests are hard to think of, test a few key functions with unit tests.

Mark or Greg is welcome to commit this when you think it is ready. I'm not able to provide more helpful feedback.

Log in or register to post comments

Comment #28

greg.1.anderson commented 22 December 2011 at 21:40

I'll add tests as time permits. I think this is ready to commit, but will wait for Mark's feedback.

I committed f41d4ba to make the complete test more stable; some code changes (such as this patch) changed the order of the completion candidates, causing spurious test failures. That shouldn't happen any more.

Log in or register to post comments

Comment #29

msonnabaum commented 22 December 2011 at 21:44

Status:

Needs review

» Needs work

THis is working for me so far, but I definitely don't like how it gathers the output for each process in series. For longer commands like drush deploy, you need to see the output as it's happening, even if it's a bit hard to read.

If you prefer sorted output, you can always pipe drush to sort, but you can't unsort the current output.

Log in or register to post comments

Comment #30

greg.1.anderson commented 22 December 2011 at 21:50

Are you still using an old version of drush on the remote end? Make sure that you're up to date on both sides of the communication.

Commands like status, that take a (relatively) long time to bootstrap and a comparatively short time to output will have their output grouped together, which is good. If you have long-running commands that output at arbitrary intervals, you should see the output come out as it is printed on the remote end -- unless you are running an old version of drush on the remote machine, in which case you are stuck waiting until the command is done before you will receive any output (except for log messages).

I suppose a unit test that runs concurrent commands that sleep and print output in a short loop would be useful and illustrative here.

Log in or register to post comments

Comment #31

greg.1.anderson commented 22 December 2011 at 21:58

Wait, in #29, did you mean the output returned in the $values array? In #30 I was referring to the output tha t appeared in the terminal window. We could provide terminal-window-ordered output in $values, if desired. I think that $values should also continue to have the output of each command separated out into its individual 'multiple' => '@sitename' bucket, though.

Log in or register to post comments

Comment #32

moshe weitzman commented 22 December 2011 at 22:22

Another option for a unit test might be to finish the make integration :) Then the make tests act as an implicit test that concurrency isn't totally broken.

Log in or register to post comments

Comment #33

greg.1.anderson commented 22 December 2011 at 22:55

Per #1267228: Drush Make should use Drush core's native download abilities concurrently, we should probably hook up the 'concurrency' feature before committing this.

Log in or register to post comments

Comment #34

greg.1.anderson commented 22 December 2011 at 23:42

Status:

Needs work

» Needs review

Status	File	Size
new	concurrent-invoke-34.patch	41.92 KB

Here's an implementation of 'concurrency' (limits number of processes) and 'invoke-multiple' (for concurrent batch processes, like your old 'concurrency' option).

Go ahead and set the status back if you're still not satisfied with output handling.

Log in or register to post comments

Comment #35

msonnabaum commented 23 December 2011 at 01:27

So I can see the interleaved output if I use -d on the remote command, but it appears that when I do that, I *only* get stderr back. Is that happening for you as well?

This is with drush 4.5 on the other end.

Also, for the tests, I think we just need straight up unit tests for all the functions we're touching here. I started on this last week. I'll commit what I have soon so we can start fleshing them out.

Log in or register to post comments

Comment #36

greg.1.anderson commented 23 December 2011 at 02:36

Drush 4.5, and very old versions of Drush-5, do not support incremental output. They horde all of their output until everything is done, and then emit it all at once. You need to use a recent version of Drush-5 to get incremental output. Furthermore, if you are using Drush 4.5, --debug breaks backwards compatibility, and hides the stdout output. There is nothing we can do to get incremental output out of Drush-4, but if we worked harder on Drush-5, we could fix the problem with --debug.

Log in or register to post comments

Comment #37

msonnabaum commented 23 December 2011 at 06:04

Status:

Needs review

» Reviewed & tested by the community

Ok, this is looking pretty good to me. I'm ready to just commit this and touch it up as necessary when we start converting commands to use it.

Log in or register to post comments

Comment #38

greg.1.anderson commented 23 December 2011 at 19:52

Status:

Reviewed & tested by the community

» Fixed

Committed. Test coverage is actually already pretty good, but we could use some tests for 'concurency', etc.

Log in or register to post comments

Comment #39

6 January 2012 at 20:00

Status:	Fixed	» Closed (fixed)
Issue tags:	-Release blocker

Automatically closed -- issue fixed for 2 weeks with no activity.

Log in or register to post comments

Comment #40

6 January 2012 at 20:00

Issue tags:

+Release blocker

Restoring issue tags, see #2125755: System messages removed all issue tags during D7 upgrade.

Log in or register to post comments

Concurrently execute a single drush command on multiple sites (con'd)

Comments