On a particular site we're using this module on, they have a back-log of around 130,000 articles that need to be tagged. Unless I'm missing a big clue-stick, the only way for me to import these is to start up a browser window on my computer, start the batch API process under the settings page, and then let it run over the course of a week. :P

It would be awesome if this could be done in a cron job instead from the shell so that there's not a risk of my browser crashing or me accidentally quitting it while it's in the middle of the job.

Comments

febbraro’s picture

Assigned: Unassigned » febbraro

hey webchick.

There are 2 options here.

#1 the bulk process will pickup where it left off so if it does fail you dont have to start all over, not ideal, but might work for some people in some cases.

#2 Someone else suggested this, but I have not had time to look into it. #409630: Create a way to do mass-importing without batch API any chance that code could be the basis of some of this or at least a test to see if you can make that happen for your specific use case faster than I can get around to implementing it?

Love the idea though, glaring hole in the whole bulk process thing.

webchick’s picture

Status: Active » Closed (duplicate)

Oh, sure. That would work smashingly! And go ahead and keep the browser processing in, since it's nice to give people a choice if they're on a site without 100,000 nodes.

I'll mark this issue as a duplicate. Sorry. I tried searching for "batch" and should've searched for "bulk." :)