I am assigning myself this task. I will make a patch that allows us to archive hadoop.log with a timestamp or delete the archives or current hadoop.log
This allows us to clean the hadoop log in the interface so we don't have to got to sshd. Also, gives the option to archive the log.
Comments
Comment #1
dstuart commentedHey maxmmize,
To note Nutch automatically does this on a daily basis do you envision it needs to be done more regularly than this
Regards,
Dave
Comment #2
maxmmize commentedWell, for me, I monitor every crawl. I have to run a small crawl at like a max of 10 links for 1 URL, then abotu 20-40 just to make sure the URL is being crawled correctly. I suppose once I get my fetch lists down after a month or so it won't be necessary.
Since I am always monitoring my crawls, I never ran into the function you described.
Now that I have your information it seems either redundant or a blessing depending on what part of the stage of crawling and indexing you are at.
On the other hand, for testing crawl scripts and such it seems like a good admin tool, not that a shell script couldn't achieve the same thing though.
Final thoughts?
Comment #3
dstuart commentedHey Maxmmize,
By all means, its a useful feature that has a good use case, I was ensuring you weren't wasn't effort if the above fit your requirements.
Regards,
David
Comment #4
maxmmize commentedHere it is, kind of heavy and out of shape a bit but functions more or less what I wanted to achieve. Probably a better way to display it though.
Start a crawl and click on hadoop.log. When you submit a new crawl the current hadoop is timestamped then moved and then hadoop is cleared. Needs more logic, like, delete log and perhaps a drop down list instead. Have to work with a bit to see what I really want. Anyway, gimme feedback when you get a chance.
nutch.admin.inc
Comment #5
maxmmize commentedHas anyone tried this out yet? It works for me but maybe others have some opinion on functionality.
Comment #6
avpadernoI am closing this issue, since Drupal 6 isn't supported anymore.