So, the story is, I maintian a multiple site install using CVS, which seemed helpful for the longest time making updates a snap, but as of late, it has caused me some grief... the latest being that I recently updated Drupal with CVS, and it must have dumped the Robots.txt file back in the root directoy. I had deleted it from their so I could use this module to manage robots for each site. It works great... until my stupid human error trick. So now, I have a site, that was a prank on a friend... but much to my dismay (and my friends), i recognized the robots.txt file mishap, and the site was cached.

My question to you all is, what would be the better strategy for getting the cached info off of Google's search results?

I already put up a proper "No Robots" file, and put in a request to have the cache cleared, but it has been "Pending" for a couple days now. I was wondering if it might be better to put up a page that is less, er, funny, and have let the Google Bot in to have a scan of it, then after that put the request back in to google to have it kill the cache.

Anyone have any opinions on the best method of getting down the cache. Allow bots, let scan again, then, when cached material is newer, disallow robots, OR, leave it as it is, and wait for Google to clear it. Google will not clear the cache if the site is still up and allowing robots access. So, I have to do one or the other...

Comments

hass’s picture

Status: Active » Fixed

Use nodewords module and disable caching for future (I'm also doing this). For the rest there are the webmaster tools... if you need to get out for a very long time this is the way - otherwise set all pages to 404 and your content is kicked out from SERPs in max ~4-12 weeks.

kpm’s picture

Status: Fixed » Closed (fixed)

Thanks for the info. It didn't actually take too long to get it cleared using Webmaster Tools... about a week.