Thank you Google, you twat
So, if you’d not noticed already, I converted my blog to WordPress yesterday. I know, I know, I told WordPress to suck my balls. Apparently things change, not least because of a half-dead install of Habari thanks to me installing an unstable night-release (don’t ask why, I don’t know).
It was actually quicker to migrate Habari back to WordPress than it was trying to wrangle my back-up onto the server, replacing the changed files etc and as time is a precious commodity these days that’s what I did.
Unfortunately, in my haste to get a working blog running, I forgot to block off the test install at /wordpress/. Google, while both magnificent and impressive in its speed, has pissed me right off by indexing shitloads of content under /wordpress/ … I don’t even begin to understand how it FOUND it, given that it wasn’t linked from anywhere.
Anyway, point being my SERPs and feed will be all over the shop for a while until things settled down. I can only apologise.
(Note: this is not the end of my love affair with Habari … but more coming on that soon ;) )
Rhys said:
On 16 Feb at 2:13 pm
Did you block search engines from your fresh install? Caught me out a couple of times myself.
Jem said:
On 16 Feb at 2:21 pm
I did, but not before Google had already crawled a load of pages. It was ridiculously quick .. I’m talking minutes!
Goosh said:
On 16 Feb at 2:23 pm
Jem,
I had that problem with a /dev/ copy of a site recently. Simple things like Google Analytics installed prior to launch will be enough to trigger an index request.
I recommend Simple 301 Redirects for WordPress (http://www.scottnelle.com/simple-301-redirects-plugin-for-wordpress/) to help you sync stuff up quickly if you haven’t already implemented a soft 301 in your .htacces. Follow that up with a Google Webmaster account and clear up the slops that drip back through.
Bonza! Pain in the arse, but will sort itself out in a few days.
Jem said:
On 16 Feb at 2:25 pm
I already have the following in my htaccess:
;)
Goosh said:
On 16 Feb at 2:26 pm
Perfect ;)
neal said:
On 16 Feb at 3:38 pm
you might want to use the url removal tool to remove the directory from google’s results – as i found out after i had a site hacked by spammers (i caught it and deleted the content within a day).
i had 12,500 articles posted and they still keep showing up in site:example.com searches because google must be picking up traces from within its own database. now moving all the real content to another directory and deleting the current /content/ dir from google.
google doesn’t realise it has gone because drupal returns 200 even for page not found, instead of a proper 404. be wary in case wordpress does the same!
Jem said:
On 16 Feb at 3:40 pm
Cheers Neal, will do
Have got my 404 with a proper 404 header though, which should hopefully help too. :)
Stephanie said:
On 16 Feb at 8:18 pm
Wait… the Jem that refused to use WordPress and once coded her own CMS gave in? WHAT!?! What is the world coming to?
Jem said:
On 16 Feb at 8:27 pm
I know. Blame Isabel :P
Cara said:
On 17 Feb at 12:43 am
Hey! You have an amazing site! I love your scripts!
OUch, I’m sorry to hear about that. Thankfully I’ve never encountered that problem *flew!* Best of luck!!!
Rhys said:
On 17 Feb at 3:34 am
At work we were having issues with Google indexing our test sites – so until we figured it out and starting adding ‘no-index’ meta tags, we had to set up a heap of 301 redirects for each of those sites.
One time customers started ordering things through a test site well before it was even finished! The developers were totally confused about the new “test” data.
Aimée said:
On 17 Feb at 8:45 am
Stop teasing us with Habari hints! I’m dying to know of this secret project.
Rebecca said:
On 22 Feb at 4:52 pm
As far as I know/can tell, Google implements a bunch of fancy heuristics/rules for common website directories. For example, /wordpress/ is pretty damn common so it will make a bee-line for that. Perhaps naming a wordpress install subdirectory to /monkeypants/ instead will give Mother Google a challenge! :)
Vera said:
On 28 Feb at 7:18 pm
Hm… interesting. Every time I finally start thinking that maybe (just maybe WP might be a bit bloated for me, and look Jem’s converted a loooong time ago)… you go back to WP.
Hm: what gives? :o
Also I’m eagerly interested in the Habari affair continuation. :D