Thank you Google, you twat

So, if you’d not noticed already, I converted my blog to WordPress yesterday. I know, I know, I told WordPress to suck my balls. Apparently things change, not least because of a half-dead install of Habari thanks to me installing an unstable night-release (don’t ask why, I don’t know).

It was actually quicker to migrate Habari back to WordPress than it was trying to wrangle my back-up onto the server, replacing the changed files etc and as time is a precious commodity these days that’s what I did.

Unfortunately, in my haste to get a working blog running, I forgot to block off the test install at /wordpress/. Google, while both magnificent and impressive in its speed, has pissed me right off by indexing shitloads of content under /wordpress/ … I don’t even begin to understand how it FOUND it, given that it wasn’t linked from anywhere.

Anyway, point being my SERPs and feed will be all over the shop for a while until things settled down. I can only apologise.

(Note: this is not the end of my love affair with Habari … but more coming on that soon ;) )

14 Comments

  1. Did you block search engines from your fresh install? Caught me out a couple of times myself.

  2. Jem

    16 Feb at 2:21 pm

    I did, but not before Google had already crawled a load of pages. It was ridiculously quick .. I’m talking minutes!

  3. Jem,

    I had that problem with a /dev/ copy of a site recently. Simple things like Google Analytics installed prior to launch will be enough to trigger an index request.

    I recommend Simple 301 Redirects for WordPress (http://www.scottnelle.com/simple-301-redirects-plugin-for-wordpress/) to help you sync stuff up quickly if you haven’t already implemented a soft 301 in your .htacces. Follow that up with a Google Webmaster account and clear up the slops that drip back through.

    Bonza! Pain in the arse, but will sort itself out in a few days.

  4. Jem

    16 Feb at 2:25 pm

    I already have the following in my htaccess:

    # thank you google, you piece of shit
    RedirectMatch 301 /wordpress/(.*) https://www.jemjabella.co.uk/$1

    ;)

  5. you might want to use the url removal tool to remove the directory from google’s results – as i found out after i had a site hacked by spammers (i caught it and deleted the content within a day).

    i had 12,500 articles posted and they still keep showing up in site:example.com searches because google must be picking up traces from within its own database. now moving all the real content to another directory and deleting the current /content/ dir from google.

    google doesn’t realise it has gone because drupal returns 200 even for page not found, instead of a proper 404. be wary in case wordpress does the same!

  6. Jem

    16 Feb at 3:40 pm

    Cheers Neal, will do

    Have got my 404 with a proper 404 header though, which should hopefully help too. :)

  7. Wait… the Jem that refused to use WordPress and once coded her own CMS gave in? WHAT!?! What is the world coming to?

  8. Jem

    16 Feb at 8:27 pm

    I know. Blame Isabel :P

  9. Hey! You have an amazing site! I love your scripts!

    OUch, I’m sorry to hear about that. Thankfully I’ve never encountered that problem *flew!* Best of luck!!!

  10. At work we were having issues with Google indexing our test sites – so until we figured it out and starting adding ‘no-index’ meta tags, we had to set up a heap of 301 redirects for each of those sites.

    One time customers started ordering things through a test site well before it was even finished! The developers were totally confused about the new “test” data.

  11. Stop teasing us with Habari hints! I’m dying to know of this secret project.

  12. As far as I know/can tell, Google implements a bunch of fancy heuristics/rules for common website directories. For example, /wordpress/ is pretty damn common so it will make a bee-line for that. Perhaps naming a wordpress install subdirectory to /monkeypants/ instead will give Mother Google a challenge! :)

  13. Hm… interesting. Every time I finally start thinking that maybe (just maybe WP might be a bit bloated for me, and look Jem’s converted a loooong time ago)… you go back to WP.

    Hm: what gives? :o

    Also I’m eagerly interested in the Habari affair continuation. :D