Category Archives: tech

Google Instant – UnBlacklist

Google Instant was rolled out earlier this month, allowing users to see the search results for words as they are typing them. I won’t bore with the details, as if you haven’t seen it already, there are plenty of blog posts out there showing it off in great detail! There has also been a rush of talk about the dreaded “blacklist” that Google has applied to Instant, and how seemingly reasonable searches will not show up unless you complete the search word and click “Search” or hit Enter.

In light of the “naughty” filter (as it’s being referred to), 2600 (the hacker group) have begun compiling a list of words that have been blacklisted by Google Instant, fairly or not. At the end of said blacklist, is a list of words which are not blacklisted, even though they would do well to be, given the guidelines Google seems to be following at the moment. You can, for example, search unhindered for the phrase “how to commit suicide” without the big G sticking its beak in.

Hungry to know more, I quickly thought of a word, to see which list it would fit in… out of curiosity. Imagine my surprise, when searching for “clung” – as in: “the climber clung to the cliff-face for dear life after his safety line snapped leaving him dangling above a great big gorge” – I was given the following suggestion from Google, as well as a full Instant result:
Google Instant Search - Clung

Note the lightly shaded letter “e” urging me to complete the search.

I also found the third result for the full, dirty, uncensored search reasonably amusing!
Google Instant Search - Clunge

But you can’t search for “teen”!

Worst Captcha Ever

I’ve never been one to submit my blog to any directories, link directories, etc, as I’ve never felt like monetising it. It’s a site I run for fun and it goes through phases of being updated fairly regularly, to not at all and back again.
I recently, however, was reading a blog post about a number of blog directories, and decided to look through a couple of them. After browsing around Blogged for a while, I decided to sign up and see if the site would send any traffic this way. It was here that I encountered possibly the worst CAPTCHA I’ve ever personally come across:
Worst ever CAPTCHA image

Incidentally, what I put in was accepted, so I’m either very lucky, or the second part of the image is ignored!

Can you beat it?

Telefonica Router – Comtrend CT-5365

Anyone in Spain who has an ADSL connection through Telefonica (and a slight amount of IT knowledge) will understand the frustrations caused by “Asistente De Configuración” the application to access your “Router Inalámbrico”.

This Telefonica specific application for configuring your Comtrend or Zyxel router is the only way of accessing the box, which can be frustrating as it is fairly limited. Those of you with a Zyxel can try a number of default username and password combinations (admin/1234, admin/admin, root/root) upon accessing your router’s IP through http and will usually find some success. Trying to access your Comtrend (mine’s a CT-5365) router’s IP address through http will prompt a login screen which none of the default passwords you’ll find online will work on.

It seems, however, that the password you set up through the configuration assistant is tied back into the Comtrend security, and after a moment of genius (read: luck) attempting the old password (1234) as a username, followed by the password you set through Asistente de Configuración (I’m not giving you mine) will let you in through in both your browser, and through SSH.

For those of you who have had enough of your telefonica router, there is absolutely nothing stopping you from buying a new one and configuring it. Something like a D-Link ADSL2/2+ would suit your needs perfectly, and will have better wireless connectivity to boot.

You will need to call Telefonica to get the settings for your connection on 900 502 010 and this must be from the telephone line your TelefonicaADSL runs on.

UPDATE I have now written a post on how to set up your replacement telefonica router.

IT Services on the Costa del Sol

IT World - IT Services on the Costa del SolA bit of flagrant promotion for some mates of mine, but Les and Darren are in the process of launching a new company offering IT products and services. Based near Duquesa, in Sabanillas on the Costa del Sol, they’re going to be offering all manner of IT services including new hardware and software, networks and cabling including the installation of broadband (which, believe you me, can be a nightmare in Spain) and setting up your wireless network.

Darren is also a qualified and experienced web designer so those looking for a website are serviced here too, and who knows, I may even jump on the bandwagon and help with SEO and PPC management ;)

Les has a serious IT background and has just moved over to the Costa bringing a wealth of knowledge with him, and I have no doubt that IT World is going to be a great addition to the area. There are very few people offering IT services properly out here (although their are a good number of shysters operating on the coast) so I really think the guys are filling a much needed gap in the market and I’m wishing them the best of luck for the future!

Robots.txt is case sensitive!

What are robots?

Robots, crawlers, spiders or agents are programs which are used to traverse the wobbly world wide web automatically, taking note on which content is where. Search engines use these programs to index content for their indices, spammers use them to scrape content for their own sites, or even to crawl the web for email addresses to spam you even more. Understanding of how search robots work is an intrinsic part of SEO!

What is robots.txt?

Robots.txt is a file which, when placed in the root of a publicly available webserver, tells search engine robots and agents which content they can and cannot/should and shouldn’t access. It can be used to block access to members’ only directories for example, or pages which nobody should really be finding through search engines, or, as I was trying to do earlier today, block search engines from indexing pages which are a part of your affiliate system or associated with an affiliate ID.

What is an affiliate system?

Many e-businesses run affiliate systems, which can be described as an semi-automated process by which the e-business takes referrals from partner websites, and remunerates them for any transactions that result from the referral. Having worked in the online gaming industry for over 10 years now, I’m quite familiar with many of them, as they an intricate part of just about every online gambling business model.

Most affiliate systems use standard html links which contain affiliate ID parameters in order to track referrals from their partner or affiliated sites. Anyone who clicks the link which contains the affiliate parameters will be associated with that affiliate’s account. If they go on to purchase something from the site, the affiliate will take their share of that revenue.

Search engine optimisation and brand protection

This is leading somewhere, I promise!

Many companies, when taking on an SEO consultant, agency or in-house employee, go straight for the proverbial jugular. They want to target the big, juicy keywords which will drive mountains of good, converting, valuable traffic. Because of this, they usually overlook the basics, ensuring you’re dominating the search results for your brand names. Imagine, if you will, the panic in the office this morning, when after no more than a few hours, I spot a discrepancy in our brand term search results: an affiliate tracked URL is ranking in second spot, taking a nice bounty per referral as well as a share of any future revenue from any clients that came through that link!

SEO, brand protection and affiliate URLs

Whoever was in here before me, had not taken the time to ensure that search engines, and especially Google (ye olde search dominator) were not allowed to index URLs tagged with affiliate IDs. Due to the high volume of traffic this affiliate was sending through his affiliate ID, that URL got indexed for our brand name, and is still ranking in second place. I immediately submitted a removal request, and asked for a change to the robots.txt to ensure that affiliate parameters were blocked from this point forward… the reply: “what robots.txt?”

Robots.txt to block affiliate IDs

Given the fact the content/web developer people here hadn’t implemented one, and the urgency required to get this resolved, I quickly typed up a robots.txt file for them to upload… ran it through the nicely-provided-by-Google testing utility:

Allowed by line 5: Disallow:

But what about line 6 you stupid test tool? The one that says:

Disallow: /?affid=*

No amount of fiddling would get it to work! I tried and tried and tried. And then I tested a second URL, one I typed up myself, and not copied and pasted:

Blocked by line 6: Disallow: /?affid=*

And that is when it struck me… affid and affId are two completely, and utterly different things according to robots.txt… why? Because ROBOTS.TXT IS BLOODY CASE SENSITIVE!