MonthMay 2012

Language Detection Algorithm

My thesis required dealing with large quantities of Twitter data and to make it easier for myself I decided to only use English language tweets. Due to the ‘interesting’ grammar and spelling that is frequently found on line a different algorithm to a standard dictionary test was required. This one checks for common English n-grams (letter groupings) and returns a value of how sure the input is English.

Here’s some examples:

Good morning everyone! : 66%
Im So Sick. Really in a bad Position : 50%
Thats the fear of unicorns : 100%
Gnt too com uma saudade daminha namorada vcs nao tem ideia :(((( : 0%
Ils testent les hologrammes pour faire des concerts par des morts. Bientôt même les pas morts feront des hologrammes et vous irez les voir. : 12%
il7ain ilnass ykhl9un men habat twitter o yntqlun to istagram=))!!! : 40%

 

Python Feed Forward Perceptron

I thought I would post a couple of useful bits of code from my final year of university.

The first is a feed forward perceptron written in Python. I tried to expand the examples on the internet as much as possible to help me understand how they work.

Lego Landrover

Just seen a fantastic video of a Landrover Defender 110 built entirely from lego; not just that but it has a full five speed sequential gear box, high and low range and a choice between two and four wheel drive (not entirely authentic that part)!

It’s really worth watching just to see the inner workings and fantastic little details such as moving pistons. Some more information can be found on Sheepo’s blog.

Also this seems like a pretty good excuse to post a picture of my full size version!

Pointless Error Messages

Gee, thanks Battlefield

So you’re sat there playing a game, about to pull off some spectacular moves and own some noobs when your game freezes, you fall through the floor and you are dumped back into the reality of the Windows Desktop. Quietly muttering some choice words about computers you discover a fantastic error message, informing you that ‘something went wrong’… Well, thanks error message, without your thoughtful take on everything I would have presumed to have teleported into a map that looks suspiciously like my desktop!

These error messages are becoming more and more common, and yes I fully understand that the alternative is the blue screen of death, which is equally unhelpful, but if you’re going to go to the effort of handling errors correctly without breaking everything why waste all that effort by just telling the user that something went wrong. How about, something went wrong and here is what it was?

Because I want a browser with sass

If it’s that important to try and hide the gory details of an error from the average user have a nice button called technical details. Average Joe isn’t going to click that, he still thinks an illegal operation involves the police.

How about some form of standard for displaying error messages? Maybe code red, yellow and green? Easy to understand for everyone.

Got any more examples of useless error messages?


Free Twitter Data Set

I needed a twitter data set for my thesis and struggled to find one that was freely available. I ended up downloading the data I needed so I thought I would release it here.  It does come in a slightly annoying format which is a MongoDB dump file but it should be easy enough to extract to there and use it. The structure of each record is shown below.

So if you wanted to go over all records in the database you would do it like this –

Download Twitter Mongo Dump

© 2026 Acodemics

Theme by Anders NorénUp ↑