Polite Spammers

by john on January 26, 2006

I just love how polite the blog comment spammers are these days. All the comment spam I get starts of with phrases like:

Very interesting! I liked it!
i like your blog. it is a very interesting one.
You have some really cool stuff at your site. I’m sure gonna come back here.
you have a very talented and skilled writting. i had a great time reading your comments.
Excellent! I enjoyed reading your material.
I really am impressed by your site. Very original
Nice post. I’ll return.
A very friendly site. Have a nice day!
Your website is wonderfull. I’ll come visit again.
Great blog. It’s nice to be here!
hi. just letting you know that i enjoyed your site.
Two thumbs up!!!
Some friends told me about this site, and now i’m glad they told me about it.
I’m asking myself: How can it be that I’ve never ran through your site before? It’s a great one!

Then that leads into lines of complete gibberish:

to Roll Grass you should be very Lazy
White Boy is always Industrious Pair
Greedy, Memorizing, Greedy nothing comparative to Superb
when Cosmos Compute Circle Hope
Big Soldier becomes Industrious Cards in final
when Round Destroy Grass Kill
Loose Anticipate Steal – that is all that Chair is capable of

In fact, now that I am looking at all this spam at once it is very obvious to see the underlying template that is being used. Almost all of the spam I am getting follow this format (this is on my WordPress blogs, not here):

Author : (some random name)
E-mail : (some random email address that doesn’t look suscpicious)
URI : (some popular URI like www.theonion.com or www.usatoday.com)

Comment:
(personalized friendly greeting such as the above) (complete gibberish) (URL #1 they are pushing) (More complete gibberish) (URL #2 they are pushing)

It is quite likely that I am the last blogger on earth to notice this.

{ 4 comments }

tom sherman January 27, 2006 at 10:30 am

You probably know this, but the “complete gibberish” comments are complete gibberish to fool Bayesian spam filters.

john January 27, 2006 at 10:55 pm

Actually I hadn’t considered that, but it makes sense, sort of. I say sort of because I would think a good Bayesian filter would recognize nonsense phrases and use that as an indicator. For example, the only hit on Google for “when Cosmos Compute Circle Hope” is this post. Ditto “Big Soldier becomes Industrious Cards in final” and probably all the others. A good filter would find no hits in Google to such a phrase and that would be an almost 100% sure sign of spam. Until they stop doing that I guess. Weird science.

GPSOkie February 12, 2006 at 3:15 pm

Interesting. I just suffered my first blast of spam on my blog this week. The same person tried to make about 25 comments to my postings but I have them moderated. Fortunately, I was just able to go in to the dashboard and reject all comments. Still, it was a bit of a pain becuase I got all the associated emails requesting moderation of the comments left.

Murk February 26, 2006 at 6:05 am

Another reason for the gibberish COULD be so that they can search for it in google, and see which sites are open….

… if so, you could see your spam going up by publishing this!

Of course, I may be wrong.

Previous post:

Next post: