192.ComAgent

February 8, 2008 15:39 by gareth

Whilst reviewing some error logfiles over the weekend, it became apparent that a rogue crawler called 192.comAgent was trawling through client websites. This crawler ignored the disallows in the robots.txt file.

To avoid the problems that this could cause, we have now added this to the list of banned user-agents. To complete this we detect the HTTP_USER_AGENT server variable and redirect undesirables to the custom error page.

 Problem over!


Currently rated 4.0 by 1 people

  • Currently 4/5 Stars.
  • 1
  • 2
  • 3
  • 4
  • 5

Related posts

Comments

February 8. 2008 16:28

Good Spot!!

With all the malicious internet usage out there today, you gotta think like a hacker, and who knows whether or not they adhere to the old crawler rules with things like robot.txt

wonder how many other people are experiencing something similar?

tim

February 8. 2008 16:53

Hard to say, but we control the SEO for this website, so the crawler has picked the website up from an external link and followed it in. In which case I would say this is very common.

gareth

August 14. 2009 14:49

thanks for this article!

inforger

October 3. 2009 07:31

What does this agent do?

games

Add comment


 

[b][/b] - [i][/i] - [u][/u]- [quote][/quote]



Live preview

September 3. 2010 14:06

Calendar

September 2010
SuMoTuWeThFrSa
2930311234
567891011
12131415161718
19202122232425
262728293012
3456789

Tags