View Full Version : Help with an annoying bot.
Ben E Lou
10-08-2011, 06:22 PM
I'm getting hit hard by a bot from MSN that's really gumming up my web server, and blocking the IP doesn't seem to help.
<table border="2" cellpadding="1" cellspacing="1" width="510"><tbody><tr><th align="center" bgcolor="#C0C0C0">#</th> <th colspan="2" align="center" bgcolor="#008040">Hits</th> <th colspan="2" align="center" bgcolor="#0080FF">Files</th> <th colspan="2" align="center" bgcolor="#FF0000">KBytes</th> <th colspan="2" align="center" bgcolor="#FFFF00">Visits</th> <th align="center" bgcolor="#00E0FF">Hostname</th></tr> <tr> <td align="center">1</td> <td align="right">31004</td> <td align="right">43.92%</td> <td align="right">0</td> <td align="right">0.00%</td> <td align="right">0</td> <td align="right">0.00%</td> <td align="right">61</td> <td align="right">9.55%</td> <td align="left" nowrap="nowrap">msnbot-65-52-108-68.search.msn.com</td></tr></tbody></table>
That's just in October. 31,000+ hits already. Looking at my MySQL logs, this bot is hitting dozens of php scripts every second when it's doing its thing. That, uh, slows things down tremendously. Any ideas as to how I can get rid of this thing?
Ben E Lou
10-08-2011, 06:23 PM
Oh, and I've already got a robots.txt file in place:
http://www.younglifenorthdekalb.com/robots.txt
That doesn't seem to be helping. :(
panerd
10-08-2011, 06:43 PM
You sure that just isn't Mizzou Basketball Fan in the NCAA Expansion thread? :)
MacroGuru
10-08-2011, 06:47 PM
You might have to get specific with the bot in your robots.txt file as it is known to ignore the wildcard command
User-agent: MSNBot
Disallow: /
Oh, and honestly, the MSNBot is the only one that tends to be a hog..
Ben E Lou
10-08-2011, 06:54 PM
OK, I'll try that.
Ben E Lou
10-08-2011, 07:03 PM
Heh. That was fast. Made that change and CPU Usage dropped from 100% to 3.6% in about 20 seconds. Thanks!
MacroGuru
10-08-2011, 07:05 PM
Your Welcome
RendeR
10-08-2011, 07:27 PM
Is Robots.txt supposed to sit in the root dir or in the www or Public_html dir?
I thought I had mine in there but it appears to be missing.
Any suggestions on a good all around robot stopper would be huge too.
DanGarion
10-08-2011, 07:52 PM
Is Robots.txt supposed to sit in the root dir or in the www or Public_html dir?
I thought I had mine in there but it appears to be missing.
Any suggestions on a good all around robot stopper would be huge too.
It has to site in the root of the website, a publicly viewable location.
CrimsonFox
10-08-2011, 08:31 PM
Awesome fix! Thanks for looking into that! Yeah the Werewolf forum has 30-40 guest bots in it every night :)
Hopefully that will clear up. Great tip Macro
Ben E Lou
11-04-2011, 05:03 AM
:mad:
<table border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="q7 label" nowrap="nowrap" width="180">CPU Usage</td> <td class="q7_" valign="top" width="16">https://108.59.255.76:4643/vz/skins/winxp.silver/icons/red_16.gif</td> <td class="q7_" nowrap="nowrap"><table border="0" cellpadding="0" cellspacing="0" width="170"><tbody><tr> <td class="qD">https://108.59.255.76:4643/vz/skins/winxp.silver/images/m10.gif</td> <td align="left" nowrap="nowrap" width="100%">100%</td> </tr></tbody></table></td> </tr> <tr> <td class="q8 label" nowrap="nowrap" width="180">CPU Load Average</td> <td class="q8_" valign="top" width="16">https://108.59.255.76:4643/vz/skins/winxp.silver/icons/green_16.gif</td> <td class="q8_" nowrap="nowrap">29.92, 31.53, 33.65</td></tr></tbody></table>
Same deal. Major robot attacks and I'm not sure how to block them. I've done IP Deny in cPanel, but it appears that they still hit the MySQL server. The process manager shows me that there are dozens of queries running right now, far more than can be attributed to human traffic. Any ideas?
Ben E Lou
11-04-2011, 05:24 AM
Dola:
To be clear, it's not msnbot this time. 88.131.106.22, which I have blocked in cPanel, is a major culprit.
Ben E Lou
11-04-2011, 05:30 AM
Several that start with 180.76.5 are also problematic.
MacroGuru
11-04-2011, 06:17 AM
Dola:
To be clear, it's not msnbot this time. 88.131.106.22, which I have blocked in cPanel, is a major culprit.
This is the entireweb bot, which I do not know why it would be hitting your DB, that is weird.
The 180.76.5 seems to be the baidu spider which is from China and pretty aggressive
I don't know if this article can help you set yours server to block them..
httpx://johannburkard.de/blog/www/spam/introduction-to-blocking-spambots-and-bad-bots.html
wade moore
11-04-2011, 07:36 AM
Someone please find a solution, this is painful! ;)
cuervo72
11-04-2011, 07:55 AM
Several that start with 180.76.5 are also problematic.
Yeah, I had to block this one in cPanel for FOBL just yesterday.
JediKooter
11-04-2011, 11:23 AM
:mad:
<table border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="q7 label" nowrap="nowrap" width="180">CPU Usage</td> <td class="q7_" valign="top" width="16">https://108.59.255.76:4643/vz/skins/winxp.silver/icons/red_16.gif</td> <td class="q7_" nowrap="nowrap"><table border="0" cellpadding="0" cellspacing="0" width="170"><tbody><tr> <td class="qD">https://108.59.255.76:4643/vz/skins/winxp.silver/images/m10.gif</td> <td align="left" nowrap="nowrap" width="100%">100%</td> </tr></tbody></table></td> </tr> <tr> <td class="q8 label" nowrap="nowrap" width="180">CPU Load Average</td> <td class="q8_" valign="top" width="16">https://108.59.255.76:4643/vz/skins/winxp.silver/icons/green_16.gif</td> <td class="q8_" nowrap="nowrap">29.92, 31.53, 33.65</td></tr></tbody></table>
Same deal. Major robot attacks and I'm not sure how to block them. I've done IP Deny in cPanel, but it appears that they still hit the MySQL server. The process manager shows me that there are dozens of queries running right now, far more than can be attributed to human traffic. Any ideas?
They're inside the house..GET OUT!
I've got nothing. Sorry. :)
Ben E Lou
11-05-2011, 05:39 AM
:mad:
<table border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="q7 label" nowrap="nowrap" width="180">CPU Usage</td> <td class="q7_" valign="top" width="16">https://108.59.255.76:4643/vz/skins/winxp.silver/icons/red_16.gif</td> <td class="q7_" nowrap="nowrap"><table border="0" cellpadding="0" cellspacing="0" width="170"><tbody><tr> <td class="qD">https://108.59.255.76:4643/vz/skins/winxp.silver/images/m10.gif</td> <td nowrap="nowrap" align="left" width="100%">100%</td> </tr></tbody></table></td> </tr> <tr> <td class="q8 label" nowrap="nowrap" width="180">CPU Load Average</td> <td class="q8_" valign="top" width="16">https://108.59.255.76:4643/vz/skins/winxp.silver/icons/green_16.gif</td> <td class="q8_" nowrap="nowrap">29.92, 31.53, 33.65</td></tr></tbody></table>
Same deal. Major robot attacks and I'm not sure how to block them. I've done IP Deny in cPanel, but it appears that they still hit the MySQL server. The process manager shows me that there are dozens of queries running right now, far more than can be attributed to human traffic. Any ideas?
Maybe it just took a while for the blocks I put in place to populate? *shurg*
<table border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="q7 label" nowrap="nowrap" width="180">CPU Usage</td> <td class="q7_" valign="top" width="16">https://108.59.255.76:4643/vz/skins/winxp.silver/icons/green_16.gif</td> <td class="q7_" nowrap="nowrap"><table border="0" cellpadding="0" cellspacing="0" width="170"><tbody><tr> <td class="qD">https://108.59.255.76:4643/vz/skins/winxp.silver/images/m0.gif</td> <td nowrap="nowrap" align="left" width="100%">4.2%</td> </tr></tbody></table></td> </tr> <tr> <td class="q8 label" nowrap="nowrap" width="180">CPU Load Average</td> <td class="q8_" valign="top" width="16">https://108.59.255.76:4643/vz/skins/winxp.silver/icons/green_16.gif</td> <td class="q8_" nowrap="nowrap">0.34, 0.26, 0.21</td></tr></tbody></table>
:thumbsup:
vBulletin v3.6.0, Copyright ©2000-2026, Jelsoft Enterprises Ltd.