Register Members List Search Today's Posts Mark Forums Read

Reply
 
Mod Options
Ban Spiders by User Agent Details »
Ban Spiders by User Agent
Mod Version: 3.1.3, by Simon Lloyd (Coder) Simon Lloyd is offline
Developer Last Online: Aug 2019 I like it Show Printable Version Email this Page

vB Version: 3.8.x Rating: (14 votes - 4.93 average) Installs: 134
Released: 09 Jun 2011 Last Update: 18 Dec 2014 Downloads: 548
Supported Uses Plugins  

What this mod does
With this mod you can enter User Agents to watch or ban, you can also recieve emails or have an Output.txt created and updated with time and date of visits. It doesn't just have to be spiders, you can watch, log or ban any useragent!

How to install
Simply import the product ban_spider, the mod is active by default but none of the other options are turned on.

What is a UserAgent?
http://en.wikipedia.org/wiki/User_agent

Understanding a UserAgent string
http://user-agent-string.info/parse

Genuine User Getting Blocked?
http://www.vbulletin.org/forum/showp...&postcount=105

Tools to help
http://whatsmyuseragent.com/SwitchingUserAgents.asp
http://www.botsvsbrowsers.com/SimulateUserAgent.asp

FAQ
http://www.vbulletin.org/forum/showp...&postcount=137

What's a bot?
http://en.wikipedia.org/wiki/Spambot

How do i ban a bot?
http://www.vbulletin.org/forum/showp...&postcount=318
http://www.vbulletin.org/forum/showp...7&postcount=51

Where's output.txt located?
http://www.vbulletin.org/forum/showp...&postcount=216

Bad bot lists
http://www.vbulletin.org/forum/showp...&postcount=259
http://www.vbulletin.org/forum/showp...&postcount=224
http://www.vbulletin.org/forum/showp...&postcount=281

VB4.x Version of Ban Spiders By User Agent

Tested on vb3.7.x, vB3.8.x but should work on any version.

____________________________________________________________________
Special thanks to:
Lior
KH99
BoP5
for helping me sort out a few issues

...and beta testers

ForceHSS (Special thanks to Force for latest testing)
ozzy47
GreyHost

If you use this please mark as INSTALLED

History
9th June 2011 Orginal xml added
12th June 2011 Added both email notification and text file logging
22nd June 2011 Version 2.0.0, Added create thread on activity
  1. Added match facility you can now use something like Yandex and it will match MOZILLA/5.0 (COMPATIBLE; YANDEXBOT/3.0; +HTTP://YANDEX.COM/BOTS)
  2. Added clickable link to visited thread
22nd September 2011 added user redirect url selection
08th October Beta testing started for thread creation.
20th October Beta testing started for emailing.
21st October Beta testing complete Ver 3.0.0 uploaded
29th October minor fix added to cope with empty userid on thread creation
30th October Beta testing automatic redirection to spiders/bots IP
31st October New xml uploaded with automatic redirect to IP
25th November Minor fix for blank forumid fixed
26th November 2011 Fixed version check & create thread Off by default
17th December 2014 Version 3.1.0 uploaded, Extra logging and statistics added by Ozzy47 (Chris)
18th December 2014 Version 3.1.2 uploaded, due to rogue process from other mod
18th December 2014 Version 3.1.3 uploaded, due to previous one being VB4 mistakingly uploaded

The Bad Bots list is now included in the product
Please prune out all those that you wish to be able to see your site (i suggest you definately prune out "DA" and "Custo" :

Support will now only be given to those who have this mod marked as INSTALLED

Download Now

Only licensed members can download files, Click Here for more information.

Supporters / CoAuthors

Show Your Support

  • To receive notifications regarding updates -> Click to Mark as Installed.
  • If you like this modification support the author by donating.
  • This modification may not be copied, reproduced or published elsewhere without author's permission.
  #136  
Old 26 Jul 2013, 17:57
Max Taxable's Avatar
Max Taxable Max Taxable is offline
 
Join Date: Feb 2011
Originally Posted by Dan49 View Post
Thanks.

Another general question. I understand that malicious bots can fake the UA. Why don't they all fake it to display as Google? This way anybody would be hesitant to block them? BTW I do use your IP ban mod also.
Some of them do so, but it's very easy to spot those fakes since Google only uses one or two IP addresses. I am pretty sure there's a Mod for blocking spoofed google UA's somewhere.
Reply With Quote
  #137  
Old 26 Jul 2013, 21:42
Dan49 Dan49 is offline
 
Join Date: Feb 2012
I found this information about verifying Google bot. And used this tool http://ipadmin.junkemailfilter.com/rdns.php. Is this the best way? Or is there a list of the IP addresses google uses?

I'd appreciate a link to the mod you mentioned, I searched and can't find it.
Reply With Quote
  #138  
Old 26 Jul 2013, 22:46
valdet's Avatar
valdet valdet is offline
 
Join Date: Feb 2007
Real name: Valdet
Hi Simon,

Hope you got your forum problems sorted

I noticed that the mod will not create new threads on private forums, where only admins can post/read.
I am using my admin account as thread creator, so it has permissions to create threads. The threads are created only on public forums where other members and guests can see them, which of course isn't nice.

The output.txt file writing is also not logging anything.

But from my server logs, I see this mod has been doing incredibly well in blocking some nasty spiders (mainly Magpie and Spinn3r).

For those who don't check their server logs, I would advise to check them and see if Magpie is sucking up to 40% of bandwidth as it was until recently in one of my sites.
Reply With Quote
  #139  
Old 26 Jul 2013, 22:48
Max Taxable's Avatar
Max Taxable Max Taxable is offline
 
Join Date: Feb 2011
Originally Posted by Dan49 View Post
I found this information about verifying Google bot. And used this tool http://ipadmin.junkemailfilter.com/rdns.php. Is this the best way? Or is there a list of the IP addresses google uses?

I'd appreciate a link to the mod you mentioned, I searched and can't find it.
I only vaguely remember seeing such, and it was sometime back. And I might even be wrong.
Reply With Quote
  #140  
Old 26 Jul 2013, 23:15
Simon Lloyd's Avatar
Simon Lloyd Simon Lloyd is offline
 
Join Date: Aug 2008
Real name: Simon
Originally Posted by valdet View Post
Hi Simon,

Hope you got your forum problems sorted

I noticed that the mod will not create new threads on private forums, where only admins can post/read.
I am using my admin account as thread creator, so it has permissions to create threads. The threads are created only on public forums where other members and guests can see them, which of course isn't nice.

The output.txt file writing is also not logging anything.

But from my server logs, I see this mod has been doing incredibly well in blocking some nasty spiders (mainly Magpie and Spinn3r).

For those who don't check their server logs, I would advise to check them and see if Magpie is sucking up to 40% of bandwidth as it was until recently in one of my sites.
Glad it's working for you, on my forum I simply added a forum where only staf had permissions to view and made the threads there, I also had no issue with logging.

I did get my forum issues sorted, turns out it was a half assed script kiddie attempt to kill it but got it sorted (one of my forums allowed html in the post environment....really bad idea!). When I get chance (which will be after Thursday as my wife is having an op) I will retest the mod from scratch and post specifics here.
__________________
Kind regards,
Simon Microsoft Office Help
My Mods: Find my modifications here
Please do not pm me for support unless i have invited you to!
Reply With Quote
  #141  
Old 06 Oct 2013, 07:00
jl255 jl255 is offline
 
Join Date: May 2007
i suppose it shld be pretty safe to use the default list of spider list to ban on this plugin? i've no idea which to prune.....
Reply With Quote
  #142  
Old 06 Oct 2013, 08:12
Simon Lloyd's Avatar
Simon Lloyd Simon Lloyd is offline
 
Join Date: Aug 2008
Real name: Simon
There are quite a few lists posted throughout this thread, take a look at them, if you use the built in one then
Originally Posted by Simon Lloyd in Mod Description
Please prune out all those that you wish to be able to see your site (i suggest you definately prune out "DA" and "Custo" :
__________________
Kind regards,
Simon Microsoft Office Help
My Mods: Find my modifications here
Please do not pm me for support unless i have invited you to!
Reply With Quote
  #143  
Old 06 Oct 2013, 15:59
Max Taxable's Avatar
Max Taxable Max Taxable is offline
 
Join Date: Feb 2011
Originally Posted by jl255 View Post
i suppose it shld be pretty safe to use the default list of spider list to ban on this plugin? i've no idea which to prune.....
See if the default one helps the issue you're having, first. Is my suggestion. The default list is a good list.
Reply With Quote
  #144  
Old 11 Oct 2013, 13:30
Wajow-community Wajow-community is offline
 
Join Date: Dec 2009
Is there a mod for vb 4.x.x
Reply With Quote
  #145  
Old 11 Oct 2013, 13:36
Max Taxable's Avatar
Max Taxable Max Taxable is offline
 
Join Date: Feb 2011
Found on the developer's profile:

http://www.vbulletin.org/forum/showthread.php?t=268208
Reply With Quote
  #146  
Old 09 Nov 2013, 15:58
qpurser qpurser is offline
 
Join Date: Aug 2011
Really love this mod a lot and works great on 4.2.1

Finally Baidu is not showing up anymore in my "online users" list.

I saw another bot recently searching my forum and from some research it seems to be a bad one also: AhrefsBot http://blocklistpro.com/content-scra...o-spybots.html

I added this to my list:
Mozilla/5.0 (compatible; AhrefsBot/2.0; +http://ahrefs.com/robot/)
and
Mozilla*Ahref

Was this the correct way to do it?
Reply With Quote
  #147  
Old 09 Nov 2013, 16:51
Max Taxable's Avatar
Max Taxable Max Taxable is offline
 
Join Date: Feb 2011
Originally Posted by qpurser View Post
I added this to my list:
Mozilla/5.0 (compatible; AhrefsBot/2.0; +http://ahrefs.com/robot/)
and
Mozilla*Ahref

Was this the correct way to do it?
NO!!!



Including "Mozilla" and "compatible" in this list blocks just about the entire world!

Get rid of that entry and simply put: "AhrefsBot" in instead!

Last edited by Max Taxable; 09 Nov 2013 at 17:04.
Reply With Quote
  #148  
Old 09 Nov 2013, 18:57
qpurser qpurser is offline
 
Join Date: Aug 2011
Originally Posted by Max Taxable View Post
NO!!!

Including "Mozilla" and "compatible" in this list blocks just about the entire world!
Get rid of that entry and simply put: "AhrefsBot" in instead!
Thank you for the quick reply.
I was reading post #127 and 128 and thought as long there is something behind the "mozilla" in the same line there wouldn't be an issue. My mistake I guess
Reply With Quote
  #149  
Old 09 Nov 2013, 20:10
Max Taxable's Avatar
Max Taxable Max Taxable is offline
 
Join Date: Feb 2011
Originally Posted by qpurser View Post
Thank you for the quick reply.
I was reading post #127 and 128 and thought as long there is something behind the "mozilla" in the same line there wouldn't be an issue. My mistake I guess
You might be right but 1.) I wouldn't risk it, and 2.) it's not necessary.
Reply With Quote
  #150  
Old 09 Nov 2013, 21:31
Simon Lloyd's Avatar
Simon Lloyd Simon Lloyd is offline
 
Join Date: Aug 2008
Real name: Simon
When banning whatever you add to the list will be looked for in its entirety, so if you enter "today" then it will ban:
Today
thisdayistoday
wheretodayis
one day today

if you entered "here today" then it will not ban:
one day today
thisdaytoday
today....etc

but it will ban:
we were here today
allhere todayagain....etc

When banning bots make sure you go to WOL and check out their UA as it may not contain their name in the UA.
__________________
Kind regards,
Simon Microsoft Office Help
My Mods: Find my modifications here
Please do not pm me for support unless i have invited you to!
Reply With Quote
Reply


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Mod Options

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off


New To Site? Need Help?

All times are GMT. The time now is 05:50.

Layout Options | Width: Wide Color: