Register Members List Search Today's Posts Mark Forums Read

Reply
 
Mod Options
Automatic Thread Tagger Details »
Automatic Thread Tagger
Mod Version: 1.2.0, by Phalynx (Coder) Phalynx is offline
Developer Last Online: Nov 2015 I like it Show Printable Version Email this Page

This modification is in the archives.
vB Version: 3.7.x Rating: (66 votes - 4.56 average) Installs: 841
Released: 16 Jul 2008 Last Update: 09 Jan 2009 Downloads: 5464
Not Supported DB Changes Uses Plugins Auto-Template Additional Files Translations  

Automatic Thread Tagger


Description
When a user submits a new thread this modification will automatically take keywords from the thread title and use these as tags. You can use Automatic Thread Tagger to propose the user AJAX tags for his new thread, or it assigns new tags after saving the new thread. It can add the translated thread prefix to the tags.
Additionally, you can tag existing threads via maintenance and also scheduled tasks.

This modification is a successor to the terminated Automatic Thread Tagger by MrEyes:
http://www.vbulletin.org/forum/showthread.php?t=179927

As an example, if a user submits a thread with a title of:
"Fish Food for Cats!"

The thread will be automatically tagged with:

- Fish
- Food
- Cats

If the user also submits an actual tag of "Fish" this will not be duplicated. Any rules you have setup for tagging will be respected.
If you choose to do so this product will also automatically tag threads created by incoming RSS feeds.

Demo
I cannot show you the process of creation, but here is a list of tags generated by Auto Thread Tagger:
http://www.insideearth.net/tags.php?langid=5
http://www.insidesupcom.de/tags.php?langid=1


Automatic Tagging of existing threads
You can tag existing threads via maintenance or scheduled task/cron. They will be created with a special flag so they can be easily identified and deleted. Manual assigned tags are not touched. Maintenance is also working if Automatic Tagging is disabled via settings. Great if you want to test some settings. Automatic Tagging will take the date of the thread creation and also the userid of the creator. This process can be automated by running a scheduled job once a night.

Please keep in mind that tags that were proposed via AJAX are not tagged as auto tagged and therefore cannot be identified as such (and therefore not deleted automatically). If you want to retain the auto tagged flag you should disable AJAX and enable the tagging after the thread has been saved. As an alternative way you can also disable this and let new threads be tagged in the night from the scheduled job.


Installation / Upgrade
1. Upload all files from "upload" to your server, take care of the directory structure
2. Import "product-auto_thread_tagger110.xml" as a product, overwrite if it's already installed
3. Check settings
4. Run maintenance / Auto Tag Threads to tag existing threads (needed if you want to use the cron)

After install, and by default the modification is disabled, this will allow you to play around with configuration before switching it on.


Troubleshooting
If you report a bug please post the thread title that created it, without this I cannot test it and improve the language parsers.

* If no threads are tagged you will have to check the following:
- Is the modification enabled? Is the action you are testing enabled? (vBulletin tagging, whole auto thread tagger system, AJAX, new threads)
- Are the words you are using badwords or filtered out?

* Cron/Scheduled Task is not tagging all threads.
- The cron is limited to 500 (you can change this via settings) threads per run to avoid heavy impact on server. Make sure you run maintenance auto tagger before this to tag old threads. You can check the scheduled tasks log to see if it is running correctly.
Important: If a thread title does not meet minimum requirements to be included in tags (f.e. one word thread titles, too short words), it will be forever in this queue.

* I'm using polish, arabic, turkish, etc.. language and the tagger is not working like it should.
- If not already replaced, replace the filter replacement '&'=>'and' with ' & '=>'and' (a space before and after &)



Todo
What comes next? You decide. Tell me what you are missing and I'll look if it can be integrated.


Why thread title and not thread text?
Parsing the thread text for tags is an extremely unlikely addition as this would require some fairly heavy processing to ensure quality of tags.


What are Stopwords?
Stopwords is the name given to words which are filtered out prior to processing of tags.
The user Hostboard on vBulletin.org posted some resources regardings this:
http://www.vbulletin.org/forum/showp...&postcount=380



History
1.2.0, 9th August 2008
- Fixed error with missing threadid's
- Fixed error with AJAX and prefix
- Fixed error with not indexing tags via cron
- Added polish, spain, english stopwords
- Compatibel with vBulletin 3.8

Download Now

Only licensed members can download files, Click Here for more information.

Show Your Support

  • To receive notifications regarding updates -> Click to Mark as Installed.
  • If you like this modification support the author by donating.
  • This modification may not be copied, reproduced or published elsewhere without author's permission.
Similar Mod
Mod Developer Type Replies Last Post
New Posting Features Automatic Thread Tagger (Project Terminated) MrEyes Modification Graveyard 146 16 Jul 2008 23:46

  #61  
Old 18 Jul 2008, 02:17
Rik Brown's Avatar
Rik Brown Rik Brown is offline
 
Join Date: Nov 2005
Excellent mod! Really love it. Thanks!

Wish list:

1) Remember the last thread # tagged and start from there ("resume mode") just tagging new threads rather than having to start over from scratch (which takes a very long time with 100,000+ threads) each time the mod is run.

2) Then, being able run the mod either by vb's scheduler or the server's cron (daily would be fine for me)

3) Would like to be able to ban tags with non-alphabetical characters unless they are in the vbulletin "Thread tag allowed list."

Thanks again. -- Rik
Reply With Quote
  #62  
Old 18 Jul 2008, 04:06
needaltuna needaltuna is offline
 
Join Date: Sep 2007
Would somebody -- anybody! -- please tell me whether this mod improves search engine rankings. Thank you.
Reply With Quote
  #63  
Old 18 Jul 2008, 04:12
glorify's Avatar
glorify glorify is offline
 
Join Date: Aug 2004
It gives you more pages, but it may take awhile to get saturation unless the php is rewritten to html. The best way to get Google to visit the pages without rewriting or waiting is to add adsense in your navbar. That way, adsense bot will look at each page and theoretically, you'll get a tad quicker results.
Reply With Quote
  #64  
Old 18 Jul 2008, 06:22
needaltuna needaltuna is offline
 
Join Date: Sep 2007
Thank you for that very informative answer.
Reply With Quote
  #65  
Old 18 Jul 2008, 09:13
Phalynx Phalynx is offline
 
Join Date: Feb 2004
Real name: Marius
Originally Posted by redlabour View Post
Example: bluewin.ch or aol.com should tagged as bluewin.ch and aol.com and not bluewinch and aolcom
Only if you remove the filter '.'=>''

Originally Posted by popowich View Post
I think I see the remaining part of the duplicates problem.

There are leading spaces in front of some the tags.

For example "tag" and " tag".
Thanks, I'll take a deeper look. If thats it, I will include the fix in the next version.

Originally Posted by popowich View Post
Is there a way to build phrases? "new york" instead of "new" and "york". I don't care if it's not perfect, just looking to teach it some common ones for my site.
Use "Composite Tags".

Originally Posted by Rik Brown View Post
Wish list:

1) Remember the last thread # tagged and start from there ("resume mode") just tagging new threads rather than having to start over from scratch (which takes a very long time with 100,000+ threads) each time the mod is run.

2) Then, being able run the mod either by vb's scheduler or the server's cron (daily would be fine for me)

3) Would like to be able to ban tags with non-alphabetical characters unless they are in the vbulletin "Thread tag allowed list."
Great ideas, will try to implement them in next version.
Reply With Quote
  #66  
Old 18 Jul 2008, 10:45
6piston's Avatar
6piston 6piston is offline
 
Join Date: Apr 2005
hi phalynx

thanks for the plugin. a real classic.
works like charm!

my only question is,
i've got 213156 threads to tag.

i ran maintenence and the thing is, it freezes at random numbers of posts. sometimes 1640 sometimes 12XXX posts. i've set the minium number of post tagging to 99999999999999 as well.

i also tried turning the board off but it still happens.

i understand that this could be a server issue too

but anyone out there with the same problem, please tell me what to do to get all posts tagged?

thanks!
Tom
Reply With Quote
  #67  
Old 18 Jul 2008, 13:29
Charlie98902 Charlie98902 is offline
 
Join Date: Dec 2006
So remove this : '-'=>'' to allow the use of this: -?
Reply With Quote
  #68  
Old 18 Jul 2008, 14:04
Phalynx Phalynx is offline
 
Join Date: Feb 2004
Real name: Marius
Yes
Reply With Quote
  #69  
Old 18 Jul 2008, 14:56
Britmovie Britmovie is offline
 
Join Date: Mar 2008
Exclude Search Words?
Is it best to set to yes or no? Little confused on this one?
Reply With Quote
  #70  
Old 18 Jul 2008, 17:53
tfw2005 tfw2005 is offline
 
Join Date: Oct 2004
I had an issue, not sure what if anything can be done.

Had the default vbulletin max tags by thread starter at 5, and user at 2.

When the auto tagger ran on long thread titles by Registerd Users, it tried to insert 10,15 words, etc.

They kept getting "thread tags were exceeded by X amount" error messages.

Pissing people off, because they don't even know its happening, or what to do to fix it.

Any way we can get a work around saying "if auto-tagger, allow unlimited tags -> else use tagging default"

My temp work around is allowing a super-high amount of tags for thread starter.
Reply With Quote
  #71  
Old 18 Jul 2008, 18:03
tfw2005 tfw2005 is offline
 
Join Date: Oct 2004
Second issue, Registered users can't post This for instance:

Thank you, Everyone.

It returns the error message:

The following errors occurred with your submission:
Tags cannot be overly common words (you).

How do I stop this from happening?
Reply With Quote
  #72  
Old 18 Jul 2008, 18:09
Lea Verou Lea Verou is offline
 
Join Date: Jul 2005
Real name: Lea Verou
Originally Posted by Phalynx View Post
Yes, parsing the whole posting would be a big performance issue. Beside that, I doubt there be will good things. You would need to maintain a big database of no-go words like "of", "we", "be", etc... - and this for every language.
This is why I mentioned the character limit
Reply With Quote
  #73  
Old 18 Jul 2008, 18:20
6piston's Avatar
6piston 6piston is offline
 
Join Date: Apr 2005
ooops i forgot to click "mark as installed"

so, did anyone experience the same thing on larger forums?


Originally Posted by 6piston View Post
hi phalynx

thanks for the plugin. a real classic.
works like charm!

my only question is,
i've got 213156 threads to tag.

i ran maintenence and the thing is, it freezes at random numbers of posts. sometimes 1640 sometimes 12XXX posts. i've set the minium number of post tagging to 99999999999999 as well.

i also tried turning the board off but it still happens.

i understand that this could be a server issue too

but anyone out there with the same problem, please tell me what to do to get all posts tagged?

thanks!
Tom
Reply With Quote
  #74  
Old 19 Jul 2008, 05:29
motowebmaster motowebmaster is offline
 
Join Date: Feb 2006
Thanks for this!
Reply With Quote
  #75  
Old 19 Jul 2008, 06:34
Skavenger Skavenger is offline
 
Join Date: Sep 2005
Everything works fine for me except one thing.
I ran "Delete Auto Tags" maintenance option, then "Auto Tag Threads" maintenance option and when I went to check tags, I found I have duplicated tags in a single thread.
Is there a way to fix this?

Thanks
Reply With Quote
Reply


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Mod Options

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off


New To Site? Need Help?

All times are GMT. The time now is 04:17.

Layout Options | Width: Wide Color: