Register Members List Search Today's Posts Mark Forums Read

Reply
 
Mod Options
Automatic Thread Tagger Details »
Automatic Thread Tagger
Mod Version: 1.2.0, by Phalynx (Coder) Phalynx is offline
Developer Last Online: Nov 2015 I like it Show Printable Version Email this Page

This modification is in the archives.
vB Version: 3.7.x Rating: (66 votes - 4.56 average) Installs: 841
Released: 16 Jul 2008 Last Update: 09 Jan 2009 Downloads: 5464
Not Supported DB Changes Uses Plugins Auto-Template Additional Files Translations  

Automatic Thread Tagger


Description
When a user submits a new thread this modification will automatically take keywords from the thread title and use these as tags. You can use Automatic Thread Tagger to propose the user AJAX tags for his new thread, or it assigns new tags after saving the new thread. It can add the translated thread prefix to the tags.
Additionally, you can tag existing threads via maintenance and also scheduled tasks.

This modification is a successor to the terminated Automatic Thread Tagger by MrEyes:
http://www.vbulletin.org/forum/showthread.php?t=179927

As an example, if a user submits a thread with a title of:
"Fish Food for Cats!"

The thread will be automatically tagged with:

- Fish
- Food
- Cats

If the user also submits an actual tag of "Fish" this will not be duplicated. Any rules you have setup for tagging will be respected.
If you choose to do so this product will also automatically tag threads created by incoming RSS feeds.

Demo
I cannot show you the process of creation, but here is a list of tags generated by Auto Thread Tagger:
http://www.insideearth.net/tags.php?langid=5
http://www.insidesupcom.de/tags.php?langid=1


Automatic Tagging of existing threads
You can tag existing threads via maintenance or scheduled task/cron. They will be created with a special flag so they can be easily identified and deleted. Manual assigned tags are not touched. Maintenance is also working if Automatic Tagging is disabled via settings. Great if you want to test some settings. Automatic Tagging will take the date of the thread creation and also the userid of the creator. This process can be automated by running a scheduled job once a night.

Please keep in mind that tags that were proposed via AJAX are not tagged as auto tagged and therefore cannot be identified as such (and therefore not deleted automatically). If you want to retain the auto tagged flag you should disable AJAX and enable the tagging after the thread has been saved. As an alternative way you can also disable this and let new threads be tagged in the night from the scheduled job.


Installation / Upgrade
1. Upload all files from "upload" to your server, take care of the directory structure
2. Import "product-auto_thread_tagger110.xml" as a product, overwrite if it's already installed
3. Check settings
4. Run maintenance / Auto Tag Threads to tag existing threads (needed if you want to use the cron)

After install, and by default the modification is disabled, this will allow you to play around with configuration before switching it on.


Troubleshooting
If you report a bug please post the thread title that created it, without this I cannot test it and improve the language parsers.

* If no threads are tagged you will have to check the following:
- Is the modification enabled? Is the action you are testing enabled? (vBulletin tagging, whole auto thread tagger system, AJAX, new threads)
- Are the words you are using badwords or filtered out?

* Cron/Scheduled Task is not tagging all threads.
- The cron is limited to 500 (you can change this via settings) threads per run to avoid heavy impact on server. Make sure you run maintenance auto tagger before this to tag old threads. You can check the scheduled tasks log to see if it is running correctly.
Important: If a thread title does not meet minimum requirements to be included in tags (f.e. one word thread titles, too short words), it will be forever in this queue.

* I'm using polish, arabic, turkish, etc.. language and the tagger is not working like it should.
- If not already replaced, replace the filter replacement '&'=>'and' with ' & '=>'and' (a space before and after &)



Todo
What comes next? You decide. Tell me what you are missing and I'll look if it can be integrated.


Why thread title and not thread text?
Parsing the thread text for tags is an extremely unlikely addition as this would require some fairly heavy processing to ensure quality of tags.


What are Stopwords?
Stopwords is the name given to words which are filtered out prior to processing of tags.
The user Hostboard on vBulletin.org posted some resources regardings this:
http://www.vbulletin.org/forum/showp...&postcount=380



History
1.2.0, 9th August 2008
- Fixed error with missing threadid's
- Fixed error with AJAX and prefix
- Fixed error with not indexing tags via cron
- Added polish, spain, english stopwords
- Compatibel with vBulletin 3.8

Download Now

Only licensed members can download files, Click Here for more information.

Show Your Support

  • To receive notifications regarding updates -> Click to Mark as Installed.
  • If you like this modification support the author by donating.
  • This modification may not be copied, reproduced or published elsewhere without author's permission.
Similar Mod
Mod Developer Type Replies Last Post
New Posting Features Automatic Thread Tagger (Project Terminated) MrEyes Modification Graveyard 146 16 Jul 2008 23:46

  #46  
Old 17 Jul 2008, 15:56
popowich popowich is offline
 
Join Date: Jun 2004
After doing some more clicking around it looks like if I had existing manual tags, the auto tagger will add a duplicate tag for the matching tags that it finds in the subject. Can the auto tagger ignore words that are existing tags for the thread?

-Raymond
Reply With Quote
  #47  
Old 17 Jul 2008, 16:08
Phalynx Phalynx is offline
 
Join Date: Feb 2004
Real name: Marius
It is already ignoring existing words, that's why I'm a little bit confused. Cannot reproduce this here with existing tags. I will try to do a fix.
Reply With Quote
  #48  
Old 17 Jul 2008, 16:14
popowich popowich is offline
 
Join Date: Jun 2004
Taking a wild educated guess, but does it have to do case sensitivity? The tag display lowercases everything, even if you enter them first or all letters capital. How does it look in the tag database? Is there a "lowercase" and a "Lowercase" causing the appearance of duplicates?

-Raymond
Reply With Quote
  #49  
Old 17 Jul 2008, 16:19
Phalynx Phalynx is offline
 
Join Date: Feb 2004
Real name: Marius
Maybe. Now I will do an strtolower before I do an array_unique. v1.0.1 will be released in few minutes.
Reply With Quote
  #50  
Old 17 Jul 2008, 16:38
Phalynx Phalynx is offline
 
Join Date: Feb 2004
Real name: Marius
Automatic Thread Tagger 1.0.1 has been released.


Changes:
1.0.1, 17th July 2008
- Added: Automatic Tags via maintenance are now associated with the UserID that created the Thread. Just remove auto tags and re-run auto tag to associate all tags to the users. Usefull if you use vBExperience and want to reward users.
- Added: Additionally to the configuration of the auto tagger the vBulletin tags badwords are also taken as blacklist
- Added: Workaround for vBulletin 3.7.0 for non existing function "split_tag_list"
- Added: New setting to filter out dates like 01/02/2008, 05/06/08, 01.02.2008
- Changed: Location of settings, moved below Tagging Options. The new name of the setting group is "Tagging Options (Automatic Thread Tagger)"
- Changed: Behaviour of auto tagging: It is not deleting old tags anymore, please delete old tags before this.

Upgrade:
1. Upload the functions_autotagger.php to your includes folder (the same directory as your config.php)
2. Import "product-auto_thread_tagger101.xml" as a product, overwrite if it's already installed
Reply With Quote
  #51  
Old 17 Jul 2008, 16:50
popowich popowich is offline
 
Join Date: Jun 2004
Does adding a minimum letters in word for auto tagging makes sense?

This way admins can specify "Don't include worlds with less than 5 letters" ?

I did the delete tags, add tags under the update counters and still seem to have some duplicates, but they are different ones now.

I'll poke around and see if I can track down why.

-Raymond
Reply With Quote
  #52  
Old 17 Jul 2008, 16:52
Phalynx Phalynx is offline
 
Join Date: Feb 2004
Real name: Marius
Yes, auto tagging is obeying such limits.

@duplicates
Thanks in advance.
Reply With Quote
  #53  
Old 17 Jul 2008, 17:17
redlabour's Avatar
redlabour redlabour is offline
 
Join Date: Mar 2004
Real name: André
Works great now!
Reply With Quote
  #54  
Old 17 Jul 2008, 17:35
Charlie98902 Charlie98902 is offline
 
Join Date: Dec 2006
Thanks for the update.
Reply With Quote
  #55  
Old 17 Jul 2008, 17:40
drsli's Avatar
drsli drsli is offline
 
Join Date: Jan 2008
Real name: Dietmar
Yearning awaited modification! Thank you so much for this ingenious tool. Works like a charm!
Reply With Quote
  #56  
Old 17 Jul 2008, 17:45
redlabour's Avatar
redlabour redlabour is offline
 
Join Date: Mar 2004
Real name: André
Can someone give me a Hint for a additional Rule to tag Domainnames allways correctly?

Example: bluewin.ch or aol.com should tagged as bluewin.ch and aol.com and not bluewinch and aolcom
Reply With Quote
  #57  
Old 17 Jul 2008, 19:57
citeman citeman is offline
 
Join Date: Feb 2008
Okay... because of some server problem I wasn't able to try the mod out. It's working fantastic! Thanks Phalynx - you've done a fantastic job with this one!
Reply With Quote
  #58  
Old 17 Jul 2008, 21:28
Hornstar's Avatar
Hornstar Hornstar is offline
 
Join Date: Jun 2005
Real name: Matt
Yeah it just keeps on getting better. thanks again.
Reply With Quote
  #59  
Old 17 Jul 2008, 23:30
popowich popowich is offline
 
Join Date: Jun 2004
I think I see the remaining part of the duplicates problem.

There are leading spaces in front of some the tags.

For example "tag" and " tag".

-Raymond
Reply With Quote
  #60  
Old 17 Jul 2008, 23:55
popowich popowich is offline
 
Join Date: Jun 2004
Is there a way to build phrases? "new york" instead of "new" and "york". I don't care if it's not perfect, just looking to teach it some common ones for my site.

-Raymond
Reply With Quote
Reply


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Mod Options

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off


New To Site? Need Help?

All times are GMT. The time now is 23:28.

Layout Options | Width: Wide Color: