Can tags be pulled from titles? [#482106]

Comment #1

avpaderno

he/him

Italian

Brescia, 🇮🇹 🇪🇺

commented 2 August 2009 at 11:56

Title:

1)Can tags be pulled from titles?

» Can tags be pulled from titles?

Log in or register to post comments

Comment #2

Bèr Kessels commented 14 September 2009 at 20:45

Status:

Active

» Closed (works as designed)

A developer can write his/her own query and push tht trough the tagadelic API.

Log in or register to post comments

Comment #3

gkatsanos commented 26 November 2009 at 17:26

Category:	support	» feature
Priority:	Normal	» Critical
Status:	Closed (works as designed)	» Active

Same exact question. At first I thought it was my mistake and that I needed to retag my nodes, but then I understood It's "by design".
As I don't possess the power of the writing snippets with the Drupal Api do you think this exists already as some kind of another module additional to tagadelic or can we have a workaround maybe by making a new vocabulary with tags that come from the titles ? (I'm just saying I have no clue:( )
Otherwise, can we have such an option in next release?

Log in or register to post comments

Comment #4

frank ralf commented 26 November 2009 at 17:38

You might have a look at the Keywords module. Seems to provide at least a similar functionality:

Module can save node's keywords as taxonomy terms to selected vocabulary. Later you can use this vocabulary to tell Meta Tags module use this terms for meta-tag "keywords" or use as usual tags.

Frank

Log in or register to post comments

Comment #5

frank ralf commented 26 November 2009 at 17:40

Priority:

Critical

» Normal

Setting this back to "normal" as it might be beyond the scope of this module ;-)

Log in or register to post comments

Comment #6

gkatsanos commented 26 November 2009 at 18:39

Thanks for that reply. It kinda work although I am still wrestling with it. I am describing what I did in case someone else has the same question.

1. you install and activate Tagadelic (it adds couple of Blocks in your Blocks page you need to activate'em yourself but wait for that)

2. you install and activate Keywords . You make a NEW Vocabulary that you name.. "Tags" or something similar.

3. you go and setup Keywords so that with each CRON run it scans your nodes for keywords used often and makes em a tag and drops them to your new Vocabulary you just made (Tags).

4. you run cron 2-3 times with the setting to scan like 10 or 20 nodes at each cron run. Not more, it's slow.

5. you go to the blocks page, publish the "Tags to Tags" block or whatever its called.

So just to make it FOOL-PROOF 123, what you're doing is: USING TAGALICIOUS TO PRESENT A CLOUD MADE UP FROM A SPECIAL VOCABULARY YOU MADE WITH THE HELP OF KEYWORDS WHICH ADDED ALL THE TERMS IN IT.

( My problem is that I have shitty words since I have a greek website and I need to re-fix my stopwords )

OH BTW! IF YOU HAPPEN TO SCAN 20-40 NODES AND YOU'RE NOT SATISFIED WITH THE QUALITY OF THE TAGS, MAYBE YOU SHOULD GO FIND THE STOP WORDS FILE (IN THE MODULE FOLDER) , EDIT IT, ADD THE WORDS YOU WANT , DISABLE->UNINSTALL->DELETE THE KEYWORDS MODULE AND THEN REINSTALL IT SO YOU HAVE NEW FRESH KEYWORDS.

Log in or register to post comments

Comment #7

frank ralf commented 26 November 2009 at 18:49

Here are some stop word lists for different languages: http://www.ranks.nl/resources/stopwords.html

Log in or register to post comments

Comment #8

gkatsanos commented 26 November 2009 at 22:17

unfortunately there is no list for Greek stop-words there . :(
Thanks anyway

Log in or register to post comments

Comment #9

gkatsanos commented 28 November 2009 at 10:01

I am afraid the "Keywords" workaround is kindda buggy. Reason is that building a Vocabulary of Tags ends up with an enormous vocabulary (I have many nodes in my portal since it's an ads site).
I think it would be 100 times more practical of Tagalicious BY DESIGN took tags out from titles. Huge save of time, database size, speed, etc.

ps: I am trying to delete this 'Tags' vocabulary right now and my query is still going after 3 minutes ...

Log in or register to post comments

Comment #10

frank ralf commented 28 November 2009 at 14:49

Tagadelic is supposed to pull its tags from vocabularies. So IMO there are two possible solutions for your problem:

Restrict the "Keywords" module so it only scans the title of nodes but not the body.
Create your own module which automatically creates vocabulary items from all node titles.

Either way, this is not (and should not be) the direct responsibility of the Tagadelic module itself.

hth
Frank

Log in or register to post comments

Comment #11

gkatsanos commented 30 November 2009 at 09:05

I don't think this is a discussion on responsibilities, Frank, but mostly a question of whether we can improve community modules with features everyone seems to be asking about. We need to understand the needs before reaching into conclusions.

After some days of testing, I confirm that the 'Keywords' workaround is not the best one.
'Keywords' also overwrites custom tags you try to add, and replaces them with the one it thinks its best. (with the available settings, the one which is more frequently used) (usually a stop word or a word that doesn't represent the context of the node).

By trying to describe our problem and what we need I'd say:
An algorithm that scans ALL nodes (it's titles - or bodies) , then saves the words that are more frequently used not inside ONE node (as keyword is designed to do) but across all of them, and then this can produce a useful Vocabulary that Tagadelic can cloud.

Maybe this is mostly a thing for "Keywords" to do than Tagadelic, I admit. But it seems Keywords is kinda abandoned as a project,

Log in or register to post comments

Comment #12

frank ralf commented 1 December 2009 at 07:36

What about taking over the Keywords module as a (co-) maintainer and enhance it in the way you described? That would make a very useful module indeed!

Frank

Log in or register to post comments

Comment #13

gkatsanos commented 1 December 2009 at 09:24

I would do it but I'll need couple of years of PHP&SQL studying in order to do that I'm afraid :/

Log in or register to post comments

Comment #14

frank ralf commented 2 December 2009 at 12:37

Will have a closer look at "Keywords" and see what I can do.

EDIT
AFAICS the Keywords module already provides most of the desired functionality. What's missing is some configurability, e.g. restricting node types (#568730: Choose content types) or adding stop word lists for other languages (#648180: Make stop-word lists configurable).

EDIT 2:
I posted a quick hack over at #648730: Cleaned up module code + updated .pot file which lets you make the module only scan titles.

Frank

Log in or register to post comments

Comment #15

gkatsanos commented 2 December 2009 at 14:57

Thank you for this contribution Frank. I hope Keywords maintainers add these fixes to the head asap.
In addition to the title-only scan patch, my idea was to save terms in the Vocabulary only if they were encountered over a number a times (we define how many) in the total of the node titles.
This would help avoid saving all words from all titles in a Vocabulary in order to present'em afterwards with Tagadelic

Log in or register to post comments

Comment #16

frank ralf commented 2 December 2009 at 18:07

Hi George,

In the Keywords module you can set

Minimum number of times keywords used:

How many times keyword should be used in text to be saved as term in specified vocabulary?
Recomended: 2 times.

But that might be on a per node basis. Counting over all the nodes is difficult, as the Keywords module only scans a certain number of nodes with every cron run. So it won't capture all nodes in one go.

Frank

PS:
The Keywords issue queue might be a more proper place to continue this discussion ;-)

Log in or register to post comments

Comment #17

gkatsanos commented 2 December 2009 at 18:15

Yeap you might be right about the issue queue :)
Actually I'm tempted to get into this and make a small module myself, about time!

peace

Log in or register to post comments

Comment #18

Bèr Kessels commented 14 December 2009 at 23:02

closing this. Not a Tagadelic issue.

Log in or register to post comments

Comment #19

Bèr Kessels commented 14 December 2009 at 23:02

Status:

Active

» Closed (won't fix)

Log in or register to post comments

Can tags be pulled from titles?

Comments

Comment #1

Comment #2

Comment #3

Comment #4

Comment #5

Comment #6

Comment #7

Comment #8

Comment #9

Comment #10

Comment #11

Comment #12

Comment #13

Comment #14

Comment #15

Comment #16

Comment #17

Comment #18

Comment #19

News items

Our community

Documentation

Drupal code base

Governance of community