| Project: | Extractor |
| Version: | 6.x-1.0-alpha7 |
| Component: | Code |
| Category: | feature request |
| Priority: | normal |
| Assigned: | Unassigned |
| Status: | active |
Issue Summary
I was having few ideas for extractor or some submodule so would like to know what others think.
1.Would like to have module where terms would be auto extracted from text. For example if you have text and word(s) that is most freqvently appearing should be extracted as term. Thinking of putting all words in array and then loop it and then find top 5 for example and add it to vocabulary of choice.
2.Text grouping, different feeds often have similar texts, so for example you would want to have same news items grouped. there are some sites that do that, dont know which method is used, some function that cheks similarity in bodies of nodes and titles and then groups?
3.Also when extractor is used, would be nice that similar option is also applied when tagging, is it skipped beacuse it would be to hard on server or just alex didnt have time to do it?
p.s.
i know that 2. is for feeds module, we can paste it there if its good idea.