I'm planning a new project which will essentially have a website front end to a large database. The data will be roughly heirarchical which suggests to me using the Taxonomy system. I have the skills to programmatically insert taxonomy terms into a Drupal database and create nodes classified with those terms.

However I am thinking of many thousands of taxonomy terms - possibly even a million or more.

what parts of the taxonomy system will break down. Presumably anything which tries to list all the taxonomy terms will need to be disabled - but apart from that what should I be careful of?

Is there something new I've missed which will be more appropriate than taxonomy module? I know about things like tagadelic but are there more things like that?

Comments

gener-1’s picture

i don't think will scale to a million terms either

coreyp_1’s picture

I think more at issue is the server/database limitation (memory & storage space).

The taxonomy structure is sound. It is in sorting through all the records that you will have a bottleneck.

Then again, with a million terms, that would probably be an issue regardless of the CMS used.

- Corey