Hello,

After getting migrations working I've noticed there are duplicate taxonomy terms.

Is there a best-practice way to avoid duplicates?

This occurs when migrating from a Drupal6 site to a Drupal7 site. The Drupal6 site was copied and then worked on / upgraded to Drupal7. So now we need to sync the content from the live Drupal6 site to the new Drupal7 site.

Should all the content be dumped from the Drupal7 / destination site before running migrations? Or is there some way to merge, skip or update existing taxonomy terms without breaking node associations or anything else...

Thanks!

Comments

internets’s picture

After digging in further the Devel module along with Devel Generate ended up being the solution.

Using Devel I was able to delete all current taxonomy terms, nodes, and users.

Then running the migrate-import succeeded without duplicates. So far things seem to be working well.

mikeryan’s picture

Status: Active » Postponed (maintainer needs more info)

The Migrate module should be resolving existing terms - if the incoming term matches an existing one (exactly, or case-insensitively if you set ignore_case to TRUE), then it should be mapped to the existing tid in the migration map table instead of creating a new term. Is there anything else you can say about the circumstances where dupes were created?

mikeryan’s picture

Status: Postponed (maintainer needs more info) » Closed (cannot reproduce)