Hi,
With the recent versions of the biblio module, we have importation issues, which did were not happening with early versions of biblio v6. When importing a large number of references, the time taken is very very long. I join the problematic bibtex file for convenience. I tried to split it in two files to ease importation a little bit, but the final number of imported references was surprisingly far less than the real number. Any hint? Thanks
| Comment | File | Size | Author |
|---|---|---|---|
| #4 | tout.bib_.zip | 265.23 KB | sebos69 |
| colloques.bib_.zip | 74.98 KB | sebos69 |
Comments
Comment #1
ntripcevich commentedUsing the v5.7 version I've been having to limit my import (Endnote X1 XML) to less than 100 records per file.
Comment #2
rjerome commentedThere is a new routine in 6.x which converts "latex" encoded special characters to UTF8. This seems to be where the problem lies, but I'm on vacation for the next few weeks so don't expect a fix before then.
Ron.
Comment #3
sebos69 commentedOK, I will try to cope with that until you return. Have nice holidays in the meantime!
Comment #4
sebos69 commentedA followup: u just tried beta 9, and it imports the entries fine, but accessing the /biblio page takes a very long time (1 to 2 minutes).
I join an updated version of our .bib database if someone wants to test it...
Comment #5
sebos69 commentedComment #6
rjerome commentedI imported this file onto my test site and although it took quite a while to import (and it did revel some errors in my import code) once imported the page load (/biblio) times are normal.
Ron.
P.S. The errors I eluded to above have corrupted some of your entries so I'll let you know when it's fixed and you can re-import them.
Comment #7
sebos69 commentedThanks for your support!
Finally, I have found that the problem with the display comes from the fact that my site is a multilingual (French/English, default is French). The /fr/biblio page displays as fast as expected while the /en/biblio page takes ages...
On the server side, it is the mysqld process which is talking a long time. (shown by the 'top' command using linux)
Should I open a new bug? Do you need more info?
Comment #8
sebos69 commentedMore investigations : it seems that there is a problem with i18n :
the page /fr/biblio takes a few seconds to display, and the corresponding mysql request is :
SELECT DISTINCT n.*, b.*, bt.name as biblio_type_name FROM node n left join biblio b on n.vid=b.vid left join biblio_types bt on b.biblio_type=bt.tid WHERE (n.language ='fr' OR n.language ='' OR n.language IS NULL) AND ( n.type='biblio' )ORDER BY b.biblio_year DESC, SUBSTRING(n.title,1,3) ASC LIMIT 0, 25
whereas the page /en/biblio takes a long time. The corresponding mysql request is :
SELECT DISTINCT n.*, b.*, bt.name as biblio_type_name FROM node n left join biblio b on n.vid=b.vid left join biblio_types bt on b.biblio_type=bt.tid LEFT JOIN node i18n ON n.tnid = i18n.tnid AND i18n.language = 'en' WHERE (n.language ='en' OR n.language ='' OR n.language IS NULL OR (n.language = 'fr' AND i18n.nid IS NULL)) AND ( n.type='biblio' )ORDER BY b.biblio_year DESC, SUBSTRING(n.title,1,3) ASC LIMIT 0, 25;
it seems that removing the parts like:
"LEFT JOIN node i18n ON n.tnid = i18n.tnid AND i18n.language = 'en'"
"AND i18n.nid IS NULL"
makes the request faster (tested directly with the command line, sorry, no patch available)
should I bug the i18n guys?
Comment #9
Frank Steiner commentedNo, this is the correct behaviour and it's caused by your i18n settings. You have set "mixed", that means on the language selection page you have "Current language and default language". Therefore the query must consider all english and all french nodes which causes this more complicated query.
Consider if you really need the mixed mode. This will show you all pages which are either in french or in english. I'm not this makes sense.
You could set "current language only" to get rid of this, but then you will see always only french or only english pages.
If you have pages that are available in one language only but want them to be shown as long as there is no translation for the current language (an english page for a french speaking viewer might be better than no page), you can also try the "current language and no language" and define those page that are available in one language only as "language neutral" (selection box in the node edit page). Only you really translate a page into both languages, set it "french" or "englisch" language. Then you will always see "french and language neutral pages" or "english and language neutral pages" with the "current language and no language" setting.
Comment #10
sebos69 commentedNo, this is the correct behaviour and it's caused by your i18n settings. You have set "mixed", that means on the language selection page you have "Current language and default language". Therefore the query must consider all english and all french nodes which causes this more complicated query.
Thanks! I switched to "Current language and default language" and the biblio page is fast in both cases!
So my case is closed :)
I am thinking about other users who could face the same problems. Would there be a way to warn the user about the potential slowdown caused by choosing this option?
Anyway, thanks a lot, again!