I set the boost cache expired every 4 hours, and cron to run every 4 hours. There is some session data in home page, say $_session['mydata'], another page use this $_session['mydata']. I find out that the crawler goes to home page, and then goes to other page. $_session['mydata'] is NOT set in other pages. So cron Crawler visits every page with different session, and the session data will lost? thanks.

Patrick

Comments

patrickwang5’s picture

by the way, I just find one interesting issue. For simple php in my ajax callback, it is faster than boost cache, is it? for example, I put " drupal_json( $_sesseion('mydata') ) in the php callback. If I run as login user, the ajax returns in 1.2 second in firebug. If I logout and run it, the ajax returns in 3.3 seconds in firebug-- a cache copy. I am wondering if boost cache takes some additional time to handshake?

mikeytown2’s picture

The crawler doesn't keep the session data. Looks like I need to purge the session table from the crawler. I developed the crawler with pressflow which doesn't give anonymous users a session.

patrickwang5’s picture

I find another way around. I save the session data to mysql. If the session is not set somehow, it loads the data from mysql. For the speed thing, if I turn off the json cache with .htaccess rule, anonymous user will finish ajax in 3.4 seconds in firebug. if I turn on the json cache with .htaccess rule, anonymous user will finish ajax in 3.3 seconds in firebug. They have similar values. One interesting thing is, if login as Admin user, ajax finishes with 1.3 seconds(maybe cut the handshake time as login user). I run them couple times. If someone come across and figure out the speed tip, let me know.