My site is a member of a press release service. I already have current press releases uploaded (via Mailhandler) but I would like help incorporating content from their archive. It's a one-off import, so quick-and-dirty is fine. You'll probably need Perl or Python skills.

I'll pay $300 to get it done.
FYI, if you're good but busy right now, I'm not in a huge rush. But you'd have to explain why you're so good I should wait. :-)

Please read the particulars below and contact me if you're up to the job.

Thanks,
Patricia

Particulars
----------
What you need to do...
1. Write a script (perl? python?) to conduct the custom search, view each article, and store. FYI...
- I'll give you a login to get onto the archive site
- They have a "custom search" utility. I'll give you the search terms to plug in. They will then give you a paginated list with 10 teasers per page. There are approx 5,000 articles
- They use Lucene as their search engine and individual articles are rendered with URLs like: "http://media.prnewswire.com/en/jsp/myPRNJ.jsp?profileid=1148775&resource..."

2. Make some tweaks to tidy up each press release
- Content type = "News" for all nodes (a custom content type, really just an article - title, teaser, body)
- Authored on date: The first line always has date, time and time zone listed as "Jul 9, 2007 07:41 America/Los_Angeles". I only really care about the date, but if it's the same amount of work you can put in the full date, time and time zone.
- For any line beginning "CONTACT:" (all caps) delete that line and any subsequent lines
- Taxonomy: Each node gets flagged with term = PR
- Taxonomy: If one of approx. 20 Taxonomy terms is mentioned in the first 10 lines, the node should be flagged with that term

3. Import
- Import the nodes. I'll add whatever import module you need, just let me know. I'll give you a login to the site which will have rights to post News nodes, plus whatever other permissions will make the job easy.

Comments

flevour’s picture

If you are still looking for someone, please contact me using my personal contact form.
Cheers,
flevour
---------------------------
http://www.flevour.net

PRFB’s picture

I found someone to do the job.
Drupal.org rocks....