My site is a member of a press release service. I already have current press releases uploaded (via Mailhandler) but I would like help incorporating content from their archive. It's a one-off import, so quick-and-dirty is fine. You'll probably need Perl or Python skills.
I'll pay $300 to get it done.
FYI, if you're good but busy right now, I'm not in a huge rush. But you'd have to explain why you're so good I should wait. :-)
Please read the particulars below and contact me if you're up to the job.
Thanks,
Patricia
Particulars
----------
What you need to do...
1. Write a script (perl? python?) to conduct the custom search, view each article, and store. FYI...
- I'll give you a login to get onto the archive site
- They have a "custom search" utility. I'll give you the search terms to plug in. They will then give you a paginated list with 10 teasers per page. There are approx 5,000 articles
- They use Lucene as their search engine and individual articles are rendered with URLs like: "http://media.prnewswire.com/en/jsp/myPRNJ.jsp?profileid=1148775&resource..."
2. Make some tweaks to tidy up each press release
- Content type = "News" for all nodes (a custom content type, really just an article - title, teaser, body)
- Authored on date: The first line always has date, time and time zone listed as "Jul 9, 2007 07:41 America/Los_Angeles". I only really care about the date, but if it's the same amount of work you can put in the full date, time and time zone.
- For any line beginning "CONTACT:" (all caps) delete that line and any subsequent lines
- Taxonomy: Each node gets flagged with term = PR
- Taxonomy: If one of approx. 20 Taxonomy terms is mentioned in the first 10 lines, the node should be flagged with that term
3. Import
- Import the nodes. I'll add whatever import module you need, just let me know. I'll give you a login to the site which will have rights to post News nodes, plus whatever other permissions will make the job easy.
Comments
I am interested in the job
If you are still looking for someone, please contact me using my personal contact form.
Cheers,
flevour
---------------------------
http://www.flevour.net
Thanks! I'm all set.
I found someone to do the job.
Drupal.org rocks....