Hi, I'm a developer on an open source project called Docvert which converts OpenDocument (and MS Word if you've got OpenOffice or Abiword on the server) to DocBook, HTML, or any XML.

About a year ago there was a request for MS Word/Excel import. I'm not that familiar with Drupal's code but if anyone wants to collaborate on a drupal module that allows OpenDocument upload then let me know at docvert@holloway.co.nz . Cheers :)

Comments

pearcec’s picture

Have you started anything yet? I have a few ideas:

* Create a New Content type and allow people to associate uploaded files that get coverted with Docvert.
* I would give people the opportunity to associate the original format and other formats like PDF.
* Each new node of this Content type could have the choice of output.
* Administration controls for what ever needed to be set for Docvert to work
* Access controls that provide access to let people create a new node of this content type
* Access controls for administrative access to configuring this module.

Drop me a line, I emailed you but never heard back.

discursives’s picture

The more libraries we process into Drupal the greater the the spread of the CMS. I'd like to help with this.

A
http://peerproducers.com

mjohnq3’s picture

I hope this gets off the ground because a way to import MS Word documents (or documents from other MS Word compliant word processors like Open Office) without losing the formatting would really be very useful as so many people world-wide create and use such documents. And typing a long story or other type of content into any of the HTML editors is really too slow and cumbersome.

(I actually composed this comment in Word!)

cghobbs’s picture

We have also had a quote request to import approx 600 pages of a word doc into Drupal while making sure it is XHTML valid.

pearcec’s picture

Looks like we have some interest at this point . I tried contacting holloway. But never heard from him. See if one of you guys can get him back into this forum. Otherwise we can go about doing it ourselves. Though I would like holloway's input.

jstrunk’s picture

EZ Publish has this feature. It is written in PHP and is GPL licensed. They are bound to php4 and an older MySQL version, however it should be feasible to refactor that feature into Drupal.

This happens to be a blocking feature for certain corporate sites(including mine) adopting Open Source CMSs.

John98’s picture

has this feature. It is written in PHP and is GPL licensed. They are bound to php4 and an older MySQL version, however it should be feasible to refactor that feature into Drupal.
. . . . . . .

This happens to be a blocking feature for certain corporate sites(including mine) adopting Open Source CMSs.

holloway’s picture

Hi!

After checking back a few times to see if there was any interest I gave up so it's great to see all these posts. So yes, I'm still interested. I'm not that familiar with Drupal but I've written a few plugins for other CMSs (Python-based ones) and it seems that because Docvert is a REST-style web service we could add it to Drupal like that. I have a very small PHP http-client library (it does WebDAV and HTTP POST/GET) if drupal doesn't have one.

These are all just guesses though and I suppose we should talk about our general approach to this. I don't know Drupal so I'd been keen on your opinions.

(ps. I used to come here when people were leaving Slashdot for Kuro5hin/Drupal -- it's amazing to see how far Drupal has come)

markDrupal’s picture

Is anyone actively working on this project or made a module of it? I would like to view the progress, test, or offer the basic coding skills that i have. This is defiantly a needed addition to Drupal.

holloway’s picture

Hi Mark, I emailed pearcc a while back about this and he got back to me saying that he was busy -- he was going to email me again when he had some free time.

Nothing has happened yet, so if anyone is familiar with Drupal and wants to work on an MS Word import module then let me know. Email me on docvert@holloway.co.nz , Cheers

StevenSokulski’s picture

Would it not be a bad idea to start with some simple functionality such as this:

1. A user chooses a content type like "uploaded file" to create.
2. They attach a text file (txt, doc, rtf, etc.) and Drupal uses the document name (minus extension) as the title and the document content as the node body.
3. Form fields on the create content page allow for the user to select taxonomy and other options. -or- some sort of simple markup allows for taxonomy and other things to be imported from the text file.

schildi’s picture

Hello

did you have a look at
http://drupal.org/node/152647 ?

Regards
Schildi

holloway’s picture

If anyone who knows Drupal wants to work on this plugin please email me on docvert@holloway.co.nz . Despite some initial interest it didn't seem to go anywhere, so if anyone is keen let me know, cheers!

pearcec@drupal.org’s picture

Looks like there is some traction in a different location:

http://groups.drupal.org/node/9929