1

Is there a .NET component which can convert docx to HTML (e.g. as per Word->SaveAs->HTMLPage) but without MS Office installed?

Justin
  • 84,773
  • 49
  • 224
  • 367
flost
  • 51
  • 1
  • 3
  • Does it have to b a .NET component? There's OpenOffice that can do it, runs on Windows and has a headless mode (i.e. is usable from the command line) – Pekka Sep 27 '11 at 06:58
  • This is answered here. http://stackoverflow.com/questions/4824619/batch-conversion-of-docx-to-clean-html – Chris Rae Jan 31 '14 at 19:41
  • `Mammoth.js https://github.com/mwilliamson/mammoth.js` can be a good option for basic conversion – Yogen Darji Dec 17 '19 at 06:39

2 Answers2

0

You may want to make use of Mammoth docx to HTML library, its 7 year too late, it seems :) but anyways its a library for displaying doc, docx documents by converting them to html on the browser side as well as can be handled on the backend.

0

You should look at Aspose.Words.

Their library allows converting between various documents formats, including DOC/DOCX to HTML via code. They have a very intuitive and simple API, and are very active in their development and improvements of their libraries.

SaguiItay
  • 2,145
  • 1
  • 18
  • 40
  • Do you know any similar tools ? I need the opposite way , HTML To Docx / PDF. But aspose words is to expensive for the budget i got from my boss – Boas Enkler Dec 06 '11 at 17:21
  • I'm not familiar with specific HTML to PDF libraries, though I'm sure there are a few. HTML to DOCX can be created on your own if your would like to save money (at the price of writing your own code) - you just have to learn the DOCX language.... – SaguiItay Dec 07 '11 at 10:06