Questions tagged [doc]

Questions about the old Microsoft Word file format and how to use it.

The "doc" format is a proprietary file format used by Microsoft as their main file format for text documents from the 1980s till 2007. With the release of Word 2007 "docx" is the main file format.

All Questions about the older (pre .docx) Wordfiles should use this tag. Questions about .docx files can use this tag, but should use "docx" primarily. Questions about .odf .txt .rdf should not use this tag.

Wikipedia about "doc" http://en.wikipedia.org/wiki/DOC_%28computing%29

Decrypted file format: http://sc.openoffice.org/compdocfileformat.pdf

851 questions
113
votes
16 answers

Is there a Java API that can create rich Word documents?

I have a new app I'll be working on where I have to generate a Word document that contains tables, graphs, a table of contents and text. What's a good API to use for this? How sure are you that it supports graphs, ToCs, and tables? What are some…
billjamesdev
  • 14,554
  • 6
  • 53
  • 76
111
votes
12 answers

How can doc/docx files be converted to markdown or structured text?

Is there a program or workflow to convert .doc or .docx files to Markdown or similar text? PS: Ideally, I would welcome the option that a specific font (e.g. consolas) in the MS Word document will be rendered to text-code: ```....```.
Lorenz Lo Sauer
  • 23,698
  • 16
  • 85
  • 87
59
votes
6 answers

How to extract just plain text from .doc & .docx files?

Anyone know of anything they can recommend in order to extract just the plain text from a .doc or .docx? I've found this - wondered if there were any other suggestions?
docextract
  • 663
  • 1
  • 6
  • 3
53
votes
4 answers

Should I use @return self, this or the current class?

I have a method that return the current object, how do I document this? /** * set something * * @return this */ public function setSomething(){ // ... return $this; } Or should I do @return self or @return…
lucaswxp
  • 2,031
  • 5
  • 23
  • 34
42
votes
3 answers

How to use clojure doc function?

I'm just starting with Clojure and can't access to the doc function. I'm using clojure 1.3 with emacs24 and swank-clojure. user> *clojure-version* {:major 1, :minor 3, :incremental 0, :qualifier nil} But when I try: (doc doc) I get: Unable to…
zaforas
  • 423
  • 1
  • 4
  • 6
31
votes
9 answers

How do I see the Python doc on Linux?

In Windows, Python has a chm type document, and it is very convenient to read. But in the Linux, is there any document let me to read?
Tanky Woo
  • 4,906
  • 9
  • 44
  • 75
30
votes
6 answers

looking up c++ documentation inside of vim

I code c++, using vim. Often times, I find myself wasting time (and brekaing flow) looking up trivial things like: is std::string.substring does it take (start, length) or (start, end). This often results in open browser; wait; search on google;…
anon
  • 41,035
  • 53
  • 197
  • 293
29
votes
1 answer

can git be used for version control on non text documents such as word doc or xls etc

I've been learning about the GIT version control system recently and It seems to work very well for plain text documents as you can add a single line, go back and fork it, revise the document and remove the line you just added. I program mostly in…
yoshiserry
  • 20,175
  • 35
  • 77
  • 104
28
votes
4 answers

Convert doc to txt via commandline

We're searching a programm that allows us to convert a doc or docx document to a txt file. We're working with linux and we want to start a website that converts user uploaded doc files. We don't wanna use open office/libre office cause we have bad…
user698601
  • 598
  • 2
  • 5
  • 11
25
votes
7 answers

Upload DOC or PDF using PHP

I'm able to upload images fine, but when when I change the types from image/jpg, image/gif to application/msword and application/pdf, it doesn't work. Here's my code. The exact same code works for images, but for uploading docs and pdf, it outputs…
user961627
  • 12,379
  • 42
  • 136
  • 210
24
votes
7 answers

Python: Convert PDF to DOC

How to convert a pdf file to docx. Is there a way of doing this using python? I've saw some pages that allow user to upload PDF and returns a DOC file, like PdfToWord Thanks in advance
AlvaroAV
  • 10,335
  • 12
  • 60
  • 91
23
votes
1 answer

How read Doc or Docx file in java?

I want to read a word file in java import org.apache.poi.poifs.filesystem.*; import org.apache.poi.hpsf.DocumentSummaryInformation; import org.apache.poi.hwpf.*; import org.apache.poi.hwpf.extractor.*; import…
Ninja
  • 249
  • 1
  • 3
  • 7
22
votes
6 answers

Maven javadoc Search redirects to "/undefined/.." url

I have generated javadoc with javadoc maven plugin 3.0.1 and Java 9. However, when I use the new Search capability and pick a class, it redirects to "File not found"... There is undefined in the url (e.g.…
Radoslav Ivanov
  • 970
  • 8
  • 23
22
votes
6 answers

Nodejs: Convert Doc to PDF

I found some repos, which do not look as they are still maintained: https://github.com/gfloyd/node-unoconv https://github.com/skmp/node-msoffice-pdf ... I tried the approach with libreoffice, but the pdf output is so bad, that it is not useable…
Andi Giga
  • 3,744
  • 9
  • 38
  • 68
22
votes
4 answers

Using Vim to edit Microsoft Word files

I've found ViEmu, a vi emulator for microsoft word. However, I wanted to use vim to edit DOC or even rtf files. Is this possible ? Are they any other formats that preserve page/paragraph layout compatible with both Microsoft Word and Vim? I am also…
Kilon
  • 1,962
  • 3
  • 16
  • 23
1
2 3
56 57