Why does FinalReleaseComObject cause "(InteropProgram) has stopped working"?

Question

I'm trying to read text and images from a Word document and close it. The problem is trying to close it without Word encountering any issues OR creating multiple WINWORD.exe instances. My problem is that when I call Marshal.FinalReleaseComObject(app); on the Word.ApplicationClass, Word fires a generic exception provided by Windows ("Word has stopped working"). I have read many of the solutions in How do I properly clean up Excel interop objects? and implemented the recommendations, but I still have the issue.

Here is my code. I am only reading one Word file with one page (you may want to skip to "// Cleanup:" where the exception occurs).

    private byte[] GetDocumentText(byte[] wordBytes, string path)
    {
        // Save bytes to word file in temp dir, open, copy info. Then delete the temp file after.

        object x = Type.Missing;
        string ext = Path.GetExtension(path).ToLower();
        string tmpPath = Path.ChangeExtension(Path.GetTempFileName(), ext);
        File.WriteAllBytes(tmpPath, wordBytes);

        // Open temp file with Excel Interop:
        Word.ApplicationClass app = new Word.ApplicationClass();
        Word.Documents docs = app.Documents;
        Word.Document doc = docs.Open(tmpPath, x, x, x, x, x, x, x, x, x, x, x, x, x, x);

        doc.ActiveWindow.Selection.WholeStory();
        doc.ActiveWindow.Selection.Copy();
        IDataObject data = Clipboard.GetDataObject();
        string documentText = data.GetData(DataFormats.Text).ToString();

        // Add text to pages.
        byte[] wordDoc = null;
        using (MemoryStream myMemoryStream = new MemoryStream())
        {
            Document myDocument = new Document();
            PdfWriter myPDFWriter = PdfWriter.GetInstance(myDocument, myMemoryStream); // REQUIRED.
            PdfPTable table = new PdfPTable(1);
            myDocument.Open();

            // Create a font that will accept unicode characters.
            BaseFont bfArial = BaseFont.CreateFont(@"C:\Windows\Fonts\Arial.ttf", BaseFont.IDENTITY_H, BaseFont.EMBEDDED);
            Font arial = new Font(bfArial, 12);

            // If Hebrew character found, change page direction of documentText.
            PdfPCell page = new PdfPCell(new Paragraph(documentText, arial)) { Colspan = 1 };
            Match rgx = Regex.Match(documentText, @"\p{IsArabic}|\p{IsHebrew}");
            if (rgx.Success) page.RunDirection = PdfWriter.RUN_DIRECTION_RTL;

            table.AddCell(page);

            // Add image to document (Not in order with text...)
            foreach (Word.InlineShape ils in doc.InlineShapes)
            {
                if (ils != null && ils.Type == Word.WdInlineShapeType.wdInlineShapePicture)
                {
                    PdfPCell imageCell = new PdfPCell();
                    ils.Select();
                    doc.ActiveWindow.Selection.Copy();
                    System.Drawing.Image img = Clipboard.GetImage();
                    byte[] imgb = null;
                    using (MemoryStream ms = new MemoryStream())
                    {
                        img.Save(ms, System.Drawing.Imaging.ImageFormat.Jpeg);
                        imgb = ms.ToArray();
                    }

                    Image wordPic = Image.GetInstance(imgb);
                    imageCell.AddElement(wordPic);
                    table.AddCell(imageCell);
                }
            }

            myDocument.Add(table);
            myDocument.Close();
            myPDFWriter.Close();
            wordDoc = myMemoryStream.ToArray();
        }

        // Cleanup:
        Clipboard.Clear();

        (doc as Word._Document).Close(Word.WdSaveOptions.wdDoNotSaveChanges, x, x);
        Marshal.FinalReleaseComObject(doc);
        Marshal.FinalReleaseComObject(docs);
        (app as Word._Application).Quit(x, x, x);
        Marshal.FinalReleaseComObject(app); // Word encounters exception here.

        doc = null;
        docs = null;
        app = null;
        GC.Collect();
        GC.WaitForPendingFinalizers();
        GC.Collect();
        GC.WaitForPendingFinalizers();

        try { File.Delete(tmpPath); }
        catch { }

        return wordDoc;
    }

This doesn't always happen the first time I read the file. When I read it a second or third time, I usually get the error.

Is there any way I can prevent the error from showing?

score 1 · Accepted Answer · edited May 23 '17 at 11:43

Seeing this crash is fairly unusual, Word normally knows how to deal with this kind of sledge-hammer approach to memory management. Nevertheless, it is a very bad practice. Best described by this blog post from the Visual Studio team. Worth a complete read, the "silent assassin" section is the most relevant.

Calling GC.Collect is enough to get all the COM references released, no additional help is required. That however doesn't work if you run your program with the debugger attached. This answer explains why.

To get GC.Collect() to work in the debugger as well, you need to move it in a separate method so that the debugger can't keep the references alive. That's easiest done like this:

private byte[] GetDocumentText(byte[] wordBytes, string path) {
   var retval = GetDocumentTextImpl(wordBytes, path);
   GC.Collect();
   GC.WaitForPendingFinalizers();
   return retval;
}

private byte[] GetDocumentTextImpl(byte[] wordBytes, string path) {
   // etc...
}

And move your original code into the GetDocumentTextImpl() method. Just delete all the Marshal and GC calls from the code since they are completely unnecessary. And dangerous.

Your other answer about this subject is very well explained. Together with the Office interop recurring questions, it's gold! — acelent, Oct 18 '13 at 12:18
Hans, great article and response. I'm still having a couple of problems, but I don't think they're directly related to this issue, so I'm creating a separate question for them. Thanks! — Paul, Oct 18 '13 at 15:29

score 0 · Answer 2 · answered Oct 17 '13 at 22:38

0

You can try checking if IsObjectValid before calling FinalReleaseComObject.

answered Oct 17 '13 at 22:38

Andra Ciorici

301
1
7

acelent · Answer 3 · 2013-10-18T12:30:57.367

You simply shouldn't use FinalReleaseComObject, that's a hammer to free/delete an RCW you know for sure you're the only referrer (in .NET).

In this case, you completely decrease the reference count on each RCW, doc, docs and app, not only from the references you have.

Try ReleaseComObject instead, but note that this might be just as bad if e.g. there's still a .NET enumerator alive, in use and attached to one of the objects you're releasing from one of Word's collections.

Closing the document, quitting Word, setting the variables to null and GC'ing should be enough. Depending on the compiler, it may discard the variables from the stack and eliminate the code that sets them to null.

Why does FinalReleaseComObject cause "(InteropProgram) has stopped working"?

3 Answers3