TreeWalker does not traverse documents</a></h1> </div> <div class="grid fw-wrap pb8 mb16 bb bc-black-075"> <div class="grid--cell ws-nowrap mr16 mb8" title="2016-01-12 19:07:53Z"> <span class="fc-light mr2">Asked</span> <time itemprop="dateCreated" datetime="2014-01-28T10:37:01.693" class="fromnow">Jan 28 '14 at 10:37</time> </div> <div class="grid--cell ws-nowrap mr16 mb8"> <span class="fc-light mr2">Active</span> <time class="fromnow" title="2018-10-29T15:16:17.107" datetime="2018-10-29T15:16:17.107">Oct 29 '18 at 15:16</a> </div> <div class="grid--cell ws-nowrap mb8" title="Viewed 496 times"> <span class="fc-light mr2">Viewed</span> 496 times </div> </div> <div id="mainbar" role="main" aria-label="questions and answers"> <div id="question" class="question" data-questionid="21403230" data-ownerid="1149773" data-score="0"> <div class="post-layout"> <div class="votecell post-layout--left"> <div class="js-voting-container grid jc-center fd-column ai-stretch gs4 fc-black-200" data-post-id="21403230"> <button class="js-vote-up-btn grid--cell s-btn s-btnunset c-pointer"><svg aria-hidden="true" class="m0 svg-icon iconArrowUpLg" width="36" height="36" viewBox="0 0 36 36"><path d="M2 26h32L18 10 2 26z"></path></svg></button> <div class="js-vote-count grid--cell fc-black-500 fs-title grid fd-column ai-center" itemprop="upvoteCount" data-value="0">0</div> <button class="js-bookmark-btn s-btn s-btnunset c-pointer py4"> <svg aria-hidden="true" class="svg-icon iconBookmark" width="18" height="18" viewBox="0 0 18 18"><path d="M6 1a2 2 0 00-2 2v14l5-4 5 4V3a2 2 0 00-2-2H6zm3.9 3.83h2.9l-2.35 1.7.9 2.77L9 7.59l-2.35 1.7.9-2.76-2.35-1.7h2.9L9 2.06l.9 2.77z"></path></svg> <div class="js-bookmark-count mt4" data-value=""></div> </button> </div> </div> <div class="postcell post-layout--right"> <div class="s-prose js-post-body" itemprop="text"><p>I'm writing a script to retrieve text nodes (and other related elements) from an HTML document. Based on <a href="../../a/2579869#2579869">this answer</a>, I was using the following. (The definition for the <code>acceptTextNode</code> function is omitted for simplicity.)</p> <pre><code>var textNodes = []; var treeWalker = document.createTreeWalker( rootNode, NodeFilter.SHOW_ALL, { acceptNode: acceptTextNode }); while (treeWalker.nextNode()) textNodes.push(treeWalker.currentNode); </code></pre> <p>However, I discovered that this approach fails when the document contains other documents nested within <code><iframe></code> elements, such as for the "Compose" facility in Outlook.com. (Assume that the domains of the <code><iframe></code> documents as the same as the parent document.)</p> <p>I managed to work around the issue by retrieving the descendent documents manually, using <code>getElementsByTagName</code>:</p> <pre><code>var textNodes = []; var rootNodes = [ rootNode ]; for (var i = 0; i < rootNodes.length; i++) { if (rootNodes[i].getElementsByTagName) { var childFrames = rootNodes[i].getElementsByTagName("iframe"); for (var j = 0; j < childFrames.length; j++) if (childFrames[j].contentDocument) rootNodes.push(childFrames[j].contentDocument); } } for (var i = 0; i < rootNodes.length; i++) { var treeWalker = document.createTreeWalker( rootNodes[i], NodeFilter.SHOW_ALL, { acceptNode: acceptTextNode }); while (treeWalker.nextNode()) textNodes.push(treeWalker.currentNode); } </code></pre> <p>However, this feels like a hack, since it's combining manual traversal with the built-in <code>TreeWalker</code>. Is there a better approach?</p></div> <div class="mt24 mb12"> <div class="post-taglist grid gs4 gsy fd-column"> <div class="grid ps-relative"> <a href="../../questions/tagged/javascript" class="post-tag js-gps-track" title="show questions tagged 'javascript'" rel="tag">javascript</a> <a href="../../questions/tagged/dom" class="post-tag js-gps-track" title="show questions tagged 'dom'" rel="tag">dom</a> <a href="../../questions/tagged/iframe" class="post-tag js-gps-track" title="show questions tagged 'iframe'" rel="tag">iframe</a> </div> </div> </div> <div class="mb0"> <div class="mt16 grid gs8 gsy fw-wrap jc-end ai-start pt4 mb16"> <div class="grid--cell mr16 fl1 w96"></div> <div class="post-signature grid--cell"> <div class="s-user-card s-user-card"> <time class="s-user-card--time" datetime="edited May 23 '17 at 12:03">edited May 23 '17 at 12:03</time> <a href="../../users/-1/community" class="s-avatar s-avatar32 s-user-card--avatar"> <img class="s-avatar--image" src="../../users/profiles/-1.webp" data-jdenticon-width="32" data-jdenticon-height="32" data-jdenticon-value="Community" /> </a> <div class="s-user-card--info"> <a href="../../users/-1/community" class="s-user-card--link">Community</a> <ul class="s-user-card--awards"> <li class="s-user-card--rep" title="reputation score">1</li> <li class="s-award-bling s-award-blingsilver" title="1 silver badges">1</li> </ul> </div> </div> </div> <div class="post-signature owner grid--cell"> <div class="s-user-card s-user-card"> <time class="s-user-card--time" datetime="asked Jan 28 '14 at 10:37">asked Jan 28 '14 at 10:37</time> <a href="../../users/1149773/douglas" class="s-avatar s-avatar32 s-user-card--avatar"> <img class="s-avatar--image" src="../../users/profiles/1149773.webp" data-jdenticon-width="32" data-jdenticon-height="32" data-jdenticon-value="Douglas" /> </a> <div class="s-user-card--info"> <a href="../../users/1149773/douglas" class="s-user-card--link">Douglas</a> <ul class="s-user-card--awards"> <li class="s-user-card--rep" title="reputation score">53,759</li> <li class="s-award-bling s-award-blinggold" title="13 gold badges">13</li> <li class="s-award-bling s-award-blingsilver" title="140 silver badges">140</li> <li class="s-award-bling s-award-blingbronze" title="188 bronze badges">188</li> </ul> </div> </div> </div> </div> </div> </div> <div class="post-layout--right js-post-comments-component"> <div id="comments-21403230" class="comments js-comments-container bt bc-black-075 mt12 " data-post-id="21403230" data-min-length="15"> <ul class="comments-list js-comments-list" data-remaining-comments-count="0" data-canpost="false" data-cansee="true" data-comments-unavailable="false" data-addlink-disabled="true"> <li id="comment-32284745" class="comment js-comment " data-comment-id="32284745" data-comment-owner-id="1256925" data-comment-score="1"> <div class="js-comment-actions comment-actions"> <div class="comment-score js-comment-edit-hide"> <span title="number of 'useful comment' votes received" class="warm">1</span> </div> </div> <div class="comment-text js-comment-text-and-form"> <a name="comment32284745_21403230"></a> <div class="comment-body js-comment-edit-hide"> <span class="comment-copy">Iframes are sandboxed, so this is supposed to happen. That's not an answer to your question, but just an FYI on why this happens</span> – <a href="../../users/1256925/joeytje50" title="18,636 reputation" class="comment-user ">Joeytje50</a> <span class="comment-date" dir="ltr"><a class="comment-link" href="../../questions/21403230/treewalker-does-not-traverse-iframe-documents#comment32284745_21403230"><span title="2014-01-28T10:40:22.767 License: CC BY-SA 3.0" class="relativetime-clean">Jan 28 '14 at 10:40</span></a></span> </div> </div> </li> <li id="comment-32284950" class="comment js-comment " data-comment-id="32284950" data-comment-owner-id="1149773" data-comment-score="0"> <div class="js-comment-actions comment-actions"> <div class="comment-score js-comment-edit-hide"> </div> </div> <div class="comment-text js-comment-text-and-form"> <a name="comment32284950_21403230"></a> <div class="comment-body js-comment-edit-hide"> <span class="comment-copy">@Joeytje50: I can understand the rationale you provide. On the other hand, I'm not sure I agree that it should apply for same-domain documents. `<iframe>` elements are commonly (ab)used as a way of structuring the visual layout of a page (like in Outlook.com). A developer seeking to get all the text nodes (or other category of nodes) for a page would typically want to traverse the `<iframe>` documents too. – Douglas Jan 28 '14 at 10:45

Question

Maybe you use this for your iframes or similar

window.frames[0].document.createTreeWalker(
        window.frames[0].document.body

for the creation tree, with iframe.

like this

 var treeWalker =  window.frames[0].document.createTreeWalker(
    window.frames[0].document.body,
    NodeFilter.SHOW_ALL,
    { acceptNode: acceptTextNode });

In the iframes.

score 0 · Answer 1 · answered Oct 29 '18 at 15:16

Maybe you use this for your iframes or similar

window.frames[0].document.createTreeWalker(
        window.frames[0].document.body

for the creation tree, with iframe.

like this

 var treeWalker =  window.frames[0].document.createTreeWalker(
    window.frames[0].document.body,
    NodeFilter.SHOW_ALL,
    { acceptNode: acceptTextNode });

In the iframes.

1 Answers1