21

I am working on a swing application using JAVA FX controls . In my application i have to take print out the html page displayed in the webview . What I am trying is to load the html content of webview in a string with the help of HtmlDocuement.

To load the content of html file from web view,I am using the following code but its not working:

try
{
    String str=webview1.getEngine().getDocment().Body().outerHtml();
}
catch(Exception ex)
{
}
adesh singh
  • 1,727
  • 9
  • 38
  • 70

2 Answers2

45
String html = (String) webEngine.executeScript("document.documentElement.outerHTML");
iTake
  • 4,082
  • 3
  • 33
  • 26
  • 4
    this one liner wont work if not in a worker. Will return empty html. It also won't work for a site like google.com. Won't return live DOM, only underlying html/javascript. – Salim Oct 02 '15 at 15:18
24

WebEngine.getDocument returns org.w3c.dom.Document, not JavaScript document which you expect judging by your code.

Unfortunately, printing out org.w3c.dom.Document requires quite a bit of coding. You can try the solution from What is the shortest way to pretty print a org.w3c.dom.Document to stdout?, see code below.

Note that you need to wait until the document is loaded before working with Document. This is why LoadWorker is used here:

public void start(Stage primaryStage) {
    WebView webview = new WebView();
    final WebEngine webengine = webview.getEngine();
    webengine.getLoadWorker().stateProperty().addListener(
            new ChangeListener<State>() {
                public void changed(ObservableValue ov, State oldState, State newState) {
                    if (newState == Worker.State.SUCCEEDED) {
                        Document doc = webengine.getDocument();
                        try {
                            Transformer transformer = TransformerFactory.newInstance().newTransformer();
                            transformer.setOutputProperty(OutputKeys.OMIT_XML_DECLARATION, "no");
                            transformer.setOutputProperty(OutputKeys.METHOD, "xml");
                            transformer.setOutputProperty(OutputKeys.INDENT, "yes");
                            transformer.setOutputProperty(OutputKeys.ENCODING, "UTF-8");
                            transformer.setOutputProperty("{http://xml.apache.org/xslt}indent-amount", "4");

                            transformer.transform(new DOMSource(doc),
                                    new StreamResult(new OutputStreamWriter(System.out, "UTF-8")));
                        } catch (Exception ex) {
                            ex.printStackTrace();
                        }
                    }
                }
            });
    webengine.load("http://stackoverflow.com");
    primaryStage.setScene(new Scene(webview, 800, 800));
    primaryStage.show();
}
Sergey Grinev
  • 34,078
  • 10
  • 128
  • 141
  • Your method will retrieve missing content if the site has errors html ex : I am very concerned about this topic if you have a fix please please help – Ma Tâm Oct 24 '16 at 08:07