I am trying to capture the h1, h2, and h3 tags for the following HTML pages, but H3 is only returned on the first URL and doesn't return for the second URL.
URL (returns H3) = https://docs.paloaltonetworks.com/prisma/prisma-access/prisma-access-panorama-release-notes/prisma-access-about/features-in-prisma-access
URL (doesn't return H3) = https://docs.paloaltonetworks.com/pan-os/10-2/pan-os-admin/authentication/configure-multi-factor-authentication/configure-mfa-between-rsa-securid-and-firewall
String url = "https://docs.paloaltonetworks.com/pan-os/10-2/pan-os-admin/authentication/configure-multi-factor-authentication/configure-mfa-between-rsa-securid-and-firewall";
try {
Document html = Jsoup.connect(url).userAgent("Mozilla").get();
Elements hTags = html.select("h1,h2,h3");
System.out.println(hTags);
} catch (IOException e) {
System.out.println("In exception " + e);
throw new RuntimeException(e);
}
If I View Page Source for both HTML files, the H3 headers do not show up, however, both HTML pages show the H3 headers when I inspect the page. Any help would be appreciated.