@KarelG's answer is the best answer, though I had specific issues with certain non-standard domains. The example issue is self contained below.
For certain "real world" input values, I had to add a check to the URI scheme to avoid mis-parsing of some addresses. This is the changed code.
import java.net.URI;
import java.net.*;
public class Domain {
public static String getDomainName(String url) {
try {
URI uri = new URI(url);
String domain = uri.getHost();
System.out.println("domain: " + domain);
if (uri.getScheme() != null) {
return domain.startsWith("www.") ? domain.substring(4) : domain;
} else {
return uri.getSchemeSpecificPart();
}
} catch (Exception e) {
e.printStackTrace();
return null;
}
}
}
Below is the test case and value that was failing.
Domain.java
javac Domain.java
java Domain
import java.net.URI;
import java.net.*;
import java.io.*;
public class Domain {
public static String longname = "https://www.3dprintingmedia.network/longform/page.html";
public static String name = "www.3dprintingmedia.network";
public static void getDomain(String url) {
try {
URI uri = new URI(url);
String domain = uri.getHost();
System.out.println("protocol: " + uri.getScheme());
System.out.println("path: " + uri.getPath());
System.out.println("name: " + name);
System.out.println("domain: " + domain);
System.out.println(domain.startsWith("www.") ? domain.substring(4) : domain);
} catch (Exception e) {
e.printStackTrace();
}
}
public static void main(String[] args) {
System.out.println("Parsing domain: " + name);
getDomain(longname);
getDomain(name);
System.exit(0);
}
}