I have a Servlet as simple as this:
public class MyServlet extends HttpServlet {
protected void doGet(HttpServletRequest request, HttpServletResponse response) throws ServletException, IOException {
request.setAttribute("processedName", request.getParameter("name"));
request.setAttribute("processedQueryString", request.getQueryString());
request.getRequestDispatcher("index.jsp").forward(request, response);
}
protected void doPost(HttpServletRequest req, HttpServletResponse resp) throws ServletException, IOException {
doGet(req, resp);
}
}
My index.jsp
is like this:
<%@ page language="java" contentType="text/html; charset=ISO-8859-1" pageEncoding="ISO-8859-1"%>
<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1">
<title>Title</title>
</head>
<body>
<form action="MyServlet" method="get">
Name: <input type="text" name="name">
<input type="submit" value="Submit">
</form>
<br>
After: <%=request.getAttribute("processedName") %>
<br>
Query String: <%=request.getAttribute("processedQueryString") %>
</body>
</html>
If I type Thomás
in the form and submit it, Thom?s
is printed.
Both the URL in browser and the query string in JSP show name=Thom%E1s
The browser is correctly encoding the ISO-8859-1 á
char as %E1
, but for some reason the .getParameter
is not decoding it correctly.
I've seen tons of posts like this here when people try to submit non ISO-8859-1 chars (unlike á
). Shouldn't Tomcat 8 use ISO-8859-1 as default for decoding?
Strangely, if I put accept-charset="UTF-8"
in my form, it works. If I change the method to method="post"
it works. If I mix both the accept-charset="UTF-8"
and method="post"
it doesn't work (but now Thomás
is printed!).
It's as if Tomcat is waiting UTF-8 in get method and ISO-8859-1 in post method.