I am parsing the files in json format where i access through
http://api.crossref.org/works?rows=1000&offset=6000000
There are around 71 million distinct publication records and the link i gave is only presents 1000. In my java code gson parser mostly works well but sometimes it gives an error as following:
com.google.gson.JsonSyntaxException: com.google.gson.stream.MalformedJsonException: Unterminated string at line 2 column 1
at com.google.gson.internal.Streams.parse(Streams.java:56)
at com.google.gson.JsonParser.parse(JsonParser.java:84)
at com.google.gson.JsonParser.parse(JsonParser.java:59)
at com.google.gson.JsonParser.parse(JsonParser.java:45)
at connectingurl.CrossRefFullTextToTable.<init>(CrossRefFullTextToTable.java:84)
at connectingurl.CrossRefFullTextToTable.main(CrossRefFullTextToTable.java:181)
Caused by: com.google.gson.stream.MalformedJsonException: Unterminated string at line 2 column 1
at com.google.gson.stream.JsonReader.syntaxError(JsonReader.java:1310)
at com.google.gson.stream.JsonReader.nextString(JsonReader.java:1046)
at com.google.gson.stream.JsonReader.nextValue(JsonReader.java:784)
at com.google.gson.stream.JsonReader.objectValue(JsonReader.java:766)
at com.google.gson.stream.JsonReader.peek(JsonReader.java:380)
at com.google.gson.internal.bind.TypeAdapters$25.read(TypeAdapters.java:635)
at com.google.gson.internal.bind.TypeAdapters$25.read(TypeAdapters.java:658)
at com.google.gson.internal.bind.TypeAdapters$25.read(TypeAdapters.java:650)
at com.google.gson.internal.bind.TypeAdapters$25.read(TypeAdapters.java:658)
at com.google.gson.internal.bind.TypeAdapters$25.read(TypeAdapters.java:658)
at com.google.gson.internal.bind.TypeAdapters$25.read(TypeAdapters.java:633)
at com.google.gson.internal.Streams.parse(Streams.java:44)
As i searched for it, most probably there might be a memory issue.
Here is part of my java code:
if(code == 200){
String full_text_link = "";
String license = "";
BufferedReader in = new BufferedReader(new InputStreamReader(yc.getInputStream(), "UTF-8"));
StringBuilder sb = new StringBuilder();
String inputLine;
while ((inputLine = in.readLine()) != null) {
sb.append(inputLine + System.getProperty("line.separator"));
}
in.close();
String jsonLine = sb.toString();
JsonElement jelement = new JsonParser().parse(jsonLine);
JsonObject jobject = jelement.getAsJsonObject();
jobject = jobject.getAsJsonObject("message");
JsonArray jarray_items = jobject.getAsJsonArray("items");
...
}
The error comes from the line:
JsonElement jelement = new JsonParser().parse(jsonLine);
It seems gson might have such problems. is there any other suggestion to use another parser or is there something that i should add my code?
Thanks in advance...