quotation in string not respect by csv.read

Question

anyone can help with below?

(shlex.split or re could work, but no idea why codes below won't work)

s = 'hello, world, a, "b,c", d' 
list(csv.reader([s]))[0]

# ['hello', ' world', ' a', ' "b', 'c"', ' d'] - get this
# ['hello', ' world', ' a', 'b,c', ' d'] - i want this

as it marked as duplicated, but found the link can't answer the question, especially below which still some problem for quotation in csv.reader:

s3 = "self, c: hug.types.number, d='hello, world'"
list(csv.reader([s3], skipinitialspace=True))[0]

# ['self', 'c: hug.types.number', "d='hello", "world'"] - get this
# ['self', 'c: hug.types.number', "d='hello, world'"] - i want this

one more example added to show the case – Felix Liu Jun 02 '19 at 05:47 — Felix Liu, Jun 02 '19 at 05:47

score 0 · Answer 1 · answered Jun 02 '19 at 04:55

For the exact sample data you showed us, using re.split on the pattern ,\s+ would work:

s = 'hello, world, a, "b,c", d'
result = re.split(r',\s+', s)
print(result)

['hello', 'world', 'a', '"b,c"', 'd']

This answer hinges on that the CSV data contained inside double quotes would not have any whitespace along with the comma separator.

quotation in string not respect by csv.read

1 Answers1