Basically, I want to parse all emails body from my gmail using Python. There are few solutions on StackOverflow e.g. the closest I found is here where he use poplib
from Python to parse all emails. Another one is this one where he use .get_payload()
to get email body
However, when I use parse email using poplib
(as from above example) then each email message I do message.get_payload()
to get email body, the string I get is not the same as html from email itself i.e. it has string like 3D
, = \n
, amp;
etc. in my parsed email body.
I wonder if it's a problem with POP
protocol so I need to use other protocol such as imaplib
(but I don't know how to parse all email like poplib
). Or, I can modify given example to get correct email body.