As I need to proceed many pdfs with different styles, I have an assumptions that the main content will be under the most appeared/common span style.
Is there a way to find the most appeared span style in beautifulsoup python?
This is a command I used to find a specific span style:
font-family: CBCDEE+ArialMT;
font-size:12px':
spans = soup.find_all('span',
attrs={'style': 'font-family: CBCDEE+ArialMT; font-size:12px'})`
Any ways to find the most appeared/common one? or basically, is there a way to have the span style list and count the appearance of different styles?
Many thanks.