I am trying to use python and nltk to parse some doctor notes that describe a medication prescription. I'm looking for a method to identify a numerical value for # items taken and how often the items are taken.
1 TABLET DAILY
TAKE 1 TABLET DAILY
ONE TABLET TWICE DAILY
2 DAILY
TWO TABLETS DAILY
ONE PILL AT BEDTIME
1/2 PILL TWICE DAILY
ROLLING WALKER WITH SEAT ATTACHMENT AND HAND BRAKES
ONE PILL DAILY
1 TAB PO DAILY
ONE PILL TWICE A DAY WITH MEALS AS NEEDED
1 TABLET TWICE DAILY
300 MG BID
ONE DAILY
1 TABLET 3 TIMES DAILY AS NEEDED
1 DAILY
TAKE 1 CAPSULE BY MOUTH 4 (FOUR) TIMES A DAY.
1 TABLET EVERY 4 TO 6 HOURS AS NEEDED
1 TABLET BY MOUTH TWICE DAILY
INJECT 34 U TWICE A DAY
Any advice?