I have street address strings in different formats. I tried this old post, but did not help much. My string formats are as follows,
format 1:
string_1 = ', landlord and tenant entered into a an agreement with respect to approximately 5,569 square feet of space in the building known as "the company" located at 788 e.7th street, st. louis, missouri 55605 ( capitalized terms used herein and not otherwise defined herein shall have the respective meanings given to them in the agreement); whereas, the term of the agreement expires on may 30, 2015;'
desired output:
788 e.7th street, st. louis, missouri 55605
format 2:
string_2 = 'first floor 824 6th avenue, chicago, il where the office is located'
desired output:
824 6th avenue, chicago, il
format 3:
string_3 = 'whose address is 90 south seventh street, suite 5400, dubuque, iowa, 55402.'
desired output:
90 south seventh street, suite 5400, dubuque, iowa, 55402
So far, I tried, this for string_1
,
address_match_1 = re.findall(r'((\d*)\s+(\d{1,2})(th|nd|rd).*\s([a-z]))', string_1)
I get an empty list.
For the 2nd string I tried the same and getting the empty list as follows,
address_match_2 = re.findall(r'((\d*)\s+(\d{1,2})(th|nd|rd).*\s([a-z]))', string_2)
How can I try to match using re
? They are all in different formats, how can I get suite involved in string_3
? Any help would be appreciated.