Questions tagged [webharvy]

WebHarvy allows you to apply Regular Expressions on the selected text (or HTML) before scraping it. You may apply Regular Expressions on Text or HTML.

WebHarvy allows you to apply Regular Expressions on the selected text (or HTML) before scraping it. You may apply Regular Expressions on Text or HTML. See more at https://www.webharvy.com/articles/regex.html.

6 questions
3
votes
1 answer

Regex to extract emails in WebHarvy

I am trying to work up a Regex to extract email addresses off a WP directory using WebHarvy (.NET) The emails could be in multiple formats, using dots and underscores and so I tried the following…
blackystrat
  • 109
  • 1
  • 5
2
votes
2 answers

Regex To Extract Second Line

Tool Used - WebHarvy REgex Flavor - .NET Looking for an expression to extract the second line (address) from following blocks of text Company: Acme associates & sons 99122 W. Charleston Blvd., Suite 555, Las Vegas, NV 89135 Phone : (702)…
blackystrat
  • 109
  • 1
  • 5
2
votes
1 answer

Regex to extract city names (.NET)

Looking for an expression to extract City Names from addresses. Trying to use this expression in WebHarvy which uses the .NET flavor of regex Example address 1234 Savoy Dr Ste 123 New Houston, TX 77036-3320 or 1234 Savoy Dr Ste 510 Texas, TX…
blackystrat
  • 109
  • 1
  • 5
1
vote
1 answer

Regex to match phone and fax numbers for WebHarvy

Sample text 5950 S Willow Dr Ste 304 Greenwood Village, CO 80111 P (123) 456-7890 F (123) 456-7890 Get Directions Tried the following but it grabbed the first line of the address as well (.*)(?=(\n.*){2}$) Also…
pb_ng
  • 361
  • 1
  • 5
  • 19
1
vote
1 answer

RegEX Positive Lookahead Guidance

I am trying to work up an expression to match the firm name in addresses like Vice President of Compliance 10004 South 152nd St. #A Omaha I tried using the following expression to match the Vice President of Compliance string but it doesn't seem…
pb_ng
  • 361
  • 1
  • 5
  • 19
0
votes
0 answers

Regex To Match Parts of Address

Shante LaGarde 01718234 Hobby Unit 123 AB 890 Marlin, Texas 76661` Can someone please advise expressions to extract parts of the above from Hobby Unit to the ZIP code in separate lines. I was able to extract the first line using (?:\r\n?|\n){2}(.+)…
pb_ng
  • 361
  • 1
  • 5
  • 19