13

How can I remove "www", "http://", "https://" from strings using Ruby?

I tried this but it didn't work:

s.gsub('/(?:http?:\/\/)?(?:www\.)?(.*)\/?$/i', '')

Here what I'm doing in Rails:

<%= auto_link(job.description) do |url| url.truncate(25).gsub('http://', '') end %>

Url are truncated, but my goal is to remove the beginning of the links, such as "www" or "http://" so the link would look like "google.com/somepage/d...", not like "http://google.com/some..."

Martin Petrov
  • 2,633
  • 4
  • 30
  • 44

3 Answers3

44
s = s.sub(/^https?\:\/\//, '').sub(/^www./,'')

If you don't want to use s =, you should use sub!s instead of all subs.

The problems with your code are:

  1. Question mark always follows AFTER an optional character
  2. Always replace one pattern in a sub. You can "chain up" multiple operations.
  3. Use sub instead of gsub and ^ in the beginning of Regexp so it only replaces the http:// in the beginning but leaves the ones in the middle.
SwiftMango
  • 15,092
  • 13
  • 71
  • 136
5

This method should catch all 3 variations:

def strip_url(url)
  url.sub!(/https\:\/\/www./, '') if url.include? "https://www."

  url.sub!(/http\:\/\/www./, '')  if url.include? "http://www."

  url.sub!(/www./, '')            if url.include? "www."

  return url
end

strip_url("http://www.google.com")
   => "google.com" 
strip_url("https://www.facebook.com")
   => "facebook.com" 
strip_url("www.stackoverflow.com")
  => "stackoverflow.com" 
Josh
  • 5,631
  • 1
  • 28
  • 54
1
def strip_url(target_url)
  target_url.gsub("http://", "")
            .gsub("https://", "")
            .gsub("www.", "")
end

strip_url("http://www.google.com")
 => "google.com" 
strip_url("https://www.google.com")
 => "google.com" 
strip_url("http://google.com")
 => "google.com"
strip_url("https://google.com")
 => "google.com" 
strip_url("www.google.com")
 => "google.com" 
rusllonrails
  • 5,586
  • 3
  • 34
  • 27