34

Can anybody tell me the difference between the * and + operators in the example below:

[<>]+ [<>]*

Ismail Badawi
  • 36,054
  • 7
  • 85
  • 97
Few Tem
  • 673
  • 1
  • 7
  • 16

7 Answers7

60

Each of them are quantifiers, the star quantifier(*) means that the preceding expression can match zero or more times it is like {0,} while the plus quantifier(+) indicate that the preceding expression MUST match at least one time or multiple times and it is the same as {1,} .

So to recap :

a*  ---> a{0,}  ---> Match a or aa or aaaaa or an empty string
a+  ---> a{1,}  ---> Match a or aa or aaaa but not a string empty
aleroot
  • 71,077
  • 30
  • 176
  • 213
13

* means zero-or-more, and + means one-or-more. So the difference is that the empty string would match the second expression but not the first.

Ismail Badawi
  • 36,054
  • 7
  • 85
  • 97
  • 4
    Isn't this the other way around? `*` will match on empty, but `+` will not. – Joshua Plicque May 04 '15 at 19:57
  • 1
    I believe Ismail is referring to the OP's sample patterns (`[<>]+` and `[<>]*`) rather than the order of the quantifiers discussed in this answer. I agree, it is a little ambiguous. – mickmackusa Apr 08 '20 at 13:40
7

+ means one or more of the previous atom. ({1,})

* means zero or more. This can match nothing, in addition to the characters specified in your square-bracket expression. ({0,})

Note that + is available in Extended and Perl-Compatible Regular Expressions, and is not available in Basic RE. * is available in all three RE dialects. That dialect you're using depends most likely on the language you're in.

Pretty much, the only things in modern operating systems that still default to BRE are grep and sed (both of which have ERE capability as an option) and non-vim vi.

ghoti
  • 45,319
  • 8
  • 65
  • 104
5

* means zero or more of the previous expression.

In other words, the expression is optional.

You might define an integer like this:

-*[0-9]+

In other words, an optional negative sign followed by one or more digits.

jahroy
  • 22,322
  • 9
  • 59
  • 108
4

They are quantifiers.

  • + means 1 or many (at least one occurrence for the match to succeed)
  • * means 0 or many (the match succeeds regardless of the presence of the search string)
Darshana
  • 2,462
  • 6
  • 28
  • 54
dan radu
  • 2,772
  • 4
  • 18
  • 23
3

[<>]+ is same as [<>][<>]*

Darshana
  • 2,462
  • 6
  • 28
  • 54
detale
  • 12,482
  • 4
  • 41
  • 42
2

I'll bring some example to extend answers above. Let we have a text:

100test10
test10
test

if we write \d+test\d+, this expression matches 100test10 and test10 but \d*test\d* matches three of them

godot
  • 3,422
  • 6
  • 25
  • 42