How do I match any character across multiple lines in a regular expression?

Question

For example, this regex

(.*)<FooBar>

will match:

abcde<FooBar>

But how do I get it to match across multiple lines?

abcde
fghij<FooBar>

To clarify; I was originally using Eclipse to do a find and replace in multiple files. What I have discovered by the answers below is that my problem was the tool and not regex pattern. — andyuk, Oct 02 '08 at 15:45

score 664 · Answer 1 · answered Oct 01 '08 at 18:52

664

Try this:

((.|\n)*)<FooBar>

It basically says "any character or a newline" repeated zero or more times.

answered Oct 01 '08 at 18:52

levik

114,835
27
73
90

7

This is dependent on the language and/or tool you are using. Please let us know what you are using, eg Perl, PHP, CF, C#, sed, awk, etc. – Ben Doom Oct 01 '08 at 18:57
67

Depending on your line endings you might need `((.|\n|\r)*)` – Potherca Mar 09 '12 at 17:27
3

He said he is using Eclipse. This is correct solution in my opinion. I have same problem and this solved it. – Danubian Sailor Apr 18 '12 at 08:14
4

Right - the question is about eclipse and so are the tags. But the accepted solution is a PHP solution. Yours should be the accepted solution... – acme Jun 13 '12 at 12:04
2

`\R` matches line endings in a platform-independent manner. In eclipse, at least, and some other tools. – frIT Oct 05 '15 at 20:20
1

Very funny, I tried this on gedit and I got a segmentation fault. Murphy's law at its finest. – Manolis Agkopian Oct 13 '15 at 01:06
53

This is the worst regex for matching multiple line input. Please never use it unless you are using ElasticSearch. Use `[\s\S]*` or `(?s).*`. – Wiktor Stribiżew Jul 18 '16 at 11:05
4

Such needless alternation can result in catastrophic backtracking in some situations. This isn't a good general pattern. – Snow Apr 25 '19 at 02:24
I like this. This is more general. – lkahtz Sep 04 '20 at 04:59
In `xml` files, I use : `((.|\n|\r|\t)*)` pattern – Nolwennig Apr 20 '21 at 15:29
@Wiktor Stribiżew: *Why* is it the worst? Will the other match newlines without modifiers? – Peter Mortensen Nov 18 '21 at 23:08
4

@PeterMortensen Too many people have already reported peformance issues and even stack overflow errors when using this pattern, and I have even recorded a [YT video](https://www.youtube.com/watch?v=SEobSs-ZCSE) with explanation of why it is that bad. – Wiktor Stribiżew Nov 18 '21 at 23:34
This should be marked as best answer. – Thomas Easo Jul 20 '22 at 17:15
@ThomasEaso this should be removed from SO as it is one of the worst regex answers here. – Wiktor Stribiżew Mar 03 '23 at 10:20
@WiktorStribiżew Seems like the way you suggested it works even faster? Because when I run the original query in Intellij IDEA it freezed for some time :D – Frankie Drake May 04 '23 at 08:16
1

@FrankieDrake `(.|\n)*` will freeze with longer texts, so it is not even a question of speed, but safety. [My answer](https://stackoverflow.com/a/45981809/3832970) provides the right way to match newline-containing texts in most languages/environments. – Wiktor Stribiżew May 04 '23 at 08:57

score 305 · Accepted Answer · answered Oct 01 '08 at 18:52

305

It depends on the language, but there should be a modifier that you can add to the regex pattern. In PHP it is:

/(.*)<FooBar>/s

The s at the end causes the dot to match all characters including newlines.

answered Oct 01 '08 at 18:52

Paige Ruten

172,675
36
177
197

1

and what if i wanted _just_ a new line and not all characters ? – Grace Apr 11 '11 at 12:02
7

@Grace: use \n to match a newline – Paige Ruten Apr 11 '11 at 21:05
http://stackoverflow.com/questions/1331815/regular-expression-to-match-cross-platform-newline-characters – Josef Sábl Apr 30 '13 at 09:01
6

The s flag is (now?) invalid, at least in Chrome/V8. Instead use /([\s\S]*)/ character class (match space and non-space] instead of the period matcher. See other answers for more info. – Allen May 09 '13 at 15:37
17

@Allen - JavaScript doesn't support the `s` modifier. Instead, do `[^]*` for the same effect. – Derek 朕會功夫 Jul 12 '15 at 22:26
3

In Ruby, use the `m` modifier – Ryan Buckley Jul 15 '15 at 22:57
If there are multiple values of , it will ignore all the values in the middle and only match the last – Mohamad Hamouday Apr 21 '18 at 03:44
What to use for Powershell? – NealWalters Aug 10 '18 at 20:25
[Wiktor's](https://stackoverflow.com/questions/159118/how-do-i-match-any-character-across-multiple-lines-in-a-regular-expression/45981809#45981809:~:text=Non%2DPOSIX%2Dbased%20engines%3A) comprehensive answer below is a longer read, but includes ways to do multiline matches in many different languages. – mwfearnley May 24 '23 at 14:00

score 208 · Answer 3 · edited Jul 04 '23 at 16:49

The question is, can the . pattern match any character? The answer varies from engine to engine. The main difference is whether the pattern is used by a POSIX or non-POSIX regex library.

A special note about lua-patterns: they are not considered regular expressions, but . matches any character there, the same as POSIX-based engines.

Another note on matlab and octave: the . matches any character by default (demo): str = "abcde\n fghij<Foobar>"; expression = '(.*)<Foobar>*'; [tokens,matches] = regexp(str,expression,'tokens','match'); (tokens contain a abcde\n fghij item).

Also, in all of boost's regex grammars the dot matches line breaks by default. Boost's ECMAScript grammar allows you to turn this off with regex_constants::no_mod_m (source).

As for oracle (it is POSIX based), use the n option (demo): select regexp_substr('abcde' || chr(10) ||' fghij<Foobar>', '(.*)<Foobar>', 1, 1, 'n', 1) as results from dual

POSIX-based engines:

A mere . already matches line breaks, so there isn't a need to use any modifiers, see bash (demo).

The tcl (demo), postgresql (demo), r (TRE, base R default engine with no perl=TRUE, for base R with perl=TRUE or for stringr/stringi patterns, use the (?s) inline modifier) (demo) also treat . the same way.

However, most POSIX-based tools process input line by line. Hence, . does not match the line breaks just because they are not in scope. Here are some examples how to override this:

sed - There are multiple workarounds. The most precise, but not very safe, is sed 'H;1h;$!d;x; s/$.*$><Foobar>/\1/' (H;1h;$!d;x; slurps the file into memory). If whole lines must be included, sed '/start_pattern/,/end_pattern/d' file (removing from start will end with matched lines included) or sed '/start_pattern/,/end_pattern/{{//!d;};}' file (with matching lines excluded) can be considered.
perl - perl -0pe 's/(.*)<FooBar>/$1/gs' <<< "$str" (-0 slurps the whole file into memory, -p prints the file after applying the script given by -e). Note that using -000pe will slurp the file and activate 'paragraph mode' where Perl uses consecutive newlines (\n\n) as the record separator.
gnu-grep - grep -Poz '(?si)abc\K.*?(?=<Foobar>)' file. Here, z enables file slurping, (?s) enables the DOTALL mode for the . pattern, (?i) enables case insensitive mode, \K omits the text matched so far, *? is a lazy quantifier, (?=<Foobar>) matches the location before <Foobar>.
pcregrep - pcregrep -Mi "(?si)abc\K.*?(?=<Foobar>)" file (M enables file slurping here). Note pcregrep is a good solution for macOS grep users.

See demos.

Non-POSIX-based engines:

php - Use the s modifier PCRE_DOTALL modifier: preg_match('~(.*)<Foobar>~s', $s, $m) (demo)
c# - Use RegexOptions.Singleline flag (demo):
- var result = Regex.Match(s, @"(.*)<Foobar>", RegexOptions.Singleline).Groups[1].Value;
- var result = Regex.Match(s, @"(?s)(.*)<Foobar>").Groups[1].Value;
powershell - Use the (?s) inline option: $s = "abcde`nfghij<FooBar>"; $s -match "(?s)(.*)<Foobar>"; $matches[1]
perl - Use the s modifier (or (?s) inline version at the start) (demo): /(.*)<FooBar>/s
python - Use the re.DOTALL (or re.S) flags or (?s) inline modifier (demo): m = re.search(r"(.*)<FooBar>", s, flags=re.S) (and then if m:, print(m.group(1)))
java - Use Pattern.DOTALL modifier (or inline (?s) flag) (demo): Pattern.compile("(.*)<FooBar>", Pattern.DOTALL)
kotlin - Use RegexOption.DOT_MATCHES_ALL : "(.*)<FooBar>".toRegex(RegexOption.DOT_MATCHES_ALL)
groovy - Use (?s) in-pattern modifier (demo): regex = /(?s)(.*)<FooBar>/
scala - Use (?s) modifier (demo): "(?s)(.*)<Foobar>".r.findAllIn("abcde\n fghij<Foobar>").matchData foreach { m => println(m.group(1)) }
javascript - Use the s (dotAll) flag or workarounds [^] / [\d\D] / [\w\W] / [\s\S] (demo): s.match(/([\s\S]*)<FooBar>/)[1]
c++ (std::regex) Use [\s\S] or the JavaScript workarounds (demo): regex rex(R"(([\s\S]*)<FooBar>)");
vba vbscript - Use the same approach as in JavaScript, ([\s\S]*)<Foobar>. (NOTE: The MultiLine property of the RegExp object is sometimes erroneously thought to be the option to allow . match across line breaks, while, in fact, it only changes the ^ and $ behavior to match start/end of lines rather than strings, the same as in JavaScript regex)
ruby - Use the /m MULTILINE modifier (demo): s[/(.*)<Foobar>/m, 1]
r tre base-r - Base R PCRE regexps - use (?s): regmatches(x, regexec("(?s)(.*)<FooBar>",x, perl=TRUE))[[1]][2] (demo)
r icu stringr stringi - in stringr/stringi regex funtions that are powered with the ICU regex engine. Also use (?s): stringr::str_match(x, "(?s)(.*)<FooBar>")[,2] (demo)
go - Use the inline modifier (?s) at the start (demo): re: = regexp.MustCompile(`(?s)(.*)<FooBar>`)
swift - Use dotMatchesLineSeparators or (easier) pass the (?s) inline modifier to the pattern: let rx = "(?s)(.*)<Foobar>"
objective-c - The same as Swift. (?s) works the easiest, but here is how the option can be used: NSRegularExpression* regex = [NSRegularExpression regularExpressionWithPattern:pattern options:NSRegularExpressionDotMatchesLineSeparators error:&regexError];
re2, google-apps-script - Use the (?s) modifier (demo): "(?s)(.*)<Foobar>" (in Google Spreadsheets, =REGEXEXTRACT(A2,"(?s)(.*)<Foobar>"))

NOTES ON (?s):

In most non-POSIX engines, the (?s) inline modifier (or embedded flag option) can be used to enforce . to match line breaks.

If placed at the start of the pattern, (?s) changes the bahavior of all . in the pattern. If the (?s) is placed somewhere after the beginning, only those .s will be affected that are located to the right of it unless this is a pattern passed to Python's re. In Python re, regardless of the (?s) location, the whole pattern . is affected. The (?s) effect is stopped using (?-s). A modified group can be used to only affect a specified range of a regex pattern (e.g., Delim1(?s:.*?)\nDelim2.* will make the first .*? match across newlines and the second .* will only match the rest of the line).

POSIX note:

In non-POSIX regex engines, to match any character, [\s\S] / [\d\D] / [\w\W] constructs can be used.

In POSIX, [\s\S] is not matching any character (as in JavaScript or any non-POSIX engine), because regex escape sequences are not supported inside bracket expressions. [\s\S] is parsed as bracket expressions that match a single character, \ or s or S.

You should link to this excellent overview from your profile page or something (+1). — Jan, Oct 15 '17 at 20:15
You may want to add this to the _boost_ item: In the regex_constants namespace, flag_type_'s : perl = ECMAScript = JavaScript = JScript = ::boost::regbase::normal = 0 which defaults to Perl. Programmers will set a base flag definition `#define MOD regex_constants::perl | boost::regex::no_mod_s | boost::regex::no_mod_m` for thier regex flags to reflect that. And the arbitor is _always_ the inline modifiers. Where `(?-sm)(?s).*` resets. — , Apr 26 '18 at 21:30
@PasupathiRajamanickam Bash uses a POSIX regex engine, the `.` matches any char there (including line breaks). See [this online Bash demo](https://ideone.com/d1XTpR). — Wiktor Stribiżew, Dec 19 '18 at 07:33
Thanks for the ?s modifier for PowerShell; cleaned up my fragile regex to something a bit sturdier. — Adam Wenger, Mar 02 '20 at 19:46
You rock — this is the most exhaustive mini-tutorial on (relatively) complex regexp's that I've ever seen. You deserve that your answer becomes the accepted one! Kudos and extra votes for including `Go` in the answer! — Gwyneth Llewelyn, May 13 '20 at 00:29

score 73 · Answer 4 · answered Nov 25 '11 at 13:16

73

If you're using Eclipse search, you can enable the "DOTALL" option to make '.' match any character including line delimiters: just add "(?s)" at the beginning of your search string. Example:

(?s).*<FooBar>

answered Nov 25 '11 at 13:16

Paulo Merson

13,270
8
79
72

1

Not anywhere, only in regex flavors supporting inline modifiers, and certainly not in Ruby where `(?s)` => `(?m)` – Wiktor Stribiżew Jul 18 '16 at 11:06
Anything for bash? – Pasupathi Rajamanickam Dec 19 '18 at 02:12
What is the underlying regular expression engine for Eclipse? Something in Java/JDK? – Peter Mortensen Nov 18 '21 at 23:22

score 46 · Answer 5 · edited Mar 13 '23 at 10:40

46

([\s\S]*)<FooBar>

The dot matches all except newlines (\r\n). So use \s\S, which will match ALL characters.

edited Mar 13 '23 at 10:40

vvvvv

25,404
19
49
81

answered Jul 19 '12 at 17:59

samwize

25,675
15
141
186

This solve the problem if you are using the Objective-C `[text rangeOfString:regEx options:NSRegularExpressionSearch]`. Thanks! – J. Costa Aug 24 '12 at 22:29
3

This works in intelliJ's find&replace regex, thanks. – barclay Sep 16 '15 at 22:14
1

This works. But it needs to be the first occurrence of `` – Ozkan Sep 26 '17 at 14:16

score 43 · Answer 6 · edited Jan 25 '20 at 14:03

43

In many regex dialects, /[\S\s]*<Foobar>/ will do just what you want. Source

edited Jan 25 '20 at 14:03

fearless_fool

33,645
23
135
217

answered Jul 30 '11 at 13:03

Abbas Shahzadeh

431
4
2

4

From that link: "JavaScript and VBScript do not have an option to make the dot match line break characters. In those languages, you can use a character class such as [\s\S] to match any character." Instead of the . use [\s\S] (match spaces and non-spaces) instead. – Allen May 09 '13 at 15:34

score 23 · Answer 7 · edited Nov 19 '21 at 00:14

23

We can also use

(.*?\n)*?

to match everything including newline without being greedy.

This will make the new line optional

(.*?|\n)*?

edited Nov 19 '21 at 00:14

Peter Mortensen

30,738
21
105
131

answered Aug 06 '18 at 07:48

Nambi_0915

1,091
8
21

Never use `(.*?|\n)*?` unless you want to end up with a catastrophic backtracking. – Wiktor Stribiżew Jul 08 '20 at 20:39

score 18 · Answer 8 · edited Nov 18 '21 at 23:33

18

In Ruby you can use the 'm' option (multiline):

/YOUR_REGEXP/m

See the Regexp documentation on ruby-doc.org for more information.

edited Nov 18 '21 at 23:33

Peter Mortensen

30,738
21
105
131

answered Aug 03 '12 at 07:52

Are you sure it shouldn't be `s` instead of `m`? – Peter Mortensen Nov 18 '21 at 23:34

Markus Jarderot · Answer 9 · 2009-04-25T21:09:27.663

9

"." normally doesn't match line-breaks. Most regex engines allows you to add the S-flag (also called DOTALL and SINGLELINE) to make "." also match newlines. If that fails, you could do something like [\S\s].

edited Apr 25 '09 at 21:09

answered Oct 01 '08 at 18:52

Markus Jarderot

86,735
21
136
138

score 8 · Answer 10 · edited Nov 18 '21 at 23:43

8

For Eclipse, the following expression worked:

Foo

jadajada Bar"

Regular expression:

Foo[\S\s]{1,10}.*Bar*

edited Nov 18 '21 at 23:43

Peter Mortensen

30,738
21
105
131

answered Jan 03 '13 at 11:32

Gordon

81
1
1

score 6 · Answer 11 · answered Oct 02 '08 at 03:31

Note that (.|\n)* can be less efficient than (for example) [\s\S]* (if your language's regexes support such escapes) and than finding how to specify the modifier that makes . also match newlines. Or you can go with POSIXy alternatives like [[:space:][:^space:]]*.

TheTechGuy · Answer 12 · 2022-01-29T02:45:14.070

6

In notepad++ you can use this

<table (.|\r\n)*</table>

It will match the entire table starting from

rows and columns

You can make it greedy, using the following, that way it will match the first, second and so forth tables and not all at once

<table (.|\r\n)*?</table>

edited Jan 29 '22 at 02:45

answered Jan 29 '22 at 02:28

TheTechGuy

16,560
16
115
136

`(\r\n)*` - super answer. thanks – Just Me Apr 04 '22 at 06:47

score 6 · Answer 13 · answered Aug 25 '22 at 18:28

6

This works for me and is the simplest one:

(\X*)<FooBar>

answered Aug 25 '22 at 18:28

Mateusz Kaflowski

2,221
1
29
35

thanks...this helped me to create multiline regex for me i.e Pattern regex = Pattern.compile("(\\X*)From:*(\\X*)Sent:*(\\X*)To:*"); – Narendra Pandey Jun 08 '23 at 12:41

score 5 · Answer 14 · edited Nov 18 '21 at 23:10

5

Use:

/(.*)<FooBar>/s

The s causes dot (.) to match carriage returns.

edited Nov 18 '21 at 23:10

Peter Mortensen

30,738
21
105
131

answered Oct 01 '08 at 18:54

Bill

4,323
8
28
32

Seems like this is invalid (Chrome): text.match(/a/s) SyntaxError: Invalid flags supplied to RegExp constructor 's' – Allen May 09 '13 at 15:31
Because it is unsupported in JavaScript RegEx engines. The `s` flags exists in PCRE, the most complete engine (available in Perl and PHP). PCRE has 10 flags (and a lot of other features) while JavaScript has only 3 flags (`gmi`). – Morgan Touverey Quilling Apr 20 '16 at 18:51

score 5 · Answer 15 · edited Nov 18 '21 at 23:30

5

Use RegexOptions.Singleline. It changes the meaning of . to include newlines.

Regex.Replace(content, searchText, replaceText, RegexOptions.Singleline);

edited Nov 18 '21 at 23:30

Peter Mortensen

30,738
21
105
131

answered Apr 13 '10 at 00:42

shmall

51
1
1

This is specific to a particular platform. What programming language and platform is it? C# / .NET? – Peter Mortensen Nov 18 '21 at 23:19

score 4 · Answer 16 · edited Nov 18 '21 at 23:45

4

In a Java-based regular expression, you can use [\s\S].

edited Nov 18 '21 at 23:45

Peter Mortensen

30,738
21
105
131

answered Jun 03 '13 at 06:22

Kamahire

2,149
3
21
50

1

Shouldn't those be backslashes? – Paul Draper Oct 19 '13 at 06:48
They go at the end of the Regular Expression, not within in. Example: /blah/s – RandomInsano Dec 21 '13 at 20:12
I guess you mean JavaScript, not Java? Since you can just add the `s` flag to the pattern in Java and JavaScript doesn't have the `s` flag. – 3limin4t0r Sep 25 '18 at 17:47

score 3 · Answer 17 · edited Nov 18 '21 at 23:09

3

Generally, . doesn't match newlines, so try ((.|\n)*)<foobar>.

edited Nov 18 '21 at 23:09

Peter Mortensen

30,738
21
105
131

answered Oct 01 '08 at 18:52

tloach

8,009
1
33
44

5

No, don't do that. If you need to match anything including line separators, use the DOTALL (a.k.a. /s or SingleLine) modifier. Not only does the (.|\n) hack make the regex less efficient, it's not even correct. At the very least, it should match \r (carriage return) as well as \n (linefeed). There are other line separator characters, too, albeit rarely used. But if you use the DOTALL flag, you don't have to worry about them. – Alan Moore Apr 26 '09 at 03:17
2

\R is the platform-independent match for newlines in Eclipse. – opyate Nov 30 '09 at 11:13
1

@opyate You should post this as an answer as this little gem is incredibly useful. – jeckhart Oct 15 '12 at 21:29
You could try this instead. It won't match the inner brackets and also consider the optional`\r`.: `((?:.|\r?\n)*)` – ssc-hrep3 Nov 29 '16 at 09:52

Sian Lerk Lau · Answer 18 · 2021-11-30T09:30:01.400

2

Solution:

Use pattern modifier sU will get the desired matching in PHP.

Example:

preg_match('/(.*)/sU', $content, $match);

Sources:

Pattern Modifiers

edited Nov 30 '21 at 09:30

answered Apr 04 '12 at 11:00

Sian Lerk Lau

173
9

The first link somehow redirects to `www.facebook.com` (which I have blocked in the [hosts file](https://en.wikipedia.org/wiki/Hosts_(file))). Is that link broken or not? – Peter Mortensen Nov 18 '21 at 23:26
I guess the owner decided to redirect it to the facebook page. I will remove it. – Sian Lerk Lau Nov 30 '21 at 09:29

score 2 · Answer 19 · edited Nov 19 '21 at 00:17

In JavaScript you can use [^]* to search for zero to infinite characters, including line breaks.

$("#find_and_replace").click(function() {
  var text = $("#textarea").val();
  search_term = new RegExp("[^]*<Foobar>", "gi");;
  replace_term = "Replacement term";
  var new_text = text.replace(search_term, replace_term);
  $("#textarea").val(new_text);
});

<script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.3.1/jquery.min.js"></script>
<button id="find_and_replace">Find and replace</button>
<br>
<textarea ID="textarea">abcde
fghij&lt;Foobar&gt;</textarea>

nsayer · Answer 20 · 2008-10-01T18:54:46.527

In the context of use within languages, regular expressions act on strings, not lines. So you should be able to use the regex normally, assuming that the input string has multiple lines.

In this case, the given regex will match the entire string, since "<FooBar>" is present. Depending on the specifics of the regex implementation, the $1 value (obtained from the "(.*)") will either be "fghij" or "abcde\nfghij". As others have said, some implementations allow you to control whether the "." will match the newline, giving you the choice.

Line-based regular expression use is usually for command line things like egrep.

score 1 · Answer 21 · answered Aug 28 '20 at 16:21

1

Try: .*\n*.*<FooBar> assuming you are also allowing blank newlines. As you are allowing any character including nothing before <FooBar>.

answered Aug 28 '20 at 16:21

hafiz031

2,236
3
26
48

1

It doesn't look right. Why two times "`.*`"? This may work for the sample input in the question, but what if "" is on line 42? – Peter Mortensen Nov 19 '21 at 00:27

score 1 · Answer 22 · edited Nov 18 '21 at 23:12

1

I had the same problem and solved it in probably not the best way but it works. I replaced all line breaks before I did my real match:

mystring = Regex.Replace(mystring, "\r\n", "")

I am manipulating HTML so line breaks don't really matter to me in this case.

I tried all of the suggestions above with no luck. I am using .NET 3.5 FYI.

edited Nov 18 '21 at 23:12

Peter Mortensen

30,738
21
105
131

answered Mar 26 '09 at 14:57

Slee

27,498
52
145
243

I am using .NET too and `(\s|\S)` seems to do the trick for me! – Vamshi Krishna May 18 '18 at 07:26
@VamshiKrishna In .NET, use `(?s)` to make `.` match any chars. Do not use `(\s|\S)` that will slow down performance. – Wiktor Stribiżew Sep 14 '18 at 20:35
There is a [multi-line mode for .NET regular expressions](https://stackoverflow.com/questions/9532340/how-do-i-remove-trailing-whitespace-using-a-regular-expression/30559093#30559093). – Peter Mortensen Nov 18 '21 at 23:14

score 0 · Answer 23 · edited Nov 18 '21 at 23:28

Often we have to modify a substring with a few keywords spread across lines preceding the substring. Consider an XML element:

<TASK>
  <UID>21</UID>
  <Name>Architectural design</Name>
  <PercentComplete>81</PercentComplete>
</TASK>

Suppose we want to modify the 81, to some other value, say 40. First identify .UID.21..UID., then skip all characters including \n till .PercentCompleted.. The regular expression pattern and the replace specification are:

String hw = new String("<TASK>\n  <UID>21</UID>\n  <Name>Architectural design</Name>\n  <PercentComplete>81</PercentComplete>\n</TASK>");
String pattern = new String ("(<UID>21</UID>)((.|\n)*?)(<PercentComplete>)(\\d+)(</PercentComplete>)");
String replaceSpec = new String ("$1$2$440$6");
// Note that the group (<PercentComplete>) is $4 and the group ((.|\n)*?) is $2.

String iw = hw.replaceFirst(pattern, replaceSpec);
System.out.println(iw);

<TASK>
  <UID>21</UID>
  <Name>Architectural design</Name>
  <PercentComplete>40</PercentComplete>
</TASK>

The subgroup (.|\n) is probably the missing group $3. If we make it non-capturing by (?:.|\n) then the $3 is (<PercentComplete>). So the pattern and replaceSpec can also be:

pattern = new String("(<UID>21</UID>)((?:.|\n)*?)(<PercentComplete>)(\\d+)(</PercentComplete>)");
replaceSpec = new String("$1$2$340$5")

and the replacement works correctly as before.

What programming language? Java? – Peter Mortensen Nov 18 '21 at 23:29 — Peter Mortensen, Nov 18 '21 at 23:29

score 0 · Answer 24 · edited Nov 18 '21 at 23:21

0

I wanted to match a particular if block in Java:

   ...
   ...
   if(isTrue){
       doAction();

   }
...
...
}

If I use the regExp

if \(isTrue(.|\n)*}

it included the closing brace for the method block, so I used

if \(!isTrue([^}.]|\n)*}

to exclude the closing brace from the wildcard match.

edited Nov 18 '21 at 23:21

Peter Mortensen

30,738
21
105
131

answered Jan 18 '11 at 09:31

Spangen

4,420
5
37
42

score 0 · Answer 25 · edited Nov 19 '21 at 00:20

Typically searching for three consecutive lines in PowerShell, it would look like:

$file = Get-Content file.txt -raw

$pattern = 'lineone\r\nlinetwo\r\nlinethree\r\n'     # "Windows" text
$pattern = 'lineone\nlinetwo\nlinethree\n'           # "Unix" text
$pattern = 'lineone\r?\nlinetwo\r?\nlinethree\r?\n'  # Both

$file -match $pattern

# output
True

Bizarrely, this would be Unix text at the prompt, but Windows text in a file:

$pattern = 'lineone
linetwo
linethree
'

Here's a way to print out the line endings:

'lineone
linetwo
linethree
' -replace "`r",'\r' -replace "`n",'\n'

# Output
lineone\nlinetwo\nlinethree\n

score -1 · Answer 26 · answered Oct 06 '19 at 19:41

Option 1

One way would be to use the s flag (just like the accepted answer):

/(.*)<FooBar>/s

Demo 1

Option 2

A second way would be to use the m (multiline) flag and any of the following patterns:

/([\s\S]*)<FooBar>/m

or

/([\d\D]*)<FooBar>/m

or

/([\w\W]*)<FooBar>/m

Demo 2

RegEx Circuit

jex.im visualizes regular expressions:

How do I match any character across multiple lines in a regular expression?

26 Answers26

Solution:

Example:

Sources:

Option 1

Demo 1

Option 2

Demo 2

RegEx Circuit

Linked

Related