Delete a pattern in a file and lines before it using some other pattern

Question

I have a text file containing this :-

# Comment
# Comment
# Comment
property1

# Comment
# Comment
property2

I wanted to use unix command (awk/sed etc.) to search for a pattern with property2 and then delete all the comments before it. Hence, after operation output should be :-

# Comment
# Comment
# Comment
property1

This is what I tried (using awk command) :-

awk -v pat='^property2' -v comment='^#' '$1~pat{p=NR} p && NR>=p-3{del=($1~comment)} del{next} 1' test.txt

Basically, the logic I tried to use was :-

Search for property2
and then loop over previous 3 lines
Search if it is a comment (starts with #)
Delete those lines (including the searched pattern and the comments above).

Can someone help me achieve this? Thanks.

Please read [how-do-i-find-the-text-that-matches-a-pattern](https://stackoverflow.com/questions/65621325/how-do-i-find-the-text-that-matches-a-pattern) then [edit] your question to replace `pattern` with whatever you mean. Your example looks like you should be matching a string but your code is trying to match a regexp. — Ed Morton, Dec 08 '22 at 14:19
You description does not accurately match your example--please clarify. — Andrew, Dec 08 '22 at 14:37

score 1 · Answer 1 · answered Dec 08 '22 at 14:47

1

This, using any awk, might be what you're trying to do but it's not clear from your question:

$ awk -v RS= -v ORS='\n\n' -F'\n' '$NF != "property2"' file
# Comment
# Comment
# Comment
property1

answered Dec 08 '22 at 14:47

Ed Morton

188,023
17
78
185

score 0 · Answer 2 · answered Dec 08 '22 at 13:46

You could use a scriptable editor such as ed to:

Search for the first match of property2 (anchored to the beginning of the line)
Search backwards from there for a line that does not start with #
From the line after this one until one that starts with property2, delete those lines
write the file out to disk
quit the editor

One way to write that would be:

#!/bin/sh

printf '%s\n'             \
        '/^property2'     \
        '?^[^#]'          \
        '+1,/^property2/d' \
        'w'               \
        'q'               \
  | ed input > /dev/null

I've dropped the stdout of ed to /dev/null because it will report the lines that it matches along the way, which we're not interested in. ed will make the changes to the file "in-place". This ed-script will fail if there is not a non-empty, non-commented line before property2 (the backwards search will fail).

In your sample input, this will delete the blank line between the stanzas as well, which seems to match your desired output.

score 0 · Answer 3 · answered Dec 08 '22 at 14:41

It is not clear what you are trying to do; maybe this is it:

Mac_3.2.57$cat test.txt
# Comment1
# Comment2
# Comment3
property1

# Comment4
# Comment5
property2
Mac_3.2.57$awk '{if(NR==FNR){{if($0!~/^#/&&startFound==1){startFound=0;end=NR};if($0~/^#/&&startFound==0){startFound=1;start=NR}}}else {if(FNR<start||FNR>=end){print}}}' test.txt test.txt
# Comment1
# Comment2
# Comment3
property1

property2
Mac_3.2.57$

score 0 · Answer 4 · answered Dec 08 '22 at 23:23

This might work for you (GNU sed):

sed -E '/^#/{:a;N;/^property[^2]/Mb;/^property2/M!ba
             :b;/^#|^property2/!P;s/[^\n]*\n//;tb;d}' file

If a line is not a comment, let it be.

Otherwise, accumulate the lines in the pattern space.

If a subsequent line begins with a property that is not 2, print the accumulate lines and repeat.

If a subsequent line does not begin with property2, continue accumulating lines.

Otherwise, remove comments and print any lines other than the last which is deleted.

score 0 · Answer 5 · answered Dec 15 '22 at 21:11

Using gnu-sed with the -z commandline option to use NUL delimited records reading the whole input, and replace the match with an empty string:

sed -zE 's/(^|\n)#[^\n]*(\n#[^\n]*)*\nproperty2//g' test.txt

The pattern matches:

(^|\n)# Either match a newline or assert the start of the string
[^\n]* Match optional characters other than a newline
(\n#[^\n]*)* Optionally repeat matching a newline # and optional chars other than a newline
\nproperty2 Match a newline and property2

Output

# Comment
# Comment
# Comment
property1

Delete a pattern in a file and lines before it using some other pattern

5 Answers5