How to "grep" out specific line ranges of a file

Question

There are often times I will grep -n whatever file to find what I am looking for. Say the output is:

1234: whatev 1
5555: whatev 2
6643: whatev 3

If I want to then just extract the lines between 1234 and 5555, is there a tool to do that? For static files I have a script that does wc -l of the file and then does the math to split it out with tail & head but that doesn't work out so well with log files that are constantly being written to.

See http://stackoverflow.com/questions/83329/how-can-i-extract-a-range-of-lines-from-a-text-file-on-unix — GreenMatt, May 26 '10 at 15:14

score 126 · Accepted Answer · edited Mar 15 '22 at 13:32

126

Try using sed as mentioned on http://linuxcommando.blogspot.com/2008/03/using-sed-to-extract-lines-in-text-file.html. For example use

sed '2,4!d' somefile.txt

to print from the second line to the fourth line of somefile.txt. (And don't forget to check http://www.grymoire.com/Unix/Sed.html, sed is a wonderful tool.)

edited Mar 15 '22 at 13:32

vvvvv

25,404
19
49
81

answered May 26 '10 at 15:17

Scorchio

2,763
2
20
28

6

One useful follow up bit of information is how to prepend the line numbers onto the sed result... pipe it into nl like so: `sed ''"$start"','"$end"'!d' somefile.txt | nl -ba -v$start` – phyatt Nov 07 '16 at 18:16
@Scorchio What does `!d` mean? – Manuel Jordan Jan 24 '22 at 19:43
2

@ManuelJordan `d` is the delete command of sed. `!` reverses the restriction (ie. in this case, the specified range). So `2,4!d` means dropping everything except lines 2-4. – Scorchio Feb 15 '22 at 11:54
@Scorchio thanks for the explanation. - Normally `!` go in the beginning - something like `!2,4d` – Manuel Jordan Feb 15 '22 at 12:09
Does it delete the other lines from the file or just from the standard output? – princess_hacker Jul 19 '22 at 11:28
1

@princess_hacker Sed doesn't touch the original file in itself. It just outputs the filtered and transformed parts of the file. – Scorchio Sep 17 '22 at 15:47

score 49 · Answer 2 · edited Mar 15 '22 at 13:29

49

The following command will do what you asked for "extract the lines between 1234 and 5555" in someFile.

sed -n '1234,5555p' someFile

edited Mar 15 '22 at 13:29

vvvvv

25,404
19
49
81

answered Apr 29 '14 at 17:19

javaPlease42

4,699
7
36
65

2

I had to add `/` delimiters to make it work: `sed -n '/1234/,/5555/p' someFile` – JB0x2D1 Jul 08 '16 at 18:25
small thing, but you don't need the quotes – Hawkeye Parker Apr 13 '18 at 19:43

score 13 · Answer 3 · edited Jun 10 '21 at 04:07

13

If I understand correctly, you want to find a pattern between two line numbers. The awk one-liner could be

awk '/whatev/ && NR >= 1234 && NR <= 5555' file

You don't need to run grep followed by sed.

Perl one-liner:

perl -ne 'if (/whatev/ && $. >= 1234 && $. <= 5555) {print}' file

edited Jun 10 '21 at 04:07

Andrew

3,733
1
35
36

answered Jul 07 '16 at 16:48

Mark Lakata

19,989
5
106
123

score 6 · Answer 4 · answered May 02 '18 at 19:09

Line numbers are OK if you can guarantee the position of what you want. Over the years, my favorite flavor of this has been something like this:

sed "/First Line of Text/,/Last Line of Text/d" filename

which deletes all lines from the first matched line to the last match, including those lines.

Use sed -n with "p" instead of "d" to print those lines instead. Way more useful for me, as I usually don't know where those lines are.

Janus Troelsen · Answer 5 · 2021-12-20T17:10:07.503

Put this in a file and make it executable:

#!/usr/bin/env bash
start=`grep -n $1 < $3 | head -n1 | cut -d: -f1; exit ${PIPESTATUS[0]}`
if [ ${PIPESTATUS[0]} -ne 0 ]; then
    echo "couldn't find start pattern!" 1>&2
    exit 1
fi
stop=`tail -n +$start < $3 | grep -n $2 | head -n1 | cut -d: -f1; exit ${PIPESTATUS[1]}`
if [ ${PIPESTATUS[0]} -ne 0 ]; then
    echo "couldn't find end pattern!" 1>&2
    exit 1
fi

stop=$(( $stop + $start - 1))

sed "$start,$stop!d" < $3

Execute the file with arguments (NOTE that the script does not handle spaces in arguments!):

Starting grep pattern
Stopping grep pattern
File path

To use with your example, use arguments: 1234 5555 myfile.txt

Includes lines with starting and stopping pattern.

score 1 · Answer 6 · answered Jan 06 '22 at 15:03

If I want to then just extract the lines between 1234 and 5555, is there a tool to do that?

There is also ugrep, a GNU/BSD grep compatible tool but one that offers a -K option (or --range) with a range of line numbers to do just that:

ugrep -K1234,5555 -n '' somefile.log

You can use the usual GNU/BSD grep options and regex patterns (but it also offers a lot more such as -K.)

dagelf · Answer 7 · 2016-03-23T15:23:59.990

0

If you want lines instead of line ranges, you can do it with perl: eg. if you want to get line 1, 3 and 5 from a file, say /etc/passwd:

perl -e 'while(<>){if(++$l~~[1,3,5]){print}}' < /etc/passwd

edited Mar 23 '16 at 15:23

answered Mar 23 '16 at 13:33

dagelf

1,468
1
14
25

2

FYI, That `$l` is "dollar el" not "dollar one". A more perlish (i.e. shorter) command is `perl -ne 'if($.~~[1,3,5]){print}' /etc/passwd`. – Mark Lakata Jul 07 '16 at 16:41

How to "grep" out specific line ranges of a file

7 Answers7