Extract lines from text file, using starting line number and amount of lines to extract, in bash?

Question

I have seen How can I extract a predetermined range of lines from a text file on Unix? but I have a slightly different use case: I want to specify a starting line number, and a count/amount/number of lines to extract, from a text file.

So, I tried to generate a text file, and then compose an awk command to extract a count of 10 lines starting from line number 100 - but it does not work:

$ seq 1 500 > test_file.txt
$ awk 'BEGIN{s=100;e=$s+10;} NR>=$s&&NR<=$e' test_file.txt
$

So, what would be an easy approach to extract lines from a text file using a starting line number, and count of lines, in bash? (I'm ok with awk, sed, or any such tool, for instance in coreutils)

remove all the `$` inside the `awk` code – Fravadona Jan 15 '23 at 21:07 — Fravadona, Jan 15 '23 at 21:07
`printf '%s\n' "${start}+$int" %p Q | ed -s file.txt` – Jetchisel Jan 17 '23 at 01:00 — Jetchisel, Jan 17 '23 at 01:00

J_H · Answer 1 · 2023-01-15T21:46:34.580

3

This gives you text that is inclusive of both end points (eleven output lines, here).

$ START=100
$
$ sed -n "${START},$((START + 10))p"  < test_file.txt

The -n says "no print by default".

And then the p says "print this line", for lines within the example range of 100,110

edited Jan 15 '23 at 21:46

answered Jan 15 '23 at 21:18

J_H

17,926
4
24
44

5

or with GNU sed: `sed -n "${START},+10p" test_file.txt` – Cyrus Jan 15 '23 at 21:31

score 2 · Answer 2 · answered Jan 15 '23 at 22:13

When you want to use awk, use something like

seq 1 500 | awk 'NR>=100 && NR<=110'

Advantage of awk is the flexibility for changing the requirements.
When you want to use a variable start and skip the endpoints, it will be

start=100
seq 1 500 | awk -v start="${start}" 'NR > start && NR < start + 10'

score 1 · Answer 3 · answered Jan 15 '23 at 22:39

Another alternative with tail and head:

tail -n +$START test_file.txt | head -n $NUMBER

If test_file.txt is very large and $START and $NUMBER are small, the following variant should be the fastest:

head -n $((START+NUMBER)) test_file.txt | tail -n +$START

Anyway, I prefer the sed solution noticed above for short input files:

sed -n "$START,$((START+NUMBER)) p" test_file.txt

score 0 · Answer 4 · answered Jan 15 '23 at 21:47

0

sed -n "$Start,$End p" file

is likely a better way to get those lines.

answered Jan 15 '23 at 21:47

TRCDev

1
3

score 0 · Answer 5 · answered Jan 16 '23 at 12:50

$ seq 1 500 > test_file.txt
$ awk 'BEGIN{s=100;e=$s+10;} NR>=$s&&NR<=$e' test_file.txt
$

$s in GNU AWK means value of s-th field, $e in GNU AWK means value of e-th field. There are not fields yet in BEGIN clause so $s for any s is not set, as you use in arithemtic context it will be assumed to be 0 and therefore e will be set to value 10. Output of seq is single number per line, so there is not 10th field, so GNU AWK assumes it to be zero when asked to compare it with number, as NR is always strictly bigger than 0 your condition never holds so output is empty.

Use Range if you are able to prepare condition which holds solely for starting line and condition which holds solely for ending line, in this case

awk 'BEGIN{s=100}NR==s,NR==s+10' test_file.txt

gives output

Keep in mind that this will process whole file, if you have huge file and area of interest is relatively near begin, then you might decrease time consumption by ending processing at end of area of interest following way

awk 'BEGIN{s=100}NR>=s{print}NR==s+10{exit}' test_file.txt

(tested in GNU Awk 5.0.1)

score 0 · Answer 6 · answered Jan 16 '23 at 16:30

0

This command extracts 30 lines starting from line 100

sed -n '100,$p' test_file.txt | head -30

answered Jan 16 '23 at 16:30

Francesco Gasparetto

1,819
16
20

Extract lines from text file, using starting line number and amount of lines to extract, in bash?

6 Answers6