How do I read the Nth line of a file and print it to a new file?

Question

I have a folder called foo. Foo has some other folders which might have sub folders and text files. I want to find every file which begins with the name year and and read its Nth line and print it to a new file. For example foo has a file called year1 and the sub folders have files called year2, year3 etc. The program will print the 1st line of year1 to a file called writeout, then it will print the 2nd line of year2 to the file writeout etc.

I also didn't really understand how to do a for loop for a file.

So far I have:

#!/bin/bash

for year* in ~/foo
do
  Here I tried writing some code using the sed command but I can't think of something       else.
done

I also get a message in the terminal which says `year*' not a valid identifier. Any ideas?

Could you please accept one of the answers below as I believe they provided enough information to this question? — Herpes Free Engineer, Feb 12 '18 at 15:01

shellter · Accepted Answer · 2011-11-03T16:35:51.357

38

Sed can help you.

Recall that sed will normally process all lines in a file AND print each line in the file.

You can turn off that feature, and have sed only print lines of interest by matching a pattern or line number.

So, to print the 2nd line of file 2, you can say

sed -n '2p' file2 > newFile2

To print the 2nd line and then stop processing add the q (for quit) command (you also need braces to group the 2 commands together), i.e.

sed -n '2{p;q;}' file2 > newFile2

(if you are processing large files, this can be quite a time saving).

To make that more general, you can change the number to a variable that will hold a number, i.e.

  lineNo=3
  sed -n "${lineNo}{p;q;}" file3 > newFile3

If you want all of your sliced lines to go into 1 file, then use the shells 'append-redirection', i.e.

 for lineNo in 1 2 3 4 5 ; do
     sed -n  "${lineNo}{p;q;}" file${lineNo} >> aggregateFile
 done

The other postings, with using the results of find ... to drive your filelist, are an excellent approach.

I hope this helps.

edited Nov 03 '11 at 16:35

answered Nov 03 '11 at 14:45

shellter

36,525
7
83
90

The grouping syntax works in GNU sed. – glenn jackman Nov 03 '11 at 16:29
@glennjackman : not sure of your point. grouping syntax works in sed on AIX and solaris too, and to my knowledge and belief is part of the original design of sed. Thanks for the feedback :-) – shellter Nov 03 '11 at 16:32
1

If you like Python over sed you can do... `python -c "import sys; print(sys.stdin.readlines()[int(sys.argv[1])-1]).strip()" ` (or of course define an alias for that big thing) – floer32 Jan 17 '14 at 20:26

score 6 · Answer 2 · answered Nov 03 '11 at 14:40

6

Here is one way to do it:

awk "NR==$YEAR" $file

answered Nov 03 '11 at 14:40

Karoly Horvath

94,607
11
117
176

I get the message: Unexpected newline or end of string. – captain Nov 03 '11 at 15:15
then $YEAR is an empty string or not a number... – Karoly Horvath Nov 03 '11 at 16:00

Emil Sit · Answer 3 · 2011-11-03T15:43:27.757

3

Use find to locate the files you want, and then sed to extract what you want:

find foo -type f -name year* |
while read file; do
    line=$(echo $file | sed 's/.*year\([0-9]*\)$/\1/')
    sed -n -e "$line {p; q}" $file
done

This approach:

Use find to produce a list of files with a name starting with the string "year".
Pipes the file list to a while loop to avoid long command lines
Uses sed to extract the desired line number from the name of the file
Uses sed to print just the desired line and then immediately quit. (You can leave out the q and just write ${line}p which would work but be potentially less efficient of $file is big. Also, q may not be fully supported on all versions of sed.)

It will not work properly for files with spaces in their names though.

edited Nov 03 '11 at 15:43

answered Nov 03 '11 at 14:44

Emil Sit

22,894
7
53
75

I get messages saying: sed: -e expression #1, char 7: unknown command: `k' – captain Nov 03 '11 at 14:58
Can you pastebin the output of the command after, running "set -x" to enable debugging? – Emil Sit Nov 03 '11 at 15:09
Sorry but I don't understand what I have to do. I'm a beginner. – captain Nov 03 '11 at 15:14
First type "set -x". Then run the command as above. Then select the text and go to pastebin.com and paste in the error. Then post the URL here as a comment. – Emil Sit Nov 03 '11 at 15:26
+1, but beware that '{p; q}' will not work in all versions of sed. – William Pursell Nov 03 '11 at 15:37
@captain that pastebin is for yi_H's answer... – Emil Sit Nov 03 '11 at 15:44
oops. Here is the right now. http://pastebin.com/uR2MdELB – captain Nov 03 '11 at 15:55
@WilliamPursell : To my knowledge, `{p;q}` will work in any sed. It's in the original O'Rielly book 'Sed' ;-! Some versions require extra semi-colons to work, i.e. `{;p;q;}` or `{p;q;}`. Which version of sed doesn't support this? Thanks for any info. – shellter Nov 03 '11 at 16:39
An alternative to `sed -n 'N{p;q}` is `sed 'N!d;q'` – potong Nov 03 '11 at 17:12

score 1 · Answer 4 · edited Sep 26 '14 at 10:46

1

1.time head -5 emp.lst tail -1
It has taken time for execution is
real 0m0.004s
user 0m0.001s
sys 0m0.001s

or

2.awk 'NR==5' emp.lst
It has taken time for execution is
real 0m0.003s
user 0m0.000s
sys 0m0.002s

or 

3.sed -n '5p' emp.lst
It has taken time for execution is
real 0m0.001s
user 0m0.000s
sys 0m0.001s

or 

4.using some cute trick we can get this with cut command
cut -d “
“ -f 5 emp.lst
# after -d press enter ,it means delimiter is newline
It has taken time for execution is
real 0m0.001s

edited Sep 26 '14 at 10:46

Sébastien

11,860
11
58
78

answered Sep 26 '14 at 10:41

parmeet

11
1

1

While your answer may solve the question, it is always better if you can provide a description of what the issue was and how your answer solves it. This is a suggestion for further improving this and future answers. – Luís Cruz Sep 26 '14 at 10:52
1

Can you elaborate how your answer is working & helpful? – Rajesh Ujade Sep 26 '14 at 11:04

score 1 · Answer 5 · answered Nov 29 '14 at 20:10

The best way that always works, provided you provide 2 arguments:

$ touch myfile
$ touch mycommand
$ chmod +x mycommand
$ touch yearfiles
$ find / -type f -name year* >> yearfiles
$ nano mycommand
$ touch foo

Type this:

#/bin/bash
head -n $1 $2 >> myfile
less -n 1 myfile >> foo

Use ^X, y, and enter to save. Then run mycommand:

$ ./mycommand 2 yearfiles
$ cat foo
year2

Presuming your year files are:

year1, year2, year3

Additionally, now you have setup, you just have to use $ ./mycommand LINENUMBER FILENAME from now on.

score 1 · Answer 6 · answered Jan 23 '15 at 15:53

1

Here you go

sed ${index}'q;d' ${input_file} > ${output_file}

answered Jan 23 '15 at 15:53

Karol Król

3,320
1
34
37

score 0 · Answer 7 · answered Nov 03 '11 at 14:36

Your task has two sub-tasks: Find the name of all the year files, and then extract the Nth line. Consider the following script:

for file in `find foo -name 'year*'`; do
     YEAR=`echo $file | sed -e 's/.*year\([0-9]*\)$/\1/'`
     head -n $YEAR $file | tail -n 1
done

The find call finds the matching files for you in the directory foo. The second line extracts only the digits at the end of the filename from the filename. The third line then extracts the first N lines from the file, keeping only the last of the first N lines (read: only the Nth line).

Am I supposed to see something on my screen? Because I get just a blank line. — captain, Nov 03 '11 at 14:59

How do I read the Nth line of a file and print it to a new file?

7 Answers7