How to convert DOS/Windows newline (CRLF) to Unix newline (LF)

Question

How can I programmatically (not using vi) convert DOS/Windows newlines to Unix newlines?

The dos2unix and unix2dos commands are not available on certain systems.
How can I emulate them with commands such as sed, awk, and tr?

In general, just install `dos2unix` using your package manager, it really is much simpler and does exist on most platforms. — Brad Koch, Oct 20 '15 at 20:15
Agreed! @BradKoch Simple as 'brew install dos2unix' on Mac OSX — SmileIT, Apr 03 '18 at 13:57
Not all users have root access, and thus cannot install packages. Maybe that's why the user asked the very specific question he asked. — bsd, Jan 30 '22 at 10:24

score 414 · Answer 1 · edited Mar 09 '20 at 16:26

414

You can use tr to convert from DOS to Unix; however, you can only do this safely if CR appears in your file only as the first byte of a CRLF byte pair. This is usually the case. You then use:

tr -d '\015' <DOS-file >UNIX-file

Note that the name DOS-file is different from the name UNIX-file; if you try to use the same name twice, you will end up with no data in the file.

You can't do it the other way round (with standard 'tr').

If you know how to enter carriage return into a script (control-V, control-M to enter control-M), then:

sed 's/^M$//'     # DOS to Unix
sed 's/$/^M/'     # Unix to DOS

where the '^M' is the control-M character. You can also use the bash ANSI-C Quoting mechanism to specify the carriage return:

sed $'s/\r$//'     # DOS to Unix
sed $'s/$/\r/'     # Unix to DOS

However, if you're going to have to do this very often (more than once, roughly speaking), it is far more sensible to install the conversion programs (e.g. dos2unix and unix2dos, or perhaps dtou and utod) and use them.

If you need to process entire directories and subdirectories, you can use zip:

zip -r -ll zipfile.zip somedir/
unzip zipfile.zip

This will create a zip archive with line endings changed from CRLF to CR. unzip will then put the converted files back in place (and ask you file by file - you can answer: Yes-to-all). Credits to @vmsnomad for pointing this out.

edited Mar 09 '20 at 16:26

caram

1,494
13
21

answered Apr 10 '10 at 15:13

Jonathan Leffler

730,956
141
904
1,278

20

using `tr -d '\015' UNIX-file` where `DOS-file` == `UNIX-file` just results in an empty file. The output file has to be a different file, unfortunately. – Buttle Butkus Nov 15 '13 at 01:50
3

@ButtleButkus: Well, yes; that's why I used two different names. If you zap the input file before the program reads it all, as you do when you use the same name twice, you end up with an empty file. That is uniform behaviour on Unix-like systems. It requires special code to handle overwriting an input file safely. Follow the instructions and you will be OK. – Jonathan Leffler Nov 15 '13 at 01:56
I seem to remember in-file search-replace functionality somehwere. – Buttle Butkus Nov 15 '13 at 02:08
4

There are places; you have to know where to find them. Within limits, the GNU `sed` option `-i` (for in-place) works; the limits are linked files and symlinks. The `sort` command has 'always' (since 1979, if not earlier) supported the `-o` option which can list one of the input files. However, that is in part because `sort` must read all its input before it can write any of its output. Other programs sporadically support overwriting one of their input files. You can find a general purpose program (script) to avoid problems in _'The UNIX Programming Environment'_ by Kernighan & Pike. – Jonathan Leffler Nov 15 '13 at 02:14
4

The third option worked for me, thanks. I did use the -i option: `sed -i $'s/\r$//' filename` - to edit in place. I am working on a machine that does not have access to the internet, so software installation is a problem. – Warren Dew Nov 24 '14 at 17:40
Here's how to do it without changing the filename: `tr -d '\015' < original_file > t && mv t original_file` - basically works by creating temp file, then overwriting the old one with it. – kethinov Apr 27 '16 at 00:21
@JonathanLeffler fyi, for macOS users: `sed` does not (by default, not sure you can change this?) recognise the escaped versions `\r`, `\015`, `\x0d` for carriage return; `sed` does recognise CR when entered with `ctrl-v ctrl-m` as described above (), which is ok for the command line; for scripts try `sed "s/$(printf '\r')$//"` (hat tip @twm), or fallback to `tr`, which recognises `\r` and `\015`. – t0rst Sep 14 '16 at 08:40
3

@JonathanLeffler The general-purpose program is called `sponge` and can be found in [moreutils](https://joeyh.name/code/moreutils/): `tr -d '\015' < original_file | sponge original_file`. I use it daily. – eush77 Mar 24 '17 at 15:56
How do I do this recursively? – Aaron Franke Jan 27 '19 at 03:44
@AaronFranke: it depends on what your scenario looks like. In my book, if you need to modify a whole lot of files the same way, you use a script to encapsulate the processing (even if you throw it away after you've finished), and then use a tool such as `find` to identify the files that need changing (or otherwise create a list of file names — one hopes they don't have spaces and other unruly punctuation in the names) and then apply the script to the files. Using `find … -exec sh script.sh {} +` is pretty effective. The alternatives are legion. The `find` technique works with absurd names. – Jonathan Leffler Jan 28 '19 at 17:24
if you accidantly apply `sed $'s/$/\r/'` twice it will have the CR twice. For scripting solutions I recommend the following: `sed 's/^$/\r/;s/$[^\r]$$/\1\r/g'` For simplicity I would state this as the third way to make the original idea point out. – Davidius Sep 01 '20 at 07:07
1

The zip file method works really well! – mbomb007 May 06 '22 at 13:44

score 90 · Answer 2 · edited Jan 28 '23 at 08:31

90

You can use Vim programmatically with the option -c {command}:

DOS to Unix:

vim file.txt -c "set ff=unix" -c ":wq"

Unix to DOS:

vim file.txt -c "set ff=dos" -c ":wq"

"set ff=unix/dos" means change fileformat (ff) of the file to Unix/DOS end of line format.

":wq" means write the file to disk and quit the editor (allowing to use the command in a loop).

edited Jan 28 '23 at 08:31

EsmaeelE

2,331
6
22
31

answered Aug 31 '18 at 10:03

Johan Zicola

1,021
8
6

9

you can use ":x" instead of ":wq" – JosephConrad Jul 05 '19 at 11:19

score 82 · Answer 3 · edited Apr 08 '21 at 11:18

82

Use:

tr -d "\r" < file

Take a look here for examples using sed:

# In a Unix environment: convert DOS newlines (CR/LF) to Unix format.
sed 's/.$//'               # Assumes that all lines end with CR/LF
sed 's/^M$//'              # In Bash/tcsh, press Ctrl-V then Ctrl-M
sed 's/\x0D$//'            # Works on ssed, gsed 3.02.80 or higher

# In a Unix environment: convert Unix newlines (LF) to DOS format.
sed "s/$/`echo -e \\\r`/"            # Command line under ksh
sed 's/$'"/`echo \\\r`/"             # Command line under bash
sed "s/$/`echo \\\r`/"               # Command line under zsh
sed 's/$/\r/'                        # gsed 3.02.80 or higher

Use sed -i for in-place conversion, e.g., sed -i 's/..../' file.

edited Apr 08 '21 at 11:18

Peter Mortensen

30,738
21
105
131

answered Apr 10 '10 at 15:21

ghostdog74

327,991
56
259
343

11

I used a variant since my file only had `\r` : `tr "\r" "\n" < infile > outfile` – Matt Todd Nov 19 '10 at 00:29
1

@MattTodd could you post this as an answer? the `-d` is featured more frequently and will not help in the "only `\r`" situation. – n611x007 Oct 14 '13 at 15:20
5

Note that the proposed `\r` to `\n` mapping has the effect of double-spacing the files; each single CRLF line ending in DOS becomes `\n\n` in Unix. – Jonathan Leffler Apr 30 '14 at 13:58
Can I do this recursively? – Aaron Franke Jan 27 '19 at 03:45

Boris Verkhovskiy · Answer 4 · 2021-04-01T11:26:13.057

56

Install dos2unix, then convert a file in-place with

dos2unix <filename>

To output converted text to a different file use

dos2unix -n <input-file> <output-file>

You can install it on Ubuntu or Debian with

sudo apt install dos2unix

or on macOS using Homebrew

brew install dos2unix

edited Apr 01 '21 at 11:26

answered Jul 18 '18 at 00:34

Boris Verkhovskiy

14,854
11
100
103

4

I know the question asks for alternatives to dos2unix but it's the first google result. – Boris Verkhovskiy Jun 23 '19 at 01:31

score 34 · Answer 5 · answered Apr 10 '10 at 15:09

34

Using AWK you can do:

awk '{ sub("\r$", ""); print }' dos.txt > unix.txt

Using Perl you can do:

perl -pe 's/\r$//' < dos.txt > unix.txt

answered Apr 10 '10 at 15:09

codaddict

445,704
82
492
529

2

A nice, _portable_ `awk` solution. – mklement0 Feb 28 '15 at 05:29

score 19 · Answer 6 · answered Apr 10 '10 at 22:32

19

This problem can be solved with standard tools, but there are sufficiently many traps for the unwary that I recommend you install the flip command, which was written over 20 years ago by Rahul Dhesi, the author of zoo. It does an excellent job converting file formats while, for example, avoiding the inadvertant destruction of binary files, which is a little too easy if you just race around altering every CRLF you see...

answered Apr 10 '10 at 22:32

Norman Ramsey

198,648
61
360
533

Any way to do this in a streaming fashion, without modifying the original file? – augurar Dec 07 '13 at 22:08
@augurar you may check "similar packages" https://packages.debian.org/wheezy/flip – n611x007 Aug 19 '14 at 11:12
I had an experience of breaking half of my OS just by running texxto with a wrong flag. Be careful especially if you want to do it on entire folders. – A_P Sep 13 '18 at 13:21
The link seems to be broken (times out - *"504 Gateway Time-out"*). – Peter Mortensen Apr 08 '21 at 11:21

score 16 · Answer 7 · edited Apr 08 '21 at 11:29

16

If you don't have access to dos2unix, but can read this page, then you can copy/paste dos2unix.py from here.

#!/usr/bin/env python
"""\
convert dos linefeeds (crlf) to unix (lf)
usage: dos2unix.py <input> <output>
"""
import sys

if len(sys.argv[1:]) != 2:
  sys.exit(__doc__)

content = ''
outsize = 0
with open(sys.argv[1], 'rb') as infile:
  content = infile.read()
with open(sys.argv[2], 'wb') as output:
  for line in content.splitlines():
    outsize += len(line) + 1
    output.write(line + '\n')

print("Done. Saved %s bytes." % (len(content)-outsize))

_{(Cross-posted from Super User.)}

edited Apr 08 '21 at 11:29

Peter Mortensen

30,738
21
105
131

answered Oct 31 '13 at 09:40

anatoly techtonik

19,847
9
124
140

2

The usage is misleading. The real `dos2unix` converts *all* input files by default. Your usage implies `-n` parameter. And the real `dos2unix` is a filter that reads from stdin, writes to stdout if the files are not given. – jfs Jul 06 '15 at 11:32
Also, this won't work on some platforms since there is no `python` -- they apparently can't be bothered with backward compatibility, so it is `python2` or `python3` or ... – user9645 Sep 01 '21 at 11:13

score 15 · Answer 8 · edited Apr 10 '10 at 20:20

15

The solutions posted so far only deal with part of the problem, converting DOS/Windows' CRLF into Unix's LF; the part they're missing is that DOS use CRLF as a line separator, while Unix uses LF as a line terminator. The difference is that a DOS file (usually) won't have anything after the last line in the file, while Unix will. To do the conversion properly, you need to add that final LF (unless the file is zero-length, i.e. has no lines in it at all). My favorite incantation for this (with a little added logic to handle Mac-style CR-separated files, and not molest files that're already in unix format) is a bit of perl:

perl -pe 'if ( s/\r\n?/\n/g ) { $f=1 }; if ( $f || ! $m ) { s/([^\n])\z/$1\n/ }; $m=1' PCfile.txt

Note that this sends the Unixified version of the file to stdout. If you want to replace the file with a Unixified version, add perl's -i flag.

edited Apr 10 '10 at 20:20

Jonathan Leffler

730,956
141
904
1,278

answered Apr 10 '10 at 17:50

Gordon Davisson

118,432
16
123
151

@LudovicZenohateLagouardette Was it a plain text file (i.e. csv or tab-demited text), or something else? If it was in some database-ish format, manipulating it as if it was text is very likely to corrupt its internal structure. – Gordon Davisson Jan 23 '16 at 20:53
A plain text csv, but I think the enconding was strange. I think it messed up because of that. However don't worry. I am always collecting backups an this wasn't even the real dataset, just a 1gb one. The real is a 26gb. – Ludovic Zenohate Lagouardette Jan 24 '16 at 08:02

score 10 · Answer 9 · edited Apr 08 '21 at 11:55

10

It is super duper easy with PCRE;

As a script, or replace $@ with your files.

#!/usr/bin/env bash
perl -pi -e 's/\r\n/\n/g' -- $@

This will overwrite your files in place!

I recommend only doing this with a backup (version control or otherwise)

edited Apr 08 '21 at 11:55

Peter Mortensen

30,738
21
105
131

answered Jul 30 '15 at 17:38

ThorSummoner

16,657
15
135
147

Thank you! This works, although I'm writing the filename and no `--`. I chose this solution because it's easy to understand and adapt for me. FYI, this is what the switches do: `-p` assume a "while input" loop, `-i` edit input file in place, `-e` execute following command – Rolf Oct 11 '17 at 12:21
Strictly speaking, PCRE is a reimplementation of Perl's regex engine, not the regex engine from Perl. They both have this capability, though there are also differences, in spite of the impication in the name. – tripleee Oct 27 '17 at 08:24

score 6 · Answer 10 · edited Apr 08 '21 at 11:50

6

An even simpler AWK solution without a program:

awk -v ORS='\r\n' '1' unix.txt > dos.txt

Technically '1' is your program, because AWK requires one when the given option.

Alternatively, an internal solution is:

while IFS= read -r line;
do printf '%s\n' "${line%$'\r'}";
done < dos.txt > unix.txt

edited Apr 08 '21 at 11:50

Peter Mortensen

30,738
21
105
131

answered Sep 04 '14 at 00:16

nawK

693
1
7
13

That's handy, but just to be clear: this translates Unix -> Windows/DOS, which is the _opposite direction_ of what the OP asked for. – mklement0 Feb 28 '15 at 06:01
5

It was done on purpose, left as an exercise for the author. _eyerolls_ `awk -v RS='\r\n' '1' dos.txt > unix.txt` – nawK Mar 01 '15 at 04:14
Great (and kudos to you for pedagogic finesse). – mklement0 Mar 01 '15 at 04:35
1

"b/c awk requires one when given option." - awk _always_ requires a program, whether options are specified or not. – mklement0 Mar 01 '15 at 04:37
2

The pure bash solution is interesting, but much slower than an equivalent `awk` or `sed` solution. Also, you must use `while IFS= read -r line` to faithfully preserve the input lines, otherwise leading and trailing whitespace is trimmed (alternatively, use no variable name in the `read` command and work with `$REPLY`). – mklement0 Mar 01 '15 at 06:14
Why clear $IFS? If you read in to one variable (or none, and implicitly `$READ`), read just splits on line endings, and you can just use echo instead of printf (echo is more likely to be a builtin, and it's generally faster). So, using ctrl-v+ctrl-m to type the \r, one can simply do `while read -r; do echo "${REPLY%^M}"; done < file > file.fixed` and it's about the same speed as sed. – dannysauer May 13 '15 at 02:59

score 6 · Answer 11 · edited Apr 08 '21 at 12:05

6

Interestingly, in my Git Bash on Windows, sed "" did the trick already:

$ echo -e "abc\r" >tst.txt
$ file tst.txt
tst.txt: ASCII text, with CRLF line terminators
$ sed -i "" tst.txt
$ file tst.txt
tst.txt: ASCII text

My guess is that sed ignores them when reading lines from the input and always writes Unix line endings to the output.

edited Apr 08 '21 at 12:05

Peter Mortensen

30,738
21
105
131

answered Jul 21 '17 at 09:21

user829755

1,489
13
27

1

On a LF type system like GNU/Linux, `sed ""` will not do the trick, though. – ndim May 27 '21 at 03:53

score 4 · Answer 12 · edited Apr 08 '21 at 11:32

4

For Mac OS X if you have Homebrew installed (http://brew.sh/):

brew install dos2unix

for csv in *.csv; do dos2unix -c mac ${csv}; done;

Make sure you have made copies of the files, as this command will modify the files in place. The -c mac option makes the switch to be compatible with OS X.

edited Apr 08 '21 at 11:32

Peter Mortensen

30,738
21
105
131

answered May 19 '14 at 23:25

Ashley Raiteri

700
8
17

1

This answer really doesn't the original poster's question. – hlin117 Feb 07 '15 at 17:43
3

OS X users should not use `-c mac`, which is for converting pre-OS X `CR`-only newlines. You want to use that mode only for files to and from Mac OS 9 or before. – askewchan Apr 14 '16 at 13:20

score 4 · Answer 13 · edited Apr 08 '21 at 12:03

I had just to ponder that same question (on Windows-side, but equally applicable to Linux).

Surprisingly, nobody mentioned a very much automated way of doing CRLF <-> LF conversion for text-files using the good old zip -ll option (Info-ZIP):

zip -ll textfiles-lf.zip files-with-crlf-eol.*
unzip textfiles-lf.zip

NOTE: this would create a ZIP file preserving the original file names, but converting the line endings to LF. Then unzip would extract the files as zip'ed, that is, with their original names (but with LF-endings), thus prompting to overwrite the local original files if any.

The relevant excerpt from the zip --help:

zip --help
...
-l   convert LF to CR LF (-ll CR LF to LF)

Best answer, according to me, as it can process entire directories and sub-directories. I'm glad I digged that far down. — caram, Mar 09 '20 at 13:24

John Paul · Answer 14 · 2021-10-22T21:50:36.833

4

sed -i.bak --expression='s/\r\n/\n/g' <file_path>

Since the question mentions sed, this is the most straightforward way to use sed to achieve this. The expression says replace all carriage-returns and line-feeds with just line-feeds only. That is what you need when you go from Windows to Unix. I verified it works.

edited Oct 22 '21 at 21:50

answered Oct 18 '18 at 14:51

John Paul

81
6

Hey John Paul--this answer got flagged for deletion so came up in a review queue for me. In general, when you've got a question like this that's 8 years old, with 22 answers, you'll want to explain how your answer is useful in a way that other existing answers are not. – zzxyz Oct 18 '18 at 22:34
I could not get this to work when adding `--in-place mydosfile.txt` to the end (or piping to a file). The end result was the file still had CRLF. I was testing on a Graviton (AArch64) EC2 instance. – Neil C. Obremski Oct 21 '21 at 20:17
@NeilC.Obremski I updated with full command line, please try that. It will also make a backup before change. – John Paul Oct 22 '21 at 21:52
1

`sed 's/\r\n/\n/g'` does not match anything. Refer to [can-sed-replace-new-line-characters](https://unix.stackexchange.com/questions/114943/can-sed-replace-new-line-characters) – zhenguoli Jan 05 '22 at 06:52
It worked for me. – John Paul Jan 06 '22 at 07:34

Eduardo Lucio · Answer 15 · 2022-09-07T01:16:17.463

Just complementing @Jonathan Leffler's excellent answer, if you have a file with mixed line endings (LF and CRLF) and you need to normalize to CRLF (DOS), use the following commands in sequence...

# DOS to Unix
sed -i $'s/\r$//' "<YOUR_FILE>"

# Unix to DOS (normalized)
sed -i $'s/$/\r/' "<YOUR_FILE>"

NOTE: If you have a file with mixed line endings (LF and CRLF), the second command above alone will cause a mess.

If you need to convert to LF (Unix) the first command alone will be enough...

# DOS to Unix
sed -i $'s/\r$//' "<YOUR_FILE>"

Thanks!

[Ref(s).: https://stackoverflow.com/a/3777853/3223785 ]

score 3 · Answer 16 · edited Apr 08 '21 at 11:56

3

TIMTOWTDI!

perl -pe 's/\r\n/\n/; s/([^\n])\z/$1\n/ if eof' PCfile.txt

Based on Gordon Davisson's answer.

One must consider the possibility of [noeol]...

edited Apr 08 '21 at 11:56

Peter Mortensen

30,738
21
105
131

answered May 31 '16 at 17:15

lzc

919
7
16

score 3 · Answer 17 · edited Apr 08 '21 at 12:00

3

You can use AWK. Set the record separator (RS) to a regular expression that matches all possible newline character, or characters. And set the output record separator (ORS) to the Unix-style newline character.

awk 'BEGIN{RS="\r|\n|\r\n|\n\r";ORS="\n"}{print}' windows_or_macos.txt > unix.txt

edited Apr 08 '21 at 12:00

Peter Mortensen

30,738
21
105
131

answered Nov 06 '16 at 23:30

kazmer

501
4
12

That's the one that worked for me (MacOS, `git diff` shows ^M, edited in vim) – Dorian Mar 01 '17 at 09:17
Your command put an extra blank line in between every line when converting a DOS file. Doing this `awk 'BEGIN{RS="\r\n";ORS=""}{print}' dosfile > unixfile` fixed that issue, but it still does not fix the missing EOL on the last line. – user9645 Sep 01 '21 at 11:04

score 2 · Answer 18 · answered Mar 12 '15 at 22:36

2

This worked for me

tr "\r" "\n" < sampledata.csv > sampledata2.csv

answered Mar 12 '15 at 22:36

Santosh

328
5
9

11

This will convert every _single_ DOS-newline into _two_ UNIX-newlines. – Melebius Aug 04 '15 at 06:11

score 2 · Answer 19 · edited Apr 08 '21 at 12:08

2

On Linux, it's easy to convert ^M (Ctrl + M) to *nix newlines (^J) with sed.

It will be something like this on the CLI, and there will actually be a line break in the text. However, the \ passes that ^J along to sed:

sed 's/^M/\
/g' < ffmpeg.log > new.log

You get this by using ^V (Ctrl + V), ^M (Ctrl + M) and \ (backslash) as you type:

sed 's/^V^M/\^V^J/g' < ffmpeg.log > new.log

edited Apr 08 '21 at 12:08

Peter Mortensen

30,738
21
105
131

answered Jul 13 '18 at 13:43

jet

21
2

score 0 · Answer 20 · edited Apr 08 '21 at 12:01

0

As an extension to Jonathan Leffler's Unix to DOS solution, to safely convert to DOS when you're unsure of the file's current line endings:

sed '/^M$/! s/$/^M/'

This checks that the line does not already end in CRLF before converting to CRLF.

edited Apr 08 '21 at 12:01

Peter Mortensen

30,738
21
105
131

answered Jan 24 '17 at 08:38

Gannet

1,315
13
18

score 0 · Answer 21 · edited Apr 08 '21 at 12:12

I made a script based on the accepted answer, so you can convert it directly without needing an additional file in the end and removing and renaming afterwards.

convert-crlf-to-lf() {
    file="$1"
    tr -d '\015' <"$file" >"$file"2
    rm -rf "$file"
    mv "$file"2 "$file"
}

Just make sure if you have a file like "file1.txt" that "file1.txt2" doesn't already exist or it will be overwritten. I use this as a temporary place to store the file in.

score 0 · Answer 22 · edited Apr 08 '21 at 12:13

0

With Bash 4.2 and newer you can use something like this to strip the trailing CR, which only uses Bash built-ins:

if [[ "${str: -1}" == $'\r' ]]; then
    str="${str:: -1}"
fi

edited Apr 08 '21 at 12:13

Peter Mortensen

30,738
21
105
131

answered May 29 '20 at 17:45

glevand

31
2

score -3 · Answer 23 · edited Apr 08 '21 at 11:28

-3

I tried

sed 's/^M$//' file.txt

on OS X as well as several other methods (Fixing Dos Line Endings or http://hintsforums.macworld.com/archive/index.php/t-125.html). None worked, and the file remained unchanged (by the way, Ctrl + V, Enter was needed to reproduce ^M). In the end I used TextWrangler. It's not strictly command line, but it works and it doesn't complain.

edited Apr 08 '21 at 11:28

Peter Mortensen

30,738
21
105
131

answered Sep 10 '13 at 13:08

mercergeoinfo

379
2
3
14

The hintsforums.macworld.com link is (effectively) broken - it redirects to the main page, "http://hints.macworld.com/" – Peter Mortensen Apr 08 '21 at 11:28
command is missing the -i option – david Jul 24 '22 at 12:20

How to convert DOS/Windows newline (CRLF) to Unix newline (LF)

23 Answers23

Linked

Related