Remove a fixed prefix/suffix from a string in Bash

Question

I want to remove the prefix/suffix from a string. For example, given:

string="hello-world"
prefix="hell"
suffix="ld"

How do I get the following result?

"o-wor"

Be very wary when linking to the so-called Advanced Bash Scripting Guide; it contains a mixture of good advice and terrible. — tripleee, Oct 19 '16 at 03:38

score 1160 · Accepted Answer · edited Mar 14 '22 at 18:25

1160

$ prefix="hell"
$ suffix="ld"
$ string="hello-world"
$ foo=${string#"$prefix"}
$ foo=${foo%"$suffix"}
$ echo "${foo}"
o-wor

This is documented in the Shell Parameter Expansion section of the manual:

${parameter#word}
${parameter##word}

The word is expanded to produce a pattern and matched according to the rules described below (see Pattern Matching). If the pattern matches the beginning of the expanded value of parameter, then the result of the expansion is the expanded value of parameter with the shortest matching pattern (the # case) or the longest matching pattern (the ## case) deleted. […]

${parameter%word}
${parameter%%word}

The word is expanded to produce a pattern and matched according to the rules described below (see Pattern Matching). If the pattern matches a trailing portion of the expanded value of parameter, then the result of the expansion is the value of parameter with the shortest matching pattern (the % case) or the longest matching pattern (the %% case) deleted. […]

edited Mar 14 '22 at 18:25

cosbor11

14,709
10
54
69

answered May 18 '13 at 11:43

Adrian Frühwirth

42,970
10
60
71

44

Is there a way to combine the two in one line? I tried `${${string#prefix}%suffix}` but it doesn't work. – static_rtti Mar 05 '14 at 08:18
35

@static_rtti No, unfortunately you cannot nest parameter substitution like this. I know, it's a shame. – Adrian Frühwirth Mar 05 '14 at 08:34
139

@AdrianFrühwirth : the whole language is a shame, but it's so useful :) – static_rtti Mar 05 '14 at 09:24
3

@static_rtti , there is a workaround: echo `basename ${string/hell} ld` (where the grey part is between backticks) – ROMANIA_engineer Dec 27 '15 at 16:54
1

@AdrianFrühwirth: The replacement is case sensitive. Any way to make it case insensitive? – ccpizza Jul 25 '16 at 13:22
2

@ccpizza Parameter substitution does not know such modifiers but if it's a fixed string you could always do e.g. `${foo#[Bb][Aa][Rr]}`. Not pretty but still possibly better than an unnecessary subshell/fork, depending on the situation. – Adrian Frühwirth Jul 25 '16 at 18:22
OT: What @static_rtti wrote does work in ZSH using that exact syntax :-) – ntninja Apr 07 '19 at 16:40
In case anyone is looking for it, escape with \ also works here, i.e. `${parameter%\[*\]}` will remove [any-string-here] suffix. – nt86 Jul 02 '21 at 08:53
@JasonOwen Do consider deleting your above ABS link. Entire competing reference sources have written (notably including the [BashGuide](https://mywiki.wooledge.org/BashGuide)) because people (particularly, the elders of what was then the freenode #bash IRC channel) got fed up with trying to teach newbies to stop reusing bad practices they picked up from the ABS. – Charles Duffy Jun 19 '22 at 12:39

score 157 · Answer 2 · edited Aug 09 '16 at 17:01

Using sed:

$ echo "$string" | sed -e "s/^$prefix//" -e "s/$suffix$//"
o-wor

Within the sed command, the ^ character matches text beginning with $prefix, and the trailing $ matches text ending with $suffix.

Adrian Frühwirth makes some good points in the comments below, but sed for this purpose can be very useful. The fact that the contents of $prefix and $suffix are interpreted by sed can be either good OR bad- as long as you pay attention, you should be fine. The beauty is, you can do something like this:

$ prefix='^.*ll'
$ suffix='ld$'
$ echo "$string" | sed -e "s/^$prefix//" -e "s/$suffix$//"
o-wor

which may be what you want, and is both fancier and more powerful than bash variable substitution. If you remember that with great power comes great responsibility (as Spiderman says), you should be fine.

A quick introduction to sed can be found at http://evc-cit.info/cit052/sed_tutorial.html

A note regarding the shell and its use of strings:

For the particular example given, the following would work as well:

$ echo $string | sed -e s/^$prefix// -e s/$suffix$//

...but only because:

echo doesn't care how many strings are in its argument list, and
There are no spaces in $prefix and $suffix

It's generally good practice to quote a string on the command line because even if it contains spaces it will be presented to the command as a single argument. We quote $prefix and $suffix for the same reason: each edit command to sed will be passed as one string. We use double quotes because they allow for variable interpolation; had we used single quotes the sed command would have gotten a literal $prefix and $suffix which is certainly not what we wanted.

Notice, too, my use of single quotes when setting the variables prefix and suffix. We certainly don't want anything in the strings to be interpreted, so we single quote them so no interpolation takes place. Again, it may not be necessary in this example but it's a very good habit to get into.

Unfortunately, this is bad advice for several reasons: 1) Unquoted, `$string` is subject to word splitting and globbing. 2) `$prefix` and `$suffix` can contain expressions that `sed` will interpret, e.g. regular expressions or the character used as delimiter which will break the whole command. 3) Calling `sed` two times is not necessary (you can `-e 's///' -e '///'` instead) and the pipe could also be avoided. For example, consider `string='./ *'` and/or `prefix='./'` and see it break horribly due to `1)` and `2)`. — Adrian Frühwirth, May 19 '14 at 06:59
Fun note: sed can take almost anything as a delimiter. In my case, since I was parsing prefix-directories out of paths, I couldn't use `/`, so I used `sed "s#^$prefix##`, instead. (Fragility: filenames can't contain `#`. Since I control the files, we're safe, there.) — Olie, Oct 21 '14 at 21:24
@Olie Filenames can contain *any* character except the slash and null character so unless you're in control you cannot assume a filename not to contain certain characters. — Adrian Frühwirth, Feb 22 '15 at 23:53
Yeah, don't know what I was thinking there. iOS maybe? Dunno. Filenames can certainly contain "#". No idea why I said that. :) — Olie, Feb 23 '15 at 03:11
@Olie: As I understood your original comment, you were saying that the limitation of your choice to use `#` as sed's delimiter meant that you couldn't handle files containing that character. — P Daddy, Mar 04 '15 at 17:03

Vijayendar Gururaja · Answer 3 · 2018-09-08T10:13:59.313

28

$ string="hello-world"
$ prefix="hell"
$ suffix="ld"

$ #remove "hell" from "hello-world" if "hell" is found at the beginning.
$ prefix_removed_string=${string/#$prefix}

$ #remove "ld" from "o-world" if "ld" is found at the end.
$ suffix_removed_String=${prefix_removed_string/%$suffix}
$ echo $suffix_removed_String
o-wor

Notes:

#$prefix : adding # makes sure that substring "hell" is removed only if it is found in beginning. %$suffix : adding % makes sure that substring "ld" is removed only if it is found in end.

Without these, the substrings "hell" and "ld" will get removed everywhere, even it is found in the middle.

edited Sep 08 '18 at 10:13

answered Sep 08 '18 at 08:47

Vijayendar Gururaja

752
1
9
16

1

Thanks for the Notes! qq: in your code example you also have a forward slash `/` right after the string, what is that for? – DiegoSalazar May 15 '19 at 15:29
1

/ separates the current string and the sub string. sub-string here is the suffix in th posted question. – Vijayendar Gururaja May 16 '19 at 05:48

tommy.carstensen · Answer 4 · 2018-01-14T02:39:50.430

23

Do you know the length of your prefix and suffix? In your case:

result=$(echo $string | cut -c5- | rev | cut -c3- | rev)

Or more general:

result=$(echo $string | cut -c$((${#prefix}+1))- | rev | cut -c$((${#suffix}+1))- | rev)

But the solution from Adrian Frühwirth is way cool! I didn't know about that!

edited Jan 14 '18 at 02:39

answered Jan 16 '17 at 20:09

tommy.carstensen

8,962
15
65
108

score 22 · Answer 5 · edited Oct 16 '16 at 09:45

22

I use grep for removing prefixes from paths (which aren't handled well by sed):

echo "$input" | grep -oP "^$prefix\K.*"

\K removes from the match all the characters before it.

edited Oct 16 '16 at 09:45

dipdapdop

126
1
10

answered Sep 16 '16 at 11:28

Vladimir Petrakovich

4,184
1
30
46

2

`grep -P` is a nonstandard extension. More power to you if it's supported on your platform, but this is dubious advice if your code needs to be reasonably portable. – tripleee May 28 '19 at 10:32
@tripleee Indeed. But I think a system with GNU Bash installed also have a grep that supports PCRE. – Vladimir Petrakovich May 29 '19 at 08:49
3

No, MacOS for example has Bash out of the box but not GNU `grep`. Earlier versions actually had the `-P` option from BSD `grep` but they removed it. – tripleee May 29 '19 at 08:56

score 10 · Answer 6 · answered Jul 08 '17 at 23:34

10

Small and universal solution:

expr "$string" : "$prefix\(.*\)$suffix"

answered Jul 08 '17 at 23:34

Tosi Do

113
1
4

1

If you are using Bash, you should probably not be using `expr` at all. It was a *sort of* convenient kitchen sink utility back in the days of the original Bourne shell, but is now way past its best-before date. – tripleee May 28 '19 at 10:34
1

Uh, why? `expr` is old, but never changes, and will probably always be available. As long as you invoke an external binary (as opposed to using BASH expressions), grep, sed or expr are pretty much equivalent (perl / awk would be costlier). – usretc Mar 08 '21 at 06:36
Fantastic, this is simplest, I've ever seen. – OfusJK Jun 08 '23 at 07:28

Martin - マーチン · Answer 7 · 2018-10-19T18:08:20.557

9

Using the =~ operator:

$ string="hello-world"
$ prefix="hell"
$ suffix="ld"
$ [[ "$string" =~ ^$prefix(.*)$suffix$ ]] && echo "${BASH_REMATCH[1]}"
o-wor

edited Oct 19 '18 at 18:08

answered Jan 12 '18 at 05:14

Martin - マーチン

231
4
9

score 7 · Answer 8 · answered Oct 10 '21 at 13:56

NOTE: Not sure if this was possible back in 2013 but it's certainly possible today (10 Oct 2021) so adding another option ...

Since we're dealing with known fixed length strings (prefix and suffix) we can use a bash substring to obtain the desired result with a single operation.

Inputs:

string="hello-world"
prefix="hell"
suffix="ld"

Plan:

bash substring syntax: ${string:<start>:<length>}
skipping over prefix="hell" means our <start> will be 4
<length> will be total length of string (${#string}) minus the lengths of our fixed length strings (4 for hell / 2 for ld)

This gives us:

$ echo "${string:4:(${#string}-4-2)}"
o-wor

NOTE: the parens can be removed and still obtain the same result

If the values of prefix and suffix are unknown, or could vary, we can still use this same operation but replace 4 and 2 with ${#prefix} and ${#suffix}, respectively:

$ echo "${string:${#prefix}:${#string}-${#prefix}-${#suffix}}"
o-wor

Great option! Worth calling out: a key difference between this solution and the others is that if the source string does not start with prefix or end with suffix, then other solutions will not clip anything, where this solution will clip the length of the suffix away. This is not necessarily a problem, just a limitation to be aware of. If you're not sure if the string starts or ends with the prefix/suffix, simply wrap this statement in the appropriate if-statement to check before trimming. — Stobor, Mar 08 '22 at 23:18

score 6 · Answer 9 · answered Feb 08 '17 at 06:47

6

Using @Adrian Frühwirth answer:

function strip {
    local STRING=${1#$"$2"}
    echo ${STRING%$"$2"}
}

use it like this

HELLO=":hello:"
HELLO=$(strip "$HELLO" ":")
echo $HELLO # hello

answered Feb 08 '17 at 06:47

math2001

4,167
24
35

Remove a fixed prefix/suffix from a string in Bash

9 Answers9

Notes:

Linked

Related