13

I'm looking for a simple way to tokenize string input without using non default libraries such as Boost, etc.

For example, if the user enters forty_five, I would like to seperate forty and five using the _ as the delimiter.

Edge
  • 2,456
  • 6
  • 32
  • 57

3 Answers3

26

To convert a string to a vector of tokens (thread safe):

std::vector<std::string> inline StringSplit(const std::string &source, const char *delimiter = " ", bool keepEmpty = false)
{
    std::vector<std::string> results;

    size_t prev = 0;
    size_t next = 0;

    while ((next = source.find_first_of(delimiter, prev)) != std::string::npos)
    {
        if (keepEmpty || (next - prev != 0))
        {
            results.push_back(source.substr(prev, next - prev));
        }
        prev = next + 1;
    }

    if (prev < source.size())
    {
        results.push_back(source.substr(prev));
    }

    return results;
}
Mahmoud Al-Qudsi
  • 28,357
  • 12
  • 85
  • 125
1

You can use the strtok_r function, but read the man pages carefully so you understand how it maintains state.

Adam Liss
  • 47,594
  • 12
  • 108
  • 150
1

Look at this tutorial, which is by far the best tutorial on tokenization that I have found so far. It covers the best practices in the implementation of different methods that include using getline() and find_first_of() in C++ std, and strtok() in C.

Mahmoud Al-Qudsi
  • 28,357
  • 12
  • 85
  • 125
Jaime Ivan Cervantes
  • 3,579
  • 1
  • 40
  • 38