Cross Platform Support for sprintf's Format '-Flag

Question

The Single UNIX Specification Version 2 specifies the sprintf's format '-flag behavior as:

The integer portion of the result of a decimal conversion (%i, %d, %u, %f, %g or %G) will be formatted with thousands' grouping characters^[1]

I can't find the format '-flag in the c or the c++ specifications. g++ even warns:

ISO C++11 does not support the ' printf flag

The flag is not recognized to even warn about in Visual C; printf("%'d", foo) outputs:

'd

I'd like to be able to write C-standard compliant code that uses the behavior of the format '-flag. Thus the answer I'm looking for one of the following:

C-Standard specification of the format '-flag
A cross platform compatible extrapolation of gcc's format '-flag
Demonstration that a cross platform extrapolation is not possible

Not clear what you are asking. "I can't find the format '-flag in the c or the c++ specifications.". You have your answer already. What is your problem? — too honest for this site, Jun 13 '17 at 14:06
Oh man, I thought it was just C++ that had the mandatory downvote on all questions. I guess C also? Or was there an actual reason for the downvote on this carefully researched question? — Jonathan Mee, Jun 13 '17 at 14:06
@Olaf If you're agreeing with me that it is not in the C-standard, and that I didn't just over look the specification, I'm looking for a way to replicate libc's behavior in a cross platform manner. If I just missed it in the C-standard and this is an instance of Microsoft failing to fully implement the standard, I'd be OK using the format `'`-flag. — Jonathan Mee, Jun 13 '17 at 14:09
As you mention the specifications, I have to assume you mean the standards, not whatever Microsoft says. Last time I checked, MS was not the one defining the C specifications (`printf` etc. are not C++!). If that assumption is wrong, you have to be more specific. For the rest: we are not a coding service. If you want such a feater, the solution is obvious. — too honest for this site, Jun 13 '17 at 14:16
Possible duplicate of [How to format a number from 1123456789 to 1,123,456,789 in C?](https://stackoverflow.com/questions/1449805/how-to-format-a-number-from-1123456789-to-1-123-456-789-in-c) — Andre Kampling, Jun 13 '17 at 14:24
@Olaf I'm not certain that I understand your last comment. Is there an action that I need to take here? I understand from your comments that an answer of type **1** is out. I'll need to hope that either someone can recommend a workaround for **2**, or **3** explain why that's not possible. — Jonathan Mee, Jun 13 '17 at 14:25
@AndreKampling I looked through these answers before asking, but on second glance I noticed [Jerry Coffin's solution](https://stackoverflow.com/a/5346394/2642059) it looks like it may provide a cross platform solution that correctly implements locale based numeric separation... I'm inspecting now, thanks. — Jonathan Mee, Jun 13 '17 at 14:34
There isn't a cross-platform way to use the POSIX extension to the C standard. It isn't necessarily universally implemented on POSIX systems, let alone elsewhere. If you need the functionality, you'll have to implement it. The information you need is available in the `struct lconv` available from standard C [`localeconv()`](http://pubs.opengroup.org/onlinepubs/9699919799/functions/localeconv.html). Decide which of many functions you will use to do the formatting, but portability dictates you use a custom function to do the formatting, implemented as needed. — Jonathan Leffler, Jun 13 '17 at 14:35
@JonathanLeffler Thanks, this gives me a great starting point. I'll look at this and see what I can cook up... — Jonathan Mee, Jun 13 '17 at 14:38
I still can't see your problem. What do you do if there is no standard function for something you need? Obviously you write it on your own. That's actually what programming is about! Sorry, if I missed something more obvious, but from your reps, I had to assume it was clear which action you have to take. — too honest for this site, Jun 13 '17 at 14:48
@Olaf No, no you're not wrong. I wanted in **1** to make sure that writing this code was required by the standard, **3** to make sure that writing this code was possible, and **2** to ask if there was an effective workaround that I had missed. I have to admit, the downvote and closevote is frustrating. This is clearly a better question than possible dupe, which asked for a solution without even understanding the problem. But hey that was a more positive time in the life of http://stackoverflow.com I get it. — Jonathan Mee, Jun 13 '17 at 15:03

Jerry Coffin · Accepted Answer · 2017-06-14T15:46:33.823

Standard C doesn't provide the formatting capability directly, but it does provide the ability to retrieve a...specification of what the formatting should be, on a locale-specific basis. So, it's up to you to retrieve the locale's specification of proper formatting, then put it to use to format your data (but even then, it's somewhat non-trivial). For example, here's a version for formatting long data:

#include <stdlib.h>
#include <locale.h>
#include <string.h>
#include <limits.h>

static int next_group(char const **grouping) {
    if ((*grouping)[1] == CHAR_MAX)
        return 0;
    if ((*grouping)[1] != '\0')
        ++*grouping;
    return **grouping;
}

size_t commafmt(char   *buf,            /* Buffer for formatted string  */
                int     bufsize,        /* Size of buffer               */
                long    N)              /* Number to convert            */
{
    int i;
    int len = 1;
    int posn = 1;
    int sign = 1;
    char *ptr = buf + bufsize - 1;

    struct lconv *fmt_info = localeconv();
    char const *tsep = fmt_info->thousands_sep;
    char const *group = fmt_info->grouping;
    // char const *neg = fmt_info->negative_sign;
    size_t sep_len = strlen(tsep);
    size_t group_len = strlen(group);
    // size_t neg_len = strlen(neg);
    int places = (int)*group;

    if (bufsize < 2)
    {
ABORT:
        *buf = '\0';
        return 0;
    }

    *ptr-- = '\0';
    --bufsize;
    if (N < 0L)
    {
        sign = -1;
        N = -N;
    }

    for ( ; len <= bufsize; ++len, ++posn)
    {
        *ptr-- = (char)((N % 10L) + '0');
        if (0L == (N /= 10L))
            break;
        if (places && (0 == (posn % places)))
        {
            places = next_group(&group);
            for (int i=sep_len; i>0; i--) {
                *ptr-- = tsep[i-1];
                if (++len >= bufsize)
                    goto ABORT;
            }
        }
        if (len >= bufsize)
            goto ABORT;
    }

    if (sign < 0)
    {
        if (len >= bufsize)
            goto ABORT;
        *ptr-- = '-';
        ++len;
    }

    memmove(buf, ++ptr, len + 1);
    return (size_t)len;
}

#ifdef TEST
#include <stdio.h>

#define elements(x) (sizeof(x)/sizeof(x[0]))

void show(long i) {
    char buffer[32];

    commafmt(buffer, sizeof(buffer), i);
    printf("%s\n", buffer);
    commafmt(buffer, sizeof(buffer), -i);
    printf("%s\n", buffer);
}


int main() {

    long inputs[] = {1, 12, 123, 1234, 12345, 123456, 1234567, 12345678 };

    for (int i=0; i<elements(inputs); i++) {
        setlocale(LC_ALL, "");
        show(inputs[i]);
    }
    return 0;
}

#endif

This does have a bug (but one I'd consider fairly minor). On two's complement hardware, it won't convert the most-negative number correctly, because it attempts to convert a negative number to its equivalent positive number with N = -N; In two's complement, the maximally negative number doesn't have a corresponding positive number, unless you promote it to a larger type. One way to get around this is by promoting the number the corresponding unsigned type (but it's is somewhat non-trivial).

Implementing the same for other integer types is fairly trivial. For floating point types is a bit more work. Converting floating point types (even without formatting) correctly is enough more work that for them, I'd at least consider using something like sprintf to do the conversion, then inserting the formatting into the string that produced.

Another note from [`lconv`](http://en.cppreference.com/w/cpp/locale/lconv) it appears that the `negative_sign` is "a string used to indicate negative monetary quantity" so I think just `'-'` should be used for non-monetary integers. — Jonathan Mee, Jun 14 '17 at 15:32
@JonathanMee: Hm...quite right. Not sure how I missed that (or maybe I just didn't look). — Jerry Coffin, Jun 14 '17 at 15:43
So I ported the concepts in this answer to C++ and linked this question. Hopefully this meets with your approval. https://stackoverflow.com/a/44549637/2642059 — Jonathan Mee, Jun 14 '17 at 16:06
@JonathanMee: It doesn't bother me *but* it seems a lot less useful in C++. C++ has required that iostreams be locale-aware for a long time, so a stream will punctuate a number according to the locale with which it's been imbued. `std::cout.imbue(std::locale("")); std::cout << "1234567.89;` will print (on my machine) `1,234,567.89`, but if (since I'm using the anonymous locale) that would vary depending on how your environment was configured. At least in my experience, all the typical compilers/libraries have supported this for a long time. — Jerry Coffin, Jun 14 '17 at 17:11
Yeah I agree. The question was actually spawned by the requirement that I do this without using `stringstream`. Thus the need to jump through all these hoops. — Jonathan Mee, Jun 14 '17 at 17:15
@JonathanMee: Ah, I see--understandable, I guess--a stringstream does impose quite a bit of overhead if this is all you really want. — Jerry Coffin, Jun 14 '17 at 17:40

Cross Platform Support for sprintf's Format '-Flag

1 Answers1

Linked