FreeCalypso > hg > freecalypso-tools
view uptools/libcoding/utf8_decode.c @ 924:d452188587b4
rvinterf: begin change to backslash escape output format
Right now throughout the rvinterf suite, any time we emit output that
is expected to be ASCII, but may contain non-printable garbage, we use
'cat -v' form of garbage character representation. Unfortunately, this
transformation is lossy (can't be reversed 100% reliably in the user's
wetware), hence we would like to migrate to C-style backslash escapes,
including doubling of any already-present backslashes - this escape
mechanism is lossless. Begin this change by converting the output
of RV and L1 traces in rvinterf and rvtdump.
author | Mychaela Falconia <falcon@freecalypso.org> |
---|---|
date | Tue, 23 May 2023 03:10:50 +0000 |
parents | ec0d6d58e043 |
children |
line wrap: on
line source
/* * This library module implements a function that converts text input * from UTF-8 to ISO 8859-1, rejecting any input Unicode characters * that aren't in the 8859-1 range. The conversion in done in place. */ #include <sys/types.h> utf8_to_latin1(buf) u_char *buf; { u_char *ip = buf, *op = buf; int c, c2; while (c = *ip++) { if (c < 0x80) { *op++ = c; continue; } if (c != 0xC2 && c != 0xC3) return(-1); c2 = *ip++; if (c2 < 0x80 || c2 > 0xBF) return(-1); *op++ = ((c & 3) << 6) | (c2 & 0x3F); } *op = '\0'; return(0); }