6

I would like to use decodeURI or decodeURIComponent as in JavaScript in my Lua (Luvit) project.

JavaScript:

decodeURI('%D0%BF%D1%80%D0%B8%D0%B2%D0%B5%D1%82')
// result: привет

Luvit:

require('querystring').urldecode('%D0%BF%D1%80%D0%B8%D0%B2%D0%B5%D1%82')
-- result: '%D0%BF%D1%80%D0%B8%D0%B2%D0%B5%D1%82'
Yu Hao
  • 119,891
  • 44
  • 235
  • 294
Kosmetika
  • 20,774
  • 37
  • 108
  • 172

3 Answers3

13

This is trivial to do yourself in Lua if you understand the URI percent-encoded format. Each %XX substring represents UTF-8 data encoded with a % prefix and a hexadecimal octet.

local decodeURI
do
    local char, gsub, tonumber = string.char, string.gsub, tonumber
    local function _(hex) return char(tonumber(hex, 16)) end

    function decodeURI(s)
        s = gsub(s, '%%(%x%x)', _)
        return s
    end
end

print(decodeURI('%D0%BF%D1%80%D0%B8%D0%B2%D0%B5%D1%82'))
Community
  • 1
  • 1
Ryan Stein
  • 7,930
  • 3
  • 24
  • 38
3

Here is another take. This code will save you lots of function calls if you have to decode many strings.

local hex = {}
for i = 0, 255 do
    hex[string.format("%02x", i)] = string.char(i)
    hex[string.format("%02X", i)] = string.char(i)
end

local function decodeURI(s)
    return (s:gsub('%%(%x%x)', hex))
end

print(decodeURI('%D0%BF%D1%80%D0%B8%D0%B2%D0%B5%D1%82'))
joe pelletier
  • 390
  • 2
  • 12
lhf
  • 70,581
  • 9
  • 108
  • 149
  • 4
    Interesting. You're caching all the possible percent-encoded hex octets, upper and lowercase, into a lookup table for use by `gsub`. The only thing I notice about this is the corner case where you might get a mixed-case octet such as `Fe`, but I doubt that would happen in practice. – Ryan Stein Dec 05 '13 at 17:58
  • code doesn't work for the first 16 characters `hex[string.format("%02x",i)]=string.char(i)` add a 2 before the x – joe pelletier Nov 27 '19 at 01:34
0

URIs represent ' ' with '+' And other special characters are represented as a percent followed by the 2 digit hex character code '%0A' for '\n' for example

local function decodeCharacter(code)
    -- get the number for the hex code 
    --   then get the character for that number
    return string.char(tonumber(code, 16))
end

function decodeURI(s)
    -- first replace '+' with ' '
    --   then, on the resulting string, decode % encoding
    local str = s:gsub("+", " ")
        :gsub('%%(%x%x)', decodeCharacter)
    return str 
    -- assignment to str removes the second return value of gsub
end

print(decodeURI('he%79+there%21')) -- prints "hey there!"
joe pelletier
  • 390
  • 2
  • 12