0

up-front clarification of exactly what I need: I would like to know - are there any standards or conventions for encoding information like this within a url that I might try to use to read what it says?

I'm sure there are established methods for encoding binary information using non-binary character sets. In this case each location can be occupied by one of 66 different characters, so each "byte" is slightly longer than six bits if you look at it logarithmically.

For example, a link in a mail message I received from the IEEE was surprisingly long:

http://links.mail.spectrum.ieee.org/mpss/c/4wA/vCMaAA/t.1xp/MW0ThjonTUa... ...6Bg3tXEzxb1xlZ4zyB3umN-2BxkLQw6LtGSGH3ghSDqpqiUmxfSVUprd0HnHYipmddjyE-3D

876 characters in toto (I've deleted most of it here) - over half a kilobyte!

The encoding must conform to legal characters within a url, the coding in the url I received uses 66 ASCII characters only - see below.

Probably - of course - it is proprietary and there's little hope, but I'd like to know if there do exist some standards that allow these to be readable if so desired.

My math:

841 * (log(26+26+10+4) / log(2)) / 8 / 1024

~0.62 kb

26 uppers, 26 lowers, 10 digits and 4 punctuation characters for this estimate. ['-', '/', '.', '_']

Since urls have a de facto limit of 2000 characters, a crafted url can carry over a kilobyte of "individualized" information.

Community
  • 1
  • 1
uhoh
  • 3,713
  • 6
  • 42
  • 95
  • When presented with gibberish there is not much once can say. Maybe the cat walked over the keyboard maybe it's the US nukes launchcodes. – PeeHaa Jun 05 '16 at 11:46
  • Also on what is that math based? Isn't it just 850/1024? – PeeHaa Jun 05 '16 at 11:53
  • @PeeHaa It looks like the extra material in the URL is restricted to upper and lower case letters and numbers `(26+26+10)`, and ok maybe a few punctuations. Your math assumes each character is a byte of any combination of 8 bits. Some of those are probably illegal in a URL, so you have to *byte the bullet* and embrace logarithms. (but I fixed the 1000 -> 1024, thanks!) – uhoh Jun 05 '16 at 12:54
  • My assumption is based on my very first comment. – PeeHaa Jun 05 '16 at 13:02

0 Answers0