I am working on a custom dataset to train and test the models attention_ocr and street, but am unclear on what the function encode_utf8_string described here is doing. What is the purpose of the padding and use of nulls in determining the char arrays (padded and unpadded)?
Given the following charset, length (5), and null char (3):
{'a':0, 'b':1, 'c':2},
Are these the correct padded and unpadded results (note spaces in text)?
'bc': padded: [1,2,3,3,3], unpadded: [1,2]
'a a': padded: [0,3,0,3,3], unpadded: [0,3,0]