I have two UTF-8 text files:
repr(file1.txt):
\nSTATEMENT OF WORK\n\n\nSTATEMENT OF WORK NO. 7\nEffective Date: February 15, 2015
repr(file2.txt):
RENEWAL/AMENDMENT\n\nTHIS agreement is entered as of July 25, 2014. b
Their respective Brat annotation files have the following annotation:
file1.ann:
T1 date 61 78 February 15, 2015
file2.ann:
T1 date 53 67 July 25, 2014.
But when I use python to retrieve the characters from .txt using above offsets, I get:
file1.read()[61:78]:
February 15, 2015
file2.read()[53:67]:
ly 25, 2014. b
Why does my offsetting work in the first case but not the second case?