2

As there is a collocation for numbers in nltk such as ('RS', '##number##')

I'm wondering if there is such a specifier for Roman numerals which I want to use for something like this: ('volume', '##roman number##')

If there is no way to do such a thing, it would be helpful if there was a viable way to convert Roman numerals to decimal for sentence segmentation, and then after segmentation convert the text back to the original Roman numerals.

Douglas
  • 1,304
  • 10
  • 26
eightnoteight
  • 234
  • 2
  • 11

0 Answers0