So far this regex expression found here works nice in almost every contest i'm working with.
(?<!\w\.\w.)(?<![A-Z][a-z]\.)(?<=[.?])\s
infect it's able to split properly even sentences like this one:
Mr. Daniel, who love cakes, is taking a trip to Nevada. Not gonna lie, i would go as well.
into:
Mr. Daniel, who love cakes, is taking a trip to Nevada.
Not gonna lie, i would go as well.
Unfortunately it doenst cover a case. if i've, for example, a sentence like this:
C. Daniel, who love cakes, is taking a trip to Nevada. Not gonna lie, i would go as well.
this regex epression makes the split in three sub sentences:
C.
Daniel, who love cakes, is taking a trip to Nevada.
Not gonna lie, i would go as well.
Instead of:
C. Daniel, who love cakes, is taking a trip to Nevada.
Not gonna lie, i would go as well.
What we're missing is this specific case is when we find a match that has a single Uppercase Chart followed by a dot (.) we dont have to split.
I still dont know how to proper use regex so if you can tell me also why your answer would work will be much appreciate