The File I am trying to process looks like this:
...
...
15 Apr 2014 22:05 - id: content
15 Apr 2014 22:09 - id: content
15 Apr 2014 22:09 - id: content
with new line
16 Apr 2014 06:56 - id: content
with new line
with new line
16 Apr 2014 06:57 - id: content
16 Apr 2014 06:58 - id: content
...
...
the regex I have come up with is this: \d{1,}[ ][A-Z][a-z]{2}[ ](?:\d{4}[ ]\d{2}[:]\d{2}|\d{2}[:]\d{2}).*
which results in:
This is almost right i just need to include newline characters, but if i include this [\s\S]*
instead of .*
only one match is returned.
What i would like to extract is a set of substrings where each string starts at the data sequence and ends at the next date sequence like so:
...
...
15 Apr 2014 22:05 - id: content //substring 1
15 Apr 2014 22:09 - id: content //substring 2
15 Apr 2014 22:09 - id: content //substring 3
with new line //substring 3
16 Apr 2014 06:56 - id: content //substring 4
with new line //substring 4
with new line //substring 4
16 Apr 2014 06:57 - id: content //substring 5
16 Apr 2014 06:58 - id: content //substring 6
...
...
Any help to what im missing?