Checking that Ragel matched the entire input

Question

Are there better ways to require that Ragel consume all of the input? Here is what I'm using now:

=begin
%%{
  machine my_lexer;
  # ...
  # extract tokens and store into `tokens`
  # ...
}%%
=end

class MyLexer

  %% write data;

  def self.run(string)
    data = string.unpack("c*")
    eof = data.length
    tokens = []
    %% write init;
    %% write exec;
    data.length == p ? tokens : nil
  end

end

Most of the above is boilerplate, except for the data.length == p test. It works -- except that it doesn't verify that the lexer ended in a final state. So, I have test cases that give me tokens back even if the entire input was not successfully parsed.

Is there a better way?

(Testing for the final state directly might work better. I'm looking into how to do that. Ideas?)

score 3 · Answer 1 · edited Sep 19 '15 at 18:06

3

You can handle errors using either global or local error actions.

For global error actions you can use this syntax:

$!action

For local error actions, which are local to your machine definition, you can use this syntax:

$^action

If you put a flag on your action, you can check the flag to detect an error.

edited Sep 19 '15 at 18:06

Martin Atkins

62,420
8
120
138

answered Jun 06 '13 at 10:10

bdnt

71
5

score 1 · Answer 2 · answered Aug 23 '12 at 07:14

1

I'm only starting out with ragel, but it's possible you want to look at EOF actions or Error actions, executed respectively when the input ends or when the next character satisfies no transition from the current state.

answered Aug 23 '12 at 07:14

joeln

3,563
25
31

Checking that Ragel matched the entire input

2 Answers2