applicative functor: <*> and partial application, how it works

Question

I am reading the book Programming in Haskell by Graham Hutton and I have some problem to understand how <*> and partial application can be used to parse a string.

I know that pure (+1) <*> Just 2 produces Just 3 because pure (+1) produces Just (+1) and then Just (+1) <*> Just 2 produces Just (2+1) and then Just 3

But in more complex case like this:

-- Define a new type containing a parser function
newtype Parser a = P (String -> [(a,String)])

-- This function apply the parser p on inp
parse :: Parser a -> String -> [(a,String)]
parse (P p) inp = p inp

-- A parser which return a tuple with the first char and the remaining string
item :: Parser Char
item = P (\inp -> case inp of
    []     -> []
    (x:xs) -> [(x,xs)])

-- A parser is a functor
instance Functor Parser where
  fmap g p = P (\inp -> case parse p inp of
    []              -> []
    [(v, out)]      -> [(g v, out)])

-- A parser is also an applicative functor
instance Applicative Parser where
  pure v = P (\inp -> [(v, inp)])
  pg <*> px = P (\inp -> case parse pg inp of
    []              -> []
    [(g, out)]      -> parse (fmap g px) out)

So, when I do:

parse (pure (\x y -> (x,y)) <*> item <*> item) "abc"

The answer is:

[(('a','b'),"c")]

But I don't understand what exactly happens. First:

pure (\x y -> (x,y)) => P (\inp1 -> [(\x y -> (x,y), inp1)])

I have now a parser with one parameter.

Then:

P (\inp1 -> [(\x y -> (x,y), inp1)]) <*> item 
=> P (\inp2 -> case parse (\inp1 -> [(\x y -> (x,y), inp1)]) inp2 of ???

I really don't understand what happens here. Can someone explain, step by step, what's happens now until the end please.

There is a little mistake in the definition of fmap. It is "case parse p inp" and not "case p inp" — yaa, Aug 04 '17 at 20:26
By examining the definition of `<*>`, can you see that first, the left-hand parser (`pg`) is applied to the input, and then the right hand-hand parser (`px`) is applied to the leftover string from applying the left-hand parser? Then, can you see that `item` is a parser which always consumes exactly one character? Then, can you see that `pure f` is a parser that consumes *no* input? I feel these three pieces are enough to put together the answer. — user2407038, Aug 04 '17 at 22:58
if you still have questions about this, if something is unclear, do tell us in the comments. otherwise, if the question have been answered to your satisfaction, it is customary to accept one of the answers that you feel was most helpful to you, to signal that the problem is resolved. if you don't, it means you're still hoping for answers that would clarify it for you. so what is still unclear? — Will Ness, Aug 16 '17 at 09:05
These instances can be derived via `StateT String []` including `Monad`, `MonadState String`, `Alternative`, `MonadPlus`, `MonadFail` etc. — Iceland_jack, Apr 26 '22 at 17:17

Alex · Answer 1 · 2017-08-06T00:28:01.147

Let's evaluate pure (\x y -> (x,y)) <*> item. The second application of <*> will be easy once we've seen the first:

P (\inp1 -> [(\x y -> (x,y), inp1)]) <*> item

We replace the <*> expression with its definition, substituting the expression's operands for the definition's parameters.

P (\inp2 -> case parse P (\inp1 -> [(\x y -> (x,y), inp1)]) inp2 of
    []              -> []
    [(g, out)]      -> parse (fmap g item) out)

Then we do the same for the fmap expression.

P (\inp2 -> case parse P (\inp1 -> [(\x y -> (x,y), inp1)]) inp2 of
    []              -> []
    [(g, out)]      -> parse P (\inp -> case parse item inp of
                           []              -> []
                           [(v, out)]      -> [(g v, out)]) out)

Now we can reduce the first two parse expressions (we'll leave parse item out for later since it's basically primitive).

P (\inp2 -> case [(\x y -> (x,y), inp2)] of
    []              -> []
    [(g, out)]      -> case parse item out of
                           []              -> []
                           [(v, out)]      -> [(g v, out)])

So much for pure (\x y -> (x,y)) <*> item. Since you created the first parser by lifting a binary function of type a -> b -> (a, b), the single application to a parser of type Parser Char represents a parser of type Parser (b -> (Char, b)).

We can run this parser through the parse function with input "abc". Since the parser has type Parser (b -> (Char, b)), this should reduce to a value of type [(b -> (Char, b), String)]. Let's evaluate that expression now.

parse P (\inp2 -> case [(\x y -> (x,y), inp2)] of
    []              -> []
    [(g, out)]      -> case parse item out of
                           []              -> []
                           [(v, out)]      -> [(g v, out)]) "abc"

By the definition of parse this reduces to

case [(\x y -> (x,y), "abc")] of
    []              -> []
    [(g, out)]      -> case parse item out of
                           []              -> []
                           [(v, out)]      -> [(g v, out)]

Clearly, the patterns don't match in the first case, but they do in the second case. We substitute the matches for the patterns in the second expression.

case parse item "abc" of
    []              -> []
    [(v, out)]      -> [((\x y -> (x,y)) v, out)]

Now we finally evaluate that last parse expression. parse item "abc" clearly reduces to [('a', "bc")] from the definition of item.

case [('a', "bc")] of
    []              -> []
    [(v, out)]      -> [((\x y -> (x,y)) v, out)]

Again, the second pattern matches and we do substitution

[((\x y -> (x,y)) 'a', "bc")]

which reduces to

[(\y -> ('a', y), "bc")] :: [(b -> (Char, b), String)] -- the expected type

If you apply this same process to evaluate a second <*> application, and put the result in the parse (result) "abc" expression, you'll see that the expression indeed reduces to[(('a','b'),"c")].

score 7 · Answer 2 · edited Aug 07 '17 at 12:12

What helped me a lot while learning these things was to focus on the types of the values and functions involved. It's all about applying a function to a value (or in your case applying a function to two values).

($)   ::                    (a -> b) ->   a ->   b
fmap  :: Functor     f =>   (a -> b) -> f a -> f b
(<*>) :: Applicative f => f (a -> b) -> f a -> f b

So with a Functor we apply a function on a value inside a "container/context" (i.e. Maybe, List, . .), and with an Applicative the function we want to apply is itself inside a "container/context".

The function you want to apply is (,), and the values you want to apply the function to are inside a container/context (in your case Parser a).

Using pure we lift the function (,) into the same "context/container" our values are in (note, that we can use pure to lift the function into any Applicative (Maybe, List, Parser, . . ):

(,) ::              a -> b -> (a, b)
pure (,) :: Parser (a -> b -> (a, b))

Using <*> we can apply the function (,) that is now inside the Parser context to a value that is also inside the Parser context. One difference to the example you provided with +1 is that (,) has two arguments. Therefore we have to use <*> twice:

(<*>) :: Applicative f => f (a -> b) -> f a -> f b

x :: Parser Int
y :: Parser Char

let p1 = pure (,) <*> x :: Parser (b -> (Int, b))
let v1 =      (,)     1 ::         b -> (Int, b)

let p2 = p1 <*> y  :: Parser (Int, Char)
let v2 = v1    'a' ::        (Int, Char)

We have now created a new parser (p2) that we can use just like any other parser!

. . and then there is more!

Have a look at this convenience function:

(<$>) :: Functor f => (a -> b) -> f a -> f b

<$> is just fmap but you can use it to write the combinators more beautifully:

data User = User {name :: String, year :: Int}
nameParser :: Parser String
yearParser :: Parser Int

let userParser = User <$> nameParser <*> yearParser -- :: Parser User

Ok, this answer got longer than I expected! Well, I hope it helps. Maybe also have a look at Typeclassopedia which I found invaluable while learning Haskell which is an endless beautiful process . . :)

Will Ness · Answer 3 · 2017-08-08T16:11:09.917

TL;DR: When you said you "[now] have a parser with one parameter" inp1, you got confused: inp1 is an input string to a parser, but the function (\x y -> (x,y)) - which is just (,) - is being applied to the output value(s), produced by parsing the input string. The sequence of values produced by your interim parsers is:

-- by (pure (,)):
(,)                     -- a function expecting two arguments

-- by the first <*> combination with (item):
(,) x                   -- a partially applied (,) function expecting one more argument

-- by the final <*> combination with another (item):
((,) x) y == (x,y)      -- the final result, a pair of `Char`s taken off the 
                        -- input string, first (`x`) by an `item`, 
                        -- and the second (`y`) by another `item` parser

Working by equational reasoning can oftentimes be easier:

 -- pseudocode definition of `fmap`:
 parse (fmap g p) inp = case (parse p inp) of    -- g :: a -> b , p :: Parser a
    []              -> []                        --        fmap g p :: Parser b
    [(v, out)]      -> [(g v, out)]              -- v :: a ,           g v :: b

(apparently this assumes any parser can only produce 0 or 1 results, as the case of a longer list isn't handled at all -- which is usually frowned upon, and with good reason);

 -- pseudocode definition of `pure`:
 parse (pure v) inp = [(v, inp)]                 -- v :: a , pure v :: Parser a

(parsing with pure v produces the v without consuming the input);

 -- pseudocode definition of `item`:
 parse (item) inp = case inp of                  -- inp :: ['Char']
    []              -> []
    (x:xs)          -> [(x,xs)]                  -- item :: Parser 'Char'

(parsing with item means taking one Char off the head of the input String, if possible); and,

 -- pseudocode definition of `(<*>)`:
 parse (pg <*> px) inp = case (parse pg inp) of    -- px :: Parser a
    []              -> []
    [(g, out)]      -> parse (fmap g px) out       -- g :: a -> b

(<*> combines two parsers with types of results that fit, producing a new, combined parser which uses the first parse to parse the input, then uses the second parser to parse the leftover string, combining the two results to produce the result of the new, combined parser);

Now, <*> associates to the left, so what you ask about is

parse ( pure (\x y -> (x,y)) <*> item <*> item ) "abc"
= parse ( (pure (,) <*> item1) <*> item2 ) "abc"             -- item_i = item

the rightmost <*> is the topmost, so we expand it first, leaving the nested expression as is for now,

= case (parse (pure (,) <*> item1) "abc") of                 -- by definition of <*>
    []              -> []
    [(g2, out2)]    -> parse (fmap g2 item2) out2
                       = case (parse item out2) of           -- by definition of fmap
                            []              -> []
                            [(v, out)]      -> [(g2 v, out)]
                       = case out2 of                        -- by definition of item
                            []              -> []
                            (y:ys)          -> [(g2 y, ys)]

Similarly, the nested expression is simplified as

parse (pure (,) <*> item1) "abc"
= case (parse (pure (\x y -> (x,y))) "abc") of               -- by definition of <*>
    []              -> []
    [(g1, out1)]    -> parse (fmap g1 item1) out1
                       = case (parse item out1) of ....
                       = case out1 of
                            []              -> []
                            (x:xs)          -> [(g1 x, xs)]
= case [((,), "abc")] of                                     -- by definition of pure
    [(g1, out1)]    -> case out1 of
                            []              -> []
                            (x:xs)          -> [(g1 x, xs)]
= let { out1 = "abc" 
      ; g1   = (,)
      ; (x:xs) = out1
      }
   in  [(g1 x, xs)]
= [( (,) 'a', "bc")]

and thus we get

= case [( (,) 'a', "bc")] of
    [(g2, out2)]    -> case out2 of
                            []              -> []
                            (y:ys)          -> [(g2 y, ys)]

I think you can see now why the result will be [( ((,) 'a') 'b', "c")].

score 2 · Answer 4 · answered Aug 07 '17 at 15:37

First, I want to emphasize one thing. I found that the crux of understanding lies in noticing the separation between the Parser itself and running the parser with parse.

In running the parser you give the Parser and input string to parse and it will give you the list of possible parses. I think that's probably easy to understand.

You will pass parse a Parser, which may be built using glue, <*>. Try to understand that when you pass parse the Parser, a, or the Parser, f <*> a <*> b, you will be passing it the same type of thing, i.e. something equivalent to (String -> [(a,String)]). I think this is probably easy to understand as well, but still it takes a while to "click".

That said, I'll talk a little about the nature of this applicative glue, <*>. An applicative, F a is a computation that yields data of type a. You can think of a term such as

... f <*> g <*> h

as a series of computations which return some data, say a then b then c. In the context of Parser, the computation involve f looking for a in the current string, then passing the remainder of the string to g, etc. If any of the computations/parses fails, then so does the whole term.

Its interesting to note that any applicative can be written with a pure function at the beginning to collect all those emitted values, so we can generally write,

 pure3ArgFunction <$> f <*> g <*> h

I personally find the mental model of emitting and collecting helpful.

So, with that long preamble over, onto the actual explanation. What does

parse (pure (\x y -> (x,y)) <*> item <*> item) "abc"

do? Well, parse (p::Parser (Char,Char) "abc" applies the parser, (which I renamed p) to "abc", yielding [(('a','b'),"c")]. This is a successful parse with the return value of ('a','b') and the leftover string, "c".

Ok, that's not the question though. Why does the parser work this way? Starting with:

.. <*> item <*> item

item takes the next character from the string, yields it as a result and passes the unconsumed input. The next item does the same. The beginning can be rewritten as:

fmap (\x y -> (x,y)) $ item <*> item

or

(\x y -> (x,y)) <$> item <*> item

which is my way of showing that the pure function does not do anything to the input string, it just collects the results. When looked at in this light I think the parser should be easy to understand. Very easy. Too easy. I mean that in all seriousness. Its not that the concept is so hard, but our normal frame of looking at programming is just too foreign for it to make much sense at first.

score 2 · Answer 5 · answered Aug 13 '17 at 15:56

Some people below did great jobs on "step-by-step" guides for you to easily understand the progress of computation to create the final result. So I don't replicate it here.

What I think is that, you really need to deeply understand about Functor and Applicative Functor. Once you understand these topics, the others will be easy as one two three (I means most of them ^^).

So: what is Functor, Applicative Functor and their applications in your problem?

Best tutorials on these:

Chapter 11 of "Learn You a Haskell for a great good": http://learnyouahaskell.com/functors-applicative-functors-and-monoids.
More visual "Functors, Applicatives, And Monads in Pictures": http://adit.io/posts/2013-04-17-functors,_applicatives,_and_monads_in_pictures.html.

First, when you think about Functor, Applicative Functor, think about "values in contexts": the values are important, and the computational contexts are important too. You have to deal with both of them.

The definitions of the types:

    -- Define a new type containing a parser function
    newtype Parser a = P (String -> [(a,String)])

    -- This function apply the parser p on inp
    parse :: Parser a -> String -> [(a,String)]
    parse (P p) inp = p inp

The value here is the value of type a, the first element of the tuple in the list.
The context here is the function, or the eventual value. You have to supply an input to get the final value.

Parser is a function wrapped in a P data constructor. So if you got a value b :: Parser Char, and you want to apply it to some input, you have to unwrap the inner function in b. That's why we have the function parse, it unwraps the inner function and applies it to the input value.

And, if you want to create Parser value, you have to use P data constructor wraps around a function.

Second, Functor: something that can be "mapped" over, specified by the function fmap:

    fmap :: (a -> b) -> f a -> f b

I often call the function g :: (a -> b) is a normal function because as you see no context wraps around it. So, to be able to apply g to f a, we have to extract the a from f a somehow, so that g can be apply to a alone. That "somehow" depends on the specific Functor and is the context you are working in:

    instance Functor Parser where
      fmap g p = P (\inp -> case parse p inp of
        []              -> []
        [(v, out)]      -> [(g v, out)])

g is the function of type (a -> b), p is of type f a.
To unwrap p, to get the value of of context, we have to pass some input value in: parse p inp, then the value is the 1st element of the tuple. Apply g to that value, get a value of type b.
The result of fmap is of type f b, so we have to wrap all the result in the same context, that why we have: fmap g p = P (\inp -> ...).

At this time, you might be wonder you could have an implementation of fmap in which the result, instead of [(g v, out)], is [(g v, inp)]. And the answer is Yes. You can implement fmap in any way you like, but the important thing is to be an appropriate Functor, the implementation must obey Functor laws. The laws are they way we deriving the implementation of those functions (http://mvanier.livejournal.com/4586.html). The implementation must satisfy at least 2 Functor laws:

fmap id = id.
fmap (f . g) = fmap f . fmap g.

fmap is often written as infix operator: <$>. When you see this, look at the 2nd operand to determine which Functor you are working with.

Third, Applicative Functor: you apply a wrapped function to a wrapped value to get another wrapped value:

    <*> :: f (a -> b) -> f a ->  f b

Unwrap the inner function.
Unwrap 1st value.
Apply the function and wrap the result.

In your case:

    instance Applicative Parser where
      pure v = P (\inp -> [(v, inp)])
      pg <*> px = P (\inp -> case parse pg inp of
        []              -> []
        [(g, out)]      -> parse (fmap g px) out)

pg is of type f (a -> b), px is of type f a.
Unwrap g from pg by parse pg inp, g is the 1st of the tuple.
Unwrap px and apply g to the value by using fmap g px. Attention, the result function only applies to out, in some case that is "bc" not "abc".
Wrap the whole result: P (\inp -> ...).

Like Functor, an implementation of Applicative Functor must obey Applicative Functor laws (in the tutorials above).

Fourth, apply to your problem:

    parse (pure (\x y -> (x,y)) <*> item <*> item) "abc"
           |         f1        |    |f2|     |f3|

Unwrap f1 <*> f2 by passing "abc" to it:
- Unwrap f1 by passing "abc" to it, we get [(g, "abc")].
- Then fmap g on f2 and passing out="abc" to it:
  - Unwrap f2 get [('a', "bc")].
  - Apply g on 'a' get a result: [(\y -> ('a', y), "bc")].
Then fmap 1st element of the result on f3 and passing out="bc" to it:
- Unwrap f3 get [('b', "c")].
- Apply the function on 'b' get final result: [(('a', 'b'), "c")].

In conclusion:

Take some time for the ideas to "dive" into you. Especially, the laws derives the implementations.
Next time, design your data structure to easier understand.
Haskell is one of my favorite languages and I thing it will be yours soon, so be patient, it needs a learning curve and then you go!

Happy Haskell hacking!

Redu · Answer 6 · 2017-08-10T18:18:10.070

0

Hmm I am not experienced with Haskell but my attempt on generating Functor and Applicative instances of the Parser type would be as follows;

-- Define a new type containing a parser function
newtype Parser a = P (String -> [(a,String)])

-- This function apply the parser p on inp
parse :: Parser a -> String -> [(a,String)]
parse (P p) inp = p inp

-- A parser which return a tuple with the first char and the remaining string
item :: Parser Char
item = P (\inp -> case inp of
    []     -> []
    (x:xs) -> [(x,xs)])

-- A parser is a functor
instance Functor Parser where
  fmap g (P f) = P (\str -> map (\(x,y) -> (g x, y)) $ f str)

-- A parser is also an applicative functor
instance Applicative Parser where
pure v = P (\str -> [(v, str)])

(P g) <*> (P f) = P (\str -> [(g' v, s) | (g',s) <- g str, (v,_) <- f str])

(P g) <*> (P f) = P (\str -> f str >>= \(v,s1) -> g s1 >>= \(g',s2) -> [(g' v,s2)])

(10x very much for the helping of @Will Ness on <*>)

Accordingly...

*Main> parse (P (\s -> [((+3), s)]) <*> pure 2) "test"
[(5,"test")]

*Main> parse (P (\s -> [((,), s ++ " altered")]) <*> pure 2 <*> pure 4) "test"
[((2,4),"test altered")]

edited Aug 10 '17 at 18:18

answered Aug 09 '17 at 08:39

Redu

25,060
6
56
76

I think the `q` in `p <*> q` is supposed to be parsing the leftover string, not the original. It is likely the definitions in the question are *simplified* for the sake of learning, allowing for only 0 or 1 outcomes instead of any number of them as the real thing would do. – Will Ness Aug 09 '17 at 20:48
@Will Ness Yes you are right. I tried to correct it accordingly so that now it takes the `String` portion into account as well. – Redu Aug 10 '17 at 09:27
1

You just needed to change your `(P g) <*> (P f) = P (\str -> [(g' v, s) | (g',s) <- g str, (v,_) <- f str])` to `(P g) <*> (P f) = P (\str -> [(g' v, s2) | (g',s) <- g str, (v,s2) <- f s])`. I personally dislike the primed names very much (but YMMV), so I'd write it as `(P pg) <*> (P pv) = P (\str -> [(g v, s2) | (g,s1) <- pg str, (v,s2) <- pv s1])`. or maybe `= P \str -> pg str >>= \(g,s1) -> pv s1 >>= \(v,s2) -> [(g v,s2)]` which has a more linear flow to it. – Will Ness Aug 10 '17 at 10:25
@Will Ness Thanks for reminding the monadic concise way.. However i am a little confused here about applying str to `pv` or `pg` first. Is it up to how one would like to implement the `<*>` operator for this applicative functor type? I feel like i have to do like `(P pg) <*> (P pv) = P \str -> pv str >>= \(v,s1) -> pg s1 >>= \(g,s2) -> [(g v,s2)]` – Redu Aug 10 '17 at 11:11
I think it is usually done left-to-right. Imagine `p = pure (,) <*> char <*> int` (assuming `char` is a parser producing a `Char` and `int` producing an `Int`). With right-to-left order it would process a string like `" 7 a"` and turn it into `('a', 7)`. It feels unnatural. Let-to-right order of processing would produce `('a', 7)` from `" a 7"`, which feels right. -- BTW, aren't the mnemonic var names much easier to deal with? :) – Will Ness Aug 10 '17 at 17:43
@Will Ness Yes sure.. `<*>` operators goes left associative and i guess my dilemma doesn't have an effect on that. What i mean is, this type is in fact a function and it seems to me such that, i have to apply the function inside the applicative functor (left of `<*>`) to the result of the right side like `g . f`. But as i said i am not very clear here. I will change my answer reflecting your monadic concise way though. – Redu Aug 10 '17 at 18:04
1

yeah this is not about the associativity. either `(p <*> q) <*> r` or `p <*> (q <*> r)` will parse the input string in the same order, according to the definition of `<*>`. With your definition, `parse (pure (,) <*> item <*> item) "123"` produces `[(('2','1'),"3")]`. `[(('1','2'),"3")]` seems more appropriate. – Will Ness Aug 10 '17 at 18:39

applicative functor: <*> and partial application, how it works

6 Answers6

Linked