Questions about Robots.txt regarding asterisk and forward slash

Question

I have few questions regarding robots.txt

If I have following line in robots.txt

Disallow: /catalog/category/view/id/6

will this block the url http://example.com/catalog/category/view/id/61 as well?
If I have

Disallow: /*education

will this block the url http://example.com/some/uri/education as well as http://example.com/some/uri/education/another/uri
what makes the difference whether I have / at the end of each rule?
Is * necessary in Disallow: /disallowme* if I want to disallow all url that starts with http://example.com/disallowme

score 0 · Answer 1 · edited May 23 '17 at 11:50

(Q1)

Disallow: /catalog/category/view/id/6

will block any URL whose path starts with /catalog/category/view/id/6. So yes, it will also block http://example.com/catalog/category/view/id/61.

(Q3) A slash is just another character, nothing special about it.

(Q2, Q4) The * character has no special meaning in the original robots.txt specification, it’s just another character, like / and a. Some parsers (for example, Google’s) use * for pattern matching. You’d have to check their documentation about it (each parser might implement this differently, as there is no specification about it).

So parsers that follow the original specification will not block http://example.com/disallowme when following Disallow: /disallowme*. They would block, for example: http://example.com/disallowme*foo. As explained above, whatever you specify in Disallow is always an URL path prefix.

Questions about Robots.txt regarding asterisk and forward slash

1 Answers1