Select siblings but without something in between

Question

Here's one tricky scenario:

#target ~ p {
  background: green;
  color: white;
}

<h1 id="target">Title</h1>
<span>Some description</span>
<p>Yes</p>
<p>Yes</p>

<h1>Another Title</h1>
<span>Some description</span>
<p>No</p>
<p>No</p>

Does xpath allow selecting siblings but stop at a point? I would like to select the two <p>s under the first <h1> but not the ones under the second <h1>. Modifying the HTML is not possible as I am doing some web scraping and I am looking for a quick and dirty way of extracting data from paragraphs under a certain heading:

paragraphs = target.select("~ p")

Your query is not clear "I would like to select the two `
`s under the first `
` but ones under the second `
`". — chriskirknielsen, Feb 16 '18 at 14:40
@chriskirknielsen Typo. It should say "but not the ones". Basically, only the ones that say "Yes" in the example should be selected. — Derek 朕會功夫, Feb 16 '18 at 14:41
are you sure it contains no parent div, so it could be easier to target? — Ylama, Feb 16 '18 at 14:49
Possible duplicate of [Is there a "previous sibling" CSS selector?](https://stackoverflow.com/questions/1817792/is-there-a-previous-sibling-css-selector) — she hates me, Feb 28 '18 at 16:03

Stickers · Accepted Answer · 2018-02-16T15:04:12.250

Try this:

#target ~ p:not(:nth-last-of-type(-n+2)) {
  background: green;
  color: white;
}

<h1 id="target">Title</h1>
<span>Some description</span>
<p>Yes</p>
<p>Yes</p>

<h1>Another Title</h1>
<span>Some description</span>
<p>No</p>
<p>No</p>

Or

#target + span + p,
#target + span + p + p {
  background: green;
  color: white;
}

<h1 id="target">Title</h1>
<span>Some description</span>
<p>Yes</p>
<p>Yes</p>

<h1>Another Title</h1>
<span>Some description</span>
<p>No</p>
<p>No</p>

Or

#target ~ p:nth-of-type(1),
#target ~ p:nth-of-type(2) {
  background: green;
  color: white;
}

<h1 id="target">Title</h1>
<span>Some description</span>
<p>Yes</p>
<p>Yes</p>

<h1>Another Title</h1>
<span>Some description</span>
<p>No</p>
<p>No</p>

score 1 · Answer 2 · answered Feb 16 '18 at 14:50

1

You can use below XPath expression to get required paragraphs:

//h1/following-sibling::p[count(preceding-sibling::h1)=1]

If you know text of each h1, then you also can try:

//h1[.="Title"]/following-sibling::p[following-sibling::h1[.="Another Title"]]

answered Feb 16 '18 at 14:50

Andersson

51,635
17
77
129

Select siblings but without something in between

` but ones under the second `

`".

2 Answers2