1

Is there a (client-side) JavaScript library for doing Readability/Instapaper-style uncluttering of HTML? ie Takes HTML for a full web page and unclutters it by removing headings, sidebars etc, to just leave the core article, using some heuristics. I'm looking for something like clean(htmlPageString).

I think readability itself is implemented in JavaScript, but there's no library for it.

mahemoff
  • 44,526
  • 36
  • 160
  • 222
  • 3
    you may want to re-word this entire question. I first assumed you were talking about code formatting (tidy) but in fact you want to have a 'print/read' view to all of your pages? but why do this in javascript - why not in a MVC 'View' and pretty it up with a lightbox or something . – rlemon Dec 22 '11 at 21:03

3 Answers3

4

I think this JavaScript library is along the lines of what you are looking for. It is a standalone version of the library used to generate Firefox's Reader View.

https://github.com/mozilla/readability

tlattimore
  • 41
  • 2
1

Here is readability, encapsulated in a node module: node-readability. It works fine, but not in every case. I am currently looking for alternatives.

bitstream
  • 1,108
  • 2
  • 19
  • 30
0

As per my reading, its built in javascript+JSON that stores the reading list of users in a queue in the form of JSON. It keeps track of the webpage requests along with time,date and some other specific information.

Umesh Patil
  • 10,475
  • 16
  • 52
  • 80