Questions tagged [html-to-text]
23 questions
14
votes
2 answers
Convert HTML to a proper plain text?
is there any way I can convert HTML into proper plain text? I tried everything from raw to sanitize and even the Mail gem with it's text_part method which is supposed to do exactly that but doesn't work for me.
My best shot so far was…

Cojones
- 2,930
- 4
- 29
- 41
5
votes
3 answers
Get the specific content from html and print to txt file in Perl
I have a html file which contains paper ID s and papers. So i want to print these ID s and papers sequencely. Here are the html file and example output.

Gunner1905
- 148
- 11
3
votes
2 answers
DOT NET code to convert HTML to text
I'm creating a little algo to fetch text from web sites.. then find answers (will post the script once completed).
To do that, I need to convert all HTML code within and into plain readable english text.
I've manually removed all html tags, but…

Arjun
- 961
- 2
- 11
- 18
2
votes
1 answer
Instead of rendering tables and other html tags in docx these are saved as plain text using docx4j-ImportXHTML
I want to render html code to docx. Instead of rendering html(i.e. tables in tabular format) it simply writes html code in it as plain text. I am using docx4j-ImportXHTML jar. I used the code from here and modified it to save in a file.
What am I…

nanosoft
- 2,913
- 4
- 41
- 61
2
votes
1 answer
htmlToText not found
I am trying to run an R script that I wrote some months ago. I would like to download online information which is why I use the htmlToText function. R can't find this function even though I have included the RCurl and XML…

PeterGerft
- 49
- 1
- 7
2
votes
1 answer
How do I remove all of the aligns and indents from a beautifulsoup output with python?
I'm trying to get information from many different tables from an HTML url without any of the HTML indent/tab formatting. I use get_text to generate the content I want, but it prints with a lot of white space and tabs. I've tried .strip and that…

Huessy
- 111
- 8
1
vote
3 answers
html-to-text conversion using Python standard library only
I'm looking for the best way to convert HTML to text, using only modules from the Python 2.7.x standard library. (I.e., no BeautifulSoup, etc.)
By HTML-to-text conversion I mean the moral equivalent of lynx -dump. In fact, just getting rid of HTML…

kjo
- 33,683
- 52
- 148
- 265
1
vote
3 answers
Serving Word document on button click on C# asp.net page
When code is placed onClick event it does not show open save dialog box and no exception is thrown but works fine onLoad event,opens a open save dialog box to save a word file..
string strDocBody;
strDocBody = "

Rachel
- 25
- 4
1
vote
1 answer
node.js \ why do I get RangeError: Maximum call stack size exceeded
The purpose of the below program is to crawl CNN, and write all its text to a single file (using couple of third parties)
I get
RangeError: Maximum call stack size exceeded
How to troubleshoot this, and how can I bypass that? is there a way I can…

user1025852
- 2,684
- 11
- 36
- 58
0
votes
1 answer
How can I get Teams Messages in text from Microsoft Graph API using Data Factory
In another context (Email) I've used a Header like "Prefer: outlook.body-content-type=text" so was wondering / hoping you could do similar for Teams Messages, but have not been able to find out ... an alternative is to convert the HTML to text…

JohnD
- 187
- 11
0
votes
2 answers
power automate flow - html to text - odd new line
Im struggling with a mail html to text flow.
It works fine normally, but all of a sudden it is inserting a "\n" that shouldnt be there and comparing the   to the other lines where it does not do this does not reveal any differences to me that…

Michael Christensen
- 46
- 5
0
votes
0 answers
Docker Image issues: TypeError: c.charCodeAt(...).toString(...).padStart is not a function for html-to-text
i have issues on nodemailer transport using smtp which i developed in nodejs, it has no issues in local server and email sent correctly, but when i'm dockerize them and deploy to AWS ECS, the API still return status 200 without any error but email…

Khairul Habib
- 452
- 1
- 12
- 29
0
votes
0 answers
Long links gets broken when arrived to gmail from node.js app
I have an html template in the server that I convert to text so I can send him to the user in email.
The problem is that when I use string.link() function for hyperlinks it breaks them in some length.
example: Google
Result in the mail:…

Omer
- 1
- 1
0
votes
2 answers
Converting HTML to plain text that looks like it was copied from a browser using Python
I want to convert HTML to plain text in Python, I hope the results to look like they were copied from the browser. I tried many libraries like html2text, html-text and BeautifulSoup, But none of them get the results I want. For example, the…

Luyu Huang
- 33
- 3
0
votes
1 answer
How to convert HTML into formatted Text so that the layout such as spacing, table borders, element positioning remains intact?
How to convert HTML into formatted Text, so that the layout such as spacing, table borders, element positioning remains intact? Is there any tool for the same?
Tried a few options already as follows but output formatting is not as…

Div22
- 63
- 1
- 7