Questions tagged [read-text]

51 questions
4
votes
3 answers

READ_TEXT similar function in CDS View

I need to read text stored for invoice, order, etc. In ABAP we can use READ_TEXT function module, but I need to read it within a CDS view. Is there a way to read text from database tables directly in a SELECT statement?
Eralper
  • 6,461
  • 2
  • 21
  • 27
1
vote
0 answers

Get package name of caller script from imported python file

I have a project that will be zipped and run from Spark, let's call it "client" project. I would like, on a library imported by a script within this client project, to read some static config file the "client" program will provide following a…
Mehdi
  • 765
  • 9
  • 19
1
vote
1 answer

I want to read a txt with infos of a movie in each line and save it in to a dynamic array of structures

I'm really new to C programming and I try to make this as an example of reading files and saving them to dynamic array of structs the infos of the txt are: Movie id:1448 title:The movie surname of director: lorez name of director: john date:…
1
vote
1 answer

Error running Cognitive Services Read Text container preview

When I am processing an image file, the container crashes with the following error message azureocr_1 | docker-entrypoint.sh: line 6: 9 Illegal instruction (core dumped) KMP_BLOCKTIME=1 KMP_HOT_TEAMS_MODE=1 OMP_WAIT_POLICY=PASSIVE…
1
vote
1 answer

When reading in pdf text using readtext is there a way to ensure that readtext respects columns?

The problem is that I have a PDF document formatted in landscape with three columns of text which I am attempting to read into R using readtext(). When it reads the text in, rather than reading down each column in order, it is reading between…
1
vote
2 answers

Extract specific words from the file

I am analyzing some text files and I want to extract a specific word every time that the word is found in the file. Imagine that I have 'Sports' in the file then I want to extract the Word 'SPORTS' based on a list. I've the following code: content =…
Pedro Alves
  • 1,004
  • 1
  • 21
  • 47
1
vote
1 answer

Text mining with tm in R antiword error

So I'm rather new to R, and I'm learning how to mine text from this handy website: https://eight2late.wordpress.com/2015/05/27/a-gentle-introduction-to-text-mining-using-r/ I do have my own text set of .doc, .docx, and .xlsx files and I'm trying to…
Zac
  • 329
  • 1
  • 3
  • 8
1
vote
1 answer

Java error in useDelimiter()

This is what I have been working so far for reading a text file, Scanner file = new Scanner(new File("sample.txt")).useDelimiter("."); ArrayList arr = new ArrayList<>(); while (file.hasNextLine()) { …
Mr.B
  • 29
  • 5
0
votes
1 answer

Import texts and docvars from XML file with readtext package

I'm trying to import texts from xml files with readtext package in order to then create and explore a corpus with quanteda. Reading the help page I've figured out how to import the texts, but I'd like to know if one can create docvars based on nodes…
Tetro
  • 5
  • 4
0
votes
1 answer

Using {readtext} to import text data from a PDF file containing text and images: R Studio aborts. No error message

I am trying to import text contained in a PDF file into R Studio, using {readtext}. In this past, this has worked smoothly and still does so for the most part. However, there are a handful of PDF files I struggle to import, meaning that R Studio…
Dr. Fabian Habersack
  • 1,111
  • 12
  • 30
0
votes
0 answers

How to remove lines after reading pdf files with readtext function in R

I am reading multiple pdf files with readtext () and I want to remove the first 10 or 20 lines of each pdf before creating the corpus and the tokens with quanteda library(readtext) testfiles <- readtext ("*.pdf", docvarsfrom = …
0
votes
1 answer

Extract pages and structured content from pdf and save it to data-frame

I have a PDF with several 100 pages. The pdf contains press releases that have a different length (from 1 page to several pages). Each press release, however, starts and ends with a similar structure: Example of the head of one press release:…
feder80
  • 1,195
  • 3
  • 13
  • 34
0
votes
0 answers

Need to decode CLUSTD column SAP Hana

i am looking to decode the CLUSTD column of SAP, which is in encoded form as binary. I want to decode it outside SAP, I know READ_TEXT helps decode it but it is in SAP environment. My CLUSTD is in SNowflake after migration and I want to decode it…
0
votes
0 answers

Reading from a .txt file and creating a nested list - Python

I have a txt. file that I need to read and create a nested list. However some of details needs to be removed from the nested list. Please see the example below: file1.txt One Type 1 2 3 4 Friend Sky House After Run the code I have to get: (’type’,…
0
votes
0 answers

readtext returns error when reading too many .rtf files

I am trying to use readtext in R to import over 13,000 .rtf files but received an error message below. uk <- readtext("/Users/path/*.rtf", docvarsfrom = "filenames", docvarnames = c("country", "year", "id"), dvsep = "_") Error in…
1
2 3 4