Questions tagged [textacy]

Reference Site: https://textacy.readthedocs.io/en/stable/

Features

Stream text, json, csv, and spaCy binary data to and from disk
Clean and normalize raw text, before analyzing it
Explore a variety of included datasets, with both text data and metadata
from Congressional speeches to historical literature to Reddit comments
Access and filter basic linguistic elements, such as words and ngrams, noun chunks and sentences
Extract named entities, acronyms and their definitions, direct quotations, key terms, and more from documents
Compare strings, sets, and documents by a variety of similarity metrics
Transform documents and corpora into vectorized and semantic network representations
Train, interpret, visualize, and save sklearn-style topic models using LSA, LDA, or NMF methods

40 questions

votes

2 answers

Calculate TD-IDF for a single word in Textacy

I'm trying to use Textacy to calculate the TF-IDF score for a single word across the standard corpus, but am a bit unclear about the result I am receiving. I was expecting a single float which represented the frequency of the word in the corpus. So…

asked Apr 19 '19 at 16:19

port5432

5,889
10
60
97

votes

2 answers

My question is about "module 'textacy' has no attribute 'Doc'"

Can't find module 'textacy' has no attribute 'Doc' I am trying to extract verb phrases from spacy but there is such no library. Please help me how can I extract the verb phrases or adjective phrases using spacy. I want to do full shallow…

spacy textacy

asked Jun 23 '19 at 01:00

Gul Jabeen

votes

1 answer

Create subject-verb-object model of complex, fragmented sentences from police reports

I am fairly new to spacy / textacy and I have a complicated task ahead. Your help is much appreciated. In a nutshell, from a sentence like "Did assault paramedic by kicking and pushing him", I want to establish whether the reported abuse was against…

python nlp spacy sentence textacy

asked Nov 28 '19 at 17:56

Kristin

votes

1 answer

multiprocessing with textacy or spacy

I am trying to speed up processing of large lists of texts via parallelisation of textacy. When I use Pool from multiprocessing the resulting textacy corpus comes out empty. I am not sure if the problem is in the way I use textacy or multiprocessing…

python multiprocessing spacy pool textacy

asked Oct 08 '19 at 22:05

Diego

votes

1 answer

More efficient implementation of Textacy / spacy 'subject_verb_object_triples'

I'm trying to implement the 'extract.subject_verb_object_triples' funcation from textacy on my dataset. However, the code I have written is very slow and memory intensive. Is there a more efficient implementation? import spacy import textacy def…

python pandas nlp spacy textacy

asked Dec 27 '18 at 13:11

W.R

votes

2 answers

How to initialize a `Doc` in textacy 0.6.2?

Trying to follow the simple Doc initialization in the docs in Python 2 doesn't work: >>> import textacy >>> content = ''' ... The apparent symmetry between the quark and lepton families of ... the Standard Model (SM) are, at the very least,…

python nlp textacy

asked Jul 19 '18 at 20:21

arturomp

28,790
10
43
72

votes

1 answer

Using spacy and textacy. Need to find tf-idf score across corpus of original tweets but cant import textacy vectorizer

I'm new to these frameworks as well as NLP. I am following an example which gives me the following code snippet to calculate the tf-idf score of all the tokens in the tweets. However I keep getting either import errors or Vectorizer undefined.…

python-3.x tf-idf spacy textacy

asked Apr 20 '18 at 15:01

aldmarj

votes

1 answer

Textacy with Jupyter Notebook: How to suppress multiple error warnings?

I am using Textacy (on top of Spacy) to process many snippets of text. Specifically I use Textacy´s Readability scores. Since I have a lot of short texts I get a warning that I need to suppress because it otherwise will crash my notebook. My…

python nlp jupyter-notebook spacy textacy

asked Sep 23 '17 at 17:54

petezurich

9,280
9
43
57

votes

0 answers

Find topic weight in part of the corpus

I am doing topic modeling with tweets on Python. I am working on two time periods. I want to extracts topics with Spacy's textacy training the model on the corpus of both the time periods. Then, I want to analyse the weight of the topics on the…

python nlp spacy topic-modeling textacy

asked Apr 28 '22 at 17:26

s12345

votes

1 answer

How to extract verb phrases today?

For a project on NLP I need to extract verb phrases from a list of sentences. I have read some older posts from StackOverflow and watched this video. All was very helpful in understanding my problem and learning about possible patterns, but all code…

python nlp spacy textacy

asked Apr 27 '21 at 17:22

Sam V

votes

1 answer

Spacy/Textacy not reading file contents from .txt (text) file

I am trying to read the contents (blog) from a text file using Python (SpaCy/Textacy/Textblob) but it has been in vain, so far. Following is the code that I have recently tried: import content as content import pattern as pattern import…

python text spacy textblob textacy

asked Dec 11 '19 at 15:47

user3438153

votes

0 answers

unable to install textacy in python 3.0

I am trying to install textacy to perform NLP tasks, but getting an error while trying to do: pip install textacy in Anaconda prompt. The error I am getting is error: Microsoft Visual C++ 14.0 is required. Get it with "Microsoft Visual C++ Build…

python anaconda nlp textacy

asked Mar 02 '19 at 04:19

harish fegade

vote

1 answer

Extract quotations and attribution from text

I am attempting to extract quotations and quotation attributions (i.e., the speaker) from text, but I am not obtaining the desired output. I am using textacy. Here is what I have tried so far: import textacy from textacy import extract from…

python function nlp text-extraction textacy

asked Jun 10 '22 at 18:19

jedmund

vote

1 answer

module 'thinc' has no attribute 'layers'

I am following this article for my work and in this article, under heading Verb Phrase Detection, I am following the instructions but after successfully installing the textacy library (It shows in pip list) when I use import textacy in jupyter…

python jupyter-notebook textacy

asked Sep 02 '21 at 11:13

ankit

vote

0 answers

spacy/textacy: subject_verb_object_triples(doc) not returning any triplets

My goal is to extract SVO-triplets from simple sentences. For example for the sentence "A person is standing in a kitchen making a sandwich." I want the output and . I tried to use spacy/textacy…

nlp spacy textacy

asked Jun 30 '21 at 21:55

josch14

2 3 Next