Questions tagged [part-of-speech]

Linguistic category of words

In grammar, a part of speech (also a word class, a lexical class, or a lexical category) is a linguistic category of words (or more precisely lexical items), which is generally defined by the syntactic or morphological behaviour of the lexical item in question.

From http://en.wikipedia.org/wiki/Parts_of_speech

194 questions
184
votes
10 answers

Java Stanford NLP: Part of Speech labels?

The Stanford NLP, demo'd here, gives an output like this: Colorless/JJ green/JJ ideas/NNS sleep/VBP furiously/RB ./. What do the Part of Speech tags mean? I am unable to find an official list. Is it Stanford's own system, or are they using…
Nick Heiner
  • 119,074
  • 188
  • 476
  • 699
20
votes
1 answer

Count verbs, nouns, and other parts of speech with python's NLTK

I have multiple texts and I would like to create profiles of them based on their usage of various parts of speech, like nouns and verbs. Basially, I need to count how many times each part of speech is used. I have tagged the text but am not sure how…
Zach
  • 4,624
  • 13
  • 43
  • 60
14
votes
8 answers

Strategies for recognizing proper nouns in NLP

I'm interested in learning more about Natural Language Processing (NLP) and am curious if there are currently any strategies for recognizing proper nouns in a text that aren't based on dictionary recognition? Also, could anyone explain or link to…
VirtuosiMedia
  • 52,016
  • 21
  • 93
  • 140
11
votes
1 answer

How to get the infinitive form of the verb using NLTK (pos tagging)

I'm trying to learn natural language processing (of English) using NLTK and Python. Is there a way to get the infinitive form of the verb during or after POS-tagging. For example: is (VBZ) => to be provided (VBN) => to provide using (VBG) => to…
Christian V
  • 2,010
  • 17
  • 26
10
votes
1 answer

What created `maxent_treebank_pos_tagger/english.pickle`?

The nltk package's built-in part-of-speech tagger does not seem to be optimized for my use-case (here, for instance). The source code here shows that it's using a saved, pre-trained classifier called maxent_treebank_pos_tagger. What created…
zkurtz
  • 3,230
  • 7
  • 28
  • 64
8
votes
2 answers

Uses/Applications of Part-of-speech-tagging (POS Tagging)

I understand the implicit value of part-of-speech tagging and have seen mentions about its use in parsing, text-to-speech conversion, etc. Could you tell me how is the output of a PoS tagger formated ? Also, could you explain how is such an output…
H W
  • 121
  • 2
  • 8
8
votes
1 answer

Is there any Part-Of-Speech tagger in C#?

My data pre-processing for data clustering needs part of speech (POS) tagging. I am wondering if there's some library in C# ready for this.
smwikipedia
  • 61,609
  • 92
  • 309
  • 482
7
votes
1 answer

WordNet - What does n and the number represent?

My question is related to WordNet Interface. >>> wn.synsets('cat') [Synset('cat.n.01'), Synset('guy.n.01'), Synset('cat.n.03'), Synset('kat.n.01'), Synset('cat-o'-nine-tails.n.01'), Synset('caterpillar.n.02'),…
malocho
  • 265
  • 3
  • 13
6
votes
1 answer

Is there a microformat for labeling sentences, words, parts-of-speech, etc

Is there a microformat for basic natural language process that has tags for sentences, words, parts-of-speech, etc...? I have searched the web but could not find any.
Jeff Kubina
  • 800
  • 4
  • 15
6
votes
1 answer

How can I convert CLAWS7 tags to Penn tags?

Does anyone of you know a way to convert a tag from CLAWS7 tagset to it's equivalent in Penn tagset? CLAWS7 tagset: http://ucrel.lancs.ac.uk/claws7tags.html Penn tagset: http://www.mozart-oz.org/mogul/doc/lager/brill-tagger/penn.html
Amin Y
  • 701
  • 1
  • 9
  • 15
6
votes
1 answer

How to use Keras to build a Part-of-Speech tagger?

I'm trying to implement a Part-of-Speech tagger using neural network with the help of Keras. I'm using a Sequential model, and training data from NLTK's Penn Treebank Corpus(i.e. from nltk.corpus import treebank). According to my understanding, to…
6
votes
2 answers

Confused about priority between stemmer and pos tagger

So I was analyzing a text corpus and I used stemmer for all the tokenized words. But I also have to find all the nouns in the corpus so I again did a nltk.pos_tag(stemmed_sentence) But my question is am I doing it right? A.]…
user4197202
5
votes
1 answer

What is the difference between parsing and Part Of Speech Tagging?

I know that POS tagging labels each and every word in a sentence with its appropriate Part Of Speech , But isn't that what a Parser does too ? i.e, break a sentence into its component parts? I've looked this up on the internet but couldn't find any…
Bharathi
  • 201
  • 1
  • 8
5
votes
1 answer

Part of Speech (POS) tag Feature Selection for Text Classification

I have the POS tag sentences obtain using Stanford POS tagger. Eg: The/DT island/NN was/VBD very/RB beautiful/JJ ./. I/PRP love/VBP it/PRP ./. (xml format also available) Can anyone explain how to perform feature selection from this POS tag…
5
votes
3 answers

Finding relations between Pronouns and Nouns in sentences

I am working on an NLP project and I need the following functionality illustrated by an example. Say there is a sentence Tell Sam that he will have to leave without Arthur, as he is sick. In this statement, the first he has to be tagged to Sam…
Ameet Deshpande
  • 496
  • 8
  • 22
1
2 3
12 13