Top "Nltk" questions

The Natural Language Toolkit is a Python library for computational linguistics.

NLTK tree data structure, finding a node, it's parent or children

I am using nltk's Tree data structure to work with parsetree strings. from nltk.tree import Tree parsed = Tree('(…

python tree nltk parse-tree
Python's NLTK vs. related Java Libraries?

I've used LingPipe, Stanford's NER, RiTa and various sentence similarity libraries for my previous Java projects that focused on text (…

java python information-retrieval nltk wordnet
Tokenization of Arabic words using NLTK

I'm using NLTK word_tokenizer to split a sentence into words. I want to tokenize this sentence: في_بيتنا كل شي لما تحتاجه يضيع ...ادور على شاحن فجأة يختفي ..لدرجة اني اسوي نفسي ادور شيء The code I'm …

python tokenize nltk
How to get all the hyponyms of a word/synset in python nltk and wordnet?

I have a list of all the nouns in wordnet now i want to leave only words which are vehicles …

python nltk wordnet
Named Entity Recognition with Regular Expression: NLTK

I have been playing with NLTK toolkit. I come across this problem a lot and searched for solution online but …

regex nlp nltk named-entity-recognition
Ngram model and perplexity in NLTK

To put my question in context, I would like to train and test/compare several (neural) language models. In order …

python nltk n-gram
How do I get a set of grammar rules from Penn Treebank using python & NLTK?

I'm fairly new to NLTK and Python. I've been creating sentence parses using the toy grammars given in the examples …

python parsing grammar nltk tagged-corpus
Setting NLTK with Stanford NLP (both StanfordNERTagger and StanfordPOSTagger) for Spanish

The NLTK documentation is rather poor in this integration. The steps I followed were: Download http://nlp.stanford.edu/software/…

python python-3.x nlp nltk stanford-nlp
How to get the WordNet synset given an offset ID?

I have a WordNet synset offset (for example id="n#05576222"). Given this offset, how can I get the synset using …

python python-2.7 nlp nltk wordnet
remove stopwords and tokenize for collocationbigramfinder NLTK

I keep getting this error sub return _compile(pattern, flags).sub(repl, string, count) TypeError: expected string or buffer when …

python nltk tokenize stop-words