Dictionary() is not supported anymore in tm package. How to emend code?

r tm
CptNemo picture CptNemo · Feb 14, 2014 · Viewed 11.6k times · Source

I just noticed that after updating to tm v. 0.5-10 the function Dictionary() is not supported anymore. Is this an mistake? Or was it deprecated? Am I suppose to use another function to create a dictionary?

Since I have many lines of code to emend now, what's the best way to proceed without engineering everything?

Answer

Ben picture Ben · Feb 15, 2014

As IShouldBuyABoat says, you haven't given us any clue about how you're using Dictionary so we can't really give you any specific answers (do update your question with more details).

In any case, the answer your question of 'how to update my code' is probably 'just delete Dictionary and it should be fine', as you can see here:

library(tm)
data(crude)

Find out what Dictionary did in earlier versions of the tm package:

methods(Dictionary)
getAnywhere(Dictionary.DocumentTermMatrix)
# function(x) structure(Terms(x), class = c("Dictionary", "character"))
getAnywhere(Dictionary.character)
# function (x)  structure(x, class = c("Dictionary", "character"))

Kind of a pointless function anyway, seems quite sensible to remove it. But how to update your code that depended on it?

You may have used Dictionary like this:

myDictionary <- Dictionary(c("some", "tokens", "that", "I", "am", "interested", "in"))
inspect(DocumentTermMatrix(crude, list(dictionary = myDictionary)))

Now that this function is not longer available, you'd do this instead, using a character vector: '

myTerms <- c("some", "tokens", "that", "I", "am", "interested", "in")
inspect(DocumentTermMatrix(crude, list(dictionary = myTerms)))

The output for these two examples is identical, first one was using tm version 0.5-9 and the second with version 0.5-10

The instruction in the NEWS to use Terms is if you want to get all the words in a document term matrix, like so

Terms(DocumentTermMatrix(crude))

If none of that helps you then you'll need to supply more detail about what you're trying to do.