What is difference between en_core_web_sm, en_core_web_md and en_core_web_lg model of spacy?

Anil Jagtap picture Anil Jagtap · May 23, 2018 · Viewed 8.8k times · Source

I installed spacy on my system and I want to parse/extract person name, organization for english. But I saw here, there is 4 model for english. And there is model versioning. I didn't get which model is large and which I have to choose for development?

Answer

AKX picture AKX · May 23, 2018

sm/md/lg refer to the sizes of the models (small, medium, large respectively).

As it says on the models page you linked to,

Model differences are mostly statistical. In general, we do expect larger models to be "better" and more accurate overall. Ultimately, it depends on your use case and requirements. We recommend starting with the default models (marked with a star below).

FWIW, the sm model is the default (as alluded to above)