✨ New features and improvements Improve Korean tokenizer speed. Add experimental character-based pretraining. Bug fixes Fix issue #5728: Fix French lemmatizer. Fix issue #5729: Fix lemmatizer for python 2.7. Fix issue #5751: Fix meta serialization in train CLI. Contributors Thanks to @graue70, @mikeizbicki, @jbesomi, @gandersen101 and @DeNeutoy for the pull requests and contributions
✨ New features and improvements Allow sourcing disabled components in config. Support Doc.spans in ...
✨ New features and improvements Move ud_train, ud_evaluate and other UD scripts from CLI to /bin in...
✨ New features and improvements The v3 of WandbLogger now supports optional run_name and entity par...
✨ New features and improvements NEW: Tokenizer.explain method to see which rule or pattern was matc...
✨ New features and improvements Modify blis and numpy build dependencies to simplify source install...
✨ New features and improvements NEW: Add alpha support for Nepali. Refactor Japanese tokenizer and ...
We had to release another update to the v2.0.x branch of spaCy to resolve a dependency issue, so we ...
Help us improve spaCy and take the User Survey 2018! ✨ New features and improvements NEW: Alpha Vi...
✨ New features and improvements NEW: Base language data for Marathi and Korean (via mecab-ko, mecab...
✨ New features and improvements NEW: Registered scoring functions for each component in the config....
✨ New features and improvements NEW: New apply CLI command to annotate new documents with a traine...
✨ New features and improvements Add Token.tensor and Span.tensor attributes. Support simple trainin...
✨ New features and improvements Improved parser and ner speeds on long documents (see technical det...
✨ New features and improvements Alpha tokenization support for Ancient Greek. Implementation of a n...
✨ New features and improvements NEW: Add alpha support for Macedonian and Sanskrit. Update language...
✨ New features and improvements Allow sourcing disabled components in config. Support Doc.spans in ...
✨ New features and improvements Move ud_train, ud_evaluate and other UD scripts from CLI to /bin in...
✨ New features and improvements The v3 of WandbLogger now supports optional run_name and entity par...
✨ New features and improvements NEW: Tokenizer.explain method to see which rule or pattern was matc...
✨ New features and improvements Modify blis and numpy build dependencies to simplify source install...
✨ New features and improvements NEW: Add alpha support for Nepali. Refactor Japanese tokenizer and ...
We had to release another update to the v2.0.x branch of spaCy to resolve a dependency issue, so we ...
Help us improve spaCy and take the User Survey 2018! ✨ New features and improvements NEW: Alpha Vi...
✨ New features and improvements NEW: Base language data for Marathi and Korean (via mecab-ko, mecab...
✨ New features and improvements NEW: Registered scoring functions for each component in the config....
✨ New features and improvements NEW: New apply CLI command to annotate new documents with a traine...
✨ New features and improvements Add Token.tensor and Span.tensor attributes. Support simple trainin...
✨ New features and improvements Improved parser and ner speeds on long documents (see technical det...
✨ New features and improvements Alpha tokenization support for Ancient Greek. Implementation of a n...
✨ New features and improvements NEW: Add alpha support for Macedonian and Sanskrit. Update language...
✨ New features and improvements Allow sourcing disabled components in config. Support Doc.spans in ...
✨ New features and improvements Move ud_train, ud_evaluate and other UD scripts from CLI to /bin in...
✨ New features and improvements The v3 of WandbLogger now supports optional run_name and entity par...