✨ New features and improvements NEW: span_finder pipeline component to identify overlapping, unlabeled spans (#12507). Language updates: Add initial support for Malay (#12602). Update Latin defaults to support noun chunks, update lexical/tokenizer defaults and add example sentences (#12538). Add option to return scores separately keyed by component name with spacy evaluate --per-component, Language.evaluate(per_component=True) and Scorer.score(per_component=True) (#12540). Support custom token/lexeme attribute for vectors (#12625). Support spancat_singlelabel in spacy debug data CLI (#12749). Typing updates for PhraseMatcher and SpanGroup (#12642, #12714). Bug fixes #12569: Require that all SpanGroup spans come from the current doc. ...
✨ New features and improvements Alpha tokenization support for Ancient Greek. Implementation of a n...
✨ New features and improvements NEW: Support multiprocessing in nlp.pipe via the n_process argument...
✨ New features and improvements NEW: Luganda language support (#10847). NEW: Latin language support...
✨ New features and improvements NEW: Trained pipelines for Catalan and a new transformer-based pipe...
✨ New features and improvements NEW: Provide scores for the SpanCategorizer predictions. NEW: Broad...
✨ New features and improvements Support for mypy 0.950+ and pydantic v1.9 (#10786). Prebuilt linux ...
We'd love to hear more about your experience with spaCy! Take our survey here. ✨ New features and im...
✨ New features and improvements NEW: New apply CLI command to annotate new documents with a traine...
✨ New features and improvements Add support for floret vectors in spacy pretrain (#12435). Save fin...
✨ New features and improvements NEW: Registered scoring functions for each component in the config....
✨ New features and improvements Allow sourcing disabled components in config. Support Doc.spans in ...
✨ New features and improvements Add the SpanRuler component. This component saves a list of matched...
✨ New features and improvements Improved speeds for many components, see speed benchmarks for train...
We had to release another update to the v2.0.x branch of spaCy to resolve a dependency issue, so we ...
✨ New features and improvements New assemble CLI command for assembling a pipeline from a config wi...
✨ New features and improvements Alpha tokenization support for Ancient Greek. Implementation of a n...
✨ New features and improvements NEW: Support multiprocessing in nlp.pipe via the n_process argument...
✨ New features and improvements NEW: Luganda language support (#10847). NEW: Latin language support...
✨ New features and improvements NEW: Trained pipelines for Catalan and a new transformer-based pipe...
✨ New features and improvements NEW: Provide scores for the SpanCategorizer predictions. NEW: Broad...
✨ New features and improvements Support for mypy 0.950+ and pydantic v1.9 (#10786). Prebuilt linux ...
We'd love to hear more about your experience with spaCy! Take our survey here. ✨ New features and im...
✨ New features and improvements NEW: New apply CLI command to annotate new documents with a traine...
✨ New features and improvements Add support for floret vectors in spacy pretrain (#12435). Save fin...
✨ New features and improvements NEW: Registered scoring functions for each component in the config....
✨ New features and improvements Allow sourcing disabled components in config. Support Doc.spans in ...
✨ New features and improvements Add the SpanRuler component. This component saves a list of matched...
✨ New features and improvements Improved speeds for many components, see speed benchmarks for train...
We had to release another update to the v2.0.x branch of spaCy to resolve a dependency issue, so we ...
✨ New features and improvements New assemble CLI command for assembling a pipeline from a config wi...
✨ New features and improvements Alpha tokenization support for Ancient Greek. Implementation of a n...
✨ New features and improvements NEW: Support multiprocessing in nlp.pipe via the n_process argument...
✨ New features and improvements NEW: Luganda language support (#10847). NEW: Latin language support...