notice
This is unreleased documentation for Rasa Open Source Documentation Master/Unreleased version.
For the latest released documentation, see the latest version (2.5.x).
rasa.nlu.featurizers.sparse_featurizer.count_vectors_featurizer
CountVectorsFeaturizer Objects
Creates a sequence of token counts features based on sklearn's CountVectorizer
.
All tokens which consist only of digits (e.g. 123 and 99 but not ab12d) will be represented by a single feature.
Set analyzer
to 'char_wb'
to use the idea of Subword Semantic Hashing
from https://arxiv.org/abs/1810.07150.
__init__
Construct a new count vectorizer using the sklearn framework.
train
Train the featurizer.
Take parameters from config and construct a new count vectorizer using the sklearn framework.
process
Process incoming message and compute and set features
persist
Persist this model into the passed directory.
Returns the metadata necessary to load the model again.
load
Loads trained component (see parent class for full docstring).