notice
This is documentation for Rasa & Rasa Pro Documentation v2.x, which is no longer actively maintained.
For up-to-date documentation, see the latest version (3.x).
rasa.nlu.featurizers.sparse_featurizer.count_vectors_featurizer
CountVectorsFeaturizer Objects
Creates a sequence of token counts features based on sklearn's CountVectorizer
.
All tokens which consist only of digits (e.g. 123 and 99 but not ab12d) will be represented by a single feature.
Set analyzer
to 'char_wb'
to use the idea of Subword Semantic Hashing
from https://arxiv.org/abs/1810.07150.
__init__
Construct a new count vectorizer using the sklearn framework.
train
Train the featurizer.
Take parameters from config and construct a new count vectorizer using the sklearn framework.
process
Process incoming message and compute and set features
persist
Persist this model into the passed directory.
Returns the metadata necessary to load the model again.
load
Loads trained component (see parent class for full docstring).