This is unreleased documentation for Rasa Open Source Documentation Master/Unreleased version.
For the latest released documentation, see the latest version (2.5.x).

Version: Master/Unreleased


LanguageModelTokenizer Objects

class LanguageModelTokenizer(WhitespaceTokenizer)

This tokenizer is deprecated and will be removed in the future.

Use the LanguageModelFeaturizer with any other Tokenizer instead.


| __init__(component_config: Dict[Text, Any] = None) -> None

Initializes LanguageModelTokenizer for tokenization.


  • component_config - Configuration for the component.