notice

This is documentation for Rasa Open Source Documentation v2.4.x, which is no longer actively maintained.
For up-to-date documentation, see the latest version (2.6.x).

Version: 2.4.x

rasa.nlu.tokenizers.lm_tokenizer

LanguageModelTokenizer Objects

class LanguageModelTokenizer(WhitespaceTokenizer)

This tokenizer is deprecated and will be removed in the future.

Use the LanguageModelFeaturizer with any other Tokenizer instead.

__init__

| __init__(component_config: Dict[Text, Any] = None) -> None

Initializes LanguageModelTokenizer for tokenization.

Arguments:

  • component_config - Configuration for the component.