notice

This is documentation for Rasa Documentation v2.x, which is no longer actively maintained.
For up-to-date documentation, see the latest version (3.x).

Version: 2.x

rasa.nlu.tokenizers.whitespace_tokenizer

WhitespaceTokenizer Objects

class WhitespaceTokenizer(Tokenizer)

__init__

| __init__(component_config: Dict[Text, Any] = None) -> None

Construct a new tokenizer using the WhitespaceTokenizer framework.

remove_emoji

| remove_emoji(text: Text) -> Text

Remove emoji if the full text, aka token, matches the emoji regex.