notice
This is documentation for Rasa Documentation v2.x, which is no longer actively maintained.
For up-to-date documentation, see the latest version (3.x).
Version: 2.x
rasa.nlu.tokenizers.whitespace_tokenizer
WhitespaceTokenizer Objects
class WhitespaceTokenizer(Tokenizer)
__init__
| __init__(component_config: Dict[Text, Any] = None) -> None
Construct a new tokenizer using the WhitespaceTokenizer framework.
remove_emoji
| remove_emoji(text: Text) -> Text
Remove emoji if the full text, aka token, matches the emoji regex.