Version: 3.x

rasa.nlu.tokenizers._whitespace_tokenizer

WhitespaceTokenizer Objects

class WhitespaceTokenizer(Tokenizer)

__init__

| __init__(component_config: Dict[Text, Any] = None) -> None

Construct a new tokenizer using the WhitespaceTokenizer framework.

remove_emoji

| remove_emoji(text: Text) -> Text

Remove emoji if the full text, aka token, matches the emoji regex.