notice
This is documentation for Rasa Documentation v2.x, which is no longer actively maintained.
For up-to-date documentation, see the latest version (3.x).
Version: 2.x
rasa.nlu.tokenizers.tokenizer
Token Objects
class Token()
get
| get(prop: Text, default: Optional[Any] = None) -> Any
Returns token value.
Tokenizer Objects
class Tokenizer(Component)
Base class for tokenizers.
__init__
| __init__(component_config: Dict[Text, Any] = None) -> None
Construct a new tokenizer using the WhitespaceTokenizer framework.
tokenize
| tokenize(message: Message, attribute: Text) -> List[Token]
Tokenizes the text of the provided attribute of the incoming message.
train
| train(training_data: TrainingData, config: Optional[RasaNLUModelConfig] = None, **kwargs: Any, ,) -> None
Tokenize all training data.
process
| process(message: Message, **kwargs: Any) -> None
Tokenize the incoming message.