notice
This is documentation for Rasa Open Source Documentation v2.2.x, which is no longer actively maintained.
For up-to-date documentation, see the latest version (2.3.x).
Version Migration Guide
This page contains information about changes between major versions and how you can migrate from one version to another.
Rasa 2.1 to Rasa 2.2
General
TEDPolicy
's transformer_size
, number_of_transformer_layers
,
and dense_dimensions
parameters have been renamed.
Please update your configuration files using the following mapping:
Old Model Parameter | New Model Parameter |
---|---|
transformer_size | dictionary transformer_size with keys |
text , action_text , label_action_text , dialogue | |
number_of_transformer_layers | dictionary number_of_transformer_layers with keys |
text , action_text , label_action_text , dialogue | |
dense_dimension | dictionary dense_dimension with keys |
text , action_text , label_action_text , intent , | |
action_name , label_action_name , entities , slots , | |
active_loop |
For example:
Deprecations
Markdown Data
Training and test data in Markdown format is now deprecated. This includes:
- reading and writing of story files in Markdown format
- reading and writing of NLU data in Markdown format
- reading and writing of retrieval intent data in Markdown format
Support for Markdown data will be removed entirely in Rasa Open Source 3.0.0.
Please convert your existing Markdown data by using the commands described here.
Policies
Policies now require a **kwargs
argument in their constructor and load
method.
Policies without **kwargs
will be supported until Rasa version 3.0.0
.
However when using incremental training
**kwargs
must be included.
Other
Domain.random_template_for
is deprecated and will be removed in Rasa Open Source 3.0.0. You can alternatively use theTemplatedNaturalLanguageGenerator
.Domain.action_names
is deprecated and will be removed in Rasa Open Source 3.0.0. Please useDomain.action_names_or_texts
instead.
Rasa 2.0 to Rasa 2.1
Deprecations
ConveRTTokenizer
is now deprecated. ConveRTFeaturizer now implements
its behaviour. To migrate, replace ConveRTTokenizer
with any other tokenizer, for e.g.:
HFTransformersNLP
and LanguageModelTokenizer
components are now deprecated.
LanguageModelFeaturizer now implements their behaviour.
To migrate, replace both the above components with any tokenizer and specify the model architecture and model weights
as part of LanguageModelFeaturizer
, for e.g.:
Rasa 1.10 to Rasa 2.0
General
A lot has changed in version 2.0. Make sure you read through this guide thoroughly, to make sure all parts of your bot are updated. A lot of updates can be done automatically with inbuilt commands, others will need some manual conversion. If you have any feedback about these updates or the migration process, please post it in the forum.
Training data files
As of version 2.0, the new default training data format is yaml. Markdown is still supported, but this will be deprecated in Rasa Open Source 3.0.0.
You can convert existing NLU, Stories, and NLG (i.e. responses.md
) training data
files in the Markdown format to the new YAML format using following commands:
Converted files will have the same names as the original ones but with a
_converted.yml
suffix.
If you are using forms or response selectors, some additional changes will need to be made as described in their respective sections.
Policies
With the introduction of rules and the RulePolicy, the following policies are deprecated:
To migrate the policies automatically, you can run the following command:
This command will take care of updating your config.yml
and domain.yml
, while
making backups of your existing files using the .bak
suffix. It will also add a
rules.yml
if necessary.
Your forms will still function as normal in the old format after this update, but this command does not convert them into the new format automatically. This should be done manually, as described in the section on forms.
You can also migrate the individual policies manually, if you don't want to use the automatic conversion command.
Manually migrating from the Mapping Policy
If you previously used the Mapping Policy, you
can follow the documentation on FAQs to convert your mapped
intents to rules. Suppose you previously mapped an intent ask_is_bot
as follows:
This becomes the following rule:
And you can safely remove any triggers:
from your domain:
Finally, you can replace the Mapping Policy with the Rule Policy in your model configuration:
Manually migrating from the Fallback Policy
If you previously used the Fallback Policy, the following model configuration would translate as follows given a previous configuration like this:
The new configuration would then look like:
In addition, you need to add a rule to specify which action to run in case of low NLU confidence:
See the documentation on fallback for more information.
Manually migrating from the Two-Stage-Fallback Policy
If you previously used the Two-Stage-Fallback Policy, with a configuration like this for example:
The new configuration would look like this:
In addition you need to add a rule to activate the Two-Stage Fallback for messages with low NLU confidence.
Note that the previous parameters fallback_nlu_action_name
and
deny_suggestion_intent_name
are no longer configurable and have the fixed values
action_default_fallback
and out_of_scope
.
See the fallback documentation for more information.
Forms
As of version 2.0 the logic for forms has been moved from the Rasa SDK to Rasa Open Source to simplify implementation and make it easier to write action servers in other languages.
This means that forms are no longer implemented using a FormAction
, but instead
defined in the domain. Any customizations around requesting slots or
slot validation can be handled with a FormValidationAction
.
Consider a custom form action from 1.x like this:
Start the migration by removing the FormPolicy and adding the RulePolicy (if not there already) to your model configuration:
Then you need to define the form, required slots and their slot mappings in the domain as described in the documentation on forms:
If you ran the command to convert your stories, you will have a story that handles form activation and deactivation like this:
This will work fine, but the best way to handle form behavior is to remove this story and instead define two separate rules for form activation and submission:
The last step is to implement a custom action to validate the form slots. Start by adding the custom action to your domain:
Then add a custom action which validates the cuisine
slot:
You can also migrate forms from Rasa SDK to Rasa Open Source 2 iteratively. You can for
example migrate one form to the Rasa Open Source 2 implementation while continue using
the deprecated Rasa SDK implementation for another form. To continue to use
the deprecated Rasa SDK FormAction
s, add a custom action with the name of your form to your domain. Note that you should complete the migration as soon as possible as the deprecated FormAction
will be removed from the Rasa SDK in Rasa Open Source 3.
See the forms documentation for more details.
Response Selectors
Response Selectors are a stable feature as of version 2.0.
The conversion command will automatically
convert your responses.md
file, stories and nlu training data to the new yaml format.
It will also take care of adding the utter_
prefix to your responses.
Additionally you will need to rename the respond_
actions in your stories files to use the
utter_
prefix instead. Run the following command to apply these changes:
You can also apply these changes manually. For example:
becomes
and you will need to add the utter_
prefix to the response names in your responses.md
as well. For example:
becomes
Finally, you should remove any actions with the respond_
prefix from the actions
list in your domain.
This behavior will work fine when defined as a story, but even better when defined as a rule. You should consider transferring your retrieval stories to rules. More information on what that looks like in the chitchat and FAQs documentation.
Response Selectors are now trained on retrieval intent labels by default instead
of the actual response text. For most models, this should improve training time
and accuracy of the ResponseSelector
.
If you want to revert to the pre-2.0 default behavior, add the use_text_as_label: true
parameter to your ResponseSelector
component:
The output schema of ResponseSelector
has changed. An example output looks like this:
As a result of this, if you were previously querying for the key full_retrieval_intent
as:
you should instead now do this:
Unfeaturized Slots
Slots of type unfeaturized are
deprecated and will be removed in version 3.0. To ignore slot values during
a conversation, set the influence_conversation
property of the slot to false
.
The following snippet is an example of the deprecated unfeaturized slot usage:
To update this to the new format, you can specify the expected data type text
and
define that the slot should be ignored during the conversation.
If you don't require the slot to have a specific data type, you can use the new slot type any. This slot type is always ignored during a conversation and does not make any assumptions regarding the data type of the slot value.
Please see the updated slots documentation for more information.
Conversation sessions
Conversation sessions are now enabled by default if your Domain does not contain a session configuration. Previously a missing session configuration was treated as if conversation sessions were disabled. You can explicitly disable conversation sessions using the following snippet:
Dialogue Featurization
This section is only relevant if you explicitly defined featurizers in your policy configuration.
LabelTokenizerSingleStateFeaturizer is deprecated and will be removed in the future.
It should be replaced with SingleStateFeaturizer and some changes should be made to the NLU pipeline.
Add a Tokenizer
with the option intent_tokenization_flag: True
and CountVectorsFeaturizer
to the NLU pipeline.
For example:
BinarySingleStateFeaturizer is deprecated and will be removed in the future.
You should replace it with SingleStateFeaturizer
and a NLU pipeline
where intent_tokenization_flag
of a Tokenizer is set to False
.
For example:
Deprecations
The deprecated event brokers FileProducer, KafkaProducer, PikaProducer
and SQLProducer have been removed. If you used these brokers in your
endpoints.yml
make sure to use the renamed variants instead:
- FileProducer became FileEventBroker
- KafkaProducer became KafkaEventBroker
- PikaProducer became PikaEventBroker
- SQLProducer became SQLEventBroker
The deprecated EmbeddingIntentClassifier has been removed. If you used this
component in your pipeline configuration (config.yml
) you can replace it
with DIETClassifier.
It accepts the same configuration parameters.
The deprecated KerasPolicy has been removed. If you used this
component in your policies configuration (config.yml
) you can replace it
with TEDPolicy. It accepts the same configuration parameters.
Rasa 1.7 to Rasa 1.8
caution
This is a release breaking backwards compatibility. It is not possible to load previously trained models. Please make sure to retrain a model before trying to use it with this improved version.
General
The TED Policy replaced the
keras_policy
as recommended machine learning policy. New projects generated withrasa init
will automatically use this policy. In case you want to change your existing model configuration to use the TED Policy add this to thepolicies
section in yourconfig.yml
and remove potentially existingKerasPolicy
entries:policies:# - ... other policies- name: TEDPolicymax_history: 5epochs: 100The given snippet specifies default values for the parameters
max_history
andepochs
.max_history
is particularly important and strongly depends on your stories. Please see the docs of the TED Policy if you want to customize them.All pre-defined pipeline templates are deprecated. Any templates you use will be mapped to the new configuration, but the underlying architecture is the same. Take a look at Tuning Your Model to decide on what components you should use in your configuration file.
The Embedding Policy was renamed to TED Policy. The functionality of the policy stayed the same. Please update your configuration files to use
TEDPolicy
instead ofEmbeddingPolicy
.Most of the model options for
EmbeddingPolicy
,EmbeddingIntentClassifier
, andResponseSelector
got renamed. Please update your configuration files using the following mapping:Old model option New model option hidden_layers_sizes_a dictionary “hidden_layers_sizes” with key “text” hidden_layers_sizes_b dictionary “hidden_layers_sizes” with key “label” hidden_layers_sizes_pre_dial dictionary “hidden_layers_sizes” with key “dialogue” hidden_layers_sizes_bot dictionary “hidden_layers_sizes” with key “label” num_transformer_layers number_of_transformer_layers num_heads number_of_attention_heads max_seq_length maximum_sequence_length dense_dim dense_dimension embed_dim embedding_dimension num_neg number_of_negative_examples mu_pos maximum_positive_similarity mu_neg maximum_negative_similarity use_max_sim_neg use_maximum_negative_similarity C2 regularization_constant C_emb negative_margin_scale droprate_a droprate_dialogue droprate_b droprate_label evaluate_every_num_epochs evaluate_every_number_of_epochs evaluate_on_num_examples evaluate_on_number_of_examples Old configuration options will be mapped to the new names, and a warning will be thrown. However, these will be deprecated in a future release.
The Embedding Intent Classifier is now deprecated and will be replaced by DIETClassifier in the future.
DIETClassfier
performs intent classification as well as entity recognition. If you want to get the same model behavior as the currentEmbeddingIntentClassifier
, you can use the following configuration ofDIETClassifier
:pipeline:# - ... other components- name: DIETClassifierhidden_layers_sizes:text: [256, 128]number_of_transformer_layers: 0weight_sparsity: 0intent_classification: Trueentity_recognition: Falseuse_masked_language_model: FalseBILOU_flag: False# ... any other parametersSee DIETClassifier for more information about the new component. Specifying
EmbeddingIntentClassifier
in the configuration maps to the above component definition, the behavior is unchanged from previous versions.CRFEntityExtractor
is now deprecated and will be replaced byDIETClassifier
in the future. If you want to get the same model behavior as the currentCRFEntityExtractor
, you can use the following configuration:pipeline:# - ... other components- name: LexicalSyntacticFeaturizerfeatures: [["low", "title", "upper"],["BOS","EOS","low","prefix5","prefix2","suffix5","suffix3","suffix2","upper","title","digit",],["low", "title", "upper"],]- name: DIETClassifierintent_classification: Falseentity_recognition: Trueuse_masked_language_model: Falsenumber_of_transformer_layers: 0# ... any other parametersCRFEntityExtractor
featurizes user messages on its own, it does not depend on any featurizer. We extracted the featurization from the component into the new featurizer LexicalSyntacticFeaturizer. Thus, in order to obtain the same results as before, you need to add this featurizer to your pipeline before the DIETClassifier. SpecifyingCRFEntityExtractor
in the configuration maps to the above component definition, the behavior is unchanged from previous versions.If your pipeline contains
CRFEntityExtractor
andEmbeddingIntentClassifier
you can substitute both components with DIETClassifier. You can use the following pipeline for that:pipeline:# - ... other components- name: LexicalSyntacticFeaturizerfeatures: [["low", "title", "upper"],["BOS","EOS","low","prefix5","prefix2","suffix5","suffix3","suffix2","upper","title","digit",],["low", "title", "upper"],]- name: DIETClassifiernumber_of_transformer_layers: 0# ... any other parameters
Rasa 1.6 to Rasa 1.7
General
- By default, the
EmbeddingIntentClassifier
,EmbeddingPolicy
, andResponseSelector
will now normalize the top 10 confidence results if theloss_type
is"softmax"
(which has been default since 1.3, see Rasa 1.2 to Rasa 1.3). This is configurable via theranking_length
configuration parameter; to turn off normalization to match the previous behavior, setranking_length: 0
.
Rasa 1.2 to Rasa 1.3
caution
This is a release breaking backwards compatibility. It is not possible to load previously trained models. Please make sure to retrain a model before trying to use it with this improved version.
General
Default parameters of
EmbeddingIntentClassifier
are changed. See the Components page for details. Architecture implementation is changed as well, so old trained models cannot be loaded. Default parameters and architecture forEmbeddingPolicy
are changed. See Policies for details. It uses transformer instead of lstm. Old trained models cannot be loaded. They useinner
similarity andsoftmax
loss by default instead ofcosine
similarity andmargin
loss (can be set in config file). They usebalanced
batching strategy by default to counteract class imbalance problem. The meaning ofevaluate_on_num_examples
is changed. If it is non zero, random examples will be picked by stratified split and used as hold out validation set, so they will be excluded from training data. We suggest to set it to zero (default) if data set contains a lot of unique examples of dialogue turns. Removedlabel_tokenization_flag
andlabel_split_symbol
from component. Instead moved intent splitting toTokenizer
components viaintent_tokenization_flag
andintent_split_symbol
flag.Default
max_history
forEmbeddingPolicy
isNone
which means it'll use theFullDialogueTrackerFeaturizer
. We recommend to setmax_history
to some finite value in order to useMaxHistoryTrackerFeaturizer
for faster training. See Featurizers for details. We recommend to increasebatch_size
forMaxHistoryTrackerFeaturizer
(e.g."batch_size": [32, 64]
)Compare mode of
rasa train core
allows the whole core config comparison. Therefore, we changed the naming of trained models. They are named by config file name instead of policy name. Old naming style will not be read correctly when creating compare plots (rasa test core
). Please remove old trained models in comparison folder and retrain. Normal core training is unaffected.We updated the evaluation metric for our NER. We report the weighted precision and f1-score. So far we included
no-entity
in this report. However, as most of the tokens actually don't have an entity set, this will influence the weighted precision and f1-score quite a bit. From now on we excludeno-entity
from the evaluation. The overall metrics now only include proper entities. You might see a drop in the performance scores when running the evaluation again./
is reserved as a delimiter token to distinguish between retrieval intent and the corresponding response text identifier. Make sure you don't include/
symbol in the name of your intents.
Rasa NLU 0.14.x and Rasa Core 0.13.x to Rasa 1.0
caution
This is a release breaking backwards compatibility. It is not possible to load previously trained models. Please make sure to retrain a model before trying to use it with this improved version.
General
The scripts in
rasa.core
andrasa.nlu
can no longer be executed. To train, test, run, … an NLU or Core model, you should now use the command line interfacerasa
. The functionality is, for the most part, the same as before. Some changes in commands reflect the combined training and running of NLU and Core models, but NLU and Core can still be trained and used individually. If you attempt to run one of the old scripts inrasa.core
orrasa.nlu
, an error is thrown that points you to the command you should use instead. See all the new commands at Command Line Interface.If you have written a custom output channel, all
send_
methods subclassed from theOutputChannel
class need to take an additional\*\*kwargs
argument. You can use these keyword args from your custom action code or the templates in your domain file to send any extra parameters used in your channel's send methods.If you were previously importing the
Button
orElement
classes fromrasa_core.dispatcher
, these are now to be imported fromrasa_sdk.utils
.Rasa NLU and Core previously used separate configuration files. These two files should be merged into a single file either named
config.yml
, or passed via the--config
parameter.
Script parameters
All script parameter names have been unified to follow the same schema. Any underscores (
_
) in arguments have been replaced with dashes (-
). For example:--max_history
has been changed to--max-history
. You can see all of the script parameters in the--help
output of the commands in the Command Line Interface.The
--num_threads
parameter was removed from therun
command. The server will always run single-threaded, but will now run asynchronously. If you want to make use of multiple processes, feel free to check out the Sanic server documentation.To avoid conflicts in script parameter names, connectors in the
run
command now need to be specified with--connector
, as-c
is no longer supported. The maximum history in therasa visualize
command needs to be defined with--max-history
. Output paths and log files cannot be specified with-o
anymore;--out
and--log-file
should be used. NLU data has been standarized to be--nlu
and the name of any kind of data files or directory to be--data
.
HTTP API
- There are numerous HTTP API endpoint changes which can be found here.