-
Rafael Weingärtner authored
Ansible facts can have a large impact on the performance of the Ansible control host. This patch introduces some control over which facts are gathered (kolla_ansible_setup_gather_subset) and which facts are stored (kolla_ansible_setup_filter). By default we do not change the default values of these arguments to the setup module. The flexibility of these arguments is limited, but they do provide enough for a large performance improvement in a typical moderate to large OpenStack cloud. In particular, the large complex dict fact for each interface has a large effect, and on an OpenStack controller or hypervisor there may be many virtual interfaces. We can use the kolla_ansible_setup_filter variable to help: kolla_ansible_setup_filter: 'ansible_[!qt]*' This causes Ansible to collect but not store facts matching that pattern, which includes the virtual interface facts. Currently we are not referencing other facts matching the pattern within Kolla Ansible. Note that including the 'ansible_' prefix causes meta facts module_setup and gather_subset to be filtered, but this seems to be the only way to get a good match on the interface facts. To work around this, we use ansible_facts rather than module_setup to detect whether facts exist in the cache. The exact improvement will vary, but has been reported to be as large as 18x on systems with many virtual interfaces. For reference, here are some other tunings tried: * Increased the number of forks (great speedup depending of the size of the deployment) * Use `strategy = mitogen_linear` (cut processing time in half) * Ansible caching (little speed up) * SSH tunning (little speed up) Co-Authored-By:
Mark Goddard <mark@stackhpc.com> Closes-Bug: #1921538 Change-Id: Iae8ca4aae945892f1dc65e1b10381d2e26e88805
Rafael Weingärtner authoredAnsible facts can have a large impact on the performance of the Ansible control host. This patch introduces some control over which facts are gathered (kolla_ansible_setup_gather_subset) and which facts are stored (kolla_ansible_setup_filter). By default we do not change the default values of these arguments to the setup module. The flexibility of these arguments is limited, but they do provide enough for a large performance improvement in a typical moderate to large OpenStack cloud. In particular, the large complex dict fact for each interface has a large effect, and on an OpenStack controller or hypervisor there may be many virtual interfaces. We can use the kolla_ansible_setup_filter variable to help: kolla_ansible_setup_filter: 'ansible_[!qt]*' This causes Ansible to collect but not store facts matching that pattern, which includes the virtual interface facts. Currently we are not referencing other facts matching the pattern within Kolla Ansible. Note that including the 'ansible_' prefix causes meta facts module_setup and gather_subset to be filtered, but this seems to be the only way to get a good match on the interface facts. To work around this, we use ansible_facts rather than module_setup to detect whether facts exist in the cache. The exact improvement will vary, but has been reported to be as large as 18x on systems with many virtual interfaces. For reference, here are some other tunings tried: * Increased the number of forks (great speedup depending of the size of the deployment) * Use `strategy = mitogen_linear` (cut processing time in half) * Ansible caching (little speed up) * SSH tunning (little speed up) Co-Authored-By:
Mark Goddard <mark@stackhpc.com> Closes-Bug: #1921538 Change-Id: Iae8ca4aae945892f1dc65e1b10381d2e26e88805
Ansible tuning
In this section we cover some options for tuning Ansible for performance and scale.
SSH pipelining
SSH pipelining is disabled in Ansible by default, but is generally safe to enable, and provides a reasonable performance improvement.
Forks
By default Ansible executes tasks using a fairly conservative 5 process forks. This limits the parallelism that allows Ansible to scale. Most Ansible control hosts will be able to handle far more forks than this. You will need to experiment to find out the CPU, memory and IO limits of your machine.
For example, to increase the number of forks to 20:
Fact caching
By default, Ansible gathers facts for each host at the beginning of every play,
unless gather_facts
is set to false
. With a large number of hosts this
can result in a significant amount of time spent gathering facts.
One way to improve this is through Ansible's support for fact caching.
In order to make this work with Kolla Ansible, it is necessary to change
Ansible's gathering
configuration option to smart
.
Example
In the following example we configure Kolla Ansible to use fact caching using the jsonfile cache plugin.
You may also wish to set the expiration timeout for the cache via [defaults]
fact_caching_timeout
.
Fact variable injection
By default, Ansible injects a variable for every fact, prefixed with
ansible_
. This can result in a large number of variables for each host,
which at scale can incur a performance penalty. Ansible provides a
configuration option
that can be set to False
to prevent this injection of facts. In this case,
facts should be referenced via ansible_facts.<fact>
. In recent releases of
Kolla Ansible, facts are referenced via ansible_facts
, allowing users to
disable fact variable injection.
Fact filtering
Ansible facts filtering can be used to speed up Ansible. Environments with
many network interfaces on the network and compute nodes can experience very
slow processing with Kolla Ansible. This happens due to the processing of the
large per-interface facts with each task. To avoid storing certain facts, we
can use the kolla_ansible_setup_filter
variable, which is used as the
filter
argument to the setup
module. For example, to avoid collecting
facts for virtual interfaces beginning with q or t:
kolla_ansible_setup_filter: "ansible_[!qt]*"
This causes Ansible to collect but not store facts matching that pattern, which
includes the virtual interface facts. Currently we are not referencing other
facts matching the pattern within Kolla Ansible. Note that including the
ansible_
prefix causes meta facts module_setup
and gather_subset
to
be filtered, but this seems to be the only way to get a good match on the
interface facts.
The exact improvement will vary, but has been reported to be as large as 18x on systems with many virtual interfaces.
Fact gathering subsets
It is also possible to configure which subsets of facts are gathered, via
kolla_ansible_setup_gather_subset
, which is used as the gather_subset
argument to the setup
module. For example, if one wants to avoid collecting
facts via facter:
kolla_ansible_setup_gather_subset: "all,!facter"