Skip to content
Snippets Groups Projects
  1. Nov 04, 2022
  2. Nov 02, 2022
  3. Oct 04, 2022
  4. Feb 14, 2022
    • Mark Goddard's avatar
      Remove bslurp module · 2f6a8fa2
      Mark Goddard authored
      This module was used in the Ceph deployment support, which was removed
      long ago.
      
      Change-Id: I599ef47199bc68e8f5cf62709157d3f096ac68a9
      2f6a8fa2
  5. Jan 06, 2022
  6. Sep 23, 2021
  7. May 30, 2021
  8. Apr 27, 2021
    • Radosław Piliszek's avatar
      Check config when checking the containers · c3afbd3c
      Radosław Piliszek authored
      The proposed approach allows for checking whether config
      files are current, e.g. cases when the deployment was aborted after
      config files were generated but before they were injected into the
      containers which lead to old config staying in containers.
      
      After this patch we can do:
        kolla-ansible genconfig
        kolla-ansible deploy-containers
      and it would do what we expected rather than being a noop
      in the second part.
      
      We also lose the need to have notifies
      and whens in config and handler sections respectively.
      This is optimised in a separate patch.
      
      Future work:
      - optimise for large files
        - could we get away with comparing timestamps and sizes?
          container's should have a newer timestamp due to copy,
          could also preserve it
      
      Change-Id: I1d26e48e1958f13b854d8afded4bfba5021a2dec
      Closes-Bug: #1848775
      Depends-On: https://review.opendev.org/c/openstack/kolla/+/773257
      
      
      Co-Authored-By: default avatarMark Goddard <mark@stackhpc.com>
      c3afbd3c
  9. Dec 16, 2020
  10. Oct 04, 2020
    • Radosław Piliszek's avatar
      Coordinate haproxy and keepalived restarts · c2d0bf30
      Radosław Piliszek authored
      Keepalived and haproxy cooperate to provide control plane HA in
      kolla-ansible deployments.
      Certain care should be exerted to avoid prolonged availability
      loss during reconfigurations and upgrades.
      This patch aims to provide this care.
      There is nothing special about keepalived upgrade compared to
      reconfig, hence it is simplified to run the same code as for
      deploy.
      The broken logic of safe upgrade is replaced by common handler
      code which's goal is to ensure we down current master only after
      we have backups ready.
      
      This change introduces a switch to kolla_docker module that allows
      to ignore missing containers (as they are logically stopped).
      ignore_missing is the switch's name.
      All tests are included.
      
      Change-Id: I22ddec5f7ee4a7d3d502649a158a7e005fe29c48
      c2d0bf30
  11. Sep 30, 2020
  12. May 17, 2020
    • gugug's avatar
      Enable W503 for flake8 check · 22f7aecb
      gugug authored
      W503 and W504 are incompatible and we need to choose one of them.
      Existing codes follows W503, so we disable W504.
      
      Change-Id: Ic745e956dd332eb0fa49b93c1e6acb12f8a7f26c
      22f7aecb
  13. Apr 04, 2020
    • Andreas Jaeger's avatar
      Update hacking for Python3 · 45448976
      Andreas Jaeger authored
      The repo is Python 3 now, so update hacking to version 3.0 which
      supports Python 3.
      
      Fix problems found by updated hacking version.
      
      Remove hacking and friends from lower-constraints, they are not needed
      during installation.
      
      Change-Id: I7ef5ac8a89e94f5da97780198619b6facc86ecfe
      45448976
  14. Feb 11, 2020
  15. Jan 06, 2020
  16. Nov 28, 2019
    • Mark Goddard's avatar
      Support configuration of Docker client timeout · 01050dc0
      Mark Goddard authored
      Adds support for configuration of the Docker client timeout via
      'docker_client_timeout'.
      
      This change also increases the default timeout to 120 seconds, as we
      sometimes see timeouts in CI and heavily loaded or underpowered
      environments. Increasing 'docker_client_timeout' further may be helpful
      in cases where Docker reports 'Read timed out'.
      
      Change-Id: I73745771078cb2c0ebae2b1d87ba2c4c12958d82
      Closes-Bug: #1809844
      01050dc0
  17. Nov 14, 2019
    • Mark Goddard's avatar
      Attempt to pull image before stopping and removing container · 64d07c0b
      Mark Goddard authored
      * Deploy services using kolla-ansible deploy
      * Reconfigure the image for one or more services to use an invalid
      * config
      * Deploy/reconfigure services using kolla-ansible reconfigure
      
      The invalid config could be a wrong docker registry, wrong image name,
      wrong tag, etc.
      
      The restart handler for the service fails, and the old container is
      left running.
      
      The restart handler for the service fails, and the old container is
      stopped and removed. This leaves the service in a broken state.
      
      This change fixes the issue by pulling the image if necessary prior to
      stopping and removing the container.
      
      Change-Id: I85b2a1b224d4c4d85c32c4922a2cd2c41171a1dc
      Closes-Bug: #1852572
      64d07c0b
  18. Oct 01, 2019
    • Mark Goddard's avatar
      Add service-rabbitmq role · 039cc2be
      Mark Goddard authored
      This role can be used by other roles to register RabbitMQ resources.
      Currently support is provided for creating virtual hosts and users.
      
      Change-Id: Ie1774a10b4d629508584af679b8aa9e372847804
      Partially Implements: blueprint support-nova-cells
      Depends-On: https://review.opendev.org/684742
      039cc2be
  19. Sep 30, 2019
  20. Sep 16, 2019
    • Mark Goddard's avatar
      Catch errors and changes in kolla_toolbox module · 70b515bf
      Mark Goddard authored
      The kolla_toolbox Ansible module executes as-hoc ansible commands in the
      kolla_toolbox container, and parses the output to make it look as if
      ansible-playbook executed the command. Currently however, this module
      sometimes fails to catch failures of the underlying command, and also
      sometimes shows tasks as 'ok' when the underlying command was changed.
      This has been tested both before and after the upgrade to ansible 2.8.
      
      This change fixes this issue by configuring ansible to emit output in
      JSON format, to make parsing simpler. We can now pick up errors and
      changes, and signal them to the caller.
      
      This change also adds an ansible playbook, tests/test-kolla-toolbox.yml,
      that can be executed to test the module. It's not currently integrated
      with any CI jobs.
      
      Note that this change cannot be backported as the JSON output callback
      plugin was added in Ansible 2.5.
      
      Change-Id: I8236dd4165f760c819ca972b75cbebc62015fada
      Closes-Bug: #1844114
      70b515bf
  21. Aug 22, 2019
  22. Aug 07, 2019
  23. Aug 05, 2019
    • Radosław Piliszek's avatar
      ceph: fixes to deployment and upgrade · 826f6850
      Radosław Piliszek authored
      1) ceph-nfs (ganesha-ceph) - use NFSv4 only
      This is recommended upstream.
      v3 and UDP require portmapper (aka rpcbind) which we
      do not want, except where Ubuntu ganesha version (2.6)
      forces it by requiring enabled UDP, see [1].
      The issue has been fixed in 2.8, included in CentOS.
      Additionally disable v3 helper protocols and kerberos
      to avoid meaningless warnings.
      
      2) ceph-nfs (ganesha-ceph) - do not export host dbus
      It is not in use. This avoids the temptation to try
      handling it on host.
      
      3) Properly handle ceph services deploy and upgrade
      Upgrade runs deploy.
      The order has been corrected - nfs goes after mds.
      Additionally upgrade takes care of rgw for keystone
      (for swift emulation).
      
      4) Enhance ceph keyring module with error detection
      Now it does not blindly try to create a keyring after
      any failure. This used to hide real issue.
      
      5) Retry ceph admin keyring update until cluster works
      Reordering deployment caused issue with ceph cluster not being
      fully operational before taking actions on it.
      
      6) CI: Remove osd df from collected logs as it may hang CI
      Hangs are caused by healthy MON and no healthy MGR.
      A descriptive note is left in its place.
      
      7) CI: Add 5s timeout to ceph informational commands
      This decreases the timeout from the default 300s.
      
      [1] https://review.opendev.org/669315
      
      
      
      Change-Id: I1cf0ad10b80552f503898e723f0c4bd00a38f143
      Signed-off-by: default avatarRadosław Piliszek <radoslaw.piliszek@gmail.com>
      826f6850
  24. Jul 18, 2019
    • Radosław Piliszek's avatar
      Fix handling of docker restart policy · 6a737b19
      Radosław Piliszek authored
      Docker has no restart policy named 'never'. It has 'no'.
      This has bitten us already (see [1]) and might bite us again whenever
      we want to change the restart policy to 'no'.
      
      This patch makes our docker integration honor all valid restart policies
      and only valid restart policies.
      All relevant docker restart policy usages are patched as well.
      
      I added some FIXMEs around which are relevant to kolla-ansible docker
      integration. They are not fixed in here to not alter behavior.
      
      [1] https://review.opendev.org/667363
      
      
      
      Change-Id: I1c9764fb9bbda08a71186091aced67433ad4e3d6
      Signed-off-by: default avatarRadosław Piliszek <radoslaw.piliszek@gmail.com>
      6a737b19
  25. Jun 07, 2019
  26. Mar 10, 2019
    • Victor Coutellier's avatar
      Fix kolla-docker possible undefined variable · 3bb4c199
      Victor Coutellier authored
      It is possible to reference undefined variable in kolla-docker module if
      DockerWorker object initialization fail, so the current behaviour will
      crash the playbook with the unwanted error message :
      
      UnboundLocalError: local variable 'dw' referenced before assignment
      
      Change-Id: Ic8d26b11f93255220888b5406f8ab4a6f81736c2
      Closes-Bug: #1819361
      3bb4c199
  27. Jan 24, 2019
    • binhong.hua's avatar
      add ulimit support for kolla_docker · 3d3f5f16
      binhong.hua authored
      By default, docker containers inherit ulimit from limits of docker
      deamon. On CentOS 7, docker daemon default NOFILE is 1048576.
      It can found in /usr/lib/systemd/system/docker.service.
      The big limit will cause many problem. we should control it in
      production environment.
      
      Change-Id: Iab962446a94ef092977728259d9818b86cfa7f68
      3d3f5f16
  28. Dec 27, 2018
    • confi-surya's avatar
      Update hacking version · b41309ef
      confi-surya authored
      Use latest release 1.1.0 and compatible changes w.r.t pep8
      
      Change-Id: Ie5e5417a1d8481bb59d516a320bd7cc1dd89591e
      b41309ef
  29. Dec 14, 2018
    • Mark Goddard's avatar
      Create cells before starting nova services · 365bb517
      Mark Goddard authored
      Nova services may reasonably expect cell databases to exist when they
      start. The current cell setup tasks in kolla run after the nova
      containers have started, meaning that cells may or may not exist in the
      database when they start, depending on timing. In particular, we are
      seeing issues in kolla CI currently with jobs timing out waiting for
      nova compute services to start. The following error is seen in the nova
      logs of these jobs, which may or may not be relevant:
      
      No cells are configured, unable to continue
      
      This change creates the cell0 and cell1 databases prior to starting nova
      services.
      
      In order to do this, we must create new containers in which to run the
      nova-manage commands, because the nova-api container may not yet exist.
      This required adding support to the kolla_docker module for specifying a
      command for the container to run that overrides the image's command.
      
      We also add the standard output and error to the module's result when a
      non-detached container is run. A secondary benefit of this is that the
      output of bootstrap containers is now displayed in the Ansible output if
      the bootstrapping command fails, which will help with debugging.
      
      Change-Id: I2c1e991064f9f588f398ccbabda94f69dc285e61
      Closes-Bug: #1808575
      365bb517
  30. Dec 05, 2018
    • Eduardo Gonzalez's avatar
      Allow set tty for containers · 846c15d8
      Eduardo Gonzalez authored
      This change adds support to comfigure tty,
      it was enabled by default but a recent patch
      removed it. Some services such as Karaf in opendaylight
      requires a TTY during startup.
      
      Closes-Bug: #1806662
      Change-Id: Ia4335523b727d0e45505cbb1efb40ccf04c27db7
      846c15d8
  31. Nov 03, 2018
  32. Oct 31, 2018
  33. Sep 14, 2018
  34. Aug 09, 2018
  35. Jul 25, 2018
  36. Jul 12, 2018
  37. Apr 26, 2018
    • Jorge Niedbalski's avatar
      Increase timeout for kolla_toolbox module. · 044e82cc
      Jorge Niedbalski authored
      
      This patch increases the default timeout for
      the kolla_toolbox ansible module when talking
      with the docker API from the default 60 to 180 secs.
      
      This is required on slower deployments,
      specially when bootstraping an environment and fernet
      tokes are in usage. For faster deployments this will
      be harmless, but for slower deployments this would be
      beneficial.
      
      Bug: #1767136
      Change-Id: I0391715b16cf86d6c27fecf8a666de64f2735a7d
      Signed-off-by: default avatarJorge Niedbalski <jorge.niedbalski@linaro.org>
      044e82cc
  38. Apr 06, 2018
    • wangwei's avatar
      Fix ipc_mode comparison in the latest docker · 193ce726
      wangwei authored
      In old docker, if you do not specify ipc_mode, the default value is empty,
      but in the latest docker, such as 17.09.0, if not specified, the default
      is "IpcMode": "shareable", which will cause all containers to be deleted
      and re-create when to redeploy or upgrade. This commit solves the
      problem.
      
      Change-Id: Ia8269b9c8066880e4aee23d6fdea8d9c04c41e44
      Closes-Bug: #1747586
      193ce726
  39. Mar 30, 2018
Loading