Skip to content
Snippets Groups Projects
  1. May 16, 2024
  2. Jul 21, 2023
    • Doug Szumski's avatar
      Fix OpenSearch Dashboards health check · bacd6c7f
      Doug Szumski authored
      The OpenSearch Dashboards container does not have a health
      check defined when created. This causes the container to always
      restart when reconfigured, even if no change has been made.
      
      Change-Id: I0b437a77aeb61bc5ae9238f900a1fa00cbc34e18
      Partial-Bug: #2028362
      bacd6c7f
  3. Jun 28, 2023
  4. Jun 21, 2023
  5. Jun 20, 2023
    • Dawud's avatar
      Use friendly prometheus instance labels · eef3ff30
      Dawud authored
      
      Replaces the instance label on prometheus metrics with the inventory
      hostname as opposed to the ip address. The ip address is still used as
      the target address which means that there is no issue of the hostname
      being unresolvable. Can be optionally enabled or set to FQDNs by
      changing the prometheus_instance_label variable as mentioned in the
      release notes.
      
      Co-Authored-By: default avatarWill Szumski <will@stackhpc.com>
      Change-Id: I387c9d8f5c01baf6054381834ecf4e554d0fff35
      eef3ff30
  6. Jun 17, 2023
  7. Jun 14, 2023
    • Michal Arbet's avatar
      Add support for multiple ceph files · fdf2385f
      Michal Arbet authored
      This patch is adding a feature for an option to copy different
      ceph configuration files and corresponding keyrings for cinder,
      glance, manila, gnocchi and nova services.
      
      This is especially useful when the deployment uses availability
      zones as below example.
      
        - Individual compute can read/write to individual ceph
          cluster in same AZ.
        - Cinder can write to several ceph clusters in several AZs.
        - Glance can use multistore and upload images to
          several ceph clusters in several AZs at once.
      
      Change-Id: Ie4d8ab5a3df748137835cae1c943b9180cd10eb1
      fdf2385f
  8. Jun 12, 2023
  9. Jun 07, 2023
  10. May 31, 2023
    • Maksim Malchuk's avatar
      Fix passwords.yml permissions · 5fd81170
      Maksim Malchuk authored
      
      The kolla-genpwd, kolla-mergepwd, kolla-readpwd and kolla-writepwd
      commands now creates or updates passwords.yml with correct
      permissions. Also they display warning message about incorrect
      permissions.
      
      Closes-Bug: #2018338
      Change-Id: I4b50053ced9150499d1d09fd4a0ec2e243cf938b
      Signed-off-by: default avatarMaksim Malchuk <maksim.malchuk@gmail.com>
      5fd81170
  11. May 26, 2023
    • OpenStack Release Bot's avatar
      Update master for stable/2023.1 · b26d25eb
      OpenStack Release Bot authored
      Add file to the reno documentation build to show release notes for
      stable/2023.1.
      
      Use pbr instruction to increment the minor version number
      automatically so that master versions are higher than the versions on
      stable/2023.1.
      
      Sem-Ver: feature
      Change-Id: I870c0569a1e175ac5df59fc495812ba81c5147e6
      b26d25eb
  12. May 19, 2023
  13. May 18, 2023
  14. May 16, 2023
    • Sean Mooney's avatar
      always add service_user section to nova.conf · ddadaa28
      Sean Mooney authored
      
      As of I3629b84d3255a8fe9d8a7cea8c6131d7c40899e8 nova
      now requires the service_user section to be configured
      to address CVE-2023-2088. This change adds
      the service user section to the nova.conf template in
      the nova and nova-cell roles.
      
      Related-Bug: #2004555
      Signed-off-by: default avatarSven Kieske <kieske@osism.tech>
      Change-Id: I2189dafca070accfd8efcd4b8cc4221c6decdc9f
      (cherry picked from commit a77ea13ef1991543df29b7eea14b1f91ef26f858)
      (cherry picked from commit 03c12abbcc107bfec451f4558bc97d14facae01c)
      (cherry picked from commit cb105dc293ff1cdb11ab63fa3e3bf39fd17e0ee0)
      (cherry picked from commit efe6650d09441b02cf93738a94a59723d84c5b19)
      ddadaa28
  15. May 04, 2023
  16. Apr 26, 2023
  17. Apr 20, 2023
  18. Apr 17, 2023
    • Michal Arbet's avatar
      Configure coordination in default for masakari-api · 842adf6d
      Michal Arbet authored
      This patch introduces distributed lock for masakari-api
      service when handle the concurrent notifications for the same
      host failure from multiple masakari-hostmonitor services.
      
      Change-Id: I46985202dc8da22601357eefe2727599e7a413e5
      842adf6d
  19. Apr 13, 2023
    • Michal Nasiadka's avatar
      ansible: bump min to 2.13 and max to 2.14 · 9867060b
      Michal Nasiadka authored
      Change-Id: Ibc9cc91f64b0450de3cae6e2830b4ff2c52c0395
      9867060b
    • Matt Crees's avatar
      Remove RabbitMQ ha-all policy when not required · c85b64d1
      Matt Crees authored
      With the addition of the variable
      `om_enable_rabbitmq_high_availability`, this feature in the upgrade
      task should be brought back. It is also now used in the deploy task. The
      `ha-all` policy is cleared only when
      `om_enable_rabbitmq_high_availability` is set to `false`.
      
      Change-Id: Ia056aa40e996b1f0fed43c0f672466c7e4a2f547
      c85b64d1
  20. Apr 12, 2023
  21. Apr 08, 2023
  22. Mar 29, 2023
  23. Mar 28, 2023
    • Matt Crees's avatar
      Use the upgraded image to run Nova upgrade checks · e34fbb17
      Matt Crees authored
      When upgrading Nova, we sometimes hit an error where an old hypervisor
      that hasn’t been upgraded recently (for example due to broken hardware)
      is preventing Nova API from starting properly. This can be detected
      using the tool ``nova-status upgrade check`` to make sure that there are
      no ``nova-compute`` that are older than N-1 releases. This is already
      used in the Kolla Ansible upgrade task for Nova. However, this task uses
      the current ``nova-api`` container, so computes which will be too old
      after the upgrade are not caught.
      
      This patch changes Kolla Ansible so that the upgraded ``nova-api`` image
      is used to run the upgrade checks, allowing computes that will be too
      old to be detected before the upgrades are performed.
      
      Depends-On: https://review.opendev.org/c/openstack/kolla/+/878744
      
      
      
      Closes-Bug: #1957080
      Co-Authored-By: default avatarPierre Riteau <pierre@stackhpc.com>
      Change-Id: I3a899411001834a0c88e37f45a756247ee11563d
      e34fbb17
  24. Mar 21, 2023
  25. Mar 06, 2023
  26. Mar 02, 2023
  27. Feb 14, 2023
    • Mark Goddard's avatar
      Fix deploy/genconfig in check mode · 572ff2f8
      Mark Goddard authored
      Previously, when running one of the following commands:
      
        kolla-ansible deploy --check
        kolla-ansible genconfig --check
      
      deployment or configuration generation fails for various reasons.
      
      MariaDB fails to lookup the existing cluster.
      
      Keystone fails to generate cron config.
      
      Nova-cell fails to get the cell settings.
      
      Closes-Bug: #2002661
      Change-Id: I5e765f498ae86d213d0a4379ca5d473db1499962
      572ff2f8
    • John Garbutt's avatar
      Improve RabbitMQ performance by reducing ha replicas · 6cf22b0c
      John Garbutt authored
      Currently we do not follow the RabbitMQ advice on replicas here:
      https://www.rabbitmq.com/ha.html#replication-factor
      
      Here we reduce the number of replicas to n // 2 + 1 as advised
      above. The hope it this helps speed up recovery from rabbit
      issues.
      
      Related-Bug: #1954925
      Change-Id: Ib6bcb26c499c9884faa4a0cd51abaec00cacb096
      6cf22b0c
    • Matt Crees's avatar
      Add flag to change RabbitMQ ha-mode definition · e13072a9
      Matt Crees authored
      Adds the flag `rabbitmq_ha_replica_count` to change how many different
      nodes a queue should be mirrored across. If the value is not set, then
      it defaults to "ha-mode":"all". This value is unset by default to avoid
      any unexpected changes to the RabbitMQ definitions.json file, as that
      would trigger an unexpected restart of RabbitMQ during the next deploy.
      
      Change-Id: Iee98cd937197a73a3b04aa8501fa325e8ecfff24
      e13072a9
    • Will Szumski's avatar
      Use loadbalancer to connect to etcd · e2c7dace
      Will Szumski authored
      Hardcoding the first etcd host creates a single point of failure.
      
      Change-Id: I0f83030fcd84ddcdc4bf2226e76605c7cab84cbb
      e2c7dace
  28. Feb 13, 2023
    • Will Szumski's avatar
      Put etcd behind HTTP loadbalancer · 6f536a4f
      Will Szumski authored
      
      etcd-compatible tooz drivers do not support multiple endpoints via
      backend_url. We can put a loadbalancer in front of etcd and configure
      backend_url to use the VIP instead. The issue with hard coding the first
      host is that we break coordination if we take this host offline. In the
      case of cinder, we would not be able to perform any volume related
      operations.
      
      Co-Authored-By: default avatarMark Goddard <mark@stackhpc.com>
      Change-Id: Ib684501ba03c386dc5ac71e5cbea05c99f191665
      6f536a4f
  29. Feb 09, 2023
    • John Garbutt's avatar
      RabbitMQ: Support setting ha-promote-on-shutdown · 94f3ce0c
      John Garbutt authored
      By default ha-promote-on-shutdown=when-synced. However we are seeing
      issues with RabbitMQ automatically recovering when nodes are restarted.
      https://www.rabbitmq.com/ha.html#cluster-shutdown
      
      Rather than waiting for operator interventions, it is better we allow
      recovery to happen, even if that means we may loose some messages.
      A few failed and timed out operations is better than a totaly broken
      cloud. This is achieved using ha-promote-on-shutdown=always.
      
      Note, when a node failure is detected, this is already the default
      behaviour from 3.7.5 onwards:
      https://www.rabbitmq.com/ha.html#promoting-unsynchronised-mirrors
      
      This patch adds the option to change the ha-promote-on-shutdown
      definition, using the flag `rabbitmq_ha_promote_on_shutdown`. This
      value is unset by default to avoid any unexpected changes to the
      RabbitMQ definitions.json file, as that would trigger an unexpected
      restart of RabbitMQ during the next deploy.
      
      Related-Bug: #1954925
      
      Change-Id: I2146bda2c72ddac2c9923c6941b0596395fd9ab5
      94f3ce0c
  30. Feb 04, 2023
    • Michal Arbet's avatar
      Fix kolla_docker module · 63b9fa56
      Michal Arbet authored
      This patch fixes kolla_docker module
      as it did not take into account common_options
      parameter. From patchset it's visible that module's
      default values are used always - even if user overrided
      some param in common_options dict.
      
      Closes-Bug: #2003079
      
      Change-Id: I677fde708dd004decaff4bd39f2173d8d81052fb
      63b9fa56
Loading