Skip to content
Snippets Groups Projects
  1. Apr 24, 2024
  2. Apr 23, 2024
  3. Apr 22, 2024
    • Sven Kieske's avatar
      mariadb: fix cluster recovery · 984cb0a7
      Sven Kieske authored
      
      sometimes cluster recovery didn't work
      because we only look for the sequence number in the last 200 lines
      of the log file.
      
      fix this by ingesting the complete file and only register the last
      sequence number we find.
      
      Closes-Bug: 1821173
      
      Change-Id: Iea2661c9d5d262cf99edd5f5b567f252607a0003
      Signed-off-by: default avatarSven Kieske <kieske@osism.tech>
      984cb0a7
  4. Mar 20, 2024
  5. Mar 19, 2024
  6. Mar 18, 2024
  7. Mar 15, 2024
  8. Mar 14, 2024
  9. Mar 13, 2024
    • Roman Krček's avatar
      Add conditionals for IPv6 sysctl settings · 9301e82d
      Roman Krček authored
      This way the playbooks won't try to set ipv6 systemctl options
      unless ipv6 is available on the system.
      
      Closes-bug: #1906306
      Change-Id: Icccfc1c509179c3cfd59650b7917a637f9af9646
      9301e82d
  10. Mar 12, 2024
  11. Mar 11, 2024
    • Uwe Jäger's avatar
      Skyline configure Prometheus · 13dd9309
      Uwe Jäger authored
      Change-Id: I0a086c59076120aa53e6a05526dbab88e393c1c7
      13dd9309
    • Michal Arbet's avatar
      Fix creation of ovs bridges · 8c760d38
      Michal Arbet authored
      This patch fixes the creation of the openvswitch
      bridge by fixing an ansible task that was rewritten
      to use an ansible module, but unfortunately, its loop
      was implemented incorrectly.
      
      Closes-Bug: #2056332
      Change-Id: Ia55a36c0f9b122b72d757ca973e7d8f76ae84344
      8c760d38
    • Michal Arbet's avatar
      Fix coordination when redis used · 59da0792
      Michal Arbet authored
      Tooz 6.0.1 includes commit [1], which introduced
      parsing the username from the Redis connection URL.
      As a result, services started authenticating as admin
      which, by the way, was incorrect even before, as either
      a created user or the default one should have been used.
      
      The reason it worked before is simply because the username
      'admin' wasn't parsed anywhere.
      
      This patch fixes the user being used and sets the correct
      'default' one.
      
      [1] https://review.opendev.org/c/openstack/tooz/+/907656
      
      Closes-Bug: #2056667
      Depends-On: https://review.opendev.org/c/openstack/kolla/+/911703
      Change-Id: I5568dba15fa98e009ad4a9e41756aba0fa659371
      59da0792
  12. Mar 05, 2024
  13. Mar 01, 2024
  14. Feb 28, 2024
  15. Feb 15, 2024
  16. Feb 12, 2024
    • Dawud's avatar
      Remove the `grafana` volume · 8962b408
      Dawud authored
      Fixes not being able to add additional plugins at build time due to the
      `grafana` volume being mounted over the existing `/var/lib/grafana`
      directory. This is fixed by copying the dashboards into the container
      from an existing bind mount instead of using the ``grafana`` volume.
      This however leaves behind the volume which should be removed with
      `docker volume rm grafana` or by setting `grafana_remove_old_volume` to
      `True`.
      
      Closes-Bug: #2039498
      Change-Id: Ibcffa5d8922c470f655f447558d4a9c73b1ba361
      Unverified
      8962b408
  17. Feb 07, 2024
  18. Feb 06, 2024
    • Michal Arbet's avatar
      Fix mariadb role when used with check mode · d0b93a63
      Michal Arbet authored
      This patch adds check_mode: false to tasks
      in restart_services.yml which just checking
      some WSREP status and if port is UP.
      
      Closes-Bug: #2052501
      Change-Id: I92a591900d85138a87991a18dd4339efd053ef1b
      d0b93a63
  19. Feb 05, 2024
  20. Jan 30, 2024
  21. Jan 29, 2024
    • Alex-Welsh's avatar
      Update keystone service user passwords · ffd6e3bf
      Alex-Welsh authored
      Service user passwords will now be updated in keystone if services are
      reconfigured with new passwords set in config. This behaviour can be
      overridden.
      
      Closes-Bug: #2045990
      Change-Id: I91671dda2242255e789b521d19348b0cccec266f
      ffd6e3bf
  22. Jan 22, 2024
    • hongbin's avatar
      Revert "zun: Deprecate Zun provisionally" · 1eeb3a53
      hongbin authored
      This reverts commit f8d4db54.
      
      Reason for revert: Zun was deprecated due to the hard dependency on old version of docker and etcd. Right now, the problem was fixed.
      
      Change-Id: I9fe760e2b9b774b5ea623e6b305de004fc0c6dd2
      1eeb3a53
  23. Jan 17, 2024
    • Piotr Parczewski's avatar
      Adjust Ceph metrics scrape interval in Prometheus · 03a1b992
      Piotr Parczewski authored
      Enables modifying the interval and sets the recommended default value.
      
      [1] https://docs.ceph.com/en/latest/mgr/prometheus/#configuration
      
      Change-Id: I4b91d184485aa52b3c06011f9dbb6b34bcad3ca8
      03a1b992
    • Matt Crees's avatar
      Fix OpenSearch upgrade tasks idempotency · e502b65b
      Matt Crees authored
      Shard allocation is disabled at the start of the OpenSearch upgrade
      task. This is set as a transient setting, meaning it will be removed
      once the containers are restarted. However, if there is not change in
      the OpenSearch container it will not be restarted so the cluster is left
      in a broken state: unable to allocate shards.
      
      This patch moves the pre-upgrade tasks to within the handlers, so shard
      allocation and the flush are only performed when the OpenSearch
      container is going to be restarted.
      
      Closes-Bug: #2049512
      Change-Id: Ia03ba23bfbde7d50a88dc16e4f117dec3c98a448
      e502b65b
  24. Jan 11, 2024
  25. Jan 08, 2024
    • Pierre Riteau's avatar
      Fix Nova scp failures on Debian Bookworm · bfa9dd97
      Pierre Riteau authored
      The addition of an instance resize operation [1] to CI testing is
      triggering a failure in kolla-ansible-debian-ovn jobs, which are using a
      nodeset with multiple nodes:
      
          oslo_concurrency.processutils.ProcessExecutionError: Unexpected error while running command.
          Command: scp -r /var/lib/nova/instances/8ca2c7e8-acae-404c-af7d-6cac38e354b8_resize/disk 192.0.2.2:/var/lib/nova/instances/8ca2c7e8-acae-404c-af7d-6cac38e354b8/disk
          Exit code: 255
          Stdout: ''
          Stderr: "Warning: Permanently added '[192.0.2.2]:8022' (ED25519) to the list of known hosts.\r\nsubsystem request failed on channel 0\r\nscp: Connection closed\r\n"
      
      This is not seen on Ubuntu Jammy, which uses OpenSSH 8.9, while Debian
      Bookworm uses OpenSSH 9.2. This is likely related to this change in
      OpenSSH 9.0 [2]:
      
          This release switches scp(1) from using the legacy scp/rcp protocol
          to using the SFTP protocol by default.
      
      Configure sftp subsystem like on RHEL9 derivatives. Even though it is
      not yet required for Ubuntu, we also configure it so we are ready for
      the Noble release.
      
      [1] https://review.opendev.org/c/openstack/kolla-ansible/+/904249
      [2] https://www.openssh.com/txt/release-9.0
      
      Closes-Bug: #2048700
      Change-Id: I9f1129136d7664d5cc3b57ae5f7e8d05c499a2a5
      bfa9dd97
    • Michal Arbet's avatar
      Enable glance proxying behaviour · 9ecfcf5a
      Michal Arbet authored
      This patch sets URL to glance worker.
      If this is set, other glance workers will know how to contact this one
      directly if needed. For image import, a single worker stages the image
      and other workers need to be able to proxy the import request to the
      right one.
      
      With current setup glance image import just not working.
      
      Closes-Bug: #2048525
      
      Change-Id: I4246dc8a80038358cd5b6e44e991b3e2ed72be0e
      9ecfcf5a
  26. Jan 05, 2024
    • Mark Goddard's avatar
      cadvisor: Set housekeeping interval to Prometheus scrape interval · 97e5c0e9
      Mark Goddard authored
      The prometheus_cadvisor container has high CPU usage. On various
      production systems I checked it sits around 13-16% on controllers,
      averaged over the prometheus 1m scrape interval. When viewed with top we
      can see it is a bit spikey and can jump over 100%.
      
      There are various bugs about this, but I found
      https://github.com/google/cadvisor/issues/2523 which suggests reducing
      the per-container housekeeping interval. This defaults to 1s, which
      provides far greater granularity than we need with the default
      prometheus scrape interval of 60s.
      
      Reducing the housekeeping interval to 60s on a production controller
      reduced the CPU usage from 13% to 3.5% average. This still seems high,
      but is more reasonable.
      
      Change-Id: I89c62a45b1f358aafadcc0317ce882f4609543e7
      Closes-Bug: #2048223
      97e5c0e9
    • Dawud's avatar
      Enable HAProxy Prometheus metrics endpoint · 140722f7
      Dawud authored
      
      HAProxy exposes a Prometheus metrics endpoint, it just needs to be
      enabled. Enable this and remove configuration for
      prometheus-haproxy-exporter. Remaining prometheus-haproxy-exporter
      containers will automatically be removed.
      
      Change-Id: If6e75691d2a996b06a9b95cb0aae772db54389fb
      Co-Authored-By: default avatarMatt Anson <matta@stackhpc.com>
      Unverified
      140722f7
Loading