Commits · 47ddac41312abf7d9f44a15b6ba6128e1eba582a · Very Demiurge Very Mindful / Kolla Ansible

Jan 22, 2024

Michal Arbet authored 1 year ago


The version that we were capping to is no longer compatible with latest
upper-constraints.txt, so let us free float again.

The resulting linting errors are included for now to unblock the gate,
these will still need to be discussed or fixed later.

NOTE(kevko): Temporarily disabling horizon deployment, as it's not
possible to unblock gates without it

Co-Authored-By: Michal Arbet <michal.arbet@ultimum.io>
Change-Id: Ib7f72b2663199ef80844a412bc436c6ef09322cc

47ddac41

Jan 18, 2024
- Merge "Drop more remnants of install_type" · 86ac8204
  Zuul authored 1 year ago
  
  86ac8204
Jan 17, 2024

Drop more remnants of install_type · 76f5d0cb
Pierre Riteau authored 1 year ago
```
Change-Id: I8e5e42db48c6235deb93dcb185e044fce983ba5a
```
76f5d0cb

use docker_custom_config override for Kolla CI upgrade jobs · 1d38ff5e

Bartosz Bezak authored 1 year ago

In Kolla CI K-A upgrade job needs docker_custom_config override
as docker_registry var is being used both for docker daemon
config - for kolla images build, and kolla-ansible container images
sources - where we're using quay.io mirror.
docker_custom_config gets precedence in docker daemon
configuration.

docker_custom_config was removed in [1].

[1] https://review.opendev.org/c/openstack/kolla-ansible/+/904067

Change-Id: I1e890223faf25b1169a49e22a9529f90806d2f3a

1d38ff5e

Jan 15, 2024
- Merge "CI: Use 2023.2 as previous_release" · 77c18fa6
  Zuul authored 1 year ago
  
  77c18fa6
Jan 12, 2024
- Merge "Test haproxy single external frontend" · 3490b0f1
  Zuul authored 1 year ago
  
  3490b0f1
- Merge "CI: Rework docker config vars" · aac86a92
  Zuul authored 1 year ago
  
  aac86a92
- Merge "Fix trove failed to discover swift endpoint" · 3ed60961
  Zuul authored 1 year ago
  
  3ed60961
Jan 11, 2024

Merge "Fix trove failed to connect rabbitmq - quorum queues support" · 1832eee3
Zuul authored 1 year ago

1832eee3
Merge "Fix trove failed to connect rabbitmq - durable queues support" · 781e3949
Zuul authored 1 year ago

781e3949

Fix trove failed to discover swift endpoint · 9eff4380

wu.chunyang authored 1 year ago

This change fixes the trove failed to discover swift endpoint
by adding service_credentials in guest-agent.conf

Closes-Bug: #2048829

Change-Id: I185484d2a0d0a2d4016df6acf8a6b0a7f934c237

9eff4380

Fix trove failed to connect rabbitmq - quorum queues support · 57b24f01

wu.chunyang authored 1 year ago

This change fixes the trove guest instance failed to connect to
RabbitMQ by adding quorum queues support to oslo_messaging_rabbit
section in guest-agent.conf.

Closes-Bug: #2048822
Change-Id: I94908f8e20981f20fbe4dc18e2091d3798f8b801

57b24f01

Fix trove failed to connect rabbitmq - durable queues support · 6b96d098

wu.chunyang authored 1 year ago

This change fixes the trove guest instance failed to connect to
RabbitMQ by adding durable queues support to oslo_messaging_rabbit
section in guest-agent.conf.

Partial-Bug: #2048822

Change-Id: I8efc3c92e861816385e6cda3b231a950a06bf57d

6b96d098

Jan 10, 2024
- Merge "Enable the Fluentd Plugin Systemd" · 357db524
  Zuul authored 1 year ago
  
  357db524
Jan 09, 2024
- Merge "CI: Test Nova server resize functionality" · e30ef79d
  Zuul authored 1 year ago
  
  e30ef79d
- Merge "Fix Nova scp failures on Debian Bookworm" · c78cedfa
  Zuul authored 1 year ago
  
  c78cedfa
- Merge "Update python classifier in setup.cfg" · 03ec1798
  Zuul authored 1 year ago
  
  03ec1798
- Merge "Enable glance proxying behaviour" · 6bbe0987
  Zuul authored 1 year ago
  
  6bbe0987
- Update python classifier in setup.cfg · 27f162cf
  Ghanshyam Mann authored 1 year ago
  
  As per the current release tested runtime, we test till python 3.11 so updating the same in python classifier in setup.cfg Change-Id: I241e77dbf6bb2085a5bf5d54f9e5b0d2af96fbf3
  27f162cf
Jan 08, 2024

CI: Test Nova server resize functionality · f86ed027

Pierre Riteau authored 1 year ago

This adds an extra resize operation to core OpenStack tests. This should
be fast since we are only increasing the number of cores of the VM and
could help catch additional errors in CI tests.

Change-Id: Ia61b995dbffcda4f1e6494548df457231cb67bd7

f86ed027

Fix Nova scp failures on Debian Bookworm · bfa9dd97

Pierre Riteau authored 1 year ago

The addition of an instance resize operation [1] to CI testing is
triggering a failure in kolla-ansible-debian-ovn jobs, which are using a
nodeset with multiple nodes:

    oslo_concurrency.processutils.ProcessExecutionError: Unexpected error while running command.
    Command: scp -r /var/lib/nova/instances/8ca2c7e8-acae-404c-af7d-6cac38e354b8_resize/disk 192.0.2.2:/var/lib/nova/instances/8ca2c7e8-acae-404c-af7d-6cac38e354b8/disk
    Exit code: 255
    Stdout: ''
    Stderr: "Warning: Permanently added '[192.0.2.2]:8022' (ED25519) to the list of known hosts.\r\nsubsystem request failed on channel 0\r\nscp: Connection closed\r\n"

This is not seen on Ubuntu Jammy, which uses OpenSSH 8.9, while Debian
Bookworm uses OpenSSH 9.2. This is likely related to this change in
OpenSSH 9.0 [2]:

    This release switches scp(1) from using the legacy scp/rcp protocol
    to using the SFTP protocol by default.

Configure sftp subsystem like on RHEL9 derivatives. Even though it is
not yet required for Ubuntu, we also configure it so we are ready for
the Noble release.

[1] https://review.opendev.org/c/openstack/kolla-ansible/+/904249
[2] https://www.openssh.com/txt/release-9.0

Closes-Bug: #2048700
Change-Id: I9f1129136d7664d5cc3b57ae5f7e8d05c499a2a5

bfa9dd97

Enable glance proxying behaviour · 9ecfcf5a

Michal Arbet authored 1 year ago

This patch sets URL to glance worker.
If this is set, other glance workers will know how to contact this one
directly if needed. For image import, a single worker stages the image
and other workers need to be able to proxy the import request to the
right one.

With current setup glance image import just not working.

Closes-Bug: #2048525

Change-Id: I4246dc8a80038358cd5b6e44e991b3e2ed72be0e

9ecfcf5a

Merge "CI: Use ControlPersist and ControlMaster" · 15380925
Zuul authored 1 year ago

15380925

Jan 06, 2024
- Merge "cadvisor: Set housekeeping interval to Prometheus scrape interval" · 205fd639
  Zuul authored 1 year ago
  
  205fd639
Jan 05, 2024

cadvisor: Set housekeeping interval to Prometheus scrape interval · 97e5c0e9

Mark Goddard authored 1 year ago

The prometheus_cadvisor container has high CPU usage. On various
production systems I checked it sits around 13-16% on controllers,
averaged over the prometheus 1m scrape interval. When viewed with top we
can see it is a bit spikey and can jump over 100%.

There are various bugs about this, but I found
https://github.com/google/cadvisor/issues/2523 which suggests reducing
the per-container housekeeping interval. This defaults to 1s, which
provides far greater granularity than we need with the default
prometheus scrape interval of 60s.

Reducing the housekeeping interval to 60s on a production controller
reduced the CPU usage from 13% to 3.5% average. This still seems high,
but is more reasonable.

Change-Id: I89c62a45b1f358aafadcc0317ce882f4609543e7
Closes-Bug: #2048223

97e5c0e9

Fix long service restarts while using systemd · b1fd2b40

Michal Arbet authored 1 year ago

Some containers exiting with 143 instead of 0, but
this is still OK. This patch just allows
ExitCode 143 (SIGTERM) as fix. Details in
bugreport.

Services which exited with 143 (SIGTERM):

kolla-cron-container.service
kolla-designate_producer-container.service
kolla-keystone_fernet-container.service
kolla-letsencrypt_lego-container.service
kolla-magnum_api-container.service
kolla-mariadb_clustercheck-container.service
kolla-neutron_l3_agent-container.service
kolla-openvswitch_db-container.service
kolla-openvswitch_vswitchd-container.service
kolla-proxysql-container.service

Partial-Bug: #2048130
Change-Id: Ia8c85d03404cfb368e4013066c67acd2a2f68deb

b1fd2b40

Jan 04, 2024
- Merge "post-deploy: add public-openrc.sh" · 39db9a04
  Zuul authored 1 year ago
  
  39db9a04
- Merge "ironic: Remove enable_ironic_pxe_uefi bits" · 288d2f08
  Zuul authored 1 year ago
  
  288d2f08
- ironic: Remove enable_ironic_pxe_uefi bits · d8700ad0
  Michal Nasiadka authored 1 year ago
  
  These were missed in I081aa1345603fa27c390e4e09231a5ff226bcb39 Change-Id: I2884bca3c06ff98004e318757a20b60c12375924
  d8700ad0
- CI: Use 2023.2 as previous_release · 6daadfdb
  Michal Nasiadka authored 1 year ago
  
  Change-Id: I30e9e8c6f59bf2b2f912d70178484ddcd657436e
  6daadfdb
Jan 03, 2024
- Use service-images-pull role for letsencrypt and venus · 498d3243
  Mark Goddard authored 1 year ago
  
  This reduces code duplication. Change-Id: Ie529875aaa42435835417468868250bbe4fcf649
  498d3243
- Merge "Remove nova cell sync comment" · 16928ced
  Zuul authored 1 year ago
  
  16928ced
- Merge "haproxy: Fix single frontend after LE cert path change" · 2712a7a6
  Zuul authored 1 year ago
  
  2712a7a6
- Merge "Persist Neutron agent state files in volume" · 3681427b
  Zuul authored 1 year ago
  
  3681427b
- Merge "magnum: Disable CAPI driver when kubeconfig missing" · dd784731
  Zuul authored 1 year ago
  
  dd784731
- Test haproxy single external frontend · 9bc99b94
  Michal Nasiadka authored 2 years ago
  
  Change-Id: Id25b4407a8170f69e4cd7278e0aff64c609ace7d
  9bc99b94
Jan 02, 2024
- haproxy: Fix single frontend after LE cert path change · 21e5b21f
  Michal Nasiadka authored 1 year ago
  
  I35317ea0343f0db74ddc0e587862e95408e9e106 changed certificate path but omitted single frontend template. Change-Id: I638ba32e97234900745df62056710dcc37e7db77
  21e5b21f
- magnum: Disable CAPI driver when kubeconfig missing · 48796560
  Michal Nasiadka authored 1 year ago
  
  Closes-Bug: #2047360 Change-Id: I73490d84da39a74ea7ac493c7dd41fe7bfe2f578
  48796560
- Merge "Fix wsrep sync status task while switched to TCP/IP" · 65886c1d
  Zuul authored 1 year ago
  
  65886c1d
- Merge "Remove after-Zed TODOs" · eb0e5bac
  Zuul authored 1 year ago
  
  eb0e5bac