- Oct 24, 2024
-
-
Michal Nasiadka authored
After switching to ProxySQL as default we see following logs: CRITICAL neutron [None req-c214fdae-5da7-402d-92b0-0572c278a5b5 - - - - - -] Unhandled error: sqlalchemy.exc.OperationalError: (pymysql.err.OperationalError) (9001, 'Max connect timeout reached while reaching hostgroup 0 after 10150ms') Mainly in upgrade jobs, which otherwise pass successfuly - just fail on this check. Change-Id: I4336ec62a0a2dfbe815842f1bacb02135bcf4c0e
-
- Oct 22, 2024
- Oct 18, 2024
-
-
Zuul authored
-
- Oct 17, 2024
-
-
Michal Arbet authored
The backup user was missing the necessary CREATE privilege for the mariadb_backup_history table within the mysql schema, causing backups to fail when attempting to create this table. This patch addresses the issue by granting the backup user the required CREATE permission specifically for the mariadb_backup_history table. With this change, the backup process can now complete successfully without manual intervention for user permissions. Closes-Bug: #2061889 Change-Id: Ic92c8959972329adbd4b89c521aa87678f25b4e4
-
jayjahns authored
In 2023.1, mariadb was at version 10.6, which meant that the PERCONA_SCHEMA.xtrabackup_history table was used to log historicals for backups. Starting in 2023.2 onwards, mariadb is at version 10.11, and the default table used is now mysql.mariadb_backup_history. Because the mysql database already exists, there is no reason to try and create it. Instead, we just need to update the defaults and ensure the permissions get added to the correct database. Related-Bug: #2061889 Change-Id: If146d8f896c70374884807d42ca0e12df3276d48
-
Michal Arbet authored
It's been some time since ProxySQL has been with us in Kolla. Let's switch the load balancer for MariaDB connections from HAProxy to ProxySQL. Depends-On: https://review.opendev.org/c/openstack/kolla/+/928956 Change-Id: I42ba4fb83b5bb31058e888f0d39d47c27b844de5
-
Michal Arbet authored
In single-node clusters, ProxySQL shuns the server on MySQL errors, causing failures during upgrades or container restarts. This change increases the timeout to 10 seconds, allowing the backend time to recover and preventing immediate errors in CI environments. Change-Id: I70becdc3fcb4ca8f7ae31d26097d95bdc6dd67eb
-
- Oct 16, 2024
-
-
Michal Nasiadka authored
Change-Id: Ie2b0b2d5fca7b9d7c613a67a134c4650de2a5af6
-
Zuul authored
-
Zuul authored
-
- Oct 15, 2024
-
-
Zuul authored
-
Jan Horstmann authored
Add missing logrotate config for redis. Closes-Bug: 2084523 Change-Id: Ic631a9c87f7be30f7694706928d9ede62015ed6d Signed-off-by:
Jan Horstmann <horstmann@osism.tech>
-
- Oct 14, 2024
-
-
Jakub Darmach authored
ubuntu-ceph is broken for now due to [1], also there are no download.ceph.com packages for Noble - so we're using Ubuntu provided ones from proposed - because current version in regular repos is built from git sha instead of a release and is not suitable for running outside of Ceph upstream CI. [1]: https://tracker.ceph.com/issues/66389 Depends-On: https://review.opendev.org/c/openstack/kolla/+/907589 Change-Id: I384068572d8a1a495c60b401dc4144a0a80802f1
-
Rafal Lewandowski authored
Closes-Bug: #2084128 Change-Id: I3b44c8f4ff3c55023d8bab4e9a88a86ca72cae5d
-
- Oct 09, 2024
- Oct 08, 2024
-
-
Michal Nasiadka authored
Since [1] Neutron puts requested-chassis entry with a name taken from the agent, which results in FQDN-based name on FQDN-based deployments. It does not match what we set in hostname in OVS. [1]: I4e3c001dd3bb37b86fda8b9495a3c5178c3e736d Closes-Bug: #2080552 Change-Id: I3ae03aa2e09bc445f0f5a95a43bf210f06685cc1
-
- Oct 07, 2024
- Oct 04, 2024
-
-
Zuul authored
-
Michal Nasiadka authored
Change-Id: I85982366d97e40d0e6ba97ce4c4c1ddd2c10b749
-
Michal Arbet authored
Change-Id: I972608b81edb2f5c118cf9a2f734f83ea399a849
-
- Oct 03, 2024
-
-
Zuul authored
-
- Oct 02, 2024
- Sep 30, 2024
-
-
Michal Arbet authored
This patch adds ansible-core as a dependency to requirements.txt since there is really no reason not to have it in the requirements.txt, given that kolla-ansible project even has ansible in its name and ansible-core is a crucial dependency. Change-Id: I4d01b7eb944c4edf6a440f2b775221986509f7eb
-
- Sep 29, 2024
-
-
Michal Arbet authored
This patch fixes an issue where backend related certificates are attempted to be copied when ``kolla_copy_ca_into_containers`` is enabled but ``kolla_enable_tls_backend`` is disabled. The fix consists of these specific tasks now being limited by the condition ``kolla_enable_tls_backend`` Closes-Bug: #2080381 Change-Id: I7ccae4c501ce332519edef336bcceefae9f9568b
-
- Sep 27, 2024
- Sep 26, 2024
-
-
Michal Arbet authored
Kolla-ansible itself requires ansible-core>=2.16,<2.18, but ansible-core in this version no longer supports python38 and python39 as per [1]. So let's just drop this old python support. [1] https://github.com/ansible/ansible/blob/v2.16.11/setup.cfg Change-Id: Ic8aaa57f75479a17c215c27ac5e6df0f18c74edc
-
- Sep 24, 2024
- Sep 23, 2024
-
-
Michal Arbet authored
This update enhances the monitoring of the databasecluster in ProxySQL. The default monitoring intervals were insufficient for reliably detecting failures in the Galera cluster environment. A detailed configuration for monitoring intervals has been introduced, providing better control over how quickly and accurately ProxySQL can identify issues. - Variables such as `mariadb_monitor_connect_interval`, `mariadb_monitor_galera_healthcheck_interval, and `mariadb_monitor_ping_interval` significantly reduce the time between connection checks. - Timeouts like `mariadb_monitor_galera_healthcheck_timeout` and `mariadb_monitor_ping_timeout` allow faster failure detection, while `mariadb_monitor_galera_healthcheck_max_timeout_count` sets the maximum number of allowed timeouts before marking a node as down. Calculation: - Galera healthcheck: 4 seconds (interval) + 1 second (timeout) + 4 seconds (interval) + 1 second (timeout) = 10 seconds. - Ping healthcheck: 3 seconds (interval) + 2 seconds (timeout) + 3 seconds (interval) + 2 seconds (timeout) = 10 seconds. Both the health check and ping check mechanisms will detect a node failure within a maximum of 10 seconds. Both processes (health check and ping) operate independently, and failure in either mechanism will mark the node as failed. Health Check Failure Detection: Up to 10 seconds. Ping Failure Detection: Up to 10 seconds. Connect Attempts: ProxySQL also tries to connect every 2 seconds, which helps monitor connectivity. These changes ensure that ProxySQL can detect issues in 10 seconds as haproxy, significantly reducing downtime compared to default settings. This adjustment enables faster and more reliable monitoring, improving system stability and reducing potential downtime in production environments. Change-Id: Ic28801519cdb35ed2387a1468b9df661847a5476
-
Michal Arbet authored
The ProxySQL startup script was incorrectly using the `--reload` flag, which only reloads/merges the configuration without initializing the database from the config file [1]. This change corrects it to use the `--initial` flag, ensuring that the database is always reloaded from the configuration at startup. [1] https://proxysql.com/documentation/configuring-proxysql/#initialstartup Change-Id: I9cc721555a6d19409b6ac24432b6b34a83efc42c
-
Zuul authored
-