apex-tripleo-heat-templates - Unnamed repository

Age	Commit message (Collapse)	Author	Files	Lines
2016-09-30	Fixed NoneType issue when logging-environment.yaml is used	Juan Badia Payno	1	-1/+1
	When you tried to use the environemnt/logging-environemnt.yaml as a part of the deployment on the overcloud you hit the following error and it stops the deploy of the overcloud. * Deploying templates in the directory /home/stack/tripleo-heat-templates 'NoneType' object does not support item assignment * Closes-Bug: #1629315 Change-Id: I55e5c7f20ddf30f3e48247b734f6fa47f5de3750 Signed-off-by: Juan Badia Payno <jbadiapa@redhat.com>
2016-09-30	Merge "Add option to specify Certmonger CA"	Jenkins	1	-0/+8

2016-09-30	Merge "Move the rest of static roles resource registry entries to j2"	Jenkins	4	-14/+4

2016-09-30	Add flag for internal TLS	Juan Antonio Osorio Robles	2	-0/+6
	This sets up a flag that tells the profiles to use TLS (this will happen in the internal network). bp tls-via-certmonger Change-Id: If47febb5b38b1c65f60f9de87a34cb31936a7c0d
2016-09-29	Merge "Use -L with chown and set crush map tunables when upgrading Ceph"	Jenkins	2	-4/+8

2016-09-29	Merge "Fix typo in fixing gnocchi upgrade."	Jenkins	1	-1/+1

2016-09-29	Merge "Add gateway_ip in OS::Neutron::Subnet"	Jenkins	11	-1/+24

2016-09-29	Add HAProxy TLS handled by certmonger as composable service	Juan Antonio Osorio Robles	7	-10/+183
	This adds some basic pieces to get certmonger to manage the certificates for HAProxy. The aim is to be flexible enough that we will be able to manage both public and internal certificates. This also adds a relevant environment to get the endpoints to have TLS everywhere. bp tls-via-certmonger Depends-On: I89001ae32f46c9682aecc118753ef6cd647baa62 Change-Id: Ife5f8c2f07233295bc15b4c605acf3d9bd62f162
2016-09-29	Add option to specify Certmonger CA	Juan Antonio Osorio Robles	1	-0/+8
	This will be used for internal (or even public) TLS, for when certmonger is generating the certificates. This same setting is used for the undercloud with the generate_service_certificate option. Change-Id: Ic54fe512b9ed5c71417a66491b7954e653f660b6
2016-09-29	Balance Rabbitmq Queue Master Location on queue declaration with min-masters ↵	Michele Baldessari	1	-0/+1
	strategy It may happen that one of the controllers may become unavailable and Queue Masters will be located on available controllers during queue declarations. Once a lost controller will be become available masters of newly declared queues are not placed with priority to such controller with obviously lower number of queue masters and thus the distribution may be unbalanced and one of the controllers may become under significantly higher load in some circumstances of multiple fail-overs. With rabbit 3.6.0 rabbitmq introduced a new HA feature of Queue masters distribution - one of the strategies is min-masters, which picks the node hosting the minimum number of masters. One of the ways how to turn such min-masters strategy on is by adding following into configuration file - rabbitmq.config {rabbit,[ .. {queue_master_locator, <<"min-masters">>}, .. ]}, Change-Id: I61bcab0e93027282b62f2a97bec87cbb0a6e6551 Closes-Bug: #1629010
2016-09-29	Set ceph osd max object name and namespace len on upgrade when on ext4	Giulio Fidente	1	-0/+10
	As per [1] we need to lower osd max object name and namespace len when upgrading from Hammer and the OSD is backed by ext4. These could also be given via ExtraConfig but on upgrade we only run puppet apply after this script is executed, so the values won't be effective unless the daemon is restarted. Yet we do not want puppet to restart the daemon because we can't bring all OSDs down unconditionally or guests will die. 1. http://tracker.ceph.com/issues/16187 Co-Authored-By: Michele Baldessari <michele@acksyn.org> Co-Authored-By: Dimitri Savineau <dsavinea@redhat.com> Change-Id: I7fec4e2426bdacd5f364adbebd42ab23dcfa523a Closes-Bug: 1628874
2016-09-29	Add parameters to run nova over httpd	Juan Antonio Osorio Robles	1	-0/+18
	This adds the necessary hieradata to run nova over httpd instead of eventlet. Change-Id: I57fb20cf0d58b3376243ba4aeb04e995e7152ce3
2016-09-29	Cinder volume service is not managed by Pacemaker on BlockStorage	Giulio Fidente	4	-2/+3
	We do not want cinder-volume to be managed by Pacemaker on BlockStorage nodes, where Pacemaker is not running at all. This change adds a new BlockStorageCinderVolume service name which can (and is, by default) mapped to the non Pacemaker implementation of the service. The error was: Could not find dependency Exec[wait-for-settle] for Pacemaker::Resource::Systemd[openstack-cinder-volume] Also moves cinder::host setting into the Pacemaker specific service definition because we only want to set a shared host= string when the service is managed by Pacemaker. Closes-Bug: #1628912 Change-Id: I2f7e82db4fdfd5f161e44d65d17893c3e19a89c9
2016-09-29	Move the rest of static roles resource registry entries to j2	Carlos Camacho	4	-14/+4
	Moving the rest of the static based resource registry entries to j2, this allows to extend the content of the template to the roles_list. Also moved the templates to correspond with the role name. Partial-Bug: #1626976 Change-Id: I1cbe101eb4ce5a89cba5f2cc45cace43d3380f22
2016-09-29	Merge "j2 template per-role things in default registry"	Jenkins	1	-58/+20

2016-09-29	Merge "Relax pre-upgrade check for failed actions"	Jenkins	2	-3/+5

2016-09-29	Merge "Fix races in major-upgrade-pacemaker Step2"	Jenkins	3	-17/+41

2016-09-29	Fix typo in fixing gnocchi upgrade.	Sofer Athlan-Guyot	1	-1/+1
	Change-Id: I44451a280dd928cd694dd6845d5d83040ad1f482 Related-Bug: #1626592
2016-09-29	Merge "Full HA->HA NG migration might fail setting maintenance-mode"	Jenkins	1	-8/+4

2016-09-29	Merge "Update gnocchi database during M/N upgrade."	Jenkins	1	-2/+3

2016-09-29	Use -L with chown and set crush map tunables when upgrading Ceph	Giulio Fidente	2	-4/+8
	Previously the chown command wasn't traversing symlinks, causing the new ownership to not be set for some needed files. This change also ensures the crush map tunables are set to the 'default' profile after the upgrade. Finally redirects the output of a pidof to /dev/null to avoid spurious logging. Change-Id: Id4865ffff207edfc727d729f9cc04e6e81ad19d8
2016-09-29	Merge "Move db::mysql into service_config_settings"	Jenkins	23	-105/+111

2016-09-29	j2 template per-role things in default registry	Steven Hardy	1	-58/+20
	The default resource-registry file contains a bunch of per-role things which mean you need to cut/paste into a custom environment file for custom roles, even if you only want the defaults like the built-in roles. Using j2 we can template these just like in the overcloud.j2.yaml and other files. Change-Id: I52a9bffd043ca8fb0f05077c8a401a68def82926 Partial-Bug: #1626976
2016-09-29	Use netapp_host_type instead of netapp_eseries_host_type	Giulio Fidente	2	-5/+15
	This patch deprecates netapp_eseries_host_type in favor of netapp_host_type. Change-Id: I113c770ca2e4dc54526d4262bacae48e223c54f4 Closes-Bug: 1579161
2016-09-29	Relax pre-upgrade check for failed actions	Michele Baldessari	2	-3/+5
	Before this change we checked the cluster for any failed actions and we stopped the upgrade process if there were any. This is likely eccessive as a failed action could have happened in the past and the cluster is now fully functional. Better to check if any of the resources are in Stopped state and break the upgrade process if any of them are. We also need to restrict this check to the bootstrap node because otherwise the following might happen: 1) Bootstrap node does the check, it is successful and it starts the full HA -> HA NG migration which will create failed actions and will start stopping resources 2) If the check now starts on a non-bootstrap node while 1) is ongoing, it will find either failed actions or stopped resources so it will fail. Change-Id: Ib091f6dd8884025d2e23bf2fa700169e2dec778f Closes-Bug: #1628653
2016-09-29	Fix races in major-upgrade-pacemaker Step2	Michele Baldessari	3	-17/+41
	tripleo-heat-templates/extraconfig/tasks/major_upgrade_controller_pacemaker_2.sh has the following code: ... check_resource mongod started 600 if [[ -n $(is_bootstrap_node) ]]; then ... tstart=$(date +%s) while ! clustercheck; do sleep 5 tnow=$(date +%s) if (( tnow-tstart > galera_sync_timeout )) ; then echo_error "ERROR galera sync timed out" exit 1 fi done # Run all the db syncs cinder-manage db sync ... fi start_or_enable_service rabbitmq check_resource rabbitmq started 600 start_or_enable_service redis check_resource redis started 600 start_or_enable_service openstack-cinder-volume check_resource openstack-cinder-volume started 600 systemctl_swift start for service in $(services_to_migrate); do manage_systemd_service start "${service%%-clone}" check_resource_systemd "${service%%-clone}" started 600 done """ The problem with the above code is that it is open to the following race condition: 1) Bootstrap node is busy checking the galera status via cluster check 2) Non-bootstrap node has already reached: start_or_enable_service rabbitmq and later lines. These lines will be skipped because start_or_enable_service is a noop on non-bootstrap nodes and check_resource rabbitmq only checks that pcs status \|grep rabbitmq returns true. 3) Non-bootstrap node can then reach the manage_systemd_service start and it will fail with stuff like: "Job for openstack-nova-scheduler.service failed because the control process exited with error code. See \"systemctl status openstack-nova-scheduler.service\" and \"journalctl -xe\" for details.\n" (because the db tables are not migrated yet) This happens because 3) was started on non-bootstrap nodes before the db-sync statements are complete on the bootstrap node. I did not feel like changing the semantics of check_resource and remove the noop on non-bootstrap nodes as other parts of the tree might rely on this behaviour. Depends-On: Ia016264b51f485b97fa150ebd357b109581342ed Change-Id: I663313e183bb05b35d0c5af016c2d1705c772bd9 Closes-Bug: #1627965
2016-09-28	Update gnocchi database during M/N upgrade.	Sofer Athlan-Guyot	1	-2/+3
	We call gnocchi-upgrade to make sure we update all the needed schemas during the major-upgrade-pacemaker step. We also make sure that redis is started before we call gnocchi-upgrade otherwise the command will be stuck in a loop trying to contact redis. Closes-Bug: #1626592 Change-Id: Ia016264b51f485b97fa150ebd357b109581342ed
2016-09-28	Merge "Fix predictable placement indexing"	Jenkins	1	-0/+14

2016-09-28	Move db::mysql into service_config_settings	Dan Prince	23	-105/+111
	This patch movs the various db::mysql hiera settings into a 'mysql' specific service_config_settings section for each service so that these will only get applied on the MySQL service node. This follows a similar puppet-tripleo change where we create the actual databases for all services locally on the MySQL service node to avoid permission issues. Change-Id: Ic0692b1f7aa8409699630ef3924c4be98ca6ffb2 Closes-bug: #1620595 Depends-On: I05cc0afa9373429a3197c194c3e8f784ae96de5f Depends-On: I5e1ef2dc6de6f67d7c509e299855baec371f614d
2016-09-28	Full HA->HA NG migration might fail setting maintenance-mode	Michele Baldessari	1	-8/+4
	Currently we do the following in the migration path: pcs property set maintenance-mode=true if ! timeout -k 10 300 crm_resource --wait; then echo_error "ERROR: cluster remained unstable after setting maintenance-mode for more than 300 seconds, exiting." exit 1 fi crm_resource --wait can actually take forever under certain conditions. The property will be set atomically across the cluster nodes so we should be good without this. Change-Id: I8f531d63479b81d65b572c4431c9db6f610f7e04 Closes-Bug: #1628393
2016-09-28	Fix "Not all flavors have been migrated to the API database"	Michele Baldessari	1	-0/+1
	After a successful upgrade to Newton, I ran the tripleo.sh --overcloud-pingtest and it failed with the following: resources.test_flavor: Not all flavors have been migrated to the API database (HTTP 409) The issue is the fact that some tables have migrated to the nova_api db and we need to migrate the data as well. Currently we do: nova-manage db sync nova-manage api_db sync We want to add: nova-manage db online_data_migrations After launching this command the overcloud-pingtest works correctly: tripleo.sh -- Overcloud pingtest SUCCEEDED Change-Id: Id2d5b28b5d4ade7dff6c5e760be0f509b4fe5096 Closes-Bug: #1628450
2016-09-28	Merge "Deprecate the NeutronL3HA parameter"	Jenkins	1	-7/+23

2016-09-27	Use correct password for keystone bootstrap	Alex Schultz	1	-0/+1
	In upstream puppet-keystone, the boostrap process should use an admin password not the admin token for the bootstrapping of keystone. The admin password option is being added to the upstream class so we will need to provide it to properly have keystone bootstrapped. Change-Id: Icab4b0cb70d6caf2f2792c4fe730f060b807fbc1 Depends-On: I7a706d93b43ec025bdb4b29667f64ff2f7dd52a0 Related-Bug: #1621959
2016-09-27	Fix NTP servers hieradata	Marius Cornea	1	-1/+1
	This patch enables correctly setting the NTP server passed via --ntp-server in the overcloud nodes' /etc/ntp.conf. Change-Id: Iff644b9da51fb8cd1946ad9d297ba0e94d3d782b
2016-09-27	Merge "Remove deprecated scheduler_driver settings"	Jenkins	1	-0/+2

2016-09-27	Merge "Add metricd workers support in gnocchi"	Jenkins	2	-0/+6

2016-09-27	Merge "Use parameter name to configure gmcast_listen_addr"	Jenkins	1	-0/+8

2016-09-27	Merge "Set manila::keystone::auth::tenant"	Jenkins	1	-0/+1

2016-09-27	Merge "Disable openstack-cinder-volume in step1 and reenable it in step2"	Jenkins	2	-0/+5

2016-09-27	Merge "Activate StorageMgmtPort on computes in HCI environment"	Jenkins	1	-5/+4

2016-09-26	Set manila::keystone::auth::tenant	Tom Barron	1	-0/+1
	Without setting this parameter, overcloud deploy fails and 'openstack stack failures list overcloud' reveals the following error: Error: Puppet::Type::Keystone_user_role::ProviderOpenstack: Could not find project with name [services] and domain [Default] Error: /Stage[main]/Manila::Keystone::Auth/Keystone::Resource::Service_identity[manilav2]/Keystone_user_role[manilav2@services]: Could not evaluate: undefined method `[]' for nil:NilClass When we set manila::keystone::auth::tenant to 'service', analogous to cinder, nova, etc., the overcloud deploy completes successfully. Change-Id: I996ac2ff602c632a9f9ea9c293472a6f2f92fd72
2016-09-27	Merge "Add FixedIPs parameter to from_service.yaml"	Jenkins	2	-0/+12

2016-09-27	Merge "Fix ignore warning on ceph major upgrade."	Jenkins	1	-1/+1

2016-09-27	Merge "Add integration with Manila CephFS Native driver"	Jenkins	4	-0/+81

2016-09-27	Merge "A few major-upgrade issues"	Jenkins	3	-25/+46

2016-09-27	Merge "Start mongod before calling ceilometer-dbsync"	Jenkins	1	-0/+7

2016-09-27	Merge "Reinstantiate parts of code that were accidentally removed"	Jenkins	2	-0/+9

2016-09-27	Merge "Neutron metadata agent worker count fix"	Jenkins	1	-3/+10

2016-09-27	Merge "Remove double definition of config_settings key in keystone"	Jenkins	1	-1/+0

2016-09-26	Fix predictable placement indexing	Ben Nemec	1	-0/+14
	As noted in the bug, predictable placement is broken right now because the %index% in the scheduler hint isn't being interpolated. This is because the parameter was moved from overcloud.yaml to the service-specific files, which doesn't provide the index value. Because the Compute role's parameter is named NovaCompute... we also have to include some backwards compatibility logic to handle the mismatch. Change-Id: Ibee2949fe4c6c707203d7250e2ce169c769b1dcd Closes-Bug: 1627858