Ansible errors - Httqm's Docs

This article refers to a situation I experienced when trying to spawn virtual machines with VMware and Ansible.

my VM boots fine, but whatever networks settings I try, the network interface is disconnected
no action within the vSphere webUI allows to connect this network interface manually
no option within the vSphere webUI (template / VM / virtual networking / ) allows connecting this network interface after executing the playbook again

I found several posts that did NOT fix this issue :

The answer is subtle and can be read here, here, and confirmed here : Customization of Linux guest operating systems requires that Perl is installed in the Linux guest operating system.

A minimal install of RHEL 8.4 does NOT install Perl.

An error message in the vm-tools logs may also have caught your attention (detail) :

/usr/bin/perl: bad interpreter or No such file or directory

Now we know what's missing, it should be as simple as :

yum install perl

in the VM used to build the template (+ clone it as a template _again_ ). But what if this VM :

has no internet access ?
is not registered to Red Hat and can not get packages from online repositories ?

Details in the dedicated article : How to install software on RHEL with the install DVD only ?.

Full error message :

TASK [common : set timezone to Etc/UTC] ******************************************************************************************************************
 [WARNING]: timedatectl command was found but not usable: Failed to create bus connection: No such file or directory . using other method.

fatal: [myHost]: FAILED! => {"changed": false, "msg": "Error message:\ngiven timezone \"Etc/UTC\" is not available"}

There are 2 problems here :

timedatectl command was found but not usable

given timezone "Etc/UTC" is not available

I got this error both with Ansible versions 2.7.9 and 2.8.6

The source code shows :

250     def _verify_timezone(self):
251         tz = self.value['name']['planned']
252         tzfile = '/usr/share/zoneinfo/%s' % tz
253         if not os.path.isfile(tzfile):
254             self.abort('given timezone "%s" is not available' % tz)
255         return tzfile

which was confirmed within the container by :
ll /usr/share/zoneinfo/Etc/UTC

In my case, both errors were caused by the fact that myHost is a Docker container.

for timedatectl command was found but not usable (sources : 1, 2) :

Stop containers :
docker-compose stop

Add to docker-compose.yml :

  s_myHost:
    hostname: myHost
    build: .
    container_name: c_myHost
    tty: true
    volumes:
     - /run/dbus/system_bus_socket:/run/dbus/system_bus_socket:ro

Rebuild + restart :
docker-compose up --build -d
update your Ansible inventory

for given timezone "Etc/UTC" is not available :

Same as above, with :

  s_myHost:
    hostname: myHost
    build: .
    container_name: c_myHost
    tty: true
    volumes:
     - /run/dbus/system_bus_socket:/run/dbus/system_bus_socket:ro
     - /usr/share/zoneinfo:/usr/share/zoneinfo:ro

Full error message :

failed: [myHost] (item=someItem) => {"item": "someItem", "msg": "Failed to connect to the host via ssh: ", "unreachable": true}

This error is very common and can have multiple causes. It can usually be fixed by making sure you're doing SSH right.

The error message :

ERROR! 'delegate_to' is not a valid attribute for a TaskInclude

refers to this task :

  - name: Create MySQL manager user
    include_tasks: mysql-users.yml
    when: create_mysql_manager | bool
    delegate_to: "{{ groups['mysql'][0] }}"
    run_once: true
    vars:
      mysql_user_name: "{{ mysql_manager_username }}"
      mysql_user_password: "{{ mysql_manager_password }}"
      mysql_user_state: 'present'
      mysql_user_priv: '*.*:ALL,GRANT'
      mysql_user_checkadmin: false
      mysql_user_updatepw: 'on_create'

This is legacy code I have to support and (try to) update so that it runs without errors. It was built for Ansible 2.7. Not sure this was (still is) the appropriate way of doing things.

ansible --version

ansible 2.8.4

  python version = 3.7.3 (default, Apr  3 2019, 05:39:12) [GCC 8.3.0]

Not a solution but a workaround (source), add to ansible.cfg :

[defaults]

invalid_task_attribute_failed=False

Full error message :

fatal: [myHost]: UNREACHABLE! => {
    "changed": false,
    "msg": "Data could not be sent to remote host \"12.34.56.78\". Make sure this host can be reached over ssh: ",
    "unreachable": true
}

This error is usually no big deal, but it can be pretty frustrating since it can have several (simultaneous) causes .

Make sure :

12.34.56.78 is really the host you'd like to manage with Ansible : is your inventory file up-to-date ?
you can ssh -i privateKey remoteUser@12.34.56.78 :
- does remoteUser exist on 12.34.56.78 ?
- have you copied remoteUser's public key to 12.34.56.78, with proper permissions ?
you are specifying the right user in the Ansible command line :
ansible-playbook [other options] -u username

Ansible is actually using the username you specified :

Triple v's are necessary for the verbosity level shown here.

ansible-playbook [other options] -u stuart -vvv

<12.34.56.78> ESTABLISH SSH CONNECTION FOR USER: stuart		the username it's actually trying to connect as
<12.34.56.78> SSH: EXEC ssh -C \					line broken for readability
	-o ForwardAgent=yes \
	-o ControlMaster=auto \
	-o ControlPersist=60s \
	-o StrictHostKeyChecking=no \
	-o KbdInteractiveAuthentication=no \
	-o PreferredAuthentications=gssapi-with-mic,gssapi-keyex,hostbased,publickey \
	-o PasswordAuthentication=no \
	-o 'User="stuart"' \
	-o ConnectTimeout=20 \
	-o ControlPath=$HOME/.ansible/cp/5aa7fea824 172.18.0.2 '/bin/sh -c '"'"'sudo -H -S -n  -u root /bin/sh -c '"'"'"	cp stands for ControlPath
'"'"'"'"'"'echo BECOME-SUCCESS-newtperqpfiryppbejvbytibvfvwafjg ; /usr/bin/python'"'"'"'"'"'"'"'"' && sleep 0'"'"''		the full ssh command with all options

One of my playbooks miserably fails complaining :

failed: [slave_n] (item=etc/iptables/rules.v4) => {"changed": false, "item": "etc/iptables/rules.v4", "msg": "AnsibleFilterError: The ipaddr filter requires python-netaddr be installed on the ansible controller"}

Just install the missing module :

apt install --no-install-recommends python-netaddr

You may also (source) :

pip install netaddr

The playbook execution fails with the error message : fatal: [server]: FAILED! => {"msg": "Incorrect sudo password"}
running the same playbook again makes it fail randomly (?) on different steps of the playbook
I have checked : the sudo password I provide when launching the playbook execution works fine manually

Steps to reproduce

ansible-playbook -i myInventoryFile -l *pattern* all.yml -t myTag -kK -DC
which prompts :
SSH password: _
so I enter my SSH password I think the black magic lies here (details)
I'm then prompted :
SUDO password[defaults to SSH password]: _
and I just press
the playbook execution begins, until it fails as described above

Technical environment

Debian 7 (Wheezy)
sudo
sssd

Welcome to The Twilight Zone

Things are getting weird, be prepared ! (See also the alternate solution)

My sudo password is a random string of characters generated by pwgen.

No idea whether this is related or not, but it contains special characters that may puzzle the shell such as ;, (, }, |, ? or ~.

When developing / running playbooks, I have to enter my password again and again, but since I'm a lazy guy, it's also in the *scratch* buffer of my editor, Emacs, so that I can copy-paste it into the shell window when prompted.
In editors, you can copy text until :

its last character (i.e. end of the word / string / line)
or you can copy the whole line, including the trailing carriage return. Which saves pressing after pasting (lazy guy, told you !)

Copy-pasting the password without the trailing carriage return seems to fix it.

As said earlier, this behavior is rather puzzling, and no formal solution was found so far. However, it is possible to workaround it by

removing the need for a sudo password with the NOPASSWD directive
and running the playbook without the -K flag

One of my playbooks miserably fails complaining :

failed: [slave_n] (item=someItem) => {"failed": true, "item": "someItem", "msg": "AnsibleError: Can't LOOKUP(dig): module dns.resolver is not installed"}

Looks like something's missing on the Ansible "master" host.

Just install the missing module :

As root :
apt --no-install-recommends install python-pip

As a standard user :

pip install --upgrade dnspython

Collecting dnspython
	Downloading dnspython-1.15.0-py2.py3-none-any.whl (177kB)
		100% |................................| 184kB 4.1MB/s
Installing collected packages: dnspython
Successfully installed dnspython-1.15.0

As root :

apt --no-install-recommends install python-dnspython

Ansible errors - (sh*t happens)

The network starts disconnected after deploying a VM with Ansible and vmware_guest

Situation

Details

Solution

timedatectl command was found but not usable: Failed to create bus connection: No such file or directory + given timezone "Etc/UTC" is not available

Situation

Details

Solution

for timedatectl command was found but not usable (sources : 1, 2) :

for given timezone "Etc/UTC" is not available :

"msg": "Failed to connect to the host via ssh: ", "unreachable": true

Situation

Details

ERROR! 'delegate_to' is not a valid attribute for a TaskInclude

Situation

Details

Solution

UNREACHABLE! => Data could not be sent to remote host "12.34.56.78". Make sure this host can be reached over ssh:

Situation

Details

Solution

AnsibleFilterError: The ipaddr filter requires python-netaddr be installed on the ansible controller

Situation

Solution

Alternate solution

fatal: [server]: FAILED! => {"msg": "Incorrect sudo password"}

Situation

Details

Steps to reproduce

Technical environment

Welcome to The Twilight Zone

Solution

Alternate solution

AnsibleError: Can't LOOKUP(dig): module dns.resolver is not installed

Situation

Details

Solution

Alternate solution